Learning Modern Algebra
Learning Modern Algebra
Learning Modern Algebra
i i
i i
“book2” — 2013/5/24 — 8:18 — page ii — #2
i i
c 2013 by
The Mathematical Association of America (Incorporated)
Library of Congress Control Number: 2013940990
Print ISBN: 978-1-93951-201-7
Electronic ISBN: 978-1-61444-612-5
Printed in the United States of America
Current Printing (last digit):
10 9 8 7 6 5 4 3 2 1
i i
“book2” — 2013/5/24 — 8:18 — page iii — #3
i i
Al Cuoco
EDC, Waltham MA
and
Joseph J. Rotman
University of Illinois at Urbana–Champaign
i i
“book2” — 2013/5/24 — 8:18 — page iv — #4
i i
Committee on Books
Frank Farris, Chair
MAA Textbooks Editorial Board
Zaven A. Karian, Editor
Matthias Beck
Richard E. Bedient
Thomas A. Garrity
Charles R. Hampton
John Lorch
Susan F. Pustejovsky
Elsa J. Schaefer
Stanley E. Seltzer
Kay B. Somers
MAA TEXTBOOKS
Bridge to Abstract Mathematics, Ralph W. Oberste-Vorth, Aristides Mouzakitis, and
Bonita A. Lawrence
Calculus Deconstructed: A Second Course in First-Year Calculus, Zbigniew H. Nitecki
Combinatorics: A Guided Tour, David R. Mazur
Combinatorics: A Problem Oriented Approach, Daniel A. Marcus
Complex Numbers and Geometry, Liang-shin Hahn
A Course in Mathematical Modeling, Douglas Mooney and Randall Swift
Cryptological Mathematics, Robert Edward Lewand
Differential Geometry and its Applications, John Oprea
Elementary Cryptanalysis, Abraham Sinkov
Elementary Mathematical Models, Dan Kalman
An Episodic History of Mathematics: Mathematical Culture Through Problem Solving,
Steven G. Krantz
Essentials of Mathematics, Margie Hale
Field Theory and its Classical Problems, Charles Hadlock
Fourier Series, Rajendra Bhatia
Game Theory and Strategy, Philip D. Straffin
Geometry Revisited, H. S. M. Coxeter and S. L. Greitzer
Graph Theory: A Problem Oriented Approach, Daniel Marcus
Knot Theory, Charles Livingston
Learning Modern Algebra: From Early Attempts to Prove Fermat’s Last Theorem, Al
Cuoco and and Joseph J. Rotman
Lie Groups: A Problem-Oriented Introduction via Matrix Groups, Harriet Pollatsek
Mathematical Connections: A Companion for Teachers and Others, Al Cuoco
Mathematical Interest Theory, Second Edition, Leslie Jane Federer Vaaler and James
W. Daniel
i i
“book2” — 2013/5/24 — 8:18 — page v — #5
i i
i i
“book2” — 2013/5/24 — 8:18 — page vi — #6
i i
i i
“book2” — 2013/5/24 — 8:18 — page vii — #7
i i
vii
i i
“book2” — 2013/5/24 — 8:18 — page viii — #8
i i
i i
“book2” — 2013/5/24 — 8:18 — page ix — #9
i i
Contents
Preface xiii
Some Features of This Book . . . . . . . . . . . . . . . . . . . . . xiv
A Note to Students . . . . . . . . . . . . . . . . . . . . . . . . . . xv
A Note to Instructors . . . . . . . . . . . . . . . . . . . . . . . . . xv
Notation xvii
2 Induction 45
2.1 Induction and Applications . . . . . . . . . . . . . . . . . . . 45
Unique Factorization . . . . . . . . . . . . . . . . . . . . . 53
Strong Induction . . . . . . . . . . . . . . . . . . . . . . . 57
Differential Equations . . . . . . . . . . . . . . . . . . . . 60
2.2 Binomial Theorem . . . . . . . . . . . . . . . . . . . . . . . 63
Combinatorics . . . . . . . . . . . . . . . . . . . . . . . . 69
2.3 Connections . . . . . . . . . . . . . . . . . . . . . . . . . . . 73
An Approach to Induction . . . . . . . . . . . . . . . . . . 73
Fibonacci Sequence . . . . . . . . . . . . . . . . . . . . . 75
3 Renaissance 81
3.1 Classical Formulas . . . . . . . . . . . . . . . . . . . . . . . 82
3.2 Complex Numbers . . . . . . . . . . . . . . . . . . . . . . . 91
ix
i i
“book2” — 2013/5/29 — 16:18 — page x — #10
i i
x Contents
Algebraic Operations . . . . . . . . . . . . . . . . . . . . 92
Absolute Value and Direction . . . . . . . . . . . . . . . . 99
The Geometry Behind Multiplication . . . . . . . . . . . . 101
3.3 Roots and Powers . . . . . . . . . . . . . . . . . . . . . . . . 106
3.4 Connections: Designing Good Problems . . . . . . . . . . . . 116
Norms . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116
Pippins and Cheese . . . . . . . . . . . . . . . . . . . . . 118
Gaussian Integers: Pythagorean Triples Revisited . . . . . . 119
Eisenstein Triples and Diophantus . . . . . . . . . . . . . . 122
Nice Boxes . . . . . . . . . . . . . . . . . . . . . . . . . . 123
Nice Functions for Calculus Problems . . . . . . . . . . . 124
Lattice Point Triangles . . . . . . . . . . . . . . . . . . . . 126
i i
“book2” — 2013/5/24 — 8:18 — page xi — #11
i i
Contents xi
9 Epilog 379
9.1 Abel and Galois . . . . . . . . . . . . . . . . . . . . . . . . . 379
9.2 Solvability by Radicals . . . . . . . . . . . . . . . . . . . . . 381
9.3 Symmetry . . . . . . . . . . . . . . . . . . . . . . . . . . . . 384
9.4 Groups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 389
9.5 Wiles and Fermat’s Last Theorem . . . . . . . . . . . . . . . 396
Elliptic Integrals and Elliptic Functions . . . . . . . . . . . 397
Congruent Numbers Revisited . . . . . . . . . . . . . . . . 400
Elliptic Curves . . . . . . . . . . . . . . . . . . . . . . . . 404
A Appendices 409
A.1 Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 409
A.2 Equivalence Relations . . . . . . . . . . . . . . . . . . . . . . 420
A.3 Vector Spaces . . . . . . . . . . . . . . . . . . . . . . . . . . 424
Bases and Dimension . . . . . . . . . . . . . . . . . . . . 427
Linear Transformations . . . . . . . . . . . . . . . . . . . 435
A.4 Inequalities . . . . . . . . . . . . . . . . . . . . . . . . . . . 441
A.5 Generalized Associativity . . . . . . . . . . . . . . . . . . . . 442
A.6 A Cyclotomic Integer Calculator . . . . . . . . . . . . . . . . 444
Eisenstein Integers . . . . . . . . . . . . . . . . . . . . . . 445
Symmetric Polynomials . . . . . . . . . . . . . . . . . . . 446
Algebra with Periods . . . . . . . . . . . . . . . . . . . . . 446
References 449
Index 451
i i
“book2” — 2013/5/24 — 8:18 — page xii — #12
i i
i i
“book2” — 2013/5/24 — 8:18 — page xiii — #13
i i
Preface
This book is designed for college students who want to teach mathematics in
high school, but it can serve as a text for standard abstract algebra courses as
well. First courses in abstract algebra usually cover number theory, groups,
and commutative rings. We have found that the first encounter with groups is
not only inadequate for future teachers of high school mathematics, it is also
unsatisfying for other mathematics students. Hence, we focus here on number
theory, polynomials, and commutative rings. We introduce groups in our last
chapter, for the earlier discussion of commutative rings allows us to explain
how groups are used to prove Abel’s Theorem: there is no generalization of the
quadratic, cubic, and quartic formulas giving the roots of the general quintic
polynomial. A modest proposal: undergraduate abstract algebra should be a
sequence of two courses, with number theory and commutative rings in the
first course, and groups and linear algebra (with scalars in arbitrary fields) in
the second.
We invoke an historically accurate organizing principle: Fermat’s Last The-
orem (in Victorian times, the title of this book would have been Learning Mod-
ern Algebra by Studying Early Attempts, Especially Those in the Nineteenth
Century, that Tried to Prove Fermat’s Last Theorem Using Elementary Meth-
ods). To be sure, another important problem at that time that contributed to
modern algebra was the search for formulas giving the roots of polynomials.
This search is intertwined with the algebra involved in Fermat’s Last Theo-
rem, and we do treat this part of algebra as well. The difference between our
approach and the standard approach is one of emphasis: the natural direction
for us is towards algebraic number theory, whereas the usual direction is to-
wards Galois theory.
Four thousand years ago, the quadratic formula and the Pythagorean The-
orem were seen to be very useful. To teach them to new generations, it was
best to avoid square roots (which, at the time, were complicated to compute),
and so problems were designed to have integer solutions. This led to Pythag-
orean triples: positive integers a; b; c satisfying a2 C b 2 D c 2 . Two thousand
years ago, all such triples were found and, when studying them in the seven-
teenth century, Fermat wondered whether there are positive integer solutions
to an C b n D c n for n > 2. He claimed in a famous marginal note that there
are no solutions, but only his proof of the case n D 4 is known. This problem,
called Fermat’s Last Theorem, intrigued many of the finest mathematicians,
but it long resisted all attempts to solve it. Finally, using sophisticated tech-
niques of algebraic geometry developed at the end of the twentieth century,
Andrew Wiles proved Fermat’s Last Theorem in 1995.
xiii
i i
“book2” — 2013/5/24 — 8:18 — page xiv — #14
i i
xiv Preface
i i
“book2” — 2013/5/24 — 8:18 — page xv — #15
i i
Preface xv
A Note to Students
The heart of a mathematics course lies in its problems. We have tried to or-
chestrate them to help you build a solid understanding of the mathematics in
the sections. Everything afterward will make much more sense if you work
through as many exercises as you can, especially those that appear difficult.
Quite often, you will learn something valuable from an exercise even if you
don’t solve it completely. For example, a problem you can’t solve may show
that you haven’t fully understood an idea you thought you knew; or it may
force you to discover a fact that needs to be established to finish the solution.
There are two special kinds of exercises.
Those labeled Preview may seem to have little to do with the section at hand;
they are designed to foreshadow upcoming topics, often with numerical ex-
periments.
Those labeled Take it Further develop interesting ideas that are connected
to the main themes of the text, but are somewhat off the beaten path. They
are not essential for understanding what comes later in the text.
An exercise marked with an asterisk, such as 1.8*, means that it is either
used in some proof or it is referred to elsewhere in the text. For ease of finding
such exercises, all references to them have the form “Exercise 1.8 on page 6”
giving both its number and the number of the page on which it occurs.
A Note to Instructors
We recommend giving reading assignments to preview upcoming material.
This contributes to balancing experience and formality as described above, and
it saves time. Many important pages can be read and understood by students,
and they should be discussed in class only if students ask questions about them.
It is possible to use this book as a text for a three hour one-semester course,
but we strongly recommend that it be taught four hours per week.
i i
“book2” — 2013/5/24 — 8:18 — page xvi — #16
i i
i i
“book2” — 2013/5/24 — 8:18 — page xvii — #17
i i
Notation
xvii
i i
“book2” — 2013/5/24 — 8:18 — page xviii — #18
i i
xviii Notation
i i
“book2” — 2013/5/24 — 8:18 — page xix — #19
i i
Notation xix
i i
“book2” — 2013/5/24 — 8:18 — page xx — #20
i i
i i
“book2” — 2013/5/24 — 8:18 — page 1 — #21
i i
Algebra, geometry, and number theory have been used for millennia. Of course,
numbers are involved in counting and measuring, enabling commerce and ar-
chitecture. But reckoning was also involved in life and death matters such as
astronomy, which was necessary for navigation on the high seas (naval com-
merce flourished four thousand years ago) as well as to predict the seasons,
to apprise farmers when to plant and when to harvest. Ancient texts that have
survived from Babylon, China, Egypt, Greece, and India provide evidence for
this. For example, the Nile River was the source of life in ancient Egypt, for
its banks were the only arable land in the midst of desert. Mathematics was
used by the priestly class to predict flooding as well as to calculate area (taxes
were assessed according to the area of land, which changed after flood waters
subsided). And their temples and pyramids are marvels of engineering.
I have subtracted the side of the square from the area, and it is 870. What
is the side of my square?
The text rewrites the data as the quadratic equation x 2 x D 870; it then
gives a series of steps showing how to find the solution, illustrating that the
Babylonians knew the quadratic formula.
Historians say that teaching played an important role in ancient mathe-
matics (see van der Waerden [35], pp. 32–33). To illustrate, the coefficients
of the quadratic equation were chosen wisely: the discriminant b 2 4ac D
1 4. 870/ D 3481 D 592 is a perfect square. Were the discriminant not a The number 59 may have
perfect square, the problem would have been much harder, for finding square been chosen because
the Babylonians wrote
roots was not routine in those days. Thus, the quadratic in the text is well-
numbers in base 60, and
chosen for teaching the quadratic formula; a good teaching prize would not be 59 D 60 1.
awarded for x 2 47x D 210.
The Babylonians were not afraid of cubics. Another of their problems from
about the same time is
1
i i
“book2” — 2013/5/24 — 8:18 — page 2 — #22
i i
and the answer was given. The solution was, most likely, obtained by using
tables of approximations of cube roots.
A standard proof of the quadratic formula is by “completing the square.”
This phrase can be taken literally. Given a quadratic x 2 C bx D c with b and c
positive, we can view x 2 C bx as the shaded area in Figure 1.1. Complete the
c C 14 b 2 D x 2 C bx C 14 b 2 D .x C 12 b/2 :
q
Thus, x C 12 b D c C 14 b 2, which simplifies to the usual formula giving
2
the roots of x C bx c. The algebraic proof of the validity of the quadratic
formula works without assuming that b and c are positive, but the idea of the
proof is geometric.
b a b
a a a2
c2
b b b2
In [35], pp. 26–35, van
der Waerden considers a
the origin of proofs in
mathematics, suggesting Figure 1.2. Pythagorean Theorem.
that they arose in Europe
and Asia in Neolithic
(late Stone Age) times,
The Babylonians were aware of the Pythagorean Theorem. Although they
4500 BCE –2000 BCE . believed it, there is no evidence that the Babylonians had proved the Pythag-
orean Theorem; indeed, no evidence exists that they even saw a need for a
proof. Tradition attributes the first proof of this theorem to Pythagoras, who
Exercise 1.4 on page 5
asks you to show that the lived around 500 BCE, but no primary documents extant support this. An ele-
rhombus in Figure 1.2 gant proof of the Pythagorean Theorem is given on page 354 of Heath’s 1926
with sides of length c is a translation [16] of Euclid’s The Elements; the theorem follows from equality
square. of the areas of the two squares in Figure 1.2.
i i
“book2” — 2013/5/24 — 8:18 — page 3 — #23
i i
S M
a
E
jSEj2 D jSM j2 C jMEj2. Thus, the Earth is farther from the Sun than it is
from the Moon. Indeed, at sunset, ˛ D †E seems to be very close to 90ı: if we
are looking at the Moon and we wish to watch the Sun dip below the horizon,
we must turn our head all the way to the left. Aristarchus knew trigonometry;
he reckoned that cos ˛ was small, and he concluded that the Sun is very much
further from the Earth than is the Moon.
There is a door whose height and width are unknown, and a pole whose There are similar problems
length p is also unknown. Carried horizontally, the pole does not fit by 4 from the Babylonians and
other ancient cultures.
ch’ihI vertically, it does not fit by 2 ch’ihI slantwise, it fits exactly. What
are the height, width, and diagonal of the door?
p
p–2
p–4
The data give a right triangle with sides p 4, p 2, and p, and the Py-
thagorean Theorem gives the equation .p 4/2 C .p 2/2 D p 2, which
simplifies to p 2 12p C 20 D 0. The discriminant b 2 4ac is 144 80 D 64,
a perfect square, so that p D 10 and the door has height 8 and width 6 (the
other root of the quadratic is p D 2, which does not fit the physical data).
The sides of the right triangle are 6, 8, 10, and it is similar to the triangle with
sides 3; 4; 5. Again, the numbers have been chosen wisely. The idea is to teach
i i
“book2” — 2013/5/24 — 8:18 — page 4 — #24
i i
students how to use the Pythagorean Theorem and the quadratic formula. As
we have already remarked, computing square roots was then quite difficult, so
that the same problem for a pole of length p D 12 would not have been very
The word hypotenuse bright because there is no right triangle with sides of integral length that has
comes from the Greek verb hypotenuse 12. N
meaning to stretch.
Are there right triangles whose three sides have integral length that are not
similar to the 3; 4; 5 triangle? You are probably familiar with the 5; 12; 13 tri-
angle. Let’s use 4.a; b; c/ (lower case letters) to denote the triangle whose
sides have length a, b, and c; if 4.a; b; c/ is a right triangle, then c denotes
the length of its hypotenuse, while a and b are its legs. Thus, the right trian-
gle with side-lengths 5, 12, 13 is denoted by 4.5; 12; 13/. (We use the usual
notation, 4ABC , to denote a triangle whose vertices are A; B; C .)
Historical Note. Pythagorean triples are the good choices for problems teach-
ing the Pythagorean Theorem. There are many of them: Figure 1.5 shows a
Babylonian cuneiform tablet dating from the dynasty of Hammurabi, about
1800 BCE, whose museum name is Plimpton 322, which displays fifteen
Pythagorean triples (translated into our number system).
b a c
120 119 169
3456 3367 4825
4800 4601 6649
13500 12709 18541
72 65 97
360 319 481
2700 2291 3541
960 799 1249
600 481 769
6480 4961 8161
60 45 75
2400 1679 2929
240 161 289
2700 1771 3229
90 56 106
Figure 1.5. Plimpton 322.
i i
“book2” — 2013/5/24 — 8:18 — page 5 — #25
i i
then
p
aD .c C b/.c b/ D mn: (1.1)
Summarizing, here is what we call the Babylonian method. Choose odd num-
bers m and n (forcing m2 C n2 and m2 n2 to be even, so that b and c are
integers), and define a, b, and c by Eqs. (1.1), (1.2), and (1.3). For example, if
m D 7 and n D 5, we obtain 35, 12, 37. If we choose m D 179 and n D 71,
we obtain 13500, 12709, 18541, the largest triple on Plimpton 322.
The Babylonian method does not give all Pythagorean triples. For example,
.6; 8; 10/ is a Pythagorean triple, but there are no odd numbers m > n with
6 D mn or 8 D mn. Of course, .6; 8; 10/ is not signifcantly different from
.3; 4; 5/, which arises from 3 > 1. In the next section, we will show, follow-
ing Diophantus, ca. 250 CE, how to find all Pythagorean triples. But now we
should recognize that practical problems involving applications of pure math-
ematics (e.g., surveying) led to efforts to teach this mathematics effectively, After all, what practi-
which led to more pure mathematics (Pythagorean triples) that seems at first to cal application does
have no application outside of teaching. The remarkable, empirical, fact is that the Pythagorean triple
.13500; 12709; 18541/
pure mathematics yields new and valuable applications. For example, we shall
have?
see in the next section that classifying Pythagorean triples leads to simplifying
the verification of some trigonometric identities as well as the solution of cer-
tain integration problems (for example, we will see a natural way to integrate
sec x).
Exercises
1.1 Prove the quadratic formula for the roots of ax 2 C bx C c D 0 whose coefficients
a, b, and c may not be positive.
1.2 Give a geometric proof that .a C b/2 D a2 C 2ab C b 2 for a; b positive.
1.3 * Let f .x/ D ax 2 C bx C c be a quadratic whose coefficients a; b; c are rational.
Prove that if f .x/ has one rational root, then its other root is also rational.
1.4 *
(i) Prove that the rhombus with side lengths c in the left square of Figure 1.2 is The book by Loomis [20]
a square. contains 370 different
(ii) Prove the Pythagorean Theorem in a way suggested by Figure 1.2. proofs of the Pythagorean
Theorem, by the author’s
(iii) Give a proof of the Pythagorean Theorem different from the one suggested count.
by Figure 1.2.
i i
“book2” — 2013/5/24 — 8:18 — page 6 — #26
i i
1.5 Here is another problem from Nine Chapters on the Mathematical Art. A pond is
10 ch’ih square. A reed grows at its center and extends 1 ch’ih out of the water.
If the reed is pulled to the side of the pond, it reaches the side precisely. What are
the depth of the water and the length of the reed?
Answer: Depth = 12 ch’ih and length = 13 ch’ih.
1.6 *
(i) Establish the algebraic identity
2 2
aCb a b
D ab:
2 2
cosh x D 21 .ex C e x
/:
Prove that cosh x 1 for all real numbers x, while cosh x D 1 if and only if
x D 0.
(vi) Use Figure 1.6 to give another proof of the Arithmetic-Geometric Mean In-
equality.
a b
i i
“book2” — 2013/5/24 — 8:18 — page 7 — #27
i i
1.2 Diophantus 7
1.9 * Show that 5 is not the area of a triangle whose side-lengths form a Pythagorean
triple.
1.10 * Let .a; b; c/ be a Pythagorean triple. If m is a positive integer, prove that
.ma; mb; mc/ is also a Pythagorean triple.
1.11 .Converse of Pythagorean Theorem/: * Let 4 D 4.a; b; c/ be a triangle with
sides of lengths a; b; c (positive real numbers, not necessarily integers). Prove that
if a2 C b 2 D c 2 , then 4 is a right triangle.
Hint: Construct a right triangle 40 with legs of lengths a; b, and prove that 40 is
congruent to 4 by side-side-side.
1.12 * Prove that every Pythagorean triple .a; b; c/ arises from a right triangle 4.a; b; c/
having sides of lengths a; b; c.
1.13 If P D .a; b; c/ is a Pythagorean triple, define r.P / D c=a. If we label the Py-
thagorean triples on Plimpton 322 as P1 ; : : : ; P15 , show that r.Pi / is decreasing:
r.Pi / > r.Pi C1 / for all i 14.
1.14 * If .a; b; c/ is a Pythagorean triple, show that .a=c; b=c/ is a point on the graph
of x 2 C y 2 D 1. What is the graph of x 2 C y 2 D 1?
1.15 Preview: Let L be the line through . 1; 0/ with slope t .
1
(i) If t D 2, find all the points where L intersects the graph of x 2 C y 2 D 1.
Answer: . 35 ; 54 /.
3
(ii) If t D 2, find all the points where L intersects the graph of x 2 C y 2 D 1.
Answer: . 135 ; 12
13 /.
1
(iii) Pick a rational number t , not 2 or 32 , and find all the points where L intersects
the graph of x 2 C y 2 D 1.
(iv) Suppose ` is a line that contains . 1; 0/ with slope r. If r is a rational number,
show that ` intersects the graph of x 2 C y 2 D 1 in two points, each of which
has rational number coordinates.
1.16 Preview: A Gaussian integer is a complex number a C bi where both a and b
are integers. Pick six Gaussian integers r C si with r > s > 0 and square them.
State something interesting that you see in your results.
1.17 Preview: Consider a complex number z D q C ip, where q > p are positive
integers. Prove that
1.2 Diophantus
We are going to classify Pythagorean triples using a geometric method of Dio-
phantus that describes all Pythagorean triples.
Historical Note. We know very little about the life of Diophantus. He was
a mathematician who lived in Alexandria, Egypt, but his precise dates are
i i
“book2” — 2013/5/24 — 8:18 — page 8 — #28
i i
unknown; most historians believe he lived around 250 CE. His extant work
shows systematic algebraic procedures and notation, but his leaps of intuition
strongly suggest that he was thinking geometrically; indeed, Newton called
Diophantus’s discussion of Pythagorean triples the chord method (see Fig-
ure 1.7). Thus, geometry (the Pythagorean Theorem) and applied problems
(teaching) suggested an algebraic problem (find all Pythagorean triples), and
we now return to geometry to solve it. Here is evidence that the distinction
between algebra and geometry is an artificial one; both are parts of the same
subject.
l
slope = t
P = (x, y)
(–1, 0)
intersects the unit circle in a unique second point, P D .x; y/; let t be the
slope of `. As t varies through all real numbers, 1 < t < 1, the intersection
points P of ` and the unit circle trace out the entire circle (except for . 1; 0/).
Proposition 1.2. The points P on the unit circle .other than . 1; 0// are
parametrized as
1 t2 2t
P D ; ; wher e 1 < t < 1:
1 C t2 1 C t2
i i
“book2” — 2013/5/24 — 8:18 — page 9 — #29
i i
1.2 Diophantus 9
Proof. The line through points .a; b/ and .c; d / has equation y b D t.x a/,
where t D .d b/=.c a/, so the line ` through . 1; 0/ and a point P D
.x; y/ on the unit circle has an equation of the form y D t.x C 1/, so that
x D .y t/=t. Thus, .x; y/ is a solution of the system
y D t.x C 1/
x C y 2 D 1:
2
An obvious solution of this system is . 1; 0/, because this point lies on both
the line and the circle. Let’s find x and y in terms of t. If the slope t D 0,
then ` is the x-axis and the other solution is .1; 0/. To find the solutions when
t ¤ 0, eliminate x: the equations
y t
Dx and x2 C y2 D 1
t
give
2
y t
C y 2 D 1:
t
Expanding and simplifying, we obtain
y .1 C t 2 /y 2t D 0:
Definition. Two Pythagorean triples .a; b; c/ and .u; v; z/ are similar if their
right triangles 4.a; b; c/ and 4.u; v; z/ are similar triangles.
i i
“book2” — 2013/5/24 — 8:18 — page 10 — #30
i i
The method of Diophantus will give a formula for certain special Pythago-
rean triples, and it will then show that every Pythagorean triple is similar to a
special one.
Definition. A point .x; y/ in the plane is a rational point if both x and y are
rational numbers.
A Pythagorean point is a rational point in the first quadrant, lying on the
unit circle, and above the diagonal line with equation y D x.
Proof. Let .a; b; c/ be a Pythagorean triple. Dividing both sides of the defining
equation a2 C b 2 D c 2 by c 2 gives
.a=c/2 C .b=c/2 D 1;
so that the triple gives an ordered pair of positive rational numbers .x; y/ D
.a=c; b=c/ with x 2 C y 2 D 1. Thus, the rational point P D .x; y/ lies in the
first quadrant. As both .a; b; c/ and .b; a; c/ are the same Pythagorean triple,
we may assume that
x D a=c b=c D y;
so that .x; y/ lies above the diagonal line with equation y D x. Hence, .x; y/
is a Pythagorean point.
Conversely, let’s now see that a Pythagorean point .x; y/ gives rise to a
Pythagorean triple. Write the rational numbers x y with the same denomi-
nator, say, x D a=c and y D b=c, where a; b; and c are positive integers and
a b c. Now
a2 b2
1 D x2 C y2 D 2 C 2 ;
c c
so that a2 C b 2 D c 2 and hence .a; b; c/ is a Pythagorean triple.
In summary, the problem of finding all Pythagorean triples corresponds to
the problem of finding all Pythagorean points. This is exactly what the geo-
metric idea of Diophantus does. In fact, a Pythagorean point .x; y/ gives rise
to infinitely many Pythagorean triples. Write the coordinates with another de-
nominator, say x D u=z and y D v=z. The calculation at the end of the proof
of Proposition 1.3 shows that .u; v; z/ is another Pythagorean triple arising
from .x; y/.
i i
“book2” — 2013/5/24 — 8:18 — page 11 — #31
i i
1.2 Diophantus 11
Theorem 1.4. Let P D .x; y/ ¤ . 1; 0/ be a point on the unit circle, and let
t be the slope of the line ` joining . 1; 0/ and P .
(i) The slope t is a rational number if and only if P is a rational point.
(ii) The point p
P is a Pythagorean point if and only if t is a rational number
satisfying 2 1 < t < 1.
1 t2 2t
xD and y D :
1 C t2 1 C t2
i i
“book2” — 2013/5/24 — 8:18 — page 12 — #32
i i
.2qp; q 2p 2 ; q 2 C p 2/;
p
where p and q are positive integers with q > p > 2 1.
How to Think About It. The strategy of Diophantus is quite elegant. The
problem of determining all Pythagorean triples is reduced from finding three
unknowns, a; b; and c, to two unknowns, x D a=c and y D b=c, to one
unknown, t D p=q. In effect, all Pythagorean triples p are parametrized by t;
that is, as t varies over all rational numbers between 2 1 and 1, the formulas
involving t vary over all Pythagorean points and hence over all Pythagorean
triples.
We can now show that the Babylonians had, in fact, found all Pythagorean
We are tacitly using a
technique of proof called triples.
Infinite Descent. If, for a
given positive integer n Corollary 1.6. Every Pythagorean triple is similar to one arising from the
with certain properties, Babylonian method.
there always exists a
strictly smaller positive
integer n1 having the Proof. By Theorem 1.5, every Pythagorean triple is similar to one of the form
same properties, then .2qp; q 2 p 2 ; q 2 C p 2 /, where q > p are positive integers. If both q and p are
there are infinitely many even, then we can replace q > p by 21 q > 12 p, obtaining a Pythagorean triple
such integers. But this is
impossible; there are only . 14 2qp; 41 .q 2 p 2/; 14 .q 2 C p 2// similar to the original one. If both parameters
finitely many integers with of the new triple are still even, replace 12 q > 12 p by 41 q > 14 p. Eventually, we
n > n1 > n2 > > 0. arrive at a Pythagorean triple .2r s; r 2 s 2 ; r 2 C s 2 /, similar to the original
triple, that arises from parameters r > s, at least one of which is odd.
i i
“book2” — 2013/5/24 — 8:18 — page 13 — #33
i i
1.2 Diophantus 13
Substitute:
mn D .r Cs/.r s/ D r 2 s 2; 1
2 .m
2
n2 / D 2r s; and 12 .m2 n2/ D r 2 Cs 2:
Exercises
1.19 Find q and p in Theorem 1.5 for each of the following Pythagorean triples.
(i) .7; 24; 25/.
Answer: q D 5 and p D 3.
(ii) .129396; 261547;291805/:
Answer: q D 526 and p D 123.
1.20 * Show that every Pythagorean triple .x; y; z/ with x; y; z having no common
factor d > 1 is of the form
.r 2 s 2; 2rs; r 2 C s 2 /
for positive integers r > s having no common factor > 1; that is,
x D r2 s 2; y D 2rs; z D r 2 C s2:
1.21 A line in the plane with equation y D mx C c is called a rational line if m and c
are rational numbers. If P and Q are distinct rational points, prove that the line
joining them is a rational line.
1.22 A lattice point is a point in the plane whose coordinates are integers. Let P D
.x; y/ be a Pythagorean point and ` the line through P and the origin. Prove that
if Q D .a; b/ is a lattice point on ` and c is the distance from Q to the origin,
then .a; b; c/ is a Pythagorean triple.
i i
“book2” — 2013/5/24 — 8:18 — page 14 — #34
i i
1.23 * Let P D .x0 ; y0 / be a Pythagorean point and L the line joining P and the origin
(so the equation of L is y D mx, where m D y0 =x0 ). Show that if .a=c; b=c/ is
a rational point on L, then .a; b; c/ is a Pythagorean triple.
1.24 Does every rational point in the plane correspond to a Pythagorean point? If so,
prove it. If not, characterize the ones that do.
Answer: No. For example, . 12 ; 12 / does not correspond.
2 2
1.25 * Prove the identity x 2 C y 2 D x 2 y 2 C .2xy/2 :
1.26 *
(i) Show that the same number can occur as a leg in two nonsimilar Pythagorean
triangles.
(ii) Prove that the area of 4.a; b; c/, a right triangle with integer side lengths, is
an integer.
(iii) A Heron triangle is a triangle with integer side lengths and area. Find a Heron
triangle that is not a right triangle.
Hint: Use parts (i) and (ii).
1.27 Show that every integer n 3 occurs as a leg of some Pythagorean triple.
Hint: The cases n even and n odd should be done separately.
1.28 Distinct Pythagorean triples can have the same hypotenuse: both .33; 56; 65/ and
.16; 63; 65/ are Pythagorean triples. Find another pair of distinct Pythagorean
triples having the same hypotenuse.
1.29 * If .cos ; sin / is a rational point, prove that both cos. C30ı / and sin. C30ı/
are irrational.
Fermat was not the first Fermat never returned to this problem (at least, not publicly) except for his
mathematician to write a proof of the case n D 4, which we give below. The statement: If n > 2,
marginal note in a copy
there are no positive integers a; b; c with an C b n D c n , was called Fer-
of Diophantus. Next to
the same problem, the mat’s Last Theorem, perhaps in jest. The original text in which Fermat wrote
Byzantine mathematician his famous marginal note is lost today. Fermat’s son edited the next edition
Maximus Planudes wrote, of Diophantus, published in 1670; this version contains Fermat’s annotations,
Thy soul, Diophantus, be including his famous “Last Theorem;” it contained other unproved assertions
with Satan because of the
as well, most true, some not. By the early 1800s, only Fermat’s Last Theorem
difficulty of your theorems.
remained undecided. It became a famous problem, resisting the attempts of
mathematicians of the highest order for 350 years, until it was finally proved,
in 1995, by Wiles. His proof is very sophisticated, and most mathematicians
i i
“book2” — 2013/5/24 — 8:18 — page 15 — #35
i i
1.2 Diophantus 15
believe that Fermat did not have a correct proof. The quest for a proof of Fer-
mat’s Last Theorem generated much beautiful mathematics. In particular, it led
to an understanding of complex numbers, factorization, and polynomials. We’ll
see, in the Epilog, that extending the method of Diophantus from quadratics to
cubics involves elliptic curves, the study of which is the setting for Wiles’
proof of Fermat’s Last Theorem.
Fermat proved the next theorem (which implies the case n D 4 of Fermat’s
Last Theorem) because he was interested in the geometric problem of deter-
mining which right triangles having all sides of rational length have integer
area (we’ll soon discuss this problem in more detail).
i i
“book2” — 2013/5/24 — 8:18 — page 16 — #36
i i
As a and b are relatively prime (no common factor d > 1), each pair from
the three factors on the right-hand side of Eq. (1.6) is relatively prime. Since
the left-hand side .y=2/2 is a square, each factor on the right is a square (Ex-
ercise 2.12 on page 59). In other words, there are integers u, v, and w such
that
a D u2 ; b D v 2 ; and a2 C b 2 D w 2 :
And, since a and b are relatively prime, so, too, are u and v relatively prime.
Hence, we have
u4 C v 4 D w 2 :
We can now repeat this process on .u; v; w/. By infinite descent, there is no
solution to Eq. (1.5).
Corollary 1.8 (Fermat’s Last Theorem for Exponent 4). There are no pos-
itive integers x; y; z with
x4 C y4 D z4:
i i
“book2” — 2013/5/24 — 8:18 — page 17 — #37
i i
1.2 Diophantus 17
the side-lengths of 4 by 16 scales the area by 36
1
, so that 4 32 ; 20
3
; 41
6
has area
180=36 D 5.
So, the question arises: “Is every integer the area of a right triangle with
rational side-lengths?” Fermat showed that 1 and 2 are not, and his proof for 2
involved Eq. (1.5).
Theorem 1.9. There is no right triangle with rational side-lengths and area 2.
Proof. Suppose, on the contrary, that the rational numbers r; s; t are the lengths
of the sides of a right triangle with area 2. Then we have two equations:
r 2 C s2 D t 2
1
2
rs D 2:
r 4 C .r s/2 D .r t/2 ;
so that (since r s D 4),
r 4 C 24 D .r t/2 :
Write the rational numbers r and t as fractions with the same denominator:
r D a=c and t D b=c. When we clear denominators, we get a4 C z 4 c 4 D t 2 ,
an equation in integers x; y; z of the form
x4 C y4 D z2 :
This is Eq. (1.5), and Theorem 1.7 says that this cannot occur.
So, not every positive integer is the area of a right triangle with rational
side-lengths.
Theorem 1.9 says that 2 is not a congruent number. Using similar ideas,
Fermat showed that 1 is not a congruent number (Exercise 1.31 below).
One way to generate congruent numbers is to scale a Pythagorean triple
using the largest perfect square that divides its area. For example, the area of We have already used this
4.7; 24; 25/ is 84 D 22 21. Since 4 D 22 is the largest perfect square in 84, method on the Pythag-
orean triple .9; 40; 41/
scaling the sides by 2 will produce a triangle of area 21, so that 21 is the area of
when we showed that 5 is
4. 72 ; 12; 25
2
/ and, hence, 21 is a congruent number. More generally, we have a congruent number.
i i
“book2” — 2013/5/24 — 8:18 — page 18 — #38
i i
a2 C b 2 D c 2
1
2 ab D m2 n:
In light of Exercise 1.33 on page 20, we now have a method for determin-
ing all congruent numbers: generate the areas of all Pythagorean triangles (we
know how to do that), and then divide out its largest perfect square factor: case
closed.
Not quite. The trouble with this method is that you have no idea how many
triangle areas to calculate before (if ever) you get to an area of m2 n for a
particular n. For some congruent numbers, it takes a long time. For example,
157 is a congruent number, but the smallest rational right triangle with area
157 has side lengths
224403517704336969924557513090674863160948472041
This triangle was found ;
by Don Zagier, using
8912332268928859588025535178967163570016480830
sophisticated techniques
investigating elliptic curves,
6803294847826435051217540 411340519227716149383203
; :
and using a substantial 411340519227716149383203 21666555693714761309610
amount of computer power
(see [19] for more details). A method for effectively determining whether or not an integer is a congru-
ent number is an unsolved problem (this problem is at least a thousand years
old, for historians have found it in manuscripts dating from the late tenth cen-
A readable account of tury). A detailed discussion of the Congruent Number Problem is in [19].
the congruent prob-
lem, with more exam-
ples than we provide How to Think About It. Proposition 1.10 shows that every squarefree con-
here, can be found at
gruent number n is the area of a scaled Pythagorean triangle. But there might
www.math.uconn.edu/
˜kconrad/blurbs/
be more than one Pythagorean triangle whose area has n as its squarefree
part. The search for more than one rational right triangle with the same area
leads to some fantastic calculations. For example, we saw that 5 is the area of
4 32 ; 20 41
3 ; 6 , which comes from the Pythagorean triangle 4.9; 40; 41/ whose
area is 5 62 . But 5 is also the area of
1519 4920 3344161
4 ; ; ;
492 1519 747348
and this comes from the Pythagorean triangle 4.2420640; 2307361; 3344161/
whose area is 5 7473482.
As usual, this isn’t magic; in Chapter 9, we’ll show how to find infinitely
many rational right triangles with the same congruent number as area.
i i
“book2” — 2013/5/24 — 8:18 — page 19 — #39
i i
1.2 Diophantus 19
a2 C b 2 D c 2
ab D 2n
might lead us to think that we could find a and b by finding their sum and
product, for this would lead to a quadratic equation whose roots are a and b.
Well, we know ab, and
p p p p
c 2 C 4n C .c 2 C 4n/ 4.2n/ c 2 C 4n C c 2 4n
aD D
2 2
and
p p p p
c 2 C 4n .c 2 C 4n/ 4.2n/ c 2 C 4n c2 4n
bD D :
2 2
But we want a and b to be rational, so we want c 2 ˙4n to be perfect squares.
That produces an arithmetic sequence of three perfect squares:
c2 4n; c 2; c 2 C 4n:
There are details to settle, but that’s the gist of the proof of the following theo-
rem.
Exercises
1.30 * Show that 1 is not a congruent number.
1.31 Show that there are no positive rational numbers x and y so that
x4 ˙ 1 D y 2:
i i
“book2” — 2013/5/24 — 8:18 — page 20 — #40
i i
1.3 Euclid
Euclid of Alexandria (ca. 325 BCE–ca.265 BCE) is one of the most prominent
mathematicians of antiquity. He is best known for The Elements, his treatise
consisting of thirteen books: six on plane geometry, four on number theory,
and three on solid geometry. The Elements has been used for over two thou-
sand years, which must make Euclid the leading mathematics teacher of all
time. We do not know much about Euclid himself other than that he taught in
Alexandria, Egypt around 270 BCE. We quote from Sir Thomas Heath [16],
the great translator and commentator on The Elements.
Eight hundred years after Euclid, Proclus (412 CE–485 CE) wrote:
Not much younger than these .pupils of Plato/ is Euclid, who put to-
gether The Elements, collecting many of Eudoxus’s theorems, perfecting
many of Theaetetus’s, and also bringing to irrefragable demonstration
the things which were only somewhat loosely proved by his predecessors.
This man lived in the time of the first Ptolemy .323 BCE 283 BCE /. For
Archimedes, who came immediately after the first Ptolemy makes men-
tion of EuclidI and further they say that Ptolemy once asked him if there
were a shorter way to study geometry than The Elements, to which he
replied that there was no royal road to geometry. He is therefore younger
than Plato’s circle, but older than Eratosthenes and ArchimedesI for
these were contemporaries, as Eratosthenes somewhere says.
The Elements is remarkable for the clarity with which its theorems are stated
and proved. The standard of rigor was a goal (rarely achieved!) for the inven-
tors of calculus centuries later. As Heath writes in the preface to the second
edition of his translation [16] of The Elements,
i i
“book2” — 2013/5/24 — 8:18 — page 21 — #41
i i
1.3 Euclid 21
More than one thousand editions of The Elements have been published since
it was first printed in 1482. In the Encyclopedia Britannica, van der Waerden
wrote,
Almost from the time of its writing and lasting almost to the present,
The Elements has exerted a continuous and major influence on human
affairs. It was the primary source of geometric reasoning, theorems, and
methods at least until the advent of non-Euclidean geometry in the 19th
century. It is sometimes said that, next to the Bible, The Elements may
be the most translated, published, and studied of all the books produced
in the Western world.
Definition. The Least Integer Axiom (often called the Well-Ordering Axiom)
states that every nonempty collection C of natural numbers contains a smallest Note that the set of positive
element; that is, there is a number c0 2 C with c0 c for all c 2 C . rationals QC does not
satisfy an analogous
property: the nonempty
This axiom is surely plausible. If 0 2 C , then c0 D 0. If 0 … C and 1 2 C , subset fx 2 QC W x 2 > 2g
then c0 D 1. If 0; 1 … C and 2 2 C , then c0 D 2. Since C is not empty, you contains no smallest
will eventually bump into C , and c0 is the first number you’ll meet. element.
We now define some familiar terms.
i i
“book2” — 2013/5/24 — 8:18 — page 22 — #42
i i
Inequalities are discussed Proof. There is a positive integer c with b D ca; note that 1 c, for 1 is
in Appendix A. 4. the smallest positive integer. Multiplying by the positive number a, we have
a ac D b.
The first few primes are 2, 3, 5, 7, 11, 13; : : : : We will soon see that there
are infinitely many primes.
The reason we do not consider 1 to be a prime is that theorems about primes
would then require special cases treating the behavior of 1. For example, we
will prove later that every positive integer a 2 has exactly one factorization
of the form a D p1 p2 p t , where p1 p2 p t are primes. This
statement would be more complicated if we allowed 1 to be a prime.
b=a D q C r=a;
becomes 22 D 4 5 C 2.
b = 4a + r
A A A A
a a a a r
B
b
i i
“book2” — 2013/5/24 — 8:18 — page 23 — #43
i i
1.3 Euclid 23
b D qa C r and 0 r < a:
The hypothesis of Theo-
Proof. We first prove that q and r exist; afterward, we’ll prove their unique- rem 1.15 can be weakened
ness. to a; b 2 Z and a ¤ 0; the
inequalities for the remain-
If b < a, set q D 0 and r D b. Now b D qa C r D 0 a C b, while
der now read 0 r < jaj.
0 b < a. Hence, we may assume that b a; that is, b a 0. Consider the
sequence b > b a > b 2a > b 3a > . There must be an integer q 1
with b qa 0 and b .q C1/a < 0 (this is just Infinite Descent, described on
page 12; in more down-to-earth language, there can be at most b steps before
this sequence becomes negative). If we define r D b qa, then b D qa C r . If a b, the quotient q is
We also have the desired inequalities. Clearly, 0 r . If r D b qa a, then the largest multiple qa with
qa b. This is very much
b qa a 0; that is, b .q C 1/a 0, contradicting the definition of q.
the way young children are
Let’s prove uniqueness. If there are integers Q and R with b D Qa C R taught to find the integer
and 0 R < a, then qa C r D b D Qa C R and quotient in division when a
and b are small.
.Q q/a D r R:
How to Think About It. We have been trained to regard the quotient q as
more important than the remainder; r is just the little bit left over. But our
viewpoint now is just the reverse. Given a and b, the important question for us
is whether a is a divisor of b. The remainder is the obstruction: a j b if and
only if r D 0. This will be a common strategy: to see whether a j b, use the
Division Algorithm to get b D qa C r , and then try to show that r D 0.
The next result shows that there is no largest prime. The proof shows, given
any finite set of primes, that there always exists another one.
Proof. .Euclid/ Suppose, on the contrary, that there are only finitely many
primes. If p1 ; p2; : : : ; pk is the complete list of all the primes, define
M D .p1 pk / C 1:
i i
“book2” — 2013/5/24 — 8:18 — page 24 — #44
i i
sa C tb;
i i
“book2” — 2013/5/24 — 8:18 — page 25 — #45
i i
1.3 Euclid 25
Proof. We may assume that at least one of a and b is not zero (otherwise,
the gcd is 0 and the result is obvious). Consider the set I of all the linear
combinations of a and b:
I D fsa C tb W s; t 2 Zg:
Both a and b are in I (take s D 1 and t D 0 or vice versa). It follows that The proof of Theorem 1.19
I contains positive integers (if a < 0, then a is positive, and I contains contains an idea that
will be used again, as in
a D . 1/a C 0b); hence, the set C of all those positive integers lying in I is
Exercise 1.49 on page 30.
nonempty. By the Least Integer Axiom, C contains a smallest positive integer,
say, d ; we claim that d is the gcd. In other words, d is the
Since d 2 I , it is a linear combination of a and b: there are integers s and smallest positive linear
combination of a and b.
t with
d D sa C tb:
We’ll show that d is a common divisor by trying to divide each of a and b
by d . The Division Algorithm gives integers q and r with a D qd C r , where
0 r < d . If r > 0, then
r Da qd D a q.sa C tb/ D .1 qs/a C . qt/b 2 C;
contradicting d being the smallest element of C . Hence r D 0 and d j a; a
similar argument shows that d j b.
Finally, if c is a common divisor of a and b, then Exercise 1.46 on page 29
shows that c divides every linear combination of a and b; in particular, c j d .
By Lemma 1.13, we have c d .
If d D gcd.a; b/ and if c is a common divisor of a and b, then c d , by
Lemma 1.13. The next corollary shows that more is true: c is a divisor of d ;
that is, c j d for every common divisor c.
Corollary 1.20. Let a and b be integers. A nonnegative common divisor d is In some treatments of num-
their gcd if and only if c j d for every common divisor c of a and b. ber theory, Corollary 1.20
is taken as the definition
of gcd. Later, we will want
Proof. Necessity (the implication )). We showed that any common divisor of
to define greatest common
a and b divides gcd.a; b/ at the end of the proof of Theorem 1.19. divisor in other algebraic
Sufficiency (the implication (). Let d D gcd.a; b/, and let D 0 be a structures. It often will not
common divisor of a and b with c j D for every common divisor c of a and make sense to say that one
b. Now D is a common divisor, so that d j D, by hypothesis; hence, d D, element of such a structure
is greater than another, but
by Lemma 1.13. But the definition of gcd (d is the greatest common divisor)
it will make sense to say
gives D d , and so D D d . that one element divides
another. Corollary 1.20
The next theorem is of great interest: not only is it very useful, but it also will allow us to extend the
characterizes prime numbers. notion of gcd.
Proof. ()): Suppose that p j ab and that p − a; that is, p does not divide a;
we must show that p j b. Since gcd.p; a/ D 1 (by Lemma 1.17), Theorem 1.19
gives integers s and t with 1 D sp C ta. Hence,
b D spb C tab:
i i
“book2” — 2013/5/24 — 8:18 — page 26 — #46
i i
Thus, a and b are relatively prime if their only common divisors are ˙1.
For example, 2 and 3 are relatively prime, as are 8 and 15.
Here is a generalization of Euclid’s Lemma having the same proof.
Corollary 1.22. Let a, b, and c be integers. If c and a are relatively prime and
c j ab, then c j b.
Definition. An expression a=b for a rational number (where a and b are inte-
gers and b ¤ 0) is in lowest terms if a and b are relatively prime.
i i
“book2” — 2013/5/24 — 8:18 — page 27 — #47
i i
1.3 Euclid 27
a a0 d a0
Proof. If d D gcd.a; b/, then a D a0 d , b D b 0 d , and D 0 D 0 . But
b bd b
a b
a0 D and b 0 D , so gcd.a0 ; b 0 / D 1 by Lemma 1.23.
d d
Since the side lengths are integers, h is rational, say h D m=`, and we may
assume that it is in lowest terms; that is, gcd.m; `/ D 1. Cross multiply:
Proof. Suppose, on the contrary, that .a=b/2 D 2. We may assume that a=b is An indirect proof or proof
in lowest terms; that is, gcd.a; b/ D 1. Since a2 D 2b 2, Euclid’s Lemma gives by contradiction has the
following structure. We
2 j a, and so 2m D a. Hence, 4m2 D a2 D 2b 2, and 2m2 D b 2. Euclid’s
assume that the desired
Lemma now gives 2 j b, contradicting gcd.a; b/ D 1. statement is false and
reach a contradiction. We
conclude that the original
It follows that the legs of a Pythagorean triple .a; b; c/ cannot be equal, for statement must be true.
if a D b, then a2 C a2 D c 2 , which implies that 2 D .c=a/2 .
i i
“book2” — 2013/5/24 — 8:18 — page 28 — #48
i i
p
Proposition 1.26 is often stated as “ 2 is irrational,”p
which is a stronger
statement than what we’ve just proved. We can assert that 2 is irrational only
if we further assume that there exists a number u with u2 D 2.
Our proof can be made more elementary; we need assume only that at least
one of a; b is odd. Also, see Exercise 1.75 on page 41.
Historical Note. The ancient Greeks defined number to mean “positive inte-
ger.” Rationals were not viewed as numbers but, rather, as ways of comparing
two lengths. They called two segments of lengths a and b commensurable
if there is a third segment of
p length c with a D mc and b D nc for posi-
tive integers m and n. That 2 is irrational was a shock to the Pythagoreans
(ca. 500 BCE); given a square with p sides of length 1, its diagonal and side
are not commensurable; that is, 2 cannot be defined in terms of numbers
(positive integers) alone. Thus, there is no numerical solution to the equation
To bridge the gap between x 2 D 2, but there is a geometric solution.
numbers and geometric By the time of Euclid, around 270 BCE, this problem had been resolved by
magnitudes, Eudoxus
splitting mathematics into two disciplines: number theory and geometry.
(408 BCE–355 BCE) intro-
duced the sophisticated In ancient Greece, algebra as we know it did not really exist. Euclid and
notion of proportions (this the Greek mathematicians did geometric algebra. For simple ideas, e.g.,
idea, discussed in The Ele- .a C b/2 D a2 C 2ab C b 2 or completing the square, geometry clarifies al-
ments, is equivalent to our gebraic formulas (for example, see the right-hand part of Figure 1.2 on page 2
contemporary definition of
without the dashed lines). For more difficult ideas, say equations of higher de-
real numbers).
gree, the geometric figures involved are very complicated, so that geometry is
no longer clarifying. As van der Waerden writes in [34], p. 266,
one has to be a mathematician of genius, thoroughly versed in trans-
forming proportions with the aid of geometric figures, to obtain results
by this extremely cumbersome method. Anyone can use our algebraic
notation, but only a gifted mathematician can deal with the Greek theory
of proportions and with geometric algebra.
The problem of defining number has arisen several times since the classical
Greek era. Mathematicians had to deal with negative numbers and with com-
plex numbers in the 1500s after the discovery of the Cubic Formula, because
that formula often gives real roots of a cubic polynomial, even integer roots, in
unrecognizable form (see Chapter 3). The definition of real numbers generally
accepted today dates from the late 1800s. But there are echos of ancient Athens
in our time. Kronecker (1823–1891) wrote,
Die ganzen Zahlen hat der liebe Gott gemacht, alles andere ist Men-
schenwerk. (God created the integers; everything else is the work of
Man.)
Even today some logicians argue for a new definition of number.
Exercises
1.37 True or false, with reasons. Of course, it is important to get the right answer, but
most attention should be paid to your reasoning.
(i) 6 j 2. Answer: False. (ii) 2 j 6. Answer: True.
(iii) 6 j 0. Answer: True. (iv) 0 j 6. Answer: False.
(v) 0 j 0. Answer: True.
i i
“book2” — 2013/5/24 — 8:18 — page 29 — #49
i i
1.3 Euclid 29
(iv) gcd.a; a/ D a
(v) gcd.a; b/ D gcd.b; a/
(vi) gcd.a; 1/ D 1
(vii) gcd.a; b/ D gcd. a; b/
(viii) gcd.a; 2b/ D 2 gcd.a; b/
1.40 * If x is a real number, let bxc denote the greatest integer n with n x. (For
example, 3 D bc and 5 D b5c.) If q is the quotient in Theorem 1.15, show that
q D bb=ac.
1.41 *
(i) Given integers a and b (possibly negative) with a ¤ 0, prove that there exist
unique integers q and r with b D qa C r and 0 r < jaj.
Hint: Use the portion of the Division Algorithm that has already been proved.
(ii) If b and a are positive integers, do b and b have the same remainder after
dividing by a? Answer: No.
1.42 For each of the following pairs a; b, find the largest nonnegative integer n with
n b=a < n C 1.
(i) a D 4 and b D 5. Answer: n D 1.
(ii) a D 5 and b D 4. Answer: n D 0.
(iii) a D 16 and b D 36. Answer: n D 2.
(iv) a D 36 and b D 124. Answer: n D 3.
(v) a D 124 and b D 1028. Answer: n D 7.
1.43 Let p1 ; p2 ; p3 ; : : : be the list of the primes in ascending order: p1 D 2, p2 D 3,
p3 D 5, and so forth. Define fk D 1 C p1 p2 pk for k 1. Find the smallest
k for which fk is not a prime.
Hint: 19 j f7 , but 7 is not the smallest k.
1.44 What can you say about two integers a and b with the property that a j b and
b j a? What if both a and b are positive?
1.45 * Show that if a is positive and a j b, then gcd.a; b/ D a. Why do we assume
that a is positive?
1.46 *.Two Out of Three/: Suppose that m, n, and q are integers and m D n C q. If c
is an integer that divides any two of m; n; q, show that c divides the third one as
well.
i i
“book2” — 2013/5/24 — 8:18 — page 30 — #50
i i
1.47 *
(i) For each a and b, give the smallest positive integer d that can be written as
Allow for positive and sa C t b for integers s and t :
negative values of s and t.
a D 12 and b D 16. Answer: d D 4.
a D 12 and b D 17. Answer: d D 1.
a D 12 and b D 36. Answer: d D 12.
a D 0 and b D 4. Answer: d D 4.
a D 4 and b D 16. Answer: d D 4.
a D 16 and b D 36. Answer: d D 4.
a D 36 and b D 124. Answer: d D 4.
a D 124 and b D 1028. Answer: d D 4.
(ii) How is “smallest positive integer d expressible as sa C t b” related to a and b
in each case? Is d a divisor of both a and b?
1.48 * Show that the set of all linear combinations of two integers is precisely the set
of all multiples of their gcd.
1.49 * Let I be a subset of Z such that
(i) 0 2 I
(ii) if a; b 2 I , then a b2I
(iii) if a 2 I and q 2 Z, then qa 2 I .
Prove that there is a nonnegative integer d 2 I with I consisting precisely of all
the multiples of d .
1.50 How might one define the gcd.a; b; c/ of three integers? When applied to a prim-
itive Pythagorean triple .a; b; c/, your definition should say that gcd.a; b; c/ D 1.
Euclidean Algorithm
Our discussion of gcd’s is incomplete. What is gcd.12327; 2409/? To ask the
question another way, is the expression 2409/12327 in lowest terms? The next
result enables us to compute gcd’s efficiently. We first prove another lemma
from Greek times.
Example 1.28. In this example, we will abbreviate gcd.b; a/ to .b; a/. Com-
puting .b; a/ is simple when a and b are small. If b a, then Lemma 1.27
i i
“book2” — 2013/5/24 — 8:18 — page 31 — #51
i i
1.3 Euclid 31
.326; 78/ D .248; 78/ D .170; 78/ D .92; 78/ D .14; 78/:
So far, we have been subtracting 78 from the other larger numbers. At this
point, we now start subtracting 14 (this is the reciprocal, direction-changing,
aspect of antanairesis), for 78 > 14.
.78; 14/ D .64; 14/ D .50; 14/ D .36; 14/ D .22; 14/ D .8; 14/:
Change direction once again to get .8; 6/ D .2; 6/, and change direction one
last time to get
326 D 4 78 C 14:
There are then five subtractions in the passage from .78; 14/ to .8; 14/; the
Division Algorithm expresses this as
78 D 5 14 C 8:
14 D 1 8 C 6:
8 D 1 6 C 2;
6 D 3 2: N
i i
“book2” — 2013/5/24 — 8:18 — page 32 — #52
i i
b D r0 D q1 a C r2 ; r2 < a
a D r1 D q2 r2 C r3 ; r3 < r2
r2 D q3 r3 C r4 ; r4 < r3
:: ::
: :
rn 3 D qn 2 rn 2 C rn 1; rn 1 < rn 2
rn 2 D qn 1 rn 1 C rn ; rn < rn 1
rn 1 D qn rn
(remember that all qj and rj are explicitly known from the Division Algo-
Lamé (1795–1870) proved rithm). There is a last remainder: the procedure stops (by Infinite Descent!)
that the number of steps because the remainders form a strictly decreasing sequence of nonnegative in-
in the Euclidean Algorithm
cannot exceed 5 times
tegers (indeed, the number of steps needed is less than a).
the number of digits in the We now show that the last remainder rn is the gcd.
smaller number (see [26],
p. 49). b D q1 a C r2 ) gcd.a; b/ D gcd.a; r2 /
a D q2 r2 C r3 ) gcd.a; r2 / D gcd.r2 ; r3 /
r2 D q3 r3 C r4 ) gcd.r2 ; r3 / D gcd.r3 ; r4 /
::
:
rn 2 D qn 1 rn 1 C rn ) gcd.rn 2 ; rn 1 /D gcd.rn 1 ; rn /
rn 1 D qn rn ) gcd.rn 1 ; rn / D rn :
All the implications except the last follow from Lemma 1.27. The last one
follows from Exercise 1.45 on page 29.
Let’s rewrite the previous example in the notation of the proof of Theo-
rem 1.29. The passage from one line to the line below it involves moving the
boldface numbers “southwest.”
326 D 4 78 C 14 (1.7)
78 D 5 14 C 8 (1.8)
14 D 1 8 C 6 (1.9)
8D16C2 (1.10)
6 D 3 2:
i i
“book2” — 2013/5/24 — 8:18 — page 33 — #53
i i
1.3 Euclid 33
b D qa C r
a D q 0r C r 0
r D q 00 r 0 C r 00;
r 00 D r q 00 r 0 :
r 00 D .1 q 00 q 0 /r q 00 a D .1 q 00 q 0 /.b qa/ q 00 a:
Thus, r 00 is a linear combination of b and a.
We use the equations to find coefficients s and t expressing 2 as a linear
combination of 326 and 78. Work from the bottom up.
2D8 16 by Eq. (1.10)
D8 1 .14 1 8/ by Eq. (1.9)
D 2 8 1 14
D 2 .78 5 14/ 1 14 by Eq. (1.8)
D 2 78 11 14
D 2 78 11 .326 4 78/ by Eq. (1.7)
D 46 78 11 326:
Thus, s D 46 and t D 11.
How to Think About It. The algorithm produces one pair of coefficients that
works. However, it’s not the only pair. For example, consider gcd.2; 3/ D 1.
A moment’s thought gives s D 1 and t D 1; but another moment’s thought
gives s D 2 and t D 1 (see Exercise 1.57 on page 35). However, the Eu-
clidean Algorithm always produces a specific pair of coefficients; assuming
that no mistakes in arithmetic are made, two people using the algorithm al-
ways come up with the same s and t.
i i
“book2” — 2013/5/24 — 8:18 — page 34 — #54
i i
How to Think About It. In calculating gcd’s with the Euclidean Algorithm,
many students get confused keeping track of the divisors and remainders. We
illustrate one way to organize the steps that has been effective with high school
students. Arrange the steps computing gcd.124; 1028/ as on the left:
8
4 D 36 2 16
124 1028 &
992 3 D 36 2 .124 3 36/
36 124 D 2 124 C 7 36
108 2 &
D 2 124 C 7 .1028 8 124/
16 36
D 7 1028 58 124
32 4
4 16
16
0
The last nonzero remainder is the gcd, so gcd.124; 1028/ D 4. This arrange-
ment can be used to read off coefficients s and t so that 4 D 124s C 128t. Start
at the next to last division and solve for each remainder.
Exercises
1.51 If a and b are positive integers, then gcd.a; b/ D sa C t b. Prove that either s or t
is negative.
1.52 * Use Infinite Descent to prove that every positive integer
p a has a factorization
a D 2k m, where k 0 and m is odd. Now prove that 2 is irrational using this
fact instead of Euclid’s Lemma.
1.53 Prove that if n is squarefree (i.e., n > 1 and n is not divisible by the square of
any prime), then there is no rational number x with x 2 D n.
Hint: Adapt the proof of Proposition 1.26.
1.54 * Assuming there is a real number x with x 3 D 2, prove that x is irrational.
1.55 (i) Find d D gcd.326; 78/, find integers s and t with d D 326s C 78t , and put
the expression 326/78 in lowest terms.
163
Answer: d D 2, s D 11, t D 46, and 39 .
(ii) Find d D gcd.12327;2409/, find integers s and t with d D 12327s C 2409t ,
and put the expression 2409/12327 in lowest terms.
803
Answer: d D 3, s D 299, t D 1530, and 4109 .
(iii) Find d D gcd.7563; 526/, and express d as a linear combination of 7563 and
526.
Answer: d D 1, s D 37, t D 532.
(iv) Find d D gcd.73122; 7404621/ and express d as a linear combination of
73122 and 7404621.
Answer: d D 21, s D 34531, t D 7404621.
i i
“book2” — 2013/5/24 — 8:18 — page 35 — #55
i i
1.3 Euclid 35
i i
“book2” — 2013/5/24 — 8:18 — page 36 — #56
i i
1.65 Consider the calculation of gcd.124; 1028/ on page 34. Show that the integer pairs
Notation. The set of all rational numbers is denoted by Q, and the set of all
real numbers is denoted by R.
We begin by stating some basic properties of real numbers (of course, inte-
gers and rationals are special cases). These properties undergird a great deal of
high school algebra; they are essential for the rest of this book and, indeed, for
abstract algebra.
Functions are discussed in Addition and multiplication are functions R R ! R, namely, .a; b/ 7!
Appendix A.1. a C b and .a; b/ 7! ab. The Laws of Substitution say that if a; a0 ; b; b 0 are
i i
“book2” — 2013/5/24 — 8:18 — page 37 — #57
i i
a C b D a0 C b 0 and ab D a0 b 0 :
The Laws of Substitution are used extensively (usually tacitly) when solving
equations or transforming expressions, and they merely say that addition and
multiplication are single-valued. For example, since 5 C 5 D 0, we have
. 5 C 5/ . 1/ D 0 . 1/ D 0:
i i
“book2” — 2013/5/24 — 8:18 — page 38 — #58
i i
b a D b C . a/;
where a is the negative of a; that is, a is the number which, when added
to a, gives 0.
The word quotient is used here in a different way than in the Division Algo-
rithm, where it is bb=ac, the integer part of b=a (see Exercise 1.40 on page 29).
How to Think About It. Almost all the properties just listed for the set R
of real numbers also hold for the set Z of integers—these properties are “in-
herited” from R because integers are real numbers. The only property that Z
doesn’t inherit is the existence of multiplicative inverses. While every nonzero
integer does have an inverse in R, it may not be an integer; in fact, the only
nonzero integers whose inverses also lie in Z are 1 and 1. There are other
familiar algebraic systems that are more like Z than R in the sense that multi-
plicative inverses may not exist in the system. For example, all polynomials in
one variable with rational coefficients form such a system, but the multiplica-
tive inverse 1=x of x is not a polynomial.
0 a D .0 C 0/a D .0 a/ C .0 a/:
. a/ . 1/ D a:
i i
“book2” — 2013/5/24 — 8:18 — page 39 — #59
i i
In particular,
. 1/ . 1/ D 1:
Addition Rule (iii) states that every real number has a negative, an additive
inverse. Can a number a have more than one negative? Intuition tells us no,
and this can be proved using the nine fundamental properties.
a C .a C b/ D a:
a C .a C b/ D a
. a C a/ C b D a
0Cb D a
b D a:
i i
“book2” — 2013/5/24 — 8:18 — page 40 — #60
i i
. 1/a C a D . 1/a C 1 a D a. 1 C 1/ D a 0 D 0:
Exercises
1.68 (i) Prove the additive cancellation law using only the nine properties: if a; b; c
are real numbers with a C c D b C c, then a D b.
(ii) Prove the multiplicative cancellation law for real numbers using only the nine
properties: if a; b; c are real numbers with ac D bc and c ¤ 0, then a D b.
1.69 Suppose that b ¤ 0. Show that a=b is the unique real number whose product with
b is a.
1.70 (i) Prove that a real number a is a square if and only if a 0.
(ii) Prove that every complex number is a square.
1.71 * Let a; b; c be numbers.
(i) Prove that ac, the negative of ac, is equal to . a/c; that is, ac C. a/c D 0.
(ii) In the proof of Corollary 1.35, we stated that
ab C a. 1/c D ab ac:
Prove this.
Hint: Evaluate a.0 C 0/ in two ways.
1.72 * Suppose that e and f are integers and let m D minfe; f g and M D maxfe; f g.
Show that
m C M D e C f:
i i
“book2” — 2013/5/24 — 8:18 — page 41 — #61
i i
1.5 Connections 41
1.73 *
(i) If a is a positive real number such that an D 1 for an integer n 1, prove
that a D 1.
(ii) If a is a real number such that an D 1 for an integer n 1, prove that
a D ˙1.
1.74 The Post Office has only 5 and 8 cent stamps today. Which denominations of
postage can you buy?
1.75 * Later in this book, we’ll prove Theorem 2.10: every integer can be factored into
primes in essentially only one way. You may use this theorem here.
(i) If a 2 Z, prove that every prime p that divides a2 shows up with even expo-
nent; that is, if p j a2 , then p2 j a2 .
(ii) Show that there are no integers a and b so that 2a2 D b 2 .
(iii) Use part(ii) to show that there is no rational number x with x 2 D 2.
1.76 Use Euclid’s idea of a geometric Division Algorithm (see Figure 1.8 on page 22)
to give a geometric version of the Euclidean Algorithm that uses repeated geo-
metric division. Apply your geometric algorithm to
(i) two segments of length 12 and 90.
(ii) the diagonal and the side of a square.
1.5 Connections
This section applies the method of Diophantus to trigonometry and to calculus.
Trigonometry
The formulas x D .1 t 2 /=.1 C t 2 / and y D 2t=.1 C t 2 /, where t is a
real number, parametrize all the points on the unit circle except . 1; 0/. But
we know that if A D .x; y/ is a point on the unit circle, then x D cos and
y D sin , where D †DOA (see Figure 1.9).
A = (cos q, sin q)
q
O D
i i
“book2” — 2013/5/24 — 8:18 — page 42 — #62
i i
and also infinitely many angles with both cosine and sine irrational (see Exer-
cise 1.29 on page14).
The parametrization of the unit circle in Proposition 1.2,
1 t2 2t
cos D and sin D ; 1 < t < 1;
1 C t2 1 C t2
enables us to prove some trigonometric identities. For example, let’s prove the
identity
1 C cos C sin
D sec C tan :
1 C cos sin
First, rewrite everything in terms of sin and cos . The left-hand side is fine;
the right-hand side is .1= cos / C .sin = cos /. Now replace these by their
formulas in t. The left-hand side is
1 t2 2t
1C 1Ct 2
C 1Ct 2
;
1 t2 2t
1C 1Ct 2 1Ct 2
and this simplifies to a rational function of t (that is, a quotient of two poly-
nomials). Similarly, the right-hand side is also a rational function of t, for
1 1 C t2 2t
sec D D and tan D . Thus, verifying whether the
cos 1 t2 1 t2
trigonometric identity is true is the same thing as verifying whether one ra-
tional expression is equal to another. This problem involves no ingenuity at
all. Just cross multiply and check whether the polynomials on either side are
equal; that is, check whether the monomials on either side having the same
degree have the same coefficients.
Integration
The parametrization of the unit circle is useful for certain integration problems.
In Figure 1.10, we see that 4AOB is isosceles, for two sides are radii; thus, the
B = (cos q, sin q)
q
(–1,0) = A O
i i
“book2” — 2013/5/24 — 8:18 — page 43 — #63
i i
1.5 Connections 43
R Let’s apply this substitution. In most calculus courses, the indefinite integral
sec d D log j sec C tan j is found by some unmotivated trick, but this
integration is quite natural when we use the method of Diophantus.
Z Z Z Z
d 1 C t 2 2 dt 2 dt
sec d D D 2
2
D :
cos 1 t 1Ct 1 t2
Since
2 1 1
D C ;
1 t2 1Ct 1 t
we have
Z Z Z
2 dt dt dt
D C D log j1 C tj log j1 tj:
1 t2 1Ct 1 t
The hard part is now done;
ˇ1 C t ˇ
log j1 C tj log j1 tj D log ˇ ˇ;
1 t
and it is cosmetic to rewrite, using the formula relating t and
1Ct .1 C t/2 1 C 2t C t 2 1 C t2 2t
D D D C D sec C tan :
1 t 1 t2 1 t2 1 t2 1 t2
Other integrands can also be integrated using the tangent half-angle formula
(see Exercise 1.78 below). Similar parametrizations of other conic sections also
lead to integration formulas (see Exercises 1.80–1.82 below and [28, pp. 86–
97]).
Exercises
1.77 Verify the following trigonometric identities.
cos cot
(i) 1 C csc D .
1 sin
1 1
(ii) D 2 cot .
csc cot csc C cot
i i
“book2” — 2013/5/24 — 8:18 — page 44 — #64
i i
1.79 * Preview.
(i) Sketch the graph of x 2 xy C y 2 D 1.
(ii) Find a “sweeping lines” parametrization for the points on the graph of
x 2 xy C y 2 D 1.
(iii) Find a scalene triangle with integer side lengths and a 60ı angle.
1.80 Take It Further.
(i) Find a “sweeping lines” parametrization for the points on the graph of the
parabola x D y 2 , using lines joining A D .0; 0/ to points P D .x; y/ on the
parabola.
Z
dx
(ii) Use this parametrization to evaluate p .
1C x
1.81 Take It Further. Show that a “sweeping lines” parametrization for the points
on the ellipse x 2 =a2 C y 2 =b 2 D 1, using lines joining A D . a; 0/ to points
P D .x; y/ on the ellipse, is
a.b 2 a2 t 2 / 2ab 2 t
xD and :
b 2 C a2 t 2 b2 C a2 t 2
1.82 Take It Further. Show that a “sweeping lines” parametrization for the points on
the hyperbola x 2 =a2 y 2 =b 2 D 1, using lines joining A D . a; 0/ to points
P D .x; y/ on the hyperbola, is
a.b 2 C a2 t 2/ 2ab 2t
xD and y D :
b 2 a2 t 2 b2 a2 t 2
1.83 * Take It Further. Most high school texts derive the quadratic formula by “com-
pleting the square,” a method we’ll discuss and generalize in Chapter 3. Here’s
another way to derive the formula.
(i) Show that if r and s are the roots of x 2 C bx C c D 0, then
r Cs D b and
rs D c
.r s/2 D b 2 4c;
p
so that r s D ˙ b2 4c.
(iii) Solve the system
r Cs D b
p
r s D ˙ b2 4c
for r and s.
i i
“book2” — 2013/5/24 — 8:18 — page 45 — #65
i i
2 Induction
41; 43; 47; 53; 61; 71; 83; 97; 113; 131;
151; 173; 197; 223; 251; 281; 313; 347; 383; 421;
461; 503; 547; 593; 641; 691; 743; 797; 853; 911;
971; 1033; 1097; 1163; 1231; 1301; 1373; 1447; 1523; 1601:
It is tedious, but not very difficult (see Exercise 2.2 on page 52), to show that
every one of these numbers is prime. Inductive reasoning leads you to expect
that all numbers of the form f .n/ are prime. But the next number, f .41/ D
1681, is not prime, for f .41/ D 412 41 C 41 D 412 , which is obviously
composite.
An even more spectacular example of the failure of inductive reasoning is
given by the harmonic series 1 C 21 C 13 C C n1 C , which diverges
(first proved by Oresme (ca.1320–1382)), and so its partial sums get arbitrarily
large. Given a number N , there is a partial sum
Xm
1
†m D D1C 1
2
CC 1
m
nD1
n
45
i i
“book2” — 2013/5/24 — 8:18 — page 46 — #66
i i
46 Chapter 2 Induction
with †m > N . A high school student, unaware of this, playing with his cal-
culator and seeing that †315 6:33137, would probably make the reasonable
guess that †m < 100 for all m. But he’s wrong; the series diverges! It is
known that if m < 1:5 1043 , then †m < 100. The most generous estimate
of the age of the Earth is ten billion (10,000,000,000) years, or 3:65 1012
days, a number insignificant when compared to 1:5 1043 . Therefore, starting
from the Earth’s very first day, if the statement †m < 100 was verified on the
mth day, then there would be today as much evidence of the general truth of
these statements as there is that the Sun will rise tomorrow morning. And yet
most statements †m < 100 are false!
Inductive reasoning is valuable in mathematics, as it is in natural science,
because seeing patterns in data often helps us guess what may be true in general
(see Exercise 2.1 on page 52, for example). However, merely checking whether
the first few (or first few trillion) statements are true is not enough. We have
just seen that checking the first 1:5 1043 statements is inadequate to establish
a general rule.
Let’s now discuss mathematical induction. Suppose we are given a se-
quence of statements
For example, the formula 2n > n for all n 1 can be viewed as the sequence
of statements
Mathematical induction is a technique for proving that all the statements are
true.
The key idea is just this. Imagine a stairway to the sky. We claim that if
its bottom step is white and the next step above any white step is also white,
then all the steps of the stairway are white. Here’s our reasoning. If some steps
aren’t white, walk up to the first non-white step; call it Fido. Now Fido can’t
be at the bottom, for the bottom step is white, and so there is a step just below
Fido. This lower step must be white, because Fido is the first non-white one.
But Fido, being the next step above a white step, must also be white. This is a
contradiction; there is no Fido. All the steps are white.
To sum up, given a list of statements, we are claiming that if
(i) the first statement is true, and
(ii) whenever a statement is true, so is the next one,
then all the statements on the list are true.
Let’s apply this idea to the list of inequalities S.n/W 2n > n. Now S.1/ is
true, for 21 D 2 > 1. Suppose we believe, for every n > 1, that the implication
The symbol ) means 2n 1 > n 1 ) 2n > n is true. Since S.1/ is true and S.1/ ) S.2/ is true,
implies. we have S.2/ true; that is, if 21 > 1 and 21 > 1 ) 22 > 2 are both true,
then 22 > 2. Since 22 > 2 is true and 22 > 2 ) 23 > 3 is true, we have
23 > 3; since 23 > 3 is true and 23 > 3 ) 24 > 4 is true, we have 24 > 4;
and so forth. Mathematical induction replaces the phrase and so forth with
statement (ii), which guarantees, for every n, that there is never an obstruction
in the passage from the truth of any statement S.n 1/ to the truth of the next
one S.n/. We will prove 2n > n for all n 1 in Proposition 2.2.
Here is the formal statement of mathematical induction.
i i
“book2” — 2013/5/24 — 8:18 — page 47 — #67
i i
S.n 1/ W 2n 1
>n 1
is true, then so is S.n/; that is, 2n 1 > n 1 implies 2n > n. Multiply both
sides of the inequality S.n 1/ by 2: if 2n 1
> n 1, then
2n D 2 2n 1
> 2.n 1/ D .n 1/ C .n 1/ .n 1/ C 1 D n
(the last inequality holds because n > 1 implies n 1 1). Thus, if 2n 1 >
n 1 is true, then 2n > n is also true.
Since both the base step and the inductive step hold, Theorem 2.1 says that
all the statements are true: 2n > n for all n 1.
Etymology. The word induction comes from the Latin word meaning to lead
into or to influence. It is used here because, as we have just seen, the truth of
the nth statement arises from the truth of the previous statement.
Usually the base step in an inductive proof occurs when k D 1, although Define S.0/ W 20 > 0.
many proofs occur when k D 0 (see Exercise 2.4 on page 52). Here is an Suppose we had taken
example of an induction whose base step occurs when k D 5. Consider the the base step in Propo-
sition 2.2 at k D 0. Can
statements you write out a proof that
S.0/ ) S.1/?
S.n/ W 2n > n2 :
This is not true for small values of n: if n D 2 or 4, then there is equality, not
inequality; if n D 3, the left side, 8, is smaller than the right side, 9. However,
S.5/ is true: 32 > 25.
i i
“book2” — 2013/5/24 — 8:18 — page 48 — #68
i i
48 Chapter 2 Induction
Proof. We have just checked the base step S.5/. Suppose that n > 5 and that
2n 1
> .n 1/2 : (2.1)
Can we use this to show that 2n > n2 ? Multiply both sides of inequality (2.1)
by 2 to obtain
We’ll be done if we show, for n > 5, that 2.n 1/2 > n2 . Now
Definition. A polygon P in the plane is convex if, for every pair of distinct
points A; B on its perimeter, the line segment AB lies inside of P .
For example, every triangle is convex, but there are quadrilaterals that are
not convex. For example, the shaded quadrilateral in Figure 2.1 is not convex,
for the line segment joining boundary points A and B is not wholly inside it.
B
A
1 C C n D .n 2/180ı:
i i
“book2” — 2013/5/24 — 8:18 — page 49 — #69
i i
V1
V2
q1
q2 V3
D q3
Vn qn P¢
qn–1
qn–2
Vn – 1
Vn – 2
In any proof by induction, we must verify both the base step and the induc-
tive step; verification of only one of them is insufficient. For example, consider
the statements S.n/W n2 D n. The base step, S.1/, is true, but the inductive
step is false; of course, these statements S.n/ are false for all n > 1. Another
example is given by the statements S.n/W n D n C 1. It is easy to see that the
inductive step is true: if S.n 1/ is true, i.e., if n 1 D .n 1/C 1, then adding
1 to both sides gives n D .n 1/ C 2 D n C 1, which is the next statement,
S.n/. But the base step is false; of course, all these statements S.n/ are false.
How to Think About It. When first seeing induction, many people sus-
pect that the inductive step is circular reasoning. Why are you allowed to use
statement S.n 1/, which you don’t know is true, to prove that S.n/ is true?
Isn’t the truth of S.n 1/ essentially what you are supposed to be proving?
A closer analysis shows that this is not at all what is happening. The inductive
step, by itself, does not prove that S.n/ is true. Rather, it says that if S.n 1/
is true, then S.n/ is also true. In other words, the inductive step proves that the
implication “If S.n 1/ is true, then S.n/ is true” is correct. The truth of this
implication is not the same thing as the truth of its conclusion. For example,
consider the two statements: “Your grade on every exam is 100%” and “Your
grade for the course is A.” The implication “If all your exams are perfect, then
you will get the highest grade for the course” is true. Unfortunately, this does
not say it is inevitable that your grade for the course will be A. Here is a math-
ematical example: the implication “If n 1 D n, then n D n C 1” is true, but
the conclusion “n D n C 1” is false.
i i
“book2” — 2013/5/24 — 8:18 — page 50 — #70
i i
50 Chapter 2 Induction
But 12 .n 1/n C n D 1
2
n.n C 1/. By induction, the formula holds for all
n 1.
s D 1 C 2 C C 99 C 100
s D 100 C 99 C C 2 C 1
and add
the sum 101 occurring 100 times. We now solve: s D 21 .100 101/ D 5050.
This argument is valid for any number n in place of 100 (and there is no obvi-
ous use of induction!). Not only does this give a new proof of Proposition 2.5,
it shows how the formula could have been discovered.
Example 2.6. Another proof of the formula in Proposition 2.5 comes from an
analysis of the square in Figure 2.3.
Pn
Figure 2.3. kD1 k.
.n C 1/2 .n C 1/ D n2 C n
unit squares off the diagonal. Half of them, 12 .n2 C n/, are above the diagonal.
But, if you count by rows, there are
1C2CCn
i i
“book2” — 2013/5/24 — 8:18 — page 51 — #71
i i
1 C 2 C C n D 21 .n2 C n/: N
How to Think About It. Proposition 2.5 illustrates a common problem stu-
dents have when learning induction. Induction itself is a technique of proof
(involving just two steps), but it is not a method of discovery. However, the
two notions of proof and discovery are often intertwined. For example, merely
applying mathematical induction, as we did in the proof of Proposition 2.5,
is straightforward. But many beginning students get confused because, at the
same time as they are following the steps of the proof, they are also wonder-
ing where the formula for the sum comes from. In contrast, neither Gauss’s
proof nor the proof using the .n C 1/ .n C 1/ square is confusing, for the
ideas of these proofs and their techniques of proof are separate. In Section 2.3,
we’ll describe a method for introducing mathematical induction to high school
students that usually minimizes this confusion.
Aside from proving statements, induction can also be used to define terms.
For example, here is an inductive definition of factorial. Inductive definitions are
often called recursive
definitions.
Definition. Define 0Š D 1 and, if n > 0, define nŠ D n .n 1/Š. In other
words, nŠ is defined by Defining 0Š D 1 is con-
( venient, as we shall see
1 if n D 0 in the next section when
nŠ D we discuss the Binomial
n .n 1/Š if n > 0: Theorem.
If a D 0, we have defined 00 D 1.
i i
“book2” — 2013/5/24 — 8:18 — page 52 — #72
i i
52 Chapter 2 Induction
Exercises
P
2.1 * Guess a formula for 1 C jnD1 j Šj , and use mathematical induction to prove
that your formula is correct.
p
2.2 * Prove that if m 2 is an integer not divisible by any prime p with p m,
then m is a prime. Use this to prove that the numbers n2 n C 41 are prime for
all n 40.
2.3 * Let m1 ; m2 ; : : : ; mn be integers such that gcd.mi ; mj / D 1 for all i ¤ j . If
each mi divides an integer k, prove that their product m1 m2 mn also divides k.
Hint: Use Exercise 1.58 on page 35.
2.4 * If a is positive, give two proofs that
an 1
1 C a C a2 C C an 1
D ;
a 1
by induction on n 0 and by multiplying the left-hand expression by .a 1/.
i i
“book2” — 2013/5/24 — 8:18 — page 53 — #73
i i
2.5 Let x > 1 be a real number. Prove that .1 C x/n 1 C nx for all n 1.
2.6 What is the smallest value of k so that 2n > n3 for all n k? Why?
2.7 Assuming the product rule for derivatives, .fg/0 D f 0 g C fg0 , prove that
.x n /0 D nx n 1
for all integers n 1:
2.8 In high school, nŠ is usually defined as 1 2 3 n. Show that this agrees with
the definition on page 51 for all n 1.
2.9 (Double Induction) Let k; k 0 be integers, and let S.m; n/ be a doubly indexed
family of statements, one for each pair of integers m k and n k 0 . Suppose
that
(i) S.k; k 0 / is true,
(ii) if S.m 1; k 0 / is true, then S.m; k 0 / is true,
(iii) if S.m; n 1/ is true for all m k, then S.m; n/ is true for all m k.
Prove that S.m; n/ is true for all m k and n k 0 .
2.10 Prove that .m C 1/n > mn for all m; n 1.
2.11 Prove the Laws of Exponents by Double Induction.
Unique Factorization
Induction is useful in number theory. As a simple example, we generalize Eu-
clid’s Lemma to more than two factors.
Proof. The proof is by induction on n 2. The base step is Theorem 1.21. To “The proof is by induction
prove the inductive step, suppose that p j a1 a2 an . We may group the fac- on n 2” not only
indicates the base step, it
tors on the right side together so there are only two factors: .a1 a2 an 1 /an .
also tells which variable
By Theorem 1.21, either p j a1 a2 an 1 or p j an . In the first case, the in- will be changing in the
ductive hypothesis gives p j ai for some i n 1 and we are done. In the inductive step.
second case, p j an , and we are also done.
This proof illustrates an empirical fact. It is not always the case, in an in-
ductive proof, that the base step is very simple. In fact, all possibilities can
occur: both steps can be easy, both can be difficult, or one can be harder than
the other.
Here is an amusing inductive proof (due to Peter Braunfeld) of the existence
of the quotient and remainder in the Division Algorithm.
Proposition 2.9. If a and b are positive integers, then there are integers q
and r with b D qa C r and 0 r < a.
Proof. We do induction on b 1.
The base step: b D 1. Now a 1, because it is a positive integer. If a D 1,
choose q D 1 and r D 0; if a > 1, choose q D 0 and r D 1.
Let’s prove the inductive step. The inductive hypothesis is b 1 D qa C r ,
where 0 r < a. It follows that b D qaCr C1. Now r < a implies r C1 a.
If r C 1 < a, we are done. If r C 1 D a, then b D qa C .r C 1/ D qa C a D
.q C 1/a, and we are done in this case as well.
i i
“book2” — 2013/5/24 — 8:18 — page 54 — #74
i i
54 Chapter 2 Induction
p1 pm D q1 qn
i i
“book2” — 2013/5/24 — 8:18 — page 55 — #75
i i
Proof. The result follows from Proposition 2.13 and Exercise 1.72 on page 40: Notice how a computa-
tional inquiry has given a
mi C Mi D ei C fi ; theorem.
i i
“book2” — 2013/5/24 — 8:18 — page 56 — #76
i i
56 Chapter 2 Induction
6 9
3 4 3 5
1 3
8 5 2 3
4 7 8 3 5 1
5 7 6 8
1 7
5 9 6 7
3 6
4– 12´ 3– 4– 12´ 3–
5 3 2 1 4
12+ 1– 12+ 1–
1 2 3 4 5
2¸ 4– 3 2¸ 4– 3
2 1 4 5 3
60´ 5+ 2¸ 60´ 5+ 2¸
4 5 1 3 2
3 4 5 2 1
The difficulty in solving a KenKen puzzle arises from there being too many
ways to fill in each cage. Sometimes, the Fundamental Theorem of Arithmetic
can help. Let’s start solving the puzzle in Figure 2.5. We view the grid as a
5 5 matrix, and we’ll abbreviate “target-operation” to T-O. Consider the L-
shaped cage consisting of 4 cells whose target operation is 60. There are two
possibilities: its cells are filled with an arrangement either of 2, 2, 3, 5 or of
1, 3, 4, 5. Assume the first possibility holds. Since we cannot have both 2s in
the same row or the same column, one 2 is in position .4; 3/; the other 2 is
either in position .5; 1/ or .5; 2/. Suppose 2 sits in the .5; 1/ position. There is
a cage in the first column with T-O 4 ; its cells must contain 1 and 5. Hence,
the other cage, with T-O 2, must contain 3 and 5; it cannot. Thus, 2 sits in
i i
“book2” — 2013/5/24 — 8:18 — page 57 — #77
i i
position .5; 2/. There is a cage in the second column with T-O 4 , and its cells
must contain 1 and 5. This says that the top two cells in the second column
contain 3 and 4. But the L-shaped cage with T-O 12 must now have 1 in
position .1; 3/. This forces the column one cage, with T-O 4 , to have 5 in
position .1; 1/, because it can’t be 1. Thus, the last cage in the first row cannot
involve 1 or 5. But the only ways to fill in a 2-cell cage with T-O 3 are with
1 and 4 or with 2 and 5. Conclusion: The 4-cell cage with T-O 60 must be an
arrangement of 1, 3, 4, 5. The full solution is given in Figure 2.6. N
Strong Induction
Certain situations call for a variant of induction, called Strong Induction (or
the Second Form of Induction).
How to Think About It. Let’s compare the two forms of induction. Both
start by verifying the base step, and both have an inductive step to prove
S.n/. The inductive hypothesis in the first form is that S.n 1/ is true; the
inductive hypothesis in Strong Induction is that all the preceding statements
S.k/; : : : ; S.n 1/ are true. Thus, Strong Induction has a stronger inductive
hypothesis (actually, each of Theorems 2.1 and 2.16 implies the other).
We are going to prove Theorem 2.16 and Theorem 2.1 simultaneously (we
haven’t yet proved the latter theorem). But first we need an easy technical
remark. The Least Integer Axiom says that every nonempty subset C of the
natural numbers N contains a smallest number; that is, there is some c0 2 C
with c0 c for all c 2 C . This axiom holds, not only for N, but for any subset
Nk D fn 2 Z W n kg
i i
“book2” — 2013/5/24 — 8:18 — page 58 — #78
i i
58 Chapter 2 Induction
Case 1. As S.c0 / is the first false statement, S.c0 1/ must be true. Assuming
inductive step (i), S.c0 / D S ..c0 1/ C 1/ is true, and this is a contradiction.
Case 2. As S.c0 / is the first false statement, all the statements S.`/, where ` is
a predecessor of c0 , are true. Assuming inductive step (ii), the strong version,
we again reach the contradiction that S.c0 / is true.
Proof. The base step S.2/ is true because 2 is a prime. We prove the inductive
step. If n 2 is a prime, we are done. Otherwise, n D ab, where 2 a < n
and 2 b < n. As a and b are predecessors of n, each of them is a product of
primes:
a D pp 0 and b D qq 0 :
The reason why strong induction is more convenient here is that it is more
natural to use S.a/ and S.b/ than to use S.n 1/; indeed, it is not at all clear
how to use S.n 1/.
The next result says that we can always factor out a largest power of 2 from
any integer. Of course, this follows easily from the Fundamental Theorem of
Arithmetic, but we prove the proposition to illustrate further situations in which
strong induction is more appropriate than the first form.
i i
“book2” — 2013/5/24 — 8:18 — page 59 — #79
i i
two cases. If n is odd, take k D 0 and m D n. If n is even, then n D 2b. Why isn’t the first form of
Since b < n, it is a predecessor of n, and so the inductive hypothesis allows us induction convenient here?
to assume b D 2` m, where ` 0 and m is odd. The desired factorization is
n D 2b D 2`C1 m.
To prove uniqueness (induction is not needed here), suppose that 2k m D
n D 2t m0 , where both k and t are nonnegative and both m and m0 are odd.
We may assume that k t. If k > t, then canceling 2t from both sides gives
2k t m D m0 . Since k t > 0, the left side is even while the right side is odd;
this contradiction shows that k D t. We may thus cancel 2k from both sides,
leaving m D m0 .
Exercises
2.12 (i) Prove that an integer a 2 is a perfect square if and only if whenever p is
prime and p j a, then p2 j a.
(ii) Prove that if an integer z 2 is a perfect square and d 4 j z 2 , then d 2 j z.
2.13 Let a and b be relatively prime positive integers. If ab is a perfect square, prove
that both a and b are perfect squares.
2.14 * Let a; b; c; n be positive integers with ab D c n . Prove that if a and b are
relatively prime, then both a and b are nth powers; that is, there are positive
integers k and ` with a D k n and b D `n .
2.15 * For any prime p and any positive integer n, denote the highest power of p
dividing n by Op .n/. That is, Corollary 2.11 guarantees
that Op is well-defined.
Op .n/ D e;
where pe j n but peC1 − n. If m and n are positive integers, prove that
(i) Op .mn/ D Op .m/ C Op .n/
˚
(ii) Op .m C n/ min Op .m/; Op .n/ . When does equality occur?
There is a generalization of Exercise 1.6 on page 6. Using a (tricky) inductive
proof (see FCAA [26], p. 11), we can prove the Inequality of the Means: if n 2
and a1 ; : : : ; an are positive numbers, then
p
n
a1 an n1 .a1 C C an /:
2.16 (i) Using the Inequality of the Means for n D 3, prove, for all triangles having a
given perimeter, that the equilateral triangle has the largest area.
Hint: Use Heron’s Formula for the area A of a triangle with sides of lengths
a; b; c: if the semiperimeter is s D 21 .a C b C c/, then
i i
“book2” — 2013/5/24 — 8:18 — page 60 — #80
i i
60 Chapter 2 Induction
Differential Equations
You may have seen differential equations in other courses. If not, don’t worry;
the next example is self-contained.
so that
an an 1 n
xn 1
D x 1
for all n 1I
.n 1/Š .n 1/Š
y.x/ D a0 .1 C x C 12 x 2 C C 1 n
nŠ
x C / D a0 e x :
It is true that an D a0 for all n; one proof is by induction (see Exercise 2.18
on page 62). N
i i
“book2” — 2013/5/24 — 8:18 — page 61 — #81
i i
y 00 D y 0 C 2y (2.3)
an D an 1 C 2an 2:
Tabulating an for a few values shows a pattern. All the outputs an seem to be
1 away from a power of 2, either 1 more or 1 less. The first two entries record
the initial conditions.
n an
0 2
1 1
2 5D1C22
3 7D5C21
4 17 D 7 C 2 5
5 31 D 17 C 2 7
6 65
7 127
8 257
9 511
10 1025
You can check that e 2x C e x works by substituting it into Eq. (2.3); we have
solved the differential equation. N
i i
“book2” — 2013/5/24 — 8:18 — page 62 — #82
i i
62 Chapter 2 Induction
Proof. Because the definition has two initial values, we need to check two base
steps:
an D an 1 C 2an 2
n 1
D 2 C . 1/n 1
C 2 2n 2
C . 1/n 2
D 2n 1
C . 1/n 1
C 2 2n
C 2 . 1/n 2 2
D 2n 1 C . 1/n 1 C 2n 1 C 2 . 1/n 2
D 2n 1 C 2n 1 C . 1/n 1 C 2 . 1/n 2
D 2 2n 1
C . 1/n 2
. 1 C 2/
n n 2
D 2 C . 1/
D 2n C . 1/n :
Exercises
2.18 * Complete the discussion in Example 2.20: show that if
a2 2 an n
y.x/ D a0 C a1 x C x C C x C
2Š nŠ
and y 0 D y, then an D a0 .
2.19 Assume that “term-by-term” differentiation holds for power series: if f .x/ D
c0 C c1 x C c2 x 2 C C cn x n C , then the power series for the derivative
f 0 .x/ is
i i
“book2” — 2013/5/24 — 8:18 — page 63 — #83
i i
2y 00 y0 3y D 0:
Answer: y D 51 e x C 54 e3x=2 .
.1 C x/n D c0 C c1x C c2 x 2 C C cn x n :
The binomial coefficient nr is pronounced “n choose r ” because it also
arises in counting problems, as we shall soon see. Thus,
n
!
X n r
n
.1 C x/ D x :
r D0
r
For example,
.1 C x/0 D 1
.1 C x/1 D 1 C 1x
.1 C x/2 D 1 C 2x C 1x 2
.1 C x/3 D 1 C 3x C 3x 2 C 1x 3
.1 C x/4 D 1 C 4x C 6x 2 C 4x 3 C 1x 4:
i i
“book2” — 2013/5/24 — 8:18 — page 64 — #84
i i
64 Chapter 2 Induction
1
1 1
1 2 1
1 3 3 1
1 4 6 4 1
1 5 10 10 5 1
1 6 15 20 15 6 1
1 7 21 35 35 21 7 1
Proof. The nth row of Pascal’s triangle is the coefficient list for .1 C x/n . The
fact that the constant term and the highest degree term have coefficient 1 is
Exercise 2.21 on page 67. For the inside terms, we claim that the coefficient of
You can also prove x r in .1 C x/n is the sum of two neighboring coefficients in .1 C x/n 1 . More
Lemma 2.23 by induc- precisely, we claim that if
tion. See Exercise 2.22 on
page 67. .1 C x/n 1
D c0 C c1 x C c2 x 2 C C cn 1x
n 1
;
and 0 < r < n, then the coefficient of x r in .1 C x/n is cr 1 C cr . We have
.1 C x/n D .1 C x/.1 C x/n 1
D .1 C x/n 1
C x.1 C x/n 1
n 1 n 1
D .c0 C C cn 1x / C x.c0 C C cn 1x /
n 1 2 n
D .c0 C C cn 1x / C .c0 x C c1x C C cn 1x /
2
D 1 C .c0 C c1 /x C .c1 C c2 /x C :
Thus nr D cr 1 C cr D nr 11 C n r 1 .
Pascal’s triangle was known centuries before Pascal’s birth; Figure 2.7 shows
a Chinese scroll from the year 1303 depicting it. Pascal’s contribution (around
1650) is a formula for the binomial coefficients.
i i
“book2” — 2013/5/24 — 8:18 — page 65 — #85
i i
For the inductive step, note first that the formula holds when r D 0 and Here is another place
r D n: showing that defining
! 0Š D 1 is convenient.
n nŠ
D1D
0 0Š.n 0/Š
and
!
n nŠ
D1D :
n nŠ 0Š
i i
“book2” — 2013/5/24 — 8:18 — page 66 — #86
i i
66 Chapter 2 Induction
Theorem 2.25 (Binomial Theorem). (i) For all real numbers x and all in-
tegers n 0,
n
X n
X
n nŠ
.1 C x/n D xr D xr :
r D0
r r D0
r Š.n r /Š
i i
“book2” — 2013/5/24 — 8:18 — page 67 — #87
i i
Example 2.27. The Binomial Theorem can be used to express the sum of the
nth powers of two variables a and b in terms of the “elementary symmetric
functions” a C b and ab. Here are some examples for n D 2; 3; 4; from
.a C b/2 D a2 C 2ab C b 2
we have
a2 C b 2 D .a C b/2 2ab:
From
we conclude
For n D 4,
Hence,
Exercises
2.21 * Show, without using the Binomial Theorem, that, if n 0 is an integer, then
(i) the degree of .1 C x/n is n
(ii) the leading coefficient of .1 C x/n is 1
(iii) the constant term of .1 C x/n is 1.
2.22 Prove Lemma 2.23 by induction on n 0.
2.23 Prove that the binomial coefficients are symmetric: for all r with 0 r n,
! !
n n
D :
r n r
i i
“book2” — 2013/5/24 — 8:18 — page 68 — #88
i i
68 Chapter 2 Induction
2.24 Find a formula for the sum of the entries in the nth row of Pascal’s triangle and
prove your assertion.
2.25 If n 1, find a formula for the alternating sum of the binomial coefficients in the
nth row of Pascal’s triangle:
! ! ! !
n n n n n
C C . 1/ :
0 1 2 n
(i) Find a formula for the nth triangular number in terms of binomial coefficients,
and prove your assertion. Compare this exercise with the discussion of the
.n C 1/ .n C 1/ square in Example 2.6.
(ii) Show that the sum of two consecutive triangular numbers is a perfect square.
2.29 Take It Further. Using the notation of Example 2.27, use the Binomial Theorem
and induction to show that an C b n can be expressed in terms of a C b and ab.
2.30 Pascal’s triangle enjoys a sort of hockey stick property: if you start at the end of
any row and draw a hockey stick along a diagonal, as in Figure 2.9, the sum of the
entries on the handle of the stick is the entry at the tip of the blade. Express the
hockey stick property as an identity involving binomial coefficients and prove the
identity.
2.31 (Leibniz) A function f W R ! R is called a C1 -function if it has an nth deriva-
tive f .n/ .x/ for every n 1. Prove that if f and g are C 1 -functions, then
n
!
X n
.n/
.fg/ .x/ D f .k/ .x/ g.n k/ .x/:
k
kD0
i i
“book2” — 2013/5/24 — 8:18 — page 69 — #89
i i
1
1 1
1 2 1
1 3 3 1
1 4 6 4 1
1 5 10 10 5 1
1 6 15 20 15 6 1
1 7 21 35 35 21 7 1
1 8 28 56 70 56 28 8 1
1 9 36 84 126 126 84 36 9 1
1 10 45 120 210 252 210 120 45 10 1
1 11 55 165 330 462 462 330 165 55 11 1
1 12 66 220 495 792 924 792 495 220 66 12 1
1 13 78 286 715 1287 1716 1716 1287 715 286 78 13 1
1 14 91 364 1001 2002 3003 3432 3003 2002 1001 364 91 14 1
1 15 105 455 1365 3003 5005 6435 6435 5005 3003 1365 455 105 15 1
2.32 * If p is a prime and a and b are integers, prove that there is an integer c with
.a C b/p D ap C b p C pc:
Combinatorics
Binomial coefficients have a combinatorial interpretation. Given a set X with
n elements, define an r-subset of X to be a subset having exactly r elements.
How many r -subsets of X are there?
Example 2.28. There are ten 3-element subsets of the 5-element set X D
fA; B; C; D; Eg. Think of forming a 3-person committee from 5 people. A com-
mittee either contains Elvis or doesn’t. The committees are
fA; B; C g fA; B; Dg fB; C; Dg fA; C; Dg
fA; B; Eg fA; C; Eg fA; D; Eg fB; C; Eg fB; D; Eg fC; D; Eg
The first row consists of the 3-subsets that don’t contain Elvis (there are four
such); the second row displays the 3-subsets that do contain Elvis (there are
six of these). N
i i
“book2” — 2013/5/24 — 8:18 — page 70 — #90
i i
70 Chapter 2 Induction
Case 1. If Elvis is in your r -subset, then you must pick r 1 elements from
the remaining n 1; by definition, there are Œn 1; r 1 ways to do this.
Case 2. If Elvis is not in your r -subset, then you must pick all r elements from
the remaining n 1; there are Œn 1; r ways to do this.
D Œn 1; r 1 C Œn 1; r inductive hypothesis
D Œn; r Lemma 2.29.
(note that aa, bb, cc are not on this list). How many r -anagrams are there on an
alphabet with n letters? We count the number of such anagrams in two ways.
i i
“book2” — 2013/5/24 — 8:18 — page 71 — #91
i i
(i) There are n choices for the first letter; since no letter is repeated, there are
only n 1 choices for the second letter, only n 2 choices for the third
letter, and so forth. Thus, the number of r -anagrams is
r Š Œn; r :
We conclude that
If you piece together the results of this section, you’ll see that we have
shown that the following ways to define binomial coefficients are all equiva-
lent: starting from any one of them, you can derive the others.
n
Algebraic: r
is the coefficient of x r in the expansion of .1 C x/n .
Pascal:
!
n nŠ
D :
r r Š .n r /Š
n
Combinatorial: r
is the number of r -element subsets of an n-element set.
Inductive:
! (
n 1 if n D 0 or n D r
D n 1 n 1
r r 1 C r if 0 < r < n.
Example 2.31. If you replace the symbols by their definition, Theorem 2.30
says something that is far from obvious: the coefficient of x r in .1 C x/n is
the same as the number of r -element subsets of an n-element set. The proof by
induction of Theorem 2.30 establishes this, but many people are left wondering
if there is a more intuitive reason why the expansion of .1 C x/n contains all
the information about subsets of various sizes from an n-element set.
i i
“book2” — 2013/5/24 — 8:18 — page 72 — #92
i i
72 Chapter 2 Induction
If you were going to multiply out .1 C x/5 by hand, you could view the
calculation like this:
The expansion is carried out by taking one term (1 or x) from each binomial
factor 1 C x, multiplying them together, and then collecting like powers of x.
For example, you could take a “1” from each of the first three binomials and
an x from the last two. That would produce 1 1 1 x x D x 2 . But that’s
not the only way to get an x 2 . You could have taken an x from the first and
third binomials and 1 from the rest. Or an x from the first two binomials and 1
from the last three. Do this in every possible way; the coefficient of x 2 in the
expansion will be the number of ways you can pick two binomials from the set
of five to be “x terms.” And there are precisely 10 D Œ5; 2 ways to do this.
Generalizing, view .1 C x/n as a product of n binomials:
The coefficient of x r in this product is the number of ways you can choose r of
the binomials to be “x terms” (and the rest to be 1). This number is precisely
Œn; r . Hence
n
X
.1 C x/n D Œn; r x r :
r D0
When combined with the definition of binomial coefficients on page 63, this
gives another proof that nr D Œn; r . N
Exercises
2.33 *
(i) For each value of r, 0 r 4, how many r-element subsets of the set
fA; B; C; Dg are there?
(ii) For each value of r, 0 r 5, how many r-element subsets of the set
fA; B; C; D; Eg are there?
2.34 How many subsets (of any size) are there in an n-element set? Prove your asser-
tion.
2.35 Show that
! r
! !
2n X n n
D :
r k r k
kD0
i i
“book2” — 2013/5/24 — 8:18 — page 73 — #93
i i
2.3 Connections 73
2.39 How many ways can you choose two hats from a closet containing 14 different
hats? (One of our friends does not like the phrasing of this exercise. After all,
you can choose two hats with your left hand, with your right hand, with your
teeth, : : : , but we continue the evil tradition.)
2.40 Let D be a collection of ten different dogs, and let C be a collection of ten dif-
ferent cats. Prove that there are the same number of quartets of dogs as there are
sextets of cats.
2.41 (i) What is the coefficient of x 16 in .1 C x/20 ?
(ii) How many ways are there to choose 4 colors from a palette containing paints
of 20 different colors?
2.42 A weekly lottery asks you to select 5 different numbers between 1 and 45. At the
week’s end, 5 such numbers are drawn at random, and you win the jackpot if all
your numbers match the drawn numbers. What is your chance of winning?
The number of selections of 5 numbers is “45 choose 5”, which is 45 5 D
1; 221; 759. The odds against your winning are more than a million to one.
2.3 Connections
An Approach to Induction
Teaching mathematical induction to high school students is often tough. In
particular, many students fall into the trap we described on page 49: in spite
of all our explanations to the contrary, they think that the inductive hypothesis
assumes what it is they are supposed to be proving. In this section, we look at
a well-tested method that avoids this trap.
Suppose you ask a class to come up with a function that agrees with the
table
Input Output
0 4
1 7
2 10
3 13
4 16
5 19
We’ve found that about half a high school class (beginning algebra, say) comes
up with a closed form definition, something like f .n/ D 3nC4; while the other
i i
“book2” — 2013/5/24 — 8:18 — page 74 — #94
i i
74 Chapter 2 Induction
i i
“book2” — 2013/5/24 — 8:18 — page 75 — #95
i i
2.3 Connections 75
n 1, but then crashed when you asked for g.n/. You could show that f and
g are equal at n by the same argument:
g.n/ D g.n 1/ C 3 (this is how g is defined)
D f .n 1/ C 3 (the virtual calculator said so)
D .3 .n 1/ C 4/ C 3 (this is how f is defined)
D 3n C 4 (algebra)
D f .n/ (this is how f is defined).
So, every time f and g are equal at one integer, they are equal at the next
one. Since f and g are equal at 0 (in fact, since they are equal at every integer
between 0 and 254), they are equal at every nonnegative integer.
This argument is the essence of mathematical induction. In the example, it
shows that if two functions f and g are equal at one integer, then they are
equal at the next one. Coupled with the fact that they are equal at 0, it makes
sense that they are equal for all integers greater than or equal to 0; that is,
f .n/ D g.n/ for all nonnegative integers n.
We have seen that induction applies in much more general situations than
this one. But this simple context is quite effective in starting students onto a
path that helps them understand induction.
Fibonacci Sequence
Many interesting investigations in high school center around the following se-
quence, which describes a pattern frequently found in nature and in art.
There are two base steps in the definition: n D 0 and n D 1. The Fibonacci
sequence begins: 0; 1; 1; 2; 3; 5; 8; 13; : : :.
i i
“book2” — 2013/5/24 — 8:18 — page 76 — #96
i i
76 Chapter 2 Induction
The number
D 1:61803 : : :, is called the golden ratio. Since both
and ı
are roots of x 2 x 1, we have
2 D C 1 and ı 2 D ı C 1: (2.4)
So, what’s the connection of the golden ratio to the Fibonacci sequence?
We discovered the closed form for the sequence cn in Proposition 2.22 by
tabulating the first few terms of the sequence and looking for regularity—it
seemed “almost” exponential, off by 1 from a power of 2.
Let’s tabulate the first few ratios of consecutive terms Fn =Fn 1 of the Fi-
bonacci sequence.
F1 =F0 D 1=1 D 1
F2 =F1 D 2=1 D 2
F3 =F2 D 3=2 D 1:5
F4 =F3 D 5=3 D 1:666
F5 =F4 D 8=5 D 1:6
F6 =F5 D 13=8 D 1:625:
If you tabulate a few more ratios (try it), a conjecture emerges—it appears that
the ratio of two consecutive terms in the Fibonacci sequence might converge to
the golden ratio
1:61803 (if the ratios were actually constant, Fn would
be a geometric sequence (why?)). This is, in fact, the case, and you’ll see, in
Exercise 2.50 on page 78, how to refine the conjecture into the statement of the
following theorem (the exercise will also help you develop a method that will
let you find closed forms for many 2-term recurrences).
Theorem 2.32. For all n 0, the nth term of the Fibonacci sequence satisfies
Fn D p1 .
n ı n /;
5
p p
where
D 21 .1 C 5/ and ı D 12 .1 5/:
Proof. We use strong induction because the inductive step involves the formu-
las for both Fn 1 and Fn 2 . The base steps S.0/ and S.1/ are true:
p1 .
0 ı 0/ D 0 D F0
5
p p
p1 .
ı/ D p1 21 .1 C 5/ 1
5 5 2 .1 5/ D 1 D F1 :
If n 2, then
i i
“book2” — 2013/5/24 — 8:18 — page 77 — #97
i i
2.3 Connections 77
Fn D Fn 1 C Fn 2
D p1 .
n 1 ı n 1 / C p1 .
n 2 ı n 2 /
5 5
n 1 n 2
D p1
5
.
C
/ .ı n 1 C ı n 2 /
D p1
n 2 .
C 1/ ı n 2 .ı C 1/
5
D p1
n 2 .
2 / ı n 2 .ı 2 / by Eq. (2.4)
5
D p1 .
n ı n /:
5
Isn’t it
pcurious that the integers Fn are expressed in terms of the irrational
number 5?
p If n D 2, then F2 D 1 D
1
Corollary 2.33. Fn >
n 2
for all integers n 3, where
D 2
1C 5.
0 , and so there is equality,
not inequality.
Proof. The proof is by induction on n 3. The base step S.3/ is true, for
F3 D 2 >
1:618. For the inductive step, we must show that FnC1 >
n 1 .
By the inductive hypothesis,
FnC1 D Fn C Fn 1 >
n 2
C
n 3
D
n 3
.
C 1/ D
n 3 2
D
n 1
:
Exercises
2.43 Show that the following functions agree on all natural numbers.
(
5 if n D 0
f .n/ D 3n C 5 and g.n/ D
g.n 1/ C 3 if n > 0.
2.44 Show that the following two functions agree on all natural numbers.
(
4 if n D 0
f .n/ D 4n and g.n/ D
4g.n 1/ if n > 0.
Find a polynomial function p that agrees with h on all natural numbers, and prove
that your functions are equal on N.
Answer: n2 C n C 4.
2.46 Define the function m inductively:
(
0 if n D 0
m.n/ D
m.n 1/ C n2 if n > 0
Find a polynomial function s that agrees with m on all natural numbers, and prove
that your functions are equal on N.
2n3 C3n2 Cn n.nC1/.2nC1/
Answer: 6 D 6 .
i i
“book2” — 2013/5/24 — 8:18 — page 78 — #98
i i
78 Chapter 2 Induction
f .x/ D x 4 6x 3 C 14x 2 6x C 2
and
(
2 if x D 0
g.x/ D
s.x 1/ C 6x 3 if x > 0.
Fn D Fn 1 C Fn 2 for all n 2;
show that
Fn 1
p
lim D 2 .1 C 5/:
n!1 Fn 1
2.50 * You saw, on page 76, that the Fibonacci sequence seems to be “almost” expo-
nential.
(i) Suppose the Fibonacci sequence actually was exponential: Fn D r n . Show
that r would have to be either
p p
1C 5 1 5
D or ı D :
2 2
(ii) Show that the sequences
n and ın satisfy the recurrence
fn D fn 1 C fn 2:
(iii) If a and b are any real numbers, show that a n C bın satisfies the recurrence
fn D fn 1 C fn 2:
a n C bın D Fn :
2.51 Ms. D’Amato likes to take a different route to work every day. She will quit her
job the day she has to repeat her route. Her home and work are pictured in the grid
of streets in Figure 2.11. If she never backtracks (she only travels north or east),
how many days will she work at her job?
i i
“book2” — 2013/5/24 — 8:18 — page 79 — #99
i i
2.3 Connections 79
2.52 Find a closed form for each of the following functions and prove your assertions.
(
4 nD0
(i) f .n/ D
f .n 1/ C 3 n > 0:
Answer: f .n/ D 3n C 4.
(
4 nD0
(ii) f .n/ D
3f .n 1/ n > 0:
Answer: f .n/ D 4 3n .
˚2 nD0
(iii) f .n/ D 4 nD1
4f .n 1/ 3f .n 2/ n > 1:
Answer: f .n/ D 3n C 1.
˚4 nD0
(iv) f .n/ D 4 nD1
4f .n 1/ 3f .n 2/ n > 1:
Answer: f .n/ D 4.
2.53 Find a closed form for the following function and prove your assertion.
„3 nD0
4 nD1
f .n/ D
14 nD2
4f .n 1/ f .n 2/ 6f .n 3/ n > 2:
2.54 Take It Further. Find or generate a copy of the first 30 rows of Pascal’s triangle.
Color the odd numbers red and the even numbers black. Explain any patterns that
you see. (Alternatively, you can use a spreadsheet to generate the triangle of 0s
and 1s that are the remainders when each entry is divided by 2.) For more on this
exercise, see
ecademy.agnesscott.edu/˜ lriddle/ifs/siertri/Pascalmath.htm
i i
“book2” — 2013/5/24 — 8:18 — page 80 — #100
i i
i i
“book2” — 2013/5/24 — 8:18 — page 81 — #101
i i
3 Renaissance
For centuries, the Western World believed that the high point of civilization
took place from the Greek and Roman eras through the beginning of Chris-
tianity. But this worldview began to change dramatically about five hundred
years ago. The printing press was invented around 1450, by Johannes Guten-
berg, Christopher Columbus landed in North America in 1492, Martin Luther
began the Reformation in 1517, and Nicolas Copernicus published De Revolu-
tionibus in 1530.
Mathematics was also developing. A formula giving the roots of certain
cubic polynomials, similar to the quadratic formula, was discovered by Scipi-
one del Ferro around 1515; by 1545, it was extended to all cubics by Fontana
(Tartaglia) and Cardano. The cubic formula contributed to the change in world-
view that was the essence of the Renaissance, for it was one of the first math-
ematical results not known to the ancients. But its impact on contemporary
mathematics was much deeper, for it introduced complex numbers. As we shall
see, the cubic formula is not as useful for numerical computations as we’d like,
because it often gives roots in unrecognizable form. Its importance, however,
lies in the ideas it generated. Trying to understand the formula, searching for
generalizations of it, and studying questions naturally arising from such en-
deavors, were driving forces in the development of abstract algebra.
In many high school algebra courses today, the complex numbers, usually
denoted by C, are introduced to find the roots of ax 2 Cbx Cc when b 2 4ac <
0. That’s not how it happened. Square roots of negative real numbers occur in
the cubic formula, but not as roots; indeed, in the 16th century, complex roots
would have been considered useless. But complex numbers arose in the middle
of calculations, eventually producing real numbers (we will see this explicitly
in the next section). To understand this phenomenon, mathematicians were
forced to investigate the meaning of number; are complex numbers bona fide
numbers? Are negative numbers bona fide numbers?
Section 3.1 discusses the classical formulas giving the roots of cubic and
quartic polynomials. We will look more carefully at the complex numbers
themselves in Section 3.2. Although initially used in purely algebraic contexts,
C has a rich geometric and analytic structure that, when taken together with its
algebraic properties, can tie together many of the ideas in high school mathe-
matics. Indeed, C finds applications all over mathematics. Section 3.4 uses C
to solve some problems that are especially useful for teachers (and interesting
for all mathematicians). Just as the method of Diophantus was used to create
Pythagorean triples, C can be used to invent problems whose solutions “come
out nice.”
81
i i
“book2” — 2013/5/24 — 8:18 — page 82 — #102
i i
82 Chapter 3 Renaissance
Cubics
The following familiar fact (to be proved in Chapter 6) was known and used
by Renaissance mathematicians, and we will use it in this section.
Proposition 6.15. If r is a root of a polynomial f .x/, then x r is a factor
of f .x/I that is, f .x/ D .x r /g.x/ for some polynomial g.x/.
One of the simplest cubics is f .x/ D x 3 1. Obviously, 1 is root of f , and
so x 3 1 D .x 1/g.x/, where
g.x/ D .x 3 1/=.x 1/ D x 2 C x C 1:
by the quadratic formula. Both ! and ! are called cube roots of unity, for
! 3 D 1 D ! 3 . Note that ! D ! 2 D 1=!. p
We know that a positive number a has two square roots. By convention,
p a
denotes the positive square root, so that the two square
p roots are ˙ a. Any
3
real number a has three cube roots. Byp convention,
p pa denotes the real cube
3 3 2 3
root, so that the three cube roots are a; ! a; ! a. Thus, cube roots of
unity generalize ˙.
The general cubic equation aX 3 C bX 2 C cX C d D 0 can be simplified
by dividing both sides by a; this procedure does not affect the roots, and so
i i
“book2” — 2013/5/24 — 8:18 — page 83 — #103
i i
we may assume that a D 1. Thus, we seek the roots of the cubic polynomial
F .X/ D X 3 C bX 2 C cX C d , where b; c; d 2 R. The change of variable
1
XDx 3
b
Lemma 3.2. Given a pair of numbers M and N , there are .possibly complex/
numbers g and h with g C h D M and gh D N . In fact, g and h are roots of
x2 M x C N .
Proof. We have
.x g/.x h/ D x 2 .g C h/x C gh:
Thus, the roots g; h of f .x/ D x 2 M x C N (which exist, thanks to the
quadratic formula) satisfy the given equations g C h D M and gh D N .
Let’s try to find a general method for solving cubic equations—a method
that doesn’t depend on the specific values of the coefficients—by first solving
a numerical equation.
Consider the polynomial f .x/ D x 3 18x 35. Since the constant term
35 D 5 7, we check whether ˙1; ˙5; ˙7 are roots. It turns out that 5 is a
root and, dividing by x 5, we can find the other two roots by solving the You can check that the
quadratic f .x/=.x 5/ D x 2 C 5x C 7. But we are looking for a general other two roots are
complex. Renaissance
method applicable to other cubics, so let’s pretend we don’t know that 5 is a
mathematicians would
root. have dismissed these as
It’s natural to look for a polynomial identity having the same form as the meaningless. But stay
equation we are trying to solve. Example 2.27 provides one. From tuned—we’ll soon see that
they, too, can be generated
a3 C b 3 D .a C b/3 3ab.a C b/; by the emerging method.
i i
“book2” — 2013/5/24 — 8:18 — page 84 — #104
i i
84 Chapter 3 Renaissance
x2 35x C 216 D 0:
g C h; !g C ! 2 h; and ! 2 g C !h;
p
1
where ! D 2
1 C i 3 is a cube root of unity,
p
3 rC R q 4q 3
g D ; hD ; and R D r2 C :
2 3g 27
u D g C h:
We are led to
g3 C h3 D r
1
gh D 3
q:
1
Cube gh D 3
q, obtaining the pair of equations
g3 C h3 D r
g3 h3 D 1 3
27
q :
x2 C r x 1 3
27
q D 0: (3.1)
i i
“book2” — 2013/5/24 — 8:18 — page 85 — #105
i i
and
q
3
p
1 4 3 1
h D 2
r r2 C 27
q D 2
r R :
Now there are three cube roots of g3 , namely, g, !g, and ! 2 g. Because
of the constraint gh D 31 q, each has a “mate,” namely, q=.3g/ D h,
q=.3!g/ D ! 2 h, and q=.3! 2 g/ D !h. Thus, the roots of f are
g C h; !g C its mate; ! 2 g C its mateI
that is, the roots of f are g C h, !g C ! 2 h, and ! 2 g C !h.
3
p If f .x/ D x
Example 3.4 (Good Example). 15x 126, then q D 15, Alternatively, having found
r D 126, R D 15376, and R D 124. Hence, g3 D 125, so we can take one root to be 6, the
g D 5. Thus, h D q=3g D 1. Therefore, the roots of f are other two roots can be
p p found as the roots of the
6; 5! C ! 2 D 3 C 2i 3; 5! 2 C ! D 3 2i 3: quadratic f .x/=.x 6/ D
x 2 C 6x C 21.
For Renaissance mathematicians, this cubic would have only one root—they
would have ignored the complex roots. N
But things don’t always work out as we expect, as the next surprising ex-
ample shows.
Example 3.5 (Bad Example). The cubic formula may give the roots in unrec-
ognizable form. Let
f .x/ D .x 1/.x 2/.x C 3/ D x 3 7x C 6I
the roots of f are, obviously, 1; 2, and 3. But the cubic formula gives
q q
3 1 400 3 1 400
g D 2 6C 27
and h D 2 6 27
:
The first question is how to compute cube roots of “numbers” of the form
a C bi , where i 2 D 1. Specifically, we want u C iv with
p
.u C iv/3 D 3 C i 109 3 :
i i
“book2” — 2013/5/24 — 8:18 — page 86 — #106
i i
86 Chapter 3 Renaissance
Hmm! Perhaps it’s smart to separate terms involving i from honest numbers.
.u C iv/2 D u2 C 2i uv v 2 D a C i bI
u2 v2 D a (3.2)
and
2uv D b: (3.3)
For example, our proof gives a method finding a square root of i . Set a D 0
and b D 1 to obtain
2
i D p1 .1 C i / :
2
We’ll see how to find the
roots of a complex number Alas, this square root success doesn’t lead to a cube root success, although it
in Section 3.3. does give us some confidence that our manipulations may be legitimate.
i i
“book2” — 2013/5/24 — 8:18 — page 87 — #107
i i
You can now appreciate the confusion produced by the cubic formula; a
cloud enveloped our ancestors. First of all, what are these “numbers” a C i b?
Sometimes they can help. Can we trust them to always give us the truth? Is it
true that we can separate terms involving i from those that don’t? When are
two complex numbers equal? Does it make sense to do arithmetic with these
guys? Do they obey the nine properties of arithmetic on page 40 that familiar
numbers do? It took mathematicians about 100 years to become comfortable
with complex numbers, and another 100 years until all was set on a firm foun-
dation.
Quartics
A method for solving fourth degree equations was found by Lodovico Ferrari
in the 1540s, but we present the version given by Descartes in 1637.
Consider the quartic F .X/ D X 4 C bX 3 C cX 2 C dX C e. The change
of variable X D x 41 b yields a simpler polynomial, f .x/ D F .x 41 b/ D
x 4 C qx 2 C r x C s, whose roots give the roots of F : if u is a root of f , then See Exercise 3.6 on
u 41 b is a root of F . Write f as a product of two quadratics: page 89.
Substituting these values for m and ` into the third equation and simplifying
yield a degree 6 polynomial which is a cubic in j 2 (called the resolvent cubic):
.j 2 /3 C 2q.j 2 /2 C .q 2 4s/j 2 r 2:
The cubic formula gives a root j 2 , from which we can determine m and ` and,
hence, the roots of the quartic.
This process is an algorithm that can easily be encoded in a computer al-
gebra system; it is known as the quartic formula. The quartic formula has the
same disadvantage as the cubic formula: even though it gives correct answers,
the values it gives for the roots are usually unrecognizable. But there are some
good examples.
i i
“book2” — 2013/5/24 — 8:18 — page 88 — #108
i i
88 Chapter 3 Renaissance
What’s going on? Why is in our earlier notation, q D 10, r D 0, and s D 1. The quartic formula shows
there a choice for ` and us how to find j; `; m. Since r D 0, we have 2` D j 2 10 D 2m; hence,
j 2 ? If the roots of f are
` D m. But `m D 1, so that either ` D 1 and j 2 D 12 or ` D 1 and j 2 D 8.
˛i for 1 Q i 4, then
f .x/ D i .x ˛i /. A
Taking ` D 1 and j 2 D 12 gives
factorization of f into p p
quadratics arises from
f .x/ D .x 2 C 12x C 1/.x 2 12x C 1/;
grouping these four factors and the quadratic formula gives the four roots of f :
into pairs, and there is p p p p p p p p
no reason why different ˛ D 2 C 3; ˇ D 2 C 3;
D 2 3; ı D 2 3: N
groupings should give
the same quadratics.
Of course, any such
The quadratic formula can be derived in a way similar to the derivations
factorization gives the of the cubic and quartic formulas (in Chapter 1, we derived the formula by
1
same roots of f . completing the square). The change of variable X D x 2 b replaces the
polynomial F .X/ D X CbX Cc with the simpler polynomial f .x/ D x 2 Cq,
2
p
where q D c 14 b 2 ; the roots u D ˙ q of f .x/ give the roots u 12 b of
F . Since the roots of f are
q p
p
uD˙ qD˙ .c 14 b 2 / D ˙ 12 b 2 4c;
the roots of F are our old friends
p p
˙ 21 b 2 4c 12 b D 1
2 b˙ b2 4c :
It is now tempting, as it was for our ancestors, to try to find the roots of the
general quintic F .X/ D X 5 C bX 4 C cX 3 C dX 2 C eX C f and to express
them in a form similar to those for quadratic, cubic, and quartic polynomials;
that is, using only extraction of roots, addition, subtraction, multiplication, and
division (of course, our ancestors hoped to find roots of polynomials of any
degree). They began with the change of variable X D x 51 b to eliminate
the X 4 term. It was natural to expect that some further ingenious substitution
together with the formulas for roots of polynomials of lower degree, analogous
to the resolvent cubic, would yield the roots of F . For almost 300 years, no
such formula was found. But, in 1824, Abel proved that there is no such quintic
formula.
How to Think About It. Abel’s theorem is often misquoted. It says: there is
no formula involving only extraction of roots and the four basic operations of
arithmetic that expresses the roots of the general quintic polynomial in terms
of its coefficients. Succinctly, the general quintic is not solvable by radicals.
But there are other kinds of formulas giving roots of polynomials. For exam-
ple, here is a formula, due to Viète, giving the roots in terms of trigonometric
functions. If f .x/ D x 3 C qx C r has three real
p roots, then its roots are t cos ,3
t cos. C 120ı/, t cos. C 240ı/, where t D 4q=3 and cos.3/ D 4r=t
(there are variations using cosh and sinh when f has complex roots ([26],
p. 445–447)). You may recall Newton’s method giving the roots as limn!1 xn ,
where xnC1 D xn f .xn /=f 0 .xn /. Now some quintic polynomials are solv-
able by radicals; for example, we’ll see in Section 3.3 that x 5 1 is one such.
Another theorem of Abel gives a class of polynomials, of any degree, which
Abel (1802–1829) also are solvable by radicals. Galois, the young wizard who was killed before his
died young. 21st birthday, characterized all the polynomials which are solvable by radi-
cals, greatly generalizing Abel’s theorem. We will look at this more closely in
Chapter 9.
i i
“book2” — 2013/5/24 — 8:18 — page 89 — #109
i i
Exercises
3.1 For each equation, find all roots in R and in C
(i) x 2 2x D 15 (ii) x 2 2x D 16 (iii) x 2 2x D 16
(iv) 6x 2 C x D 15 (v) 6x 2 C x D 16 (vi) 6x 2 C x D 16
(vii) x2 D1 (viii) x3 D1 (ix) x4 D1
(x) x3 D8
3.2 * We know that i satisfies x 2 C 1 D 0 in C (is there another solution?).
(i) Show, for all n 2 Z, that the value of i n is one of 1; i; 1; i .
(ii) Use the Division Algorithm to decide which of the four values i 247 will have.
p
3.3 Let ! D 12 . 1 C i 3/ be a cube root of unity.
(i) Show, for every integer n, the value of ! n is one of 1; !; ! 2 .
(ii) Use the Division Algorithm to decide, for any fixed n, which of the three
values ! n will have.
3.4 Find two numbers whose
(i) sum is 5 and product is 6. (ii) sum is 0 and product is 2.
(iii) sum is 3 and product is 3. (iv) sum is 1 and product is 1.
(v) sum is b and product is c (in terms of(vi)
b and c).
1
3.5 * If F .X / D X 3 C bX 2 C cX C d , show that the change of variable X D x 3b
produces a polynomial f with no quadratic term,
1
f .x/ D F .x 3 b/ D x 3 C qx C r:
be a polynomial of degree n.
1
(a) Show that the change of variable X D x n an 1 produces a polynomial
f with no term of degree n 1,
f .x/ D F .x 1
n an 1 / D x n C qn 2x
n 2
C C q0 :
1
(b) Show that if u is a root of f , then u n an 1 is a root of F:
3.7 Take It Further. Suppose that g and h are complex numbers and
p
! D 12 1Ci 3 :
Show that
i i
“book2” — 2013/5/24 — 8:18 — page 90 — #110
i i
90 Chapter 3 Renaissance
3.8 In Example 3.7, we found the roots of x 4 10x 2 C 1 by factoring it into two
quadratics (which came from taking, in the notation of page 87, ` D 1 and j 2 D
12). Another choice was ` D 1 and j 2 D 8.
(i) Using the alternate choice, get a different factorization of the quartic into
quadratic factors.
(ii) Show that the two factorizations produce the same linear factors.
3.9 The following problem, from an old Chinese text, was solved by Qin Jiushao
(Ch’in Chiu-shao) in 1247. There is a circular castle (see Figure 3.1) whose di-
ameter is unknown; it is provided with four gates, and two lengths out of the
north gate there is a large tree, which is visible from a point six lengths east of the
south gate. What is the length of the diameter? (The answer is a root of a cubic
polynomial.)
T
2 a
N C
r
r
O
E
S 6
i i
“book2” — 2013/5/24 — 8:18 — page 91 — #111
i i
3.16 (i) Show that a rectangle is determined by its perimeter and area.
(ii) Is a rectangular box determined by its volume and surface area? Explain.
3.17 Suppose that ˛, ˇ, and
are roots of the cubic equation x 3 C bx 2 C cx C d D 0. Exercise 3.17 can be done
Find, in terms of b, c, and d , without the cubic formula.
(i) ˛2 C ˇ 2 C
2
(ii) ˛3 C ˇ 3 C
3
(iii) ˛2 ˇ 2
C ˛2 ˇ
2 C ˛ˇ 2
2
3.18 Take It Further.
(i) Suppose that ˛, ˇ, and
are three numbers whose sum is 0. Show that
2 3
.˛ ˇ/.˛
/.ˇ
/ C 4.˛ˇ C ˛
C ˇ
/3 C 27 ˛ˇ
D 0:
3.19 Take It Further. When finding the roots of x 3 C qx C r with the cubic formula,
1 3
you are led to Eq. (3.1): x 2 C rx 27 q , whose roots are g3 and h3 .
(i) Show that the discriminant ı of this quadratic is
ı D r2 C 4 3
27 q
(ii) If ı > 0, show that the cubic has one real root and two complex conjugate
roots. The discriminant of the
cubic x 3 Cqx Cr is defined
(iii) If ı D 0, show that the cubic has two real roots, one of them with multiplic-
to be D 4q 3 27r 2 .
ity 2.
(iv) If ı < 0, show that the cubic has three distinct real roots.
i i
“book2” — 2013/5/24 — 8:18 — page 92 — #112
i i
92 Chapter 3 Renaissance
In Chapter 7, using ideas actual numbers enjoying the simplification rule i 2 D 1. It was during this
of abstract algebra, we’ll time that the terms real and imaginary arose. In this section and the next, we’ll
see that the naive way of
develop complex numbers in a more careful and formal way, and we’ll see that
thinking about complex
numbers, as polynomials in
complex numbers are as real as real numbers!
i obeying the rule i 2 D 1,
can be made precise.
The Complex Plane
When considering expressions of the form a C bi , it is natural to separate the
two summands. Geometry rears its head.
Historical Note. Surprisingly, it took a very long time for people to embrace
the idea of representing the elements of C as points in the plane. It wasn’t until
Wessel presented a paper in 1797 to the Royal Danish Academy of Sciences,
A bijection is a one-to-
entitled On the Analytic Representation of Direction: An Attempt, did this rep-
one correspondence. See
Appendix A.1, page 416, resentation crystallize. Wessel’s discovery was not adopted immediately but,
for the precise definition. by 1830, most mathematicians routinely used the bijection a C bi $ .a; b/
between complex numbers and points of the plane. The complex plane has
gone by other names in its history: for example, Argand Diagram and Gaus-
sian Plane.
Algebraic Operations
In Section 3.1, you saw that mathematicians were forced to add and multiply
complex numbers. However, without precise definitions of the operations or of
the complex numbers themselves, they could not trust many of their results.
The complex plane allows us to resolve the many doubts our ancestors had
about the algebra of complex numbers.
i i
“book2” — 2013/5/24 — 8:18 — page 93 — #113
i i
.a C bi / C .c C d i / D .a C c/ C .b C d /i:
r .a C bi / D ra C r b i;
R = (a+c,b+d )
P = (a,b)
Q = (c,d )
i i
“book2” — 2013/5/24 — 8:18 — page 94 — #114
i i
94 Chapter 3 Renaissance
Proof. The proofs are routine, just reducing each to a familiar statement about
real numbers, and so we’ll only prove the longest such: associativity of addi-
tion. It is clearer if we use ordered pairs.
a C bi D .a; b/
D .a; 0/ C .0; b/
D a.1; 0/ C b.0; 1/:
It follows that the + in the notation a C bi really does mean add and that bi is
the product of b and i ; that is, bi D b.0; 1/ D .0; b/.
The set C of complex numbers has more algebraic structure: any two com-
plex numbers can be multiplied, not just when one of them is real. The defini-
tion arises from pretending that
.a C bi /.c C d i / D ac C ad i C bci C bd i 2
D .ac bd / C i.ad C bc/;
where we have set i 2 D 1. This is precisely what our ancestors did, which
motivates the formal definition. But our definition involves no pretending.
Notice that
i 2 D .0; 1/.0; 1/ D . 1; 0/ D 1;
for ac D 0 D ad D bc.
We are now obliged to prove that the familiar properties of multiplication
actually do hold for complex multiplication.
i i
“book2” — 2013/5/24 — 8:18 — page 95 — #115
i i
Proof. Again, the proofs are routine, for each reduces to a familiar statement
about real numbers. We’ll only prove associativity. As in the proof of Proposi-
tion 3.8, it is clearer if we use ordered pairs.
xa yb D 1 and ya C xb D 0:
The second equation gives x D ay=b; substitute this into the first equation
and obtain
b a
yD and xD :
a2 C b2 a2 C b2
i i
“book2” — 2013/5/24 — 8:18 — page 96 — #116
i i
96 Chapter 3 Renaissance
1 a b
z D i :
a2 C b2 a2 C b2
zDa bi:
Proof. We’ll prove (i), leaving the rest to Exercise 3.25 on page 98.
z C w D .a C c/ C .b C d /i
D .a C c/ .b C d /i
D .a bi / C .c d i /
D z C w:
Using induction, the first two statements in the proposition can be general-
ized:
z1 C C zn D z 1 C C z n
z1 zn D z 1 z n :
The formula for the multiplicative inverse of a complex number can be writ-
ten in terms of conjugates. Informally, cancel z to see that z=zz D 1=z.
1 a bi z
z D 2 2
D :
a Cb zz
i i
“book2” — 2013/5/24 — 8:18 — page 97 — #117
i i
Proof. It’s enough to see that you get 1 if you multiply z by .a bi /=.a2 Cb 2 /.
And so it is:
1 1
z z D z z D 1:
zz zz
It wouldn’t be worth introducing the new term complex conjugation if our
only use of it was to give a neat proof of the formula for reciprocals. The notion
has many other uses as well. For example, if f .x/ D ax 2 C bx C c has real
coefficients, then the quadratic formula implies that whenever z is a complex
root of f , then so is z. In fact, this is true for polynomials of any degree, and Is this true if z is real?
the proof depends only on Proposition 3.10.
f .x/ D a0 C a1 x C C ai x i C C an x n ;
0 D a0 C a1 z C C ai z i C C an z n :
Hence,
0 D 0 D a0 C a1 z C C ai z i C C an z n
D a0 C a1 z C C ai z i C C an z n
D a0 C a1 z C C ai z i C C an z n
D a0 C a1 z C C ai z i C C an z n (because all ai are real)
D f .z/ :
Therefore, z is a root of f .
Exercises
3.20 In Appendix A.4, we considered the subset P of all (strictly) positive real num-
bers; it satisfies: If we define a < b to
mean b a 2 P , then we
if a; b 2 P , then a C b 2 P and ab 2 P ; can prove all the familiar
if r 2 R, then exactly one of the following is true: properties of inequality.
For example, if a < b and
r 2 P; r D 0; or r 2 P: c < 0, then bc < ac. See
page 441.
(i) Using only the two properties of P , prove that if a 2 R, then either a D 0 or
a2 2 P .
(ii) Prove that there is no subset Q C, closed under addition and multiplica-
tion, such that if z 2 C, then exactly one of the following is true:
z 2 Q; z D 0; or z 2 Q:
Conclude that it’s impossible to order the complex numbers in a way that
preserves the basic rules for inequality listed in Proposition A.51.
i i
“book2” — 2013/5/24 — 8:18 — page 98 — #118
i i
98 Chapter 3 Renaissance
3.21 * Suppose that z is a complex number. Generalize Propositions 1.33 and 1.32.
(i) Show that z is unique.
(ii) Show that z D . 1/z.
(iii) If z ¤ 0, show that z 1 is unique.
3.22 *
(i) We may think of real numbers r and s as complex numbers. Show that their
sum r C s and their product rs in C are the same as their sum and product
in R.
(ii) If z is a complex number and r is a real number, you can think of the complex
number rz in two ways: as the product of scalar multiplication, or as the
product of two complex numbers r C 0i and z. Show that the two calculations
give the same result.
3.23 If z 2 C, show that z C z D 2.<z/ and z z D 2.=z/.
3.24 Find a complex number z such that z C z D 14 and zz D 49.
3.25 * Finish the proof of Proposition 3.10. If z and w are complex numbers, prove
(i) zw D z w
(ii) z D z
(iii) z D z if and only if z is a real number
3.26 If z is a complex number and n is a natural number, show that
z n D .z/n :
.3 C 2i /z D 3 C 11i:
i i
“book2” — 2013/5/24 — 8:18 — page 99 — #119
i i
The absolute value of a real number is its distance to the origin, and so we
have just extended the notion of absolute value from R to C. Thus, if z D
a C bi , then jzj is the distance from the point P D .a; b/ to the origin O;
!
equivalently, it is the length of the arrow OP . Because z z D a2 C b 2 , we can
write
p Does this equation hold if
jzj D z z: z is a real number?
i i
“book2” — 2013/5/24 — 8:18 — page 100 — #120
i i
.1; 0/ to P . When we view the point P D .cos ; sin / on the unit circle as a
complex number, it is equal to cos C i sin .
Figure 3.3 shows z D a C bi as the tip of an arrow of length jzj D r . The
!
direction of this arrow is the same as the direction of the unit vector OP having
!
the same direction as z. If is the angle between the x-axis and OP , then the
coordinates of P are jOAj D cos and jAP j D sin .
z = (a, b)
P = (cos q, sin q)
r = |z|
q (1, 0)
O A
arg.z/;
!
is the counterclockwise angle from the positive real axis to OP .
Finding arg.z/ requires In Figure 3.3, we see that the coordinates of P are cos and sin ; that is,
some way of comput- if P D .a; b/, then a D cos and b D sin . Thus, for any nonzero complex
ing values of inverse
number z D a C bi , not necessarily of absolute value 1, the definitions of
trigonometric functions.
Nowadays, we use com-
cosine and sine (in terms of right triangles) give arg.z/ D , where cos D
puters; in earlier times, a=jzj and sin D b=jzj. Note that
tables of values of cosine
and sine were used. Fairly a b
accurate trigonometric z D jzj Ci D jzj .cos C i sin / : (3.4)
jzj jzj
tables were known over
two thousand years ago.
i i
“book2” — 2013/5/24 — 8:18 — page 101 — #121
i i
i i
“book2” — 2013/5/24 — 8:18 — page 102 — #122
i i
W = (r, s)
U = (c, d )
! !
We are proving that per- Lemma 3.16. If W D .r; s/ and U D .c; d /, then the arrows OW and OU
pendicularity is equivalent are perpendicular if and only if r c C sd D 0.
to the dot product being 0.
Proof. We use the Pythagorean Theorem and its converse, Exercise 1.11 on
! !
page 7: OU ? OW if and only if jU W j2 D jOU j2 C jOW j2 . Let h D jU W j;
then
h2 D .r c/2 C .s d /2
D r2 2r c C c 2 C s 2 2sd C d 2
D r 2 C s2 C c2 C d 2 2.r c C sd /:
jOW j2 C jOU j2 D .r 2 C s 2 / C .c 2 C d 2 /
and
In Figure 3.4, the coor-
dinates of U are labeled h2 D r 2 C s 2 C c 2 C d 2 2.r c C sd /:
.c; d /. This lemma shows
that .c; d / D . s; r /. Therefore, h2 D jOU j2 C jOW j2 if and only if r c C sd D 0.
Now for the addition formulas.
We are looking at points Proof. In Figure 3.5, we have a picture of the unit circle. Let Z D .a; b/ D
here as elements of R2 , al- .cos ˛; sin ˛/ and W D .r; s/ D .cos ˇ; sin ˇ/. Rotate 4OQZ counterclock-
though we’ll soon interpret
wise through †ˇ to get 4OQ0 Z 0 , so that 4OQZ and 4OQ0 Z 0 are congru-
this diagram in the complex
plane.
ent. Thus, Z 0 D .cos.˛ C ˇ/; sin.˛ C ˇ// : Our task is to find the coordinates
of Z 0 in terms of r; s; a, and b.
Define U D . s; r /. Since W D .r; s/ is on the unit circle, we have
r 2 C s 2 D 1, and so U D . s; r / is also on the unit circle. Moreover, since
! !
. s/r C r s D 0, Lemma 3.16 says that OU is orthogonal to OW . Therefore,
O Q0 Z 0 M is a rectangle.
!
Decompose OZ 0 as the sum of two vectors:
! ! !
OZ 0 D OQ0 C OM ;
i i
“book2” — 2013/5/24 — 8:18 — page 103 — #123
i i
Z¢ W = (r, s)
Z = (a, b)
Q¢
U = (–s, r)
M a
b
a
O Q = (a, 0)
! ! !
where OM is the projection of Q0 Z 0 onto OU . We can get explicit expres-
! !
sions for Q0 and M . First, OQ0 is a scalar multiple of OW and, because
jOQj D a, we know the scalar:
M D b. s; r / D . bs; br /:
Therefore,
! ! !
OZ 0 D OQ0 C OM D .ar; as/ C . bs; br / D .ar bs; as C br /:
Theorem 3.18 (The Geometry of Multiplication.). If z and w are complex In words, the length of
numbers, then a product is the product
of the lengths, and the
(i) jzwj D jzj jwj, and argument of a product is
the sum of the arguments.
(ii) arg.zw/ D arg.z/ C arg.w/.
The equality in Theo-
rem 3.18(ii) holds up to a
Proof. The first statement is Proposition 3.13, and the second follows from multiple of 2.
Theorem 3.17 and Eq. (3.5) on page 101.
i i
“book2” — 2013/5/24 — 8:18 — page 104 — #124
i i
Proof. Both sides equal jzj .cos ˛ C i sin ˛/ jwj .cos ˇ C i sin ˇ/.
How to Think About It. There’s a way to see, without using trigonometry,
that angles add in the product of two complex numbers. Essentially, we recast
the proof of Theorem 3.17 in terms of complex numbers. Given z D a C bi
and w D r C si , we want to determine arg.zw/ in terms of arg.z/ D ˛ and
arg.w/ D ˇ. We can assume that z and w are unit vectors; this implies that zw
is also a unit vector, by Proposition 3.13. The key insight is that
zw D .a C bi /w D aw C .bi /w D aw C b.iw/:
This approach does not You know the geometric effect of scalar multiplication (Exercise 3.27 on page 98),
seem to be very well you know how to add geometrically (parallelogram law), and you know
known. It appears in [22]
and Kerins, B. “Gauss,
iw D i.r C si / D s C r i I
Pythagoras, and Heron”
(Mathematics Teacher,
96:5, 2003), but we can’t using Lemma 3.16, it follows that iw is obtained from w by counterclock-
find any older sources. wise rotation by 90ı. Figure 3.6 below is almost the same as Figure 3.5; the
difference is that points are now labeled as complex numbers.
zw = aw + b(iw) w = r + si
z = a + bi
aw
u = iw = –s + ri
b(iw) a
b
a
O a
i i
“book2” — 2013/5/24 — 8:18 — page 105 — #125
i i
Hence
cos C D p1 .cos sin /
4 2
and, as a bonus,
sin C D p1 .cos C sin /:
4 2
Over the next two centuries, people became comfortable with the fact that
polynomial equations with real coefficients can have complex solutions. It was
eventually proved that every polynomial f .x/ D x n C cn 1x n 1 C C c1x C
c0 with real coefficients has a factorization
f .x/ D .x ˛1 / .x ˛n /;
where ˛1 ; : : : ; ˛n are complex numbers. This amazing result holds for any
nonzero polynomial f with complex coefficients; it is known as the Funda-
mental Theorem of Algebra. We won’t prove this result here because, in spite
of its name, it is a theorem of analysis, not of algebra; you can find a readable
account in [4], pp. 142–152.
Exercises
!
3.34 If z D a C bi , prove that the arrow corresponding to z, namely OP , where
P D .a; b/, is perpendicular to the arrow corresponding to iz.
3.35 If z and w are complex numbers with w ¤ 0, show that
3.36 (i) Prove that the quadratic formula holds for polynomials with complex coeffi-
cients (use Proposition 3.6).
(ii) Find the roots of x 2 C 2ix 1. Why aren’t these roots conjugate?
3.37 If z and w are complex numbers, find a necessary and sufficient condition that
jz C wj to equal jzj C jwj.
i i
“book2” — 2013/5/24 — 8:18 — page 106 — #126
i i
i i
“book2” — 2013/5/24 — 8:18 — page 107 — #127
i i
Equating real parts and imaginary parts gives the double angle formulas:
Theorem 3.20 (De Moivre). For every angle and all integers n 0,
i i
“book2” — 2013/5/24 — 8:18 — page 108 — #128
i i
Corollary 3.23. Let r .cos Ci sin / be the polar form of a complex number z.
If n 1 is an integer, then
hp in
n
r cos =n C i sin =n D r .cos C i sin / D z:
Of course, we must find Example 3.24. In Example 3.15, we saw that the polar form of z D 3 C 4i is
the polar form of z, approximately 5 .cos.53:13ı / C i sin.53:13
p
ı
//. Now .53:13/=3 D 17:71, and
which involves finding 3
so a cube root of z is approximately 5 .cos.17:71ı / C i sin.17:71ı //. Our
D cos 1 .a=jzj/.
calculator says that
p 3
3
5 cos 17:71ı C i sin 17:71ı D 3:000001 C 3:99999i: N
and
x3 x5 . 1/n 1 x 2nC1
sin x D x C C C :
3Š 5Š .2n C 1/Š
P
We can define convergence of power series 1 n
nD0 cn z for z and cn com-
plex numbers, and we can then show that the series
z2 zn
1CzC CC C
2Š nŠ
converges for every complex number z. The complex exponential e z is defined
to be the sum of this series. In particular, the series for e ix converges for all
real numbers x, and
.ix/2 .ix/n
e ix D 1 C ix C CC C :
2Š nŠ
It is said that Euler was Theorem 3.25 (Euler). For all real numbers x,
delighted by the special
case e ix D cos x C i sin x:
ei C 1 D 0; Sketch of proof. We will not discuss necessary arguments involving conver-
for it contains five important
gence. As n varies over 0; 1; 2; 3; 4; 5; : : :, the powers of i repeat every four
constants in one equation. steps: that is, the sequence
1; i; i 2 ; i 3 ; i 4 ; i 5 ; i 6 ; i 7 ; i 8 ; i 9 ; i 10 ; i 11 ; : : :
is actually
1; i; 1; i; 1; i; 1; i; 1; i; 1; i; : : : I
i i
“book2” — 2013/5/24 — 8:18 — page 109 — #129
i i
the even powers of i are all real, whereas the odd powers all involve i . It
follows, for every real number x, that .ix/n D i n x n takes values
.ix/2 .ix/n
e ix D 1 C ix C CC C ;
2Š nŠ
the even powers of ix do not involve i , whereas the odd powers do. Collecting
terms, one has e ix D even terms C odd terms. But
.ix/2 .ix/4
even terms D 1 C C C
2Š 4Š
x2 x4
D1 C
2Š 4Š
and
.ix/3 .ix/5
odd terms D ix C C C :
3Š 5Š
x3 x5
D i.x C /:
3Š 5Š
Therefore, e ix D cos x C i sin x.
As a consequence of Euler’s Theorem, the polar decomposition can be
rewritten in exponential form: every complex number z has a factorization
z D r e i ;
Theorem 3.26 (Exponential Addition Theorem). For all real numbers x and
y,
e ix e iy D e i.xCy/ :
Corollary 3.27 (Exponential De Moivre). For every real number x and all
integers n 1,
.e ix /n D e i nx :
i i
“book2” — 2013/5/24 — 8:18 — page 110 — #130
i i
e i 3x D cos.3x/ C i sin.3x/:
e i 3x D .e ix /3
3
D cos x C i sin x
D cos3 x C 3i cos2 x sin x C 3i 2 cos x sin2 x C i 3 sin3 x
D cos3 x 3 cos x sin2 x C i 3 cos2 x sin x sin3 x :
and
Roots of Unity
De Moivre’s Theorem can be used to find the roots of an important family of
polynomials: those of the form x n 1.
1; ; 2 ; ; n 1
;
j D q nCr D q n r D r ;
because q n D 1.
On the other hand, all the k , for 0 k n 1, are distinct. After all, by
De Moivre’s Theorem,
and Proposition 3.14, uniqueness of polar forms, applies, for 0 2 k=n < 2
are n distinct angles. Therefore, we have displayed n distinct roots of x n 1.
i i
“book2” — 2013/5/24 — 8:18 — page 111 — #131
i i
These are all the roots of x n 1, for a polynomial of degree n can have at
most n distinct roots. We’ll give a proof of this later (see Theorem 6.16) but,
since we haven’t yet proved this result, we now proceed in a different way.
If z 2 C is a root of x n 1, then 1 D jz n j D jzjn , so that jzj D 1,
by Exercise 1.73 on page 41, and z D cos C i sin for some . By De
Moivre’s Theorem, 1 D z n D cos.n/ C i sin.n/, so that n D 2 k for
some integer k; hence, D 2 k=n. Write k D qn C r , where 0 r < n, and
Definition. The roots of x n 1 are called the nth roots of unity. An nth root
of unity is a primitive nth root of unity if n is the smallest positive integer
for which n D 1.
For every n 1, we see that D e 2 i=n is a primitive nth root of unity, for
if 1 m < n, then m D cos.2 m=n/ C i sin.2 m=n/ ¤ 1. In particular,
i D cos.2=4/
p C i sin.2=4/ is a primitive fourth root of unity, and ! D
1
2
. 1 C 3/ D cos.2=3/ C i sin.2=3/ is a primitive cube root of unity.
Corollary 3.29. Let k D cos.2 k=n/Ci sin.2 k=n/ be an nth root of unity.
(i) k is a primitive nth root of unity if and only if gcd.k; n/ D 1.
(ii) If k is a primitive nth root of unity, then every nth root of unity is a power
of k .
Proof. (i) Suppose that k is a primitive nth root of unity. If d D gcd.k; n/ >
1, then n=d < n, and
. k /n=d D . n /k=d D 1:
m D mskCmt n D msk mt n D 1;
i i
“book2” — 2013/5/24 — 8:18 — page 112 — #132
i i
Corollary 3.30. For every positive integer n, there are exactly .n/ primitive
nth roots of unity.
cos. 2
8
/ C i sin. 2
8
/; cos. 6
8
/ C i sin. 6
8
/;
cos. 10
8
/ C i sin. 10
8
/; cos. 14
8
/ C i sin. 14
8
/I
that is, the primitive 8th roots of unity are all those cos. 2k
8
/ C i sin. 2k
8
/ for
which gcd.k; 8/ D 1. N
-1 1
-i
Figure 3.7. 8th roots of unity.
The nth roots of unity enjoy some remarkable properties that we’ll use in
upcoming chapters; here are some of them. (See Exercise 3.51 on page 115
and Proposition 6.63 for some other interesting properties.)
. 1/.1 C C 2 C C n 1
/ D n 1 D 0:
i i
“book2” — 2013/5/24 — 8:18 — page 113 — #133
i i
Example 3.33. The vertices of unit n-gons for small values of n can be calcu-
lated with plane geometry.
p p
The vertices of the unit 3-gon are 1; 1 . 1 C i 3/; 1 . 1 i 3/: See Exercise 3.49 on
2 2
The vertices of the unit 4-gon are 1; i; 1; i . page 115 for more exam-
ples.
The vertices of the unit 6-gon are
p p p p
1; 21 .1Ci 3/; 12 . 1Ci 3/; 1; 21 . 1 i 3/; 21 .1 i 3/: N
Can we find explicit expressions for these vertices that don’t involve trigonom-
etry? We’ll obtain such an expression for cos.2=5/, but we’ll leave the rest of
the details for you (Exercise 3.49(i) on page 115); after all, you can evaluate,
say, cos.8=5/.
We have D 4 , by Theorem 3.32(ii). Inspired by Lemma 3.2, we define
g D C 4 and h D 2 C 3:
Now
g D C 4 D 2 cos.2=5/;
a real number that is twice the number we are after. Similarly,
h D 2 C 3 D 2 cos.4=5/;
i i
“book2” — 2013/5/24 — 8:18 — page 114 — #134
i i
g C h D C 4 C 2 C 3
D 1:
x2 C x 1:
Now x 2 C x 1 has a positive root and a negative one. Since g > h (why?),
the positive root is g, and so
cos.2=5/ D 21 g
p
D 14 . 1 C 5/: N
Exercises
3.46 Is De Moivre’s Theorem true for negative integer exponents? Explain.
3.47 Let z D cos C i sin . Show, for all nonnegative integers n, that
3.48 This exercise shows that there’s something special about a 72ı angle: there’s only
one isosceles triangle (up to similarity) whose base angle is twice the vertex an-
gle, namely, the “72-72-36 triangle.” Let the equal sides of such a triangle have
length 1, and let q denote the length of the base.
(i) Bisect one of the base angles of the triangle.
(ii) Show that the small triangle is similar to the whole triangle.
1 q
(iii) Use (ii) to show that q D 1 q; and solve for q.
(iv) Show that q=2 D cos 72ı .
i i
“book2” — 2013/5/24 — 8:18 — page 115 — #135
i i
36°
q
1 1 1
q
1–q
72° 72°
q q
3.49 * Find explicit formulas (i.e., without trigonometry) for the vertices of the unit
(i) pentagon. (ii) decagon. (iii) 20-gon.
3.51 Take It Further. We saw, on page 111, that .p/ D p 1, where p is prime
and is the Euler- function. Note that if n D p1 p2 ,
then .p1 1/.p2 1/ D
(i) Suppose n is the product of two primes, n D p1 p2 . Show that
n pn n n
p C p p :
1 2 1 2
.n/ D .p1 1/.p2 1/:
e e
(ii) Suppose n is the product of two primes powers, n D p11 p22 . Show that
n n n 1 1
.n/ D n C Dn 1 1 :
p1 p2 p1 p2 p1 p2
e e
(iii) Generalize to show that, if n D p11 p22 : : : pnen , then
n
Y
1
.n/ D n 1 :
pk
kD1
3.52 Prove or disprove and salvage if possible. If a and b are positive integers,
.ab/ D .a/.b/:
3.53 Find explicit formulas (i.e., without trigonometry) for the vertices of the unit n-
gon if
(i) n D 3 (ii) n D 4
(iii) n D 6 (iv) n D 8
(v) n D 12 (vi) n D 16
i i
“book2” — 2013/5/24 — 8:18 — page 116 — #136
i i
3.54 For all integers n between 3 and 9, find all the primitive nth roots of unity.
3.55 Find a primitive 12th root of unity . Is unique?
3.56 Suppose D cos. 2 2
7 / C i sin. 7 /.
(i) Plot the roots of x 7 1 in the complex plane.
(ii) Show that ˛ D C 6 , ˇ D 2 C 5 , and
D 3 C 4 are real numbers.
(iii) Find a cubic equation satisfied by 2 cos. 2
7 / by finding the values of ˛CˇC
,
˛ˇ C ˛
C ˇ
, and ˛ˇ
.
Pn 1 k
3.57 If n D cos. 2 2
n / C i sin. n /, evaluate kD0 n :
n Number of Factors of x n 1
1
2
3
4
5
6
7
8
9
10
11
12
Norms
We begin by introducing a function C ! R, called the norm, that is closely
related to absolute value. It will be an important tool for our applications; it will
also be very useful in Chapter 8 when we do some algebraic number theory.
N.z/ D zz D a2 C b 2 :
i i
“book2” — 2013/5/24 — 8:18 — page 117 — #137
i i
Example 3.36. Let’s revisit Example 3.5, the “bad example,” in which the
cubic formula gives the roots of x 3 7x C 6 D .x 1/.x 2/.x C 3/ in
unrecognizable form.
Imagine again that you have just left the contest in Piazza San Marco, think-
ing about how g C h could possibly equal 1, where
p p
g3 D 3 C i 10 9
3 and h3 D 3 i 10 9
3:
Had you known about conjugates, you’d have seen that g3 D h3 . It would have
been natural to guess that the cube roots g and h are also complex conjugates
(you’d have guessed right: see Exercise 3.64 on page 127); thus, g D a C i b
and h D a i b. Now if g C h D 1, as your opponent loudly proclaimed, then
.a C i b/ C .a i b/ D 2a D 1; that is,
1 1
gD 2 C ib and hD 2 i b:
You really want to find g and h now—what is b? Using the norm function, you
see that
p 2
N.g/3 D N.g3 / D . 3/2 C 109 3 D 343 27
:
(the other cube roots are complex; they are 73 ! and 37 ! 2 , where ! is a primitive
cube root of unity). But if g D 21 C i b, then N.g/ D 14 C b 2 . Hence, 14 C b 2 D
7 5
3 , and b D ˙ 2 3 . Thus,
p
1 5 1 5
gD 2
Ci p and hD 2
i p :
2 3 2 3
Bingo! For these “values” of g and h, we have g C h D 1. You were right! To find the other two roots,
Elated, you run back to the square to show off g and h, but everyone has gone see Exercise 3.65 on
home. N page 127.
i i
“book2” — 2013/5/24 — 8:18 — page 118 — #138
i i
(i) A triangle has vertices . 18; 49/; .15; 7/, and .30; 15/: How long are
its sides?
(ii) In Figure 3.11, the side lengths of 4QS U are as marked. How big is
†Q?
5 7
Q S
8
(iii) An open box is formed by cutting out squares from a 7 15 rectangle and
folding up the sides (see Figure 3.12). What size cut-out x maximizes the
volume of the box?
15
(iv) Find the zeros, extrema, and inflection points of the function
(v) Find the area of the triangle with sides of lengths 13, 14, and 15.
i i
“book2” — 2013/5/29 — 12:13 — page 119 — #139
i i
.a C bi /.a bi / D c 2 ;
or even
N.a C bi / D c 2:
So, we’re looking for complex numbers z D a C bi whose norms are perfect
squares of integers. The Pythagorean equation now looks like
N.z/ D c 2 :
For example, N.3 C 4i / D 52, N.5 C 12i / D 132, and N.8 C 15i / D 172.
This idea doesn’t work for every complex number. What’s needed are com-
plex numbers whose real and imaginary parts are integers (and, besides, whose
norms are perfect squares). We’d like the real and imaginary parts to be posi-
tive integers, but any integers will do, because changing the sign of the real or
imaginary part of a complex number doesn’t change its norm (why?).
Definition. The Gaussian integers is the set ZŒi of all complex numbers
whose real and imaginary parts are integers. In symbols,
ZŒi D fa C bi 2 C W a 2 Z and b 2 Zg :
Proposition 3.37. (i) The set ZŒi of Gaussian integers is closed under ad-
dition and multiplicationW If a C bi; c C d i 2 ZŒi , then
.a C bi / C .c C d i / D .a C c/ C .b C d /i 2 ZŒi
.a C bi /.c C d i / D .ac bd / C .ad C bc/i 2 ZŒi :
(ii) If z D a C bi , then
N.z/ D a2 C b 2 :
Proof. The formula for addition is clear; for multiplication, use the fact that
i 2 D 1. Of course, part (ii) is just the definition of the norm.
We’ll investigate the Gaussian integers in more detail in Chapter 8.
Let’s return to the norm equation N.z/ D c 2 arising from Pythagorean
triples, but with z a Gaussian integer. Our question is now “Which Gaussian
integers have perfect squares as norms?” The answer comes from Proposi-
tion 3.35(iii): if z and w are complex numbers, then N.zw/ D N.z/ N.w/: In
particular (letting z D w),
N z 2 D N.z/2 :
i i
“book2” — 2013/5/24 — 8:18 — page 120 — #140
i i
perfect squares if a > 0, b > 0, and a ¤ b. Now the right-hand side is the
square of an integer, namely, N.z/2 , which produces a Pythagorean triple. For
example, if z D 3 C 2i , then N.z/ D 13 and z 2 D 5 C 12i , and we get the
Pythagorean triple .5; 12; 13/, for
52 C 122 D N .3 C 2i /2 D N.3 C 2i /2 D 132 :
We now have a quick way to generate Pythagorean triples (by hand or with
a computer; one of our colleagues uses this method to amaze friends at parties).
Pick a Gaussian integer r C si (with r > 0, s > 0, and r ¤ s), and square it.
The r; s entry in the following table is Œ.r C i s/2 ; N.r C i s/. For exam-
ple, the top entry in the first column, arising from r D 2 and s D 1, is
Œ.2 C i /2 ; N.2 C i / D Œ3 C 4i; 5; the corresponding Pythagorean triple is
.3; 4; 5/.
Eisenstein Integers.
Let’s now look at the meta-problem of creating triangles with integer side-
lengths and a 60ı angle.
Let †C D 60ı in Figure 3.13, so that cos.†C / D 21 . By the Law of
Cosines,
c 2 D a2 C b 2 2ab cos †C
2 2
Da Cb ab:
2 2
What’s important here is that the right-hand side
pof the equation, a ab C b ,
1
is the norm of a C b!, where ! D 2 . 1 C i 3/ is a primitive cube root of
unity (Exercise 3.72 on page 128). This leads to the following definition.
B
c
a
A
C b
i i
“book2” — 2013/5/24 — 8:18 — page 121 — #141
i i
Proposition 3.38. (i) The set ZŒ! of Eisenstein integers is closed under ad-
dition and multiplication: If .a C b!/; .c C d!/ 2 ZŒ!, then
a2 ab C b 2 D c 2:
The same idea that produces Pythagorean triples from norms of squares of
Gaussian integers applies to produce Eisenstein triples from norms of squares
of Eisenstein integers. If z is an Eisenstein integer, then
N.z 2 / D N.z/2 :
The left-hand side of this equation, being the norm of an Eisenstein integer,
is of the form a2 ab C b 2 . And the right-hand side is the square of the
integer N.z/. Hence a2 ab C b 2 is a perfect square, and we have produced
an Eisenstein triple.
z 2 D 9 C 12! C 4! 2
D 9 C 12! C 4. 1 !/
D 5 C 8!:
Hence, 52 5 8 C 82 D N z 2 D N.z/2 D 72 ; and .5; 8; 7/ is an Eisenstein
triple. N
i i
“book2” — 2013/5/24 — 8:18 — page 122 — #142
i i
5 7
Q S
8
is .2 C !/2 ; N.2 C !/ D .3 C 3!; 3/; the corresponding Eisenstein triple
gives .3; 3; 3/, which is an equilateral triangle. One of our friends calls this
table a “candy store of patterns.” Which entries give equilateral triangles?
x2 xy C y 2 D 1:
(See Figure 3.15.) As with the unit circle, the graph contains . 1; 0/, and we
See Exercise 3.66 on can use the chord method idea of Diophantus.
page 128.
Proposition 3.40. Let ` be a line through . 1; 0/ which intersects the ellipse
with equation x 2 xy C y 2 D 1 in a point P . If ` has rational slope, then P
has rational coordinates, P D .a=c; b=c/, and
a2 ab C b 2 D c 2 :
i i
“book2” — 2013/5/29 — 16:15 — page 123 — #143
i i
y
1.2
0.8
0.4
x
–1.6 –1.2 –0.8 –0.4 0.4 0.8 1.2 1.6
–0.4
–0.8
–1.2
Proof. The proof is almost identical to the proof of Proposition 1.2. We leave
it to you to fill in the details.
For example, if ` has slope 14 and equation y D 14 .x C 1/, then ` intersects
15 7
the ellipse in 13 ; 13 , and .15; 7; 13/ is an Eisenstein triple. So, the triangle
whose side lengths are 15, 7, and 13 has a 60ı angle. Which angle is it?
Nice Boxes
Our next application is to a “box problem.” In an a b rectangle, cut out little
squares at the corners, and then fold up the sides to form an open-top box (see
Figure 3.16). What size cut-out maximizes the volume of the box? For most
rectangles, the best cut-out has irrational side length. The meta-problem:
How can we find a and b to make the optimal cut-out a rational number?
x
b
As we tell our students, let the size of the cut-out be x. Then the volume of
the box is a function of x:
V .x/ D .a 2x/.b 2x/x D 4x 3 2.a C b/x 2 C abx;
and its derivative is
V 0 .x/ D 12x 2 4.a C b/x C ab:
We want V 0 .x/ to have rational zeros, and so its discriminant
16.a C b/2 48ab
i i
“book2” — 2013/5/29 — 16:18 — page 124 — #144
i i
.a C b/2 3ab D a2 ab C b 2
should be a perfect square. This will be so if a and b are the legs of an Eisen-
stein triple .a; b; c/.
For example, from the Eisenstein triple .7; 15; 13/, we get a 7 15 rectangle
that can be used to create a box whose maximum volume occurs at a rational-
length cut-out. The volume of the resulting box is
So, V 0 .x/ D 12x 2 88x C 105. The roots of V 0 .x/ are 23 and 35 6
. Both are
3
rational, but only 2 fits the context and maximizes V . (Why doesn’t 35
6 fit the
context? What significance does it have? Also, see Exercise 3.69 on page 128.)
How do you find cubic polynomials f .x/ with integer coefficients and
rational roots, whose extrema and inflection points have rational coordi-
nates?
Using the notation of Theorem 3.3, we can first assume that the cubic f is
reduced; that is, it has form
f .x/ D x 3 C qx C r:
˛ 3 C 3p 2˛ D ˇ 3 3p 2ˇ
or
ˇ 3 C ˛ 3 D 3p 2.˛ C ˇ/:
˛2 ˛ˇ C ˇ 2 D 3p 2:
i i
“book2” — 2013/5/24 — 8:18 — page 125 — #145
i i
Creating examples like this is not hard by hand, but a computer algebra
system makes it automatic. The next table was generated by a CAS, and it
shows the results of our algorithm for small values of r and s.
s D1 s D2 sD3
r D2 54 27x C x3 128 48x C x3
r D3 286 147x C x3 286 147x C x 3 1458 243x C x 3
r D4 506 507x C x3 3456 432x C x3 506 507x C x 3
r D5 7722 1323x C x 3 10582 1083x C x 3 10582 1083x C x 3
r D6 35282 2883x C x3 18304 2352x C x 3 39366 2187x C x 3
i i
“book2” — 2013/5/24 — 8:18 — page 126 — #146
i i
Clearly, solutions are invariant under translation by a lattice point; that is,
if A, B, and C form a lattice point solution and U is any lattice point, then
A U , B U , and C U form another solution: since d.A U; B U / D
d.A; B/ (where d.P; Q/ is the distance between points P and Q), we have
d.A; B/ D jA Bj. Hence, we can assume that one of the points, say C , is at
the origin.
Now view the plane as the complex plane, so that lattice points are Gaussian
integers. Thus, we want Gaussian integers z and w such that jzj, jwj, and
jz wj are integers. But if z D a C bi , then
p p
jzj D a2 C b 2 D N.z/:
Hence, to make the length an integer, make the norm a perfect square and, to
make the norm a perfect square, make the Gaussian integer a perfect square in
ZŒi . That is, we want Gaussian integers z and w so that z, w, and z w are
perfect squares in ZŒi . Hence, we choose z and w so that
˛2 ˇ2 D
2
or
˛2 D ˇ2 C
2:
which holds in any commutative ring, holds, in particular, in ZŒi . So, the trick
is to pick Gaussian integers x and y, set
i i
“book2” — 2013/5/24 — 8:18 — page 127 — #147
i i
Now put
Hence, .0; 0/, . 192; 256/, and . 60; 32/ are vertices of an integer-sided tri-
angle. Moreover, adding a lattice point to each vertex produces another such
triangle with no vertex at the origin. Once again, a CAS can be used to generate
many more. N
Exercises
3.60 For each integer n between 3 and 9, find a polynomial of smallest degree with
integer coefficients whose roots are the primitive nth roots of unity.
3.61 * Let a and b be real numbers, and let z be a complex number.
(i) Show that a C bz D a C b z:
(ii) Show that N.a C bz/ D a2 C 2<.z/ab C b 2N.z/.
3.62 * If z and w are complex numbers, show that N.z/ < N.w/ if and only if
jzj < jwj.
3.63 Let be an isosceles triangle with side lengths 13, 13, and 10.
(i) Show that the altitude to the base has length 12, and that it divides into two
5, 12, 13 triangles.
(ii) Show that the altitude to one of the sides of length 13 divides into two right
triangles whose side lengths are rational.
(iii) Each of the side lengths can thus be scaled to get a Pythagorean triple. Show
that one triple is similar to .5; 12; 13/ and that other comes from .5 C 12i /2.
(iv) Generalize this result to any isosceles triangle formed by two copies of a
Pythagorean triple, joined along a leg.
i i
“book2” — 2013/5/24 — 8:18 — page 128 — #148
i i
.2xy; x 2 y 2 ; x 2 C y 2 /:
Show how this can be obtained via the “norm from ZŒi ” method.
3.68 Obtain a formula for Eisenstein triples analgous to the one for Gaussian inte-
gers in Theorem 1.5 using norms from ZŒ! and rational points on the graph of
x 2 xy C y 2 D 1.
3.69 * Assume that the square of the Eisenstein integer r C s! is used to generate an
Eisenstein triple, and that the triple is used to create a “nice box,” as on page 123.
Express the volume of the box in terms of r and s.
Replacing x by x C 1 just 3.70 * Replace x by x C 1 in several of the cubics in the table on page 125 to produce
translates the graph by one nice cubics whose coefficient of x 2 is nonzero. Show that your cubics are indeed
unit. Which way? nice.
3.71 Describe where Gaussian integers are situated in the complex plane.
3.72 * Suppose that a and b are real numbers and
p
! D cos. 2 2
3 / C i sin. 3 / D
1
2. 1Ci 3/:
Show that
N.a C b!/ D a2 ab C b 2:
3.73 Describe where Eisenstein integers are situated in the complex plane.
3.74 Find an integer-sided triangle one of whose angles has cosine equal to 3=5.
Hint: Let D cos. 53 / C i sin. 35 / and consider norms from ZŒ. What conic
would help here?
3.75 A Heron triangle is a triangle with integer side lengths and integer area. In Ex-
ercise 1.26 on page 14, you found a Heron triangle by joining two Pythagorean
triangles together along a common leg. Show that the following method also pro-
duces Heron triangles.
Pick a rational point .cos ; sin / on the unit circle, where 0 < < , and let
˛ D cos C i sin . Then pick any number z of the form r C s˛, where r and
s are rational numbers and r > s > 0.
(i) What is the norm of r C s˛?
(ii) Show that
˛2 C 2˛ cos C 1 D 0:
(iii) Show that if z 2 D a C b˛, then the triangle with side lengths a and b and
included angle will have a rational number, say c, as its third side length
and a rational number as an area. (This triangle can be then scaled to produce
a Heron triangle.) Use this method to generate a few Heron triangles.
3.76 Show that a triangle with lattice point vertices and integer side-lengths is a Heron
triangle.
i i
“book2” — 2013/5/24 — 8:18 — page 129 — #149
i i
3.77 Take It Further. Here’s a typical current problem, taken from B. Kerins, Gauss,
Pythagoras, and Heron, Mathematics Teacher, 2003, 350-357:
A boat is making a round trip, 135 miles in each direction. Without a cur-
rent, the boat’s speed would be 32 miles per hour. However, there is a con-
stant current that increases the boat’s speed in one direction and decreases
it in the other. If the round trip takes exactly 9 hours, what is the speed of
the current?
(i) Solve the problem.
(ii) Solve the corresponding meta-problem: find a method for generating current
problems that come out nice.
i i
“book2” — 2013/5/24 — 8:18 — page 130 — #150
i i
i i
“book2” — 2013/5/24 — 8:18 — page 131 — #151
i i
4 Modular Arithmetic
x n C y n D .x C y/.x C y/ .x C n 1
y/;
where D e 2 i=n is an nth root of unity. It turns out that the most fruitful
way to understand this factorization is within the system ZŒ of cyclotomic
integers, the collection of all polynomials in with coefficients in Z (a com- We’ll discuss cyclotomic
mon generalization of Gaussian integers ZŒi and Eisenstein integers ZŒ!). integers in Chapter 8.
Numbers in these systems can be added and multiplied, and they satisfy all but
one of the nine fundamental properties that ordinary numbers do (reciprocals
of cyclotomic integers need not be such); we will call such systems commuta-
tive rings. But for some roots of unity , the commutative ring ZŒ does not
enjoy the unique factorization property that Z, ZŒi , and ZŒ! have, and this
caused early “proofs” of Fermat’s Last Theorem to be false. Dealing with the
lack of unique factorization was one important problem that led naturally to
the modern way of studying algebra.
In Section 4.1, we shall see that the distinction between even and odd can
be generalized, using congruences: studying remainders in the Division Algo-
rithm. It turns out, as we’ll see in Section 4.3, that, for any fixed positive integer
m, the set of its remainders, 0; 1; :::; m 1, can be viewed as a commutative
ring, as can cyclotomic integers, and they behave in many, but not all, ways It turns out that many of
as do ordinary integers. Finally, in Section 4.5, we’ll apply these results to an the “number systems”
analysis of decimal expansions of rational numbers. studied in high school are
commutative rings.
4.1 Congruence
It is often useful to know the parity of an integer n; that is, whether n is even
or odd (why else would these words be in the language?). But n being even or
odd is equivalent to whether its remainder after dividing by 2 is 0 or 1. Modular
arithmetic, introduced by Euler around 1750, studies the generalization of par-
ity arising from considering remainders after dividing by any positive integer.
At a low level, it will help us answer questions of the following sort:
London time is 6 hours ahead of Chicago time; if it is now 9:00 AM in
Chicago, what time is it in London?
If April 12 falls on a Thursday this year, on what day of the week is May 26?
131
i i
“book2” — 2013/5/24 — 8:18 — page 132 — #152
i i
We claim that integers a and b have the same parity if and only if a
b mod 2. Assume that a and b have the same parity. If both are even, then
a D 2a0 and b D 2b 0. Hence, a b D 2.a0 b 0 /, 2 j .a b/, and a
b mod 2. Similarly, if both are odd, then a D 2a0 C 1 and b D 2b 0 C 1. Hence,
a b D .2a0 C 1/ .2b 0 C 1/ D 2.a0 b 0 /, 2 j .a b/, and a b mod 2
in this case as well. Conversely, suppose that a b mod 2. If a and b have
different parity, then one is even, the other is odd, and so their difference is
odd. Hence, 2 − .a b/, and a 6 b mod 2. Having proved the contrapositive,
we may now assert that a and b have the same parity.
If today is Tuesday, what day of the week is 90 days from now? Since 2C90 D
92 1 mod 7, the answer is Monday.
Let’s now answer the question: if April 12 falls on Thursday this year, on
what day of the week is May 26? There are 18 days to April 30, so there are
18 C 26 D 44 days until May 26 (for April has only 30 days). Now Thursday
corresponds to 4, so that May 26 corresponds to 4 C 44 D 48 6 mod 7;
therefore, May 26 falls on Saturday. N
There are at least two ways to state the solutions of Exercises 3.2 and 3.3
on page 89. We expected you to say then that i n D i m if and only if n and m
leave the same remainder when divided by 4 and, if ! is a primitive cube root
of unity, that ! n D ! m if and only if n and m leave the same remainder when
i i
“book2” — 2013/5/24 — 8:18 — page 133 — #153
i i
Notice that Proposition 4.2 generalizes the fact that integers a and b have
the same parity if and only if a b mod 2.
We are now going to see that congruence modulo m behaves very much
like ordinary equality; more precisely, it is an equivalence relation (see Ap-
pendix A.2): it is reflexive, symmetric, and transitive.
How to Think About It. Congruence mod 1 makes sense, but it is not very
interesting, for a b mod 1 if and only if 1 j .a b/. But this latter condition
is always true, for 1 is a divisor of every integer. Thus, every two integers are
congruent mod 1. Similarly, congruence mod 0 makes sense, but it, too, is not
very interesting, for 0 j c if and only if c D 0. Thus, a b mod 0 if and only If 0 j c, then there is some
if 0 j .a b/; that is, a b mod 0 if and only if a D b, and so congruence k with c D 0 k D 0; that
is, c D 0.
mod 0 is just ordinary equality. You should not be surprised that we usually
assume that m 2.
Corollary 4.4. If m 2, then every integer a is congruent mod m to exactly See Exercise 4.5 on
one integer on the list page 140 for a generaliza-
tion.
0; 1; : : : ; m 1:
i i
“book2” — 2013/5/24 — 8:18 — page 134 — #154
i i
a C b a0 C b 0 mod m:
ab a0 b 0 mod m:
ab a0 b 0 D ab ab 0 C ab 0 a0 b 0 D a.b b 0 / C .a a0 /b 0 :
How to Think About It. The key idea in calculating with congruences
mod m is that every number can be replaced by its remainder after dividing
by m, for this is precisely what Proposition 4.5 permits; it allows you to “re-
duce as you go” in calculations, as the next example shows.
Example 4.6. The last (units) digit of a positive integer is the remainder when
it is divided by 10. What is the last digit of
We could do this by brute force: cube 10324, multiply 2348 and 5267, add, and
look at the last digit. But, as one of our friends says, why should the calculator
have all the fun? You can do this more cleverly using congruence.
i i
“book2” — 2013/5/24 — 8:18 — page 135 — #155
i i
Now 43 D 64 4 mod 10, so that 103243 4 mod 10, and the last digit
of 103243 is 4.
To multiply 2348 and 5267, note that 2348 8 mod 10 and 5267 More simply, think of
7 mod 10. Hence, multiplying 2348 and 5267
by hand. What’s the last
2348 5267 8 7 D 56 6 mod 10: digit? This is what most
middle school students
Thus, would do. We just want
to illustrate the general
103243 C 2348 5267 4 C 6 D 10 mod 10; principle here.
r 0 1 2 3 4 5 6 7
2
r 0 1 4 9 16 25 36 49
2
r mod 8 0 1 4 1 0 1 4 1
Figure 4.1. Squares mod 8.
i i
“book2” — 2013/5/24 — 8:18 — page 136 — #156
i i
See Corollary 4.67 for Theorem 4.9 (Fermat). Let p be a prime and a 2 Z.
another proof.
(i) ap a mod p.
n
(ii) ap a mod p for all n 1.
(iii) If p − a, then ap 1 1 mod p.
.a C 1/p ap C 1 a C 1 mod p:
i i
“book2” — 2013/5/24 — 8:18 — page 137 — #157
i i
Later in this chapter, we will use the next corollary to construct codes that
are extremely difficult for spies to decode.
for ap 1
1 mod p, by Theorem 4.9(iii)
We can now explain a well-known divisibility test.
a D dk 10k C C d1 10 C d0 ;
How to Think About It. The proof of Proposition 4.11 shows more than its
statement claims: the sum of the (decimal) digits of any positive integer a is
congruent to a mod 3, whether or not a is divisible by 3. For example,
172 1 C 7 C 2 mod 3I
that is, both 172 and 10 (the sum of its digits) are 1 mod 3.
(It is now clear why this procedure is called casting out 9s.) In light of a
†.a/ mod 9, we have †.a/ r .a/ mod 9, so that r .a/, which seems to de-
pend on a choice of operations (i) and (ii), depends only on a, for the variation
of Proposition 4.11 for 9 says that †.a/ is the remainder after dividing a by 9.
i i
“book2” — 2013/5/24 — 8:18 — page 138 — #158
i i
.7 C 3/3 D 8;
5 103 C 7 102 C 5 10 C 4:
Example 4.12. Let’s write 12345 in “base 7.” Repeated use of the Division
Algorithm gives
12345 D 1763 7 C 4
1763 D 251 7 C 6
251 D 35 7 C 6
35 D 5 7 C 0
5 D 0 7 C 5:
07C5D 5
5 7 C 0 D 35
.0 7 C 5/ 7 C 0 D 35
35 7 C 6 D 251
..0 7 C 5/ 7 C 0/ 7 C 6 D 251
251 7 C 6 D 1763
...0 7 C 5/ 7 C 0/ 7 C 6/ 7 C 6 D 1763
1763 7 C 4 D 12345
....0 7 C 5/ 7 C 0/ 7 C 6/ 7 C 6/ 7 C 4 D 12345:
i i
“book2” — 2013/5/24 — 8:18 — page 139 — #159
i i
h D dk b k C dk 1b
k 1
C C d0 :
h D dk b k C C d1 b C d0 D e m b m C C e 1 b C e 0 ;
dk b k 1
C C d1 D e m b m 1
C C e1 and d0 D e 0 :
Example 4.14. Let’s calculate the 13-adic digits of 441. The only complica-
tion here is that we need 13 digits d (for 0 d < 13), and so we augment 0
through 9 with three new symbols
So, 441 D 2 132 C 7 13 C 12, and the 13-adic expansion for 441 is
27w:
i i
“book2” — 2013/5/24 — 8:18 — page 140 — #160
i i
k k 1
D ndk p ndk 1 p nd1 p nd0
k d
k 1 dk 1 d
D np k np np 1 nd0
ndk ndk 1 nd1 nd0 mod p
ndk CCd1 Cd0 mod p
n†.h/ mod p:
Exercises
4.1 Show that if integers a and b are congruent mod m to the same thing, say r, then
they are congruent to each other.
4.2 We saw in Exercise 1.41 on page 29 that an integer b and its negative b can have
different remainders, say r and s, after dividing by some nonzero a. Prove that
s r mod a.
4.3 Show that if a b mod n and m j n, then a b mod m.
4.4 A googol is 10100; that is, 1 followed by 100 zeros. Compute the remainder mod 7
of a googol.
4.5 *
(i) If m 2, show that every integer a is
(ii) congruent mod m to exactly one integer on the list
1; 2; : : : ; m:
(iii) Generalize Corollary 4.4 by showing that if m 2, every integer a is congru-
ent mod m to exactly one integer on any list of m consecutive integers.
i i
“book2” — 2013/5/24 — 8:18 — page 141 — #161
i i
4.6 (i) Show that every nonnegative integer is congruent mod 6 to the sum of its
7-adic digits.
(ii) Show that every nonnegative integer is congruent mod 3 to the sum of its
7-adic digits.
(iii) Suppose b and n are nonnegative integers. If n j .b 1/, show that every
integer is congruent mod n to the sum of its b-adic digits.
4.7 (i) Show that every nonnegative integer is congruent mod 11 to the alternating
sum of its decimal digits.
(ii) Show that every nonnegative integer is congruent mod b C 1 to the alternating
sum of its b-adic digits.
P
4.8 Let a nonnegative integer n have decimal expansion n D kiD0 di 10i . Define
t .n/ D n 10d0 4d0 .
(i) Show that n is divisible by 41 if and only if t .n/ is.
(ii) Is n t .n/ mod 41 for all nonnegative n?
You will have to invent
4.9 Find the b-adic digits of 1000 for b D 2; 3; 4; 5; and 20. symbols for some 20-adic
digits.
4.10 (i) Find the 11-adic digits of 115 .
(ii) What is the b-adic expansion for b k (k a nonnegative integer)?
4.11 Let a be a positive integer, and let a0 be obtained from a by rearranging its (dec-
imal) digits (e.g., a D 12345 and a0 D 52314). Prove that a a0 is a multiple
of 9.
4.12 Prove that there are no positive integers a; b; c with
a2 C b 2 C c 2 D 999:
4.13 Prove that there is no perfect square whose last two decimal digits are 35.
4.14 Using Fermat’s Theorem 4.9, prove that if ap Cb p D c p , then a Cb c mod p.
Linear congruences
We are now going to solve linear congruences; that is, we’ll find all the inte-
gers x, if any, satisfying
ax b mod m:
Later, we will consider several linear congruences in one unknown with dis-
tinct moduli (see Theorems 4.21, 4.25, and 4.27). And we’ll even consider two
linear congruences in more than one unknown (see Theorem 4.44).
ax b mod m
i i
“book2” — 2013/5/24 — 8:18 — page 142 — #162
i i
Corollary 4.18. If p is prime and p − a .i.e., p does not divide a/, then the
congruence ax b mod p is always solvable.
Example 4.19. When gcd.a; m/ D 1, Theorem 4.17 says that the set of solu-
tions of ax b mod m is
2x 9 mod 13:
x 11 mod 13;
x b mod m
x b 0 mod m0
Theorem 4.27 will gener-
alize Theorem 4.21 to any have a common solution. Moreover, any two solutions are congruent mod
number of moduli. mm0 .
i i
“book2” — 2013/5/24 — 8:18 — page 143 — #163
i i
Proof. Every solution of the first congruence has the form x D b C km for
some integer k; hence, we must find k such that b C km b 0 mod m0 ; that is,
km b 0 b mod m0 . Since gcd.m; m0 / D 1, however, Theorem 4.17 applies
at once to show that such an integer k does exist.
If y is another common solution, then both m and m0 divide x y; by
Exercise 1.58 on page 35, mm0 j .x y/, and so x y mod mm0 .
Example 4.22. Let’s find all the solutions to the simultaneous congruences
x 5 mod 8
x 11 mod 15:
Every solution to the first congruence has the form
x D 5 C 8k;
for some integer k. Substituting, x D 5 C 8k 11 mod 15, so that
8k 6 mod 15:
But 2 8 D 16 1 mod 15, so that multiplying by 2 gives
16k k 12 mod 15:
We conclude that x D 5C812 D 101 is a solution, and the Chinese Remainder
Theorem (which applies because 8 and 15 are relatively prime) says that every
solution has the form 101 C 120n for n 2 Z (because 120 D 8 15). N
By the Chinese Remainder Theorem, all the simultaneous solutions x have the
form
x D 13k 6 .13 18/ 6 228 mod 260I
that is, the solutions are
: : : ; 32; 228; 488; : : : : N
i i
“book2” — 2013/5/24 — 8:18 — page 144 — #164
i i
Œm; d ;
We now ask how many days have elapsed between Oc 11 and Etznab 5.
More generally, let x be the number of days from tzolkin Œm; d to tzolkin
Œm0 ; d 0 . As we remarked at the beginning of this example, the cyclic behavior
of the days gives the congruence
x d0 d mod 13;
x m0 m mod 20:
To answer the original question, Oc 11 corresponds to the ordered pair Œ10; 11
and Etznab 5 corresponds to Œ18; 5. Since 5 11 D 6 and 18 10 D 8, the
simultaneous congruences are
x 6 mod 13
x 8 mod 20:
i i
“book2” — 2013/5/24 — 8:18 — page 145 — #165
i i
If we do not assume that the moduli m and m0 are relatively prime, then
there may be no solutions to a linear system. For example, if m D m0 > 1,
then uniqueness of the remainder in the Division Algorithm shows that there
is no solution to
x 0 mod m
x 1 mod m:
x b mod m
x b 0 mod m0
h D b 0 sc C btc 0
D .b C kd /sc C btc 0
D b.sc C tc 0 / C kdsc
D b C ksm
b mod m:
x 1 mod 6
x 4 mod 15:
h D 4 3 2 C 1 . 1/ 5 D 19:
We check that 19 1 mod 6 and 19 4 mod 15. Since lcm.6; 15/ D 30, the
solutions are : : : ; 41; 11; 19; 49; 79; : : :. N
We are now going to generalize the Chinese Remainder Theorem for any
number of linear congruences whose moduli are pairwise relatively prime. We
i i
“book2” — 2013/5/24 — 8:18 — page 146 — #166
i i
shall see in Chapter 6 that this new version, whose solutions are given more
explicitly, can be used to reveal a connection with Lagrange Interpolation, a
method for finding a polynomial that agrees with a finite set of data.
Consider the following problem, adapted from Qin Jiushao, Nine Chapters
on the Mathematical Art, 1247 CE.
Three farmers equally divide the rice that they have grown. One goes to
a market where an 83-pound weight is used, another to a market that
uses a 112-pound weight, and the third to a market using a 135-pound
weight. Each farmer sells as many full measures as possible, and when
the three return home, the first has 32 pounds of rice left, the second 70
pounds, and the third 30 pounds. Find the total amount of rice they took
to market.
i i
“book2” — 2013/5/24 — 8:18 — page 147 — #167
i i
Now comes the important step: since 112 and 135 are relatively prime to 83,
so is their product (Exercise 1.56 on page 35). Hence, 14 (which is the same as
112135 modulo 83) is also relatively prime to 83, and so Theorem 4.17 implies
that we can solve Eq. (4.2) for k. There is an integer s with 14s 1 mod 83,
and multiplying both sides of 14k 32 mod 83 by s gives
There are several methods for finding s (since 83 is not so small, the Euclidean
Algorithm is probably the most efficient); in fact, s D 6, and so k satisfies
Hence,
To get a feel for this method, it’s a good idea to go through it twice more,
finding v and w. In fact, that’s Exercise 4.22 on page 149.
The method just developed generalizes to a proof of the extended Chinese
Remainder Theorem. Let’s first introduce some notation.
bi mr D m1 mi
Mi D m1 m2 m 1 mi C1 mr I
x b1 mod m1
x b2 mod m2
:: ::
: :
x br mod mr
bi mr
Mi D m1 m2 m and si Mi 1 mod mi for 1 i r:
Proof. Use our discussion on the previous page as a model for the proof.
That the specified x works is a consequence of Proposition 4.5. That all solu-
tions are congruent modm1 m2 : : : mr is a consequence of Exercise 1.58 on
page 35.
i i
“book2” — 2013/5/24 — 8:18 — page 148 — #168
i i
Exercises
4.15 * Complete the proof of Theorem 4.27.
4.16 (i) Solve
x 5 mod 7
x 2 mod 11:
(ii) In the year 2000, the remainder after dividing my age by 3 was 2, and the
remainder after dividing by 8 was 3. If I was a child when people first walked
on the Moon, how old was I in 2000?
(iii) Solve
x 7 5 mod 7
x 11 2 mod 11:
4.17 (i) Find a solution v to
v 3 mod 17
v 0 mod 11:
i i
“book2” — 2013/5/24 — 8:18 — page 149 — #169
i i
4.22 * Finish the calculations solving Qin Jiushao’s problem on page 146 by first find-
ing s and t , and then finding the smallest positive solution.
4.23 A band of 17 pirates stole a sack of gold coins. When the coins were divided
equally, there were three left over. So, one pirate was made to walk the plank.
Again the sack was divided equally; this time there were 10 gold coins left over.
So, another unlucky member of the crew took a walk. Now, the gold coins could
be distributed evenly with none left over. How many gold coins were in the sack?
4.24 .Bhaskara I; ca: 650 C . E ./: If eggs in a basket are taken out 2, 3, 4, 5, and 6 at a
time, there are 1, 2, 3, 4, and 5 eggs left over, respectively. If they are taken out 7
at a time, there are no eggs left over. What is the least number of eggs that can be
in the basket?
. , ; : ! ? - ’ " ( )
In all, there are 64 symbols. Assign a two-digit number to each symbol. For
example,
Notice that any message coded in this cipher has an even number of digits, and
so decoding, converting the number into English, is a simple matter. Thus,
i i
“book2” — 2013/5/24 — 8:18 — page 150 — #170
i i
should not be able to decode it. An ingenious way to find a code with these
properties, now called an RSA code, was found in 1978 by Rivest, Shamir,
and Adleman; they received the 2002 Turing Award for their discovery.
The following terms describe two basic ingredients of RSA codes.
The numbers N and e are public—they are published on the web—but the
primes p and q are kept secret. In practice, the primes p and q are very large.
If x is a message, encoded by assigning natural numbers to its letters as
discussed above, then the encoded message sent is
x e mod N:
Definition. Given a public key .N; e/, a private key is a number d such that
x ed x mod N for all x 2 Z:
i i
“book2” — 2013/5/24 — 8:18 — page 151 — #171
i i
p j .x m x/ and q j .x m x/:
As p and q are distinct primes, they are relatively prime, and so pq j .x m x/,
by Exercise 2.20 on page 33. Hence, x m x mod pq for all x; that is,
x m x mod N for all x 2 Z.
Return now to the special case m D ed ; can we find a private key d so that
ed 1 mod .p 1/ and ed 1 mod .q 1/? By hypothesis, gcd.e; p 1/ D
1 D gcd.e; q 1/; by Exercise 1.56 on page 35, gcd e; .p 1/.q 1/ D 1.
We can now find d with Proposition 4.17, which shows how to construct an
integer d such that
ed 1 mod .p 1/.q 1/:
Example 4.28. Let’s create a public key and a private key using p D 11 and
q D 13. (This is just for the sake of illustration; in practice, both p and q need
to be extremely large primes.)
The modulus is N D pq D 11 13 D 143, and so p 1 D 10 and
q 1 D 12. Let’s choose e D 7 (note that gcd.7; 10 12/ D 1). Hence the
public key is
.N; e/ D .143; 7/:
If x is a message in cipher (i.e., a natural number), then the encoded message is
the congruence class x 7 mod 143. To find the private key, we need a number d
so that 7d 1 mod 120. Using Euclidean Algorithm II or a CAS, we find a
private key
d D 103;
for 7 103 D 721 D 6 120 C 1.
Let’s encode and decode the word “dog”: d D 4; o D 15; g D 7. Thus,
the cipher for dog is 041507. In the real world, the encoding is .41507/7, and
the message sent out is the congruence class .41507/7 mod 143. Decoding
involves computing .41507/721 mod 143. As we said earlier, decoding is not
finished by finding this congruence class; the numbers in this class are of the
form .41507/721 C 143k, and only one of these must be determined. As we
said above, the method used in actual RSA transmissions encodes blocks of
letters to get around this ambiguity. For this example, however, we’ll use a
simpler method—we’ll send each letter separately, so that “dog” is sent as as
three codes
i i
“book2” — 2013/5/24 — 8:18 — page 152 — #172
i i
d: 47 82 mod 143
o: 157 115 mod 143
g: 77 6 mod 143:
How to Think About It. A CAS can easily tell you that 82103 4 mod
143, but it’s interesting to see how the theorems developed in this chapter can
allow you to do the computation by hand. Start with the fact that the reduction
of 82103 mod 143 is equivalent to two calculations, since 143 is 11 13:
82103 mod 11
82103 mod 13:
and
10
4103 D 4128C7 D 412 47 47 mod 13 (Little Fermat)
D 43 4.43 / D 64 4.64/ . 1/ 4. 1/ mod 13 4 mod 13:
i i
“book2” — 2013/5/24 — 8:18 — page 153 — #173
i i
Exercises
4.25 For this exercise, use the primes p D 5 and q D 17 to create public and private
keys.
(i) What will be the modulus N for the public key?
(ii) The exponent e for the public key must have no common factors with p 1
and q 1. List the five smallest numbers relatively prime to .p 1/.q 1/.
(iii) There are many possibilities for e; for now, use e D 3. To encode letters (a The public key reveals
computer would do blocks of letters), use the rule x 7! x 3 mod 85. e D 3 and N D pq, but
p and q are not revealed.
(iv) Encode the phrase “cell phones” using this method.
(Why not?)
(v) The private key d satisfies
ed 1 mod .p 1/.q 1/:
Find d , decode your message using the private key, and verify that it is, in-
deed, what was sent.
4.26 The following message was encoded using the public key .85; 3/:
01 42 59 10 49 27 56I
decode this message. It answers the question, “What do you call a boomerang that
doesn’t come back when you throw it?”
4.27 Decode the following message encoded using the public key .91; 5/:
04 31 38 38 23 71 14 31:
4.28 Let m and r be nonnegative integers, and p be a prime. If m r mod .p 1/,
show that x m x r for all integers x.
4.29 Take It Further. (Electronic Signatures) Consider this scenario: Elvis receives
an email, encoded with his public key, from his abstract algebra instructor Mr. Jag-
ger, which says that algebra is a waste of time and Elvis should spend all his time
watching TV. Elvis suspects that the message didn’t really come from Mr. J., but
how can he be sure?
Suppose both Elvis and Mr. Jagger have private keys, and each knows the
other’s public keys. They can communicate in total privacy, with no one able to
read their messages. Here’s how: if Elvis wants to send a message to Mr. J., he
follows these steps:
Write the message to get x1 .
Encode the message with his private key to get x2 .
Encode x2 with Mr. J.’s public key to get x3 .
Send x3 .
i i
“book2” — 2013/5/24 — 8:18 — page 154 — #174
i i
When Mr. J. receives the message, he can follow a procedure to get the original
message back.
(i) What is the procedure?
(ii) Explain why no one besides Mr. Jagger could read the message from Elvis.
4.30 Take It Further. Elvis is home sick with the flu. He decides to send a message
These public keys are not to Mr. Jagger, using the method from Exercise 4.29. Suppose Elvis’s public key
realistic. In reality, public is .253; 7/ and Mr. J.’s public key is .203; 5/. Elvis sends the message
keys use much larger
FIDO ATE MY HOMEWORK.
primes.
What is the encoded message that Mr. J. receives? Show how Elvis encodes it and
how Mr. J. decodes it.
Œa D fx 2 X W x ag:
Corollary 4.4 says that the list Œ0; Œ1; : : : ; Œm 1 is complete; that is, there
are no other congruence classes mod m.
For example, Z2 , the integers mod 2, is the set fŒ0; Œ1g; we may think of
Œ0 as even (for Œ0 D fa 2 Z W a 0 mod 2g is the set of all even integers)
and Œ1 as odd (for Œ1 is the set of all odds).
Here is the “theological reason” for introducing congruence classes. We
could continue to deal with integers and congruence; this is, after all, what
Gauss did. We saw in Proposition 4.5 that + and are compatible with con-
gruence: if a b mod m and a0 b 0 mod m, then a C b a0 C b 0 mod m
and ab a0 b 0 mod m. But wouldn’t life be simpler if we could replace by
=; that is, if we could replace congruence by equality? We state the following
special case of Lemma A.16 in Appendix A.2 explicitly:
i i
“book2” — 2013/5/24 — 8:18 — page 155 — #175
i i
We often say “odd + odd = even,” which does replace by = at the cost of
replacing integers by their congruence classes. Thus, we should define addition
of these congruence classes so that Œ1 C Œ1 D Œ0.
Addition and multiplication of evens and odds leads to the following tables.
We saw above that Œ1 C Œ1 D Œ0 says that “odd C odd D even;” note that
Œ1 Œ1 D Œ1 says “odd odd D odd.” The table above on the left de- A binary operation on a set
fines addition ˛W Z2 Z2 ! Z2 ; the table on the right defines multiplication R is a function R R ! R
(in particular, R is closed
W Z2 Z2 ! Z2 . As usual, we view congruence as generalizing parity, and
under f : if a and b are in
we now extend the definitions to give addition and multiplication of congru- R, then f .a; b/ is in R).
ence classes mod m for all m 2. Can you prove associativity
of the binary operations ˛
Definition. If m 2, addition and multiplication Zm Zm ! Zm are defined and when R D Z2 ?
by
Œr C Œs D Œr C s and Œr Œs D Œr s:
The definitions are simple and natural.However, we are adding and multi-
plying congruence classes, not remainders. After all, remainders are integers
between 0 and m 1, but the sum and product of remainders can exceed m 1,
and hence are not remainders.
i i
“book2” — 2013/5/24 — 8:18 — page 156 — #176
i i
How to Think About It. There are more general (non-commutative) rings
in which (v), commutativity of multiplication, is not assumed, while (vi) is
modified to say that 1 a D a D a 1 and (viii) is modified to say a.b C c/ D
ab C ac and .b C c/a D ba C bc. A good example is the ring of all 2 2
matrices with entries in R, with identity element 10 01 , and binary operations
ordinary matrix addition and multiplication:
" # " # " #
a b a0 b 0 a C a0 b C b 0
C 0 D
c d c d0 c C c0 d C d 0
and
" #" # " #
a b a0 b0 aa0 C bc 0 ab 0 C bd 0
D :
c d c0 d 0
ca0 C dc 0 cb 0 C dd 0
Since all rings in this book are commutative, we will often abuse language and
abbreviate “commutative ring” to “ring.”
How to Think About It. The notion of commutative ring wasn’t conceived
in a vacuum. Mathematicians noticed that several useful systems shared the
basic algebraic properties listed in the definition. Definitions usually emerge
in this way, distilling common features of different interesting examples.
Precise definitions are valuable; we couldn’t prove anything without them.
For example, political discourse is often vapid because terms are not defined:
what is a liberal; what is a conservative? A mathematician who asserts that
there are infinitely many primes can be believed. But can you believe a politi-
cian who says his opponent is a fool because he’s a liberal (or she’s a conser-
vative)?
i i
“book2” — 2013/5/24 — 8:18 — page 157 — #177
i i
Example 4.30. (i) Z, Q, and R are commutative rings. The ninth funda-
mental property, reciprocals, does not hold in Z; for example, 2 1 D 21
does not lie in Z.
(ii) Propositions 3.8 and 3.9 show that C is a commutative ring, while Propo-
sition 3.11 shows that every nonzero complex number has an inverse.
(iii) The set of even integers does not form a commutative ring, for it has no
identity.
(iv) The Gaussian integers ZŒi form a commutative ring (see Exercise 4.64
on page 168).
(v) The Eisenstein integers ZŒ!, where ! is a primitive cube root of unity,
form a commutative ring (see Exercise 4.64).
(vi) More generally, the cyclotomic integers ZŒ, where is any primitive
root of unity, form a commutative ring (see Exercise 4.65 on page 168).
(vii) The next theorem shows that Zm is a commutative ring for every integer
m 2.
(viii) We’ll see, in the next chapter, that all polynomials whose coefficients lie
in a commutative ring (e.g., all polynomials with coefficients in Z) is
itself a commutative ring with the usual addition and multiplication. N
C.X/ D ff W X ! R W f is continuousg
is a commutative ring under pointwise operations. If both f; g 2 C.X/
are continuous, then it is shown in calculus that both f C g and fg are
also continuous. The constant function e with e.t/ D 1 for all t 2 X is
continuous; we let the reader prove that the other axioms in the definition
of commutative ring hold. N
Etymology. The word ring was probably coined by Hilbert in 1897 when
he wrote Zahlring. One of the meanings of the word ring, in German as in
English, is “collection,” as in the phrase “a ring of thieves.” It has also been
suggested that Hilbert used this term because, for a commutative ring such as
the Gaussian integers ZŒi , powers of some elements “cycle back” to being a
linear combination of smaller powers (for example, i; i 2; i 3 ; i 4 D 1, i 5 D i ).
i i
“book2” — 2013/5/24 — 8:18 — page 158 — #178
i i
Proof. The proof of each of the eight statements is routine; in essence, they
are inherited from the analogous statement in Z (the inheritance is made pos-
sible by Proposition 4.5). We prove only statements (i), (vii), and (viii) in the
definition of commutative ring; the other proofs are left to Exercise 4.31 below.
Exercises
4.31 * Prove the remaining parts of Theorem 4.32
4.32 Prove that every commutative ring R has a unique identity 1.
4.33 (i) Prove that subtraction in Z is not an associative operation.
(ii) Give an example of a commutative ring in which subtraction is associative.
4.34 * If R is a commutative ring and S is a set, verify that RS is a commutative ring
under pointwise operations. (See Example 4.31.)
4.35 * Define the weird integers W as the integers with the usual addition, but with
multiplication defined by
(
ab if a or b is odd
ab D
ab if both a and b are even:
Prove that W is a commutative ring.
Hint: It is clear that 1 is the identity and that * is commutative; only associativity
of * and distributivity must be checked.
4.36 For each integer a between 1 and 11, find all solutions to Œax D Œ9 in Z12 .
(There may be no solutions for some a.)
4.37 In Z8 , find all values of x so that .x 1/.x C 1/ D 0.
4.38 Solve the equation x 2 C 3x 3 D 0 in Z5 .
4.39 How many roots does the polynomial x 2 C 1 D 0 have in each of the following
commutative rings?
(i) Z5 (ii) Z7 (iii) Z11
(iv) Z101 (v) Z13
i i
“book2” — 2013/5/24 — 8:18 — page 159 — #179
i i
i i
“book2” — 2013/5/24 — 8:18 — page 160 — #180
i i
The hybrid ka can be viewed as the product of two elements in the com-
mutative ring R. If e D 1 (the identity element in R), then ke 2 R and
ka D .ke/a. For example, if k > 0, then
ka D a C a C C a D .e C e C C e/a D .ke/a:
Œ2Œ0 D Œ0; Œ2Œ1 D Œ2; Œ2Œ2 D Œ4 D Œ0; Œ2Œ3 D Œ6 D Œ2I
i i
“book2” — 2013/5/24 — 8:18 — page 161 — #181
i i
How to Think About It. There is a strong analogy between the method for
solving linear equations in elementary algebra and the proof of Theorem 4.17.
When solving an equation like 3x D 4 in first-year algebra, you multiply both
sides by the number u with u3 D 1, namely, u D 13 :
3x D 4
1 1
3
.3x/ D 3
4
1
4
3
3 x D 3
4
xD 3
:
Œ3x D Œ4;
and go through the same steps as above, using the fact that Œ5 Œ3 D Œ1:
Œ3x D Œ4
Œ5.Œ3x/ D Œ5 Œ4
.Œ5 Œ3/ x D Œ6
x D Œ6:
i i
“book2” — 2013/5/24 — 8:18 — page 162 — #182
i i
the context make things clear. For example, the calculation in Z7 above will
usually be written
3x D 4
5.3x/ D 5 4
.5 3/ x D 6
x D 6:
What are the units in ZŒi ? Our work in Chapter 3 lets us find the answer.
Every nonzero Gaussian integer z has an inverse in C, but that inverse may not
be in ZŒi . Proposition 3.11 shows, in C, that
1 z
z D :
zz
The denominator on the right-hand side is none other than N.z/, the norm of z,
and this suggests the following proposition.
i i
“book2” — 2013/5/24 — 8:18 — page 163 — #183
i i
1; i ; 1; i:
Proof. If m is prime and 0 < a < m, then gcd.a; m/ D 1, and Proposition 4.39
says that a is a unit in Zm . Hence, Zm is a field. We have removed the
Conversely, suppose that m is not prime; that is, m D ab, where 0 < a, brackets from the notation
for elements of Zm .
b < m. In Zm , both a and b are nonzero, and ab D 0. If a has an inverse in
Zm , say, s, then sa D 1,which gives the contradiction:
0 D s0 D s.ab/ D .sa/b D 1b D b:
Who would have thought that a field could have a finite number of elements?
When one of us was a graduate student, a fellow student was tutoring a 10-year
old prodigy. To illustrate the boy’s talent, he described teaching him how to
multiply 2 2 matrices. As soon as he was shown that the 2 2 identity matrix
I satisfies IA D A for all matrices A, the boy immediately began writing;
after a few minutes he smiled, for he had just discovered that A D ac db has
an inverse if and only if ad bc ¤ 0! Later, when this boy was told the
definition of a field, he smiled as the usual examples of Q, R, and C were
trotted out. But when he was shown Z2 , he threw a temper tantrum and ended
the lesson.
In Theorem 4.17, we considered linear congruences in one variable. We
now consider linear systems in two variables.
ax C by u mod p
cx C dy v mod p
ax C by D u
cx C dy D v:
i i
“book2” — 2013/5/24 — 8:18 — page 164 — #184
i i
How to Think About It. Had you mimicked the method in the example
when proving Theorem 4.44, you would have found Cramer’s Rule, a generic
formula for the solution to the system
ax C by D u
cx C dy D v:
Exercises
4.40 Give an example of a commutative ring R containing an element a with a ¤ 0,
a ¤ 1, and a2 D a.
4.41 * The notation in this exercise is that of Example 4.31.
(i) Find all the units in Fun.R/ D RR .
(ii) Prove that a continuous function uW X ! R is a unit in C.X / if and only if
u.t / ¤ 0 for all t 2 X .
p p
4.42 Let R D ZŒ 3 D fa C b 3 W a; b 2 Zg:
(i) Show, with the usual addition and multiplication of real numbers, that R is a
commutative ring.
p
(ii) Show that u D 2 C 3 is a unit in R.
(iii) Show that R has infinitely many units.
i i
“book2” — 2013/5/24 — 8:18 — page 165 — #185
i i
3x 2y C z 1 mod 7
x Cy 2z 0 mod 7
x C 2y C z 4 mod 7:
2x C 5y D 7
x C 4y D 9
i i
“book2” — 2013/5/24 — 8:18 — page 166 — #186
i i
i i
“book2” — 2013/5/24 — 8:18 — page 167 — #187
i i
A B
commutative ring. The empty set ¿ is the zero element, for A C ¿ D A, while
each subset A is its own negative, for A C A D ¿. These exercises also show
that symmetric difference is associative and that the distributive law holds. Fi-
nally, X itself is the identity element, for X \ A D A for every subset A. We
call 2X a Boolean ring.
Suppose now that Y ¨ X is a proper subset of X; is 2Y a subring of 2X ?
If A and B are subsets of Y , then A C B and A \ B are also subsets of Y ;
that is, 2Y is closed under the addition and multiplication on 2X . However, the
identity element in 2Y is Y , not X, and so 2Y is not a subring of 2X . N
The example of 2X may have surprised you. It was natural for us to in-
troduce the notion of commutative ring, for we had already seen many ex-
amples of numbers or of functions in which addition and multiplication make
sense and obey the usual rules. But the elements of 2X are neither numbers nor
functions. And even though we call their binary operations addition and mul-
tiplication, they are operations from set theory. This is a happy circumstance,
which we will exploit in the next chapter. It’s not really important what we
call addition and multiplication; what is important is that the operations satisfy
eight fundamental properties; that is, the axioms in the definition of commuta-
tive ring.
Just as the notion of a subring of a commutative ring is useful, so too is the
notion of a subfield of a field.
Exercises
4.56 Give an example of a subring of a field that is not a field.
4.57 * Prove Proposition 4.48.
4.58 (i) Show that f0; 2g Z4 has the same addition and multiplication tables as Z2 .
(ii) Is Z2 a subring of Z4 ?
(iii) Is f0; 2; 4; 6g a subring of Z8 ?
i i
“book2” — 2013/5/24 — 8:18 — page 168 — #188
i i
p p
4.59 Let R D ZŒ 3 D fa C b 3 W a; b 2 Zg.
(i) Show that R is a subring of the Eisenstein integers.
(ii) What are the units in R?
4.60 (i) If S and T are subrings of a ring R, show that S \ T is also a subring of R.
(ii) Show that the intersection of the Gaussian and Eisenstein integers is Z.
4.61 *
(i) If .Si /i 2I isTa family of subrings of a commutative ring R, prove that their
intersection i 2I Si is also a subring of R.
(ii) If X is a subset of a commutative ring R, define G.X /, the subring generated
by X , to be the intersection of all the subrings of R that contain X .
Prove that G.X / is the smallest subring containing X in the following
sense: if S is any subring of R containing X , then G.X / S.
(iii) Let .Si /i 2I be a family of subrings
T of a commutative ring R, each of which
is a field. Prove that the subring i 2I Si is a field. Conclude that the inter-
section of a family of subfields of a field is a subfield.
4.62 Let p be a prime and let Ap be the set of all fractions with denominator a power
of p.
(i) Show, with the usual operations of addition and multiplication, that Ap is a
subring of Q.
(ii) Describe the smallest subring of Q that contains both A2 and A5 .
4.63 Let p be a prime and let Qp be the set of rational numbers whose denominator
(when written in lowest terms) is not divisible by p.
(i) Show, with the usual operations of addition and multiplication, that Qp is a
subring of Q.
(ii) Show that Q2 \ Q5 is a subring of Q.
(iii) Is Qp a field? Explain.
(iv) What is Qp \ Ap , where Ap is defined in Exercise 4.62?
4.64 *
(i) Prove that ZŒi D fa C bi W i 2 D 1 and a; b 2 Zg, the Gaussian integers,
is a commutative ring.
(ii) Prove that ZŒ! D fa C b! W ! 3 D 1 and a; b 2 Zg, the Eisenstein integers,
is a commutative ring.
4.65 * Prove that ZŒ D fa C b i W 0 i < n and a; b 2 Zg is a commutative ring,
where is a primitive nth root of unity.
4.66 * It may seem more natural to define addition in 2X as union rather than symmet-
ric difference. Is 2X a commutative ring if addition A ˚ B is defined as A [ B
and AB is defined as A \ B?
4.67 If X is a finite set with exactly n elements, how many elements are in 2X ?
4.68 * If A and B are subsets of a set X , prove that A B if and only if A D A \ B.
4.69 * Recall that if A is a subset of a set X , then its complement is
Ac D fx 2 X W x … Ag:
i i
“book2” — 2013/5/24 — 8:18 — page 169 — #189
i i
A B
C
Figure 4.3. Associativity.
i i
“book2” — 2013/5/24 — 8:18 — page 170 — #190
i i
would be fine if the year were exactly 365.25 days long, but it has the effect of
making the year 365:25 365:2422 D :0078 days (about 11 minutes and 14
seconds) too long. After 128 years, a full day was added to the calendar; that
is, the Julian calendar overcounted the number of days. In the year 1582, the
vernal equinox (the Spring day on which there are exactly 12 hours of daylight
and 12 hours of night) occurred on March 11 instead of on March 21. Pope
Gregory XIII (and his scientific advisors) then installed the Gregorian calen-
dar by erasing 10 days that year; the day after October 4, 1582 was October
15, 1582. This caused confusion and fear among the people; they thought their
lives had been shortened by ten days.
The Gregorian calendar modified the Julian calendar as follows. Call a
year y ending in 00 a century year. If a year y is not a century year, then
it is a leap year if it is divisible by 4; if y is a century year, it is a leap year only
if it is divisible by 400. For example, 1900 is not a leap year, but 2000 is a leap
year. The Gregorian calendar is the one in common use today, but it was not
uniformly adopted throughout Europe. For example, the British empire didn’t
accept it until 1752, when 11 days were erased, and the Russians didn’t ac-
cept it until 1918, when 13 days were erased (thus, Trotsky called the Russian
revolution, which occurred in 1917, the October Revolution, even though it
occurred in November of the Gregorian calendar).
The true number of days in 400 years is about
while the Gregorian calendar, which eliminates three leap years from this time
period, has 146,097 days. Thus, the Julian calendar gains about 3.12 days every
400 years, while the Gregorian calendar gains only 0.12 days (about 2 hours
and 53 minutes.
Historical Note. There are 1628 years from 46 BCE to 1582 CE. The Julian
calendar overcounts one day every 128 years, and so it overcounted 12 days in
this period (for 12 128 D 1536). Why didn’t Gregory have to erase 12 days?
The Council of Nicaea, meeting in the year 325 CE, defined Easter as the first
Sunday strictly after the Paschal full moon, which is the first full moon on or
after the vernal equinox (now you know why Pope Gregory was interested in
the calendar). The vernal equinox in 325 CE fell on March 21, and the Synod
of Whitby, in 664 CE, officially defined the vernal equinox to be March 21. The
discrepancy observed in 1582 was thus the result of only 1257 D 1582 325
years of the Julian calendar: approximately 10 days.
We now seek a calendar formula. For easier calculation, choose 0000 as our
reference year, even though there was no year zero! Assign a number to each
day of the week, according to the scheme
i i
“book2” — 2013/5/24 — 8:18 — page 171 — #191
i i
365 D 52 7 C 1 1 mod 7:
Similarly, March 1, 0002, has number a0 C 2, and March 1, 0003, has number
a0 C 3. However, March 1, 0004, has number a0 C 5, for February 29, 0004,
fell between March 1, 0003, and March 1, 0004, and so 366 2 mod 7 days
had elapsed since the previous March 1. We see, therefore, that every common
year adds 1 to the previous number for March 1, while each leap year adds 2.
Thus, if March 1, 0000, has number a0 , then the number a0 of March 1, year y,
is
a0 a0 C y C L mod 7;
where L is the number of leap years from year 0001 to year y. To compute L,
count all those years divisible by 4, then throw away all the century years, and
then put back those century years that are leap years. Thus,
a0 a0 C y C L
a0 C y C by=4c b y=100c C by=400c mod 7:
and so
a0 2496 4 3 mod 7
(that is, March 1, 0000 fell on Wednesday). We can now determine the day of
the week a0 on which March 1 will fall in any year y > 0, for
Let us now analyze February 28. For example, suppose that February 28,
1600, has number b. As 1600 is a leap year, February 29, 1600, occurs between
February 28, 1600, and February 28, 1601; hence, 366 days have elapsed be-
tween these two February 28s, so that February 28, 1601, has number b C 2.
February 28, 1602, has number b C 3, February 28, 1603, has number b C 4,
i i
“book2” — 2013/5/24 — 8:18 — page 172 — #192
i i
February 28, 1604, has number b C 5, but February 28, 1605, has number b C 7
(for there was a February 29 in 1604).
Let us compare the pattern of behavior of February 28, 1600, namely, b;
b C 2; b C 3; b C 4; b C 5; b C 7; : : : ; with that of some date in 1599. If May 26,
1599, has number c, then May 26, 1600, has number c C 2, for February 29,
1600, comes between these two May 26s, and so there are 366 2 mod 7 in-
tervening days. The numbers of the next few May 26s, beginning with May 26,
1601, are c C 3, c C 4, c C 5, c C 7. We see that the pattern of the days for
February 28, starting in 1600, is exactly the same as the pattern of the days
for May 26, starting in 1599; indeed, the same is true for any date in January
or February. Thus, the pattern of the days for any date in January or February
of a year y is the same as the pattern for a date occurring in the preceding
year y 1: a year preceding a leap year adds 2 to the number for such a date,
whereas all other years add 1. Therefore, we pretend we have reverted to the
ancient calendar by making New Year’s Day fall on March 1, so that any date
in January or February is treated as if it had occurred in the previous year.
How do we find the day corresponding to a date other than March 1? Since
March 1, 0000, has number 3 (as we have seen above), April 1, 0000, has
number 6, for March has 31 days and 3 C 31 6 mod 7. Since April has 30
days, May 1, 0000, has number 6 C 30 1 mod 7. Figure 4.4 is the table
giving the number of the first day of each month in year 0000.
Remember that we are pretending that March is month 1, April is month 2,
and so on. Let us denote these numbers by 1 C j.m/, where j.m/, for m D
1; 2; : : : ; 12, is defined by
j.m/ W 2, 5, 0, 3, 5, 1, 4, 6, 2, 4, 0, 3.
It follows that month m, day 1, year y, has number
1 C j.m/ C g.y/ mod 7;
where
g.y/ D y C b y=4c b y=100c C by=400c:
Note that a0 D 1 C j.1/, so that the values of j.m/ depend on our knowing a0 .
Here’s a formula for j.m/:
j.m/ D b2:6m 0:2c; where 1 m 12I
the values are displayed in Figure 4.4. This formula is not quite accurate. For
example, this number for December, that is, for m D 10, is b2:6m 0:2c D 25;
but j.10/ D 4. However, 25 4 mod 7, and so the formula for j.m/ really
gives the congruence class mod 7.
i i
“book2” — 2013/5/24 — 8:18 — page 173 — #193
i i
Theorem 4.49 (Calendar Formula). The date with month m, day d , year y The word calendar comes
has number from the Greek “to call,”
which evolved into the
Latin word for the first day
d C j.m/ C g.y/ mod 7;
of a month (when accounts
were due).
where j.m/ is given in Figure 4.4,
and dates in January and February are treated as having occurred in the pre-
vious year.
1 C j.m/ C g.y/:
Let’s find the day of the week on which July 4, 1776 fell; here m D 5,
d D 4, and y D 1776. Substituting in the formula, we obtain the number
2; 5; [0; 3; 5; 1; 4; 6; 2]; 4; 0; 3:
Indeed, we see that there must be a Friday 13 occurring between May and
November. No number occurs three times on the list, but it is possible that
there are three Friday 13s in a year because January and February are viewed as
having occurred in the previous year; for example, there were three Friday 13s
in 1987 (see Exercise 4.79 on page 176). Of course, we may replace Friday by
any other day of the week, and we may replace 13 by any number between 1
and 28. N
i i
“book2” — 2013/5/24 — 8:18 — page 174 — #194
i i
Most of us need paper and pencil (or a calculator) to use the calendar for-
mula in the theorem, but here’s a way to simplify the formula so you can do
the calculation in your head and amaze your friends. A mnemonic for j.m/ is
the sentence
Corollary 4.51. The date with month m, day d , year y D 100C C N , where
0 N 99, has number
provided that dates in January and February are treated as having occurred in
the previous year.
y D 100C C N 2C C N mod 7;
b y=4c D 25C C bN=4c 4C C bN=4c mod 7;
b y=100c D C; and b y=400c D bC=4c:
Therefore,
This formula is simpler than the first one. For example, the number corre-
sponding to July 4, 1776 is now obtained as
4 C 5 C 76 C 19 C 4 34 D 74 4 mod 7;
agreeing with our calculation above. The reader may now compute the day of
his or her birth.
Example 4.52. The birthday of Rose, the grandmother of Danny and Ella, was
January is counted as January 1, 1909; on what day of the week was she born?
belonging to the previous
year 1908. We use Corollary 4.51. If A is the number of the day, then j.m/ D 0 (for
January corresponds to month 11), and
A 1 C 0 C 8 C b8=4c C b19=4c 38
23 mod 7
5 mod 7:
J. H. Conway found an even simpler calendar formula. The day of the week
on which the last day of February occurs is called the doomsday of the year.
We can compute doomsdays using Corollary 4.51.
Knowing the doomsday D of a century year 100C finds the doomsday D 0
of any other year y D 100C C N in that century. Since 100C is a century
i i
“book2” — 2013/5/24 — 8:18 — page 175 — #195
i i
year, the number of leap years from 100C to y does not involve the Gregorian
alteration. Thus,
D 0 D C N C bN=4c mod 7:
3 C 94 C 23 D 120 1 mod 7:
D C q C r C br=4c mod 7:
Proof.
D 0 D C N C bN=4c
D C 12q C r C b.12q C r /=4c
D C 15q C r C br=4c
D C q C r C br=4c mod 7:
If we return to the everyday listing beginning with January as the first month,
then it is easier to remember these dates using the notation month/day:
i i
“book2” — 2013/5/24 — 8:18 — page 176 — #196
i i
Since doomsday corresponds to the last day of February, we are now within
a few weeks of any date in the year, and we can easily interpolate to find the
desired day. For example, let’s use this method for July 4, 1776. Notice that
July 4 occurs on the same day of the week as July 11, and so we need only find
doomsday 1776. By Proposition 4.53,
D 0 0 C 76 C b76=4c D 95 4 mod 7:
We see again that July 4, 1776 fell on a Thursday.
Example 4.54. Let’s use Conway’s method to compute Rose’s birthday again
(recall Example 4.52: Rose was born on January 1, 1909). Since Conway’s
method applies within a given century, there is no need to pretend that Jan-
uary and February live in the preceding year; we can work within 1909. Now
doomsday 1900 is 3, so that Proposition 4.53 gives doomsday 1909 D 0; that
is, Sunday. By definition, doomsday is the number corresponding to the last
date in February, which is here February 28 (for 1909 is not a leap year). Thus,
we interpolate that 3 is the number for 1=31, 1=24, 1=3; that is, January 3 fell
on Sunday, and so January 1 fell on Friday (which agrees with what we saw in
Example 4.52). N
Exercises
4.76 A suspect said that he had spent the Easter holiday April 21, 1893, with his ailing
mother; Sherlock Holmes challenged his veracity at once. How could the great
detective have been so certain?
Hint: Easter always falls on Sunday. (There is a Jewish variation of this problem,
for Yom Kippur must fall on either Monday, Wednesday, Thursday, or Saturday;
secular variants can involve Thanksgiving Day, which always falls on a Thursday,
or Election Day in the US, which always falls on a Tuesday.)
4.77 How many times in 1900 did the first day of a month fall on a Tuesday?
Hint: The year y D 1900 was not a leap year.
4.78 On what day of the week did February 29, 1896 fall?
Hint: On what day did March 1, 1896, fall? Conclude from your method of solu-
tion that no extra fuss is needed to find leap days.
4.79 *
(i) Show that 1987 had three Friday 13s.
Hint: See Example 4.50.
(ii) Show, for any year y > 0, that g.y/ g.y 1/ D 1 or 2, where g.y/ D
y C b y=4c by=100c C b y=400c.
(iii) Can there be a year with exactly one Friday 13?
Hint: Either use congruences or scan the 14 possible calendars: there are 7
possible common years and 7 possible leap years, for January 1 can fall on
any of the 7 days of the week.
4.80 * JJR’s Uncle Ben was born in Pogrebishte, a village near Kiev, and he claimed
that his birthday was February 29, 1900. JJR told him that this could not be, for
1900 was not a leap year. Why was JJR wrong?
Hint: When did Russia adopt the Gregorian calendar?
i i
“book2” — 2013/5/24 — 8:18 — page 177 — #197
i i
Real Numbers
We assume that every real number x has a decimal expansion; for example,
D 3:14159 : : : . This follows from identifying each real number x with
a “point on a number line” having signed distance from a fixed origin on a
coordinatized line. In particular, rational numbers have decimal expansions,
which you can find by long division.
The term expansion will be used in a nonstandard way: we restrict the ter-
minology so that, from now on, the decimal expansion of a real number is the
sequence of digits after the decimal point. With this usage, for example, the
decimal expansion of is :14159 : : : .
We are going to see that decimal expansions of real numbers are unique,
with one possible exception: if there is an infinite string of all 9s. For example,
:328 D :327999 : : : :
i i
“book2” — 2013/5/24 — 8:18 — page 178 — #198
i i
i i
“book2” — 2013/5/24 — 8:18 — page 179 — #199
i i
dk C 1 ek
:dk dkC1 : : : < :ek ekC1 : : : ;
10 10
contradicting the fact that the extreme left-hand and right-hand expressions are
equal.
r D :d1 d2 d3 : : : I
di di C1 : : : di Cm 1:
We could say that “terminating” and “repeating” decimals are not really
different, for terminating rationals have decimal expansions that repeat with
period 1 and with block having the single digit 0, but it’s convenient and natural
to distinguish such rationals from those having infinitely many nonzero digits,
as you’ll see in Proposition 4.59.
The way to get the decimal expansion for 1=7 is to divide 7 into 1 via long
division, as in Figure 4.6. Each of the remainders 1 through 6 shows up exactly
once in this calculation, in the order 3; 2; 6; 4; 5; 1. Once you get a remainder
of 1, the process will start over again, and the digits in the quotient, namely,
1; 4; 2; 8; 5; 7, will repeat. The block of 1=7 is 142857. However, even though
i i
“book2” — 2013/5/24 — 8:18 — page 180 — #200
i i
0.142857. . . 0.153846. . .
7 1.000000. . . 13 2.000000. . .
7 13
30 70
28 65
20 50
14 39
60 110
56 104
40 60
35 52
50 80
49 78
1 2
Figure 4.6. 1=7 D :142857142857142857: : : : Figure 4.7. 2=13 D :153846 : : : :
142857142857 also repeats, it is not a block because it is too long: 1=7 has
period 6, not 12.
Consider a second example: the calculation of 2=13 in Figure 4.7. It too has
period 6.
Next, we’ll see that every rational number terminates or repeats; that is, the
two types in the definition are the only possibilities.
Proof. The arguments for 1=7 and 2=13 generalize. Imagine expressing a frac-
tion a=b (with a; b > 0) as a decimal by dividing b into a via long division.
There are at most b possible remainders in this process (integers between 0
and b 1), so after at most b steps a remainder appears that has shown up
before. After that, the process repeats.
Conversely, let’s see that if a real number x terminates or repeats, then x is
rational. A terminating decimal is just a fraction whose denominator is a power
of 10, while a repeating decimal is made up of such a fraction plus the sum of
Middle school students a convergent geometric series. An example is sufficient to see what’s going on.
practice another method
for doing this (for days on :1323232 : : : D :1 C :0323232 : : :
end). See Exercise 4.84 on
32 32 32
page 181. D :1 C 3 C 5 C 7 C : : :
10 10 10
32 1 1
D :1 C 3 1 C 2 C 4 C : : :
10 10 10
!
1 4 1
D C 1
: (by Lemma 4.55)
10 25 1
102
The last expression is clearly a rational number. The general proof is a generic
version of this idea; it is left as Exercise 4.81 below.
i i
“book2” — 2013/5/24 — 8:18 — page 181 — #201
i i
How to Think About It. Exercise 4.83 shows that if r is a rational number
and 5m r or 2` r terminates, then r also terminates. However, if kr terminates
(for some integer k), then r need not terminate; for example, r D :271333 : : :
does not terminate, but 3r D :814 does terminate.
Exercises
4.81 * Complete the proof of Proposition 4.58 that a decimal that eventually repeats is
the decimal expansion of a rational number.
4.82 * Let r D a=b be rational.
(i) If r terminates, then kr terminates for every integer k.
(ii) If gcd.a; b/ D 1, prove that a=b terminates if and only if 1=b terminates.
Hint: 1=b D .sa C t b/=b D sa=b C t .
4.83 * If ` 0, m 0, and 2` 5m r terminates, prove that r terminates.
4.84 * Here’s a method used by many precollege texts for converting repeating dec-
imals to fractions. Suppose that you want to convert :324324 : : : to a fraction.
Calculate like this: If x D :324324 : : :, then 1000x D 324:324324 : : : , and
Hence, x D 324=999:
(i) There is a hidden assumption about geometric series in this method. Where is
it?
(ii) Try this method with the following decimal expansions:
(a) :356356 : : : (b) :5353 : : :
(c) :2222 : : : (d) :07593 : : :
(e) :0123563563: : :
i i
“book2” — 2013/5/24 — 8:18 — page 182 — #202
i i
x D 1 C 2 C 22 C : : :
2x D 2 C 22 C 23 C : : : :
0.076923. . . 0.153846. . .
13 1.000000. . . 13 2.000000. . .
00 13
1 00 70
91 65
90 50
78 39
120 110
117 104
30 60
26 52
40 80
39 78
1 2
Figure 4.8. Decimal expansions of 1=13 and 2=13.
An analysis of the calculation for 1=13 yields another insight (see Figure
4.8). Pretend that the decimal point isn’t there, so we are dividing 1;000;000 D
106 by 13. Since 1 appears as a remainder, the initial sequence of remainders
will repeat, and the period of 1=13 is 6. Thus, the period of 1=13 is the smallest
power of 10 congruent to 1 mod 13. In other words, the period of 1=13 is the
order of 10 in Z13 (see Exercise 4.49 on page 165).
We will generalize this observation in Theorem 4.61: the period of any frac-
tion 1=n is the order of 10 in Zn as long as there is some positive integer e with
i i
“book2” — 2013/5/24 — 8:18 — page 183 — #203
i i
10e 1 mod n. But, by Exercise 4.48 on page 165, some power of 10 is con-
gruent to 1 mod n if and only if 10 is a unit in Zn . Now this condition is
equivalent to gcd.10; n/ D 1; that is, if and only if n is not of the form 2u 5v .
Thus, Proposition 4.59 shows why the dichotomy of terminating and repeating
rationals is so natural.
To prove the general result for 1=n, we just need to make sure that the first
remainder that shows up twice is, in fact, 1. That’s the content of the next
lemma.
1 10e2 e1
mod n:
And since e2 e1 < e, this would contradict the fact that e is the order of 10
in Zm .
Proof. Part (i) was proved in Proposition 4.59. The essence of the proof of
part (ii) lies in the discussion on page 182 about the decimal expansion of 1=13:
the expansion for 1=n repeats after e steps, where e is the order of 10 in Zn ;
that is, the first occurrence of remainder 1 occurs at the eth step of the long
division. And Lemma 4.60 shows that there can be no earlier occurrences.
So, if gcd.n; 10/ D 1, then 1=n repeats, and we know that its period is the
order of 10 in Zn . What about fractions of the form a=n? The next corollary
shows that the same thing is true, as long as the fraction is in lowest terms.
Corollary 4.62. If gcd.a; n/ D 1 and a < n, then the period of a=n is the
same as that of 1=n, namely the order of 10 in Zn .
Proof. Suppose the period of a=n is `. Then, arguing as in Lemma 4.60, the
expansion will repeat only after the remainder a occurs in the long division
i i
“book2” — 2013/5/24 — 8:18 — page 184 — #204
i i
1 1
p period for p p period for p
3 1 97 96
7 6 101 4
11 2 103 34
13 6 107 53
17 16 109 108
19 18 113 112
23 22 127 42
29 28 131 130
31 15 137 8
37 3 139 46
41 5 149 148
43 21 151 75
47 46 157 78
53 13 163 81
59 58 167 166
61 60 173 43
67 33 179 178
71 35 181 180
73 8 191 95
79 13 193 192
83 41 197 98
89 44 199 99
of a by n (see Exercise 4.88 on page 190). But this implies that ` is the smallest
positive integer such that
a 10` a mod n:
1
Since a is a unit in Zn , multiplying by a gives
`
10 1 mod n:
It follows that ` D m, the order of 10 in Zn .
Theorem 4.61 doesn’t answer every question about the periods of 1=p,
where p is a prime other than 2 or 5. Sometimes the period is p 1, as when
p D 7, but this not always so, for 1=13 has period 6, not 12. In all the entries in
Figure 4.9, we see that periods of 1=p are divisors of p 1. This turns out to
be always true, and you’ll prove it soon. What about non-prime denominators?
Perhaps the length of the period of the expansion of 1=n is a factor of n 1?
But stay tuned—we’ll No such luck: 1=21 D :047619047619 : : : has period 6 which is not a divisor
return to the period of 1=n of 20.
shortly.
i i
“book2” — 2013/5/24 — 8:18 — page 185 — #205
i i
follows: there are infinitely many primes p for which the order of 10 in Zp is
p 1. E. Artin generalized Gauss’s conjecture. He claimed that if b is a posi- See Exercises 4.92
tive integer that is not a perfect square, then there are infinitely many primes p and 4.93 on page 190.
for which the b-adic expansion of 1=p has period p 1. These are still con-
jectures (as Gauss’s conjecture above), and very celebrated ones at that. Many
seemingly simple questions in arithmetic are extremely hard to answer.
But some things are known. For example, Gauss proved in Disquisitiones
that for any prime p, there is always at least one number (not necessarily 10)
whose order in Zp is p 1. Such a number is called a primitive root mod p.
We now know that the period of 1=n, where gcd.10; n/ D 1, is the order of
10 in Zn . In Exercise 4.49 on page 165, you did some calculations of orders of
units. We can now say a little more.
u.n/ D 1
where is the Euler -function.
Proof. By Proposition 4.39, there are .n/ units in Zn . Suppose we list them
all:
u1 ; u2 ; : : : ; u.n/ :
One of these units is u. Now multiply all these units by u; you get
.n/ D qe C r 0 r < e:
Then
q
u.n/ D uqeCr D .ue / ur :
Now use Theorem 4.63 and the fact that e is the minimal positive exponent
such that ue D 1 to conclude that r D 0.
i i
“book2” — 2013/5/24 — 8:18 — page 186 — #206
i i
Theorem 4.65. If n is relatively prime to 10 .that is, if 1=n repeats/, then the
period of 1=n is a divisor of .n/.
This greatly reduces the number of possibilities. For example, all we could
say about the length of the period of 1=231 before is that it is at most 230. Now
we can say it is a factor of .231/ D 120. Which one is it?
ap a 0 mod p;
and ap D a in Zp .
If gcd.a; p/ D 1, then a is a unit in Zp , and Proposition 4.63 gives
a.p/ D 1
ap 1
D 1:
Proof. If n is not prime, then .n/ < n 1, and the period of 1=n is not
n 1.
How to Think About It. The converse of Corollary 4.68 is not true, as the
example of 1=13 shows. As we said on page 184, it’s still an open question
about which primes p have the property that the decimal expansion for 1=p
has maximal period. All we can say is that the decimal expansion is a divi-
sor of .p/ D p 1, providing an explanation for the evidence gathered in
Figure 4.9.
i i
“book2” — 2013/5/24 — 8:18 — page 187 — #207
i i
other decimal expansions to look for interesting patterns. For example, calcu-
late the decimal expansions of
1 2 1 1 2 1 1
; ; ; ; ; ;
8 3 15 19 19 13 20
to see whether you can come up with some conjectures for connections be-
tween the integers a and b and the blocks in the decimal expansion of a=b.
Figure 4.10 displays the digits in the blocks of k=7 for 1 k 6. Is there a
way to explain where each rearrangement starts?
1
D :142857142857 : : :
7
2
D :285714285714 : : :
7
3
D :428571428571 : : :
7
4
D :571428571428 : : :
7
5
D :714285714285 : : :
7
6
D :857142857142 : : :
7
There are quite a few patterns here. For example, each block consists of six
repeating digits—some “cyclic” permutation of 142857:
It’s the sequence of remainders that explains the various decimal expansions
of k=7—what they are and why they are in a particular order. For example, in
calculating 6=7, you look down the remainder list and see where you get a 6.
The process for 6=7 will start there, as in Figure 4.11.
The point of Figure 4.11 is that you can “pick up” the calculation at any step
in the process—in a way, the calculation of 6=7 is embedded in the calculation
of 1=7. So are the calculations for all the other k=7 for 2 k 5.
So, the sequence of remainders in a long division provides the key to which
decimal expansions can be obtained from the same long division. For the ra-
tionals k=7, there were six remainders before things started to repeat, so we
get all the expansions 1=7; 2=7; : : : ; 6=7 from one calculation. But it isn’t al-
ways the case that you get all the expansions for k=n (where 1 k < n) from
the calculation of 1=n. That only happens when the period for the decimal ex-
pansion of 1=n has the maximal length n 1 (implying that n is prime). For
example, for the various k=13, you need two calculations, because the period
of the expansion for 1=13 is 6, not 12.
Earlier, on page 187, we listed the blocks for the various k=7, noting that
there seemed to be no apparent pattern to where each block starts. In fact, a
closer analysis of the long division gives us a way to calculate the digits in each
block. Consider again the calculation of the expansion for 6=7. As before, if
i i
“book2” — 2013/5/24 — 8:18 — page 188 — #208
i i
0. 142857. . . 0 . 857142. . .
7 1. 000000. . . 7 6 . 000000. . .
7 56
30 40
28 35
20 50
14 49
60 10
56 7
40 30
35 28
50 20
49 14
1 6
we “forget” the decimal point, each new remainder gives the remainder when 6
times a power of 10 is divided by 7. Referring to Figure 4.11, we have
6 6 1 mod 7
4 6 10 mod 7
5 6 100 mod 7
1 6 1000 mod 7
3 6 10000 mod 7
2 6 100000 mod 7
6 6 1000000 mod 7:
Now, these are the remainders, not the digits in the block. Still, we have an
interesting preliminary result.
cj a 10j mod n
with 0 cj < n.
.q1 q2 q3 q4 . . . qj
n a. 0 0 0 0. . . 0
i i
“book2” — 2013/5/24 — 8:18 — page 189 — #209
i i
a10j D .n q1 q2 q3 q4 : : : qj / C cj :
cj a 10j mod n:
What about the digits in the blocks? As in Lemma 4.69, let gcd.10; n/ D
gcd.a; n/ D 1 and e be the order of 10 in Zn . Then we know that the eth
remainder is a, where
What is the block? Our old friend the Division Algorithm gives the answer:
a.10e 1/
:
n
Proof. The above discussion shows that the block is the partial quotient up to
a remainder of a in the division. Rewrite Eq. (4.3) as:
a 10e D qn C a
1
D :142857 : : : and 1.106 1/=7 D 142857
7
2
D :285714 : : : and 2.106 1/=7 D 285714
7
3
D :428571 : : : and 3.106 1/=7 D 428571
7
4
D :571428 : : : and 4.106 1/=7 D 571428
7
5
D :714285 : : : and 5.106 1/=7 D 714285
7
6
D :857142 : : : and 6.106 1/=7 D 857142:
7
It’s an interesting calculation to go through the same process for the vari-
ous k=13. N
i i
“book2” — 2013/5/24 — 8:18 — page 190 — #210
i i
Exercises
4.87 Find the order of 10 modulo n (if it exists) for each value of n, and verify that the
decimal expansion of 1=n has period equal to the order.
(i) 7 (ii) 9 (iii) 3 (iv) 6 (v) 8
(vi) 11 (vii) 13 (viii) 39 (ix) 22 (x) 41
(xi) 73 (xii) 79 (xiii) 123 (xiv) 71 (xv) 61
4.88 * Finish the proof of Corollary 4.62 by showing that the decimal expansion of
1=n, where 1 a < n and gcd.a; n/ D 1, will repeat only after a remainder of a
occurs in the long division of a by n.
4.89 If
L D fu1 ; u2 ; : : : ; u.n/ g
is the list of units in Zn and u is any unit, show that the elements of
i i
“book2” — 2013/5/24 — 8:18 — page 191 — #211
i i
5 Abstract Algebra
191
i i
“book2” — 2013/5/24 — 8:18 — page 192 — #212
i i
Many texts say integral Definition. A domain D is a nonzero commutative ring in which every prod-
domain instead of domain. uct of nonzero elements is nonzero.
i i
“book2” — 2013/5/24 — 8:18 — page 193 — #213
i i
Fraction Fields
The converse of Proposition 5.3—every domain is a subring of a field—is
much more interesting than the proposition. Just as the domain Z is a sub-
ring of the field Q, so, too, is any domain a subring of its fraction field. We’ll
construct such a field containing a given domain using the construction of Q
from Z as inspiration. This is not mere generalization for generalization’s sake;
we shall see, for example, that it will show that certain polynomial rings are
subrings of fields of rational functions.
How to Think About It. Warning! Over the years, school curricula have
tried using the coming discussion to teach fractions to precollege students,
even to fourth graders. This is a very bad idea. Experience should precede
formalism and, in this particular case, introducing rational numbers as ordered
pairs of integers was a pedagogical disaster.
How to Think About It. One reason cross multiplication is important is that
it converts many problems about fractions into problems about integers.
i i
“book2” — 2013/5/24 — 8:18 — page 194 — #214
i i
Proof. The argument given above for Z is valid for D. The assumption that D
is a domain is present so that we can use the cancellation law to prove trans-
itivity.
How to Think About It. In the back of our minds, we think of Œa; b as
the fraction a=b. But, in everyday experience, fractions (especially rational
numbers) are used in calculations—they can be added, multiplied, subtracted,
and divided. The next theorem equips Frac.D/ with binary operations that will
look familiar to you if you keep thinking that Œa; b stands for a=b.
D 0 D fŒa; 1 W a 2 Dg;
is a subring of Frac.D/.
1
(iii) Every h 2 Frac.D/ has the form uv , where u; v 2 D 0 .
i i
“book2” — 2013/5/24 — 8:18 — page 195 — #215
i i
Œ1; 1 2 D 0
Œa; 1 C Œc; 1 D Œa C c; 1 2 D 0
Œa; 1Œc; 1 D Œac; 1 2 D 0 :
Notation. From now on, we use standard notation: If D is a domain, then the But be careful: for arbitrary
element Œa; b in Frac.D/ will be denoted by fraction fields, the notation
a=b is just an alias for
Œa; b. For Q, the notation
a=b:
is loaded with all kinds of
extra meanings that don’t
Of course, Q D Frac.Z/. Not surprisingly, elementary school teachers are carry over to the general
correct: it is, indeed, true that a=b D c=d if and only if ad D bc. setting (for example, as a
We started this section with two goals: to show that every domain is a sub- number having a decimal
expansion obtained by
ring of a field, and to make precise the notion of “fraction.” We’ve done the
dividing a by b).
second, but we didn’t quite show that a domain D is a subring of Frac.D/;
instead, we showed that D 0 is a subring of Frac.D/, where D 0 consists of all
Œa; 1 for a 2 D. Now D and D 0 do bear a strong resemblance to each other.
If we identify each a in D with Œa; 1 in D 0 (which is reminiscent of identify-
ing an integer m with the fraction m=1), then not only do elements correspond
nicely, but so, too, do the operations: a C b corresponds to Œa C b; 1:
similarly, ab corresponds to Œab; 1 D Œa; 1Œb; 1. In Section 5.3, we will dis-
cuss the important idea of isomorphism which will make our identification here
precise. For the moment, you may regard D and D 0 as algebraically the same.
Exercises
5.1 Let R be a domain. If a 2 R and a2 D a, prove that a D 0 or a D 1. Compare
with Exercise 4.40 on page 164.
5.2 Prove that the Gaussian integers ZŒi and the Eisenstein integers ZŒ! are do-
mains.
5.3 * Prove that Zm is a domain if and only if Zm is a field. Conclude, using Theo-
rem 4.43, that Zm is a domain if and only if m is prime.
5.4 Prove that every finite domain D (i.e., jDj < 1) is a field.
Hint: Use the Pigeonhole Principle, Exercise A.11 on page 419.
i i
“book2” — 2013/5/24 — 8:18 — page 196 — #216
i i
5.2 Polynomials
You are surely familiar with polynomials; since they can be added and mul-
tiplied, it is not surprising that they form commutative rings. However, there
are some basic questions about them whose answers may be less familiar. Is a
polynomial a function? Is x a variable? If not, just what is x? After all, we first
encounter polynomials as real-valued functions having simple formulas; for
example, f .x/ D x 3 2x 2 C 7 is viewed as the function f # W R ! R defined
by f # .a/ D a3 2a2 C7 for every a 2 R. But some polynomials have complex
coefficients. Is it legitimate to consider polynomials whose coefficients lie in
any commutative ring R? When are two polynomials equal? Every high school
algebra student would say that the functions defined by f .x/ D x 7 C 2x 1
and g.x/ D 3x C 6 are not the same, because they are defined by different
polynomials. But these two functions are, in fact, equal when viewed as func-
tions Z7 ! Z7 , a fact that you can check by direct calculation. Here’s another
example. Is it legitimate to treat 2x C 1 as a polynomial whose coefficients lie
in Z4 ? If so, then .2x C 1/2 D 4x 2 C 4x C 1 D 1 (for 4 D 0 in Z4 ); that
is, the square of this linear polynomial is a constant! Sometimes polynomials
are treated as formal expressions in which x is just a symbol, as, for example,
when you factor x 6 1 or expand .x C 1/5 . And sometimes polynomials are
treated as functions that can be graphed or composed. Both of these perspec-
tives are important and useful, but they are clearly different.
We now introduce polynomials rigorously, for this will enable us to answer
these questions. In this section, we’ll first study polynomials from the formal
viewpoint, after which we’ll consider polynomial functions. In the next sec-
tion, we will see that the notion of homomorphism will link the formal and the
function viewpoints, revealing their intimate connection.
How to Think About It. As we said on page 193 in the context of fractions,
rigorous developments should not be points of entry. One goal of this section is
to put polynomials on a firm footing. This will prepare you for any future work
you do with beginning algebra students, but it is in no way meant to take the
place of all of the informal experience that’s necessary before the formalities
can be appreciated and understood.
i i
“book2” — 2013/5/24 — 8:18 — page 197 — #217
i i
Be patient. The reason for this terminology will be apparent in a few pages. In linear algebra, you may
In the meantime, pretend that .s0 ; s1; s2 ; : : : ; si ; : : : / is really s0 C s1 x C have seen the example of
s2 x 2 C C si x i C . the vector space V of all
polynomials of degree, say,
A formal power series over R is a sequence, but a sequence is just a 3 or less, with coefficients
function W N ! R (where N is the set of natural numbers) with .i / D si in R. As a vector space,
for all i 0. By Proposition A.2 in Appendix A.1, two sequences and are V can be thought of as
equal if and only if .i / D .i / for all i 2 N. So, formal power series are R4 , where the 4-tuple
equal if and only if they are equal “coefficient by coefficient.” .5; 6; 8; 9/ corresponds to
the polynomial 5 C 6x C
8x 2 C 9x 3 .
Proposition 5.6. Formal power series D .s0 ; s1; s2 ; : : : ; si ; : : : / and D
.t0 ; t1; t2 ; : : : ; ti ; : : : / over a commutative ring R are equal if and only if si D ti
for all i 0.
i i
“book2” — 2013/5/24 — 8:18 — page 198 — #218
i i
RŒŒx
denotes the set of all formal power series over R, and
RŒx RŒŒx
C D .s0 C t0 ; s1 C t1 ; : : : ; si C ti ; : : : /:
What about multiplication? The product of two power series is also computed
term by term; multiply formally and collect like powers of x:
.s0 C s1 x C s2 x 2 C C si x i C /.t0 C t1 x C t2 x 2 C C tj x j C /
D s0 .t0 C t1 x C t2 x 2 C / C s1 x.t0 C t1 x C t2 x 2 C / C
D .s0 t0 C s0 t1 x C s0 t2 x 2 C / C .s1 t0 x C s1 t1 x 2 C s1 t2 x 3 C / C
D s0 t0 C .s1 t0 C s0 t1 /x C .s0 t2 C s1 t1 C s2 t0 /x 2 C :
Motivated by this, we define multiplication of formal power series by
D .s0 t0 ; s0 t1 C s1 t0 ; s0 t2 C s1 t1 C s2 t0 ; : : : /I
more precisely,
D .c0 ; c1; : : : ; ck ; : : : /;
P Pk
where ck D i Cj Dk si tj D i D0 si tk i .
i i
“book2” — 2013/5/24 — 8:18 — page 199 — #219
i i
Proof. Addition and multiplication are operations on RŒŒx: the sum and prod-
uct of two formal power series are also formal power series. Define zero to be
the zero polynomial, define the identity to be the polynomial .1; 0; 0; : : : /, and
define the negative of .s0 ; s1; : : : ; si ; : : : / to be . s0 ; s1; : : : ; si ; : : : /. Veri-
fications of the axioms of a commutative ring are routine, and we leave them as
Exercise 5.8 on page 202. The only difficulty that might arise is proving the as-
sociativity of multiplication. Hint: if D .r0 ; r1P ; : : : ; ri ; : : : /, then the `th co-
ordinate of the polynomial ./ turns out to be i Cj CkD` Pri .sj tk /, while the
`th coordinate of the power series ./ turns out to be i Cj CkD` .ri sj /tk ;
these are equal because associativity of multiplication in R gives ri .sj tk / D
.ri sj /tk for all i; j; k.
We’ll see in a moment that the subset RŒx of polynomials is a subring of
the commutative ring of formal power series RŒŒx.
ck D s0 tk C C smtk m C smC1 tk m 1 C sk t0 :
cmCn D sm tn :
i i
“book2” — 2013/5/24 — 8:18 — page 200 — #220
i i
Corollary 5.9. (i) If R is a commutative ring, then RŒx and R are subrings
of RŒŒx.
Exercise 5.22 on page (ii) If R is a domain, then RŒx is a domain.
203 shows that if R is a
domain, then RŒŒx is a
domain.
Proof. (i) Let ; 2 RŒx. Now C is a polynomial, for either C D 0
or deg. C / maxfdeg./; deg./g. By Lemma 5.8(i), the product
of two polynomials is also a polynomial. Finally, 1 D .1; 0; 0; : : :/ is a
polynomial, and so RŒx is a subring of RŒŒx.
It is easy to check that R0 D f.r; 0; 0; : : : / W r 2 Rg is a subring of
See Exercise 5.9 on RŒx, and we may view R0 as R by identifying r 2 R with .r; 0; 0; : : : /.
page 202.
(ii) If and are nonzero polynomials, then Lemma 5.8(ii) shows that ¤
0. Therefore, RŒx is a domain.
Here is the link between this discussion and the usual notation.
x D .0; 1; 0; 0; : : : / 2 RŒx:
How to Think About It. Thus, x is neither “the unknown” nor a variable;
it is a specific element in the commutative ring RŒx, namely, the polynomial
.a0 ; a1 ; a2 ; : : : / with a1 D 1 and all other ai D 0; it is a polynomial of de-
gree 1.
Note that we need the unit 1 in a commutative ring R in order to define the
indeterminate in RŒx.
x D .0; s0 ; s1; : : : ; sj ; : : : /I
that is, multiplying by x shifts each coefficient one step to the right.
(ii) If n 0, then x n is the polynomial having 0 everywhere except for 1 in
the nth coordinate.
(iii) If r 2 R and .s0 ; s1; : : : ; sj ; : : : / 2 RŒŒx, then
i i
“book2” — 2013/5/24 — 8:18 — page 201 — #221
i i
i i
“book2” — 2013/5/24 — 8:18 — page 202 — #222
i i
Exercises
5.8 * Fill in the details and complete the proof of Proposition 5.7.
5.9 * Suppose that R is a commutative ring. In the proof of Corollary 5.9(i), we de-
fined R0 as the set of all power series of the form .r; 0; 0; 0; : : : / where r 2 R, and
we said “we may view R0 as R by identifying r 2 R with .r; 0; 0; : : : /.” Show, if
r; s 2 R, that
(i) r C s is identified with .r; 0; 0; 0; : : : / C .s; 0; 0; 0; : : : /
(ii) rs is identified with .r; 0; 0; 0; : : : /.s; 0; 0; 0; : : : /.
5.10 If .t0 ; t1 ; t2 ; : : : / is a power series over R and r 2 R, show that
5.11 * Suppose that F is a field. Show that F ŒŒx is a vector space over F where
Vector spaces over arbi- addition is defined as addition of power series and scalar multiplication is defined
trary fields are discussed by
in Appendix A.3.
r.s0 ; s1 ; s2 ; : : : / D .rs0 ; rs1 ; rs2 ; : : : /:
5.12 If R is the zero ring, what are RŒx and RŒŒx? Why?
5.13 Prove that if R is a commutative ring, then RŒx is never a field.
Hint: If x 1 exists, what is its degree?
5.14 (i) Let R be a domain. Prove that if a polynomial in RŒx is a unit, then it is a
nonzero constant (the converse is true if R is a field).
Hint: Compute degrees.
(ii) Show that .2x C 1/2 D 1 in Z4 Œx. Conclude that 2x C 1 is a unit in Z4 Œx,
and that the hypothesis in part (i) that R be a domain is necessary.
5.15 * If R is a commutative ring and
f .x/ D s0 C s1 x C s2 x 2 C C sn x n 2 RŒx
.f C g/0 D f 0 C g0
.rf /0 D r.f 0 / if r 2 R
.fg/ D fg C f 0 g
0 0
Œ1pt .f n /0 D nf n 1
f0 for all n 1:
i i
“book2” — 2013/5/24 — 8:18 — page 203 — #223
i i
R
5.16 Take It Further. Define W QŒx ! QŒx by
Z
f D a0 x C 12 a1 x 2 C C nC1
1
an x nC1 2 QŒx;
.x C 1/p D x p C 1:
5.20 *
(i) If R is a domain and D 1 C x C x 2 C C x n C 2 RŒŒx, prove that
is a unit in RŒŒx; in fact, .1 x/ D 1.
(ii) Show that .1 x/2 is a unit in QŒŒx, and express 1=.1 x/2 as a power
series.
Hint: See Exercise 5.22 below.
5.21 Show that 1 x x 2 is a unit in QŒŒx, and express 1=.1 x x 2 / as a power
series.
5.22 *
(i) Prove that if R is a domain, then RŒŒx is a domain.
Hint: If D .s0 ; s1 ; : : : / 2 RŒŒx is nonzero, define the order of , denoted
by ord. /, to be the smallest n 0 for which sn ¤ 0. If R is a domain and
; 2 RŒŒx are nonzero, prove that ord. / ord. / C ord. /, and use
this to conclude that ¤ 0.
(ii) Let k be a field. Prove that a formal power series 2 kŒŒx is a unit if and
only if its constant term is nonzero; that is, ord. / D 0.
(iii) Prove that if 2 kŒŒx and ord. / D n, then D x n u, where u is a unit
in kŒŒx.
5.23 *
(i) Prove that Frac.ZŒx/ D Q.x/.
(ii) Let D be a domain with K D Frac.D/. Prove that Frac.DŒx/ D K.x/.
5.24 (i) Expand .C 2 C S 2 1/.S 2 C 2CS C 2 /, where C and S are elements in
some commutative ring.
(ii) Establish the trigonometric identity
i i
“book2” — 2013/5/24 — 8:18 — page 204 — #224
i i
xp 1
fp .x/ D :
x 1
Polynomial Functions
Let’s now pass to viewing polynomials as functions. Each polynomial f .x/ D
s0 Cs1x Cs2x 2 C Csn x n 2 RŒx defines its associated polynomial function
f # W R ! R by evaluation:
f # .a/ D s0 C s1 a C s2 a2 C C sn an 2 R;
is a subring of Fun.R/ (we admit that we are being very pedantic, but you
will see in the next section that there’s a good reason for this fussiness). The
identity u of RR is the constant function with value 1, where 1 is the identity
element of R; that is, u D 1# , where 1 is the constant polynomial. We claim
that if f .x/; g.x/ 2 RŒx, then
(In the equation f # C g# D .f C g/# , the plus sign on the left means addition
of functions, while the plus sign on the right means the usual addition of poly-
nomials in RŒx; a similar remark holds for multiplication.) The proof of these
equations is left as Exercise 5.27 on page 206.
i i
“book2” — 2013/5/24 — 8:18 — page 205 — #225
i i
Definition. If k is a field, then the fraction field Frac.kŒx/ of kŒx, denoted We can define R.x/ for
by arbitrary domains R. See
Exercise 5.23 on page 203.
k.x/;
is called the field of rational functions over k.
How to Think About It. By convention, the elements of k.x/ are called
rational “functions” but they are simply elements of the fraction field for kŒx.
Of course, a rational function can be viewed as an actual function via evalu-
ation at elements of k, in the same way that a polynomial in kŒx gives rise
to its associated polynomial function defined on k. But the domain of such a
rational function may not be all of k (why?).
We’ll use the standard notation for elements in fraction fields (introduced
on page 195) for rational functions over a field: Œf; g will be denoted by f =g.
i i
“book2” — 2013/5/24 — 8:18 — page 206 — #226
i i
its elements have the form f =g, where f; g 2 KŒx1 ; x2; : : : ; xn and g ¤ 0.
Exercises
5.26 Let R be a commutative ring. Show that if two polynomials f .x/; g.x/ 2 RŒx
are equal, then their associated polynomial functions are equal; that is, f # D g# .
5.27 * If R is a commutative ring, prove that Poly.R/ is a subring of Fun.R/ D RR .
5.28 True or false, with reasons:
(i) .x 2 9/=.x 2 2x 3/ D .x C 3/=.x C 1/ in Q.x/.
(ii) What are the domains of the functions x 7! .x 2 9/=.x 2 2x 3/ and
x 7! .x C 3/=.x C 1/? Are the functions equal?
5.3 Homomorphisms
The question whether two given commutative rings R and S are somehow the
same has already arisen, at least twice.
(i) On page 195 we said
For the moment, you may regard D and D 0 as algebraically the
same.
(ii) And on page 201 we said
If we identify .r; 0; 0; : : : / with r , then Lemma 5.10(iii) reads
r .s0; s1 ; : : : ; si ; : : : / D .r s0 ; r s1; : : : ; r si ; : : : /:
What does “the same” mean in statement (i)? What does “identify” mean in
statement (ii)? More important, if R is a commutative ring, we wish to compare
the (formal) polynomial ring RŒx with the ring Poly.R/ of all polynomial
functions on R.
We begin our discussion by considering the ring Z2 ; it has two elements, the
congruence classes 0; 1, and the following addition and multiplication tables.
C 0 1 0 1
0 0 1 0 0 0
1 1 0 1 0 1
The two words even, odd also form a commutative ring, call it P; its addi-
tion and multiplication are pictured in the following tables.
C even odd even odd
even even odd even even even
odd odd even odd even odd
i i
“book2” — 2013/5/24 — 8:18 — page 207 — #227
i i
Thus, odd C odd D even and odd odd D odd. It is clear that the commutative
rings Z2 and P are distinct; on the other hand, it is equally clear that there
is no significant difference between them. The elements of Z2 are given in
terms of numbers; those of P in terms of words. We may think of P as a trans-
lation of Z2 into another language. And more than just a correspondence of
elements, the operations of addition and multiplication (that is, the two tables)
get translated, too.
A reasonable way to compare two systems is to set up a function between
them that preserves certain essential structural properties (we hinted at this
idea earlier when we noted that a ring R is essentially a subring of RŒx).
The notions of homomorphism and isomorphism will make this intuitive idea
precise. Here are the definitions; we will discuss what they mean afterward.
Etymology. The word homomorphism comes from the Greek homo, mean-
ing “same,” and morph, meaning “shape” or “form.” Thus, a homomorphism
carries a commutative ring to another commutative ring of similar form. The
word isomorphism involves the Greek iso, meaning “equal,” and isomorphic
rings have identical form.
Consider the two simple examples above of addition tables arising from
the rings Z2 and P (the symbol P stands for “parity.”). The rings Z2 and P
0 1 even odd
Z2 W I PW :
1 0 odd even
i i
“book2” — 2013/5/24 — 8:18 — page 208 — #228
i i
are isomorphic, for the function 'W Z2 ! P, defined by '.0/ D even and
'.1/ D odd, is an isomorphism, as the reader can quickly check (of course,
you must also check the multiplication tables).
Let a1 ; a2 ; : : : ; aj ; : : : be a list with no repetitions of all the elements of a
ring R. An addition table for R is a matrix whose ij entry is ai C aj .
C a1 aj
a1 a1 C a1 a1 C aj
ai ai C a1 ai C aj
i i
“book2” — 2013/5/24 — 8:18 — page 209 — #229
i i
Example 5.16. Example 4.31 on page 157 shows, for a commutative ring R
and a set X, that the family RX of all functions from X to R, equipped with
pointwise addition and multiplication, is a commutative ring. We’ve also used
the notation 2X in Example 4.47 on page 166 to stand for the Boolean ring of
all subsets of a set X. The goal of this example is to prove that 2X and .Z2 /X This example is rather
are isomorphic rings. dense. It’s a good idea
to pick a concrete set,
The basic idea is to associate every subset A X with its characteristic
say X D f1; 2; 3g, and
function fA 2 .Z2 /X , defined by work out the characteristic
( function for each of the 8
1 if x 2 A subsets of X.
fA .x/ D
0 if x 62 A:
i i
“book2” — 2013/5/24 — 8:18 — page 210 — #230
i i
(iii) ' preserves multiplication: we must show that '.AB/ D '.A/'.B/ for
all A; B X. The proof is similar to that in part (ii), using a table for
fA fB to prove that fA fB D fAB ; you will supply the details in Exer-
cise 5.39 on page 212.
We conclude that 2X and .Z2 /X are isomorphic. In Exercise 5.40 on page 212,
we will see that if jXj D n, then .Z2 /X Š .Z2 /n , the ring of all n-tuples
having coordinates in Z2 with pointwise operations. N
How to Think About It. There are two strategies in trying to show that a
homomorphism 'W R ! S is an isomorphism. One way is to show that ' is
a bijection; that is, it is injective and surjective. A second way is to show that
the inverse function ' 1 W S ! R exists (see Exercise 5.30 on page 211 and
Exercise 5.39(ii) on page 212).
i i
“book2” — 2013/5/24 — 8:18 — page 211 — #231
i i
(i) '.0/ D 0.
(ii) '. a/ D '.a/.
(iii) '.na/ D n'.a/ for all n 2 Z.
(iv) '.an / D '.a/n for all n 2 N.
1 1
(v) If a is a unit in R, then '.a/ is a unit in S , and '.a / D '.a/ .
Exercises
5.29 Let R and S be commutative rings, and let 'W R ! S be an isomorphism.
(i) If R is a field, prove that S is a field.
(ii) If R is a domain, prove that S is a domain.
5.30 *
(i) If ' is an isomorphism, prove that its inverse function ' 1W S ! R is also an
isomorphism.
(ii) Show that ' is an isomorphism if and only if ' has an inverse function ' 1.
5.31 (i) Show that the composite of two homomorphisms (isomorphisms) is again a
homomorphism (an isomorphism).
(ii) Show that R Š S defines an equivalence relation on the class of all commu-
tative rings.
i i
“book2” — 2013/5/24 — 8:18 — page 212 — #232
i i
5.32 Prove that the weird integers W (see Exercise 4.35 on page 158) is not isomorphic
to Z.
p
1 3
5.33 Recall that ZŒ! D fa C b! W a; b 2 Zg, where ! D 2 Ci 2 . Show that
' W ZŒ! ! ZŒ!, defined by
'W a C b! 7! a C b! 2 ;
and
i i
“book2” — 2013/5/24 — 8:18 — page 213 — #233
i i
Extensions of Homomorphisms
Suppose that a ring R is a subring of a commutative ring E with inclusion
i W R ! E. Given a homomorphism ' W R ! S , an extension ˆ of ' is a
homomorphism ˆW E ! S with restriction ˆjR D ˆi D '. If ˆW U ! Y is any
function, then its restriction
E ˆjX to a subset X U
is equal to the composite
ˆ
i ˆi, where iW X ! U is the
inclusion.
R ' S:
ˆW RŒx ! S
i i
“book2” — 2013/5/24 — 8:18 — page 214 — #234
i i
P P
Third, let f .x/g.x/ D ck x k , where ck D
k i Cj Dk ri aj . Then
!
X
k
ˆ.fg/ D ˆ ck x
k
X
D '.ck /s k
k
0 1
X X
D '@ ri aj A s k
k i Cj Dk
0 1
X X
D @ '.ri /'.aj /A s k :
k i Cj Dk
ˆW RŒx1 ; : : : ; xn ! S
Proof. The proof is by induction on n 1. The base step is Theorem 5.19. For
the inductive step, let n > 1 and define A D RŒx1; : : : ; xn 1 . The inductive
hypothesis gives a homomorphism W A ! S with .xi / D si for all i
n 1 and .r / D '.r / for all r 2 R. The base step gives a homomorphism
‰W AŒxn ! S with ‰.xn / D sn and ‰.a/ D .a/ for all a 2 A. The
result follows, because RŒx1; : : : ; xn D AŒxn , ‰.xi / D .xi / D si for all
i n 1, ‰.xn / D .xn / D sn , and ‰.r / D .r / D '.r / for all r 2 R.
How to Think About It. There is an analogy between Theorem 5.20 and
an important theorem of linear algebra, Theorem A.43 in Appendix A.4: Let
V and W be vector spaces over a field k; if v1 ; : : : ; vn is a basis of V and
w1 ; : : : ; wn 2 W , then there exists a unique linear transformation T W V ! W
with T .vi / D wi for all i (linear transformations are homomorphisms of vec-
tor spaces). The theorem is actually the reason why matrices can describe linear
transformations.
i i
“book2” — 2013/5/24 — 8:18 — page 215 — #235
i i
RŒx
ˆ
R ' S S Œx:
1 1
If ' is an isomorphism, then ˆ is the inverse of the extension of ' .
i i
“book2” — 2013/5/24 — 8:18 — page 216 — #236
i i
Exercises
5.41 If R is a commutative ring, prove that RŒx; y Š RŒy; x. In fact, prove that there
is an isomorphism ˆ with ˆ.x/ D y, ˆ.y/ D x, and ˆ.r/ D r for all r 2 R.
Hint: Use Theorem 5.20.
If you look very carefully at the definitions, you’ll see that RŒx; y and RŒy; x
are different rings. Recall that elements a in a ring A correspond to .a; 0; : : :/
in AŒx. In particular, the element x 2 RŒx corresponds
to .x; 0; 0; : : :/ in RŒxŒy;
that is, we have x D .0; 1; 0; : : : /; 0; 0; : : : so that in RŒxŒy the element x has
.0; 1; 0; : : : / in coordinate 1. This is not the same element as x in RŒyŒx, which
has 1 sitting in coordinate 1. However, this exercise allows you to relax and regard
these polynomials rings as the same.
5.42 *
(i) If R is a commutative ring and c 2 R, prove that there is a homomorphism
'W P
RŒx ! RŒx Pwith '.x/ D x C c and '.r/ D r for all r 2 R; that is,
'. i ri x i / D i ri .x C c/i . Is ' an isomorphism?
No calculus is needed for (ii) If deg.f / D n, show that
this exercise.
f 00 .c/ f .n/ .c/
'.f / D f .c/ C f 0 .c/.x C c/ C .x C c/2 C C .x C c/n ;
2Š nŠ
where f 0 .x/ is the formal derivative of f defined in Exercise 5.15 on page 202.
r3 . 1/ D r3 .k 2 / D r3.k/2 ;
i i
“book2” — 2013/5/24 — 8:18 — page 217 — #237
i i
Here are the first properties of these subsets. Note that Lemma 5.17 says,
for every homomorphism 'W R ! S , that 0 2 ker ' and 0 2 im '. In general,
im ' is a subset of S but, as for any function, ' is surjective if and only if
im ' D S .
Proof. (i) To see that im ' is a subring of S , note first that 1 2 im ', because
'.1/ D 1. If s; t 2 im ', then there are a; b 2 R with s D '.a/ and
t D '.b/. Hence, s C t D '.a/ C '.b/ D '.a C b/ 2 im ', and
st D '.a/'.b/ D '.ab/ 2 im '. Therefore, im ' is a subring of S .
(ii) If a; b 2 ker ', then '.a/ D 0 D '.b/. Hence, '.aCb/ D '.a/C'.b/ D
0 C 0 D 0, and a C b 2 ker '.
(iii) If a 2 ker ', then '.a/ D 0. Hence, '.ra/ D '.r /'.a/ D '.r / 0 D 0,
and so ra 2 ker '.
Here are some examples of kernels and images.
Example 5.26. (i) If 'W R ! S is an isomorphism, then ker ' D f0g and
im ' D S .
(ii) If ' is injective, then ker ' D f0g, for if r ¤ 0, then '.r / ¤ '.0/ D 0.
We will soon see that the converse is true, so that ' is injective if and only
ker ' D f0g.
(iii) If rm W Z ! Zm is reduction mod m, then ker rm consists of all the multi-
ples of m.
(iv) Let k be a commutative ring, let a 2 k, and let ea W kŒx ! k be the
evaluation homomorphism f .x/ 7! f .a/. Now ea is always surjective: if
b 2 k, then b D ea .f /, where f .x/ D x a C b. By definition, ker ea
consists of all those polynomials g for which g.a/ D 0.
In particular, let 'W RŒx ! C be defined by '.x/ D i and '.a/ D a
for all a 2 R. Then ker ' is the set of all polynomials f .x/ 2 RŒx having
i as a root. For example, x 2 C 1 2 ker '. N
i i
“book2” — 2013/5/24 — 8:18 — page 218 — #238
i i
Proposition 5.25 suggests that ker ' is a subring of R but, in fact, it almost
never is because it usually doesn’t contain 1. The definition of homomorphism
says that '.1/ D 1. If 1 2 ker ', then '.1/ D 0, and so 1 D 0 in S ; that is, S
is the zero ring. We conclude that if S has more than one element, then ker '
is not a subring of R. However, kernels are always ideals.
The ring R itself and f0g, the subset of R consisting of 0 alone, are always
ideals in a commutative ring R. Proposition 5.25 says that the kernel of a ho-
momorphism 'W R ! S is always an ideal in R; it is a proper ideal if S is not
the zero ring because 1 … ker '.
We have seen ideals in a completely different context. Theorem 1.19, which
says that gcd.a; b/ is a linear combination of a; b, involved showing that the set
of all linear combinations is an ideal in Z. Indeed, Exercise 1.49 on page 30
makes this explicit (of course, we had not introduced the term ideal at that
time).
It is very easy to check Definition. If b1 ; b2 ; : : : ; bn lie in a commutative ring R, then the set of all
that .b1 ; b2 ; : : : ; bn / is an linear combinations, denoted by
ideal.
.b1 ; b2 ; : : : ; bn /;
is an ideal in R, called the ideal generated by b1 ; b2 ; : : : ; bn . In particular, if
n D 1, then
.b/ D fr b W r 2 Rg
The principal ideal .b/ is
sometimes denoted by Rb. consists of all the multiples of b; it is called the principal ideal generated by b.
i i
“book2” — 2013/5/24 — 8:18 — page 219 — #239
i i
Both R and f0g are ideals; indeed, both are principal ideals, for R D .1/
and, obviously, f0g D .0/ is generated by 0. Henceforth, we will denote the
zero ideal f0g by .0/.
Example 5.27. (i) The even integers comprise an ideal in Z, namely, .2/.
(ii) Proposition 5.25 says that if 'W R ! S is a homomorphism, then ker ' is
an ideal in R. In particular, we can generalize part (i): if rm W Z ! Zm is
reduction mod m, then ker rm D .m/.
(iii) If I and J are ideals in a commutative ring R, then it is routine to check
that I \ J is also an ideal in R. More generally,
T if .Ij /j 2J is a family
of ideals in a commutative ring R, then j 2J Ij is an ideal in R (see
Exercise 5.53 below).
(iv) By Example 5.26(iv), the set I , consisting of all polynomials f .x/ in
RŒx having i as a root, is an ideal in RŒx containing x 2 C 1 (it is the
kernel of the evaluation ei ). We shall see, in Corollary 6.26, that I D
.x 2 C 1/. N
i i
“book2” — 2013/5/24 — 8:18 — page 220 — #240
i i
Proof. The only proper ideal in k is .0/, by Example 5.30; now apply Propo-
sition 5.31.
Exercises
5.43 Construct a homomorphism from ZŒi ! ZŒi that has i in its kernel. What is the
entire kernel?
p p
5.44 Find the kernel
p of the homomorphism
p QŒx ! QŒ 2 defined by f 7! f . 2/,
where QŒ 2 D fa C b 2 W a; b 2 Qg.
5.45 Show that the kernel of the evaluation homomorphism ea in Corollary 5.21 is the
set of polynomials in RŒx that have a as a root.
p p
5.46 Consider the set I of polynomials in RŒx that vanish on the set f3˙ 5; 5˙ 7g.
The notation ¨ means Show that I is a principal ideal in RŒx.
“is a proper subset of”
(in contrast to which 5.47 * Find three ideals .a/ in Z with the property that
indicates a subset which
may or may not be proper). .24/ ¨ .a/:
5.48 * Suppose a and b are integers. Show that a j b if and only if .b/ .a/.
5.49 * If a; b 2 Z, prove that .a/ \ .b/ D .m/, where m D lcm.a; b/.
5.50 * Define the sum of ideals I and J in a commutative ring R by
I C J D fu C v W u 2 I and v 2 J g:
IJ D fa1 b1 C C an bn W ai 2 I; bi 2 J; n 1g:
IJ D .ai bj W 1 i s and 1 j t /:
i i
“book2” — 2013/5/24 — 8:18 — page 221 — #241
i i
.r; s/ C .r 0 ; s 0 / D .r C r 0 ; s C s 0 /
and
(i) Prove that F is a field (with operations matrix addition and matrix multipli-
cation).
(ii) Prove that 'W F ! C, defined by '.A/ D a C i b, is an isomorphism.
A \ .B [ C / D .A \ B/ [ .A \ C /
i i
“book2” — 2013/5/24 — 8:18 — page 222 — #242
i i
for subsets A, B, and C of a set X. Such formulas are proved by showing that
an element lies in the left-hand side if and only it lies in the right-hand side.
Exercises 4.68 through 4.74 on page 169 gave you practice in doing this
sort of thing, but they actually showed more. Recall Example 4.47: if 2X is
the family of all the subsets of a set X, then 2X is a commutative ring with
addition defined as symmetric difference,
A C B D .A B/ [ .B A/ D A [ B .A \ B/;
Recall: if U; V are subsets and multiplication defined as intersection,
of X, then U V D
fx 2 X W x 2 U and x … AB D A \ B:
V g.
It follows, for all subsets A of X, that
A2 D A; A C ¿ D A; A C A D ¿; and AX D AI
the identity element 1 is the subset X itself. It follows from ACA D ¿ that ev-
ery A 2 2X is its own additive inverse; that is, A D A. Indeed, Exercise 5.58
on page 226 says that 1 D 1 in 2X . Since we often pass back and forth be-
tween the commutative ring 2X and set theory, we say out loud that a minus
sign will be used in set theory, as in the definition of symmetric difference, but
it shall never be used when we are working in 2X viewed as a ring.
We are going to show that calculations in the ring 2X give more satifisfy-
ing proofs of set-theoretic identities; thus, regarding all subsets as forming a
commutative ring is a definite advantage. Another goal is to use the calcula-
tions to establish the inclusion-exclusion principle, a very useful technique in
counting problems.
Venn diagrams are visual representations in the plane of relationships among
subsets in X. They convert words into pictures. For example, symmetric dif-
ference and intersection are illustrated by the Venn diagram in Figure 5.2.
Some standard words occurring in set theory, actually in logic, are NOT,
AND, OR, and EXCLUSIVE OR. If we picture a statement a as the inside of
a region A in the plane, then the Venn diagram of “NOT a” is the outside of A;
it is the complement
Ac D fx 2 X W x … Ag:
In Figure 5.2, A C B is the Exercise 4.69 on page 168 says that Ac D X C A. If a and b are statements,
shaded region, AB is the then the Venn diagram of the statement “a AND b” is the intersection A \ B,
unshaded region.
while the diagram of “a OR b” is the union A [ B. EXCLUSIVE OR is the
symmetric difference A C B; it pictures the statement “a OR b but not both”
(as in the statement “Take it or leave it!”).
The next result is Exercise 4.73 on page 169; you probably solved this ex-
ercise then using elements, as we now do.
A B
i i
“book2” — 2013/5/24 — 8:18 — page 223 — #243
i i
.A [ B/c D Ac \ B c :
a C a D .a C a/.a C a/
D a2 C a2 C a2 C a2
D a C a C a C a:
i i
“book2” — 2013/5/24 — 8:18 — page 224 — #244
i i
a C b D a.b C b 0 / C b.a C a0 / D ab C ab 0 C ab C a0 b D ab 0 C a0 b:
Let’s now see how working in an arbitrary Boolean ring reduces the proofs
about facts in specific such rings like 2X to algebraic calculations. Compare
the set-theoretic proof of Proposition 5.33 with the following proof.
.A [ B/c D Ac \ B c :
.A \ B/c D Ac [ B c :
.ab/0 D a0 _ b 0 :
The idea again is to first use “pure algebra,” reducing everything to statements
about addition and multiplication in R, and then translate the result into the
language of 2X . Now .ab/0 D 1 C ab, and
a0 _ b 0 D a0 C b 0 C a0 b 0 D .1 C a/ C .1 C b/ C .1 C a/.1 C b/:
i i
“book2” — 2013/5/24 — 8:18 — page 225 — #245
i i
Calculate:
.1 C a/ C .1 C b/ C .1 C a/.1 C b/ D 1 C a C 1 C b C 1 C a C b C ab
D .1 C 1/ C .a C a/
C .b C b/ C .1 C ab/
D 1 C ab
D .ab/0 :
Proof. It suffices to work in a Boolean ring and then to see what it says in the
particular Boolean ring 2X .
Assume that s D a0 D 1 C a. Then
as D a.1 C a/ D a C a2 D a C a D 0;
and
a _ s D a _ .1 C a/ D a C .1 C a/ C a.1 C a/ D a C 1 C a C a C a2 D 1:
The usual distributive law in a commutative ring is a.b Cc/ D ab Cac. The
proof that the equation holds in 2X essentially follows from the set-theoretic
identity
A \ .B [ C / D .A \ B/ [ .A \ C /:
We are now going to show that interchanging \ and [ gives another valid
identity.
A [ .B \ C / D .A [ B/ \ .A [ C /:
i i
“book2” — 2013/5/24 — 8:18 — page 226 — #246
i i
Exercises
5.58 * Prove that 1 D 1 in every Boolean ring.
5.59 * Proposition 5.40 proves that if A; B, and C are subsets of a set X , then
A [ .B \ C / D .A [ B/ \ .A [ C /:
a _ .b _ c/ D .a _ b/ _ c:
n
5.66 Suppose that R is a Boolean ring and ai i D1 is a collection of n elements in R.
Show that
n
Y n
_
1 ai0 D ai :
i D1 i D1
n
5.67 Suppose that R is a Boolean ring and ai i D1 is a collection of n elements in R.
Show that
n
Y X X
ai0 D 1 ai C ai aj C . 1/n a1 a2 : : : an :
i D1 1i n 1i <j n
Hint: a0 D 1 a.
n
5.68 Suppose that R is a Boolean ring and ai i D1
is a collection of n elements in R.
Show that
n
_ X X
ai D ai ai aj C C . 1/n 1
a1 a2 : : : an :
i D1 1i n 1i <j n
i i
“book2” — 2013/5/24 — 8:18 — page 227 — #247
i i
Inclusion-Exclusion
Suppose you have a class of students, all of whom take either French or Span-
ish, but none of whom take both. If 15 students are studying French and 12
students are studying Spanish, you have 15 C 12 D 27 students in your class.
Denote the number of elements in a finite set A by
jAj:
Then one way to state the above fact is that if F is the set of students studying
French and S is the set of students studying Spanish, then
jF [ S j D jF j C jS j:
jA [ Bj D jAj C jBj:
Lemma 5.41. If .Ai /niD1 is a family of pairwise disjoint finite sets, then
ˇ[n ˇ X n
ˇ ˇ
ˇ Ai ˇ D jAi j:
i D1 i D1
n[1
!
Ai \ An D .A1 \ An / [ [ .An 1 \ An /;
i D1
i i
“book2” — 2013/5/24 — 8:18 — page 228 — #248
i i
Let’s return to your class of students, 15 of whom are studying French and
12 of whom are studying Spanish. What if 4 of them are studying both French
and Spanish? You’d then have fewer than 27 students in the class because of
double counting. A Venn diagram can help you figure out how to calculate
the actual number. The goal of this subsection is to develop a general method
of calculating the number of elements in the union of a finite collection of
possibly overlapping finite sets.
As a Venn diagram illustrates, the Addition Principle no longer holds if A
and B overlap, for elements in A \ B are counted twice in jAj C jBj. What is
the formula giving a precise count of jA [ Bj? The number of things that get
counted twice must be subtracted once.
i i
“book2” — 2013/5/24 — 8:18 — page 229 — #249
i i
Example 5.43. How many positive integers < 1000 are there that are not di-
visible by 5 or by 7? If the number of positive integers that are divisible by 5
or 7 is D, then the answer is 999 D. We compute D using Lemma 5.42.
Let
A D fn 2 Z W 5 j n and 0 < n < 1000g
and
B D fn 2 Z W 7 j n and 0 < n < 1000g:
The Division Algorithm gives jAj D 199, because 999 D 199 5 C 4; similarly,
jBj D 142 and jA \ Bj D 28, where A \ B D fn 2 Z W 35 j n and 0 < n <
1000g. Hence,
jA [ Bj D jAj C jBj jA \ Bj
D 199 C 142 28 D 313:
Therefore, there are exactly 999 313 D 686 positive numbers < 1000 that
are not divisible by 5 or by 7. N
How to Think About It. You could probably convince yourself of the result
in Lemma 5.42 with a Venn diagram accompanied by a few examples. While
diagrams and examples can motivate insight, they are not substitutes for rig-
orous proof. The reason is that a picture can be misleading. For example, if
you aren’t careful about drawing a Venn diagram for the union of four or more
regions, then some possible intersections might be overlooked.
Example 5.44. Let’s look at the case of three finite subsets A, B, and C of a
set X. Before reading on, what do you think the formula should be? The basic
idea is to apply Lemma 5.42 twice.
jA [ B [ C j D j.A [ B/ [ C j
D jA [ Bj C jC j j.A [ B/ \ C j
D jAj C jBj jA \ Bj C jC j j.A \ C / [ .B \ C /j
D jAj C jBj C jC j jA \ Bj
jA \ C j C jB \ C j jA \ B \ C j
D jAj C jBj C jC j jA \ Bj C jA \ C j C jB \ C j
C jA \ B \ C j:
So, the number of elements in the union of three sets is the sum of the number
of elements in each, minus the sum of the number of elements in the pairwise
intersections, plus the number of elements that are common to all three. N
i i
“book2˙color” — 2013/5/29 — 21:56 — page 230 — #250
i i
A B
C
Figure 5.3. Subsets of three regions. Figure 5.4. Subsets of four regions.
Now
j.A1 [ [ Ar 1/ \ Ar j D j.A1 \ Ar / [ [ .Ar 1 \ Ar /j
D jA1r [ [ Ar 1 r j;
as well as
X X
jA1r [ [ Ar 1 rj D jAi r j jAij r j C C . 1/r 2
jA12r j:
i <r i <j <r
i i
“book2” — 2013/5/24 — 8:18 — page 231 — #251
i i
jAi1 \ \ Aiq j D sq ;
then
! !
r r
jA1 [ [ Ar j D r s1 s2 C s3 C . 1/r 1
sr :
2 3
P
Proof. By hypothesis, jAiP j D s1 for all i , and so i jAi j D r s1 . How many
terms are there in the sum 1i1 <:::<iq r jAi1 iq j? If q D 2, there is one term
jAij j D jAi \ Aj j for each pair of distinct Ai , Aj in fA1 ; : : : ; Ar g; that is,
there’s one term for each choice of 2 of the r subsets. If q D 3, there is one term
jAij k j D jAi \ Aj \ Ak j for each triple of distinct Ai , Aj , Ak in fA1 ; : : : ; Ar g;
that is, there’s one term for each choice
P of 3 of the r subsets. In general, there
are r choose q terms in the sum 1i1<i2 <<iq r jAi1 i2 iq j; thus, there are
r P
q
terms of the form jAi1 i2 iq j.Therefore, the sum 1i1 <:::<iq r jAi1 iq j in
the Inclusion-Exclusion formula is here equal to qr sq .
Example 5.47. Social Security numbers are 9-digit numbers of the form xxx-
xx-xxxx (there are some constraints on the digits, but let’s not worry about
them here). How many Social Security numbers are there that contain all the
odd digits?
As usual, it is easier to compute the size of the complement of a union. Let
X be the set of all 9-digit numbers and, for i D 1; 3; 5; 7; 9, let
Ri D fn 2 X W i is not a digit in ng:
i i
“book2” — 2013/5/24 — 8:18 — page 232 — #252
i i
5.73 There is a class of students, all of whom are taking either French, German, or
Spanish. Suppose that 15 students are studying French, 12 students are studying
German, and 10 students are studying Spanish; moreover, 4 students are study-
ing French and German, 5 are studying German and Spanish, and 3 are studying
French and Spanish. One brave soul is studying all three at once. How many stu-
dents are in the class?
Answer: 26.
5.74 Is “Inclusion-Exclusion” an appropriate name for Theorem 5.45? Why?
5.75 Elvis is playing a game in which he tosses a fair coin and rolls a fair die. He wins
if either the coin comes up heads or the die rolls a multiple of 3. What is the
probability that Elvis wins the game?
5.76 * Recall that if p is a prime and is the Euler- function, then .p/ D p 1
(see page 111).
e e e
(i) Suppose n D p11 p22 p33 is a product of three prime powers. Show that
n n n n n n n
.n/ D n C C C :
p1 p2 p3 p1 p2 p1 p3 p2 p3 p1 p2 p3
e e
(ii) Generalize to show that if n D p11 p22 : : : pnen , where p1 ; : : : ; pn are distinct
primes, then
0 1
X 1 X 1 X 1 1
.n/ D n @1 C C C . 1/k A:
pi pi pj pi pj pl p1 pk
i i;j i;j;l
i i
“book2” — 2013/5/24 — 8:18 — page 233 — #253
i i
6 Arithmetic of Polynomials
The two most important rings appearing in precollege mathematics are Z and
kŒx (where k is usually Q, R, or C). The goal of this chapter is to show that
these rings share some basic structural properties: both are domains, each has a
division algorithm, and non-units in each are products, in essentially only one
way, of irreducibles (primes in Z, polynomials in kŒx having no nontrivial
factorizations); there are numerous other parallels as well. Our program is to
take familiar results about Z and investigate their analogs in kŒx. Sometimes a
translation from Z to kŒx is quite simple—not only is the analog of a theorem
in Chapter 1 true, but so is its proof, mutatis mutandis; in other cases, however,
some modifications in proofs are necessary.
6.1 Parallels to Z
Divisibility
Let’s begin with a discussion of divisibility.
The next result, analogous to Lemma 1.13, will be very useful in what fol-
lows. It allows us to use degree in kŒx as a proxy for absolute value in Z.
Lemma 6.1. Let k be a field and let f .x/; g.x/ 2 kŒx. If f ¤ 0 and f j g,
then
deg.f / deg.g/:
i i
“book2” — 2013/5/24 — 8:18 — page 234 — #254
i i
Example 6.3. (i) Since the only units in Z are ˙1, the associates of an inte-
ger m are ˙m.
(ii) There are only four units in the Gaussian integers ZŒi , by Proposition
4.42: namely ˙1 and ˙i . Hence, every nonzero Gaussian integer z has
four associates: z; z; iz; iz.
(iii) There
are exactly six units in the Eisenstein integers ZŒ!, where ! D
1
p
2
1 C i 3 , by Exercise 4.45 on page 165. Hence, every Eisenstein
integer z has exactly six associates: ˙z; ˙!z; ˙! 2 z:
(iv) If k is a field, Proposition 6.2 says that the associates of f .x/ 2 kŒx are
nonzero multiples uf for u 2 k. N
The definition of prime on An integer n is irreducible in Z if and only if n D ˙p for some prime p;
page 22 says that primes that is, n is an associate of a prime. When k is a field, Proposition 6.2 implies
are positive. that every associate uf of a polynomial f .x/ 2 kŒx has the same degree as f ,
and it is easy to see that if f is irreducible, then uf is also irreducible.
i i
“book2” — 2013/5/24 — 8:18 — page 235 — #255
i i
Proposition 6.5. Let k be a field and let f .x/ 2 kŒx be a nonconstant polyno-
mial. Then f is irreducible in kŒx if and only if it has no factorization f D gh
in kŒx with both factors having degree < deg.f /.
The next example shows that we need the hypothesis in Proposition 6.6 that
R be a domain.
i i
“book2” — 2013/5/24 — 8:18 — page 236 — #256
i i
Example 6.7 (Kaplansky). Let X be the interval Œ0; 3. We claim that there
are elements a; b 2 C.X/ (see Example 4.31(ii)) each of which divides the
other yet which are not associates. Define
If v 2 C.X/ satisfies v.t/ D 1 for all t 2 Œ0; 1 and v.t/ D 1 for all t 2 Œ2; 3,
then it is easy to see that b D av and a D bv (same v); hence, a and b divide
each other.
Suppose a and b are associates: there is a unit u 2 C.X/ with b D au. As
for v above, u.t/ D 1 for all t 2 Œ0; 1 and u.t/ D 1 for all t 2 Œ2; 3; in
particular, u.1/ D 1 and u.2/ D 1. Since u is continuous, the Intermediate
Value Theorem of calculus says that u.t/ D 0 for some t 2 Œ1; 2. But this
contradicts Exercise 4.41(ii) on page 164, which says that units in C.X/ are
never 0. N
The next result shows that irreducible polynomials over a field behave like
primes in Z; they are “building blocks” in the sense that every nonconstant
polynomial can be expressed in terms of them.
We continue to use the Proof. If the proposition is false, then the set
term product as we have in
earlier chapters: a product C D fa.x/ 2 kŒx W a is neither a constant nor a product of irreduciblesg
can have only one factor.
Thus, it’s okay to say that
is nonempty. Let h.x/ 2 C have least degree (the Least Integer Axiom guaran-
a single irreducible is a
product of irreducibles. tees h exists). Since h 2 C , it is not a unit, and so 0 < deg.h/; since h is not ir-
reducible, h D fg, where neither f nor g is a unit, and so, by Proposition 6.2,
neither f nor g is constant. Hence, Lemma 6.1 gives 0 < deg.f / < deg.h/
and 0 < deg.g/ < deg.h/. It follows that f … C and g … C , for their degrees
are too small (h has the smallest degree of polynomials in C ). Thus, both f
and g are products of irreducibles and, hence, h D fg is a product of irre-
ducibles, contradicting h 2 C . Therefore, C is empty, and the proposition is
true.
Corollary 6.9. If k is a field, then every nonconstant f .x/ 2 kŒx has a fac-
torization
f .x/ D ap1 .x/ pn .x/;
i i
“book2” — 2013/5/24 — 8:18 — page 237 — #257
i i
4x 3 14x 2
x 2 C 3x 2 4x 5 2x 4 C x 3
4x 5 C12x 4 8x 3
14x 4 C9x 3
::
:
g D qf C r;
Proof. Let
f D x n C an 1x
n 1
C C a0 and g D bm x m C bm 1x
m 1
C C b0 :
G.x/ D g bm x m n
f;
Theorem 6.11 (Division Algorithm). Let k be a field and f .x/; g.x/ 2 kŒx.
If f ¤ 0, then there exist unique q.x/; r .x/ 2 kŒx with
g D qf C r;
g D q 0 .an 1 f / C r;
i i
“book2” — 2013/5/24 — 8:18 — page 238 — #258
i i
g D qf C r;
where q D q 0 an 1 .
To prove uniqueness of q and r , assume that g D Qf C R, where R D 0
or deg.R/ < deg.f /. Then qf C r D g D Qf C R, and
.q Q/f D R r:
If R ¤ r , then each side, being nonzero, has a degree. Since k is a field, kŒx
is a domain (Lemma 5.8), and so
deg .q Q/f D deg.q Q/ C deg.f /
deg.f /;
Example 6.12. This example shows that quotients and remainders may not
be unique when the coefficients do not lie in a domain. In Z4 Œx, let b.x/ D
2x 3 C 3 and a.x/ D 2x 2 C 2x C 1. Then
2x 3 C 3 D .x C 1/.2x 2 C 2x C 1/ C .x C 2/
D .x C 3/.2x 2 C 2x C 1/ C x:
The quotient and remainder in the first equation are x C 1 and x C 2, while the
quotient and remainder in the second equation are x C 3 and x. Note that both
x C 2 and x are linear, and hence
deg.x C 2/ D deg.x/
D1
< deg.a/
D 2: N
xn 1 D x mq 1
D .x m /q 1
D .x m 1/ .x m /q 1
C .x m /q 2
C C .x m /2 C x m C 1 :
The converse is also true, and the proof uses the Division Algorithms in Z and
in kŒx.
i i
“book2” — 2013/5/24 — 8:18 — page 239 — #259
i i
xn 1 D x mqCr 1
mqCr
Dx xr C xr 1
r mq
D x .x 1/ C .x r 1/ :
Roots
We are going to apply the preceding results to roots of polynomials. We’ve
been using the word “root” all along; let’s begin with a formal definition.
How to Think About It. We have just defined “root in k,” not “root.” Often,
a root of a polynomial f .x/ 2 kŒx may live in a larger field K containing k,
For example, f .x/ D x 2 2 has
but we still call it a root of f . p pits coefficients
in Q, but
p we usually say that 2 is a root of f even though 2 is irrational;
that is, 2 … Q.
Etymology. Why is a root so called? Just as the Greeks called the bottom
side of a triangle its base (as in the area formula 12 altitude base), they also
called the bottom side of a square its base. A natural question for the Greeks
was: givenpa square of area A, what is the length
p of its side? Of course, the
answer is A. Were we inventing a word for A, we might have called it the
base of A or the side of A. Similarly, consider the analogous three-dimensional
question:
p given a cube of volume V , what is the length
p of its edge? The answer
3
V might be called the cube base of V , and A might then be called the
square base of A. Why, then, do we call these numbers cube root and square
root? What has any of this to do with plants?
Since tracing the etymology of words is not a simple matter, we only sug-
gest the following explanation. Through 400 CE, most mathematics was written
in Greek, but, by the fifth century, India had become a center of mathematics,
and important mathematical texts were also written in Sanskrit. The Sanskrit
term for square root is pada. Both Sanskrit and Greek are Indo-European lan-
guages, and the Sanskrit word pada is a cognate of the Greek word podos;
both mean base in the sense of the foot of a pillar or, as above, the bottom of a
square. In both languages, however, there is a secondary meaning “the root of a
plant.” In translating from Sanskrit, Arab mathematicians chose the secondary
i i
“book2” — 2013/5/24 — 8:18 — page 240 — #260
i i
f .x/ D q.x/.x u/ C r;
Proposition 6.14 is often paraphrased to say that f .u/ is the remainder after
dividing f .x/ by x u.
Here is a connection between roots and factoring.
Corollary 6.15 (Factor Theorem). Let f .x/ 2 kŒx, where k is a field, and
let a 2 k. Then a is a root of f in k if and only if x a divides f .
Theorem 6.16. Let k be a field. If f .x/ 2 kŒx has degree n, then f has at
most n roots in k.
i i
“book2” — 2013/5/24 — 8:18 — page 241 — #261
i i
Example 6.17. Theorem 6.16 is not true for polynomials with coefficients in
an arbitrary commutative ring. For example, the quadratic polynomial
x 2 1 in Z8 Œx has four roots in Z8 , namely 1; 3; 5, and 7. On the other hand,
Exercise 6.14 on page 247 says that Theorem 6.16 remains true if we assume
that the coefficient ring is only a domain. N
Recall that every polynomial f .x/ 2 kŒx determines the polynomial func-
tion f # 2 Poly.k/, where f # W k ! k is defined by a 7! f .a/ for all a 2 k.
On page 204, however, we saw that the nonzero polynomial f .x/ D x p x 2
Fp Œx determines the constant function zero; different polynomials can deter-
mine the same polynomial function. This pathology vanishes when the field k
is infinite.
Proposition 6.18. Let k be an infinite field and f .x/; g.x/ 2 kŒx. If f and
g determine the same polynomial function .that is, f # D g# , so that f .a/ D
g.a/ for all a 2 k/, then f D g.
Corollary 6.19. Let k be a .possibly finite/ field, and let f .x/; g.x/ 2 kŒx,
where deg.f / deg.g/ D n. If f .a/ D g.a/ for n C 1 elements a 2 k, then
f D g.
We can now show that kŒx and Poly.k/ are structurally the same for the
most familiar fields k.
kŒx Š Poly.k/:
i i
“book2” — 2013/5/24 — 8:18 — page 242 — #262
i i
Exercises
6.1 Prove that the only units in ZŒx are ˙1, and that the only associates of a polyno-
mial f .x/ 2 ZŒx are ˙f .
6.2 * Let R be a domain, and let p.x/; q.x/ 2 RŒx.
(i) If p and q are irreducible, prove that p j q if and only if there is a unit u with
q D up.
(ii) If, in addition, both p and q are monic, prove that p j q implies u D 1 and
p D q.
i i
“book2” — 2013/5/24 — 8:18 — page 243 — #263
i i
6.3 (i) If R is a domain, prove that the only units in RŒx are units in R.
(ii) The domain Z2 has only one unit. Give an example of an infinite domain
having only one unit.
6.4 Let R be a commutative ring and let a.x/; b.x/ 2 RŒx, where a ¤ 0. Prove that
Proposition 6.10 generalizes: if the leading coefficient of a is a unit, then there
exist q.x/; r.x/ 2 RŒx with b D qa C r, where either r D 0 or deg.r/ < deg.a/.
6.5 * Let k be a domain and let a.x/; b.x/ 2 kŒx, where a ¤ 0. Prove that the
uniqueness statement in the Division Algorithm generalizes: if there are q; r; Q; R
in kŒx with qa C r D b D Qa C R, where r D 0 or deg.r/ < deg.a/, and where
R D 0 or deg.R/ < deg.a/, then R D r and Q D q.
6.6 Let k be a domain and let f .x/ 2 kŒx. If a.x/ is an associate of f , prove that
deg.f / D deg.a/. Give an example to show that the statement may be false if k
is not a domain.
6.7 * Show that there is a “2 out of 3” result for polynomials, analogous to the one for
integers: if k is a field and f; g; h 2 kŒx are polynomials such that f D g C h,
then a polynomial that divides two of the three will divide the third.
6.8 * Let R be a domain and f .x/ 2 RŒx be nonzero. If f D g1 gn , where
gi .x/ 2 RŒx for all i , show that there exist a nonzero a 2 R and monic gi0 .x/ 2
RŒx with f D ag10 gn0 .
6.9 (i) Let f .x/; g.x/ 2 QŒx with f monic. Write a pseudocode (or a program
in a CAS) implementing the Division Algorithm with input f; g and output
q.x/; r.x/, the quotient and remainder.
(ii) Find the quotient and remainder by dividing x 3 C 2x 2 8x C 6 by x 1 as
you would in high school.
6.10 * If R is a commutative ring, define a relation on R by a b if they are
associates. Prove that is an equivalence relation on R.
6.11 A student claims
p that x p1 is not irreducible in QŒx because there is a factoriza-
tion x 1 D . x C 1/. x 1/. Explain the error of his ways.
6.12 * Prove that the ideal .x; y/ in kŒx; y, where k is a field, is not a principal ideal.
The next proposition shows that gcd’s exist; it is true, but not obvious, that
every pair a; b 2 kŒx has a unique gcd (Corollary 6.29).
i i
“book2” — 2013/5/24 — 8:18 — page 244 — #264
i i
We are going to see that gcd’s of polynomials are linear combinations. The
proof of this fact for gcd’s of integers essentially involved ideals in Z, and so
we now examine ideals in kŒx.
In any commutative ring R, associates a and b generate the same principal
ideal (the converse may be false if R is not a domain).
Proof. If I D .0/, then I is a principal ideal with generator 0; that is, I D .0/.
Otherwise, let a.x/ be a polynomial in I of least degree. Since a 2 kŒx
is nonzero, its leading coefficient c ¤ 0; since k is a field, c 1 exists, and
d D c 1 a is monic. By Proposition 6.24, .a/ D .d /.
i i
“book2” — 2013/5/24 — 8:18 — page 245 — #265
i i
Example 6.27. Let R D ZŒx, the commutative ring of all polynomials over
Z. It is easy to see that the set I of all polynomials with even constant term is
an ideal in ZŒx. We show that I is not a principal ideal.
Suppose there is d.x/ 2 ZŒx with I D .d /. The constant 2 2 I , so that
there is f .x/ 2 ZŒx with 2 D df . Since the degree of a product is the sum
of the degrees of the factors, 0 D deg.2/ D deg.d / C deg.f /. Since degrees
are nonnegative, it follows that deg.d / D 0; i.e., d is a nonzero constant.
As constants here are integers, the candidates for d are ˙1 and ˙2. Suppose
d D ˙2; since x 2 I , there is g.x/ 2 ZŒx with x D dg D ˙2g. But every
coefficient on the right side is even, while the coefficient of x on the left side is
1. This contradiction gives d D ˙1. Thus, d is a unit and, by Example 5.30,
I D .d / D ZŒx, another contradiction. Therefore, no such d exists; that is, I But see Exercise 6.22
is not a principal ideal. N on page 248. There
is h.x/ 2 ZŒx with
I D .2; h/
Recall that if R is any commutative ring and a; b 2 R, then a linear combi-
nation of a; b is an element of R of the form sa C tb, where s; t 2 R. Given
a; b, the set I of all linear combinations of a; b is an ideal in R. The next
theorem parallels Theorem 1.19.
Theorem 6.28. If k is a field and f .x/, g.x/ 2 kŒx, then any gcd of f; g
is a linear combination of f and gI that is, if d.x/ is a gcd, then there are
s.x/; t.x/ 2 kŒx with
d D sf C tg:
i i
“book2” — 2013/5/24 — 8:18 — page 246 — #266
i i
Corollary 6.29. Let k be a field and let f .x/, g.x/ 2 kŒx. A monic common
divisor d.x/ is a gcd of f; g if and only if d is divisible by every common
divisor; that is, if h is any common divisor of f; g, then h j d .
Proof. The end of the proof of Theorem 6.28 shows that if h is a common
divisor, then h j d . Conversely, if h j d , then deg.h/ deg.d /, and so d is a
monic common divisor of largest degree.
Theorem 6.30. Let f .x/, g.x/ 2 kŒx, where k is a field, and let I D .f; g/
be the ideal of all linear combinations of f and g.
(i) If d.x/ 2 kŒx is monic, then d D gcd.f; g/ if and only if I D .d /.
Recall that gcd’s are re- (ii) f and g have a unique gcd.
quired to be monic. That’s
essential to uniqueness.
Proof. (i) Suppose that d D gcd.f; g/. We show that .d / I and I
.d /. Theorem 6.28 shows that d 2 I ; therefore, .d / I (for every
multiple of d is also a linear combination). For the reverse inclusion, let
h D uf C vg 2 I . Now d j f and d j g, because d is a common divisor,
and so d j h. Hence, h D rd 2 .d /; that is, I .d /, and so I D .d /.
Conversely, suppose that I D .d /. Then d D sf C tg, and so every
common divisor h of f; g is a divisor of d . Hence, Corollary 6.29 gives
d D gcd.f; g/.
(ii) If d and d 0 are gcd’s of f and g, then .d / D .d 0 /, by part (i). Since both
d and d 0 are monic, we must have d D d 0 , by Theorem 6.25.
How to Think About It. It’s a good idea to stop and take stock of where we
are in our program of displaying parallels between integers and polynomials.
For polynomials over a field, we have, so far
extended the notion of divisibility
generalized “prime” to “irreducible”
shown that factorizations into irreducibles exist
established a division algorithm
shown that the gcd of two polynomials exists and is unique
shown that the gcd of two polynomials is a linear combination of them.
Thinking back to Chapter 1, what’s next? There were two main paths we took
then: one led to unique factorization—the Fundamental Theorem of Arith-
metic; one led to Euclidean Algorithms. We’ll follow both these paths for poly-
nomials.
i i
“book2” — 2013/5/24 — 8:18 — page 247 — #267
i i
Exercises
6.13 Find the gcd of each pair .f; g/ in QŒx and write it as a linear combination of f
and g.
(i) .x 3 x 2 x 2; x 3 3x 2 C 3x 2/
(ii) .x 6 1; x 5 1/
(iii) .x 3 x 2 x 2; 2x 3 4x 2 C 2x 4/
(iv) .x 6 1; x 6 C x 5 2/
(v) .2x C 1/.x 6 1/; .2x C 1/.x 5 1/
(vi) .3x 6 3; 2x 5 2/
6.14 * Let R be a domain. If f .x/ 2 RŒx has degree n, prove that f has at most n
roots in R.
Hint: Use Frac.R/.
p
6.15 If k is a field in which 1 C 1 ¤ 0, prove that 1 x 2 is not a rational function
over k.
p
Hint: Mimic the classical proof that 2 is irrational.
6.16 In Exercise 6.10 on page 243, we saw that the relation on a commutative ring R,
defined by a b if they are associates, is an equivalence relation. Prove that if R
is a domain, then there is a bijection from the family of all equivalence classes to
the family of all principal ideals in R.
6.17 *
(i) If f .x/ and g.x/ are relatively prime in kŒx (k a field) and each divides a
polynomial h, prove that their product fg also divides h.
(ii) If p1 ; p2 ; : : : ; pn are polynomials so that gcd.pi ; pj / D 1, and each pi di-
vides a polynomial h, prove that p1 p2 pn also divides h.
6.18 *
(i) Find gcd.3x 3 2x 2 C 3x 2; 3x 2 C x 2/ in CŒx.
(ii) Write a pseudocode (or a program in a CAS) implementing Euclidean Algo-
rithm I.
(iii) Write a pseudocode (or a program in a CAS) implementing Euclidean Algo-
rithm II.
Hint: Model your routine after the functions in Exercise 1.67 on page 36.
6.19 * Prove the converse of Euclid’s Lemma. Let k be a field and let f .x/ 2 kŒx be a
nonconstant polynomial; if, whenever f divides a product of two polynomials, it
necessarily divides one of the factors, then f is irreducible. (See Theorem 1.21.)
6.20 (i) Find two polynomials in QŒx whose associated polynomial functions agree
with this input-output table:
Input Output
1 3
4 17
5 26
(ii) Classify the set of all polynomials that agree on the table.
6.21 (i) Show that the set of polynomials in QŒx that vanish on f1; 2; 3g is an ideal in
QŒx.
(ii) What is a generator of this ideal?
i i
“book2” — 2013/5/24 — 8:18 — page 248 — #268
i i
6.22 * In Example 6.27, we saw that the ideal I in ZŒx consisting of all polynomials
with even constant term is not a principal ideal. Find a polynomial h.x/ 2 ZŒx
so that I D .2; h/; that is, I consists of all the linear combinations of 2 and h.
6.23 Let k be a field and f .x/; g.x/ 2 kŒx. Generalize Exercises 5.49 and 5.50 on
page 220: if d.x/ D gcd.f; g/ and m.x/ D lcm.f; g/, prove that
6.24 Show, in Z8 Œx, that x 2 1 has two distinct factorizations into irreducibles.
Hint: See Example 6.17.
Unique Factorization
The main result in this subsection is a generalization of the Fundamental Theo-
rem of Arithmetic to polynomials: the factorization of every nonconstant poly-
nomial over a field as a product of irreducibles is essentially unique.
We begin by proving Euclid’s Lemma for polynomials. As for integers, it
shows that irreducibility is a strong assumption when dealing with divisibility.
Theorem 6.31 (Euclid’s Lemma). Let k be a field and let f .x/; g.x/ 2 kŒx.
If p.x/ is an irreducible polynomial in kŒx and p j fg, then
pjf or p j g:
g D spg C tfg:
But p j fg, by hypothesis, and so Exercise 6.7 on page 243 gives p j g. The
last statement follows by induction on n 2.
The converse of Euclid’s Lemma is true; see Exercise 6.19 on page 247.
Polynomial versions of arithmetic theorems in Chapter 1 now follow.
Definition. Two polynomials f .x/; g.x/ 2 kŒx, where k is a field, are called
relatively prime if their gcd is 1.
Corollary 6.32. Let f .x/; g.x/; h.x/ 2 kŒx, where k is a field, and let h and
f be relatively prime. If h j fg, then h j g.
Proof. The proof of Theorem 6.31 works here. Since gcd.h; f / D 1, we have
1 D sh C tf , and so g D shg C tfg. But fg D hh1 for some h1 .x/ 2 kŒx,
and so g D h.sg C th1 /.
i i
“book2” — 2013/5/24 — 8:18 — page 249 — #269
i i
Theorem 6.34 (Euclidean Algorithm I). If k is a field and f .x/; g.x/ 2 kŒx,
then there is an algorithm computing gcd.f; g/.
Proof. The proof is essentially a repetition of the proof of the Euclidean Algo-
rithm in Z; just iterate the Divison Algorithm. Each line comes from the line
above it by moving some terms “southwest.”
g D q1 f C r1
f D q2 r1 C r2
r1 D q3 r2 C r3
::
:
rn 3 D qn 1 rn 2 C rn 1
rn 2 D qn rn 1 C rn
rn 1 D qnC1 rn :
Since the degrees of the remainders are strictly decreasing, the procedure must
stop after at most deg.f / steps. The claim is that d D rn is the gcd, once
it is made monic. We see that d is a common divisor of f and g by back
substitution: repeated applications of “2 out of 3,” working from the bottom
up. To see that d is the gcd, work from the top down to show that if c is any
common divisor of f and g, then c j ri for every i .
The Euclidean Algorithm may not produce a monic last remainder. The gcd
is the monic associate of the last nonzero remainder.
3x 3 2x 2 C 3x 2 D .x 1/.3x 2 C x 2/ C .6x 4/
2
3x C x 2 D 12 x C 21 /.6x 4/ C 0:
f D .x 1/g C r
1
gD 2
x C 21 r:
The last remainder is 6x 4. As we warned, it’s not monic, and we must make
it so. Thus, we need to take its monic associate (multiplying by 61 ):
2
gcd.f; g/ D x 3: N
i i
“book2” — 2013/5/24 — 8:18 — page 250 — #270
i i
g D Qf C R;
g D qf C r;
where q; r 2 kŒx and either r D 0 or deg.r / < deg.f /. But the equation
g D qf C r also holds in KŒx because kŒx KŒx, so that the uniqueness
of quotient and remainder in the Division Algorithm in KŒx gives Q D q 2
kŒx and R D r 2 kŒx. Therefore, the list of equations occurring in the
Euclidean Algorithm in KŒx is exactly the same as the list occurring in the
Euclidean Algorithm in the smaller ring kŒx. In particular, the gcd, being the
last remainder (made monic), is the same in both polynomial rings.
To illustrate, even though there are more divisors with complex coefficients,
See Exercise 6.18(i) on the gcd of 3x 3 2x 2 C 3x 2 and 3x 2 C x 2, computed in RŒx, is equal
page 247. to their gcd computed in CŒx.
As in Z, the Division Algorithm in kŒx can also be used to compute coef-
ficients occurring in an expression of the gcd as a linear combination.
i i
“book2” — 2013/5/24 — 8:18 — page 251 — #271
i i
Proof. Let d D gcd.f; g/. To find s and t with d D sf C tg, again work from
the last remainder back to f and g:
rn D rn 2 qn rn 1
D rn 2 qn .rn 3 qn 1 rn 2 /
D .1 C qn qn 1 /rn 2 qn rn 3
::
:
D sf C tg
i i
“book2” — 2013/5/24 — 8:18 — page 252 — #272
i i
.that is, a and b are nonzero constants and the p’s and q’s are monic irredu-
cibles/, then a D b, m D n, and the q’s may be re-indexed so that qi D pi for
all i .
ap1 pm D bq1 qn
in which a and b are nonzero constants and the p’s and q’s are monic ir-
reducibles. We prove, by induction on M D maxfm; ng 1, that a D b,
m D n, and the q’s may be re-indexed so that qi D pi for all i . For the base
step M D 1, we have ap1 D bq1 . Now a is the leading coefficient, because
p1 is monic, while b is the leading coefficient, because q1 is monic. Therefore,
a D b, and canceling gives p1 D q1 . For the inductive step, the given equation
shows that pm j q1 qn . By Euclid’s Lemma for polynomials, there is some
i with pm j qi . But qi , being monic irreducible, has no monic divisors other
than 1 and itself, so that qi D pm . Re-indexing, we may assume that qn D pm .
Canceling this factor, we have ap1 pm 1 D bq1 qn 1 . By the inductive
hypothesis, a D b, m 1 D n 1 (hence m D n) and, after re-indexing,
qi D pi for all i .
Here is another way to state uniqueness, using Proposition 6.24: after re-
indexing, the ideals .p1 /; : : : ; .pm / and .q1 /; : : : ; .qm / are the same.
Collect like factors.
Theorem 6.40 shows that every nonconstant polynomial f has prime fac-
torizations; moreover, if all the exponents ei > 0, then the factors in it are
unique. Let f .x/; g.x/ 2 kŒx, where k is a field. As with integers, using zero
exponents allows us to assume that the same irreducible factors occur in both
prime factorizations:
a am b bm
f D p1 1 pm and g D p1 1 pm :
i i
“book2” — 2013/5/24 — 8:18 — page 253 — #273
i i
Proposition 6.41. Let f .x/; g.x/ 2 kŒx, where k is a field, have prime fac-
torizations f D p1a1 pnan and g D p1b1 pnbn in kŒx, where ai ; bi 0 for
all i .
(i) f j g if and only if ai bi for all i .
(ii) If mi D minfai ; bi g and Mi D maxfai ; bi g, then
Corollary 6.42. If k is a field and f .x/; g.x/ 2 kŒx are monic polynomials,
then
lcm.f; g/ gcd.f; g/ D fg:
i i
“book2” — 2013/5/24 — 8:18 — page 254 — #274
i i
How to Think About It. There is nothing magic about R and C. It can be
proved that every field k has an algebraic closure k; that is, there is a field k
containing k as a subfield, and every f .x/ 2 kŒx is a product of linear factors.
In particular, since kŒx kŒx, every f .x/ 2 kŒx is a product of linear
factors in kŒx; that is, k contains all the roots of f . Thus, Corollary 6.43 can
be generalized by replacing R and C by k and k.
We know that C can be viewed as a vector space over its subfield R, and
dimR .C/ D 2. But things are not so simple for algebraic closures k of other
fields k. It is always true that k is a vector space over k, but its dimension need
not be 2. In fact, dimk .k/ need not even be finite: for example, dimQ .Q/ D 1
and, if k is finite, then dimk .k/ D 1.
Let k be a field, and assume that all the roots of a polynomial f .x/ 2 kŒx
lie in k: there are a; r1 ; : : : ; rn 2 k with
n
Y
f .x/ D a .x ri /:
i D1
Exercises
6.25 Let f .x/; g.x/ 2 kŒx, where k is a field. If fg is a square, must f or g be a
square? What if gcd.f; g/ D 1?
6.26 Let f .x/; g.x/ 2 kŒx, where k is a field, be relatively prime. If h.x/ 2 kŒx and
h2 j fg, prove that h2 j f or h2 j g.
6.27 Let k D F2 .x/. Prove that f .t / D t 2 x 2 kŒt is an irreducible polynomial. (We
shall see later that there is a field K containing k and an element u with u2 D x,
so that f .t / D .t u/2 in KŒt .)
n
6.28 In Zp Œx, show that if f is an irreducible factor of x p x, then f 2 does not
n
divide x p x.
6.29 Determine, for each of the following polynomials in QŒx whether or not it is
irreducible in QŒx, in RŒx, or in CŒx.
(i) x 2 7x C 6.
(ii) x 2 C 2x 1.
(iii) x2 C x C 1.
6.30 * Show that f .x/ D x 3 C 5x 2 10x C 15 is irreducible in QŒx.
In Section 6.2, we will give different criteria for determining whether poly-
nomials are irreducible (in particular, we will discuss f on page 267). However,
we ask you to solve this problem now so you will appreciate the theorems to be
proved.
i i
“book2” — 2013/5/24 — 8:18 — page 255 — #275
i i
Although there are domains with elements not having a gcd, we now show
gcd’s always exist in PIDs.
i i
“book2” — 2013/5/24 — 8:18 — page 256 — #276
i i
Proof. If p − a, then 1 is a gcd of p; a, for the only divisors of p are units and
This proof should look associates. Thus, Theorem 6.46 says that there exist s; t 2 R with 1 D sp C ta.
quite familiar to you. Hence, b D spbCtab. But ab D pr , for some r 2 R, and so p j b, as desired.
I1 I2 In InC1
Proof. Suppose there is an ascending chain of ideals that does not stop. Throw-
ing away any repetitions In D InC1 if necessary, we may assume that there is
i i
“book2” — 2013/5/24 — 8:18 — page 257 — #277
i i
I1 ¨ I2 ¨ ¨ In ¨ InC1 ¨ :
S
By Exercise 6.31 on page 258, J D n1 In is an ideal in R. And since R
is a PID, J is principal; there is d 2 J with J D .d /. Now d got into J by
being in Im for some m; that is, .d / Im . Hence,
J D .d / Im ¨ ImC1 J:
This is a contradiction.
Proposition 6.49 gives existence. The next theorem gives a fundamental the-
orem of arithmetic for PIDs: every nonzero non-unit has a unique factorization
as a product of irreducibles.
Theorem 6.50. Let R be a PID. Every r 2 R, neither 0 nor a unit, has a fac-
torization as a product of irreducibles which is unique in the following senseW
if
p1 pn D r D q1 qm ;
where the p’s and q’s are irreducible, then m D n and the q’s can be re-
indexed so that qi and pi are associates for all i .
p1 pn D r D q1 qm ;
i i
“book2” — 2013/5/24 — 8:18 — page 258 — #278
i i
where the p’s and q’s are irreducible. By Euclid’s Lemma, pn irreducible im-
plies pn j qi for some i . Since qi is irreducible, we have pn and qi are asso-
ciates: there is a unit u 2 R with qi D upn . Re-index the q’s so that qi is now
qm D upn , cancel pn from both sides, and replace q1 by uq1 . Thus,
p1 pn 1 D r D .uq1 / qm 1:
Further Results. We’ve just seen that every PID is a UFD. The converse is
false: there are UFDs that are not principal ideal domains. A theorem of Gauss
states that if a domain A is a UFD, then AŒx is also a UFD. For example, ZŒx
is a UFD (this is not a PID). If k is a field, then it follows by induction on
n 1 that R D kŒx1 ; : : : ; xn , polynomials in several variables, is a UFD (R
is not a PID if n 2).
Exercises
6.31 *
(i) Let I and J be ideals in a commutative ring R. Prove that their union I [ J
is an ideal if and only if I J or J I .
(ii) Let I1 I2 In be an ascending chain of ideals in a commu-
tative ring R. Prove that
1
[
In
n1
is an ideal in R.
i i
“book2” — 2013/5/24 — 8:18 — page 259 — #279
i i
f .x/ D a0 C a2 x 2 C a3 x 3 C :
I2 ¨ I4 ¨ I8 ¨ ¨ I2j ¨ : : :
(iv) Show that this ascending chain of ideals does not stop.
(v) Conclude that there are ideals in RR that are not principal.
6.2 Irreducibility
Although there are some techniques to help decide whether an integer is prime,
the general problem is open and is very difficult (indeed, this is precisely why
RSA public key codes are secure). Similarly, it is very difficult to determine
whether a polynomial is irreducible, but there are some useful techniques that
frequently work. Most of our attention will be on QŒx and ZŒx, but some of
the results do generalize to other rings of coefficients.
For polynomials of low degree, we have a simple and useful irreducibility
criterion.
Proposition 6.51. Let k be a field and let f .x/ 2 kŒx be a quadratic or cubic
polynomial. Then f is irreducible in kŒx if and only if f has no root in k.
i i
“book2” — 2013/5/24 — 8:18 — page 260 — #280
i i
Proof. We may assume that a root b=c is in lowest terms; that is, gcd.b; c/ D
1. Evaluating gives 0 D f .b=c/ D a0 C a1 b=c C C an b n =c n , and multi-
plying through by c n gives
0 D a0 c n C a1 bc n 1
C C an b n :
Reducing this mod b shows that b j a0 c n ; since gcd.b; c/ D 1, Corollary 1.22
gives b j a0 . Similarly, reducing mod c gives c j an b n . Since gcd.b; c/ D 1,
we have c j an .
It follows from the second statement that if an integer a p
is not the nth power
n n
of an integer,
p then x a has no rational roots; that is, a is irrational. In
particular, 2 is irrational. Thus, Theorem 6.52 is a vast generalization of
Proposition 1.26.
Had we known Theorem 6.52 earlier, we could have easily dealt with the
“bad cubic” f .x/ D x 3 7x C 6 in Example 3.5. Since the candidates for its
rational roots are ˙1; ˙2; ˙3; ˙6, we would have quickly found the factor-
ization f .x/ D .x 1/.x 2/.x C 3/.
If f .x/ 2 QŒx happens to be in ZŒx, there is a useful theorem of Gauss
comparing the factorizations of f in ZŒx and in QŒx that concludes that f
is irreducible over Q. Our proof involves Example 5.23: the homomorphism
rp W Z ! Zp , sending j 7! Œj , gives a homomorphism rp W ZŒx ! Zp Œx,
called reduction mod p. If f .x/ D a0 C a1 x C C an x n 2 ZŒx, then
rp W f 7! f ; where f .x/ D Œa0 C Œa1 x C C Œan x n 2 Zp Œx:
Thus, rp merely reduces all coefficients mod p.
Theorem 6.53 (Gauss’s Lemma). Let f .x/ 2 ZŒx. If there are G.x/; H.x/ 2
QŒx with f D GH , then there are g.x/; h.x/ 2 ZŒx with deg.g/ D deg.G/,
deg.h/ D deg.H /, and f D gh.
i i
“book2” — 2013/5/24 — 8:18 — page 261 — #281
i i
How to Think About It. We agree that Gauss’s Lemma, though very useful,
is rather technical. Gauss saw that the ideas in the proof could be generalized
to apply to polynomials in several variables over a field.
The basic use of reduction mod p was previewed on page 216 when we
proved that 1 is not a square in Z by showing that it’s not a square in Z3 .
Reduction mod p gives a criterion for irreducibility of f in ZŒx by testing the
irreducibility of f in Fp Œx. The precise statement is:
i i
“book2” — 2013/5/24 — 8:18 — page 262 — #282
i i
There are eight cubics, of which four are reducible because their constant
term is 0 (so that x is a factor). The remaining polynomials are
x 3 C 1; x 3 C x C 1; x 3 C x 2 C 1; x 3 C x 2 C x C 1:
Since 1 is a root of the first and fourth, the middle two are the only irreducible
cubics. Proposition 6.51 now applies.
There are sixteen quartics, of which eight are reducible because their con-
stant term is 0. Of the eight with nonzero constant term, those having an even
number of nonzero coefficients have 1 as a root. There are now only four sur-
viving polynomials f , and each has no roots in F2 ; that is, they have no linear
factors. The only possible factorization for any of them is f D gh, where both
g and h are irreducible quadratics. But there is only one irreducible quadratic,
namely, x 2 C x C 1. Therefore, x 4 C x 2 C 1 D .x 2 C x C 1/2 factors, and the
other three quartics are irreducible.
Example 6.57. Here is a list of the monic irreducible quadratics and cubics
in F3 Œx. You can verify that the list is correct by first enumerating all such
polynomials; there are six monic quadratics having nonzero constant term, and
Note that 1 D 2 in Z3 . there are eighteen monic cubics having nonzero constant term. It must then be
checked which of these have 1 or 1 as a root, for Proposition 6.51 applies.
x 3 C x 2 C x 1, x 3 x 2 x 1. N
i i
“book2” — 2013/5/24 — 8:18 — page 263 — #283
i i
Further Results.
We can count the number Nn of irrreducible polynomials of degree n in
Fp Œx. In [17], pp. 83–84, it is shown that
X
pn D d Nd ; (6.2)
djn
Exercises
6.35 Let f .x/ D x 2 C x C 1 2 F2 Œx. Prove that f is irreducible in F2 Œx, but that
f has a root ˛ 2 F4 . Use the construction of F4 in Exercise 4.55 on page 165 to
display ˛ explicitly.
6.36 Show that x 4 C x C 1 is not irreducible in RŒx even though it has no roots in R.
6.37 (i) If k is a field and each of f .x/; g.x/ 2 kŒx has a root ˛ in k, show that ˛ is
a root of gcd.f; g/.
(ii) How does this apply to the polynomials in Examples 6.35 and 6.36?
6.38 If p is a prime, show that, in Zp Œx,
pY1 pY1
xp xD .x i/ and xp 1
1D .x i /:
i D0 i D1
.p 1/Š 1 mod p:
6.40 *
(i) Let f .x/ D .x a1 / .x an / 2 kŒx, where k is a field. Show that f has
no repeated roots (i.e., all the ai are distinct) if and only if gcd.f; f 0 / D 1,
where f 0 is the derivative of f .
Hint: Use Exercise 5.17 on page 203.
(ii) Prove that if p.x/ 2 QŒx is an irreducible polynomial, then p has no re-
peated roots in C.
Hint: Use Corollary 6.37.
i i
“book2” — 2013/5/24 — 8:18 — page 264 — #284
i i
1 3
6.41 If p is prime, prove that there are exactly 3 .p p/ monic irreducible cubic
polynomials in Fp Œx.
6.42 Determine whether the following polynomials are irreducible in QŒx.
(i) f .x/ D x 5 4x C 2.
(ii) f .x/ D x 4 C x 2 C x C 1.
Hint: Show that f has no roots in F3 and that a factorization of f as a product
of quadratics would force impossible restrictions on the coefficients.
(iii) f .x/ D x 4 10x 2 C 1.
Hint: Show that f has no rational roots and that a factorization of f as a
product of quadratics would force impossible restrictions on the coefficients.
6.43 Is x 5 C x C 1 irreducible in F2 Œx?
Hint: Use Example 6.56.
6.44 Let f .x/ D .x p 1/=.x 1/, where p is prime. Using the identity
f .x C 1/ D x p 1
C pq.x/;
Roots of Unity
In Chapter 3, we defined an nth root of unity to be primitive if every nth root
of unity is a power of . For example, i is a primitive 4th root of unity. Note
that i ispalso an 8th root of unity, for i 8 D 1, but it’s not a primitive 8th root of
unity; 22 .1 C i / is a primitive 8th root of unity.
Proof. We know that n D 1; let d be the smallest positive integer for which
d D 1. By the Division Algorithm, there are integers q and r with n D qd Cr ,
where 0 r < d . Now
1 D n D qdCr D dq r D r ;
1; ; 2 ; : : : ; d 1
;
are all distinct (Exercise 6.51 on page 269). Since there are exactly d d th roots
of unity, they are all powers of , and so is primitive.
i i
“book2” — 2013/5/24 — 8:18 — page 265 — #285
i i
Proposition 6.60. Let n be a positive integer and regard x n 1 2 ZŒx. Then This proposition sheds
light on your discovery in
(i) Exercise 3.59 on page 116.
Y
xn 1D ˆd .x/;
djn
where d ranges over all the positive divisors d of n (in particular, both
ˆ1 .x/ and ˆn .x/ are factors).
(ii) ˆn .x/ is a monic polynomial in ZŒx.
Proof.Q (i) For each divisor d of n, collect all terms in the equation x n 1D
.x / with a primitive d th root of unity. Thus,
Y
xn 1 D hd .x/;
djn
Q
where hd .x/ D .x / with an nth root of unity that is also a prim-
itive d th root of unity. But every such must be an nth root of unity: by
Lemma 6.59, n D dq for some integer d , and 1 D n D dq . Therefore,
hd .x/ D ˆd .x/.
(ii) The proof is by strong induction on n 1. The base step is true, for
ˆ1 .x/ D x 1. For the inductive step n > 1, write
xn 1 D ˆn .x/F .x/;
Q
where F .x/ D ˆd .x/ with d j n and d < n. The inductive hypoth-
esis says that all the factors ˆd of F are monic polynomials in ZŒx;
hence, F is a monic polynomial in ZŒx. By Proposition 6.10, ˆn .x/ D
.x n 1/=F .x/ is a monic polynomial in ZŒx, as desired.
we have
xn 1
ˆn .x/ D Y
ˆd .x/
djn; d<n
i i
“book2” — 2013/5/24 — 8:18 — page 266 — #286
i i
n ˆn .x/
1 x 1
2 xC1
2
3 x CxC1
4 x2 C 1
5 x4 C x3 C x2 C 1
6 x2 xC1
7 x6 C x5 C x4 C x3 C x2 C 1
8 x4 C 1
9 x6 C x3 C 1
10 x4 x3 C x2 xC1
10
11 x C x C x C x C x C x C x4 C x3 C x2 C 1
9 8 7 6 5
12 x4 x2 C 1
Q
You should verify that x 12 1 D d2f1;2;3;4;6;12g ˆd .x/. The recursive def-
inition can be programmed into a CAS (see Exercise 6.55 on page 270); Fig-
ure 6.1 displays the first dozen cyclotomic polynomials. There’s no simple
pattern to these polynomials, but calculating a good number of them gives
you food for thought and leads to interesting conjectures. For example, can
you conjecture anything about deg.ˆn /? All the coefficients of the cyclotomic
polynomials displayed in Figure 6.1 are 0 and ˙1, but your guess that this is
always true is wrong [see Exercise 6.55(iii) on page 270]. Do any of the ˆn .x/
factor in ZŒx? N
ˆp .x/ D x p 1
C xp 2
C C x 2 C x C 1:
The next proposition shows that .n/ is intimately related to ˆn .x/, and this
leads to a simple proof of a fact from number theory.
i i
“book2” — 2013/5/24 — 8:18 — page 267 — #287
i i
We’ve shown that ˆn .x/ 2 ZŒx, and we’ll finish this section by showing
that ˆp is irreducible in QŒx when p is prime. It turns out that ˆn is actually
irreducible in QŒx for every n, but the proof is more difficult (see [17] p.195).
As any linear polynomial over a field, the cyclotomic polynomial ˆ2 D
x C 1 is irreducible in QŒx; ˆ3 D x 2 C x C 1 is irreducible in QŒx because it
has no rational roots; we saw, in Example 6.58, that ˆ5 is irreducible in QŒx.
We’ll next introduce another irreducibility criterion, useful in its own right,
that will allow us to prove that ˆp is irreducible in QŒx for all primes p. An
example will motivate the criterion.
Example 6.64. Exercise 6.30 on page 254 asked you to show that f .x/ D
x 3 C5x 2 10x C15 is irreducible in ZŒx. You now have machinery that makes
this easy. For example, you could invoke Theorem 6.52 (the Rational Root
Theorem) to show that f has no root in Q (or Z) and then use Proposition 6.51.
But let’s use another technique that shows the power of reducing coeffi-
cients. Suppose that f .x/; g.x/; h.x/ 2 ZŒx and f D gh, where neither g
nor h is constant; reduce the coefficients mod 5. Because reduction mod 5 is
a homomorphism, we have f D g h. But all the coefficients of f (except the
leading one) are divisible by 5, so we have
x 3 D gh in Z5 Œx:
i i
“book2” — 2013/5/24 — 8:18 — page 268 — #288
i i
Proof. By Exercise 5.42 on page 216, the function 'W ZŒx ! ZŒx, given by
f .x/ 7! f .x C c/;
But '.g/ D g.x C c/, so that Eq. (6.3) is a forbidden factorization of g.x C c/.
Hence, Corollary 6.54 says that g is irreducible in QŒx.
f .x C 1/ D .x C 1/4 C 1 D x 4 C 4x 3 C 6x 3 C 4x C 2:
Theorem 6.68 (Gauss). For every prime p, the pth cyclotomic polynomial
ˆp .x/ is irreducible in QŒx.
i i
“book2” — 2013/5/24 — 8:18 — page 269 — #289
i i
Exercises
6.46 * Let D e2 i=n be a primitive nth root of unity.
(i) Prove, for all n 1, that
xn 1 D .x 1/.x /.x 2 / .x n 1
/;
and, if n is odd, that
x n C 1 D .x C 1/.x C /.x C 2 / .x C n 1
/:
(ii) For numbers a and b, prove that
an b n D .a b/.a b/.a 2 b/ .a n 1
b/;
and, if n is odd, that
an C b n D .a C b/.a C b/.a C 2 b/ .a C n 1
b/:
Hint: Set x D a=b if b ¤ 0.
6.47 * Let k be a field and a 2 k. Show that, in kŒx,
xn an D .x a/ x n 1
C xn 1
a C xn 2 2
a C : : : an 1
x C an :
6.48 If k is a field, a 2 k, and f .x/ D cn x n C cn 1x
n 1 C C c0 2 kŒx, then
rewrite
f .x/ f .a/ D cn x n C cn 1 x n 1 C C c0 cn a n C cn 1a
n 1
C C c0
and use Exercise 6.47 to give another proof of Corollary 6.15.
6.49 Determine whether the following polynomials are irreducible in QŒx.
(i) f .x/ D 3x 2 7x 5. (ii) f .x/ D 2x 3 x 6.
(iii) f .x/ D 8x 3 6x 1. (iv) f .x/ D x 3 C 6x 2 C 5x C 25.
(v) f .x/ D x 4 C 8x C 12.
Hint: In F5 Œx, f .x/ D .x C 1/g.x/, where g is irreducible.
6.50 Use the Eisenstein Criterion to prove that if a is a squarefree integer, then
x n a is irreducible in QŒx for every n 1. Conclude that there are irreducible
polynomials in QŒx of every degree n 1.
6.51 * In the proof of Lemma 6.59, we claimed that the first d powers of are distinct,
where is an nth root of unity and d is the smallest positive integer with d D 1.
Prove this claim.
6.52 * Let be an nth root of unity. Lemma 6.59 shows that is a primitive d th root
of unity for some divisor d of n. Show that the divisor is unique.
6.53 Consider a finite table of data:
Input Output
a1 b1
a2 b2
a3 b3
:: ::
: :
an bn
Show that two polynomial functions (defined over Q) agree on the table if and
only if their difference is divisible by
n
Y
.x ai /:
i D1
i i
“book2” — 2013/5/24 — 8:18 — page 270 — #290
i i
6.54 (i) Show that the set of polynomials in QŒx that vanish on f˛1 ; : : : ; ˛n g is an
ideal in QŒx.
(ii) What is a generator of this ideal?
6.55 *
(i) Implement the recursively defined function for ˆn .x/ given in Example 6.61
in a CAS.
(ii) Use it to generate ˆn .x/ for, say, 1 n 50.
(iii) Use the CAS to find the smallest value of n for which a coefficient of ˆn .x/
is not 0, 1, or 1.
On the surface, the problem of fitting data seems to have little to do with the
ideas in this chapter. However, by placing it in a more abstract setting, we’ll
see that it yields Lagrange Interpolation, a result useful in its own right. Thus,
this problem fits right into the theory of commutative rings; in fact, it’s really
the Chinese Remainder Theorem! But isn’t the Chinese Remainder Theorem
about solving some congruences? Well, yes, and we’ll see that we can make
the notion of congruence apply here. But first, let’s find a polynomial f .x/ by
hand that fits the table.
f g mod m;
if m j .f g/.
i i
“book2” — 2013/5/24 — 8:18 — page 271 — #291
i i
a b mod I
We can now rephrase the Division Algorithm in kŒx using congruence. The
statement: given m.x/; f .x/ 2 kŒx with m.x/ ¤ 0, there exist q.x/; r .x/ 2
kŒx with f D qm C r , where r D 0 or deg.r / < deg.m/, can be rewritten to
say
f r mod m:
f 12 mod .x C 3/
f 22 mod .x 2/
f 72 mod .x 3/
f 26 mod .x C 4/:
Notice that the four linear polynomials are pairwise relatively prime.
Let’s push the similarity a little further, using the localization idea on page 146.
Suppose we can find polynomials g, h, k, and ` satisfying
g. 3/ D 1 h. 3/ D 0 k. 3/ D 0 `. 3/ D 0
g.2/ D 0 h.2/ D 1 k.2/ D 0 `.2/ D 0
g.3/ D 0 h.3/ D 0 k.3/ D 1 `.3/ D 0
g. 4/ D 0 h. 4/ D 0 k. 4/ D 0 `. 4/ D 1:
Setting f D 12g C 22h C 72k 26`, we have a polynomial that fits the
original table (why?). Now Proposition 6.15, the Factor Theorem, shows that
g is divisible by the linear polynomials x 2, x 3, and x C 4. Since they are
irreducible in QŒx, they are pairwise relatively prime; hence, Exercise 6.17(ii)
on page 247 says that g is divisible by their product: there is A.x/ such that
i i
“book2” — 2013/5/24 — 8:18 — page 272 — #292
i i
Proof. The proof of Theorem 4.27 can be easily adapted to prove this.
Example 6.70. The calculations we have just made can be used to illustrate
the theorem. The table gives
b1 D 12; b2 D 22; b3 D 72; b4 D 26I
applying the Remainder Theorem to the table entries,
m1 D x C 3; m2 D x 2; m3 D x 3; m4 D x C 4:
Then
M1 D .x 2/.x 3/.x C 4/;
M2 D .x C 3/.x 3/.x C 4/;
M3 D .x C 3/.x 2/.x C 4/;
M4 D .x C 3/.x 2/.x 3/;
i i
“book2” — 2013/5/24 — 8:18 — page 273 — #293
i i
and
s1 D 1=30; s2 D 1=30; s3 D 1=42; s4 D 1=42:
Note, for example, that
1
s1 M1 1D 30 .x 2/.x 3/.x C 4/ 1
3 2
x x 14x C 24
D 1
30
x3 x2 14x 6
D
30
4x 2 x2
D .x C 3/ ;
30
so that s1 M1 1 mod m1 . This is not magic; if you look carefully at how s1
is calculated (we called it A on the previous page), you’ll see that it is none
other than
1=M1 . 3/:
And the Remainder Theorem (again) says that the remainder when g.x/ D
M1 .x/=M1 . 3/ is divided by x C 3 is
g. 3/ D M1 . 3/=M1 . 3/ D 1:
Similarly, we have si Mi 1 mod mi for the other values of i .
Finally, the statement about any other solution to the system follows from
Exercise 6.53 on page 269. N
Mi .x/ D .x a1 /.x a2 / .x ai 3
1 /.xa /.x i ai C1 / .x ar /:
i i
“book2” — 2013/5/24 — 8:18 — page 274 — #294
i i
Exercises
6.56 Find a polynomial that agrees with the table
Input Output
0 3
1 4
2 7
3 48
4 211
6.57 A radio show offered a prize to the first caller who could predict the next term in
the sequence
1; 2; 4; 8; 16:
(i) What would you get if you used “common sense?”
(ii) What would you get if you used Lagrange Interpolation?
6.58 Another radio show offered a prize to the first caller who could predict the next
term in the sequence
14; 3; 26; 8; 30:
After no one got it for a few days, the host announced that these are the first
five numbers that were retired from the Mudville Sluggers baseball team. Use
Lagrange Interpolation to predict the next number that was retired in Mudville.
6.59 The following table fits the quadratic f .x/ D x 2 3x C 5; that is, f .0/ D 5,
f .1/ D 3, etc. Now forget about f and use Lagrange Interpolation to find a
The result of Exercise 6.53 polynomial that fits the table.
allows you to fool many
standardized tests.
Input Output
0 5
1 3
2 3
3 5
4 9
It seems that this table should fool Lagrange Interpolation, which produces a de-
gree 4 polynomial. Does it?
6.60 * Show that Lagrange Interpolation produces a polynomial of smallest degree that
agrees with a given input-output table.
6.61 (i) Find a polynomial g.x/ that agrees with the table
Input Output
4 24
5 60
6 120
7 210
8 336
i i
“book2” — 2013/5/24 — 8:18 — page 275 — #295
i i
6.62 It is known that there’s a cubic polynomial function f .x/ 2 QŒx such that, for
positive integers n,
n
X1
f .n/ D k2:
kD0
(i) Find f .
Hint: A cubic is determined by four inputs.
P 1 2
(ii) Prove that f .n/ D nkD0 k for all positive integers n.
6.63 * Using the notation of Theorem 6.71, show that
i i
“book2” — 2013/5/24 — 8:18 — page 276 — #296
i i
i i
“book2” — 2013/5/24 — 8:18 — page 277 — #297
i i
i i
“book2” — 2013/5/24 — 8:18 — page 278 — #298
i i
the ideal .m/ in Z. Push this analogy further. Elements in Zm are congruence
classes Œa, where a 2 Z. Let’s invent new elements Œf corresponding to poly-
nomials f .x/. In more detail, Œa denotes fa C qm W q 2 Zg, and we forced
qm D 0 (actually, using the Division Algorithm in Z, this allowed us to focus
on remainders after dividing by m). Defining Œf D ff .x/ C q.x/.x 2 C 1/ W
q.x/ 2 RŒxg would allow us to focus on the remainder after dividing f .x/ by
x 2 C 1. Indeed, the Division Algorithm in RŒx writes
f .x/ D q.x/.x 2 C 1/ C r .x/;
where r .x/ D 0 or deg.r / < 2. In other words, we could write Œf D Œr ,
where r .x/ D a C bx for a; b 2 R. Hold it! If the bracket notation makes
x 2 C 1 D 0, then x 2 D 1, and we may as well write i instead of Œx. Looks a
lot like C to us! Now it turns out that this idea is also compatible with addition
and multiplication, as we shall see when we introduce quotient rings precisely.
The construction makes sense for any commutative ring R and any ideal I
in R; moreover, it constructs not only C but many other important systems
as well.
i i
“book2” — 2013/5/24 — 8:18 — page 279 — #299
i i
Thus, the coset a C I is the set of all those elements in R that are congru-
ent to a mod I . Cosets generalize the notion of congruence class and so, by
analogy, the coset a C I is often called a mod I .
i i
“book2” — 2013/5/24 — 8:18 — page 280 — #300
i i
Once the set Zm was built, we equipped it with the structure of a commu-
tative ring by defining addition and multiplication of congruence classes. We
carry out that program now for R=I .
˛W .a C I; b C I / 7! a C b C I
W .a C I; b C I / 7! ab C I:
Example 7.5. Suppose that R D ZŒx and I is the principal ideal .x 2 Cx C1/.
If a D 3 C 2x and b D 4 C 3x, then
.3 C 2x C I /.4 C 3x C I / D 6 C 11x C I: N
Lemma 7.6. Addition and multiplication R=I R=I ! R=I are well-defined
functions.
.a C b/ .a0 C b 0 / D .a a0 / C .b b 0 / 2 I:
ab a0 b 0 D ab a0 b C a0 b a0 b 0 D .a a0 /b C a0 .b b 0 / 2 I:
i i
“book2” — 2013/5/24 — 8:18 — page 281 — #301
i i
Proof. Each of the eight axioms in the definition of commutative ring must be
verified; all the proofs are routine, for they are inherited from the correspond-
ing property in R. If a; b; c 2 R, then we have
(i) Commutativity of addition:
.a C I / C .b C I / D a C b C I D b C a C I D .b C I / C .a C I /:
(ii) The zero element is I D 0 C I , for I C .a C I / D 0 C a C I D a C I .
(iii) The negative of a C I is a C I , for .a C I / C . a C I / D 0 C I D I .
(iv) Associativity of addition:
Œ.a C I / C .b C I / C .c C I / D .a C b C I / C .c C I /
D Œ.a C b/ C c C I D Œa C .b C c/ C I
D .a C I / C .b C c C I / D .a C I / C Œ.b C I / C .c C I /:
(viii) Distributivity:
.a C I / .b C I / C .c C I / D .a C I /.b C c C I /
D Œa.b C c/ C I D .ab C ac/ C I
D .ab C I / C .ac C I /
D .a C I /.b C I / C .a C I /.c C I /:
i i
“book2” — 2013/5/24 — 8:18 — page 282 — #302
i i
zero ring is not very interesting, we usually assume, when forming quotient
rings, that ideals are proper ideals. Recall, in constructing Zm , that we usually
assumed that m 2.
The definitions of addition and multiplication in R=I involve an interplay
between reducing modulo the ideal I and the operations of addition and multi-
plication in R=I . In the special case of Zm , we called this interplay “reduce as
you go.” But that’s just an informal way of describing a homomorphism. More
precisely, if we define a function W R ! R=I by W a 7! a C I , then we can
rewrite a C b C I D .a C I / C .b C I / as .a C b/ D .a/ C .b/; similarly,
ab C I D .a C I /.b C I / can be rewritten as .ab/ D .a/.b/.
The word “map” is often Definition. If I is an ideal in a commutative ring R, then the natural map is
used as a synonym for the function W R ! R=I given by
function or homomor-
phism.
a 7! a C I I
Proof. We have just seen that .aCb/ D .a/C.b/ and .ab/ D .a/.b/.
Since .1/ D 1 C I , the multiplicative identity in R=I , we see that is a ho-
momorphism.
Now is surjective: if a C I 2 R=I , then a C I D .a/. Finally, by
definition, ker D fa 2 R j .a/ D 0 C I g. But .a/ D a C I , and
a C I D 0 C I if and only if a 2 I (Proposition 7.4). The result follows.
Here is the converse of Proposition 5.25: Every ideal is the kernel of some
homomorphism.
We know that isomorphic commutative rings are essentially the same, being
“translations” of one another; that is, if 'W R ! S is an isomorphism, we may
think of r 2 R as being in English while '.r / 2 S is in French. The next
theorem shows that quotient rings are essentially images of homomorphisms.
It also shows how to modify a homomorphism to make it an isomorphism.
Proof. Let I D ker '. We have already seen, in Proposition 5.25, that I is an
ideal in R and im ' is a subring of A.
i i
“book2” — 2013/5/24 — 8:18 — page 283 — #303
i i
Define e
' W R=I ! im ' by
e
' .r C I / D '.r /:
We claim that e
' is an isomorphism. First, e
' is well-defined. If r C I D s C I ,
then r s 2 I D ker ', '.r s/ D 0, and '.r / D '.s/. Hence
e
' .r C I / D '.r / D '.s/ D e
' .s C I /:
Next, e
' is a homomorphism because ' is.
e
' .r C I / C .s C I / D e ' .r C s C I /
D '.r C s/ D '.r / C '.s/
De ' .r C I / C e' .s C I /:
Similarly, e
' .r C I /.s C I / D e ' .s C I / (Exercise 7.7 on page 285).
' .r C I /e
As e
' .1 C I / D '.1/ D 1, we see that e ' a homomorphism.
We show that e' is surjective. If a 2 im ', then there is r 2 R with a D '.r /;
plainly, a D '.r / D e' .r C I /.
Finally, we show that e ' is injective. If e
' .r C I / D 0, then '.r / D 0, and
r 2 ker ' D I . Hence, r C I D I ; that is, ker e ' D fI g and e ' is injective, by
Proposition 5.31. Therefore, e ' is an isomorphism.
We can illustrate this last proof with a picture; such a picture is often called
a commutative diagram if composites of maps having same domain and same
target are equal. Here, i W im ' ! A is the inclusion, and ' D ie ' .
'
R A
i
R=I im '
e
'
Theorem 7.11. The quotient ring RŒx=.x 2 C 1/ is a field isomorphic to the Hence, the high school
complex numbers C. approach to complex
numbers contains the
germ of a correct idea.
Proof. Consider the evaluation 'W RŒx ! C (as in Corollary 5.21) with
'.x/ D i and '.a/ D a for all a 2 R; that is,
'W f .x/ D a0 C a1 x C a2 x 2 C 7! f .i / D a0 C a1 i C a2 i 2 C :
Now ' is surjective, for a C i b D '.a C bx/, and so the First Isomorphism
Theorem gives an isomorphism e ' W RŒx= ker ' ! C, namely f .x/ C ker ' 7!
f .i /. But Corollary 6.26 gives ker ' D .x 2 C1/; therefore, RŒx=.x 2 C1/ Š C
as commutative rings, by the First Isomorphism Theorem. We know that C is
a field, and any commutative ring isomorphic to a field must, itself, be a field.
Thus, the quotient ring RŒx=.x 2 C 1/ is another construction of C.
i i
“book2” — 2013/5/24 — 8:18 — page 284 — #304
i i
1 a bi
D 2 ;
a C bi a C b2
to RŒx=.x 2 C 1/, using the inverse of the isomorphism ' in the proof of
Theorem 7.11. Now ' 1 .a C bi / D a C bx C .x 2 C 1/; so that
1 a bx
2
D 2 C .x 2 C 1/:
a C bx C .x C 1/ a C b2
(ii) Euclidean Algorithm II gives another way of finding the inverse, writing
gcd.a C bx; x 2 C 1/ as a linear combination of a C bx and x 2 C 1. The
algorithms in Exercise 6.18(iii) produce the linear combination in RŒx:
a bx 2 a2 C b 2
.a C bx/ C 1.x C 1/ D :
b2 b2
a bx b2
.a C bx/ C .x 2 C 1/ D 1:
a2 C b 2 a2 C b 2
Proof. It suffices to show that every nonzero element a C .p/ in the commu-
tative ring R=.p/ has a multiplicative inverse. Since a C .p/ ¤ 0, we have
a … .p/; that is, p − a. Since R is a PID, Theorem 6.46 says that gcd’s exist
and are linear combinations. In particular, gcd.a; p/ D 1, so there are s; t 2 R
with sa C tp D 1. Thus,
1 C .p/ D sa C .p/ D sa C .p/ D s C .p/ a C .p/
1
in R=.p/, and a C .p/ D s C .p/. Therefore, R=.p/ is a field.
i i
“book2” — 2013/5/24 — 8:18 — page 285 — #305
i i
Exercises
7.1 Are any cosets of .5/ in Z ideals?
7.2 Prove Proposition 7.1.
7.3 Prove Proposition 7.2.
7.4 * In QŒx=.x 2 C x C 1/, write each term in the form a C bx with a; b 2 Q.
(i) .3 C 2x/.4 C 3x/ (ii) 12 C 17x C 6x 2
(iii) x2 (iv) x 3
(v) .1 x/2 (vi) .1 x/.1 x2 /
(vii) .a C bx/.a C bx 2 / (viii) .a C bx/2 .
S=I D fr C I W r 2 Sg
is a subring of R=I .
(ii) If J is an ideal in R and I J , prove that
J =I D fr C I W r 2 J g
is an ideal in R=I .
7.10 Show that the subring ZŒx=.x 2 C 1/ of RŒx=.x 2 C 1/ is isomorphic to the
Gaussian integers ZŒi .
7.11 Show that there is an isomorphism of fields:
RŒx=.x 2 C 1/ Š RŒx=.x 2 C x C 1/
i i
“book2” — 2013/5/24 — 8:18 — page 286 — #306
i i
7.14 For each element of QŒx=.x 2 C x C 1/, find the multiplicative inverse.
(i) 3 C 2x C .x 2 C x C 1/
(ii) 5 x C .x 2 C x C 1/
(iii) 15 C 7x 2x 2 C .x 2 C x C 1/
(iv) a C bx C .x 2 C x C 1/ (in terms of a and b).
7.15 * Prove the Third Isomorphism Theorem: If R is a commutative ring having
ideals I J , then J =I is an ideal in R=I , and there is an isomorphism
.R=I /=.J =I / Š R=J .
Hint: Show that the function 'W R=I ! R=J , given by a C I 7! a C J , is a
homomorphism, and apply the First Isomorphism Theorem.
7.16 For every commutative ring R, prove that RŒx=.x/ Š R.
7.17 An ideal I in a commutative ring R is called a prime ideal if I is a proper ideal
such that ab 2 I implies a 2 I or b 2 I .
(i) If p is a prime number, prove that .p/ is a prime ideal in Z.
Hint: Euclid’s Lemma.
(ii) Prove that if an ideal .m/ in Z is a prime ideal, then m D 0 or jmj is a prime
number.
7.18 Let I be a proper ideal in kŒx, where k is a field.
(i) If p is an irreducible polynomial, prove that .p/ is a prime ideal in kŒx.
(ii) Prove that if an ideal .f / in kŒx is a prime ideal, then f D 0 or f is an
irreducible polynomial.
7.19 Let I be a proper ideal in a commutative ring R.
(i) Prove that .0/ is a prime ideal in R if and only if R is a domain.
(ii) Prove that I is a prime ideal if and only if a … I and b … I imply ab … I .
(iii) Prove that I is a prime ideal if and only if R=I is a domain.
7.20 Prove that .x/ is a prime ideal in ZŒx.
Hint: Is ZŒx=.x/ a domain?
7.21 An ideal I in a commutative ring R is called a maximal ideal if I is a proper ideal
for which there is no proper ideal J with I ¨ J .
(i) If p is a prime number, prove that .p/ is a maximal ideal in Z.
(ii) Prove that if an ideal .m/ in Z is a maximal ideal, then jmj is a prime number.
7.22 Let I be a proper ideal in kŒx, where k is a field.
(i) If p is an irreducible polynomial, prove that .p/ is a maximal ideal in kŒx.
(ii) Prove that if an ideal .f / in kŒx is a maximal ideal, then f is an irreducible
polynomial.
7.23 * Let I be a proper ideal in a commutative ring R.
(i) Prove that .0/ is a maximal ideal in R if and only if R is a field.
(ii) Prove that I is a maximal ideal if and only if R=I is a field. Conclude that if
k is a field and p.x/ 2 kŒx is irreducible, then kŒx=.p/ is a field.
(iii) Prove that every maximal ideal is a prime ideal.
7.24 (i) Prove that J is a maximal ideal in ZŒx, where J consists of all polynomials
with even constant term.
Hint: Prove that ZŒx=J Š F2 .
(ii) Prove that the prime ideal .x/ in ZŒx is not a maximal ideal.
i i
“book2” — 2013/5/24 — 8:18 — page 287 — #307
i i
RŒx=.x 2 C 1/ Š C:
We are going to apply quotient rings to prove some interesting results: for every
polynomial f .x/ 2 kŒx, where k is a field, there exists a field extension E= k
containing all the roots of f ; we will also be able to prove the existence of
finite fields other than Fp .
Characteristics
Contemplating “any field” seems quite daunting, and so it makes sense for us to
begin classifying fields. First of all, fields come in two types: those that contain
a subfield isomorphic to Q, and those that contain a subfield isomorphic to Fp
for some prime p.
Recall the definition of na on page 160, where n 2 Z and a is an element
of a commutative ring R. For example, 3a means a C a C a and . 3/a means
a a a. More generally, if n is a nonnegative integer, then na means
a C a C C a;
„ ƒ‚ …
n times
.n/ D ne;
is a homomorphism.
Proof. Since every ideal in Z is principal, ker D .m/ for some integer
m 0. If m D 0, then is an injection, and im Š Z. If m ¤ 0, the First
Isomorphism Theorem gives Zm D Z=.m/ Š im k. Since k is a field,
im is a domain, and so m is prime (Exercise 5.3 on page 195). Writing p
instead of m, we have im Š Zp D Fp .
i i
“book2” — 2013/5/24 — 8:18 — page 288 — #308
i i
'W a 7! ap ;
is an isomorphism.
Proof. (i) Since k has characteristic p, we have ker./ D .p/; that is, .p/ D
p1 D 0 (we have reverted to our usual notation, so that 1 denotes the mul-
tiplicative identity). But the hybrid product pa can be viewed as a product
of two ring elements: pa D .p1/a D 0a D 0.
(ii) Expand .a C b/p by the Binomial Theorem, and note that p j pj for
all 1 j p 1. By (i), all the inside terms vanish. The argument is
completed by induction on n 1.
(iii) It is obvious that '.1/ D 1 and
i i
“book2” — 2013/5/24 — 8:18 — page 289 — #309
i i
Proposition 7.18. If K is a finite field, then jKj D p n for some prime p and
some n 1.
Proof. The prime field of K is isomorphic to Fp for some prime p, by Propo- If K is a vector space
sition 7.15. As we remarked, K is a vector space over Fp ; as K is finite, it is over k, its dimension is
denoted by dimk .K/ or,
obviously finite-dimensional. If dimFp .K/ D n, then jKj D p n , by Corol-
more briefly, by dim.K/.
lary A.34 in the Appendix.
Extension Fields
The Fundamental Theorem of Algebra states that every nonconstant polyno-
mial in CŒx is a product of linear polynomials in CŒx; that is, C contains all
the roots of every polynomial in CŒx. Using ideas similar to those allowing us
to view C as a quotient ring, we’ll prove Kronecker’s Theorem, a local ana-
log of the Fundamental Theorem of Algebra for polynomials over an arbitrary
field k: given f .x/ 2 kŒx, there is some field E containing k as a subfield that
also contains all the roots of f . (We call this a local analog, for even though
the larger field E contains all the roots of the polynomial f , it may not contain
roots of some other polynomials in kŒx.) In fact, we’ll see how to construct
such an E making basic use of quotient rings of the form kŒx=I , where k is a
field. Theorem 4.43 says that
Zm is a field if and only
if m is a prime in Z;
Proposition 7.19. If k is a field and I D .f /, where f .x/ 2 kŒx is noncon-
Proposition 7.19 is the
stant, then the following are equivalentW analog for kŒx.
(i) f is irreducible
(ii) kŒx=I is a field
(iii) kŒx=I is a domain.
Proof. (i) ) (ii) Since kŒx is a PID, this follows at once from Proposi-
tion 7.13.
(ii) ) (iii) Every field is a domain.
(iii) ) (i) Assume that kŒx=I is a domain. If f is not irreducible, then
there are g.x/; h.x/ 2 kŒx with f D gh, where deg.g/ < deg.f / and
deg.h/ < deg.f /. Recall that the zero in kŒx=I is 0 C I D I . Thus, if
gCI D I , then g 2 I D .f / and f j g, contradicting deg.g/ < deg.f /.
Similarly, hC I ¤ I . However, the product .g C I /.hC I / D f C I D I
is zero in the quotient ring, which contradicts kŒx=I being a domain.
Therefore, f is irreducible.
The structure of general quotient rings R=I can be complicated, but for This section will be using
special choices of R and I , the commutative ring R=I can be easily described. various facts about dimen-
For example, when k is a field and p.x/ 2 kŒx is an irreducible polynomial, sion, and you may wish
to look in Appendix A.3 to
the following proposition gives a complete description of the field R=I D refresh your memory.
kŒx=.p/, and it shows how to build a field K in which p.x/ has a root.
i i
“book2” — 2013/5/24 — 8:18 — page 290 — #310
i i
k 0 D fa C I W a 2 kg
dimk .K/ D d:
Proof. (i) Since p is irreducible, Proposition 7.19 says that the quotient ring
K D kŒx=I is a field, while Corollary 5.32 on page 220 says that the
natural map a 7! a C I restricts to an isomorphism k ! k 0 .
(ii) Let p.x/ D a0 C a1 x C C ad 1 x d 1 C x d , where ai 2 k for all i .
0
In
P light of thej identification of k and k in (i), we may view p.x/ as
.a
j j C I /x . Hence, since z D x C I ,
i i
“book2” — 2013/5/24 — 8:18 — page 291 — #311
i i
Pd 1 i
so that p j i D0 ci x in kŒx. But deg.p/ D d , so that all ci D 0.
Definition. If K is a field containing k as a subfield, then K is called an exten- This notation should not be
sion field of k, and we write “K= k is an extension field.” An extension field confused with the notation
for a quotient ring, for a
K= k is a finite extension if K is a finite-dimensional vector space over k. The
field K has no interesting
dimension of K, denoted by ideals; in particular, if
ŒK W k; k ¨ K, then k is not an
ideal in K.
is called the degree of K= k.
Œk.z/ W k D deg.p/:
How to Think About It. At first glance, many people see Proposition 7.20
as a cheat: we cook up a field that contains a root of p.x/ by reducing mod p.
The root is thus a coset, not a “number.” But, just as mathematicians gradually
came to see the elements of C as numbers (through their constant use in calcu-
lations), one can develop a feel for arithmetic in kŒx=.p/ in which the cosets
become concrete objects in their own right, as in the next example.
p.x/ D x 3 C x 2 2x 1:
You can check that p is irreducible (it’s a cubic without a rational root (why?),
so it can’t factor). Now
K D QŒx=.x 3 C x 2 2x 1/
a C bx C cx 2 C .p/;
i i
“book2” — 2013/5/24 — 8:18 — page 292 — #312
i i
polynomial in its congruence class mod p. Using this convention, the elements
of K are thus named by quadratic polynomials in QŒx.
What about the arithmetic? Just as in C D RŒx=.x 2 C1/, calculations in K
are carried out by calculating in QŒx, dividing by p, and taking the remainder.
Indeed, because x 3 C x 2 2x 1 D 0 in K, we have an equation in K,
x3 D x 2 C 2x C 1:
a fact that you can verify (by hand or CAS) by expanding the left-hand
side and reducing mod .p/.
In general, expand
a C bx C cx 2 d C ex C f x 2
as
A little practice with such calculations gives you the feeling that you are indeed
working with “numbers” in a system and, if K had any use, you’d soon become
very much at home in it just as our Renaissance predecessors became at home
in C. N
i i
“book2” — 2013/5/24 — 8:18 — page 293 — #313
i i
Exercises
7.25 * Prove Lemma 7.14.
˝ ˛
7.26 * If X is a subset of a field k, then X , the subfield generated by X , is the
intersection of all the subfields containing X (by Exercise 4.61(iii) on page 168,
the intersection of any family of subfields of k is itself a subfield of k).
˝ ˛
(i) Prove that X is the smallest ˝ ˛such subfield in the sense that any subfield F
containing X must contain X .
(ii) Define the prime field of a field k to be the intersection of all the subfields
of k. Prove that the prime field of k is the subfield generated by 1.
(iii) Prove that the prime field of a field is isomorphic to either Q or Fp .
7.27 * If k is a field of characteristic p > 0 and a 2 k, prove that
.x C a/p D x p C ap :
˛ D x; ˇ D x 2 2; and D x 3 3x D x 2 x C 1:
Algebraic Extensions
The first step in classifying fields is by their characteristics. Here is the second
step: we define algebraic extensions.
i i
“book2” — 2013/5/24 — 8:18 — page 294 — #314
i i
In Exercise 7.43 on page 308, you’ll show that k.A/ is the smallest subfield
of K containing k and A; that is, if E is any subfield of K containing k and A,
then k.A/ E.
Proposition 7.20 starts with an irreducible polynomial p.x/ 2 kŒx and
constructs an extension K= k in which p has a root. Suppose we start with the
root; that is, suppose that z is algebraic over k. Can we find a polynomial p
so that k.z/ (the smallest extension of k that contains z) can be realized as
kŒx=.p/? Let’s look at an example.
p p
Example 7.24. Suppose that K D R, k D Q, and z D 2 C 3. First of
all, z is algebraic over Q. To see this, proceed as you would in high school
algebra.
p
z 2 D 5 C 2 6;
.z 2 5/2 D 24;
z4 10z 2 C 1 D 0:
W f .x/ 7! f .z/:
The First Isomorphism Theorem suggests that we look at im and ker .
im contains Q (because .a/ D a for all a 2 Q) and z (because .x/ D z).
It follows that any subfield of R that contains Q and z contains im . In other
words,
im D Q.z/:
i i
“book2” — 2013/5/24 — 8:18 — page 295 — #315
i i
QŒx=I Š im :
im is a subring of R, so it is a domain. And I is an ideal in QŒx, so it is
principal, say I D .p/, where p.x/ 2 QŒx.
Furthermore, since QŒx=I is a domain, p is an irreducible polynomial in
QŒx, which we can take to be monic.
Thus, we have an isomorphism:
‰W QŒx=.p/ Š im ;
QŒx=.p/ Š Q.z/:
‰ W kŒx=I ! k.z/
W k.z/ ! k.z 0 /
W f 7! f .z/:
i i
“book2” — 2013/5/24 — 8:18 — page 296 — #316
i i
D‰ 0 ı‰ 1
‰0
k.z 0 /
i i
“book2” — 2013/5/24 — 8:18 — page 297 — #317
i i
‰
QŒx=.x 3 5/ Q.z/
D‰ı‰ 1
‰0
Q.!z/ N
ŒK W k D ŒK W EŒE W k:
for all j . Since A is linearly independent over k, it follows that j i D 0 for all
j and i , as desired.
Example 7.28. We now show how Theorem 7.27, the multiplicativity of de-
gree in a tower of extension fields, can be used to calculate degrees; we also
show, given an extension field E= k, that an explicit basis of E over k can
sometimes be constructed. We urge you to work though this example care-
fully; it will help make the preceding development much more concrete, and
you will see how all these ideas come together.
Let’s return to Exercise 3.56 on page 116 (if you haven’t attempted this
exercise, you should try it now). It involves D cos.2=7/ C i sin.2=7/, a
primitive 7th root of unity; note that the powers of are the vertices of a regular
7-gon in the complex plane. Using Proposition 6.62 and the language we have
since introduced, we can now say that
irr.; Q/ D ˆ7 .x/ D x 6 C x 5 C x 4 C x 3 C x 2 C 1;
i i
“book2” — 2013/5/24 — 8:18 — page 298 — #318
i i
˛ D C 6 D 2 cos.2=7/
ˇ D 2 C 5 D 2 cos.4=7/
D 3 C 4 D 2 cos.6=7/;
x3 C x2 2x 1:
Ah, but this is precisely the irreducible p.x/ in Example 7.22. There, you
constructed a field in which p has a root, but you didn’t know what the roots
are. Now you know: they are ˛, ˇ, and
, all real numbers, determined by
expressions involving cosines. Furthermore, the construction in Theorem 7.25
gives a field isomorphic to Q.˛/. But Q.˛/ contains all the roots of p; for
example,
2
˛2 D C 6
D 2 C 2 7 C 12
D 2 C 2 C 5
D ˇ C 2;
and, hence,
ˇ D ˛2 2 2 Q.˛/:
3
In the same way, you can expand ˛ to see that
D ˛ 3 C 3˛;
Q.˛/ Hence, ŒQ./ W Q.˛/ 3 D 6, and ŒQ./ W Q.˛/ D 2. Therefore, the exten-
sion Q./=Q decomposes into a cubic extension of Q followed by a quadratic
3 extension of Q.˛/. This implies that
Q
deg irr .; Q.˛// D 2I
i i
“book2” — 2013/5/24 — 8:18 — page 299 — #319
i i
Q./
2
3
Q.˛/
Q.ı/ 3
i i
“book2” — 2013/5/24 — 8:18 — page 300 — #320
i i
Exercises
7.31 As usual, let n D cos.2=n/ C i sin.2=n/.
(i) Find the minimal polynomial of n over Q for all n between 1 and 10.
(ii) What is minimal polynomial of p over Q if p is prime?
(iii) What is minimal polynomial of p2 over Q if p is prime?
7.32 If p is a prime, and p D cos.2=p/ C i sin.2=p/, show that
Q.p / W Q D p 1:
p
7.33 Show that x 2 3 is irreducible in Q 2 Œx.
7.34 Let k K E be fields. Prove that if E is a finite extension of k, then E is a
finite extension of K, and K is a finite extension of k.
7.35 Show that
(i) cos.2=7/ C cos.4=7/ C cos.8=7/ D 21 .
2
(ii) sin.2=7/ C sin.4=7/ C sin.8=7/ D 72 .
7.36 Let k F K be a tower of fields, and let z 2 K. Prove that if k.z/=k is finite,
then ŒF .z/ W F Œk.z/ W k. Conclude that ŒF .z/ W F is finite.
Hint: Use Proposition 7.20 to obtain an irreducible polynomial p.x/ 2 kŒx; the
polynomial p may factor in KŒx.
7.37 Let K=k be an extension field. If A K and u 2 k.A/, prove that there are only
finitely many a1 ; : : : ; an 2 A with u 2 k.a1 ; : : : ; an /.
7.38 Let E=k be a field extension. If v 2 E is algebraic over k, prove that v 1 is
algebraic over k.
Splitting Fields
We now prove a result of Kronecker that says that if f .x/ 2 kŒx is not con-
stant, where k is a field, then there is some extension field K= k containing all
the roots of f .
How to Think About It. For the familiar fields Q, R, and C, Kronecker’s
Theorem offers nothing new. The Fundamental Theorem of Algebra says that
every nonconstant f .x/ 2 CŒx has a root in C; it follows, by induction on
the degree of f , that all the roots of f lie in C; that is, f .x/ D a.x z1 /
.x zn /, where a 2 C and zj 2 C for all j . On the other hand, if k D Fp or
i i
“book2” — 2013/5/24 — 8:18 — page 301 — #321
i i
Example 7.31. (i) Let f .x/ D x n 1 2 QŒx, and let E=Q be a splitting
field. If D e 2 i=n is a primitive nth root of unity, then Q./ D E is
a splitting field of f , for every nth root of unity is a power of , and
j 2 Q./ for all j .
(ii) There are n distinct nth roots of unity in C, but there may be fewer roots
of unity over fields of characteristic p. For example, let f .x/ D x 3 1 2
Z3 Œx. Since x 3 1 D .x 1/3 , by Exercise 7.27 on page 293, we see
that there is only one cube root of unity here. N
i i
“book2” — 2013/5/24 — 8:18 — page 302 — #322
i i
How to Think About It. When we defined the field k.A/ obtained from a
field k by adjoining a set A, we assumed that A K for some extension field
K= k. But suppose no larger field K is given at the outset. For example, can
the roots of f .x/ D x 2 x 1 2 F3 Œx be adjoined to F3? Yes. In light
of Kronecker’s Theorem, there is some field extension K=F3 containing the
roots of f , say ˛; ˇ; now we do have the larger field, and so F3 .˛; ˇ/ makes
sense; we can adjoin the roots of f to F3 . Such an extension field K may not
be unique, but we shall see that any two of them are isomorphic.
By Theorem 6.52, the only possible rational roots of h are ˙1, and so we have
just proved that all these roots are irrational.
We claim that h is irreducible in QŒx (so, p D h after all). It suffices to
show that h has no quadratic factor q.x/ 2 QŒx (why?). If, on the contrary,
h D qq 0 for two monic quadratic polynomials in QŒx, then the roots of h are
paired up, two for q and two for q 0 . Suppose q.z/ D 0. Then the other root of
q, call it z 0 , is one of
p p p p p p
2 3; 2 3; 2 C 3:
Now, if q.x/ D x 2 Cbx Cc, then b D zCz 0 and c D zz 0 . But you can check,
for each choice of z 0 , that either z C z 0 or zz 0 is irrational. Since q 2 QŒx, this
is a contradiction, and so h is irreducible. p p
We now know that ŒE W Q D 4. Let F D Q. 2; 3/, so that we have a
F tower of fields Q E F . Theorem 7.27 tells us that
ŒF W Q D ŒF W EŒE W Q:
E 2
On the other hand,
p p
p ŒF W Q D ŒF W Q 2 ŒQ 2 W Q:
4 Q. 2/
p p
Now ŒQ 2 W Q D 2, because 2 is a root of the irreducible quadratic
2 p
x 2 2 in QŒx. We claim that ŒF W Q 2 2. The field F arises by ad-
Q p p p p
joining 3 to Q 2 ; either 3 2 Q 2 , in which case the degree is 1,
i i
“book2” — 2013/5/24 — 8:18 — page 303 — #323
i i
Our next goal is to show that splitting fields are unique up to isomorphism.
We paraphrase Theorem 7.25(ii).
longer equal; they are only isomorphic. The upshot is that we have to compli-
cate the statement of what we are going to prove in order to take account of
this.
First, recall Corollary 5.22:
i i
“book2” — 2013/5/24 — 8:18 — page 304 — #324
i i
Lemma 7.34. Let 'W k ! k 0 be an isomorphism of fields and ' W kŒx ! k 0 Œx
the isomorphism of Corollary 5.22; let p.x/ 2 kŒx be irreducible, and let
p 0 D ' .p/.
(i) p 0 is irreducible in k 0 Œx, and the map ˆW kŒx=.p/ ! k 0 Œx=.p 0 /, defined
by ˆW f C .p/ 7! ' .f / C .p 0 /; is an isomorphism of fields.
(ii) Let K= k be a field extension, let z 2 K be algebraic over k, and let
p.x/ D irr.z; k/. If p 0 D ' .p/ 2 k 0 Œx and z 0 is a root of p 0 in some
extension of k 0 , then ' can be extended to an isomorphism k.z/ ! k 0 .z 0 /
that maps z to z 0 .
Proof. (i) This is straightforward, for ' carries the ideal .p/ in kŒx onto the
ideal .p 0 / in k 0 Œx, and Exercise 7.8 on page 285 applies. Exercise 7.44
on page 308 asks you to give the details.
(ii) By (i), there are isomorphisms
0
W kŒx=.p/ ! k.z/ and W k 0 Œx=.p 0 / ! k 0 .z 0 /:
ˆ W kŒx=.p/ ! k 0 .x/=.p 0 /;
0 1
and the composite ıˆı is the desired isomorphism.
Here is a picture of the Lemma’s proof.
kŒx=.p/ k.z/
ˆ 0 ıˆı 1
0
k 0 Œx=.p 0/ k 0 .z 0 /
E D k.z/.z1 ; : : : ; zn /;
i i
“book2” — 2013/5/24 — 8:18 — page 305 — #325
i i
E D k.z; z1 ; : : : ; zn / D k.z/.z1 ; : : : ; zn /:
Corollary 7.36. If k is a field and f .x/ 2 kŒx, then any two splitting fields
of f over k are isomorphic via an isomorphism that fixes k pointwise.
Proof. Let E and E 0 be splitting fields of f .x/ over k. If ' is the identity, then
Theorem 7.35 applies at once.
We now show, given a prime power q D p n , that there exists a field with
n
p elements. Our guess is that Galois realized that C can be constructed by
adjoining a root of x 2 C 1 to R, so that it was natural for him (but not for
any of his contemporaries!) to adjoin a root of a polynomial to Fp . However,
Kronecker’s Theorem was not proved until a half century after Galois’s death.
g.x/ D x q x 2 Fp Œx:
i i
“book2” — 2013/5/24 — 8:18 — page 306 — #326
i i
E D fz 2 K W g.z/ D 0gI
that is, E is the set of all the roots of g. We claim that all the roots of g are
distinct. Since the derivative g0 .x/ D qx q 1 1 D p n x q 1 1 D 1 (by
Proposition 7.17), we have gcd.g; g0 / D 1. By Exercise 6.40 on page 263, all
the roots of g are, indeed, distinct; that is, E has exactly q D p n elements.
The theorem will follow if E is a subfield of K. Of course, 1 2 E. If a,
b 2 E, then aq D a and b q D b. Hence, .ab/q D aq b q D ab, and ab 2 E.
By Proposition 7.17, .a C b/q D aq C b q D a C b, so that a C b 2 E.
Therefore, E is a subring of K. Finally, if a ¤ 0, then Lemma 7.37 says that
aq 1 D 1, and so the inverse of a is aq 2 (which lies in E because E is closed
under multiplication).
It is remarkable that the next theorem was not proved until the 1890s, 60
E. H. Moore was an years after Galois discovered finite fields.
algebraist who later did
research in geometry and
Corollary 7.40 (Moore). Any two finite fields having exactly p n elements are
foundations of analysis.
isomorphic.
Finite fields are often called Galois fields in honor of their discoverer. In
light of Corollary 7.40, we may speak of the field with q elements, where
q D p n is a power of a prime p, and we denote it by
Fq :
The next example displays different finite fields with the same number of
elements; by Moore’s Theorem, they are isomorphic.
Example 7.41. (i) In Exercise 4.55 on page 165, we constructed the field F4
with four elements:
˚ b
F4 D ab aCb W a; b 2 F2 :
i i
“book2” — 2013/5/24 — 8:18 — page 307 — #327
i i
E D fa C bz W where z 2 C 1 D 0g:
F D fa C bu W where u2 C u 1 D 0g:
Without Moore’s Theorem, it is not instantly obvious that the two fields
are isomorphic. You can check that the map 'W E ! F (found by trial
and error), defined by '.a C bz/ D a C b.1 u/, is an isomorphism.
Now F3 Œx=.x 2 x 1/ is another field with nine elements; Exer-
cise 7.46 asks for an explicit isomorphism with E.
(iii) In Example 6.57, we exhibited eight monic irreducible cubics p.x/ 2
F3 Œx; each gives rise to a field F3 Œx=.p/ having 27 D 33 elements, and
Moore’s Theorem says that they are all isomorphic to one another. N
Actually, more is known when K is finite: it can be shown that every nonzero
element of K is a power of ˛ (not merely a linear combination of powers of ˛).
Exercises
7.39 Let f .x/; g.x/ 2 kŒx be monic polynomials, where k is a field. If g is irreducible
and every root of f (in an appropriate splitting field) is also a root of g, prove that
f D gm for some integer m 1.
Hint: Use induction on deg.h/.
7.40 Determine whether any of the following pairs of fields are isomorphic.
(i) Q.i / and Q. 21 .1 C i //
p
(ii) Q.i / and Q. 3/
p p
(iii) Q. 2/ and Q. 3/
p p
(iv) Q. 2/ and Q. 6/
i i
“book2” — 2013/5/24 — 8:18 — page 308 — #328
i i
F3 =.x 2 C 1/ Š F3 Œx=.x 2 x 1/
7.3 Connections:
Ruler–Compass Constructions
There are myths in several ancient civilizations in which the gods demand
precise solutions of mathematical problems in return for granting relief from
catastrophes. We quote from van der Waerden [35].
i i
“book2” — 2013/5/24 — 8:18 — page 309 — #329
i i
diagonal of a square with p sides of length 1), we are going to prove that it is
3
impossible to construct 2 by the methods of Euclidean geometry — that is,
by using only ruler and compass. (Actually, the gods were not so p cruel, for
the Greeks did use other methods. Thus, Menaechmus constructed 3 2 as the
intersection of the parabolas y 2 D 2x and x 2 D y; this is elementary for
us, but it was an ingenious feat when there was no analytic geometry and no
algebra. There was also a solution found by Nicomedes.)
There are several other geometric problems handed down from the Greeks.
Can one trisect every angle? Can one construct a regular n-gon? More pre-
cisely, can one inscribe a regular n-gon in the unit circle? Can one “square the
circle;” that is, can one construct a square whose area is equal to the area of
a given circle? Since the diskp with radius 1 has area , can one construct a
square with sides of length ?
If we are not careful, some of these problems appear ridiculously easy. For
example, a 60ı angle can be trisected using a protractor: just find 20ı and draw
the angle. Thus, it is essential to state the problems carefully and to agree on
certain ground rules. The Greek problems specify that only two tools, ruler
and compass, are allowed, and each must be used in only one way. The goal
of this section is to determine exactly what can be constructed using the two
“Euclidean tools.” The answer will involve some surprising applications of
ideas from this chapter.
How to Think About It. In many geometry classes, constructions are now
taught using dynamic geometry software. These environments can be used in
the same way that one uses physical rulers and compasses; the principles are
the same, and what’s possible in them is what’s possible with pencil and paper.
This brings up an important point. Constructions made in dynamic geometry
environments are likely to be more accurate than those carried out with pencil
and paper, but
p the goal here is not approximation—we are not contentp with
3 3
constructing 2 correct
p to 100 decimal places; the goal is to find 2 exactly,
just as we can find 2 exactly as the length of the diagonal of the unit square.
We now seek to determine just what constructions are possible, and so we must
use precise definitions.
The formal discussion begins with defining the tools by saying exactly what
each is allowed to do.
Definition. A ruler is a tool that can be used to draw the line L.P; Q/ deter-
mined by points P and Q.
A compass is a tool that can be used to draw circles; given two points P
and Q, it can draw C.P; Q/ and C.Q; P /.
What we are calling a ruler, others call a straightedge. For them, a ruler
can be used not only to draw lines but to measure distances as well.
i i
“book2” — 2013/5/24 — 8:18 — page 310 — #330
i i
How to Think About It. Just to show you how fussy we are, let us point
out a subtlety about what a compass cannot do. Suppose we are given three
points: P , Q, and R. We are allowed to draw the circle C.P; Q/ with center P
and radius r D PQ. But we are not allowed to draw the circle C.R; r / with
center R and radius r . Reason: a compass is allowed to draw a circle only
if two points are given at the outset; but the circle C.R; r / cannot be drawn
(using the compass as in the definition) because only one point, namely R, is
given at the outset. Our compass is called a collapsible compass as compared
to the more versatile compass that’s allowed to draw C.R; r /. We mention this
now only because the proof of Theorem 7.48(ii) may appear more complicated
than necessary (we’ll say something more there).
Constructions with ruler and compass are carried out in the plane. Since ev-
ery construction has only a finite number of steps, we shall be able to define
constructible points inductively. Once this precise definition is given, we will
be able to show that it is impossible to double the cube or to trisect arbitrary
angles using only a ruler and compass. Angles such as 90ı and 45ı can be
About 425 BCE , Hippias trisected using a ruler and compass (for we can construct a 30ı angle, which
of Elis was able to square can then be bisected), but we shall see that a 60ı angle is impossible to tri-
the circle by drawing a
sect. When we say impossible, we mean what we say; we do not mean that it is
certain curve as well as
lines and circles. We shall merely very difficult. You should ponder how anything can be proved to be im-
see that this construction is possible. This is an important idea, and we recommend letting students spend
impossible using only ruler an evening trying to trisect a 60ı angle by themselves as one step in teaching
and compass. them the difference between hard and impossible.
Given the plane, we establish a coordinate system by first choosing two dis-
tinct points, A and A0 ; call the line they determine the x-axis. Use a compass
to draw the two circles C.A; A0 / and C.A0 ; A/ of radius AA0 with centers A
and A0 , respectively (see Figure 7.1). These two circles intersect in two points
P1 and P2 ; the line L.P1 ; P2 / they determine is called the y-axis; it is the
perpendicular-bisector of AA0 , and it intersects the x-axis in a point O, called
the origin. We define the distance OA to be 1. We have introduced coordi-
nates into the plane; of course, O D .0; 0/, A D .1; 0/, and A0 D . 1; 0/.
Consider the point P1 in Figure 7.1. Now OAP1 is a right triangle with legs
OA and OP1 . The hypotenuse AP1 has length 2 D AA0 (for this is the radius p
of C.A; A0 /). Since OAp D 1, the Pythagorean Theorem gives P1 D .0; 3/.
Similarly, P2 D .0; 3/.
Informally, we construct a new point Q from old points E; F; G, and H by
using the first pair E ¤ F to draw a line or circle, the second pair G ¤ H to
draw a line or circle, and then obtaining Q as one of the points of intersection
P1
A¢ A
O
P2
i i
“book2” — 2013/5/24 — 8:18 — page 311 — #331
i i
of the two lines, of the line and the circle, or of the two circles. More generally,
a point is called constructible if it is obtained from A and A0 by a finite number
of such steps.
Given a pair of constructible points, we do not assert that every point on the
line or the circle they determine is constructible. For example, we can draw the
x-axis L.A0 ; A/, but, as we’ll see, not every point on it is constructible.
We now begin the formal discussion. Our goal is Theorem 7.52 which gives
an algebraic characterization of constructibility. Recall, given distinct points
P and Q in the plane, that L.P; Q/ is the line they determine and C.P; Q/ is
the circle with center P and radius PQ.
Proof. (i) The construction is the same as in Figure 7.1. Here, there are two
points of intersection of the circles C.A; B/ and C.B; A/, say P1 and P2 ,
and L.P1 ; P2 / is the perpendicular-bisector of AB.
(ii) The midpoint is the intersection of AB and its perpendicular-bisector.
P = (cos q, sin q)
X
Q
q
O A = (1, 0)
i i
“book2” — 2013/5/24 — 8:18 — page 312 — #332
i i
We know that the Parallel Here is the (tricky) constructible version of the Parallel Postulate.
Postulate is not true in
non-Euclidean geometry.
Lemma 7.45. If U; V; P are distinct constructible points with P … L.U; V /,
What hidden hypotheses
of Euclidean geometry then there is a constructible point Q with L.P; Q/ parallel to L.U; V /.
are we using to make this
construction?
Q P
L(U, V )
U B V
Proof. The proof refers to Figure 7.3. Choose U so that L.P; U / is not per-
pendicular to L D L.U; V /. Thus, L is not tangent to the circle C.P; U /, and
so C.P; U / meets L in another point, say B (of course, B is constructible).
Let Q 2 C.P; U / \ C.U; P /. Clearly, Q is constructible, and we claim that
L.P; Q/ is parallel to L. Indeed, we claim that the quadrilateral PBUQ is
a rhombus and hence it is a parallelogram. Now PQ is a radius of C.P; U /,
P U is a radius of both C.P; U / and C.B; U /, and BU is a radius of both
C.B; U / and C.U; P /. Hence, PQ D P U D PB, as we want.
In high school geometry, the goal is to construct certain figures with ruler
and compass. We are about to shift the focus, considering instead the notion
of constructible numbers. “Numbers?” Well, analytic geometry equips points
with coordinates, and we have seen how to regard points as complex numbers.
How to Think About It. We asked you earlier to contemplate how we could
prove that something is impossible. The basic strategy is an elaborate indirect
proof: assuming a certain point Y is constructible, we will reach a contra-
diction. The first step is essentially analytic geometry: replace points by their
coordinates, as we have just done by defining constructible complex numbers.
The next step involves modern algebra; don’t just consider one constructible
number; consider the set K of all constructible numbers, for the totality of
them may have extra structure that we can exploit. In fact, we will see that K
is a subfield of C. Not only can we translate points to numbers, we can also
i i
“book2” — 2013/5/24 — 8:18 — page 313 — #333
i i
Q = (0, b) P = ( a, b )
O B = (b, 0) A = (a, 0)
i i
“book2” — 2013/5/24 — 8:18 — page 314 — #334
i i
I b Q
a b
O P S
Figure 7.5. a C b.
Assume that a and b are positive. Let I D .0; 1/, P D .a; 0/, and
You are tempted to use Q D .b; 1/. Now Q is constructible: it is the intersection of the horizontal
a compass with center I line through I and the vertical line through .b; 0/, both of which can be
and radius b to draw
drawn by Lemma 7.45 (the latter point is constructible, by hypothesis).
C.P; b/. But this is illegal.
Remember: we’re using a The line through Q parallel to IP intersects the x-axis in S D .a C b; 0/,
collapsible compass that as desired.
requires two points given To construct b a, let P D . a; 0/ in Figure 7.5. Thus, a C b and
at the outset; here, only a C b are constructible; by part (i), a b and a b are also con-
one is available, namely P .
structible. Thus, a C b is constructible, no matter whether a and b are
both positive, both negative, or have opposite sign.
(iii) ab is constructible.
By part (i), we may assume that both a and b are positive. In Fig-
ure 7.6, A D .1; 0/, B D .1 C a; 0/, and C D .0; b/. Define D to be the
intersection of the y-axis and the line through B parallel to AC . Since the
triangles OAC and OBD are similar,
OB=OA D OD=OC I
i i
“book2” — 2013/5/24 — 8:18 — page 315 — #335
i i
C
b
1 a
O A B
T
1
S
a
1
O A B
Figure 7.7. a 1.
Let A D .1; 0/, S D .0; a/, and T D .0; 1 C a/. Define B as the
intersection of the x-axis and the line through T parallel to AS ; thus,
B D .1 C u; 0/ for some u. Similarity of the triangles OSA and OTB
gives
OT =OS D OB=OA:
a
O A Q P
p
Figure 7.8. a.
i i
“book2” — 2013/5/24 — 8:18 — page 316 — #336
i i
vertical line through A. The (right) triangles AOR and ARP are sim-
ilar, so that
OA=AR D AR=AP;
p
and hence AR D a.
(vi) If z D a C i b 2 K; then z D a i b is constructible.
By Lemma 7.47, K is a subfield of C. Now a; b 2 K, by Lemma 7.46,
and i 2 K, as we saw on page 310. It follows that bi 2 K, and so
a i b 2 K.
Proof. This follows from Theorem 7.48 and the quadratic formula.
As we said earlier, the next result is intuitively obvious, for the equation of
a line is linear and the equation of a circle is quadratic. However, the coming
proof involves some calculations.
then ŒF .˛/ W F 2.
i i
“book2” — 2013/5/24 — 8:18 — page 317 — #337
i i
Proof. If L.P; Q/ is not vertical, then Lemma 7.50(ii) says that L.P; Q/ has
equation y D mxCb, where m; b 2 F . If L.P; Q/ is vertical, then its equation
is x D b because P D .a; b/ 2 L.P; Q/, and so b 2 F , by Lemma 7.50(i).
Similarly, L.R; S / has equation y D nx C c or x D c, where m; b; n; c 2 F .
Since these lines are not parallel, one can solve the pair of linear equations for
.u; v/, the coordinates of ˛ 2 L.P; Q/ \ L.R; S /, and they also lie in F . In
this case, therefore, ŒF .˛/ W F D 1. See Exercises 7.52
Let L.P; Q/ have equation y D mx C b or x D b, and let C.R; S / have and 7.53 on page 326.
equation .x c/2 C .y d /2 D r 2 ; by Lemma 7.50, we have m; q; r 2 2 F .
Since ˛ D u C iv 2 L.P; Q/ \ C.R; S /,
r 2 D .u c/2 C .v d /2
D .u c/2 C .mu C q d /2 ;
Q D K0 K1 Kn C;
Pj C1 2 L.E; F / \ L.G; H /;
Pj C1 2 L.E; F / \ C.G; H /;
Pj C1 2 C.E; F / \ C.G; H /:
i i
“book2” — 2013/5/24 — 8:18 — page 318 — #338
i i
Proof. This follows from Theorems 7.52 and 7.27: If k E K are fields
with E= k and K=E finite extension fields, then ŒK W k D ŒK W EŒE W k.
The converse of Corollary 7.53 is false; it can be shown that there are non-
constructible numbers z with ŒQ.z/ W Q D 4 (see [27], p. 136).
Corollary 7.54. (i) The real number cos .2=7/ is not constructible.
(ii) The complex 7th root of unity 7 is not constructible.
Nicomedes solved the Theorem 7.55 (Wantzel). It is impossible to duplicate the cube using only
Delian problem of doubling ruler and compass.
the cube using a marked
ruler and compass. p3
Proof. The question is whether z D 2 is constructible. Since x 3 2 is
irreducible, ŒQ.z/ W Q D 3, by Theorem 7.20; but 3 is not a power of 2.
Consider how ingenious this proof is. At the beginning of this section, you
were asked to ponder how we can prove impossibility. As we said when we
outlined this argument, the constructibility of a point was translated into al-
gebra, and the existence of a geometric construction produces an arithmetic
contradiction. This is a spectacular use of the idea of modeling!
A student in one of our classes, imbued with the idea of continual progress
through technology, asked, “Will it ever be possible to duplicate the cube with
ruler and compass?” Impossible here is used in its literal sense.
Proof. We may assume that one side of the angle is on the x-axis, and so
the question is whether z D cos.20ı / C i sin.20ı / is constructible. If z were
constructible, then Lemma 7.46 would show that cos.20ı / is constructible.
i i
“book2” — 2013/5/24 — 8:18 — page 319 — #339
i i
A U V
U
V
U V
E X
O F
i i
“book2” — 2013/5/24 — 8:18 — page 320 — #340
i i
a g b
E C
O F
˛ D ı C ˇ;
because ˛ is an exterior angle of AOV , and hence it is the sum of the two
opposite internal angles. Since OAU is isosceles .OA and OU are radii),
ı D ", and so
˛ D " C ˇ:
˛ D 2ˇ C ˇ D 3ˇ:
In addition to investigating more powerful tools, one can look at what can
be accomplished with fewer tools. It was proved by Mohr in 1672 and, inde-
pendently, by Mascheroni in 1797, that every geometric construction carried
out by ruler and compass can be done without the ruler. There is a short proof
of the theorem given by Hungerbühler in American Mathematical Monthly,
101 (1994), pp. 784–787.
Which regular polygons can be inscribed in the unit circle using only
ruler and compass?
i i
“book2” — 2013/5/24 — 8:18 — page 321 — #341
i i
Because they can construct 90ı and 60ı angles, high school students can con-
struct squares and hexagons (just make right and 60ı central angles), and they
can connect every other vertex of their hexagon to inscribe an equilateral trian-
gle. Also, by using the perpendicular-bisector construction, they can inscribe
a regular polygon with twice as many sides as an already constructed one, so
they can inscribe regular polygons with 3 2n and 4 2n sides for any positive
integer n. Archimedes knew that is the area of the unit circle, and he approx-
imated it by inscribing and circumscribing a regular 96-gon (he began with a
regular hexagon and then doubled the number of sides four times).
This is about as far as most high school programs get, although some treat
polygons with 5 2n sides (using Exercise 3.48 to construct the decagon and We’ll revisit the construc-
then connecting every other vertex). This is also about as far as Greek geome- tion of the pentagon in just
a minute, putting it a more
ters got, although they also were able to show (see Exercise 7.67) that if a
general setting.
regular m-gon and and a regular n-gon are inscribable in a circle (again, with
only ruler and compass), then so is a regular nm-gon; for example, a regular
15-gon can be inscribed. However, it was unknown whether all regular poly-
gons could be so inscribed.
About 2000 years later, around 1796, Gauss—still in his teens—essentially
invented the main results in this section, and he applied them to the problem of
determining whether a regular polygon could be inscribed in a circle with ruler
and compass (he wrote that his main result on this problem led to his decision
to become a mathematician). We’ll develop his methods here.
Theorem 3.28 tells us that the vertices of a regular n-gon inscribed in the
unit circle can be realized in the complex plane as the set of roots to x n 1, Given the development
and that these roots are all powers of so far, you may already
see that the problem
can be translated to the
n D cos.2=n/ C i sin.2=n/:
algebra of constructible
complex numbers, but
So, we can recast our question about inscribability and ask: this was a huge leap
for mathematicians of
For which values of n is n a constructible number? Gauss’s time and certainly
out of reach for Greek
geometers.
Well, we can hit this question with Theorem 7.52:
Corollary 7.59. A regular n-gon can be inscribed in the unit circle with ruler
and compass if and only if there is a tower of fields
Q D K0 K1 Kn C;
Proof. Indeed, a regular n-gon can be so inscribed if and only if n and, hence,
all its powers, are constructible numbers.
Gauss showed how to construct such a tower when n D 17, and his method
was general in principle, leading to a complete classification of inscribable
regular polygons. Before we state the main result, let’s work through two ex-
amples as Gauss did (all laid out in detail by him in [14], Section VII).
Example 7.60. In Example 3.34 on page 113, we showed how to find explicit
formulas for the vertices of a regular pentagon inscribed in the unit circle.
i i
“book2” — 2013/5/24 — 8:18 — page 322 — #342
i i
Let’s look at this from the perspective of this chapter. Write instead of 5 .
The nonreal roots of x 5 1, namely ; 2 ; 3 ; 4 ; are the roots of the irre-
ducible polynomial
ˆ5 .x/ D x 4 C x 3 C x 2 C x C 1:
It follows that
ŒQ./ W Q D 4;
so Corollary 7.53 tells us that there’s a chance that is constructible. In Ex-
ample 3.34, without using this language, we actually constructed the tower of
quadratic extensions necessary to guarantee that is, in fact, constructible. We
showed that if g D C 4 D 2 cos.2=5/ and h D 2 C 3 D cos.4=5/,
then g and h are roots of the quadratic equation x 2 C x 1, so that
ŒQ. C 4 / W Q D 2:
By Theorem 7.27,
ŒQ./ W Q. C 4 /ŒQ. C 4 / W Q D ŒQ./ W Q D 4;
Q./
so that
2 ŒQ./ W Q. C 4 / D 2;
Q. C 4 / and we have our tower of quadratic extensions:
2 Q Q. C 4 / Q./ N
Q
Gauss’s construction of the 17-gon
Stepping back a bit, we can describe what we did with the pentagon: the non-
real roots are
f; 2 ; 3 ; 4 g
(we are still writing instead of 5 ). There are four roots. The first story of our
tower is Q. C 4 /, generated by the sums of pairs of the roots: C 4 and
2 C 3 . The top story, Q./, is generated by the individual roots themselves.
This is the basic idea behind Gauss’s insight into the 17-gon, but the situa-
tion here is more complicated. Change notation again; now let
D 17 D cos.2=17/ C i sin.2=17/:
i i
“book2” — 2013/5/24 — 8:18 — page 323 — #343
i i
ˆ.x/ D x 16 C x 15 C C x 2 C x C 1;
˚
we have ŒQ./ W Q D 16. There are sixteen roots of ˆ17 : k W 0 k 15 .
Together with 1, these points on the unit circle are the vertices of our regular
17-gon.
For each factorization 16 D ef , Gauss divided the roots into e sums of We’ll see what method
f D 16=e roots each: e;0 ; e;1; : : : ; e;e 1 , where each e;k is a sum of f Gauss used to partition the
roots into these sums in
roots. For example, he divided the sixteen roots into two sums of eight each,
just a minute.
which we can call 2;0 and 2;1 , as follows:
2;0 D C 9 C 13 C 15 C 16 C 8 C 4 C 2
2;1 D 3 C 10 C 5 C 11 C 14 C 7 C 12 C 6 :
4;0 D C 13 C 16 C 4
4;1 D 3 C 5 C 14 C 12
4;2 D 9 C 15 C 8 C 2
4;3 D 10 C 11 C 7 C 6 :
And there are eight sums of two each: Each period of length > 1
is a real number; you can
8;0 D C 16 check that if k occurs in
e;k , so does
8;1 D 3 C 14
8;2 D 9 C 8 17 k
D k
D k;
8;6 D 15 C 2
8;7 D 11 C 6 :
Gauss called each of the e;k a period of length f D 16=e. The plan is
to show that the periods of length eight lie in a quadratic extension K1 of Q,
the periods of length four lie in a quadratic extension K2 of K1 , and so on,
building a tower of quadratic extensions ending with Q./.
The calculations will sometimes be involved so, once again, pull out your If you use a CAS, you can
pencil or computer. perform all of these calcu-
lations in QŒx=.ˆ.x//.
Because ˆ./ D 0, we see that 2;0 C 2;1 D 1.
With a little patience and care (or a CAS), you can check that
i i
“book2” — 2013/5/24 — 8:18 — page 324 — #344
i i
ŒQ.2;1 / W Q 2:
You can also check that The first step in our tower is Q Q.2;1 /. Note that 2;0 D 4=2;1 , so that
2
2;0 D n;1 (Exercise 7.60 2;1 2 Q.2;0 /. Next, we move up to the periods of length 4. You can check
on page 327). (Exercise 7.61 on page 327) that
ŒQ.4;1 / W Q.2;1 / 2:
Q Q.2;1 / Q.4;1 /:
So, 8;1 and 8;5 are roots of x 2 4;1 x C 4;2 . This says that
all the degrees are equal to 2 (Theorem 7.27). Hence, we have constructed a
tower of fields, each quadratic over the one below, starting with Q and ending
with Q./. We have proved that is constructible.
Theorem 7.61. A regular 17-gon can be inscribed in the unit circle with ruler
and compass.
i i
“book2” — 2013/5/24 — 8:18 — page 325 — #345
i i
One detail that remains is to see what method Gauss used to assign different One of the reasons has
powers of to each period; how did Gauss decide which powers of should already been mentioned:
each period should contain
occur in each e;k ? The answer comes from Galois theory (a subject we only
a sum of terms, each
briefly touch on in Chapter 9). He employed an ingenious method using the of form k C k . This
fact that 3 is a primitive element in F17; that is, every nonzero element in F17 ensures that every story in
is a power of 3. our tower except the last
is contained in R, so we
“save the complex step” for
k 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
the end (why is this a good
3k 1 3 9 10 13 5 15 11 16 14 8 7 4 12 2 6 thing?).
Gauss used this special property of 3 to define the periods: if ef D 16, there
are k periods of length f defined by
f
X1 kCje
e;k D 3 :
j D0
Sufficiency of the following theorem, a feat the Greeks would have envied,
was discovered by Gauss around 1796. He claimed necessity as well, but none
of his published papers contains a complete proof of it. The first published
proof of necessity is due to Wantzel, in 1837.
p D 2s C 1:
x k C 1 D .x C 1/.x k 1
xk 2
C xk 3
C 1/:
p D 2s C 1 D .2m /k C 1
D Œ2m C 1Œ.2m /k 1
.2m /k 2
C .2m /k 3
C 1:
The only known Fermat primes are 3, 5, 17, 257, and 65537. It follows from
Theorem 7.62, for example, that it is impossible to construct regular 7-gons,
11-gons, or 13-gons.
i i
“book2” — 2013/5/24 — 8:18 — page 326 — #346
i i
Theorem 7.63. A regular n-gon is constructible with ruler and compass if and
only if n D 2k p1 p t , where k 0 and the pi ’s are distinct Fermat primes.
Exercises
7.50 Explain how to carry out each of the following constructions with ruler and com-
pass. Prove that your method works.
(i) Copy a segment.
(ii) Copy an angle.
(iii) Construct a line parallel to a given one through a given point not on the line.
(iv) Construct a line perpendicular to a given one through a given point either on
or off the line.
7.51 *
(i) Prove that every lattice point .m; n/ in the plane is constructible. Conclude
that every Gaussian integer is constructible.
(ii) Prove that every Eisenstein integer is constructible.
7.52 Suppose that ` and `0 are lines with equations ax C by D c and dx C ey D f ,
and suppose that the coefficients of the equations are all in a field k.
(i) What condition on the coefficients guarantees that ` and `0 intersect in a
unique point?
(ii) If ` and `0 intersect in a unique point P , show that P is point whose coordi-
nates are in k.
7.53 If the quadratic polynomial ax 2 C bx C c has coefficients in a field k, show that
its roots are a quadratic extension of k.
7.54 Given a segment of length a, show how to construct a segment of length a=5.
7.55 Given a segment of length a, show how to construct a segment of length a=n,
where n is any positive integer.
7.56 Show how to construct segments of length
p p
(i) 5 (ii) 3 C 5
p p p p
3C 5
(iii) 3 C 5 (iv) 3
p p
3C 5
(v) p .
3 5
i i
“book2” — 2013/5/24 — 8:18 — page 327 — #347
i i
k C k
D 2 cos.2k=17/:
7.65 Show that 3 is a primitive element for Z5 , and apply this to Example 7.60.
7.66 Find the minimal polynomial over Q of 8;k for all 0 k 7.
7.67 Show that if .m; n/ D 1, and if n and m are constructible, so is mn .
Hint: Use Theorem 7.63.
i i
“book2” — 2013/5/24 — 8:18 — page 328 — #348
i i
i i
“book2” — 2013/5/24 — 8:18 — page 329 — #349
i i
8 Cyclotomic Integers
After proving Corollary 1.8, the special case of Fermat’s Last Theorem for
exponent n D 4, we observed that the full theorem would follow if we could
prove, for every odd prime p, that there are no positive integers a; b; c with
ap C b p D c p . It is natural to factor this expression as in Exercise 3.50(ii) on
page 115: for odd p, we have
.a C b/.a C b/ .a C p 1
b/ D c p ; (8.1)
where D e 2 i=p is a pth root of unity. We didn’t have the language of rings
at the time but, later you showed, in Exercise 4.65 on page 168, that the cyclo-
tomic integers ZŒ is a domain. How could we begin to use this observation?
Recall Exercise 2.14 on page 59: if ab D c n in Z, where n is a positive integer
and a; b are relatively prime, then both a and b are nth powers. If ZŒ behaved
like Z and the factors on the left-hand side of Eq. (8.1) are pairwise relatively
prime, then all the factors a C j b would be pth powers; that is,
a C j b D djp
for some dj 2 ZŒ. For example, consider the case p D 3, so that ZŒ D
ZŒ! is the ring of Eisenstein integers. The factorization is
.a C b/.a C !b/.a C ! 2 b/ D c 3 :
If the factors on the left-hand side are pairwise relatively prime, then each
of them is a cube of an Eisenstein integer. We can even say something if the
factors are not relatively prime: assuming that ZŒ! has unique factorization,
any prime divisor of c must occur three times on the left-hand side (prime now
means a prime in ZŒ!; that is, an element whose only divisors are units and
associates—we will use the term prime here instead of irreducible).
But does the ring of cyclotomic integers behave as Z? To solve Exercise
2.14, we need to use the Fundamental Theorem of Arithmetic: factorization
into primes exists and is essentially unique. It turns out that some rings of cy-
clotomic integers do enjoy unique factorization into primes, but some do not.
Indeed, it is known (see [36], p. 7) that ZŒ23 does not have unique factoriza-
tion.
It’s clear that we need a more thorough investigation of the arithmetic in
rings of cyclotomic integers. In particular, we already know the units in ZŒi
and ZŒ! (Example 6.3); what are the primes?
In Section 8.1, we retrace the by-now-familiar developments in Chapters 1
and 6 to establish division algorithms in ZŒi and ZŒ! (using norm to mea-
sure size). Even though these are the easiest rings of cyclotomic integers, this
329
i i
“book2” — 2013/5/24 — 8:18 — page 330 — #350
i i
will give us a clue how to proceed with ZŒ for other roots of unity . There
will be a bonus: we’ll be able to prove Fermat’s Two-Square Theorem that
characterizes all primes in Z which are sums of two squares.
As is our custom (because it is so useful), we’ll generalize from these and
the earlier examples of Z and kŒx, where k is a field, to define a Euclidean
domain—a domain having a generalized division algorithm. We’ll show that
every Euclidean domain is a PID, so that, by Theorem 6.50, Euclidean domains
are UFDs and thus have unique factorization.
In Section 8.2, we’ll see that there are primes in ZŒi and ZŒ! that are not
ordinary integers. On the other hand, some primes in Z remain prime in the
larger rings, while some split into non-unit factors. We’ll then make a complete
analysis of this phenomenon for ZŒi and sketch the analogous theory for ZŒ!.
In Section 8.3, we’ll prove Fermat’s theorem for exponent 3. The fact that
there are no non-trivial integer solutions to x 3 C y 3 D z 3 is attributed to Euler;
we’ll prove the result making basic use of the arithmetic of ZŒ!.
In Section 8.4, we’ll briefly sketch how the proof for exponent 3 gener-
alizes to prime exponent p when the ring of cyclotomic integers ZŒp is a
UFD, where D e 2 i=p . But there are primes p for which ZŒp does not
have unique factorization. What then? We’ll finish this section with a brief
discussion about how Kummer’s construct of ideal numbers (which Dedekind
recognized as equivalent to what we now call ideals, which is why they are
so-called) could be used to restore a kind of unique factorization to ZŒp .
Finally, in Section 8.5 we develop the machinery to prove a lovely theorem
of Fermat that determines the number of ways a positive integer can be written
as the sum of two perfect squares.
i i
“book2” — 2013/5/24 — 8:18 — page 331 — #351
i i
The idea is to take q to be the Gaussian integer closest to w=z in the complex
plane, and then to find an r that makes up the difference. Since
w 137 126
D C i 3:34 C 3:07i;
z 41 41
we’ll take q D 3 C 3i . What about r ? There’s no choice; since we want w D
qz C r , set r D w qz:
r Dw qz D . 211 C 102i / .3 C 3i /. 19 C 48i / D 10 C 15i:
By construction, w D qz C r . What’s more, that q is the closest Gaussian
integer to w=z implies, as we’ll see in the proof of the next proposition, that
N.r / < N.z/. Indeed, N.z/ is much bigger that N.r / in this example, because
w=z is so close to q.
N.r / D . 10/2 C .15/2 D 325 and N.z/ D . 19/2 C .48/2 D 2665: N
Proof. Suppose that w=z D a C bi , where a and b are rational numbers (but
not necessarily integers). As in Example 8.1, take q to be a Gaussian integer
closest to w=z in the complex plane; more precisely, choose integers m and n
so that
1
ja mj 2
and jb nj 21 ;
and let q D m C ni . Now define r to be the difference:
rDw qz:
Clearly, w D qz C r , so the only thing to check is whether N.r / < N.z/. To
this end, we have
w w
N.r / D N.w qz/ D N z q D N.z/N q :
z z
But w=z q D .a m/ C .b n/i , so that
w
N q D .a m/2 C .b n/2 14 C 14 < 1:
z
It follows that N.r / < N.z/.
How to Think About It. The earlier statements of the division algorithms
for Z (Theorem 1.15) and for kŒx, where k is a field (Theorem 6.11), differ
from that in Proposition 8.2; the latter statement does not assert uniqueness of
quotient and remainder.
In fact, the way q and r are constructed shows that there may be several
choices for q and r —locate w=z inside a unit square in the complex plane
whose vertices are Gaussian integers, and then pick a closest vertex. There
may be several of these, as the next example shows. Luckily, we won’t need
uniqueness of quotients and remainders to get unique factorization into primes.
i i
“book2” — 2013/5/24 — 8:18 — page 332 — #352
i i
9 C 17i D .2 C 3i /.2 C 4i / C . 1 C 3i /I
9 C 17i D .3 C 3i /.2 C 4i / C . 3 i /I
9 C 17i D .3 C 4i /.2 C 4i / C .1 3i /I
9 C 17i D .2 C 4i /.2 C 4i / C .3 C i /:
All of these work. In fact, all the remainders have (the same) norm 10 < 20 D
N.z/. Even more: the remainders are all associates. Is this an accident? See
Exercises 8.1 through 8.4 on page 336. N
Alas, there are other There is an analogous result for the Eisenstein integers ZŒ!, and its proof
rings ZŒ of cyclotomic is almost identical to that for the Gaussian integers. Recall that c C d! D
integers which do not have c C d! 2 D c C d. 1 !/, and that
a generalized division
algorithm.
N.c C d!/ D c 2 cd C d 2 :
Proof. Suppose that w=z D a C b!, where a and b are rational numbers (but
not necessarily integers). Take q to be a Eisenstein integer closest to w=z in
the complex plane (with respect to the norm); more precisely, choose integers
m and n so that
1
ja mj 2
and jb nj 21 ;
r Dw qz:
i i
“book2” — 2013/5/24 — 8:18 — page 333 — #353
i i
Why can’t we modify the proof of Proposition 8.4 to prove the result for
every ring of cyclotomic integers? The short answer is that there are counter-
examples. But the reason the proof fails to generalize is that we can’t verify
N.r / < N.z/ in every ZŒ.
Euclidean Domains
Looking at our main examples—Z; kŒx; ZŒi ; ZŒ!—we see that one key
to a division algorithm is a measure of size: absolute value for Z, degree for
kŒx, norm for ZŒi and ZŒ!. Now we generalize.
Definition. A Euclidean domain is a domain R equipped with a size function @ is defined on the nonzero
elements of R and takes
@WR f0g ! N nonnegative integer val-
ues.
such that, for all a; b 2 R with a ¤ 0, there exist q and r in R with
b D qa C r;
i i
“book2” — 2013/5/24 — 8:18 — page 334 — #354
i i
Some size functions have extra properties. For example, when R is a do-
main, then RŒx is a domain, and degree (which is a size function on RŒx)
satisfies deg.fg/ D deg.f / C deg.g/, while norm N (which is a size function
on ZŒi and ZŒ!) satisfies N.˛ˇ/ D N.˛/N.ˇ/. On the other hand, if @ is a
size function of a Euclidean domain R, then so is @0 , where @0 .a/ D @.a/ C 1
for all a 2 R f0g. It follows that a size function may have no algebraic
properties; moreover, there may be no elements in R having size 0.
Euclidean domains have nice properties. The proof of the next proposition
is essentially the same as that of Theorem 1.19.
S D [email protected]/ W z 2 I g
z D qd C r;
Corollary 8.8 probably piques your curiosity about what primes look like in
Euclidean domains. We’ll consider this question for ZŒi and ZŒ! in the next
section.
i i
“book2” — 2013/5/24 — 8:18 — page 335 — #355
i i
and ZŒ!, are not deterministic: there is a choice about how to calculate
quotients and remainders.
In Section 1.3, we studied a direct path in Euclid’s Elements from the Divi-
sion Algorithm in Z to the Fundamental Theorem of Arithmetic. This path
can be followed in a much more general setting. We just saw that every Eu-
clidean domain enjoys Euclid’s Lemma and a fundamental theorem.
One way to show that a domain is a PID is to show that it is Euclidean—
indeed, this is one of the most important uses of this notion. On the other
hand, it’s hard from first principles to show that a domain is not Euclidean
(you have to show that no size function exists). Often, the easiest way to
show that a domain is not Euclidean is to show that it’s not a PID. So, for
example, ZŒx is not a Euclidean domain.
There are PIDs that are not Euclidean, so that the converse of Proposition 8.6
is false. Motzkin found a property of Euclidean domains that can be de-
fined without mentioning its size function. He called an element d in an
arbitrary domain R a universal side divisor if d is not a unit and, for ev-
ery r 2 R, either d j r or there is some unit u 2 R with d j .r C u/.
He then proved that every Euclidean domain contains a universal side divi-
sor, namely
p any non-unit of smallest size. Now it was known that if ˛ D
1
2
.1 C 19/, then the ring ZŒ˛ is a PID. Motzkin then showed that ZŒ˛
has no universal side divisors, and he concluded that ZŒ˛ is a PID that is
not a Euclidean domain (see Wilson, A principal ideal ring that is not a
Euclidean ring. Math. Magazine 46 (1973), 34–38 and Williams, Note
on non-Euclidean principal ideal domains, Math. Magazine 48 (1975),
176–177).
The fact that a Euclidean domain is a PID allows us a to talk about gcd’s,
thanks to Theorem 6.46. Using exactly the same logic as in Chapter 1, we
can iterate division, creating a Euclidean algorithm that finds a gcd for us:
just move factors on one line southwest on the next line (as in the next
example).
Example 8.9. Building on the calculation in Example 8.5, let’s find a gcd for
91 C 84! and 34 C 53! in ZŒ!. We’ll use the algorithm outlined in Proposi-
tion 8.4 to carry out the divisions (a CAS is very useful here). There are four
equations, which we present in “southwestern style:” if a row has the form
f D qh C r , then the next row moves h and r southwest and looks like
h D q 0r C r 0.
i i
“book2” — 2013/5/24 — 8:18 — page 336 — #356
i i
1 !
34C53! 91C84!
87C72! 3 !
4C12! 34C53!
24C44! 1C !
10C 9! 4C12!
1C10! 4 C !
3C 2! 10C9!
10C9!
0
Recall that in a general So, we end with 3 C 2!. Repeated application of Exercise 8.6 on page 337
PID, the gcd of two shows that there is a chain of equalities of ideals:
elements a and b is a
generator d of the principal .91 C 84!; 34 C 53!/ D .34 C 53!; 4 C 12!/ D .4 C 12!; 10 C 9!/
ideal consisting of all linear
combinations of a and b; in D .10 C 9!; 3 C 2!/ D .3 C 2!/I
symbols, .a; b/ D .d /.
that is,
How to Think About It. We’ve seen, in ZŒi or ZŒ!, that there are some-
times choices for quotients and corresponding remainders in the generalized
division algorithms. Hence, there may be more than one way to implement the
Euclidean Algorithm and, so, more than one end result. But, thanks to Proposi-
tion 6.45 and the fact that ZŒi and ZŒ! are PIDs, any two gcd’s are associates.
See Exercise 8.7 below for an example.
Exercises
8.1 Prove or Disprove and Salvage if Possible. Two Gaussian integers are associates
if and only if they have the same norm.
8.2 How many possible numbers of “closest Gaussian integers” to a complex number
are there? For each number, give an example.
8.3 Let z and w be Gaussian integers, and suppose that q and q 0 are Gaussian integers
equidistant from w=z in the complex plane. Show that
w w
N. q/ D N. q 0 /:
z z
8.4 Let z and w be Gaussian integers, and suppose that q and q 0 are Gaussian integers
equidistant from w=z in the complex plane. Are w=z q and w=zz q 0 associates?
If so, prove it; if not, give a counterexample.
i i
“book2” — 2013/5/24 — 8:18 — page 337 — #357
i i
8.5 If z ¤ 0 and w are Eisenstein integers, how many possible numbers of quotients
w=z are there in ZŒ! satisfying the conditions of the division algorithm? For
each number, give an example.
8.6 * Let R be a commutative ring. If a, b, c, and d are elements of R such that
b D da C c, show that there is equality of ideals .b; a/ D .a; c/.
8.7 If z D 6 C 12i and w D 13 C 74i , show that
w 9 10
D C i:
z 2 3
(i) Show that there are two q’s with w D qz C r in the generalized division
algorithm, namely q D 4 C 3i and q D 5 C 3i .
(ii) Apply the Euclidean Algorithm to find a gcd of z and w starting with q D
4 C 3i .
(iii) Apply the Euclidean Algorithm to find a gcd of z and w starting with q D
5 C 3i .
(iv) Are the two gcd’s associates in ZŒi ?
i i
“book2” — 2013/5/24 — 8:18 — page 338 — #358
i i
The converse of Proposition 8.11 is false; we’ll soon see that 7 is prime
in ZŒi , but that N.7/ D 49.
w D .3 C 3i /z C . 10 C 15i /
3 C 3i D .3 i /. 10 C 15i / C . 4 7i /
10 C 15i D . 1 2i /. 4 7i /:
3C 3i
19C48i 211C102i
201C 87i 3 i
10C 15i 19C48i
15C55i 1 2i
4 7i 10C15i
10C15i
0
Again we see that gcd.z; w/ D 4 C 7i . So 4 C 7i is a factor of both z
and w.
(ii) Is the gcd 4 C 7i prime? If not, can we factor it explicitly? Since
N.4 C 7i / D 65 D 13 5;
i i
“book2” — 2013/5/24 — 8:18 — page 339 — #359
i i
Putting it all together, we have the prime factorization of w: In Section 3.4, we used the
fact that
w D 211 C 102i 5 C 12i D .3 C 2i/2 when
we generated Pythagorean
D .4 C 7i /. 2 C 29i /
triples with Gaussian
D .2 C i /.3 C 2i /.2 C i /.3 C 2i /2 integers.
D .2 C i /2 .3 C 2i /3 :
We leave it to you to find the prime factorization of z and to show that
gcd.z; w/ lcm.z; w/ D zw. N
Laws of Decomposition
We now describe the primes in ZŒi (there will be a similar story for ZŒ!).
The next lemma lets us concentrate on how primes downstairs in Z behave
when they are viewed as elements upstairs in ZŒi .
Notation. It gets tedious to keep saying “let p be a prime in Z:” From now on,
let’s call primes in Z rational primes to distinguish them from primes in other
rings. Remember that a prime (or irreducible) element in a commutative ring
is one whose only divisors are units and associates. We may also say rational
integer to distinguish an ordinary integer in Z from a Gaussian integer, an
Eisenstein integer or, more generally, a cyclotomic integer.
Proof. Every Gaussian integer z divides its norm in ZŒi , for N.z/ D zz. In
particular, divides a rational integer, namely its norm. Now N. / factors
into primes in Z:
D N. / D p1 p2 : : : pk ;
and so
j p1 p2 : : : pk : The primes on the right-
hand side are elements of
But is a prime in ZŒi ; hence, by Euclid’s Lemma in ZŒi , it divides one of ZŒi as well as of Z.
the (prime) factors pj on the right.
i i
“book2” — 2013/5/24 — 8:18 — page 340 — #360
i i
As we said earlier, there are primes in Z that remain prime in ZŒi and
others that factor into new primes; the same is true for ZŒ!. It turns out that
there’s a beautiful theory, going back to Gauss, for how primes decompose in
these rings, a theory that brings together many of the ideas you’ve studied so
far. For example, here are some factorizations of rational primes when viewed
as elements in ZŒi :
5 D .2 C i /.2 i /I 13 D .3 C 2i /.3 2i /I 29 D .5 C 2i /.5 2i /:
In each of these cases, the rational prime decomposes as a norm: the product
of a Gaussian integer and its conjugate. This is always the case.
Lemma 8.15. Let p be a rational prime. If p is not prime in ZŒi , then there
exists some prime z in ZŒi with
p D zz D N.z/:
Proof. Suppose p D zw, where z and w are non-unit Gaussian integers. Then
p 2 D N.p/ D N.zw/ D N.z/ N.w/;
And, in fact, z and w must where neither N.z/ nor N.w/ is 1. But this is an equation in Z, and so unique
be associates. factorization in Z gives p D N.z/ D zz. Finally, z is prime in ZŒi , by
Proposition 8.11, because N.z/ is a rational prime.
Lemma 8.15 narrows the situation quite a bit. It says that if a rational prime
factors in ZŒi , it factors into exactly two conjugate Gaussian integers, each
prime in ZŒi . We say that such a rational prime splits in ZŒi . We can state
the result of Lemma 8.15 using only the arithmetic of Z. Since N.a C bi / D
a2 C b 2 , the lemma says that a prime splits if it can be written as the sum of
two perfect squares. And the converse is also true.
Proof. If p splits in ZŒi , then Lemma 8.15 says that there is a Gaussian integer
z D a C bi such that p D N.z/ D a2 C b 2 .
Conversely, if p D a2 C b 2, then p D .a C bi /.a bi /. But a C bi is prime
in ZŒi , by Proposition 8.11, because its norm, N.a C bi / D p, is a rational
prime.
The question of which rational primes split in ZŒi thus comes down to the
question of which primes are sums of two squares. Not every rational prime is
a sum of two squares; for example, it’s easy to see that 11 is not. Here is a nice
(and perhaps surprising) connection to modular arithmetic. A quick example
gives the idea. The prime 29 is the sum of two squares:
29 D 22 C 52 :
i i
“book2” — 2013/5/24 — 8:18 — page 341 — #361
i i
As an equation in F29 , this says that 22 C 52 D 0. Multiply both sides by 62 , Recall that Fp is another
for 6 D 5 1 in F29 : notation for Zp , the field of
integers modulo p.
0 D 22 62 C 52 62
D .2 6/2 C .5 6/2
D .2 6/2 C 1;
so that 2 6 D 12 is a root of x 2 C 1 in F29 . More generally, suppose that p
is a prime and p D a2 C b 2. We can assume that 0 < a; b < p, so that both
a and b are units in Fp . We can write this as an equation: a2 C b 2 D 0 in Fp .
2
Multiplying both sides by b 1 , we get:
2
ab 1 C 1 D 0I
1
that is, ab is a root of x 2 C 1 in Fp . And the converse is true as well:
Corollary 8.18. A rational prime p factors in ZŒi if and only if x 2 C 1 has a So, p factors in ZŒi if and
root in Fp . only if x 2 C 1 factors in
Fp Œx.
Proof. Apply Proposition 8.17 and the Factor Theorem (Corollary 6.15).
Let’s summarize these various equivalent statements about a rational prime p.
p factors in ZŒi .
p D N.z/ for some z in ZŒi .
p D a2 C b 2 in Z.
x 2 C 1 has a root in Fp .
The last criterion may seem the most remote, but it is actually the easiest to
use—you have to check at most .p 1/=2 possible solutions to x 2 C 1 D 0
(because if ˛ is a solution, so is ˛). If you try a few numerical cases, a pat-
tern begins to emerge—the primes satisfying the last criterion all seem to be
congruent to 1 mod 4. That’s quite a beautiful and elegant result, which adds
one more equivalent statement to the summarizing list above.
i i
“book2” — 2013/5/24 — 8:18 — page 342 — #362
i i
Hence,
xp 1
1 D x 4k 1
4 k
D x 1
h k k k i
1 2 3
D .x 4 1/ x4 C x4 C x4 CC1
by Exercise 6.47 on page 269
h k 1 k k i
2 3
D .x C 1/ .x 2 1/ x 4
2
C x4 C x4 CC1
D .x 2 C 1/h.x/: (8.3)
Comparing Eqs. (8.2) and (8.3), the two factorizations of x p 1 1, and using
unique factorization in Fp Œx, we see that x 2 C 1 D .x ˛/.x ˇ/ for some
˛; ˇ 2 Fp .
Conversely, if p is odd and p 6 1 mod 4, then p 3 mod 4 (it can’t be
congruent to 0 or 2). But, by Proposition 8.17, if x 2 C 1 has a root in Fp , then
p is the sum of two squares in Z. However, the sum of two squares in Z is
never congruent to 3 mod 4: If a D 0; 1; 2; 3, then a2 0; 1; 0; 1 mod 4, and
so a2 C b 2 0; 1; 2 mod 4.
Propostion 8.16, when combined with Theorem 8.19, gives us a nice fact of
arithmetic, first established by Gauss.
The name of a theorem
may not coincide with the Corollary 8.20 (Fermat’s Two-Square Theorem). An odd rational prime p
name of the first person is a sum of two squares if and only if p 1 mod 4.
who proved it.
Theorem 8.19 tells the whole story for odd primes: primes that are congru-
ent to 1 mod 4 split into two conjugate factors, and primes that are congruent to
3 mod 4 stay prime (we call primes downstairs that stay prime upstairs inert).
There is one prime we haven’t yet considered: p D 2. Now 2 factors in ZŒi ,
because x 2 C 1 factors in Z2 Œx. In fact
2 D .1 C i /.1 i /:
Are there other primes in But note that these two factors are associates:
ZŒi that are associate
to their conjugates? That 1 C i D i.1 i /;
question is Exercise 8.10
on page 343. and so
2 D i.1 i /2 :
i i
“book2” — 2013/5/24 — 8:18 — page 343 — #363
i i
How to Think About It. The fact that 2 D i.1 i /2 can be stated in terms
of ideals in Z[i]: there is equality of ideals
.2/ D .1 i /2 :
In fact, if we use the definition of the product of ideals from Exercise 5.51 on
page 220, the above equation of ideals can be written as
.2/ D .1 i / .1 i / D .1 i /2 :
Exercises
8.8 (i) In Example 8.12 we found a gcd of z D 19 C 48i and w D 211 C 102i
to be 4 C 7i . Write 4 C 7i as a linear combination of z and w.
(ii) Use part (i) to find the prime factorization of z.
(iii) Show that gcd.z; w/ lcm.z; w/ D zw.
8.9 Show that if two Gaussian integers z and w have relatively prime norms in Z,
then z and w are relatively prime in ZŒi . Is the converse true?
8.10 * Which primes in ZŒi are associate to their conjugates?
8.11 How many non-associate primes in ZŒi lie above 5?
8.12 In ZŒi , show that every associate of a C bi is conjugate to an associate of b C ai .
8.13 Show that every Gaussian integer is associate to one in the first quadrant of the
complex plane. (We define the first quadrant to include the nonnegative x-axis but
not the positive y-axis.)
8.14 Show that if two integers a and b can each be written as the sum of two squares,
so can ab.
i i
“book2” — 2013/5/24 — 8:18 — page 344 — #364
i i
Eisenstein Primes
The whole theory just given for ZŒi carries over to ZŒ!. Of course, the state-
ments have to be modified slightly, but the proofs are almost identical to the
corresponding results in ZŒi . If you think about it, this shouldn’t be a surprise:
a proof using only algebraic properties of norm (for example, it is multiplica-
tive) and properties of PIDs (unique factorization and Euclid’s Lemma) should
carry over mutatis mutandis.
We summarize the results for Eisenstein integers, providing sketches of
proofs where we think it’s necessary, but we leave the details to you. And
these are important exercises, because they will help you digest the ideas in
both rings.
In each of these cases, the prime in Z decomposes in ZŒ! into a norm: the
product of an Eisenstein integer and its conjugate. This is always the case.
i i
“book2” — 2013/5/24 — 8:18 — page 345 — #365
i i
Corollary 8.27. A rational prime p factors in ZŒ! if and only if x 2 C x C 1 So, p factors in ZŒ! if and
has a root in Fp . only if x 2 C x C 1 factors
in Fp Œx.
Proof. Apply Proposition 8.26 and the Factor Theorem (Corollary 6.15).
We summarize the chain of equivalent statements.
p factors in ZŒ!.
p D N.z/ for some z in ZŒ!.
p D a2 ab C b 2 in Z.
x 2 C x C 1 has a root in Fp .
Onward to a law of decomposition in ZŒ!. Numerical experiments (we
hope you’ll try some) suggest that if p is a rational prime and p 1 mod 3,
then x 2 C x C 1 has a root in Fp . The proof of Theorem 8.19 suggests a reason
why.
i i
“book2” — 2013/5/24 — 8:18 — page 346 — #366
i i
But
xp 1
1 D x 3k 1
3 k
D x 1
h k k k i
3 1 2 3
D .x 1/ x3 C x3 C x3 CC1
Exercise 6.47 on page 269
h k 1 k 2 k 3
i
D .x 2 C x C 1/ .x 1/ x 3 C x3 C x3 CC1
D .x 2 C x C 1/h.x/: (8.5)
Recall that the units in But the important thing is that the two factors on the right are associates. You
ZŒ! are ˙1, ˙!, and can check that
˙! 2 .
! 2 .1 !/ D 2 C !:
Are there other primes in
ZŒ! that are associate
So, our factorization of 3 can be written as
to their conjugates? That
question is Exercise 8.18
on page 349. 3 D ! 2 .1 !/2 ;
In terms of ideals, .3/ D and 3 is a ramified prime. Putting it all together, we have the law of decompo-
.1 !/2 D .1 !/2 . sition in ZŒ! as well as a description of all Eisenstein primes.
i i
“book2” — 2013/5/24 — 8:18 — page 347 — #367
i i
The relation between the factorization of a prime p in ZŒ! and the factor-
ization of x 2 C x C 1 in Fp Œx can be used to factor Eisenstein integers.
Example 8.31. (i) Let p D 31. There are two roots of x 2 C x C 1 in F31 ,
namely 5 and 25, and so x 2 C x C 1 D .x 5/.x 25/ in F31 Œx. Lift
this equation to ZŒx:
x 2 C x C 1 D .x 5/.x 25/ C 31.x 4/:
So, letting x D !, we have
.5 !/.25 !/ D 31.4 !/:
Now N.5 !/ D 31, so 5 ! is a prime factor of 31, and the other is This example connects to
Exercise 8.24 on page 349.
25 !
D 6 C !:
4 !
(ii) Let p D 97. There are two roots of x 2 C x C 1 in F97 , namely 61 and 35.
In fact,
x 2 C x C 1 D .x 61/.x 35/ C 97.x 22/:
Letting x D !,
.61 !/.35 !/ D 97.22 !/;
and so
.61 !/.35 !/
D 97:
22 !
Now, N.22 !/ D 507 D 3132; since N.1 !/ D 3 and N.4C!/ D 13,
so (checking for unit factors), we have
22 ! D .1 !/.4 C !/2 :
Some of these factors divide 61 !; the rest divide 35 !. We have
N.61 !/ D 3 13 97
N.35 !/ D 13 97:
We can cancel the factor of 13 by dividing by 4 C !; it’s easier to work
with 35 !:
35 ! .35 !/.4 C ! 2 /
D D8 3!:
4C! 13
Bingo: N.8 3!/ D 97, so that
97 D .8 3!/.8 3! 2 /: N
i i
“book2” — 2013/5/24 — 8:18 — page 348 — #368
i i
How to Think About It. Because ZŒi and ZŒ! are commutative rings, we
can construct quotient rings. And, since both rings are PIDs, they often look
very similar to rings we have already met.
The result of this example Example 8.32. We investigate the quotient ring R D ZŒ!=./, where
will be useful in the next
section. D1 !
is the prime lying over the rational (and ramified) prime 3. For any Eisenstein
integer z, let’s look at the remainder after dividing z by . Proposition 8.4 gives
Eisenstein integers q and r such that
z D q C r; with r D 0 or N.r / < N./.
Now N./ D 3, so that N.r / must be 0, 1, or 2. There are no Eisenstein
integers of norm 2, because 2 is inert in ZŒ!. Hence N.r / is 0 or 1. If N.r / D
0, then r D 0; if N.r / D 1, then r is a unit in ZŒ!. So, aside from 0, we
need only investigate the six Eisenstein units. It turns out that each of these is
congruent to 1 or 1 modulo :
Proof. By Proposition 7.13, the quotient ring ZŒ!=./ is a field, while Exam-
ple 8.32 shows that the field has exactly 3 elements. Therefore, ZŒ!=./ Š
F3 , for Corollary 7.40 says that two finite fields with the same number of ele-
ments are isomorphic.
The results in this section just scratch the surface, for life is more compli-
cated; there are rings of cyclotomic integers that are not PIDs. We shall have
more to say about this when we discuss the work of Kummer.
Further Results. The laws of decomposition for ZŒi (Theorem 8.21) and
ZŒ! (Theorem 8.29) show that the decomposition of a rational prime depends
only on its congruence class modulo a fixed integer: 4 for ZŒi and 3 for ZŒ!.
This theory was greatly generalized in the twentieth century to Class Field
Theory, which determines laws of decomposition of primes in rings of cy-
clotomic integers, thereby bringing together under one roof many of the main
ideas in modern algebra.
i i
“book2” — 2013/5/24 — 8:18 — page 349 — #369
i i
Exercises
8.18 * Which primes in ZŒ! are associate to their conjugates?
8.19 For which primes p is x 2 C x C 1 a perfect square in Fp Œx?
8.20 Working in ZŒ!], under what conditions are a C b! and b C a! associates?
8.21 In ZŒ!],
(i) What are all the associates of the prime 1 !?
(ii) Show that 1 ! and 1 ! 2 are associates.
(iii) Write .1 !/.1 ! 2 / as a C b!. Note that x 2 C x C 1 D
.x !/.x ! 2 /. What
(iv) What is the minimal polynomial of 1 !?
happens if you put x D 1?
8.22 If z, w, v are elements of ZŒ! (or ZŒi ), show that
(i) If z j w, then z j w.
(ii) If z j w, then N.z/ j N.w/ in Z.
(iii) If z w mod v then z w mod v.
8.23 Show that a rational prime p splits in ZŒ! if and only if 3 is a square mod p.
8.24 Show there are isomorphisms of commutative rings,
(i) ZŒi Š ZŒx=.x 2 C 1/.
(ii) ZŒ! Š ZŒx=.x 2 C x C 1/.
8.25 * Find all units u in ZŒ! such that u 1 mod 3. Note that 3 is a unit times
.1 !/2 .
8.26 Factor into primes in ZŒ!.
(i) 301 (ii) 307 (iii) 5 C 8!
(iv) 5 C ! (v) 19 C 18! (vi) 39 C 55!
(vii) 61 ! (viii) 62 C 149! (ix) 87 62!
8.27 Find the number of elements in
(i) ZŒ!=.2 C !/ (ii) ZŒ!=.4 !/
(iii) ZŒ!=.6 C !/ (iv) ZŒ!=.31/
8.28 Take It Further. If z is an Eisenstein integer, show that
jZŒ!=.z/j D N.z/:
How to Think About It. The development of the proof is quite technical
(we’ve polished it as much as we were able), but the essential idea is straight-
forward and has already been mentioned several times. It’s based on the fac-
torization of x 3 C y 3 in ZŒ! (see Exercise 3.50 on page 115):
x 3 C y 3 D .x C y/.x C y!/.x C y! 2 /:
i i
“book2” — 2013/5/24 — 8:18 — page 350 — #370
i i
z 3 D .x C y/.x C y!/.x C y! 2 /:
The primes dividing z all show up with exponent at least 3 in z 3 , and the idea is
to show that this can’t happen on the right-hand side. Heuristically, if the three
factors on the right are relatively prime and none is divisible by the square of
a prime, we’re done. But it’s not so easy, mainly because of some mischief
caused by D 1 !, the prime lying above 3. So, pull out your pencil again
and follow along.
Preliminaries
Our development will often make use of a fact about ZŒ! adapted from Corol-
lary 6.37.
Proposition 8.34. If x and y are rational integers that are relatively prime
in Z, then they are relatively prime in ZŒ!.
Proof. If is a prime in ZŒ! dividing both x and y, then N. / j N.x/ and
N. / j N.y/. That is, N. / j x 2 and N. / j y 2 (for both x and y lie in Z).
See Exercise 8.22 on Now N. / 2 Z; hence, if p is a prime factor of N. /, then p is a common
page 349. factor of x and y, a contradiction.
3 D ! 2 2 : (8.6)
That lies above 3 implies that a rational integer divisible by in ZŒ! is di-
visible by 3 in Z. The next lemma explains the ubiquity of in the forthcoming
proofs.
i i
“book2” — 2013/5/24 — 8:18 — page 351 — #371
i i
Thus, .z/ is the exponent of the highest power of dividing z: Some treatments define
.0/ to be 1, but we
.z/ D n if and only if z D n z 0 and − z 0 : won’t do that here. Also, a
valuation can be defined
Put another way, .z/ j z and .z/C1 − z. in an analogous way for
For example, ./ D 1, .!/ D 0, and .3/ D 2; in Example 8.31, we saw any prime q in a UFD; just
replace by q.
that .61 !/ D 1. Indeed, .u/ D 0 for every unit u.
The valuation enjoys some properties that come from the properties of
exponentiation. The next proposition reminds us of Exercise 2.15 on page 59.
How to Think About It. Most proofs of the theorem for exponent 3 are
broken into two parts: the first case in which 3 doesn’t divide x, y, or z, and
the second case in which 3 does divide one of them. We’ll follow this program
and treat the two cases in turn. There are many proofs in the literature; our
proof of the first case is not the easiest (see Exercise 8.31 on page 358 for a
fairly simple alternative approach), but we choose it because it generalizes to
a proof of the first case for any odd prime exponent p when ZŒp has unique
factorization (see Chapter 1 of [36] for the details). Our proof of the second
case is based on the development in Chapter 17 of [17].
x3 C y3 D z3 :
Proof. Suppose that is a prime in ZŒ! that divides two of the three integers,
say
j x C !i y and j x C ! j y;
i i
“book2” — 2013/5/24 — 8:18 — page 352 — #372
i i
Proposition 8.38 (First Case for Exponent 3). There are no positive integers
x; y; z with gcd.x; y/ D 1 and 3 − xyz such that
x3 C y3 D z3: (8.7)
i i
“book2” — 2013/5/24 — 8:18 — page 353 — #373
i i
x C !y ˙! i n mod 3:
1
But ! D ! , so that
1
xC! y ˙! i n mod 3: (8.9)
hence,
i
! .x C !y/ ! i x C ! 1
y mod 3:
Multiplying by ! i gives
x C !y ! 2i x C ! 1
y mod 3;
and so
x C !y ! 2i x ! 2i 1
y 0 mod 3: (8.10)
We claim, for each possible value of i , namely 0, 1, or 2, that Eq. (8.10) leads
to a contradiction.
(i) i D 0: Eq. (8.10) becomes
1
x C !y x ! y 0 mod 3I
that is,
1
! y 0 mod 3:
!
Multiplying both sides by ! 2 gives
.1 !/ y 0 mod 3I
x C !y !2 x !y 0 mod 3:
Thus, the !y’s drop out, and there is ˛ 2 ZŒ! with x.1 ! 2 / D 3˛.
But 1 ! 2 D i , by Exercise 8.21 on page 349, and so xi D ! 2 2 ˛.
Hence, j x in ZŒ!, and Lemma 8.35 gives 3 j x in Z, another contra-
diction.
i i
“book2” — 2013/5/24 — 8:18 — page 354 — #374
i i
˛ ˙1 mod :
Gauss’s proof requires a lemma that shows how an “extra ” sneaks into the
cube of this congruence.
for some ˇ 2 ZŒ!. Substitute this into the usual factorization in ZŒ!:
˛3 1 D .˛ 1/.˛ !/.˛ ! 2 /:
Now rewrite the third factor, using Exercise 8.21 on page 349, which says that
1 ! 2 D ! 2 :
˛ ! 2 D 1 C ˇ ! 2 D ˇ ! 2 D .ˇ ! 2 /:
i i
“book2” — 2013/5/24 — 8:18 — page 355 — #375
i i
Therefore,
˛3 1 D 3 ˇ.1 C ˇ/.ˇ ! 2 /: (8.11)
ˇ ! 2 D 1 C ˇ 0 !2
D1 ! 2 C ˇ 0
D ! 2 C ˇ 0
D . ! 2 C ˇ 0 /:
and so ˛ 3 1 mod 4 .
Gauss used infinite descent on .z/ and showed (as we will shortly) that if
there was a solution to x 3 C y 3 D uz 3 of the desired type, then one could find
another solution .x 0 ; y 0 ; z 0 / of the same type with .z 0 / < .z/. Iterating this
process will eventually contradict the next lemma.
Proof. Since − xy, Euclid’s Lemma in ZŒ! says that − x and − y, and
so Lemma 8.39 applies to say that both x 3 and y 3 are congruent to ˙1 mod 4 .
Hence, reducing x 3 C y 3 D uz 3 mod 4 yields
The left-hand side of these congruences is one of 0, 2, or 2. Since j z and Note that j z implies that
− 2 (why?), we see that ˙2 are impossible. Thus, 0 uz 3 mod 4 , so that .z/ 1.
4 j z 3 and .z 3 / D 3.z/ 4. But .z) is an integer; hence, .z/ 2 and
2 j z.
Here’s the main piece of the puzzle: the key step for infinite descent. Before we dig into the
proof, think about why
Proposition 8.41. Suppose that u is a unit in ZŒ! and x 3 C y 3 D uz 3 for this result, combined with
Lemma 8.40, implies
Eisenstein integers x; y; z with − xy and j z. Then there exists a unit u0 that there is no solution
and x 0 , y 0 ; z 0 2 ZŒ! with − x 0 y 0 and .z 0 / D .z/ 1, such that to x 3 C y 3 D uz 3 in
Eisenstein integers with
3 3 3
x0 C y0 D u0 z 0 : − xy and j z.
i i
“book2” — 2013/5/24 — 8:18 — page 356 — #376
i i
i i
“book2” — 2013/5/24 — 8:18 — page 357 — #377
i i
i i
“book2” — 2013/5/24 — 8:18 — page 358 — #378
i i
x 3 C y 3 D uz 3 :
After all this work, we have, as a simple corollary, what we wanted in the
first place.
Theorem 8.44 (Fermat’s Last Theorem for Exponent 3). There are no pos-
itive integers x, y, z such that x 3 C y 3 D z 3 .
Proof. Since j 3, Proposition 8.38 and Theorem 8.43 (with u D 1) cover all
the possible cases for x, y, z.
Proving Fermat’s Last Theorem for a given exponent n was split into two
cases, as we have just seen for n D 3; the second case was also divided into
two parts. The first case for all n < 100 was proved, around 1806, by Germain.
In 1825, Legendre proved one part of the second case for n D 5, while Dirich-
let proved the other part. In 1839, Lamé proved Fermat’s Last Theorem for
exponent n D 7. The level of difficulty increased with the exponent. It was not
until Kummer that many exponents were completely settled simultaneously.
Exercises
8.29 Show that none of the six units u in ZŒ! is congruent mod to 0, 2, or 2. (As
usual, D 1 !.)
8.30 Prove Proposition 8.36.
8.31 Without using Proposition 8.38, show that there are no integers x, y, z with
3 − xyz such that x 3 C y 3 D z 3 mod 9. This exercise gives an alternative proof
of Proposition 8.38.
8.32 Show that there are no integers x, y, z with 5 − xyz such that x 5 C y 5 D
z 5 mod 25. This exercise implies Fermat’s Last Theorem for exponent 5 in the
case that 5 − xyz.
8.33 Are there any integers x; y; z with 7 − xyz such that x 7 C y 7 D z 7 mod 49?
8.34 (i) Sketch the graph of x 3 C y 3 D 1.
(ii) Show that the only rational points on the graph are .1; 0/ and .0; 1/.
8.35 Take It Further. Let G be the graph of x 3 C y 3 D 9.
(i) Sketch G.
(ii) Find the equation of the line ` tangent to G at .2; 1/.
(iii) Find the intersection of ` and G.
(iv) Show that there are infinitely many triples of integers .x; y; z/ such that
x 3 C y 3 D 9z 3 :
i i
“book2” — 2013/5/24 — 8:18 — page 359 — #379
i i
x p C y p D .x C y/.x C p y/ : : : .x C pp 1
y/ (8.16)
The basic idea is to use the fact, in a UFD, that if a product of relatively prime
elements is a pth power, then each of its factors is also a pth power. The proof
is more complicated for large p because, while ZŒ3 D ZŒ! has only six
units, the ring ZŒp for p > 3 may have infinitely many units. As we saw
in the proof of Proposition 8.38, much of the argument depends on a careful
analysis of how units enter into the calculations.
The commutative rings ZŒp are called rings of cyclotomic integers, and
investigating them has played an important part of the story of Fermat’s Last
Theorem, well into the 20th century. We’ll start this section with a brief sketch
of arithmetic in ZŒp , pointing to some major results, perhaps without proof,
that generalize results we’ve already established for ZŒ!.
After that, we’ll sketch the work of Kummer that deals with the situation
when unique factorization fails. While these efforts didn’t lead him to a proof
of Fermat’s Last Theorem, they did lead to some ideas that have had real stay-
ing power in algebra. One of them is his introduction of ideals as an important
structural component of a commutative ring (Kummer called them divisors),
not merely as subsets that happen to arise, say in studying gcd’s. Another im-
portant idea is that of class number, a measure of how far off ZŒp is from
having unique factorization.
Here is a biography of Kummer we have adapted from that given in the
history archives of the School of Mathematics and Statistics of the University
of St. Andrews in Scotland.
Ernst Eduard Kummer was born in Sorau, Prussia, in 1810. He entered the
University of Halle in 1828 with the intention of studying Protestant theology,
but he received mathematics teaching as part of his degree which was designed
to provide a proper foundation to the study of philosophy. Kummer’s mathe-
matics lecturer H. F. Scherk inspired his interest in mathematics, and Kummer
soon was studying mathematics as his main subject.
In 1831 Kummer was awarded a prize for a mathematical essay he wrote on
a topic set by Scherk. In the same year he was awarded his certificate enabling
him to teach in schools and, on the strength of his prize-winning essay, he
was awarded a doctorate. In 1832, Kummer was appointed to a teaching post Joachimsthal was famed
at the Gymnasium in Liegnitz, now Legnica in Poland. He held this post for for the high quality of his
ten years, where he taught mathematics and physics. Some of his pupils had lectures. His colleagues
in Berlin included many
great ability and, conversely, they were extremely fortunate to find a school
famous mathematicians
teacher of Kummer’s quality and ability to inspire. His two most famous pupils such as Eisenstein, Dirich-
were Kronecker and Joachimsthal and, under Kummer’s guidance, they began let, Jacobi, Steiner, and
mathematical research while at school, as did Kummer himself. He published Borchardt.
i i
“book2” — 2013/5/24 — 8:18 — page 360 — #380
i i
Cyclotomic integers
We shall assume throughout this section that p is an odd prime and D p D
cos.2=p/ C i sin.2=p/. Recall some facts about Q./.
(1) irr.; Q/ D ˆp .x/ D 1 C x C x 2 C C x p 2 C x p 1
(Theorem 6.68
and Exercise 7.31 on page 300).
(2) Q.p / W Q D p 1 (Exercise 7.32 on page 300).
(3) x p 1 D .x 1/.x /.x 2 / : : : .x p 1
/ (Exercise 6.46(i) on
page 269).
(4) Q./ Š QŒx= ˆp .x/ (Theorem 7.25(i)).
B D 1; ; 2; : : : ; p 2
:
i i
“book2” — 2013/5/24 — 8:18 — page 361 — #381
i i
Pp 2
The ring ZŒ Q./ is thus the set of all linear combinations i D0 ai i
with ai 2 Z. It shares many of the algebraic properties of the Gaussian and
Eisenstein integers except, alas, it is not always a UFD (more about this in
the next section). But there are analogs for the laws of decomposition that we
developed in ZŒi and in ZŒ!. Recall, for example, that there is equality of
ideals in ZŒi :
.2/ D .1 i /2 ;
and also in ZŒ!,
.3/ D .1 !/2 :
It turns out that the ideal .p/ ramifies in ZŒ in a similar way. Let’s look into
this.
Now put x D 1.
Lemma 8.46 gives a factorization of p in ZŒ into p factors. Our next goal
is to show that the factors are all associates.
i i
“book2” — 2013/5/24 — 8:18 — page 362 — #382
i i
p D u.1 /p 1
;
.p/ D .1 /p 1
:
1 k D uk .1 /:
Factoring out the units from Eq. (8.17) and writing their product as u, we see
that
p D u.1 /p 1
:
.p/ D .1 /p 1
:
1 s
1 t
is a unit in ZŒ.
1; ; 2 ; :::; p 1
:
s
In particular, if 1 s p 1, then D p s
. We can calculate in ZŒ by
calculating in
ZŒx=.x p 1
C xp 2
C C 1/:
This allows us to use a CAS to do calculations and then to translate to ZŒ via
the map f .x/ 7! f ./.
i i
“book2” — 2013/5/24 — 8:18 — page 363 — #383
i i
There are other units in ZŒ that are real numbers; Corollary 8.49 gives a
way to produce them.
i i
“book2” — 2013/5/24 — 8:18 — page 364 — #384
i i
The results in this section set the stage for a proof of Fermat’s Last Theorem,
along the same lines as our proof of the theorem for exponent 3, for arbitrary
prime exponents p, as long as ZŒp has unique factorization. Kummer did
exactly this, for both cases of the theorem (a detailed historical account is
in [23]). As in the case p D 3, the key players are Eq. (8.16), the prime D
1 , and the units s where is a real unit in ZŒ. We leave the story here,
pointing to [5] Chapter 3 for the rest of the technical details.
Exercises
8.36 As usual, let D cos.2=p/ C i sin.2=p/, where p is a rational prime.
(i) Show that Q./ W Q C 1 D 2, and find
1
irr.; Q C :
1
(ii) What is Q C WQ ?
pY1
1 C pi :
i D1
pY1
1 C pi :
i D1
ps C p s
1 53 sin.3=5/
5 1 D :
1 5 sin.=5/
i i
“book2” — 2013/5/24 — 8:18 — page 365 — #385
i i
We claim that they are essentially different ways to factor 6 into primes. Let’s
see why.
p
Lemma 8.51. (i) The rational integers 2 and 3 are prime in R D ZŒ 5.
(ii) 1 C ˛ and 1 ˛ are prime in R.
i i
“book2” — 2013/5/24 — 8:18 — page 366 — #386
i i
But that doesn’t happen here because neither 2 nor 3 is associate to 1 C ˛, for
neither has norm 6 D N.1 C ˛/. We have a problem!
Kummer was working on methods for factoring cyclotomic integers (not,
as it turns out, towards a proof of Fermat’s Last Theorem, but towards another,
related question). He devised a way to think about our problem that actually
shows up in elementary school when children think that 14 15 and 10 21
are different factorizations of 210. The students are not going far enough in
their factorizations: if they write
14 D 2 7 and 15 D 3 5;
they see that the “other” factorization is just a rearrangement of the prime
factors of 14 15:
10 D 2 5 and 21 D 3 7:
Now, our problem is different in the sense that we already have prime factor-
izations. But Kummer’s idea was to imagine some “ghost factors” for each of
2, 3, 1 C ˛, and 1 ˛, sort of “super primes” behind the scenes, that could be
rearranged to produce the different factorizations. Kummer called these ideal
numbers or divisors, and he imagined there was a further factorization into
ideal numbers J1 ; J2 ; J3; J4 :
2 D J1 J2
3 D J3 J4
1 C ˛ D J1 J3
1 ˛ D J2 J4 :
Kummer knew that no such Ji existed in R, but he was able to model these
ghost factors, not as elements of R but as “lists” of elements, each list contain-
ing the non-associate divisors of 2, 3, 1 C ˛, and 1 ˛. And he developed a
theory extending R to a new system R0 in which there was unique factorization
Actually, Kummer consid- into ideal numbers. Later, Dedekind refined Kummer’s ideas, recasting ideal
ered rings of cyclotomic numbers into what we nowadays call ideals, a notion, as we’ve seen in this
integers. We’re using R
book, that has utility far beyond investigations into Fermat’s Last Theorem.
here just for the sake of
example. We’ll use the contemporary notion of ideal to continue our story.
The basic idea is that products of elements are replaced by products of ide-
Exercises 5.51 and 5.52 als. In a PID, nothing new is added, because there’s a bijection between ring
on page 220 define the
elements (up to associates) and principal ideals (Exercise 5.51(ii) on page 220).
product of two ideals and
develop the properties of But rings that are not UFDs are not PIDs (Theorem 6.50), so there’s a larger
the multiplication. stash of ideals that can enter into factorizations.
p
Example 8.52. We’ve seen, in R D ZŒ˛, where ˛ D 5, that
6 D 2 3 D .1 C ˛/.1 ˛/:
The ghost factors that will resolve our problem are ideals in R generated by
two elements:
J1 D .2; 1 C ˛/ D f2a C b.1 C ˛/ W a; b 2 Rg
J2 D .2; 1 ˛/ D f2a C b.1 ˛/ W a; b 2 Rg
J3 D .3; 1 C ˛/ D f3a C b.1 C ˛/ W a; b 2 Rg
J4 D .3; 1 ˛/ D f3a C b.1 ˛/ W a; b 2 Rg:
i i
“book2” — 2013/5/24 — 8:18 — page 367 — #387
i i
We claim that
.2/ D J1 J2
.3/ D J3 J4 Note that these equations
are equalities of ideals, not
.1 C ˛/ D J1 J3 numbers.
.1 ˛/ D J2 J4 :
The verifications all use the same method, so we’ll carry it out for the first case
only, leaving the rest for you as Exercise 8.45 below.
Let’s show that .2/ D J1 J2 . Now the product of two ideals I and J is
the set of all linear combinations of products r s where r 2 I and s 2 J
(Exercise 5.51 on page 220). So, J1 J2 D .2; 1 C ˛/.2; 1 ˛/ is the set of all
linear combinations of the form (recall that .1 ˛/.1 C ˛/ D 6):
a.2 2/ C b .2.1 ˛// Cc .2.1 C ˛// C d.1 C ˛/.1 ˛/
D 4a C 2b.1 ˛/ C 2c.1 C ˛/ C 6d;
where a; b; c; d 2 R. Well,
4a C 2b.1 ˛/ C 2c.1 C ˛/ C 6d D 2 Œ2a C b.1 ˛/ C 2c.1 C ˛/ C 3d ;
so J1 J2 .2/. And, if
.a; b; c; d / D . 1; 0; 0; 1/;
we have
4a C 2b.1 ˛/ C 2c.1 C ˛/ C 6d D 2;
so that .2/ J1 J2 . Hence
.2/ D J1 J2
as claimed. The other verifications follow in the same way.
Ah, but there’s one glitch. What if one of the four ideals is .1/, the unit
ideal? If J1 D .1/ for example, we’d have .2/ D .1 ˛/, and we’d still have
the same problem. But we can show that none of the Ji is the unit ideal. Let’s
show that J1 ¤ .1/—the arguments for the others are the same (Exercise 8.46
below).
Suppose, on the contrary, that J1 D .2; 1 C ˛/ D .1/. Then there exist
elements r C s˛ and t C u˛ in R, where r; s; t; u 2 Z, so that
1 D .r C s˛/ 2 C .t C u˛/.1 C ˛/:
Multiply this out, using the fact that ˛ 2 D 5, and write the result as x C y˛
to obtain
1 D .2r C t 5u/ C .2s C t C u/˛:
It follows that
2r C t 5u D 1
2s C t C u D 0:
Replace u by 2s t in the first equation to obtain
2r 4t C 10u D 1:
Since the left-hand side is even, this is impossible. N For rings of cyclotomic
integers ZŒp , it turns
Kummer introduced another brilliant idea. Call two ideals I and J equiva- out that this new kind of
factorization into ideals is
lent if there is a cyclotomic integer z so that unique.
I D .z/J D fzb W b 2 J g:
i i
“book2” — 2013/5/24 — 8:18 — page 368 — #388
i i
He was able to show that this gives an equivalence relation on nonzero ideals in
ZŒp (for symmetry, the set of all ideals must be enlarged by adding in certain
subsets of Frac.ZŒp / D Q.p / called fractional ideals). Most importantly,
Kummer showed that this relation has only finitely many equivalence classes,
and he called the number h.p/ of them the class number of ZŒp . If ZŒp
has class number 1, then all ideals are principal, there is unique factorization,
and our proof of Fermat’s Last Theorem can be refined to produce a proof
for such exponents. In fact, Kummer generalized this, proving that if the class
number h.p/ is not divisible by p, then there are no positive integer solutions to
ap Cb p D c p . This was a monumental achievement. Kummer called primes p
such that p − h.p/ regular primes. For example, even though ZŒ23 doesn’t
have unique factorization, 23 is a regular prime—h.23/ D 3, and so Fermat’s
Last Theorem holds for it. Alas, there are irregular primes. The smallest is
37, and the next two are 59 and 67. Unfortunately, it is known that there are
infinitely many irregular primes, and it’s unknown whether there are infinitely
many regular primes.
Let’s now say a bit more about Kummer’s ideal numbers (nowadays called
divisors), but we view his idea through the eyes of Dedekind. Take a cyclo-
tomic integer a 2 ZŒ, and define its divisor
D.a/ D fz 2 ZŒ W a is a divisor of zg:
Now D.a/ is closed under addition and multiplication by other cyclotomic
integers; that is, if z; z 0 2 D.a/, then z C z 0 2 D.a/; if z 2 D.a/ and r 2 ZŒ,
then r z 2 D.a/. In other words, D.a/ is an ideal (in fact, a principal ideal)
If a; b 2 R, where R in precisely the sense we have been using the term in this book (and we see
is a commutative ring, how natural the idea is when viewed in this context). The definition of divisor
then a j b if and only
makes sense for any commutative ring R, not just for the rings ZŒ.
if D.a/ D.b/; thus,
if R is a domain, then
Now generalize the notion of divisor so that, instead of being a subset of
D.a/ D D.b/ if and only if a commutative ring R of the form D.a/ for some a 2 R, it is a subset of R
a and b are associates. closed under addition and multiplication by elements of R; that is, let’s replace
D.a/, which is a principal ideal, by any ideal. Thus, if a; b 2 R, then
D.a/ C D.b/ D fz C w W z 2 D.a/ and w 2 D.b/g
is a generalized divisor. If we denote D.a/ C D.b/ by D.c/, pretending that
generalized divisors are just ordinary divisors, then we cannot declare that c is
an element of R. Thus, c is a “ghost” element. Of course, if R is a PID, then
c is an element of R, but if R is not a PID, then c may be a creature of our
imagination. p
Consider the ring R D ZŒ˛ in Example 8.52, where ˛ D 5. The fac-
torizations of 6,
6 D 2 3 and 6 D .1 C ˛/.1 ˛/;
involve four elements of R, each of which gives a divisor. As in the example,
define
J1 D D.2/ C D.˛/
J2 D D.2/ C D.1 ˛/
J3 D D.3/ C D.1 C ˛/
J4 D D.3/ C D.1 ˛/:
We can pretend that there are ghosts ci so that Ji D D.ci / for i D 1; 2; 3; 4.
i i
“book2” — 2013/5/24 — 8:18 — page 369 — #389
i i
To complete the story, we report that ghosts are primes: the ideals Ji can
be shown to be prime ideals, using the notion of the norm of ideals. Moreover,
one can prove that factorizations in terms of such ghosts are unique, using
fractional ideals.
How to Think About It. One of the contributions of Fermat’s Last Theorem
to algebra is that it attracted mathematicians of the first order and, as they
studied it, they enhanced the areas of mathematics impinging on it. For algebra
in particular, it brought the idea of commutative rings, factorization, and unique
factorization to the forefront. Kummer’s recognition that unique factorization
was not always present, and his restoration of it with his “ideal numbers,” led
Dedekind to introduce ideals into the study of rings. Dedekind’s notion of ideal
was taken up by Hilbert and then later by Emmy Noether. It is today one of the
most fundamental ideas in modern algebra.
We have a confession to make. Our discussion in Chapter 6 explains par-
allels of the arithmetic of polynomials with coefficients in a field k with the
arithmetic of integers by saying that both kŒx and Z are PIDs. No doubt, our
ancestors were aware of the analogy between these two systems, but viewing
them in terms of ideals is a modern viewpoint, after Dedekind, dating from
the 1920s. We wrote Chapter 6 using contemporary ideas because it unifies the
exposition.
Richard Dedekind was born in 1831 in Braunschweig (in what is now Ger-
many). He entered the University of Göttingen in 1850; it was a rather disap-
pointing place to study mathematics at the time, for it had not yet become the
vigorous research center it turned into soon afterwards. Gauss taught courses
in mathematics, but mostly at an elementary level. Dedekind did his doctoral
work under Gauss’s supervision, receiving his doctorate in 1852; he was to be
the last pupil of Gauss.
In 1854, both Riemann and Dedekind were awarded their habilitation de-
grees within a few weeks of each other. Dedekind was then qualified as a uni-
versity teacher, and he began teaching at Göttingen. Gauss died in 1855, and
Dirichlet was appointed to fill the vacant chair. This was an extremely impor-
tant event for Dedekind, who found working with Dirichlet extremely prof-
itable. He attended courses by Dirichlet, and they soon became close friends;
the relationship was in many ways the making of Dedekind, whose mathemat-
ical interests took a new lease on life with their discussions. Around this time
Dedekind studied the work of Galois, and he was the first to lecture on Galois
theory when he taught a course on the topic at Göttingen.
In the spring of 1858, Dedekind was appointed to the Polytechnikum in
Zurich. It was while he was thinking how to teach differential and integral
calculus that the idea of a Dedekind cut came to him. His idea was that every
real number r divides the rational numbers into two subsets, namely those
greater than r and those less than r . Dedekind’s brilliant idea was to represent
the real numbers by such divisions of the rationals.
The Collegium Carolinum in Brunswick had been upgraded to the Brunswick
Polytechnikum by the 1860s, and Dedekind was appointed there in 1862. He
returned to his home town, remaining there for the rest of his life, retiring in
1894. Dedekind died in 1916.
i i
“book2” — 2013/5/24 — 8:18 — page 370 — #390
i i
Exercises
p
8.43 Let R D ZŒ 5 and let N W R ! Z be the norm map: N.z/ D z z. Show that
(i) N.zw/ D N.z/ N.w/ for all z; w 2 R.
(ii) u is a unit in R if and only if N.u/ D 1.
(iii) If z 2 R, N.z/ D N.z/.
(iv) If a 2 Z, N.a/ D a2 .
p
8.44 Find all the units in R D ZŒ 5.
.3/ D J3 J4
.1 C ˛/ D J1 J3
.1 ˛/ D J2 J4 :
i i
“book2” — 2013/5/24 — 8:18 — page 371 — #391
i i
8.46 Referring to Example 8.52, show that none of J2 ; J3 ; J4 is the unit ideal in R.
8.47 Referring to Example 8.52,
(i) The ideal generated by the norms of elements in J1 is an ideal in Z, and hence
is principal. Find a generator for it.
(ii) Do the same for the other ideals Ji (i D 2; 3; 4).
8.48 Take It Further. Figure 8.1 is the front of a T-shirt that illustrates that
p p
2 3 D .1 C 5/.1 C 5/:
Explain.
In how many ways can a positive integer be written as a sum of two We should say “as a
squares? sum of two squares of
nonnegative integers.”
The surprising answer to this question was first discovered by Fermat. Just as
we used the arithmetic of Eisenstein integers to prove Fermat’s Last Theorem
for exponent 3, we’ll use the arithmetic of Gaussian integers to understand
Fermat’s discovery.
Before continuing, let’s first consider n D 5. Now 5 is a sum of two squares:
5 D 22 C 12 . We recognize the norm of a Gaussian integer, for 5 D 22 C 12 D
N.2 C i /. Is there another way to write 5 as a sum of two squares? Recall that 5
splits in ZŒi as .2 C i /.2 i /, which suggests writing 5 as N.2 i /; that is,
5 D 22 C . 1/2 . If we agree, when we write n D a2 C b 2 , that both a and b
are nonnegative, then we can ignore the second equation 5 D N.2 i /. Ah,
but there’s another way to write it as N.a C bi / with both a; b nonnegative.
While 2 i doesn’t have nonnegative real and imaginary parts, it is associate
to 2 C i , because
i.2 i / D 1 C 2i I
and 2 C i and 1 C 2i are not associates (why?). So there are two bonafide non-
associate Gaussian integers a C bi with nonnegative a and b and norm 5. Let’s
agree, then, that 5 is a sum of two squares in two ways: 22 C 12 and 12 C 22 .
The following definition of a counting function makes sense.
i i
“book2” — 2013/5/24 — 8:18 — page 372 — #392
i i
the complex plane, We’ve been using the term “first quadrant” throughout the
book, often without properly defining it. We now insist that the positive x-axis
is in the first quadrant but that the positive y-axis is not. The reason is, viewing
R2 as C, that we want to find a piece of the complex plane that contains one
Gaussian integer from each class of associates.
In light of these remarks, we modify the definition of r .n/ for the purpose at
hand, counting only Gaussian integers in the first quadrant (as we wrote above,
Exercise 8.13 says that two such integers are necessarily not associate).
It’s worth calculating r .n/ to get a feel for what it is counting. High school
students should enjoy working out some of these numbers by hand (there is no
need to mention machinery of ZŒi ). Here are some values for them to check.
i i
“book2” — 2013/5/24 — 8:18 — page 373 — #393
i i
The proof that we’ll develop uses some new machinery as well as the law
of decomposition for Gaussian integers. First, a few examples that show some
of the delightful consequences of the theorem.
1; 5; 13; 65:
65 D 1 C 64
D 64 C 1
D 16 C 49
D 49 C 16:
There are two divisors that are 1 mod 4 and two that are 3 mod 4, so
r .21/ D 0. Thus, 21 is not a sum of two squares.
(iii) Let n D 3m for some integer m. Odd powers of 3 are congruent to
3 mod 4, while even powers are 1 mod 4. The divisors of 3m are
1; 3; 32 ; 33 ; : : : ; 3m :
Corollary 8.55. For any positive integer n, we have A.n/ B.n/I that is, n
has at least as many divisors of the form 4k C 1 as it has divisors of the form
4k C 3.
X1
a.n/ a.2/ a.3/
s
D a.1/ C s C s C ;
nD1
n 2 3
where the a.n/ are complex numbers. (It will be useful to write a.n/ instead
of the usual an .)
i i
“book2” — 2013/5/24 — 8:18 — page 374 — #394
i i
Multiplication is also done term by term, but then one gathers up all terms with
Dirichlet series are not the same denominator. So, for example, if we’re looking for c.12/=12s in
formal power series, and
X1 1 1
multiplication is not the a.n/ X b.n/ X c.n/
same as in CŒŒx. s s
D ;
nD1
n nD1
n nD1
ns
Theorem 8.56. If
X1 X1
a.n/ c.n/
.s/ s
D ;
nD1
n nD1
ns
P
then c.n/ D djn a.d /:
i i
“book2” — 2013/5/24 — 8:18 — page 375 — #395
i i
Proof. Expand
1
! 1
!
X 1 X a.n/
ns ns
nD1 nD1
a.mn/ D a.m/a.n/:
Proof. Each factor on the right side is a geometric series: To be rigorous, we should
put some restrictions on
1 a.p/ a.p/ 2 a.p/ 3 the values of a.k/ to
D 1C C C C ::: ensure that the series
a.p/ ps ps ps converges.
1
ps
a.p/ a.p 2 / a.p 3 /
D 1C C C C ::::
ps p 2s p 3s
Multiply these together (one for each prime) and you get the sum of every
possible expression of the form
Since every n 2 Z can be written in one and only one way as a product of
powers of primes (the fundamental theorem again), this is the same as the sum
X1
a.n/
:
nD1
ns
i i
“book2” — 2013/5/24 — 8:18 — page 376 — #396
i i
(ii) Here’s a multiplicative function that’s connected to our work with Gaus-
sian integers:
8̂
<1 if n 1 .mod 4/
.n/ D 1 if n 3 .mod 4/ :
:̂
0 if n is even
is called a quadratic You can check that is strongly multiplicative, and so
character.
1
!
X .n/ Y 1
D : N
ns p 1 .p/ s
nD1 p
So, a.n/ is the excess of the number of divisors of n of the form 4k C1 over the
number of divisors of n of the form 4k C 3. Bingo: this is exactly the function
that is the at heart of Theorem 8.53. The idea, then, is to form the Dirichlet
series with coefficients r .n/ and to show that
1
X 1
X
r .n/ .n/
s
D .s/ :
n ns
nD1 nD1
To do this, we’ll convert each of the sums to products. We already have done
this in Example 8.58 for the sums on the right-hand side; for the left-hand side,
we argue as follows.
Each term in the left-hand sum is a sum of unit fractions, and the number
of such fractions is the number of Gaussian integers with given norm. For
example, 3=25s comes from
1 1 1
C C :
N.3 C 4i / N.4 C 3i / N.5 C 0i /
Using this idea and the multiplicativity of N , we get a product formula for the
left-hand side.
X1 X
r .n/ 1
s
D s
nD1
n .N.˛//
˛2Q1
1
Y X 1
D s (use the fundamental theorem in ZŒi /
p2Q1 kD0 .N.p//k
Y 1
D 1
(sum a geometric series) :
p2Q1
1 N.p/s
i i
“book2” — 2013/5/24 — 8:18 — page 377 — #397
i i
Here, the product is over all Gaussian primes in the first quadrant. This is
another example that is best understood by calculating a few coefficients by
hand.
Now use Theorem 8.21 (the law of decomposition for ZŒi ). Every prime in
Q1 lies over one of these:
the prime 2. There’s only one in the first quadrant: 1 C i , and N.1 C i / D 2.
a prime p congruent to 1 mod 4. There are two for each such p—if
p D ;
then both and have an associate in Q1 (and they are different), and each
has norm p.
a prime p congruent to 3 mod 4. There’s only one such prime in Q1 , because
such a p is inert and N.p/ D p 2.
So,
1 Y 1
2 Y 1
X r.n/ Y 1 1
D D 1 1
ns 1 1 p 1 p 1
nD1 z2Q1 1 1 1 mod 4 ps 3 mod 4 p2s
N.z/s 2s
Y 1
2 Y 1
Y 1
1
D 1 1 1
1 p 1 p 1 p 1C
1 1 mod 4 ps 3 mod 4 ps 3 mod 4 ps
2s
Y 1
Y 1
Y 1
1
D 1 1 1
1 p odd 1 p1 mod 4 1 p3 mod 4 1C
1 ps ps ps
2s
Y 1
Y 1
D .s/ .p/ .p/
p1 mod 4 1 p3 mod 4 1
ps ps
Y 1
1
X .n/
D .s/ .p/ D .s/
p odd 1 ns
ps nD1
1
X a.n/
D ;
ns
nD1
Exercises
8.49 Suppose that m 1 is an integer. Show that if p is a prime and p 3 mod 4,
(
m
0 if m is odd
r p D
1 if m is even:
i i
“book2” — 2013/5/24 — 8:18 — page 378 — #398
i i
r.mn/ D r.m/r.n/:
8.55 Show that an integer can be written as a sum of two squares if and only if the
primes in its prime factorization that are congruent to 3 mod 4 show up with even
exponents.
8.56 Take It Further. Show that
1
X .n/
.s/ D .s 1/;
ns
nD1
i i
“book2” — 2013/5/24 — 8:18 — page 379 — #399
i i
9 Epilog
Attempts to resolve Fermat’s Last Theorem have led to much modern alge-
bra. There were many other areas of mathematical research in the seventeenth,
eighteenth and nineteenth centuries, one of which was determining the roots of
polynomials. Informally, a polynomial is solvable by radicals if its roots can
be given by a formula generalizing the classical quadratic, cubic, and quar-
tic formulas. In 1824, Abel proved that there are quintic polynomials that are
not solvable by radicals and in 1828 he found a class of polynomials, of any
degree, that are solvable by radicals. In 1830, Galois, the young wizard who
was killed before his 21st birthday, characterized all the polynomials which
are solvable by radicals, greatly generalizing Abel’s theorem. Galois’ brilliant
idea was to exploit symmetry through his invention of group theory.
After a brief account of the lives of Abel and Galois, we will use ring theory
to make the notion of solvability by radicals precise. This will enable us to un-
derstand the work of Abel and Galois showing why there is no generalization
of the classical formulas to polynomials of higher degree. We will then intro-
duce some group theory, not only because groups were the basic new idea in
the study of polynomials, but because they are one of the essential ingredients
in Wiles’ proof of Fermat’s Last Theorem in 1995. In fact, symmetry is an im-
portant fundamental idea arising throughout mathematics. In the last section,
we will say a bit about Andrew Wiles and his proof of Fermat’s Last Theorem.
i i
“book2” — 2013/5/24 — 8:18 — page 380 — #400
i i
In 1821, while in his final year at the University of Christiana, Abel thought
he had proved that quintic polynomials are solvable by radicals, and he submit-
ted a paper to the Danish mathematician Degen for publication by the Royal
The importance of nu- Society of Copenhagen. Degen asked Abel to give a numerical example of his
merical examples can’t be method and, in trying to do this, Abel discovered a mistake in his paper. De-
overestimated.
gen had also advised Abel to study elliptic integrals, and Abel wrote several
important fundamental papers on the subject. In 1824, Abel returned to quin-
tic polynomials, proving that the general quintic polynomial is not solvable by
radicals.
In 1825, having now done brilliant work in two areas of mathematics, the
Norwegian government gave Abel a scholarship to travel abroad. He went to
Germany and France, hoping to meet eminent mathematicians, but Gauss was
not interested in Abel’s work on the quintic, and the mathematicians in Paris
did not yet appreciate his remarkable theorems on elliptic functions. By 1827,
Abel’s health deteriorated, he was heavily in debt, and he returned home to
Norway. In 1828, he briefly returned to polynomials, proving a theorem de-
scribing a class of polynomials (of any degree) that are solvable by radicals.
By this time, Abel’s fame had spread to all mathematical centers. Legendre
saw the new ideas in papers of Abel and of Jacobi, and he wrote
Through these works you two .Abel and Jacobi/ will be placed in the
class of the foremost analysts of our times.
Strong efforts were made to secure a suitable position for Abel by a group from
the French Academy, who addressed King Bernadotte of Norway-Sweden;
Crelle also worked to secure a professorship for him in Berlin. But it was too
late. Abel died in 1829, at age 26.
An imprecise measure of Abel’s influence on modern mathematics is the
number of areas named after him: abelian groups, abelian varieties, abelian
differentials, abelian integrals, abelian categories, abelian extensions, abelian
number fields, abelian functions. The Niels Henrik Abel Memorial Fund was
established in 2002, and the Norwegian Academy of Science and Letters awards
the Abel Prize for outstanding scientific work.
Évariste Galois was born in Bourg La Reine, near Paris, in 1811. France,
and especially Paris, was then in the throes of great political and social change
as a consequence of the French Revolution in 1789, the Napoleonic era 1799–
1815, the restoration of the French monarchy with King Louis XVIII in 1815,
his overthrow by King Charles X in 1824, and another revolution in 1830.
In April 1829, Galois’ first mathematics paper (on continued fractions) was
published; he was then 17 years old. In May and June, he submitted articles
on the algebraic solution of equations to Cauchy at the Academy of Science.
Cauchy advised him to rewrite his article, and Galois submitted On the condi-
tion that an equation be solvable by radicals in February 1830. The paper was
sent to Fourier, the secretary of the Academy, to be considered for the Grand
Prize in mathematics. But Fourier died in April 1830, Galois’ paper was never
subsequently found, and so it was never considered for the prize. July 1830
saw another revolution. King Charles X fled France, and there was rioting in
the streets of Paris. Later that year, Galois (now age 19) was arrested for mak-
ing threats against the king at a public dinner, but he was acquitted. Galois
was invited by Poisson to submit a third version of his memoir on equations to
the Academy, and he did so in January 1831. On July 14, Galois was arrested
i i
“book2” — 2013/5/24 — 8:18 — page 381 — #401
i i
i i
“book2” — 2013/5/24 — 8:18 — page 382 — #402
i i
us now examine the classical formulas to make this rather vague idea more
precise.
How to Think About It. Even though much of what we shall say applies
to polynomials over any field, the reader may assume all fields coming up are
subfields of the complex numbers C. We point out, however, that some famil-
iar results may not be true for all fields. For example, the quadratic formula
doesn’t hold in kŒx when k has characteristic 2 (for 12 doesn’t make sense in
k); similarly, neither the cubic formula nor the quartic formula holds in kŒx
when k has characteristic either 2 or 3.
In more detail,
p K D k.u/, where u is a root of x n a for some a 2 k;
n
that is, u D a, and so we are adjoining an nth root of a to k. But there are
several nth roots of a in C, namely
p p p
n
a; n a; : : : ; n 1 n a;
g C h; !g C ! 2 h; and ! 2 g C !h;
p
where g3 D 12 r C R , h D q=3g, R D r 2 C 27 4 3
q , and ! is a primitive
cube root of unity. Because of the constraint gh D 31 q, each choice of g D
q p
3 1
2
. r C R/ has a “mate,” namely h D q=.3g/, q=.3!g/ D ! 2 h, and
q=.3! 2 g/ D !h.
i i
“book2” — 2013/5/24 — 8:18 — page 383 — #403
i i
k D K0 K1 K t D K;
Quadratics are solvable by radicals, and the cubic formula shows that every
cubic f .x/ D x 3 C qx C r 2 QŒx is solvable by radicals: a radical extension
containing a splitting field of f is
p
Q.q; r / D K0 K1 D K0 .!/ K2 D K1 . R/ K3 D K2 .g/;
i i
“book2” — 2013/5/24 — 8:18 — page 384 — #404
i i
e.x/ D x 4 C qx 2 C r x C s D .x 2 C jx C `/.x 2
f jx C m/;
.j 2 /3 C 2q.j 2 /2 C .q 2 4s/j 2 r 2:
The cubic formula gives j 2 , from which we can determine m and `, and hence
the roots of the quartic.
Define a radical extension
k D K0 K1 K2 K3 ;
We have just seen that quadratics, cubics, and quartics in QŒx are solvable
by radicals. Conversely, let f .x/ 2 QŒx be a polynomial of any degree, and
let E=Q be a splitting field. If f is solvable by radicals, we claim that there is
Q can be replaced by any a formula that expresses its roots in terms of its coefficients. Suppose that
field of characteristic 0.
Q D K0 K1 K t
9.3 Symmetry
Recognizing and exploiting symmetry is an important ingredient in geometry,
algebra, number theory, and, indeed, in all of mathematics.
Here is the basic idea: an object is symmetric if, when you transform it in a
certain way, you get the same object back. For example, what do we mean
when we say that an isosceles triangle is symmetric? Figure 9.1 shows
D ABC with its base AB on the x-axis and with the y-axis being the
i i
“book2” — 2013/5/24 — 8:18 — page 385 — #405
i i
A B
perpendicular-bisector of AB. Close your eyes; pretend that the y-axis is a Reflection in the y-axis
mirror, and let be reflected in the y-axis (so that the vertices A and B are is the function .x; y/ 7!
. x; y/.
interchanged); open your eyes. You cannot tell that has been reflected; that
Reflection in the x-axis
is, is symmetric in the y-axis. On the other hand, if were reflected in the is the function .x; y/ 7!
x-axis, then it would be obvious, once your eyes are reopened, that a reflection .x; y/.
had taken place; that is, is not symmetric in the x-axis.
Here is a non-geometric example: the polynomial f .x; y/ D x 3 C y 3 xy
is symmetric because, if you transform it by interchanging x and y, you get
the same polynomial back. Another example arises from g.x/ D x 6 x 2 C 3.
This polynomial is symmetric because g. x/ D g.x/; this symmetry induces
symmetry of the graph of g.x/ in the y-axis, for . x; y/ lies on the graph if
and only if .x; y/ does.
The transformations involved in defining symmetry are usually permuta-
tions.
i i
“book2” — 2013/5/24 — 8:18 — page 386 — #406
i i
P
P
U Q U Q U Q
The following theorem should remind you of Theorem 3.12 (which is the
special case when E= k D C=R and is complex conjugation). Of course,
automorphisms are certain kinds of permutations.
Theorem 9.4. Let k be a field, let f .x/ 2 kŒx, and let E= k be a splitting
field of f . If W E ! E is an automorphism fixing k, then permutes the set
of all the roots of f .
0 D .f .u//
D .a0 C a1 u C C an un /
D .a0 / C .a1 /.u/ C C .an /.un /
D a0 C a1 .u/ C C an .u/n
D f ..u//:
The following definition, due to E. Artin around 1930, modernizes and sim-
plifies Galois’ original definition given 100 years earlier (it is equivalent to
Galois’ definition).
i i
“book2” — 2013/5/24 — 8:18 — page 387 — #407
i i
the other hand, the splitting field of g is Q, for both 3 and 3 lie in Q, and so
Gal.g/ consists only of the identity permutation.
The astute reader may have noticed that Gal.f / really depends only on the
fields k and E; two polynomials in kŒx having the same splitting field have
the same Galois group. For this reason, we usually write
Gal.f / D Gal.E= k/:
Example 9.5. We show that not every permutation of the roots of a polyno-
mial f is the restriction of some automorphism 2 Gal.f /. We saw in Ex-
ample 3.7 that the roots of
f .x/ D x 4 10x 2 C 1 2 QŒx
are
p p p p p p p p
˛D 2 C 3; ˇD 2 3;
D 2C 3; ıD 2 3:
Let E=Q be a field extension containing these four roots, and let be the
permutation that interchanges ˇ and
and fixes the other two roots:
.˛/ D ˛; .ˇ/ D
; .
/ D ˇ; .ı/ D ı:
p
In E, we have ˛ ˇ D 2 3. Suppose there p is an automorphism of E with
jf˛; ˇ;
; ıg D . Then .˛ ˇ/ D .2 3/, and
p
.˛ ˇ/ D .˛/ .ˇ/ D .˛/ .ˇ/ D ˛
D 2 2:
p p
Hence, .2 3/ D 2 2. Square both sides:
p p
.2 3/2 D .2 2/2 D 8:
p p
The left-hand side is .2 3/2 D .2 3/2 D .12/ D 12, and this is a
contradiction. Therefore, … Gal.f /. N
i i
“book2” — 2013/5/27 — 21:32 — page 388 — #408
i i
Example 9.6. The elementary symmetric polynomials can be used to give al-
ternate derivations of the quadratic and cubic formulas (Exercises 9.1 and 9.3
below). Let’s sketch a derivation of the cubic formula along these lines.
Assume our cubic has been reduced and is, as before, of the form
x 3 C qx C r:
Suppose further that we let its roots be ˛1 , ˛2 and ˛3 . Then we know that
˛1 C ˛2 C ˛3 D 0;
˛1˛2 C ˛1 ˛3 C ˛2 ˛3 D q;
˛1 ˛2 ˛3 D r:
As usual, ! D Form the two expressions s and u:
1
p
1Ci 3 .
2
s D ˛1 C ˛2 ! C ˛3 ! 2
u D ˛1 C ˛2 ! 2 C ˛3 !:
So, we have three expressions in the roots
0 D ˛1 C ˛2 C ˛3
s D ˛1 C ˛2 ! C ˛3 ! 2
u D ˛1 C ˛2 ! 2 C ˛3 !:
Adding the equations, we see that s C u D 3˛1 . Hence, if s and u can be
expressed in terms of q and r , then ˛1 can be so expressed (and, by symmetry,
the other roots can be expressed in terms of q and r ). Experimenting with a
CAS or by hand, we find that
su D ˛12 C ˛22 C ˛32 ˛1 ˛2 ˛1 ˛3 ˛2 ˛3
2
D .˛1 C ˛2 C ˛3 / 3 .˛1 ˛2 C ˛1 ˛3 C ˛2 ˛3/
D 0 3q D 3q:
A CAS is a great help here Expanding s 3 C u3 and factoring the result, we get
(See Appendix A.6).
s 3 Cu3
D .˛1 C ˛2 2˛3 /.˛1 C ˛3 2˛2/.˛2 C ˛3 2˛1/
D .˛1 C ˛2 C ˛3 3˛3/.˛1 C ˛2 C ˛3 3˛2/.˛1 C ˛2 C ˛3 3˛1 /
D . 3˛3 /. 3˛2 /. 3˛1 /
D 27˛1˛2 ˛3 D 27r:
From su D 3q, we get s 3 u3 D 27q 3 . Coupled with s 3 C u3 D 27r , we
see that s 3 and u3 are roots of the quadratic polynomial
x 2 C 27r x 27q 3 :
We can solve this for s 3 and u3 , take cube roots, and recover ˛1 , leading to
Cardano’s formula (Exercise 9.3 below). N
Exercises
Exercise 9.1 shows how
to derive the quadratic 9.1 * Suppose the roots of x 2 C bx C c are ˛ and ˇ. Find, without using the quadratic
formula without completing formula, an expression for and ˛ ˇ in terms of ˛ C ˇ and ˛ˇ . Use it and the
the square. fact that ˛ C ˇ D b to find ˛ in terms of b and c.
i i
“book2” — 2013/5/24 — 8:18 — page 389 — #409
i i
x3 18x 35;
9.4 Groups
Galois invented groups to exploit symmetry. Our purpose here is only to dis-
play Galois’ ideas in enough detail so that Theorem 9.16 below is plausible;
we wish to dispel some of the mystery that would arise if we merely cited the
ultimate result (you can follow the proofs in [26], Chapter 5).
Commutative rings are sets with two binary operations; a group is a set
having only one binary operation. Permutations, as any functions from a set
X to itself, can be composed and, as we show in Appendix A.1, composition
equips the family of all permutations of X with a a binary operation. This
viewpoint begets a kind of algebra, called group theory.
W G G ! G;
It is not difficult to prove, for groups as for commutative rings, that the
identity element is unique (if e 0 a D a D a e 0 for all a 2 G, then e 0 D e),
and the inverse of every element is unique (if a00 a D e D a a00 , then
a00 D a0 ).
Example 9.7. Theorem A.12 in Appendix A.1 shows that SX , the family of
all the permutations of a nonempty set X, is a group with composition as its
binary operation. In the special case when X D f1; 2; : : : ; ng, denote SX by
Sn ;
Example 9.8. Just because we call a Galois group a group doesn’t make it so.
Recall that the Galois group Gal.E= k/ of a field extension E= k consists of all
the automorphisms of E that fix k. We now show that Gal.E= k/ with binary
operation composition is a group.
If ; 2 Gal.E= k/, then their composite is an automorphism of E
fixing k; that is, 2 Gal.E= k/, so that composition is a binary operation
on Gal.E= k/. Proposition A.5 says that composition of functions is always
i i
“book2” — 2013/5/24 — 8:18 — page 390 — #410
i i
How to Think About It. We warn the reader that new terms are going to
be introduced at a furious pace. You need not digest everything; if a new idea
seems only a little reasonable, continue reading nevertheless. One way to keep
your head above water is to see that definitions and constructions for groups
(subgroups, homomorphisms, kernels, normal subgroups, quotient groups) are
parallel to what we have already done for commutative rings (subrings, ho-
momorphisms, kernels, ideals, quotient rings). Your reward will be a better
appreciation of the beautiful results of Abel and Galois.
i i
“book2” — 2013/5/24 — 8:18 — page 391 — #411
i i
We can be more precise. If groups are denoted by .G; / and .H; ı/, where
and ı are binary operations, then a homomorphism 'W G ! H is a function
for which
Theorem 9.11. If a polynomial f .x/ 2 kŒx has n roots, then its Galois group If X and Y are sets
Gal.E= k/ is isomorphic to a subgroup of the symmetric group Sn . of n elements, then
SX Š Sn Š SY . Thus,
groups don’t care if you
Proof. By Theorem 9.4, elements of Gal.E= k/ permute the roots of f . Now are permuting n numbers,
see [26], p. 454. n roots, or n monkeys.
i i
“book2” — 2013/5/24 — 8:18 — page 392 — #412
i i
aN D fas W s 2 N g G:
i i
“book2” — 2013/5/24 — 8:18 — page 393 — #413
i i
Sketch of proof. Adapt the proof of the First Isomorphism Theorem for com-
mutative rings.
How to Think About It. Without a doubt, this section contains too much
new material; there’s too much to digest. Fortunately, you have seen analogs
of these definitions for commutative rings so, at least, they sound familiar. You
can now sympathize with the members of the Academy in Paris in 1830 as
they struggled, without benefit of ever having seen any abstract algebra at all,
to read such things in the paper Galois submitted to them!
Let us now see why normal subgroups are important for polynomials. If
k B E, then Gal.E=B/ is a subset of Gal.E= k/:
i i
“book2” — 2013/5/24 — 8:18 — page 394 — #414
i i
Lemma 9.15. Let k be a field containing all roots of unity. If f .x/ 2 kŒx is
solvable by radicals, then there is a chain of subgroups
G0 D Gal.K t = k/ G1 G2 G t D f1g;
G D G0 G1 G2 G t D f1g
Galois’ Theorem explains why the classical theorems hold for polynomials
of degree 4.
Proof. Since deg.f / 4, Theorem 9.11 says that Gal.f / is (isomorphic to) a
subgroup of S4 and, hence, it is a solvable group. Theorem 9.16 now says that
f is solvable by radicals.
i i
“book2” — 2013/5/24 — 8:18 — page 395 — #415
i i
f .x/ D x 5 4x C 2 2 QŒx
(see Figure 9.3) is not solvable by radicals because its Galois group is S5 ([26],
p. 469).
6
−2 −1 1 2
−2
−4
−6
Corollary 9.18 is often misquoted. It says the general quintic is not solvable
by radicals: there is no formula involving only addition, subtraction, multipli-
cation, division, and extraction of roots that expresses the roots of the general
quintic polynomial in terms of its coefficients. But it doesn’t say that roots of
quintics cannot be found. There are other kinds of formulas; for example, New-
ton’s method gives the roots as limn!1 xn , where xnC1 D xn f .xn /=f 0 .xn /.
Thus, it is not accurate to say that there is no formula finding the roots of a
quintic polynomial.
Exercises
9.4 Prove that every subgroup of an abelian group is abelian.
9.5 Let f .x/; g.x/ 2 QŒx be solvable by radicals.
(i) Show that f .x/g.x/ is also solvable by radicals.
(ii) Give an example showing that f .x/ C g.x/ need not be solvable by radicals.
9.6 Assuming that x n 1 is solvable by radicals, prove that x n a is solvable by
radicals, where a 2 Q.
9.7 Prove that S3 is a solvable group and that it is not abelian.
9.8 Recall Exercise 1.56 on page 35: if m 2 is an integer, gcd.k; m/ D 1, and
gcd.k 0 ; m/ D 1, then gcd.kk 0 ; m/ D 1.
Prove that
Um D fŒk 2 Zm W gcd.k; m/ D 1g
i i
“book2” — 2013/5/24 — 8:18 — page 396 — #416
i i
9.10 If R is a commutative ring, prove that R is an abelian group under addition. (Note
that 0 is the identity element and that a is the (additive) inverse of a.)
9.11 Let k be a field.
(i) Prove that k is an abelian group under multiplication, where k denotes the
set of nonzero elements of k.
(ii) Prove that GL2 .k/, the set of all 2 2 nonsingular matrices with entries in k,
is a group under matrix multiplication.
GL2 .k/ is called the (iii) Prove that the determinant function,
General Linear group, and
SL2 .k/ is called the Special detW GL2 .k/ ! k ;
Linear group.
is a surjective homomorphism of groups.
(iv) Prove that ker.det/ D SL2 .k/, the set of all 2 2 matrices over k having
determinant 1.
(v) Prove that GL2 .k/=SL2 .k/ Š k .
9.12 (i) Prove that R is an abelian group with addition as binary operation.
(ii) Prove that Q is an abelian group with addition as binary operation; indeed, it
is a subgroup of R.
(iii) Let R> be the group of positive real numbers. Show that R> is a group with
addition as as binary operation.
The “laws of exponents” 9.13 (i) Prove that expW R ! R> , defined by a 7! ea , is a group homomorphism.
from high school algebra (ii) Prove that logW R> ! R, defined by b 7! log b, is a group homomorphism.
preview the results of
Exercise 9.13. (iii) Prove that exp is an isomorphism by showing that its inverse is log.
9.14 (i) Prove that R>, the set of all positive real numbers, is an abelian group with
multiplication as binary operation, and prove that Q> , the set of all positive
rational numbers, is a subgroup of R> .
(ii) Prove that ZŒx is an abelian group under addition.
(iii) Use the Fundamental Theorem of Arithmetic to prove that the additive group
ZŒx is isomorphic to the multiplicative group Q> of all positive rational
numbers.
Hint: Define 'W ZŒx ! Q> by
e e
'W e0 C e1 x C C en x n 7! p00 p11 pnen ;
I was a ten year old and one day I happened to be looking in my local
public library and I found a book on maths and it told a bit about the
history of this problem and I, a ten year old, could understand it. From
that moment I tried to solve it myselfI it was such a challenge, such a
beautiful problem. This problem was Fermat’s Last Theorem.
and
i i
“book2” — 2013/5/24 — 8:18 — page 397 — #417
i i
There’s no other problem that will mean the same to me. I had this very
rare privilege of being able to pursue in my adult life what had been my
childhood dream. I know it’s a rare privilege, but I know if one can do
this it’s more rewarding than anything one can imagine.
1 C .dy=dx/2 dx:
i i
“book2” — 2013/5/24 — 8:18 — page 398 — #418
i i
Finally, we rewrite the last integral using the tangent half-angle formula t D
tan.=2/ in Chapter 1 (so that d D 2dt=.1 C t 2 / and sin D 2t=.1 C t 2 /).
We obtain
Z p Z p
2 2 g.t/
a 1 E sin d D 2a dt;
.1 C t 2 /2
where g.t/ is a quartic polynomial. A similar integral arises from the arclength
of the hyperbola x 2 =a2 y 2 =b 2 D 1.
i i
“book2” — 2013/5/24 — 8:18 — page 399 — #419
i i
i i
“book2” — 2013/5/24 — 8:18 — page 400 — #420
i i
Ru Ru
In 1797, Gauss considered the elliptic integrals 0 p dt 3 and 0 p dt .
1 t 1 t4
He saw an analogy (as, most likely, did Fagnano and Euler) with
Z u
dt
sin 1 u D p ;
0 1 t2
and he inverted many elliptic integrals; after all, sin x is the inverse function
of sin 1 x. Nowadays, inverse functions of elliptic integrals are called elliptic
functions. Just as sin x is periodic, that is, sin.x C 2 / D sin x for all x, so,
too, are elliptic functions f ; there is some number p with f .x C p/ D f .x/
Rz
for all x. Gauss then studied complex elliptic integrals I.z/ D 0 pd g./
; their
inverse functions f .z/ D I 1 .z/ are called elliptic functions of a complex
variable. Gauss saw that complex elliptic functions are doubly periodic: there
are (noncollinear) complex numbers p and q with
f .z C mp C nq/ D f .z/
for all complex z and all m; n 2 Z. This fact has important geometric conse-
quences, both for elliptic functions and for complex variables in general. Alas,
Gauss never published these ideas, and they became known only later.
In 1823, Abel investigated elliptic functions, rediscovered many of Gauss’s
theorems, and proved new beautiful results about them. For example, just as
Gauss had found all n for which one can divide the circle into n equal arcs
using ruler and compass (n D 2m p1 pk , where m 0 and the pi are dis-
t
tinct primes of the form 22 C 1), Abel obtained the same result (for exactly
the same n) for the lemniscate. At the same time, Jacobi began his investiga-
tions of elliptic functions, further explaining and generalizing work of Euler
by introducing theta functions and modular curves.
a2 C b 2 D c 2 and
1
2
ab D n:
Since ab D 2n > 0, we Let’s loosen the constraints a bit and allow a, b, and c to be negative rational
have a ¤ 0 and b ¤ 0. It numbers as well. We’d like to replace the two equations in four unknowns with
follows that c ¤ 0, too.
a simpler set of constraints. We’ll see that the solution can be realized as the
search for rational points on a polynomial curve.
In Theorem 1.9, we We now turn the pair of defining equations into a single equation in two
reduced the defining pair variables. The equation a2 C b 2 D c 2 can be written as
(for n D 2) to a degree 4
equation in three variables. b2 D c2 a2 D .c a/.c C a/:
We’ll do a little better here.
Let k D c a, so that we have
b 2 D k.c C a/:
i i
“book2” — 2013/5/24 — 8:18 — page 401 — #421
i i
–5 5
i i
“book2” — 2013/5/24 — 8:18 — page 402 — #422
i i
a2 C b 2 D c 2 and ab D 2n
Proof. The calculation preceding the statement of the theorem shows that a
triple produces such a point on the graph. Going the other way, if .x; y/ is a
point on the graph with y ¤ 0, we can solve the system
bn
xD
c a
2n2
yD
c a
c D a2 C b 2
2
x2 n2
aD
y
2nx
bD
y
x 2 C n2
cD :
y
It is easily checked that this produces a triple of rational numbers of the desired
type.
Example 9.21. The correspondence between rational right triangles with in-
teger area and cubic curves allows us to generate infinitely many congruent
triangles with the same area from a given such triangle.
i i
“book2” — 2013/5/24 — 8:18 — page 403 — #423
i i
For example, on page 18, we saw that there are two rational right triangles
with area 5. One comes from a scaled copy of 4.9; 40; 41/ whose area is 5 62 .
To find the second one, it would take a very long time (even with a computer)
to find 4.2420640; 2307361; 3344161/ whose area is 5 7473482. But we can
use an idea related to the “sweeping lines” method of Diophantus. The rational
right triangle with side-lengths 23 ; 20
3
; 41
6
corresponds, via the formulas in
25 75
Corollary 9.20, to the point P D 4 ; 8 on the curve C defined by
y2 D x3 25x:
dy 3x 2 25
D :
dx 2y
Using this, we find that the slope of the tangent to C at P is 59=12, and hence
the tangent line to C at P has equation
59 25 75
yD x C :
12 4 8
y2 D x3 25x
59 25 75
yD x C ;
12 4 8
we get P and
1681 62279
P0 D ; :
144 1728
i i
“book2” — 2013/5/24 — 8:18 — page 404 — #424
i i
Exercises
9.15 Show that there are no rational points .x; y/ with y ¤ 0 on the graph of
(i) y 2 D x 3 x
(ii) y 2 D x 3 4x
9.16 Show that a cubic equation with rational coefficients and two rational roots has,
in fact, three rational roots.
9.17 Find a third rational right triangle with area 5, different from the two we found in
Example 9.21.
Elliptic Curves
The curves defined by the equation in Theorem 9.19,
y 2 D a cubic polynomial in x
show up all across mathematics. We just saw how they can be used to generate
congruent numbers. R p
Before that, we saw that the integral defining arcsine, dt= 1 t 2 , sug-
gested studying elliptic functions, the inverse functions of elliptic integrals.
Just as the unit circle is parametrized by sine and cosine (it consists of the
points .sin ; cos /), Gauss, Abel, and Jacobi considered curves parametrized
by elliptic functions; that is, curves consisting of the points .f .u/; f 0 .u//,
where f is an elliptic function (cosine is the derivative of sine). What sort of
curves are these? Expand the integrand of an elliptic integral as a power series
(since it has a denominator, the series begins with a negative power), and then
integrate term by term. There results a differential equation involving x D f
and y D f 0 , which turns out to be a cubic in two variables (see [9], pp. 17–
19). After some manipulations, one obtains a Weierstrass normal form for the
points .x; y/ on the curve y 2 D ax 3 C bx 2 C cx C d (there is another, simpler,
Weierstrass normal form, y 2 D 4x 3 g2 x g3 , where g2 ; g3 are constants).
This definition is not Definition. An elliptic curve over a field k is a curve C k 2 with equation
quite accurate, for an
elliptic curve is really an y 2 D g.x/;
equivalence class of such
curves.
where g.x/ D ax 3 C bx 2 C cx C d 2 kŒx has no repeated roots.
Curves over C are two- The most interesting elliptic curves are over C (for complex variables) or
dimensional surfaces when over Q (for number theory), while elliptic curves over finite fields Fq give rise
viewed over R.
to public access codes that are more secure than the RSA codes we discussed
in Chapter 4.
Elliptic functions and elliptic curves, whose humble origins are in arclength
problems, occur in analysis, geometry, and complex variables. In the previous
subsection, we saw that congruent numbers lead to rational points on elliptic
curves. More generally, let’s now see the connection with number theory and
with Fermat’s Last Theorem in particular.
i i
“book2” — 2013/5/24 — 8:18 — page 405 — #425
i i
i i
“book2” — 2013/5/24 — 8:18 — page 406 — #426
i i
P*Q
Q
P
P+Q
then the point of intersection is another rational point; denote such a point by
P P
(thus, the tangent line T intersects C in another rational point, say Q, and the
two points P; Q determine a third rational point).
If we are considering cubic curves C in the plane, then it is possible that a
line meets C in only one point, not three (a cubic in RŒx always has a real root,
This is a good reason to but its other roots may not be real). To make all work smoothly, we enlarge the
consider cubics over the plane to the Riemann sphere C b D R2 [ f1g, where we regard 1 as a point
complex numbers.
“at infinity.” We agree that lines through 1 are precisely the vertical lines; we
declare that 1 lies on every cubic C and that 1 is a rational point on C . Given
two points P; Q on the curve, the line they determine meets the curve in a third
rational point P Q. Define P CQ to be the intersection of C with the vertical
line V through P Q; that is, V is the line determined by the two rational
points 1 and P Q (see Figure 9.8). The wonderful discovery is that this
allows us to “add” points P; Q on elliptic curves (indeed, the set of all rational
points on an elliptic curve is an abelian group under this binary operation). In
particular, if C is the elliptic curve arising from the lemniscate (or any of the
elliptic functions considered by Euler), then the limits of integration in Euler’s
Addition Theorem are given by the chord–tangent construction: for example,
Z P Z Q Z P CQ
dt dt dt
p C p D p :
0 1 t4 0 1 t4 0 1 t4
As we have seen, congruent numbers n arising from rational side-lengths
.a; b; c/ of a right triangle correspond to rational points on the elliptic curve
y 2 D x 3 n2 x. The binary operation shows how to construct new congruent
numbers from given ones. The importance of this operation is illustrated by
Theorem 9.19.
What has this discussion to do with Fermat’s Last Theorem? The abelian
group of rational points on elliptic curves, an example of complex multipli-
cation, is only the beginning of deep connections between Diophantine equa-
tions and elliptic curves. The following account by the number theorist Andrew
Granville summarizes the recent history.
i i
“book2” — 2013/5/24 — 8:18 — page 407 — #427
i i
It all began in 1955, with a question posed by the Japanese mathe- We refer the reader to
matician Yutaka TaniyamaW Could one explain the properties of ellip- [31] for more about elliptic
curves and Diophantine
tic curves, equations of the form y 2 D x 3 C ax C b with a and b
equations. We also rec-
given whole numbers, in terms of a few well-chosen curves? That is, ommend the expository
is there some very special class of equations that in some way encap- article of Cox, Introduction
sulate everything there is to know about our elliptic curves‹ Taniyama to Fermat’s Last Theorem,
was fairly specific about these very special curves .the so-called modu- Amer. Math. Monthly 101
(1994), pp. 3–14, for more
lar curves) and, in 1968, André Weil, one of the leading mathematicians
details.
of the twentieth century, made explicit which modular curve should de-
scribe which elliptic curve. In 1971, the first significant proven evidence
in favor of this abstract understanding of equations was given by Goro
Shimura at Princeton University, who showed that it works for a very
special class of equations. This somewhat esoteric proposed approach to
understanding elliptic curves is now known as the Shimura–Taniyama–
Weil conjecture. There the matter stood until 1986, when Gerhard Frey
made the most surprising and innovative link between this very abstract
conjecture and Fermat’s Last Theorem. What he realized was that if
c n D an C b n , then it seemed unlikely that one could understand the
equation y 2 D x.x an /.x C b n / in the way proposed by Taniyama.
It took deep and difficult reasoning by Jean-Pierre Serre and Ken Ribet
to strengthen Frey’s original concept to the point that a counterexam-
ple to Fermat’s Last Theorem would directly contradict the Shimura–
Taniyama–Weil conjecture.
This is the point where Wiles enters the picture. Wiles drew together a
vast array of techniques to attack this question. Motivated by extraor-
dinary new methods of Victor Kolyvagin and Barry Mazur, Wiles estab-
lished the Shimura–Taniyama–Weil conjecture for an important class of
examples, including those relevant to proving Fermat’s Last Theorem.
His work can be viewed as a blend of arithmetic and geometry, and
has its origins way back in Diophantus’s Arithmetic. However he em-
ploys the latest ideas from a score of different fields, from the theories
of L-functions, group schemes, crystalline cohomology, Galois repre-
sentations, modular forms, deformation theory, Gorenstein rings, Euler
systems and many others. He uses, in an essential way, concepts due to
many mathematicians from around the world who were thinking about
very different questions.
The work of Wiles is a tour de force, and will stand as one of the sci-
entific achievements of the century. His work is not to be seen in iso-
lation, but rather as the culmination of much recent thinking in many
directions. Wiles’ proof, starting from scratch, would surely be over a
thousand pages long.
The story of this important discovery is a tribute to the deeper and more
abstruse levels of abstract understanding that mathematicians have long
claimed is essential. Many of us, while hailing Wiles’ magnificent achieve-
ment, yearn for Fermat to have been correct, and for the truly marvel-
lous, and presumably comparatively straightforward, proof to be recov-
ered.
i i
“book2” — 2013/5/24 — 8:18 — page 408 — #428
i i
i i
“book2” — 2013/5/24 — 8:18 — page 409 — #429
i i
A Appendices
A.1 Functions
Pick up any calculus book; somewhere near the beginning is a definition of
function which reads something like this: a function f is a rule that assigns to
each element a in a set A exactly one element, denoted by f .a/, in a set B. Ac-
tually, this isn’t too bad. The spirit is right: f is dynamic; it is like a machine,
whose input consists of elements of A and whose output consists of certain
elements of B. The sets A and B may be made up of numbers, but they don’t
have to be.
There is a slight notational surprise. We are used to writing a function, not
as f , but as f .x/. For example, integrals are written
Z
f .x/ dx:
Logically, one notation for a function, say f , and another for its value at a
point a in A, say f .a/, does make sense. However, some notation is grand-
fathered in. For example, we will continue to write polynomials as f .x/ D
an x n C an 1 x n 1 C C a0 , trigonometric functions as sin x and cos x, and
the exponential as e x (but some authors denote the exponential function by
exp). Still, the simpler notation f is usually a good idea.
One problem we have with the calculus definition of function involves the
word rule. To see why this causes problems, we ask when two functions are
equal. If f is the function f .x/ D x 2 C 2x C 1 and g is the function g.x/ D
.x C 1/2 , is f D g? We usually think of a rule as a recipe, a set of directions.
With this understanding, f and g are surely different: f .5/ D 25 C 10 C 1 and
g.5/ D 62 . These are different recipes, but note that both cook the same dish:
for example, f .5/ D 36 D g.5/.
A second problem with the calculus definition is just what is allowed to be
a rule. Must a rule be a formula? If so, then f .x/, defined by
(
1 if x is rational
f .x/ D
0 if x is irrational,
is not a function. Or is it? The simplest way to deal with these problems is to
avoid the imprecise word rule.
If A is a set, then we write
a 2 A;
which abbreviates “a belongs to A” or “a is an element of A.”
409
i i
“book2” — 2013/5/24 — 8:18 — page 410 — #430
i i
i i
“book2” — 2013/5/24 — 8:18 — page 411 — #431
i i
Let’s look at more examples before resolving the first problem arising from
the imprecise term rule.
f W a 7! b
Let’s return to our first complaint about rules; when are two functions equal?
Proposition A.2 resolves the first problem arising from the term rule: if
f; gW R ! R are given by f .x/ D x 2 C 2x C 1 and g.x/ D .x C 1/2 , then
f D g because f .a/ D g.a/ for every number a.
Let us clarify another point. Can functions f W A ! B and gW A0 ! B 0 be
equal? Here is the commonly accepted usage.
i i
“book2” — 2013/5/24 — 8:18 — page 412 — #432
i i
We must analyze f to find its image, and this is no small task. But if targets
have to be images, then we couldn’t even write down f W R ! R without
See the discussion on having first found the image of f . Thus, targets are convenient to use.
page 437 for a more If A is a subset of a set B, the inclusion i W A ! B is the function given by
sophisticated reason why
i.a/ D a for all a 2 A; that is, i is the subset of A B consisting of all .a; a/
targets are important.
with a 2 A. If S is a proper subset of a set A, then the inclusion i W S ! A is
not the identity function 1S because its target is A, not S ; it is not the identity
function 1A because its domain is S , not A.
Instead of saying that the values of a function f are unique, we sometimes
say that f is single-valued or that it is well-defined.
p For example, if R de-
notes the set ofpnonnegative reals, then W R ! R is a function because
we agree that
p a 0 for every nonnegative number a. On the other hand,
g.a/ D ˙ a is not single-valued, and hence it is not a function. The sim-
plest way to verify whether an alleged function f is single-valued is to phrase
uniqueness of values as an implication:
a c ad C bc
C D :
b d bd
But fractions have many names. If a=b D a0 =b 0 and c=d D c 0 =d 0 , is .ad C
bc/=bd D .a0 d 0 C b 0 c 0 /=b 0 d 0 ? We verified that this formula does not depend
on the choices of names of the fractions on page 193. On the other hand, the
operation
a c aCc
D
b d bd
1
is not well-defined: 2
D 24 , but 1
2
3
4
D 48 , while 2
4
3
4
D 5
16
¤ 84 .
There is a name for functions whose image is equal to the whole target.
i i
“book2” — 2013/5/24 — 8:18 — page 413 — #433
i i
im f D B:
f .a/ D 6a C 4:
6a C 4 D b‹
6a C 4
f .a/ D :
2a 3
To see whether f is a surjection, we seek, given b, a solution a: can we
solve
6a C 4
b D f .a/ D ‹
2a 3
This leads to the equation a.6 2b/ D 3b 4, which can be solved
for a if 6 2b ¤ 0 (note that . 3b 4/=.6 2b/ ¤ 3=2). On the other
hand, it suggests that there is no solution when b D 3 and, indeed, there is
not: if .6a C 4/=.2a 3/ D 3, cross multiplying gives the false equation
6a C 4 D 6a 9. Thus, 3 … im f , and f is not a surjection (in fact,
im f D R f3g). N
i i
“book2” — 2013/5/24 — 8:18 — page 414 — #434
i i
i i
“book2” — 2013/5/24 — 8:18 — page 415 — #435
i i
Proof. If a 2 A, then
1B ı f W a 7! f .a/ 7! f .a/
and
f ı 1A W a 7! a 7! f .a/:
i i
“book2” — 2013/5/24 — 8:18 — page 416 — #436
i i
2 b
X 3 c Y
4 d
5 e
i i
“book2” — 2013/5/24 — 8:18 — page 417 — #437
i i
Proof. Suppose that f .a/ D f .a0 /. Apply g to obtain g.f .a// D g.f .a0 //;
that is, a D a0 (because g.f .a// D a), and so f is injective. If x 2 X, then
x D g.f .x//, so that x 2 im g; hence g is surjective.
Proof. If f has an inverse g, then Lemma A.8 shows that f is injective and
surjective, for both composites g ı f and f ı g are identities.
Assume that f is a bijection. Let y 2 Y . Since f is surjective, there is some
a 2 X with f .a/ D y; since f is injective, a is unique. Defining g.y/ D a
thus gives a (single-valued) function whose domain is Y (g merely “reverses
arrows;” since f .a/ D y, there is an arrow from a to y, and the reversed arrow
goes from y to a). It is plain that g is the inverse of f ; that is, f .g.y// D
f .a/ D y for all y 2 Y and g.f .a// D g.y/ D a for all x 2 X.
a ı ıa W r 7! a1 r 7! a 1a r D r I
f .n/ D n C 1I
(
0 if n D 0
g.n/ D
n 1 if n 1.
i i
“book2” — 2013/5/24 — 8:18 — page 418 — #438
i i
What are the domains and function. For example, if R> denotes the positive real numbers, letPus show
targets of log and exp? that the exponential function f W R ! R> , defined by f .x/ D e x D x n =nŠ,
is a bijection. A direct proof that f is injective would require showing that if
e a D e b , then a D b; a direct proof showing that f is surjective would involve
showing that every positive real number c has the form e a for some a. It is
simplest to prove these statements using the (natural) logarithm g.y/ D log y.
The usual formulas e log y D y and log e x D x show that both composites f ıg
and g ı f are identities, and so f and g are inverse functions. Therefore, f is
a bijection, for it has an inverse.
The next theorem summarizes some results of this section. If X is a nonempty
set, define the symmetric group on X:
SX D fbijections W X ! Xg:
Proof. Exercise A.12(iii) on page 420 says that the composite of two bijec-
tions is itself a bijection, and so composition has target SX . Part (i) is Proposi-
tion A.5, part (ii) is Proposition A.6, and part (iii) is Proposition A.9.
Exercises
A.1 True or false, with reasons.
(i) If S T and T X , then S X .
(ii) Any two functions f W X ! Y and gW Y ! Z have a composite f ı gW X ! Z.
(iii) Any two functions f W X ! Y and gW Y ! Z have a composite g ı f W X ! Z.
(iv) For every set X , we have X ¿ D ¿.
(v) If f W X ! Y and j W im f ! Y is the inclusion, then there is a surjection
gW X ! im f with f D j ı g.
(vi) If f W X ! Y is a function for which there is a function gW Y ! X with
f ı g D 1Y , then f is a bijection.
(vii) The formula f .a=b/ D .a C b/.a b/ is a well-defined function Q ! Z.
(viii) If f W N ! N is given by f .n/ D n C 1 and gW N ! N is given by g.n/ D
n2 , then the composite g ı f is n 7! n2 .n C 1/.
(ix) Complex conjugation z D a C i b 7! z D a i b is a bijection C ! C.
Hint: (i) True. (ii) False. (iii) True. (iv) True. (v) True. (vi) False. (vii) False.
(viii) False. (ix) True.
A.2 * Let A and B be sets, and let a 2 A and b 2 B. Define their ordered pair as
follows:
.a; b/ D fa; fa; bgg:
i i
“book2” — 2013/5/24 — 8:18 — page 419 — #439
i i
Hint: In any formal treatment, one is obliged to define new terms carefully. In
particular, in set theory, one must discuss the membership relation 2. Does x 2 x
make sense? If it does, is it ever true? One of the axioms constraining 2 is that
the statement a 2 x 2 a is always false.
A.3 Let L D f.x; x/ W x 2 Rg; thus, L is the line in the plane that passes through the
origin and makes an angle of 45ı with the x-axis.
(i) If P D .a; b/ is a point in the plane with a ¤ b, prove that L is the
perpendicular-bisector of the segment PP 0 having endpoints P D .a; b/
and P 0 D .b; a/.
Hint: You may use Lemma 3.16 and the fact that . 21 .a C c/; 12 .b C d // is
the midpoint of the line segment having endpoints .a; b/ and .c; d /.
f.b; a/ W .a; b/ 2 f g:
i i
“book2” — 2013/5/24 — 8:18 — page 420 — #440
i i
i i
“book2” — 2013/5/24 — 8:18 — page 421 — #441
i i
We now display the equivalence classes arising from the equivalence rela-
tions in Example A.14.
i i
“book2” — 2013/5/24 — 8:18 — page 422 — #442
i i
It is instructive to compare rational numbers and vectors, for both are de-
fined as equivalence classes. Every rational a=b has a “favorite” name—its ex-
pression in lowest terms; every vector has a favorite name—an arrow .O; Q/
with its foot at the origin O. Although it is good to have familiar favorites,
working with fractions in lowest terms is not always convenient; for example,
even if both a=b and c=d are in lowest terms, their sum .ad C bc/=bd and
product ac=bd may not be. Similarly, it is not always best to think of vectors
as arrows with foot at the origin. Vector addition is defined by the parallelo-
! ! !
gram law (see Figure A.2): OP C OQ D OR, where O, P , Q, and R are the
! !
vertices of a parallelogram. But OQ D PR, because .O; Q/ .P; R/, and it
! ! ! ! !
is more natural to write OP C OQ D OP C PR D OR.
R = (a+c,b+d )
P = (a,b)
Q = (c,d )
The next lemma says that we can replace equivalence by honest equality at
the cost of replacing elements by their equivalence classes.
We are now going to prove that equivalence relations and partitions are
merely different ways of viewing the same thing.
i i
“book2” — 2013/5/24 — 8:18 — page 423 — #443
i i
Example A.19. (i) If is the identity relation on a set X, then the blocks
are the one-point subsets of X.
(ii) Let X D Œ0; 2 , and define the partition of X whose blocks are f0; 2 g
and the singletons fxg, where 0 < x < 2 . This partition identifies the
endpoints of the interval (and nothing else), and so we may regard this as
a construction of a circle. N
Exercises
A.14 Let X D frock; paper; scissorsg. Recall the game whose rules are: paper domi-
nates rock, rock dominates scissors, and scissors dominates paper. Draw a subset
of X X showing that domination is a relation on X .
A.15 Which of the following relations are equivalence relations? State your reasons.
(i) The relation on R.
(ii) The relation R on Z given by m R n if m n is odd.
(iii) The relation R on Z given by m R n if m n is even.
(iv) The relation on a group of people of having a common friend.
A.16 Let f W X ! Y be a function. Define a relation on X by x x 0 if f .x/ D f .x 0 /.
Prove that is an equivalence relation. If x 2 X and f .x/ D y, the equivalence
class Œx is denoted by f 1 .y/; it is called the fiber over y.
A.17 (i) Find the error in the following argument that claims to prove that a symmet-
ric and transitive relation R on a set X must be reflexive; that is, R is an
i i
“book2” — 2013/5/24 — 8:18 — page 424 — #444
i i
The elements of V are called vectors and the elements of k are called
scalars. It is not difficult to prove that the vector v in the third axiom of
addition is equal to the scalar product . 1/v.
Etymology. The word vector comes from the Latin word meaning “to carry;”
vectors in Euclidean space carry the data of length and direction. The word
scalar comes from regarding v 7! av as a change of scale. The terms scale
i i
“book2” — 2013/5/24 — 8:18 — page 425 — #445
i i
and scalar come from the Latin word meaning “ladder,” for the rungs of a
ladder are evenly spaced.
i i
“book2” — 2013/5/24 — 8:18 — page 426 — #446
i i
Example A.21. (i) The extreme cases U D V and U D f0g (where f0g de-
notes the subset consisting of the zero vector alone) are always subspaces
of a vector space. A subspace U V with U ¤ V is called a proper sub-
space of V ; we may write U ¨ V to denote U being a proper subspace
of V .
(ii) If v D .a1 ; : : : ; an / is a nonzero vector in Rn , then the line through the
origin,
` D fav W a 2 Rg;
is a subspace of Rn .
Similarly, a plane through the origin consists of all vectors of the form
av1 C bv2 , where v1 ; v2 is a fixed pair of noncollinear vectors, and a; b
vary over R. It is easy to check that planes through the origin are sub-
spaces of Rn .
(iii) If k is a field, then a homogeneous linear system over k of m equations
in n unknowns is a set of equations
a11 x1 C C a1n xn D0
a21 x1 C C a2n xn D0
:: ::
: :
am1 x1 C C amn xn D 0;
3x 2y C z 1 mod 7
x C y 2z 0 mod 7
x C 2y C z 4 mod 7
i i
“book2” — 2013/5/24 — 8:18 — page 427 — #447
i i
can be regarded as a system of equations over the field F7. The system
can be solved just as in high school, for inverses mod 7 are now known:
Œ2Œ4 D Œ1; Œ3Œ5 D Œ1; Œ6Œ6 D Œ1. The solution is
'W f1; 2; : : : ; ng ! V;
with '.i / D vi for all i . Thus, the subset im ' is ordered in the sense that
there is a first vector v1 , a second vector v2 , and so forth. A vector may appear
several times on a list; that is, ' need not be injective.
Definition. Let V be a vector space over a field k. A k-linear combination of We often write linear
a list v1 ; : : : ; vn in V is a vector v of the form combination instead of
k-linear combination if it
v D a1 v1 C C an vn ; is clear where the scalar
coefficients live.
where ai 2 k for all i .
i i
“book2” — 2013/5/24 — 8:18 — page 428 — #448
i i
Proof. Part (i) is routine. For part (ii), let X D fv1 ; : : : ; vm g and let S denote
the family of all the subspaces of V containing X; we claim that
\
S D Span hv1 ; : : : ; vmi :
S 2S
See Exercise A.24 on page 440 to see other properties of a list that do not
depend on the ordering of its vectors.
T
If X D ¿, then Span hXi D S 2S S , where S is the T family of all the
subspaces of V containing X. Now f0g Span h¿i D S 2S S , for f0g is
contained in every subspace S of V . For the reverse inclusion, one of the sub-
spaces
T S of V occurring in the intersection is f0g itself, and so Span h¿i D
S V S f0g. Therefore, Span h¿i D f0g.
Example A.24. (i) Let V D R2 , let e1 D .1; 0/, and let e2 D .0; 1/. Now
V D Span he1 ; e2i, for if v D .a; b/ 2 V , then
v D .a; 0/ C .0; b/
D a.1; 0/ C b.0; 1/
D ae1 C be2 2 Span he1 ; e2 i :
The following definition makes sense even though we have not yet defined
dimension.
Part (ii) of Example A.24 shows that k n is finite-dimensional, while part (iii)
shows that kŒx is infinite-dimensional. Now C is a vector space over R, and it
i i
“book2” — 2013/5/24 — 8:18 — page 429 — #449
i i
1
X
vi D ai aj vj : (A.1)
j ¤i
Deleting vi gives a shorter list that still spans V : write any v 2 V as a lin-
ear combination of all the vj (including vi ); now substitute the expression
Eq. (A.1) for vi and collect terms.
i i
“book2” — 2013/5/24 — 8:18 — page 430 — #450
i i
Linear independence has been defined indirectly, as not being linearly de-
pendent. Because of the importance of linear independence, let us define it
directly.
Proof. These are just the contrapositives of (i) ) (iii) and (iii) ) (i) in Propo-
sition A.25.
Thus, bases are shortest spanning lists. Of course, all the vectors in a linearly
independent list v1 ; : : : ; vn are distinct, by Example A.26(iv).
c 1 e1 D .c1 ; 0; 0; : : : ; 0/
C c2 e2 D C .0; c2 ; 0; : : : ; 0/
:: ::
: :
C cn en D C .0; 0; 0; : : : ; cn /
0 D .c1 ; c2; : : : ; cn /:
i i
“book2” — 2013/5/24 — 8:18 — page 431 — #451
i i
v D a1 v1 C a2 v2 C C an vn ;
i i
“book2” — 2013/5/24 — 8:18 — page 432 — #452
i i
vector spaces also have bases. For example, it turns out that a basis of kŒx is
1; x; x 2; : : : ; x n ; : : : .
We can now prove invariance of dimension, perhaps the most important
result about vector spaces.
vi D ai1 u1 C C ai n un
Each vi0 is a linear combination of the u’s, and the coefficient of u1 is ai1
.ai1 a111 /a11 D 0. Since m 1 > n 1, the inductive hypothesis gives scalars
b2 ; : : : ; bm , not all 0, with
b2 v20 C C bm vm
0
D 0:
The following familiar fact illustrates the intimate relation between linear
algebra and systems of linear equations.
˛11 x1 C C ˛1n xn D 0
:: :: ::
: : :
˛m1 x1 C C ˛mn xn D 0
i i
“book2” — 2013/5/24 — 8:18 — page 433 — #453
i i
ˇ1 c1 C C ˇn cn D 0:
v D c1 v1 C C cn vn ;
where ci 2 k for all i . There are q choices for each ci , and so there are q n
vectors in V .
Example A.35. (i) Example A.28 shows that k n has dimension n, which
agrees with our intuition when k D R: the plane R R is 2-dimensional,
and R3 is 3-dimensional!
(ii) If V D f0g, then dim.V / D 0, for there are no elements in its basis ¿.
(This is another good reason for defining ¿ to be linearly independent.)
(iii) Let I be a finite set with n elements. Define
k I D ffunctions f W I ! kg:
f C f 0 W i 7! f .i / C f 0 .i /
af W i 7! af .i /
i i
“book2” — 2013/5/24 — 8:18 — page 434 — #454
i i
(see Exercise A.18(i) on page 439). It is easy to check that the set of n
functions of the form fi , where i 2 I , defined by
(
1 if j D i
fi .j / D
0 if j ¤ i
Proof. If v1 ; : : : ; vn is not a basis, then it does not span V , for this list is lin-
early independent. Thus, there is w 2 V with w … Span hv1 ; : : : ; vn i. But the
longer list v1 ; : : : ; vn ; w is linearly independent, by Proposition A.25, contra-
dicting X being a longest linearly independent list.
The converse of Lemma A.36 is true; bases are longest linearly independent
lists. This follows from the next proposition, which is quite useful in its own
right.
Proof. If the linearly independent list Z does not span V , there is vmC1 2 V
with vmC1 … Span hu1 ; : : : ; um i, and the longer list u1 ; : : : ; um ; vmC1 is lin-
early independent, by Proposition A.25. If u1 ; : : : ; um ; vmC1 does not span V ,
there is vmC2 2 V with vmC2 … Span hu1 ; : : : ; um; vmC1 i. Since dim.V / D n,
Lemma A.31 says that the length of these lists can never exceed n, and so this
process of adjoining elements vmC1 ; vmC2 ; : : : must stop. But the only reason
a list stops is that it spans V ; hence, it is a basis.
Proof. Lemma A.36 shows that longest linearly independent lists are bases.
Conversely, if X is a basis, it must be a longest linearly independent list: other-
wise, Proposition A.37 says we could lengthen X to obtain a basis of V which
is too long.
We now paraphrase Lemma A.31.
i i
“book2” — 2013/5/24 — 8:18 — page 435 — #455
i i
Proof. Otherwise, the list could be extended to a basis having too many ele-
ments.
Proof. (i) Were the list linearly dependent, then it could be shortened to give
a basis, and this basis is too small.
(ii) If the list does not span, then it could be lengthened to give a basis, and
this basis is too large.
Linear Transformations
Linear transformations are homomorphisms of vector spaces; they are really
much more important than vector spaces, but vector spaces are needed in order
to define them, and bases of vector spaces are needed to describe them by
matrices. (You are surely familiar with the next definition, at least for k D R.)
i i
“book2” — 2013/5/24 — 8:18 — page 436 — #456
i i
i i
“book2” — 2013/5/24 — 8:18 — page 437 — #457
i i
T .v/ D Av;
i i
“book2” — 2013/5/24 — 8:18 — page 438 — #458
i i
dim.V / D n D dim.V /:
Proof. A basis of V and its dual basis have the same number of elements.
Proof. We saw this in the proof of Proposition A.47, when showing that the
dual basis spans V .
The next result shows that dual spaces are intimately related to transposing
matrices. If A D Œaij is an m n matrix, then its transpose A> is the n m
matrix Œaj i whose ij entry is aj i . In words, for each i , the i th row ai1 ; : : : ; ai n
of A is the i th column of A> (and, necessarily, each j th column of A is the
j th row of A> ).
i i
“book2” — 2013/5/24 — 8:18 — page 439 — #459
i i
Exercises
A.18 (i) * If k is a field, c 2 k, and f W k ! k is a function, define a new function
cf W k ! k by a 7! cf .a/. With this definition of scalar multiplication,
prove that the commutative ring k k of all functions on k is a vector space
over k (see Example A.35(iii)).
(ii) Prove that Poly.k/, the set of all polynomial functions k ! k, is a subspace
of k k .
A.19 If the only subspaces of a vector space V are f0g and V itself, prove that
dim.V / 1.
A.20 Prove, in the presence of all the other axioms in the definition of vector space,
that the commutative law for vector addition is redundant; that is, if V satisfies
all the other axioms, then u C v D v C u for all u; v 2 V .
Hint: If u; v 2 V , evaluate Œ. v/ C . u/ in two ways.
i i
“book2” — 2013/5/24 — 8:18 — page 440 — #460
i i
`i D f˛vi C ui W ˛ 2 Rg;
fu C ˛w W ˛ 2 Rg R3 ;
where w is a fixed nonzero vector. Show that every line through the origin is
a one-dimensional subspace of R3.
(ii) In calculus, a plane in space passing through a point u is defined as the subset
fv 2 R3 W .v u/ n D 0g R3 ;
where n ¤ 0 is a fixed normal vector. Prove that a plane through the origin
is a two-dimensional subspace of R3.
If the origin .0; 0; 0/ lies on a plane H , then u D 0 and
H D fv D .x; y; z/ 2 R3 W v n D 0g;
where n D .˛; ˇ;
/ is a (nonzero) normal vector; that is, H is the set of all
vectors orthogonal to n.
A.27 If U and W are subspaces of a vector space V , define
U C W D fu C w W u 2 U and w 2 W g:
U ˚ W D f.u; w/ W u 2 U and w 2 W g;
with addition
.u; w/ C .u0 ; w0 / D .u C u0 ; w C w0 /
i i
“book2” — 2013/5/24 — 8:18 — page 441 — #461
i i
A.4 Inequalities
Many properties of inequality follow from a few basic properties. Denote the Recall that N is the set of
set of all positive real numbers by P (we do not regard 0 as positive). We all nonnegative integers,
assume the set P satisfies so that N D P [ f0g.
(i) a; b 2 P implies a C b 2 P
(ii) a; b 2 P implies ab 2 P
(iii) Trichotomy W If a is a number, then exactly one of the following is true:
a 2 P; a D 0; a 2 P:
The first two properties say that P is closed under addition and multiplica-
tion. We now define inequality.
Definition. Given real numbers a and b, we say that a is less than b, written Other notation: if a < b,
a < b, if b a 2 P ; we say that a is less than or equal to b, written a b, if we may write b > a and,
if b a, we may write
b a 2 N; that is, a < b or a D b.
a b.
Thus, a is positive if 0 < a (that is, a 2 P ), and a is negative if a < 0 (that
is, a 2 P ). Just to complete the
picture, a > b means
b < a (and a b means
Here are some standard properties of inequality.
b a).
Proof. We prove the first three parts; the last two proofs are similar and appear
in Exercise A.31 below.
(i) By definition, b < B means that B b 2 P.
Suppose that a > 0; that is, a 2 P . To show that ab < aB, we must
show that aB ab D a.B b/ 2 P , and this follows from Property (i)
of P .
If a < 0, then a 2 P . Therefore, . a/.B b/ 2 P , and so
. a/.B b/ D . 1/a.B b/ D a.b B/ 2 P:
i i
“book2” — 2013/5/24 — 8:18 — page 442 — #462
i i
(ii) The first part says that if b < B and both sides are multiplied by a positive
number, then the sense of the inequality stays the same. So, if B < 0, then
aB < a 0 D 0.
(iii) Suppose that b > 0. If b 1 < 0, then
1
1Dbb < b 0 D 0;
1
a contradiction. If b D 0 then
1
1 D bb D b 0 D 0;
1
another contradiction. Hence, Trichotomy gives b > 0.
Exercises
A.31 * Prove parts (iv) and (v) of Proposition A.51.
A.32 Prove, or disprove and salvage if possible. Suppose a, b, c, and d are real num-
“Disprove” here means bers.
“give a concrete counterex-
(i) If a < b, then a2 < b 2 .
ample.” “Salvage” means
“add a hypotheis to make it (ii) If a2 < b 2, then a < b.
true.” (iii) If a < b and c < d , then ac < bd .
(iv) If a3 > 0, then a > 0.
A.33 Does C have a subset P 0 like P ; that is, P 0 is closed under addition and multi-
plication, and it satisfies Trichotomy?
a .b c/ D .a b/ c:
Since we are told only how to combine two elements, there is a choice when
confronted with three elements: first combine b and c, obtaining b c, and
then combine this new element with a to get a .b c/, or first get a b
and then combine it with c to get .a b/ c. Associativity says that either
choice yields the same element of G. Thus, there is no confusion in writing
a b c without parentheses. In contrast, subtraction is not associative, for it
i i
“book2” — 2013/5/24 — 8:18 — page 443 — #463
i i
Proof. The proof is by induction on n 3. The base step holds because the
operation is associative. For the inductive step, consider two ultimate prod-
ucts U and V obtained from a given expression a1 a2 an after two series of
choices:
U D .a1 ai /.ai C1 an / and V D .a1 aj /.aj C1 an /I
the parentheses indicate the penultimate products displaying the last two fac-
tors that multiply to give U and V , respectively; there are many parentheses
inside each of the shorter expressions. We may assume that i j . Since each
of the four expressions in parentheses has fewer than n factors, the inductive
hypothesis says that each of them needs no parentheses. It follows that U D V
if i D j . If i < j , then the inductive hypothesis allows the first expression to
be rewritten as
U D .a1 ai / Œai C1 aj Œaj C1 an
i i
“book2” — 2013/5/24 — 8:18 — page 444 — #464
i i
Proof. In the first case, both elements arise from the expression having m C n
factors each equal to a; in the second case, both elements arise from the ex-
pression having mn factors each equal to a.
so, as long as your CAS can find the remainder when one polynomial is divided
by another, you can use it to perform “modular arithmetic” with polynomials
in Q.˛/.
If p is a prime in Z and D cos.2=p/ C i sin.2=p/, we know that
irr.; Q/ D 1 C x C x 2 C C x p 1
:
Phi(x,p):= sum(xˆk,k,0,p-1)
Suppose that your CAS command for polynomial remainder is pmod. For ex-
ample,
i i
“book2” — 2013/5/24 — 8:18 — page 445 — #465
i i
pmod(xˆ3+4xˆ2-3x+1,xˆ2+1)
The two functions, Phi and pmod, allow us to calculate in Q./. Let’s look
at some examples.
Eisenstein Integers
Arithmetic with complex numbers is built into most CAS environments, so that
you can do calculations with Gaussian integers right away. Arithmetic with
Eisenstein integers isn’t usually built in, but you can build a model of ZŒ! by
thinking of an Eisenstein integer as a congruence class mod x 2 C x C 1:
or even
add(f,g) = cl(f+g)
mult(f,g) = cl(fg).
So, now we can compute: to find, for example, 3! 5 ! 2 C 1, you want the
class of 3x 5 x 2 C 1 mod x 2 C x C 1
cl (3xˆ5-xˆ2+1)
> -2x-1
3! 5 !2 C 1 D 1 2!:
Your model can do generic calculations, giving the rules for addition and mul-
tiplication in ZŒ!:
add(a+b*x,c+d*x)
> a+c+(b+d)*x
mult(a+b*x,c+d*x)
> a*c-b*d + (a*d+b*c-b*d)*x
mult(3+2*x,3+2*x)
> 5+8*x
mult(5+x,5+x)
i i
“book2” — 2013/5/24 — 8:18 — page 446 — #466
i i
> 24+9*x
mult(4+3*x,4+3*x)
> 7+15*x
Symmetric Polynomials
In Example 9.6 on page 388, we derived the cubic formula via symmetric
polynomials. There, we defined
s D ˛1 C ˛2 ! C ˛3 ! 2
u D ˛1 C ˛2 ! 2 C ˛3!;
s 3 C u3 D 27˛1˛2 ˛3 :
add((a+b*x+c*xˆ2)ˆ3,(a+b*xˆ2+c*x)ˆ3)
> 2*aˆ3-3*aˆ2*b+aˆ2*c-3*a*bˆ2-12*a**b*c-3*a*cˆ2+2*
bˆ3-3*bˆ2*c-3*b*cˆ2+2*cˆ3
factor(2*aˆ3-3*aˆ2*b+aˆ2*c-3*a*bˆ2-12*a**b*c-3*a*
cˆ2+2*bˆ3-3*bˆ2*c-3*b*cˆ2+2*cˆ3)
> (a+b-2*c)*(a-2*b+c)*(2*a-b-c)
mod is the CAS built-in A CAS model follows the syntax pretty closely:
“mod” function.
n(e,k):=sum(xˆ(mod(3ˆ(k+e*j),17)),j,0,(16/e)-1).
i i
“book2” — 2013/5/24 — 8:18 — page 447 — #467
i i
n(2,0)
> xˆ16+xˆ15+xˆ13+xˆ9+xˆ8+xˆ)+xˆ2+x
n(2,1)
> xˆ14+xˆ12+xˆ11+xˆ10+xˆ7+xˆ6+xˆ5+xˆ3
x2 C x 4:
Exercises
A.34 Find a polynomial in QŒx that has roots 0;k ; 1;k ; 3;k ; 4;k :
i i
“book2” — 2013/5/24 — 8:18 — page 448 — #468
i i
i i
“book2” — 2013/5/24 — 8:18 — page 449 — #469
i i
References
i i
“book2” — 2013/5/24 — 8:18 — page 450 — #470
i i
450 References
[16] Heath, T. L., The Thirteen Books of Euclid’s Elements, Cambridge Uni-
versity Press, 1926; Dover reprint, Mineola NY, 1956.
[17] Ireland, K. and Rosen, M., A Classical Introduction to Modern Number
Theory, Springer-Verlag, New York, 1982.
[18] Koblitz, N., A Course in Number Theory and Cryptography, Springer-
Verlag, New York, 1987.
[19] ——, Introduction to Elliptic Curves and Modular Forms, Springer Ver-
lag, New York, 1993.
[20] Loomis, E. S., The Pythagorean Proposition, Edwards Brothers., Ann
Arbor, 1940.
[21] Montgomery, S., and Ralston, E. W., Selected Papers in Algebra, Ray-
mond W. Brink Selected Mathematical Papers, Vol. 3, Mathematical As-
sociation of America, Washington, 1977.
[22] Needham, T., Visual Complex Analysis, Clarendon Press, Oxford, 1997.
[23] Ribenboim, P., Thirteen Lectures on Fermat’s Last Theorem, Springer–
Verlag, New York, 1979.
[24] Rosen, K. H., Elementary Number Theory and Its Applications, 4th ed.,
Addison-Wesley, Reading, MA, 2000.
[25] Rotman, J. J., Advanced Modern Algebra, 2d ed., Graduate Studies
in Mathematics vol. 114, American Mathematical Society, Providence,
2010.
[26] ——, A First Course in Abstract Algebra, 3d ed., Prentice Hall, Upper
Saddle River NJ, 2006.
[27] ——, Galois Theory, 2d ed., Springer-Verlag, New York, 1998.
[28] ——, Journey into Mathematics, Prentice Hall, Upper Saddle River NJ,
1998; Dover reprint, Mineola NY, 2007.
[29] Samuel, P., Algebraic Theory of Numbers, Houghton-Mifflin, Boston,
1992.
[30] Siegel, C. L., Topics in Complex Function Theory Vol. I, Elliptic Func-
tions and Uniformization Theory, Wiley, New York, 1969.
[31] Silverman, J. H., and Tate, J., Rational Points on Elliptic Curves,
Springer–Verlag, New York, 1992.
[32] Stillwell, J., Mathematics and Its History, 3d ed., Springer, New York,
2010.
[33] Tignol, J.-P., Galois’ Theory of Equations, World Scientific Publishing,
Singapore, 2001.
[34] van der Waerden, B. L., Science Awakening, John Wiley, New York,
1963.
[35] ——, Geometry and Algebra in Ancient Civilizations, Springer–Verlag,
New York, 1983.
[36] Washington, L.C., Introduction to Cyclotomic Fields, Springer, New
York, 1982.
[37] Weil, A., Number Theory: An Approach Through History. From Ham-
murapi to Legendre, Birkhäuser, Boston, 1984.
[38] Weyl, H., Algebraic Theory of Numbers, 6th printing, Princeton Univer-
sity Press, Princeton, 1971; Princeton Landmarks in Mathematics and
Physics Series, 1998.
[39] Wilf, H., Generatingfunctionology, Academic Press, New York, 1994.
i i
“book2” — 2013/5/24 — 8:18 — page 451 — #471
i i
Index
i i
“book2” — 2013/5/24 — 8:18 — page 452 — #472
i i
452 Index
i i
“book2” — 2013/5/24 — 8:18 — page 453 — #473
i i
Index 453
i i
“book2” — 2013/5/24 — 8:18 — page 454 — #474
i i
454 Index
i i
“book2” — 2013/5/24 — 8:18 — page 455 — #475
i i
Index 455
i i
“book2” — 2013/5/24 — 8:18 — page 456 — #476
i i
456 Index
i i
“book2” — 2013/5/24 — 8:18 — page 457 — #477
i i
Index 457
UFD, 258
Uncle Ben, 176
i i
“book2” — 2013/5/24 — 8:18 — page 458 — #478
i i
i i
“book2” — 2013/5/24 — 8:18 — page 459 — #479
i i
Joseph Rotman was born in Chicago on May 26, 1934. He studied at the
University of Chicago, receiving the degrees BA, MA, and Ph.D. there in 1954,
1956, and 1959, respectively; his thesis director was Irving Kaplansky.
Rotman has been on the faculty of the mathematics department of the Uni-
versity of Illinois at Urbana-Champaign since 1959, with the following ranks:
Research Associate 1959–1961; Assistant Professor 1961–1963; Associate Pro-
fessor 1963–1968; Professor 1968–2004; Professor Emeritus 2004–present.
He has held the following visiting appointments: Queen Mary College, Lon-
don, England 1965, 1985; Aarhus University, Denmark, Summer 1970; He-
brew University, Jerusalem, Israel 1970; University of Padua, Italy, 1972; Tech-
nion, Israel Institute of Technology and Hebrew University, Jerusalem (Lady
Davis Professor), 1977–78; Tel Aviv University, Israel, 1982; Bar Ilan Univer-
sity, Israel, Summer 1982; Annual visiting lecture, South African Mathemati-
cal Society, 1985; Oxford University, England, 1990.
Professor Rotman was an editor of Proceedings of American Mathematical
Society, 1970, 1971; managing editor, 1972, 1973.
Aside from writing research articles, mostly in algebra, he has written the
following textbooks: Group Theory 1965, 1973, 1984, 1995; Homological Al-
gebra 1970, 1979, 2009; Algebraic Topology 1988; Galois Theory 1990, 1998;
Journey into Mathematics 1998, 2007; First Course in Abstract Algebra 1996,
2000, 2006; Advanced Modern Algebra 2002.
459
i i