Sylow Thms y Gset
Sylow Thms y Gset
Sylow Thms y Gset
The main topics considered here are group actions, the Sylow theorems, semi-direct
products, nilpotent and solvable groups, and simple groups.
See Preliminary remarks for some of the notation used here, especially regarding general
linear groups. Some further notation: [n] denotes the set of the first n natural numbers
1, 2, ..., n.
P k [n] denotes the set of k-element subsets of [n].
1 Group actions
1.1 Definition of a group action or G-set
Let G be a group, with identity element e. A left G-set is a set X equipped with a map
: G XX satisfying (i) (gh, x) = (g, (h, x)) for all g, h G and all x X, and (ii)
(e, x) = x for all x X. Usually we write either g x or simply juxtaposition gx for (g, x);
in the latter notation conditions (i) and (ii) become (gh)x = g(hx) and ex = x. We also call
this data a group action, or say that G acts on X (on the left).
Similarly a right G-set is a set X equipped with a map : X GX satisfying (in
the evident juxtaposition notation) x(gh) = (xg)h and xe = x. Of course the distinction
between left and right G-actions does not depend on whether we write the domain of as
G X or X G. The distinction is that in a left action gh acts by h first, then g, whereas
in a right action g acts first, then h.
Notice that up to this point, we havent even used the existence of inverses, so exactly
the same definition makes sense for left and right monoid actions. We will make little use
of monoid actions, however. One immediate advantage of the existence of inverses is that
any right action can be converted to a left action by setting g x = xg 1 ; similarly any left
action can be converted to a right action. Nevertheless it is important to pay close attention
to which side the group is acting on. If the side is not specified we always mean a left action
(an arbitrary choice on my part!). Any statement about left actions has a parallel statement
for right actions; we leave it to the reader to make the translation.
1
G-Set.
Finally, we recall a simple but very powerful counting formula. Suppose the group G
acts on the finite set X. Then
X X
|X| = |O| = [G : Gx ],
O xG\X
where the first sum is over the orbits O of the action and the second sum means, in a
mildly abusive notation, that we are taking a fixed representative x of each orbit. This
choice of x O is arbitrary, but the sum is nevertheless well-defined since [G : Gx ] = |O| is
independent of the choice.
2 Examples
1. The symmetric group Sn acts on the left of [n] := {1, 2, ..., n} by permutions. The action
is transitive, with the isotropy group of any point isomorphic to Sn1 .
More generally, if X is any set, we let P erm X denote the group of bijections XX.
Then by construction P erm X acts on the left of X by x = (x). It is a left action
because a composition acts by first, then .
2
3. If G is any group, the left conjugation action of G on itself is given by g x = gxg 1 .
Similarly right conjugation is defined by x g = g 1 xg. The fixed-point set of either action
is the center C(G) (Z(G) is another common notation for the center). The orbits are the
conjugacy classes of G. The isotropy group of x is CG x, the centralizer of x in G.
In this case the counting formula yields the class equation. To state it we need a notation
for conjugacy classes, andsadlywe have already assigned the letter C to centers and
centralizers. I will use the non-standard notation (x) to mean the conjugacy class of x, and
Conj G to mean the set of conjugacy classes. We then have
X X
|G| = |(x)| = [G : CG x].
xConj G xConj G
Once again the notation has the obvious interpretation: We are choosing one x from each
conjugacy class, and the choice doesnt matter.
4. If G is any group, let S(G) denote the set of subgroups of G. Then G acts on S(G)
by left conjugation: g H = gHg 1 (there is also a right conjugation, of course). The
fixed-points are the normal subgroups. The orbits are conjugacy classes of subgroups. The
isotropy group of H is the normalizer NG H of H in G.
5. Let X, Y be sets and let F (X, Y ) denote the set of functions XY . If Y is a left
G-set, we get a left G-action on F (X, Y ) by (g )(x) = g((x)). If X is a left G-set, we get
a right G-action on F (X, Y ) by ( g)(x) = (gx). Note carefully that this is a right action.
However, we can always convert it to a left action by (g ? )(x) = (g 1 x). If both X and
Y are G-sets, we can get a combined left action of G on F (X, Y ) by (g )(x) = g(g 1 x).
The fixed-point set of the combined left action gg 1 is the subset of G-equivariant
maps XY , as is easily checked.
6. Let X be a set, X n the n-fold Cartesian product of X with itself. Then Sn acts on
X n by permuting the coordinates. This is a right action, given explicitly by
7. Projective spaces. The purpose of this example is twofold. First of all, projective
spaces are ubiquitous in topology and geometryincluding especially algebraic geometry,
which leads me to discuss them in an algebra course. Second, in nature we are often first
confronted not with a group action or even a group, but with a set (or topological space,
etc.) X that may secretly be equipped with a useful group action and/or realization as the
orbit set of a group action. Its important to be able to recognize such structures.
Let F be a field, and let V be a finite dimensional vector space over F . (In fact the
finite-dimensionality isnt necessary, but I prefer to avoid distractions.) The projective space
P(V ) is the set of lines through the origin in V . Even though at the moment we are not giving
3
it any topology, we call it a space because that is the traditional term, and if you call it
the projective set you risk being sneered at as an ignorant yokel from the backcountry.
Thus P(V ) does not involve a group in its definition, but it is in fact crawling with groups.
First of all, it is the orbit set of the action of F on V {0} by scalar multiplication. This
is a very handy interpretation. Second, GL(V ) acts transitively on it: By elementary linear
algebra, any line can be moved to any other line by an invertible linear transformation.
This is a particularly important interpretation; it is often useful to recognize a set as a
transitive G-set (also known as a homogeneous space). To complete the picture we choose
a convenient point in the set and determine its isotropy group. Here there is no natural
choice of a line; we just pick one and call it L0 . The isotropy group H consists of invertible
transformations preserving L0 , i.e. the set of A GL(V ) such that A has an eigenvector in
=
L0 . Then there is a G-isomorphism GL(V )/H P(V ). If we want to be more explicit, we
choose a basis e1 , ..., en and take L0 = he1 i. Then H corresponds to the group of matrices
with ai1 = 0 for i > 1 (i.e. the first column is zero except for a11 ).
Example. X is itself a group, and G is acting on it via group automorphisms. In other words,
g (xy) = (g x)(g y) for all g G, x, y X. The action of G on itself by conjugation is an
action of this type. Actions via group automorphisms will be used to construct semi-direct
products later.
Example. We have a vector space V over a field F , and the G-action is linear: g (v + w) =
g v + g w, and g (cv) = cg v (c F ). This type of action is called a representation of
G over F. In this course representation will always be taken to mean finite dimensional
representation, unless otherwise specified. Note that GL(V ) acts linearly on V by definition;
we call this the standard representation of GL(V ).
Representation theory is one of the major branches of mathematics. Well consider
representation theory of finite groups in some detail, especially over C.
Example. Let F be a field, and suppose G acts on F via field automorphisms. This is
precisely the situation one studies in Galois theory.
4
There are many variants on this theme: groups acting on metric spaces by isometries,
smooth Lie groups actions on smooth manifolds (studied in our Manifolds course), sim-
plicial actions on simplicial complexes...all of these are dear to my heart, but lie outside the
scope an algebra course.
5
2. Suppose X1 , X2 are G-sets with the same underlying set X but with different G-actions.
Let 1 , 2 be the corresponding homomorphisms GP erm X. Then X1 is isomorphic to
X2 as a G-set if and only if 1 , 2 are conjugate as homomorphisms to P erm X. (Exercise.
By conjugate I mean there is a P erm X such that 1 1 = 2 . )
5 Transitive G-sets
Recall that a G-set X is transitive if there is only one orbit, and that in this case a choice
=
of x X yields an isomorphism of G-sets G/Gx X. Notice, however, that the choice of
x is arbitrary. This is true even for a free transitive action: then we get an isomorphism of
=
G-sets G X, where G has the left translation action, but there is no natural choice of
such an isomorphism; it depends on the choice of x.
The next result is basic.
Proposition 5.1 Let X be a transitive G-set. Then the isotropy groups form a complete
conjugacy class of subgroups of G.
Proof: Suppose x, y X. Choose g G with gx = y. Then gGx g 1 = Gy , as is readily
checked. Hence any two isotropy groups are conjugate. Conversely, let H be a subgroup
conjugate to Gx ; say gGx g 1 = H. Then H = Ggx , so every subgroup in the conjugacy class
occurs as an isotropy group.
Now suppose we have two subgroups K, H and we want to show that K is conjugate
to a subgroup of H, a problem that arises quite frequently. In the spirit of the previous
proposition we have at once:
Proposition 5.2 K is conjugate to a subgroup of H if and only if the left action of K on
G/H has a fixed point. More precisely, KxH = xH if and only if x1 Kx H.
Now lets consider the set of all orbits of the K-action on G/H. These are called the
(K, H)-double cosets, denoted K\G/H. We could just as well think of K\G/H as the H-
orbits of the right action on K\G, or more symmetrically as the orbits of the left KH-action
on G given by (k, h) g = kgh1 . More often, however, we stick with the first interpretation.
Here is an example known as the Bruhat decomposition:
Proposition 5.3 Let F be a field, and let B := Bn F . Then the (B, B)-double cosets of
GLn F are given by
a
GLn F = BwB,
wW
where W
= Sn is the Weyl group.
6
The proof is by old-fashioned row and column reduction, and is left to the reader. Whats
not obvious is why this particular way of arranging the row/column reduction is of interest. It
turns out that it plays an important role in the structure theory of a large class of interesting
groups (not just the general linear groups), and enters into algebraic geometry and topology
via Schubert varieties and Schubert cells. For example, when F = R or F = C then the
orbit set GLn F/B is a topological space known as a flag manifold or flag variety, and the
left B-orbitswhich by the proposition are indexed by elements of W are homeomorphic
to vector spaces over F and called Schubert cells. They are very useful for studying the
geometry and topology of flag manifolds, a subject which is in itself a major industry these
days. I mention all this just to pique your curiosity, and to suggest how the humble process
of row reduction connects with beautiful, deep mathematics.
Theorem 6.1 Let P be a finite p-group, S a finite P -set. Then |S| = |S P | mod p.
Proof: |S| = |O|, where O ranges over the P -orbits. Since P is a p-group, |O| is either
P
Corollary 6.2 If |S| is prime to p, then there is at least one fixed point.
The next application can be proved using the binomial theorem, but we can also deduce
it from Theorem 6.1.
n
Proposition 6.5 Let n = spk . Then pk
= s mod p.
7
Proof: Partition [n] into s disjoint subsets A1 , ..., As of size pk . Let Ci Sn be a cyclic
group of order pk that permutes Ai transitively and fixes the other Aj s pointwise, and take
P = C1 ... Cs . Then the action of P on P pk [n] has exactly s fixed points, namely
A1 , ..., As . Hence the proposition follows from the theorem.
Let G be a finite group, and write |G| = spk with s prime to p. A p-Sylow subgroup is
a subgroup P of order pk . The first Sylow theorem asserts that such subgroups always
exist.
Proof: The strategy is to look for a finite G-set S such that some isotropy group Gx is a
p-Sylow subgroup. Indeed, suppose we had a finite G-set S such that (i) |S| is prime to p;
and (ii) all isotropy groups Gx are p-groups. Then by (i) there is an orbit O with |O| prime
to p. Choose x O; then |Gx | = pi for some i by (ii). But |G| = |Gx | |O|, forcing i = k.
Hence Gx is a p-Sylow subgroup.
It remains to exhibit such an S. Take S to be the set of subsets of G of size pk , with
action induced by the left translation action of G on itself. Then (i) holds by Proposition 6.5.
Now let A S. Then GA acts freely on the elements of A (since the left translation action
is free), so |GA | divides pk , proving (ii).
The second Sylow theorem says that all p-Sylow subgroups are conjugate. More
precisely:
Proof: As discussed earlier, this is equivalent to saying that the action of H on G/P has
a fixed point: if HxP = xP , then x1 Hx P and conversely. But H is a p-group and
|G/P | = s is prime to p, so this follows immediately from Theorem 6.1.
The last item of business is to say something about the set of all p-Sylow subgroups of
G. How many such subgroups are there? By the second Sylow theorem, G acts transitively
on this set by conjugation. If we fix a p-Sylow subgroup P , the isotropy group of the action
is the normalizer NG P . Hence the number of p-Sylow subgroups is [G : NG P ]. This brings
us to the third Sylow theorem:
Theorem 6.8 Let np G denote the number of distinct p-Sylow subgroups of G. Then np G
divides |G| and np G = 1 mod p.
8
np G = |G/NG P | = (G/NG P )P = 1 mod p.
Many examples and applications of p-Sylow subgroups can be found in the exercises.
Products of G-sets. Again let X, Y be G-sets. The product X Y is a G-set via the diagonal
action: g (x, y) = (gx, gy). The product of any collection of G-sets is defined similarly,
and is the categorical product. The fixed-point functor takes products to products, e.g.
(X Y )G = X G Y G . Orbits, however, are another matter; there is no simple relationship
between G\(X Y ) and G\X, G\Y . We will see some examples later.
Balanced products. Suppose X is a right G-set, Y a left G-set. Define an equivalence relation
on X Y by (xg, y) (x, gy). The balanced product X G Y is the set of equivalence classes.
In fact this is just the orbit set of the left G-set X Y , where g acts by g (x, y) = (xg 1 , gy).
But the slight change in viewpoint can be useful and enlightening. For our immediate
purposes, the induced G-spaces below provide the most important example.
Induction. Suppose H is a subgroup of G and X is a left H-set. Then the balanced product
GH X is a left G-set with action g1 [g, x] = [g1 g, x]; we say that the G-action is induced from
the H-action. Here the brackets [] denote equivalence class in the balanced product. Thus
X 7 G H X defines the induction functor H-set G-set (the definition of the functor
on morphisms being obvious). Note that the map i : XG H X given by i(x) = [e, x] is
an H-map.
Induction has the following universal property (see the category theory notes for a general
discussion of such properties). We keep the above notation.
9
Proposition 7.1 Suppose Y is a G-space, X is an H-space and : XY is an H-map.
Then there is a unique G-map : G H XY such that the following diagram commutes:
i
X - G H X
p
p pp
p
p pp
pp
!
p
?
Y
Proof: There is no choice in the definition of : We must take ([g, x]) = g(x). Now check
that it works (in particular, check that is well-defined).
As usual, we give two alternate ways of thinking about the universal property:
Adjoint functor version. Induction H-set G-set is left adjoint to restriction G-set
H-set.
See the category theory notes for discussion of adjoint functors. In essence there is not
much to it in the present example; the universal property translates immediately to the
assertion that there is a bijection
HomG (G H X, Y )
= HomH (X, Y ).
To complete the proof that these are adjoint functors, one has to show that the above
bijection is natural in X and Y . This is easy once one has absorbed the definition of
natural transformation, but it is not essential to understand all this right away. Just use
the universal property.
Proposition 7.2 is bijective if and only if (i) for all y Y , there is a g G such that
gy X; and (ii) whenever x1 , x2 X and gx1 = x2 , we have g H.
The proof is a straightforward check; (i) gives the surjectivity and (ii) the injectivity.
Note that (ii) says that if g
/ H, then g moves every element of X to an element not in X.
Example. Let F be a field, G = GLn F , and Y the set of pairs (L, v) with L a line in F n and
v L. We have an evident G-action on Y given by g (L, v) = (gL, gv). Let L0 denote the
line spanned by the standard basis vector e1 , and let X = {(L, v) Y : L = L0 }. Then X
is invariant under H := {g GLn F : gL0 = L0 }. Now lets check conditions (i) and (ii) of
the recognition principle above: (i) is clear, since by linear algebra GLn F acts transitively
10
on the lines. And if (gL0 , gv) = (L0 , w) then g H. So we have a canonical isomorphism
=
of G-sets G H X Y . To complete this description, one should describe how H acts
on X. Identifying X with the vector space L0 , it is the linear action that pulls back scalar
multiplication along the homomorphism HF taking a matrix A to a11 , the upper left
entry: A (L0 , v) = (L0 , a11 v).
Examples of this type occur frequently in geometry and topology.
Proposition 8.1 Suppose H1 , ..., Hn are normal subgroups of G such that (i) Hi ( j6=i Hj ) =
Q
Proof: First note that m is a group homomorphism: For this, one needs to know that for
i 6= j the elements of Hi , Hj commute with one another. But if x Hi and y Hj , then
by normality xyx1 y 1 Hi Hj = {e}. Then m is injective by (i) and surjective by (ii),
completing the proof.
11
already know how to do this, in any group: kh = (khk 1 )k. The upshot of this discussion
is that in the semi-direct product, the automorphism (k) of H is the same thing as left
conjugation by k.
Note that by construction H is normal in H o K, and that H o K is the direct product
H K if and only if is trivial if and only if K is normal (check this!).
If is understood, we often omit it from the notation and simply write H o K. Needless
to say, abusive notation of this kind must be used with care.
Example. Suppose we ask: Are there non-abelian groups of order 21? With semi-direct
products in hand, it is easy to construct such groups explicitly: Aut C7 is cyclic of order
6, so we can choose an injective homomorphism : C3 Aut C7 (there are two such ho-
momorphisms) and form C7 o C3 . More generally, given primes p, q with p|q 1, we get
non-abelian groups of order pq this way. Indeed one can show that every group of order pq
has this form.
With this criterion in hand, you soon realize that semi-direct products are everywhere.
Here are a few important examples, with details and verifications left to the reader:
12
8.3 Group extensions
A group extension consists of group homomorphisms
i
H G K,
where is surjective and i is an isomorphism onto the kernel of . Thus without loss of
generality we can, if desired, assume that i is just an inclusion and K = G/H. In fact we
will often treat H as a subgroup and omit i from the notation.
Note that H and K alone do not determine G, even when G is abelian. For example, if
we are given a group extension C2 GC2 , then since |G| = 4 we know G is abelian, but
without further information there is no way to know whether G is C4 or C2 C2 .
The extension is central if H C(G). Note, for example, that if H = C2 then the
extension is automatically central. This leads to another example of the ambiguity inherent
in group extensions: If we have an extension C2 GC2 C2 , then it is a central extension
but G could be any of the five groups of order 8 except C8 : (C2 )3 , C2 C4 , the dihedral
group D8 , or the quaternion group Q8 . Each of these four groups fits into such an extension,
as you can easily check.
Note that a semi-direct product G := H o K fits into an extension HGK. In
fact we can characterize the semi-direct products in terms of extensions. A group extension
i
H G K splits if there is a homomorphism s : KG such that s = IdK . We
call s a splitting (or sometimes a section) of .
Proposition 8.3 If G := H o K is a semi-direct product, the extension HGK splits.
Conversely if HGK is a split group extension, then G = H o K with K acting on H by
conjugation. More precisely, if s : KG is a splitting, K acts on H by k h = s(k)hs(k)1 .
Proof: If G = H o K, define s by s(k) = (e, k). Conversely if HGK is split, choose a
splitting s : KG. Then the pair H, s(K) satisfies the recognition principle for semi-direct
products (an easy check).
Remark. Note that to give a splitting s : KG is the same thing as giving a subgroup
K 0 G such that : GK maps K 0 isomorphically to K.
Example. Let G be a group of order pq, where p, q are primes with p < q. I claim that G is
a semi-direct product of the form Cq o Cp . First of all, by the third Sylow theorem there is
a unique and hence normal q-Sylow subgroup, cyclic of order q. Hence there is an extension
Cq G Cp . Now choose a p-Sylow subgroup H. Then |H is injective, and hence
an isomorphism. This proves the claim. Note that Cp acts on Cq by some homomorphism
Cp Aut Cq = Cq1 , and hence for the action to be non-trivial we must have p|(q 1) or
equivalently q = 1 mod p. This fits with the third Sylow theorem because if H is not normal,
then there are q p-Sylow subgroups.
13
for is to find such an extension with both H and K abelian. Or if we are feeling especially
lucky, we might hope that in addition the extension is central. A more reasonable although
still optimistic hope is that we can build G by a finite iteration of such extensions. This
leads to the concepts solvable and nilpotent group in the respective cases.
Example. We can build S4 in two steps out of abelian groups. First we form the extension
C22 A4 C3 . Then we form the extension A4 S4 C2 .
But we need a smoother way to think about building up from extensions. This is the
subject of the next section.
{e} G1 G2 G3 ....
A decreasing filtration likewise consists of subgroups
G G1 G2 G3 ...
In either case the filtration stabilizes at Gn if Gk = Gn for all k n. An increasing (resp.
decreasing) filtration is finite if Gn = G for some n (resp. Gn = {e} for some n). For finite
filtrations there is no real difference between the increasing and decreasing cases, since one
could always reverse the ordering to convert from one to the other.
In fact this definition makes sense for any kind of object with subobjects: rings and
subrings, vector spaces and sub-vector spaces, topological spaces and subspaces, etc. In
group theory filtrations are classically known as series. I prefer the term filtration
because it has a verb, to filter, that goes with it, and because it is the more widely used
term across many different categories. But I will freely use both terms, just so you get used
to them.
Example. Let p be a prime. Any abelian group A has a natural decreasing p-adic filtration
A pA p2 A ... (which need not terminate; think of A = Z), as well as a natural p-torsion
filtration A[p] A[p2 ] ... (which need not terminate; think of A = Q/Z). Incidentally, it
isnt necessary for p to be a prime here, but the prime case is by far the most important.
Example. Note that according to our definition, a filtration of a finite group need not be
a finite filtration. For example, suppose p, q are distinct primes, and G is a finite abelian
q-group. Then pG = G and hence the p-adic filtration takes the form G G G ...; it
never reaches {e}. Similarly the p-torsion filtration takes the form {e} {e} ...; it never
reaches G. Thus a finite filtration is not merely one with a finite number of distinct terms,
but one that begins at the trivial subgroup and ends at the whole group (or vice-versa).
Usually the point of filtering a group (or anything else) is to arrange it in such a way
that the quotient objects Gk /Gk1 (increasing case) or Gk /Gk+1 (decreasing case) have
some simple form that we understand; then the hope is that we can recover, perhaps by an
14
induction argument, information about G itself. This is the meaning of the motto: Dont
fight it, filter it! In our category of groups, as it stands the quotients are in general only
sets, so we make the definitions:
Definition. An increasing filtration is subnormal if Gi is normal in Gi+1 for all i, and normal
if Gi is normal in G for all i. Thus normal implies subnormal but not conversely (for a
minimal counterexample to the converse, look at our old friend A4 ). Subnormal and normal
decreasing filtrations are defined similarly. The classical terminology is subnormal/normal
series.
It will be handy to have a term for the following simple construction: Suppose HG
K is a group extension, and we are given finite filtrations (which we can assume are increas-
ing) of H and K. Then we get a filtration of G by splicing them together:
Remark. By functorial we mean that any group homomorphism preserves the filtration. To
make this fit precisely into the framework of category theory, we would define a category of
filtered groups, with morphisms the filtration preserving homomorphisms, so that assigning
15
to G its commutator filtration is a functor to this new category. But theres no compelling
reason to do this at the moment.
Proof: Clearly (a) (b) (c). Now suppose G is solvable, and let G = H 0 H 1 ...
H m = {e} be a solvable filtration. Then G/H 1 is abelian, so [G, G] H 1 . Similarly, since
H 1 /H 2 is abelian, we have G(2) [H 1 , H 1 ] H 2 . Continuing in this manner, we find that
G(k) H k for all k. Hence G(m) = {e}, proving that (c) (a).
Proposition 9.2 Solvable groups are closed under taking subgroups, quotients, extensions,
and finite products.
Remark: It follows that the solvable groups can be described as the smallest class of groups
that contains the abelian groups and is closed under extensions.
Now, here is one of the most important solvable groups. We could work over a general
commutative ring, but to avoid distractions we will stick to the case of a field F . Recall that
Bn F GLn F is the Borel subgroup of upper triangular matrices.
16
subgroup with ann = 1 (the superscript u if for unipotent). One easily checks that RCnu
is isomorphic to the additive group F n1 , and in particular is abelian. So RCn F is solvable
and the proof is complete.
The finite p-groups are another important family of solvable groups. However, they
satisfy the even stronger property of nilpotence, as we will show in the next section.
There are a number of theorems showing that under certain restrictions on the prime
factors of n, every group of order n is solvable. Here are three such theorems, in increasing
order of difficulty (the first is by far the easiest, and is demoted to the status of proposition):
Proof: Exercise. Part (a) is trivial from things weve already proved. Part (b) is a little
more interesting.
Proof: 60 = 22 3 5 is the smallest number that is neither the product of three distinct
primes nor of the form pa q b . (Its also a nice exercise to prove the corollary directly, without
Burnsides theorem.)
The original proof is 200 pages long, and as far as I know has never been simplified.
17
Proof: Let G be a finite p-group. Then C(G) is non-trivial. By induction on order we can
assume G/C(G) has a finite central filtration; splicing with the one-step filtration {e}
C(G) yields the result.
In fact any group has a functorial, normal central filtration, sometimes called the as-
cending central series, defined recursively as follows: Let C1 = C(G). Having defined
the normal subgroup Ck , let : GG/Ck be the quotient homomorphism, and set
Ck+1 = 1 C(G/Ck ). It is clear that Ck+1 is normal.
The next proposition is analogous to Proposition 9.1.
Proof: Clearly (a) (b). To show that (b) (a), assume given a central filtration Gi with
Gm = G and show inductively that Gi Ci . Then Cm = G, as desired.
Example. Let G act on itself by conjugation. This action is nilpotent if and only if G is
nilpotent.
Example. Consider a field F and take K to be the group of upper triangular unipotent
matrices Un F . For G we take the additive group F n , with its standard left Un F action.
Filter F n by the F i s (where as always, our default inclusion F i F n is in the first i
coordinates). This is automatically a subnormal (indeed normal) filtration, since F n is
abelian, and satisfies the nilpotent action conditions (i)-(ii) by definition of Un F .
Note: From the point of view of this example, it would have made more sense to use the
term unipotent action in place of nilpotent action. But such terminology conflicts are
inevitable, and one just has to live with them.
Nilpotent groups are not closed under extensions (think of S3 , for example). Our next
definition compensates for this deficiency: A group extension HGK is nilpotent if the
conjugation action of G on H is nilpotent. Any central extension is nilpotent, for example,
or more generally any extension in which the conjugation action of G on H is trivial. The
extension C3 S3 C2 is not nilpotent.
Proposition 9.10 The class of nilpotent groups is closed under subgroups, quotients, nilpo-
tent extensions and finite products.
18
But H is invariant under the latter action, and we get a finite central filtration of H by
intersecting with a given such filtration for G. The case of quotients is also straightforward,
and left to the reader.
Now suppose HGK is a group extension with K nilpotent and G acting nilpo-
tently on H by conjugation (which implies that H is nilpotent). Choose a G-nilpotent
filtration of H and a nilpotent filtration of K; splicing these yields a nilpotent filtration of
G, as desired. Details are left to the reader.
If H, G are nilpotent, then HG HG is clearly a nilpotent extension, since the
conjugation action of G H on H factors through H. So H G is nilpotent. It then
follows by induction on the number of factors that any finite product of nilpotent groups is
nilpotent.
Next is one of the most important examples. Let F be a field, and recall that the
unipotent group Un F is the group of upper triangular nn matrices with 1s on the diagonal.
Since it is a subgroup of the solvable group Bn F , it is solvable. But more is true:
Proof: We follow closely the proof already given for solvability of Bn F . As in that case,
there is a group extension
RCnu F Un F Un1 F,
where the second map is projection on the upper left (n 1) (n 1) block and RCnu F is
again the unipotent right column group; for example RC3 F consists of matrices
1 0 a
0 1 b
0 0 1
By induction we can assume Un1 F is solvable, so it suffices to show that the action of Un F
on RCnu F by conjugation is nilpotent. Since RCnu F is abelian, the action of Un F factors
through Un1 F , so what we need to show is that the conjugation action of Un1 F on RCnu F
is nilpotent. But under the evident isomorphism RCnu F = F n1 (where the entries of the
right column are ordered from top to bottom), this action corresponds to the standard linear
action of Un1 F on F n1 . We saw earlier that this latter action is nilpotent, so the proof is
complete.
Here are two more interesting facts about nilpotent groups. Both are false for solvable
groups; the reader can easily supply examples.
Proposition 9.12 Let G be a nilpotent group, H a normal subgroup. Then H C(G) 6= {e}.
Proof: Since G acts nilpotently on itself by conjugation, and H is invariant, it acts nilpotently
on H. In particular H has a non-trivial subgroup H1 on which G acts trivially, so H1
H C(G).
19
Proposition 9.13 Let G be a nilpotent group, H a proper subgroup of G. Then H is a
proper subgroup of its normalizer NG H.
Theorem 9.14 A finite group G is nilpotent if and only if G is a product of p-groups (where
p ranges over the prime divisors of |G|).
Proof: We have shown that any p-group is nilpotent. Since any finite product of nilpotent
groups is nilpotent, this proves the if.
Conversely, suppose G is nilpotent, let p divide |G|, and let P be a p-Sylow subgroup.
Then NG (NG P ) = NG P (see the section on the Sylow theorems). By Proposition 9.13
this is impossible unless NG P = G, so P is normal. So if p1 , ..., pm are the prime divisors
of G, with corresponding unique pi -Sylow subgroups Pi , the natural multiplication map
: P1 ... Pm G is an injective group homomorphism. Comparing orders, we see that
it must be an isomorphism.
Proposition 9.15 G is nilpotent if and only if the descending central series is finite, i.e.
C n = {e} for some n.
We conclude by mentioning a class of finite groups lying between the nilpotent and
solvable groups. If G is a finite group, G is supersolvable if it has a finite normal filtration
with cyclic quotients.
The second implication is immediate, while the first is left as an exercise. Note that
S3 is supersolvable but not nilpotent, while A4 is solvable but not supersolvable. One can
also check that any solvable finite group admits a subnormal filtration with cyclic quotients;
hence the insistence on normal filtrations in the definition of supersolvable groups is key.
Supersolvable groups have nice representation-theoretic properties, as we will see in [Serre],
8.5.
20
10 Simple groups and perfect groups
10.1 Simple groups
A group G is simple if it has no non-trivial proper normal subgroups. If G is abelian, then G
is simple if and only if it is cyclic of prime order (an easy exercise). Non-abelian finite simple
groups were not completely classified until 2004; the proofs occupy thousands of pages. The
formidable Feit-Thompson theorem implies that every non-abelian finite simple group has
even order, but this is just the first little step!
Note that since every group of order < 60 is solvable, there are no non-abelian simple
groups of order < 60 (indeed these two statements are equivalent).
For a short and interesting proof, see Artins undergraduate algebra text (proofs can be
found also in [Hungerford] and [Dummit-Foote]). Another infinite family of simple groups
is given by the projective special linear groups P SLn F , F a field. These are defined by
P SLn F = SLn F/C, where C := C(SLn F ) is the center. Since C(GLn F ) consists of the
scalar matrices, it is easy to see that C(SLn F ) is also just the scalar matrices with deter-
minant 1, and hence is a finite cyclic group of order dividing n. It turns out that with just
two exceptions, P SLn F is simple for all n, F . Well prove this in a later section for n = 2,
and sketch the proof for general n. By taking F to be a finite field we get an infinite family
of finite simple groups. This is as far as well go.
Theorem 10.2 If G admits a Jordan-Holder filtration, then the list of simple groups occur-
ing as the quotients is independent of the choice of filtration, up to ordering.
We will rarelyif everuse this theorem, so we wont prove it here. For a proof and
further discussion see e.g. [Hungerford]. (We will, however, prove and use the analogous
theorem for modules over a ring.)
21
Any simple non-abelian group is perfect.
Perfect groups are closed under quotients, extensions, and arbitrary products (but not
under subgroups).
Note that a perfect group need not be simple. For example, any product of non-abelian
simple groups such as A5 A5 is perfect, but certainly not simple.
We will see below that with two exceptions, SLn F is perfect (but typically has non-trivial
center, hence need not be simple). Indeed for us this is the main reason to bother introducing
the concept of a perfect group; we will use it as a stepping stone toward proving P SLn F is
(almost always) simple.
SLn F GLn F F .
Many basic subgroups of SLn F are defined by simply intersecting with the corresponding
subgroups of GLn F . For example, define the Borel subgroup of SLn F by SBn F = Bn F
SLn F , the upper triangular matrices with determinant 1. Although one rapidly grows tired
of putting the S in SBn F and its cousins, well do so for a little while. Thus SDn F
denotes diagonal matrices of determinant 1, SUn F = Un F (since the unipotent subgroup
already consists of determinant 1 matrices), and SNn F = Nn F SLn F . Here one can check
that SNn F = NSLn F SDn F provided, as usual, that F 6= F2 .
The one significant difference to watch out for concerns the Weyl group. One might think
that we should just take permutation matrices of determinant 1, but this turns out to be
the wrong thing to do. The way to think of it is as follows: In GLn F we have the extension
Dn F Nn F Wn F,
where Wn F = Sn and a splitting of the extension is already given by using permutation
matrices. If F 6= F2 there is an analogous extension
22
this does not cause any difficulties. To illustrate, lets first clarify our definition of SWn F : If
char F = 2, it is just the permutation matrices as before, i.e. is equal to Wn F . If char F 6= 2,
SWn F = SNn F/SDn F . In all cases SWn F = Wn F = Sn . Now consider the SLn version of
the Bruhat decomposition.
Theorem 11.1 a
SLn F = SBn F wSBn F,
wWn F
where in the case char F 6= 2, w is any preimage of w in SNn F (in the characteristic 2 case
it is just a permutation matrix as usual).
Proof/Discussion: First of all, note that the expression on the right is well-defined. Any
two choices of w differ by an element of SDn F SBn F , so the double coset SBn F wSBn F
is independent of the choice. The theorem then follows easily from its GLn analogue; no
new fussing about with row reduction is needed. Certainly the indicated double cosets are
distinct and hence disjoint, by comparing with the GLn result. And if g SLn F , we can
write g = b1 wb2 with b1 , b2 Bn F by the GLn result, with det b2 = (det b1 )1 . We can write
b1 = b01 d with b01 SBn F and d Dn F . Then since w normalizes Dn F , we have
g = b01 w(d0 b2 ),
where det d0 = det d = det b1 and hence d0 b2 SBn F .
11.1 On SL2 F
The goal of this section is to study the structure of SL2 F in detail, and in particular to
prove:
23
checking the orders are not of the form n!/2. For example, P SL2 F7 is simple of order 168
and so is not an alternating group.
The group SL2 F is important for much more general reasons, as it is the most basic
example of a semi-simple algebraic group, or in the case F = R, C of a semi-simple Lie
group. It plays a central role in representation theory and Lie theory, as well as in certain
parts of algebraic topology, algebraic geometry, and combinatorics.
and y(t) =
!
1 0
t 1
If t F we set w(t) =
!
0 t
1
t 0
and h(t) =
!
t 0
0 t1
These satisfy:
1. w(1)x(t)w(1)1 = y(t)
2. h(t) = w(t)w(1)1
5. h(s)x(t)h(s)1 = x(s2 t)
Note that any of the last three formulas easily determines the other two, and that there
are analogues with x(t) replaced by y(t).
Let w = w(1).
24
11.1.2 Properties of SL2 F
Note that the center C := C(SL2 F ) is trivial if char F = 2 and is C2 (i.e. Id) otherwise.
`
The Bruhat decomposition SL2 F = B BwB immediately implies:
Proposition 11.4 SL2 F is generated by the subgroups U, U (together), i.e. by the elements
x(t), y(t), t F .
Proof: By the preceeding proposition, it suffices to show that x(a), y(a) are commutators
for all a F . By Formula (1), we need only consider x(a). By Formula (4) we see that x(a)
is a commutator provided there is an s F with s2 6= 1, and this latter statement is true
if and only if |F | > 3.
The two exceptional cases are not perfect (see the exercises).
25
11.2 Simplicity of P SLn F : a sketch
In this section we sketch how the results on SL2 extend to SLn . In particular we sketch a
proof that P SLn F is simple for all n 3 and all F .
We use the following notation: For each i, 1 i < n, we let Gi GLn F denote the
block diagonal subgroup consisting of just one 2 2 SL2 F block in the i, i + 1 position. For
example, if n = 4 then G2 is the subgroup
1 0 0 0
0 a b 0
0 c d 0
0 0 0 1
with ad bc = 1. In each such block we have copies of the elements x(t), y(t), h(t), w(t)
defined above for SL2 F , which we denote xi (t), yi (t), etc. Similarly we have subgroups
Ui , Bi , Di , Ni etc. of Gi corresponding to U, B, D, N in SL2 F . Caution: This conflicts with
our earlier notation,in which for example Bi denoted the upper triangular matrices in GLi F .
In the displayed matrix above, B2 is the subgroup defined by c = 0.
Proposition 11.7 SLn F is generated by the subgroups Gi , and hence is generated by the
elements xi (t), yi (t), t F .
Proof: The first statement follows by the usual row/column reduction (or Bruhat decomposi-
tion) argument, since all the elementary row/column operations can be realized by repeated
left/right multiplication by elements of the Gi s. The second statement then follows by what
we proved for SL2 .
Proof: The case n = 2 was proved earlier. If F 6= F2 , F3 , the general case follows immediately
from the fact that the Gi s are perfect together with the previous proposition. However, one
can give a uniform proof for all F and n 3 as follows: It suffices to show each xi (t) is a
commutator. These are all conjugate in SLn F (via a permutation of coordinates), so we can
assume i = 1, in which case it is enough to prove the result for n = 3. But x1 (t) is conjugate
to x13 (t), i.e.
1 0 t
0 1 0
0 0 1
Since [x1 (a), x2 (b)] = x13 (ab), x13 (t) is a commutator for all t and were done.
Now the other key ingredient in the P SL2 simplicity proof was that the Borel subgroup
B P SL2 F is a maximal subgroup. This is clearly not true for higher n, as there are
various block-triangular groups that contain B. Explicitly if a = (a1 , ..., ar ) is an ordered
partition of n, there is an associated parabolic subgroup Pa consisting of block triangular
26
matrices whose diagonal blocks have size a1 , ..., ar . For example if n = 6 and a = (3, 1, 2),
then Pa consists of matrices
a b c
d e f
g h i
0 0 0 j
0 0 0 0 k l
0 0 0 0 p q
where the entries are arbitrary and the three diagonal blocks have the product of their
determinants equal to 1. Note that B itself corresponds to the partition (1, 1, ..., 1), while
SLn F corresponds to (n). Note also that Pa is generated by B together with the wi s it
contains. The total number of parabolic subgroups is thus 2n1 . The surprising fact is:
In particular there are only finitely many subgroups containing B. Note that when n = 2
the proposition says that B is a maximal proper subgroup.
Proof: The case n = 2 was proved earlier, so assume n 3 and let H SLn F be a normal
subgroup. Then HB is a subgroup containing B, so HB = Pa for some partition a. If
Pa = B then H B, and as in the case n = 2 we find that H D, from which it follows
that in fact H C, C being the center (i.e. scalar matrices of determinant 1). The reader
can fill in the details of this step. If Pa = SLn F , then as in the case n = 2 we find that
SLn F/H is both perfect and solvable, hence trivial, so H = SLn F and were done.
The new step that remains is to show that the case Pa 6= B, SLn F cant occur. We
sketch briefly the ideas involved. Suppose Pa 6= B, SLn F . Then Pa contains some but
not all of the wi s. Hence there must exist i, i + 1 such that wi Pa and wi+1 / Pa .
Since Bwi B Pa = HB, we have (Bwi B) H 6= . Since H is normal, it follows that
1
(wi+1 Bwi Bwi+1 ) H 6= . From this one can show that wi+1 wi wi+1 HB = Pa . But this is
false, as one can check from the definition of Pa . So we have a contradiction and the proof
is complete.
Taking F to be a finite field, this yields a large family of finite simple groups. With a
very small number of exceptions, they are distinct from each other and from the alternating
groups (up to isomorphism). In fact if memory serves, the only (?) exception besides the
coincidence P SL2 F5
= A5 already mentioned is P SL2 F7 = GL3 F2 . Note these last two
groups have order 168. To get the isomorphism one can show that in fact there is only one
simple group of order 168 up to isomorphism; for an elaborate proof of this see [Dummit-
Foote].
27
12 Exercises
Note: Selected exercises will be assigned. Remember too that your mission is not merely
to find any old proof; always strive for a simple, elegant argument, and of course make full
use of the machinery that we develop. Needless to say, a simple, elegant argument may or
may not come to mind, or even be possible, but it should always be your goal.
Notice. The Feit-Thompson theorem is off-limits unless explicitly allowed!
A. G-actions.
A1. By Cayleys theorem (which is trivial, from a modern perspective), every finite group
is isomorphic to a subgroup of Sn for some n, where the n provided by the proof is n = |G|.
On the other hand, G may well be isomorphic to a subgroup of Sm for some much smaller
m (think of Sm itself, for example). Show, however, that for G = Q8 (the quaternion group
of order 8), n = 8 is the minimal n for which the conclusion of Cayleys theorem holds.
A2. Prove by first finding a suitable set on which the group in question acts:
a) GL2 F2 is isomorphic to S3 , and Af f2 F2 is isomorphic to S4 .
b) P SL2 F3 is isomorphic to A4 .
c) If G is a simple group of order 60, then G is isomorphic to A5 .
B. Wreath products.
product
R
H n o G, where G acts on the left of H n by permuting the factors. Thus the elements
of H G have the form (h1 , ..., hn )g, with multiplication determined by the formula
B1. Partition [mn] into n blocks (subsets) of equal size m, compatibly with the standard
order on [mn]. Let Smn denote the subgroup of all block-preserving permutations; by
this we mean that is allowed to permute the blocks as well as the elements within a
particular block. Then
R
= Sm Sn .
there is an isomorphism
Z Z Z Z
(H G) K
=H (G K).
R R R
Consequently expressions such as H G1 G2 ... Gr are unambiguous (assuming Gi is
given
Rr
as a subgroup of Sni ). In particular we can define the r-fold iterated wreath product
H of a subgroup H Sn . It is a subgroup of Snr .
28
B3. Wreath products at Wimbledon. The Wimbledon tennis tournament begins with a
draw of 128 players. Player no. 1 plays no. 2, no. 3 plays no. 4, the winner of 1-2 plays
the winner of 3-4, and so on. Some care is needed in determining the draw; for instance the
two (theoretically) best players should be placed in opposite halves, say one at position 1
and the other at Rposition 128, so that if they meet at all it will be in the finals. The iterated
wreath product 7 C2 S128 can be thought of as the group of permutations of the draw
leaving it essentially unchanged, and the number of essentially distinct possible draws
is (128!)/2127 . Explain.
C. Sylow subgroups of general linear groups of finite fields. In this exercise p is a prime and
q = pd for some d, while Fq denotes a field with q elements. For any prime `, ` n denotes
the exponent of ` in the prime factorization of n.
n Q
C1. Answer/show: a) |GLn Fq | = q ( 2 ) ni=1 (q i 1)
b) The unipotent subgroup U is a p-Sylow subgroup.
c) How many p-Sylow subgroups are there?
d) What is the order of SLn Fq ? Order of P SLn Fq ?
Note: Recall that F
p is a cyclic group. This is true for any finite field, so Fq is cyclic of
order q 1. We havent proved the general case yet, but you can assume it if necessary in
part (d).
Note: Some of the remaining C problems may require a little knowledge of finite fields
beyond what has been discussed in class, and therefore may be postponed.
C2. Now let ` be a prime 6= p. Assume that ` divides q1, and that if ` = 2 then 4|(q1).
Then DW = NGLn Fq D contains an `-Sylow subgroup
R
of GLn Fq . Hence if ` (q1) = a, GLn Fq
has an `-Sylow subgroup isomorphic to C`a L, where L is an `-Sylow subgroup of Sn (cf.
3.1).
Remarks: (i). If ` is odd and ` doesnt divide q 1, the `-Sylow subgroups are of a
similar nature but the details are more complicated. To pursue this point, think about the
extension of Fq obtained by adjoining an `-th root of unity.
(ii). Why the restriction when ` = 2? One of my favorite mottos is the doubly nonsensical
4 is an odd prime. The key, and elementary, number-theoretic fact behind this motto is
the following: Suppose ` (x 1) = a 1. Then ` (xn 1) = a + ` n provided that either `
is odd or a 2. When ` = 2 and a = 1 this fails, e.g. for x = 3, n = 2.
29
a) If q = 3 mod 4, the 2-Sylow subgroups of GL2 Fq are isomorphic to the semi-dihedral
group SDa+1 of order 2a+1 , defined for a 3 as follows: Let C2a , C2 have generators x, y
a
respectively, and let C2 act on C2a by y x = x2 1 . Then SDa+1 = C2a o C2 .
Suggestion: Identify F2q with Fq2 and consider the group of units of Fq2 together with the
Frobenius a 7 aq .
R
b) If q = 3 mod 4 and n = 2m, then GL2 Fq Sm (embedded in GLn Fq in the evident way)
R
contains a 2-Sylow subgroup P . Hence P = SDa+1 Q, where Q is a 2-Sylow subgroup of
Sm . What happens for n odd?
c) If q is any odd prime power, the 2-Sylow subgroups of SL2 Fq are isomorphic to the
generalized quaternion group Qa of order 2a . (Qa is defined as follows for a 3: Write the
quaternions as H = C Cj. Then Qa is the subgroup of H+ generated by j and the 2a1 -st
roots of unity in C.)
D1. Let F be a field, and let Xn Mn F denote the subset consisting of matrices with n
distinct eigenvalues, all of which lie in F . In this problem you will give two alternate ways of
thinking about Xn . Let Zn = Dn Xn , which we identify with the subset of (F )n consisting
of n-tuples with no repeated entries.
a) Let Yn denote the set of ordered n-tuples (L1 , ..., Ln ) of lines in F n such that Li = F n .
P
b) Note that GLn F acts on Xn by conjugation, and Zn is invariant under the restriction
of this action to Nn F . Use the recognition principle for induced G-sets to show that the
canonical map GLn F Nn F Zn Xn is a bijection.
Remark: For fans of topology, I note that for F = R, C, the set Xn is a subspace of Mn F
(with its usual topology). The other two spaces in (a), (b) have quotient topologies making
the above bijections homeomorphisms.
D2. Let H be a subgroup of G and let X be a G-set. Then there is a natural isomorphism
of G-sets G H X = (G/H) X, where the target has the product G-action. In particular,
the induced action g1 (g, x) = (g1 g, x) on G X is isomorphic to to the product action.
30
E2. Suppose G is nilpotent. Then:
a) Every minimal normal subgroup is central of prime order.
b) Every maximal subgroup is normal and has prime index.
E3. The converse of 2b holds in the form: If G is a finite group such every maximal
subgroup is normal, then G is nilpotent. (Suggestion: Show that every p-Sylow subgroup is
normal.)
Structure of Bn F .
1. If F 6= F2 , [Bn F, Bn F ] = Un F . Hence for all fields the commutator series of Bn F is
determined by that of Un F .
2. Determine the upper and lower central series of Bn F for F 6= F2 . (They wont get
far!)
M. Miscellaneous problems. These are mostly problems already suggested in the body
of the notes.
M1. Show that if either n = pq with p, q prime, or n = pqr with p, q, r distinct primes,
then every group of order n is solvable.
M2. Without using Burnsides pa q b theorem, show that every group of order < 60 is
solvable.
M4. Show that an abelian group admits a Jordan-Holder filtration if and only if it is
finite.
31