Академический Документы
Профессиональный Документы
Культура Документы
Contents
1 Topics in Group Theory
1.1 Groups . . . . . . . . . . . . . . . . . . . .
1.2 Examples of Groups . . . . . . . . . . . .
1.3 Elementary Properties of Groups . . . . .
1.4 Subgroups . . . . . . . . . . . . . . . . . .
1.5 Cyclic Groups . . . . . . . . . . . . . . . .
1.6 Cosets and Lagranges Theorem . . . . . .
1.7 Normal Subgroups and Quotient Groups .
1.8 Homomorphisms . . . . . . . . . . . . . .
1.9 The Isomorphism Theorems . . . . . . . .
1.10 Group Actions, Orbits and Stabilizers . . .
1.11 Conjugacy . . . . . . . . . . . . . . . . . .
1.12 The Class Equation of a Finite Group . . .
1.13 Cauchys Theorem . . . . . . . . . . . . .
1.14 The Structure of p-Groups . . . . . . . . .
1.15 The Sylow Theorems . . . . . . . . . . . .
1.16 Some Applications of the Sylow Theorems
1.17 Simple Groups . . . . . . . . . . . . . . .
1.18 Solvable Groups . . . . . . . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
1
1
2
3
4
5
6
7
10
12
13
14
14
15
16
17
18
21
27
Contents
2 Rings and Polynomials
2.1 Rings, Integral Domains and Fields .
2.2 Ideals . . . . . . . . . . . . . . . . .
2.3 Quotient Rings and Homomorphisms
2.4 The Characteristic of a Ring . . . . .
2.5 Polynomial Rings . . . . . . . . . . .
2.6 Gausss Lemma . . . . . . . . . . . .
2.7 Eisensteins Irreducibility Criterion .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
30
30
32
33
35
35
38
39
Contents
3 Introduction to Galois Theory
3.1 Field Extensions and the Tower Law . . . . . . . . .
3.2 Algebraic Field Extensions . . . . . . . . . . . . . . .
3.3 Algebraically Closed Fields . . . . . . . . . . . . . . .
3.4 Ruler and Compass Constructions . . . . . . . . . . .
3.5 Splitting Fields . . . . . . . . . . . . . . . . . . . . .
3.6 Normal Extensions . . . . . . . . . . . . . . . . . . .
3.7 Separability . . . . . . . . . . . . . . . . . . . . . . .
3.8 Finite Fields . . . . . . . . . . . . . . . . . . . . . . .
3.9 The Primitive Element Theorem . . . . . . . . . . . .
3.10 The Galois Group of a Field Extension . . . . . . . .
3.11 The Galois correspondence . . . . . . . . . . . . . . .
3.12 Quadratic Polynomials . . . . . . . . . . . . . . . . .
3.13 Cubic Polynomials . . . . . . . . . . . . . . . . . . .
3.14 Quartic Polynomials . . . . . . . . . . . . . . . . . .
3.15 The Galois group of the polynomial x4 2 . . . . . .
3.16 The Galois group of a polynomial . . . . . . . . . . .
3.17 Solvable polynomials and their Galois groups . . . . .
3.18 A quintic polynomial that is not solvable by radicals
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
41
41
42
45
45
50
53
54
56
59
60
62
64
64
66
68
70
71
75
Contents
4 Commutative Algebra and Algebraic Geometry
4.1 Modules . . . . . . . . . . . . . . . . . . . . . . .
4.2 Noetherian Modules . . . . . . . . . . . . . . . .
4.3 Noetherian Rings and Hilberts Basis Theorem . .
4.4 Polynomial Rings in Several Variables . . . . . . .
4.5 Algebraic Sets and the Zariski Topology . . . . .
4.6 The Structure of Algebraic Sets . . . . . . . . . .
4.7 Maximal Ideals and Zorns Lemma . . . . . . . .
4.8 Prime Ideals . . . . . . . . . . . . . . . . . . . . .
4.9 Affine Varieties and Irreducibility . . . . . . . . .
4.10 Radical Ideals . . . . . . . . . . . . . . . . . . . .
4.11 Commutative Algebras of Finite Type . . . . . .
4.12 Zariskis Theorem . . . . . . . . . . . . . . . . . .
4.13 Hilberts Nullstellensatz . . . . . . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
77
77
79
81
84
87
90
92
94
95
98
100
101
104
1.1
Groups
1.2
Examples of Groups
The sets of integers, rational numbers, real numbers and complex numbers
are Abelian groups, where the group operation is the operation of addition.
The sets of non-zero rational numbers, non-zero real numbers and nonzero complex numbers are also Abelian groups, where the group operation is
the operation of multiplication.
For each positive integer m the set Zm of congruence classes of integers
modulo m is a group, where the group operation is addition of congruence
classes.
For each positive integer m the set Zm of congruence classes modulo m of
integers coprime to m is a group, where the group operation is multiplication
of congruence classes.
In particular, for each prime number p the set Zp of congruence classes
modulo p of integers not divisible by p is a group, where the group operation
is multiplication of congruence classes.
For each positive integer n the set of all nonsingular n n matrices is a
group, where the group operation is matrix multiplication. These groups are
not Abelian when n 2.
Let E n denote n-dimensional Euclidean space, so that E 2 denotes the
Euclidean plane, and E 3 denotes three-dimensional Euclidean space. A geometrical figure may be represented as a subset S of E n . A symmetry of S is a
transformation T : E n E n of E n which sends straight lines to straight lines,
preserves all lengths and angles, and has the property that T (S) = S. The
collection of all symmetries of a geometrical figure is a group, the symmetry
group of S, the group operation being that of composition of transformations.
For any natural number n greater than 2, the the dihedral group D2n of
order 2n is defined to be the symmetry group of a regular n-sided polygon
in the Euclidean plane. It consists of rotations though an angle of 2j/n
about the centre of the polygon for j = 0, 1, 2, . . . , n 1, together with the
reflections in the n axes of symmetry of the polygon.
The symmetries of a rectangle that is not a square constitute a group of
order 4. This group consists of the identity transformation, reflection in the
axis of symmetry joining the midpoints of the two shorter sides, reflection
in the axis of symmetry joining the two longer sides, and rotation though
an angle of radians (180 ). If I denotes the identity transformation, A
and B denote the reflections in the two axes of symmetry, and C denotes
the rotation through radians then A2 = B 2 = C 2 = I, AB = BA = C,
AC = CA = B and BC = CB = A. This group is Abelian: it is often
referred to as the Klein 4-group (or, in German, Kleinsche Viergruppe).
The symmetries of a regular tetrahedron in 3-dimensional space constitute
2
1.3
In what follows, we describe basic properties of a group G, using multiplicative notation and denoting the identity element of the group by the letter e.
Lemma 1.1 A group G has exactly one identity element e satisfying ex =
x = xe for all x G.
Proof Suppose that f is an element of G with the property that f x = x for
all elements x of G. Then in particular f = f e = e. Similarly one can show
that e is the only element of G satisfying xe = x for all elements x of G.
Lemma 1.2 An element x of a group G has exactly one inverse x1 .
Proof We know from the axioms that the group G contains at least one
element x1 which satisfies xx1 = e and x1 x = e. If z is any element of
G which satisfies xz = e then z = ez = (x1 x)z = x1 (xz) = x1 e = x1 .
Similarly if w is any element of G which satisfies wx = e then w = x1 . In
particular we conclude that the inverse x1 of x is uniquely determined, as
required.
Lemma 1.3 Let x and y be elements of a group G. Then (xy)1 = y 1 x1 .
Proof It follows from the group axioms that
(xy)(y 1 x1 ) = x(y(y 1 x1 )) = x((yy 1 )x1 ) = x(ex1 ) = xx1 = e.
Similarly (y 1 x1 )(xy) = e, and thus y 1 x1 is the inverse of xy, as required.
Note in particular that (x1 )1 = x for all elements x of a group G, since
x has the properties that characterize the inverse of the inverse x1 of x.
Given an element x of a group G, we define xn for each positive integer n
by the requirement that x1 = x and xn = xn1 x for all n > 1. We also define
x0 = e, where e is the identity element of the group, and we define xn to be
the inverse of xn for all positive integers n.
3
(x1 x2 )(x3 x4 ),
x1 ((x2 x3 )x4 ),
x1 (x2 (x3 x4 ))
all have the same value. (Note that x1 x2 x3 x4 is by definition the value of the
first of these expressions.)
Two expressions, each specifying a finite product of elements of a group G,
determine the same element of G if the same elements of G occur in both
expressions, and in the same order. This result can be proved by induction
on the number of elements of G making up such a product.
1.4
Subgroups
Lemma 1.5 Let x be an element of a group G. Then the set of all elements
of G that are of the form xn for some integer n is a subgroup of G.
Proof Let H = {xn : n Z}. Then the identity element belongs to H, since
it is equal to x0 . The product of two elements of H is itself an element of
H, since xm xn = xm+n for all integers m and n (see Theorem 1.4). Also the
inverse of an element of H is itself an element of H since (xn )1 = xn for
all integers n. Thus H is a subgroup of G, as required.
Definition Let x be an element of a group G. The order of x is the smallest
positive integer n for which xn = e. The subgroup generated by x is the
subgroup consisting of all elements of G that are of the form xn for some
integer n.
Lemma 1.6 Let H and K be subgroups of a group G. Then H K is also
a subgroup of G.
Proof The identity element of G belongs to H K since it belongs to the
subgroups H and K. If x and y are elements of H K then xy is an element
of H (since x and y are elements of H), and xy is an element of K, and
therefore xy is an element of H K. Also the inverse x1 of an element x of
H K belongs to H and to K and thus belongs to H K, as required.
More generally, the intersection of any collection of subgroups of a given
group is itself a subgroup of that group.
1.5
Cyclic Groups
1.6
1.7
and we can use analogous notation to denote the product of four or more
subsets of G.
If A, B and C are subsets of a group G, and if A B then clearly
AC BC and CA CB.
Note that if H is a subgroup of the group G and if x is an element of G
then xH is the left coset of H in G that contains the element x. Similarly
Hx is the right coset of H in G that contains the element x.
If H is a subgroup of G then HH = H. Indeed HH H, since the
product of two elements of a subgroup H is itself an element of H. Also
H HH since h = eh for any element h of H, where e, the identity element
of G, belongs to H.
Definition A subgroup N of a group G is said to be a normal subgroup of
G if xnx1 N for all n N and x G.
The notation N / G signifies N is a normal subgroup of G.
Definition A non-trivial group G is said to be simple if the only normal
subgroups of G are the whole of G and the trivial subgroup {e} whose only
element is the identity element e of G.
Lemma 1.12 Every subgroup of an Abelian group is a normal subgroup.
Proof Let N be a subgroup of an Abelian group G. Then
xnx1 = (xn)x1 = (nx)x1 = n(xx1 ) = ne = n
for all n N and x G, where e is the identity element of G. The result
follows.
Example Let S3 be the group of permutations of the set {1, 2, 3}, and let
H be the subgroup of S3 consisting of the identity permutation and the
transposition (1 2). Then H is not normal in G, since (2 3)1 (1 2)(2 3) =
(2 3)(1 2)(2 3) = (1 3) and (1 3) does not belong to the subgroup H.
Proposition 1.13 A subgroup N of a group G is a normal subgroup of G if
and only if xN x1 = N for all elements x of G.
Proof Suppose that N is a normal subgroup of G. Let x be an element
of G. Then xN x1 N . (This follows directly from the definition of a
normal subgroup.) On replacing x by x1 we see also that x1 N x N , and
thus N = x(x1 N x)x1 xN x1 . Thus each of the sets N and xN x1 is
contained in the other, and therefore xN x1 = N .
Conversely if N is a subgroup of G with the property that xN x1 = N
for all x G, then it follows immediately from the definition of a normal
subgroup that N is a normal subgroup of G.
8
B = {T1 , T3 },
C = {T2 , T4 }.
1.8
Homomorphisms
An isomorphism : G K between groups G and K is a homomorphism that is also a bijection mapping G onto K. Two groups G and K are
isomorphic if there exists an isomorphism mapping G onto K.
Example Let D6 be the group of symmetries of an equilateral triangle in
the plane with vertices A, B and C, and let S3 be the group of permutations
of the set {A, B, C}. The function which sends a symmetry of the triangle
to the corresponding permutation of its vertices is an isomorphism between
the dihedral group D6 of order 6 and the symmetric group S3 .
Example Let R be the group of real numbers with the operation of addition,
and let R+ be the group of strictly positive real numbers with the operation
of multiplication. The function exp: R R+ that sends each real number x
to the positive real number ex is an isomorphism: it is both a homomorphism
of groups and a bijection. The inverse of this isomorphism is the function
log: R+ R that sends each strictly positive real number to its natural
logarithm.
Here is some further terminology regarding homomorphisms:
A monomorphism is an injective homomorphism.
An epimorphism is a surjective homomorphism.
An endomorphism is a homomorphism mapping a group into itself.
An automorphism is an isomorphism mapping a group onto itself.
Definition The kernel ker of the homomorphism : G K is the set of
all elements of G that are mapped by onto the identity element of K.
Example Let the group operation on the set {+1, 1} be multiplication,
and let : Z {+1, 1} be the homomorphism that sends each integer n
to (1)n . Then the kernel of the homomorphism is the subgroup of Z
consisting of all even numbers.
Lemma 1.18 Let G and K be groups, and let : G K be a homomorphism
from G to K. Then the kernel ker of is a normal subgroup of G.
Proof Let x and y be elements of ker . Then (x) = eK and (y) = eK ,
where eK denotes the identity element of K. But then (xy) = (x)(y) =
eK eK = eK , and thus xy belongs to ker . Also (x1 ) = (x)1 = e1
K = eK ,
and thus x1 belongs to ker . We conclude that ker is a subgroup of K.
Moreover ker is a normal subgroup of G, for if g G and x ker then
(gxg 1 ) = (g)(x)(g)1 = (g)(g 1 ) = eK .
11
since ((xN
)(yN )) = (xyN
) = (xy) = (x)(y) = (xN
)(yN
).
Suppose now that N = ker . Then (x) = (y) if and only if xN = yN .
G/N K is injective. Conversely if :
G/N
Thus the homomorphism :
K is injective then N must be the kernel of , as required.
Corollary 1.20 Let G and K be groups, and let : G K be a homomorphism. Then (G)
= G/ ker .
1.9
12
1.10
13
1.11
Conjugacy
1.12
14
1.13
Cauchys Theorem
1.14
1.15
17
Proof Let K be a Sylow p-subgroup of G, and let X be the set of left cosets
of K in G. Let H be a p-subgroup of G. Then H acts on X on the left,
where h(gK) = hgK for all h H and g G. Moreover h(gK) = gK if and
only if g 1 hg K. Thus an element gK of X is fixed by H if and only if
g 1 Hg K.
Let |G| = pk m, where k and m are positive integers and m is coprime to
p. Then |K| = pk . Now the number of left cosets of K in G is |G|/|K|. Thus
the set X has m elements. Now the number of elements in any orbit for the
action of H on X divides the order of H, since it is the index in H of the
stabilizer of some element of that orbit (Lemma 1.24). But then the number
of elements in each orbit must be some power of p, since H is a p-group.
Thus if an element of X is not fixed by H then the number of elements in its
orbit is divisible by p. But X is a disjoint union of orbits under the action
of H on X. Thus if m0 denotes the number of elements of X that are fixed
by H then m m0 is divisible by p.
Now m is not divisible by p. It follows that m0 6= 0, and m0 is not divisible
by p. Thus there exists at least one element g of G such that g 1 Hg K. But
then H is contained in the Sylow p-subgroup gKg 1 . Thus every p-subgroup
is contained in a Sylow p-subgroup of G, and this Sylow p-subgroup is a
conjugate of the given Sylow p-subgroup K. In particular any two Sylow
p-subgroups are conjugate.
It only remains to show that the number of Sylow p-subgroups in G
divides the order of |G| and is congruent to 1 modulo p. On applying the
above results with H = K, we see that g 1 Kg = K for some g G if and
only if gK is a fixed point for the action of K on X. But the number of
elements g of G for which gK is a fixed point is m0 |K|, where m0 is the
number of fixed points in X. It follows that the number of elements g of
G for which g 1 Kg = K is pk m0 . But every Sylow p-subgroup of G is of
the form g 1 Kg for some g G. It follows that the number n of Sylow
p-subgroups in G is given by n = |G|/pk m0 = m/m0 . In particular n divides
|G|. Now we have already shown that m m0 is divisible by p. It follows
that m0 is coprime to p, since m is coprime to p. Also m m0 is divisible
by m0 , since (m m0 )/m0 = n 1. Putting these results together, we see
that m m0 is divisible by m0 p, and therefore n 1 is divisible by p. Thus
n divides |G| and is congruent to 1 modulo p, as required.
1.16
18
Proof Let G be a group of order pq. It follows from the First Sylow Theorem
that G contains Sylow subgroups Np and Nq of orders p and q respectively.
Now the number np of Sylow p-subgroups divides pq and satisfies np 1
(mod p), by the Second Sylow Theorem. Clearly np cannot be divisible by p,
and therefore either np = 1 or np = q. But q 6 1 (mod p). It follows that
np = 1. Thus the group G has just one subgroup of order p.
Now, given any element g of G, the subgroups Np and gNp g 1 are of
order p. It follows that gNp g 1 = Np for all elements g of G. Thus Np is a
normal subgroup of G.
A similar argument shows that Nq is also a normal subgroup of G, since
p < q, and therefore p 6 1 (mod q).
Now Np Nq is a subgroup of both Np and Nq . It follows from Lagranges
Theorem that the order of Np Nq divides both of the prime numbers p and
q, and therefore |Np Nq | = 1 and Np Nq = {e}, where e is the identity
element of G.
Let x Np and y Nq . Then yx1 y 1 Np and xyx1 Nq , since
Np and Nq are normal subgroups of G. But then xyx1 y 1 Np Nq ,
since xyx1 y 1 = x(yx1 y 1 ) = (xyx1 )y 1 , and therefore xyx1 y 1 = e.
Thus xy = yx for all x Np and y Nq . It follows easily from this that
the function : Np Nq G which sends (x, y) Np Nq to xy is a
homomorphism. This homomorphism is injective, for if xy = e for some
x Np and y Nq , then x = y 1 , and hence x Np Nq , from which it
follows that x = e and y = e. But any injective homomorphism between two
finite groups of the same order is necessarily an isomorphism. Therefore the
function : Np Nq G is an isomorphism, and thus G
= Np Nq .
Now any group whose order is prime number must be cyclic. Therefore
the groups Np and Nq are cyclic. Let x be an element of Np that generates
Np , and let y be an element of Nq that generates Nq . Then (x, y)n = (xn , y n )
for all integers n. It follows from this that the order of (x, y) cannot be equal
to 1, p or q, and must therefore be equal to pq. Thus Np Nq is a cyclic
group generated by (x, y), and therefore G is a cyclic group, generated by
xy, as required.
Example Any finite group whose order is 15, 33, 35, 51, 65, 69, 85, 87, 91
or 95 is cyclic.
Theorem 1.34 Let G be a group of order 2p where p is a prime number
greater than 2. Then either the group G is cyclic, or else the group G is isomorphic to the dihedral group D2p of symmetries of a regular p-sided polygon
in the plane.
19
Proof It follows from the First Sylow Theorem, or from Cauchys Theorem, that the group G contains elements x and y whose orders are 2 and p
respectively. The subgroup N generated by y is then a Sylow p-subgroup
of G. Now it follows from the Second Sylow Theorem that the number of
Sylow p-subgroups of G divides 2p and is congruent to 1 modulo p. There
can therefore be only one such Sylow p-subgroup, since 2, p and 2p are not
congruent to 1 modulo p. Now if g is any element of G then gN g 1 is a
Sylow p-subgroup of G, and therefore gN g 1 = N . We deduce that N is a
normal subgroup of G, of order p.
Now consider the element xyx1 of G. This must be an element of the
normal subgroup N of G generated by y. Therefore xyx1 = y k for some
integer k. Moreover k is not divisible by p, since xyx1 is not the identity
element e of G. Then
2
1.17
Simple Groups
|H| |K|
.
|G|
25
Proof We regard A5 as the group even permutations of the set {1, 2, 3, 4, 5}.
There are 60 such permutations: the identity permutation, twenty 3-cycles,
twenty-four 5-cycles, and fifteen permutations that are products of two disjoint transpositions. (Such a product of disjoint transpositions is a permutation (a1 a2 )(a3 a4 ) that interchanges a1 with a2 and a3 with a4 for some
distinct elements a1 , a2 , a3 and a4 of the set {1, 2, 3, 4, 5}.)
Now each 3-cycle in A5 generates a Sylow 3-subgroup of order 3, and these
subgroups are all conjugate to one another by the Second Sylow Theorem.
It follows that any normal subgroup of A5 that contains at least one 3-cycle
must contain all twenty 3-cycles, and thus its order must therefore be at
least 21 (since it must also contain the identity element). Similarly each
5-cycle in A5 generates a Sylow 5-subgroup of order 5, and these subgroups
are all conjugate to one another. Therefore any normal subgroup of A5 that
contains at least one 5-cycle must contain all twenty four 5-cycles, and thus
its order must be at least 25.
Now if A5 were to contain a subgroup of order 30, this subgroup would be
the kernel of a non-constant homomorphism : A5 {1, 1} from A5 to the
multiplicative group consisting of the numbers 1 and 1. But any 3-cycle
or 5-cycle would have to belong to the kernel of this homomorphism, and
therefore this kernel would contain at least 45 elements, which is impossible.
We conclude that A5 cannot contain any subgroup of order 30. It follows
from Lagranges Theorem that any normal subgroup of A5 that contains at
least one 3-cycle or 5-cycle must be the whole of A5 .
The group A5 contains 5 Sylow 2-subgroups, which are of order 4. One
of these consists of the identity permutation, together with the three permutations (1 2)(3 4), (1 3)(2 4) and (1 4)(2 3). (Each of these permutations fixes
the element 5.) There are four other such Sylow 2-subgroups, and all of the
Sylow 2-subgroups are conjugate to one another. It follows that A5 does not
contain any normal subgroup of order 4. Moreover A5 cannot contain any
normal subgroup of order 2, since any element of order 2 belongs to one of
the five Sylow 2-subgroups of order 4, and is therefore conjugate to elements
of order 2 in the other Sylow 2-subgroups.
Now any subgroup of A5 whose order is divisible by 3 must contain a
3-cycle by Cauchys Theorem. (Theorem 1.27.) Similarly any subgroup of
A5 whose order is divisible by 5 must contain a 5-cycle. It follows that the
order of any proper normal subgroup of A5 cannot be divisible by 3 or 5.
But this order must divide 60. Therefore the order of any proper normal
subgroup of A5 must be at most 4. But we have seen that A5 cannot contain
any normal subgroup of order 4 or 2. Therefore any proper normal subgroup
of A5 is trivial, and therefore A5 is simple.
26
1.18
Solvable Groups
=
=
Hi1
Gi1 (Gi H)
Gi1
by the First Isomorphism Theorem (Theorem 1.22), and thus Hi /Hi1 is
isomorphic to a subgroup of the Abelian group Gi /Gi1 . It follows that
Hi /Hi1 must itself be an Abelian group. We conclude therefore that the
subgroup H of G is solvable.
Now let N be a normal subgroup of G, and let Ki = Gi N/N for all i.
Then K0 is the trivial subgroup of G/N and Km = G/N . It follows from
Lemma 1.40 that Ki1 / Ki and Ki /Ki1 is isomorphic to the quotient of
Gi /Gi1 by some normal subgroup. But a quotient of any Abelian group
must itself be Abelian. Thus each quotient group Ki /Ki1 is Abelian, and
thus G/N is solvable.
Finally suppose that G is a group, N is a normal subgroup of G and
both N and G/N are solvable. We must prove that G is solvable. Now the
solvability of N ensures the existence of a finite sequence G0 , G1 , . . . , Gm of
subgroups of N , where G0 = {1}, Gm = N , and Gi1 / Gi and Gi /Gi1 is
Abelian for i = 1, 2, . . . , m. Also the solvability of G/N ensures the existence
of a finite sequence K0 , K1 , . . . , Kn of subgroups of G/N , where K0 = N/N ,
Kn = G/N , and Ki1 / Ki and Ki /Ki1 is Abelian for i = 1, 2, . . . , n.
Let Gm+i be the preimage of Ki under the the quotient homomorphism
: G G/N , for i = 1, 2, . . . , n. The Second Isomorphism Theorem (Theorem 1.23) ensures that Gm+i /Gm+i1
= Ki /Ki1 for all i > 0. Therefore
G0 , G1 , . . . , Gm+n is a finite sequence of subgroups of G, where G0 = {1},
Gn = G, and Gi1 / Gi and Gi /Gi1 is Abelian for i = 1, 2, . . . , m + n. Thus
the group G is solvable, as required.
Example The alternating group A5 is simple. It follows that A5 is not
solvable, since the definition of solvable groups ensures that that any simple
solvable group is cyclic, and A5 is not cyclic. Now if n 5 the symmetric
28
29
2.1
31
2.2
Ideals
2.3
33
(I + x)(I + y) = I + xy
for all I +x R/I and I +y R/I. One can readily verify that R/I is a ring
with respect to these operations. We refer to the ring R/I as the quotient of
the ring R by the ideal I.
Example Let n be an integer satisfying n > 1. The quotient Z/nZ of the
ring Z of integers by the ideal nZ generated by n is the ring of congruence
classes of integers modulo n. This ring has n elements, and is a field if and
only if n is a prime number.
Definition A function : R S from a ring R to a ring S is said to be a
homomorphism (or ring homomorphism) if and only if (x+y) = (x)+(y)
and (xy) = (x)(y) for all x, y R. If in addition the rings R and S are
unital then a homomorphism : R S is said to be unital if (1) = 1 (i.e.,
maps the identity element of R onto that of S).
Let R and S be rings, and let : R S be a ring homomorphism. Then
the kernel ker of the homomorphism is an ideal of R, where
ker = {x R : (x) = 0}.
The image (R) of the homomorphism is a subring of S; however it is not
in general an ideal of S.
An ideal I of a ring R is the kernel of the quotient homomorphism that
sends x R to the coset I + x.
Definition An isomorphism : R S between rings R and S is a homomorphism that is also a bijection between R and S. The inverse of an
isomorphism is itself an isomorphism. Two rings are said to be isomorphic
if there is an isomorphism between them.
34
2.4
Let R be a ring, and let r R. We may define n.r for all natural numbers n
by recursion on n so that 1.r = r and n.r = (n 1).r + r for all n > 0. We
define also 0.r = 0 and (n).r = (n.r) for all natural numbers n. Then
(m + n).r = m.r + n.r,
(mn).r = m.(n.r),
2.5
Polynomial Rings
such that h = f q + r and either r = 0 or else deg r < deg f . (Lemma 2.10).
But r I, since r = h f q and h and f both belong to I. The choice of f
then ensures that r = 0 and h = qf . Thus I = (f ).
Definition Polynomials f1 , f2 , . . . , fk with coefficients in some field K. are
said to be coprime if there is no non-constant polynomial that divides all of
them.
Theorem 2.12 Let f1 , f2 , . . . , fk be coprime polynomials with coefficients in
some field K. Then there exist polynomials g1 , g2 , . . . , gk with coefficients in
K such that
f1 (x)g1 (x) + f2 (x)g2 (x) + + fk (x)gk (x) = 1.
Proof Let I be the ideal in K[x] generated by f1 , f2 , . . . , fk . It follows from
Lemma 2.11 that the ideal I is generated by some polynomial d. Then d
divides all of f1 , f2 , . . . , fk and is therefore a constant polynomial, since these
polynomials are coprime. It follows that I = K[x]. But the ideal I of K[x]
generated by f1 , f2 , . . . , fk coincides with the subset of K[x] consisting of all
polynomials that may be represented as finite sums of the form
f1 (x)g1 (x) + f2 (x)g2 (x) + + fk (x)gk (x)
for some polynomials g1 , g2 , . . . , gk . It follows that the constant polynomial
with value 1 may be expressed as a sum of this form, as required.
Definition A non-constant polynomial f with coefficients in a field K is said
to be irreducible over K if it is not divisible by any non-constant polynomial
of lower degree with coefficients in K.
Any polynomial with coefficients in a field K may be factored as a product
of irreducible polynomials. This is easily proved by induction on the degree
of the polynomial, for if a non-constant polynomial is not itself irreducible,
then it can be factored as a product of polynomials of lower degree.
Lemma 2.13 Let K be a field. Then the ring K[x] of polynomials with
coefficients in K contains infinitely many irreducible polynomials.
Proof Let f1 , f2 , . . . , fk K[x] be irreducible polynomials, and let
g = f1 f2 fk + 1.
Then g is not divisible by f1 , f2 , . . . , fk , and therefore no irreducible factor
of g is divisible by any of f1 , f2 , . . . , fk . It follows that K[x] must contain
irreducible polynomials distinct from f1 , f2 , . . . , fk . Thus the number of irreducible polynomials in K[x] cannot be finite.
37
2.6
Gausss Lemma
i=0
i < j and p divides ci for all i < k. But p does not divide bj ck since p does
not divide either bj or ck . Therefore p does not divide the coefficient aj+k of
gh. This shows that the polynomial gh is primitive, as required.
Proposition 2.17 A polynomial with integer coefficients is irreducible over
the field Q of rational numbers if and only if it cannot be factored as a product
of polynomials of lower degree with integer coefficients.
Proof Let f be a polynomial with integer coefficients. If f is irreducible
over Q then f clearly cannot be factored as a product of polynomials of
lower degree with integer coefficients. Conversely suppose that f cannot be
factored in this way. Let f (x) = g(x)h(x), where g and h are polynomials
with rational coefficients. Then there exist positive integers r and s such that
the polynomials rg(x) and sh(x) have integer coefficients. Let the positive
integers u and v be the highest common factors of the coefficients of the
polynomials rg(x) and sh(x) respectively. Then rg(x) = ug (x) and sh(x) =
vh (x), where g and h are primitive polynomials with integer coefficients.
Then (rs)f (x) = (uv)g (x)h (x). We now show that f (x) = mg (x)h (x)
for some integer m. Let l be the smallest divisor of rs such that lf (x) =
mg (x)h (x) for some integer m. We show that l = 1. Suppose that it
were the case that l > 1. Then there would exist a prime factor p of l.
Now p could not divide m, since otherwise (l/p)f (x) = (m/p)g (x)h (x),
which contradicts the definition of l. Theorefore p would have to divide each
coefficient of g (x)h (x), which is impossible, since it follows from Gausss
Lemma (Lemma 2.16) that the product g h of the primitive polynomials
g and h is itself a primitive polynomial. Therefore l = 1 and f (x) =
mg (x)h (x). Now f does not factor as a product of polynomials of lower
degree with integer coefficients. Therefore either deg f = deg g = deg g, or
else deg f = deg h = deg h, Thus f is irreducible over Q, as required.
2.7
40
3.1
m
P
i=1
41
and therefore
m
P
i=1
independent over K. It follows that ij = 0 for all i and j. This shows that
the elements xi yj are linearly independent over K.
Now y1 , y2 , . . . , yn span M as a vector space over L, and therefore any
n
P
element z of M can be written in the form z =
j yj , where j L for
j=1
m P
n
P
m
P
ij xi , where ij K
i=1
i=1 j=1
3.2
42
Proof Let L: K be a finite field extension, and let n = [L: K]. Let L.
Then either the elements 1, , 2 , . . . , n are not all distinct, or else these
elements are linearly dependent over the field K (since a linearly independent subset of L can have at most n elements.) Therefore there exist
c0 , c1 , c2 , . . . , cn K, not all zero, such that
c0 + c1 + c2 2 + + cn n = 0.
Thus is algebraic over K. This shows that the field extension L: K is
algebraic, as required.
Definition A polynomial f with coefficients in some field or unital ring is
said to be monic if its leading coefficient (i.e., the coefficient of the highest
power of x occurring in f (x) with a non-zero coefficient) is equal to 1.
Lemma 3.3 Let K be a field and let be an element of some extension
field L of K. Suppose that is algebraic over K. Then there exists a unique
irreducible monic polynomial m K[x], with coefficients in K, characterized
by the following property: f K[x] satisfies f () = 0 if and only if m divides
f in K[x].
Proof Let I = {f K[x] : f () = 0}. Then I is a non-zero ideal of K[x].
Now there exists some polynomial m with coefficients in K which generates
the ideal I (Lemma 2.11). Moreover, by dividing m by its leading coefficient,
if necessary, we can ensure that m is a monic polynomial. Then f K[x]
satisfies f () = 0 if and only if m divides f .
Suppose that m = gh where g, h K[x]. Then 0 = m() = g()h().
But then either g() = 0, in which case m divides g, or else h() = 0, in
which case m divides h. The polynomial m is thus irreducible over K.
The polynomial m is uniquely determined since if some monic polynomial m also satisfies the required conditions then m and m divide one another
and therefore m = m.
Definition Let K be a field and let L be an extension field of K. Let be
an element of L that is algebraic over K. The minimum polynomial m of
over K is the unique irreducible monic polynomial m K[x] with coefficients
in K characterized by the following property: f K[x] satisfies f () = 0 if
and only if m divides f in K[x].
Note that if f K[x] is an irreducible monic polynomial, and if is a
root of f in some extension field L of K, then f is the minimum polynomial
of over K.
43
3.3
Definition A field K is said to be algebraically closed if, given any nonconstant polynomial f K[x] with coefficients in K, there exists some K
satisfying f () = 0.
The field C of complex numbers is algebraically closed. This result is the
Fundamental Theorem of Algebra.
Lemma 3.7 Let K be an algebraically closed field, and let L: K be an algebraic extension of K. Then L = K.
Proof Let L, and let m K[x] be the minimal polynomial of over
K. Then the polynomial m (x) has a root a in K, and is therefore divisible
by the polynomial x a. It follows that m (x) = x a, since m (x) is an
irreducible monic polynomial. But then = a, and therefore K. This
shows that every element of L belongs to K, and thus L = K, as required.
3.4
One can make use of the Tower Law in order to prove the impossibility of
performing a number of geometric constructions in a finite number of steps
using straightedge and and compasses alone. These impossible constructions
include the following:
the trisection of an arbitrary angle;
the construction of the edge of a cube having twice the volume of some
given cube;
45
Example We show that there is no geometrical construction for the trisection of an angle of 3 radians (i.e., 60 ) using straightedge and compasses
alone. Let a = cos 9 and b = sin 9 . Now the point (cos 3 , sin 3 ) (i.e, the
point ( 12 , 12 3)) is constructible. Thus if an angle of 3 radians could be trisected using straightedge and compasses alone, then the point (a, b) would
be constructible. Now
cos 3 = cos cos 2 sin sin 2 = cos (cos2 sin2 ) 2 sin2 cos
= 4 cos3 3 cos
for any angle . On setting = 9 we deduce that 4a3 3a = 12 and thus
8a3 6a 1 = 0. Now 8a3 6a 1 = f (2a 1), where f (x) = x3 + 3x2 3.
An immediate application of Eisensteins criterion for irreducibility shows
that the polynomial f is irreducible over the field Q of rational numbers, and
thus [Q(a): Q] = [Q(2a 1): Q] = 3. It now follows from Theorem 3.8 that
the point (cos 9 , sin 9 ) is not constructible using straightedge and compasses
alone. Therefore it is not possible to trisect an angle of 3 radians using
straightedge and compasses alone. It follows that there is no geometrical
construction for the trisection of an arbitrary angle using straightedge and
compasses alone.
Example It is not difficult to see
that if it were possible to
construct two
3
3
points in the plane a distance 2 apart, then the point ( 2, 0) would be
constructible.
But it follows from Theorem 3.8 that this is impossible,
3
since 2is a root of the irreducible monic polynomial x3 2, and therefore [Q( 3 2), Q] = 3. We conclude that there is no geometric construction
using straightedge and compasses alone that will construct from a line segment in the plane a second line segment such that a cube with the second
line segment as an edge will have twice the volume of a cube with the first
line segment as an edge.
Example It can be shown
that is not algebraic over the field Q of rational
numbers. Therefore is not algebraic over Q. It then follows from Theorem 3.8 it is not possible to give a geometrical construction for obtaining a
square with the same area as a given circle, using straightedge and compasses
alone. (Thus it is not possible to square the circle using straightedge and
compasses alone.)
Lemma 3.9 If the endpoints of any line segment in the plane are constructible, then so is the midpoint.
47
Proof Let P and Q be constructible points in the plane. Let S and T be the
points where the circle centred on P and passing through Q intersects the
circle centred on Q and passing through P . Then S and T are constructible
points in the plane, and the point R at which the line ST intersects the
line P Q is the midpoint of the line segment P Q. Thus this midpoint is a
constructible point.
Lemma 3.10 If any three vertices of a parallelogram in the plane are constructible, then so is the fourth vertex.
Proof Let the vertices of the parallelogram listed in anticlockwise (or in
clockwise) order be A, B, C and D, where A, B and D are constructible
points. We must show that C is also constructible. Now the midpoint E of
the line segment BD is a constructible point, and the circle centred on E
and passing though A will intersect the line AE in the point C. Thus C is a
constructible point, as required.
Theorem 3.11 Let K denote the set of all real numbers x for which the
point (x, 0) is constructible using straightedge and compasses alone. Then K
is a subfield of the field of real numbers, and a point (x, y) of the plane is
constructible using straightedge and compass
alone if and only if x K and
y K. Moreover if x K and x > 0 then x K.
Proof Clearly 0 K and 1 K. Let x and y be real numbers belonging to
K. Then (x, 0) and (y, 0) are constructible points of the plane. Let M be the
midpoint of the line segment whose endpoints are (x, 0) and (y, 0). Then M
is constructible (Lemma 3.9), and M = ( 21 (x + y), 0). The circle centred on
M and passing through the origin intersects the x-axis at the origin and at
the point (x + y, 0). Therefore (x + y, 0) is a constructible point, and thus
x + y K. Also the circle centred on the origin and passing through (x, 0)
intersects the x-axis at (x, 0). Thus (x, 0) is a constructible point, and
thus x K.
We claim that if x K then the point (0, x) is constructible. Now if x K
and x 6= 0 then (x, 0) and (x, 0) are constructible points, and the circle
centred on (x, 0) and passing through (x, 0) intersects the circle centred on
(x, 0) and passing through
(x, 0) in two
since it is the fourth vertex of a parallelogram which has three vertices at the
constructible points (x, 0), (0, y) and (0, 1) (Lemma 3.10). But the line which
passes through the two constructible points (0, y) and (x, y 1) intersects
the x-axis at the point (xy, 0). Therefore the point (xy, 0) is constructible,
and thus xy K.
Now suppose that x K, y K and y 6= 0. The point (x, 1 y) is
constructible, since it is the fourth vertex of a parallelogram with vertices
at the constructible points (x, 0), (0, y) and (0, 1). The line segment joining
the constructible points (0, 1) and (x, 1 y) intersects the x-axis at the point
(xy 1 , 0). Thus xy 1 K.
The above results show that K is a subfield of the field of real numbers.
Moreover if x K and y K then the point (x, y) is constructible, since it is
the fourth vertex of a rectangle with vertices at the constructible points (0, 0),
(x, 0) and (0, y). Conversely, suppose that the point (x, y) is constructible.
We claim that the point (x, 0) is constructible and thus x K. This result is
obviously true if y = 0. If y 6= 0 then the circles centred on the points (0, 0)
and (1, 0) and passing through (x, y) intersect in the two points (x, y) and
(x, y). The point (x, 0) is thus the point at which the line passing through
the constructible points (x, y) and (x, y) intersects the x-axis, and is thus
itself constructible. The point (0, y) is then the fourth vertex of a rectangle
with vertices at the constructible points (0, 0), (x, 0) and (x, y), and thus is
itself constructible. The circle centred on the origin and passing though (0, y)
intersects the x-axis at (y, 0). Thus (y, 0) is constructible, and thus y K.
We have thus shown that a point (x, y) is constructible using straightedge
and compasses alone if and only if x K and y K.
Suppose that x K and that x > 0. Then 21 (1 x) K. Thus if
C = (0, 12 (1 x)) then C is a constructible point. Let (u, 0) be the point at
which the circle centred on C and passing through the constructible point
(0, 1) intersects the x-axis. (The circle does intersect the x-axis since it passes
through (0, 1) and (0, x), and x > 0.) The radius of this circle is 12 (1 + x)),
and therefore 14 (1 x)2 + u2 = 14 (1 + x)2 (Pythagoras Theorem.) But then
2
u
= x. But (u, 0) is a constructible point. Thus if x K and x > 0 then
x K, as required.
The above theorems can be applied to the problem of determining whether
or not it is possible to construct a regular n-sided polygon with a straightedge
and compass, given its centre and one of its vertices. The impossibility
of trisecting an angle of 60 shows that a regular 18-sided polygon is not
constructible using straightedge and compass. Now if one can construct a
regular n-sided polygon then one can easily construct a regular 2n-sided
polygon by bisecting the angles of the n-sided polygon. Thus the problem
49
3.5
Splitting Fields
51
3.6
Normal Extensions
Proof Suppose that L: K is both finite and normal. Then there exist algebraic elements 1 , 2 , . . . , n of L such that L = K(1 , 2 , . . . , n ) (Corollary 3.5). Let f (x) = m1 (x)m2 (x) mn (x), where mj K[x] is the minimum polynomial of j over K for j = 1, 2, . . . , n. Then mj splits over L since
mj is irreducible and L: K is normal. Thus f splits over L. It follows that
L is a splitting field for f over K, since L is obtained from K by adjoining
roots of f .
Conversely suppose that L is a splitting field over K for some polynomial
f K[x]. Then L is obtained from K by adjoining the roots of f , and
therefore the extension L: K is finite. (Corollary 3.5).
Let g K[x] be irreducible, and let M be a splitting field for the polynomial f g over L. Then L M and the polynomials f and g both split over
M . Let and be roots of g in M . Now the polynomial f splits over the
fields L() and L(). Moreover if f splits over any subfield of M containing
K() then that subfield must contain L (since L is a splitting field for f over
K) and thus must contain L(). We deduce that L() is a splitting field for
f over K(). Similarly L() is a splitting field for f over K().
Now there is a well-defined K-isomorphism : K() K() which sends
h() to h() for all polynomials h with coefficients in K, since two such polynomials h1 and h2 take the same value at a root of the irreducible polynomial g if and only if their difference h1 h2 is divisible by g. This isomorphism
: K() K() extends to an K-isomorphism : L() L() between L()
and L(), since L() and L() are splitting fields for f over the field K() and
K() respectively (Theorem 3.15). Thus the extensions L(): K and L(): K
are isomorphic, and [L(): K] = [L(): K]. But [L(): K] = [L(): L][L: K]
and [L(): K] = [L(): L][L: K] by the Tower Law (Theorem 3.1). It follows
that [L(): L] = [L(): L]. In particular L if and only if L. This
shows that that any irreducible polynomial with a root in L must split over
L, and thus L: K is normal, as required.
3.7
Separability
54
n
P
jcj xj1 .
j=1
(The definition of formal derivative given above is a purely algebraic definition, applying to polynomials with coefficients in any field whatsoever,
which corresponds to the formula for the derivative of a polynomial with real
coefficients obtained by elementary calculus.)
Let K be a field. One can readily verify by straightforward calculation
that D(f + g) = Df + Dg and D(f g) = (Df )g + f (Dg) for all f K[x]. If
f is a constant polynomial then Df = 0.
Let K be a field, and let f K[x]. An element of an extension field L
of K is said to be a repeated zero if (x )2 divides f (x).
Proposition 3.18 Let K be a field, and let f K[x]. The polynomial f
has a repeated zero in a splitting field for f over K if and only if there exists
a non-constant polynomial with coefficients in K that divides both f and its
formal derivative Df in K[x].
Proof Suppose that f K[x] has a repeated root in a splitting field L.
Then f (x) = (x )2 h(x) for some polynomial h L[x]. But then
(Df )(x) = 2(x )h(x) + (x )2 (Dh)(x)
and hence (Df )() = 0. It follows that the minimum polynomial of over
K is a non-constant polynomial with coefficients in K which divides both f
and Df .
Conversely let f K[x] be a polynomial with the property that f and
Df are both divisible by some non-constant polynomial g K[x]. Let L be
a splitting field for f over K. Then g splits over L (since g is a factor of f ).
Let L be a root of g. Then f () = 0, and hence f (x) = (x )e(x)
for some polynomial e L[x]. On differentiating, we find that (Df )(x) =
e(x) + (x )De(x). But (Df )() = 0, since g() = 0 and g divides Df
in K[x]. It follows that e() = (Df )() = 0, and thus e(x) = (x )h(x)
for some polynomial h L[x]. But then f (x) = (x )2 h(x), and thus the
polynomial f has a repeated root in the splitting field L, as required.
Definition Let K be a field. An irreducible polynomial in K[x] is said to
be separable over K if it does not have repeated roots in a splitting field. A
polynomial in K[x] is said to separable over K if all its irreducible factors
are separable over K. A polynomial is said to be inseparable if it is not
separable.
55
3.8
Finite Fields
p
X
p
xj y pj , where
j=0
p
p
p(p 1) (p j + 1)
for j = 1, 2, . . . , p. The de= 1 and
=
j!
0
j
nominator of each binomial coefficient must divide the numerator, since this
coefficient is an integer. Now the characteristic p of K is a prime number.
Moreover if 0 < j < p then p is a factor of the numerator but is not a factor
of the denominator. It follows from the Fundamental Theorem of Arithmetic
56
p
that p divides
for all j satisfying 0 < j < p. But px = 0 for all x K,
j
since charK = p. Therefore (x + y)p = xp + y p for all x, y K. The identity
(xy)p = xp y p is immediate from the commutativity of K.
Let K be a field of characteristic p, where p > 0. The monomorphism
x 7 xp is referred to as the Frobenius monomorphism of K. If K is finite then
this monomorphism is an automorphism of K, since any injection mapping
a finite set into itself must be a bijection.
Theorem 3.22 A field K has pn elements if and only if it is a splitting field
n
for the polynomial xp x over its prime subfield Fp , where Fp
= Z/pZ.
Proof Suppose that K has q elements, where q = pn . If K \ {0} then
q1 = 1, since the set of non-zero elements of K is a group of order q 1
with respect to multiplication. It follows that q = for all K. Thus
all elements of K are roots of the polynomial xq x. This polynomial must
therefore split over K, since its degree is q and K has q elements. Moreover
the polynomial cannot split over any proper subfield of K. Thus K is a
splitting field for this polynomial.
Conversely suppose that K is a splitting field for the polynomial f over
Fp , where f (x) = xq x and q = pn . Let () = q for all K.
Then : K K is a monomorphism, being the composition of n successive
applications of the Frobenius monomorphism of K. Moreover an element
of K is a root of f if and only if () = . It follows from this that
the roots of f constitute a subfield of K. This subfield is the whole of
K, since K is a splitting field. Thus K consists of the roots of f . Now
Df (x) = qxq1 1 = 1, since q is divisible by the characteristic p of Fp . It
follows from Proposition 3.18 that the roots of f are distinct. Therefore f
has q roots, and thus K has q elements, as required.
Let K be a finite field of characteristic p. Then K has pn elements, where
n = [K: Fp ], since any vector space of dimension n over a field of order p must
have exactly pn elements. The following result is now a consequence of the
existence of splitting fields (Corollary 3.14) and the uniqueness of splitting
fields up to isomorphism (Theorem 3.15)
Corollary 3.23 There exists a finite field GF(pn ) of order pn for each prime
number p and positive integer n. Two finite fields are isomorphic if and only
if they have the same number of elements.
57
(d) = n.
d|n
satisfying 0 x < n for which (x, n) = n/d. Thus it suffices to show that
nd = (d) for each divisor d of n.
Let d be a divisor of n, and let a = n/d. Given any integer x satisfying
0 x < n that is divisible by a, there exists an integer y satisfying 0 y < d
such that x = ay. Then (x, n) = (ay, ad) = a(y, d). It follows that the
integers x satisfying 0 x < n for which (x, n) = a are those of the form
ay, where y is an integer, 0 y < d and (y, d) = 1. It follows that there
are exactly (d) integersX
x satisfying 0 x < n for which (x, n) = n/d, and
thus nd = (d) and n =
(d), as required.
d|n
The set of all non-zero elements of a field is a group with respect to the
operation of multiplication.
Theorem 3.25 Let G be a finite subgroup of the group of non-zero elements
of a field. Then the group G is cyclic.
Proof Let n be the order of the group G. It follows from Lagranges Theorem
that the order of every element of G divides n. For each divisor dX
of n, let (d)
denote the number of elements of G that are of order d. Clearly
(d) = n.
d|n
d|n
3.9
3.10
Theorem 3.31 Let (L: K) be the Galois group of a finite field extension
L: K. Then |(L: K)| divides [L: K]. Moreover |(L: K)| = [L: K] if and only
if L: K is a Galois extension, in which case K is the fixed field of (L: K).
Proof Let M be the fixed field of (L: K). It follows from Theorem 3.30
that L: M is a Galois extension and |(L: K)| = [L: M ]. Now [L: K] =
[L: M ][M : K] by the Tower Law (Theorem 3.1). Thus |(L: K)| divides
[L: K]. If |(L: K)| = [L: K] then M = K. But then L: K is a Galois
extension and K is the fixed field of (L: K).
Conversely suppose that L: K is a Galois extension. We must show that
|(L: K)| = [L: K]. Now the extension L: K is both finite and separable. It
follows from the Primitive Element Theorem (Theorem 3.27) that there exists
some element of L such that L = K(). Let f be the minimum polynomial
of over K. Then f splits over L, since f is irreducible and the extension
L: K is normal. Let 1 , 2 , . . . , n be the roots of f in L, where 1 = and
n = deg f . If is a K-automorphism of L then f (()) = (f ()) = 0, since
the coefficients of the polynomial f belong to K and are therefore fixed by
. Thus () = j for some j. We claim that, for each root j of f , there is
exactly one K-automorphism j of L satisfying j () = j .
Let g(x) and h(x) be polynomials with coefficients in K. Suppose that
g() = h(). Then g h is divisible by the minimum polynomial f of .
It follows that g(j ) = h(j ) for any root j of f . Now every element of
L is of the form g() for some g K[x], since L = K(). We deduce
therefore that there is a well-defined function j : L L with the property
that j (g()) = g(j ) for all g K[x]. The definition of this function ensures
that it is the unique automorphism of the field L that fixes each element of
K and sends to j .
Now the roots of the polynomial f in L are distinct, since f is irreducible
and L: K is separable. Moreover the order of the Galois group (L: K) is
equal to the number of roots of f , since each root determines a unique element
of the Galois group. Therefore |(L: K)| = deg f . But deg f = [L: K] since
L = K() and f is the minimum polynomial of over K (Theorem 3.4).
Thus |(L: K)| = [L: K], as required.
3.11
Proposition 3.32 Let K, L and M be fields satisfying K M L. Suppose that L: K is a Galois extension. Then so is L: M . If in addition M : K
is normal, then M : K is a Galois extension.
Proof Let L and let fK K[x] and fM M [x] be the minimum
polyomials of over K and M respectively. Then fK splits over L, since fK
62
3.12
Quadratic Polynomials
3.13
Cubic Polynomials
,
4
27
2
and the product of these roots is p3 /27. Thus if one of these roots is equal to
u3 then the other is equal to v 3 , where v = p/(3u). It follows that the roots
of the cubic polynomial f are
s
s
r
r
2
3
3 q
3 q
q
p
q2
p3
+
2
4
27
2
4
27
where the two cube roots must be chosen so as to ensure that their product
is equal to 13 p. It follows that the cubic polynomial x3 px q splits over the
1 3
field K(, , ), where 2 = 14 q 2 27
p and 3 = 12 q + and where satisfies
3
= 1 and 6= 1. The roots of the polynomial in this extension field are ,
and , where
=+
p
,
3
= + 2
p
,
3
= 2 + 3
p
.
3
Now let us consider the possibilities for the Galois group (L: K), where
L is a splitting field for f over K. Now L = K(, , ), where , and
are the roots of f . Also a K-automorphism of L must permute the roots
of f amongst themselves, and it is determined by its action on these roots.
Therefore (L: K) is isomorphic to a subgroup of the symmetric group 3
(i.e., the group of permutations of a set of 3 objects), and thus the possibilities
for the order of (L: K) are 1, 2, 3 and 6. It follows from Corollary 3.16 that
f is irreducible over K if and only if the roots of f are distinct and the
Galois group acts transitively on the roots of f . By considering all possible
subgroups of 3 it is not difficult to see that f is irreducible over K if and
only if |(L: K)| = 3 or 6. If f splits over K then |(L: K)| = 1. If f factors
in K[x] as the product of a linear factor and an irreducible quadratic factor
then |(L: K)| = 2.
65
3.14
Quartic Polynomials
We now consider how to locate the roots of a quartic polynomial with coefficients in a field K of characteristic zero. A substitution of the form x 7 xc,
where c K, will reduce the problem to that of locating the roots , ,
and of a quartic polynomial f of the form f (x) = x4 px2 qx r in some
splitting field L.
Now the roots , , and of the quartic polynomial
x4 px2 qx r,
must satisfy the equation
(x )(x )(x )(x ) = x4 px2 qx r.
Equating coefficients of x, we find that
+ + + = 0,
and
p = ( + + + + + ),
q = + + + ,
r = .
Let
= ( + )( + ) = ( + )2 = ( + )2 ,
= ( + )( + ) = ( + )2 = ( + )2 ,
= ( + )( + ) = ( + )2 = ( + )2 .
66
( + + )( + + )
2 + 2 + 2 + + + ,
( + + )( + + )
( 2 + 2 + 2 + 2 + 2 + 2 ) 2,
2 + 2 + 2 .
Then
2
2
2
+ + = ( + ) + ( + ) + ( + )
= 2 2 + 2 + 2 + + +
2
+ +
p2
= 2p,
= ( + )4 + ( + )4 + ( + )4
= 4 + 4 3 + 6 2 2 + 4 3 + 4
+ 4 + 4 3 + 6 2 2 + 4 3 + 4
+ 4 + 4 3 + 6 2 2 + 4 3 + 4
= 2( 4 + 4 + 4 ) + 4( 3 + 3 + 3 + 3 + 3 + 3 )
+ 6( 2 2 + 2 2 + 2 2 ),
= 4 + 4 + 4 + 3( 2 2 + 2 2 + 2 2 )
+ 4( 2 + 2 + 2 )
+ 2( 3 + 3 + 3 + 3 + 3 + 3 ).
Therefore
2 + 2 + 2 = 2p2 8( 2 + 2 + 2 )
= 2p2 8r.
But
4p2 = ( + + )2 = 2 + 2 + 2 + 2( + + )
Therefore
+ + = 2p2 21 (2 + 2 + 2 )
= p2 + 4r.
67
One can then verify that the roots of f take the form 12 ( + +
3.15
A straightforward application of Eisensteins Irreducibility Criterion (Proposition 2.18) shows that the polynomial x4 2 is irreducible over Q. Let be
4
the unique positive real number satisfying 4 = 2. Then the roots of x
2
in the field C of complex numbers are , i, and i, where i = 1.
Thus if L = Q(, i) then L is a splitting field for the polynomial x4 2 over
Q.
Now the polynomial x4 2 is the minimum polynomial of over Q, since
this polynomial is irreducible. We can therefore apply Theorem 3.4 to deduce
that [Q(): Q] = 4. Now i does not belong to Q(), since Q() R. Therefore
the polynomial x2 + 1 is the minimum polynomial of i over Q(). Another
application of Theorem 3.4 now shows that [L: Q()] = [Q(, i): Q()] = 2. It
follows from the Tower Law (Theorem 3.1) that [L: Q] = [L: Q()][Q(): Q] =
8. Moreover the extension L: Q is a Galois extension, and therefore its Galois
group (L: Q) is a group of order 8 (Theorem 3.31).
Another application of the Tower Law now shows that [L: Q(i)] = 4,
since [L: Q] = [L: Q(i)][Q(i): Q] and [Q(i): Q] = 2. Therefore the minimum
polynomial of over Q(i) is a polynomial of degree 4 (Theorem 3.4). But is
a root of x4 2. Therefore x4 2 is irreducible over Q(i), and is the minimum
polynomial of over Q(i). Corollary 3.16 then ensures the existence of an
automorphism of L that sends L to i and fixes each element of Q(i).
Similarly there exists an automorphism of L that sends i to i and fixes
each element of Q(). (The automorphism is in fact the restriction to L
of the automorphism of C that sends each complex number to its complex
conjugate.)
Now the automorphisms , 2 , 3 and 4 fix i and therefore send to
i, , i and respectively. Therefore 4 = , where is the identity
automorphism of L. Similarly 2 = . Straightforward calculations show
that = 3 , and ( )2 = ( 2 )2 = ( 3 )2 = . It follows easily from this
that (L: Q) = {, , 2 , 3 , , , 2 , 3 }, and (L: Q) is isomorphic to the
dihedral group of order 8 (i.e., the group of symmetries of a square in the
plane).
The Galois correspondence is a bijective correspondence between the subgroups of (L: Q) and subfields of L that contain Q. The subfield of L corresponding to a given subgroup of (L: Q) is set of all elements of L that
are fixed by all the automorphisms in the subgroup. One can verify that
the correspondence between subgroups of (L: Q) and their fixed fields is as
69
follows:
3.16
Subgroup of (L: Q)
Fixed field
(L: K)
{, , 2 , 3 }
{, 2 , , 2 }
{, 2 , , 3 }
{, 2 }
{, }
{, 2 }
{, }
{, 3 }
{}
Q
Q(i)
Q( 2)
Q(i
2)
Q( 2, i)
Q()
Q(i)
Q((1 i)/)
Q((1 + i)/)
Q(, i)
3.17
Lemma 3.40 Let f be a polynomial with coefficients in a field K of characteristic zero, and let K 0 = K(), where K 0 satisfies p K for some
prime number p. Then K (f ) is solvable if and only if K 0 (f ) is solvable.
Proof Let N be a splitting field for the polynomial f (x)(xp c) over K,
where c = p . Then N contains a splitting field L for f over K and a
splitting field M for xp c over K. Then N : K, L: K and M : K are Galois
extensions. The Galois correspondence (Theorem 3.35) ensures that (N : L)
and (N : M ) are normal subgroups of (N : K). Moreover (L: K) is isomorphic to (N : K)/(N : L), and (M : K) is isomorphic to (N : K)/(N : M ).
Now M and N are splitting fields for the polynomial xp c over the fields K
and L respectively. It follows from Lemma 3.39 that (M : K) and (N : L)
are solvable. But if H is a normal subgroup of a finite group G then G is solvable if and only both H and G/H are solvable (Proposition 1.41). Therefore
(N : K) is solvable if and only if (N : M ) is solvable. Also (N : K) is solvable if and only if (L: K) is solvable. It follows that (N : M ) is solvable if
and only if (L: K) is solvable. But (N : M )
= M (f ) and (L: K)
= K (f ),
since L and N are splitting fields for f over K and M respectively. Thus
M (f ) is solvable if and only if K (f ) is solvable.
Now M is also a splitting field for the polynomial xp c over K 0 , since
0
K = K(), where is a root of the polynomial xp c. The above argument therefore shows that M (f ) is solvable if and only if K 0 (f ) is solvable.
Therefore K (f ) is solvable if and only if K 0 (f ) is solvable, as required.
Theorem 3.41 Let f be a polynomial with coefficients in a field K of characteristic zero. Suppose that f is solvable by radicals. Then the Galois group
K (f ) of f is a solvable group.
Proof The polynomial f is solvable by radicals. Therefore there exist fields
K0 , K1 , . . . , Km such that K0 = K, the polynomial f splits over Km , and, for
each integer i between 1 and m, the field Ki is obtained on adjoining to Ki1
an element i with the property that ipi Ki1 for some prime number pi .
Now Km (f ) is solvable, since it is the trivial group consisting of the identity
automorphism of Km only. Also Lemma 3.40 ensures that, for each i > 0,
Ki (f ) is solvable if and only if Ki1 (f ) is solvable. It follows that K (f ) is
solvable, as required.
Lemma 3.42 Let p be a prime number, let K be a field whose characteristic
is not equal to p, and let L: K be a Galois extension of K of degree p. Suppose
that the polynomial xp 1 splits over K. Then there exists L such that
L = K() and p K.
73
Proof The Galois group (L: K) is a cyclic group of order p, since its order is
equal to the degree p of the extension L: K. Let be a generator of (L: K),
let be an element of L \ K, and let
j = 0 + j 1 + 2j 2 + + (p1)j p1
for j = 0, 1, . . . , p 1, where 0 = , i = (i1 ) for i = 1, 2, . . . , p 1,
and is a primitive pth root of unity contained in K. Now (j ) = j j
for j = 0, 1, . . . , p 1, since () = , (p1 ) = 0 and p = 1. Therefore
(jp ) = jp and hence jp K for j = 0, 1, 2, . . . , p 1. But
0 + 1 + 2 + + p1 = p,
since j is a root of the polynomial 1 + x + x2 + + xp1 for all integers
j that are not divisible by p. Moreover p L \ K, since L \ K and
p 6= 0 in K. Therefore at least one of the elements 0 , 1 , . . . , p1 belongs
to L \ K. Let = j , where j L \ K. It follows from the Tower Law
(Theorem 3.1) that [K(), K] divides [L: K]. But [L: K] = p and p is prime.
It follows that L = K(). Moreover p K, as required.
Theorem 3.43 Let f be a polynomial with coefficients in a field K of characteristic zero. Suppose that the Galois group K (f ) of f over K is solvable.
Then f is solvable by radicals.
Proof Let be a primitive pth root of unity. Then K() (f ) is isomorphic
to a subgroup of K (f ) (Lemma 3.36) and is therefore solvable (Proposition 1.41). Moreover f is solvable by radicals over K if and only if f is
solvable by radicals over K(), since K() is obtained from K by adjoining
an element whose pth power belongs to K. We may therefore assume,
without loss of generality, that K contains a primitive pth root of unity for
each prime p that divides |K (f )|.
The result is trivial when |K (f )| = 1, since in that case the polynomial f
splits over K. We prove the result by induction on the degree |K (f )| of the
Galois group. Thus suppose that the result holds when the order of the Galois
group is less than |K (f )|. Let L be a splitting field for f over K. Then L: K
is a Galois extension and (L: K)
= K (f ). Now the solvable group (L: K)
contains a normal subgroup H for which the corresponding quotient group
(L: K)/H is a cyclic group of order p for some prime number p dividing
|(L: K)|. Let M be the fixed field of H. Then (L: M ) = H and (M : K)
=
(L: K)/H. (Theorem 3.35), and therefore [M : K] = |(L: K)/H| = p. It
follows from Lemma 3.42 that M = K() for some element M satisfying
p K. Moreover M (f )
= H, and H is solvable, since any subgroup of
74
3.18
76
4.1
Modules
(r + s)x = rx + sx,
1x = x
78
4.2
Noetherian Modules
Indeed if a and b are elements of L then a and b both belong to Ln for some
sufficiently large n, and hence a + b, a and ra belong to Ln , and thus to L,
for all r M . But the submodule L is finitely-generated. Let {a1 , a2 , . . . , ak }
be a generating set of L. Choose N large enough to ensure that ai LN for
i = 1, 2, . . . , k. Then L LN , and hence LN = Ln = L for all n N . Thus
M must satisfy the Ascending Chain Condition, as required.
79
k
X
ri x i +
i=1
m
X
si y i .
j=1
(r + s)x = rx + sx,
(rs)x = r(sx),
1x = x
x(r + s) = xr + xs,
x(rs) = (xr)s,
x1 = x
4.3
Let R be a unital commutative ring. We can regard the ring R as an Rmodule, where the ring R acts on itself by left multiplication (so that r . r0
is the product rr0 of r and r0 for all elements r and r0 of R). We then find
that a subset of R is an ideal of R if and only if it is a submodule of R. The
following result therefore follows directly from Proposition 4.2.
Proposition 4.5 Let R be a unital commutative ring. Then the following
are equivalent:
(i) (Ascending Chain Condition) if I1 I2 I3 is an ascending
chain of ideals of R then there exists an integer N such that In = IN
for all n N ;
81
4.4
A monomial in the independent indeterminates X1 , X2 , . . . , Xn is by definition an expression of the form X1i1 X2i2 Xnin , where i1 , i2 , . . . , in are nonnegative integers. Such monomials are multiplied according to the rule
X1i1 X2i2 Xnin X1j1 X2j2 Xnjn = X1i1 +j1 X2i2 +j2 Xnin +jn .
A polynomial p in the independent indeterminates with coefficients in some
ring R is by definition a formal linear combination of the form
r1 m 1 + r2 m 2 + + rk m k
where r1 , r2 , . . . , rk R and m1 , m2 , . . . , mk are monomials in X1 , X2 , . . . , Xn .
The coefficients r1 , r2 , . . . , rk of this polynomial are uniquely determined,
84
j=1
i=1 j=1
where the product mi m0j of the monomials mi and m0j is defined as described above. The set of all polynomials in the independent indeterminates
X1 , X2 , . . . , Xn with coefficients in the ring R is itself a ring, which we denote
by R[X1 , X2 , . . . , Xn ].
Example The polynomial 2X1 X23 6X1 X2 X32 is the product of the polynomials 2X1 X2 and X22 3X32 in the ring Z[X1 , X2 , X3 ] of polynomials in
X1 , X2 , X3 with integer coefficients.
Lemma 4.11 Let R be an integral domain. Then the ring R[x] of polynomials in the indeterminate x with coefficients in R is itself an integral domain,
and deg(pq) = deg p + deg q for all non-zero polynomials p, q R[x].
Proof The integral domain R is commutative, hence so is R[x]. Moreover
R[x] is unital, and the multiplicative identity element of R[x] is the constant
polynomial whose coefficient is the multiplicative identity element 1 of the
unital ring R.
Let p and q be polynomials in R[x], and let ak and bl be the leading
coefficients of p and q respectively, where k = deg p and l = deg q. Now
p(x)q(x) = ak bl xk+l + terms of lower degree.
Moreover ak bl 6= 0, since ak 6= 0, bl 6= 0, and the ring R of coefficients is an
integral domain. Thus if p 6= 0 and q 6= 0 then pq 6= 0, showing that R[x] is
an integral domain, and deg(pq) = k + l = deg p + deg q, as required.
Let p be a polynomial in the indeterminates X1 , X2 , . . . , Xn with coefficients in the ring R, where n > 1. By collecting together terms involving Xnj
for each non-negative integer j, we can write the polynomial p in the form
p(X1 , X2 , . . . , Xn ) =
k
X
j=0
polynomial p uniquely determines and is uniquely determined by the polynomials p1 , p2 , . . . , pk . It follows from this that the rings R[X1 , X2 , . . . , Xn ] and
R[X1 , X2 , . . . , Xn1 ][Xn ] are naturally isomorphic and can be identified with
one another. We can use the identification in order to prove results concerning the structure of the polynomial ring R[X1 , X2 , . . . , Xn ] by induction on
the number n of independent indeterminates X1 , X2 , . . . , Xn . For example,
the following result follows directly by induction on n, using Lemma 4.11.
Lemma 4.12 Let R be an integral domain. Then the ring R[X1 , X2 , . . . , Xn ]
is also an integral domain.
A monomial X1i1 X2i2 Xnin is said to be of degree d, where d is some
non-negative integer, if i1 + i2 + + in = d.
Definition Let R be a ring. A polynomial p R[X1 , X2 , . . . , Xn ] is said to
be homogeneous of degree d if it can be expressed as a linear combination of
monomials of degree d with coefficients in the ring R.
Any polynomial p R[X1 , X2 , . . . , Xn ] can be decomposed as a sum of
the form
p(0) + p(1) + + p(k) ,
where k is some sufficiently large non-negative integer and each polynomial
p(i) is a homogeneous polynomial of degree i. The homogeneous polynomial
p(i) is referred to as the homogeneous component of p of degree i; it is uniquely
determined by p. A non-zero polynomial p is said to be of degree d if p(d) 6= 0
and p(i) = 0 for all i > d. The degree of a non-zero polynomial p is denoted
by deg p.
Lemma 4.13 Let R be a ring, and let p and q be non-zero polynomials
belonging to R[X1 , X2 , . . . , Xn ]. Then
deg(p + q) max(deg p, deg q), provided that p + q 6= 0,
deg(pq) deg p + deg q, provided that pq 6= 0.
Moreover if R is an integral domain then pq 6= 0 and deg(pq) = deg p + deg q.
Proof The inequality (p + q) max(deg p, deg q) is obvious. Also p(i) q (j) is
homogeneous of degree i + j for all i and j, since the product of a monomial
of degree i and a monomial of degree j is a monomial of degree i + j. The
inequality deg(pq) deg p + deg q follows immediately.
Now suppose that R is an integral domain. Let k = deg p and l = deg q.
Then the homogeneous component (pq)(k+l) of pq of degree k + l is given
by (pq)(k+l) = p(k) q (l) . But R[X1 , X2 , . . . , Xn ] is an integral domain (see
Lemma 4.12), and p(k) and q (l) are both non-zero. It follows that (pq)(k+l) 6= 0,
and thus deg(pq) = deg p + deg q, as required.
86
4.5
Lemma 4.14 Let V be an algebraic set in An , and let L be a one-dimensional affine subspace of An . Then either L V or else L V is a finite
set.
Proof The affine subspace L is a translate of a one-dimensional subspace
of K n , and therefore there exist vectors v and w in K n such that L =
{v + wt : t K} (on identifying n-dimensional affine space An with the
vector space K n ). Now we can write
V = {(x1 , x2 , . . . , xn ) An : f (x1 , x2 , . . . , xn ) = 0 for all f S},
where S is some subset of the polynomial ring K[X1 , X2 , . . . , Xn ]. Now either
each polynomial belonging to S is zero throughout L, in which case L V ,
or else there is some f S which is non-zero at some point of L. Define
g K[t] by the formula
g(t) = f (v1 + w1 t, v2 + w2 t, . . . , vn + wn t)
(where vi and wi denote the ith components of the vectors v and w for
i = 1, 2, . . . , n). Then g is a non-zero polynomial in the indeterminate t, and
therefore g has at most finitely many zeros. But g(t) = 0 whenever the point
v + wt of L lies in V . Therefore L V is finite, as required.
Example The sets
{(x, y) A2 (R) : y = sin x}
and
{(x, y) A2 (R) : x 0}
are not algebraic sets in A2 (R), since the line y = 0 is not contained in either
of these sets, yet the line intersects these sets at infinitely many points of the
set.
Given any subset S of K[X1 , X2 , . . . , Xn ], we denote by V (S) the algebraic
set in An defined by
V (S) = {x An : f (x) = 0 for all f S}.
Also, given any f K[X1 , X2 , . . . , Xn ], we define V (f ) = V ({f }).
Given any subset Z of An , we define
I(Z) = {f K[X1 , X2 , . . . , Xn ] : f (x) = 0 for all x Z}.
Clearly S I(V (S)) for all subsets S of K[X1 , X2 , . . . , Xn ], and Z
V (I(Z)) for all subsets Z of An . If S1 and S2 are subsets of K[X1 , X2 , . . . , Xn ]
satisfying S1 S2 then V (S2 ) V (S1 ). Similarly, if Z1 and Z2 are subsets
of An satisfying Z1 Z2 then I(Z2 ) I(Z1 ).
88
then
I
T
T V (I ). Thus
P
V
I
V (I ). Conversely if x is a point of
V (I ) then
fP(x) = 0 for allT and f P
I , andtherefore f (x) = T
0 for all f
V (I ) V
I . It follows that
V (I ) =
P I . Thus
V
I . This proves (ii).
Let I and J be ideals of R. Then I J I, I J J and IJ I J, and
thus V (I) V (I J), V (J) V (I J) and V (I J) V (IJ). Therefore
V (I) V (J) V (I J) V (IJ).
If x is a point of An which does not belong to V (I) V (J) then there exist
polynomials f I and g J such that f (x) 6= 0 and g(x) 6= 0. But
then f g IJ and f (x)g(x) 6= 0, and therefore x 6 V (IJ). Therefore
V (IJ) V (I) V (J). We conclude that
V (I) V (J) = V (I J) = V (IJ).
89
4.6
Let K be a field. We shall apply Hilberts Basis Theorem in order to study the
structure of algebraic sets in n-dimensional affine space An over the field K.
We shall continue to use the notation for algebraic sets in An and corresponding ideals of the polynomial ring that was established earlier.
The following result is a direct consequence of the Hilbert Basis Theorem.
Proposition 4.17 Let V be an algebraic set in An . Then there exists a finite
collection f1 , f2 , f3 , . . . of polynomials in n independent indeterminates such
that
V = {x An : fi (x) = 0 for i = 1, 2, . . . , k}.
Proof The set V is an algebraic set, and therefore V = V (I) for some
ideal I of K[X1 , X2 , . . . , Xn ]. Moreover it follows from Corollary 4.10 that
I is generated by some finite set {f1 , f2 , . . . , fk } of polynomials. But then
V = V ({f1 , f2 , . . . , fk }), and thus V is of the required form.
90
and thus
[
DC
D = A V (I) = A V
k
X
!
IDi
i=1
k
[
Di ,
i=1
as required.
We recall that a topological space is compact if and only if every open
cover of that space has a finite subcover. The following result therefore
follows directly from Proposition 4.19.
Corollary 4.20 Every subset of An is compact with respect to the Zariski
topology.
4.7
4.8
Prime Ideals
K = {ry : r R},
since R is unital and commutative (see Lemma 2.5). It follows easily that
JK = {rxy : r R}. Now xy I. It follows that JK I. But I is prime.
Therefore either J I or K I, and thus either x I or y I.
Example Let n be a natural number. Then the ideal nZ of the ring Z of
integers is a prime ideal if and only if n is a prime number. For an integer j
belongs to the ideal nZ if and only if n divides j. Thus the ideal nZ is prime
94
if and only if, given any integers j and k such that n divides jk, either n
divides j or n divides k. But it follows easily from the Fundamental Theorem
of Arithmetic that a natural number n has this property if and only if n is
a prime number. (The Fundamental Theorem of Arithmetic states that any
natural number can be factorized uniquely as a product of prime numbers.)
Lemma 4.25 An ideal I of a unital commutative ring R is prime if and
only if the quotient ring R/I is an integral domain.
Proof If I is a proper ideal of the unital commutative ring R then the
quotient ring R/I is both unital and commutative. Moreover the zero element
of R/I is I itself (regarded as a coset of I in R). Thus R/I is an integral
domain if and only if, given elements x and y of R such that (I +x)(I +y) = I,
either I + x = I or I + y = I. But (I + x)(I + y) = I + xy for all x, y R,
and I + x = I if and only if x I. We conclude that R/I is an integral
domain if and only if I is prime, as required.
Lemma 4.26 Every maximal ideal of a unital commutative ring R is a prime
ideal.
Proof Let M be a maximal ideal of R. Then the quotient ring R/M is a
field (see Lemma 4.21). In particular R/M is an integral domain, and hence
M is a prime ideal.
4.9
Definition A topological space Z is said to be reducible if it can be decomposed as a union F1 F2 of two proper closed subsets F1 and F2 . (A subset
of Z is proper if it is not the whole of Z.) A topological space Z is said to
be irreducible if it cannot be decomposed as a union of two proper closed
subsets.
Lemma 4.27 Let Z be a topological space. The following are equivalent:
(i) Z is irreducible,
(ii) the intersection of any two non-empty open sets in Z is non-empty,
(iii) every non-empty open subset of Z is dense.
Moreover a subset A of a topological space Z is irreducible (with respect to
the subspace topology) if and only if its closure A is irreducible.
95
Proof The topological space Z is irreducible if and only if the union of any
two proper closed subsets of Z is a proper subset of Z. Now the complement
of any proper closed set is a non-empty open set, and vica versa. Thus on
taking complements we see that Z is irreducible if and only if the intersection
of any two non-empty open subsets of Z is a non-empty subset of Z. This
shows the equivalence of (i) and (ii).
The equivalence of (ii) and (iii) follows from the fact that a subset of Z
is dense if and only if it has non-empty intersection with every non-empty
open set in Z.
Let A be a subset of Z. It follows directly from the definition of the
subspace topology on A that A is irreducible if and only if, given any closed
sets F1 and F2 such that A F1 F2 then either A F1 or A F2 . Now if
F is any closed subset of Z then A F if and only if A F . It follows that
A is irreducible if and only if A is irreducible.
It follows immediately from Lemma 4.27 that a non-empty irreducible
topological space is Hausdorff if and only if it consists of a single point.
Lemma 4.28 Any irreducible topological space is connected.
Proof A topological space Z is connected if and only if the only subsets of Z
that are both open and closed are the empty set and the whole set Z. Thus
suppose that the topological space Z were not connected. Then there would
exist a non-empty proper subset U of Z that was both open and closed. Let
V = Z \ U . Then U and V would be disjoint non-empty open sets. It would
then follow from Lemma 4.27 that Z could not be irreducible.
Lemma 4.29 Let V be an algebraic set, and let V1 be a proper algebraic
subset of V . Then there exists f K[X1 , X2 , . . . , Xn ] such that f (x) = 0 for
all x V1 but f 6 I(V ).
Proof The inclusion V1 V implies that I(V ) I(V1 ). Now V = V (I(V ))
and V1 = V (I(V1 )). Thus if V1 is a proper subset of V then I(V ) 6= I(V1 ),
and hence there exists f I(V1 ) such that f 6 I(V ). Then f is the required
polynomial.
Proposition 4.30 A non-empty algebraic set V in An is irreducible (with
respect to the Zariski topology) if and only if the ideal I(V ) is a prime ideal
of K[X1 , X2 , . . . , Xn ].
96
4.10
Radical Ideals
Lemma 4.35 Let I be an ideal of a unital commutative ring R, and let I
m
denote the set of all elements
x of R with the property that x Ifor some
natural number m. Then I is a radical ideal of R. Moreover I = I if and
only if I is a radical ideal of R.
(x + y)
m+n
X
i=0
m+n
i
xi y m+ni ,
R.
Thus
I
is
an
ideal
of
R.
Clearly
I is a radical ideal, and
I is the radical of the ideal I. Thus every algebraic set in An is of the form
V (I) for some radical ideal I of K[X1 , X2 , . . . , Xn ].
99
Proof The ideal I(V (S)) of K[X1 , X2 , . . . , Xn ] contains the set S. Therefore
S I I I(V (S)).
It follows that
4.11
100
4.12
Zariskis Theorem
102
4.13
Hilberts Nullstellensatz
Proposition 4.42 Let K be an algebraically closed field, let R be a commutative K-algebra of finite type, and let m be a maximal ideal of R. Then
there exists a surjective K-homomorphism : R K from R to K such that
m = ker .
Proof Let L = R/m, and let : R L denote the quotient homomorphism.
Then L is a field (Lemma 4.21). Now m = ker and 1 6 m, and therefore
|K 6= 0. It follows that m K is a proper ideal of the field K. But
the only proper ideal of a field is the zero ideal (Lemma 2.4). Therefore
m K = {0}. It follows that the restriction of to K is injective and maps
K isomorphically onto a subfield of L. Let K1 = (K), and let : K K1
be the isomorphism obtained on restricting : R L to K. Then L: K1
is a field extension, and L is a K1 -algebra of finite type. It follows from
Zariskis Theorem (Theorem 4.41) that L: K1 is a finite field extension. But
then L = K1 , since the field K1 is algebraically closed (Lemma 3.7). Let
= 1 . Then : R K is the required K-homomorphism from R to K.
Theorem 4.43 Let K be an algebraically closed field, and let R be a commutative K-algebra of finite type. Let a be a proper ideal of R. Then there
exists a K-homomorphism : R K from R to K such that a ker .
Proof Every proper ideal of R is contained in some maximal ideal (Theorem 4.22). Let m be a maximal ideal of R with a m. It follows from
Proposition 4.42 that m = ker for some K-homomorphism : R K.
Then a ker , as required.
Theorem 4.44 (Weak Nullstellensatz) Let K be an algebraically closed field,
and let a be a proper ideal of the polynomial ring K[X1 , X2 , . . . , Xn ], where
X1 , X2 , . . . , Xn are independent indeterminates. Then there exists some point
(a1 , a2 , . . . , an ) of An (K) such that f (a1 , a2 , . . . , an ) = 0 for all f a.
Proof Let R = K[X1 , X2 , . . . , Xn ]. Then R is a K-algebra of finite type. It
follows from Theorem 4.43 that there exists a K-homomorphism : R K
such that a ker . Let ai = (Xi ) for i = 1, 2, . . . , n. Then (f ) =
f (a1 , a2 , . . . , an ) for all f R. It follows that f (a1 , a2 , . . . , an ) = 0 for all
f a, as required.
Theorem 4.45 (Strong Nullstellensatz) Let K be an algebraically closed
field, let a be an ideal of the polynomial ring K[X1 , X2 , . . . , Xn ], and let f
104
r
X
j=0
105
gj (X1 , X2 , . . . , Xn )Y j ,
where g1 , g2 , . . . , gn a.
Let g a be defined by g =
r
P
gj f rj . Now g f r = g f r h + f r (h 1).
j=0
Also
r
gf h=
r
X
gj f rj (1 f j Y j ) (1 f Y ),
j=0
106