Curso Produto Tensorial de Modulos Conrad 2009

8/6/2019 Curso Produto Tensorial de Modulos Conrad 2009

1/71

TENSOR PRODUCTS

KEITH CONRAD

1. Introduction

Let R be a commutative ring and M and N be R-modules. Formation of their directsum M N is an addition operation on modules. We introduce now a product operation,called the tensor product M R N. To start off, we will describe roughly what a tensorproduct of modules looks like. The rigorous definitions will come in Section 3.

The tensor product M R N is an R-module spanned by symbols m n satisfyingdistributive laws:

(1.1) (m + m) n = m n + m n, m (n + n) = m n + m n.Also multiplication by any r R is associative with on both sides:(1.2) r(m n) = (rm) n = m (rn).(So the notation rm n is unambiguous: it is both (rm) n and r(m n).) The formulas(1.1) and (1.2) should be contrasted with those for the direct sum M N, where

(m + m, n) = (m, n) + (m, 0), r(m, n) = (rm,rn).

In M N, every (m, n) decomposes as (m, 0) + (0, n), but m n in M R N does notbreak apart in any general way. While every element of M N is a pair (m, n), there aremore elements of MR N than the products m n. All elements of MR N are R-linearcombinations1

r1(m1 n1) + r2(m2 n2) + + rk(mk nk),where k 1, ri R, mi M, and ni N. Since ri(mi ni) = (rimi) ni, we can renamerimi as mi and write the linear combination as a sum

(1.3) m1 n1 + m2 n2 + + mk nk.In M N, equality is easy to define: (m, n) = (m, n) if and only if m = m and n = n.

But when are two sums of the form (1.3) equal in MR N? This is not easy to say in termsof the description of the tensor product above, except in one special case: M and N arefinite free R-modules with bases {ei} and {fj}. In this case, MR N has basis {ei fj}:every element of MR N is a unique sum

i,j cijei fj and two sums of this special form

are equal only when the coefficients of like terms are equal.

To describe equality in M R N when there arent bases, we need to use a universalmapping property of the tensor product. In fact, the tensor product is the first fundamentalconcept in algebra which can be used (in the most general case) only through its universalmapping property, which is: MR N is the universal object that turns bilinear maps outof M N into linear maps.

1Compare with the polynomial ring R[X, Y], whose terms are not only the products f(X)g(Y), but sumsof such products like

Pi,j aijX

iYj . It turns out that R[X, Y] = R[X] R R[Y], so the comparison is not

merely an analogy but a special case (Example 4.10).

1


2/71

2 KEITH CONRAD

After a discussion of bilinear (and multilinear) maps in Section 2, the definition andconstruction of tensor products is presented in Section 3. Examples of tensor products arein Section 4. In Section 5 we will show how the tensor product interacts with some otherconstructions on modules. Section 6 describes the important operation of base extension,

which is a process of turning an R-module into an S-module where S is another ring.

2. Bilinear Maps

We described MR N as sums (1.3) subject to the rules (1.1) and (1.2). The intentionis that M R N be the freest object built out of M and N subject to (1.1) and (1.2).The essence of (1.1) and (1.2) is bilinearity, which well discuss before getting back to thetensor product.

A function B : M N P, where M, N, and P are R-modules, is called bilinear whenit is linear in each coordinate with the other one fixed:

B(m1 + m2, n) = B(m1, n) + B(m2, n), B(rm,n) = rB(m, n),

B(m, n1

+ n2

) = B(m, n1

) + B(m, n2

), B(m,rn) = rB(m, n).

So B(, n) is a linear map M P for each n and B(m, ) is a linear map N P for eachm. Here are some examples:

(1) The dot product v w on Rn is a bilinear function Rn Rn R.(2) For any A Mn(R), the function v, w = v Aw is a bilinear map Rn Rn R.

It reduces to the dot product when A = In.(3) The cross product v w is a bilinear function R3 R3 R3.(4) Multiplication R R R is bilinear.(5) For an R-module M, scalar multiplication R M M is bilinear.(6) The determinant det: M2(R) R is a bilinear function of the columns.(7) The dual pairing M M R given by (, m) (m) is bilinear.(8) The function M

N

HomR(M, N) given by (, n)

[x

(x)n] is bilinear.

(9) IfM N B P is bilinear and P L Q is linear, the composite M N LB Qis bilinear. (This is a very important example. Check it!)

(10) Iff: M HomR(N, P) is linear then B : MN P given by B(m, n) = f(m)(n)is bilinear.

(11) In our naive description of MR N, the expression m n is bilinear in m and n.That is, the function M N MR N given by (m, n) m n is bilinear.

Even though elements of MN and MN are written in the same way, as pairs (m, n),bilinear functions M N P should not be confused with linear functions M N P.For example, addition as a function R R R is linear, but as a function R R Rit is not bilinear. Multiplication as a function R R R is bilinear, but as a functionRR R it is not linear. Linear functions are generalized additions and bilinear functionsare generalized multiplications.

An extension of bilinearity is multilinearity. For R-modules M1, . . . , M k, a functionf: M1 Mk M is called multilinear when f(m1, . . . , mk) is linear in each miwith the other coordinates fixed. We use the label k-multilinear if the number of factorshas to be mentioned, so 2-multilinear means bilinear.

Here are a few examples of multilinear functions:

(1) The scalar triple product u (v w) is trilinear R3 R3 R3 R.(2) The function f(u, v, w) = (u v)w is trilinear Rn Rn Rn Rn.


3/71

TENSOR PRODUCTS 3

(3) The function M M N N given by (,m,n) = (m)n is trilinear.(4) If B : M N P and B : P Q T are bilinear then M N Q T by

(m,n,q) B(B(m, n), q) is trilinear.(5) Multiplication R

R

R with k factors is multilinear.

(6) The determinant det: Mn(R) R, as a function of the columns, is n-multilinear.(7) IfM1 Mk f M is multilinear and M L N is linear then the composite

M1 Mk Lf N is multilinear.The R-linear maps M N are an R-module HomR(M, N) under addition of functions

and R-scaling. The R-bilinear maps M N P form an R-module BilR(M, N; P) in thesame way. However, unlike linear maps, bilinear maps are missing some features:

(1) There is no kernel of a bilinear map M N P since M N is not a module.(2) The image of a bilinear map M N P need not form a submodule.

Example 2.1. Bilinear maps MN R can be created using M, N, and multiplicationin R: for M and N, set B, : M N R by B,(m, n) = (m)(n). EachB, is bilinear. Also B : M

N

BilR(M, N; R) given by B(, ) = B, is bilinear.(So B is a bilinear map whose values are bilinear maps.) The sum 1(m)1(n)+2(m)2(n)is bilinear in m and n but usually not expressible as (m)(n), so B is not surjective. AndB is generally not injective: for any u R, B(, ) = B(u, (1/u)).

Lets look at the special case that M and N are nonzero finite free R-modules, with re-spective bases {e1, . . . , ek} and {f1, . . . , f }. Any bilinear map B : MN R is determinedby its values on the basis pairs (ei, fj) since B(

i aiei,

j bj fj) =

i,j aibjB(ei, fj). Since

Bei ,fj on MN equals 1 at (ei, fj) and 0 at other basis pairs, B =

i,j B(ei, fj )Bei ,fj for

every B in BilR(M, N; R): both sides agree at basis pairs, so they agree everywhere sinceboth sides are bilinear. This shows the functions Bei ,fj = B(e

i , f

j ) span BilR(M, N; R).

They are also linearly independent: if

i,j cijBei ,fj = O then evaluating at the basis pair

(ei

, fj

) shows ci

j

= 0, so all coefficients are 0. Thus BilR(M, N; R) has basis {Be

i ,f

j }.Because B sends basis pairs {(ei , fj )} in M N to a basis of BilR(M, N; R), we

would like to say B is in some sense an isomorphism, but this makes no sense: the domainof B is not an R-module, and B is neither injective nor surjective. These defects willbe rectified by the tensor product in Example 4.11: B can be converted into a linearmap L : M R N BilR(M, N; R) which is an isomorphism for finite free M and N.The non-injective feature B(, ) = B(u, (1/u))), for instance, will wipe out because = u(1/u) in MR N by (1.2) even though (, ) = (u, (1/u)) in MN.

3. Construction of the Tensor Product

Any bilinear map M N P to an R-module P can be composed with a linear map

P Q to get a map M N Q which is bilinear.P

linear

M N

bilinear

;;vvvvvvvvv

composite is bilinear!##GGG

GGGG

GG

Q


4/71

4 KEITH CONRAD

We will construct the tensor product as a solution to the following universal mappingproblem: find an R-module T and bilinear map b : M N T such that all bilinear mapsout of M N are composites of the single bilinear map b with all linear maps out of T.

T

linear?

M N

b;;wwwwwwwww

bilinear##HHH

HHHH

HH

P

This turns the task of constructing bilinear maps out of M N to that of constructing lin-ear maps out of T. It is analogous to the universal mapping property of the abelianizationG/[G, G] of a group G: homomorphisms G A with abelian A are the same as homo-morphisms G/[G, G] A because every homomorphism G f A is the composite of thecanonical homomorphism G G/[G, G] with a unique homomorphism G/[G, G]

e

f A.G/[G, G]

ef

G

::vvvvvvvvv

f$$I

IIII

IIII

I

A

Definition 3.1. The tensor product MR N is an R-module equipped with a bilinear mapM

N

M

R N such that for any bilinear map M

N

B

P there is a unique linear

map MR N L P making the following diagram commute.MR N

L

M N

88qqqqqqqqqq

B&&MM

MMMM

MMMM

MM

P

While the functions in the universal mapping property for G/[G, G] are all homomor-phisms (out of G and out of G/[G, G]), functions in the universal mapping property for

M R N are not all of the same type: those out of M N are bilinear and those out ofMR N are linear.The definition of the tensor product involves not just a new module MR N, but also

a distinguished bilinear map to it: M N MR N. This is similar to the universalmapping property for the abelianization G/[G, G], which requires not just G/[G, G] but also

the homomorphism G G/[G, G] through which all homomorphisms from G to abelian

groups factor. The universal mapping property makes no sense without fixing this extrapiece of information.


5/71

TENSOR PRODUCTS 5

Before constructing the tensor product, lets show all constructions of it are essentially

the same. Suppose there are R-modules T and T and bilinear maps M N b T andMN b T with the universal mapping property of the tensor product. From universality

of M Nb

T, the bilinear map M Nb

T

factors uniquely through T: there is aunique linear map f: T T making(3.1) T

f

M N

b

;;vvvvvvvvv

b##HHH

HHHH

HH

T

commute. From universality of M N b T, the bilinear map M N b T factorsuniquely through T

: there is a unique linear map f

: T

T making(3.2) T

f

M N

b;;vvvvvvvvv

b$$H

HHHH

HHHH

T

commute. We combine (3.1) and (3.2) into the commutative diagram

T

f

M N b //

b##H

HHHH

HHHH

b

;;vvvvvvvvv

T

f

T

Removing the middle, we have the commutative diagram

(3.3) T

ff

M

N

b

;;wwwwwwwww

b##GGG

GGGG

GG

T

From universality of (T, b), a unique linear map T T fits in (3.3). The identity mapworks, so f f = idT. Similarly, f f = idT by stacking (3.1) and (3.2) together in theother order. Thus T and T are isomorphic R-modules by f and also f b = b, whichmeans f identifies b with b. So any two tensor products of M and N can be identified with


6/71

6 KEITH CONRAD

each other in a unique way compatible2 with the distinguished bilinear maps to them fromM N.Theorem 3.2. A tensor product of M and N exists.

Proof. Consider M N simply as a set. We form the free R-module on this set:FR(M N) =

(m,n)MN

R(m,n).

Let D be the submodule of FR(M N) spanned by all the elements(m+m,n) (m,n) (m,n), (m,n+n) (m,n) (m,n), (rm,n) (m,rn),

r(m,n) (rm,n), r(m,n) (m,rn).The quotient module by D will serve as the tensor product: set

MR N := FR(M N)/D.We write the coset (m,n) + D in M

R N as m

n.

(Notice how large FR(M N) can be. If R = M = N = R then FR(R R) is adirect sum ofR2-many copies of R, even though well see in Theorem 4.8 that the quotientR-module R R R is 1-dimensional!)

We need to write down a bilinear map M N MR N and show all bilinear mapsout of M N factor uniquely through this one. From the definition of D, we get relationsin FR(M N)/D like

(m+m,n) (m,n) + (m,n) mod D,which is the same as

(m + m) n = m n + m nin MR N. Similarly, m (n + n) = m n + m n and r(m n) = rm n = m rnin M

R N. This shows the function M

N

M

R N given by (m, n)

m

n is

bilinear. (No other function M N MR N will be considered except this one.)Now suppose P is any R-module and M N B P is a bilinear map. Treating M N

simply as a set, so B is just a function on this set (ignore its bilinearity), the universalmapping property of free modules extends B from a function M N P to a linearfunction : FR(M N) P with ((m,n)) = B(m, n), so the diagram

FR(M N)

M N

(m,n)(m,n)77ppppppppppp

B

''OO

OOOO

OOOO

OOO

P

commutes. We want to show makes sense as a function on MR N, which means showingker contains D. From the bilinearity of B,

B(m + m, n) = B(m, n) + B(m, n), B(m, n + n) = B(m, n) + B(m, n),

rB(m, n) = B(rm,n) = B(m,rn),

2The universal mapping property is not about modules T per se, but about pairs (T, b).


7/71

TENSOR PRODUCTS 7

so

((m+m,n)) = ((m,n)) + ((m,n)), ((m,n+n)) = ((m,n)) + ((m,n)),

r((m,n)) = ((rm,n)) = ((m,rn)).

Since is linear, these conditions are the same as

((m+m,n)) = ((m,n) + (m,n)), ((m,n+n)) = ((m,n) + (m,n)),

(r(m,n)) = ((rm,n)) = ((m,rn)).

Therefore the kernel of contains all the generators of the submodule D, so induces alinear map L : FR(M N)/D P where L((m,n) + D) = ((m,n)) = B(m, n), so thediagram

FR(M N)/D

L

M N

(m,n)m,n+D77nnnnnnn

nnnnn

B((PP

PPPP

PPPP

PPPP

P

commutes. Since FR(M N)/D = MR N and (m,n) + D = m n, the above diagram is

(3.4) MR N

L

M N

88qqqqqqqqqq

B&&MM

MMMMMMM

MMM

P

.

So every bilinear map B out of M N comes from a linear map L out of M R N suchthat L(m n) = B(m, n) for all m M and n N.

It remains to show the linear map MR N L P in (3.4) is the only one which makes(3.4) commute. We go back to the definition of MR N as a quotient of the free moduleFR(M N). From the construction of free modules, every element of FR(MN) is a finitesum

r1(m1,n1) + + rk(mk,nk).

The reduction map FR(M N) FR(M N)/D = MR N is linear, so every element ofMR N is a finite sum(3.5) r1(m1 n1) + + rk(mk nk),This means the particular elements m n in MR N span it as an R-module. Thereforelinear maps out of M R N are completely determined by their values on all m n. Sothere is at most one linear map MR N P with the effect m n B(m, n). Since wehave created a linear map out of MR N with this effect in (3.4), it is the only one.


8/71

8 KEITH CONRAD

Having shown a tensor product of M and N exists,3 its essential uniqueness lets uscall M R N the tensor product rather than a tensor product. Dont forget that theconstruction involves not simply the module M R N but also the distinguished bilinearmap M

N

M

R N given by (m, n)

m

n, through which all bilinear maps

out of M N factor. We call this distinguished map the canonical bilinear map to thetensor product. We call general elements of M R N tensors, and will denote them bythe letter t. Tensors of the form m n are called elementary tensors. (Other names forelementary tensors are simple tensors, decomposable tensors, and pure tensors.) Just asmost elements of the free module FR(A) on a set A are not of the form a but are linearcombinations of these, most tensors in MR N are not elementary tensors but are linearcombinations of elementary tensors. Actually, all tensors are sums of elementary tensorssince r(m n) = (rm) n. This shows all elements of MR N have the form (1.3).

The role of elementary tensors among all tensors is like that of separable solutionsf(x)g(y) to a PDE among all solutions. Even if not all solutions to a PDE are separa-ble, one first seeks separable solutions and then tries to form the general solution as a sum(perhaps an infinite series; it is analysis) of separable solutions.

From now on forget the explicit construction ofMR N as the quotient of an enormousfree module FR(M N). It will confuse you more than its worth to try to think aboutMR N in terms of its construction. What is more important to remember is the universalmapping property of the tensor product.

Lemma 3.3. LetM and N be R-modules with respective spanning sets {xi}iI and{yj}jJ.The tensor product MR N is spanned linearly by the elementary tensors xi yj .Proof. An elementary tensor in M R N has the form m n. Write m =

i aixi and

n =

j bjyj, where the ais and bj s are 0 for all but finitely many i and j. From thebilinearity of,

m

n = (i aixi) (j bjyj) = i,j aij xi yj

is a linear combination of the tensors xi yj . So every elementary tensor is a linearcombination of the elementary tensors xi yj, which means every tensor is such a linearcombination too.

Example 3.4. Let e1, . . . , ek be the standard basis of Rk. The R-module Rk R Rk is

linearly spanned by the k2 elementary tensors ei ej. We will see later (Theorem 4.8) thatthese elementary tensors are a basis.

Theorem 3.5. In MR N, m 0 = 0 and 0 n = 0.Proof. Since m n is linear in n with m fixed, m 0 = m (0 + 0) = m 0 + m 0.Subtracting m 0 from both sides, m 0 = 0. That 0 n = 0 follows by a similarargument.

3What happens ifR is a noncommutative ring? IfM and N are left R-modules and B is bilinear onMN then for any m M, n N, and r and s in R, rsB(m,n) = rB(m,sn) = B(rm,sn) = sB(rm,n) =srB(m,n). Usually rs = sr, so asking that rsB(m,n) = srB(m,n) for all m and n puts us in a delicatesituation! The correct tensor product MR N for noncommutative R uses a rightR-module M, a leftR-module N, and a middle-linear map B where B(mr, n) = B(m,rn). In fact MR N is not an R-modulebut just an abelian group! We will not discuss this further.


9/71

TENSOR PRODUCTS 9

Example 3.6. If A is a finite abelian group, Q Z A = 0 since every elementary tensor is0: for a A, let na = 0 for some positive integer n. Then in Q Z A, r a = n(r/n) a =r/n na = r/n 0 = 0. Every tensor is a sum of elementary tensors, so all tensors are0. In fact we dont need finiteness of A in this argument, but rather than each element of

A has finite order. The group Q/Z has this property, so Q Z (Q/Z) = 0. (For instance,(2/5) (1/3) = 0, so we can have m n = 0 without m or n being 0.)

By a similar argument, Q/Z Z Q/Z = 0.Since MR N is spanned additively by elementary tensors, any linear (or just additive)

function out of MR N is determined at all tensors from its values on elementary tensors.This is why linear maps on tensor products will in practice only be described by theirvalues on elementary tensors. It is similar to describing a linear map between finite freemodules using a matrix, which tells you the values of the map only on basis vectors, butthis information is enough to determine the linear map everywhere.

There is a key difference between elementary tensors and basis vectors: elementary tensorshave lots of linear relations. Using elementary tensors to describe all tensors is like describing

the vectors in R2

as linear combinations of (1, 0), (2, 3), (8, 4), and (1, 5). A linearmap out of R2 is determined by its values on these four vectors, but those values are notindependent: they have to satisfy every linear relation the four vectors satisfy because alinear map preserves linear relations. Using this linearly dependent spanning set requireskeeping track of linear relations to be sure constructions made in terms of this spanning setare well-defined. Nobody wants to do that, which is why using bases is so convenient. Butyou usually cant get away from the zillions of linear relations among elementary tensors,and that is why a random function on elementary tensors generally does not extend to alinear map on the tensor product.

For that matter, functions on elementary tensors themselves have a well-definedness issue:the function f(m n) = m + n, for instance, makes no sense since m n = (m) (n)but m + n is usually not

m

n. The only way to create linear maps on M

R N is

with the universal mapping property of the tensor product (it creates linear maps out ofbilinear maps), because all the linear relations among elementary tensors are built into theuniversal mapping property of MR N. There will be a lot of practice with this in Section4. Understanding how the universal mapping property of the tensor product can be usedto compute (nonzero) examples and to prove properties of the tensor product is about thebest way to get used to the tensor product; after all, if youre incapable of writing downfunctions out of MR N, you dont understand MR N.

Lets address a few beginner questions about the tensor product:Questions

(1) What is m n?(2) What does it mean to say m n = 0?(3) What does it mean to say M

R N = 0?

(4) What does it mean to say m1 n1 + + mk nk = m1 n1 + + m n?(5) Is there a way to picture the tensor product?

Answers

(1) A logically correct answer is: m n is the image of (m, n) M N under thecanonical bilinear map M N M R N that is part of the definition of thetensor product. That may seem like too formal an answer. Heres another answer,which is not a definition but is more closely aligned with how m n actually occurs


10/71

10 KEITH CONRAD

in practice: mn is that element ofMR N at which the linear map MR N Pcorresponding to a bilinear map M N B P takes the value B(m, n). Reviewthe proof of Theorem 3.2 and check this property of m n really holds.

(2) We have m

n = 0 if and only if every bilinear map out of M

N vanishes at

(m, n). Indeed, if m n = 0 then for any bilinear map B : M N P we have acommutative diagram

MR N

L

M N

88qqqqqqqqqq

B&&MM

MMMM

MMMM

MM

P

for some linear map L, so B(m, n) = L(m n) = L(0) = 0. Conversely, if everybilinear map out of M N sends (m, n) to 0 then the canonical bilinear mapM N M R N, which is a particular example, sends (m, n) to 0. Since thisbilinear map actually sends (m, n) to m n, we obtain m n = 0.

To show a particular elementary tensor m n is not 0, find a bilinear map B outof M N such that B(m, n) = 0. We will use this idea in Theorem 4.8.

(3) The tensor product MR N is 0 if and only if every bilinear map out of M N isidentically 0. First suppose MR N = 0. Then every elementary tensor m n is0, so B(m, n) = 0 for any bilinear map out of M N by the answer to the secondquestion. Thus B is identically 0. Next suppose every bilinear map out of M Nis identically 0. Then the canonical bilinear map M N MR N, which is aparticular example, is identically 0. Since this function sends (m, n) to m n, wehave m n = 0 for all m and n. Since MR N is additively spanned by all m n,the vanishing of all elementary tensors implies MR N = 0.

To show MR N = 0, find a bilinear map on M N which is not identically 0.(4) We have

ki=1 mi ni =

j=1 m

j nj if and only if for all bilinear maps B out of

M N, ki=1 B(mi, ni) = j=1 B(mj, nj ). The justification is along the lines ofthe previous two answers and is left to the reader.

(5) There are objects in physics that are described using tensors (stress, elasticity, elec-tromagnetic fields), but this doesnt really lead to a picture of a tensor. Tensorsarise in physics as multi-indexed quantities which are subject to multilinear effectsunder a change in coordinates.

The tensor product can be extended to allow more than two factors. Given k modulesM1, . . . , M k, there is a module M1 R R Mk which is universal for k-multilinear maps:it admits a k-multilinear map M1 Mk M1 R R Mk and every k-multilinearmap out of M1 Mk factors through this by composition with a unique linear map


11/71

TENSOR PRODUCTS 11

out of M1 R R Mk:

M1 R R Mk

unique linear

M1 Mk

55kkkkkkk

kkkkkkk

multilinear))TT

TTTTTT

TTTTTT

TTT

P

The image of (m1, . . . , mk) in M1 R R Mk is written m1 mk. This k-fold tensorproduct can be constructed as a quotient of the free module FR(M1 Mk). It can alsobe constructed using tensor products of modules two at a time:

( ((M1 R M2) R M3) R ) R Mk.

The canonical k-multilinear map to this R-module from M1 Mk is (m1, . . . , mk) ( ((m1 m2) m3) ) mk. This is not the same construction of the k-fold tensorproduct using FR(M1 Mk), but it satisfies the same universal mapping propertyand thus can serve the same purpose (all tensors products ofM1, . . . , M k are isomorphic toeach other in a unique way compatible with the distinguished bilinear maps to them fromM1 Mk).

As an exercise, check from the universal mapping property that m1 mk = 0 inM1 R R Mk if and only if all k-multilinear maps out of M1 Mk vanish at(m1, . . . , mk). The module M1 R R Mk is spanned additively by all m1 mk.Important examples of the k-fold tensor product are tensor powers Mk:

M

0

= R, M

1

= M, M

2

= MR M, M3

= MR MR M,and so on. (The formula M0 = R is a convention, like a0 = 1.)

4. Examples of Tensor Products

Theorem 4.1. For positive integers a and b with d = (a, b), Z/aZ Z Z/bZ = Z/dZ asabelian groups. In particular, Z/aZ Z Z/bZ = 0 if and only if (a, b) = 1.

Proof. Since 1 spans Z/aZ and Z/bZ, 1 1 spans Z/aZ Z Z/bZ by Lemma 3.3. From

a(1 1) = a 1 = 0 1 = 0 and b(1 1) = 1 b = 1 0 = 0,the additive order of 1 1 divides a and b, and therefore also d, so #(Z/aZ Z Z/bZ) d.

To show Z/aZZZ/bZ has size at least d, we create a Z-linear map from Z/aZ ZZ/bZonto Z/dZ. Since d|a and d|b, we can reduce Z/aZ Z/dZ and Z/bZ Z/dZ in thenatural way. Consider the map Z/aZ Z/bZ B Z/dZ which is reduction mod d in eachfactor followed by multiplication: B(x mod a, y mod b) = xy mod d. This is Z-bilinear, sothe universal mapping property of the tensor product says there is a (unique) Z-linear map


12/71

12 KEITH CONRAD

f: Z/aZ Z Z/bZ Z/dZ making the diagramZ/aZ Z Z/bZ

f

Z/aZ Z/bZ

66llllllllll

lll

B((RR

RRRR

RRRR

RRRR

Z/dZ

commute, so f(xy) = xy. In particular, f(x1) = x, so f is onto. Therefore Z/aZZZ/bZhas size at least d, so the size is d and were done.

Example 4.2. The abelian group Z/3Z Z Z/5Z is 0. Equivalently, every Z-bilinear mapZ/3Z Z/5Z A to an abelian group is identically 0.

In Z/aZ Z Z/bZ all tensors are elementary tensors: x y = xy(1 1) and a sum ofmultiples of 1 1 is again a multiple, so Z/aZ Z Z/bZ = Z(1 1) = {x 1 : x Z}.Notice in the proof of Theorem 4.1 how the map f: Z/aZ Z Z/bZ Z/dZ was createdfrom the bilinear map B : Z/aZ Z/bZ Z/dZ and the universal mapping property oftensor products. Quite generally, to define a linear map out of M R N that sends allelementary tensors m n to particular places, always back up and start by defining abilinear map out of M N sending (m, n) to the place you want m n to go. Make sureyou show the map is bilinear! Then the universal mapping property of the tensor productgives you a linear map out of MR N sending m n to the place where (m, n) goes, whichgives you what you wanted: a (unique) linear map on the tensor product with specifiedvalues on the elementary tensors.

Theorem 4.3. For ideals I and J in R, there is a unique R-module isomorphism

R/IR R/J = R/(I+ J)where x y xy. In particular, taking I = J = 0, R R R = R by x y xy.

For R = Z and nonzero I and J, this is Theorem 4.1.

Proof. Start with the function R/I R/J R/(I + J) given by (x mod I, y mod J) xy mod I+ J. This is well-defined and bilinear, so from the universal mapping property ofthe tensor product we get a linear map f: R/IR R/J R/(I+ J) making the diagram

R/IR R/J

f

R/I

R/J

77nnnnnnnnnnnn

(x mod I,y mod J)xy mod I+J ''PPPP

PPPP

PPPP

R/(I+ J)

commute, so f(x mod I y mod J) = xy mod I + J. To write down the inverse map, letR R/IRR/J by r r(11). This is linear, and when r Ithe value is r1 = 01 = 0.Similarly, when r J the value is 0. Therefore I + J is in the kernel, so we get a linearmap g : R/(I+ J) R/IR R/J by g(r mod I+ J) = r(1 1) = r 1 = 1 r.


13/71

TENSOR PRODUCTS 13

To check f and g are inverses, a computation in one direction shows

f(g(r mod I+ J)) = f(r 1) = r mod I+ J.To show g(f(t)) = t for all t

R/I

R R/J, we show all tensors are scalar multiples of 1

1.

Any elementary tensor has the form x y = x1 y1 = xy(1 1), which is a multiple of1 1, so sums of elementary tensors are multiples of 1 1 and thus all tensors are multiplesof 1 1. We have

g(f(r(1 1)) = rg(1 mod I+ J) = r(1 1) = r(1 1).

Remark 4.4. For two ideals I and J, we know a few operations that produce new ideals:I+ J, I J, and IJ. The intersection I J is the kernel of the linear map R R/I R/Jwhere r (r, r). Theorem 4.3 tells us I+J is the kernel of the linear map R R/IR R/Jwhere r r(1 1).

Theorem 4.5. For an ideal I in R and R-module M, there is a unique R-module isomor-phism

(R/I) R M = M/IMwhere r m rm. In particular, taking I = (0), R R M = M by r m rm, soR R R = R as R-modules by r r rr.Proof. We start with the bilinear map (R/I) M M/IM given by (r, m) rm. Fromthe universal mapping property of the tensor product, we get a linear map f: (R/I)RM M/IM where f(r m) = rm.

(R/I) R M

f

(R/I) M

77nnnnnnn

nnnnn

(r,m)rm ''PPPP

PPPP

PPPP

M/IM

To create an inverse map, start with the function M (R/I) R M given by m 1 m.This is linear in m (check!) and kills IM (generators for IM are products rm for r Iand m M, and 1 rm = r m = 0 m = 0), so it induces a linear map g : M/IM (R/I) R M given by g(m) = 1 m.

To check f(g(m)) = m and g(f(t)) = t for all m M/IM and t (R/I) R M, we dothe first one by a direct computation:

f(g(m)) = f(1 m) = 1 m = m.To show g(f(t)) = t for all t MR N, we show all tensors in R/IR M are elementary.An elementary tensor looks like r m = 1 rm, and a sum of tensors 1 mi is 1

i mi.

Thus all tensors look like 1 m. We have g(f(1 m)) = g(m) = 1 m. Example 4.6. For any abelian group A, (Z/nZ) Z A = A/nA as abelian groups bym a ma.


14/71

14 KEITH CONRAD

Remark 4.7. Saying R R M = M by r m rm is another way of saying the R-bilinear maps B out ofR M can be identified with the linear maps out of M, and thatsclear because B(r, m) = B(1, rm) when B is bilinear, and B(1, ) is linear in the secondcomponent.

The next theorem justifies the discussion in the introduction about bases for tensorproducts of free modules.

Theorem 4.8. If F and F are free R-modules then FR F is a free R-module. If{ei}iIand {ej}jJ are bases of F and F then {ei ej}(i,j)IJ is a basis of F R F.Proof. The result is clear if F or F is 0, so let them both be nonzero free modules (henceR = 0 and F and F have bases). By Lemma 3.3, {ei ej} spans FR F as an R-module.

To show this spanning set is linearly independent, suppose

i,j cij ei ej = 0, where allbut finitely many cij are 0. We want to show every cij is 0. Pick two basis vectors ei0 andej0 in F and F

. To create a coordinate function for ei0 ej0 in F R F, define thefunction F

F

R by (v, w)

ai0bj0, where v = i aiei and w = j bje

j . This function

is bilinear, so by the universal mapping property of tensor products there is a linear mapf0 : F R F R such that f0(v w) = ai0bj0 on all elementary tensors v w.

F R F

f0

F F

99rrrrrrrrrr

(v,w)ai0bj0 &&MMM

MMMM

MMMM

R

In particular, f0(ei0 ej0) = 1 and f0(ei ej) = 0 for (i, j) = (i0, j0). Applying f0 to theequation i,j cij ei ej = 0 in FR F tells us ci0j0 = 0 in R. Since i0 and j0 are arbitrary,all the coefficients are 0.

Theorem 4.8 has a concrete meaning in terms of bilinear maps out of F F. It saysthat any bilinear map out of F F is determined by its values on the pairs (ei, ej) andthat any assignment of values to these pairs extends in a unique way to a bilinear map.(The uniqueness of the extension is connected to the linear independence of the elementarytensors ei ej.) This is the bilinear analogue of the existence and uniqueness of a linearextension of a function from a basis of a free module to the whole module.

Example 4.9. Let R = 0 and F = Re1 Re2 be free of rank 2. Then F R F is free ofrank 4 with basis

e1 e1, e1 e2, e2 e1, e2 e2.

(The tensors e1 e2 and e2 e1 are not equal, since theyre different terms in a basis,but theres also an explanation of this using bilinear maps: if e1 e2 = e2 e1 then allbilinear maps out of F F takes the same value at (e1, e2) and (e2, e1), and that is nottrue. Consider the bilinear map F F R given by (ae1 + be2, ce1 + de2) ad. Soe1 e2 = e2 e1.)

The sum e1 e2 + e2 e1 in F R F is an example of a tensor that is provably not anelementary tensor. Any elementary tensor in F R F has the form(4.1) (ae1 + be2) (ce1 + de2) = ace1 e1 + ade1 e2 + bce2 e1 + bde2 e2.


15/71

TENSOR PRODUCTS 15

If this equals e1 e2 + e2 e1 thenac = 0, ad = 1, bc = 1, bd = 0.

Since ad = 1 and bc = 1, a and c are invertible, but that contradicts ac = 0. So e1e2+e2e1is not an elementary tensor.

4

Example 4.10. As an R-module, R[X] R R[Y] is free with basis {Xi Yj}i,j0, so thistensor product is isomorphic to R[X, Y] as R-modules by

cij (X

i Yj ) cij XiYj .More generally, R[X1, . . . , X k] = R[X]k as R-modules with Xi corresponding to the tensor1 X 1 where X is in the ith position. The difference between ordinary productsand tensor products is like the difference between multiplying polynomials as f(T)g(T) andas f(X)g(Y).

Example 4.11. We return to Example 2.1. For in M and in N, B,(m, n) =(m)(n) is a bilinear map M N R. Set B : M N BilR(M, N; R) by B(, ) =B,. This is bilinear, so we get a linear map L : M

R N BilR(M, N; R) whereL(

) = B,.

When M and N are finite free R-modules, lets show L is an isomorphism. We may sup-pose M and N are nonzero, with respective bases {e1, . . . , ek} and {f1, . . . , f }. By Theorem4.8, M R N is free of rank k with basis {ei fj }. By Example 2.1, BilR(M, N; R) isfree of rank k with basis {Bei ,fj }. Since Bei ,fj = L(ei fj ), L sends a basis to a basis,so its an isomorphism when M and N are finite free.

It is not true that L is an isomorphism for all M and N. When p is prime, R = Z/p2Z,and M = Z/pZ, show as an exercise that L : M R M BilR(M, M; R) is identically 0but M R M and BilR(M, M; R) both have size p.Theorem 4.12. LetF be a free R-module with basis {ei}iI. For any k 1, the kth tensorpower Fk is free with basis {ei1 eik}(i1,...,ik)Ik .Proof. This is similar to the proof of Theorem 4.8.

Theorem 4.13. If M is an R-module and F is a free R-module with basis {ei}iI, thenevery element of MR F has a unique representation in the form

iI mi ei, where all

but finitely many mi equal 0.

Proof. Using M as a spanning set of M and {ei}iI as a spanning set for F as R-modules,by Lemma 3.3 every element of M R F is a linear combination of elementary tensorsmi ei, where mi M. Since r(mi ei) = (rmi)ei, we can write every tensor in MR Fas a sum of elementary tensors of the form mi ei. So we have a surjective linear mapf:

iI M MR F given by f((mi)iI) =

iI mi ei. (All but finitely many mi are0, so the sum makes sense.)

To create an inverse to f, consider the function M F iI

M where (m,

iriei)

(rim)iI. This function is bilinear (check!), so there is a linear map g : MR F iI Mwhere g(m i riei) = (rim)iI.To check f(g(t)) = t for all t in MR F, we cant expect that all tensors in MR F are

elementary (an idea used in the proofs of Theorems 4.3 and 4.5), but we only need to checkf(g(t)) = t when t is an elementary tensor since f and g are additive and the elementary

4From (4.1), a necessary condition forP2

i,j=1 cijei ej to be elementary is that c11c22 = c12c21. When

R = K is a field this condition is also sufficient, so the elementary tensors in K2 K K2 are characterized

among all tensors by a polynomial equation of degree 2. For a generalization, see [2].


16/71

16 KEITH CONRAD

tensors additively span M R F. (We will use this kind of argument a lot to reduce theproof of an identity involving functions of all tensors to the case of elementary tensors eventhough most tensors are not themselves elementary. The point is all tensors are sums ofelementary tensors and the formula we want to prove involves additive functions.) Any

elementary tensor looks like m i riei, andf

g

m

iI

riei

= f((rim)iI)

=iI

rim ei

=iI

m riei

= m iI

riei.

These sums have finitely many terms (ri = 0 for all but finitely many i), from the definitionof direct sums. Thus f(g(t)) = t for all t MR F.For the composition in the other order,

g(f((mi)iI)) = g

iI

mi ei

=iI

g(mi ei) =iI

(. . . , 0, mi, 0, . . . ) = (mi)iI.

Now that we know MR F =

iI M, with

iI mi ei corresponding to (mi)iI, theuniqueness of coordinates in the direct sum implies the sum representation

iI mi ei is

unique.

Example 4.14. For any ring S R, elements ofSR R[X] have unique expressions of theform

n0 sn Xn, so SR R[X] = S[X] as R-modules by

n0 sn Xn

n0 snX

n.

Remark 4.15. Although you can check an additive identity f(g(t)) = t by only checkingit on elementary tensors, it would be really dumb to think you have proved injectivityof some linear map f: M R N P by only looking at elementary tensors.5 That is, iff(mn) = 0 mn = 0, there is no reason to believe f(t) = 0 t = 0 for all t MR N,since injectivity of a linear map is not an additive identity.6 This is the number one reasonthat proving that a linear map out of a tensor product is injective can require technique. Asa special case, if you want to prove a linear map out of a tensor product is an isomorphism,it might be easier to construct an inverse map and check the composite in both orders isthe identity than to show the original map is injective and surjective.

Theorem 4.16. If M is nonzero and finitely generated then Mk = 0 for all k.Proof. Write M = Rx1 + + Rxd, where d 1 is minimal. Set N = Rx1 + + Rxd1(N = 0 ifd = 1), so M = N+ Rxd and xd N. Set I = {r R : rxd N}, so I is an idealin R and 1 I, so I is a proper ideal. From the definition of I, the function Mk R/Igiven by

(n1 + r1xd, . . . , nk + rkxd) r1 rd mod I5Unless every tensor in MR N is elementary, which is usually not the case.6Heres an example. Let CR C C be the R-linear map with the effect z w zw on elementary

tensors. Ifzw 0 then z = 0 or w = 0, so z w = 0, but the map is not injective: 1 i i 1 0 but1 i i 1 = 0 since 1 i and i 1 belong to a basis ofCR C by Theorem 4.8.


17/71

TENSOR PRODUCTS 17

is well-defined. Its multilinear (check!), so there is a linear map Mk R/I such thatxd xd

k terms

1. That shows Mk = 0.

Tensor powers of non-finitely generated modules could vanish:Q

/Z

ZQ

/Z

= 0 (Ex-ample 3.6).The rest of this section is concerned with properties of tensor products over domains.

Theorem 4.17. Let R be a domain with fraction field K. There is a unique R-moduleisomorphism KR K = K where x y xy. In particular, Q Z Q = Q.Proof. Multiplication is a function m : K K K. It is R-bilinear, so the universalmapping property of tensor products says there is an R-linear function f: KR K Kwhere f(x y) = xy on elementary tensors. That says the diagram

KR K

f

K K

88rrrrrrrrrr

m&&MM

MMMM

MMMM

M

K

commutes. Since f(x 1) = x, f is onto.To show f is one-to-one, first we show every tensor in KR K looks like 1 y. For an

elementary tensor x y, write x = a/b with a and b in R where b = 0. Then x y =x by/b = bx y/b = a y/b = 1 ay/b = 1 xy. (We cant say x y = 1 xy frombilinearity of, as is R-bilinear and x K.) Thus all elementary tensors have the form1 y, so every tensor is 1 y for some y K since every tensor is a sum of elementarytensors. Now we can show f has trivial kernel: if f(t) = 0 then, writing t = 1 y, we gety = 0, so t = 1 0 = 0.

Remark 4.18. Multiplication K K K is also K-bilinear, not just R-bilinear, andthere is a K-vector space isomorphism KK K= K (tensor product over K, not R).

For any R-module M, every tensor in KR M is elementary from manipulations withtensors: any finite set of tensors of the form ai/bi mi can be provided with a commondenominator for the bis, say ai/bi = ai/b, so

i ai/bi mi =

i 1/b aimi = 1/b

(

i aimi), which is an elementary tensor. What we cant easily do by manipulations with

tensors alone is determine when an elementary tensor x m is 0 in KR M.Theorem 4.19. Let R be a domain with fraction field K and M be any nonzero R-modulein K. There is a unique R-module isomorphism K R M = K where x y xy. Inparticular, K

R I = K for every nonzero ideal I in R.

Proof. The proof is largely like the previous one, but it needs a few extra steps because thesecond module is an R-module in K rather than K itself.7 Multiplication gives a functionK M K which is R-bilinear, so we get an R-linear map f: K R M K wheref(x y) = xy. To show f is onto, we cant look at f(x 1) since 1 is usually not in M.Instead we can argue with a choice of nonzero m M: for any x K, f(x/m m) = x.

7Theorem 4.17 is just a special case of Theorem 4.19, but we worked it out separately since some of thetechnicalities are simpler.


18/71

18 KEITH CONRAD

To show f is injective, suppose f(t) = 0. All tensors in KR M are elementary, so wecan write t = x y. Then xy = 0 in K, so x = 0 or y = 0, so t = x y = 0.

Theorem 4.20. Let R be a domain and F and F be free R-modules. If x and x are

nonzero in F and F

, then x x

= 0 in F R F

.

Proof. Write x =

i aiei and x =

j a

je

j using bases for F and F. Then x x =

i,j aia

j ei ej in FR F. Since x and x are nonzero, they each have a nonzero coefficient,say ai and a

j. Then aia

j = 0 since R is a domain, so x x has a nonzero coordinate in the

basis {ei ej} of F R F (Theorem 4.8). Thus x x = 0.

Theorem 4.21. Let R be a domain with fraction field K.

(1) For any R-module M, KR M = KR (M/Mtor) as R-modules, where Mtor is thetorsion submodule of M.

(2) If M is a torsion R-module then KR M = 0 and if M is not a torsion modulethen KR M = 0.(3) IfN M is a submodule such that M/N is a torsion module then KRN = KRMas R-modules by x n x n.

Proof. (1) The map K M KR (M/Mtor) given by (x, m) x m is R-bilinear, sothere is a linear map f: KR M KR (M/Mtor) where f(x m) = x m.

To go the other way, the canonical bilinear map K M KR M vanishes at (x, m)if m Mtor: when rm = 0 for r = 0, x m = r(x/r) m = x/r rm = x/r 0 = 0(Theorem 3.5). Therefore we get an induced bilinear map K (M/Mtor) KR M givenby (x, m) xm. (The point is that an elementary tensor xm in KR M only dependson m through its coset mod Mtor.) The universal mapping property of the tensor productnow gives us a linear map g : KR (M/Mtor) KR M where g(x m) = x m.

The composites g f and f g are both linear and fix elementary tensors, so they fix alltensors and thus f and g are inverse isomorphisms.

(2) It is immediate from (1) that KR M = 0 ifM is a torsion module, since KR M =KR (M/Mtor) = KR 0 = 0. We could also prove this in a direct way, by showing allelementary tensors in KR M are 0: for x K and m M, let rm = 0 with r = 0, sox m = r(x/r) m = x/r rm = x/r 0 = 0.

To show KR M = 0 when M is not a torsion module, from the isomorphism KR M =KR (M/Mtor), we may replace M with M/Mtor and are reduced to the case when M istorsion-free. For torsion-free M we will create a nonzero R-module and a bilinear map ontoit from K M. This will require a fair bit of work (as it usually does to prove a tensorproduct doesnt vanish).

We want to consider formal products xm with x

K and m

M. To make this precise,

we will use equivalence classes of ordered pairs in the same way that a domain is enlargedto its fraction field. On the product set K M, define an equivalence relation by

(a/b, m) (c/d,n) adm = bcn in M.

Here a,b,c, and d are in R and b and d are not 0. The proof that this relation is well-defined(independent of the choice of numerators and denominators) and transitive requires M betorsion-free (check!). As an example, (0, m) (0, 0) for all m M.


19/71

TENSOR PRODUCTS 19

Define KM = (K M)/ and write the equivalence class of (x, m) as x m. Give KMthe addition and K-scaling formulas

a

b m + c

d n = 1

bd (adm + bcn), x(y m) = (xy) m.

It is left to the reader to check these operations on KM are well-defined and make KM intoa K-vector space (so in particular an R-module). The zero element of KM is 0 0 = 0 m.The function M KM given by m 1 m is injective, since if 1 m = 1 m then(1, m) (1, m), so m = m in M. So KM = 0 since M = 0.

The function K M KM given by (x, m) x m is R-bilinear and onto, so there isa linear map KR M f KM such that f(x m) = x m, which is onto. Since KM = 0we have KR M = 0, and in fact KR M = KM by f (exercise).

(3) Since N M, there is an obvious bilinear map K N KR M, namely (x, n) xn. So we get automatically a linear map f: KR N KR M where f(xn) = xn.(This is not the identity function: on the left x n is in KR N and on the right x n isin KR M.) To get an inverse map to f, let K M KR N by (x, m) (1/r)x rm,where r R {0} is chosen so rm N. There is such r since M/N is a torsion module.(We cant simplify (1/r)x rm by moving r through since rm is in N but usually m isnot.) To check this function is well-defined, if also rm N with r R {0}, then

1

rx rm = r

rrx rm = 1

rrx rrm = 1

rrx r(rm) = r

rrx rm = 1

rx rm.

So our function KM KR N is well-defined, and the reader can check it is bilinear. Itleads to a linear map g : KR M KR N where g(x m) = (1/r)x rm when rm N,r = 0. As an exercise, check f(g(x m)) = x m and g(f(x n)) = x n, so f g andg f are both the identity by additivity. Corollary 4.22. Let R be a domain with fraction field K. In KR M, x m = 0 if andonly if x = 0 or m Mtor. In particular, Mtor = ker(M KR M) where m 1 m.Proof. Ifx = 0 then x m = 0 m = 0. Ifm Mtor, with rm = 0 for some nonzero r R,then x m = (x/r)r m = (x/r) rm = (x/r) 0 = 0.

Conversely, suppose x m = 0. We want to show x = 0 or m Mtor. Write x = a/b,so (1/b) am = 0. If a = 0 then x = 0, so we suppose a = 0 and will show m Mtor.Multiply by b to get 1 am = 0. From the isomorphism K R M = K R (M/Mtor),1 am = 0 in KR (M/Mtor). Since M/Mtor is torsion-free, applying the R-linear mapKR (M/Mtor) K(M/Mtor) from the proof of Theorem 4.21 tells us that 1 am = 0 inK(M/Mtor). The function m 1 m from M/Mtor to K(M/Mtor) is injective, so am = 0,so am Mtor. Therefore there is nonzero r R such that 0 = r(am) = (ra)m. Sincera = 0, m Mtor. Example 4.23. The tensor product Q Z A is 0 when A is a torsion abelian group, so werecover Example 3.6.

5. General Properties of Tensor Products

There are canonical isomorphisms M N = N M and (M N) P = M (N P).We want to show similar isomorphisms for tensor products: M R N = N R M and(M R N) R P = M R (N R P). Furthermore, there is a distributive property overdirect sums: MR (N P) = (MR N) (MR P). How these modules are isomorphicis much more important than just that they are isomorphic.


20/71

20 KEITH CONRAD

Theorem 5.1. There is a unique R-module isomorphism M R N = N R M wherem n n m.Proof. We want to create a linear map MR N NR M sending m n to n m. Todo this, we back up and start off with a map out of M N to the desired target moduleN R M. Define M N N R M by (m, n) n m. This is a bilinear map sincen m is bilinear in m and n. Therefore by the universal mapping property of the tensorproduct, there is a unique linear map f: MR N NR M such that f(m n) = n mon elementary tensors.

MR N

f

M N

88qqqqqqqqqq

(m,n)nm &&MMM

MMMM

MMM

NR MRunning through the above arguments with the roles of M and N interchanged, there

is a unique linear map g : N R M M R N where g(n m) = m n on elementarytensors. We will show f and g are inverses of each other.

To show f(g(t)) = t for all t NR M, it suffices to check this when t is an elementarytensor, since both sides are R-linear (or even just additive) in t and N R M is spannedby its elementary tensors: f(g(n m)) = f(m n) = n m. Therefore f(g(t)) = t for allt N R M. The proof that g(f(t)) = t for all t MR N is similar. We have shown fand g are inverses of each other, so f is an R-module isomorphism.

Theorem 5.2. There is a unique R-module isomorphism (MR N)R P = MR (NR P)where (m

n)

p

m

(n

p).

Proof. By Lemma 3.3, (MR N) R P is linearly spanned by all (m n) p and MR(N R P) is linearly spanned by all m (n p). Therefore linear maps out of these twomodules are determined by their values on these8 elementary tensors. So there is at mostone linear map (MR N)R P MR (NR P) with the effect (mn)p m(np),and likewise in the other direction.

To create such a linear map (MR N) R P MR (NR P), consider the functionM N P M R (N R P) given by (m,n,p) m (n p). Since m (n p) istrilinear in m, n, and p, for each p we get a linear map fp : M R N M R (N R P)where fp(m n) = m (n p) on elementary tensors in MR N.

Now we consider the function (MR N) P MR (NR P) given by (t, p) fp(t).This is bilinear! First, it is linear in t with p fixed, since each fp is a linear function. Nextwe show it is linear in p with t fixed:

fp+p(t) = fp(t) + fp(t) and frp(t) = rfp(t)

8A general elementary tensor in (MR N)R P is not (m n) p, but t p where t MR N andt might not b e elementary itself. Similarly, elementary tensors in MR (NR P) are more general thanm (np).


21/71

TENSOR PRODUCTS 21

for any p, p, and r. Both sides of these identities are additive in t, so to check them itsuffices to check the case when t = m n:

fp+p(m n) = (m n) (p +p)

= (m n) p + (m n) p

= fp(m n) + fp(m n)= (fp + fp)(m n).

That frp(m n) = rfp(m n) is left to the reader. Since fp(t) is bilinear in p and t,the universal mapping property of the tensor product tells us there is a unique linear mapf: (MR N) R P MR (NR P) such that f(t p) = fp(t). Then f((m n) p) =fp(m n) = m (n p), so we have found a linear map with the desired values on thetensors (m n) p.

Similarly, there is a linear map g : MR(NRP) (MRN)RP where g(m(np)) =(mn)p. Easily f(g(m(np))) = m(np) and g(f((mn)p)) = (mn)p. Sincethese particular tensors linearly span the two modules, these identities extend by linearity

(f and g are linear) to show f and g are inverse functions.

Theorem 5.3. There is a unique R-module isomorphism

MR (N P) = (MR N) (MR P)where m (n, p) (m n, m p).Proof. Instead of directly writing down an isomorphism, we will put to work the essentialuniqueness of solutions to a universal mapping problem by showing (MR N) (MR P)has the universal mapping property of the tensor product M R (N P). Therefore byabstract nonsense these modules must be isomorphic. That there is an isomorphism whoseeffect on elementary tensors in MR (NP) is as indicated in the statement of the theoremwill fall out of our work.

For (M

R N)

(M

R P) to be a tensor product of M and N

P, it needs a bilinear

map to it from M(NP). Let b : M(NP) (MR N)(MR P) by b(m, (n, p)) =(m n, m p). This function is bilinear. We verify one part:

b(m, (n, p) + (n, p)) = b(m, (n + n, p +p))

= (m (n + n), m (p +p))= (m n + m n, m p + m p)= (m n, m p) + (m n, m p)= b(m, (n, p)) + b(m, (n, p)).

To show (M R N) (M R P) equipped with the map b has the universal mappingproperty of MR (N P) and its canonical bilinear map , let B : M (N P) Q beany bilinear map. We seek an R-linear map L making the diagram

(5.1) (MR N) (MR P)

L

M (N P)

b55jjjjjjjjjjjjjjj

B**TTT

TTTTTT

TTTTTT

TTTT

Q


22/71

22 KEITH CONRAD

commute. Being linear, L would be determined by its values on the direct summands, andthese values would be determined by its values on all pairs (m n, 0) and (0, m p) byadditivity. These values are forced by commutativity of (5.1) to be

L(m

n, 0) = L(b(m,(n, 0))) = B(m,(n, 0)) and L(0, m

p) = L(b(m,(0, p))) = B(m,(0, p)).

To construct L, we are inspired by these formulas to contemplate the maps M N Qand M P Q given by (m, n) B(m, (n, 0)) and (m, p) B(m, (0, p)). Both arebilinear, so there are R-linear maps MR N L1 Q and MR P L2 Q where

L1(m n) = B(m, (n, 0)) and L2(m p) = B(m, (0, p)).Define L on (MR N) (MR P) by L(t1, t2) = L1(t1) + L2(t2). (Notice we are definingL not just on ordered pairs of elementary tensors, but on all pairs of tensors. We need L1and L2 to be defined on the whole tensor product modules M R N and M R P.) Themap L is linear since L1 and L2 are linear, and (5.1) commutes:

L(b(m, (n, p))) = L(b(m, (n, 0) + (0, p)))

= L(b(m, (n, 0)) + b(m, (0, p)))= L((m n, 0) + (0, m p)) by the definition of b= L(m n, m p)= L1(m n) + L2(m p) by the definition of L= B(m, (n, 0)) + B(m, (0, p))

= B(m, (n, 0) + (0, p))

= B(m, (n, p)).

Now that weve shown (MR N) (MR P) and the bilinear map b have the universalmapping property of M R (N P) and the canonical bilinear map , there is a uniquelinear map f making the diagram

(MR N) (MR P)

f

M (N P)

b55jjjjjjjjjjjjjjj

))TTT

TTTTTT

TTTTTT

MR (N P)commute, and f is an isomorphism of R-modules because it transforms one solution of auniversal mapping problem into another. Taking (m, (n, p)) around the diagram both ways,

f(b(m, (n, p))) = f(m

n, m

p) = m

(n, p).

Therefore the inverse off is an isomorphism MR (N P) (MR N) (MR P) withthe effect m (n, p) (m n, m p). Theorem 5.4. There is a unique R-module isomorphism

MRiI

Ni =iI

(MR Ni)

where m (ni)iI (m ni)iI.


23/71

TENSOR PRODUCTS 23

Proof. We extrapolate from the case #I = 2 in Theorem 5.3. The map b : M(iI Ni) iI(MR Ni) by b((m, (ni)iI)) = (m ni)iI is bilinear. We will show

iI(MR Ni)

and b have the universal mapping property of MR

iI Ni and .

Let B : M

(iI Ni) Q be bilinear. For each i I the function M Ni Q where(m, ni) B(m, (. . . , 0, ni, 0, . . . )) is bilinear, so there is a linear map Li : M R Ni Q

where Li(mni) = B(m, (. . . , 0, ni, 0, . . . )). Define L :

iI(MR Ni) Q by L((ti)iI) =iI Li(ti). All but finitely many ti equal 0, so the sum here makes sense, and L is linear.

It is left to the reader to check the diagram iI(MR Ni)

L

MiI Nib

66lllllllllllll

B))RR

RRRR

RRRR

RRRR

RR

Q

commutes. Any linear map L making this diagram commute has its value on (. . . , 0, m ni, 0, . . . ) = b(m, (. . . , 0, ni, 0, . . . )) determined by B, so it is unique. Thus

iI(MR Ni)

and the bilinear map b to it have the universal mapping property of MR

iI Ni and thecanonical map , so there is an R-module isomorphism f making the diagram

iI(MR Ni)

f

MiI Nib

66lllllllllllll

((RR

RRRR

RRRR

RRR

MR iI Nicommute. Sending (m, (ni)iI) around the diagram both ways shows f((m ni)iI) = m (ni)iI, so the inverse of f is an isomorphism with the effect m (ni)iI (m ni)iI. Remark 5.5. The analogue of Theorem 5.4 for direct products is false. While there is anatural R-linear map

(5.2) MRiI

Ni iI

(MR Ni),

namely m (ni)iI (m ni)iI on elementary tensors, it is not an isomorphism ingeneral. Taking R = Z, M = Q, and Ni = Z/p

iZ (i 1), the right side of (5.2) is 0 sinceQ Z (Z/piZ) = 0 for all i 1 (Example 3.6). The left side of (5.2) is Q Z i1

Z/piZ,

which is not 0 by Theorem 4.21 since i1 Z/piZ is not a torsion abelian group.In our proof of associativity of the tensor product, we started with a function on a direct

product MNP and collapsed this domain to an iterated tensor product (MR N)R Pusing bilinearity twice. It is useful to record a rather general result in that direction, as atechnical lemma for future convenience.

Theorem 5.6. Let M1, . . . , M k, N be R-modules, with k > 2, and suppose

M1 Mk2 Mk1 Mk N


24/71

24 KEITH CONRAD

is a function which is bilinear in Mk1 and Mk when other coordinates are fixed. There isa unique function

M1 Mk2 (Mk1 R Mk) N

that is linear in Mk1 R Mk when the other coordinates are fixed and satisfies(5.3) (m1, . . . , mk2, mk1 mk) = (m1, . . . , mk2, mk1, mk).If is multilinear in M1, . . . , M k, then is multilinear in M1, . . . , M k2, Mk1 R Mk.Proof. Assuming a function exists satisfying (5.3) and is linear in the last coordinate,its value everywhere is determined by additivity in the last coordinate: write any tensort Mk1 R Mk in the form t =

pi=1 xi yi, and then

(m1, . . . , mk2, t) =

m1, . . . , mk2,

pi=1

xi yi

=

pi=1

(m1, . . . , mk2, xi yi)

=

pi=1

(m1, . . . , mk2, xi, yi).

It remains to show exists with the desired properties.Fix mi Mi for i = 1, . . . , k 2. Define m1,...,mk2 : Mk1 Mk N by

m1,...,mk2(x, y) = (m1, . . . , mk2, x , y).

By hypothesis m1,...,mk2 is bilinear in x and y, so from the universal mapping property ofthe tensor product there is a linear map m1,...,mk2 : Mk1

R Mk

N such that

m1,...,mk2(x y) = m1,...,mk2(x, y) = (m1, . . . , mk2, x , y).Define : M1 Mk2 (Mk1 R Mk) N by

(m1, . . . , mk2, t) = m1,...,mk2(t).

Since m1,...,mk2 is a linear function on Mk1 R Mk, (m1, . . . , mk2, t) is linear in t whenm1, . . . , mk2 are fixed.

If is multilinear in M1, . . . , M k we want to show is multilinear in M1, . . . , M k2,Mk1 R Mk. We already know is linear in Mk1 R Mk when the other coordinates arefixed. To show is linear in each of the other coordinates (fixing the rest), we carry outthe computation for M1 (the argument is similar for other Mis): is

(x + x, m2, . . . , mk2, t)?

= (x, m2, . . . , mk2, t) + (x, m2, . . . , mk2, t)

(rx,m2, . . . , mk2, t)?

= r(x, m2, . . . , mk2, t)

when m2, . . . , mk2, t are fixed in M2, . . . , M k2, Mk1 R Mk? In these two equations, bothsides are additive in t so it suffices to verify these two equations when t is an elementarytensor, say t = mk1 mk. Then from (5.3), these two equations are true since wereassuming is linear in M1 (fixing the other coordinates).


25/71

TENSOR PRODUCTS 25

Theorem 5.6 is not specific to functions which are bilinear in the last two coordinates:any two coordinates can be used when the function is bilinear in those two coordinates. Forinstance, lets revisit the proof of associativity of the tensor product in Theorem 5.3. Define

: M

N

P

M

R (N

R P)

by (m,n,p) = m (n p). This function is trilinear, so Theorem 5.6 says we can replaceM N with its tensor product: there is a bilinear function

: (MR N) P MR (NR P)such that (m n, p) = m (n p). Since is bilinear, there is a linear function

f: (MR N) R P MR (NR P)such that f(tp) = (t, p), so f((mn)p) = (mn, p) = m(np). The constructionof the functions fp in the proof of Theorem 5.3 are now seen to be special cases of Theorem5.6.

The remaining module properties we treat with the tensor product in this section in-

volve its interaction with the Hom-module construction, so in particular the dual moduleconstruction (M = HomR(M, R)).

Theorem 5.7. There are R-module isomorphisms

HomR(M, HomR(N, P)) = BilR(M, N; P) = HomR(MR N, P).Proof. A member of HomR(M, HomR(N, P)) is a linear map f: M HomR(N, P), andfrom f we can create a bilinear map Bf : MN P by Bf(m, n) = f(m)(n). In the otherdirection, from any bilinear map B : MN P we get a linear map fB : M HomR(N, P)by fB(m) : n B(m, n). The correspondences f Bf and B fB are obviously in-verses, and the reader can check they are linear as well, so HomR(M, HomR(N, P)) =BilR(M, N; P). The R-module isomorphism between BilR(M, N; P) and HomR(MR N, P)comes from the universal mapping property of the tensor product.

Heres a high-level way of interpreting the isomorphism between the two Hom-modulesin Theorem 5.7. Write FN(M) = HomR(N, M) and GN(M) = MR N, so FN and GNturn R-modules into new R-modules. Theorem 5.7 says

HomR(M, FN(P)) = HomR(GN(M), P).This is analogous to the relation between a matrix and its transpose inside the dot product:

v Aw = Av w.So FN and GN are transposes of each other. Actually, FN and GN are really calledadjoints of each other (FN is left adjoint to GN, and GN is right adjoint to FN) becausepairs of operators L and L in linear algebra that satisfy the relation L(v) w = v L(w)are called adjoints and the relation between FN and GN is formally similar.Corollary 5.8. For R-modules M and N, there are R-module isomorphisms

HomR(M, N) = HomR(N, M) = BilR(M, N; R) = (MR N).

Proof. Let P = R in Theorem 5.7: HomR(M, N) = BilR(M, N; R) = (M R N). By

Theorem 5.1, MR N = NR M, so (MR N) = (NR M), and the latter dual moduleis isomorphic to HomR(N, M

) by Theorem 5.7 with the roles of M and N there reversedand P = R. Thus we have obtained isomorphisms between the desired modules.


26/71

26 KEITH CONRAD

Explicitly, the isomorphism between HomR(M, N) and HomR(N, M

) amounts to real-izing linear maps in both Hom-modules are the same as bilinear maps B : M N R.

The construction of MR N is symmetric in M and N in the sense that MR N =N

RM in a natural way, but Corollary 5.8 is not saying Hom

R(M, N)

=Hom

R(N, M)

since those are not the Hom-modules in the corollary. For instance, if R = M = Z andN = Z/2Z then HomR(M, N) = Z/2Z and HomR(N, M) = 0.Theorem 5.9. For R-modules M and N, there is a unique linear map

M R N HomR(M, N)sending the elementary tensor n to [m (m)n]. This is an isomorphism if M andN are finite free. In particular, if F is finite free then F R F = EndR(F) as R-modules.

Concretely, the last part is saying (Rn)R Rn = Mn(R) as R-modules, in a natural way.Proof. We need to make an element of M and element of N act together as a linear mapM

N. The function M

N

M

N given by (,n,m)

(m)n is trilinear.

Here the parameter acts on m to give a scalar, which is then multiplied by n. ByTheorem 5.6, this trilinear map induces a bilinear map B : (M R N) M N whereB( n, m) = (m)n. For t M R N, B(t, ) is in HomR(M, N), so we have a linearmap f: M R N HomR(M, N) by f(t) = B(t, ). (Explicitly, the elementary tensor n acts on M by the rule ( n)(m) = (m)n.)

Now let M and N be finite free. To show f is an isomorphism, we may suppose M andN are nonzero. Pick bases {ei} of M and {ej} of N. Then f lets ei ej act on M bysending ek to e

i (ek)e

j = ike

j . So f(e

i ej) HomR(M, N) sends ei to ej and sends every

other member of the basis ofM to 0. Such linear maps are a basis of HomR(M, N), so f isonto.

To show f is one-to-one, suppose f(

i,j cije

i ej) = O in HomR(M, N). Applying both

sides to any ek

, we get i,j cijikej = 0, which says j ckj ej = 0, so ckj = 0 for all j andall k. Thus every cij is 0. This concludes the proof that f is an isomorphism.Lets work out the inverse map explicitly. For L HomR(M, N), write L(ei) =

j aji e

j ,

so L has matrix representation (aji). (The matrix indices here look reversed from usualpractice because we use i as the index for basis vectors in M and j as the index for basisvectors in N; review how linear maps become matrices when bases are chosen. If we hadindexed bases ofM and N with i and j in each others places, then L(ej) =

aij e

i .) From

the isomorphism f, let L correspond to

i,j cijei ej in M R N, with the coefficients

cij to be determined. Then

L(ek) =

i,j

cij(ei ej)(ek) =

j

ckj e

j,

so ajk = ckj . Therefore cij = aji , so L HomR(M, N) with matrix representation (aji )corresponds to i,j aji ei ej . Thats nice. It just says ei ej corresponds to the matrixunit Eji.

9

Example 5.10. For finite-dimensional vector spaces V and W over the field K, VKW =HomK(V, W) by ( w)(v) = (v)w. This is one of the most basic ways tensor productsoccur in linear algebra. Understand this isomorphism! When V = W = K2, with standard

9Notice the index switch: ei ej goes to Eji and not Eij .


27/71

TENSOR PRODUCTS 27

basis e1 and e2, the matrix ( a bc d ) M2(K) = HomK(V, W) corresponds to the tensore1

ac

+ e2

bd

since this tensor sends e1 to e

1 (e1)

ac

+ e2 (e1)

bd

=a

c

, and similarly

this tensor sends e2 to

bd

.

Remark 5.11. The linear map M

R N HomR(M, N) in Theorem 5.9 is not anisomorphism, or even injective or surjective, in general. For example, let p be prime,R = Z/p2Z, and M = N = Z/pZ as an R-module. Check as an exercise that M = M,M R M = M, HomR(M, M) = M, and the natural map M R M HomR(M, M) isidentically 0 (it suffices to show every elementary tensor in M R M acts on M as 0).Remark 5.12. We could have bypassed the injectivity part of the proof, for the followingreason. When M and N are finite free, M R N and HomR(M, N) are both finite free R-modules of the same rank, so they are abstractly isomorphic. Therefore a linear surjectionbetween them has to be an isomorphism (consequence of Cayley-Hamilton theorem), so fis an isomorphism.

When M and N are finite free R-modules, the isomorphisms in Corollary 5.8 and Theorem

5.9 lead to a description ofMR N that makes no mention of universal mapping properties.Identify M with M by double duality, so Theorem 5.9 assumes the form

MR N = HomR(M, N),where m n goes over to the linear map (m)n. Since N = N by double duality,HomR(M

, N) = HomR(M, (N)) = BilR(M, N; R) by Corollary 5.8. Therefore(5.4) MR N = BilR(M, N; R),where m n acts as the bilinear map M N R sending (, ) to (m)(n). Thedefinition of the tensor product of finite-dimensional vector spaces in [1, p. 65] and [4, p. 35]is essentially (5.4).10 It is a good exercise to check these interpretations of m n as amember of HomR(M

, N) and BilR(M, N; R) are identified with each other by Corollary

5.8 and double duality.But watch out! The formula (5.4) for the tensor product does not work for generalmodules M and N (where double duality doesnt hold). While there is always a linear mapMR N BilR(M, N; R) given on elementary tensors by mn [(, ) (m)(n)],it is generally not an isomorphism.

Example 5.13. Let p be prime, R = Z/p2Z, and M = Z/pZ as an R-module. The naturalmap MR M BilR(M, M; R) is identically 0 while both sides have size p.

6. Base Extension

In algebra, there are many times a module over one ring is replaced by a related moduleover another ring. For instance, in linear algebra it is useful to enlarge Rn to Cn, creating

in this way a complex vector space by letting the real coordinates be extended to complexcoordinates. In ring theory, irreducibility tests in Z[X] involve viewing a polynomial inZ[X] as a polynomial in Q[X] or reducing the coefficients mod p to view it in (Z/pZ)[X].We will see that all these passages to modules with new coefficients (Rn Cn, Z[X] Q[X], Z[X] (Z/pZ)[X]) can be described in a uniform way using tensor products.

10Using the last isomorphism in Corollary 5.8 and double duality, MR N= BilR(M,N;R) for finite

freeM and N, where mn goes to the (evaluation) function B B(m,n). This is how tensor products offinite-dimensional vector spaces are defined in [3, p. 40], namely VKW is the dual space to BilK(V, W;K).


28/71

28 KEITH CONRAD

Let f: R S be a homomorphism of commutative rings. We use f to consider anyS-module N as an R-module by rn := f(r)n. In particular, S itself is an R-module byrs := f(r)s. Passing from N as an S-module to N as an R-module in this way is calledrestriction of scalars.

Example 6.1. If R S, f can be the inclusion map (e.g., R C or Q C). This ishow a C-vector space is thought of as an R-vector space or Q-vector space.

Example 6.2. If S = R/I, f can be reduction modulo I (e.g., R = K[T] and S =K[T]/(m(T))). This is how a K-vector space V on which a linear operator A acts is turnedinto a K[T]-module by letting T act as A. First V becomes a module over the ring ofoperators K[A] acting on V. Since K[A] = K[T]/(m(T)), where m(T) is the minimalpolynomial ofA, V is a K[T]/(m(T))-module. Then V becomes a K[T]-module by pullingback the scalars from K[T]/(m(T)) to K[T] using reduction mod m(T).

Here is a simple illustration of restriction of scalars.

Theorem 6.3. LetN and N

be S-modules. Any S-linear map N N

is also an R-linearmap when we treat N and N as R-modules.

Proof. Let : N N be S-linear, so (sn) = s(n) for any s S and n N. For r R,(rn) = (f(r)n) = f(r)(n) = r(n),

so is R-linear.

As a notational convention, since we will be going back and forth between R-modulesand S-modules a lot, we will write M (or M, and so on) for R-modules and N (or N, andso on) for S-modules. Since N is also an R-module by restriction of scalars, we can formthe usual R-module MR N, where

r(m n) = (rm) n = m rn,with the third expression really being m f(r)n since rn := f(r)n.

We want to reverse the process of restriction of scalars. For any R-module M we wantto create an S-module of products sm which matches the old meaning of rm if s = f(r).This new module is called an extension of scalars or base extension. It will be the R-moduleSR M equipped with a new structure of S-module.

Since S is a ring and not just an R-module, we can make SR M into an S-module by(6.1) s(s m) = ss m.But does this S-scaling extend unambiguously to all tensors?

Theorem 6.4. The additive group S R M has a unique S-module structure satisfying(6.1), and this is compatible with the R-module structure in the sense that rt = f(r)t for allr R and t SR M.Proof. Suppose the additive group SR M has an S-module structure satisfying (6.1). Wewill show the S-scaling on all tensors in SR M is determined by this. Any t SR M isa finite sum of elementary tensors, say

t = s1 m1 + + sk mk.


29/71

TENSOR PRODUCTS 29

For s S,st = s(s1 m1 + + sk mk)

= s(s1 m1) + + s(sk mk)= ss1 m1 + + ssk mk by (6.1),

so st is determined, although this formula for it is not obviously well-defined. (Does adifferent expression for t as a sum of elementary tensors change st?)

Now we show there really is an S-module structure on SR M satisfying (6.1). Describingthe S-scaling on SR M means creating a function S (SR M) SR M satisfyingthe relevant scaling axioms:

(6.2) 1 t = t, s(t1 + t2) = st1 + st2, (s1 + s2)t = s1t + s2t, s1(s2t) = (s1s2)t.For each s S we consider the function S M SR M given by (s, m) (ss) m.

This is R-bilinear, so by the universal mapping property of tensor products there is anR-linear map s : S

R M

S

R M where s(s

m) = (ss)

m on elementary tensors.

Define a multiplication S (SR M) SR M by st := s(t). This will be the scalingof S on SR M. We check the conditions in (6.2):

(1) To show 1t = t means showing 1(t) = t. On elementary tensors, 1(s m) =(1 s) m = s m, so 1 fixes elementary tensors. Therefore 1 fixes all tensors byadditivity.

(2) s(t1 + t2) = st1 + st2 since s is additive.(3) To show (s1 + s2)t = s1t + s2t means showing s1+s2 = s1 + s2 as functions on

SR M. Both sides are additive functions on SR M, so it suffices to check theyagree on tensors s m, where both sides have common value (s1 + s2)s m.

(4) To show s1(s2t) = (s1s2)t means s1 s2 = s1s2 as functions on SR M. Bothsides are additive functions of t, so it suffices to check they agree on tensors s m,where both sides have common value (s1s2s) m.

Lets check the S-module structure on SR M is compatible with its original R-modulestructure. For r R, if we treat r as f(r) S then scaling by f(r) on an elementary tensors m has the effect f(r)(s m) = f(r)s m. Since f(r)s is the definition ofrs (this is howwe make S into an R-module), f(r)sm = rsm = r(sm). Thus f(r)(sm) = r(sm),so f(r)t = rt for all t in SR M.

By exactly the same kind of argument, MR S has a unique S-module structure wheres(ms) = mss. So whenever we meet MR S or SR M, we know they are S-modulesin a specific way. Moreover, these two S-modules are naturally isomorphic: by Theorem5.1, there is an isomorphism : SR M MR S ofR-modules where (s m) = m s.To show is in fact an isomorphism of S-modules, all we need to do is check S-linearity

since is known to be additive and a bijection. To show (s

t) = s

(t) for all s

and t,additivity of both sides in t means we may focus on the case t = s m:

(s(s m)) = ((ss) m) = m ss = s(m s) = s(s m).This idea of creating an S-module isomorphism by using a known R-module isomorphismthat is also S-linear will be used many more times, so watch for it.

Now we must be careful to refer to R-linear and S-linear maps, rather than just linearmaps, so it is clear what our scalar ring is each time.


30/71

30 KEITH CONRAD

Example 6.5. In Example 4.5 we saw (R/I)R M = M/IMas R-modules by rm rm.Since M/IM is naturally an R/I-module, and now we know (R/I)R M is an R/I-module,the R-module isomorphism turns out to be an R/I-module isomorphism too since it is R/I-linear (check!).

Example 6.6. In Theorem 4.21, we looked at KR M when R is a domain with fractionfield K. It was treated there as an R-module, but now we see it is also a K-vector spacewith the K-scaling rule x(y m) = xy m on elementary tensors.Theorem 6.7. If F is a free R-module with basis {ei}iI then SR F is a free S-modulewith basis {1 ei}iI.Proof. Since S is an R-module, we know from Theorem 4.13 that every element of SR Fhas a unique representation in the form

iI si ei, where all but finitely many si equal 0.

Since si ei = si(1 ei) in the S-module structure on SR F, every element of SR F is aunique S-linear combination

si(1 ei), which says {1 ei} is an S-basis ofSR F.

Example 6.8. As an S-module, S

R R

n has S-basis

{1

e1, . . . , 1

en

}where

{e1, . . . , en

}is the standard basis of Rn, so Sn = SR Rn as S-modules by(s1, . . . , sn)

ni=1

si(1 ei) =n

i=1

si ei

because this map is S-linear (check!) and sends an S-basis to an S-basis. In particular,SR R = S as S-modules by s r sr.

For instance,C R Rn = Cn, C R Mn(R) = Mn(C).

These are isomorphisms as C-vector spaces. For any ideal I in R, (R/I) R Rn = (R/I)nas R/I-modules.

Example 6.9. As an S-module, SR R[X] has S-basis {1 Xi}i0, so SR R[X] = S[X]as S-modules11 by i0 si Xi i0 siXi.

As particular examples, C R R[X] = C[X] as C-vector spaces, Q Z Z[X] = Q[X] asQ-vector spaces and (Z/pZ) Z Z[X] = (Z/pZ)[X] as Z/pZ-vector spaces.Example 6.10. If we treat Cn as a real vector space, then its base extension to C is thecomplex vector space C R Cn where c(z v) = cz v for c in C. Since Cn = R2n as realvector spaces, we have a C-vector space isomorphism

C R Cn = C R R2n = C2n.Thats interesting: restricting scalars on Cn to make it a real vector space and then ex-tending scalars back up to C does not give us Cn back, but instead two copies of Cn. Thepoint is that when we restrict scalars, the real vector space Cn forgets it is a complex vectorspace. So the base extension to C doesnt remember that it used to be a complex vectorspace.

Quite generally, ifV is a finite-dimensional complex vector space and we view it as a realvector space, its base extension C R V as a complex vector space is not V but a directsum of two copies ofV. Lets do a dimension check: as real vector spaces, dimR(CRV) =dimR(C)dimR(V) = 2(2dimC V) = 4d imC V and dimR(V V) = 2d imC(V V) =

11We saw SR R[X] and S[X] are isomorphic as R-modules in Example 4.14 when S R, and it holds

now for any Rf S.


31/71

TENSOR PRODUCTS 31

4dimC V, so the two dimensions match. This match is of course not a proof that thereis a natural isomorphism C R V V V of complex vector spaces. Work out such anisomorphism as an exercise.

To get our bearing on this example, lets compare an S-module N with the S-moduleSR N (where s(s n) = ss n). Since N is already an S-module, should SR N = N?If you think so, reread Example 6.10 (R = R, S = C, N = Cn). Scalar multiplicationSN N is R-bilinear, so there is an R-linear map : SR N N where (sn) = sn.This map is also S-linear: (st) = s(t). To check this, since both sides are additive in t itsuffices to check the case of elementary tensors, and

(s(s n)) = ((ss) n) = ssn = s(sn) = s(s n).In the other direction, the function : N SR N where (n) = 1 n is R-linear but isgenerally not S-linear since (sn) = 1sn has no reason to be s(n) = sn because wereusing R, not S. We have created natural maps : SR N N and : N SR N;are they inverses? Its unlikely, since is S-linear and is generally not. But lets workout the composites and see what happens. In one direction,

((n)) = (1 n) = 1 n = n.In the other direction,

((s n)) = (sn) = 1 sn = s nin general. So is the identity but is usually not the identity. Since = idN, is a section to , so N is a direct summand SR N. Explicitly, SR N = ker N bys n (s n 1 sn, sn) and its inverse map is (k, n) k + 1 n. The phenomenonthat SR N is typically larger than N when N is an S-module can be remembered by theexample C R Cn = C2n.Theorem 6.11. For R-modules {Mi}iI, there is an S-module isomorphism

SR iI

Mi = iI

(SR Mi).

Proof. Since S is an R-module, by Theorem 5.4 there is an R-module isomorphism

: SRiI

Mi iI

(SR Mi)

where (s (mi)iI) = (s mi)iI. To show is an S-module isomorphism, we just haveto check is S-linear, since we already know is additive and a bijection. It is obvious that(st) = s(t) when t is an elementary tensor, and since both (st) and s(t) are additivein t the case of general tensors follows.

The analogue of Theorem 6.11 for direct products is false. There is a natural S-linear

map SRiI Mi iI(SR Mi), but it need not be an isomorphism: QZi1 Z/piZis nonzero (Remark 5.5) but

i1(Q Z Z/piZ) is 0.

We now put base extensions to work. Let M be a finitely generated R-module, say withn generators. That is the same as saying there is a linear surjection Rn M. To say Mcontains a subset of d linearly independent elements is the same as saying there is a linearinjection Rd M. If both Rn M and Rd M, it is natural to suspect d n, i.e., thesize of a spanning set should always be an upper bound on the size of a linearly independentsubset. Is it really true? IfR is a field, so modules are vector spaces, we can use dimension


32/71

32 KEITH CONRAD

inequalities on Rd, M, and Rn to see d n. But if R is not a field, then what? We willsettle the issue in the affirmative when R is a domain, by tensoring M with the fraction fieldof R to reduce to the case of vector spaces. We first tensored R-modules with the fractionfield ofR in Theorem 4.21, but not much use was made of the vector space structure of the

tensor product with a field. Now we exploit it.

Theorem 6.12. LetR be a domain with fraction field K. For a finitely generatedR-moduleM, KR M is finite-dimensional as a K-vector space and dimK(KR M) is the maximalnumber of R-linearly independent elements in M and is a lower bound on the size of aspanning set for M. In particular, the size of any linearly independent subset of M is lessthan or equal to the size of any spanning set of M.

Proof. If x1, . . . , xn is any spanning set for M as an R-module then 1 x1, . . . , 1 xn spanKR M as a K-vector space, so dimK(KR M) n.

Let y1, . . . , yd be R-linearly independent in M. We will show {1 yi} is K-linearlyindependent in KR M, so d dimK(KR M). Suppose

di=1 ci(1 yi) = 0 with ci K.

Write ci

= ai/b using a common denominator b in R. Then 0 = 1/b

di=1 aiyi in KR M.By Corollary 4.22, this implies di=1 aiyi Mtor, so di=1 raiyi = 0 in M for some nonzeror R. By linear independence of the yis over R, every rai is 0, so every ai is 0 (R is adomain). Thus every ci = ai/b is 0.

It remains to prove M has a linearly independent subset of size dimK(KR M). Let{e1, . . . , ed} be a linearly independent subset of M, where d is maximal. (Since d dimK(K R M), there is a maximal d.) For every m M, {e1, . . . , ed, m} has to belinearly dependent, so there is a nontrivial R-linear relation

a1e1 + + aded + am = 0.Necessarily a = 0, as otherwise all the ais are 0 by linear independence of the eis. InKR M,

di=1

ai(1 ei) + a(1 m) = 0

and from the K-vector space structure on KR M we can solve for 1 m as a K-linearcombination of the 1 eis. Therefore {1ei} spans KR M as a K-vector space. This setis also linearly independent over K by the previous paragraph, so it is a basis and therefored = dimK(KR M).

While M has at most dimK(KRM) linearly independent elements and this upper boundis achieved, any spanning set has at least dimK(KR M) elements but this lower bound isnot necessarily reached. For example, ifR is not a field and M is a torsion module (e.g., R/Ifor I a nonzero proper ideal) then KR M = 0 and M certainly doesnt have a spanningset of size 0 if M

= 0. It is also not true that finiteness of dimK(K

R M) implies M is

finitely generated as an R-module. Take R = Z and M = Q, so Q Z M = Q Z Q = Q(Theorem 4.17), which is finite-dimensional over Q but M is not finitely generated over Z.

The maximal number of linearly independent elements in an R-module M, for R a do-main, is called the rank of M.12 This use of the word rank is consistent with its usagefor finite free modules: if M is free with an R-basis of size n then KR M has a K-basisof size n by Theorem 6.7.

12When R is not a domain, this concept of rank for R-modules is not quite the right one.


33/71

TENSOR PRODUCTS 33

Example 6.13. A nonzero ideal I in a domain R has rank 1. We can see this in two ways.First, any two nonzero elements in I are linearly dependent over R, so the maximal numberof R-linearly independent elements in I is 1. Second, KR I = K as K-vector spaces (inTheorem 4.19 we showed they are isomorphic as R-modules, but that isomorphism is also

K-linear; check!), so dimK(KR I) = 1.Example 6.14. A finitely generated R-module M has rank 0 if and only if it is a torsionmodule, since KR M = 0 if and only if M is a torsion module.

Since KR M = KR (M/Mtor) as K-vector spaces (the isomorphism between themas R-modules in Theorem 4.21 is easily checked to be K-linear check!), M and M/Mtorhave the same rank.

We return to general R, no longer a domain, and see how to make the tensor product ofan R-module and S-module into an S-module.

Theorem 6.15. Let M be an R-module and N be an S-module.

(1) The additive group M

R N has a unique structure of S-module such thats(m

n) =m sn for s S. This is compatible with the R-module structure in the sense thatrt = f(r)t for r R and t MR N.

(2) The S-module M R N is isomorphic to (S R M) S N by sending m n to(1 m) n.

The point of part 2 is that it shows how the S-module structure on M R N can bedescribed as an ordinary S-module tensor product once we know the idea of base extendingM to the S-module SR M.

Part 2 has both R-module and S-module tensor products. This is the first time that wemust decorate the tensor product sign explicitly. Up to now it was actually unnecessary, asall the tensor products were over R.

Proof. (1) This is similar to the proof of Theorem 6.4 (which is the special case N = S).We just sketch the idea.

Since every tensor is a sum of elementary tensors, declaring how s S scales elementarytensors in MR N determines its scaling on all tensors. To show the rule s(mn) = msnreally corresponds to an S-module structure, for each s S we consider the functionM N MR N given by (m, n

Documents

Curso Produto Tensorial de Modulos Conrad 2009