Академический Документы
Профессиональный Документы
Культура Документы
Scalar Multiplication
By a scalar we shall mean a number, as opposed to a
matrix or array of numbers. Let c be a scalar and A be an
𝑚 × 𝑛 matrix. The scalar multiple cA is the 𝑚 × 𝑛 matrix
whose (𝑖, 𝑗) entry is 𝑐𝑎𝑖𝑗 . Thus to form cA we multiply every
entry of A by the scalar c. The matrix (−1)𝑨 is usually
written –A; it is called the negative of A since it has the
property that 𝑨 + (−𝑨) = 𝟎.
Example
If
1 2 0 1 1 1
𝑨= and 𝑩 = ,
−1 0 1 0 −3 1
Then
5 7 3 −1 1 −3
2𝑨 + 3𝑩 = and 2𝑨 − 3𝑩 =
−2 −9 5 −2 9 −1
Matrix Multiplication
It is less obvious what the “natural” definition of the
product of two matrices should be. Let us start with the
simplest interesting case, and consider a pair of 2 × 2
matrices
𝑎11 𝑎12 𝑏11 𝑏12
𝑨= 𝑎 𝑎22 and 𝑩 = .
21 𝑏21 𝑏22
In order to motivate the definition of the matrix product 𝑨𝑩
we consider two sets of linear equations
𝑎11 𝑦1 𝑎12 𝑦2 = 𝑥1 𝑏11 𝑧1 𝑏12 𝑧2 = 𝑦1
+
𝑎21 𝑦1 𝑎22 𝑦2 = 𝑥2 and +
𝑏21 𝑧1 𝑏22 𝑧2 = 𝑦2
Observe that the coefficient matrices of these linear
systems are 𝑨 and 𝑩 respectively. We shall think of these
equations as representing changes of variables from 𝑦1 , 𝑦2
to 𝑥1 , 𝑥2 , and from 𝑧1 , 𝑧2 to 𝑦1 , 𝑦2 respectively.
Suppose that we replace 𝑦1 and 𝑦2 in the first set of
equations by the values specified in the second set. After
simplification we obtain a new set of equations.
𝑎11 𝑏11 + 𝑎12 𝑏21 𝑧1 + 𝑎11 𝑏12 + 𝑎12 𝑏22 𝑧2 = 𝑥1
𝑎21 𝑏11 + 𝑎22 𝑏21 𝑧1 + 𝑎21 𝑏12 + 𝑎22 𝑏22 𝑧2 = 𝑥2
This has coefficient matrix
𝑎11 𝑏11 + 𝑎12 𝑏21 𝑎11 𝑏12 + 𝑎12 𝑏22
𝑎21 𝑏11 + 𝑎22 𝑏21 𝑎21 𝑏12 + 𝑎22 𝑏22
and represents a change of variables from 𝑧1 , 𝑧2 to 𝑥1 , 𝑥2
which may be thought of as the composite of the original
changes of variables.
At first sight this new matrix looks formidable. However, it
is in fact obtained from A and B in quite a simple fashion,
namely by the row-times-column rule. For example, the
(1, 2) entry arises from multiplying corresponding entries of
row 1 of A and column 2 of B, and then adding the
resulting numbers; thus
𝑏12
𝑎11 𝑎12 ⟶ 𝑎11 𝑏12 + 𝑎12 𝑏22 .
𝑏22
Other entries arise in a similar fashion from a row of A and
a column of B.
Having made this observation, we are now ready to define
the product AB where A an 𝑚 × 𝑛 matrix and B is an 𝑛 × 𝑝
matrix. The rule is that the 𝑖, 𝑗 entry of AB is obtained by
multiplying corresponding entries of row 𝑖 of 𝑨 and column
𝑗 of B, and then adding up the resulting products. This is
the row-times-column rule. Now row 𝑖 of A and column 𝑗 of
B are
𝑏1𝑗
𝑏2𝑗
𝑎𝑖1 𝑎𝑖2 … 𝑎𝑖𝑛 and .
⋮
𝑏𝑛𝑗
Example
Let
0 2 0
2 1 −1
𝑨= and 𝑩 = 1 1 1
3 0 2
1 5 −1
Since A is 2 × 3 and B is 3 × 3, we see that AB is defined
and is a 2 × 3 matrix. However BA is not defined. Using the
row-times-columns rule, we quickly find that
0 0 2
𝑨𝑩 =
2 16 −2
Example
Let
0 1 1 1
𝑨= and 𝑩 = .
0 0 0 0
In this case both AB and BA are defined, but these
matrices are different:
0 0 0 1
𝑨𝑩 = = 022 and 𝑩𝑨 = .
0 0 0 0
Thus already we recognize some interesting features of
matrix multiplication. The matrix product is not
commutative, that is, AB and BA may be different when
both are defined; also the product of two non-zero matrices
can be zero, a phenomenon which indicates that any theory
of division by matrices will face considerable difficulties.
Conformable Matrices: The matrices A and B are
conformable if their product AB is defined, that is, if the
number of columns of A is equal to the number of rows of
B.
Powers of a Matrix
Once matrix products have been defined, it is clear how to
define a non-negative power of a square matrix. Let 𝑨 be an
𝑛 × 𝑛 matrix. Then the 𝑚 𝑡ℎ power of 𝑨, where 𝑚 is a non-
nagative integer, is defined by the equations
𝑨0 = 𝑰𝑛 and 𝑨𝑚 +1 = 𝑨𝑚 𝑨
This is an example of a recursive definition: the first
equation specifies 𝑨0 , while the second shows how to define
𝑨𝑚 +1 , under the assumption that 𝑨𝑚 has already been
defined. Thus 𝑨1 = 𝑨, 𝑨2 = 𝑨𝑨, 𝑨3 = 𝑨2 𝑨 etc… We do not
attempt to define negative powers at this juncture.
Example
0 1
Let 𝑨 = .
−1 0
−1 0 0 −1 1 0
Then 𝑨2 = , 𝑨3 = and 𝑨4 = .
0 −1 1 0 0 1
The reader can verify that higher powers of 𝑨 do not lead to
new matrices in this example. Therefore 𝑨 has just four
distinct powers, 𝑨0 = 𝐼2 , 𝑨1 = 𝑨, 𝑨2 and 𝑨3 .
𝑝
𝑨 = det 𝑨 ≡ −1 𝑎1𝑖1 , 𝑎2𝑖2 , … , 𝑎𝑚 𝑖𝑚
𝑨−1
for the unique inverse of an invertible matrix 𝑨.
Theorem
a. If 𝑨 is an invertible matrix, then 𝑨−1 is invertible and
𝑨−1 −1 = 𝑨
b. If 𝑨 and 𝑩 are invertible matrices of the same size, the
𝑨𝑩 is ivertible and 𝑨𝑩 −1 = 𝑩−1 𝑨−1
Proof:
a. Certainly we have 𝑨𝑨−1 = 𝑰 = 𝑨−1 𝑨, equations which
can be viewed as saying that 𝑨 is an inverse of 𝑨−1 .
Therefore, since 𝑨−1 cannot have more than one
inverse, its inverse must be 𝑨.
b. To prove the assertions we have only to check that
𝑩−1 𝑨−1 is an inverse of 𝑨𝑩. This is easily done:
𝑨𝑩 𝑩−1 𝑨−1 = 𝑨 𝑩𝑩−1 𝑨−1 , by two applications of the
associative law (we will discuss in the next module);
the latter matrix equals 𝑨𝑰𝑨−1 = 𝑨𝑨−1 = 𝑰. Similarly
𝑩−1 𝑨−1 𝑨𝑩 = 𝑰. Since inverses are unique, 𝑨𝑩 −1 =
𝑩−1 𝑨−1 .
0 𝑓𝑜𝑟 𝑖 ≠ 𝑗
𝑎𝑖𝑗 = 𝑐 𝑓𝑜𝑟 𝑖 = 𝑗 = 𝑘
1 𝑓𝑜𝑟 𝑖 = 𝑗 ≠ 𝑘