Section 3.10 Elementary matrices
Subsection 3.10.1 The three types of elementary matrices
There are three different elementary row operations: Table 2.4.4Interchanging two rows (Ri↔Rj)
Multiplying a row by a scalar (Ri←λRi where λ≠0)
Adding a multiple of one row to another (Ri←Ri+λRj)
Definition 3.10.1. Elementary matrices.
-
(Ri↔Rj)
E1=[1⋱10⋯⋯⋯1⋮1⋮⋮⋱⋮⋮1⋮1⋯⋯⋯01⋱]In this case ei,i=ej,j=0 and ei,j=ei,j=1. All other entries are the same as those in I.
-
(Ri←λRi where λ≠0)
E2=[1⋱1λ1⋱1]In this case ei,i=λ. All other entries are the same as those in I.
-
(Ri←Ri+λRj)
E3=[1⋱11⋯λ1⋮⋱1]In this case ei,j=λ. All other entries are the same as those in I.
A matrix E of any of the three types is called an elementary matrix.
Subsection 3.10.2 Elementary matrices and reduced row echelon form
Theorem 3.10.2. Carrying out row operations using matrix multiplication.
Suppose we start with a matrix A and carry out one elementary row operation to get the matrix B. Then there is an elementary matrix E1, E2, or E3 so that:
Interchange rows (Ri↔Rj) B=E1A
Multiply a row by a constant (Ri←λRi) B=E2A
Add a multiple of one row to another (Ri←Ri+λRj) B=E3A
Proof.
We use the elementary matrices given in Definition 3.10.1. The proof is is then a careful observation of the effect of the nonzero entries in each case.
Example 3.10.3. Multiplying by an elementary matrix.
Let
The entries in the elementary matrices in red are the only ones that differ from an identity matrix I.
-
Interchange rows 2 and 4 (R2↔R4)
E=[1000000010001000100000001]EA=[123456192021222324131415161718789101112252627282930] -
Multiply row 4 by 3 (R4←3R4)
E=[1000001000001000003000001]EA=[123456789101112131415161718576063666972252627282930] -
Subtract twice row 2 from row 4 (R4←R4−2R2)
E=[1000001000001000−201000001]EA=[123456789101112131415161718543210252627282930]
Example 3.10.4. Reduction to reduced row echelon form by matrix multiplication.
Let A=[122245011]. We put this matrix into reduced row echelon form:
Matrix | Elementary Row | Corresponding |
Operations | Matrix | |
A=[122245011] | R2←R2−2R1 | E1=[100−210001] |
E1A=[122001011] | R2↔R3 | E2=[100001010] |
E2E1A=[122011001] | R1←R1−2R2 | E3=[1−20010001] |
E3E2E1A=[100011001] | R2←R2−R3 | E4=[10001−1001] |
E4E3E2E1A=[100010001] |
There is a bonus from this computation: since (E4E3E2E1)A=I, we know that A(E4E3E2E1)=I and A−1=(E4E3E2E1). Indeed,
We shall see shortly that every elementary matrix has an inverse which itself is elementary, and so A=E−11E−12E−13E−14. Thus A is a product of elementary matrices.
Subsection 3.10.3 Inverses of Elementary Row Operation Matrices
Each matrix associated with the three elementary row operations has an inverse. While it is easy to define and verify the matrix in each case, it is useful to think of the effect of the inverse. If E is the matrix associated with an elementary row operation, then EA carries out that operation on A. Since E−1EA=A, the effect of E−1 is to undo the operation and return A to its original form.If E corresponds to interchanging two rows (Ri↔Rj), to undo the operation we just interchange them again. This means the E−1=E.
If E corresponds to multiplying a row by λ (Ri←λRi), then multiplying the same row by 1λ returns it to its original form. Hence E−1 is formed by replacing λ by 1λ in E.
If E corresponds to adding λ times row j to row i (Ri←Ri+λRj), then subtracting λ times row j from row i (Ri←Ri−λRj) returns A to its original form. Hence E−1 is formed by replacing λ by −λ in E. i
Theorem 3.10.6. Inverses of Elementary Matrices.
Every elementary matrix is invertible. In particular
- E−11=E1.
- E−12 has λ in E2 replaced by 1λ.
- E−13 has λ in E3 replaced by −λ.
Proof.
We give the inverse in each case:
Interchange rows (\(R_i\leftrightarrow R_j\)) \(E= \begin{bmatrix} 1 \\ \amp\ddots \\ \amp\amp1\amp\amp\\ \amp\amp\amp0\amp\cdots\amp\cdots\amp\cdots\amp1\amp \\ \amp\amp\amp\vdots\amp1\amp\amp\amp\vdots \\ \amp\amp\amp\vdots\amp\amp\ddots \amp\amp\vdots\\ \amp\amp\amp\vdots\amp\amp\amp1\amp\vdots \\ \amp\amp\amp1\amp\cdots\amp\cdots\amp\cdots\amp0 \\ \amp\amp\amp\amp\amp\amp\amp\amp1 \\ \amp\amp\amp\amp\amp\amp\amp\amp\amp\ddots \end{bmatrix}\) and \(E^{-1}=E\text{.}\)
Multiply a row by a constant (\(R_i\leftarrow \lambda R_i\)) \(E= \begin{bmatrix} 1 \\ \amp\ddots \\ \amp\amp1 \\ \amp\amp\amp\lambda \\ \amp\amp\amp\amp1 \\ \amp\amp\amp\amp\amp\ddots \\ \amp\amp\amp\amp\amp\amp1 \end{bmatrix} \) \(E^{-1}= \begin{bmatrix} 1 \\ \amp\ddots \\ \amp\amp1 \\ \amp\amp\amp\tfrac1\lambda \\ \amp\amp\amp\amp1 \\ \amp\amp\amp\amp\amp\ddots \\ \amp\amp\amp\amp\amp\amp1 \end{bmatrix}\)
Add a multiple of one row to another (\(R_i\leftarrow R_i + \lambda R_j\)) \(E= \begin{bmatrix} 1 \\ \amp\ddots \\ \amp\amp1 \\ \amp\amp\amp1\amp\cdots\amp\lambda \\ \amp\amp\amp\amp1\amp\vdots \\ \amp\amp\amp\amp\amp\ddots \\ \amp\amp\amp\amp\amp\amp1 \end{bmatrix}\) and so \(E^{-1}= \begin{bmatrix} 1 \\ \amp\ddots \\ \amp\amp1 \\ \amp\amp\amp1\amp\cdots\amp-\lambda \\ \amp\amp\amp\amp1\amp\vdots \\ \amp\amp\amp\amp\amp\ddots \\ \amp\amp\amp\amp\amp\amp1 \end{bmatrix}\)
Theorem 3.10.7. The Inverse of an Elementary Matrix is Elementary.
The inverse of an elementary matrix is elementary.
Theorem 3.10.8. An Invertible Matrix is a Product of Elementary Matrices.
Let A be an invertible matrix. Then A=F1F2⋯Ft where each Fi is an elementary matrix.
Proof.
Put \(A\) into reduced row echelon form. Suppose this takes \(t\) elementary row operations. Then this implies that \(E_tE_{t-1}\cdots E_2E_1A=I.\) Let \(F_i=E_i^{-1}\) for \(i=1,\dots,t.\) Then \(F_i\) is an elementary matrix, and \(F_1F_2\cdots F_t=F_1F_2\cdots F_tI=F_1F_2\cdots F_t E_tE_{t-1}\cdots E_2E_1A=A.\)