Processing math: 100%
Skip to main content

Section 3.10 Elementary matrices

We put matrices into reduced row echelon form by a series of elementary row operations. Our first goal is to show that each elementary row operation may be carried out using matrix multiplication. The matrix E=[ei,j] used in each case is almost an identity matrix. The product EA will carry out the corresponding elementary row operation on A.

Subsection 3.10.1 The three types of elementary matrices

There are three different elementary row operations: Table 2.4.4

  1. Interchanging two rows (RiRj)

  2. Multiplying a row by a scalar (RiλRi where λ0)

  3. Adding a multiple of one row to another (RiRi+λRj)

We now define an elementary matrix E=[ei,j] for each one of these operations:

Definition 3.10.1. Elementary matrices.
  1. (RiRj)

    E1=[110111101]

    In this case ei,i=ej,j=0 and ei,j=ei,j=1. All other entries are the same as those in I.

  2. (RiλRi where λ0)

    E2=[11λ11]

    In this case ei,i=λ. All other entries are the same as those in I.

  3. (RiRi+λRj)

    E3=[111λ11]

    In this case ei,j=λ. All other entries are the same as those in I.

A matrix E of any of the three types is called an elementary matrix.

Subsection 3.10.2 Elementary matrices and reduced row echelon form

We use the elementary matrices given in Definition 3.10.1. The proof is is then a careful observation of the effect of the nonzero entries in each case.

Example 3.10.3. Multiplying by an elementary matrix.

Let

A=[123456789101112131415161718192021222324252627282930]

The entries in the elementary matrices in red are the only ones that differ from an identity matrix I.

  1. Interchange rows 2 and 4 (R2R4)

    E=[1000000010001000100000001]
    EA=[123456192021222324131415161718789101112252627282930]
  2. Multiply row 4 by 3 (R43R4)

    E=[1000001000001000003000001]
    EA=[123456789101112131415161718576063666972252627282930]
  3. Subtract twice row 2 from row 4 (R4R42R2)

    E=[1000001000001000201000001]
    EA=[123456789101112131415161718543210252627282930]

Notice that Theorem 3.10.2 can be applied to a sequence of elementary row operations. If, for example, we have three elementary row operations whose corresponding elementary matrices are E1, E2 and E3, and we apply them in sequence to A, then the resulting matrix is the product E3(E2(E1(A))), or, more simply, E3E2E1A. Note that we are applying E1 first followed by E2 and finally E3.

Example 3.10.4. Reduction to reduced row echelon form by matrix multiplication.

Let A=[122245011]. We put this matrix into reduced row echelon form:

Table 3.10.5. Row reduction by matrix multiplication
Matrix Elementary Row Corresponding
Operations Matrix
A=[122245011] R2R22R1 E1=[100210001]
E1A=[122001011] R2R3 E2=[100001010]
E2E1A=[122011001] R1R12R2 E3=[120010001]
E3E2E1A=[100011001] R2R2R3 E4=[100011001]
E4E3E2E1A=[100010001]

There is a bonus from this computation: since (E4E3E2E1)A=I, we know that A(E4E3E2E1)=I and A1=(E4E3E2E1). Indeed,

[100011001][120010001][100001010][100210001]=[102211210]=A1

We shall see shortly that every elementary matrix has an inverse which itself is elementary, and so A=E11E12E13E14. Thus A is a product of elementary matrices.

Subsection 3.10.3 Inverses of Elementary Row Operation Matrices

Each matrix associated with the three elementary row operations has an inverse. While it is easy to define and verify the matrix in each case, it is useful to think of the effect of the inverse. If E is the matrix associated with an elementary row operation, then EA carries out that operation on A. Since E1EA=A, the effect of E1 is to undo the operation and return A to its original form.

  1. If E corresponds to interchanging two rows (RiRj), to undo the operation we just interchange them again. This means the E1=E.

  2. If E corresponds to multiplying a row by λ (RiλRi), then multiplying the same row by 1λ returns it to its original form. Hence E1 is formed by replacing λ by 1λ in E.

  3. If E corresponds to adding λ times row j to row i (RiRi+λRj), then subtracting λ times row j from row i (RiRiλRj) returns A to its original form. Hence E1 is formed by replacing λ by λ in E. i

We give the inverse in each case:

  1. Interchange rows (\(R_i\leftrightarrow R_j\)) \(E= \begin{bmatrix} 1 \\ \amp\ddots \\ \amp\amp1\amp\amp\\ \amp\amp\amp0\amp\cdots\amp\cdots\amp\cdots\amp1\amp \\ \amp\amp\amp\vdots\amp1\amp\amp\amp\vdots \\ \amp\amp\amp\vdots\amp\amp\ddots \amp\amp\vdots\\ \amp\amp\amp\vdots\amp\amp\amp1\amp\vdots \\ \amp\amp\amp1\amp\cdots\amp\cdots\amp\cdots\amp0 \\ \amp\amp\amp\amp\amp\amp\amp\amp1 \\ \amp\amp\amp\amp\amp\amp\amp\amp\amp\ddots \end{bmatrix}\) and \(E^{-1}=E\text{.}\)

  2. Multiply a row by a constant (\(R_i\leftarrow \lambda R_i\)) \(E= \begin{bmatrix} 1 \\ \amp\ddots \\ \amp\amp1 \\ \amp\amp\amp\lambda \\ \amp\amp\amp\amp1 \\ \amp\amp\amp\amp\amp\ddots \\ \amp\amp\amp\amp\amp\amp1 \end{bmatrix} \) \(E^{-1}= \begin{bmatrix} 1 \\ \amp\ddots \\ \amp\amp1 \\ \amp\amp\amp\tfrac1\lambda \\ \amp\amp\amp\amp1 \\ \amp\amp\amp\amp\amp\ddots \\ \amp\amp\amp\amp\amp\amp1 \end{bmatrix}\)

  3. Add a multiple of one row to another (\(R_i\leftarrow R_i + \lambda R_j\)) \(E= \begin{bmatrix} 1 \\ \amp\ddots \\ \amp\amp1 \\ \amp\amp\amp1\amp\cdots\amp\lambda \\ \amp\amp\amp\amp1\amp\vdots \\ \amp\amp\amp\amp\amp\ddots \\ \amp\amp\amp\amp\amp\amp1 \end{bmatrix}\) and so \(E^{-1}= \begin{bmatrix} 1 \\ \amp\ddots \\ \amp\amp1 \\ \amp\amp\amp1\amp\cdots\amp-\lambda \\ \amp\amp\amp\amp1\amp\vdots \\ \amp\amp\amp\amp\amp\ddots \\ \amp\amp\amp\amp\amp\amp1 \end{bmatrix}\)

An easy observation: the inverse in each case of Theorem 3.10.6, the inverse is also an elementary matrix, and so we have proven the following theorem:

Put \(A\) into reduced row echelon form. Suppose this takes \(t\) elementary row operations. Then this implies that \(E_tE_{t-1}\cdots E_2E_1A=I.\) Let \(F_i=E_i^{-1}\) for \(i=1,\dots,t.\) Then \(F_i\) is an elementary matrix, and \(F_1F_2\cdots F_t=F_1F_2\cdots F_tI=F_1F_2\cdots F_t E_tE_{t-1}\cdots E_2E_1A=A.\)