Skip to main content

An Overview of Key Ideas

讲座摘要(lecture summary)

An overview of key ideas

This is an overview of linear algebra given at the start of a course on the mathematics of engineering.

Linear algebra progresses from vectors to matrices to subspaces.\text{Linear algebra progresses from vectors to matrices to subspaces.}

Vectors

What do you do with vectors? Take combinations.

We can multiply vectors by scalars, add, and subtract. Given vectors u\bm{u}, v\bm{v} and w\bm{w} we can form the linear combination x1u+x2v+x3w=bx_1\bm{u} + x_2\bm{v} + x_3\bm{w} = b.

An example in R3\mathbb{R}^3 would be:

u=[110],v=[011],u=[001]\bm{u} = \begin{bmatrix} 1 \\ -1 \\ 0 \end{bmatrix}, \bm{v} = \begin{bmatrix} 0 \\ 1 \\ -1 \end{bmatrix}, \bm{u} = \begin{bmatrix} 0 \\ 0 \\ 1 \end{bmatrix}

The collection of all multiples of u\bm{u} forms a line through the origin. The collection of all multiples of v\bm{v} forms another line. The collection of all combinations of u\bm{u} and v\bm{v} forms a plane. Taking all combinations of some vectors creates a subspace.

We could continue like this, or we can use a matrix to add in all multiples of w\bm{w}.

Matrices

Create a matrix AA with vectors u\bm{u}, v\bm{v} and w in its columns:

A=[100110011]A = \begin{bmatrix} 1 & 0 & 0 \\ -1 & 1 & 0 \\ 0 & -1 & 1 \end{bmatrix}

The product:

Ax=[100110011][x1x2x3]=[x1x1+x2x2+x3]A\bm{x} = \begin{bmatrix} 1 & 0 & 0 \\ -1 & 1 & 0 \\ 0 & -1 & 1 \end{bmatrix} \begin{bmatrix} x_1 \\ x_2\\ x_3 \end{bmatrix} = \begin{bmatrix} x_1 \\ -x_1 + x_2 \\ -x_2 + x_3 \end{bmatrix}

equals the sum x1u+x2v+x3w=bx_1\bm{u} + x_2\bm{v} + x_3\bm{w} = b. The product of a matrix and a vector is a combination of the columns of the matrix. (This particular matrix AA is a difference matrix because the components of AxA\bm{x} are differences of the components of that vector.)

When we say x1u+x2v+x3w=bx_1\bm{u} + x_2\bm{v} + x_3\bm{w} = b we’re thinking about multiplying numbers by vectors; when we say Ax=bA\bm{x} = \bm{b} we’re thinking about multiplying a matrix (whose columns are u\bm{u}, v\bm{v} and w\bm{w}) by the numbers. The calculations are the same, but our perspective has changed.

For any input vector x\bm{x}, the output of the operation “multiplication by AA” is some vector b\bm{b}:

A[149]=[135]A\begin{bmatrix} 1 \\ 4\\ 9 \end{bmatrix} = \begin{bmatrix} 1 \\ 3\\ 5 \end{bmatrix}

A deeper question is to start with a vector b and ask “for what vectors x does Ax=bA\bm{x} = \bm{b}?” In our example, this means solving three equations in three unknowns. Solving:

Ax=[100110011][x1x2x3]=[x1x1+x2x2+x3]=[b1b2b3]A\bm{x} = \begin{bmatrix} 1 & 0 & 0 \\ -1 & 1 & 0 \\ 0 & -1 & 1 \end{bmatrix} \begin{bmatrix} x_1 \\ x_2\\ x_3 \end{bmatrix} = \begin{bmatrix} x_1 \\ -x_1 + x_2 \\ -x_2 + x_3 \end{bmatrix} = \begin{bmatrix} b_1 \\ b_2\\ b_3 \end{bmatrix}

is equivalent to solving:

x1=b1x2x1=b2x3x2=b3\begin{array}{rcl} x_1 & = & b_1 \\ x_2 - x_1 & = & b_2 \\ x_3 - x_2& = & b_3 \end{array}

We see that x1=b1x_1 = b_1 and so x2x_2 must equal b1+b2b_1 + b_2. In vector form, the solution is:

[x1x2x3]=[b1b1+b2b1+b2+b3]\begin{bmatrix} x_1 \\ x_2\\ x_3 \end{bmatrix} = \begin{bmatrix*}[r] b_1 \\ b_1 + b_2\\ b_1 + b_2 + b_3 \end{bmatrix*}

But this just says:

x=[100110111][b1b2b3]\bm{x} = \begin{bmatrix} 1 & 0 & 0 \\ 1 & 1 & 0\\ 1 & 1 & 1 \end{bmatrix} \begin{bmatrix} b_1 \\ b_2\\ b_3 \end{bmatrix}

or x=A1b\bm{x} = A^{−1}\bm{b}. If the matrix AA is invertible, we can multiply on both sides by A1A^{−1} to find the unique solution x\bm{x} to Ax=bA\bm{x} = \bm{b}. We might say that A represents a transform xb\bm{x} \to \bm{b} that has an inverse transform bx\bm{b} \to \bm{x}.

In particular, if b=[000]\bm{b}=\begin{bmatrix} 0 \\ 0 \\ 0 \end{bmatrix} then x=[000]\bm{x}=\begin{bmatrix} 0 \\ 0 \\ 0 \end{bmatrix}

The second example has the same columns u\bm{u} and v\bm{v} and replaces column vector w\bm{w}:

C=[101110011]C = \begin{bmatrix} 1 & 0 & -1 \\ -1 & 1 & 0\\ 0 & -1 & 1 \end{bmatrix}

Then:

Cx=[101110011][x1x2x3]=[x1x3x2x1x3x2]C\bm{x} = \begin{bmatrix} 1 & 0 & -1 \\ -1 & 1 & 0\\ 0 & -1 & 1 \end{bmatrix} \begin{bmatrix} x_1 \\ x_2\\ x_3 \end{bmatrix} = \begin{bmatrix} x_1 - x_3 \\ x_2 - x_1 \\ x_3 - x_2 \end{bmatrix}

and our system of three equations in three unknowns becomes circular.

Where before Ax=0A\bm{x} = 0 implied x=0\bm{x} = 0, there are non-zero vectors x\bm{x} for which Cx=0C\bm{x} = 0. For any vector x\bm{x} with x1=x2=x3,Cx=0x_1 = x_2 = x_3, C\bm{x} = 0. This is a significant difference; we can’t multiply both sides of Cx=0C\bm{x} = 0 by an inverse to find a nonzero solution x\bm{x}.

The system of equations encoded in Cx=bC\bm{x} = \bm{b} is:

x1x3=b1x2x1=b2x3x2=b3\begin{array}{rcl} x_1 - x_3 & = & b_1 \\ x_2 - x_1 & = & b_2 \\ x_3 - x_2& = & b_3 \end{array}

If we add these three equations together, we get:

0=b1+b2+b30 = b_1 + b_2 + b_3

This tells us that Cx=bC\bm{x} = \bm{b} has a solution x\bm{x} only when the components of b sum to 0. In a physical system, this might tell us that the system is stable as long as the forces on it are balanced.

Subspaces

Geometrically, the columns of CC lie in the same plane (they are dependent; the columns of AA are independent). There are many vectors in R3\mathbb{R}^3 which do not lie in that plane. Those vectors cannot be written as a linear combination of the columns of CC and so correspond to values of b\bm{b} for which Cx=bC\bm{x} = \bm{b} has no solution x\bm{x}. The linear combinations of the columns of CC form a two dimensional subspace of R3\mathbb{R}^3.

This plane of combinations of u\bm{u}, v\bm{v} and w\bm{w} can be described as “all vectors CxC\bm{x}”. But we know that the vectors b\bm{b} for which Cx=bC\bm{x} = \bm{b} satisfy the condition b1+b2+b3=0b_1 + b_2 + b_3 = 0. So the plane of all combinations of u\bm{u} and v\bm{v} consists of all vectors whose components sum to 00.

If we take all combinations of:

u=[110],v=[011],andw=[001]\bm{u} = \begin{bmatrix} 1 \\ -1 \\ 0 \end{bmatrix}, \bm{v} = \begin{bmatrix} 0 \\ 1 \\ -1 \end{bmatrix}, \text{and}\> \bm{w} = \begin{bmatrix} 0 \\ 0 \\ 1 \end{bmatrix}

we get the entire space R3\mathbb{R}^3; the equation Ax=bA\bm{x} = \bm{b} has a solution for every b\bm{b} in R3\mathbb{R}^3. We say that u\bm{u}, v\bm{v} and w\bm{w} form a basis for R3\mathbb{R}^3.

A basis for Rn\mathbb{R}^n is a collection of nn independent vectors in Rn\mathbb{R}^n. Equivalently, a basis is a collection of n vectors whose combinations cover the whole space. Or, a collection of vectors forms a basis whenever a matrix which has those vectors as its columns is invertible.

A vector space is a collection of vectors that is closed under linear combinations. A subspace is a vector space inside another vector space; a plane through the origin in R3\mathbb{R}^3 is an example of a subspace. A subspace could be equal to the space it’s contained in; the smallest subspace contains only the zero vector.

The subspaces of R3\mathbb{R}^3 are:

  • the origin,
  • a line through the origin,
  • a plane through the origin,
  • all of R3\mathbb{R}^3.

Conclusion

When you look at a matrix, try to see “what is it doing?”

Matrices can be rectangular; we can have seven equations in three unknowns. Rectangular matrices are not invertible, but the symmetric, square
matrix ATAA^TA that often appears when studying rectangular matrices may be invertible.