Kenneth Kuttler

11.5. DIAGONALIZATION 197

Proof: First note that if S is n×n and its columns are linearly independent, then thesecolumns must also span all of Fn since otherwise, there would be a vector v not in thespan and you could add it in to the list and get n+1 vectors in an independent set. This iscontrary to Theorem 9.1.6. Thus Im(S) = Fn. Also, if Sx= Sy, then S (x−y) = 0 and sox= y (S is one to one.). Thus we can define S−1y to be that vector such that S

(S−1y

)= y.

Then S−1 is a linear transformation because of the following reasoning.

S(S−1 (ax+by)

)= ax+by

S(aS−1x+bS−1y

)= aS

(S−1x

)+bS

(S−1y

)= ax+by

Thus, since S is one to one, as explained above, it follows that

S−1 (ax+by) = aS−1x+bS−1y

Therefore, there is a matrix, still denoted as S−1 such that for any x ∈ Fn, S(S−1x

)=(

SS−1)x= x. Hence SS−1 = I. By Problem 19 on Page 168, S−1S = I also. Alternatively,

S(S−1S

)=(SS−1

)S = S. Hence for all x,S

(S−1S

)x= Sx and so S

(S−1Sx− Ix

)= 0 and

so, since S is one to one, S−1Sx= Ix for all x showing that S−1S = I also. ■Thus if the columns of a matrix are linearly independent, then the matrix is invertible.

On the other hand, if the matrix S is invertible, then if Sx= 0 one could multiply bothsides by S−1 and obtain x= 0 and so the columns of S are linearly independent.

Theorem 11.5.3 An n× n matrix is diagonalizable if and only if Fn has a basis of eigen-vectors of A. Furthermore, you can take the matrix S described above, to be given as

S =(

s1 s2 · · · sn

)where here the sk are the eigenvectors in the basis for Fn. If A is diagonalizable, theeigenvalues of A are the diagonal entries of the diagonal matrix.

Proof: To say that A is diagonalizable, is to say that

S−1AS =

λ 1

. . .

λ n

the λ i being elements of F. This is to say that for S =

(s1 · · · sn

), sk being the kth

column,

s1 · · · sn

)=(

s1 · · · sn

)λ 1

. . .

λ n

which is equivalent, from the way we multiply matrices, that(

As1 · · · Asn

)=(

λ 1s1 · · · λ nsn

)which is equivalent to saying that the columns of S are eigenvectors and the diagonal matrixhas the eigenvectors down the main diagonal. Since S−1 is invertible, these eigenvectorsare a basis. Similarly, if there is a basis of eigenvectors, one can take them as the columnsof S and reverse the above steps, finally concluding that A is diagonalizable. ■

11.5. DIAGONALIZATION 197Proof: First note that if S is m x n and its columns are linearly independent, then thesecolumns must also span all of F” since otherwise, there would be a vector v not in thespan and you could add it in to the list and get n+ 1 vectors in an independent set. This iscontrary to Theorem 9.1.6. Thus Im(S) = F”. Also, if Sa = Sy, then S(a — y) = 0 and sox = y (Sis one to one.). Thus we can define S~'y to be that vector such that S (S~!y) = y.Then S~! is a linear transformation because of the following reasoning.S(S7! (ax +by)) =ax+byS(aS"'w+bS'y) = aS(S~'x) +bS(S7'y)= ax+byThus, since S$ is one to one, as explained above, it follows thatS| (aa +by) =aS'a+bS"'yTherefore, there is a matrix, still denoted as S~! such that for any x €F”’, S$ (S~'a) =(SS!) & = x. Hence SS~' = 1. By Problem 19 on Page 168, S~!S = J also. Alternatively,S(S~!S) = (SS~') S=S. Hence for all x, (S~'S) a = Sa and so S (S~'Sa — Ix) =0 andso, since S is one to one, S~'!Sa = Ia for all x showing that S-'S =/ also. HfThus if the columns of a matrix are linearly independent, then the matrix is invertible.On the other hand, if the matrix S is invertible, then if Sa =O one could multiply bothsides by S~! and obtain x = 0 and so the columns of S are linearly independent.Theorem 11.5.3 An n Xx n matrix is diagonalizable if and only if F" has a basis of eigen-vectors of A. Furthermore, you can take the matrix S described above, to be given asS=(s1 8S. o-: sn )where here the s, are the eigenvectors in the basis for F”". If A is diagonalizable, theeigenvalues of A are the diagonal entries of the diagonal matrix.Proof: To say that A is diagonalizable, is to say thatAyS'AS =Anthe A; being elements of F. This is to say that for S = ( Sp c+ Sp ) , 8 being the k”column,AyAnwhich is equivalent, from the way we multiply matrices, that( As: +++ ASy )=( A181 ++ AnSn )which is equivalent to saying that the columns of S are eigenvectors and the diagonal matrixhas the eigenvectors down the main diagonal. Since S~! is invertible, these eigenvectorsare a basis. Similarly, if there is a basis of eigenvectors, one can take them as the columnsof S and reverse the above steps, finally concluding that A is diagonalizable. Ml