5.8. SHUR’S THEOREM 87

be a basis for Fn. Then there exists an orthonormal basis for Fn,

{u1, · · · ,un}

which has the property that for each k ≤ n,

span(x1, · · · ,xk) = span(u1, · · · ,uk) .

Proof: Let {x1, · · · ,xn} be a basis for Fn. Let u1 ≡ x1/ |x1| . Thus for k = 1, span(u1) =span(x1) and {u1} is an orthonormal set. Now suppose for some k < n, u1, · · · , uk havebeen chosen such that (u j ·ul) = δ jl and span(x1, · · · ,xk) = span(u1, · · · ,uk). Then define

uk+1 ≡xk+1−∑

kj=1 (xk+1 ·u j)u j∣∣∣xk+1−∑kj=1 (xk+1 ·u j)u j

∣∣∣ , (5.8.24)

where the denominator is not equal to zero because the x j form a basis and so

xk+1 /∈ span(x1, · · · ,xk) = span(u1, · · · ,uk)

Thus by induction,

uk+1 ∈ span(u1, · · · ,uk,xk+1) = span(x1, · · · ,xk,xk+1) .

Also, xk+1 ∈ span(u1, · · · ,uk,uk+1) which is seen easily by solving 5.8.24 for xk+1 and itfollows

span(x1, · · · ,xk,xk+1) = span(u1, · · · ,uk,uk+1) .

If l ≤ k,

(uk+1 ·ul) = C

((xk+1 ·ul)−

k

∑j=1

(xk+1 ·u j)(u j ·ul)

)

= C

((xk+1 ·ul)−

k

∑j=1

(xk+1 ·u j)δ l j

)= C ((xk+1 ·ul)− (xk+1 ·ul)) = 0.

The vectors,{

u j}n

j=1 , generated in this way are therefore an orthonormal basis becauseeach vector has unit length.

The process by which these vectors were generated is called the Gram Schmidt process.Recall the following definition.

Definition 5.8.2 An n×n matrix, U, is unitary if UU∗ = I =U∗U where U∗ is defined tobe the transpose of the conjugate of U.

Theorem 5.8.3 Let A be an n×n matrix. Then there exists a unitary matrix, U such that

U∗AU = T, (5.8.25)

where T is an upper triangular matrix having the eigenvalues of A on the main diagonallisted according to multiplicity as roots of the characteristic equation.

5.8. SHUR’S THEOREM 87be a basis for F". Then there exists an orthonormal basis for F",{u; vo ju, }which has the property that for each k <n,span (x1,--+ ,X¢) = span(uj,--+ ,Ux).Proof: Let {x,,--- ,x,} be a basis for F”. Let uy = x; /|x;|. Thus for k = 1, span (u,) =span (x;) and {u;} is an orthonormal set. Now suppose for some k <n, uy, «++, uz havebeen chosen such that (u;- uy) = 6; and span (x),--- ,X;) = span(uj,--- ,ux). Then definekxX, _ -_1 (X, °u;)U;wes Vij eet Wj) Oy (5.8.24)Xk+1 Die (Xe1 Uy) Ujwhere the denominator is not equal to zero because the x; form a basis and soXz41 ¢ span (X1,--- ,X,) = span(uy,--- , Ux)Thus by induction,Ux+1 € span (Uy, +++ Ux, Xe¢1) = Span (X1, +++ XK, Xe +1)-Also, X41 € span (uj,--+ ,Uxz,Ux41) which is seen easily by solving 5.8.24 for x,41 and itfollowsspan (x1, ute Xk, Xk+1) = span (uy,- “ , Uz, Ug+1) .Ifl<k,Mellun(Xe ‘uj) -0))JM-== Ct (Xy1-W) —(Xiu 1)j= C((Xe¢1 Wy) — (Xep1 uy) = 0.(usi-uy) = C (0% “uy) —The vectors, {u isi , generated in this way are therefore an orthonormal basis becauseeach vector has unit length.The process by which these vectors were generated is called the Gram Schmidt process.Recall the following definition.Definition 5.8.2 Ann xn matrix, U, is unitary if UU* = I =U*U where U* is defined tobe the transpose of the conjugate of U.Theorem 5.8.3 Let A be ann x n matrix. Then there exists a unitary matrix, U such thatU*AU =T, (5.8.25)where T is an upper triangular matrix having the eigenvalues of A on the main diagonallisted according to multiplicity as roots of the characteristic equation.