Kenneth Kuttler

22.2. BASIC DEFINITIONS 711

Definition 22.2.1 A function g is o(v) if

lim||v||→0

g(v)||v||

= 0 (22.2.8)

A function f : U → Y is differentiable at x ∈U if there exists a linear transformation L ∈L (X ,Y ) such that

f(x+v) = f(x)+Lv+o(v)

This linear transformation L is the definition of Df(x). This derivative is often called theFrechet derivative.

In finite dimensions, the question whether a given function is differentiable is inde-pendent of the norm used on the finite dimensional vector space. That is, a function isdifferentiable with one norm if and only if it is differentiable with another norm. This isbecause all norms are equivalent on a finite dimensional space.

The definition 22.2.8 means the error,

f(x+v)− f(x)−Lv

converges to 0 faster than ||v||. Thus the above definition is equivalent to saying

lim||v||→0

||f(x+v)− f(x)−Lv||||v||

= 0 (22.2.9)

or equivalently,

limy→x

||f(y)− f(x)−Df(x)(y−x)||||y−x||

= 0. (22.2.10)

The symbol o(v) should be thought of as an adjective. Thus, if t and k are constants,

o(v) = o(v)+o(v) , o(tv) = o(v) , ko(v) = o(v)

and other similar observations hold.

Theorem 22.2.2 The derivative is well defined.

Proof: First note that for a fixed vector v, o(tv) = o(t). This is because

limt→0

o(tv)|t|

= limt→0||v|| o(tv)||tv||

= 0

Now suppose both L1 and L2 work in the above definition. Then let v be any vector and lett be a real scalar which is chosen small enough that tv+x ∈U . Then

f(x+ tv) = f(x)+L1tv+o(tv) , f(x+ tv) = f(x)+L2tv+o(tv) .

Therefore, subtracting these two yields (L2−L1)(tv) = o(tv) = o(t). Therefore, dividingby t yields (L2−L1)(v) = o(t)

t . Now let t → 0 to conclude that (L2−L1)(v) = 0. Sincethis is true for all v, it follows L2 = L1. This proves the theorem.