Manifolds, Tensor Analysis, and Applications, Second Edition (Applied Mathematical Sciences 75)

R. Abraham J.E. Marsden T. Ratiu Manifolds, Tensor Analysis, and Applications Second Edition Springer-Verlag New Yor...

Author: Ralph Abraham | Jerrold E. Marsden | Tudor Ratiu

18 downloads 576 Views 29MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
Report copyright / DMCA form

DOWNLOAD PDF

R. Abraham J.E. Marsden

T. Ratiu

Manifolds, Tensor Analysis, and Applications Second Edition

Springer-Verlag New York Berlin Heidelberg London Paris Tokyo

Ralph Abraham Department of Mathematics University of CaliforniaSanta Cruz Santa Cruz. CA 95064 U.S.A.

J.E. Marsden Department of Mathematics University of CaliforniaBerkeley Berkeley. CA 94720 U.S.A. and Department of Mathematics Cornell University Ithaca. NY 14853-7901 U.S.A.

Tudor Ratiu Department of Mathematics University of CaliforniaSanta Cruz Santa Cruz, CA 95064 U.S.A.

AMS Subject Classifications (l9l!0): 34, 58. 70. 76. 93

Library of Congress Cataloging-in-Publication Data Abraham. Ralph Manifolds. tensor analysis, and applications. Second Edition (Applied Mathematical Sciences; v. 75) Bibliography: p. 631 Indudes index. I. Global analysis (Mathematics) 2. Manifolds (Mathematics) 3. Calculus of tensors. I. Marsden. Jerrold E. II. Ratiu. Tudor S. Ill. Title. IV. Series. QA614.A28 1983514.382-1737 ISBN 0-387-96790-7 First edition published by Addison-Wesley Publishing Company «) 1983 ~) 1988 by Springer-Verlag New York Inc. All rights reserved. No pan of this book may be translated or reproduced in any fortn without written permission from Springer-Verlag. 175 Fifth Avenue, New York. New York 10010. U.S.A.

Printed and bound by R.R. Donnelley and Sons. Harrisonburg. Virginia Printed in the United States of America. 987654321 ISBN 0-387-96790-7 Springer-Verlag New York Berlin Heidelberg ISBN 3-540-96790-7 Springer-Verlag Berlin Heidelberg New York

Preface

The purpose of this book is to provide core material in nonlinear analysis for mathematicians, physicists, engineers, and mathematical biologists. The main goal is to provide a working knowledge of manifolds, dynamical systems, tensors, and differential forms. Some applications to Hamiltonian mechanics, fluid mechanics, electromagnetism, plasma dynamics and control theory arc given in Chapter 8, using both invariant and index notation. The current edition of the book does not deal with Riemannian geometry in much detail, and it docs not treat Lie groups, principal bundles, or Morse theory. Some of this is planned for a subsequent edition. Meanwhile, the authors will make available to interested readers supplementary chapters on Lie Groups and Differential Topology and invite comments on the book's contents and development. Throughout the text supplementary topics are given, marked with the symbols ~ and This device enables the reader to skip various topics without disturbing the main flow of the text. Some of these provide additional background material intended for completeness, to minimize the necessity of consulting too many outside references. We treat finite and intinite-dimensional manifolds simultaneously. This is partly for efficiency of exposition. Without advanced applications, using manifolds of mappings, the study of infinite-dimensional manifolds can be hard to motivate. Chapter 8 gives a hint of these applications. In fact, some readers may wish to skip the infinite-dimensional case altogether. To aid in this we have separated into supplements some of the technical points peculiar to the infinite-dimensional case. Our own research interests lean toward physical applications, and the choice of topics is partly molded by what is useful for this kind of research. We have tried to be as sympathetic to our readers as possible by providing ample examples, exercises, and applications. When a computation in coordinates is easiest, we give it and do not hide things behind complicated invariant notation. On the other hand, index-free notation sometimes provides valuable geometric and comPUtational insight so we have tried to simultaneously convey this flavor. The prerequisites required are solid undergraduate courses in linear algebra and advanced calculus. At various points in the text contacts are made with other SUbjects, providing a good way for students to link this material with other courses. For example, Chapter I links with point-set topology, parts of Chapter 2 and 7 are connected with functional analysis, Section 4.3 relates to ordinary differential equations, Chapter 3 and Section 7.5 are linked to differential topology and algebraic topology, and Chapter 8 on applications is connected with applied Illathematics, physics, and engineering.

tl::n.

This book is intended to be used in courses as well as for reference. The sections are, as far as possible, lesson sized, if the supplementary material is omitted. For some sections, like 2.5,4.2, or 7.5, two lecture hours are required. A standard course for mathematics graduate students could omit Chapter I and the supplements entirely and do Chapters 2 through 7 in one semester with the possible exception of Section 7.4. The instructor could then assign certain supplements for reading and choose among the applications of Chapter 8 according to taste. A shorter course, or a course advanced undergraduates, probably should omit all supplements, spend about two lectures on Chapter I for reviewing background point set topology, and cover Chapters 2 through 7 with the exception of Sections 4.4, 7.4, 7.5 and all the material relevant to volume elements induced by metrics, the Hodge star, and codifferential operators in Sections 6.2, 6.4. 6.5. and 7.2. A more applications oriented course could skim Chapter I. review without proofs the material of Chapter 2, and cover Chapters 3 to 8 omitting the supplementary material and Sections 7.4 and 7.5. For such a course the instructor should keep in mind that while Sections 8.1 and 8.2 use only elementary material. Section 8.3 relies heavily on the Hodge star and codifferential operators. and Section 8.4 consists primarily of applications of Frobenius' theorem dealt with in Section 4.4. The notation in the book is as standard as conflicting usages in the literature allow. We have had to compromise among utility, clarity. clumsiness. and absolute precision. Some possible notations would have required too much interpretation on the part of the novice while others. while precise. would have been so dressed up in symbolic decorations that even an expert in the field would not recognize them. In a subject as developed and extensive as this one, an accurate history and crediting of theorems is a monumental task. especially when so many results are folklore and reside in private notes. We have indicated some of the important credits where we know of them, but we did not undertake this task systematically. We hope our readers will inform us of these and other shortcomings of the book so that. if necessary. corrected printings will be possible. The reference list at the back of the book is confined to works actually cited in the text. These works are cited by author and year like this: deRham [19551. During the preparation of the book, valuable advice was provided by Malcolm Adams. Morris Hirsch. Charles Pugh, Alan Weinstein. and graduate students in mathematics, physics and engineering at Berkeley, Santa Cruz and elsewhere. Our other teachers and collaborators from whom we learned the material and who inspired. directly and indirectly, various portions of the text are too numerous to mention individually. so we hereby thank them all collectively. We have taken the opportunity in this edition to correct some errors kindly pointed out by our readers and to rewrite numerous sections. This book was typeset on a Macintosh using Mathwriter (Cooke Publications Inc, Ithaca, N. Y.): we thank Connie Calica. Dotty Hollinger, Mamie MacElhiny and Esther Zack for their invaluable help with the typing. We intend this book to be an evolving project. That is, we invite corrections and comments from our readers to be incorporated into future printings. We are

currently preparing some supplementary chapters and plan to include a differential topology and Lie groups chapter in the next printing-space permitting. Meanwhile, if you wish to see these chapters. we will be happy to send them to you in exchange for your comments. February. 1988 RALPH ABRAHAM JERROLD

E.

MARSDEN

TUDOR RATIL

Contents Preface Background Notation

v vii

CHAPTER I

Topology 1.1 1.2 1.3 1.4 1.5 1.6 1.7

Topological Spaces Metric Spaces Continuity Subspaces. Products. and Quotients Compactness Connectedness Baire Spaces

2 9 14 18 24 31 37

CHAPTER 2

Banach Spaces and Differential Calculus 2.1 2.2 2.3 2.4 2.5

Banach Spaces Linear and Multilinear Mappings The Derivative Properties of the Derivative The Inverse and Implicit Function Theorems

40 40 56 75 83 116

CHAPTER 3

Manifolds and Vector Bundles 3.1 3.2 3.3 3.4 3.5

Manifolds Submanifolds. Products. and Mappings The Tangent Bundle Vector Bundles Submersions. Immersions and Transversality

141 141 150 157 167 196

CHAPTER 4

Vector Fields and Dynamical Systems 4.1 4.2 4.3 4.4

Vector Fields and Flows Vector Fields as Differential Operators An Introduction to Dynamical Systems Frobenius' Theorem and Foliations

238 238 265 298 326

CHAPTER 5

Tensors 5.1 5.2 5.3 5.4 5.5

Tensors in Linear Spaces Tensor Bundles and Tensor Fields The Lie Derivative: Algebraic Approach The Lie Derivative: Dynamic Approach Partitions of Unity

338 338 349 359 370 377

CHAPTER 6

Differential Forms 6. I 6.1 6.3 6.4 6.5

Exterior Algebra Determinants. Volumes. and the Hodge Star Operator Differential Forms The Exterior Derivative. Interior Product. and Lie Derivative Orientation. Volume Elements. and the Codifferential

392 392 401 417 413 450

CHAPTER 7

Integration on Manifolds 7. I 7.1 7.3 7.4 7.5

The Definition of the Integral Stokes' Theorem The Classical Theorems of Green. Gauss. and Stokes Induced Flows on Function Spaces and Ergodicity Introduction to Hodge-deRham Theory and Topological Applications of Differential Forms

464 464 476 504 513 53X

CHAPTER 8

Applications 8.1 X.2 8.3 8.3 8.4

Hamiltonian Mechanics Fluid Mechanics Electromagnetism The Lie-Poisson Bracket in Continuum Mechanics and Plasma Physics Constraints and Control

560 560 5X4 599 609 614

References

63 I

Index

643

Supplementary Chapters-Available from the authors as they are produced S-I Lie Groups S-2 Introduction to Differential Topology S-3 Topics in Riemannian Geometry

Background Notation The reader is assumed to be familiar with the usual notations of set theory such as E, C, U, n and with the concept of a mapping. If A and Bare scts and if f: A~B is a mapping, we write a ~ na) for the effect of the mapping on the element of a E A; "iff' stands for "if and only if" (= "if" in delinitions). Other notations we shall use without explanation include the following:

•• IR,C

Z.O A X B IRn. Cn (x I • . . . , xn) E IRn

AcB A"B I or Id C'(B) fr = {(x.f(x)) inf A sup A e), ... ,

en

ker T, range T D,(m)

B,(m)

IxE

domain of f}

end of an example or remark end of a proof proof of a lemma is done, but the proof of the theorem goes on real. complex numbers integers. rational numbers Cartesian product Euelidean n-spaee, complex n-space point in IR n set theoretic containment (means same as A ~ B) set theoretic difference identity map inverse image of B under f graph of I' infinimum (greatest 'lower bound) of the set A c IR supremum (least upper bound) of A c IR basis of an n-dimensional vector space kernel and range of a linear transformation T open ball about m of radius r closed ball of radius r (also denoted O,(m)).

Chapter 1 Topology The purpose of this chapter is to introduce just enough topology for later requirements. It is assumed that the reader has had a course in advanced calculus and so is acquainted with open, closed, compact, and connected sets in Euclidean space (see for example Marsden [1974aJ and Rudin [1976]). If this background is weak, the reader may find the pace of this chapter too fast. If the background is under control, the chapter should serve to collect, review, and solidify concepts in a more general context. Readers already familiar with point set topology can safely skip this chapter. A key concept in manifold theory is that of a differentiable map between manifolds. However, manifolds are also topological spaces and differentiable maps are continuous. Topology is the study of continuity in a general context; it is therefore appropriate to begin with it. Topology often involves interesting excursions into pathological spaces and exotic theorems. Such excursions are deliberately minimized here. The examples will be ones most relevant to later developments, and the main thrust will be to obtain a working knowledge of continuity, connectedness, and compactness. We shall take for granted the usual logical structure of analysis without much comment, except to recall one of the basic axioms that is in common use and an equivalent resuit. These will be used occasionally in the text.

Axiom of choice If S is a collection of nonempty sets, then there is afunction X : S --+ USESS such that X(S) E S for every S E S. The function X chooses one element from each S E S and is called a choice function. Even though this statement seems self-evident, it has been shown to be equivalent to a number of nontrivial statements, using other axioms of set theory. To discuss them, we need a few definitions. An order on a set A is a binary relation, usually denoted by "~" satisfying the fOllowing conditions: a~a

(reflexivity)

a ~ b and b ~ a implies a =b

(antisymmetry), and

a ~ b and b ~ c implies a ~ c

(transitivity).

Chapter 1 Topology

2

An ordered set A is called a chain if for every a, b E A, a

*b

we have a:5 b or b:5 a.

The set A is said to be well ordered if it is a chain and every nonempty subset B has a first element; i.e., there exists an element bE B such that b:5 x for all x E B. An upper bound u E A of a chain C

C

A is an element for which c:5 u for all c E C. A maximal element m of an

ordered set A is an element for which there is no other a E A such that m:5 a, a

* m;

in other

words x:5 m for all x E A that are comparable to m. We state the following without proof.

Theorem Given other axioms of set theory, the following statements are equivalent: (i) The axiom of choice. (ii) Product Axiom If {Ai liE! is a collection of nonempty sets then the product space

IT iE! A; = {(xi) I Xi E A;} is nonempty. (iii) Zermelo's Theorem Any set can be well ordered. (iv) Zorn's Theorem If A is an ordered setforwhich every chain has an upper bound

(i.e., A is inductively ordered), then A has at least one maximal element.

~1.1

Topological Spaces

Abstracting ideas about open sets in JRn leads to the notion of a topological space.

1.1.1 Definition A topological space is a set S together with a collection 0 of subsets called open sets such that T1

0 E 0 and S EO;

T2 T3

if VI' V 2 E 0, then VI n V 2 EO; the union of any collection of open sets is open.

A basic example is the real line. We choose S =JR, with 0 consisting of all sets that are unions of open intervals. As exceptional cases, the empty set 0 E 0 and JR itself belong to 0. Thus T1 holds. For T2, let VI and V 2 EO; to show that VI n V 2 E 0, we can suppose that

V I n V2

* 0.

If x E V I

n V 2,

then x lies in an open interval lal' b l [

C

V I and also in the

V 2 • We can write lal' b l [ n l~, b 2 [ = la, b[ where a =max(ap~) and b = min(bl' b2 Thus x E la, b[ C VI n V 2 . Hence VI n V 2 is the union of such intervals, so is open. Finally, T3 is clear by definition. interval

l~,

».

b2 [

C

Similarly, JRn may be topologized by declaring a set to be open if it is a union of open rectangles. An argument similar to the one just given for JR shows that this the standard topology on JRn.

i~

a topology, called

The trivial topology on a set S consists of 0 = {0, S}. The discrete topology on S is defined by 0 = {A I A C S}; i.e., 0 consists of all subsets of S. Topological spaces are specified by a pair (S, 0); we shall, however, simply write S if there is no danger of confusion.

Section 1.1 Topological Spaces

1.1.2 Definition

3

Let S be a topological space. A set A c S will be called closed if its

complement S \ A is open. The collection of closed sets is denoted C.

For example, the closed interval [0, 1) ]-"", O[ U ]1,

C

lR is closed as it is the complement of the open set

00 [ .

1.1.3 Proposition The closed sets in a topological space satisfy: C1 0 E C and SEC; C2 if AI' A2 E C then Al U A2 E C; C3 the intersection of any collection of closed sets is closed. Proof C1 follows from T1 since 0

= S \ S, S = S \ 0.

The relations

for {BJiEI a family of closed sets show that C2, C3 are equivalent to T2, T3, respectively .• Closed rectangles in lRn are closed sets, as are closed balls, one-point sets, and spheres. Not every set is either open or closed. For example, the interval [0, 1[ is neither an open nor a closed set. In a discrete topology on S any set A C S is both open and closed, whereas in the trivial topology any A*,0 or S is neither. Closed sets can be used to introduce a topology just as well as open ones. Thus, if C is a collection satisfying C1-C3 and 0 consists of the complements of sets in Co then 0 satisfies T1-T3.

1.1.4 Definition An open neighborhood of a point u in a topological space S is an open set U E U. Similarly,for a subset A of s, V is an open neighborhood of A if V is open and A C V. A neighborhood of a point (or a subset) is a set containing some open neighborhood of the point (or subset). such that u

Examples of neighborhoods of x E lR are )x - 1, x + 3), ]x - E, X + E[ for any E> 0, and lR itself; only the last two are open neighborhoods. The set lx, x + 2[ contains the point x but is not one of its neighborhoods. In the trivial topology on a set S, there is only one neighborhood of any point, namely S itself. In the discrete topology any subset containing p is a neighborhood of the point PES, since {p} is an open set.

1.1.5 Definition A topological space is calledfirst countable iffor each u E S there is a sequence {VI' V 2 , .•• } = {Un} of neighborhoods of u such that for any neighborhood V of u, there is an integer n such that Un C U. A subset 'E of 0 is called a basis for the topology. if each open Set is a union of elements in 'E. The topology is called second countable ifit has a countable basis.

Chapter 1 Topology

4

Most topological spaces of interest to us will be second countable. For example IRn is second countable since it has the countable basis formed by rectangles with rational side length and centered at points all of whose coordinates are rational. Clearly every second-countable space is also first countable. but the converse is false. For example if S is an infinite noncountable set. the discrete topology is not second countable. but S is first countable. since {p} is a neighborhood of pES. The trivial topology on S is second countable (see Exercises 1.11, I.U for more interesting counter-examples). 1.1.6 Lindelofs Lemma Every covering of a set A in a second countable space S byafamily of open sets U. (that is

U. U. ::::l A)

contains a countable subcollection also covering A.

Proof Let r.B = {Bn} be a countable basis for the topology of S. For each pEA there are indices n and ex such that p E Bn c: Ua . Let r.B' = {Bn I there exists an ex such that Bn c: U a } . Now let Ua(n) be one of the Ua that includes the element Bn of r.B' . Since r.B' is a covering of A. the countable collection (Ua(n)} covers A. • 1.1.7 Definition Let S be a topological space and A c: S. The closure of A. denoted cl(A) is the intersection of all closed sets containing A. The interior of A. denoted int(A) is the union of all open sets contained in A. The boundary of A. denoted bd(A) is defined by

bd(A) = cl(A) n cl(S \ A). By C3. cl(A) is closed and by T3. int(A) is open. Note that as bd(A) is the intersection of closed sets. bd(A) is closed. and bd(A) = bd(S \ A) On R. for example. cl([O. I[) = [0. I], int([O. I[)

= ]0.1[.

and bd([O. I[)

= {O.

I}. The

reader is assumed to be familiar with examples of this type from advanced calculus. 1.1.8 Definitions A subset A of S is called dense in S if cl(A) = S. and is called nowhere dense if S \ cl(A) is dense in S. The space S is called separable if it has a countable dense subset. A point in S is called an accumulation point of the set A if each of its neighborhoods contains a point of A other than itself. The set of accumulation points of A is called the derived set of A and is denoted by der(A). A point of A is said to be isolated if it has a neighborhood in S containing no other points of A than itself.

The set A = [0.1 [ U {2} in IR has the element 2 as its only isolated point. its interior is int(A) int{p}

= ]0. 1[.

cl(A)

= [0. 1]

= cl{p} = {p}, for any

U {2} and der(A) pES.

= [0. 1].

In the discrete topology on a set S.

Since the set Q of rational numbers is dense in IR and is countable. IR is separable. Similarly IRn is separable. A set S with the trivial topology is separable since cl {p} = S for any pES. But S = lR with the discrete topology is not separable since cl(A) = A for any A c: S. Any second-countable space is separable. but the converse is false; see Exercises 1.11. I.U.

5


1.1.9 Proposition Let S be a topological space and A

(i) u

E

S. Then

C

'* 0;

cl(A) ifffor every neighborhood U of u. unA

(ii) u E int(A) iff there is a neighborhood U of u such that U c A; (iii) u E bd(A) ifffor every neighborhood U of u. unA Proof (i) u 11' cl(A) iff there exists a closed set C

=:J

'* 0

and un (S\A)

'* 0.

A such that u 11' C. But this is equivalent to

the existence of a neighborhood of u not intersecting A. namely S \ C. (ii) and (iii) are proved in a similar way. • 1.1.10 Proposition Let A. B and Ai' i E I be subsets of S. (i) A c B implies int(A) C int(B). cl(A) C cl(B), and der(A)

C

der(B);

(ii) S \ cl(A) = int(S \ A). S \ int(A) = cl(S \ A). and cl(A) = A U der(A); (iii) cl(0)

= int(0) = 0.

cl(S)

= int(S) = S,

= cl(A)

cl(cl(A»

and int(int(A»

= int(A);

(iv) cl(A U B) = cl(A) U cl(B). der(A U B) = der(A) U der(B). int(A U B)::> int(A) U int(B); (v) cl(A n B)

C

cl(A) n cl(B). der(A n B)

(vi) cl(U jEi A)::> U jEi cl(A j). int(U jEi A)::> U jEi int(A). Proof

C

der(A) n der(B). int(A n B) = int(A) n int(B);

cl(n jEi A)

C

int(njEiA j)

C

n jEi cl(Aj). n jEi int(A j).

(i). (ii). and (iii) are consequences of the definition and of Proposition 1.1.9. Since for

each i E I. Aj

C U jEi A j• by (i) cl(Aj) C cl(U jEi Aj) and hence U jE1 cl(Aj) C cl(U jEi A). Similarly. since n jE1 Aj C Aj C cl(Aj) for each i E I. it follows that n jEi cl(A) is a subset of the closet set njEicl(A); thus by (i) cl(n jEi A) C cl(n jE1 cl(A) = n jEi (cl(A).The other formulas of (vi) follow from these and (ii). This also proves all the other formulas in (iv) and (v)

except the ones with equalities. Since cl(A) U cl(B) is closed by C2 and A U B C cl(A) U cl(B). it follows by (i) that cl(A U B) C cl(A) U cl(B) and hence equality by (vi). The formula int(A n B) = int(A) n int(B) is a corollary of the previous formula via (ii) . • The inclusions in the above proposition can be strict. For example. if we let A =10.1 [ and B = [1.2[. then one finds cl(A) = der(A) = [0. IJ. cl(B) = der(B) = [1. 21. int(A) = 10. 1[, int(B) = 11. 2[, AU B = 10.2[. and An B = 0. and therefore int(A) U int(B)

'* 10. 2[ = int(A U B). and

cl(A n B) = 0

= 10.I[ U]1. 2[ = I-lin. I/n[. n = 1. int(nn~lAn) = 0 '* (OJ = nn~lint(An)'

'* {I} = cl(A) n cl(B).

Let An

2.... ; then nn~l An = {O}, int(An) = An for all n. and DUalizing this via (ii) gives Un~lcl(~ \An) = lR \ (OJ lR = cl(Un"'l(lR \ An»' If A C B. there is. in general. no relation between the sets bd(A) and bd(B). For example. if A = [0. 11 and B =: [0,21, A C B, yet we have bd(A) = (0. I} and bd(B) = (0.2}.

'*

1.1.11 Definition Let S be a topological space and {un} a sequence of points in S. The sequence is said to converge if there is a point u E S such that for every neighborhood U of u, there is an N such that n ~ N implies un E U. We say that un converges to u, or u is a limit POint of {un}'

Chapter I Topology

6

For example, the sequence (l/n) in

~

converges to O. It is obvious that limit points of

sequcnces un of distinct points are accumulation points of the set (un)' In a first countable topological space any accumulation point of a set A is a limit of a sequence of clements of A. Indeed, if (Un) denotes the countable collection of neighborhoods of a E der(A) given by dcfinition 1.1.5, then choosing for each n an clemcnt an E Un n A such that an'" a, we see that (an) converges to a. We have proved the following. 1.1.12 Proposition Let S be afirst-countable space and A

C

S. Then u E c1(A)

iff there is a

sequence of points of A that converges to u (in the topology of S). It should be noted that a sequencc can be divcrgent and still have accumulation points. For cxample (2, 0, 3/2, -1/2, 4/3, -2/3, ... ) does not convcrge but has both 1 and -1 as accumulation points. In arbitrary topological spaces, limit points of sequences are in general not unique. For example, in the trivial topology of S any sequence converges to all points of S. In order to avoid such situations several separation axioms have been introduced, of which the three most important ones will be mentioned. 1.1.13 Definition A topological space S is called Hausdorff if each two distinct points have

disjoint neighborhoods (that is, with empty intersection). The space S is called regular if it is Hausdorff and if each closed set and point not in this set have disjoint neighborhoods. Similarly, S is called normal if it is Hausdorff and if each two disjoint closed sets have disjoint neighborhoods. Most standard spaces in analysis are normal. The discrete topology on any set is normal, but the trivial topology is not even Hausdorff. It turns out that "Hausdorff' is the necessary and sufficient condition for uniqueness of limit points of sequences in first countable spaces (sec Exercise I.IE). Since in Hausdorff space single points arc closed (Exercise I.IF), we havc thc implications: normal

~

rcgular

~

Hausdorff. Counterexamples for each of the converscs of

these implications are given in Exercises 1.11 and 1.1J. 1.1.14 Proposition A regular second-countable space is normal. Proof

Let A and B be two disjoint closed sets in S. By regularity. for every point pEA

therc are disjoint open neighborhoods Up of p and U B of B. Hence c1(Up) n B = 0. Sincc {U pip E A} is an open covering of A, by the Lindelof lemma (1.1.6). there is a countable collection {Uklk =1.2 .... } covering A. Thus Uk~IUk~A and c1(U k )nB=0. Similarly. find a family (V k) such that Uk;,Q Vk ~ B and c1(Vk) n A = 0. Thcn the sets 0n+1 = Un+1 \ Uk = o.I ....ncl (V k). Hn = Vn \ Uk = o.I .... ,ncl(U k). 0 0 = Uo. are open and 0:: U n~OOn ~ A. H = U n~OHn ~ B are also open and disjoint. • In the remainder of this book Euclidean n-space IRn will be undcrstood to have the standard topology (unless explicitly stated to the contrary).


7

Exercises 1.1A Let A=(x,y,z)E:R 3 10o;;x O}

U

{p}, ifp = (x, 0).

Chapter 1 Topology

8

Prove the following: (i)

o/(p) = {U

S I there exists Bip) c: U} satisfies V1-V4 of Exercise 1.1H.

C

Thus S becomes a top logical space. (ii)

S is first countable.

(iii)

S is Hausdorff.

(iv)

S is separable. (Him: The set {(x. y)

(v)

S is not second countable (Hint: Assume the contrary and get a contradiction by

E

Six. y

E

Q. y > O} is dense in S.)

looking at the points (x. 0) of S.) (vi)

S is not regular. (Hint: Try to separate the point (xo.O) from the set { (x. 0)

1.U

I

x

E

lR} \ { (x o• O)}.)

With the same notations as in the preceding exercise. except changing B£(p) to D£(p) n S. if P = (x. y) with y> 0

B£(p) =

1

{(x. y)

E

D£(p) I y > O} u {pl. ifp = (x. 0).

show that (i)-(v) of 1.11 remain valid and that (vi) S is regular; (Hint: Use Exercise 1.lG.) (vii) S is not normal. (Hint : Try to separate the set {(x. 0) I x E Q} from the set {(x. 0) I x

E

lR \ Q } .)

l.lK Prove the following properties of the boundary operation and show by example that each inclusion cannot be replaced by equality.

Bd1 bd(A) = bd(S \ A); Bd2 bd(bd(A) c: bd(A); Bd3 bd(A U B) c: bd(A) U bd(B) c: bd(A U B) U A U B; Bd3 bd(bd(bd(A))) = bd(bd(A». Properties Bd1-Bd4 may be used to characterize the topology.

1.lL Let p be a polynomial in n variables zi' ...•zn with complex coefficients. Show that p-I(O) has open dense complement. (Him: If p vanishes on an open set of

en. then all

its derivatives also vanish and hence all its coefficients are zero.)

1.lM Show that a subset '13 of 0 is a basis for the topology of S if and only if the following three conditions hold: Bl 0 E '13; B2 B3

UBe~B=S; if Bi' B2 E '13. then BI n B2 is a union of elements of 'E.

§1.2 Metric Spaces One of the common ways to form a topological space is through the use of a distance function. also called a (topological) metric. For example. on

d(x, y)

between x

= (xl' ...• xn )

and y

= [

= (Yl' .... Yn )

t

~n

the standard distance

I/2

J (xi - Yi )2

can be used to construct the open disks and from

them the topology. The abstraction of this proceeds as follows.

1.2.1 Definition Let M be a set. A metric (also called a topological metric) on M is afunction d: M x M

~ ~

such that for all ml' m 2 • m3 E M.

M1 d(ml' m 2) = 0 iff m l = m 2 (definiteness); M2 d(ml' m 2 ) = d(m 2 • m l ) (symmetry); and M3 d(ml' m 3):O:;; d(ml' m 2) + d(m2 • m 3) (triangle inequality). A metric space is the pair (M. d); if there is no danger of confusion, just write M for (M. d). Taking m t = m3 in M3 shows that d is necessarily a non-negativc function. It is proved in advanced calculus courses (and is geometrically clear) that the standard distance on }In satisfies

M1-M3. The topology determined by a metric is defined as follows. 1.2.2 Definition For c > 0 and m

E

Dt(m)

M, the open c-ball (or disk) about m is defined by

= {m'E

Mld(m'.m) 0 such that D£(m) c U.

10

Chapter 1 Topology

Proof (i) T1 and T3 are clearly satisfied. To prove T2, it suffices to show that the intersection of two disks is a union of disks, which in tum is implied by the fact that any point in the intersection of two disks sits in a smaller disk included in this intersection. To verify this, suppose that p E De(m)

n Ds(n)

and let 0 < r < minCE - d(p, m), Ii - d(p, n»). Hence Dr(p)

n Ds(n), since for any x E Dr(p), d(x, m)

: 0 there is an integer N such that n ~ N implies d(un, u) < E. This follows readily from the definitions 1.1.11 and 1.2.2. A convergent sequence (un) is a Cauchy sequence. To see this, let E> 0 be given. Choose N such that n ~ N implies d(un, u) < E/2. Thus, n, m ~ N implies d(u n, urn) : 0 such that D,(m) c U}.

Show that 'Km) satisfies V1-V4 of Exercise 1.IH. This shows how the metric topology can be defined in an alternative way starting from neighborhoods.

1.2F

In a metric space show that cl(A)

= {u E

M I d(u, A)

= O}.

Exercises 1.2G-l use the notion of continuity from calculus (see Section 1.3).

1.2G Let M denote the set of continuous functions f: [0,1]

~

lR on the interval [0, 1]. Show

that d(f, g) = fal I f(x) - g(x) I dx is a metric. 1.2H Let M denote the set of continuous functions f: [0,1] d(f, g)

= sup{ I f(x) -

~

lR. Set

g(x) II 0 ~ x ~ I}

(i)

Show that d is a metric on M.

(ii)

Show that fn

(iii)

By consulting theorems on unifonn convergence from your advanced calculus text,

~

f in Miff fn converges uniformly to f.

show that M is a complete metric space. 1.21

Let M be as in the previous exercise and define T : M

T(f)(x)

=

a+

I

~

M by

K(x, y) f(y) dy ,

where a is a constant and K is a continuous function of two variables. Let

and suppose k < 1. Prove the following:

Section 1.2 Metric Spaces

(i)

T is a contraction.

(ii)

Deduce the existence of a unique solution of the integral equation

f(x) = a +

(iii)

I

K(x, y) f(y) dy .

Taking a special case of (ii), prove the "existence of e"."

13

§1.3 Continuity 1.3.1 Definition Let S and T be topological spaces and cp : S ~ T be a mapping. We say that cp is continuous at u

E

S iffor every neighborhood Vof cp(u) there is a neighborhood U of u

such that cp(U) c V. If,for every open set V of T, cp-I(V) = {u

E

S I cp(u)

cp is continuous. (Thus. cp is continuous if cp is continuous at each u

E

E

V} is open in S,

S). If the map cp: S

~

T is a bijection (that is. one-to-one and onto). and both cp and cp-I are continuous. cp is called a

homeomorphism and S and T are said to be homeomorphic. For example. notice that any map from a discrete topological space to any topological space is continuous. Similarly. any map from an arbitrary topological space to the trivial topological space is continuous. Hence the identity map from the set S topologized with the discrete topology to S with the trivial topology is bijective and continuous. but its inverse is not continuous. hence it is not a homeomorphism.

It follows from 1.3.1 by taking complements and using the set theoretic identity S\q>-I(A) = cp-I(1\A) that cp : S ~ T is continuous iff the inverse image of every closed set is closed.

1.3.2 Proposition Let S. T be topological spaces and cp: S ~ T. The following are equivalent: (i) cp is continuous; (ii) S \ Ui=I ..... rnUi for m ~ n. Choose Pn e S\Ui=I ..... nUi. If {Pn I n = 1. 2 .... } is infinite. by hypothesis is contains a convergent SUbsequence; let its limit point be denoted p. Then PES \ Ui=I .... nUi for all n. contradicting the fact that {Un I n = 1. 2 .... } covers S. Thus {Pn I n = 1.2.... } must be a finite set; i.e .• for all n ~ N. Pn = PN' But then again PN E S \ Ui=I ..... n Ui for all n. contradicting the fact that {Un I n = 1. 2 .... } covers S. Hence S is compact. Let S be a metric space such that every sequence has a convergent subsequence. If we

26

Chapter 1 Topology

show that S is separable. then since S is a metric space it is second countable (Exercise 1.2C). and by the preceding paragraph. it will be compact. Separability of S is proved in two steps. First we show that for any E > 0 there is a finite set of points {p\' ...•Pn} such that S = U i = 1•... ,nDt(p). If this were false. there would exist an E> 0 such that no finite number of E-disks cover S. Let PI E S be arbitrary. Since Dt(PI) *" S. there is a point P2 E S \ Dt(PI)· Since DipI) U Dt (P2) *" S. there is also a point P3 E S \ (Dt(PI) U Dt (P2»' etc. The sequence {Pn I n=1.2 •... } is infinite and d(Pi' Pj) ~ E. But this sequence has a convergent subsequence by hypothesis. so this subsequence must be Cauchy. contradicting d(Pi' Pj) ~ E for all i. j. Second. we show that the existence for every E> 0 of a finite set (Pl •...• Pn(t)} such that S = U i = 1,..n(tPt(Pi) implies S is separable. Let A.. denote this finite set for E = lin and let A = Un~A... Thus A is countable and it is easily verified that c1(A) = S. • A property that came up in the preceding proof turns out to be important. 1.5.5 Definition Let S be a metric space. A subset A c: S is called totally bounded iffor any E > 0 there exists afinite set {p\' ...• Pn} in S such that A c: U i = 1,..nDt(P)' 1.5.6 Corollary A metric space is compact iff it is complete and totally bounded. A subset of a complete metric space is relatively compact iffit is totally bounded. Proof The previous proof shows that compactness implies total boundedness. As for compactness implying completeness. it is enough to remark that in this context. a Cauchy sequence having a convergent subsequence is itself convergent. Conversely. if S is complete and totally bounded, let {Pn I n = 1.2 •... } be a sequence in S. By total boundedness. this sequence contains a Cauchy subsequence. which by completeness. converges. Thus S is compact by the Bolzano-Weierstrass theorem. The second statement now readily follows. • 1.5.7 Proposition In a metric space compact sets are closed and bounded. Proof This is a particular case of the previous corollary but can be easily proved directly. If A is compact. it can be finitely covered by E-disks: A = U i = l,..nDt(P). Thus n

diam(A) :5

L diam(Dt(Pi»

= 2nE

i=1

•

From 1.5.2 and 1.5.6. if S is compact and O. [-a. a]

[-

a. a] and k be

n and

k I'.

~

t
f- 1(f(S» = S, and r-I(U) n f-I(V) = r-1(0) = 0, thus contradicting connectedness of S by 1.6.3. Hence f(S) is connected. _ We shall prove that arcwise connected spaces are connected. We shall use the following. 1.6.5 Lemma The only connected sets of]R are the intervals (finite, infinite, open, closed, or hIllj-open). Proof Let us prove that [a, b[ is connected; all other possibilities have identical proofs. If not, [a, b[ = U U V with U, V nonempty disjoint closed sets in [a, b[. Assume that a E U. If x = sup(U), then x E U since U is closed in [a, b[, and x < b since V'" 0. But then lx, b[ c V and, since V is closed, x E V. Hence x E un V, a contradiction. Conversely, let A be a connected set of R We claim that [x, y]

C

A whenever x, y E

A, which implies that A is an interval. If not, there exists Z E [x, y] with z e A. But in this case ]-00, z[ U A and ]z, 00 [ U A are open nonempty sets disconnecting A. _ 1.6.6 Proposition If S is arcwise connected then it is connected. Proof If not, there are nonempty, disjoint open sets Uo and U I whose union is S. Let Xo E U o and XI E Uland let cp be an arc joining Xo to XI. Then Vo = cp-I(U O) and V I = cp-I(U I) disconnect [0, I]. _ A standard example of a space that is connected but is not arcwise connected nor locally connected, is {(X,y)E ]R21 x>O and y=sin(1/x)} U {(O,y)I-I
= a( el' e2 ).

denotes complex

Properties CI1 - CI3

are also known in the literature under the name sesquilinearity. As is customary, for a complex number a we shall denote by Re a = (a + 0.)/2,

1m a = (a - 0.)/2,

1a 1 =

(a 0.)1/2

its real and imaginary parts and its absolute value. The standard inner product on the product space

cn = C

X ...

x C is defined by n

( x, w) =

~>iWi i=

,

1

and C11-C14 are readily checked. Also C' is a normed space with n

II Z 112

=

L1

Zi

12

i= 1 In lItn or

cn,

property N3 is a little harder to check directly. However, as we shall show in

Proposition 2.1.4, N3 follows from 11-14 or CI1-CI4. In a (real or complex) inner product space E, two vectors e p e2 E E are called orthogonal and we write e l -L e 2 provided (el' e2 ) = O. For a subset ACE, the set A.L defined by A.L = (e EEl (e, x) =0 for all x E A} is called the orthogonal complement of A. Two sets A, B C E are called orthogonal and we write A -L B if (A, B) = 0; that is, e l -L e 2 for all e l E A and e 2 E B. 2.1.3 Cauchy-Schwartz Inequality In a (real or complex) inner product space,

Equality holds iff e l , e2 are linearly dependent.

43

Section 2.1 Banach Spaces

Proof It suffices to prove the complex case. If a.

~

E

C we have

If we set a = ( e z, ez )' and ~ = - ( e 1, e2 ). then this becomes

and so

If ez = O. equality results in the statement of the proposition and there is nothing to prove. If ez "Ie0, the tenn (ez' ez ) in the preceding inequality can be cancelled since (e z• "Ie- 0 by C14. Taking square roots yields the statement of the proposition. Finally, equality results if and

ez)

only if ae l + ~ez =( ez' eZ)el - ( el' e2 )ez

= O.

•

2.1.4 Proposition Let (E, (', . » be a (real or complex) inner product space and set II e II = ( e, e )IIZ. Then (E, II . 10 is a normed space. Proof N1 and N2 are straightforward verifications. As for N3, the Cauchy-Schwartz inequality and the obvious inequality Re(( e 1, ez » ~ I ( e 1, I imply

ez )

II e1 + ezll z

»

= II e1 112 + 2 Re« ~

el' ez + II ezll z ~ II e1ll z + 21 (el' ez ) I + II ezll z

II e 1 liZ + 211 e 1II II ezll + II ezll z = (II e1 II + II ez 10z . •

Some other useful facts about inner products are given next.

2.1.5 Proposition nonn. Then

Let (E,(·,

.»

be an inner product space and II· II the corresponding

(i) (Polarization)

4( el' ez)

=

j

II el + ez liZ - II el - ez liZ + i II el + ez liZ - i II el - iez liZ if E is complex , II el + ez liZ - II el - Cz liZ if E is real .

(ii) (Parallelogram law)

2 II e 1 liZ + 2 II ez liZ

Proof (i) II e 1+ ez If - II e 1 -

= II e1+ ez liZ + II e1-

e2 UZ .

ez 112 + i II e 1 + iez 112 - i II e1 - iezll2

= II e 1 112 + 2Re« el' ez

»+ II e2112 -II C1 112 + 2Re«

el' e2 » -II

ez 112 +

Chapter 2 Banach Spaces and DifferentiJJl Calculus

44

+ ill e l 112 + 2iRc« cl' ic2 » + i)l c 2 112-i II e l 112 + 2iRe« cl' ie2

»- i II c2112

= 4Rc« el' ~» + 4iRc(-i (el' e 2 » = 4Re«

el' e2

»+ 4i Im«

el' c2

»=4 ( el' e2 ).

The real case is proved in a simnar way. (ii) II e l + e 2 112 + II e l

-

e2 112 = II e l 112 + 2Re« el' e2

»+ II c2 112 + II e

l

If - 2Rc« Cl' c2

»c2 112

= 2 II e l 112 + 2 II C2 If. • Not all norms come from an inner product. For cxamplc in JRR, R

II) x III =

L

I xii

i=1

is a norm, but there is no inner product with this as norm, since this norm fails to satisfy thc parallelogram law (see Exercise 2.IA for a discussion). 2.1.6 Proposition Let

(E,II·II) be a rwnned (or a semirwnned) space and define d(c l , c 2)

=

II e) - e211. Then (E, d) is a metric (pseudometric) space. Proof The only non-obvious verification is the triangle ineqUality. By N3 we have d(el' c 3 )

= II e l - e 3 II = lI(e l - e2) + (e2 = d(el' c 2) + d(e2, e3 ) . •

e3 ) II S II e) - e211 + II e 2 - e3 II

Thus we havc the following hicrarchy of gcncrality:

inner product spaces

c

normcd spaces

C

mctric spaces

C

topological spaccs.

Since inner product and normcd spaccs arc mctric spaces, we can use thc conccpts from Chaptcr 1. In a normcd spacc, N1 and N2 imply that the maps (el' e2) 1-+ c 1 + c2' (a, c) 1-+ ac of Ex E

~

E, and ex E

o fixed, thc mappings

e

~

1-+

E, respectively, are continuous. Hence for eo E E, a o E C, a o ~ eo + c, e 1-+ aoc are homeomorphisms. Thus U is a neighborhood

of thc origin iff c + U = {c + x I x E U} is a ncighborhood of c E E. In othcr words, all thc neighborhoods of e E E are sets that contain translates of disks ccntcred at the origin. This constitutes a complete description of the topology of a normed vector space (E, II . II).


4S

Finally, note that the inequality III e) 11- II e2111 $ II e l - e2 II implies that the nonn is unifonnly continuous on E. In inner product spaces, the Cauchy-Schwartz inequality implies the continuity of the inner product as a function of two variables.

2.1.7 Definition Let (E, II . II) be a rwrmed space. If the corresponding metric d is complete, we say (E, II· II) is a Banach space. If (E, (.,.» is an inner product space whose corresponding metric is complete, we say (E, CO» is a Hilbert space. For example, it is proven in books on advanced calculus that JR." is complete. Thus, JR." with the standard nonn is a Banach space and with the standard inner product is a Hilbert space. Not only is the standard nonn on JR." complete, but so is the nonstandard one

"

IIl x lll=L IXil. i=l

However, it is equivalent to the standard one in the following sense.

2.1.8 Definition Two rwrms on a vector space E are equivalent if they induce the same topology on E.

2.1.9 Proposition Two rwrms II· II and III . ilion E are equivalent iff there is a constant M such that M-) III e III $ II e II $ Mill e III for all e E E.

Proof Let B:(x) = {y E E III y - x II $ r},

B~(x) = {y E E 1111 y - x III $ r}

denote the two closed disks of radius r centered at x E E in the two metrics defined by the nonns II· II and III . III, respectively. Since neighborhoods of an arbitrary point are translates of neighborhoods of the origin, the two topologies are the same iff for every R > 0, there are constants Ml' M2 > 0 such that

B~ 1(0) c Bk(O) c B~ 2(0). The first inclusion says that if III x III Thus, if e ~ 0, then

$

M I' then II x II

$

R, i.e., if III x III

$

1, then

II x II $ RlM l .

that is, II e II $ (R/M)1I1 e III for all e E E. Similarly, the second inclusion is equivalent to the assertion that (MzIR) III e III $ II e II for all e E E. Thus the two topologies are the same if there exist

46

Chapter 2 Banach Spaces and DiJJerentUd Calculus

constants N 1 > 0, N2 > 0 such that NI

III e III

II e II

~

~

N2111 e III

for all e E E. Taking M = max(N2, lIN I) gives the statement of the proposition. • If E and F are nonned vector spaces, the map II· II : E x F ~ lR defined by II (e, e') II = II e II + II e' II is a nonn on Ex F inducing the product topology. Equivalent nonns on Ex Fare (e, e') ...... max(1I e II, II e' II) and (e, e') ...... (II e 112 + II e' 112)1/2. The nonned vector space Ex F is

usually denoted by E ED F and called the direct sum of E and F. Note that E ED F is a Banach space iff both E and F are. These statements are readily checked.

2.1.10 Proposition. Let E be afinite-dimensional real or complex vector space. Then (i) there is a nonn on E; (ii) all nonns on E are equivalent; (iii) all nonns on E are complete. Proof Let e l ,

... ,

en denote a basis of E, where n is the dimension of E.

(i) A nonn on E is given, for example, by n

lie

II = Llail, i=1

where n

e =

L aiei i=1

(ii) Let

II· II' be any other nonn on E. If n

e=

L aiei

n

and f=

i=1

L biei' i=1

the inequality n

III e II' -II f II' I ~ II e - f II' ~

L I ai - bi III ei II' i=

shows that the map

1


47

is continuous with respect to the III· III -nonn on en. Since the set S = {x E en 1111 x 111= 1} is closed and bounded, it is compact. The restriction of this map to S is a continuous, strictly positive function, so it attains its minimum MI and maximum M2 on S; that is,

for all (x I, ... , xn)

E

= 1.

en such that III (x I, ... , xn) III

Thus

i.e., MIll e II' $; II ell'$; M2 II e II, where n

e =

L

xiei

i=1

Taking M =max(M 2, 1IM I), Proposition 2.1.9 shows that II· II and II· II' are equivalent. (iii) It is enough to observe that n

(xl, ... , xn)

E

en

1-+

L

xic;

E

E

i=1

is a nonn-preserving map (Le., an isometry) between (en, III . III) and (E, II . II). • The unit spheres for the three common nonns on ]R2 are shown in Figure 2.1.1.

y

y

y

x

x

max(x. yl

\.'C

+ r

Figure 2.1.1

Ixl + l,vl

48

Chapter 2 Banach Spaces and Differential Calculus

The foregoing proof shows that compactness of the unit sphere in a finite-dimensional space is crucial. This fact is exploited in the foIIowing supplement.

~

SUPPLEMENT 2.IA

A Characterization of Finite-Dimensional Spaces 2.1.11 Proposition A normed vector space is finite dimensional iff it is locally compact. (Local

compactness is equivalent to the fact that the closed unit disk is compact.) Proof If E is finite dimensional. the previous proof of (iii) shows that E is 10caIIy compact.

Conversely. assume the closed unit disk B1(0)

C

E is compact and let {D 1/2(x i ) I i = 1•...• n}

be a finite cover with open disks of radii 1/2. Let F = span {xl' ...• xn }. Then F is finite

dimensional. hence homeomorphic to Ck (or

]Rk)

for some k ~ n. and thus complete. Being a

complete subspace of the metric space (E. II . II). it is closed. We shaII prove F

=E.

Suppose the contrary. that is. there exists vEE. v II' F. Since F = cl(F). the number d = inf{1I v - e II leE F} is strictly positive. Let r > 0 be such that B/v) set Br(v)

nF

n

F

* 0. Note that the

is closed and bounded in the finite-dimensional space F. and thus is compact.

Since inf{ II v - e III e E F} = inf{1I v - e III e E B/v) C F} and the continuous function defined by e E Br(v) n F ~ II v - e II E 1o. 00 [ attains its minimum. there is a point eo E Br (v) n F such that d =II v - eo II. But then there is a point

Xi

such that

so that

Since eo + II v - eo II

Xi

E F. we get II v - eo - (II v - eo lI)x i II

~

d. which is a contradiction .

• {l::rJ 2.1.12 Examples

sUPx E

A Let X be a set and F a normed vector space. Define the set B(X. F) = (f: X ~ F I X II f(x) II < 00 } . Then B(X. F) is easily seen to be a normed vector space with respect to

the so-caIled sup-norm. II f II~ = sUPx E X II f(x) II. We prove that if F is complete. then B(X. F) is a Banach space. Let {fn} be a Cauchy sequence in B(X. F). i.e.. II fn - fm II~ < E for n. m ~ N(E). Since for each x E X. II f(x) II ~ II f II~. it foIlows that {fn (x)} is a Cauchy sequence in F. whose limit we denote by f(x). In the inequality II fn(x) - fm(x) 11< E for all n. m ~ N(E). let m ~ 00 and get II fn (x) - f(x) II ~ E for n ~ N(E). i.e.• IIfn - f II~ ~ E for n ~ N(E). This shows that fn - f E B(X. F). i.e.• that f E B(X. F). and that II fn - f II~ ~ 0 as n ~ 00. As a particular case. we get the Banach space cb consisting of all bounded real sequences {an} with the

49


norm II {an} II"" = sUPnl an I. B If X is a topological space, the space CB(X, F) = {f: X ~ F I f is continuous, f E B(X, F)} is closed in B(X, F). Thus if F is Banach, so is CB(X, F). In particular, C(X, F) = {f: X ~ F

I f continuous}, for

X compact and F Banach, is a Banach space. For example,

the vector space C([O, I], JR) = {f: [0, 1] ~ JR I f is continuous} is a Banach space with the norm IIfll",,=sup{lf(x)llxE [O,I]}. C (For readers with some knowledge of measure theory.) Consider the space of real valued square integrable functions defined on an interval [a, b]

C

JR, i.e., functions f that satisfy

The function II . II : f......

(

f. I

)1/2

b

f(x) 12 dx

is, strictly speaking, not a norm on this space; for example, if

f(x) = {

0 for x;to a , 1 for x = a ,

then II f II = 0, but f;to O. However, II· II does become a norm if we identify functions which differ only on a set of measure zero in [a, b], i.e., which are equal almost everywhere. The resulting vector space of equivalence classes [f] will be denoted L2 [a, b]. With the norm of the equivalence class [f] defined as b

II [f] II = ( 11 f(x) 12 dx

)1/2

,

L2 [a, b] is an (infinite dimensional) Banach space. The only nontrivial part of this assertion is the completeness; this is proved in books on measure theory, such as Royden [1968]. customary, [f] is denoted simply by f. In fact, L2 [a, b] is a Hilbert space with

( f, g) =

f

As is

f(x) g(x) dx .

If we use square integrable complex-valued functions we get a complex Hilbert space L2([a, bl, C) with (C, g) =

f

f(x) g(x) dx .

D The space LP([a, b)) may be defined for each real number p ~ 1 in an analogous

so


fashion to L2 [a, b]. Functions f: [a, b] ~ JR. satisfying

are considered equivalent if they agree almost everywhere. LP([a, b]) is then defined to be the vector spac~ of equivalence classes of functions equal almost everywhere.

II' lip: LP [a, b]

~ JR. given by If] ~ (

f.

b

I f(x)I P dx

)I/P

defines a norm, called the LP -norm, which makes LP [a, b] into an (infmite dimensional) Banach space. E

Denote by C([a, b],JR.) the set of continuous real valued functions on [a, b]. With the

O-norm, C([a, bJ,JR.) is not a Banach space. For example, the sequence of continuous functions fn shown in Figure 2.1.2 is a Cauchy sequence in the L I-norm but does not have a continuous limit function. On the other hand, with the sup norm

II f II

sup

= x

E

I f(x) I ,

[0.11

C([O, 1]) is complete, i.e., it is a Banach space, as in Example B.

•

y

_ - - - - - - - - y = fn(x)

--4-----L---~------------------- X

Figure 2.1.2. fn converges in L I, but not in C. As in the case of vector spaces, quotient spaces of normed vector spaces playa fundamental role. 2.1.13 Proposition Let E be a normed vector space, F a closed subspace, ElF the quotient

vector space, and It: E ~ ElF the canonical projection defined by (i) The mapping II· II : ElF ~ JR.,

It

(e) = leJ = e + F

E

ElF.

51


II [elll = inf{1I e + v III v E F} defines a norm on ElF. (ii) 1t is continuous and the topology on ElF defined by the norm coincides with the quotient topology. In particular, 1t is open. (iii) If E is a Banach space, so is ElF. Proof (i) Clearly II [elll ~ 0 for all [el E ElF and II [0111 = inf{ II v III v E F} = O. If II [elll = 0, then there is a sequence {V n } C F such that limn --> ~ II e + vn II = O. Thus limn --> ~ vn = - e and since F is closed, e E F; i.e., [el = O. Thus N1 is verified and the necessity of having F closed becomes apparent. N2 and N3 are straightforward verifications. (ii) Since

II [elll :511 e II, it is obvious that limn --> ~ en = e implies limn --> ~ 1t (en) =

limn --> ~[enl = [el and hence 1t is continuous. Translation by a fixed vector is a homeomorphism. Thus to show that the topology of ElF is the quotient topology, it suffices to show that if [01 E U and 1t -1(U) is a neighborhood of zero in E, then U is a neighborhood of [01 in ElF. Since 1t""1(U) is a neighborhood of zero in E, there exists a disk D/O)

C

1t""1(U).

But then 1t(Dr(O» C U and 1t (D/O» = {[ell e E Dr(O)} = {[ell II [elll < r }, so that U is a neighborhood of [01 in ElF. (iii) Let {[enl} be a Cauchy sequence in ElF. We may assume without loss of generality

that II [enl- [en+1lll :51/2n. Inductively, we find points e'n E [enl such that II e'n -e'n+l ll < 10.0. Thus {e'n} is Cauchy in E so it converges to, say, e E E. Continuity of 1t implies that limn-->~ [en 1 = [el . •

The codimension of F in E is defined to be the dimension of ElF. We say F is of jiniU codimension if ElF is finite dimensional. 2.1.14 Definition The closed subspace F of the Banach space E is said to be split, or complenrented, if there is a closed subspace GeE such that E = F E!:) G.

~

SUPPLEMENT 2.1B Split Subspaces

Definition 2.1.14 implicitly asks that the topology of E coincide with the product topology of FEEl G. We shall show in Supplement 2.2C that this topological condition can be dropped; i.e., F is split iff E is the algebraic direct sum of F and the closed subspace G. We note that if E = F EEl G then G is isomorphic to ElF. However, F need not split for

ElF

to be a Banach space, as we proved in 2.1.13. In finite dimensional spaces, any subspace is

closed and splits; however, in infinite dimensions this is false. For example, let E = LP(SI) and let F={fE Elf(n)=O forn 0 I II Ae II :5 Mile II for all e = sup{IIAelllllell:5l} = sup{1I Ae II

E

E}

I lIell=l}

1111 e II· L(E, F) and B E L(F, G), where E, F, and G are nonned spaces, then

In particular, II Ae II :5 II A

If A

E

II (B 0 A)(e) II = II B(A(e» II :5 II B 1111 Ae II :5 II B 1111 A 1111 ell, and so

II (B 0

A)

II :5 II B 1111 A II .

Equality does not hold in general. A simple example is obtained by choosing E = F = G = 1R2, A(x, y) = (x, 0), and B(x, y) = (0, y), so that BoA = 0 and II A

II = II B II =

1.

2.2.4 Proposition L(E, F) with the norm just defined is a normed space. It is a Banach space F is. Proof Clearly

if

II A II ~ 0 and II 0 II = O. If II A II = 0, then for any e E E, II Ae II :5 II A 1111 e II = 0,

so that A = 0 and thus N1 (see Definition 2.1.1) is verified. N2 and N3 are also straightforward to check. Now let F be a Banach space and {An} inequality II Ane - Ame II :5 II An - Am 1111 e

C

L(E, F) be a Cauchy sequence. Because of the

II for each e E E, the sequence {Ane} is Cauchy in· F

and hence is convergent. Let Ae = limn ..... ~ Ane. This defines a map A: E ~ F, which is evidently linear. It remains to be shown that A is continuous and II An - A II ~ O. If e> 0 is given, there exists a natural number N(e) such that for all m, n ~ N(e) we have

II ~ - Am II < e. If II e II :5 1, this implies II Ane - Ame II < e, and now letting m ~ it follows that II Ane - Ae II :;; e for all e with II ell:;; 1. Thus An - A E L(E, F), hence A E L(E, F) and II ~ - A II :;; e for all n ~ N(e); i.e., II An - A II ~ O. • 00,

If a sequence {An} converges to A in L(E, F) in the sense that II ~ -A II ~ 0, i.e., if ~ -+ A in the nonn topology, we say ~ ~ A in norm. This phrase is necessary since other

Chapter 2 Banach Spaces and DifferenJiol Calculus

58

topologies on L(E, F) are possible. For example, we say that An -+ A strongly if for each e E E. Since II Ane - Ae II

~

~e

-+ Ae

II An - A 1111 e II, norm convergence implies strong

convergence. The converse is false as the following example shows. Let

with inner product

({~}, {bn} )

=

L ~bn n=

1

Let en = (0, ... , 0, 1,0 ... )

E

E

F = IR , and

~

= (en") E L(E, F) ,

where the 1 in en is in the nth slot. The sequence {An! is not Cauchy in the operator nonn since II An - Am II = ";2, but if e = {am}, ~(e) = (en' e) = ~ -+ 0, i.e., An -+ 0 strongly. If both E and F are finite dimensional, strong convergence implies nonn convergence. (To see this, choose a basis el' ... , en of E and note that strong convergence is equivalent to Ake j ~ Ae j as k -+ 00 for i = 1, ... , n. Hence maxjll Aej II = III A III is a norm yielding strong convergence. But all norms are equivalent in finite dimensions.)

~

SUPPLEMENT 2.2A

Dual Spaces Recall from elementary linear algebra that the dual space of a finite dimensional vector space of dimension n also has dimension n and so the space and its dual are isomorphic. For general Banach spaces this is no longer true. However, it is true for Hilbert space. 2.2.5 Riesz Representation Theorem Let E be a real (resp. complex) Hilbert space. The map e .... ( " e) is a linear (resp. antilinear) norm-preserving isomorphism of E with E*;for short, E:; E*. (A map A: E -+ F between complex vector spaces is called antilinear ifwe have the identities A(e + e') = Ae + Ae' ,and A(ae) = ( f Ae.) Proof Let fe =(', e). Then IIfell=lIell and thus feE E*. Themap A:E-+E*, Ae=fe is clearly linear (resp. anti linear), norm preserving, and thus injective. It remains to prove surjectivity. Let f E E* and kerf = {e EEl f(e) = O}. kerf is a closed subspace in E. If kerf = E, then f = 0 and f = A(O) so there is nothing to prove. If ker f *- E, then by lemma 2.1.17 there exists e*-O such that e.l ker f. Then we claim that f = A(f(e)elll e 11 2 ). Indeed, any VEE can be written as f(v) f(v) f(v) v = v - f(e) e + f(e) e and v - f(e) e E ker f. •

Section 2.2 Linear and Multilinear Mappings

59

Thus. in a real Hilbert space E ev for some eo E E and II 2 II = II eo II· In a general Banach space E we do not have such a concrete realization of E*. However. one should not always attempt to identify E and E*. even in finite dimensions. In fact. distinguishing these spaces is fundamental in tensor analysis. We have a canonical map i: E ~ E ** defined by i(e)(2) = 2(e). (pause and look again at this strange but natural formula: i(e) E E** = (E*)*. so i(e) is applied to the element 2 E E*.) It is easy to check that i is norm preserving. One calls E reflexive if i is onto. Hilbert spaces are reflexive. by 2.2.5. For example. let V = L2( IRD) with inner product ( f. g > =

in

f(x) g(x) dx •

and let a: L2(IRD) ~ JR be a continuous linear functional. Then the Riesz representation theorem guarantees that there exists a unique g E L2(IRD) such that a(O =

r g(x)f(x) dx JRn

=

(g. f)

for all f E L2(JRD). In general. if E is not a Hilbert space and we wish to represent a linear functional a in the form of a(O = ( g. f >. we must regard g(x) as an element of the dual space E*. For example. let E = CO(n. lEt). where n C JRD. Each x E n defines a linear functional Ex: CO(n.lR) ~ IR; f ..... f(x). This linear functional cannot be represented in the form Ex(f) = (g. f> and. indeed. is not continuous in the L2 norm. Nevertheless. it is customary and useful to write such linear maps as if ( • > were the L2 inner product. Thus one writes. symbolically.

E"o(f) =

10

O(x - xo)f(x) dx.

which defines dIe Dirac delta/unction at xo; i.e .• g(x) = O(x - xo).

tl:n

Next we shall discuss integration of vector valued functions. We shall require the following.

2.2.6 Linear Extension Theorem Let E. F. and G be normed vector spaces where (i) FeE;

(ii) G is a Banach space; and (iii) T

E

L(F. G).

Then the closure cl(F)

0/ F is a normed vector subspace 0/ E and T can be uniquely extended

60


to a rruzp 'Ie L(cl(F). G). Moreover. we have the equality II T II = II 'I II. Proof The fact that cl(F) is a linear subspace of E is easily checked. unique by continuity. Let us prove the existence of

rz:

N~te

that if 'I exists it is

If e e cl(F). we can write c '"

limn ..... ~ cn' whcre cn e F. so that

which shows that the sequcnce {Ten} is Cauchy in thc Banach space G. Let 'Ie =limn ..... ~ Tcn . This limit is indcpendent of thc sequcncc {cn}. for if c = lim e'n' then

which provcs that limn ..... ~ (Tcn) = limn ..... ~ (Te'n)' It is simple to chcck the linearity of

rz:

Since

Te = 'Ie for e e F (because e = limn ..... ~ e). 'I is an extension of T. Finally.

shows that 'Ie L(cl(F). G) and

II 'I II ~ II T II. The inequality II T II ~ II 'I II is obvious sincc 'I

extends T. • As an application of this lemma we define a Banach space valued integral that will be of use latcr on. Fix the closed interval [a. bJ c: 1R and the Banach space E. A map f: [a. bJ called a step function if there exists a partition a = to < t1 < '-2 ... < tn

=b

~

E is

such that f is constant

on each interval [ti• ti+1 [. Using the standard notion of a refinement of a partition. it is clear that the sum of two step functions and the scalar multiples of step functions are also step functions. Thus the set 8([a. bJ. E) of step functions is a vector subspace of B([a. bJ. E). the Banach spacc of all bounded functions (see Example 2.1.12). The integral of a step function f is defined by

f L b

n

f =

(1; + 1 - t) f(t i )

.

i= 0

a

It is easily verified that this definition is independent of the partition. Also note that

II where

that is.

f II ~ f f

II f II

~

(b -

a)1I f

II~


r:

8([a. bJ. E)

~

61

E

is continuous and linear. By the linear extension theorem. it extends to a continuous linear map

r

E L(cl(8 ([a. b], E».E).

2.2.7 Definition The linear map

r

is called the Cauchy-Bochner integral. Note that

The usual properties of the integral such as

are easily verified since they clearly hold for step functions. The space cl(8([a.bJ. E) contains enough interesting functions for our purposes. namely cl([a. bJ. E)

C

cl(8([a. bJ. E»

C

B([a. bJ. E) .

The first inclusion is proved in the following way. Since [a.bJ is compact. each f E CO([a. bJ. E) is uniformly continuous. For E> O. let 0 > 0 be given by uniform continuity of f for £/2. Then take a partition a = to < ... < tn = b such that I ti+1 - tl I < 0 and define a step function g by g I[ti• ~+I[ = f(t;). Then the E-disk De(f) in B([a. bJ. E) contains g. Finally. note that if E and F are Banach spaces. A E L(E. F). and f E cl(8([a. bJ. E». we have AofE cl(8([a. bJ. F» since IIAofn-Aofll:5 IIAllllfn-fll~ where fn are step functions in E. Moreover.

since this relation is obtained as the limit of the same (easily verified) relation for step functions. The reader versed in Riemann integration should notice that this integral for E = IR is less general than the Riemann integral; i.e.• the Riemann integral exists also for functions outside of cl(8([a. bJ. lR». For purposes of this book. however. this integral will suffice. Next we turn to multilinear mappings. If E\, .... I\: and F are linear spaces. a map

Chapter 2 Banach Spaces and Differentiol Calculus

62

A : E) x·.. x

~ -t

F

is called k-multilinear if A(e), ... , 0 such that II A(e\, ... , Olllf(e)II$Mlleln = sup {lIf(e)lIll1ell$ I} = sup{llf(e)lIll1ell=i}

It is Complete

if F is.

(ii) IffE Sk(E, F) and gE sn(F, G), then gofE Skn(E,G) and IIgofll$lIglillfli. (iii) (Polarization). The mapping ': Lk(E, F) ~ Sk(E, F) defined by N(e) = A(e, ... , e)

restricted to L\(E; F) has an inverse ': Sk(E, F) ~ Lk,(E, F) given by

64

Chapter 2 Banach Spaces and DilJerentiJJJ Calculus

f(t l e l + ... + tke k) is a polynomial in tp ...• tk, so there is no problem in understanding what the derivatives on the right hand side mean).

(Note that

(iv) For A E Lk(E,F). II A' II ~ II A II ~ (\(k/k!) II A' II. which implies the mflPS

•

are

continuous.

Proof (i) and (ii) are proved exactly as for L(E. F) =SI(E. F). (iii) For A E Lks(E; F) we have

where each ei appears ai times. and

I

ak -'t

-'t

°1"'Ok

t=O

a

1

a

t I ... ti' =

j

1. if k = j

0, if k

*j

It follows that

and for j

*k.

This means that '(A') = A for any A E Lks(E. F). Conversely, if f E Sk(E. F). then 1 rO'(e) = , f(e •...• e) = -k' .

atl a... atk k

I t= 0

f(tle + ... + tke)

(iv) II A'(e) II = II A(e •... , e) II :0; II A 1111 e Ilk, so II A' II inequality, note that if A E Lk.<E; F), then

:0;

II A II.

To prove the other


6S

where the sum is taken over all the 2k possibilities el = ± 1•...• ~ = ± 1. Put II e l II = ... = 1I~1I=1 andget

II A'(ele l + ... + ~~) II

elel + ... + ~~ II

$

II A'

$

II A' II (I el III e l II + ... + I ~ III ~ Il)k

1111

k

whence

Le .•

where each e i appears ai times. If e = tIel + ... +

f(e) = '[(e •...• e) =

~en'

L

the proof of (iii) shows that

cal ... ant~l ... ~n;

8 1+···+an=k

i.e.. f is a homogeneous polynomial of degree k in t l •...• tn in the usual algebraic sense. The constant kk/k! in (iv) is the best possible. as the following example shows. Write elements of JR.k as x = (xl •...• xk) and introduce the norm III (xl •...• xk) III = I xii + ... + I xk I. Define A

E

Lks(JR.k, JR.) by

I""

A(xl.···. xk) = -kl• where xi = (Xi I, ...• Xik)

E

£..J

X; I ••• X; k , 1

k

JR.k and the sum is taken over all permutations of {I ..... k}. It is

easily verified that II A II = l/k!, and II A' II = l/kk; i.e., II A II = (kk/k!) II A'

II. Thus. except for k

== 1, the isomorphism ' is not norm preserving. (This is a source of annoyance in the theory of

formal power series and infinite-dimensional holomorphic mappings.)

$:n

~ SUPPLEMENT 2.2C The Three Pillars of Linear Analysis

The three fundamental theorems of linear analysis are the Hahn-Banach theorem, the open mapping theorem, and the uniform boundedness principle. This supplement gives the classical proofs of these three fundamental theorems and derives some corollaries that will be used later. In

66

Chapter 2 Banach Spaces and Dif{erenJiol Calculus

finite dimensions these corollaries are all "obvious." 2.2.12 Hahn-Banach Theorem Let E be a real or complex vector space, 11·11 : E

~ lR

a

seminorm, and FeE a subspace. If f E F* satisfies 1f(e) 1:511 e II for all e E F, then there exists a linear map f': E ~ JR (or

C) such that f' I F = f and 1f'(e) 1:511 e II for all e E E.

Proof Real Case. First we show that f E F* can be extended with the given property to FEEl span{ eo), for a given eo i!: F. For el' e2 E F we have

so that and hence

Let a E JR be any number between the sup and inf in the preceding expression and define f': F EEl span {eo} ~ JR by f'(e + teo) = f(e) + tao It is clear that f' is linear and that f' I F = f. To show that 1 f'(e + teo) 1 :511 e + teo II, note that by the definition of a,

so that by multiplying the second inequality by t ~ 0 and the first by t < 0, we get the desired result. Second, one verifies that the set S = {(G, g) I F C GeE, G is a subspace of E, g E G*, g IF = f, and 1g(e) 1:511 e II for all e E G} is inductively ordered with respect to the ordering

Thus by Zorn's Lemma there exists a maximal element (Fo, fo) of S. Third, using the first step and the maximality of (Fo, fo)' one concludes that Fo = E. Complex Case. Let f = Re f + i 1m f and note that complex linearity implies that (1m f)(e) = - (Re f)(ie) for all e E F. By the real case, Re f extends to a real linear continuous map (Re f) : E ~ JR, such that 1 (Re f)'(e) 1 :5 II e II for all e E E. Define f': E ~ C by f'(e) = (Re f)'(e) - i(Re f)'(ie) and note that f is complex linear and f' I F =f. To show that 1f'(e) 1:5 II e II for all e E E, write f'(e) = 1f'(e) lexp(i9), so complex linearity of f' implies f'(e· exp(- i9» E JR, and hence

1f'(e) I

= f'(e· exp(- is)) = (Re f)'(e . exp(- i9»

:5 II e . exp (- i9) II

= II ell·

•

2.2.13 Corollary Let (E, II . II) be a normed space, F c: E a subspace, and f E F* (the topological dual). Then there exists f' E E* such that f' I F = f and II f' II = II f II.

67


Proof We can assume f"'O. Then III e III = II filII e II is a norm on E and I f(e) I $11 fll·1I e I :: III e III for all e E F. Applying the preceding theorem we get a linear map f: E ~ lR (or C) with the properties f

IF

= f and I f '(e) I $ III e III for all e

since f extends f. it follows that II f II

E. This says that II f II $ II f II. and

E

II f II; i.e.. II f' II = II f II and f

$

E*. •

E

Applying the corollary to the linear function ae 1-+ a. for e E E a fixed element. we get the following. 2.1.14 Corollary Let E be a normed vector space and e '" 0. Then there exists f E E* such

that f(e) '" O. In other words if f(e) =

°

for all f E E*. then e = 0; i.e .• E* separates points of

E. 2.2.15 Open Mapping Theorem of Banach and Schauder Let E and F be Banach

spaces and suppose A E L(E. F) is onto. Then A is an open mapping. Proof To show A is an open mapping. it suffices to prove that A(cl(DI(O») contains a disk centered at zero in F. Let r> 0. Since E = Un~IDnr(O). it follows that F = Un~I(A(Dnr(O») and hence U n~lcl(A(Dnr(O») = F. Completeness of F implies that at least one of the sets cl(A(Dnr(O») has a nonempty interior by the Baire Category Theorem 1.7.2. Because the mapping e E E 1-+ ne C

E

E is a homeomorphism. we conclude that cl(A(Dr(O))) contains some open set V

F. We shall prove that the origin of F is in int{cl[A(D/O»]} for some r> O. Continuity of

(ei' e2 )

E

E x E 1-+ e l - e2

{e l - e2 1 el' e 2

E

U}

E

E assures the existence of an open set U

D/O). Choose r> 0 such that Dr«O»

C

C

C

E such that U - U =

U. Then cl(A(D/O»)::J

cl(A(U) - A(U» ::J cl(A(U» - cl(A(U» ::J V-V. But V - V = U e EV(V - e) is open and clearly contains 0 E F. It follows that there exists a disk D1(O)

C

F such that D1(O)

C

cl(A(Dr(O»).

Now let £(n) = 1/2n+l. n = 0.1.2•...• so that 1 = l:n~ £(n). By the foregoing result for each n there exists an 11(n) > 0 such that D'1(n)(O)

C

cl(A(DE(n)(O». Clearly 11(n) ~ 0. We

shall prove that D'1(O) C A(cl(DI(O»). For v E D'1(O)(O) C cl(A(DE(O)(O») there exists eo E DE(O)(O) such that II v - Aeo II < 11(1) and thus v - Aeo E cl(A(D E(1)(O»). so there exists e l E DE(I)(O) such that II v - Aeo - Ae l II < 11(2). etc. Inductively one constructs a sequence en E D'1(n) such that II v - Aeo - ... - Aen II < 11(n+ 1). The series l:n ~ 0 en is convergent because m

I

m

L ei II i=n+1 L 1d+ i=n+1

1•

$

and E is complete. Let e = l:n ~ 0 en

E

L- 1/2

n+ 1

n=O

E. Thus

Ae=LAen=v. n=O and

= 1,

68


II e II i.e., v E O,,(olO) implies v

-

~ ~ II en II ~ ~o 2n1+ 1 =

=Ae,

II e II ~

1; that is, 0,,(0)(0) c: A(c1(OI(O»). •

An immediate consequence is the following.

2.2.16 Banach's Isomorphism Theorem A continuous linear isomorphism of Banach spaces is a homeomorphism.

Thus, if F and G are closed subspaces of the Banach space E and E is the algebraic direct sum of F and G, then the mapping (e, e') E F x G ...... e + e' E E is a continuous isomorphism, and hence a homeomorphism; i.e., E = F e G; this proves the comment at the beginning of Supplement 2.1B. 2.2.17 Closed Graph Theorem Let E, F be Banach spaces. A linear map A: E

~

F is

continuous iffits graph r A = {(e, Ae) E Ex Fie E E} is a closed subspace of E CD F.

Proof It is readily verified that r A is a linear subspace of E E9 F. If A E L(E, F), then r A is closed (see Exercise l.4B). Conversely, if r A is closed, then it is a Banach subspace of E e F, and since the mapping (e, Ae) ErA ...... e E E is a continuous isomorphism, its inverse e E E ...... (e, Ae) ErA is also continuous by 2.2.16. Since (e, Ae) ErA ...... Ae E F is clearly continuous, so is the composition e ...... (e, Ae) ...... Ae. • The Closed Graph Theorem is often used in the following way. To show that a linear map A : E ~ F is continous for E and F Banach spaces, it suffices to show that if en ~ 0 and ACn ~ e', then e' = O. 2.2.18 Corollary Let E be a Banach space and F a closed subspace of E. Then F is split iff there exists P E L(E, E) such that PoP =P and F = {e EEl Pe =e}.

Proof If such a P exists, then clearly ker P is a closed subspace of E that is an algebraic complement of F; any e E E is of the form e =e - Pe + Pe with e - Pe E ker P and Pe E F. Conversely, if E = Fe G, define P: E ~ E by P(e) =el' where c =e 1 + e 2, e 1 E F, c2 E G. P is clearly linear, p2 = P, and F = {e EEl Pc =e}, so all there is to show is that P is continuous. Let en =e 1n + e2n ~ 0 and P(en) =e 1n ~ e'; i.e., - e 2n ~ e', and since F and G are closed this implies that e' E F G = (OJ. By the closed graph theorem, P E L(E, E). •

n

2.2.19 Fundamental Isomorphism Theorem Let A E L(E, F) be surjective where E and F are Banach spaces. Then E/ker A and F are isomorphic Banach spaces.

Section 2.2 Linear and M ultiJinear Mappings

Proof The map tel

f-+

Ae is bijective and continuous (since its norm is $

69

II A II ), so it is a

homeomorphism. • A sequence of maps

of Banach spaces is said to be split exact if for all i, ker Ai + 1 =range Ai and both ker Ai and range Ai split. With this terminology, 2.2.18 can be reformulated in the following way: If 0 ~

G

E

~

F

~

~

0 is a split exact sequence of Banach spaces, then E/G is a Banach space

isomorphic to F (thus F;:; G EB F).

2.2.20 Uniform Boundedness Principle of Banach and Steinhaus Let E and F be normed vector spaces, with E complete, and let (A)i E

(II Aie Ill i E

I

is bounded in F, then

Proof Let q>(e) = sup{11 Ai e

Sn

(II Ai II) i E

1

I C

L(E, F). Iffor each e E E the set

is a bounded set of real numbers.

III i E I} and note that

= {e EEl

q>(e) $ n}

=

niEI

{e E E

III ~e II .,; n}

= E. Since E is a complete metric space, the Baire Category Theorem 1.7.2 says that some Sn has nonempty interior; i.e., there exist r > 0 and eo E E such that q>(e)

is closed and Un" 1 Sn $

M, for all e E cl(D/e o))' where M > 0 is come constant. For each i E I, and II e II = 1, we have II Ai(re + eo) II $ q>(re + eo) $ M, so that

i.e.,

II Ai II.,; (M + q>(eo))/r for all i E I. •

2.2.21 Corollary If (An) C L(E, F) is a strongly convergent sequence (i.e., limn ..... ~ Ane = Ae exists for every e E E), then A E L(E, F). Proof A is clearly a linear map. Since {Ane} is convergent, it is a bounded set for each e E E, that by 2.2.20, (II ~ II) is bounded by, say, M > O. But then

SO

i.e., A

E

L(E, F)..

b

70

Chapter 2 Banach Spaces and DjfferentiIJI Calculus

Exercises 2.2A

If E =]Rn and F =]Rm with the standard norms. and A: E --+ F is a linear map. show that (i) II A II is the square root of the absolute value of the largest eigenvalue of AA T. where AT is the transpose of A and (ii) if n. m ~ 2. this norm does not come from an inner product. (Hint: Use Exercise 2.IA.)

2.2B Let E = lItn and F =]Rn with the standard norms and A. B E L(E. F). Let ( A. B ) = trace(ABT). Show that this is an inner product on L(E. F). 2.2C Show that the map L(E. F) x L(F. E) --+ lit;

(A. B)

1-+

trace(AB)

gives a (natural) isomorphism L(E. F)*;: L(F. E). 2.2D Let E. F. G be Banach spaces. A: DeE --+ F. and B : DeE --+ G two closed operators with the same domain D. (See Supplement 7.4A for a full account of closed operators; all that is needed here is the definition). Show there exist constants MI' M2 > 0 such that II Ae II ~ MI(IIBell+llell) and

II Bell

~

MiliAell+llelD

forall eE E. (Hint: Norm EeG by II (e.g) 11= lie 1I+lIgll and define T:rB--+G by T(e. Be) =Ae. Use the Closed Graph Theorem to show that T E L(rB' G).) 2.2E

Linear transversality Let E. F be Banach spaces. Fo C F a closed subspace. and T E L(E. F). T is said to be transversal to Fo. if il(Fo) splits in E and T(E) + Fo = {Te + fie E E. f E F} = F. Prove the following. (i) T is transversal to Fo iff 1t 0 T E L(E. FIFo) is surjective with split kernel; here 1t: F --+ FIFo is the projection. (ii) If 1t 0 T E L(E. FIFo) is surjective and Fo has finite codimension. then ker(1t 0 T) has the same codimension and T is transversal to Fo. (Hint: Use the algebraic isomorphism T(E)/(Fo T(E» ;: (T(E) + Fo)lFo to show Elker(1t 0 T);: FIFo; now use 2.2.18.) (iii) If 1t 0 T E L(E, FIFo) is surjective and if ker T and Fo are finite dimensional. then ker(1t 0 T) is finite dimensional and T is transversal to Fo. (Hint: Use the exact sequence 0 --+ ker T --+ ker(1t 0 T) --+ Fo T(E) --+ 0.)

n

n

2.2F

Let E and F be Banach spaces. Prove the following. (i) If f E cl(8([a. bl. L(E. F») and e E E. then

Section 2.2 Linear and MulJiJinear Mappings

ff(t)edt

71

=( ff(t) dt J(e).

(Hint: T 1-+ Te is in L(L(E. F), F).) (ii)

If f E cl(8([a, b], JR) and v E F, then

(Hint: t

(iii)

1-+

multiplication by t in F is in L(JR, L(F, F»); apply (i).)

Let X be a topological space and f: [a, b] x X --t E be continuous. Then the mapping

g : X --t E, g(x) =

f

f(t, x)dt

is continuous. (Hint: for t E JR, x' E X and E> 0 given, II f(s, x) - f(t, x') II < E if (s, x) E U I X U X ',!; use compactness of [a, b] to find Ux' as a finite intersection and such that II f(t, x) - f(t, x') II < E for all t E [a. b]. x E U x") 2.2G Show that the Banach Isomorphism Theorem is false for normed incomplete vector spaces

in the following way. Let E be the space of all polynomials over JR normed as follows:

II (i) (ii)

(iii)

aa + a l x + ... + a"XR II

= max {I

aa I•...• 1a" I}.

Show that E is not complete. Define A: E --t E by

and show that A E L(E, E). Prove that A-I : E --t E exists. Show that A-I is not continuous.

2.2H Let E and F be Banach spaces and A E L(E. F). If A(E) has finite codimension. show

that it is closed. (Hint: If Fo is an algebraic complement to A(E) in F, show there is a continuous linear isomorphism E/ker A ;: FIFo; compose its inverse with E/ker A --t A(E).)

2.21

Symmetrization operator Define Symk : Lk(E. F)

--t

Lk(E. F), by

Chapter 2 Banach Spaces and DjfferenJiol Calculus

72

SymkA = k!

LoA,

ae!\

where (oA)(el' ... , ~) = A(ecr(l)' ... , ecr(k». Show that: (i) Symk(Lk(E, F» = Lk.(E, F). (ii) (Symk)2 = Symk.

2.2J

II Symk II ~ 1.

(iii) (iv)

If F is Banach, then Lks(E, F) splits in Lk(E, F). (Hint: Use 2.2.18.)

(v)

(SymkA)' = A'.

Show that a k-multilinear map continuous in each argument separately is continuous. (Hint: for k = 2: If II e l II ~ 1, then II A(el' e 2) II ~ II A(·, e 2) II, which by the uniform boundedness principle implies that II A(el' . ) II

2.2K

~

M for II e l II

~

1.)

(i) Prove the following theorem of Mazur and Ularn [1932] following the steps below (see Banach [1955], p. 166): Every isometric surjective mapping O

e->O

2.3.4 Linearity of the Derivative Let f, g : U C E ~ F be r times differentiable mappings and a a real (or complex) constant. Then af and f + g: U C E ~ Fare r times differentiable

with D'(f + g)

= D'f + D'g

and D'(af)

= aD'"f

.

Section 2.3 The Derivative

77

Proof If u E U and e E E. then f(u + e) = f(u) + Df(u)·e + o(e)

and

g(u + e) = g(u) + Dg(u) . e + o(e) •

so that adding these two relations yields (f + g)(u + e) = (f + g)(u) + (Df(u) + Dg(u» . e + o(e) The case r> 1 follows by induction. Similarly af(u + e) = af(u) + aDf(u) . e + ao(e) = af(u) + aDf(u) . e + o(e). • 2.3.5 Derivative of a Cartesian Product Let fi : U

differentiable mappings.

Then f= fl x ... x fn : U c E (fl(u) •...• fn(u» is r times differentiable and

E

C

~

FI

~

Fi' 1:0::; i :0: ; n. be r times Fn defined by f(u) =

X ... X

Proof For u E U and e E E. we have f(u + e) = (fl(u + e) •...• fn(u + e» = (fl(u) + Dfl(u)· e + o(e) •...• fn(u) + Dfn(u) . e + o(e» = (fl(u) ....• fn(u» + (Dfl(u) •...• Dfn(u» . e + (o(e) •...• o(e» = f(u) + Df(u) . e + o(e) . the last equality follows using the sum norm in FI x ... x Fn :

II (o(e) •...• o(e» II = II o(e) II + ... + II o(e) II • so (o(e) •...• o(e»

=: o(e).

•

Notice from the definition that for L

E

L(E. F). DL(u) = L for any u E E. It is also clear

that the derivative of a constant map is zero. Usually all our spaces will be real and linearity will mean real-linearity. In the complex case. differentiable mappings are the subject of analytic function theory. a subject we shall not pursue in this book (see Exercise 2.3F for a hint of why there is a relationship with analytic function theory). In addition to the forcgoing approach. there is a more traditional way to differentiate a function f: U C JRn ~ JRm. We write out f in component form using the following notation: f(xl •...• xn) = (fl(x I •... xn) •...• F(x I •... xn» and compute partial derivaties. afi/ax i for j = 1•...• m and i = 1•...• n. where the symbol afi/ax i means that we compute the usual derivative of f.i with respect to xi while keeping the other variables x I....• xi-I. xi+l •...• xn fixed. For f: JR ~ JR. Df(x) is just the linear map "multiplication by df/dx." i.e.. df/dx = Df(x)·l. This fact. which is obvious from the definitions. can be generalized to the following theorem.

Chapter 2 Banach Spaces and Di/lerential Calculus

78

2.3.6 Proposition Suppose that U c: jRD is an open set and that f: U ~ jRm is differentiable. Then the partial derivatives ap/ax i exist. and the matrix of the linear map Df(x) with respect to the standard bases in jRD and jRm is given by afl axl ar axl

afl

afl

ax 2

ax D

ar

ar

ax 2

axD

where each partial derivative is evaluated at x = (xl •...• xD). This matrix is called the Jacobian matrix of f. Proof By the usual definition of the matrix of a linear mapping from linear algebra. the (j. i)th matrix element ai i of Df(x) is given by the jib component of the vector Df(x)· ei• where e l • ...• eD is the standard basis of JRD. Letting y = x + hei• we see that II f(y) - f(x) - Df(x)(y - x) II II Y_ x II

1

= liJ II f(x , •...• x'+h •...• x I '

D

I

) -

f(x •...• xD )

-

hDf(x)ei II

approaches zero as h ~ O. so the jib component of the numerator does as well; i.e.• lim h..... O

-Ih1-I

1 fj

. I (x I •...• xi +h •...• xD ) - f(x •...• xD ) - hal.i 1 = O.

which means that aii = ap/ax i. • In computations one can usually compute the Jacobian matrix easily. and this proposition then gives Df. In some books. Df is called the differential or the total derivaJive of f. 2.3.7 Example Let f:

jR2 ~ JR.3.

whose matrix in the standard basis is

f(x, y) = (x 2 • x3y. x4y 2). Then Df(x, y) is the linear map

79


afl

afl

ax ay ar al ax ay ar

ax

2x

0

3x2y

x

2x4y

4x 3l

ar

ay

3

One should take special note when m = 1, in which case we have a real-valued function of n variables. Then Df has the matrix

[4

'00.'

ax

afn ] ax

and the derivative applied to a vector e =(aI, ... , an) is n

Df(x) . e =

L i=1

~ ai ax'

It should be emphasized that Df assigns a linear mapping to each x E U and the definition of Df(x) is independent of the basis used. If we change the basis from the standard basis to another one, the matrix elements will of course change. If one examines the definition of the matrix of a linear transformation, it can be seen that the columns of the matrix relative to the new basis will

be the derivative Df(x) applied to the new basis in JRn with this image vector expressed in the new basis in JRm. Of course, the linear map Df(x) itself does not change from basis to basis. In the ease m = 1, Df(x) is, in the standard basis, a 1 x n matrix. The veetor whose components are the same as those of Df(x) is called the gradient of f, and is denoted grad f or Vf. Thus for f: U C JRn -t JR, grad f=[4 ax

,00',

afn ] ax

(Sometimes it is said that grad f is just Df with commas inserted!) The formation of gradients makes sense in a general inner product spaee as follows.

2.3.8 Definition (i) Let E be a normed space and f: U C E -t JR be differentiable so that Df(u) E L(E, JR) = E*. In this case we sometimes write df(u) for Df(u) and call df the differential of f. Thus df: U-t E*. (ii) If E is a Hilbert space, the gradient of f is the map grad f = Vf: U -t E

defined by

( Vf(u), e

>=

df(u)· e

80

Chapter 2 Banach Spaces and DilJerentiBJ Calculus

where df(u)· e means the linear map df(u) applied to the vector e. Note that the existence of Vf(u) requires the Riesz Representation Theorem (see 2.2.5). The notation Of/au instead of (grad f)(u)

= Vf(u)

is also in wide use, especially in the case in

which E is a space of functions. See Supplement 2.4C below.

,» be a real inner product and f(u) = II u 112. Since II u 112 = II Uo 112 II u - Uo 11 2, we obtain df(Uo)' e = 2( u o' e) and thus Vf(u) = 2u. Hence f is

2.3.9 Example Let (E, ( + 2( u o' u - u o ) + of class

ct.

But since Df(u)

=2( u, .)

E

E* is a continuous linear map in u

E

E, it follows

that D2f(u) = Df E L(E, E*) and thus I)kf = 0 for k ~ 3. Thus f is of class C"". The mapping f considered here is a special case of a polynomial mapping (see 2.2.10). • We close this section with the Fundamental Theorem of Calculus in real Banach spaces. First a bit of notation. If cp : U c: JR.

~

F is differentiable, then Dcp(t)

L(lR, F) is isomorphic to F by A t-+ A(l), 1 E JR.; note that

~;

cp'(t) =

= Dcp(t)· I, 1

E

E

L(JR., F). The space

II A II = II A(1) II. We denote JR..

cp'(t) = lim cp(t + h) - cp(t) h ..... O h and cp is differentiable iff cp' exists. 2.3.10 Fundamental Theorem of Calculus (i) If g: [a, bl ~ F is continuous,for F a real

normed space, the map f: (a, b)

~F

defined by

f(t)

= fg(S)dS

is differentiable and we have f' =g. (ii) If f: [a, bl ~ F is continuous, is differentiable on la, b[ and f' extends to a continuous map on [a, bl, then f(b)-f(a) =

Proof (i) Let to

E

f

f'(s)ds.

la, b[. Since the integral is linear and continuous,

II f(to + h) - f(to) - hg(to) II

=

II

J'o'o+h (g(s) - g(to» ds

:;> Ihl sup {II g(s)-g(to) II The sup on the right-hand side

~

II

:;>

I to:;> s:;> to+h}

0 as I h I ~ 0 by continuity of g at to.

81


(ii) Let

h(t) =

(f

f '(s) ds ) - f(t).

*

By (i). h'(t) =0 on lao b[ and h is continuous on [a. bl. If for some t E [a. bl. h(t) h(a). then by the Hahn-Banach Theorem there exists a E F* such that (a 0 h)(t) (a 0 h)(a).

*

Moreover. a 0 h is differentiable on lao b[ and its derivative is zero (Exercise 2.3D). Thus by elementary calculus. a 0 h is constant on [a. bl. a contradiction. Hence h(t) = h(a) for all t E [a. bl. In particular. h(a)

=h(b).

•

Exercises 2.3A Let B: E x F --t G be a continuous bilinear map of normed spaces. Show that B is COO and that DB(u. v)(e. f) = B(u. f) + B(e. v) . 2.3B Show that the derivative of a map is unaltered if the spaces are renormed with equivalent norms. 2.3C If f E Sk(E. F). show that

and IYf(O)

=0

for i

= 1•...• k -

I .

2.3D Let f: U C E --t F be a differentiable (resp. C,) map and A E L(F. G). Show that A 0 f: U C E --t G is differentiable (resp. 0) and D'(A 0 f)(u) = A 0 D'f(u). (Hint: Use induction.) 2.3E

Let f: U

C

E --t F be r times differentiable and A E L(G. E). Show that

exists for all i ~ r. where v is an affine map. 2.3F

(i) (ii)

E

A-I (U). and gl' .... gi E G. Generalize to the case where A

Show that a complex linear map A E L(C. C) is necessarily of the form A(z) = A.z. for some A. E C. Show that the matrix of A E L(C. C). when A is regarded as a real linear map in

82


(Hint: A = a + ib.) (iii)

Show that a map f: U

C ~ C. f = g + ih. g. h : U

C

C

]R2 ~ JR is complex

differentiable iff the Cauchy-Riemann equations ag ah ax=ay'

ag ah ay=-ax

are satisfied. (Hint: Use (ii) and 2.3.6.) 2.3G Let (E. (

.»

be a complex inner product space. Show that the map f(u) = II u 112 is

not differentiable. Contrast this with 2.3.9. (Hint: Df(u). if it exists. should equal 2Re( ( u •. ) ).) 2.3H Show that the matrix of D2f(x)

E

L2(JRR. JR) for f: U

a2 f

a2f

axiaxi

ax iax2

a2f

a2f

axRaxi

axRax 2

C

JRR -+ JR. is given by

(Hint: Apply 2.3.6. Recall from linear algebra that the matrix of a bilinear mapping B E L(JRR.]Rm; JR) is given by the entries B(ej (first index = row index. second index =

.9

column index). where {el' .... eR} and {fl' .... fm } are ordered bases of JRR and JRm • respectively.)

§2.4 Properties of the Derivative In this section some of the fundamental properties of the derivative are developed. These properties are analogues of rules familiar from elementary calculus. Let us begin by strengthening the fact that differentiability implies continuity. 2.4.1 Proposition Suppose U

C

E is open and f: U ~ F is differentiable on U. Then f is

continuolLS. InfactJor each \10 E U there is a constant M > 0 and a

00 > 0 with the property

that II u - Uo II < 00 implies II feu) - f(u o) II :S M II u - Uo II. (This is called the Lipschitz property.)

Proof

III feu) - f(uo) II -II Df(uo) . (u - \10) III :S II feu) - f(uO> - Df(uo) . (u - uo) II

= II o(u - uo) II :S II u -

Uo

II

for II u - U o II :S 00 , where 00 is some positive constant depending on uo; this holds since lim u --> "0

o(u - no) --;;"'=0.

lIu-no II

Thus,

II feu) - f(u o) II :S II Df(ua> . (u - uo) II + II u - U o II :S (II Df(uo) II + 1)11 u - Uo II

Perhaps the most important rule of differential calculus is the chain rule. To facilitate its statement, the notion of the tangent of a map is introduced. The text will begin conceptually distinguishing points in U from vectors in E. At this point it is not so clear that the distinction is important, but it will help with the transition to manifolds in Chapter 3. 2.4.2 Definition Suppose f: U Tf: U x E

~

C

E ~ F is of class Cl. Define the tangent of f to be the map

FxF

given by

Tf(u, e)

= (f(u), Df(u) . e)

where we recall that Df(u) . e denotes Df(u) applied to e E E as a linear map. If f is of class cr, define Trf = T(T'-l f) inductively.

From a geometric point of view, T is more "natural" than D. If we think of (u, e) as a vector with base point u, and vector part e then (f(u), Df(u) . e) is the image vector with its

84

Chapter 2 Banach Spaces and Dilferentiol Calculus

base point f(u), as in Fig. 2.4.1. Another reason for this is its simple behavior under composition,

as given in the next theorem.

l

E

f

~

U

F

\~(U) ff(U)'e

Figure 2.4.1 2.4.3 C r Composite Mapping Theorem Suppose f: U

E -+ V

C

are differentiable (respectively Cr ) maps. Then the composite g

F and g: V

C 0

f :U

C

C

F -+ G

E -+ G is also

differentiable (respectively 0) and T(g 0 f) = Tg 0 Tf,

(respectively, T(g 0 f)

=Tg

0

Tf). Theformula T(g 0 f) = Tf 0 Tf is equivalent to the chain rule

in terms of D: D(g 0 f)(u) = Dg(f(u»

Proof Since f is differentiable at u

E

0

Df(u).

U and g is differentiable at f(u)

f(u + e) = f(u) + Df(u) . e + o(e) for e

E

E

V, we have

E

and for v = f(u) we have g(v + w) = g(v) + Dg(v) . w + o(w). Thus (g 0 f)(u + e) = g(f(u) + Df(u) . e + o(e»)

=(g

0

f)(u) + Dg(f(u» . (Df(u) . e) + Dg(f(u»(o(e» + o(Df(u) . e + o(e».

For e in a neighborhood of the origin, II Df(u) . e + o(e) II II ell

;5;

(II Df(u) II +

lIo(l~~I)1I ) M

for some constant M > 0, and IIDg(f(u» . o(e)1I

;5;

IIDg(f(u)1I lIo(e)lI.

;5;

M

Section 2.4 Properties of the Derivative

85

Thus

II o(lli(u)· e + o(e» II lie II

II (o(Df(u) . e + o(e» II II Df(u)· e + o(e) II

II Df(u) . e + o(e) II II e II

------~M

II (o(Df(u) . e + o(e» II . I Df(u) . e + o(e) II

Hence we conclude Dg(f(u» . (o(e» + o(Df(u) . e + o(e» = o(e) and thus D(g 0 f)(u) . e = Dg(f(u» . (Df(u) . e). Denote by cp: L(F. G) x L(E. F) --7 L(E. G) the bilinear mapping cp(B. A) = BoA and note that cp

E

L(L(F. G). L(E. F); L(E. G») since II BoA II ~ II B II II A II; i.e .• II cp II ~ 1. Let

(Dg 0 f) x Df: U --7 L(F. G) x L(E. F) be defined by [(Dg 0 f) x Df](u) = (Dg(f(u». Df(u»; notice that this map is continuous if f and g are of class CI. Therefore the composite function cp 0 (Dg 0 f) x Df) = D(g 0 f) : U --7 L(E. G) is continuous if f and g are CI. that is. go f is CI.

Inductively suppose f and g are

cr.

Then Dg is

cr- I •

so Dg 0 f is Cr- I and thus the

map (Dg 0 f) x Df is Cr - I (see 2.3.5). Since cp is COO (Exercise 2.3A). again the inductive hypothesis forces cp 0 «Dg 0 f) x Df) = D(g 0 f) to be Cr- I ; i.e .• go f is cr. The formula T'(g 0 f) = T'g 0 Ff is a direct verification for r = 1 using the chain rule. and the rest follows by induction. • If E=JRm. F=JRD. G=JRP

and f=(fl •...• fD). g=(gl •...• gP).where P:U--7JR

and gi : V --7 JR. by 2.3.6 the chain rule becomes 1

a(g 0 f) (x)

af(x)

which. when read componentwise. becomes the usual chain rule from calculus: a(gof)i(x) =

i

axi The chain rule applied to B fOllowing.

k=1

E

agi(f(>..» af(x) t

al

i= 1,... , m.

axi

L(Ft' F2; G) and fl x f2 : U

C

E --7 FI

X

F2 yields the

86

Chapter 2 Banach Spaces and DijJerentiol Calculus

2.4.4 The Leibniz Rule Let fj: U

C

E ~ Fj' i = 1,2, be differentiable (respectively Cr ) 0 (fl x f2) : U C E ~ G is

maps and B E L(FI' F 2; G). Then the mapping B(fl' f2) = B differentiable (respectively 0) and

In the case FI = F2 =JR and B is multiplication, 2.4.4 reduces to the usual product rule for derivatives. Leibniz' rule can easily be extended to multilinear mappings (Exercise 2.4C). The first of several consequences of the chain rule involves the directional derivative. 2.4.5 Definition Let f: U direction e E E at u

C

E ~ F and let u E U. We say that f has a derivative in the

if ddt f(u+te)1 1=0

exists. We call this element of F the directional derivative of f in the diurection e at u.

Sometimes a function all of whose directional derivatives exist is called Gaeaux differentiable, whereas a function differentiable in the sense we have defined is called Frichet differentiable. The latter is stronger, according to the following. (See also Exercise 2.4J.)

2.4.6 Proposition If f is differentiable at u, then the directional derivatives of f exist at u and are given by ddt feu + te)

I

1=0

=

Df(u) . e .

Proof A path in E is a map from I into E, where I is an open interval of lR. Thus, if c is differentiable, for tEl we have Dc(t) E L(lR, E), by definition. Recall that we identify L(JR. E) with E by associating Dc(t) with Dc(t) . I (1 E JR). Let dc dt(t) = Dc(t)· I . For f: U rule that

C

E ~ F of class C l we consider f 0 c, where c : I ~ U. It follows from the chain d

dt (f(c(t)))

=

D(f

0

c)(t) . I

=

~

Df(c(t» . dt .

The proposition follows by choosing c(t) = u + te, where u, e E E, I = 1 - A, A[, and A is sufficiently small. • For f: U C JRn ~ JR, the directional derivative is given in terms of the standard basis {el' .... en} by

Section 2.4 Properties o/the Derivative

Df(u) . e = -

df

x

dX I where e

= x lei + ...

I

87

df n + ... + - - x , dx n

+ xne n • This follows from 2.3.6 and 2.4.6.

The formula in 2.4.6 is sometimes a convenient method for computing Df(u)· e. For example, let us compute the differential of a homogeneous polynomial of degree 2 from E to F. Let f(e) = A(e, e), where A

E

L2(E; F). By the chain and Leibniz rules,

Df(u) . e = dd A(u + te, u + te) t

I

1=0

= A(u, e) + A(e, u) .

If A is symmetric, then Df(u)· e = 2A(u, e). One of the basic tools for finding estimates is the following.

2.4.7 Proposition Let E and F be real Banach spaces, f: U

C

E ~ F a Cl-map, x, Y E U,

and c a C l arc in U connecting x to y; i.e., c is a continuous map c: [0, 1] ~ U, which is Cion ]0, 1[, c(O)

= x, and

c(1)

= y.

Then 1

f(y) - f(x) = 50 Df(c(t» . c'(t) dt .

If U is convex and c(t) = (1 - t)x + ty, then f(y)-f(x)

Proof If g(t)

= 50IDf(O-t)x+ty).(y-x)dt =

= (f

0

(fDf(O-t)X+ty)dt).(y-x).

c)(t), the chain rule implies g'(t)

= Df(c(t»

. c'(t) and the fundamental

theorem of calculus gives 1

g(I) - g(O) = 50 g'(t) dt ,

which is the first equality. The second equality for U convex and c(t) 2.2F(i). •

2.4.8 Mean Value Inequality Suppose U for all x, y E U

C

=(1 -

E is convex and f: U

II f(y) - f(x) II ~ [suPO:SI:SI II Df«l - t)x + ty)

C

t)x + ty is Exercise

E ~ F is C l . Then

liJ II x - y II.

Thus, if II Df(u) II is uniformly bounded on U by a constant M > 0, then for all x, y II f(y) - f(x) II ~ Mil y - x II. If F = JR, then f(y) - f(x) = Df(c)· (y - x) for some c on the line joining x to y.

E

U


88

Proof The inequality follows directly from 2.4.7. The last assertion follows from the intermediate value theorem as in elementary calculus. •

2.4.9 Corollary Let U C E be an open set; then the/ollowing are equivalent: (i) U is connected; (ii) every differentiable milP f: U C E ---+ F satisfying Df = 0 on U is constant. Proof If U = U 1 U U 2 , U 1 n U 2 = 0, where U 1 and U 2 are open, then the mapping

1

0, if u

f(u) =

E

U1

e, if u E U 2

where e E F, e"", 0 is a fixed vector, has Df = 0, yet is not constant. Conversely, assume that U is connected and Df = O. Then f is in fact C-. Let

Uo E U be fixed and consider the set S = {u E U I f(u) = f(u o)}' Then S"'" 0 (since Uo E S), S C U, and S is closed since f is continuous. We shall show that S is also open. If u E S, consider

v E Dr(u)

C

U and apply 2.4.8 to get

II f(u)-f(v) II

~ sup {

II Df«I-t)u+ tv) III t E [0, III II u-v II =0;

i.e., f(v) = f(u) = f(u o) and hence Diu)

C

S. Connectedness of U implies S = U. •

If f is Giiteaux differentiable and the Giiteaux derivative is in L(E, F); i.e., for each u there exists Gu E L(E, F) such that ddt f(u + te)

I

1=0

=

E

V

Gue ,

and if u ...... G u is continuous, we say f is Cl-Gateaux. The mean value inequality holds, replacing CI everywhere by "Cl-Giiteaux" and the identical proofs work. When studying differentiability the following is often useful. 2.4.10 Corollary If f: U

C

E ---+ F is CI-Gateaux then it is CI and the two derivatives coincide.

Proof Let u E U and work in a disk centered at u. Proposition 2.4.7 gives

and the sup converges to zero as, e ---+ 0, by uniform continuity of the map t E [0, 1] ...... Gu+1e L(E, F). This says that Df(u)· e exists and equals Gue. •

E

Section 2.4 Properties a/the Derivative

89

It will be convenient to consider partial derivatives in this context. We shall discuss only functions of two variables. the generalization to n variables being obvious.

2.4.11 Definition Let f: U -+ F be a mapping defined on the open set U C EI EEl Ez and let Uo = (uOI ' u02 ) E U. The derivatives of the mappings v I 1-+ f(v l' Uo2)' v 2 1-+ f(uol' v2). where vI E EI and v2 E Ez. if they exist, are called partial derivatives of f at Uo E U and are denoted by Dlf(u o) E L(EI' F). Di(u o) E L(E2, F). 2.4.12 Proposition Let U C EI EEl E2 be open and f: U -+ F. (i) If f is differentiable, then the partial derivatives exist and are given by

= Df(u) . (el' 0)

D/(u) . e l

and

D2f(u)· e 2 = Df(u) . (0.

~).

(ii) If f is differentiable, then

Df(u) . (el' e 2) = DI f(u) . e l + D2f(u) . e 2 . (iii) f is of class Cr

Proof (i) Let

j!

X:EI

iff Dl: U -+ L(Ei • F), i = 1,2 both exist and are of class Cr - I .

-+ EI EEl ~ be defined by .i.!(VI) = (vI' u2)' where u = (ul' U2). Then

is C~ and D.i.!(uI) =

]1 E

L(EI' EI EEl~) is given by ]I(el) = (el'O). By the chain rule, Dlf(u) = D(f ° j!)(uI) = Df(u) ° ]1'

which proves the first relation in (i). One similarly defines t']2 and proves the second relation. ]1°

(ii) Let Pi (el' e 2) = ei• i = 1.2 be the canonical projections. Then compose the relation PI +]2 ° P2 = identity on EI EEl E2 with Df(u) on the left and use (i).

(iii) Let «l>i E L(L(E I EEl Ez. F). L(Ei' F» and 'I'i E L(L(Ei, F). L(E I EEl E 2• F» be defined by «l>i(A) = A o]i and 'I'i (B i) = Bi ° Pi' i = 1.2. Then (i) and (ii) become

Dl=«l>ioDf.

Df='I'loDlf+'I'2oD2f.

This shows that if f is differentiable. then f is Cr iff Dlf and D2f are Cr - I. Thus to conclude the proof we need to show that if Dlf and D2f exist and are continuous, then Df exists. By 2.4.7 applied consecutively to the two arguments. we get f(u l + el' ~ + e2) - f(ul' u2) - Dlf(ul' u2) . e l - D2f(ul' u2) . e 2 = f(u l + el' u 2 + e 2) - f(ul'

~

+ e2) - Dlf(ul' u 2) . e l

+ f(ul' u2 + e2) - f(ul' u2) - D2f(ul' =

(f +

~)

. e2

(Dlf(ul + tel' U2 + e2) -Dlf(ul. u2» dt ) . el

(fal (D2f(ul' U2 + te2) - D2f(uI.u2)dt ) . e2.

90


II e l II ~ II e l II + II e 2 II - II(e l • e2)11.

Taking nonns and using in each tenn the obvious inequality we see that

II f(u l + e p u2 + e2) - f(u p

~

Uz) - Dlf(u p u 2)·e l - D2f(u p u 2) . e 2 II

( sup

OStSI

II Dlf(ul + tel' u2 + e2) - Dlf(ul' u2 + e2) II

Both sups in the parentheses converge to zero as (e l • e 2)

~

(0. 0) by continuity of the partial

derivatives. • If EI

=E2 = \R.

and {el' e 2 } is the standard basis in \R.2 we see that

af (x. y) -_ lim ...:f(_x_+_h.:....:. y'-!-)_-_f.o....(x..;....:.. :. y) = D I f (x. y) . el E F. a x h-->O h Similarly. (af!ay)(x. y) = DzC(x. y) . e2 E F. Define inductively highcr dcrivatives

2.4.13 Example As an application of the fonnalism just introduced we shall prove that for f: U \R. 2 ~

C

F 2

D feu) . (v. w) = v

I

I W

a 2f I 2 a 2f 2 I a2f 2 2 a 2f (u) + V W ~.ax (u) + v w a ~. (u) + v W - 2 (u) • ax V] XV] ay

-2

if

(vi. v2 )

a 2f ayax (u)

-(u) ax 2 a 2f axay (u)

a 2f -(u) al

(::)

E U. V. WE \R. 2• V = vlel + v 2e 2 • w = wle l + w 2 e 2 • and {e l • e 2 } is the standard basis of \R.2. To prove this. note that by definition.

where u

D2f(u) . (v. w) = D(Df)(·) . w)(u) . v Applying the chain rule to DfO' w = Tw : A

E

L(\R. 2 • F)

f-+

A· w

E

F. the above equals

D(DfO . w)(u) . v (by 2.4.12(ii»


91

D ( w Iaf - + w2af) (u)' v ax dy

ay

af ) (u)· v Iel + D2 ( ax af ) (u)· v2~ ] + W2 [ DI ( dy af) (u)· v Iel + D2 ( af) (u)· v2e2 ] WI[ DI ( ax I I a 2f 2 I a 2f I 2 a 2f 2 2 a 2f v w -2 (u) + v w a ::l,. (u) + v w ::l,.a (u) + v w -2 (u). • ax XV] V] x ay For computation of higher derivatives, let us note that by repeated application of 2.4.6,

In particular for f: U

C

Rm

~

RD the components of I>'"f(u) in terms of the standard basis are

Thus f is of class Cr iff all its r-th order partial derivatives exist and are continuous. 2.4.14 Proposition (L. Euler) If f: U

C

E

~

Cr, then Drf(u)

E

Us(E, F); i.e., Drf(u) is

symmetric. Proof First we prove the result for r =2. Let u E U, v, wEE be fixed; we want to show that D2f(u) . (v, w) = D2f(u) . (w, v). To this, define the linear map a: ~2 ~ E by a(e l ) = v, and a(e2) = w, where e l and e2 are the standard basis vectors of R2. For (x, y)

=xv + yw.

Now define the affine map A: R2 ~ E by A(x, y)

=u

E

]R2, then a(x,y)

+ a(x, y). Since D2(f 0

A)(x, y) . (el' e 2) = D2f(u) . (v, w) (Exercise 2.3E), it suffices to prove this formula: D2(f . (x,y) . (e l , e2) = D2(f 0 A)(x, y) . (e 2, e l ); i.e.,

(see 2.4.13). Let g

=f

0

A : V = A-I (U)

C

0

A)

R2 ~ F. Since for any A. E F*, a 2(A. 0 g)/axdy

=

A.
p=I ..... r

-

R:D~Us(E.F).

and

where U is some thickening of U. such that for all (u. h)

E

-

U.

Assume f is a Cr mapping. If r = I, then by 2.4.7

f(u + h) = f(u) +

(f

Df(u + th) dt) . h = f(u) + Df(u) . h +

(L

1 (Df(u + th) - Df(u» dt) . h

and the formula is proved. For general k ~ r proceed by induction choosing in the integration by

Chapter 2 Banach Spaces and Dif/erentiIJJ Calculus

94

parts formula EI = JR, E2 = E, B(s, e) = se, 'l'2(t) = J>kf(u + th)·hk , and 'I',(t) = - (1 - tnk!, and taking into account that ( (I - t)k 1 Jo -k-!- dt = (k + 1)1" Since Dkf(u)

E

L\(E, F) by 2.4.14, Taylor's formula follows. •

Note that R(u, h) . hr =o(h') since R(u, h) ~ 0 as h ~ O. If f is Cr+1 then the mean value inequality and a bound on Dr+lf gives R(u, h) . hr = o(hr+I). See Exercise 2.4M for the differentiability of R. The proof also shows that Taylor's formula holds if f is (r - 1) times differentiable on U and r times differentiable at u. The estimate R(u, h) . hr =o(h') is proved directly by induction; for r = 1 it is the definition of the Frechet derivative. If f is COO (that is, is Cr for all r) then we may be able to extend Taylor's formula into a convergent power series. If we can, we say f is of class em, or analytic. A standard example of a COO function that is not analytic is the following function from lR to lR (Fig. 2.4.3)

l

ex P{-l/(1 - x2)} , I x 1 11 0 ,

lim

E

by uniform continuity of f on I since

Since

(g, s) .... (0,0)

for all t

=0

DTg(t)'ST 1/ (g, s) liT

=

0

]0, 1[, by Taylor's theorem for g we get ev(f + g, t + s) = f(t + s) + g(t + s) T

=

L~

(Dif(t) , Si + Dig(t) . si) + R(t, s) . sr

I.

i=O T

= f(t) +

~

1 LJ 'T
i

r

t) . (g, s) + R«f, t), (g, s» . (g, s) ,

i=\

where r

R«f, t), (g, s»' «g\, s\), .. " (gr' ST» = R(t, s), (s\, ... , ST) +

L

Drgi(t)· (S\, ... , Sr),

i=\

which is symmetric in its arguments and R«f, t), (0, 0»

= O.

By the converse to Taylor's

theorem, the proposition is proved if we show that every
~~

Og(f + Eh)(x)

ut

I£=0

= dd g(f(x) + Eh(x)) E

I . £=0

By the chain rule this is Dg(f(x))· h(x). This shows that if Og is differentiable. then DOg must be as stated in the proposition. Proof Let f E CO(M. U). By continuity of g and compactness of M.

II Og(f) - Og(f') II

=

sup

II g(f(m)) - g(f'(m)) II

mEM

is small as soon as

be given by

II f - f II is small; i.e .• Og is continuous at each point

f. Let

102

Chapter 2 Banach Spaces and DifferentWlCalculus

for HE CaCM, U(E; F)), hi"'" hi E CaCM, E) and mE M. The maps Ai are clearly linear and are continuous with II Ai II .,:; 1. Since Dig: U ~ UseE; F) is continuous, the preceding argument shows that the maps

are continuous and hence

is continuous. The Taylor theorem applied to g yields r

I. ~ Dig(f(m))'h(m)i + R(f(m), h(m))'h(mi

g(f(m) + hem)) = g(f(m)) +

1.

i=1

so that defining and [R(f, h)·(h l ,... , hr)](m) = R(f(m), h(m))·(hl(m), ... , h/m)) we see that R is continous, R(f, 0)

=0, and r

ng(f +

kv = g

0

(f + h)

=g f + 0

I. ~ (Dig i=1

1.

Thus by the converse of Taylor's theorem Ding = Ai

0

0

f) . hi

+ R(f, h) . hi

nDig and ng is of class

C .•

This proposition can be generalized to the Banach space C(I, E), I = [a, b], equipped with the norm II· IIr given by the maximum of the norms of the first r derivatives; i.e., II f IIr = max OSi ';'- suptEIIi f{i)(t) II. If g is cr+q, then ng: C(I, E) ~ cr-k(l, F) is Cq+k. Readers are invited to convince themselves that the foregoing proof works with only trivial modifications in this case. This version of the omega lemma will be used in Supplement 4.1C. For applications to partial differential equations, the most important generalizations of the two previous propositions is to the case of Sobolev maps of class HS; see for example Palais [1968], Marsden [1974b] and Marsden and Hughes [1982] for proofs and applications.


103

~

Supplement 2.4C The Functional Derivative and the Calculus of Variations Differential calculus in infinite dimensions has many applications, one of which is to the calculus of variations. We give some of the elementary aspects here. We shall begin with some notation and a generalization of the notion of the dual space. Let E and F be Banach sp~ces. A continuous bilinear functional (x)) r-I 'l'(s) dn x.

.•

Suppose. more generally. that f is defined on a Banach space E of functions q> on a region fl in lR. n . The functional derivative

8f oq>

E

~

of f with respect to q> is the unique element

E • I'f"It eXists. suc hth at

Df(q» . 'l' =

< ~ . 'l' > =

fn (~ )<X)'l'(X) dnx

for all

E

E.

The functional derivative may be determined in examples by

5.n

8f d -r(x)'l'(x) dn x = -d uq> E

I £=0

f(q> + E'l') .

(2)

A basic result in the calculus of variations is the following.

2.4.22 Proposition Let E be a space of functions, as above. A necessary condition for a

differentiable function f: E ~ lR. to have an extremwn at q> is that

o. Proof If f has an extremum at q>. then for each 'l'. the function h(E) = f(cp + E'V) has an extremum at E = O. Thus. by elementary calculus. h'(O) = O. Since 'l' is arbitrary. the result follows from (2). • Sufficient conditions for extrema in the calculus of variations are more delicate. See. for example. Bolza (1904) and Morrey (1966).

105


2.4.23 Examples A Suppose that Q

C

JR. is an interval and that f, as a functional of 0

«

0) for all e

to

0, e E E, then

Uo

is a strict local minimum (maximum) of f. Proof

TDf(uo)(h, h)

(i) By Taylor's formula, in a neighborhood V of U o, 0 ~ f(uo + h) - f(uo) =

+ 0(h2) for all hE V. If e E E is arbitrary, for small t E JR, te E V, so that 0 ~ ..!..D 2f(Uo)(te, te) 2

+ 0(t 2e 2) implies D 2f(u o)(e, e) + 20(t2e 2)!t2 ~ O. Now let t ~ O. (ii) Denote by T: E ~ E* the isomorphism defined by e t--+ D2f(u o) . (e, .), so that there exists a > 0 such that


a II e II ~ II Te II

=

111

sup I (Te, e') I = sup I D2f(Uo) . (e, e')I.

lIe'II=1

Ile'II=1

By hypothesis and symmetry of the second derivative,

which is a quadratic form in s. Therefore its discriminant must be negative, i.e.,

and we get a II e II ~ sup I D2f(uo) . (e, e')1 ~ II D2f(uo) 111/2[D2f(uo) . (e, e)] 112. 110'11=1

Therefore, letting m = a2~1 D2f(u o) II, the following inequality holds for any e

E

E:

Thus, by Taylor's theorem we have f(uo + h) - f(uo)

=

'21 D2f(uo)

2 2 2 . (h, h) + o(h ) ~ m II h II (2 + o(h ).

Let E> 0 be such that if II h II < E, then I 0(h2) I ~ m II h 112/4, which implies f(uo + h) - f(uo) ~ m II h 112/4> 0 for h

* 0, and thus

Uo

is a strict local minimum of f. •

The condition in (i) is not sufficient for f to have a local minimum at uo' For example, = 0, Df(O, 0) = 0, D 2f(O, 0) . (x, y)2 = 2x2 ~ 0 and

f: JR.2 ~ JR., f(x, y) = x2 - y4 has f(O, 0)

in any neighborhood of the origin, f changes sign. The conditions in (ii) are not necessary for f to have a strict local minimum at u o' For example f: JR. ~ IR., f(x) = x4 has f(O)

= £'''(0) = 0,

= £'(0) = f'(O)

[ 0 and 0 is a strict global minimum for f. On the other hand, if the

conditions in (ii) hold and

Uo

is the only critical point of a differentiable function f:

U~

JR., then

Uo is a strict global minimum of f. For if there was another point u l E U with f(u l ) ~ f(uo) on the straight line segment (1 - t)u o + tu l , t E [0, 1] there exists a point u2 such that f(u 2) > f(uo)

;::. f(u l ) since by (ii) ~ uO'

Uo is a strict local minimum. Therefore, there exists u 3 on this segmenI, u3 u l such that f(u 3 ) = f(uo)' But then by the mean value theorem 2.4.8 there exists u 4 uO'

*

u3 such that Df(u 4) = 0 which contradicts uniqueness of the critical point. Finally, care has to be taken with the statemenI in (ii): non-degeneracy holds in the topology of E. If E is continuously embedded in another Banach space F and D2f(u o) is non-degenerate in F only, even be a minumum. For example, consider the smooth map

U

o need not


112

4

f: L ([0, 1)) ~ JR ,f(u) =

1 rl 2 '2J o (u(x) -

4

u(x) ) dx

and note f(O) = 0, Df(O) = 0, and D2f(0)(v, v) =

fol v(d dx

> 0 for vi' 0,

and that D2f(0) defmes an isomorphism of L4([0, 1)) with L 4/3([0, 1]). Alternatively, D 2f(0) is non-degenerate on L2([O, 1]) not on L 4([0,1]). Also note that in any neighborhood of 0 in L 4([0, 1)), f changes sign: f(lIn) = (n 2 - 1)!2n4 ~ 0 for n ~ 2, but f(u n) = -12/n < 0 for n ~ 1 if

1

2, on [0, lin)

~=

0, elsewhere

and both lin, un converge to 0 in L4([0, 1)). Thus, even though D2f(0) is positive, 0 is not a minimum of f. (See Ball and Marsden [1984) for more sophisticated examples of this sort.)

fl:n

Exercises 2.4A Show that if g : U

E

C

~

(g(u»(v), u E U, V E F is also the evaluation map.) 2.4B Show that if f: U h :U

C

C

E

~

L(F, G) is C r , then f: U x F

cr.

(Hint:

L(F, G), g: U

~

G, defined by feu, v) =

Apply the Leibniz rule with L(F, G) x F

C

E

~

~

G

L(G, H) are Cr mappings then so is

E ~ L(F, H), defined by h(u) = g(u) 0 feu).

2.4C Extend Leibniz' rule to multilinear mappings and find a formula for the derivative. 2.4D Define a map f: U C E ~ F to be of class TI if it is differentiable, its tangent map Tf : U x E ~ F x F is continuous and II Df(x) II is locally bounded. (i) For E and F finite dimensional, show that this is equivalent to C I. (ii) (project) Investigate the validity of the chain rule and Taylor's theorem for T' maps. (iii) (Project) Show that the function developed in Smale [1964) is T2 but is not C2. 2.4E Suppose that f: E ~ F (where E, F are real Banach spaces) is homogeneous uf degree k; i.e., fete) = tkf(e) for all t E JR, and e E E. (i) Show that if f is differentiable, then Df(u)· u = kf(u). (Hint: Let get) = [(tu) and compute dg/dt.)


113

(ii) If E = JRn and F = JR. show that this relation is equivalent to n

"£... xi a;.df

=

kf(x).

I

i='

Show that maps multilinear in k variables are homogeneous of degree k. Give other examples. (iii) If f is Ck show that f(e) = (Ilk!) Dkf(O)· ek • i.e .• f E Sk(E. F) and thus it is C~. (Hint:

f(O)

=0;

inductively applying Taylor's theorem and replacing at each step h by

tho show that

2.4F Let e, •...• en_,

E

E be fixed and f: U c: E ~ F be n times differentiable. Show that the

map g: U c: E ~ F defined by g(u) = Dn-'f(u) . (e" ...• en_i) is differentiable and Dg(u) . e = Dnf(u) . (c. e" ...• en_i)' 2.4G (i) Prove the following refinement of 2.4.14. If f is C' and D, D 2f(u) exists and is

continuous in u. then D2D,f(u) exists and these are equal. (ii) The hypothesis in (i) cannot be weakened: show that the function

I

xy(x2 - / )

f(x. y) =

2

x

2

+Y

. • If

O.

is

ct. has

(x. y) '" (0. 0)

if (x. y) = (0. 0)

d 2 f!dxdy. d2 f!dydx continuous on JR2\ (O. 0»). but that d2 f(0. O)/dxdy '"

d 2 f(0. O)ldydx. 2.4H

For f: U c E

~

F. show that the second tangent map is given as follows:

T2f: (U x E) x (E x E) ~ (F x F) x (F x F) (u. e" e2• e 3 )

t-+

(f(u). Df(u) . e" Df(u) . e 2• D2f(u) . (e,. e2) + Df(u) . e 3 ).

2.41 Let f: JR2 ~ JR be defined by f(x. y) = 2x 2 y/(x 4 + y2) if (x. y) '" (0. 0) and 0 if (x. y) =

(0. 0). Show that (i) f is discontinuous at (0.0). hence is not differentiable at (0.0); (ii) all directional derivatives exist at (0.0); i.e .• f is Gateaux differentiable. 2.4J Differentiating sequences Let fn : U c: E ~ F be a sequence of cr maps. where E and F are Banach spaces. If {fnl converges pointwise to f: U ~ F and if {]).ifn). O:::;j:::; r.


114

converges locally uniformly to a map gi: U ~

0 seE, F), then show that

f is C r , Dif = gi

and {fn } converges locally uniformly to f. (Hint: For r = I use the mean value inequality and continuity of gl to conclude that II feu + h) - feu) - gl(U) . h II ~ II feu + h) - fn(u + h) - [feu) - fn(u)]II

+ II fn(u + h) - fn(u) - Dfn(u) . h II + II Dfn(u) . h - gl(u) . h II ~

£ II h II.

For general r use the converse to Taylor's theorem.)

2.4K

a

lemma In the context of 2.4.18 let a(g) = g

0

f. Show that a is continuous linear and

hence is COO.

2.4L Consider the map : CI([O, I]) ~ CO([O, 1]) given by (f)(x) = exp[f'(x)]. Show that is COO and compute D.

2.4M (H. Whitney [1943a]) Let f: U

C

E ~ F be of class C k+p with Taylor expansion

1 k k feb) = f(a) + Df(a) . (b - a) + ... + k! D f(a) . (b - a)

+

{I

I 0

tl-

I (l } (k-_ I)! [Dkf«(l - t)a + tb) - Dkf(a)] dt . (b - at

(i) Show that the remainder Rk(a,b) is C k+p for b"" a and CP for a, bEE. If E

=F

=JR, Rk(a, a) = 0, and lim (I b - a Ii Di+PRk(a, b» = 0, 1 ~ i ~ k.

b......

(For generalizations to Banach spaces, see Tuan and Ang [1979].) (ii) Show that the conclusion in (i) cannot be improved by considering f(x) = I X Ik+p+1!2 2.4N (H. Whitney [1943b]). Let f: JR (resp. f(x» =

~

JR be an even (resp. odd) function; i.e., f(x)

= f( -x)

-fe-x»~.

(i) Show that f(x) = g(x2) (resp. f(x) = xg(x 2» for some g. (ii) Show that if f is C 2k (C 2k+1) then g is C k (Hint: Use the converse to Taylor's theorem). (iii) Show that (ii) is still true if k = 00. (iv) Let f(x) = I X 12k +I+I!2 to show that the conclusion in (ii) cannot be sharpened. 2.40 (M. Buchner, 1. Marsden, and S. Schecter [1983]). Let E = L 4([0, 1]) and let q>: JR ~ JR be a

C~

function such that q>'(A)

= 1,

if -1:'> A:'> 1 and q>'(A)

=

°

if I A I ~ 2. Assume

q> is monotone increasing with q> = -M for A:'> -2 and q> = M for A ~ 2. Define the map h: E ~ JR by

115


1

fI

'3 Jo

h(u) =

0-1 0'" from

L(E, F) to L(E, E) is continuous and GL(E, F) is the inverse image of GL(E, E). Let

II a II = sup II a(e) II eEE

lIell=1

be the norm on L(E, F) relative to given norms on E and F. For -I 0 (q> I - ~ is invertible whenever II ~

sequence called the Neumann series): ~o

I,

~I

I+~

~2

I +~ +~

~n

0

~

= I + ~ + ~ 0 ~ + ... + (~ 0 ~ 0 ... 0 ~) .

Using the triangle inequality and the foregoing norm inequality, we can compare this sequence to the sequence of real numbers, 1, 1 + II ~ II, 1 + II ~ II + II ~ 11 2, ... , which we know is a Cauchy sequence since

II ~ II < 1. Because L(E, E) is complete,

~n

must converge. The limit, say p, is

the inverse of I -~. Indeed (I - ~)~n = I - (~ 0 ~ 0 ... 0 ~), so the result follows.

•

2.5.5 Lemma Let I: GL(E, F) ~ GL(F, E) be given by q> 1-+ q>-I. Then I is of class C~ and

D I(-I 0 '" 0 q>-I, then it will follow from Leibniz' rule that

I is of class

C~.

Indeed D I

= B(J,

J) where B

E

L2(L(F,E); L(L(E, F), L(F, E») is defined by B(",\, "'2)(A) = - "'loA 0 "'2' where "'\, "'2 E L(F, E) and A E L(E, F), which shows inductively that if I is Ck then it is Ck+I. Since the mapping '" 1-+

-

q>-I

0'" 0(x) = Df(O)' x - f(x) =

fo

I

to obtain ~(x) = [Df(O)-I) . (y + q>(x»,

-1 J

I rl

Dq>(sx)' x ds =

2 o D f(tsx)· (sx, x) dt ds

II q>(x) II $ KII x 112 if II x II $ P, and

II ~(x) II $ M(II y II + KII x 112) . Hence for II y II $ P/2M, gy maps Bp(O) to B5 (0). Similarly we get II g/x l ) - ~(x2) II $ II xI -

Xz 1112 from the Mean Value Inequality and the estimate

if II x II $ P. Thus, as in the previous proof, r-I : B p!2M(O) ~ Bp(O) is defined and there exists an open set G

C Bp(O) diffeomorphic via f to the open ball D Pf2M (O). Taking the second derivative of the relation r- I 0 f= identity on G, we get

D 2f- l (f(x»(Df(x) . ul' Df(x) . u2) + Df-I(f(x») . D 2f(x)(ul' u 2) = 0 for any ul' u 2

E

E. Let Vj = Df(x) . uj, i = 1,2, so that

D 2r- I(f(x» . (v!' v 2 ) = - Dr-I (f(x» . D 2 f(x)(Df(x)-1 . VI' Df(x)-l . v 2) and hence

121

Section 2.5 The Inverse and Implicit Function Theorems

since x

E

G

C

Dp(O) and on Bp(O) we have the inequality

II Df(x)-l II :-:; 2M. Thus on

Bp12M (O) the following estimate holds:

By the previous argument with f replaced by f- I, R by P/2M, L by M, and K by N = 8M3 K, it follows that there is an open set He DQ(O), Q = mini I/2KM, Q/2L, Q} such that II : H ~ D Q/2L(O) is a diffeomorphism. Since II is a diffeomorphism on BQ(O) and H is one of its open subsets, it follows that BQ12L(O) C G. Finally, replacing R by Q/2L, we conclude the existence of a ball B S12M (O), where S = mini I/2KM, Q/2L, Q). on which II is a diffeomorphism. Therefore B S/2M (O)

C

H. •

fl:n

In the study of manifolds and submanifolds, the argument used in the following is of central importance. 2.5.7 Implicit Function Theorem Let veE, V

C

F be open and f: V x V

~

G be

cr,

r

~

V, Yo E V assume Di(x o' Yo) : F ~ G is an isomorphism. Then there are neighborhoods Vo of Xo and Wo of f(x o' Yo) and a unique cr map g: Vo x Wo ~ V such 1. For some Xo

E

thatJor all (x, w)

E

Vo x W o' f(x, g(x, w)) = w .

Proof Define the map q,: V x V

~

E x G by (x, y) ...... (x, f(x, y». Then Dq,(x o' Yo) is given

by

which is an isomorphism of Ex F with Ex G. Thus q, has a unique cr local inverse, say q,-I : Vo x Wo ~ V x V, (x, w) ...... (x, g(x, w». The g so defined is the desired map. • Applying the chain rule to the relation f(x, g(x, w», one can compute the derivatives of g: DIg(x, w) = - [D 2f(x, g(x, w))]-I

0

Dlf(x, g(x, w» ,

D 2g(x, w) = [D2f(x, g(x, w))]-I . 2.5.8 Corollary Let veE be open and f: V ~ F be

cr,

r ~ 1. Suppose Df(x o) is surjective

and ker Df(x o) is complemented. Then f(V) contains a neighborhood of f(x o)' Proof Let EI = ker Df(x o) and E = EI EEl E 2. Then D/(x o): E2 ~ F is an isomorphism, so the hypotheses of 2.5.7 are satisfied and thus f(V) contains Wo provided by that theorem. •

122


Since in finite-dimensional spaces every subspace splits, the foregoing corollary implies that if f: U

C

]R.n --t ]R.m, n

~

m, and the Jacobian of f at every point of U has rank m, then f is

an open mapping. This statement generalizes directly to Banach spaces, but it is not a consequence of the Implicit Function Theorem anymore, since not every subspace is split. This result goes back to Graves [1950]. The proof given in Supplement 2.58 follows Luenberger [1969]. E --t F is C l and Df(uo) is onto for some U o U, then f is locally onto; i.e., there exist open neighborhoods U I of U o and VI of f(uo)

2.5.9 Local Surjectivity Theorem If f: U E

C

such that fl U I : U I --t VI is onto. In particular, open mapping. ~

if

Df(u) is onto for all u

E

U, then f is an

SUPPLEMENT 2.SB

Proof of the Local S urjectivity Theorem Proof Recall from Section 2.1 that E/ker Df(uo) =Eo is a Banach space with nom1 lI[x]11 = inf{ II x + u III u E ker Df(u o)}, where [x] is the equivalence class of x. To solve f(x) = y we set up an iteration scheme in Eo and E simultaneously. Now Df(u o) induces an isomorphism T: Eo --t F, so j l E L(F, Eo) exists by the Banach Isomorphism Theorem. Let x =Uo + h and write f(x)

=y

as jl(y - f(uo + h»

=0

.

To solve this equation, define a sequence Ln

» and ~

of ker Df(uo

E

Ln

C

E E/ker Df(u o) (so the element Ln is a coset E inductively by Lo =ker Df(uo)' ho E Lo small, and

(1)

and selecting hn E Ln such that

(2) The latter is possible since II Ln - L n_1 II = jl(Df(uo) . hn_I), so

= inf{ II h -

hn_1 II

I

h

E

Ln)' Since hn_1 E Ln_l' Ln_1

Subtracting this from the expression for Ln_1 gives

For £ > 0 given, there is a neighborhood U of U o such that II Df(u) - Df(u o) II < £ for u E U, since f is C I. Assume inductively that U o + ~_I E U and U o + hn_2 E U. Then from the Mean Value Inequality,

(4)

Section 2.S The Inverse and Implicit Function Theorems

123

By (2).

Thus if

I':

is small,

TII ho II. Uo + ~ remains inductively in U since

Starting with ho small and II hI - ho II
0 such that II Df(u o) . e II ;:;Mil e II for all e E E. By continuity of Df, there exists r > 0 such that II Df(u) - Df(u o) II < M/2 whenever II u -

Uo

II < 3r. By the Mean Value Inequality. for el' e 2 E D/uo)

II f(el) - f(e2) - Df(Uo)(el - e2) II

sup II Df(el + t(e2 - el» - Df(uo) 1111 el - e21!

~ t E

since II Uo - e l

-

t(e2

-

e l ) II < 3r. Thus

[0. Il

Chapter 2 Banach Spaces and DUJerentUd Calculus

124

i.e. ,

which proves that f is injective on Dr(uO) and that II : f(D/Uo» -t U is Lipschitz continuous .

•

We now give an example of the use of the Implicit Function Theorem to prove an existence theorem for differential equations. For this and related examples, we choose the spaces to be infinite dimensional. In fact, E, F, G, ... will be suitable spaces of functions. The map f will often be a nonlinear differential operator. The linear map Df(x o) is called the linearization of f about xo' (Phrases like "first variation," "first-order deformation," and so forth are also used.) 2.5.11 Example Let E be the space of all CI-functions f: [0, 1] -t lR with the norm

II fli!

=

sUP I f(x) I + sup

XE

[0,1]

XE

[6,1]

df(x) I I -dx

and F the space of all CO-functions with the norm II f 110 = sUPx E

[0.1]

I f(x) I. These are Banach

spaces (see Exercise 2.1C). Let : E -t F be defined by (f) = df/dx + f3. It is easy to check that is

C~

and D(O) =d/dx : E -t F. Clearly D(0) is surjective (Fundamental Theorem of

Calculus). Also ker D(O) consists of EI

= all

constant functions. This is complemented

because it is fmite dimensional; explicitly, a complement consists of functions with zero integral. Thus Corollary 2.5.8 yields the following: There is an e >

°

such that if g : [0, 1] -t lR is a continuous function with

I g(x) I < e, then

there is a C I function f: [0, 1] -t lR such that

df ..3 dx + r(x) = g(x) . •

~ SUPPLEMENT 2.5e An Application of the Inverse Function Theorem to a Nonlinear Partial Differential Equation

Let

nC

lRn be a bounded open set with smooth boundary. Consider the problem


125

for given f and g. We claim that for f and g small, this problem has a unique small solution. For partial differential equations of this sort one ean use the Sobolev spaces HS(n, 1R) consisting of maps nl2 the map

is

c~

= H'(n, lit),

F

=

(use Supplement 2.4B) and the linear operator

is an isomorphism. The fact that D(O) is an isomorphism is a result on the solvability of the Dirichlet problem from the theory of elliptic linear partial differential equations. See, for example, Friedman [1969]. (In the Ck spaces, D(O) is not an isomorphism.) The result claimed above

;lJJ

now follows from the Inverse Function Theorem.

The following series of consequences of the Inverse Function Theorem are important technical tools in the study of manifolds. The first two results give, roughly speaking, sufficient conditions to "straighten out" the range (respectively, the domain) of f in a neighborhood of a point, thus making f look like an inclusion (respectively, a projection). 2.5.12 Local Immersion Theorem Let f: U

C

E

~

F be of class Cr , r

~

1,

110 E

U and

suppose that Df(u o) is one to one and has a closed split image Fl with closed complement F2. (If E = IRm and F = 1R", assume only that Df(u o) has trivial kernel.) Then there are two open sets U' C F and veE EEl F2 where f(u o) E U' and a C diffeomorphism : U' ~ V such that q>-' (q>

0

g)(e, 0)

= (e, 0).

=g I V.

Hence for (e, 0)

E

V, (q>

0

f)(e) =

•

Figure 2.5.2 2.5.13 Local Submersion Theorem Let f: U C E ~ F be of class cr, r 2" 1, U o E U and suppose Df(u o) is surjective and has split kernel E2 with closed complement E,. (If E = jRm and F =JRD, assume only that rank (Df(uo» = n.) Then there are open sets U' and V such that U o E U' cue E and V C F (JJ ~ and a Cr diffeomorphism '1': V ~ U' with the property that (f 0 'I')(u, v) =u for all (u, v) E V. The intuition for E,

= E2 = F = jR

is given in Figure 2.5.3, which should be compared to

Figure 2.5.2. Note that this theorem implies the results of 2.5.9, the Local Surjectivity Theorem, but the hypotheses are more stringent. Proof By the Banach Isomorphism Theorem (Section 2.2), D,f(u o) E aL(E" F). Define the map g: U C E, (JJ E2 ~ F (JJ E2 by g(u!' u2) = (f(u!, u 2), u2 ) and note that

so that Dg(u o) E aL(E, F (JJ E 2). By the Inverse Function Theorem there are open sets U' and V such that Un E U' cue E, V C F (JJ ~ and a Cr diffeomorphism '1': V ~ U' such !hat 'I' -, = g I U'. Hence if (u, v) E V, (u, v) = (g 0 'I')(u, v) = (f('I'(u, v», 'I'/u, v», where


\jf = \jfl

X

\jf2: i.e., \jf2(u, v) = v and (f 0 \jf)(u, v) = u.

E,

f

0

~ constant

'"

E,

1

f

127

•

~ constant

~~~ F

------'---F

E,

Figure 2.5.3

2.5.14 Local Representation Theorem Let f: veE ~ F be of class cr, r > 1, U o E V and suppose Df(u o) has closed split image FI with closed complement F2 and split kernel Ez with closed complement E I. (If E = jRffi, F= jRn, assume that rank (Df(uo)) = k, k ~ n, k ~ m,

so that F2 = jRn-k, FI = jRk, EI = the property that

Uo E

E2 =

jRk,

V' eve E, V

C

that (f 0 \jf)(u, v) = (u, TJ(u, v)), where TJ : V Proof Write f = fl

X

and thus there exists a fl

0

f 2, where f i : V

cr

~

jRffi-k.)

Then there are open sets V' and V with

FI E9 E2 and a ~

Fz is a

cr Cr

diffeomorphism \jf: V ~ V' such

map satisfying DTJ(uo) = o.

Fi , i = 1, 2. Then f satisfies the conditions of 2.5.13,

diffeomorphism \jf: V

\jf is given by (flo \jf)(u, v) = u. Let TJ = f2

C 0

FI E9 E2

~

V'

C

E such that the composition

\jf. •

To use 2.5.12 (or 2.5.13) in finite dimensions, we must have the rank of Df equal to the dimension of its domain space (or the range space). However, we can also use the Inverse Function Theorem to tell us that if Df(x) has constant rank k in a neighborhood of xo' then we can straighten out the domain of f with some invertible function \jf such that f 0 \jf depends only on k variables. Then we can apply Proposition 2.5.12. This is the essence of the following theorem. Roughly speaking, the theorem says that if Df has rank k on jRffi, then m - k variables are redundant and can be eliminated. As a trivial example, if f:

jRz

~

jR

is defined by

setting f(x, y) = x - y, Df has rank 1, and so we can express f using just one variable, namely, let \jf(x, y) = (x + y, y) so that (f 0 \jf)(x, y) = x, which depends only on x.

2.5.15 Rank Theorem Let f: veE ~ F be of class Cr , r;:' 1, U o E V and suppose Df(u o) has closed split image FI with closed complement F2 and split kernel E z with closed Complement E I. In addition, assume that for all u in a neighborhood of U o E V, Df(u)(E) is a closed subspace of F and Df(u) I EI : EI ~ Df(u)(E) is a Banach space isomorphism. (In case E:: jRffi and F = JRR, assume only that rank (Df(u)) = k for u in a neighborhood of uo') Then there exist open sets VI C FI E9 E2, V 2 c: E, VI C F, and V 2 C F and there are c r


128

diffeomorphisms q>:

VI ~ V 2

and'll: U I

~

U 2 such that (q>

0

f

0

'V)(x, e) = (x, 0).

The intuition is given by Figure 2.5.4 for E = ]R2, F =]R2 and k = 1.

F,

F,

£,

Figure 2.5.4 Remark It is clear that the theorem implies EI EEl ker(Df(u» = E and Df(u)(E) EEl F2 = F for u in U o in U, because q> 0 f 0'11 has these properties. These seemingly stronger conditions can in fact be shown directly to be equivalent to the hypotheses in the theorem by the use

a neighborhood of

of the openness of aL(E, E) in L(E, E). Proof By the Local Representation Theorem there is a Cr diffeomorphism'll: U I

C

FI EEl E2

~

'V)(x, y) = (x, ll(x, y». Let PI: F ~ FI be the projection. Since Df(x, y) . (w, e) = (w, Dll(x, y) . (w, e», it follows that PI 0 Df(x, y»(w, e) = (w, 0), U2

C

E such that f(x, y) := (f

0

for WE FI and e E E2. In particular PI 0 Df(x, y) I FI x (OJ = I\, the identity on FI' which shows that Df(x, y) I FI x {OJ: FI x (OJ ~ Df(x, y)(FI EEl Ez) is injective. In finite dimensions this implies that it is an isomorphism, since dim(F I ) = dim(Df(x, y)(FI EEl E 2». In infinite dimensions this is our hypothesis. Thus we get Df(x, y) 0 PI I Df(x, y)(FI EEl E 2) = identity. Let (w, Dll(x, y)(w, e» (Df(x, y)

0

E

Df(x, y)(FI EEl

Ez).

PI)(w, Dll(x, y). (w, e»

Since Df(x, y). (w, 0) (w, Dll(x, y) . (w, 0»

(w, D\ll(x, y) . w)

Section 2.5 The Inverse and ImpliciJ Function Theorems

we must have DTl(x. y) . e = 0 for all e

E

129

E2 ; i.e .• D 2Tl(x. y) = O. However. D 2 f(x. y) . e =

(0. D 2Tl(x. y) . e). which says that D 2 f(x. y) = 0; i.e .. f does not depend on the variable y E

E2. Define fix)

=f(x. y) =(f

0

'V)(x. y). so f: P,'(V)

C

F, ~ F where P,': F, EB E2 ~ F, is

the projection. Now f satisfies the conditions of 2.5.12 at P ,'('I"(u o)) and hence there exists a C r diffeomorphism 0 such that II Df(x)-l II < M for all x E E, then f is a diffeomorphism. The key to the proof of the theorem consists of a homotopy lifting argument.

If X is a

topological space, a continuous map q>: X -+ F is said to lift to E through f, if there is a continuous map \jf: X -+ E satisfying f 0 \jf = q>. 2.5.18 Lemma Let X be a connected topological space, q>: X -+ F a continuous map and

let f: E -+ F be a C l map with Df(x) an isomorphism for every x E E. Fix

Uo E

E, Vo E F,

and Xo E X satisfying f(uo) = Vo and q>(x o) = vO' Then if a lift \jf of q> through f with q>(x o) = uo exists, it is unique.

Proof Let \jf' be another lift and define the sets Xl X

I

X I q>(x)

= {x E

= \jf '(x)}

and X 2

= {x E

\jf(x)"* \jf '(x) }, so that X = Xl U X 2 and Xl () X 2 = 0. We shall prove that both Xl' X 2

are open. Since Xo E Xl' connectedness of X implies X2 = 0 and the lemma will be proved. If x

E

Xl' let

U

be an open neighborhood of \jf(x)

= \jf'(x)

on which f is a

diffeomorphism. Then \jIl(U) () 'I"-l(U) is an open neighborhood of x contained in Xl' If x

E

X 2 , let U (resp. U') be an open neighborhood of the point \jf(x) (resp. of 'I"(x))

on which f is a diffeomorphism and such that U () U' = 0. Then the set \jIl(U) () 'I"-l(U') is an open neighborhood of x contained in X 2. • A path y: [0, 1] -+ G, where G is a Banach space, is called C l if Y I] 0, 1[ is uniformly l C and the extension by continuity of y' to [0, 1] has the values y'(O), y'(1) equal to y'(O) =

lim I 0 such that [0, £[ x [0, 1]

c: H-l(U). Let K: I] (X

I

[0, £[ x [0, 1] -+ E be given by K = r-l

0

H. Consider the set A = {o

E

[0,

H: [0, o[ x [0, 1] -+ F can be lifted through f to E} which contains the interval [0, c[. If

= sup A

we shall show first that a

the lifting K.

E

A and second that a

= 1.

This will prove the existence of


132

E

f

[0.1] x [0. 1]

•

H

F

Figure 2.5.5 To show that ex E A. note that for 0 ~ t < ex we have f

0

K = H and thus Df(K(I. s»

0

aK/t

=

aH/at. which implies that

II a: II ~

M

sup I. s E

[0. II

II aa~ I

= N .

Thus by the Mean Value Inequality. if {tn} is an increasing sequence in A converging to ex.

which shows that (K(tn• s)} is a Cauchy sequence in E. uniformly in s E [0.1]. Let K(ex. s) =

lim K(tn. s) In i a

By continuity of f and H we have f(K(ex. s» =

lim f(K(tn. s» = Inia

lim H(tn. s) = H(ex. s) • In ia

which proves that ex E A. Next we show that ex = 1. If ex < 1 consider the curves s 1-+ K(ex. s) and s 1-+ H(ex. s)

=

f(K(ex. s». For each s

E

such that f I Us : Us

Vs is a diffeomorphism. By compactness of the path K(ex. s) in s. i.e.

~

[0. 1] choose open neighborhoods Us of K(ex. s) and Vs of I-I(ex. s)

of the set {K(ex. s) I s E [0. I]}. finitely many of the Us' say Ui' .... Un' cover it. Therefore the corresponding Vi' ...• Vn cover {H(ex. s) I s E [0. Ill. Since H-l(V) contains the point (ex. s). there exists E> 0 such that


133

where lSi - 0i' si + oJ = H(a, yl(V i) and in particular lSi - 0i' si + oil. i = 1, ... , n cover [0, 1]. Let e = min{e!, ... , en} and define K: [0, a + e[ x [0, 1] ~ E by

1

K(t, s), if (t, s)

K(t, s) =

E

[0, a[ x {(O, I)}

(f I Uifl(H(t, s», if (t, s)

E

[a, a + e[ x lSi - Oi' Si + Oil. i = 1, ... , n.

By 2.5.18, K is a lifting of H, contradicting the definition of a. Finally, K is C l in t for each s by the chain rule:

oK

at Proof of 2.5.17 Let Yo' y

E

oH

-I

Df(K(t, s»

0

at . •

F and consider the path y(t) = (1 - t)yo + ty. Regarding y as

defined on [0, 1] x [0, 1], independent of the second variable, the Homotopy Lifting Lemma guarantees the existence of a C l path 0: [0, 1] ~ E lifting y, i.e., f 00 = y. In particular,

=y(1) = y

f(o(1»

and thus f is surjective.

To show that f is injective, assume xI "# x2' f(x l ) = f(x 2), and consider the path o(t) = (1 - t)x l + tx 2· Then y(t) = f(o(t». By the Homotopy Lifting Lemma, there exists a lift K of H through f. From f

0

K = H it follows that the continuous curve s ...... K(O, s) is mapped by f

to the point f(x l ), thus contradicting the Inverse Function Theorem. Therefore f is a bijective map which is a local diffeomorphism around every point, i.e., f is a diffeomorphism of E with F. • Remarks (i) The uniform bound on

II Df(x)-I II can be replaced by properncss of the map, i.e.,

if f(x n) ~ y there exists a convergent subsequence {xm}, xm ~ x with f(x) = y (see Exercise 1.5J). Indeed, the only place where the uniform bound on II Df(x)-I II was used is in the Homotopy Lifting Lemma in the argument that a

= sup A E

A. If f is proper, this is shown in

the following way. Let (t(n)} be an increasing sequence in A converging to a. Then H(t(n), s)

~

H(a, s) and from f

H(a, s) uniformly in s

E

0

K = H on [0, a[ x [0, 1], it follows that f(K(t(m), s»

~

[0,1]. Thus, by properness of f, there is a subsequence (t(m)} such

that K(t(m), s) is convergent for every s. Put K(a, s) = limt(m) i a K(t(n), s) and proceed as before. (ii) If E and F are finite dimensional, properness of f is equivalent to: the inverse image of every compact set in F is compact in E (see Exercise 1.5J). (iii) Conditions on f like the one in (ii) or in the theorem are necessary as the following counterexample shows. Let f:]R2 ~]R2 be given by (eX, ye-X ) so that f(]R2) is the right open half plane and in particular f is not onto. However

134


Df(x, y)

~xl

X

= [

e _x -ye e

J

is clearly an isomorphism for every (x, y) E JR.2. But f is neither proper nor does the norm II Df(x, y)-I II have a uniform bound on JR.2. compact set [0, 1] x {OJ is ]-00,0] which is unbounded x

~

X

For example, the inverse image of the

(OJ and II Df(x, y)-I II = C[e- 2x + e2x + y2e- 2x ]1!2,

+=.

(iv) Sec Wu and Desoer [1972] and lchiraku [1985] for useful references to the theorem and applications.

•

If E = F = H is a Hilbert space, then the Hadamard-Levy Theorem has an important consequence. We have seen that in the case of f: JR.

~ JR.

with a uniform bound on 1/1 f '(x) I, the

strong monotonicity of f played a key role in the proof that f is a diffeomorphism. 2.5.20 Definition Let H be a Hilbert space. A map f: H

~

H is strongly monotone

if there

exists a> 0 such that (f(x) - fey), x-y) ~ all x _ y 112. As in calculus, for differentiable maps strong monotonicity takes on a familiar form. 2.5.21 Lemma Let f: H ~ H be a differentiable map of the Hilbert space H onto itself Then f is strongly monotone

if and only if ( Df(x) . u, u ) ~ all u 112 for some a> O.

Proof If f is strongly monotone, (f(x + tu) - f(x), tu ) ~ at 211 u 112 for any x, u E H, t E R Dividing by t2 and taking the limit as t ~ 0 yields the result. Conversely, integrating both sides of (Df(x + tu) . u, u) ~ all u 112 from 0 to I gives the strong monotonicity condition. • 2.5.22 Lax-Milgram Lemma Let H be a real Hilbert space and A E L(H, H) satisfy the estimate (Ae, e ) ~ all e 112 for all e E H. Then A is an isomorphism and II A-I II

$

l/a.

Proof The condition clearly implies injectivity of A. To prove A is surjective, we show first that A(H) is closed and then that the orthogonal complement A(H)-L is (OJ. Let fn = A(en) be a sequence which converges to f E H. Since II Ae II

~

all e II by the Schwarz inequality, we have

andthus (enJ is a Cauchy sequence in H. If e is its limit we have Ae=f and thus fE A(H). To prove A(H)-L = {O}, let u E A(H)-L so that 0 = ( Au, u ) ~ all u 112 whence u = O. By Banach's Isomorphism Theorem 2.2.16, A is a Banach space isomorphism of H with itself. Finally, replacing e by A-If in II Ae II ~ all e II yields II A-If II $11 f II/a, i.e., II A-I II $ l/a. •


135

Lemmas 2.5.21.2.5.22. and the Hadamard-Levy Theorem imply the following global Inverse Function Theorem on the real Hilbert space. 2.5.23 Theorem Let H be a real Hilbert space and f: H ~ H be a strongly monotone Ck

mapping k ~ 1. Then f is a Ck diffeomorphism.

~

b

SUPPLEMENT 2.5E The Inversion Map

Let E and F be isomorphic Banach spaces and consider the inversion map I: CiL(E. F)

~

CiL(F. E); I(O can be covered by a single chart. Define an equivalence relation on s2n+1 E

C

c 2(n+l) by x - y if Y = eiSx for some 8

R Show s2n+I/_ is homeomorphic to cpn.

(ii) Show that (a)

(u, v)

E

(b)

(u, v)

E

S3 c C 2 1-+ 4(- uv, 1V 12 -I U 12) E S2, and S3 c C 2 1-+ (uv- I , if v*" 0, and 00, if v = 0)

E

]R2c == S2 (see

Exercise 3.1E) induce homeomorphisms of S2 with Cpl. The map in (a) is called the classical Hopffibration; it will be studied further in Section 3.4.

3.tH Flag manifolds Let Fn denotethesetofsequencesofnestedlinearsubspaces VI C V2 C ... C V n_1 in ]R0 (or CO), where dim Vi = i. Show that Fo is a compact manifold and compute its dimension. (Flag manifolds are typified by Fn and come up in the study of symplectic geometry and representations of Lie groups.) (Hint: Show that Fo is in bijective correspondence with the quotient nL(n)/upper triangular matrices.)

~3.2

Submanijolds, Products, and Mappings

A submanifold is the nonlinear analogue of a subspace in linear algebra. Likewise, the product of two manifolds, producing a new manifold, is the analogue of a product vector space. The analogue of linear transformations are the cr maps between manifolds, also introduced in this section. We are not yet ready to differentiate these mappings; this will be possible after we introduce the tangent bundle in §3.3. If M is a manifold and A C M is an open subset of M, the differentiable structure of M naturally induces one on A. We call A an open submanifold of M. For example, Gn(E),

G"(E) are open submanifolds of G(E) (see Example 3.1.8G). We would also like to say that sn is a submanifold of ]Rn + 1, although it is a closed subset. To motivate the general definition we notice that there are charts in ]Rn + 1 in which a neighborhood of sn becomes part of the subspace ]Rn. Figure 3.2.1 illustrates this for n = 1.

R' R'

Figure 3.2.1

3.2.1 Definition A submanifo/d of a manifold M is a subset Be M with the property thatfor

each bE B there is an admissible chart (U, 1) 0 fep", 0 (q>' 0 q>-I)-I is also CT. Moreover, if q>" and '1'" are restrictions of q> and

('I"

Section 3.2 Submanifo/ds, Products, and Mappings

153

\jf to open subsets of U and V, then fq>''It, is also cr. Finally, note that if f is Cr on open submanifolds of M, then it is Cr on their union. That f is cr now follows from the fact that any chart of M can be obtained from the given collection by change of diffeomorphism, restrictions, and/or unions of domains, all three operations preserving the Cr character of f. This argument

also demonstrates the converse. _ Any map from (open subsets of) E to F which is Cr in the Banach space sense is Cr in C~

the sense of Definition 3.2.5. Other examples of

maps are the antipodal map x 1-+ - x of sn

and the transation map by (91" .. , 9 n) on yn given by (exp(ir l ), ... , exp(irn») ...... exp(i(r l + 9 1»), ... , exp(i(rn + 9n»). From the previous proposition and the Composite Mapping Theorem, we get the following. 3.2.7 Proposition If f: M ~ Nand g : N

~

Pare Cr maps, then so is go f.

~ N, where M and N are manifolds, is called a cr diffeomorphism if f is of class Cr, is a bijection, and f-I: N ~ M is of class cr. If a diffeomorphism exists between two manifolds, they are called diffeomorphic.

3.2.8 Definition A map f: M

It follows from 3.2.7 that the set Di.ffr(M) of Cr diffeomorphisms of M fOnTIS a group under composition. This large and intricate group will be encountered again several times in the book.

Exercises 3.2A Show that (i) if (U, cp) is a chart of M and

\jf:

cp(U)

~

V

C

F is a diffeomorphism, then

(U, \jf a cp) is an admissible chart of M, and

(ii) 3.2B

admissible local charts are diffeomorphisms.

A C l diffeomorphism that is also a

cr

map is a Cr diffeomorphism. (Hint: Use the

comments after the proof of 2.5.2.) 3.2C

Show that if Ni C Mi are submanifolds, submanifold of MI x ... x Mn'

i = 1, ... , n, then

NIx", x Nn is a

3.2D Show that every submanifold N of a manifold M is locally closed in M; i.e., every point n E N has a neighborhood U in M such that N n U is closed in U. 3.2E

Show that fi : Mi ~ Ni' i = 1, ... , n are all Cr iff fl x ... x fn : MI x ... x Mn ~ NIx ... X Nn is cr.

Chapter 3 Manifolds and Vector Buru/les

154

3.2F

Let M be a set and (M) j E

I

a covering of M. each M j being a manifold. Assume that

for every pair of indices (i. j). M j

n Mj is an open submanifold in both M j and Mj .

Show that there is a unique manifold structure on M for which the M j are open submanifolds. The differentiable structure on M is said to be obtained by the collation of the differentiable structures of M j • 3.2G Show that the map F 1-+ f'l

= {u E F* lui F = o}

of O(E) into O(E*) is a C~ map.

If E = E** (i.e .. E is reflexive) it restricts to a C~ diffeomorphism of on(E) onto On(E*) for all n = 1. 2. .... Conclude that lFJP'n is diffeomorphic to on(]Rn + 1). 3.2H Show that the two differentiable structures of ]R dermed in 3.1.8D are diffeomorphic. (Hint: Consider the map x 1-+ x 1/3.) 3.21

3.2J

(i)

Show that Sl and lFJP'1 are diffeomorphic manifolds (see Exercise 3.1F(b».

(ii)

Show that ClP'1 is diffeomorphic to S2 (see Exercise 3.1G(b».

Let MA = {(x. 1x IA) I x E ]R}. where

AE R

Show that

(i)

if A$ O. MA is a C'" submanifold of ]R2;

(ii)

if A> 0 is an even integer. MA is a C'" submanifold of ]R2;

(iii)

if A> 0 is an odd integer or not an integer. then MA is a CiA] submanifold of ]R2 which is not CiA] + I. where [A] denotes the smallest integer ~ A. i.e .. [A] $ A < [A] + I;

(iv)

in case (iii). show that MA is the union of three disjoint C~ submanifolds of ]R2.

3.2K Let M be a Ck submanifold. Show that the diagonal ~ = {(m. m) I m EM} is a closed Ck submanifold of M x M. 3.2L

Let

E

be a Banach space. Show that the map

x

1-+

Rx(R 2 -

II x 112)-112 is a

diffeomorphism of the open ball of radius R with E. Conclude that any manifold M modeled on E has an atlas {(V j • -l)«q> 0 c)(t» - ('V 0 q>-l)«q> 0 c)(a)) = D('I' 0 q>-l)«q> 0 c)(a» . v + o(

II v II)

whence ('V 0 c)(t) - ('V 0 c)(a) t- a

D('I'

0

q>-l)(q>

0

c)(a)· v

0( II v II)

-----------------+--a t- a t-

Since

lim __ v_ = (q> 0 c)'(a) d. t - a

fun O(lIvll)

and

d.

t-a

0,

it follows that [('V 0 c)(t) - ('V 0 c)(a)] fun - - - - : - - - - - = D('I'

d.

t-a

and therefore the map c: [a, b] -t M

0

q>-l)«q> 0 c)(a» . (q> 0 c)'(a)

is differentiable at a in the chart (U, q»

iff it is

differentiable at a in the chart (V, 'V). In summary, it makes sense to speak of differentiability of

curves at an endpoint of a closed interval. The map c: [a, b]-t M is said to be differentiable if c I]a, b[ is differentiable and if c is differentiable at the endpoints a and b. The map c: [a, b] -t M is said to be of class Cl ifit is differentiable and if (q> 0 c)': [a, b]-t E is continuous for any chart (U, q» satisfying

un c([a, bJ) ~ 0, where

E is the model space of M.

3.3.1 Definition Let M be a manifold and m E M. A curve at m is a C1 map c : I -t M from an interval I C lR into M with 0 E I and c(O) = m. Let c 1 and c2 be curves at m and (U, q» an admissible chart with mE U. Then we say c 1 and c2 are tangent at m with respect to q> if and only if (q> 0 c1),(O) = (q> 0 c2 )'(O). Thus two curves are tangent with respect to q> if they have identical tangent vectors (same direction and speed) in the chart q>; see Figure 3.3.1. The reader can safely assume in what follows that I is an open interval; the use of closed intervals becomes essential when defining tangent vectors to a manifold with boundary at a boundary point; this will be discussed in Chapter 7. 3.3.2 Proposition Let c 1 and c 2 be two curves at mE M. Suppose (Uj3' q>j3) are admissible

charts with mE Uj3' ~ = 1,2. Then c 1 and c2 are tangent at m with respect to q>l if and only if they are tangent at m with respect to q>2.

Section 3.3 The Tangent Bundle

159

R

o

Figure 3.3.1 Proof By taking restrictions if necessary we may suppose that VI = V 2. Since we have the 0 is a bijection, since q> is a diffeomorphism. Hence, on TM we

can regard (TU, Tq» as a local chart. 3.3.10 Theorem Let M be a C + I manifold and 5l an atlas of admissible charts. Then T 5l = {(TU, T
j

0

0

q>j-I can be formed by restriction of q>j

(Tq>j)-I

=T(q>i

0

0

q>j-I to q>/Uj n Uj)' The chart overlap map

-I )(u, e) =

Let us next develop some of the simplest properties of tangent maps. First of all, let us check that tangent maps are smooth. 3.3.11 Proposition Let M and N be Cr + I manifolds, and let f: M ~ N be a map of class CT+ I. Then Tf: TM ~ TN is a map of class

cr.

Proof It is enough to check that Tf is a Cr map using the natural atlas. For m E M choose charts (U, q»

and (V, \jI) on M and N so that mE U, f(m) E V and f 0 't TM 0 T(Tq>-I)) : (u, e, el' ( 2 ) the tangent of the map 't M : TM ~ M, we get T'tM : T(TM) T'tM is

t-+

(u, e). On the other hand, taking

~

TM. The local representative of

Applying the commutative diagram for Tf following 3.3.6 to the case f = 'tw we get what is


163

commonly known as the dual tangent rhombic: T(TM)

/~ ~/.

TM

TM

M

Let us now tum to a discussion of tangent bundles of product manifolds. Here and in what follows, tangent vectors will often be denoted by single letters such as v E TmM.

3.3.12 Proposition Let MI and M2 be manifolds and Pi: MI x M2 ~ Mi, i = 1,2, the two canonical projections. The map (TPI' Tp2) : T(M I x M2) ~ TMI x TM2 which is defined by (TPI' Tp2)(v) = (TPI(v), Tplv» is a diffeomorphism of the tangent bundle T(M I x M 2) with the product manifold TMI x

T~.

Proof The local representative of this map is (u I' u2' ei' e 2) E VI x V 2 X EI X E 2 ...... «ui' e l ), (~, e 2» E (VI x E I) X (V 2 x M2), which clearly is a local diffeomorphism. _ Since the tangent is just a global version of the derivative, statements concerning partial derivatives might be expected to have analogues on manifolds. To effect these analogies, we globalize the defmition of partial derivatives. Let M I, M2. and N be manifolds, and f: MI x M2 ~ N be a Cr map. For (p, q) xM 2, let ip: M2 ~ MI X M2 and iq : MI ~ MI X M2 be given by

E

MI

With these notations the following proposition giving the behavior of T under products is a straightforward verification using the definition and local differential calculus.

3.3.13 Proposition Let M I, M2, N, and P be manifolds, gi: P ~ Mi, i = 1,2, and f: MI x M2 ~ N be c r maps, r ~ 1. Identify T(M I x M2) with TMI x TM 2 . Then the following statements hold. (i) T(gl x gz) = Tg i x Tg z·

164

Chapter 3 Manifolds and Vector Bundles

(ii) Tf(u p' vq ) =Tlf(p, q)(U p) + T2f(p, q)(v q), for up E TpMI and Vq E T qM 2. (iii) Implicit Function Theorem If Ti(p, q) is an isomorphism, then there exist open

neighborhoods U of p in M I , W of f(p, q) in N, and a unique C r map g: U x W such that for all (x, w)

U

E

X

~

M2

W, f(x, g(x, w)) = w .

In addition, Tlg(x, w) = - (T2f(x, g(x, w)))-I

0

(Tlf(x, g(x, w)))

and T 2g(x, w) = (T 2f(x, g(x, w)))-I .

3.3.14 Examples A The tangent bundle TS1 of the circle Consider the atlas with the four charts {(Ui ±, 'vl) 1 i = 1,2} of Sl = {(x, y) E ]R21 x2 + y2 = I} from 3.1.4. Let us construct the natural atlas for TSI = {«x, y), (u, v)) E ]R2 x ]R21 x2 + y2 = 1. «x, y), (u, v) =O}. Since the map 'l'1 + : U I+ = {(x, y)

E

Sll x> O} ~ ]-1, 1[ is given by 'l'1 +(x, y)

= y,

by definition of

the tangent we have

Proceed in the same way with the other three charts. Thus, for example, T(x,y)'l'2- I(u, v) = (x, u) and hence for x E ]-1, O[ ,

This gives a complete description of the tangent bundle. But more can be said. Thinking of S I as the multiplicative group of complex numbers with modulus I, we shall show that the group operations are C~: the inversion I: s 1-+ S-I has local representative ('l'1 ±

0

10 ('l'1 ±)-I )(x) = - x

and the composition C: (sl' s2) 1-+ SI Sz has local representative

(here

±

can be taken in any order). Thus for each s

= ss', is a diffeomorphism.

E

Sl, the map Ls: Sl ~ Sl defined by

]R by A(V s) = (s. TsLs-l(v s))' which is easily seen to be a diffeomorphism. Thus TSI is diffeomorphic to Sl x lR. See Figure 3.3.2. Ls(s')

This enables us to define a map A: TSI ~ SI

X

165


N on trivial tangent bundle

Trivial tangent bundle

Figure 3.3.2

nrn

B TS I == SI C

The tangent bundle to the n-torus JR, it follows that TT" == T" x JRD.

Since T" =SI

X ... X

SI (n times) and

X

The tangent bundle TS2 to the sphere The previous examples yielded trivial

tangent bundles. In general this is not the case, the tangent bundle to the two-sphere being a case in point, which we now describe. Choose the atlas with six charts {(ut 'Vj±) I i = 1,2,3} of S2 that were given in 3.1.4. Since

we have T (x.1 x 2• x3 )'1'1+( v I ,v 2, v 3) -_ (2 x, x3, v2, v 3) , where x Iyl + x2y2 + x3v3 = O. Similarly, construct the other five charts. For example, one of the twelve overlap maps for x2 + y2 < I, and y < 0, is + -I )(x,y,u,v)= (T'V3- o (T'VI)

(.J I-x -y ,x, 2

-ux

2

-

vy

J 1 - x -l J 1 - x -l 2

,u ) .

2

One way to see that TS2 is not trivial is to use the topological fact that any vector field on S2 must vanish somewhere. We shall prove this in §7.5. •

Exercises 3.3A

Let M and N be manifolds and f: M (i)

~

N.

Show that (a) if f is C~, then graph(f) = {(m, f(m» E M x N I m E M} is a ~ submanifold of M x N and (b)

T(m. f(m))(M

X

N) == T(m. f(m)lgraph(f) Ef> Tf(m)N for all mE M.

166

(ii)


(c)

Show that the converse of (a) is false. (Hint: x E IR f-+ x 1/3

(d)

Show that if (a) and (b) hold, then f is

If f is

C~

E

JR.)

C~.

show that the canonical projection of graph(f) onto

M

is a

diffeomorphism. (iii)

Show that T(rn, f(rn))(graph(f)) == graph(Tmf) = {(v rn , Trnf(vrn» I vrn

E

TmM}

C

TmM x Tf(rn)N.

3.38

(i)

Show that there is a map sM: T(TM)

~

T(TM) such that SM

0

sM

= identity and

the diagram

,--

,ommutes. (Hint: In local natural coordinate charts sM(u, e, el' e 2) =(u, el' e, e2).) o e calls sM the canonical involution on M and says that T(TM) is a symmetric

r ombic. (ii)

Verify that for f: M ~ N of class C2, T2f 0 sM = sN

(iii)

If X is a vector field on M, that is, a section of 'tM : TM --t M, show that TX is a section of T'tM : T2M --t TM and Xl = sM 0 TX is a section of 't-rM : T2M --t TM.

0

T2f.

(A section () of a map f: A --t 8 is a map (): 8 --t A such that f 0

()

=identity on

B.)

3.3C

(i)

Let S(S2)

= {v E

TS 2 111 v II

= I}

be the circle bundle of S2. Prove that S(S2) is

a submanifold of TS2 of dimension three. (ii)

Define f: S(S2) --t 1RlP'3 by f(x, y, v) = the line through the origin in IR 4 (x, y, vI, v 2). Show that f is a

determined by the vector with components diffeomorphism.

3.3D Let M be an n-dimensional submanifold of IRN. Define the Gauss map r: M --t on. N-n by rem) = TrnM - m, i.e., rem) is the n dimensional subspace of IRN through the origin, which, when translated by m, equals TrnM. Show that r is a smooth map. 3.3E

Let f: T2 ~ IR be a smooth map. Show that f has at least four critical points (points where Tf vanishes). (Hint: Parametrize T2 using angles e, c:p and locate the maximum and minimum points of fee, c:p) for c:p fixed, say (emax(C:P)' c:p) and (emin(c:p), c:p); now maximize and minimize f as c:p varies. How many critical points must f: S2 --t IR have?)

~3.4

Vector Bundles

Roughly speaking. a vector bundle is a manifold with a vector space attached to each point. During the formal definitions we may keep in mind the example of the tangent bundle to a manifold. such as the n-sphere

sn.

Similarly. the collection of normal lines to

sn

form a vector

bundle. The definitions will follow thc pattern of those for a manifold. Namely. we obtain a vector bundle by smoothly patching together local vector bundles. The following terminology for vector space products and maps will be useful.

3.4.1 Definition Let E and F be vector spaces with U an open subset of E. We call the Cartesian product U x F a local vector bundle. We call U the base space. which can be identified with U x {O}. the zero section. For u E U, {u} x F is called the fiber of u, which we endow with the vector space structure of F. The map 1t: U x F ~ U given by 1t(u. f) = u is called the projection of U x F. (Thus, the fiber over u E U is 1t- I (u). Also note that U x F is an open subset of E x F and so is a local manifold.) Next. we introduce the idea of a local vector bundle map. The main idea is that such a map must map a fiber linearly to a fiber.

3.4.2 Definition Let U x F and U' x F be local vector bundles. A map 1 (u) 1 x F' and so restricted is linear. By Banach's Isomorphism Theorem it follows that a local vector bundle map Cj> is a local vector bundle isomorphism iff Cj>2(u) is a Banach space isomorphism for every u E U.

~

SUPPLEMENT 3.4A Smoothness of Local Vector Bundle Maps C~

In some examples, to check whether a map Cj> is a faced with the rather unpleasant task of verifying that Cj>2 : U

~

local vector bundle map, one is L(F, F) is

C~.

It would be nice

to know that the smoothness of Cj> as a function of two variables suffices. This is the context of the next proposition. We state the result for

C~,

but the proof contains a

cr

result (with an

interesting derivative loss) which is discussed in the ensuing Remark A. 3.4.3 Proposition A map Cj>: U x F

~

U' x F is a

~

local vector bundle map if Cj> is

C~

and

is of the form Cj>(U'~l(u), Cj>2(u), f), where ~ : U ~ U' and Cj>2: U ~ L(F, F') are C~. Proof (M. Craioveanu and 1r. Ratiu [1976]) The evaluation map ev: L(F, F) x F ~ F; ev(T, f)

= T(f) is clearly bilinear Jilld continuous. First assume Cj> is a cr local vector map, so Cj>2 : U ~ L(F, F) is cr. Now write Cj>2(u), f =(ev 0 (Cj>2 x I»(u, f). By the composite mapping theorem, it follows that Cj>2 is C as a function of two variables. Thus Cj> is Cr by 2.4.12 (iii). Conversely, assume Cj>(u, f) = (Cj>l(u), Cj>2(u) . f) is and Cj>z{u)· f are

~

suffices to prove the following:

for all u

E

C~.

Then again by 2.4.12 (iii), Cj>l(u)

as functions of two variables. To show that Cj>2: U

if

h :U x F

~

F is Cr, r

~

~

L(E, F) is

1, and such that h(u, .)

E

C~,

it

L(F, F)

U, then the map h': U ~ L(F, F), defined by h'(u) = h(u, .) is C-1. This will be

shown by induction on r.

=I

we prove continuity of h' in a disk around Uo E U in the following way. By continuity of Dh, there exists e> 0 such that for all u E DrCuo) and v E DE(O), 1\ D1h(u, v) 1\ .,:; If r

N for some N > O. The mean value inequality yields 1\ h(u, v) - h(u', v) 1\.,:; N

1\ h'(u) - h'(u') 1\ = sup IIvllS1

II u - u' II

II h(u, v) - h(u', v) 1\ < N 1\ u - u' 1\, e

proving that h' is continuous. Let r> 1 and inductively assume that the statement is true for r - 1. Let S : L(F, L(E, F» ~ L(E, L(F, F» be the canonical isometry: S(T)(e)· f = T(f) . e. We shall prove that

Section 3.4 Vector Bundles

169

(1)

where (D1h)'(u)· v = D1h(u, v). Thus, if h is Cr , Dlh is Cr - l , by induction (D1h)' is Cr - 2, and hence by (1), Dh' will be Cr - 2. This will show that h' is Cr - l . For relation (1) to make sense, we first show that D1h(u, .) E L(F, L(E, F'». Since lim [h'(u + tw) - h'(u)] . v I ....

0

---------- =

lim A.,v

for all v E F, where

it follows by the Unifonn Boundedness Principle (or rather its corollary 2.2.21) that D1h(u,·) . WE L(F, F). Thus (v, w) 1-+ D1h(u, v) . w is linear continuous in each argument and hence is bilinear continuous (Exercise 2.2J), and consequently v 1-+ D1h(u, v) E L(E, F) is linear and continuous. Relation (1) is proved in the following way. Fix

Uo E

U and let

E

and N be positive

constants such that

II D1h(u, v) - D1h(u', v) II

~

Nil u - u' II

(2)

for all u, u' E 02/uO) and v E 0,(0). Apply the mean value inequality to the Cr - l map g(u) = h(u, v) - D1h(u', v) . u for fixed u' E 02,(u O) and v EO/D) to get

II h(u + w, v) - h(u, v) - Dlh(u', v) . w II = II g(u + w) - g(u) II ~ II w II sup II Dg(u + tw) II IElO,l)

=

II w II sup II Dlh(u + tw, v) - D1h(u', v) II IE [0,11

for w E O,(uo)' Letting u' --+ u and taking into account (2) we get

II h(u + w, v) - h(u, v) - D1h(u, v) . w II ~ N II w 112; i.e.,

II h'(u + w) . v - h'(u)· v - [(S (D1h)' )(u) . w](v) II ~ N II W 112 0

for all v E 0,(0), and hence

II h'(u + w) - h'(u) - (S thus proving (1).

•

0

(D1h)')' w II

170


Remarks A If F is finite dimensional and if h : U x F L(F. F) for all u

E

~

F is

cr.

r

~

1. and is such that h(u •. ) E

U. then h': U ~ L(F. F') given by h'(u) = h(u .. ) is also

cr.

In other

words. Proposition 3.4.3 holds for Cr-maps. Indeed. since F = lRn for some n. L(F. F') ;: F' x ... X

F' (n times) so it suffices to prove the statement for F = R Thus we want to show that if h

cr

: U x lR ~ F is for all (u. x)

E

and h(u. 1) = g(u)

E

F'. then g: U ~ F is also

cr.

Since h(u. x)

= xg(u)

=g

is a Cr

U x lR by linearity of h in the second argument. it follows that h'

map. B If F is infinite dimensional the result in the proof of Proposition 3.4.3 cannot be

improved even if r = O. The following counterexample is due to A.J. Trunba. Let h: [0. 1] x L2[0. 1] ~ L2[0. 1] be given by

1o 1

hex. and q> is Ihe natural identification

by definition of Tq>, and this is exactly TmM. For the second assertion, determined by (O), a diffeomorphism. _

Thus, TmM is isomorphic to the Banach space E, the model space of M, M is identified with the zero section of TM, and

'tM is identified with the bundle projection onto the zero section. It is also worth recalling that the local representative 'tM is (q> 0 't M 0 Tq>-I)(u, e) = u, i.e., just the projection of q>(0) x E to q>(0).

174


3.4.10 Examples A

Any manifold M is a vector bundle with zero-dimensional fiber, namely M x {OJ.

B

The cylinder E = S I X lR is a vector bundle with 1t: E ~ B = S I the projection on the

first factor (Figure 3.4.3). This is a trivial vector bundle in the sense that it is a product. The cylinder is diffeomorphic to TSI by Example 3.3.14A. ~ Fiber~1I

1T =

t--

hase

=

prOjectIOn

Sl

Figure 3.4.3 C

The Mobius band is a vector bundle 1t: M ~ Sl with one dimensional fiber obtained

in the following way (see Figure 3.4.4). On the product manifold lR x lR, consider the equivalence relation defined by (u, v) - (u', v') iff u' = u + k, v' = (- ltv for some k E Z and denote by p: lR x lR~ M the quotient topological space. Since the graph of this relation is closed and p is an open map, M is a Hausdorff space. Let [u, v] = p(u, v) and define the projection 1t: M~SI by 1t[u,v]=e21tiu . Let V I =]O,l[xlR, V2 =]_1'2,1'2[xlR, UI=SI\{l} and U2 = Sl\{- 1} and then note that p I VI: VI ~ 1t- I (U I ) and pi V 2 : V 2 ~ 1t-I(U 2) are homeomorphisms and that M = n-I(U I ) U 1t- I(U 2). Let «UI' 1)' (U2' 2)} be an atlas with

two charts for Sl (see 3.1.2). Define 'Vj: 1t- I(U) ~ lR x lR by 'Vj = Xj 0 (p I V)-I and Xj : Vj ~ lR x JR by X/u, v) = (/e 21tiu ), (-l)i+lv), j = 1,2 and observe that Xj and 'Vj are homeomorphisms. Since the composition 'V20 'VI-I: (JR x JR)\«O) x JR) ~ (lR x JR)\«O) x JR) is given by the fonnula ('V2 0 'VI-I)(x, y) = «2 0 1-1)(x), -y), we see that ((1t- I (U I ), 'VI)' (1t- I(U 2), 'V2)} fonns a vector bundle atlas of M.

175

Section 3.4 Vector Bundles

Figure 3.4.4 D yn(E)

~

The Grassmann bundles (universal bundles) G n (E), yn(E)

~

Gn(E), and y(E)

~

We now define vector bundles

G(E) which play an important role in the

classification of isomorphism classes of vector bundles (see for example Hirsch [1976]). The definition of the projection p: Yn(E)

~

Gn(E) is the following (see Example 3.1.SG for

notations): recalling Yn(E) = {(F, v) I F is an n-dimensional subspace of E and v E F}, we set p(F, v) = F. The charts (p-l(U G), 'I'FG)' where E = F EB G, 'I'FG(H, v) = (GlFG(H), 1t G(H, F)(v», and 'I'FG: p-l(U G) ~ L(F, G) x F, define a vector bundle structure on Yn(E) since the overlap maps are

where T

E

L(F, G), f

E

F, and graph(T) denotes the graph of T in Ex F; smoothness in T

is shown as in 3.1.SG. The fiber dimension of this bundle is n. A similar construction holds for Gn(E) yielding yn(E); the fiber codimension in this case is also n. Similarly y(E) obtained with not necessarily isomorphic fibers at different points of G(E).

~

G(E) is

•

3.4.11 Definition Let E and E' be two vector bundles. A map f: E ~ E' is called a C' vector

bundle mapping (local isomorphism) whenfor each vEE and each admissible local bundle chart (V, 'V) of E' for which f(v)

E

V, there is an admissible local bundle chart (W, Gl) with feW)

W' such that the local representative f'l'ljI = 'V 0 f

0

C

Gl-1 is a C' local vector bundle mapping (local

isomorphism). A bijective local vector bundle isomorphism is called a vector bundle isomorphism. This definition makes sense only for local vector bundle charts and not for all manifold Charts. Also, such a W is not guaranteed by the continuity of f, nor does it imply it. However, if we first check that f is fiber preserving (which it must be) and is continuous, then such an open set W is guaranteed. This fiber-preserving character is made more explicit in the following. 3.4.12 Proposition Suppose f: E ~ E' is a C' vector bundle map, r ~ O. Then: (i) f preserves the zero section: feB) C B'; (ii) f induces a unique mapping f B : B ~ B' such that the following diagram commutes:

176


f

E

,I

..

E'

...

B'

B

I"

fB

that is,

1t'

0

fB = fB

1t.

0

(Here,

1t

and

1t'

are the projection maps.) Such a map f is called a

vector bundle map over f B. C~

(iii) A

such that

1t'

0

map g: E -+ E' is a vector bundle map iff there is a

g =gB

0

1t

c~

map gB : B -+ B'

and g restricted to each fiber is a linear continuous map into afiber.

Proof (i) Suppose bE B. We must show f(b) E B'. That is, for a vector bundle chart (V, 'JI) with f(b) E V we must show 'JI(f(b» f(W)

C

V, and and 'E' :: (E 'k)keIUJ of Banachable spaces. Let

ieI

jeJ

and let

i.e., Ai E L(Ei, E'i)' i E I, and Aj E L(E'j' Ej), j E J. An assignment n taking any family 'E to a Banach space n'E and any sequence of linear maps (Ak) to a linear continuous map n(Ak ) E L(n'E, n'E ') satisfying n(Il\) = IU'L' n«B k ) 0 (Ak» = n«B k » 0 n«A k » (composition is taken componentwise) and is such thatthe induced map n: L('E, 'E ')

~

L(n'E, n'E ') is

C~,

will be called a tensorial construction of type (I, J).

3.4.24 Proposition Let n be a tensorial construction of type (I, J) and 'E :: (Ek)ke IUJ be a family of vector bundles with the same base B. Let

Then n'E has a unique vector bundle structure over B with (n'E)b:: n~ and 7t: n'E ~ B sending n~ to bE B, whose atlas is given by the charts (n-I(U), '1'), where '1': n-1(U) ~ U' x n«p N is an injective immersion, f(M) is called an immersed submanifold of N. 3.5.9 Definition An immersion f: M -> N that is a homeomorphism onto f(M) with the relative

topology induced from N is called an embedding. Thus, if f: M -> N is an embedding, then f(M) is a submanifold of N. The following is an important situation in which an immersion is guaranteed to be an embedding; the proof is a straightforward application of the definition of relati ve topology. 3.5.9 Embedding Theorem An injective immersion which is an open or closed map onto its

image is an embedding. The condition "f: M -> N is closed" is implied by "f is proper", i.e. each sequence xn E M with f(x n) convergent to y N has a convergent subsequence xn(i) in M such that f(xn(i») converges to y. Indeed, if this hypothesis holds, and A is a closed subset of M, then f(A) is shown to be closed in N in the following way. Let xn E A, and suppose f(x n) = Yn converges to yEN. Then there is a subsequence (zml of {xn}, such that zm -> x. Since A = cl(A), x E A and by continuity of f, y = f(x) E f(A); i.e., f(A) is closed. If N is infinite dimensional, this hypothesis is assured by the condition "the inverse image of every compact set in N is compact in M". This is clear since in the preceding hypothesis one can choose a compact neighborhood V of the limit of f(x n) in N so that for n large enough, all xn belong to the compact neighborhood r-1(V) in M. The reader should note that while both hypotheses in the proposition are necessary, properness of f is only sufficient. An injective nonproper immersion whose image is a submanifold is, for example, the map f: la, =[ ->]R2 given by

f(t) = (t cos

T' t sin T).

This is an open map onto its image so 3.5.9 applies; the submanifold f(lO, =[) is a spiral around the origin. 3.5.10 Definition A Cr map f: M -> N, r;::: 1, is said to be transversal to the submanifold P of N (denoted f rf1 P) if either f-l(p) = 0, or iffor every mE f-I(P), T1 (T mf)(TmM) + Tf(mt = T f(m)N and T2 the inverse image (Tmf)-I(Tf(mt) of Tf(mt splits in T mM. The first condition T1 is purely algebraic; no splitting assumptions are made on (Tmf)(TmM), nor need the sum be direct. If M is a Hilbert manifold, or if M is finite dimensional, then the splitting condition T2 in the definition is automatically satisfied.

202

Chapter 3 Manifolds and Vector Bundks

3.5.11 Examples A If each point of P is a regular value of f, then f iii P since (Tmf)(T mM) = Tf(m)N in this case. B Assume that M and N are finite-dimensional manifolds with dim(P) + dim(M) < dim(N). Then filip ilT'plies f(M) n P = 0. This is seen by a dimension count: if there were a point m E il(P) n M, then dim(N) = dim«Tmf)(T mM) + Tf(ml) :£ dim(M) + dim(P) < dim(N) which is absurd. C Let M = 1Ft2, N = 1Ft3 , P = the (x, y) plane in 1Ft3 , a E 1Ft and define fa: M ~ N, by fa(x, y) = (x, y, x2 + y2 + a). Then f iii P if a # 0; see Fig. 3.5.4. This example also shows intuitively that if a map is not transversal to a submanifold it can be perturbed very slightly to a transversal map; for a discussion of this phenomenon we refer to the Supplement 3.6B. • Image

of

f.

x

a=O

0>0

0 then

206


which is the local version of the second statement. • Notice that

if rank(Tmt) is finite. The sub immersion theorem reduces to 3.5.4 when f is a submersion. The immersion part of the subimmersion f implies a version of 3.5. 7(iii). 3.5.18 Fibration Theorem Thefollowing are equivalentfor a

C~

map f: M

~

N:

(i) the map f is a sub immersion (if M or N are finite dimensional, this is equivalent to

the rank of Tmf being locally constant); (ii) for each m

E

M there is a neighborhood U of m, a neighborhood V of f(m), and

a submanifold Z of M with m

E

induces a diffeomorphism of f-I(V)

Z such that feU) is a submanifold of Nand f

nZ nU

onto feU)

n V;

(iii) ker Tf is a subbundle of TM (called the tangent bundle to the fibers) andfor each m E M, the image of Tmf is closed and splits in Tf(m)N. If one of these hold and if f is open (or closed) onto its image, then f(M) is a submanifold of N.

Proof To show that (i) implies (ii), choose for U and V the chart domains given by 3.5.16(i) and let Z = -I)(x, y) == x for all (x, y) E VI x VI' If f", == f 0 q>-I : VI x VI ~ JR, ~e have for all v E F, Di",(O, 0) . v == 0 since T nf I T nN == O. Thus, letting Il == DI f",(O, 0) E E ,


212

U E

E and v

E

F, we get

Df",(O, 0) . (u, v)

= Il(u) = (Il

Dg",'II)(O, 0) . (u, v); i.e., Df",(O, 0)

0

To pull this local calculation back to M and P, let A = Il

0

T p \jf

E

= (Il

0

Dg",'II)(O, 0).

T p *p, so composing the

foregoing relation with Tn
*.

i=1 where e l •...• eP is the standard dual basis in JR.p. such that n is a critical point of P

f - A og

=

f-

L

Ail

i=1

In calculus. the real numbers Ai are referred to as Lagrange multipliers. Thus. to find a critical point x = (x I •...• xm) ENe JR.m of fiN one solves the system of m + p equations af

- . (x) -

ax)

Li=1 Ai -agax).i P

o.

j = 1•...• m

gi(x) = O.

i = 1•...• P

(x) =

for the m + p unknowns x I •...• xm. AI' ...• \ . For example. let N = S2 C JR.3 and f: JR.3 ~ JR.; f(x. y. z) = z. Then f I S2 is the height function on the sphere and we would expect (0. O. ±1) to be the only critical poins of f I S2; note that f itself has no critical points. The method of Lagrange multipliers. with g(x. y. z) = x 2 + y2

+ z2 - 1. gives 0- 2xA = O. 0 - 2yA = O. 0 - 2zA = O. and x2 + y2 + z2 = 1. The only solutions are 1..= ±1/2. x = O. Y = O. z = ±1. and indeed these correspond to the maximum and minimum points for f on S2. See an elementary text such as Marsden and Tromba [1988] for additional examples. For more advanced applications. see Luenberger [1969]. The reader will recall from advanced calculus that maximum and minimum tests for a critical point can be given in terms of the Hessian. i.e .• matrix of second derivtives. For constrained problems there is a similar test involving bordered Hessians. Bordered Hessians are simply the Hessians of h = f - Ag + c(A - 1..0)2 in (x. A)-space. Then the Hessian test for maxima and minima apply; a maximum or minimum of h clearly implies the same for f on a level set of g. See Marsden and Tromba [1988. pp. 224-30] for an elementary treatment and applications.

{lJJ

Exercises 3.5A

(i) Show that the set 8L(n. JR.) of elements of L(JR.n. JR.n) with determinant 1 is a closed submanifold of dimension n 2-1; 8L(n. JR.) is called the special linear group. Generalize to the complex case. (ii) Show that O(n) has two connected components. The component of the identity 80(n)

214

Chapter 3 Manifolds and Vector Bundhs

is called the special orthogonal group. = {U E L(en, en) I UU* = Identity} be the unitary group. Show that U(n) is a compact submanifold of dimension n2 of L(en, en) and 0(2n).

(iii) Let U(n)

n SL(n,

(iv) Show that the special unitary group SU(n) = U(n)

C) is a compact n 2-1

dimensional manifold. (v) Define IT =

[~I ~J

L(lR 2n, lR 2n), where I is the identity of lR n. Show that

E

Sp(2n, lR) = {Q E L(lR2n ,lR2n ) I QlTQT = IT} is a compact submanifold of dimension 2n2 + n; it is called the symplectic group.

3.58 Define USt(n, n; k) and SpSt(2n, 2n; 2k) analogous to the definition of OSt (n, n; k) in example 3.S.SD. Show that they are compact manifolds and compute their dimensions. 3.SC

(i) Let P

C

0(3) be defined by P= {QE O(3)ldetQ=+I,Q=QT }\{IJ.

Show that P is a two-dimensional compact submanifold of 0(3). (ii) Define f: lRJP2 ~ 0(3), f(~) = the rotation through is 3.SD (i)

11

about the line

t Show that f

If N is a submanifold of dimension n in an m-manifold M, show that for each x E N there is an open neighborhood U ~ lRm - n, such that N n U = f-I(O).

(ii)

1t

diffeomorphism of lRJP2 onto P.

C

M with x E U and a submersion f: U

Show that lRlP I is a submanifold of lRlP 2,

C

M

which is not the level set of any

submersion of lRJP2 into lRJPI; in fact, there are no such submersions. (Hint.' lRJP] is one-sided in lRJP2). 3.SE

(i) Show that if f: M

~

a submersion, then g (ii) Show that if f; : M; then so is f]

N is a subimmersion, g: N 0

~

~

P an immersion and h: Z

~

M

f 0 h is a sub immersion. N;, i = 1,2 are immersions (submersions, subimmersions),

f2 : M] x M2 ~ N]

X N 2. (iii) Show that the composition of two immersions (submersions) is again an immersion X

(submersion). Show that this fails for subimmersions. (iv) Let f: M ~ Nand g: N ~ P be Cr , r ~ 1. If go f is an immersion, show that f is an immersion. If g

0

f is a submersion and if f is onto, show that g is a

submersion. (v)

Show that if f is an immersion (resp. embedding, submersion, subimmersion) then so is Tf.

Section 3.S Submersions, Immersions, and Transversality

3.SF (i)

21S

Let M be a manifold, R a regular equivalence relation and S another equivalence relation implied by R; i.e., graph R

C

graph S. Denote by S / R the equivalence

relation induced by S on M / R. Show that S is regular iff S / R is and in this case establish a diffeomorphism (M / R) / (S / R) (ii)

M / S.

-7

Let M j , i = 1,2 be manifolds and R j be regular equivalences on M j . Denote by R the equivalence on Ml x Mz defined by Rl x R z. Show that M / R is diffeomorphic to (Ml/Rl)x(Mz/R z).

3.SG The line with two origins Let M be the quotient topological space obtained by starting with (lR x {O}) n (lR x { I}) and identifying (t, 0) with (t, 1) for t

*" O.

Show that this is

a one-dimensional non-Hausdorff manifold. Find an immersion lR -7 M. 3.SH Let f: M

-7

N be C~ and denote by h: TM

-7

f*(TN) the vector bundle map over the

identity uniquely defined by the pull-back. Prove the following: h (i) f is an immersion iff 0 ~ TM ~ f*(TN) is fiber split exact; h (ii) f is a submersion iff TM ~ f*(TN) ~ 0 is fiber split exact; (iii) f is a subimmersion iff kerCh) and range(h) are subbundles. 3.S1 Let A be a real nonsingular symmetric n x n matrix and c a nonzero real number. Show that the quadratic surface {x

E

lRO I (Ax, x) = c} is an (n-l )-submanifold of lRo.

3.SJ Steiner's Roman Surface Let f: SZ -7lR4 be defined by f(x, y, z) (i) Show that f(p)

= f(q)

=(yz, xz, xy, xZ + 2yZ + 3zz).

if and only if p

= iq.

(ii) Show that f induces an immersion f': lRJP>z (iii) Let g: lRlP'z

-7 lR 3

-7

lR4.

be the first three components of f'.

Show that g is a

"topological" immersion and try to draw the surface g(lRJP>z) (see Spivak [1979] for the solution). 3.SK Covering maps Let f: M

-7

N be smooth and M compact, dim(M) = dim(N) < "". If

n is a regular value of f, show that f-l(n) is a finite set {m}> ... , mk } and that there exists an open neighborhood V of n in N and disjoint open neighborhoods U l'

... ,

Uk

of m}> ... , m k such that [-1 (V) = U l U ... U Uk' and f I Uk : U j -7 V, i = 1, ... , k are all diffeomorphisms. Show k is constant if M is connected and f is a submersion. 3.SL Let f: M

-7

N and g: N

-7

P be smooth maps, such that g iii V where V is a

submanifold of P. Show that f iii g-l(V) iff go f iii

v.

216


3.sM Show that an injective immersion f: M

~

N is an embedding iff f(M) is a closed ~

submanifold of an open submanifold of N. Show that if f: M

N is an embedding, f is

a diffeomorphism of M onto f(M). (Hint: See Exercise 2.SL.) 3.SN Show that the map p: St(n, n; k) = Gk(lR n) defined by p(A) = range A is a surjective submersion. 3.50 Show that f: lRpn x lRpm

~

lRpmn+m+n given by (x, y)

~

[xoYo' xoY l'

... ,

xiYj' ... ,

xnYm 1 is an embedding. (This embedding is used in algebraic geometry to define the product of quasi projective varities; it is called the Segre embedding.) 3.SP Show that {(x, y) E lRpn x lRpm I n ~ m, Li=O ..... n xiYi = O} 1)-manifold. It is usually called a Milnor manifold. 3.sQ Fiber product of manifolds Let f: M (f, g) : M x N

~

~

P and g: N ~ P be

C~

is an (m + n -

mappings such that

P x P is transversal to the diagonal of P x P. Show that the set defined

by M xp N = {(m, n)

E

M x N I f(m) = g(n)} is a submanifold of M x N. If M and N

are finite dimensional, show that codim(M xp N)

=dim P.

3.SR (i) Let H be a Hilbert space and aL(H) the group of all isomorphisms A: H

~

H that

are continuous. As we saw earlier, aL(H) is open in L(H, H) and multiplication and inversion are

C~

defined by O(H)

maps, so aL(H) is a Lie group. Show that O(H) c aL(H)

= (A E

aL(H)

I (Ax, Ay) = (x, y)

for all x, y

E

H} is a smooth

submanifold and hence also a Lie group. (ii)

Show that the tangent space at the identity of aL(H) (the Lie algebra) consists of all bounded skew adjoint operators, as follows. Let S(H) = (A

L(H, H) I A * = A} ,

E

where A * is the adjoint of A. Define f: aL(H) ~ S(H), by f(A) = A *A. Show f is C~, 11(1) = O(H), and Df(A)· B = B* A + A *B. Show that f is a submersion, and ker Df(A) = (B

E

L(H, H) I B* A + A*8

space (T E L(H, H) I T* A

=A*T}

= O},

which splits; a complement is the

since any U splits as

U =~ (U - A *-I U*A) + ~(U + A *-IU* A).

3.SS (i)

If f: M

~

N is a smooth map of finite dimensional manifolds and m

there is an open neighborhood U of m such that rank(TJ)

~

E

M, show that

rank(Tmf) for all x E

U. (Hint: Use the local expression of Tmf as a Jacobian matrix.)

(ii) Let M be a finite dimensional connected manifold and f: M ~ M a smooth map satisfying f 0 f = f. Show that f(M) is a closed connected submanifold of M. What is its dimension? (Hint: Show f is a closed map. For m E f(M) show that range(Tmf) = ker(ldentity - Tmf) and thus rank(Tmf) + rank(Identity - Tmf) = dim M. Both ranks can only increase in a neighborhood of m by (i), so rank(Tmf) is locally

Section 3.5 Submersions, Immersions, and Transversality

217

constant on f(M). Thus there is a neighborhood U of f(M) such that the rank of f on U is bigger than or equal to the rank of f on f(M). Use rank(AB)

~

rank A and

the fact that f 0 f = f to show that the rank of f on U is smaller than or equal to the rank of f in f(M). Therefore rank of f on U is constant. Apply 3.S.18(iii).) 3.ST (i)

Let ker

(ii)

(x,

p.

p : E ~ JR Show that

Let f, g : M

~

be continuous linear maps on a Banach space E such that ker and

(X

p are proportional.

(X

=

(Hint: Split E = ker (X EEl JR.)

JR be smooth functions with 0 a regular value of both f and g and

N = f-I(O) = g-I(O). Show that for all x

E

N, df(x) = A(X) dg(x) for a smooth

function A: N ~ JR. 3.5U Let f: M ~ N be a smooth map, peN a submanifold, and assume f iii P. Use 3.S.10 and Exercise 3.4J to show the vector bundle isomorphism v(rl(p»

= r*(v(P».

(Hint:

Look at Tf I f-I(P) and compute the kernel of the induced map TM I f-I(P) ~ v(P). Obtain a vector bundle map v(f-I(P» ~ v(P) which is an isomorphism on each fiber. Then invoke the universal property of the pull-back of vector bundles; see Exercise 3.4M.) 3.SV (i)

Recall (Exercise 3.2A) that So with antipodal points identified is diffeomorphic to

lRlP"'. Conclude that any closed hemisphere of So with antipodal points on the great circle identified is also diffeomorphic to lRlP". (ii)

Let BO be the closed unit ball in JRo. Map BO to the upper hemisphere of So by mapping x ....... (x, (1 -II X 11 2)112). Show that this map is a diffeomorphism of an open neighborhood of BO to an open neighborhood of the upper hemisphere So + = {x

I xo+1

E

So

~ OJ, mapping BO homeomorphic ally to SO+and homeomorphically to the

greatcircle {XE SOlxo+ 1 =O}. Use (i) to show that BO withantipodalpointson the boundary identified is diffeomorphic to lRlP". (iii)

Show that SO(3) is diffeomorphic to

JRllil; the diffeomorphism is induced by the map

sending the closed unit ball B3 in JR3 to SO(3) via (x, y, z) ....... (the rotation about (x, y, z) by the right hand rule in the plane perpendicular to (x, y, z) through the angle 1t(x2 + y2 + z2)1!2. 3.SW (i) (ii) (iii)

Show that So x JR embeds in JRo+l . (Hint: The image is a "fat" sphere.) Describe explicitly in terms o[ trigonometric functions the embedding of ']['2 into JR3. sa(1) x x sa(k), where a(1) + a(k) = n embeds in JR o+l . (Hint:

Show that

00.

00.

Show that its product with JR embeds in JRo+1 by (i).) 3.7X Let f: M ~ M be an involution without fixed points, i.e.,

[0 [

= identity and f(m) oFm

for all m. Let R be the equivalence relation determined by f, i.e., m l Rm2 iff f(m l ) = [(m 2 )·

(i) (ii)

Show R is a regular equivalence relation. Show that the differentiable structure of M! R is uniquely determined by the property: the projection 1t: M

~

M / R is a local diffeomorphism.


218

3.SY Connected sum of manifolds Let M and N to be two Hausdorff manifolds modeled on the same Banach space E. Let m E M. n E N and let (Uo' 0 and A E E*. where E is a Banach space whose norm is of class C r away from the origin and

r

~

1. Write E = ker A EEl JR.; this is always possible since any

closed fmite codimensional space splits (see §2.2). Define. for any t E JR.. fA,a,t: E

~

10 where u E ker A and x E R Show that fA,a,t satisfies fA,a,t(O. 0) = (0. 1) and fA,.,t I (E \ clB.(O» = identity. (Hint: Show that

E by fA,a,t(u.x) = (u. x + tx.(11 u

(iii)

fA .• ,t is a bijective local diffeomorphism.) Let M be a Cr Hausdorff manifold modeled on a Banach space E whose norm is C' on E \ {O}, r

~

1. Assume dim M

~

2. Let C be a closed set in M and assume that

M \ C is connected. Let {Pl' .. ·• Pk}, {ql' .... qk} be two finite subsets of M \ C. Show that there exists a cr diffeomorphism m, we may assume q

~

neighborhood V such that f(V Choose

Xo E

neighborhood V E

VI and x

E

= 1, ... , n-l.

Since Kq is empty for q

m. As before it will suffice to show that each point Kq has a

n Kq)

has measure zero.

K q. By the local representation theorem 2.5.14 we may assume that

=V I X V 2'

Xo

has a

where V I C ]Rq and V2 C ]Rm-q are open balls, such that for t

V 2 ' f(t, x) = (t, l1(t, x)). Hence 11 : VI x V2 ~ ]Ro-q is a C k map. For t

E

VI

define TIt: V 2 ~ ]Ro-q by Tlt(x) = TI(t, x) for x E V 2. Then for every t E V I'

This is because, for (t, x)

E

VI

X

V2 ' Df(t, x) is given by the matrix

Df(t, x) = [I*q

Hence rank Df(t, x)

=q

iff Dllt(x)

0]

Dllt(x) .

= O.

Now TIt is Ck and k ~ m - n = (m - q) - (n - q). Since q ~ 1, by induction we find that the critical values of TIt' and in particular TIt< {x E V2 I Dllt(x)

=O}),

has measure zero for each t

V 2. By Fubini's lemma, f(K q n V) has measure zero. Since Kq is covered by countably many such V, this shows th~t f(Kq) has measure zero. E

Proof of Step 3 To show f(C s \ Cs+I) has measure zero, it suffices to show that every x

E Cs \ Cs+ I has a neighborhood V such that f(C s n V) has measure zero; then since C s \ Cs+ I is covered by countably many such neighborhoods V, it follows that f(C s \ Cs+I) has measure zero.

Choose Xo E C s \ C s+ I. All the partial derivatives of f at Xo of order less than or equal to s are zero, but some partial derivative of order s + 1 is not zero. Hence we may assume that DI w(x o) "* 0 and w(x o) = 0, where DI is the partial derivative with respect to Xl and that w has the form w(x) Define h: U

~]Rm

=Di(l) ... Di(./(x).

by hex) = (w(x), x 2 '

... ,

xm),

where x = (xI' X2 ' ... , Xm) E U C ]Rm. Clearly h is Ck-s and Dh(x o) is nonsingular; hence there is an open neighborhood V of Xo and an open set W C]Rm such that h : V ~ W is a Ck-s diffeomorphism. Let A = C s n V, A' = h(A) and g = h- I . We would like to consider the function fog and then arrange things such that we can apply the inductive hypothesis to it. If k == 00, there is no trouble. But if k < 00, then fog is only Ck - s and the inductive hypothesis would not apply anymore. However, all we are really interested in is that some Ck function F: W ~ ]R0 exists such that F(x) = (f 0 g)(x) for all x E A' and DF(x) = 0 for all x

E

A'. The existence

Section 3.6 The Sard and Smale Theorems

223

of such a function is guaranteed by the Kneser-Glaeser rough composition theorem (Abraham and Robbin [1969]). For k = 00. we take F = fog. In any case. define the open set Woe jRm-1 by

Let S= {(x 2... ·.X m)E WoIDFo(x2.···.xm)=0}. By the induction hypothesis. Fo(S) has measure zero. But N = h(C s n V) cOx S since for x E N. DF(x) = 0 and since for x E Cs n v.

because w is an s-th deriviative of f. Hence f(Cs and so f(Cs

n V) = F(h(Cs n V))

C

F(O x S) = Fo(S).

n V)

has measure zero. As C s \ Cs+ I is covered by countably many such V. the sets f(Cs \ Cs+I) have measure zero (s = 1•...• k - I) . • The smoothness assumption k counterexample shows.

~

1 + max(O. m-n) cannot be weakened as the following

3.6.7 Example Devil's Staircase Phenomenon The Cantor set C is defined by the following construction. Remove the open interval ]-1/3.2!3[ from the closed interval [0.1]. Then remove the middle thirds ]1/9. 2/9 [ and ]7/9. 8/9 [ from the closed intervals [0.1/3] and [2/3.1] respectively and continue this process of removing the middle third of each remaining closed interval indefinitely. The set C is the remaining set. Since we have removed a (countable) union of open intervals. C is closed. The total length of the removed intervals equals (1/3)L,n;,o(2!3)n = 1 and thus C has measure zero in [0. 1]. On the other hand each point of C is approached arbitrarily closely by a sequence of endpoints of the intervals removed. i.e .. each point of C is an accumulation point of [0. 1] \ C. Each open subinterval of [0. 1] has points in common with at least one of the deleted intervals which means that the union of all these deleted intervals is dense in [0.1]. Therefore C is nowhere dense. Expand each number x in [0. 1] in a ternary expansion 0.a l a 2 ... i.e .• x = L,n;,o3- nan. where an = O. 1. or 2. Then it is easy to see that C consists of

all numbers whose ternary expansion involves only 0 and 2. (The number 1 equals 0.222 .... ) Thus C is in bijective correspondence with all sequences valued in a two-point set. i.e .• the Cardinality of C is that of the continuum; i.e.. C is uncountable. We shall construct a Cl function f: jR2 - t lR which is not C2 and which contains [0.2] among its critical values. Since the measure of this set equals 2 this contradicts the conclusion of

224


Sard's theorem. Note, however, that there is no contradiction with the statement of Sard's theorem since f is only C l . We start the construction by noting that the set C + C = {x + y I x, Y E C} equals [0,2]. The reader can easily be convinced of this fact by expanding every number in [0, 2] in a ternary expansion and solving the resulting undetermined system of infinitely many equations. (The number 2 equals 1.222..... ) Assume that we have constructed a Cl-function g: lR -t lR which is not C2 and which contains C among its critical values. The function f(x, y) = g(x) + g(y) is C l , is not C2 and if c l ' c 2 E C, then there are critical points xl' X 2 E [0, 1] such that g(x j ) = c j , i = 1,2; i.e., (xl' x2) is a critical point of f and its critical value is c l + c 2. Since C + C = [0, 2], the set of critical values of f contains [0, 2]. We proceed to the construction of a function g: lR -t lR having all points of C as critical values. At the k-th step in the construction of C, we delete 2 k- 1 open intervals, each of length 3-k. On these 2k- 1 intervals, construct (smooth) congruent bump functions of height 2-k and area (const.) 2-k3-k (Fig. 3.6.1).

k=2

o

It -1/9

t

1/4

+

__

--

1/9

--

Figure 3.6.1 to x of hk, so gk' = These define a smooth function hk; let gk be the integral from hk and gk is smooth. At each endpoint of the intervals, hk vanishes, i.e., the finite set of -00

endpoints occurring in the k-th step of the construction of C is among the critical values of gk' It is easy to see that h = Lk~l hk is a uniformly Cauchy series and that g = ~~lgk is pointwise Cauchy; note that gk is monotone and gk(l) - gk(O) = (const.) 3-k. Therefore g defines a C I function with g' =h. The reader can convince himself that h has arbitrarily steep slopes so that g is not C2. The above example was given by Grinberg [1981]. Other examples of this sort are due to Whitney [1935] and Kaufman [1979] .• We proceed to the global version of Sard's theorem on finite-dimensional manifolds. Recall that a subset of a topological space is residual if it is the countable intersection of open dense sets. The Baire category theorem 1.7.3 asserts that a residual subset of a a locally compact space or of a complete pseudometric space is dense. A topological space is called Lindelof if every open covering has a countable subcovering. In particular, second countable topological spaces are LindelOf. (See 1.1.6.)

225


3.6.8 Sard's Theorem for Manifolds Let M and N befinite·dimensional Ck manifolds, dim(M) = m, dim(N) = nand f: M -t N a Ck mapping, k ~ 1. Asswne M is Lindelof and k

> max(O, m - n). Then '.t r is residual and hence dense in N. Proof

Denote by C the set of critical points of f. We will show that every x

E

M has a

neighborhood Z such that '.t r I cl(Z) is open and dense. Then, since M is LindelOf we can find a countable cover {Zj} of X with '.t r I cl(Z) open and dense. Since '.t r = nj'.tr I cl(Zj)' it will follow that '.tr is residual. Choose x

E

M. We want a neighborhood Z of x with '.t r I cl(Z) open and dense. By

taking local charts we may assume that M is an open subset of

]Rm

open neighborhood Z of x such that cl(Z) is compact. Then C = {x

and N = E

Choose an

]R".

M I rank Df(x) < n} is

closed, so cl(Z) n C is compact, and hence f(cl(Z) n C) is compact. But f(cl(Z) n C) is a subset of the set of critical values of f and hence, by Sard's theorem in

]R",

closed set of measure zero is nowhere dense; hence '.t r I cl(Z) = dense . •

f(cl(Z)

]R" \

has measure zero. A

n C)

is open and

We leave it to the reader to show that the concept of measure zero makes sense on an n-manifold and to deduce that the set of critical values of f has measure zero in N. To consider the infinite-dimensional version of Sard's theorem, we first analyze the regular points of a map.

3.6.9 Lemma The set SL(E, F) of linear continuous split surjective maps is open in L(E, F). Proof Choose A -t

E

SL(E, F), write E = F EE! K where K is the kernel of A, and define A': E

F x K by A'(e) = (A (e), p(e)) where p: E = F EE! K -t K is the projection. By the closed

graph theorem, p is continuous; hence A' -t

L(E, F) given by T(B) = 1t 0 B

T is linear, continuous (1I1t 0 B II

E

uL(E, F x K). Consider the map T: L(E, F x K)

E

L(E, F x K), where 1t: F x K -t F is the projection. Then

:0;

111t 1111 B II), and surjective; hence, by the open mapping

theorem, T is an open mapping. Since uL(E, F x K) is open in L(E, F x K), it follows that T(uL(E, F x K)) is open in L(E, F). But A = T(A') and T(uL(E, F x K))

C

SL(E, F). This

shows that SL(E, F) is open. •

3.6.10 Proposition Let f: M -t N be a C 1 mapping of manifolds. Then the set of regular points is open in M. Consequently the set of critical points of f is closed in M.

Proof It suffices to prove the proposition locally. Thus, if E, F are the model spaces for M and N, respectively, and x E U C E is a regular point of f, then Df(x) E SL(E, F). Since Df: U -t L(E, F) is continuous, (Df)-I(SL(E, F)) is open in U by lemma 3.6.9 . • 3.6.11 CorOllary Let f: M -t N be Cl and P a submanifold of N. The set {m transversal to P at m } is open in M.

E

M I f is

226


Proof Assume f is transversal to P at mE M. Choose a submanifold chart (V, cp) at f(m) E P, cp : V ~ Fl X F2, cp(V n P) = Fl X (OJ. Hence if 1t : Fl x F2 ~ F2 is the canonical projection, vnp=cp-l(F1x{O})=(1tocp)-I{Oj. Clearly, 1tocp:vnp~F2 is a submersion so that by 3.5.4, ker T f(m)(1t 0 cp) = Tf(ml. Thus f is transversal to P at the poing f(m) iff Tf(p)N = ker Tf(m)(1t 0 cp) + Tf(Pl and (Tmo-1(Tf(m)P) = ker Tm(1t 0 cp 0 0 splits in TmM. Since cp 0 1t is a submersion this is equivalent to 1t 0 cp 0 f being submersive at m E M (see Exercise

cp 0 f is submersive is open in V, hence in

2.2E). From Proposition 3.6.10, the set where

1t

M, where V is a chart domain such that f(V)

v .•

3.6.12 Example assumptions.

C

0

If M and N are Banach manifolds, the Sard theorem is false without further

The following couterexample is, so far as we know, due to Bonic, Douady, and

Kupka. Let E = {x = (xl' x 2' ... ) I xi E lR,lI x 112 = Lj~I(X/ j)2 < oo}. which is a Hilbert space with respect to the usual algebraic operations on components and the inner product (x, y) =

Lj~IXjY/ p. Consider the map f: E ~ lR given by f(x) = Lj~I(-2x/ + 3x/)!2, which is defined since x E E implies I xi I < c for some c > 0 and thus

I- 2 Xj.2)+ 3Xj21 3

~

2c 3.3 ,.3 J + 3c 2.2 J < 2. 2) 2j ,

i.e., the series f(x) is majorized by the convergent series C'Lj~IP/2j. We have Df(x) . v = Lj~16(-x/ + x)Vj / 2j; i.e., f is C 1. In fact f is C~. Moreover, Df(x) = 0 iff all coefficients of Vj are zero, i.e., iff Xj = 0 or Xj = 1. Hence the set of critical points is {x EEl Xj = 0 or I} so that the set of critical values is

But clearly [0, 1] has measure one. • Sard's theorem holds, however, if enough restrictions are imposed on f. The generalization we consider is due to Smale [1965]. The class of linear maps allowed are Fredholm operators which have splitting properties similar to those in the Fredholm alternative theorem. 3.6.13 Definition Let E and F be Banach spaces and A E L(E, F). Then A is called a Fredholm operator if: (i) A is double splitting; i.e., both the kernel and the image of A are closed and have

closed complement;

(ii) the kernel of A is finite dimensional; (iii) the range of A has finite codimension.In this case, if n = dim(ker A) and p = codim(range(A», index(A): = n - p is the index of A. If


227

M and N are C 1 manifolds and f: M --t N is a C 1 map, we say f is a Fredholm map iffor

every x

E

M. T/ is a Fredholm operator.

Condition (i) follows from (ii) and (iii); see exercises 2.2H and 2.2N.

A map g between

topological spaces is called locally closed if every point in the domain of definition of g has an open neighborhood U such that g I cl(U) is a closed map (i.e .• maps closed sets to closed sets). 3.6.14 Lemma A Fredholm map is locally closed.

Proof By the local representative theorem we may suppose our Fredholm map ha~ the form f(e. x)

=(e.ll(e. x)). for

D 1, and x E D2• where f: Dl x D2 --t E x]R.P and Dl c E. D2 C ]R.n are open unit balls. Let U 1 and U 2 be open baIls with cl(Ul) C Dl and cl(U 2) C D2. Let U = e

E

U 1 X U 2 so that cl(U) = cl(U 1) x cl(U 2). Then f I cl(U) is closed. To see this. suppose A C cl(U) is closed; to show f(A) is closed. choose a sequence (e i • y)} such that (e i • y) --t (e. y) as i --t

00

and (e i • Yi)

f(A). say (ei • Yi) = f(e i • x). where (ei • Xi)

E

cl(U 2) is compact. we may assume (e, x)

E

E

A. Since Xi E cl(U 2) and

cl(U 2). Then (ei • Xi) --t (e. x). Since A is closed. f(A). Thus f(A) is closed. •

Xi --t X E

A. and f(e, x) = (e. y). so (e. y)

E

3.6.15 The Smale-Sard Theorem Let M and N be Ck manifolds with M LindeLOf and

assume that f: M --t N is a Ck Fredholm map, k ~ 1. Suppose that k> max(O. index(T/)) for every x E M. Then '1(f is a residual subset of N. Proof It suffices to show that every

Xo E

M has a neighborhood Z such that '1(f I Z) is open

and dense in Z. Choose

M. We shall construct a neighborhood Z of

Z E

Z

so that '1(f I Z) is open and

dense. By the local representation theorem we may choose charts (U. a) at z and such that a(U)

Ex Rn. ~(V)

C

C

Ex ]R.P and the local representative faP = ~

has the form fap(e. x) = (e. ll(e. x)) for (e. x)

E

a(U). (Here x E ]R.n. e

E

0

(V,~)

f0

a-I

at f(z) of f

E. and II : a(U) --t

]R.p.) The index of Tzf is n - p and so k > max(O. n - p) by hypothesis. We now show that 1\(f I U) is dense in N. Indeed it suffices to show that 1\(fap) is dense in Ex ]R.p. For e

E, (e. x)

E

E

a(U), defme lle(x)

=ll(e. x).

Then for each e, lle is a

Ck map defined on an open set of ]R.n. By Sard's theorem. 1\(lle) is dense in ]R.n for each e E. But for (e. x)

E

a(U)

C

E x ]R.n. we have

Dfap(e. x)

=

[*1

0 ] DTJe(x)

so Dfap(e. x) is surjective iff Dll/x) is surjective. Thus for e

E

E

E


228

and so ~faj3) intersects every plane {e} x lRP in a dense set and is, therefore, dense in Ex lR P, by 3.6.2. Thus 'l{(f I U) is dense as claimed. By lemma 3.6.14 we can choose an open neighborhood Z of z such that c1(Z)

C

U and

f I c1(Z) is closed. By Proposition 3.6.10 the set C of critical points of f is closed in M. Hence, f(c1(Z) 'l{(f I U)

C

n C)

is closed in N and so 'l{(f I c1(Z» = N\ f(c1(Z)

n C)

is open in N. Since

'l{(f I c1(Z», this latter set is also dense.

We have shown that every point z has an open neighborhood Z such that 'l{(f I c1(Z» is open and dense in Z. Repeating the argument of 3.6.8 shows that 'l(f is residual (recall that M is Lindelof) . • Sard's theorem deals with the genericity of the surjectivity of the derivative of a map. We now address the dual question of genericity of the injectivity of the derivative of a map. 3.6.16 Lemma The set IL(E, F)

0/ linear continlUJus split injective maps is open in

L(E, F).

Proof Let A E IL(E, F). Then A(E) is closed and F =A(E) EEl G for G a closed subspace of F. The map

r: E x G

--t

F; defined by r(e, g) = A(e) + g is clearly linear, bijective, and

continuous, so by Banach's isomorphism theorem --t

L(E, F) given by P(B)

=B I E

r

E uL(E x G, F). The map P: L(E x G, F)

is linear, continuous, and onto, so by the open mapping

theorem it is also open. Moreover P(r) = A and P(uL(E x G, x G, F) then F

= B(E) EEl B(G)

F»

C

IL(E, F) for if BE uL(E

where both B(E) and B(G) are closed in F. Thus A has an

open neighborhood P(uL(E x G, F» contained in IL(E, F). • 3.6.17 Proposition Let f: M --t N be a C1_map o/manifolds. The set P = {x E M I f is an immersion at x} is open in M. Proof It suffices to prove the proposition locally. If E and F are the models of M and N respectively and if f: U

--t

E is immersive at x E U

C

E, then Df(x) E IL(E, F). By lemma

3.6.16, (Df)-I(IL(E, F» is open in U since Df: U --t L(E, F) is continuous. • The analog of the openness statements in proposition 3.6.10 and 3.6.17 for sub immersions follows from definition 3.5.15. Indeed, if f: M

--t

N is a Cl map which is a subimmersion at x

E M, then there is an open neighborhood U of x, a manifold P, a submersion s: U an immersion j : P

--t

N such that flU = j

0

--t

P, and

s. But this says that f is subimmersive at every

point of U. Thus we have the following. 3.6.18 Proposition Let f: M --t N be a Cl-map o/manifolds. Then the set P = {x E M I f is a subimmersion at x} is open in M. If M or N are finite dimensional then P = {x E M I rank Txf is locally constant} by proposition 3.5.16. Lower semicontinuity of the rank (Le., each point x E M admits an open


~

neighborhood of U such that rank T /

rank T J for all y

E

229

U; see exercise 2.SJ(i)) implies

that P is dense. Indeed, if V is any open subset of M, by lower semicontinuity {x TJ

is maximal}

E

V I rank

is open in V and obviously contained in P. Thus we have proved the

following. 3.6.19 Proposition Let f: M ~ N be a Cl-map o/manifolds where at least one 0/ M or N are

finite dimensional. Then the set P = {x

E

M

If

is a subimmersion at x} is dense in M.

3.6.20 Corollary Let f: M ~ N be a C l injective map o/manifolds and let dim(M) = m. Then

the set P = {x E M I f is immersive at x} is open and dense in M. In particular, if dim(N) = n, then m::; n. Proof By 3.6.18 and 3.6.19, it suffices to show that if f: M ~ N is a Cl-injective map which is subimmersive at x, then it is immersive at x. Indeed, if flU = j

0

s where U is an open

neighborhood of x on which j is injective, then the submersion s must also be injective. Since submersions are locally onto, this implies that s is a diffeomorphism in a neighborhood of x. i.e .• f restricted to a sufficiently small neighborhood of x is an immersion. • There is a second proof of this corollary that is independent of proposition 3.6.19. It relies ultimately on the existence and uniqueness of integral curves of C l vector fields. This material will be treated in Chapter 4. but we include this proof here for completeness. Alternative Proof of 3.6.20 (D. Burghelea) It suffices to work in a local chart V. We shall use induction on k to show that c1(Ui(I). "', i(k») ::) V. where

Ui(I), "', i(k) = { x E V

I TxfCx~(I» ..... Txf (ax~(k»)

linearly independent} .

The case k = n gives then the statement of the theorem. Note that by the preceding proposition Ui(I),,,., i(k) is open in V. The statement is obvious for k = 1 since if it fails T J would vanish on an open subset of V and thus f would be constant on V, contradicting the injectivity of f. Assume inductively that the statement for k holds; i.e .• Ui(I), "', i(k) is open in V and cl(U i(1), "', i(k»)::) V, Define

and notice that it is open in Ui(I), "', i(k) and thus in V. It is also dense in Ui(I), ... , i(k) (by the case k = 1) and hence in V by induction. Define the following subset of Ui(l), ''', i(k+l)

230


U'i(l), ""i(k+l) = {

X E

U'i(k+l)

I TxfC:lx~(I)) ....• Txf (axi~+I)) linearly indepenent}.

We prove that U'i(l), ",.i(k+I) is dense in U'i(k+I)' which then shows that c1(U i(I). "'. i(k+I) ~ V. If this were not the case. there would exist an open set we U'i(k+I) such that

for some CI-functions a l ..... ak nowhere zero on W. Let c:

1-E. E[ -+ W

be an integral

curve of the vector field I a k a a a --+"'+a - - + - - axl(l) axi(k) axi(k+I)' Then (f 0 c)'(t) = Tc(t/(c'(t» = D. so f 0 c is constant on )-£. E[ contradicting injectivity of f. • There is no analogous result for surjective maps known to us; an example of a surjective function lR. -+ lR. which has zero derivative on an open set is given in Figure 3.6.2. However surjectivityof f can be replaced by a topological condition which then yields a result similar to the one in Corollary 3.6.20. y

----~-+~--+----1_-----~

x

Figure 3.6.2 Corollary 3.6.21 Let f: M -+ N be a CI-map of manifolds where dim(N) = n. If f is an open map, then the set {x E M I f is a submersion at x} is dense in M. In particular, if dim(M) = m. then m ~ n. Proof It suffices to prove that if f is a CI-open map which is subimmersive at x. then it is submersive at x. This follows from the relation flU = j 0 s and the openness of f and s. for then j is necessarily open and hence a diffeomorphism by theorem 3.5.7(iii) . •


231

~ SUPPLEMENT 3.6A An Application of Sard's Theorem to Fluid Mechanics

The Navier-Stokes equations governing homogeneous incompressible flow in a region n in JR.3 for a velocity field u are

au

Tt+(u

.V)u-v~u=-Vp+f

in n

div u = 0

(Ia)

(Ib)

where u is parallel to an (so fluid does not escape) and u =

TmM and

z2

zl

E TF(p,rn)S and define w = T(rn,pl(v, uI) +

E T F(P,m)~.such that w = T rnFp(u 2) +

z2'

zl'

By (b), there exist liz

Subtract these two relations to get

Chapter 3 Manifolds and Vector Burulles

234

i.e., T(m,pl(v, u l -ll:l) = z2 - zi

E

T F(m,p)S and therefore (c) holds. •

There are many other very useful theorems about genericity of transversal intersection. We refer the reader to Golubitsky and Guillemin [1974], Hirsch [1976], and Kahn [1985) for the finite dimensional results and to Abraham [1963b) and Abraham-Robbin [1968) for the infinite dimensional case and the situation when P is a manifold of maps.

Exercises 3.6A Construct a C~ function f:]R ~]R whose set of critical points equals [0, 1). This shows that the set of regular points is not dense in general. 3.6B Construct a

~

function f:]R

~]R

which has each rational number as a critical value.

(Hint: Since Q is countable, write it as a sequence {qo I n = 0, I, ... }. Construct on the

closed interval [n, n+ 1) a

C~

function which is zero ncar n and n+ 1 and equal to qo on

an open interval. Define f to equal fo on [n, n+ 1).) 3.6C Show that if m < n there is no C I map of an open set of ]Rm onto an open set of ]R0. 3.6D A manifold M is called ck-simply connected, if it is connected and if every Ck map f: S I ~ M is Ck-homotopic to a constant, i.e., there exist a Ck-map H: )-£, 1 + £[ X SI ~ M such that for all s E S I, H(O, s) = f(s) and H(l, s) = m o' where mo E M. (i) Show that the sphere So, n ~ 2, is Ck-simply connected for any k ~ 1. (Hint: By Sard, there exists a point x E So \ f(S I). Then use the stereographic projection defined by x.) (ii) Show that So, n ~ 2, is CO-simply connected. (Hint: Approximate the continuous map g: SI ~ So by a CI-map f: SI ~ So. Show that one can choose f to be homotopic to g.) (iii) Show that SI is not simply connected. 3.6E Let M and N be submanifolds of ]R0. Show that the set {x E ]R0 I M intersects N + x transversally} is dense in ]R0. Find an example when it is not open. 3.6F Let f:]R° ~]R be C2 and consider for each a E ]R0 the map fa(x) = f(x) + a • x. Prove that the set {a E ]R0 I the matrix [a2fa(xo)/axiihi) is nonsingular for every critical point Xo of fa} is a dense set in ]R0 which is a countable intersection of open sets. (Hint: Use Supplement 3.68; when is the map (a, x) 1-+ Vf(x) + a transversal to (O}?)


235

3.6G Let M be a C2 manifold and f: M -t lR a C2 map. A critical point mo of f is called non-degenerate, if in a local chart (U, 0,

Chapter 4 Vector Fields and Dynamical Systems

242

and a Cu

C~

mapping F: Vo x I ~ E, where I is the open interval ] - a, a[, such that the curve

: I ~ E, defined by cu(t)

=F(u, t)

is a curve at u E E satisfying the differential equations c'u(t)

= X(cu(t» for all tEl.

4.1.6 Lemma Let E be a Banach space, veE an open set, and X : veE

map; i.e. there is a constant K > 0 such that

II X(x) -

X(y)

II ~

=

~

E a Lipschitz

II x - y II for all x, y E U. Let E III x - Xo II ~ b} lies in V, and

K

Xo E V and suppose the closed ball of radius b, Bb(x O) {x E II X(x) II ~ M for all x E Bb(xO)' Let to E IR and let a. = min(1/K, bIM), Then there is a unique C 1 curve x(t), t E [to - A., to + a.] such that x(t) E Bb(x O) and

{

X'(t) = X(x(t» x(to) = Xo .

Proof The conditions x'(t) = X(x(t», x(to) = Xo are equivalent to the integral equation

x(t) = Xo +

f

X(x(s» ds .

10

Put xo(t) = Xo and define inductively

Clearly xn(t) E Bb(x O) for aU n and t E [to - a., to + a.] by definition of a.. We also find by induction that

Thus xn(t) converges uniformly to a continuous curve x(t). Clearly x(t) satisfies the integral equation and thus is the solution we sought. For uniqueness, let y(t) be another solution. By induction we find that MK n It - to In+1/(n + I)! ; thus,letting n ~

00

II xn(t) -

y(t)

II ~

gives x(t) = y(t). T

The same argument holds if X depends explicitly on t and/or on a parameter p, is jointly continuous in (t, p, x), and is Lipschitz in x uniformly in t and p. Since xn(t) is continuous in (x o' to' p) so is x(t), being a uniform limit of continuous functions; thus the integral curve is jointly continuous in (x o' to, p). 4.1.7 Gronwall's Inequality Let f, g: [a, b[ ~ IR be continuous and nannegative. Suppose

that for all t satisfying a ~

t ~

b,

243

Section 4.1 Vector Fields and Flows

f(t) ::; A + f f(s)g(s) ds, for a constant A

~ O.

Then f(t) ::; A exp(f g(s) dS) for all t E [a, b[ .

Proof

First suppose A > O. Let h(t)

=

A + f f(s)g(s) ds;

thus h(t) > O. Then h'(t) = f(t)g(l)::; h(t)g(t). Thus h'(t)/h(t) ::; get). Integration gives

h(t) ::; A exp(f g(s) dS) .

This gives the result for A> O. If A = 0, then we get the result by replacing A by E > 0 for every E > 0; thus h and hence f is zero. T

4.1.8 Lemma Let X be as in Lemma 4.1.6. Let Ft(x o) denote the solution (= integral curve) of x'(t) = X(x(t», x(O) = xo' Then there is a neighborhood V of Xo and a number E> 0 such that for every y t E [-E, E],

E

V there is a unique integral curve x(t) = Ft(y) satisfying x'(t) = X(x(t» for all

and x(O) = y. Moreover,

Proof Choose V = B b/2 (x O) and E = min(1/K, b/2M). Fix an arbitrary y Bb(x o) and hence II X(z)

II ::; M for all z E

E

V. Then B b!2(Y) c

Bb1iY). By lemma 4.1.5 with Xo replaced by y, b

by b/2, and to by 0, there exists an integral curve x(t) of x'(t) = X(x(t»

for t E [-E, E] and

satisfying x(O) = y. This proves the first part. For the second, let f(t) = II Ft(x) - Ft(y) II. Clearly f(t)

=

III:

[X(Fs(x» - X(Fs(y»] ds + x - y

II ::; IIx -

yll + K

I:

f(s) ds ,

so the result follows from Gronwall's inequality. T This result shows that Ft(x) depends in a continuous, indeed Lipschitz, manner on the initial condition x and is jointly continuous in (t, x). Again, the same result holds if X depends explicitly on t and on a parameter p is jointly continuous in (t, p, x), and is Lipschitz in x uniformly in t and p; (Ft,l)P(x) is the unique integral curve x(1) satisfying x'(t) = X(x(t), t, p) and x(A.)

= x.

By the remark following lemma 4.1.6, (Ft)P(x) is jointly continuous in the

variables (A., t, p, x), and is Lipschitz in x, uniformly in CA., t, pl. The next result shows that

244


Ft is Ck if X is, and completes the proof of 4.1.5. For the next lemma, recall that a Cl-function is locally Lipschitz. 4.1.9 Lemma Let X in Lemma 4.1.6 be of class Ck, 1$ k $ 00, and let Ft(x) be defined as before. Then locally in (t, x), Ft(x) is of class Ck in x and is Ck+1 in the t-variable.

Proof We define 'I'(t, x)

E

L(E, E), the set of continuous linear maps of E to E, to be the

solution of the "linearized" or "first variation" equations: d dt 'I'(t, x) = DX(Ft(x»

where DX(y) : E DX(Ft(x»

0

~

0

'I'(t, x), with '1'(0, x)

= identity,

E is the derivative of X taken at the point y. Since the vector field 'I' f-+

'I' on L(E, E) (depending explicitly on t and on the parameter x) is Lipschitz in '1',

uniformly in (t, x) in a neighborhood of every (to' xo)' by the remark following 4.1.8 it follows that 'I'(t, x) is continuous in (t, x) (using the norm topology on L(E, E». We claim that DFt(x)

= 'I'(t, x).

To show this, fix t, set 9(s, h)

= Fs(x + h)

- Fs(x), and

write 9(t, h) - 'I'(t, x) . h =

=

S: {X(Fs(x + h» - X(Fs(x»} ds - S: [DX(Fs(x)

0

'I'(s, x)] . h ds

S: DX(Fs(x»' [9(s, h) - 'I'(s, x) . h] ds +

S: {X(Fs(x + h» - X(Fs(x» - DX(Fs(x»

Since X is of class C 1, given I: > 0, there is a/» term is dominated in norm by

. [Fs(x + h) - Fs(x)]} ds .

0 such that II h II < /) implies the second

s: 1:11 Fs(x + h) - Fs(x) II ds , which is, in tum, smaller than AI: II h II for a positive constant A by lemma 4.1.8. By Gronwall's inequality we obtain II 9(t, h) - 'I'(t, x) . h II :5 (constant) I: II h II. It follows that DFt(x)· h = 'I'(t, x) . h. Thus both partial derivatives of Ft(x) exist and are continuous; therefore Ft(x) is of class C 1• We prove Ft(x) is Ck by induction on k. Begin with the equation defining Ft :

so


245

and

Since the right-hand sides are Ck -

I,

so are the solutions by induction. Thus F itself is

Ck . • Again there is an analogous result for the evolution operator (Ft,A)P(x) for a time-dependent vector field X(x, t, p), which depends on extra parameters p in a Banach space P. If X is Ck , then (Ft.,,)P(x) is Ck in all variables and is Ck+1 in t and A.. The variable p can be easily dealt with by suspending X to a new vector field obtained by appending the trivial differential equation

p'

=0;

this defines a vector field on Ex P and 4.1.5 may be applied to it. The flow on E x P

is just F/x, p)

= (Fl(x), p).

For another more "modem" proof of 4.1.5 see Supplement 4.1C. That alternative proof has a technical advantage: it works easily for other types of differentiability assumptions on X or on Ft, such as HOlder or Soholev differentiability; this result is due to Ebin and Marsden [1970] . The mapping F gives a locally unique integral curve

Cu

for each u

E

U o' and for each t E

I, F t = F I (U o x {t}) maps U o to some other set. It is convenient to think of each point u being allowed to "flow for time t" along the integral curve Cu (see Fig. 4.1.2 and our opening motivation). This is a picture of a U o "flowing," and the system (Uo' a, F) is a local flow of X, or flow box. The analogous situation on a manifold is given by the following.

Figure 4.1.2 4.1.10 Definition Let M be a manifold and X a

cr

vector field on M, r ~ 1. Aflow box of

X at m E M is a triple (U o' a, F), where (i) U o C M is open, m E U o' and a E lR., where a> 0 or a = + 00; (ii) F: U o x Ia ~ M is of class Cr, where Ia = j-a, a[;

(iii) for each u

E

U o'

Cu :

Ia ~ M defined by cu(t) = F(u, t) is an integral curve of X at


246

the point u; (iv)

if Ft : Uo ~ Ft is a

M is defined by F/u) = F(u, t), then for tEla' Ft(U O) is open, and

cr diffeomorphism onto its image.

Before proving the existence of a flow box, it is convenient first to establish the following, which concerns uniqueness. 4.1.11 Global Uniqueness Suppose c l and c2 are two integral curves of X at m E M. Then c l = c2 on the intersection of their domains.

Proof This does not follow at once from 4.1.5 for c l and c2 may lie in different charts. (Indeed. if the manifold is not Hausdorff, Exercise 4.1M shows that this proposition is false.) Suppose c I

n

: II ~ M and c2 : 12 ~ M. Let I = II 12, and let K = {t E I I cl(t) = c2(t)}; K is closed since M is Hausdorff. We will now show that K is open. From 4.1.5, K contains some neighborhood of O. For t E K consider CIt and c 2t. where des) = c(t + s). Then CIt and c2t are integral curves at c,(t) = c 2(t). Again, by 4.1.5 they agree on some neighborhood of O. Thus some neigborhood of t lies in K, and so K is open. Since I is connnected, K = I. • 4.1.12 Proposition Suppose (Uo' a, F) is a triple satisfying (i), (ii), and (iii) of 4.1.10. Thenfor t, sand t + s E la' Ft+, = Ft 0 F, = F, 0 Ft and Fo is the identity map, whenever the compositions above are defined. Moreover, if U t = Ft(U o) and Ut n Uo 'I- 0, then Ft I U_ t Uo : U_t Uo ~ Uo Ut is a diffeomorphism and its inverse is F_t I Uo

n

n

n

n

U t•

Proof Ft+S 0, < 0, or t E JR., respectively.

n

Let 'f+(m) (resp. l(m» denote the sup (resp. inj) of the times of existence of the integral curves through m; 'f+(m) resp. l(m) is called the positive (negative) lifetime of m. Thus, X is complete iff each integral curve can be extended so that its domain becomes = 00 and l(m) =- 0 0 for all m E M.

)-00, 00[; i.e., 'f+(m)

4.1.16 Examples A For M

=JR.2,

let X be the constant vector field, whose principal part is (0, 1). Then

X is complete since the integral curve of X through (x, y) is t B On M

=JR.2\ {OJ.

1-+

(x, y + t).

the same vector field is not complete since the inegral curve of X

through (0, -1) cannot be extended beyond t = 1; in fact as t ~ 1 this integral curve tends to the point (0,0). Thus 'f+(O, -1)

= I,

while 1(0, -1)

=

-00.

C On JR. consider the vector field X(x) = 1 + x2. This is not complete since the integral curve c with c(O) = 0 is c(a) = tan a and thus it cannot be continuously extended beyond -n12 and n12; i.e., T ±(O) = ± n!2 .• 4.1.17 Proposition Let M be a manifold and X E Xr(M), r ~ 1. Then (i) 'iJx:J M x {O}; (ii) 'iJx is open in M x JR.; (iii) there is a unique Cr mapping Fx: 'iJx ~ M such that the mapping t an integral curve at m for all m E M; (iv) for (m, t)

E

'iJx ' (Fx(m, t), s) E 'iJx iff (m, t + s) Fx(m, t + s) =Fx(Fx(m, t), s).

E

1-+

Fx(m, t) is

'iJx ; in this case


249

Proof. Parts (i) and (ii) follow from the flow box existence theorem. In (iii), we get a unique map Fx : 'Dx

~

M by the global uniqueness and local existence of integral curves: (m, t) E 'Dx when

the integral curve m(s) through m exists for s E [0, tl. We set Fx(m, t) = m(t). To show Fx is

cr,

note that in a neighborhood of a fixed mo and for small t, it is

cr

by local smoothness.

To show Fx is globally C r , first note that (iv) holds by global uniqueness.

Then in a

neighborhood of the compact set (m(s) I s E [0, tl} we can write Fx as a composition of finitely many

cr

maps by taking short enough time steps so the local flows are smooth. _

4.1.18 Definition Let M be a m1lnifold and X E Xr(M), r ~ 1. Then the m1lpping Fx is called

the integral of X, and the curve t

1-+

Fx(m, t) is called the maximal integral curve of X at m.

In case X is complete, Fx is called the flow of Thus, if

X

x.

is complete with flow F, then the set (Ft

ItE

lR.)

is a group of

diffeomorphisms on M, sometimes called a one-parameter group of diffeomorphisms. Since Fn

= (Fj)n (the n-th power), the notation P is sometimes convenient and is used where we use Fe 0 Fs = F t+ s wherever it is defined. Note that F/m) is The reader should write out similar definitions for the

For incomplete flows, (iv) says that F t defined for t E IT-(m), T+(m)[.

time-dependent case and note that the lifetimes depend on the starting time to. 4.1.19 Proposition Let X be Cr , where r

~

1. Let c(t) be a m1lximal integral curve of X

such that for every finite open interval la, b[ in the dOm1lin Ij(c(O», T+(c(O»[ of c, c(la, b[) lies in a compact subset of M. Then c is defined for all t

E

R

Proof It suffices to show that a E I, bEl, where I is the inteval of definition of c. Let Tn E la, b[. tn ~ b. By compactness we can assume some subsequence c(tn(k) converges, say, to a point x in M. Since the domain of the flow is open, it contains a neighborhood of (x, 0). Thus, there are e > 0 and

't

> 0 such that integral curves starting at points (such as c(tn(k)) for large

k) closer than e to x persist for a time longer than 'to This serves to extend c to a time greater than b, so bEl since c is maximal. Similarly, a E I. _ The support of a vector field X defined on a manifold M is defined to be the closure of the set (m E M I X(m) ;I'oO}. (4.1.20 Corollary. A Cr vector field with compact support on a manifold M is complete. In

particular, a Cr vector field on a compact m1lnifold is complete. Completeness corresponds to well-defined dynamics persisting eternally. In some circumstances (shock waves in fluids and solids, singularities in general relativity, etc.) one has to live with incompleteness or overcome it in some other way. Because of its importance we give two additional criteria. In the first result we use the notation X[f] = df . X for the derivative of f in

250

Chapter 4 Vector Fklds and Dynamical Systems

the direction X. Here f: E

~

lR and df stands for the derivative map. In standard coordinates

on lR n , n

af af) ~ iaf df(x) = ( - I ' ... , -n and X[f] = LJ X - i . ax ax i= 1 ax 4.1.21 Proposition Suppose X is a C< vector field on the Banach space E, k ~ I, and f: E ~

if {xn} is any sequence in E such that f(x n) ~ a, then there is a convergent subseqence (xn(i)}' Suppose there are constants K, L ~ 0 such that

lR is a C l proper map; that is,

I X[f](m) I ~ K I f(m) I + L for all

m E E.

Then the flow of X is complete. Proof From the chain rule we have (a/at)f(Ft(m»

=X[f](Ft(m»,

so that

f(Ft(m» - f(m) =

1: X[f](F~(m»

d't .

Applying the hypothesis and Gronwall's inequality we see that I f(Ft(m» I is bounded and hence relatively compact on any fmite t-interval, so as f is proper, a repetition of the argument in the proof of 4.1.19 applies . • Note that the same result holds if we replace "properness" by "inverse images of compacl sets are bounded" and assume X has a uniform existence time on each bounded set. This version is useful in some infmite dimensional examples. 4.1.22 Proposition Let E be a Banach space and X a Cr vector field on the Banach space E, r

~

1. Let cr be any integral curve of X. Assume II X(cr(t» II is bounded on finite t-intervals.

Then cr(t) exists for all t E R Proof. Suppose II X(cr(t» II < A for tEla, b[ and let tn

II

cr(~) - cr(~) II ~

tm II cr'(t)II dt

=

~

b. For tn < tm we have

It,,, II X(cr(t»11 dt ~ AI ~ - ~I .

Hence cr(tn) is a Cauchy sequence and therefore, converges. Now argue as in 4.1.19 .• 4.1.23

Examples A Let X be a Cr vector field, r ~ I, on the manifold M admitting aftrst integral, i.e. a function f: M ~ lR such that X[f] = O. If all level sets f -I(r), r E lR are compact, X is

251


complete. Indeed, each integral curve lies on a level set of f so that the result follows by 4.1.19. B Newton's equations for a moving particle of mass m in a potential field in JRO are given

by q(t).= -(l/m) V'V(q(t», for V: JRO ~ JR a smooth function. We shall prove that if there are constants a, b E JR, b ~ 0 such that (l/m)V(q) ~ a - bll q 11 2, then every solution exists for all time. To show this, rewrite the second order equations as a first order system -V'V(q) and note that the energy E(q, p) = (112m)

q= (l/m)p, p=

II p 112 + V(q) is a first integral. Thus, for any

solution (q(t), pet»~ we have ~ = E(q(t), pet»~ = E(q(O), p(O» ~ V(q(O». We can assume ~ > V(q(O», i.e. p(O) of. 0, for if pet) "" 0, then the conclusion is trivially satisifed; thus there exists a

to

for which p(t o) of. 0 and by time translation we can assume that to = O. Thus we have

II q(t) II .;:; II q(t) - q(O) II + II q(O) II';:; II q(O) II + J~ II q(s) II ds II q(O) II + J~ .;:; II q(O) II + J~

J1~ --k

V(q(S»] ds

.j 2(~ - a + b II q(s) 112)

ds

or in differential form :t

II q(t) II .;:;

J 2(~

- a+b

II q(t)1I 2)

whence

1

du

IIq(t) II

t

> -

IIq(O)1I

(I)

,J 2(~ -

a + bu 2)

Now let ret) be the solution of the differential equation d2r(t)

d

dt2

dr

-- = - -

2

(a - br let) = 2br(t) ,

which, as a second order equation with constant coefficients, has solutions for all time for any initial conditions. Choose reO)

= II q(O) II,

[r(O )]2

=2(~ -

a + bll q(O) 112)

and let ret) be the corresponding solution. Since

d (1. I ret) 2+ a - br(t)2)

dt

= 0 ,

it follows that (1/2) r(t)2 + a - br(t)2 = (112) r(0)2 + a - br(O)2 = [3, i.e.

252


t-l

whence

-

du

r(t)

IIq(O)1I

(2)

.J 2(~ - a + ~U2)

Comparing the two expressions (1) and (2) and taking into account that the integrand is > 0, it follows that for any finite time interval for which q(t) is defined, we have II q(t) II : 1. Let 9 z(s) = 9,(2 - s). so 9 2 is a function that is 1 if s < 1 and 0 if s > 3. Finally. let hex) = 9z(1I x II) . •

so 9 1 is a C~

The norm on a real Hilbert space is

C~

away from the origin. The order of differentiability

of the norms of some concrete Banach spaces is also known; see Bonic and Frampton [1966]. Yamamuro [1974]. and Supplement 5.5B. 4.2.14 Corollary Let M be a Cr manifold modeled on a C Banach space. If urn E Tm *M. then there is an f E .rr(M) such that df(m) = urn' Proof If M = E. so TmE ;: E. let f(x) = um(x). a linear function on E. Then df is constant and equals urn' The general case can be reduced to E using a local chart and a bump function as follows. Let
. g) + B(9l0. 9 1(g» + B(9 1(0.9 2 (g»

+ B(f. 9 1(9 2 (g») - B(92 (9 1(0). g) - B(9 1(0. 9 2 (g» - B(9l0. 9 1(g» - B(f. 9 2 (9 1(g))) = B([91' 92 l (0. g) + B(f. [91' 9 2 l (g» .•

Because of 4.2.16 the following definition can be given. 4.2.18 Definition Let M be a manifold modeled on a C~ Banach space and X. Y E *~(M).

Then [X. Yl is the unique vector field such that L[x. Yj = [Lx Lyl. This vector field is also denoted Lx Y and is called the Lie derivative of Y with respect to X. or the Jacobi-Lie bracket of X and Y. Even though this definition is useful for Hilbert manifolds (in particular for finite-dimensional manifolds). it excludes consideration of Cr vector fields on Banach manifolds modeled on nonsmooth Banach spaces. such as LP function spaces for p not even. We shall. however. establish an equivalent definition. which makes sense on any Banach manifold and works for cr vector fields. This alternative definition is based on the following result. 4.2.19 Lie Derivative Formula for Vector Fields

and let X have (local) flow Ft' Then

Let M be as in 4.2.18. X. Y E *(M).


278

(at those points where Ft is defined). Proof. If t = 0 this formula becomes

d -d t

It = 0 F *Y=LxY.

(4 )

t

Assuming (4) for the moment •

.! (F*Y) = dt t

I

I

d F*t+. Y = F*t-d d * * Y. -d F.Y=FtLx s ,=0 s .=0

Thus the formula in the theorem is equivalent to (4). which is proved in the following way. Both sides of (4) are clearly vector derivations. In view of 4.2.16. it suffices then to prove that both sides are equal when aeting on an aribtrary function f E C"'(M. F). Now

by 4.2.7(i). Using 4.2.10 and Leibniz' rule. this becomes X[Y[f]](m) - Y[X[f]](m) = [X. Y][f](m). _ Since the formula for

LxY

in Eq. (4) does not use the fact that the norm of E is

c~

away

from the origin. we can state the following definition of the Lie derivative on any Banach manifold

M. 4.2.20 Dynamic Definition of Jacobi-Lie bracket If X. Y

E

Xr(M). r ~ 1 and X hasflow

Ft' the Cr - 1 vector field Lx Y = [X. Y] on M defined by

d [X. Y] = dt

I

*Y)

t=O (Ft

is called the Lie derivative of Y with respect to X. or the Lie bracket of X and Y. From the point of view of this more general definition. 4.2.19 can be rephrased as follows. 4.2.21 Proposition Let X. Y

E

Xr(M). 1 :0::; r. Then

[X. Y] = Lx Y is the unique C r- 1 vector

field on M satisfying [X. Y][f] = X[Y[f]] - Y[X[f]]

Section 4.2 Vector Fields as Differential Operators

for all f

E

279

Cr+1(U, F), where U is open in M.

The derivation approach suggessts that if X, Y E Xr(M) then [X, Yj might only be Cr- 2 , since [X, Yj maps Cr + 1 functions to

cr- 1 functions, and differentiates them twice.

However

4.2.20 (and the coordinate expression (6) below) show that [X, Yj is in fact C r- 1• 4.2.22 Proposition The bracket [X, Yj on X(M), together with the real vector space structure X(M), form a Lie algebra. That is, (i) [, j is lR bilinear;

(ii) [X, Xj = 0 for all X

E

X(M);

(iii) [X, [Y, Zll + [Y, [Z, Xll + [Z, [X, Yjj = 0 for all X, Y, Z

E

X(M) (Jacobi identity).

The proof is straightforward, applying the brackets in question to an arbitrary function. Unlike X(M), the space Xr(M) is not a Lie algebra since [X, Yj Xr(M). (i) and (ii) imply that [X, Yj = -[Y, Xl. since [X

E

Xr-l(M) for

X, Y

E

+ Y, X + Yj = 0 = [X, Xj + [X, Yj +

[Y, Xj + [Y, Yj. We can describe (iii) by writing Lx as a Lie bracket derivation: Lx[Y' Zj = [ Lx Y, Zj + [Y, LxZj.

Strictly speaking we should be careful using the same symbol Lx for both definitions of Lxf and Lx Y. However, the meaning is generally clear from the context. The analog of 4.2.7 on the vector field level is the following. 4.2.23 Proposition (i)

Let
0 on M, and the series defining f is uniformly convergent, being majorized by L.n", n-2. If we can show that 13 can be taken inside

L

the sum, then 13(f) = O. This construction then produces f E ker 13, f> 0 and hence 1 = (lff)f E ker 13; i.e., ker 13 = .r(M), contradicting the hypothesis 13 '# O. To show that 13

can be taken inside the series, it suffices to prove the following

"g-estimate:" for any g E 1(M), II3(g)1 :::; sup{ I gem) II mE M}. Once this is done, then

as n

~

00

by uniform convergence and boundedness of all functions involved. Thus 13(f) =

L.n", l3(anXnfn 2). To prove the g-estimate, let A > sup {I gem) II m EM} so that A ± g '# 0 on M; i.e., A ± g are both invertible functions on M. Since 13 is an algebra homomorphism, 0 I3(A ± g) = A ± l3(g). Thus ± l3(g)

'#

'#

A for all A > sup {I gem) II m EM}. Hence we get the

estimate II3(g) I:::; sup{ Ig(m) II mE M}. T Remark For infinite-dimensional manifolds, the proof of the lemma is almost identical, with the following changes: we work with 13:

C~(M,

F)

~

F, absolute values are replaced by norms,

second countability is in the hypothesis, and the neighborhoods V m are chosen in such a way that fm I V m is a bounded function (which is possible by continuity of fm ) .• Proof of existence in 4.2.36 For each m E M, define the algebra homomorphism 13m : .r(M) lR. by

I3m (f) =

a(f)(m). Since a is invertible, a(1)

'#

~

0 and since a(1) = a(l2) = a(1) a(l),

we have a(1) = 1. Thus 13 m '# 0 for all mE M. By 4.2.37 there exists a unique point, which we call s and

lim Ft(V) = {m}, t---40+«>

i.e. Jar any neighborhood U of m. there is a T such that Ft(V)

C

U if t

~

T. (See

Fig. 4.3.1(b).) It is obvious that asymptotic stability implies stability. The harmonic oscillator x = -x giving a flow in the plane shows that stability need not imply asymptotic stability (Fig. 4.3.1(c». 4.3.4 Liapunov's Stability Criterion Suppose X is C 1 and m is a critical point of X.

Assume the spectrum of X'(m) is strictly in the left half plane. (In finite dimensions, the characteristic exponents of m have negative real parts.) Then m is asymptOlically stable. (In a similar way. if Re(l1i) > O. m is asymptotically unstable, i.e., asymptotically stable as t ~ -00.)

300

Chapter 4 Vector Fields and Dynamicol Systems

y

x

(b) Asymptotically stable

(a) Stable

(c) Harmonic oscillator

Figure 4.3.1 The proof we give requires some spectral theory that we shall now review. For the finite dimensional case, consult the exercises. This proof in fact can be adapted to work for many partial differential equations (see Marsden and Hughes [1983, Chapters 6, 7, and page 483 D. Let T: E

~

E be a bounded linear operator on a Banach space E and let o(T) denote its

{A Eel T - AI is not invertible on the complexification of E}. Then A E o(T), I A I ~ II T II. Let r(T) denote its spectral radius, defined by r(T) = sup { I A II A E o(T)} .

spectrum; i.e., o(T) =

o(T) is non-empty, is compact, and for

4.3.5 Spectral radius formula. r(T) = lim n-+oo

II ~ lI"n

The proof is analogous to the formula for the radius of convergence of a power series and can be supplied without difficulty; cf. Rudin [1973, p. 355]. The following lemma is also not difficult and is proved in Rudin [1973] and Dunford and Schwartz [1963].

4.3.6 Spectral Mapping Theorem. Let f(z) =

L 3nzn n=O

be an entire junction and define

Then o(f(T»

=f(o(T».

4.3.7 Lemma Let T : E

~

E be a bounded linear operator on a Banach space E. Let r be any

number greater than r(T), the spectral radius of T. Then there is a norm I I on E equivalent to the original norm such that I T I

~

r.

Proof From the spectral radius formula, we get sUPn ~ o( II Tn II Ir") < 00, so if we define I x I =

301

Section 4.3 An Introduction to Dynamical Systems

SUPD 2: O( II T D(X) II /rD), then I I is a norm and II x II ~ I x I ~ (SUPD 2: o( II TOil IrD» II x II. Hence I T(x) I = SUPD 2: o( II TD+I(x) II IrD) = r SUPD 2: o( II p+l(x) II I~+I) ~ r I x I .• 4.3.8 Lemma Let A: E ~ E be a bounded operator on E and let r > o(A) (i.e., Re(A.) > r). Then there is an equivalent norm

I I on E such that for t ~ 0, I etA I

Proof Using Lemma 4.3.6, ert is ~ spectral radius of etA; i.e., e rt ~ limn-->

00

if

A.

E

o(A),

~ e't.

II eDtA III/D. Set

and proceed as in Lemma 4.3.7.• There is an analogous lemma if r < o(A), giving

I etA I

~ e rt •

4.3.9 Lemma Let T: E ~ E be a bounded linear operator. Let oCT) oCT)

C

C

{z I Re (z) < O} (resp.

{z I Re (z) > 0 D. Then the origin is an attracting (resp. repelling) fixed point for the flow

0.

The Liapunov function L s said to be strict, if(ii) is replaced by (ii') X[L]
s, the inequality L(x(t» - L(x(s» = f X[L] (X(A» dA

~-

af

II X(X(A» II < 0,

implies that L(x(s» - L(x(t»

~

af

II X(X(A» II dA

~

a Ilf X(X(A» dA II = all x(t) - x(s)

II •

which together with the continuity of A ...... L(X(A» shows that (x(tn)} is a Cauchy sequence.

+00 such that x(~) --? Xo E D[(O), D[(O) being =O. Since L(x(t» is a strictly decreasing function of t by (ii'). L(x(t» > L(xo) for all t > O. If Xo;f. O. let c(t) be the solution of X starting at xo' so that L(c(t» < L(x o)' again since t ...... L(x(t» is strictly decreasing. Thus. for any solution c(t) starting close to xo. L(c(t» < L(x o) by continuity of L. Now take 7:(0) = x(~) for n large to get the contradiction L(x(tn + t» < L(xo)' Therefore Xo = 0 is the only limit Thus. in all three cases, there is a sequence tn

--?

given in the proof of 4.3.11. We shall prove that Xo


304

point of {x(t) I t ~ O} if x(O)

E

V', i.e., 0 is asymptotically stable. _

The method of Theorem 4.3.12 can be used to detect the instability of equilibrium solutions. 4.3.13 Theorem Let m be an equilibrium point of X

E

Xr(M), r ~ 1. Assume there is a

continuous function L: V ~ M defined in a neighborhood of V of m, which is differentiable on V\ {m}. and satisfyies L(m) = 0, X[L] ~ a > 0 (respectively ~ a < 0) on V\ {m}. [fthere exists a sequence mk ~ m such that L(mk ) > 0 (respectively < 0), then m is unstable. Proof. We need to show that there is a neighborhood W of m such that for any neighborhood V of m, V C V, there is a point mv whose integral curve leaves W. Since m is an equilibrium, by Corollary 4.1.25, there is a neighborhood WI C V of m such that each integral curve starting in WI exists for time at least l/a. Let W = {m E WI I L(m) < 1/2}. We can assume M

=E,

and m

=O.

Let cn(t) denote the integral curve of X with initial condition

~ E

W. Then

so that

i.e. cn(l/a) Ii!' W. Thus all integral curves starting at the points mn E W leave W after time at most l/a. Since mn ~ 0, the origin is unstable .• Note that if M is finite dimensional, the condition X[L]

~

a > 0 can be replaced by the

condition X[L] > 0; this follows, as usual, by local compactness of M. 4.3.14 Examples A The vector field X(x, y) = (-y - x5)(afi)x) + (x - 2y3)(a/ay) E X(lR2) has the origin as an isolated equilibrium. The characteristic exponents of X at (0, 0) are ± i and so Liapunov's Stability Criterion 4.3.4 does not give any information regarding the stability of the origin. If we suspect that (0, 0) is asymptotically stable, we can try searching for a Liapunov function of the form L(x, y) =ax2 + by2, so we need to determine the coefficients a, b, 0 in such a way that

*'

X[L] < O. We have X[L]

=2ax(-y -

x5 ) + 2by(x - 2y3)

=2xy(b -

a) - 2ax 6 - 4by4,

*'

so that choosing a = b = I, we get X[L] =-2(x6 + 2y4) which is strictly negative if (x, y) (0, 0). Thus the origin is asymptotically stable by Theorem 4.3.12. B Consider the vector field X(x, y) =(-y + x5 )(afi)x) + (x + 2y3)(a/ay) with the origin as an isolated critical point and characteristic exponents ± i. Again 4.3.4 cannot be applied, so that we search for a function L(x, y) = ax 2 + by2, a, b 0 in such a way that X[L] has a definite

*'


305

sign. As above we get X[L] = 2ax(-y + xs) + 2by(x + 2y3) = 2xy(b - a) + 2ax 6 + 4by 4,

so that choosing a = b = 1, it follows that X[L]

= 2(x6 + y4)

> 0 if (x, y) *- (0. 0). Thus by

Theorem 4.3.13, the origin is unstable. These two examples show that if the spectrum of X lies on the imaginary axis, the stability nature of the equilibrium is determined by the nonlinear terms.

C Consider Newton's equations in ]R3,

it = v, v= -(I/m)V'V(q)

q =-(1/m)V'V(q)

written as a first order system

and so define a vector field X on]R3 x ]R3. Let (qo' Yo) be an

. equilibrium of this system, so that Vo = 0 and V'V(qo) = O. In Example 4.1.23B we saw that the total energy E(q, v)

= (l/2)m II v 112 + V(q)

is conserved, so we try to use E to construct a

Liapunov function L. Since L(qo' 0) = 0, define

1 2 L(q, v) = E(q,v)-E(qo,O) = Imllvll +V(q)-V(qo),

which satisfies X[L]

=0

by conservation of energy. If V(q) > V(qo) for q *- qo' then L is a

Liapunov function. Thus we have proved Lagrange's Stability Theorem: an equilibrium point (qo' 0) of Newton's eqUfJtions for a particle of mass m, moving under the influence of a potential

V, which has a local absolute minimum at qo' is stable. D Let E be a Banach space and L: E ~]R be C2 in a neighborhood of

o. If DL(O) =0

and there is a constant c > 0 such that D2L(0)(e, e) > c II e 112 for all e, then 0 lies in a potential well for L (i.e. (ii) and (iii) of 4.3.10 hold). Indeed, by Taylor's Theorem 2.4.15,

Thus, if 0 > 0 is such that for all

II h II < 0, 10(h2) 1 ~ cll h 112/4, then L(h) - L(O) > cll h 112/4,

i.e. inf ll h II = e[L(h) - L(O)] ? cE/4 for £ < O.• In many basic infinite dimensional examples, some technical sharpening of the preceding ideas is necessary for them to be applicable. We refer therader to LaSalle [1976], Marsden and Hughes [1983, §6.6], Hale, Magalhaes, and Oliva [1984] and Holm, Marsden, Ratiu, and Weinstein [1985] for more information. Next we turn to cases where the equilibirum need not be stable. A critical point is called hyperbolic or elementary if none of its characteristic exponents has zero real part. A generalization of Liapunov's theorem called the Hartman-Grobman theorem shows that near a hyperbolic critical point the flow looks like that of its linearization. (See Hartman [1973, Ch.9] and Nelson [1979, Ch. 3]. for proofs and discussions.) In the plane, the possible hyperbolic flows near a critical point are summarized in the table below and shown in Fig. 4.3.2. (Remember that for real systems, the characteristic exponents occur in conjugate pairs.)

306


Eigenvalues

Real Jordan Form

Name

Phase portrait in a coordinate system (yl, y2) adapted to the Jordan form (Figure 4.3.2)

Al < A2 < 0 Al

= A2 < 0

Al

=~ < 0

Al

= a + ib, a < 0

~

= a-ib,

Al = ib, A2 b;toO

saddle

(a)

stable focus

(b)

stable node

(c)

[AI 0] 1 A2

stable improper node

(d)

[: ~b]

stable spiral sink

(e)

center

(f)

[AI 0] o A2

Al < 0 < A2

b;toO

=-

ib,

[~

-;]

Table 4.3.1

In cases 1 to 5, all arrows in the phase portraits are reversed and "stable" is replaced by "unstable," if the signs of AI'~' are changed. In the original coordinate system (x I ,x2) all phase portraits in Figure 4.3.2 are rotated and sheared.

(al

(el (dl

Figure 4.3.2


307

To understand the higher dimensional case, a little more spectral theory is required.

°

4.3.15 Lemma Suppose o(T) = 1 U 02 where d(0l' (2) > O. Then there are unique T-invariant subspaces E\ and Ez such that E = E\ EB E 2. o(Ti) = 0i' where Ti = T I Ei ; Ei is called the generalized eigenspace of 0i' The basic idea of the proof is this: let 'Yj be a closed curve with OJ in its interior and Ok' k +'= j, in its exterior; then

Note that the eigenspace of an eigenvalue A is not always the same as the generalized eigenspace of A. In the finite dimensional case, the generalized eigenspace of T is the subspace corresponding to all the Jordan blocks containing A in the Jordan canonical form.

4.3.16 Lemma Let T, 01' and 02 be as in Lemma 4.3.15; assume d(exp(o\), exp(02» > O. Thenfor the operator exp(tT), the generalized eigenspace of exp(tT;> is Ei. Proof Write. according to Lemma 4.3.15. E

_

-

=EI EB Ez.

Thus

[~ tnTn L.. ,e\. n=O

n.

From this the result follows easily .• Now let us discuss the generic nonlinear case; i.e. let m be a hyperbolic equilibrium of the vector field X and let Ft be its flow. Define the inset of m by In(m) = (m' E M I Ft(m') -t m as t -t + oo} and similarly, the outset is defined by Out(m) = (m' E M I Ft(m') -t m as t -t + -oo}.

In the case of a linear system,

x= Ax,

where A has no eigenvalue on the imaginary axis (so the

origin is a hyperbolic critical point), In(O) is the generalized eigenspace of the eigenvalues with

308


negative real parts, while Out(O) is the generalized eigenspace corresponding to the eigenvalues with positive real parts. Clearly, these are complementray subspaces. The dimension of the linear subspace In(O) is called the stability index of the critical point. The Hartman-Grobman linearization theorem states that the phase portrait of X near m is topologically conjugate to the phase portrait of the linear system = Ax, near the origin, where A = X'(m), the linearized vector field at m. This means there is a homeomorphism of the two domains, preserving the oriented trajectories of the respective flows. Thus in this nonlinear hyperbolic case, the inset and

x

CO submanifolds. Another important theorem of dynamical systems theory, the stable manifold theorem (Smale [1967]) says that in addition, these are smooth, injectively immersed submanifolds, intersecting transversally at the critical point m. See Fig. 4.3.3 for an illustration showing part of the inset and outset near the critical point.

outset are

It follows from these important results that there are 'up to topological conjugacy) only a few essentially different phase portraits, near hyperbolic critical points. These are classified by the dimension of their insets, called the stability index, which is denoted by S(X, m) for m an equilibrium. as in the linear case.

OUI(m o )

\

Figure 4.3.3 The word index comes up in this context with another meaning. If M is finite dimensional and m is a critical point of a vector field X. the topological index of m is + 1 if the number of eigenvalues (counting multiplicities) with negative real part is even and is -1 if it is odd. Let this index be denoted I(X, m). so that I(X, m) = (-l)s(X, m). The Poincare-HopI index theorem states that if M is compact and X only has (isolated) hyperbolic critical points, then

L

I(X. m) = X(M)

m is a critical

point of X


309

where X(M) is the Euler-Poincare characteristic of M. For isolated nonhyperbolic critical points the index is also defined but requires degree theory for its definition--a kind of generalized winding number; see Section 7.5 or Guillemin and Pollack [1974, p. 133]. We now illustrate these basic concepts about critical points with some classical applications.

4.3.17 Examples A The simple pendulum with linear damping is defined by the second-order equation

x+ c it + k sin x = 0

(c > 0).

This is equivalent to the following dynamical system whose phase portrait is shown in Fig. 4.3.4:

v= - cv - k sin x.

x=v,

The stable focus at the origin represents the motionless, hanging pendulum. The saddle point at (k1t, 0) corresponds to the motionless bob, balanced at the top of its swing.

Figure 4.3.4 B Another classical equation models the buckling column (see Stoker [1950, ch. 3, §IO]):

Or equivalently, the planar dynamical system 3

. cv a,x a3 x x=v, v = - - - - - - m m m

310


with the phase portrait shown in Fig. 4.3.5. This has two stable foci on the horizontal axis, denoted m) and m2, corresponding to the column buckling (due to a heavy weight on the top) to either side. The saddle at the origin corresponds to the unstable equilibrium of the straight, unbuckled column.

Figure 4.3.5

Figure 4.3.6 Note that in this phase portrait, some initial conditions tend toward one stable focus, while some tend toward the other. The two tendencies are divided by the curve, In(O, 0), the inset of


311

the saddle at the origin. This is called the separatrix, as it separates the domain into the two disjoint open sets, In(m o) and In(m 1). The stable foci are caIled attractors, and their insets are caIled their basins. See Fig. 4.3.6 for the special case + it - x + x 3 = O. This is a special

x

case of a general theory, which is increasingly important in dynamical systems applications. The attractors are regarded as the principal features of the phase portrait; the size of their basins measures the probability of observing the attractor, and the separatrices help find them .• Another basic ingredient in the qualitative theory is the notion of a closed orbit, also called a limit cycle ). 4.3.18 Definition An orbit )'(t) for a vector field X is called closed when y(t) is not afixed

point and there is a 't > 0 such that )'(t + 't) = )'(t) for all t. The inset of y, In(y), is the set of points m E M such that Ft(m) ~ y as t ~ +00 (i.e., the distance between Ft(m) and the (compact) set {y (t) I 0 ~ t ~ 't} tends to zero as t ~ 00.) Likewise, the outset, Out(y), is the set of points tending to y as t ~ -00. 4.3.19 Example One of the earliest occurrences of an attractive closed orbit in an important application is found in Baron Rayleigh's modelfor the violin string (see Rayleigh [1887, vol. 1, §68aj),

or equivalently,

with the phase portrait shown in Fig. 4.3.7.

Figure 4.3.7

312


This has an unstable focus at the origin. with an attractive closed orbit around it. That is. the closed orbit y is a limiting set for every point in its basin (or inset) In(y). which is an open set of the domain. In fact the entire plane (excepting the origin) comprises the basin of this closed orbit. Thus every trajectory tends asymptotically to the limit cycle y and winds around closcr and closer to it. Meanwhile this closed orbit is a periodic function of time. in the sense of Definition 4.3.6. Thus the eventual (asymptotic) behavior of every trajectory (other than the unstable constant trajectory at the origin) is periodic; it is an oscillation. This picture thus models the sustained oscillation of the violin string. under the influence of the moving bow. Related systems occur in electrical engineering under the name van der Pol equation. (See Hirsch and Smale [1974]. Chapter 10 for a discussion) .• We now proceed toward the analog of Liapunov's theorem for the stability of closed orbits. To do this we need to introduce Poincare maps and characteristic multipliers.

4.3.20 Definition Let X be a

cr

vector field on a manifold M, r ~ 1. A local transversal

section of X at m E M is a submanifold SCM of codimension one with m E S and for all s E S. X(s) is not contained in T.S.

Let X be a cr vector field on a manifold M with cr flow F: 1>x C M x lR 4 M, Y a closed orbit of X with period 't, and S a local transversal section of X at m E y. A Poincare map of y is a cr mapping 8: Wo 4 WI where: PM1 W 0' WI C S are open neighborhoods of m E S, and 8 is a Cr diffeomorphism; PM2 there is a cr function 15: Wo 4 lR such that for all SEW0' (S, 't - 15(s» E

1>x, PM3

and 8(s) =F(s, 't - 15(s»; and finally,

if t E ]0, 't - 15(s)[, then F(s, l) e W 0 (see Fig. 4.3.8).

Figure 4.3.8


313

4.3.21 Existence and Uniqueness of Poincare Maps (i) If X is a Cr vector field on M, and y is a closed orbit of X, then there exists a Poincare map of y. (ii) If 8 : W0 ~ WI is a Poincare map of y (in a local transversal section S at m E y) and 8' also (in S' at m' E y), then 8 and 8' are locally conjugate. That is, there are open neighborhoods W2 of m E S, W2' of m' E S', and a C'-difJeomorphism H : W2 ~ W2', such that W2 c Wo n WI' diagram commutes:

W' 2

Wz' CWo' n WI'

and the following

.. S'

*

Proof (i) At any point mE y we have X(m) 0, so there exists a flow box chart (U, cp) at m with cp(U) = V x I C lRo- 1 X lR. Then S = cp-I(V x (O}) is a local transversal section at m. If F: 'Dx C M x lR ~ M is the integral of X, 'Dx is open, so we may suppose U x [-'t, 'tJ C 'iJx' where 't is the period of y. As F~(m) = m E M and F~ is a homeomorphism, Uo = F~-I(U)

n

U is an open neighborhood of mE M with FiUo) C U. Let Wo = S

W2 = F~(Wo). Then W2 is a local transversal section at mE M and diffeomorphism (see Fig. 4.3.9).

s

F~:

Wo

n Uo and

~

W2 is a

u

lJo

Figure 4.3.9 If U2 = F~(Uo)' then we may regard Uo and U2 as open submanifolds of the vector

bundle V x lR (by identification using cp) and then F~: Uo ~ U2 is a cr diffeomorphism mapping fibers into fibers, as cp identifies orbits with fibers, and F~ preserves orbits. Thus W2

314


is a section of an open subbundle. More precisely, if 1t: V x I --+ V and p : V x I --+ I are the projection maps, then the composite mapping is

cr and

has a tangent that is an isomorphism at each point, and so by the inverse mapping theorem, it is a

cr

diffeomorphism onto an open submanifold. Let WI be its image, and 8

the composite

mapping. We now show that 8: Wo --+ WI is a Poincare map. Obviously PM 1 is satisfied. For PM 2, we identify U and V x I by means of q> to simplify notalions. Then 1t: W 2 --+ WI is a diffeomorphism, and its inverse (1t I W 2>-1

:

WI --+ W 2

C

WI

X

lR is a section corresponding

to a smooth function 0': WI --+ JR. In fact, 0' is defined implicitly by

or p 0 F~(wo» = 0' 0 1tF~(wo)' Now let Ii: Wo --+ lR be given by Wo 1-+ 0' 0 F t (w o) which is cr. Then we have

Finally, PM 3 is obvious since (U, q» is a flow box. (ii) The proof is burdensome because of the notational complexity in the defmition of local

conjugacy, so we will be satisfied to prove uniqueness under additional simplifying hypotheses that lead to global conjugacy (identified by italics). The general case will be left to the reader. We consider first the special case m assume sUS'

C

=m '.

Choose a flow box chart (U, q» at m, and

U, and that S and S' intersect each orbit arc in U at most once, and that

they intersect exactly the same sets of orbits. (These three conditions may always be obtained by shrinking S and S'.) Then let W 2 = S, W 2' = S', and H: W 2 --+ W2' the bijection given by the orbits in U. As in (i), this is seen to be a

cr

diffeomorphism, and H 08= 8' 0 H.

Finally, suppose m *- m'. Then Fa(m) = m' for some a E ]0, 't[, and as 'Dx is open there is a neighborhood U of m such that U x {a} C 'Dx. Then Fa(U n S) = s" is a local transversal section of X at m' E Y, and H = Fa effects a conjugacy between 8 and 8" = Fa 0 80 Fa-Ion S". By the preceding paragraph, 8" and 8' are locally conjugate, but conjugacy is an equivalence relation. This completes the argument. _ If Y is a closed orbit of X E ~(M) and m E 'Y, the behavior of nearby orbits is given by a

Poincare map 8 on a local transversal section S at m. Clearly Tm8 E L(TmS, TmS) is a linear approximation to 8 at m. By uniqueness of 8 up to local conjugacy, T m,8' is similar to Tm8, for any other Poincare map 8' on a local transversal section at m' E 'Y. Therefore, the spectrum of Tm8 is independent of me 'Y and the particular section S at m.


4.3.22 Definition

315

If y is a closed orbit of X E X(M). the characteristic multipliers of X at y

are the points in the spectrum of Tme. for any Poincare map

e

at any m

Another linear approximation to the flow near y is given by E y and 't is the period of y. Note that F/(X(m»

=X(m).

TmF~

E

y.

E L(TmM. TmM) if m

so TmF~ always has an eigenvalue

1 corresponding to the eigenvector X(m). The (n - 1) remaining eigenvalues (if dim(M)

=n)

are in fact the characteristic multipliers of X at y. 4.3.23 Proposition If y is a closed orbit of X E X(M) of period 't and cy is the set of

characteristic multipliers of X at y. then cyu {I} is the spectrum of

TmF~.

for any mE y.

Proof We can work in a chart modeled on E and assume m = O. Let V be the span of X(m) so E = T mM e V. Write the flow Ft(x. y) = (FtI(x. y). (Ft2(x. y». By definition. we have

Thus the matrix of TmF~ is of the form

where A

=DIF~2(m).

From this it follows that the spectrum of T mF~ is {I} U cy .

•

If the characteristic exponents of an equilibrium point lie (strictly) in the left half-plane. we

know from Liapunov's theorem that the equilibrium is stable. For closed orbits we introduce stability by means of the following definition. 4.3.24 Definition Let X be a vector field on a manifold M and y a closed orbit of X. An orbit Ft(mo) is said to wind toward y

if mo is + complete and for any local transversal section S to

X at mE y there is a teO) such that Ft(O)(mo) E S and successive applications of the Poincare map yield a sequence ofpoints that converges to m. If the closed orbit y has a neighborhood U

such that for any mo E U. the orbit through mo winds towards y. then y is called asymptotically stable. In other words. orbits starting "close to y. "converge" to y; see Fig. 4.3.10.

4.3.25 Proposition If y is an asymptotically stable periodic orbit of the vector field X and mo to> 0 Such that for all t ~ to. Ft(m) E v.

e U. the neighborhood given in 4.3.24, then for any neighborhood Vof y. there exists

316


Figure 4.3.10 Proof Derme mk = ek(IIlo), where 8 is a Poincare map for a local transversal section at m to 'Y containing mo' Let t(n) be the "return time" of n, i.e., t(n) is defined by Ft(o)(n) E S. If 't denotes the period of 'Y and ~ = t(mk), then since mk -+ m, it follows that 'tk -+ 't since t(n) is a smooth function of n by 4.3.21. Let M be an upper bound for the set {I'tkll kEN}. By smoothness of the flow, F.(mk) -+ F.(m) as k -+ 00, uniformly in s E [0, MJ. Now write for any t > 0, Ft(m o) = FT(t)(mk(t», for T(t) E [0, MJ and observe that k(t) -+ 00 as t -+ 00. Therefore, if W is any neighborhood of FT(t)(m O) contained in V, since Ft(mo) = FT(t)(mk(t» converges to FT(t)(m) as t -+ 00 it follows that there exists to > 0 such that for all t ~ to, Ft(m o) Ewe V. • 4.3.26 Liapunov Stability Theorem for Closed Orbits Let 'Y be a closed orbit of X E 'Y lie strictly inside the unit circle. Then 'Y is asymptotically stable.

*(M) and let the characteristic multipliers of

The proof relies on the following lemma. 4.3.27 Lemma Let T: E -+ E be a bounded linear operator. Let a(T) lie strictly inside the unit circle. Then limn -+ ~ TOe =0 for all e E E. Proof By lemma 4.3.7 and compactness of a(T), there is a norm I I on E equivalent to the original norm on E such that I T I :;;; r < 1. Therefore I TOe I :;;; rD I e I -+ 0 as n -+ 00• • 4.3.28 Lemma Let f: S -+ S be a smooth map on a manifold S with f(s) = s for some s. Let the spectrum of Tl lie strictly inside the unit circle. Then there is a neighborhood U of s such that if s' E U, f(s') E U and fY'(s') -+ s as n -+ 00, where fY' = f 0 f 0 . . . 0 f (n times).


317

Proof We can assume that S is a Banach space E and that s = O. As above. renorm E and find such that 1 T 1 :0:;; r. where T = Df(O). Let E> 0 be such that r + E < 1. Choose a neighborhood V of 0 such that for all x E V

o< r < 1

1 f(x)

- Tx

1 :0:;;

E 1x 1

which is possible since f is smooth. Therefore. 1 f(x) 1:0:;; 1 Tx 1

+ E 1 x 1 :0:;; (r + E) 1 x I·

Now. choose 0 > 0 such that the ball U of radius 0 at 0 lies in V. Then the above inequality implies 1 f"(x) 1 :0:;; (r + E)n 1 x 1 for all x E U which shows that f"(x) E U and f"(x) --+ O. • Proof of 4.3.26 The previous lemma applied to f = 9. a Poincare map in a transversal slice S to y at m. implies that there is an open neighborhood V of m in S such that the orbit through every point of V winds toward y. Thus. the orbit through every point of U = {Ft(m) I t ~ O} ::::> y winds toward y. U is a neighborhood of y since by the Sraightening Out Theorem 4.1.14. each point of y has a neighborhood contained in {Ft(9(U» It> -E. E> O} .• 4.3.29 Definition If X E *(M) and y is a closed orbit of X. y is called hyperbolic if none of the characteristic multipliers of X at y has modulus 1. Hyperbolic closed orbits are isolated (see Abraham-Robbin [1967. Ch. 5]). The local qualitative behavior near an hyperbolic closed orbit. y. may be visualized with the aid of the Poincare map. 9: Woe S --+ WI C S. as shown in Fig. 4.3.8. The qualitiative behavior of this map. under iterations. determines the asymptotic behavior of the trajectories near y. Let m E y be the base point of the section. and s E S. Then In(y). the inset of y. intersects S in the inset of m under the iterations of e. That is. s E In(y) if the trajectory Ft(s) winds towards y. and this is equivalent to saying that eI«s) tends to m as k --+ + 00. The inset and outset of m E S are classified by linear algebra. as there is an analogue of the linearization theorem for maps at hyperbolic critical points. The linearization theorem/or maps says that there is a CO coordinate chart on S. in which the local representative of 9 is a linear map. Recall that in the hyperbolic case. the spectrum of this linear isomorphism avoids the unit circle. The eigenvalues inside the unit circle determine the generalized eigenspace of contraction i.e.• the inset of m E S under the iterates of 9. The eigenvalues outside the unit circle similarly determine the outset of m E S. Although this argument provides only local CO submanifolds. the global stable manifold theorem improves this: the inset and outset of a fixed point of a diffeomorphism are smooth. injectively immersed submanifolds meeting transversally at m. Returning to closed orbits. the inset and outset of y c M may be visualized by choosing a section Sm at every point m E y. The inset and outset of y in M intersect each section Sm in submanifolds of Sm' meeting transversally at m E y. In fact. In(y) is a cylinder over y. that is.

318


a bundle of injectively immersed disks. So, likewise, is Out(y). And these two cylinders intersect transversally in y, as shown in Fig. 4.3.11. These bundles need not be trivial.

Figure 4.3.11 Another argument is sometimes used to study the inset and outset of a closed orbit, in place of the Poincare section technique described before, and is originally due to Smale [1967]. The flow F t leaves the closed orbit invariant. A special coordinate chart may be found in a neighborhood of y. The neighborhood is a disk bundle over y, and the flow Ft, is a bundle map. On each fiber, F t is a linear map of the form ZteRt, where Zt is a constant, and R is a linear map. Thus, if s =(m, x) is a point in the chart, the local representative of F t is given by the expression

called the Floquet normal/orm. This is the linearization theorem/or closed orbits. A related result, the Floquet theorem, eliminates the dependence of Zt on t, by making a further (time-dependent) change of coordinates (see Hartman [1973, ch. 4, §6], or Abraham and Robbin [1967]). Finally, linear algebra applied to the linear map R in the exponent of the Floquet normal form, establishes the CO structure of the inset and outset of y. To get an overall picture of a dynamical system in which all critical elements (critical points and closed orbits) are hyperbolic, we try to draw or visualize the insets and outsets of each. Those with open insets are attractors, and their open insets are their basins. The domain is divided into basins by the separatrices, which includes the insets of all the nonattractive (saddle-type) critical elements (and possible other, more complicated limit sets, called chaotic attractors, not described here.) We conclude with an example of sufficient complexity, which has been at the center of dynamical system theory for over a century.


319

4.3.30 Example The simple pendulum equation may be "simplified" by approximating sin x by two tenns of its MacLaurin expansion. The resulting system is a model for a nonlinear spring with

linear damping. X

. = v. v = - cv - kx

+"3k x3.

Adding a periodic forcing tenn. we have

X=

. k 3 v. v = - cv - kx + "3 x + F cos rot.

This time-dependent system in the plane is transfonned into an autonomous system in a solid ring by adding an angular variable proportional to the time. 9 = rot. Thus. x=v,

. V

=-

k 3 . cv - kx + "3 x + F cos 9. 9 = roo

Although this was introduced by Baron Rayleigh to study the resonance of tuning forks. piano strings. and so on. in his classic book Theory a/Sound [1877]. this system is generally named the

Duffing equation after Duffing who obtained the first important results (see Duffing [1908] and Stoker [1950]). Depending on the values of the three parameters (c. k. F) various phase portraits are obtained. One of these is shown in Fig. 4.3.12. adapted from the experiments of Hayashi [1964]. There are three closed orbits: two attracting. one of saddle type. The inset of the saddle is a cylinder topologically. but the whole cylinder revolves around the saddle-type closed orbit. This cylinder is the separatrix between the two basins. For other parameter values the dynamics can be chaotic (see for example. Holmes [1979a.b] and Ueda [1980]).

Figure 4.3.12


320

For further information on dynamical systems, see Abraham and Shaw [1985] and Guckenheimer and Holmes [1983].

Exercises 4.3A

(i)

Let E --+ M be a vector bundle and m E M an element of the zero section. Show

(ii)

If ~ : M --+ E is a section of E, and ~(m) =0, define ~'(m) : TmM --+ Em to be

(iii)

the projection of Tm~ to Em' Write out ~'(m) relative to coordinates. Show that if X is a vector field, then X'(m) defmed this way coincides with

that TmE is isomorphic to TmM E9 Em in a natural, chart independent way.

Definition 4.3.1. 4.38 Prove that the equation 9 + 2k 9 - q sin 9

9

=0 (q > 0, k > 0)

has a saddle point at 9

=0,

= O.

4.3C Consider the differential equations

r= ar3 - br,

{) = 1 using polar coordinates in the plane.

(i)

Determine those a, b for which this system has an attractive periodic orbit.

(ii)

Calculate the eigenvalues of this system at the origin for various a, b.

4.3D Let X E X(M), cp: M --+ N be a diffeomorphism, and Y =CP.X. Show that (i) m E M is a critical point of X iff cp(m) is a critical point of Y and the characteristic exponents are the same for each; (ii) y c: M is a closed orbit of X iff cp(y) is a closed orbit of Y and their characteristic multipliers are the same. 4.3E The energy for a symmetric heavy top is 2

1 2 2 2·2 P\j1 ft H(9, po) = - - 2 - {I'lv (b - cos 9) + AI sm 9} + -J- + Mg~cos 9 21 sin 9 where I, J > 0, b, P\j1' and Mgt> 0 are constants. The dynamics of the top is described by the differential equations {) = aH/dpo' Po = -aHJi)9. (i) Show that 9 = 0, Po = 0 is a saddle point if 0 < P\j1 < 2(MgtI)112 (a slow top). (ii) Verify that cos 9 = 1 - Ysech2«~y)1I2/2), where y =2 - b2~ and ~ =2Mgt/I describe both the outset and inset of this saddle point. (This is called a homoclinic orbit. ) (iii) Is 9 = 0, Po = 0 stable if P\j1> (MgtI)l12? (Hint: Use the fact that H is constant along the trajectories.)

321


4.3F Let A

E

L(lRn. JR.n) and suppose that a < Re(l..i ) < b. for all eigenvalues Ai' i = 1•...• n

«. »

of A. Show that JRn admits an inner product that a III x 1112 S

with associated norm

III· III

such

«Ax. x» S b III x IIf .

Prove this by following the outline below. (i)

If A is diagonalizable over C. then find a basis in JRn in which the matrix of A has either the entries on the diagonal. the real eigenvalues of A. or 2 x 2 blocks of the form

Choose the inner product

«.» on JRn such that the one and two-dimensional

invariant subspaces of A defined by this block-matrix are mutually orthogonal; pick the standard JR2-basis in the two-dimensional spaces. (ii)

If A is not diagonalizable. then pass to the real Jordan form. There are two kinds of Jordan k j x kj blocks: I..j 0 0 1 I..j 0 0 1 I..j

0 0 0

0 if I..j

E

0 0 0 I..j

JR. or

0

0

0

~j

0

0 12

~j

0 0

o.

~j

0

12

0

12 ~j

0 where ~. J

= [ a·J -bj].

bj

aj

12

=[~ ~J

.

if I..j = aj + ibj• aj• bj E JR. bj '" O. For the first kind of block. choose the basis e l ' ..... '1,;(j)' of eigenvectors of the diagonal part D. Then. for £ > 0 small. put e/ = e/fer - I • r = 1..... k(j) and define ( • )t on the subspace span{e l t ..... '1,;(j)t}

C

JRn to be the Euclidean inner product given by this basis. Compute the

marix of A in this basis and show that

322

Chapter 4 Vector Fkids and Dynamical Systems

( Ax. x)£ Dx . x ..;....-......::. -+ - - 2 - ' ( x. x)£ II x II Conclude that for

E

as E -+ O.

small the statement holds for the first kind of block. Do the

same for the second kind of block. 4.3G Let A

E

L(]RR.]RR). Show that the following are equivalent.

(i)

All eigenvalues of A have strictly negative real part (the origin is called a sink in

(ii)

For any norm I· I on ]RR. there exist constants k > 0 and E> 0 such that for all t ~ o. I etA I ~ ke-t£ .

(iii)

There is a norm III· ilion ]RR and a constant 15 > 0 such that for all t ~ o. III etA III ~ e-t&. (Hint: (ii) => (i) by using the real Jordan form: if every solution

this case).

of

x= Ax

tends to zero as t -+ + 00. then every eigenvalue of A has strictly

negative real part. For (i) => (iii) use Exercise 4.3E and observe that if x(t) is a

i = Ax. then we have

solution of

(d/dt) III x(t) III =

that we get the following inequality: at

~

«x(t). Ax(t) » / III x(t) III. so

log III x(t) III/log III x(O) III

a = min{Re (Ai) I i = 1•...• n} • and b = max{Re (Ai) -E =

I

~

bt. where

i = 1•...• n}. Then let

b.) Prove a similar theorem if all eigenvalues of A have strictly positive real

part; the origin is then called a source. 4.3H Give a proof of Theorem 4.3.4 in the finite dimensional case without using the variation of constants formula (Exercise 4.1E) and using Exercise 4.3F. (Hint: If A = X'(O) locally show Iimx -+ 0

«X(x) - Ax. x » / III x 1112 = O.

Since

«Ax. x» ~

-E

III x 111 2 ,

E

= max Re

{Ail i = 1, ... , n, Ai eigenvalues of A}, there exists 15 > 0 such that if III x III ~ 15, then

X(x), x» ~ - C III x

111 2 ,

«

for some C > O. Show that if x(t) is a solution curve in the

closed &-ball. t E [0. TJ. then dill x(t) 111/ dt ~ - CIII x(t) III. Conclude III x(t) III ~ 15 for all t E [0. TJ and thus by compactness of the 15-ball. x(t) exists for all t ~ O. Finally. show that III x(t) III ~ e-tE III x(O) III.) 4.31 An equilibrium point m of a vector field X

E

X(M) is called a sink, if there is a 15 > 0

such that all points in the spectrum of X'(m) have real part < -15. (i)

Show that in a neighborhood of a sink there is no other equilibrium of X.

(ii)

If M =]RR and X is a linear vector field. Exercise 4.3G shows that provided Iimt-+~

m(t) = 0 for every integral curve m(t) of X. then the eigenvalues of X

have all strictly negative real part. Show that this statement is false for general vector fields by finding an example of a non-linear vector field X on ]RR whose integral curves tend to zero as t -+ 00. but is such that X'(O) has at least one eigenvalue with zero real part. (Hint: See Exercise 4.3C). 4.3J Hyperbolic Flows An operator A E L(JRR. ]RR) is called hyperbolic ifno eigenvalue of A has zero real part. The linear flow x ~ etAx is then called a hyperbolicflow.


(i)

323

Let A be hyperbolic. Show that there is a direct sum decomposition jRD = E S EEl Ell, A(ES) C ES, A(Ell) C Ell, such that the origin is a sink on ES and a source on Ell; ES and Ell are called the stable and unstable subspaces of jRn. Show that the decomposition is unique. (Hint: ES is the sum of all subspaces defined by the real Jordan form for which the real part of the eigenvalues is negative. For uniqueness, if jRn = E's EEl E'll and vEE's, then v = x + y, X E E S, y E Ell with elAv ~

o

as t ~

00,

so that elAx ~ 0, elAy ~ 0 as t ~

source on E' ll,

II elAy II ~ etl: II y II for some

4.3G(ii).)

'* 0,

£

00.

But since the origin is a

> 0 by the analogue of Exercise

II elAx II ~

(ii)

Show that A is hyperbolic iff for each x

(iii)

Conclude that hyperbolic flows have no periodic orbits.

00

as t ~ ± 00.

4.3K Gradient flows (this continues Exercise 4.tH) Let f: jRD ~ jR be C I and let X = (af/ax I, 00.' af/ax D). Show that at regular points of f, the integral curves of X cross the level surfaces of f orthogonally and that every singular point of f is an equilibrium of X. Show that isolated maxima of f are asymptotically stable. (Hint: If Xo is the isolated maximum of f, then f(x o) - f(x) is a strict Liapunov function.) Draw the level sets of f and the integral curves of X on the same diagram in jR2, when f is defined by f(x I, x2) = (xl _ 1)2 + (x2 _ 2)2(x2 _ 3)2. 4.3L Consider

u+ u+ u3 = O. Show that solutions converge to zero like c/l't as H(u, u) = (u + u)2 + u 3 + u 4. (See Ball and Carr [1976]

considering

t~

00

by

for more

information.) 4.3M Use the method of Liapunov functions to study the stability of the origin for the following vector fields:

(iii)

X(x, y) = -(3y + x3)(a;ax) + (2x - 5y3) (a/ay) (asymptotically stable); X(x, y) = -xy4(a;ax) + x6y (a/ay) (stable; look for L of the form x4 + ay6); X(x, y) = (xy - x3 + y)(a/ax) + (x 4 - x2y + x3)(a/ay) (stable; look for L of the

(iv)

X(x, y) = (y + x7 )(a/ax) + (y9 - x) (a/ay) (unstable);

(v)

X(x, y, z) = 3y(z + l)(a;ax) + x(z + l)(a;ay) + yz (a/az) (stable); X(x, y, z) = (-x 5 + 5x6 + 2y3 + xz2 + xyz)(a/ax) + (-y - 2z + 3x6 + 4yz + xz +

(i) (ii)

form ax 4 + by2);

(vi)

xy2)(a/ay) + (2y - z - 2x 8 - y2 + xz2 + xy3)(a/az) L(x, y, z) = (l/2)x 2 + 5(y2 + z2».

(asymptotically stable; use

4.3N Consider the following vector field on jRD+I; X(s, x) = (asN + f(s) + g(s, x), Ax + F(x) + h(s, x» where s E jR, X E jRn, f(O) = 00. = r 0,

324

Chapter 4 Vector Fwlds and Dynamical Systems

then the origin is unstable; if N is odd and a < O. the origin is asymptotically stable. (Hint: for N even. use L(s. x) = s - a II x 112/2 and for N odd. L(s. x) = (s2 - a II x 112)/2; show that in both cases the sign of X[L] near the origin is given by the sign of a.) ~ L(E. E) a continuous map. Let Ft.s denote the evolution operator of the time--dependent vector field X(t. x) = A(t)x on E.

4.10 Let E be a Banach space and A: JR

(i) (ii)

Show that Ft.s E GL(E). Show that II Ft.s 11 = identity. Show that the eigenvalues of M(t) are independent of t. (v)

(Floquet) Show that each real eigenvalue A. of M(to) determines a solution. denoted c(t; A. to) of = A(t)x satisfying c(t + T; A.. to) = AC(t; A.. to)' and also each complex eigenvalue A. =a + ib. of M(to)' where a. b E JR. determines a pair of solutions denoted cr(t; A. to) and ci(t; A.. to) satisfying

x

[

cr(t + T; A.. to)] c'(t + T; A. to)

=

[a -bJ [cr(t; A. to)]. b a

(Hint: Let c(t; A. to) denote the solution of

c'(t; A. to)

x= A(t)x

with the initial condition

c(to; A.. to> = e. where e is an eigenvector of M(to) corresponding to A; if A is complex. work on the complexification of E. Then show that c(t + T; A.. to) Ac(t; A. to) satisfies the same differential equation and its value at to is zero since (vi)

c(to + T; A.. to) = M(to) c(to; A. to) = Ac(to; A.. Lo).) (Floquet) Show that there is a nontrivial periodic solution of period T of = A(t)x if and only if 1 is an eigenvalue of M(to) for some to E R (Hint: If c(t)

x

is such a periodic solution. then c(to) = c(to+ T) = M(to)c(to).) (vii) (Liapunov) Let P: JR ~ GL(E) be a C 1 function which is periodic with period T. Show that the change of variable y = P(t)x transforms the equation = A(t)x into the equation = B(t)y. where B(t) = (P'(t) + P(t)A(t»p(t)-1 If N(t) is the monodromy operator of y = B(t)y. show that N(t) = P(t)M(t)p(t)-I.

y

4.1P

(i)

x

(Liapunov) Let E be a finite dimensional complex vector space and let A: JR ~


325

L(E, E) be a continuous function which is periodic with period T. Let M(s) be

x

the monodromy operator of the equation = A(t) x and let B E L(E, E) be such that M(s) = eTB (see exercise 4.10). Define pet) = etBF s•t and put yet) = P(t)x(t). Use (vii) in the previous exercise to show that yet) satisfies y = By. Prove that pet) is a periodic CI-function with period T. (Hint: Use (iii) of the previous exercise and eTB = M(s) = Fs+T. s)' Thus, for complex finite dimensional vector spaces, the equation = A(t)x, where A(t + T) = A(t), can be transformed via yet) =P(t)x(t) into the constant coefficient linear equation =By. Since the general solution of y = By is a linear combination of vectors

x

y

(ii)

exp(tA)t'(i)u i• where AI' ... , Am are the distinct eigenvalues of B, ui E E, and 1 ~ ~(i) ~ multiplicity of J.-.. conclude that the general solution of = A(t)x is a linear combination of vectors exp(tAi)t'(i) p(t)-I ui where pet + T) =pet). Show

x

that the eigenvalues of M(s) =eTB are exp(TAJ Show that Re(A) < 0 (> 0) for

x

all i = 1. ..., m if and only if all the solutions of = A(t)x converge to the origin as t ~ +00 (-00), i.e. if and only if the origin is asympotically stable (unstable).

~4.4

Frobenius' Theorem and Foliations

The main pillars supporting differential topology and calculus on manifolds are the Implicit Function Theorem, the Existence Theorem for ordinary differential equations, and Frobenius' Theorem, which we discuss briefly here. First some definitions. 4.4.1 Definition Let M be a manifold and let E

C

TM be a subbundle of its tangent bundle;

i.e., E is a distribution (or a plane field) on M. (i) We say E is involutive iffor any two vector fields X and Y defined on open sets of M and which take values in E, [X, Yj takes values in E as well.

(ii) We say E is integrable iffor any

mE

M there is a (local) submanifold N

C

M, called

a (local) integral manifold of E at m containing m, whose tangent bundle is exactly E restricted to N.

The situation is shown in Figure 4.4.1.

Figure 4.4.1 4.4.2 Examples A Any subbundle E of TM with one dimensional fibers is involutive; E is also integrable, which is seen in the following way. Using local bundle charts for TM at III E M with the subbundle property for E, find in an open neighborhood of m, and a vector field that never vanishes and has values in E. Its local integral curves through m have as their tangent bundles E restricted to these curves. If the vector field can be found globally and has no zeros, then through

Section 4.4 Frobenius' Theorem and Foliations

327

any point of the manifold there is exactly one maximal integral curve of the vector field. and this integral curve never reduces to a point. B Let f: M ~ N be a submersion and consider the bundle ker Tf C TM. This bundle is involutive since for any X. Y E ~(M) which take values in ker Tf. we have Tf([X. Y]) =0 by 4.2.25. The bundle is integrable since for any m E M the restriction of ker Tf to the submanifold f-I(f(m» coincides with the tangent bundle of this submanifold (see §3.5). C Let 'Jl"I be the n-dimensional torus. n ~ 2. Let 1 '5 k '5 n and consider E = {(v I' ...• vn) E TIn I Vk + 1= ... = vn = OJ. This distribution is involutive and integrable; the integral manifold through (tl' ...• tn) is 'lI'k X (~+ I' ...• tn). D E =TM is involutive and integrable; the integral submanifold through any point is M itself. E An example of a noninvolutive distribution is as follows. Let M =80(3). the rotation group (sce Exercise 3.5S). The tangent space at I = identity consists of the 3 x 3 skew symmetric matrices. Let

a two dimensional subspace. For Q E 80(3). let

Then E = U {EQ I Q E 80(3)} is a distribution but is not involutive. In fact. one computes that the two vector fields with p = 1. q = 0 and p = O. q = 1 have a bracket that does not lie in E. Further insight into this example is gained after one studies Lie groups (a supplementary chapter). F Let E be a distribution on M. Suppose that a collection 'E of smooth sections of E spans E in the sense that for each section X of E there are vector fields Xi' ...• Xk in 'E and smooth functions a l ••.•• ak on M such that X = aiXi. Suppose 'E is closed under bracketing; i.e .• if X and Y have values in 'E. so does [X. YJ. We claim that E is involutive. To prove this assertion. let X and Y be sections of E and write X = aiXi and Y = biYj • where ai and bi are smooth functions on M and X; and Yj belong to 'E. We calculate "

where and Thus [X. YJ is a section of E. so E is involutive. • Frobenius' Theorem asserts that the two conditions in 4.4.1 are equivalent.

Chapter 4 Vector Fklds and Dynamical Systems

328

4.4.3 The Local Frobenius Theorem A subbundle E of TM is involutive if and only if it is integrable.

Proof Suppose E is integrable. Let X and Y be sections of E and let N be a local integral manifold through m E M. At points of N, X and Y are tangent to N, so define restricted vector fields XI N, YIN on N. By 4.2.25 (on cp-relatedness of brackets) applied to the inclusion map, we have

Since N is a manifold, [XI N, YIN] is a vector field on N, so [X, Y] is tangent to N and hence in E. Conversely, suppose that E is involutive. By choosing a vector bundle chart, one is reduced to this local situation: E is a model space for the fibers of E, F is a complementary space, and U x V c: E x F is an open neighborhood of (0, 0), so U x V is a local model for M. We have a map f: U x V -+ L(E, F) such that the fiber of E over (x, y) is E(x.y)

= {(u,f(x,y),u)luE E) C:ExF,

and we can assume we are working near (0, 0) and f(O, 0) =O. Let us express involutivity of E in terms of f. For fixed u E E, let Xu(x, y) = (u, f(x, y) . u). Using the local formula for the Lie bracket (see formula (5) in §4.2) one finds that [XUl ' Xu21(x, y) = (0, Df(x, y) . (ul' f(x, y) . ul) . u2 (1)

- Df(x, y) . (u2' f(x, y) . u2) . ul) By the involution assumption, this lies in description of

E(x, y)

E(x. y)'

Since the first component vanishes, the local

above shows that the second must as well; i.e., we get the following identity:

Df(x, y) . (up f(x, y) . u 1) . u 2

= Df(x, y) . (u2' f(x, y) . ~) . u 1 .

(1')

Consider the time dependent vector fields Xt(x, y)

= (0, f(tx, y) . x)

and

~.

u(x, y)

= (u, f(tx, y) . tu)

,

so that by the local formula for the Jacobi-Lie bracket, [~,

Xt. u](x, y) = (0, tD 2f(tx, y) . (f(tx, y) . x) . u - tDf(tx, y) . (u, f(tx, y) . u) . x - f(tx, y) . u) (0, - tD1f(tx, y) . x . u - f(tx, y) . u) ,

(2)

Section 4.4 Frobenius' Theorem and Foliations

329

where the last equality follows from (1'). But a~. jat equals the negative of the right hand side of (2). i.e .•

which by 4.2.31 is equivalent to

(3) where F t =Ft. 0 and Ft. s is the evolution operator of the time dependent vector field Xt. Since ~(O.

0) = O. it follows that F t is defined for 0:5 t :5 1 by 4.1.25. Since Fo is the identity. relation (3) implies that

(4) Let N = F I (E x {O

n. a submanifold of E x F. the model space of M. If (x. y) = F I(e. 0).

the

tangent space at (x. y) to N equals {T(e.oll(u.O»luE E} = (T(e.O)FI(Xo.u(e.O»I UE E} (by (4»

(XI. u(FI(e. 0» I u E E}

=

{(u. f(x. y). u) I u E E}

=E(x.y)'

•

The method of using the time-one map of a time-dependent flow to provide the appropriatlcoordinate change is useful in a number of situations and is called the method of Lie transforms. An abstract version of this method is given in 5.4.7; we shall use this method again in Chapter 6 to prove the Poincare Lemma and in Chapter 8 to prove the Darboux Theorem. Note: The method of Lie transforms is also used in singularity theory and bifurcation theory (see Golubitsky and Schaeffer [1985]). For a proof of the Morse Lemma using this method. see 5.5.S. which is based on Palais [1969] and Golubitsky and Marsden [1983]. For a proof of the Frobenius Theorem from the Implicit Function Theorem using manifolds of maps in the spirit of Supplement 4.1 C. see Penot [1970]. See Exercise 4.4F for another proof of the Frobenius Theorem in finite dimensions. The Frobenius Theorem is intimately connected to the global concept of foliations. Roughly speaking. the integral manifolds N can be glued together to form a "nicely stacked" family of submanifolds filling out M (see Figure 4.4.1 or Example 4.4.2A). 4.4.4 Definition Let M be a manifold and ell = {La} a E A a partition of M into disjoint connected sets called leaves. The partition ell is called afoliotion if each point of M has a chart (U. cp). cp : U ~ U' x V' C E EEl F such thatfor each La the connected components (U n La)P

330


of V n La are given by cp«V n La)~) =V' x {ca~}' where ca~ E F are constants f or each a. E A and ~. Such charts are called foliated (or distinguished) by «1>. The dimension (respectively, codimension) of the foliation «I> is the dimension of E (resp., F). See Figure 4.4.2.

Figure 4.4.2 Note that each leaf La is a connected immersed submanifold. In general, this immersion is not an embedding; i.e., the induced topology on La from M does not necessarily coincide with the topology of La (the leaf La may accumulate on itself, for example). A differentiable structure on La is induced by the foliated charts in the following manner. If (V, cp), cp: V ~ V' x V' C E E£l F is a foliated chart on M, and X: E E£l F ~ F is the canonical projection, then X 0 cp restricted to (V n La)~ defines a chart on La. 4.4.5 Examples

A The trivial foliation of a connected manifold M has only one leaf, M itself. It has codimension zero. If M is finite dimensional, both M and the leaf have the same dimension. Conversely, on a finite-dimensional connected manifold M, a foliation of dimension equal to dim(M) is the trivial foliaton. B The discrete foliation of a manifold M is the only zero-dimensional foliation; its leaves are all points of M. If M is finite dimensional, the dimension of M is the codimension of this foliation. C A vector field X that never vanishes on M determines a foliation; its leaves are the maximal integral curves of the vector field X. The fact that this is a foliaton is the Straightening Out Theorem (see §4.1). D Let f: M ~ N be a submersion. It defines a foliation on M (of codimension equal to dim(N) if dim(N) is finite) by the collection of all connected components of f-I(n) when n varies throughout N. The fact that this is a foliation is given by Theorem 3.5.4. In particular, we see that E EE> F is foliated by the family {E x (fll f E F. E In the preceding example, let M = JR3, N = JR, and f(x l , x2, x3) = cp(r2)exp(x3), where cp: JR ~ R is a C"" function satisfying cp(O) = 0, cp(1) = 0, and cp '(s) < 0 for s> 0,

Section 4.4 Frobenius' Theorem and Foliations and where r2

= (xl)2 + (x2)2.

Since df(x I, x2, x3)

331

= exp(x 3)[2 = {La}a EA' Because E I (U n La)/J = T«U n La)IJ, we also have T(M, «1» = E. •

There is an important global topological condition that integrable subbundles must satisfy that was discovered by Bolt [1970]. The result, called the Bott Vanishing Theorem, can be found, along with related results, by readers with background in algebraic topology, in Lawson [1977]. The leaves of a foliation are characterized by the following property. 4.4.8 Proposition Let «1> be afoliation on M. Then x and yare in the same leaf if and only if x and y lie on the same integral curve of a vector field X defined on an open set in M and which is tangent to the foliation «1>. Proof Let X be a vector field on M with values in T(M, «1» and assume that x and y lie on the same integral curve of X. Let L denote the leaf of «1> containing x. Since X is tangent to the foliation, X I L is a vector field on L and thus any integral curve of X starting in L stays in L. Since y is on such an integral curve, it follows that y E L. Conversely, let x, y ELand let c(t) be a smooth non-intersecting curve in L such that c(O) = x, c(1) = y, c'(t) ~ O. (This can always be done on a connected manifold by showing that the set of points that can be so joined is open and closed.) Thus c : [0, 1] ~ L is an immersion, and hence by compactness of [0, 1], c is an embedding. Using definition 4.4.4, there is a neighborhood of the curve c in M which is diffeomorphic to a neighborhood of [0,1] x {OJ x {OJ in lR x F x G for Banach spaces F and G such that the leaves of the foliation have the local

334


representation R x F x {w}, for fixed

WE

G, and the image of the curve c has the local

representation [0,1] x {O} x {O}. Thus we can find a vector field X which is defined by c'(t) along c and extends off c to be constant in this local representation. _ Let R denote the following equivalence relation in a manifold M with a given folation : xRy if x, y belong to the same leaf of . The previous proposition shows that R is an open equivalence relation. It is of interest to know whether M/R is a manifold. Foliations for which R is a regular equivalence relation are called regular foliations. (See §3.5 for a discussion of regular equivalence relations.) The following is a useful criterion. 4.4.9 Proposition Let be afoliation on a manifold M and R the equivalence relation in M

Lm of M TmLm ffi Tm(M, -I

=(r-I)*.

Proof For (i) we examine representatives of (g 0 f)* and g* 0 f*. These representatives are the same in view of 5.1.6. Part (ii) is clear from the definition. and (iii) follows from (i) and (ii) by the same method as in 5.1.6.• We now specialize to the case where 7t: E ~ B is the tangent vector bundle of a manifold. 5.2.9 Definition Let M be a manifold and 'eM : TM ~ M its tangent bundle. We call T'.(M) = T'.(TM) the vector bundle ojtensors contravariant order r and covariant order s. or simply of type (r. s). We identify TIO(M) with TM and call ~I(M) the cotangent bundle of M also denoted by 'e* M : T*M ~ M. The zero section of Tr.(M) is identified with M. Recall that a section of a vector bundle assigns to each base point b a vector in the fiber over b and the addition and scalar multiplication of sections takes place within each fiber. In the case of T'.(M) these vectors are called tensors. The C'" sections of 7t : E ~ B were denoted r"'(7t). or f""(E). Recall that 1" (M) denotes the set of mappings from Minto JR. that are of class C'" (the

standard local manifold structure being used on JR.) together with its structure as a ring; namely. f + g. cf. fg for f. g E 1" (M). c E JR. are defined by (f + g)(x) = f(x) + g(x). (cf)(x) = c(f(x» and (fg)(x) = f(x) g(x). Finally. recall that a vector field on M is an element of X(M) = r"'(TM). 5.2.10 Definition A tensor field ojtype (r. s) on a manifold M is a C'" section of T'.(M). We denote by 'I\(M) the set r"'(T'.(M» together with its (infinite-dimensional) real vector space structure. A co vector field or differential one-jorm is an element of X*(M) = TJI(M). If f E 1{M) and t E T.(M). let ft: M ~ T'.<M) be defined by m ~ f(m)t(m). If Xi E X(M). i = 1..... s. a) E X*(M). j = 1..... r. and t' E 'Tr'.. (M) define

t(a l ..... ar. Xl ..... X.) : M ~ JR. by m 1-+ t(m)(al(m) ..... X.(m» and

t ® t' : M ~ T'+*(dy) = dy, we have q>*t = 3(x+ 2y)q>*(

;J

® q> *(dy) + q> *( ;y) ® q>*(dy)

o ® dy + = 3(x + 2y) ox

(0 0) 0 0 - 2 ox + oy ® dy = (3x + 6y - 2) ox ® dy + ay ® dy.

C Let q>: JR.3 ~ lR.2, q>(x, y, z) = (2x + z, xyz) and t = (u + 2v)du ® du + (u)2 du ® dv E

tzi!2 (JR.2). Since q>*(du) = 2 dx + dz and q>*(dv) = yz dx + xz dy + xy dz, we have q>*t = (2x + z + 2xyz)(2 dx + dz) ® (2 dx + dz) + (2x + z)2 (2 dx + dz) ® (yz dx + xz dy + xy dz)

= 2[4x + 2z + 4xyz + (2x + z)2 yz] dx ® dx + 2(2x + z)2

XZ

dx ® dy

+ 2[2x + z + 2xyz + (2x + z)2 xy] dx ® dz + [4x + 2z + 4xyz + yz(2x + z)2]dz ® dx + xz(2x + z)2 dz ® dy + [2x + z + 2xyz + xy(2x + z)2 ]dz ® dz. D If q> : M

~

N represents the deformation of an elastic body and g is a Riemannian

metric on N, then C =q>*g is called the Cauchy-Green tensor, in coordinates

Thus, C measures how q> deforms lengths and angles. • Finally, we describe an alternative approach to tensor fields. Suppose .'J(M) and X(M) have been defined. With the "scalar multiplication" (f, X) 1-+ fX defined in 5.2.10, X(M) becomes an .'J(M)-module. That is, X(M) is essentially a vector space over .'J(M) , but the "scalars" .'1(M) form only a commutative ring with identity, rather than a field. Define

Section 5.2 Tensor Bundles and Tensor Fields

357

the .1(M)-linear mappings on X(M). and similarly T'.(M) = U+sreM) (X*(M) • .... X(M); J-""(M) the J-""(M)-multilinear mappings. From 5.2.10. we have a natural mapping 'I\(M) -+ Ts(M) which is J-""(M)-linear.

5.2.20 Proposition Let M be afinite-dimensional manifold or be modeled on a Banach space with norm C~ away/rom the origin. Then 'Tr,(M) is isomorphic to the J-""(M)-multilinear maps/rom X*(M) x ... x X(M) into J-""(M). regarded as spaces. In particular. X*(M) is isomorphic to x*(M).

J-""(M)-modules or as real vector

Proof Consider the map Ts(M) -+ U+s reM) (X*(M) • .... X(M); J-""(M» given by ~(a,I, ... , a,'. Xi' ...• Xs)(m) = ~(m)(a,I(m), ...• Xs(m». This map is clearly J-""(M)-linear. To show it is an isomorphism. given such a multilinear map ~. define t by t(m)(a,I(m) •...• Xs(m» = ~(a,I •...• X,)(m). To show this is well-defined we must show that. for each Vo E Tm(M). there is an X E X(M) such that X(m) = vO' and similarly for dual vectors. Let (V.
002

n

•

.... X.) +

L teal' .... ar' X, ..... nx

k .....

X.).

k-'

is local. or is natural with respect to restrictions. That is,for U eye M open sets,

and IE T.CY)

Chapter 5 Tensors

360

(D t) I V = D (t I V)

E

'T',(V)

or thefollowing diagram commutes

Iu .. Ts(U)

7\(V)

jv

vj 'TTS(V)

IV

.. T,(V)

Note that we do not demand that D . be natural with respect to push-forward by diffeomorphisms. Indeed. several important differential operators. such as the covariant derivative. arc not natural with respect to diffeomorphisms. although the Lie derivative is. as we shall see. 5.3.2 Theorem Suppose for each open set V C M we have maps Eu: ![ (U) ~ ![(V) and 1"u: X(V) ~ X(V). which are (lR-linear) tensor derivations and natural with respect to restrictions. That is

(i) :Eu(f ® g) =(:Euf) ® g + f ® Eug for f. g E ![ (V); (ii) for f E ![ (M). :Eu(f I U) = (:EMf) I V; (iii) 1"u(f ® X) = (:Euf) ® X + f ® 1"uX for f E ![(V). and X E X(V);

(iv) for X E X(M). 1"uf(X I V)

=(1"MX) I U.

Then there is a unique differential operator D on 'T(M) that coincides with :Eu on ![ (V) and with 1"u on X(U).

Proof Since D must be a tensor derivation. define D on X*(U) by the formula (Da)' X = D(a· X) - a· (DX) = :Eu(a . X) - a· 1"uX for all X E X(V). By properties (i) and (iii). Da is ![(M)-linear and thus by the remark following 5.2.20. D so defined on X*(V) has values in X*(V). Note also that D(f ® a) = (:Ef) ® a + f ® (Da) for any a E X*(U). f E ![(U). This shows that D exists and is unique on X*(V) (by the Hahn-Banach theorem). Define Dt.; on 'TTS(V) by requiring 001 to hold: (Dut)(al' ...• aT' Xl' ...• Xs) = :Eu(t(al' ...• aT' Xl' ...• Xs» s

T

-L ~I

t(al •...• Daj •...• aT' XI ....• Xs) -

L

t(al •...• aT' XI' ...• 1"U X k'

...•

Xs)·

k=1

From (i). (iii). and 001 for Du on X*(U). it follows that Dut is an ![ (M)-multilinear map. i.c .. that Dut E 'T".(V) (see the comment following 5.2.20). The definition of Du on 'T",(U) uniquely determines Du from the property 001. Finally. if V is any open subset of U. by (ii) and (iv) it follows that Dv(t I V) =(Dut) I V. This enables us to define D on ![(M) by (Dt)(m) = (Dut)(m). where V is any open subset of M containing m. Since Du is unique. so is D.

Section 5.3 The Lie Derivative: Algebraic Approach

361

and so 002 is satisfied by the construction of :0 .• 5.3.3 Corollary We have (i) :O(t\ ® It. ) = :Ot\ ® t2 + t\ ®:Olt. ,and (ii) :00 = 0, where 0 is Kronecker's delta.

Proof (i) is a direct application of 001. For (ii) let a

E

3£*(U) and X E 3£(U) where U is an

arbitrary chart domain. Then (:Oo)(a, X) = :O(o(a, X» - o(:Oa, X) - o(a, :OX) =:O(a· X)-:Oa· X-a·:OX =0.

Again the Hahn-Banach theorem assures that :00 = 0 on U, and thus by 002, :00 = O. • Taking :Eu and :F'u to be Lx Iu we see that the hypotheses of theorem 5.3.2 are satisfied. Hence we can define a differential operator as follows. 5.3.4 Definition 1/ X E 3£(M), we let Lx be the unique differential operator on 'T(M), called the Lie derivative with respect to X, such that Lx coincides with Lx as given on .'T(M) and 3£(M) (see 4.2.6 and 4.2.20). 5.3.5 Proposition Let cp : M ~ N be a diffeomorphism and X a vector field on M. Then Lx is natural with respect to push-forward by cp; that is,

or the/ollowing diagram commutes: T ' , ( M ) - - - - - - - - - T',(N)

Lx!

r~'

'I\(M»----------'Tr,(N)

Proof For an open set U

C

M define

where we use the same symbol cp for cp I u. By naturality on .'T(U) and X(U), :0 coincides

Chapter 5 Tensors

362

with LxIU on !F (U) and Jt(U). Next, we show that D is a differential operator. For 001, we use the fact that *(t(al' .... <X.' Xl' ... , Xs» =(*t)(",al' ... , ",<X.. ",X I..... *Xs)' which follows from the definitions. Then for X. X\, ... , Xs E Jt(U) and al' .... a r

E

Jt"'(U).

r

+

L (",t)(*a l , ... , Lcp.x ",aj' ... , *a

r,

",X I ..... ",X.)

j= I

•

+

L (*t)(",al' ... , ",a

r,

",X I , ... , Lcp.x"'X k ,

... ,

",X s )].

k=1

by 001 for Lx. Since * = (-1)* by 5.2.14. thls becomes r

(Dt)(a l , ... , a r , XI' .... X.) +

L

(aI' .... Da j .... , a r , XI' ... , X.)

j= I

+

L•

t(al, ... ,a r , XI' .... DX k ,

....

X.).

k=1

For 002, let t E 'Tr'<M) and write Dt I U = [(",r l Lcp.x",tll U = (",rl[Lcp.x",tll U -I

=

("')

Lcp.xlu ",t I U

=

D(t I U).

(by 002 for Lx)

The result now follows by 5.3.2. • Using the same reasoning. a differential operator that is natural with respect to diffeomorphisms on functions and vector fields is natural on all tensors. Let us now compute the local formula for Lxt where t is a tensor field of type (r, s). Let : U C M ~ veE be a local chart and let X' and t' be the principal parts of the local representatives, *X and ",t respectively. Thus X': V ~ E and t ': V ~ T'.(E). Recall from §4.2 that the local formulas for the Lie derivatives of functions and vector fields are:

Section 5.3 The Lie Derivative: Algebraic Approach (Lxf)'(x) = Df(x) . X'(x)

363

(1)

where f' is the local representative of f and (LxY)'(x) = DY'(x) . X'(x) - DX'(x) . Y'(x).

(2)

In finite dimensions these become (1')

and (2')

Let us first find the local expression for Lxa where a is a one-form. By 5.3.5. the local representative of Lxa is

which we write as Lx,a' where X' and a' are the principal parts of the local representatives. so X': V ~ E and a': V ~ E*. Let VEE be fixed and regarded as a constant vector field. Then as Lx is a tensor derivation. Lx, (a' . v) = ( Lx' a') . v + a'(Lx.v)· By (1) and (2) this becomes D(a'· v) . X' = (Lx' a') . v - a' . (DX' . v). Thus (Lx'(x') . v = (Da' . X') . v + a' . (DX'·v).

In the expression (Dcx'· X')· v. Da'· X' means the derivative of a' in the direction X'; the resulting element of E* is then applied to v. Thus we can write Lx,a' = Da'· X' + a'· DX'.

(3)

In finite dimensions. the corresponding coordinate expression is

. aai

..

axj

·

(Lxa)· VI = _ . XJVI + a· _ . VI; I axJ J axl

i.e.•

(3')

Now let t be of type (r. s). so t': V ~ L (E ........ E .... E •...• E; JR). Let a l •...• a r be (constant) elements of E* and v I' ...• V5 (constant) elements of E. Then again by the derivation

Chapter 5 Tensors

364

property. Lx' [t'(a' •...• a'. vI' ...• v.)] = (Lx,t')(a' •...• a'. vI' ...• v.)

.

,

+

L t'(a' •...• Lx,a

i •...•

a'. v, •...• v.) +

i='

L t'(a' •...• a'. v, •...• Lx,vj' ...• v.). j='

Now using the local fonnula (1)-(3) for the Lie derivatives of functions. vector fields. and one-fonns. we get (:I)t'· X')· (a' •...• a'. vI' ...• v.) = (Lx,t')(a' •...• a'. vI' ...• v.)

.

, + Lt'(a' •...• a i i =,

DX' •...• cl.v, . .... v,)+ Lt'(a' •...• a', v" ... ,-DX'·vj' ... , vs ). j='

.

Therefore, (Lx,t')(a', ... , a'. vI' ...• v s)

=(:I)t'· X')(a', ... , a', vI' ... , vs)

,

-L

s

t'(a', ... , a i

.

DX' , ... , a" v" ...• v.) +

i='

L

t'(a', ... , a', v, •... , DX' . Vj' ... , v s)·

j='

In components. this reads

i .. .i (Lxt) I '. . h···J,

=

k a i .. .i axil li .. .i . . X - p '. . - - - t 2 '. . - (all upper mdlces) axk J•...J. axl h···J,

aXm

..

+ -ax. - t'I···" mh. ...J,. + (all lower indices) JI

(4)

We deduced the component fonnulas for Lxt in the case of a finite-dimensional manifold as corollaries of the general Banach manifold fonnulas. Because of their importance, we shall deduce them again in a different manner, without appealing to 5.3.5. Let _ il .. .i,.J

t - t

.....£...... ®

J

I···. ax'i

... ®

....£...... ® ax',

_jl ... ® dxj, E L rrf dxs(U),

where U is a chart domain on M. If X = xka/axk, the tensor derivation property can be used to compute Lxt. For this we recall that


365

and that

by the general formula for the bracket components. The formula for Lx(dxk) is found in the following way. The relation Oki = dxk(d/dx i ) implies by 001 that

Thus

k (dXk) Lx(dx)= --. dx'..

so

dX'

Now one simply applies 001 and collects terms to get the same local formula for Lxt found in (4). Note especially that L

~ (~).=o ax'

dx)

and

L~ ax'

(dxi) = 0, for all i,j.

5.3.6 Examples A Compute Lxt, where

Solution. Metfwd 1 Note that (i) Lxt = La/ilx + xa/ay t = La/ax t + La/ilyt (ii) La/ax t = La/ax { x

= La/ax (x Now note

-#y ® dx ® dy + y ~ ~ ® dx ® dy) +

® dy ® dY}

La/ax( Y ~ ® dy ® dy) = ;y ® dx ® dy + 0 .

Chapter 5 Tensors

366

a = 0, LxiJ/iJy ax a = - LiJ/iJx(xa a LxiJ/iJy ay dy) = - { l'a ay + x . 0} = - ay LxiJ/iJy dx

= 0,

and LxiJliJy dy

,

= dx.

thus (iii) LxiJliJy t

= LxiJ/iJy { x

~ ® dx ® dy + y ~ ® dy ® dY}

= (O+O+O+X~®dX®dX) + (x

~

® dy ® dy + 0 + y

~

® dx ® dy + Y ;y ® dy ® dx ).

Thus, substituting (ii) and (iii) into (i), we find a

a

a

Lxt = dy ® dx ® dy + x ay ® dx ® dx + x dy ® dy ® dy + y = (y + 1)

a

a

dy ® dx ® dy + x dY ® dx ® dx + x

a a ay ® dx ® dy + Y dy ® dy ® (

a a ay ® dy ® dy + Y ® dy ® dx.

dY

Method 2 Using component notation, t is a tensor of type (1,2) whose nonzero components are t2l2 = x and t 222 = y. The components of X are Xl = 1 and X2 = x. Thus, by the component formula (4),

The nonzero components are (Lx t)2 12 = 1 - 0 + y + 0 = 1 + y;

(Lxt)\2 = X - 0 + 0 + 0 = x;

(LXt)211 =O-O+O+x=x;

(Lx t)221 = 0 - 0 + 0 + y = y,

and hence a a a a Lxt = (y + 1) ay ® dx ® dy + x ® dx ® dx + x ay ® dy ® dy + y dy ® dy ® dx.

ay

The two methods thus give the same answer. It is useful to understand both methods since they both occur in the literature, and depending on the circumstances, one may be easier to apply than the other. B In Riemannin geometry, vector fields X satisfying Lxg = 0 are called Killing vector fields; their geometric significance will become clear in the next section. For now, let us compute the system of equations that the components of a Killing vector field must satisfy. If X = Xii}Ii}x i , and g = &jdxi ® dxj, then


367

Lxg = (LXgi} dx i ® dxi + gij(LXdx i) ® dxj + gijdx i ® (Lxdx j) k ag··. . axi k . . axj k = X -') dx' ® dx-l + g .. dx ® dx-l + g··dx' ® dx aXk ') aXk ') aXk axk aXk} i j k agij = { X - + gk· --. + gik--. dx ® dx. ai ) ax' ax) Note that Lxg is still a symmetric (0. 2)-tensor. as it must be. Hence X is a Killing vector field iff its components satisfy the following system of n partial differential equations. called Killing's equotions

C In the theory of elasticity. if u represents the displacement vector field. the expression Lug is called the strain tensor. As we shall see in the next section. this is related to the Cauchy-Green tensor C = ,,(uo) = I. _

5.4.9 The Classical Morse Lemma Let h: U ~]R be Ck. k;?: 3. U open in ]Rn. and let u E U be a nondegenerate critical point of h. i.e., h(u) = O. Dh(u) = 0 and the symmetric bilinear form D 2h(u) on ]Rn is nondegenerate. Then there is a local Ck-2 diffeomorphism 'I' of]Rn fixing u such that h('I'(x»

={l/2)[(x 1 _ u 1)2 + ... + (x _ un- i)2 _ (x n- i+1 _ un- i+1)2 _ ... _ (xn _ un)2).

Proof. In 4.5.8 take ( .) to be the dot-product in ]Rn to find a local Ck-2 diffeomorphism on ]Rn fixing Uo such that h(q>(x))

= (1I2)D 2h(u)(x -

u. x - u). Next. apply the Gram-Schmidt

procedure to find a basis of ]Rn in which the matrix of D2h(u) is diagonal with entries ±I (see 6.2.9 for a review of the proof of the existence of such a basis). If i is the number of -I's (the

index). let q> be the composition of 'I' with the linear isomorphism detennined by the change of an arbitrary basis of ]Rn to the one above. _

Exercises 5.4A Verify Theorem 5.4.1 by a coordinate computation as follows. Let F,,(x) = (yl(A. x) •...• yn(A. x» so that ifyi/aA = Xi(y) and ayi/axi satisfy the variational equation

Then write

Chapter 5 Tensors

376

Differentiate this in A. at A. =0 and obtain the coordinate expression (4) of Seclion 5.3 for Lxt. 5.4B Carry oUl the proof outlined in Exercise 5.4A for lime-dependenl veclor fields. 5.4C Slarling with Theorem 5.4.1 as the definition of Lxt, check that Lx satisfies 001, 002 and the properties (i)-(iv) of 5.3.2. 5.4D Let C be a contraction operator mapping 'T's(M) to 'Tr-1s_1(M). Use both 5.4.1 and 001 to show that Lx(Ct) =C(Lxt). 5.4E Extend Theorem 5.4.1 to F-valued tensors. 5.4F Let f(y) = (1/2)y2 - y3 + y5. Use the Lie transform method to show that there is a local diffeomorphism 0 whenever m *- n. Let mE U

C

M where (U, 0 such that Y = hX is complete. (Hint: With f as in Exercise 5.5G put h = exp {_(X[f))2) so that I Y[f) I :s 1. Hence (f 0 c)(]a. b[) is bounded for any integral curvc c of Y and ]a. b[ in the domain of c.) 5.51 Let M be a paracompact. non-compact manifold.

(i) Show that there exists a locally fmite sequence of open sets {Vi lie Z} such that Vi

n Vi+1 '" 0

unless j = i-I. i. i + I. and each Vi is a chart domain diffeomorphic (by the chart map CPi) with the open unit ball in the model space of M. See Fig. 5.5.1. (Hint: Let '11 be a locally finite open cover of M with chart domains diffeomorphic by their chart maps to the open unit ball such that no finite subcover of '11 covers M. and no two elements of '11 include each other. Let VO' VI be distinct elements of 0/, Vo

n VI '" 0. Let V_I E '11 be such that V_I n Vo '" 0; V_I exists by local finiteness of 0/. Now use induction.)

n VI = 0;

such a V_I

(ii) Vse (i) to show that there exists an embedding of ]R in M as a closed manifold. (Hint: Let Co be a smooth curve in Vo diffeomorphic by CPo to )-1. 1[ connecting a point in V_I n Vo to a point in Vo n VI. Next. extend Co n VI smoothly to a curve c!' diffeomorphic by CPI to ]0. 2[ ending inside VI n V 2 ; show that c I n U o extends the curve Co n VI inside Vo n VI. Now use induction.) (iii) Show that on each non-compact paracompact manifold admitting partitions of unity there exists a non-complete vector field. (Hint: Embed lR in M by (ii) and on lR consider the vector field ~ = x2 . Extend it to M via partitions of unity.)

Figure 5.5.1

Section 5.5 Partitions o/Unity

391

5.5J Show that every compact n-manifold embeds in some IRk for k big enough in the following way. If {(Vj, n, Ak(E)

=

{O}, while for 0 < k ~ n, Ak(E) has dimension n!/(n-k)!k!. The exterior algebra over E has

dimension 2n. If {et' ... , en} is an (ordered) basis of E and {e l , of Ak(E) is

... ,

en} its dual basis, a basis

(4) Proof First we show that the indicated wedge products span Ak(E). If a e Ak(E), then from

5.1.2, a = a(ei l' ... , eik)eil ® ... ® e~, where the summation convention indicates that this should be summed over all choices of it' ... , ik between 1 and n. If the linear operator A is applied to this sum and (2) is used, we get a

· · 1 ·

.

ll ® ... ® elk) = a(ei ' ... , ei ) -kl ell /\ ... /\ elk = Aa = a(ei I ' ... , e;.)A(e -. I k.

The sum still runs over all choices of the il' ... , ik and we want only distinct, ordered ones. However, since a is skew symmetric, the coefficient in a is 0 if it' ... , ik are not distinct. If they are distinct and 0 e Sk' then

since both a and the wedge product change by a factor of sign 0. Since there are k! of these rearrangements, we are left with

a =

L

a(ei l' ... , eik)eil /\ ... /\ eik.

i 1 < ... dim(span{a l •...• a k )) and so ai, ... , a k are linearly dependent. • 6.1.11 Corollary Let 9 E AI(E) and a

E

Ak(E). Then 9/\ a = 0 iff there exists ~

E

Ak-I(E)

such that a = 9 /\ ~.

Proof Clearly, if a =9 /\~, then 9/\ a = O. Conversely, assume 9/\ a choose a basis {eJiEI of E such that for some k E I, ek =9. If

=0,

9"* 0 and

from 9 /\ a = 0 it follows that all summands not involving ek are zero. Now factor ek oul of the remaining terms and call the resulting (k-l)-form ~. •

400

Chapter 6 Differentiol Forms

6.1.12 Examples A Let E = JR2, {el' e 2} be the standard basis of JR2 and {e l , e2 } the dual basis. Any element ffi of A I (JR2) can be written uniquely as ffi = ffile l + ffi 2e 2, and any element ffi of A(JR2) can be written uniquely as ffi = ffi12 e l /\ e 2. 8

Let E=]R3, {el'e2'~} be the standard basis, and {e l ,e2,e3 } the dual basis. Any E AI(]R3) can be written uniquely as ffi = ffile l + ffi 2e 2 + ffi3e3. Similarly, any

element ffi

elements 11 E A2(lR3) and ~ E A3(JR3) can be uniquely written as 11 = 1l12e l /\ e2 + ll\3el /\ e 3 + 1l23e2 /\ e 3 and ~ = ~123el /\ e2 /\ e3. • Since JR3, A I(JR3), and A2(JR3) all have the same dimension, they are isomorphic. An isomorphism ]R3 ;: AI(]R3) = (JR3)* is the standard one associated to a given basis: ei ...... e i , i = 1,2,3. An isomorphism of AI(JR3) with A2(JR3) is determined by

This isomorphism is usually denoted by *: AI(JR3) 1-+ A2(]R3); we shall study this map in general in the next section under the name Hodge star operator. The standard isomorphism of JR3 with A I(JR3) = (JR3)* is given by the index lowering

action b of the standard metric on JR3; i.e., b (ei ) = ei . Then *

0

b :

JR3 -+ A2(JR.3) has the

following property:

(5) for all v, w

E

JR.3, where x denotes the usual cross-product of vectors; i.e.,

The relation (5) follows from the definitions and the fact that if a = aIel + ~e2 + ~e3 and ~ = ~Iel + ~2e2 + ~3e3, then

Exercises 6.1A Compute a /\ a, a /\~, ~ /\ ~,and ~ /\ a /\ ~ for a = 2e l /\ e 3 - e2 /\ e3 E A2(JR3) and ~=-el +e2 _2e 3 where {e l ,e2,e3} is a basis of (JR.3)*. 6.18 If k! is omitted in the definition of A (6.1.1), show that /\ fails to be associative. 6.1C Let v\, ... , vk be linearly dependent vectors. Show that a(vl' ... , vk ) = 0 for all a Ak(E).

E

Section 6.1 Exterior Algebra

401

6.1D Let E be finite dimensional. Show that Ak(E*) is isomoprhic to (Ak(E»*. (Hint: Define
for some constant b. clearly unique. 6.2.2 Definition Let dim(E)

=n

and
e N(E) is called the determinant of cpo The defmition shows that the determinant does not depend on the choice of basis of E. nor does it depend on a norm on E. To compute det cpo choose a basis {el' .... en} of E with dual basis {el ..... en}. Let cpe L(E.E) have the matrix [Aiil;thatis. cp(e i) = ~lsisnAiiei' By Example 6.1.6A.

Since (e l /\ ... /\ en)(el' .... en) = 1 we get det cp = det[Aiil. the classical expression of the determinant of a matrix with xl' .... xn as columns. where xi has components Ai i. Thus the definition of the determinant in 6.2.2 coincides with the classical one. From properties of pull-back. we deduce corresponding properties of the determinant. all of which are well known from linear algebra. 6.2.3 Proposition Let cpo '" e L(E. E). Then (i) (ii) (iii)

det(cp 0

"')

=(det cp)(det "');

if cp is the identity, det cp'= 1; cp is an isomorphism iff det cp *- O. and in this case det(cp-I)

Proof For (i). (cp

0

"')*0>

= det(cp

0

",)0>; but (cp

0

"')*0>

= (",*

0

=(det cp)-I.

Cp*)O>. Hence. ( = (det ",)(det and (i) follows. (ii) follows at once from the definition. For (iii). suppose cp is an isomorphism with inverse cp-I. Therefore. by (i) and (ii). 1 = det(cp

0

cp-I)

=

(det cp)(det cp-I). and. in particular. det
. we have (cp*o»(el' .... en) = 0>(0. cp(e2 ) ..... cp(en = O. Hence. det cp = O. •

»

In Chapter 2 we saw that if E and F are finite dimensional. one convenient norm giving the topology of L(E. F) is the operator norm:

1

11 cp(e) II II cp II = sup{1I cp(e)1I i II e II = I} = sup II ell where II e II is a norm on E. (See §2.2.) Hence. for any e

E

E.

Ie*-O)

Section 6.2 Determinants, Volumes, and the Hodge Star Operator

405

II q>(e) II ~ II cp 1111 e II. 6.2.4 Proposition The 17Ulp del: L(E, E) -+ lR is continuous. Proof This is clear from the component formula for det, but let us also give a coordinate free proof. Note that II ro II

= sup { I ro(e" ... , en) 1111 e l II = ... =II en II = 1} = sup { I ro(e l ,

... ,

en) 1111 e l II .. · II en lei' ... , en

is a norm on An(E) and I ro(e l , ... , en) I

~

'* o}

II ro 1111 e l 11 .. ·11 en II. Then, for cp, '" E L(E, E),

I det cp - det '" III ro II = II cp*ro- ",*ro II

= sup{

I ro(cp(e l ),

... ,

»- ro(\jf(e

cp(en

l ), ... ,

»

\jf(en 1111 e l II

= ... = II en II = I}

»

~ sup { I ro(cp(e l ) - ",(e l ), cp(e2 ), ... , cp(en 1+'"

»

+ I ro(\jf(e l ), \jf(~), ... , cp(en) - \jf(en 1111 e l II

= ... = II en II = I}

~ II ro 1111 cp - \jf II {II cp IIn-1 + II cp IIn- 2 II '" II + ... + II \jf lin-I} ~ II ro 1111 cp - \jf 11 0 such that COl = cco 2 . An equivalence class [co] of volume elements on E is called an orientation on E. An oriented vector space (E. [co]) is a vector space E together with an orientation [co] on E; [-co] is called the reverse omntation. A basis {el' .... en} of the oriented vector space (E. [co]) is called positively (resp. negatively) oriented, if ro(e l •.... en) > 0 (resp. < 0).

The last statement is independent of the representative of the orientation [co]. for if co'

E

[co], then co' = cco for some c > O. and thus co'(e l •...• en) and co(e l •.... en) have the same sign. Also note that a vcctor space E has exactly two orientations: one given by selecting an arbitrary dual basis {e l •...• en} and taking [e l /\ ... /\ en]; the other is its reverse orientation. This definition of orientation is related to the concept of orientation from calculus as follows. In JR.3. a right-handed coordinate system like the one in Fig. 6.2.1 is by convention positively oriented. as are all other right-handed systems. On the other hand. any left-handed coordinate system. obtained for example from the one in Fig. 6.2.1 by interchanging

XI

and x2 • is by

convention negatively oriented. Thus one would call a positive orientation in JR.3 the set of all right-handed coordinate systems. The key to the abstraction of this construction for any vector space lies in the observation that the determinant of the change of ordered basis of two right-handcd systems in JR.3 is always strictly positive. Thus. if E is an n-dimensional vector space. define an equivalence relation on the set of ordered bases in the following way: two bases {el' ...• en} and {e l ' •...• en'} are equivalent iff det
O. where
(v l ) •...• q>(v n =I det q> In p(vi' ...• vn),for all VI' ...• Vn E E and all q> E L(E. E). Let I A In(E) denote the a-densities on E. With a = 1. I-densities on E are simply called densities and I A II(E) is denoted by I A I(E).

»

The determinant of q> in this definition is taken with respect to any volume element of E. As we saw in 6.2.2. this is independent of the choice of the volume element. Note that I A In(E) is one-dimensional. Indeed. if p I and P2 then P2(e l •...• en)

= aPI(el'

q>(e). defining q>

L(E. E). Then

E

E

I A In(E). p I

;to

...• e n). for some constant a

O. and {ei' ...• en} is a basis of E. E

JR. If v\' ...• vn

E

E. let Vi

=

i.e .• P2 = aPI· Alpha-densities can be constructed from volume elements as follows. If ro E An(E). define I ro In

E

I A In(E)

by

I ro In(ei' ...• en) = I ro(e l •...• en) In where el' ...• en

E

E. This

association defines an isomorphism of An(E) with IAln(E). Thus one often uses the notation lrol n for a-densities. We shall construct canonical volume elements (and hence a-densities) for vector spaces carrying a bilinear symmetric nondegenerate covariant two-tensor. and in particular for inner product spaces. First we recall a fact from linear algebra.

6.2.9 Proposition Let E be an n-dimensional vector space and g = (.) E 'J".

(7)

(8)

(9) (10)

Proof Equation (7) follows from (5) by symmetry of (a, ~>. Equations (8) follow directly from (6), witb k = 0, n, respectively, and 0

=identity

(note tbat c 1 ... c n =(_l)Ind(g». To verify ('.I),

it suffices to take a = e°(1) /\ ... /\ e°(k). By (6), *(e0(k+I) /\ ... /\ eo(n»

=be0(I) /\ ... /\ e°(k)

for a constant b. To find b use (5) witb a = ~ = eo(k+I) /\ ... /\ eo(n) to give (see (4» " be0(k+I) /\ ... /\ eo(n) /\ e0(1) /\ ... /\ eo(k) -- co(k+I) ... co(n),""' Hence b

=co(k+I) ... co(n)(-lj'(n-k) sign 0. Thus (6) implies **(e°(1) /\ ... /\ eo(n» =CO(I) ... co(k) (sign 0) *(eo(k+l) /\

... /\ eo(n»

= cO(1)" ,cO(k)co(k+I) .. ,co(n)(sign 0)2(_Ij'(n-k)eO(I) /\ ... /\ e°(k)

=(-I )Ind(g)(-I )k(n-k)e0(1) /\

... /\ eo(k).

Finally for (10), we use (7) and (9) to give (*a, *~>~

= *a

/\ **~

= (_l)Ind(gl(_Ij'(n-k) * a

/\ ~

=(_l)Ind(g)(a, ~>~.

•

= (_l)Ind(g)~ /\ * a

6.2.14 Examples A

The Hodge operator on A I(JR3) where JR 3 has the standard metric and dual basis is given from (6) by * e l =e 2 /\ e3, * e 2 =-e l /\ e3, and * e3 =e l /\ e 2. (This is tbe isomorphism considered in 6.1.11B). BUsing (5), we compute * in an arbitrary oriented basis. Write i i i ... i

*(e1/\ ..• /\ek)

=

(sum over jk+1 < ... < jn) and apply (5) witb

c,l

-Jk+l

.. k.

In

_i

_i

l""k+l /\ .•• /\l""n

Section 6.2 Determinants, Volumes, and the Hodge Star Operator

413

where (jl' ...• jk} is a complementary set of indices to (jk+I' ...• jn}' One gets

Hence .

*(e'l

.

1\ .•• 1\

e'k) = I det [gijll

(1 ...

1I2~

n) .. ...

"",signlj I ... jn g"J, ... g'kJ~k+'

where the sum is over all (k. n - k) shuffies

G~ ... ...

1\ ••• 1\

.

do.

(11)

~) . In

C In particular. if k = I. (11) yields n

I .. I . * e'. = I det ([gij]) I1/2~' "",(-IY- g'Je 1\ ••. 1\ r} 1\ •.• 1\ en

j=1 since sign(jl' j2' ...• jn) = (-I~-I. for j2 < ... < jn' jl = j. and where deleted.

(12)

ej

means that ei is

D From B we can compute the components of * a. where a E Ak(E). relative to any oriented basis: write a = ~ , ... ikei, 1\ ••• 1\ e ik (sum over i l < ... < ik) and apply (11) to give

Hence

n)

~I ... . (*(l)iJ.+,... j•-_ I det[gijll 1/2 "'" £..P; ,... ikg i,~ ... gikjk'sIgn. I ... In

(13)

for jk+1 < ... <jn and where the sum is over all complementary indices jl < ... <jk' E Consider lR,4 with the Lorentz inner product. which in the standard basis [e\, e 2 • e 3 • ) e4 of R4 has the matrix

o

o o

0

o

0

0

0

0

0 0

0

-1

Let {el. e2• e3 • e 4 } be the dual basis. The Hodge operator on A 1(lR4) is given by

414

Chapter 6 Differential Forms

If ]R4 had been endowed with the usual Euclidean inner product, the formulas for * e4 , *(e l /\ e4), *(e z /\ e 4), and *(e3 /\ e4) would have opposite signs. The Hodge * operator on A3(L~4) follows from the formulas on AI(]R4) and the fact that for ~ E AI(]R4), ** ~ == ~ (from

formula (9». Thus we obtain

F

If ~ is a one form and v!' v Z' ... , vn is a positively oriented orthonormal basis, then

This follows from (5) taking ex = vI, the first element in the dual basis.

Exercises 6.2A

Let (e l , eZ, e 3) be the standard dual basis of ]R3 and ex = e l /\ e Z - 2e z /\ e 3 E AZ(]R3), ~ = 3e l - e2 + 2e 3 E Al (]R3), and q> E L(]Rz, ]R3) have the matrix

Compute q>*ex. With the aid of the standard metrics in ]R2 and ]R3, compute * ex, *~, *(q>*ex), and *(q>*~). Do you get any equalities? Explain. 6.2B A map q> E L(E, F), where (E, Ol),

(F,~)

are oriented vector spaces with chosen volume

clements, is called volume preserving if q>*~ = Ol. Show that if E and F have the same (finite) dimension, then q> is an isomorphism. 6.2C A map q> E L(E, F), where (E,

[~])

and (F, [Ol)) are oriented vector spaces, is called

orientation preserving if q>*~ E [Oll. If dim E = dim F, and q> is orientation preserving, show that q> is an isomorphism. Given an example for F = E = ]R 3 of an orientation-preserving but not volume-preserving map. 6.2D Let E and F be n-dimensional real vector spaces with nondegenerate symmetric two-tensors, g E T0 2 (E) and hE T0 2 (F). Then q> E L(E, F) is called an isometry if h(' ~ and B

=/\.

which is bilinear and continuous. Thus (a

/\ ~)q> is ~ by the Leibniz rule . • 6.3.7 Definition Let OeM) denote the direct sum of Ok(M). k = O. 1•...• together with its

structure as an (infinite-dimensional) real vector space and with the multiplication /\ extended componentwise to OeM). (If dim M = n < 00. the direct sum need only be takenfor k = O. 1•...• n.) We call U(M) the algebra of exterior differentialforms on M. Elements of Ok(M) are called

k-fonns. In particular. elements of

X*(M) are called one-fonns.

Note that we generally regard OeM) as a real vector space rather than an !reM) module (a~ with 'T(M». The reason is that !reM) = OO(M) is included in the direct sum. and f /\ a

=f ® a

= fa. 6.3.8 Remarks and Examples

e on a manifold

M assigns to each mE M a linear functional on T mM.

A

A one-form

B

A two-form c.o on a manifold assigns to each m E M a skew symmetric bilinear map

C

For an n-manifold M. a tensor field t E 'Ps(M) has the local expression t(u) =

j ... j

(l

, .....

Jt

J.

a

a

J'

j

(u)-. ® ... ® - . ® dx I ® ... ® dx' axIl axl,

where u E U. (U. q» is a local chart on M. and

Section 6.3 Differential Forms

t'i "'ir.

.

J,'" J,

i ... dx i r (u) = t( dx' '"

a

- - •••

axi, ,

419

-

a )(u) .

'axil

The proof of 6.1.8(ii) gives then the local expression for

CJ) E

Ak(M), namely

where

D

In Q(M), the addition of forms of different degree is "purely formal" as in the case M = E. Thus, for example, if M is a two-manifold (a surface) and (x, y) are local coordinates on U

C

M, a typical element of Q(M) has the local expression f + g dx +

h dy + k dx /\ dy, for f, g, h, k E

E

.1"(U).

As in §6.1, we have an isomorphism of vector bundles

* : A I(J!t3) ~ A2(J!t3) given by

On the other hand, the index lowering action given by the standard Riemannian metric on

J!t3 defines a vector bundle isomorphism £>: T(J!t3) ~ T\J!t3)

=A 1(J!t3).

These two

isomorphisms applied pointwise define maps

and Then 6.1.12C implies *[(X x Y?l

for any vector fields X, Y

E

=xl> /\ y&

X(J!t3), where X x Y denotes the usual cross-product of

vector fields on J!t3 from calculus. That is,

F

= Xi -a.

i

a .

and Y = Y - . , 1 = I, 2, 3. ax' ax' The wedge product is taken in Q(M) in the same way as in the algebraic case. For example, if M = J!t3, (X = dx 1 - x1dx 2 E Ql(M) and P= x2dx 1 /\ dx 3 - dx 2 /\ dx 3, then (X /\ J3 =(dx 1 - x 1dx 2 ) /\ (x2dx 1 /\ dx 3 - dx 2 /\ dx 3) for X

=0-

X lx 2 dx 2 /\

= (x 1x2

_

dx I

/\

dx 3 - dx I

l)dx 1 /\ dx 2 /\ dx 3. •

/\

dx 2

/\

dx 3 + 0

420

Chapter 6 D;gerentiJll Forms

6.3.9 Definition Suppose F: M ~ N is a C'" mapping of manifolds. For F*o> : M ~ Ak(M) by F*O>(m) = (TmF)* 00>0 F(m); i.e .•

0> E

nk(N). define

where vI' ...• vk E TmM; for g E nD(N). F*g =g 0 F. We say F*o> is the pull-back of F. (See Fig. 6.3.1.)

0>

by

6.3.10 Proposition Let F: M ~ N and G : N ~ W be C'" mappings of manifolds. Then (i) F*: nk(N) ~ nk(M); (ii) (G 0 F)* = F*

0

G*;

(iii) if H : M ~ M is the identity. then H* : Uk(M) ~ Uk(M) is the identity;

(iv) if F is a diffeomorphism. then F* is an isomorphism and (F*)-I = (F-I)*; (v) F*(a 1\ ~) =F*a 1\ F*~ for a E nk(N) and ~ E n'(N). F*

----

......

~~-

F'Ol

M

Figure 6.3.1 Proof Choose charts (U. cp). (V. 'II) of M and N so that F(U) C V. Then the local representative F",'I1 = 'II 0 F 0 cp-I is of class C-. as is 0>'11 = (T",)*o 0> 0 ",I. The local representative of F*o> is

which is of class C'" by the composite mapping theorem. For (ii). note that it holds for the local representatives; (iii) follows from the definition; (iv) follows in the usual way from (ii) and (iii); and (v) follows from the corresponding pointwise result. _ We close this section with a few optional remarks about vector-bundle-valued forms. As before. the idea is to globalize vector-valued exterior forms.

421

Section 6.3 Differentiol Forms

6.3.11 Definition Let

1t:

E ~ B, p: F ~ B be vector bundles over the same base. Define

the vector bundle with base B of vector bundle homomorphisms over the identity from Ak(E) to F. If E = TB, Ak(TB; F) is denoted by Ak(B; F) and is called the vector bundle of F-valued k-forms on M. If F = B x F, we denote it by Ak(B, F) and call its elements vector-valued k-forms on M. The spaces of sections of these bundles are denoted respectively by nk(E; F),

nk(B; F) and nk(B; F). Finally, n(E; F) (resp. n(B; F), n(B, F» denotes the direct sum of nk(E; F), k = 1,2, ... , n, together with its structure of an infinite-dimensional real vector space and ~(B)-module. Thus, a E nk(E; F) is a smooth assignment to the points b of B of skew symmetric k-linear Eb ~ F b. In particular, if all manifolds and bundles are finite dimensional, then a E nk(M, )RP) may be uniquely written in the form a = ~i = I, "', pa i ej , where aI, ... , a P E nk(M), and {el' ... , e p} is the standard basis of )RP. Thus a E nk(E, )RP) is written in

maps

~:

Eb x ...

X

local coordinates as

for i l < ... < ik. Proposition 6.3.10(i)-(iv) and its proof have straightforward generalizations to vector-bundle-valued forms on M. The wedge product requires additional structure to be defined, namely a smooth assignment b 1-+ gb of a symmetric bilinear map gb: Fb x Fb ~ Fb for each b E B. With this structure, 6.3.10(v) also carries over.

Exercises 6.3A Show that for a vector bundle 1t: E ~ B, Ak(E) is a (smooth) subbundle of 'flk(E). Generalize to vector-bundle-valued tensors and forms. 6.3B Let
0, Ft(u) = tu. Thus Ft is a diffeomorphism and, starting at t = 1, is generated by the time-dependent vector field ~(u)

=u!t;

that is, F1(u) =u and dFt(u)/dt =~(Ft(u». Therefore, since co is closed,

For 0 < to :5 1, we get

Section 6.4 The Exterior Derivative, Interior Product, and Lk Derivative

Letting

to --+ 0,

we get

0)

437

= d~, where

Explicitly,

(Note that this fonnula for

~

agrees with that in the previous proof.) _

It is not true that closed forms are always exact (for example, on ]R2\ {(O, 0») or on a sphere). In fact, the quotient groups of closed fonns by exact fonns (called the deRham cohomology groups of M) are important algebraic objects attached to a manifold; they are discussed in §7.6. Below we shall prove that on smoothly contractible manifolds, closed fonns are always exact. Let r ~ 1. Two C' maps f, g : M --+ N are said to be (properly) Cr-homotopic, if there exists e> 0 and a Cr (proper) mapping F: ] - e,l + e[ x M --+ N such that F(O, m) = f(m), and F(l, m) = g(m) for all mE M. The manifold M is called

6.4.15 Definition

C'-contractihle if there exists a point mo E M and Cr-homotopy F of the constant map m 1-+ mo with the identity map of M; F is called a C'-contraction of M to mo. The following theorem represents a verification of the homotopy axiom for the deRham cohomology.

a E nk(N) a closed k-form (with compact support) on N. Then g*a - r*a E nk(M) is an exact k-form on M (with compact support). 6.4.16 Theorem Let f, g: M --+ N be two (properly) Cr-homotopic maps and

The proof is based on the following. 6.4.17 Deformation Lemma For a C'-manifold M let the C' mapping it: --+ ]-e, 1 + e[ x M

be given by it(m) =(t, m). Define H: (lk+I(] -e, 1 + e[ x M) --+ (lk(M) by

Then doH

0

d = i : - i ;.

Proof Since the flow of the vector field a/at E Xr(]-e, 1 + e[ x M) is given by FA.(s, m) = (s + A., m), i.e., is+A. = FA. 0 is, for any fonn 13 E nt(l-e, 1 + e[ x M) we get


438

'*L A Is a/iltf'

.* d

= Is dA.

I 1..=0

F*A d "y = dA.

I 1..=0

'*F*A d "y = dA.

Is

I

.* A

d '*A

Is+f..f' = dIs f'. 1..=0 S

Therefore, since the integrand in the formual for H is smooth, d and the intcgral sign commute, so that by Cart an's formula (3) and the above formula we get

Proof of Theorem 6.4.16 Define G

=H

0

F*, where H is given in the Deformation Lemma

6.4.17 and F is the (proper) homotopy between f and g. Since F* commutes with d we get doG + God = g* - r*, so that if the term a. E Qi«N) is closed (and has compact support), (g*

- r*) (a.) = d(Ga.) (and Ga. has compact support). _ 6.4.18 Poincare Lemma for Contractible Manifolds Any closed form on a smoothly contractible manifold is exact. Proof Apply the previous theorem with g = identity on M, and f(m) =

mo. _

The naturality of the exterior derivative has been investigated by Palais [1959]. He proves the following result. Let M be a connected paracompact n-manifold and assume that A : np(M) ~ nq(M)

is a linear operator commuting with pull-back, i.e.,

diffeomorphism cp: M

~

A

0

cp* = cp* 0 A for any

M. Then

0, if 0 $; P $; n, 0 < q $; n, q;t p, p+ I,

A =

1

a (Identity), if 0 < q = P $; n,

bd, if 0

$;

P $; n-l, q = p + I,

for some real constants a, b. If M is compact, then in addition we have

I

0, if q = 0, 0 < p < n c (Identity), if p = q = 0

A=

0, if q = 0, p = n, M is non-orientable or orientable and reversible,

df M' if q = 0, p = n,

M is orientable and non-reversible,

for some real constants c, d. (Orientability and reversibility will be defined in the next section whereas integration will be the subject of Chapter 7.)

Section 6.4 The Exterior Derivative, Interior Product, and Lie Derivative

~

439

SUPPLEMENT 6.4B

Differential Ideals and Pfaffian Systems This box discusses a refonnulation of the Frobenius theorem in tenns of differential ideals in the spirit of E. Cartan. Recall that a sub bundle E c: TM is called involutive if for all pairs (X, Y) of local sections of E defined on some open subset of M, the bracket [X, Y] is also a local section of E. The subbundle E is called integrable if at every point m

E

M there is a local

submanifold N of M such that TmN = Em' Frobenius' theorem states that E is integrable iff it is involutive (see §4.4). Before starting the general theory let us show by a simple example how fonns and involutive subbundles are interconnected. Let

0) E

02(M) and assume that Em = {v E TM I ivO) = o} is a

subbundle of TM. If X and Y are two sections of Em' then

Thus, Em is involutive iff ixiydO) = O. In particular, if

0)

is closed, then Em is involutive. In

this supplement we shall express conditions such as ixiydO) = 0 in tenns of one forms for subbundlcs E of TM that are not necessarily defined by exterior fonns. For any subbundle E, the k-annihilator of E defined by

is a subbundle of the bundle Ak(M) of k-fonns. Denote by r(U, E) the C~ sections of E over the open set V of M and notice that

is an ideal of O)(M); i.e., if I(E).

0)1'

Wz

E

I(E) and P E O(M), then

0)1

+ Wz

E

I(E) and P 1\

0)1 E

6.4.15 Proposition The subbundle E of TM is involutive iffor all open subsets V of M and all 0) E r(V, E 0(1», we have dO) E reV, E 0(2». If E is involutive, then 0) E reV, E o(k» implies dO) E r(V, E o(k + 1). Proof For any a

E

reV, E 0(1» and X, Y E reV, E), 6.4.11(ii) yields

da(X, Y)

=X[a(Y)]- Y[a(X)]- a([X, YD =- a([X, YD.

Thus E is involutive iff da(X, Y) = 0; i.e., da

E

reV, E 0(2» . •

440

Chapter 6 DifferentWl Forms

The Frobenius theorem in terms of differential forms takes the following form.

6.4.20 Corollary The subbundle E c: TM is integrable iffor all open subsets U of M, ro

E

nU, E°(\» implies dro E nU, EO(2». The following considerations are strictly finite dimensional. They can be generalized to infinite-dimensional manifolds under suitable splitting assumptions. We restrict ourselves to the finite-dimensional situation due to their importance in applications and for simplicity of presentation.

6.4.21 Definition Let M be an n-manifold and I c: OeM) be an ideal. We say that I is locally generated by n - k independent one-/orms, if every point of M has a neighborlwod U and nk pointwise linearly independent one-forms rol' ... , ron-k E OI(U) such that: n-k

(i) if ro E I, then ro I U =

L9

i 1\

roi for some 9i E OeM);

i=1

(ii) if ro E OeM) and M is covered by open sets U such that for each U in this cover, n-k

ro I U =

L 9i

1\

roi for some 9i E OeM),

i=1

then ro E I. The ideal I c: OeM) is called a differential ideal if dI c: I Finitely generated ideals of OeM) are characterized by being of the form I(E). More precisely, we have the following.

6.4.22 Proposition Let I be an ideal of OeM) and let n = dim(M). The ideal I is locally generated by n - k linearly independent one-forms iff there exists a subbundle E c: TM with k-dimensionalfiber such that 1= I(E). Moreover. the bundle E is uniquely determined by I Proof If E has k-dimensional fiber, let Xn_k+I' ... , ~ be a local basis of nU. E). Complete this to a basis of X(U) and let roi E OI(U) be the dual basis. Then clearly rol' ... , ron _ k are linearly independent and locally generate I(E). Conversely, let rol' ... , ron _k generate lover U and define Em = {v E TmM I roi(m)(v) = 0, i = 1, ... , n - k}. Em is clearly independent of the generators of lover U so that E = nmeMEm is a subbundle of TM. It is straightforward to check that I= I(E). Finally, E is unique since E *- E' implies I(E) *- I(E') by construction. _ Differential ideals are characterized among finitely generated ones by the following.


441

6.4.23 Proposition Let J be an ideal of O(M) locally generated by n - k linearly independl1' ...• O>n_k

E

01(U). n

= dim(M). and

let 0>1

O>n_k

A ... A

= 0> E

on-k(u). Then the

following are equivalent: (i) J is a differential ideal;

(ii) do> = Ij':.70>ij A O>j for some O>;j

E

01(U) and for every U as in the hypothesis;

(iii) dO>i A 0> =0 for all open sets U. as in the hypothesis; (iv) there exists

eE

01(U) such that do> =e A 0> for all open sets U. as in the

hypothesis.

Proof That conditions (i) and (ii) are equivalent and (ii) implies (iv) follows from the definitions. Condition (iv) means that n-k

L (-l)id

0>;

A

0>1

A ••• A

&; A

••• A

O>n-k =

eA

0>1

A ••• A

O>n-k.

i=1 so that multiplying by O>i we get (iii). Finally. we show that (iii) implies (ii). Let 0>1' ...• O>n 01(U) be a basis such that 0>1' ...• O>n_k generates J over U. Then

dO>i = L<XijtO>j jt. where <Xijt

E

E

.'F(U).

But

o = d O>i A 0> =

L

<X;jtO>j A O>t

A

0>1

A ••• A

O>n-k

n-k<jd

and thus <Xijt

=0

for n - k <j
1' ... , O>n_k E 01(U); (iv) for every point of M there exists an open set U and 0>1' ... , O)n_k generating J(E) such that

E

01(U)

Chapter 6 Differentiol Forms

442

n-k

dOlj =

L Cl>ij /\ Cl>j for some Cl>ij

E

I

n (U);

j=1 (v) same as (iv) but with the condition on Cl>i being: dCl>i /\

Cl1 /\ ... /\ Cl>n_k = 0;

(vi) same as (iv) but with the condition on Cl>i being: there exists

= e /\ CI>,

where CI>

eE

nl(U) such that dCl>

=Cl>1 /\ ... /\ Cl>n_k'

6.4.25 Examples A In classical texts (such as Cartan [1945) and Flanders [1963)), a system of equations

Cl>1 = 0, ... , Cl>n-k = 0, where Cl>i E nl(U) and U

C

]Rn

is called a Pfaffian system. A solution of this system is a k-dimensional submanifold N of U given by xi = xi(u I,

... ,

uk) such that if one plugs in these values of xi in the system, the result is

identically zero. Geometrically, this means that Cl>1' ... , Cl>n_k annihilate TN. Thus, finding solutions of the Pfaffian system reverts to finding integral manifolds of the subbundle E = {v E TV I Cl>i(v) = 0, i = 1, ... , n - k} for which Frobenius' theorem is applicable; thus we must have dCl>i /\ Cl>1 /\ ... /\ Cl>n_k = O. This condition is equivalent to the existence of smooth functions a jj , bj on U such that n-k Cl>i = Lajjdbj. j=1 To see this, recall that by the Frobenius theorem there are local coordinates bl' ... , bn on U such

= constant, ... , bn_k = constant, so that db i, i = 1, ... , n - k annihilate the tangent spaces to these submanifolds. Thus the ideal I generated by

that the integral manifolds of E are given by b l

db l ' ... , dbn_k ; i.e., Cl>i = Lj=l ..... n-k aij db j for some smooth functions a ij on U. B Let us analyze the case of one Pfaffian equation in ]R2. Let CI> =P(x, y)dx + Q(x, y)dy

E

nlo~2) using standard (x, y) coordinates. We seek a solution to CI> = O. This is equivalent to

dy/dx = -P(x, y)/Q(x, y), so existence and uniqueness of solutions for ordinary differential equations assures the local existence of a function f(x, y) such that f(x, y) = constant give thc integral curves y(x). In other words, f(x, y)

= constant is an integral manifold of

CI>

= O.

The

same result could have been obtained by means of the Frobenius theorem. Since dCl> /\ CI>

E

n3(]R2), we get dCl> /\ CI> =0, so integral manifolds exist and are unique. In texts on differential equations, this problem is often solved with the aid of integrating factors. More precisely, if CI> is not (locally) exact, can a function f and a function g, called an integrating factor, be found, such that gCl> = df? The answer is "yes" by 6.4.24(iii) for choosing f as above, E Thus g is found by solving the partial differential equation,

= ker df locally.


443

This always has a solution and the connection between g and f is given by I df

g =

P

dX

f(x, y) = constant being the solution of ro = O. C Let us analyze a Pfaffian equation ro = 0 in jRn. As in 8, we would like to be able to

write gro = df with df;t 0 on U

C

jRn, for then f(xl, ... , xn) = constant gives the (n-

I)-dimensional integral manifolds; i.e., the bundle defined by ro is integrable. Conversely, if the bundle defined by ro is integrable, then by 8, gro = df. Integrability is (by the Frobenius theorem) equivalent to dro /\ ro = 0, which, as we have seen in 8, is always verified for n = 2. For n

~

3, however, this is a genuine condition. If n = 3, let ro = P(x, y, z)dx + Q(x, y, z)dy +

R(x, y, z)dz. Then droI\ro=[p(dR _

ay

dQ)+Q(dP _dR)+R(dQ _ dP)]dX/\dY/\dz; dZ dZ dX dX ay

so ro = 0 is integrable iff the term in the square brackets vanishes. D The Frobenius theorem is often used in overdetermined systems of partial differential

equations to answer the question of existence and uniqueness of solutions. Consider for instance the following system of A. Mayer [1872] in jRP-+
i=I, ... ,p,

a=I, ... ,q.

We ask whether there is a solution y = f(x, c) for any choice of initial conditions c such that f(O, c) = c. The system is equivalent to the following Pfaffian system:

Since the existence of a solution is equivalent to the existence of p-dimensional integral manifolds, Frobenius' theorem asserts that the existence and uniqueness is equvalent to dro u = ~=I ..... qroU13 /\ ro13 for some one-forms ro u13. A straightforward computation shows that U a j k dA u i i 13 dro = C jkdx /\ dx + --13- dx /\ ro , iJy

where a

C

jk =

iJA Uj iJx k

iJA U k

-

iJA Uj

13

iJA uk

13

a;;J + ay13 A k - ayll A J"


444

Since dxl •...• dx p • ro l •...• roq are a basis of nl(lRP+q). we see that dron = 1:13=1 •...• qron13 Thus the Mayer system is integrable

A

rob for some one-/orms ron13

iff

C"jk

=O.

iff

C"jk

= O.

•

In §S.4 and §S.S we shall give some applications of Frobenius' theorem to problems in constraints and control theory. Many of these applications may alternatively be understood in terms ofPfaffian systems; see for example. Hermann [1977. Ch. 181.

b

IDENTITIES FOR VECTOR FIELDS AND FORMS 1. Vector fields on M with the bracket [X. Yl form a Lie algebra; that is. [X. Yl is real bilinear. skew symmetric. and Jacobi's identity holds: [[X. Yl. Zl + [[Z. Xl. Yl + [[Yo Zl. Xl = O. Locally. [X. Yl = DY·X-DX·Y and on functions. [X. Y][f] = X[Y[f]l - Y[X[f]l. 2. For diffeomorphisms . ",. *[X. Yl = [*X. * Yl and (

0

",)*X = *"'*X.

3. The forms on a manifold are a real associative algebra with A as multiplication. Furthermore. a A 13 = (-lf l l3 A a for k and t-forms a and 13. respectively. 4. For maps . ",. *(a A 13) = *a A *13. ( 0 ",)*a = "'**a. 5. d is a real linear map on forms and dda = O. d(a A 13) = da A 13 + (-If a 6. For a a k-form and

"0..... Xk

A

dl3 for a a k-form.

vcctor fields:

k

da(X o..... Xk)= L(-I)iXi[a(x o..... Xi' .... Xk)l i=O

+

L

(-l)i+ja([x i• Xjl. Xo .....

~ .....

Xj' .... X k)

OSi<jSk

If M is finite dimensional and a = <Xj 1'" iIt.dx i1

A ••• A

dx\ i l < ... < ik • then

Section 6.4 The Exterior Derivative, Interior Product, and Lie Derivative k

(da)j j

I'" k+1

=

L

p-I

(-0

p=1

a

- . a J· a)k

X

I

... )·

)'

p-I p+1

"')'

)k

hi

a

+ (-1 . - aJ· ... )· a h+1 I k

•

445

for jl Qk~I(M) is a /\ antiderivation. (ii) ixf = 0 for f E ,'RM); (iii) ixw = w(X) for WE QI(M); (iv) ix is natural with respect to restrictions. Usc these properties to show i[X,YI = LXiy - iyLx. Finally, show ix

0

ix = O.

6.4F Show that a derivation mapping Qk(M) to Qk+I(M) for all k ~ 0 is zero (note that d and ix are antiderivations).

6.4G Let s: T2M --> T2M be the canonical involution of the second tangent bundle (see exercise 3.4D). (i)

If X is a vector field on M, show that s 0 TX is a vector field on TM.

(ii)

If F t is the flow of X, prove that TFt is a flow on TM generated by sO TX.

(iii)

If I-l is a one-form on M, I-l': TM --> TIt is the corresponding function, and w E T2M, then show that dl-l'(sw)

= dl-l(tTM (w), TtM(w)) + dl-l'(w).

6.4H Prove the following relative Poincare lemma. Let and let N

C

W

be a closed k-form on a manifold M

M be a closed submanifold. Assume that the pull-back of

Then there is a (k - 1)-form a vanishes on N. If

w to N is zero. w and a

on a neighborhood of N such that da =

w vanishes on N, then a can be chosen so that all its first partial

derivaties vanish on N. (Hint: Let SI. Define dw, a one-form on M by taking the TIt-projection of Tw. Show that (i) if w: M --> S I, then ddw is smooth, then r*(dw) = d(r*w), where r*w =

w

0

= 0; and (ii) if

f: M --> N

f.

6.4J Prove the identity

6.4K

(i)

Let X

= (Xl,

X2, 0) be a vector field defined on the plane S

TIt} in TIt 3. Show that there exists Y

E

= ((x, y, 0) I x, y E

XCR3) such that X = curl Y on S. (Hint:

Let Vex, y, z) = (zX2(x, y), -zXI(x, y), 0).) (ii)

XeS). Show that there exists Y E X(TIt 3) such that X = curl Y on S. (Hint: By 5.5.9 extend X to X E X(TIt3) and put (j) = * XE>. Locally find a such that da = w by (i). Use a partition of unity { 0 for all m or else f(m) < 0 for all m. Thus v is equivalent to 11 or -11. Conversely, if M is not connected,let U"* 0 or M, be a subset that is both open and closed. If 11 is a volume form on M, define v by letting v(m) = Il(m) if m E U and v(m) = -Il(m) if m Ii! U. Obviously, v is a volwne form on M, and v Ii! [Ill U [-Ill. _ A simple example of a nonorientable manifold is the Mobius band (see Figure 6.5.1 and Exercise 6.SL). For other examples, see Exercises 6.SK, M.

Figure 6.5.1

6.5.5 Proposition The equivalence relation in 6.5.3 is natural with respect to mappings and diffeomorphisms. That is, if f: M ~ N is of class C~, IlN and v N are equivalent volume forms on N, and r*(IlN) is a volume form on M, then r*(v N ) is an equivalent volume form. If f is a diffeomorphism and 11M and v M are equivalent volume forms on M, then f*(IlM) and f*(v M) are equivalent volume forms on N. Proof This follows from the fact that r*(gm) = (g

0

f)r*m, which implies r*(gm) = (g 0 [-I)r*m

when f is a diffeomorphism. _

6.5.6 Definition Let M be an orientable manifold with orientation [Ill. A chart (U, " ... g'kik,'A. . 1 det[g .. 11 112 k+ I 1) 1,', Ik'k t L..t f'lI···h" I)

ax

p=1

or as a contravariant tensor

where ~ = ~, ... , ' 1

k hl

dx" /\ ... /\ dx'k~.

(sum over r l < ... < rk + l ) is the usual coordinate expression for

~.

Formula (3) is messy to

prove directly. However it follows fairly readily from integration by parts in local coordinates and the fact that I) is the adjoint of d. a fact that will be proved in Chapter 7 (see 7.2.13 and Exercise

7.5G). We now shall express the divergence of a vector field X E X(M) in terms of

I).

We define

the divergence div g(X) of X with respect to a pseudo-Riemannian metric g to be the divergence of X with respect to the pseudo-Riemannian volume 11 = Il(g) of g; i.e .• Lx ll = divg(X)Il. To compute the divergence. we prepare a lemma.

6.5.22 Lemma ixll =

* xU.

Proof At points where X vanishes this relation is trivial. So let X(x) *" 0 and choose v2' ... , Vo E T"M such that {X(x)/g(X, X)(x). v2..... vol is a positively oriented orthonormal basis of T"M. Then by 6.2.14,

This may also be seen in coordinates using formula (12) of §6.2.

6.5.23 Proposition Let g be a pseudo-Riemannian metric on the orienta table n-manifold M. Then div /X) = -oxU. (4)

Section 6.5 Orientation, Volume Elements, and Codifferential

459

In (positively oriented) local coordinates (5)

Proof Let Lx ll = dixll = d * xf' by the lemma. But then (div g X)1l the definition of I) and the fonnula for **. Since *1

=11,

=- * I)xf' =_(I)Xb) * 1

by

we get (4). To prove fonnula (5),

write 11 = 1det[~j111/2dxl /\ ... /\ dx n and compute Lxll = dixll in these coordinates. We have A

ixll = 1det[gi!2Xk(-I)k dx l /\ ... /\ dxk /\ ... /\ dxn and so

k) 1

. a 1det [gij 111/2X dx /\ ... /\ dx n dlxll = (ai a

1

----:-I!2~ --k (I det[gij11

1det [gij11

1/2

k

X )11·

•

ax

6.5.24 Definition The Laplace-Beltrami operator onfunctions on a orientable pseudo-Riemannian

manifold is defined by V2 =div

0

grad.

Thus in a positively oriented chart, equation (5) gives af) V2 f=ldet[g·ll -1/2 - a ( g 'k Idet[gll 112 . 1J axk 1J ax'

(6)

Exercises 6.5A Let f: JR.n

~

JR.n be a diffeomorphism with positive Jacobian and f(O) = O. Prove that there

is a continuous curve ft of diffcomorphismsjoning f to the identity. (Hint: First join f to Df(O) by gt(x) = f(tx)/t.) 6.58 If t is a tensor density of M, that is, t = t' ® 11, where 11 is a volume fonn, show that

6.5C A map A: E ~ E is said to be derived from a variational principle if there is a function L: E

~

JR. such that dL(x)·v = (A(x), v),

where (,) is an inner product on E. Prove Vainberg's theorem: A comes from a variational principle if and only if DA(x) is a symmetric linear operator. Do this by applying the Poincare lemma to the one-fonn a(x)· v = (A(x) ,v) (see Hughes and

460


Marsden [1977]). 6.SD Show in three different ways that the sphere

sn

is orientable by using 6.5.2 and the two

charts given in 3.1.2, by constructing an explicit n-form, and by using 6.5.9. 6.SE Use formula (6) to show that in polar coordinates (r, e) in ]R2,

and that in spherical coordinates (p,

where Il

e, 0 on M; i.e., f is orientation preserving. 6.SH Let g be a pseudo-Riemannian metric on M and define g.. = Ag for A> O. Let *.. be the Hodge-star operator defined by g.. and set *1 A(n/2)-k

= *.

Show that if a

E

Qk(M), *.. a

=

* a.

6.51 In]R3 equipped with the standard Euclidean metric show that for any vector field F and any function f we have: div F

= -liP>,

curl F = (1i*P»#, and grad f = -(*Ii*f)#.

6.5.1 Show that if M and N are orientab1e, then so is M x N. 6.SK (i) Let 0: M

~

M be an involution of M, Le., 000 = identity, and assume that the

equivalence relation defined by 0 is regular, i.e., there exists a surjective submersion 1t: M ~ N such that 1t- 1(n)

= (m, o(m)},

where 1t(m)

= n.

Let Q±M)

= {a E

Q(M) I

0* a = ±a} be the ± 1 eigenspaces of 0*. Show that 1t* : Q(N) ~ Q /M) is an isomorphism. (Hint: To show that range 1t* = Q +(M), note that 1t 0 0 = 1t implies range 1t* C Q +(M). For the converse inclusion, note that Tm1t is an isomorphism, so a form at 1t(m) uniquely determines a form at m. Show that this resulting form is


461

smooth by working in a chart on M diffeomorphic to a chart on N.) (ii) Show that lRP' is orientable if n is odd and is not orientable if n is even. (Hint: In (i) take M = So C ]R0+I, cr(x) = x, and N =Un. Let CO be a volume element on So induced by a volume element of Ro+l. Show cr*co =(_1)0+I CO• Now apply (i) to orient RlP'° for n odd. If n is even, let v be an n-fonn on JRlP'0; then 1t*v = fco. Show that f(x) =-fe-x) so f must vanish at a point of So.) 6.5L In Example 3.4.8C, the Mobius band M was defined as the quotient of ]R 2 by the equivalence relation (x, y) - (x+k, (_1)ky) for any k E Z. (i) Show that this equivalence relation is regular. Show that M is a non-compact. connected. two-manifold. (ii) Define cr:]R2 ~ R2 by cr(x, y) = (x + 1, -y). Show 1t 0 cr = cr, where cr: R2 ~ M, is the canonical projection. If v E n2(M) define f E .1(]R2) by 1t*v = fco. where co is an area fonn on R2. Show that f(x + 1, -y) = -f(x, y). (iii) Conclude that f must vanish at a point of ]R2, and that this implies M is not orientable. 6.5M The Klein bottle K is defined as the quotient of R2 by the equivalence relation defined by (x, y) - (x + n, (-l)0y + m) for any n, m E Z. (i) Show that this equivalence realtion is regular. Show that K is a compact, connected.

smooth, two-manifold. (ii) Use 6.5L(ii), (iii) to show that K is non-orientable. 6.5N Orientation in vector bundles. Let 1t : E ~ B be a vector bundle with finite dimensional fiber modeled on a vector space E, and assume B is connected. The vector bundle is said to be orientable if the line bundle L = E* 1\ .•• 1\ E* (dim (E) times) has a global nowhere vanishing section. An orientation of E is an equivalence class of global nowhere vanishing sections of L under the equivalence relation: cr I - cr 2 iff there exists f E .1'(B), f> 0 such that cr2 = fcrl' (i) Prove that E is orientable iff L is a trivial line bundle. Show that E admits exactly two orientations. Showthat an orientation [crJ of E induces an orientation in each fiber of E. (ii) Show that a manifold M is orient able iff its tangent bundle is an oriented vector bundle. (iii) Let E, F be vector bundles over the same base. Show that if two of E. F and E EEl F are orientable, so is the third. (iv) Let E, F be vector bundles (over possibly different bases) Show that Ex F is orientable if and only if E and F are both orientable. Conclude that if M. N are finite dimensional manifolds, then M x N is orientable if and only if M and N are both orientable. (v) Show that E EEl E* is an orientable vector bundle if E is any vector bundle. (Hint:

Chapter 6 DigerenJiol Forms

462

Consider the section Q(x)«el' a. 1). (e2 • a. 2 » = (a.2 • e l )

-

(a. p e 2 ) of (E (B E*)

/\ (E (B E*).)

(vi)

Choose an orientation of the vector space E and assume B admits partitions of unity. Show that the vector bundle atlas all of whose change of coordinate maps have positive determinant relative to the orientation of E. when restricted to the fiber. (Hint: If E is oriented and '!f: rr-I(V) ~ V' x E is a vector bundle chart with V' open in the model space of B. and V is connected in B. define : :M ~ 'T is called covariant if for every diffeomorphism ( is differentiable and :M, 'T are given suitable Banach space topologies.)


463

(ii) Show that if M is oriented, then the map g>-+ l1(g), the volume element of g, is covariant. Is the identity in (i) anything interesting? 6.5P Let X be a vector field density on the oriented n-manifold M; i.e., X = F ® 11, where F is a vector field and 11 is a density. Use Exercise 6.58 to define div X and to show it makes intrinsic sense. 6.5Q Show that an orientable line bundle over a base admitting partitions of unity is trivial. (Hint: Since the bundle is orientable there exist local charts which when restricted to each fiber give positive functions. Regard these as local sections and then glue.) 6.5R Let 1t: E

~

SI be a vector bundle with n-dimensional fiber. If E is orientable show that

it is isomorphic to a trivial bundle over SI. Show that if p : F

~

SI is a non-orientable

vector bundle with n-dimensional fiber and if E is non-orientable, then E and F are isomorphic. Conclude that there are exactly two isomorphism classes of vector bundles with n-dimensional fiber over SI. Construct a representative for the class corresponding to the non-orientable case. (Hint: Construct a non-orientable vector bundle like the Mobius band: the equivalence relation has a factor (-I)2n-1 if the dimension of the fiber is 2n-1 or 2n. To prove non-orientability, proceed as in 6.5L.) 6.5S Let {el' ... , en+tl be the standard basis of ~n+1 and 0n+1 = el induced volume form. On Sn define (On E on(sn) by

1\ ... 1\

en+1 be the

for s E sn, vI' ... , vn E Tssn. (i) Use Proposition 6.5.8 to show that (On is a volume form on Sn; (On is called the (ii)

standard volume form on sn. ~ IRn+1 \ (OJ be given by f(t, s) = ts, where IR + is defined to be the set {t E IR It> O}. Show that if IR + is oriented by dt, Sn by (On' and ~n+1

Let f: IR + x Sn

by 0,,+1' then (JI)(t. s) = tn. Conclude that f is orientation preserving.

Chapter 7 Integration on Manifolds The integral of an n-form on an n-manifold is defined by piecing together integrals over sets in ]Rn using a partition of unity subordinate to an atlas. The change-of-variables theorem guarantees that the integral is well defined. independent of the choice of atlas and partition of unity. Two basic theorems of integral calculus. the change-of-variables theorem and Stokes' theorem. arc discussed in detail along with some applications.

~7.1

The Definition of the Integral

The aim of this section is to define the integral of an n-form on an oriented n-manifold M and prove a few of its basic properties. We begin with a summary of the basic results in ~n. Suppose f:]Rn ~]R is continuous and has compact support. Then f dx \ ... dxn is defined to

f

be the Riemann integral over any rectangle containing the support of f.

7.1.1 Definition Let U C]Rn be open and

0) E

nn(u) have compact support. If, relative to the

standard basis of ]Rn.

ro(x) =

I

, 0 ) ; ... ;

n.

where the components of

1

0)

; ' (x) dx I 1\ ... 1\ dx'u =

n

0)\ ...

n(x) dx

\

n

1\'"

1\

dx •

are given by 0);1 ... i,,(X) =

O)(x)(e;I' ...• e;).

then we define

Recall that if ~ is any integrable function and f:]Rn change o/variables theorem states that ~ 0 f is integrable and

~]Rn

is any diffeomorphism. thc

Section 7.1 The Definition ofthe Integral

465

where n = dx l /\ ... /\ dxn is Ihe standard volume form on JRn, and Jnf is Ihe Jacobian determinant of f relative to n. This charge of variables Iheorem can be rephrased in terms of pull backs in Ihe following form. 7.1.2 Change of Variables in Rn Let U and V be open subsets of JRn and suppose f: U

-+ V is an orientation-preserving diffeomorphism. If co E nn(v} has compact support, then r*co E

nn(u} has compact support as well and

(2) Proof If co = COl ... ndxl /\ ... /\ dxn, Ihen r*co = (COl ... n 0 f)(Jnf)n, where the n-form n = dx l /\ ... /\ dxn is Ihe standard volume form on JRn. As discussed in §6.5, Jnf> O. Since f is a diffeomorphism, Ihe support of r* co is r-I(supp co), which is compact. Then from (I),

Suppose Ihat (U, cp) is a chart on a manifold M and CO E nn(M}. If supp CO

C

U, we

may form co I U, which has Ihe same support. Then CP.(co I U} has compact support, so we may state Ihe following. 7.3.13 Definition Let M be an orientable n-manifold with orientation [n]. Suppose co E nn(M} has compact support C C U, where (U, cp) is a positively oriented chart. Then we define

1

co = cp.( coIU}.

(ep)

7.1.4 Proposition Suppose co E nn(M} has compact support C

C

U

n V, where

(U, cp), (V,

"') are two positively oriented charts on the oriented manifold M. Then

r J(ep)

Proof By 7.1.2

I CP.(co I U) = I ('"

[Recall that for diffeomorphisms f* Thus we may define

0

co-

r

co

J('I')

cp-I}.cp.(CO I U). Hence

= (11)*

and (f 0 g)*

Iv co = I(ep) co, where

= f*

0

I CP.(co I U) = I 'I'*(co I U).

g*.] •

(U, cp) is any positively oriented chart

containing the compact support of CO (if one exists). More generally, we can define co has compact support as follows.

IM co

where

Chapter 7 Integration on Manifolds

466

7.1.5 Definition Let M be an oriented manifold and 5'l an atlas of positively oriented charts. Let P = {(V u'
fQro.

have fpro= The common vaLue is denoted f~ ro, and is called the integral of ro E nn(M). Proof For any m E M, there is a neighborhood V such that only a finite number of gu are nonzero on U. By compactness of supp ro, a finite number of such neighborhoods cover the support of ro. Hence only a finite number of gu are nonzero on the union of these V. For (ii), let P = {(Vu'
and ~u~l3guhl3(m) = 1, for all mE M. Since ~l3hl3 = 1, we get

7.1.7 Change of Variables Theorem Suppose M and N are oriented n-manifolds and f: M

~

N is an orientation-preserving diffeomorphism. If ro E nn(N) has compact support, then

r*ro has compact support and

I I ro=

N

~

(4)

f*ro.

Proof First, supp r* ro = f-I(supp ro). which is compact. To prove (2). let {(V j•
= LflRn(
0

II) *(gj 0 II)ro =

L

ro. •

1

This result is summarized by the following commutative diagram:


.:-

nn(M) .::;;::::=======~

7.1.8 Definition Let (M,Il) be a volume manifold. Suppose f support. Then we call fll the integral of f with respect to 11.

467

nn(N)

E

.'r(M) and f has compact

fM

The reader ean check that since the Riemann integral is IR-linear, so is the integral just defined. The next theorem will show that the integral defined by (4) ean be obtained in a unique way from a measure on M. (The reader unfamiliar with measure theory ean find the necessary background in Royden [1968]; this result will not be essential for future sections.) The integral we have described can clearly be extended to all continuous functions with compact support. Then we have the following.

7.1.9 Riesz Representation Theorem Let (M, 11) be a volume manifold. Let '13 denote the Borel sets of M, the o-agebra generated by the open (or closed, or compact) subsets of M. Then there is a unique measure mJ.L on '13 such that for every continuous function of compact support

(5) Proof Existence of such a measure mJ.L is proved in Royden [1968]. For uniqueness, it is enough to consider bounded open sets (by the Hahn extension theorem). Thus, let U be open in M, and let C u be its characteristic function. We construct a sequence of C'" functions of compact support CPn such that CPn .J.. C u ' pointwise. Hence from the monotone convergence theorem, fCPnll = fCPn

dmJ.L ~

fCu dmJ.L = mJ.L(U). Thus, mJ.L is unique. •

The space LP(M, 11), p

E

integrable. For p ~ 1, the norm

IR, consists of all measurable functions f such that I f IP is

II flip

=Cf I f 1PdmJ.L)l/p

makes U(M, 11) into a Banach space

(functions that differ only on a set of measure zero are identified). The use of these spaces in studying objects on M itself is discussed in §7.4. The next propositions give an indication of some of the ideas. If F: M ~ N is a measurable mapping and m M is a measure on M, then F*m M is the measure on N defined by F*mM(A) = mM(r-I(A». If F is bijective, we set F*(m N ) = (r-I)*m N . If f: M ~ lR is an integrable function, then fmM defined by

is the measure on M

468


for every measurable set A in M. 7.1.10 Proposition Suppose M and N are orientable n-manifolds with volwneforms

~M

and Let F be an orientation preserving C1

~N and corresponding measures mM and mN . diffeomorphism of M onto N. Then

(6) Proof Let f be any

C~

function with compact support on M. By 7.1.7,

J/dmN = =

!/~N

=

LF*(f~N)

=

L(f O

F)(J("'M>!lNl)~M

L(f O F)(J("'M''''N)F)dmM'

As in the proof of 7.1.9, this relation holds for f the characteristic function of F(A). That is,

In preparation for the next result, we notice that on a volume manifold (M, of any vector field X is orientation preserving for each t

E

~),

the flow F t

lR (regard this as a statement on the

domain of the flow, if the vector field is not complete). Indeed, since Ft is a diffeomorphism,

J,.,(Ft) is nowhere zero; since it is continuous in t and equals one at t = 0, it is positive for all t. 7.1.11 Proposition Let M be an orientable manifold with volume form ~ and corresponding measure mw Let X be a (possibly time-dependent) C 1 vector field on M withflow Ft. The following are equivalent (if the flow of X is not complete. the statements involving it are understood to /wld on its domain):

=0;

(i)

div,.,X

(ii)

J,.,Ft

(iii)

(iv)

Ft*ml-' = m,., for all t E lR; F t *~ = ~ for all t E lR;

(v)

fMfdml-'= fM(foFt)dml-' for all fE

= 1 for all

t

E

lR;

Ll(M,~) andall tE lR.

Proof Statement (i) is equivalent to (ii) by 6.5.19. Statement (ii) is equivalent to (iii) by (6) and to (iv) by definition. We shall prove that (ii) is equivalent to (v). If JI-'Ft = 1 for all t E lR and f is continuous with compact support, then


469

Hence, by uniqueness in 7.1.9, we have fMf dmll = fM(f 0 Ft)dm ll for all integrable f, and so (ii) implies (v). Conversely, if

then taking f to be continuous with compact support, we see that

Thus, for every integrable f, fM(f implies (ii). •

0

Ft)dm ll = fM(f

0

Ft)(IIlFt)dmw Hence JIlFt = I, and so (v)

The following result is central to continuum mechanics (see below and §8.2 for applications). 7.1.12 Transport Theorem Let (M,I1) be a volume manifold and X a vector field on M with flow Ft. For f E .1"(M x IR) and letting ft(m) = f(m, t), we have

(7) for any open set U in M. Proof By the flow characterization of Lie derivatives and 6.S.17(ii), we have

= F;(:

= F;[

11) + F;[(Lx ft)11 + ft(divIlX)I1]

(~~ + divll(ft X»)11J.

Thus by the change-of-variables formula,

7.1.13 Example Let p(x, t) be the density of an ideal fluid moving in a compact region of 1R3 with smooth boundary. One of the basic assumptions of fluid dynamics is conservation of mass: the mass of the fluid in the open set U remains unchanged during the motion described by a flow Ft' This means that

:J

F,{Ul

p(x, t)dx = 0

(8)


470

for aH open sets U. By the transport theorem, (8) is equivalent to the equotion of continuity

a;

+ div(pu)

=

(9)

0,

where u represents the velocity of the fluid particles. We shaH return to this example in §8.2. As another application of 7.1.11, we prove the foHowing.

Poincare Recurrence Theorem Let (M, Il) be a volume manifold, mil the corresponding measure, and X a time-independent divergence-free vector field with flow Ft' Suppose A is a measurable set in M such that mll(A) < 00, Ft(x) exists for all t E R if x E A, and Ft(A) C A. Thenfor each measurable subset B of A and T ~ 0, there exists S ~ T such that B n Fs(B) T- 0. Therefore, a trajectory starting in B returns infinitely often to B. 7.1.14

Proof By 7.1.11, the sets B, FT(B), F2T(B), ... aH have the same finite measure. Since mll(A)

< 00, they cannot aH be disjoint, so there exist integers k > ~ > 0 satisfying FkT(B) n Fn(B) 0. Since FkT =(FTt (as X is time-independent), we get F(k_t)T(B)

T-

n B T- 0 . •

The Poincare recurrence theorem is one of the forerunners of ergodic thoery, a topic that wiH be discussed briefly in §7.4. A related result is the foHowing. 7.1.15 Schwarzschild Capture Theorem Let (M, Il) be a volume manifold, X a time -

independent divergence-free vector field with flow F t , and A a measurable subset of M with finite measure. Assume that for every x E A, the trajectory t ...... Ft(x) exists for all t E R Then for almost all x E A (relative to mil) the following are equivalent: (i) Ft(x) E A for all t ~ 0; (ii)

Ft(x)

E

A for all t:5; O.

Proof Let AI = nt~Ft(A) be the set of points in A which have their future trajectory completely in A. Similarly, consider A2 = nt~cft(A). By 7.1.11, for any

which shows, by letting

1: ~

00,

1:

~ 0,

that

Reasoning similarly for A 2, we get Il(A I) = Il(A I

n A2) =Il(A 2 ), so that

Section 7.1 The Definition ofthe Integral Let S Al \ S

= (AI \(AI n A 2» U (A2 \(AI n A2»; then ml!(S) = 0 and = Al n A2 = A2 \ S which proves the desired equivalence. •

471

seA. Moreover, we have

So far only integration on orientable manifolds has been discussed. A similar procedure can be carried out to define the integral of a one-density (see §6.S) on any manifold, orientable or not.

The only changes needed in the foregoing definitions and propositions are to replace the Iacobians with their absolute values and to use the definition of divergence with respect to a given density as discussed in §6.S. All definitions and propositions go through with these modifications. F-valued one-forms and one-densities can also be integrated in the following way. If co = Eti=ICOifi' where fp ... , fn is an ordered basis of F, then we set fM co = Eti=l( fM coi)fi

E

F. It

is easy to sce that this definition is independent of the chosen basis of F and that all the basic properties of the integral remain unchanged.

On the other hand, the integraL of

vector-bundLe-vaLued n-forms on M is not defined unless additional special structures (such as

triviality of the bundle) are used. In particular, integration of vector or general tensor fields is not defined.

Exercises 7.1A Let M be an n-manifold and J.I. a volume form on M. If X is a vector field on M with flow Ft show that

(Hint: Compute (d/dt)Ft*J.I. using the Lie derivative formula.)

7.1 B Prove the following generalization of the transport theorem

where co t is a time-dependent k-form on M and V is a k-dimensional submanifold of M. 7.1C (i) Let q> : SI ~ SI be the map defined by q>(e i9 ) = e2i9 , where e

E

[0,21t]. Let, by

abuse of notion, de denote the standard volume of SI. Show that the following identity holds:

ISl q>*(d9) = 2ISl de.

(ii) Let q> : M ~ N be a smooth surjective map. Then q> called a k-Jold covering map if

every n E N has an open neighborhood V such that q>-I(V) = U I U ... U Uk' are disjoint open sets each of which is diffeomorphic by q> to V. Generalize (i) in the q>*co = co. following way. If co E nn(N) is a volume form, show that

IM

kIN


472

7.ID Define the integration of Banach space valued n·forms on an n·manifold M. Show that if the Banach space is lR '. you recover the coordinate definition given at the end of this section. If E. F are Banach spaces and A E L(E. F). define A* E L(Q(M. E). Q(M. F» by (A*a)(m)

= A(a(m».

Show that

(fM)

A*

0

= A (fM) on 0

QD(M. E).

7.IE Let M and N be oriented manifolds and endow M x N with the product orientation. Let PM : M x N -+ M and PN: M x N -+ N be the projections. If a E Qdirn M(M) and ~ E Qdirn N(N) have compact support show that

has compact support and is a (dim M + dim N)-form on M x N. Prove Fubini's Theorem

7.IF Fiber Integral Let cp: M -+ N be a surjective submersion. dim M = m and dim N = n. The map cp is said to be orientable if there exists 11 E QP(M). where p = m - n. such that for each YEN. jy *11 is a volume form on cp-l(y). where jy: cp-l(y) -+ M is the inclusion. An orientation of cp is an equivalence class of p-forms under the relation: 111 Tt2 iff there exists f E !F(M). f> 0 such that Tt2 = f Tt I. (i) If cp : M -+ N is a vector bundle. show that orientability of cp is equivalent to

orientability of the vector bundle as defined in Exercise 6.SM. (ii) If cp is oriented by Tt and N by ro. show that cp*ro 1\ 11 is a volume form on M. The orientation on M defined by this volume is called the local product orientation of M (compare with Exercise 6.SM(vi». (iii) Let

Q is a projection and apply the theorem of smoothness of the integral with respect to parameters.) Show that if q> : M

~

N is a locally trivial fiber bundle

with N paracompact. ffib is surjective. (v) Let ~ e n'(N) have compact support and let a e n./+p(M). Show that q>*~ 1\ a e n cpk+t+P(M) and that

1.

fib

(Hint: Let E = Ty *N

~ 1\ 1.

(q> *~ 1\ a) =

1\ ... 1\

a.

fib

Ty *N (k times) and let F be the wedge product ~ + k

times. Define A e L(E. F) by A(y) = ~(y) 1\ Y and show that (q>*~ 1\ a)y = A*(~) using the notation of Exercise 7.1D. Then apply ffib to this identity and use 7.1D). (vi) Assume N is paracompact and oriented. q> is oriented. and endow M with the local product orientation. Prove the following iteraJed integration (Fubini-type)formula

by following the three steps below. Step 1: Using a partition of unity. reduce to the case M = N x P where q> : M

~

N is

the projection and M. N. P are Euclidean spaces. Step2: Use (v) and Exercise 7.1E to show that for ~ E nn(N) and ye np(p) with compact support.

r r (~x y)

JNJfib

=

r (~x y)

JM

Step 3: Since M. N. and P are ranges of coordinate patches. show that any ro e nm(M) with compact support is of the form ~ x y. (vii) Let q>: M ~ N and q>': M' ~ N' be oriented surjective submersions and let f: M ~ M'. and fo: N ~ N' be smooth maps satisfying fo 0 q> = q>'

0

f. Show that

Jfib r* = 0

f'

fo * 0 f'fib' where fib denotes the fiber integral of q>'. (viii) Let q>: M ~ N be an oriented surjective submersion and assume X e X(M) and Y X(N) are q>-related. Prove that

E

(For more information on the fiber integral see Bourbaki [1971) and Greub. Halperin. and Vanstone [1973). Vol. I.)

474

Chapter 7 IntegratiDn on Manifolds

7.1G Let
0 c)(t)/dt]I t = b in general

480

Chapter 7 IntegratiJJn on Manifolds

points out of U /• as in Fig. 7.2.4. Therefore. TxM is isomorphic to the model space E of M even if x E aM (see Fig. 7.2.5). It is because of this result that tangent vectors are derivatives of Cl-curves defined on closed intervals. Had we defined tangent vectors as derivatives of C1-curves defined on open intervals. TyM for y E aM would be isomorphic to ker A. and not to E. In Fig. 7.2.4. E = JRD. A. is the projection onto the n-th factor. and c(t) is defined on a closed interval whereas the Cl-curve d(t) is defined on an open interval.

.....

",

T,M

T,8M

Figure 7.2.4 Having defined the tangent bundle. all of our previous constructions including tensor fields and exterior forms as well as operations on them such as the Lie derivative. interior product. and exterior derivative carry over directly to manifolds with boundary. One word of caution though: the fundamental relation between Lie derivatives and flows still holds if one is careful to take into account that a vector field on M has integral curves which could run into the boundary in finite time and with fmite velocity. (If the vector field is tangent to aM. this will not happen.)

Figure 7.2.S

481

Section 7.2 Stokes' Theorem

Next. we turn to the problem of orientation. As-for manifolds without boundary a volume

form on an n-manifold with boundary M is a nowhere vanishing n-form on M. Fix an orientation on JRn. Then a chart (U. O. then [cr(v i •...• v n)] = [cr(v l '. v 2 •...• v n)] = [cr(v i + vI" v 2 •...• v n)]; (ii) a l = O. then [cr(vi' ...• vn)] = [cr(v i + vI" v 2 •...• v n)] and (A)(V I'. v 2 •...• v n) = 0; (iii) -1 < a l < O. then [cr(v i •...• v o)] = [-<J(v I'. v 2 •...• v n)] = [cr(v i + vI" v 2 •.••• v n)]; (iv) a l = -1. then [cr(v i •...• v n)] = [- cr(v l '. v 2 •...• v n)] and (A)(V I + VI" v 2 •...• v n) = 0; (v) a l < -1. then [cr(v i •...• v n)] = [- cr(v l '. v 2 •...• v n)] = [- cr(v i + VI" v 2 •...• vn)]. Additivity is now checked in all five cases separately. For example. in case (iii) we have

»

(A)(V I + VI" v 2 •...• v n) = (m. [cr(v i + VI" v 2 •...• vn]' A(V I + VI" ...• v n = (m. [cr(v i •...• v n)]. (1 + al)A(v l •...• vn»

= (m.

»

[cr(vi' ...• v n)]. A(Vi' ...• v n

»

+ (m. [- cr(v l '. V2 ••.•• v n»). -I a l I A(Vi' .... v n = (A)(Vi' ...• vn) + (A)(V I'. V2 •...• vn)·

Thus has values in A n~(M). The map is clearly linear and injective and thus is an isomorphism of

I A(M) I with

An~(M). Denote also by the induced isomorphism of

I Q(M) I

488


with nn~(M). The integral of p E nniM) is defined to be the integral of the density ell-I(p) over M. In local coordinates the expression for ell is

where I;no is the basis element of the space sections of o(U) given by I;no(u)(vl' ...• vn) =(u. [o(vl' ...• vn)l. sign(det(vim. where (vi) are the components of the vector Vj in the coordinates (x I•...• xn) of U. Therefore

and

for any smooth functions a. b : U -+ JR. Finally. for the formulation of Stokes' Theorem. if i : dM -+ M is the inclusion and p E nn-I~(M). the induced twisted (n-1)-form i*p on dM is defined by setting (i*p)(m)(vl' ...• vn_l ) = (m. [sign[llnl 0(- d/dXn• v\, ...• vn_I)]. p'(m)(vl' ...• vn_I if v\, ...• vn_1 are linearly independent and setting it equal to zero. if vi' ...• vn-I are linearly dependent. where (x I •...• xn) is a coordinate system at m with dM described by xn = 0 and p(m)(vl' ...• vn_l ) = (m.

».

sign[llml. p'(m)(vl' ...• vn_ I » with p'(m) skew symmetric; moreover sign[llml = +1 (respectively -1) if [Ilml and [O(-d/dX n• v\, ...• vn_l)l define the same (respectively opposite) orientation of TmM. If M =U. where U is open in JR\ and p =a a~ E nn-\(U). then i*p = (-l)ni*(aa)l;n-l o. In particular. if n

~= L~idxl

/\ ... /\ (dx i)" /\ ... /\ dxn.

i=1

we have

With this observation. the proof of Stokes' Theorem 7.2.8 gives the following. 7.2.15 Nonorientable Stokes' Theorem Let M be a paracompact rwrwrientable n-manifold with smooth boundary dM and pEnn-I ~(M). a twisted (n-1 )-form with compact support. Then

r i*p. J.Mdp = JaM

489

Section 7.2 Stokes' Theorem

The same statement holds for vector-valued twisted (n-I)-forms and all corollaries go through replacing everywhere (n-I)-forms with twisted (n-I)-forms. For example, we have the following. 7.2.16 Non-Orientable Gauss Theorem Let M be a nonorientable Riemannian n-manifold

with associated density ~M' Thenfor X

E

X(M) with compact support

f. div(X)~M f M

=

aM

(X· n)!laM

where n is the outward unit normal of aM, !laM is the induced Riemannian density of aM and LX!lM = (div X)!lM' For a concrete situation in lR3 involving these ideas, sec Exercise 7.31

$:n

~ SUPPLEMENT 7.2B Stokes' Theorem on Manifolds with Piecewise Smooth Boundary

The statement of Stokes' theorem we have given does not apply when M is, say a cube or a cone, sinee these sets do not have a smooth boundary. If the singular portion of the boundary (the four vertices and 12 edges in case of the cube, the vertex and the base circle in case of the cone), is of Lebesgue measure zero (within the boundary) it should not eontribue to the boundary integral and we can hope that Stokes' theorem still holds. This supplement discusses such a version of Stokes' theorem inspired by Holmann and Rummier [1972]. (See Lang [1972] for an alternative approach.) First we shall give the definition of a manifold with piecewise smooth boundary. A glance at the definition of a manifold with boundary makes it clear that one could define a manifold with

corners, by choosing charts that make regions near the boundary diffeomorphic to open subsets of a finite intersection of positive closed half-spaces. Unfortunately, singular points on the boundary-osuch as the vertex of a cone--need not be of this type. Thus, instead of trying to classify the singular points up to diffeomorphism and then make a formal intrinsic definition, it is simpler to consider manifolds already embedded in a bigger manifold. Then we can impose a condition on the boundary to insure the validity of Stokes' theorem. 7.2.17 Definition Let U C lR n- 1 be open and f: U ~ lR be continuous. A point p on the graph of f, r f = {(x, f(x» I x E U}, is called regular if there is an open neighborhood V of P such that V () r f is an (n - 1) dimensional smooth submanifold of V. Let Pf denote the set of regular points. Any point in <Jf = r f \ Pf is called singular. The mapping f is called piecewise smooth if Pf is Lebesgue measurable, 1t(<Jf ) has measure zero in U (where 1t: U x lR ~ U is the projection) and f !1t(<Jf ) is locally Holder; i.e.Jor each compact set K C 1t(<Jf ) there are

490


constants c(K) > 0, 0 < a(K) ~ 1 such that I f(x) - f(y) I ~ c(K) II x - Y 1I'1(K) for all x, y E K. Note that Pr is open in rr and that Int(rn U Pr' where rr-

= {(x, y) E

U x JR.I y ~

f(x)}, is a manifold with boundary Pr' Thus Pf has positive orientation induced from the standard orientation of JR.n. This will be called the positive orientation of r f. We are now ready to define manifolds with piecewise smooth boundary. 7.2.18 Definition Let M be an n-manifold. A closed subset N of M is said to be a manifold

with piecewise smooth boundary iffor every pEN there exists a chart (U, 0

II x - y lin for x. y E 7t(K}. We can assume k

~ 1 without loss of I generality. In each of the sets 7t- (C j} = C j x JR. choose a box Pj with base Cj and height

such that I f(x} - f(y} I':; k

(2k2}l/n such that 7t(K) is covered by parallelipeds Pj' with the same center as Pj and edge lengths equal to two-thirds of the edge lengths of Pj' Let V = U j=l ..... LPj' Then 7t(V} = U j=l ..... LCj and since at most 2n of the Pj intersect vol(V} and

= 2k22n vol(7t(V}} .:; 2n+1k20':; 2n+1ko

494


q(V

n Pr) ~ o.

By the previous lemma. for each Pi there is a COO function 0 are constants. Let U£ and 0 and a + xy = a' + x' y') or (y = y' = 0 and a = a', x = x')]. Show that P is a Hausdorff two dimensional manifold, ap oF

0, and ap is a disjoint union of uncountably many copies of lR. Show that P is not paracompact. 7.2U In the notation of Supplement 7.2C, verify that a

0

a=

o.

~7.3

The Classical Theorems of Green, Gauss, and Stokes

This section obtains these three classical theorems as a consequence of Stokes' theorem for differential forms. We begin with Green's theorem, which relates a line integral along a closed piecewise smooth cW"Ve C in the plane 1R2 to a double integral over the region D enclosed by C. (Piecewise smooth means that the cW"Ve C has only finitely many comers.) Recall from advanced calculus that the line integral of a one-form [a, b] ~ 1R2 is defined by

0)

=P dx + Q dy

along a cW"Ve C parametrized by y:

7.3.1 Green's Theorem Let D be a closed bounded region in 1R2 bounded by a closed positively oriented piecewise smooth curve C. (Positively oriented means the region D is on your left as you traverse the curve in the positive direction.) Suppose P: D CI. Then

~ IR

and Q : D

~ IR

are

Proof We assume the boundary C = dD is smooth. (The piecewise smooth case follows from the generalization of Stokes' theorem outlined in Supplement 7.28). Let 0) = P(x, y)dx + Q(x, y)dy E nl(D). Since dO) = (dQ/dx - dP/dy) dx 1\ dy and the measure associated with the volume dx 1\ dy on 1R2 is the usual Lebesgue measure dx dy, the formula of the theorem is a restatement of Stokes' theorem for this case. _ This theorem may be phrased in terms of the divergence and the outward unit normal. If C is given parametrically by t 1-+ (x(t), y(t», then the outward unit normal is n = (y'(t), - x'(t))/J x'(t)2 + y'(t)2

and the infinitesimal arc-length (the volume element of C) is ds =

(1)

Jx'(t)2 + y'(t)2 dt.

(See Fig.

7.3.1.) If X = Pd/dx + Qd/oy E 3t(D), recall that div X = *d*)(f> = dP/dx + dQ/dy. 7.3.2 Corollary Let D be a region in 1R2 bounded by a closed piecewise smooth curve C. If X 3t(D), then

E

Ie (X· n) ds = I ID (div X) dx dy.

Section 7.3 The Classical Theorems o/Green, Gauss, and Stokes

where

505

Ie f ds denotes the line integral of the function f over the positively oriented curve C and

X • n is the dot product.

/)

Figure 7.3.1 Proof Using formula (I) for n we have

fe

(X • n) ds =

f

[P(x(t)y(t»y'(t) - Q(x(t), y(t»x'(t)]dt =

f/

dy - Q dx

(2)

by the definition of the line integral. By 7.3.1, (2) equals

Taking P(x, y) = x and Q(x, y) = y in Green's theorem, we get the following. 7.3.3 Corollary Let D be a region in

JR.2

area of D is given by area(D) =

bounded by a closed piecewise smooth curve C. The

±Ie

x dy - y dx.

The classical Stokes theorem for surfaces relates the line integral of a vector field around a simple closed curve C in

JR.3

to an integral over a surface S for which C =

advanced calculus that the line integral of a vector field X in is defined by

LX. ds

=

f

JR.3

X(o(t» • o'(t)dt.

over the curve

as. 0:

Recall from [a, b] ~ JR.3

(3)

The surface integral of a compactly supported two-form co in JR.3 is defined to be the integral of the pull-back of co to the oriented surface. If S is an oriented surface, n is called the outward unit normal at XES if n is perpendicular to TxS and (n, el' e 2 ) is a positively oriented basis of JR.3 whenever (e l , e 2 ) is a positively oriented basis of TxS. Thus S is


506

orientable iff the normal bundle to S, which has one-dimensional fiber, is trivial. In particular, the area element v of S is given by 6.5.8. That is (4)

and ~

=0

IsW in a form familiar from vector calculus.

dx /\ dy /\ dz. We want to express

P dy /\ dz + Q dy /\ dz + R dx /\ dy so that w =0

x Recall that a /\ *P

=0

=0

* Xb,

Let

W =0

where

a a a Pax +Qay +R az '

(a, P> ~ so that letting a

=0

n b, and P =0 Xb, we get n b /\ *Xb

=0

(X • n)~.

Applying both sides to (n, vi' v2 ) and using (2) gives

(5) (the base point x is suppressed). The left side of (4) is *Xb( vi' v2 ) since n b is one on n and zero on

VI

and v 2 . Thus (4) becomes

* Xb Therefore

=0

(X • n)v .

(6)

Is w Is (X • n) dS Is X • dS, =

=

where dS, the measure on S defined by v, is identified with a surface integral familiar from vector calculus. A physical interpretation of

Is (X· n)dS

may be useful. Think of X as the velocity field

of a fluid, so X is pointing in the direction in which the fluid is moving across the surface S and X· n measures the volume of fluid passing through a unit square of the tangent plane to S in unit

Is (X· n)dS is the net quantity offluidflowing across the sUrface per unit time, i.e., the rate offluidflow. Accordingly, this integral is also called the flux of X across the time. Hence the integral

surface. 7.3.4 Classical Stokes Theorem Let S be an oriented compact surface in IR3 and X a C l

vector field defined on S and its boundary. Then

r (curl X) • n dS

Js

=0

J as

X· ds.

where n is the outward unit normal to S (Fig. 7.3.2). Proof First extend X via a bump function to all of 1R3 so that the extended X still has compact

Section 7.3 The Classical Theorems of Green, Gauss, and Stokes

support. By definition,

Ias X • ds = Ias Xb

by the standard metric in

]R3.

where b denotes the index lowering action defined

= *(curl

But dXb

507

X)b (sec 6.4.3(C» so that by (3), (6), and

Stokes' theorem,

f

as

X· ds

x

Figure 7.3.2 7.3.5 Examples A The historical origins of Stokes' formula (6) arc connected with Faraday's law, which is

discussed in Chapter 8 and examble B below. In fluid dynamics, (5) is related to Kelvin's circulation theorem, to be discussed in §8.2. Here we concentrate on a physical interpretation of the curl operator. Suppose X represents the velocity vector field of a fluid. Let us apply Stokes' theorem to a disk Dr of radius r at a point P

f

aD,

X· ds

=

f

aD,

E ]R3

(Fig. 7.3.3). We get

(curl X) • n ds

= (curl X • n)(Q)1tf2,

the last equality coming from the mean value theorem for integrals; here Q E Dr is some point given by the mean value theorem and 1tf2 is the area of Dr' Thus ((curl X) • n)(P)

=

lim _1_ r-->O 1tr2

The number

Ie X • ds

f

aD,

X· ds.

(7)

is called the circulation of X around the closed curve C. It

represents the net amount of turning of the fluid in a counterclockwise direction around C.

,~" r

Figure 7.3.3


508

Formula (7) gives the following physical interpretation for curl X, namely: (curl X) • n is the circulation of X per unit area on a sUrface perpendicular to n. The magnitude of (curl X) • n

is clearly maximized when n = (curl X)/II curl X II. Curl X is called the vorticity vector. B One of Maxwell's equations of electromagnetic theory states that if E(x, y, z, t) and H(x, y, z) represent the electric and magntic fields at time t, then V x E

=- aH/at,

where V x

E is computed by holding t fixed, and aH/at is computed by holding x, y, and z constant. Let us use Stokes' theorem to determine what this means physically. Assume S is a surface to which Stokes' theorem applies. Then

(The last equality may be justified if H is ct.) Thus we obtain (8)

Equality (8) is known as Faraday's law. The quantity

fas E • ds

represents the "voltage" around

as, and if as were a wire, a current would flow in proportion to this voltage. Also

fs H • dS

is

called theflux of H, or the magnetic flux. Thus, Faraday's law says that the voltage around a loop equals the negative of the rate of change of magnetic flux through the loop.

C Let X E X(1R 3 ). Since

is contractible, the Poincare lemma shows that curl X = 0

]R3

iff X = grad f for some function f

E

.'F(]R3).

This in tum is equivalent (by Stokes' theorem) to

fe

either of the following: (i) for any oriented simple closed curve C, X • ds = 0, or (ii) for any oriented simple curves C t , C2 with the same end points Ie 1X • ds = Ie 2 X • ds. The function f can be found in the following way:

f(x, y, z) =

I

x

0

X t (t, 0, O)dt +

lY X (x, t, O)dt + lZX (x, y, t)dt. 0

2

0

3

Thus, for example, if

X

then curl X =

°

a a a y ax + (z cos(yz) + x) ay + y cos(yz) az'

and so X = grad f, for some f. Using the formula (9), one finds f(x, y, z) = xy + sin yz.

D The same arguments apply in

X

]R2

by the use of Green's theorem. Namely if

(9)

Section 7.3 The Classical Theorems o/Green, Gauss, and Stokes

509

then X = grad f, for some f E .'J(JR.2) and conversely. E The following statement is again a reformulation of the Poincare lemma: let X then div X = 0 iff X = curl Y for some Y 7.3.6 Classical Gauss Theorem

E

E

X(JR.3) ,

X(lRh •

Let 0 be a compact set with nonempty interior in JR.3

bouruied by a surface S that is piecewise smooth. Thenfor X a C 1 vector field on 0 US,

fn (div

fs (X

X) dV =

0

(10)

n)dS,

where dV denotes the staruiard volume element (Lebesgue measure) in JR.3 (Fig. 7.3.4). Proof Either use 7.2.10 or argue as in 7.3.4. By (6),

By Stokes' theorem, this equals

since d*Xb = (div X) dx /\ dy /\ dz. •

X

n

Figure 7.3.4 7.3.7 Example

We shall use the preceding theorem to prove Gauss' law

1n

ron -dS r3

-

j

41t, if 0 E 0

0,

if 0

where 0 is a compact set in JR.3 with nonempty interior.

~0

ao

(11)

is the surface bounding O. which


510

is assumed to be piecewise smooth. n is the outward unit normal. 0 i"

n.

If 0 i"

o inside n

(In.

and where

apply 7.3.6 and the fact that div(r/r3) = 0 to get the result. If 0 E

n.

surround

by a ball DE of radius £ (Fig. 7.3.5). Since the orientation of (l0E induced from

\ DE is the opposite of that induced from DE (namely it is given by the

n

inward unit normal). Gauss'

theorem gives (12)

since 0 i"

n \ DE

and thus on

JaD since

J

n \ DE'

div(r/r3) = O. But on (l0E' r = £ and n = -r/£. so that

J

E

2 1 2 P3ndS=£4dS =--41t£ =-41t r aDE £ £2

dS = 41t£2. the area of the sphere of radius £.

aDE

aD,.--....-.---.•

~

•

. . . . . . t.:. . .. .•....

Figure 7.3.5 In electrostatics Gauss' law (II) is used in the following way. The potential due to a point E JR.3 is given by q/(4m). r = (x 2 + y2 + z2)l!2. The corresponding electric field is defined to be minus the gradient of this potential; i.e .. E = qr/(4m3). Thus Gauss' law states that charge q at 0

fanE. n

the total electricjlux dS equals q if 0 E n and equals zero, if 0 i" n. A continuous charge distribution in n described by a charge density p is related to E by P

fan

fn

= div E. By Gauss' theorem the electric flux E • n dS = p dV. which represents the total charge inside n. Thus. the relationship div E = P may be phrased as follows. The flux out of II swface of an electric field equals the total charge inside the swface.

Section 7.3 The Classical Theorems of Green, Gauss, and Stokes

511

Exercises 7.3A Use Green's theorem to show that (i) The area of the ellipse x 2/a2 + y2jb2 == 1 is 1tab. (ii) The area of the hypocycloid x == a cos 3 9, y == b sin 3 9 is 31ta2/8. (iii) The area of one loop of the four-leaved rose r == 3 sin 29 is 91t/8. 7.3B Why does Green's theorem fail in the unit disk for -y dx/(x 2 + y2) + X dy/(x 2 + y2)? 7.3C For an oriented surface S and a fixed vector a, show that

2

r a. n dS == Jras(a x r). dS.

Js

7.3D Let the components of the vector field X

E

3\(]R3) be homogeneous of degree one; i.e., X

satisfies Xi(tx, ty, tz) == tXi(x, y, z), i == 1,2, 3. grad f, where f == (xXi + yX 2 + zX3)/2. 7.3E Let S be the surface of a region

n

Show that if curl X == 0, then X ==

in ]R3. Show that

volume(n) = (1/3)

Lr . n dS.

Give an intuitive argument why this should be so. (Hint: Think of cones.) 7.3F Let S be a closed (i.e., compact boundaryless) oriented surface in ]R3. (i) Show in two ways that fs(curl X) • n dS == O. (ii) Let X = 3\(S) and f E :J(S). Make sense of and show that

Is (grad DX • dS -Is(f curl X)dS =

where grad, curl, and div are taken in ]R3. 7.3G Let X and Y be smooth vector fields on an open set DC ]R3, with aD smoolh, and c1(D) compact. Show that

r Y • curl X dV Jr X • curl Y dV + Jr (X x Y) • n

JD

=

D

dS.

aD

where n is the outward unit normal to dD and dS the induced surface measure on dD. (Hint: Show that y. curl X - X • curl Y == div(X x Y).)

512


7.3H If C is a closed curve bounding a surface S show that f/(grad g) • ds =

Is (grad f x grad g) • n dS

= -

fcg(grad 0 • ds

where f and g are C2 functions. 7.31 (A. Lenard) Faraday's law relates the line integral of the electric field around a loop C to the surface integral of the rate of change of the magnetic field over a surface S with boundary C. Regarding the equation V x E = - dH/dt as the basic equation, Faraday's law is a

consequence of Stokes' theorem, as we have seen in Example 7.3.5B. Suppose we are given electric and magnetic fields in space that satisfy the equation V x E = - dH/dt. Suppose C is the boundary of the Mobius band shown in Fig. 6.5.1. Since the Mobius band cannot be oriented, Stokes' theorem does not apply. What becomes of Faraday's law? Resolve the issue in two ways: (i) by using the results of Supplement 7.2A or a direct reformulation of Stokes' theorem for nonorientable surfaces, and (ii) realizing C as the boundary of an orientable surface. If dH/dt is arbitrary, in general does a current flow around C or not?

~7.4

Induced Flows on Function Spaces and Ergodicity

This section requires some results from functional analysis. Specifically we shall require a

knowledge of Stone's theorem and self-adjoint operators. The required results may be found in Supplement 7.4A and 7.4B at the end of this section. Flows on manifolds induce flows on tangent bundles, tensor bundles, and spaces of tensor fields by means of push-forward. In this section we shall be concerned mainly with the induced flow on the space of functions. This induced flow is sometimes called the Liouville flow. Let M be a manifold and Jl a volume element on M; i.e., (M, Jl) is a volume manifold. If F t is a (volume-preserving) flow on M, then F t induces a linear one-parameter group (of isometrics) on the Hilbert space H = L2(M Jl) by

The association of V t with F t replaces a nonlinear finite-dimensional problem with a linem

infinite-dimensional one. There have been several theorems that relate properties of Ft and V t . The best known of these is the result of Koopman [1931], which shows that V t has one as a simple eigenvalue for all t if and only if F t is ergodic. (If there are no other eigenvalues, then F t is called weakly mixing.) A few basic results on ergodic theory are given below. We refer the reader to the excellent texts of Halmos [1956], Arnold and Avez [1967], and Bowen [1975) for more information. We shall first present a result of Povzner [1966], which relates the completeness of the flow of a divergence-frce vector field X to the skew-adjointness of X as an operator. (The hypothesis of divergence free is removed in Exercises 7.4A-C.) We begin with a lemma due to Ed Nelson. 7.4.1 Lemma Let A be an (unbounded) self-adjoint operator on a complex Hilbert space '}{. Let Do

C

D(A) (the domain of A) be a dense linear subspace of '}{ and suppose V t = e itA (the

unitary one-parameter group generated by A) leaves Do invariant. Then Ao:= (A restricted to Do) is essentially self-adjoint; that is, the closure of

Ao

is A.

Proof Let A denote the closure of Ao. Since A is closed and extends A o' A extends A. We

need to prove that A extends A. For A> 0, A - iA is surjective with a bounded inverse. First of all, we prove that A - iAo has dense range. If not, there is a v

E

'}{

such that

(v, AX - iAox) = 0 for all x E Do.

Chapter 7 IntegratWn on Manifolds

514

In particular, since Do is VI-invariant,

so

Since Do is dense, this holds for all x E J{. Thus II VI II = 1 and A. > 0 imply v = O. Therefore (A. - iAu)-' makes sense and (A. - iAt' is its closure. It follows from Supplement 7.4A that A is the closure of Au (see 7.4.15 and the remarks following it) .• 7.4.2 Proposition

Let X be a

C~

divergence-free vector field on (M, 11) with a complete flow ~c

Fl' Then iX is an essentially self-adjoint operator on in the complex Hilbert space L 2(M, 11).

= ~ functions with compact support

Proof Let Vtf = f 0 F-I be the unitary one-parameter group induced from Fl' A straightforward convergence argument shows that V/ is continuous in t in L2(M,I1). In Lemma 7.4.1, choose Do = C~ functions with compact support. This is clearly invariant under VI' If f E Do' then

.!V dt If I

1=0

=.!f dt 0 F-I

I 1=0

= -

df· X '

so the generator of VI is an extension of -X (as a differential operator) on Do. The corresponding essentially self-adjoint operator is therefore iX . • Now we prove the converse of 7.4.2. That is, if iX is essentially self-adjoint, then X has an almost everywhere complete flow. This then gives a functional-analytic characterization of completeness. 7.4.3 Theorem Let M be a manifold with a volume element 11 and X be a c~ divergence-free vector field on M. Suppose that, as an operator on L2(M, 11), iX is essentially self-adjoint on the c~ functions with compact support. Then, except possibly for a set of points x of measure zero, the flow FI(x) of X is defined for all t E R

We shall actually prove that, if the defect index of iX is zero in the upper half-plane (i.e., if (iX + i)(C~c) is dense in L2), then the flow is defined, except for a set of measure zero, for all t> O. Similarly, if the defect index of iX is zero in the lower half-plane, the flow is essentially complete for t < O. The converses of these more general results can be established along the lines of the proof of 7.4.1. Proof (E. Nelson-private communication) Suppose that there is a set E of finite positive measure such that if x E E, FI(x) fails to be defined for t sufficiently large. Let Er be the set of x E E

Section 7.4 Induced Flows on Function Spaces and Ergodicity

for which Ft(x) is undefined for t ~ T. Since E Replacing E by

~,

515

=

UT:1:i Ep some ET has positive measure. we may assume that all points of E "move to infinity" in a time '5T.

If f is any function on M, we adopt the convention that f(Ft(x» = 0 if Ft(x) is undefined. For any x

M, and t < -T, Ft(x) must be either in the complement of E or

E

undefined; otherwise it would be a point of E that did not move to infinity in time T. Hence XE(Ft(x» = 0 for t < - T, where XE is the characteristic function of E. We now define a function on M by

Note that the integral converges because the integrand vanishes for t < - T. In fact, we have

Moreover, g is in L2. Indeed, because F t is measure-preserving, where defined, denoting by II 112 the L 2 norm, we have II XE ° F, 112 '5 II XE 11 2, so that

IIglb '5

i~

e--'tIiXEoF,lbd't '5 IIXElbeT .

-T

The function g is nonzero because E has positive measure. Fix a point x this case F/Ft(x»

E

M. Then Ft(x) is defined for t sufficiently small. It is easy to see that in

and Fnt(x) are defined or undefined together, and in the former case they are

equal. Hence we have XE(F,(Ft(x» = XE(Fnt(x»

for t sufficiently small. Therefore, for t

sufficiently small

Now if
O

f f~ g(x)

B then B*::::> A *. If A is everywhere defined and bounded, it follows from the Riesz representation theorem

(Supplement 2.2A) that A* is everywhere defined; moreover it is not hard to see that, in this case, IIA*II=IIAII. An operator A is symmetric (Hermitian in the complex case) if A * ::::> A; i.e., (Ax, y) = (x, Ay) for all x, y E DA' If A* = A (this includes the condition DA' = D A)' then A is called

self-adjoint. An everywhere defined symmetric operator is bounded (from the closed graph theorem) and so is self-adjoint. It is also easy to see that a self-adjoint operator is closed. One must be aware that, for technical reasons, it is the notion of self-adjoint rather than symmetric, which is important in applications. Correspondingly, verifying self-adjointness is often difficult while verifying symmetry is usually trivial. Sometimes it is useful to have another concept at hand, that of essential self-adjointness. First, it is easy to check that any symmetric operator A is closable. The closure A is easily seen to be symmetric. One says that A is essentially self-adjoint when its closure A is self-adjoint. Let A be a self-adjoint operator. A dense subspace C CHis said to be a core of A if C C D A and the closure of A restricted to C is again A. Thus if C is a core of A one can recover A just be knowing A on C. We shall now give a number of propositions concerning the foregoing concepts, which arc useful in applications. Most of this is classical work of von Neumann. We begin with the following. 7.4.12 Proposition Let A be a closed symmetric operator of a complex Hilbert space H. If A is self-adjoint then A + AI is surjective for every complex number A with 1m A of- 0 (I is the

identity operator). Conversely, if A is symmetric and A - iI and A + iI are both surjective then A is self-adjoint. Proof Let A be self-adjoint and A = a + i~,

~ of-

O. For xED A we have

II(A + A)X 112 = II(A + a)x 112 + i~(x, Ax) - i~(Ax, x) + ~2 II X 112 = II(A + a)x 112 + ~211 x If ~ ~2 II

X

11 2,

where A + A means A + AI. Thus we have the inequality II(A + A)X II

~

I 1m A III x II.

(1)

Since A is closed, it follows from (1) that the range of A + A is a closed set for 1m A of- O. Indeed, let Yn = (A + A)X n ~ y. By the inequality (1), II xn - xm 11:0; II Yn - Ym II II 1m AI so xn converges to, say x. Also AXn converges to y - Ax; thus x E DA and y - Ax = Ax as A is

522


closed. Now suppose y is orthogonal to the range of A + AI. Thus (Ax + Ax, y) = 0 for all xEDA' or (Ax, y) = - (x, Ay). By definition, yE DA* and A*y=- T y; since A=A*, yE DA and Ay=- Ty,weobtain (A + AI)y = O. Thus the range of A + AI is all of H. Conversely, suppose A+i and A-i are onto. Let yE DA*. Thus for all XE DN «A + i)x, y) = (x, (A * - i)y) = (x, (A - i)z) for some zED A since A - i is onto. Thus «A + i)x, y) = «A + i) x, z) and it follows that y = z. This proves that DA* C D A and so DA = D A*. The result follows . • If A is self-adjoint then for 1m A ~ 0, AI - A is onto and from (1) is one-to-one. Thus (AI - A)-I: H ~ H exists, is bounded, and we have

II (AI-Af 1 11$_1_. IImAI

(2)

This operator (AI - A)-l is called the resolvent of A. Notice that even though A is an unbounded operator, the resolvent is bounded. The same argument used to prove 7.4.12 shows the following. 7.4.13 Proposition A symmetric operator A is essentially self-adjoint iff the ranges of A + iI and A - iI are dense. If A is a (closed) symmetric operator then the ranges of A + iI and A - iI are (closed)

subspaces. The dimensions of their orthogonal complements are called the deficiency indices of A. Thus 7.4.12 and 7.4.13 can be restated as: a closed symmetric operator (resp., a symmetric operator) is self-adjoint (resp., essentially self-adjoint) iff it has deficiency indices (0,0). If A is a closed symmetric operator then from (I), A + iI is one-to-one and we can consider the inverse (A + il)-l, defined on the range of A + iI. One calls (A - iI)(A + il)-l the Cayley transform of A. It is always isometric, as is easy to check. Thus A is self-adjoint iff its Cayley transform is unitary. Let us return to the graph of an operator A for a moment. The adjoint can be described entirely in terms of its graph and this is often convenient. Define an isometry l: H (fJ H ~ H (fJ H by lex, y) = (-y, x); note that j2 =-1.


523

7.4.14 Proposition Let A be densely defined. Then (r A).l = J(r A+) and -r A+ = J(r A).l. In

particular, A * is closed, and if A is closed, then

where H EB H carries the usual inner product: «xI' x), (Yl' Y2» = (xl' YI) + (x 2' Y2)' Proof Let (Z,y)E J(rA+), so yE D A+ and z=-A*y. Let XE DN Wehave «x, Ax), (-A *y, y» = (x, -A *y) + (Ax, y) = 0, and so J(rA+)crA.l. Conversely if (z, y) E (rA).l, then (x, z) + (Ax, y) = 0 for all XED A' Thus by definition, y E D A+ and z = -A *y. This proves the opposite inclusion. _ Thus if A is a closed operator, the statement H EB H = r A EB J(r A+) means that given e, f E H, the equations x - A *y = e and Ax + y = f have exactly one solution (x, y). If A is densely defined and symmetric, then its closure satisfies A

C

A * since A * is closed. There are other important consequences of 7.4.14 as well.

7.4.15 Corollary For A densely defined and closeable, we have (i)

A =A**,

Suppose A: DA

and

C

(ii)

A*=A*.

H ~ H is one-to-one. Then we get an operator A-I defined on the

range of A. In terms of graphs:

where K(x, y) = (y, x); note that K2 = I, K is an isometry and KJ = -JK. It follows for example that if A is self-adjoint, so is A-I, since


524

Next we consider possible self-adjoint extensions of a symmetric operator. 7.4.16 Proposition Let A be a symmetric densely defined operator on H and A its closure. The following are equivalent: (i) A is essentially self-adjoint. (ii) A* is self-adjoint. (iii) A** ::J A*. (iv) A has exactly one self-adjoint extension. (v) A=A*.

Proof By definition, (i) means A* = A. But we know A* = A * and A = A ** by 7.4.15. Thus (i), (ii), (v) are equivalent. These imply (iii). Also (iii) implies (ii) since A cAe A * c A ** = A and so A * = A **. To prove (iv) is implied let Y be any self-adjoint extention of A. Since Y is closed, Y::J A. But A = A * so Y extends the self-adjoint operator A *; i.e., Y::J A. Taking adjoints, A * = A::J Y* = Y so Y = A. To prove that (iv) implies the others is a bit more complicated. We shall in fact give a more general result in 7.4.18 below. First we need some notation. Let 0+ = range(A + iI)-L CHand 0_ = range(A - ilf C H called the positive and negative defect spaces. Using the argument in 7.4.12 it is easy to check that

7.4.17 Lemma Using the graph norm on DA*' we have the orthogonal direct sum

°

° °

Proof Since 0+, 0_ are closed in H they are closed in A *. Also A c A * is closed since A * is an extention of A and hence of A. It is ea~y to see that the indicated spaces are orthogollal. Forexamplelct XE DA and yE 0_. Then using the inner product «x, y» = (x, y) + (A *x, A *y) gives «x, y» = (x, y) + (A *x, -iy) = (x, y) - i( A *x, y). Since x E DA = D A *, by 7.4.16(v), we get «x, y» = (x, y) - i(x, A *y) = (x, y) - (x, y) = o.

Section 7.4 Induced Flows on FunctUm Spaces and Ergodicity

525

To see that D A " = DA EEl D+ EEl D_ it suffices to show that the orthogonal complement of DA EEl D+ EEl D_ is zero. Let UE (DAEElD+EElD_).L, so «u, x» = «u, y» = «u, z» = 0 for all XE DA, yE D+, ZE D_. From «u,x»=O we get (u,x)+(A*u,A*x)=O or A*UE D A" and A * A *u = - u. It follows that (I - iA *)u ED +. But from «u, y» = 0 we have «(1iA *)u, y) = 0 and so (1- iA *)u = O. Hence u

E

D_. Taking Z = u gives u = O. ,.

7.4.18 Proposition The self·adJoint extensions of a symmetric densely defined operator A (if any) are obtained as follows. Let T : D+ ~ D_ be an isometry mapping D+ onto D_ and let

r TeD+

EEl D_ be its graph. Then the restriction of A * to D A EEl r T is a self·adJoint extension of A. Thus A has self-adjoint extensions iff its defect indices (dim D+' dim D_) are equal and these extensions are in one-to-one correspondence with all isometries of D+ onto D_. Assuming this result for a moment, we give the following. Completion of Proof of 7.4.16 If there is only one self-adjoint extension it follows from 7.4.18 that D+=D_= {OJ soby7.4.13 A is essentially self-adjoint. _ Proof of 7.4.18 Let B be a self-adjoint extension of A. Then A* = A*::::> B so B is the restriction of A * to some subspace containing D A' We want to show that these subspaccs are of the form D A EEl r T as stated. Suppose first that T: D+ ~ D_ is an isometry onto and let DA EEl

r T"

First of all, one proves that

A

A

be the restriction of A * to

is symmetric: i.e., for u, xED A and v, y E D+ that

(Ax + A *y + A *Ty, u + v + Tv) = (x + y + Ty, Au + A *v + A *Tv). This is a straightforward computation using the definitions. To show that

A

is self-adjoint, we show that DA"

a nonzero ZEDA" such that either

A

*z = iz or

C

DA' If this does not hold there exists

A *z = -iz.

This follows from Lemma 7.4.17

applied to the operator A. (Observe that A is a closed operator--this easily follows.) Now A ::::>A so A*::::>A*. Thus ZE D+ or ZE D_. Suppose ZE D+. Then z+TzE DA so as «DA' z» = 0 «(,» denotes the inner product relative to A),

o=

«z + Tz, z» =

«;~,

z» + «Tz, z» = 2(z, z),

since Tz E D_. Hence z = O. In a similar way one sees that if Z E D_ then z = O. Hence

A

is self-adjoint. We will leave the details of the converse to the reader (they are similar to the foregoing). The idea is this: if A is restriction of A* to a subspace DA ED V for V C D+ ED D_ and A is

526


symmetric, then V is the graph of a map T: W subspace

we

D+. Then self-adjointness of

C

D+

~

D_ and (Tu, Tv) = (u, v), for a

A. implies that in fact W = D+ and T is onto. _

A convenient test for establishing the equality of the deficiency indices is to show that T commutes with a conjugation U; i.e., an anti linear isometry U: H ~ H satisfying U 2 = I; anti linear means U(ux) = aUx for complex scalars u and U(x + y) = Ux + Uy for x, Y E H. It is easy to see that U is the isometry required from D+ to D_ (use D+ = range (A + iI)l.). As a corollary, we obtain an important classical result of von Neumann: Let H be L2 of a

measure space and let A be a (closed) symmetric operator that is real in the sense that it commutes with complex conjugation. Then A admits self-adjoint extensions. (Another sufficient condition of a different nature, due to Friedrichs, is given below.) This result applies to many quantum mechanical operators. However, one is also intcrested in essential self-adjointness, so that thc self-adjoint extension will be unique. Methods for proving this for specific operators in quantum mechanics are given in Kato [1966] and Reed and Simon [1974]. For corresponding questions in elasticity, see Marsden and Hughes [1983]. We now give some additional results that illustrate methods for handling self-adjoint operators. 7.4.19 Proposition Let A be a self-adjoint and B a bounded self-adjoint operator. Then A + B

(with donwin D A) is self-adjoint. If A is essentially self-adjoint on D A then so is A + B. Proof A + B is certainly symmetric on DA' Let y E D(A+B)' so that for all xEDA' «A + B)x, y) = (x, (A + B)*y). The left side is (Ax, y) + (Bx, y) =(Ax, y) + (x, By) since B is everywhere defined. Thus (Ax, y) = (x, (A + B)*y - By). Hence y E D A• = DA and Ay = A*y = (A + B)*y - By. Hence y E DA+B = DA. Let A be the closure of A. For the second part, it suffices to show that the closure of A + B equals A + B. But if x E DA there is a sequence xn E D A such that xn

~

x, and AXn

~

Ax. Then Bxn·-t Bx as B is bounded so x belongs to the domain of the closure of A + B. _ In general, the sum of two self-adjoint operators need not be self-adjoint. (See Nelson [1959] and Chernoff [1974] for this and related examples.)


527

7.4.20 Proposition Let A be a symmetric operator. If the range of A is all of H then A is self-adjoint.

Proof We first observe that A is one-to-one. Indeed Jet Ax = O. Then for any y E DA' 0 = (Ax, y) = (x, Ay). But A is onto and so x = O. Thus A admits an everywhere defined inverse A-I, which is therefore self-adjoint. Hence A is self-adjoint (we proved earlier than the inverse of a self-adjoint operator is self-adjoint). _ We shall use these results to prove a theorem that typifies the kind of techniques one uses. 7.4.21 Proposition Let A be a symmetric operator on H and suppose A 5 0; that is (Ax, x) 5

o for

xED A' Suppose I - A has dense range. Then A is essentially self-adjoint.

Proof Note that ((1- A)u, u) = (u, u) - (Au, u) ~

and so by the Schwarz inequality we have 11(1- A)u II

~

II U 112

II u II. It follows that the closure of I - A

which equals I - A has closed range, which by hypothesis must be all of H. By 7.4.20 I - A is self-adjoint and so by 7.4.21 A is self-adjoint. _ 7.4.22 Corollary If A is self-adjoint and A 5 0, then for any A. > 0, A. - A is onto, (A. - A)-l exists and

II (A. - A)-l II 5 IfA.

(3)

Proof As before we have «A. - A)u, u) ~ A. II U 112 which yields II(A. - A)u II ~ A. II u II. As A is closed, this implies that the range of A. - A is closed. If we can show it is dense, the result will follow. Suppose Y is orthogonal to the range: «A.-A)u,y)=O for all

UE

DA .

This means that (A. - A)*y = 0, or since A is sC\f-adjoint, y gives 0 = «A. - A)y, y) ~

Ail y 112

E

D A' Making the choice u = y

so Y = O. _

Note that an operator A has dense range iff A* is one-to-one; i.e., A*w = 0 implies w= O. For a given symmetric operator A, we considered the general problem of self-adjoint extensions of A and classified these in terms of the defect spaces. Now, under different

528


hypotheses. we construct a special self-adjoint extension (even though A need not he essentially self-adjoint). This result is useful in many applications. including quantum mechanics. A symmetric operator A on H is called lower semi-bounded if there is a constant such that (Ax.x)~cllxIl2 forall

XE

CElt

DA" Upper semi-bounded is defined similarly. If A is

either upper or lower semi-bounded then A is called semi-bounded. Observe that if A is positive or negative then A is semi-bounded. As an example. let A = _V2 + V where V2 is the Laplacian and let V be a real valued continuous function and bounded below. say V(x) ~ a. Let H = L2(IRn. C) and DA the C~ functions with compact support. Then - V2 is positive so (Af. f)

=(- V2 f. f) + (Vf. f) ~ a(f. f).

and thus A is semi-bounded. We already know that this operator is real so has self-adjoint extensions by von Neumann's theorem. However. the self-adjoint extension constructed below (called the Friedrichs extension) is "natural." Thus the actual construction is as important as the statement. 7.4.23 Theorem A semi-bounded symmetric (densely defined) operator admits a self-adjoint

extension. Proof After multiplying by -1 if necessary and replacing A by A + (1 - a)I we can suppose (Ax. x) ~ II X 112. Consider the inner product on DA given by «x. y» = (Ax. y). (Using symmetry of A and the preceding inequality one easily checks that this is an inner product.) Let HI be the completion of DA in this inner product. Since the HI-norm is stronger than the H-norm. we have HI C H (i.e .• the injection DA C H extends uniquely to the completion). Now let H-I be the dual of HI. We have an injection of H into H- I defined as follows: if y is fixed and x 1-+ (x, y) is a linear functional on H, it is also continuous on HI since I (x, y) I ::; II x 1111 y II::; III x 11111 y II,

where III· III is the norm of HI. Thus HI

C

H

C

Wi.

Now the inner product on HI defines an isomorphism B: HI ~ H-I. Let C be the operator with domain Dc = {x E HI I B(x) E H}, and Cx = Bx for x E Dc- Thus C is an extension of A. This will be the extension we sought. We shall prove that C is self-adjoint. By definition, C is surjective; in fact C: Dc ~ H is a linear isomorphism. Thus by 7.4.20 it suffices to show that C is symmetric. Indeed for x. y E Dc we have, by definition, (Cx, y)

=

«x, y»

=

«y, x»

=

(Cy, x)

=

(x, Cy). •

The self-adjoint extension C can be alternatively described as follows. Let HI be as before and let C be the restriction of A* to DA. n HI. We leave the verification as an exercise. b


~

529

SUPPLEMENT 7.4B Stone's Theorem

(written in collaboration with P. Chernoff) Here we give a self-contained proof of Stone's theorem for unbounded self-adjoint operators

A on a complex Hilbert space H. This guarantees that the one-parameter group eitA of unitary operators exists. In fact, there is a one-to-one correspondence between self-adjoint operators and continuous one-parameter unitary groups. A continuous one-parameter unitary group is a homomorphism t 1-+ VI from lR to the group of unitary operators on H, such that for each x

E

H the map t 1-+ Vlx is continuous. The infinitesimal generator A of VI is defined by

its domain D consisting of those x for which the indicated limit exists. We insert the factor i for convenience; iA is often called the generator.

7.4.24 Stone's Theorem. Let VI be a continuous one-parameter unitary group. Then the generator A of VI is self-adjoint. (In particular, by Supplement 7.4A, it is closed and densely defined.) Conversely, let A be a given self-adjoint operator. Then there exists a unique one-parameter unitary group VI whose generator is A. Before we begin the proof, let us note that if A is a bounded self-adjoint operator then one can form the series I + itA

+

;!

(itA)2

+

;!

(itA)3

+ ...

which converges in the operator norm. It is straightforward to verify that VI is a continuous one-parameter unitary group and that A is its generator. Because of this, one often writes eitA for the unitary group whose generator is A even if A is unbounded. (In the context of the so-called "operational calculus" for self-adjoint operators, one can show that eilA really is the result of applying the function e ilO to A; however, we shall not go into these matters here.) Proof of Stone's Theorem (first half) Let VI be a given continuous unitary group. In a series of lemmas, we shall show that the generator A of VI is self-adjoint.

7.4.25 Lemma The domain D of A is invariant under each VI' and moreover AVtx = VIAx for each XED. Proof Suppose xED. Then


530

which converges to Ut(iAx)

= iUtAx

as h

~

O. The lemma follows by the definition of A. _

7.4.26 Corollary A is closed. Proof If XED then, by 7.4.25

(1)

Hence

Now suppose that xn E D, xn

~

x, and AXn

~

y. Then we have, by (I),

(2)

Thus

(Here we have taken the limit under the integral sign because the convergence is unifonn; indccd IIU~Axn

-

U~YII

= IIAxn -

yll

~

0 independent of "t E [0, tJ.) Then, by (2),

dd UtX t

Hence xED and y

=Ax.

I

iy.

t=O

Thus A is closed. _

7.4.27 Lemma A is densely defined. Proof Let x E H, and let
- xq> , from which the assertion follows. • Thus far we have made no significant use of the unitarity of U t . We shall now do so. 7.4.28 Lemma A is symmetric. Proof Take x, yEO. Then we have

Tlddt

(Ax, y)

=

It=O =

(Utx, y)

+

1t (x, U_ty)

= - .;.. (x, 1

iAy)

=

It=O

Tlddt

= -

*

I

(x, U t y) t=O

+

I

1t (x, UtY) t=O

(x, Ay) . •

To complete the proof that A is self-adjoint, let y E 0* and x E O. By 7.4.25.28,

(UtY, x) = (y. U_tx) = (y. x) + (y, i Itu,AX d'!)

= (y.

x) - i I t (y. U,Ax)d'! = (y, x) - i I t (y, AU,x) d1:


532

1,-1 (U_, A * y, x) d't

= (y, x) - i 0

1,1 (0, A *y, x) d1:

= (y, x) + i 0

Because D is dense, it follows that

Hence, differentiating, we see that y E D and A *y = Ay. Thus A = A *. Proof of Stone's theorem (second half) We are now given a self-adjoint operator A. We shall construct a continuous unitary group UI whose generator is A.

7.4.29 Lemma If A> 0, then 1+ AA2: DA2 ~ H is bijective, (I + AA2)-1 : H ~ DA2 is bounded by I, and DA 2, the domain of A2 , is dense. Proof If A is self-adjoint, so is

...;T:A.

It is therefore enough to establish the lemma for A = 1.

First we establish surjectivity. By 7.4.14 and 7.4.26, if

E H is given there exists a unique solution (x, y) to the

Z

equations x - Ay = 0, Ax + y = z. From the first equation, x

= Ay.

The second equation then yields A2y + y = z, so 1+ A2 is

surjective. For xED A2, note that «I + A2)x, x) ~ II

X

11 2, so II (I + A2)x II ~ II x II. Thus 1+ A2 is

one-to-one and II (I + A2)-1 11$ 1. Now suppose that u is orthogonal to D A2. We can find a v such that u

=v + A2v.

Then 0= (u, v) = (v + A2v, v) = II

V

112 + II Av 11 2,

whence v = 0 and therefore u = O. Consequently D A2 is dense in H . • For A> 0, define an operator AI.. by AI.. = A( I + A A2)-I. Note that AI.. is defined on all of H because if x E H then (I + J...A2)-lx E D A2 C D, so A (I + J...A2)-lx makes sense.

7.4.30 Lemma AI.. is a bounded self-adjoint operator. Also AI.. and Proof Pick x E H. Then by 7.4.29

A~

commute for all A,

~

> o.


533

Ail AA.x 112 =(AA(I + AA 2)-lx, A(I + AA2)-lx) = (AA2(1 + AA2)-lx, (I + AA2)-lx)

:0; «I + AA2)(I + AA2)-lx, (I + AA 2)-lx) :0; 11(1 + AA2)-lx 112:0; so

II AA.II :0; 1/ v-:;:'

II x 11 2,

and thus AA. is bounded.

We now show that AA. is self-adjoint. First we shall show that if xED, then

Indeed, if xED we have AA.xED A2 by 7.4.29 and so

Now suppose xED and y is arbitrary. Then «I + AA2)-IAx, y) = «I + AA2)-IAx, (I + AA2)(I + AA2)-ly)

(Ax, (I + }.A2t 1y)

= (x, A.y).

Because D is dense and AI.. bounded, this relation must hold for all x E H. Hence AA. is self-adjoint. The proof that AA.~ = AJ.LAA. is a calculation that we leave to the reader. _ Since AA. is bounded, we can form the continuous one-parameter unitary groups UtA. = eitAA., A> 0 using power series or the results of §4.1. Since AA. and AJ.L commute, it follows that UsA. and

ut

commute for every s and t.

7.4.31 Lemma If XED then limA. -> OAA.x Proof If xED we have AA.x - Ax

=Ax.

= (I + AA 2t

1Ax - Ax

=- AA2(1 + AA2)-1 Ax.

It is therefore

enough to show that for every y E H, AA2(1 + AA2)-ly ~ O. From the inequality 11(1 + AA2)y 112 ~ II AAZy 11 2, valid for A ~ 0, we see that II }.A2(1 + }.A2)-1 II :0; I. Thus it is even enough to show the preceding equality for all y in some dense subspace of H. Suppose y E D A2, which is dense by 7.4.29. Then

which indeed goes to zero with A. _ 7.4.32 Lemma For each x E H, limA.-> oUtAx exists. If we call the limit Utx, then {Utl is a

continlWus one-parameter unitary group. Proof We have

534


whence (3)

II AAx - Af1x II ~ 0 as A,

Now suppose that xED. Then by 7.4.31, AAx ~ Ax, so that Il ~ O. Because of (3) it follows that {U,A~

h>0

is uniformly Cauchy as A ~ 0 on every

compact t-interval. It follows that lim A-> oU,Ax = U,X exists and is a continuous function of t. Moreover, since D is dense and all of the UtA have norm I, an easy approximation argument shows that the preceding conclusion holds even if x E' D.

It is obvious that each U t is a linear opeator. Furthermore,

so Ut is isometric. Trivially, U o = I. Finally, (UsU,x, y) = lim (UsAUtx, y) 1..->0

lim (U/x, U~sY)

1..->0

=

=

lim (U,x, U~sY)

1..->0

lim (U~+tx, y)

1..->0

=

(Us+tx, y),

so UsU, = Us+!' Thus, Us has an inverse, namely U_ s' and so Us is unitary. _

7.4.33 Lemma If xED, then , . Ux-x Iu n---

,->0

t

iAx.

Proof We have

(4) Now

uniformly for

1:

E [0, tl as A ~ O. Thus letting A ~ 0 in (4), we get

(5)

Section 7.4 Induced Flows on FunctWn Spaces and Ergodicily

535

for all xED. The lemma follows directly from (5) . • 7.4.34 Lemma If

Ux-x lim - ' - -

1-->0

iw

t

exists, then xED.

Proof It suffices to show that XE D*, the domain of A*, since D=D*. Let yE D*. Then by 7.4.33,

(x, iAy)

= lim

1-->0

<x, U-,y-y) -t

= - lim

,-->0

0, set

Ft(q, p) = q + tp if q > 0, q + tp > 0 and Ft(q, p) = - q - tp if q > 0, q + tp < 0 and set

What is the generator of the induced unitary flow? Is it essentially self-adjoint on the C'" functions with compact support away from the line q =O?

7.4E Let M be an oriented Riemannian manifold and L2(Ak(M» the space of L2 k-forms with

f

inner product

o such that

1/

f(x) - x II > K for all x E B.

Let e < minCK, 2) and choose 0 > 0 such that

20/0 + 0) < e; i.e., 0 < d(2 - e). By the Weierstrass approximation theorem (see, for example, Marsden [1974a]) there exists a polynomial mapping q : JRn -+ JRn such that 1/ f(x) - q(x) 1/ < 0 for all x E B. The image q(B) lies inside the closed ball centered at 0 of radius 1 + 0, so that p '" q/(l + 0) : B -+ B and

1/

f(x) - p(x)

1/

~

I

f(x) -

:~)o

"

+"

-

:~)o {~~ II ~

for all x E B. Since p is smooth by 7.5.14, it has a fixed point, say

o< K < 1/ f(x o) -

Xo 1/ ~ 1/

f(x o) - p(xO>

1/

+

1/

12: 0 < e

Xo E

p(x o) - Xo 1/

B. Then ~

e,

which contradicts the choice e < K. Brouwer's fixed point theorem isfaise in an open ball, for the open ball is diffeomorphic to JRn and translation provides a counterexample. The proof we have given is not "constructive." For example, it is not clear how to base a numerical search on this proof, nor is it obvious that the fixed point we have found varies continuously with f. For these aspects, see Chow, Mallet-Paret and Yorke [1978]. A third application of 7.5.12 is a topological proof of the fundamental theorem of algebra. 7.5.15 The Fundamental Theorem of Algebra Any polynomial p : C -+ C of degree n > 0 has a root. Proof Assume without loss of generality that p(z) = zn + an_1z n- 1 + ... + 30, where ai E C, and regard p as a smooth map from JR2 to JR2. If P has no root, then we can define the smooth map

Section 7.5 Introduction to Hodge-DeRham Theory

fez) = p(z)/1 p(z)

I whose restriction to S' we denote by g: S I

549

~ S' .

Let R > 0 and define for t E [0, 1] and z E 5',

5ince p,(z)/(Rz)n R~

00,

= 1 + t[an_,(Rz) + ... + ad(Rz)n]

and the coefficient of t converges to zero as

we conclude that for sufficiently large R, none of the p, has zeros on 5 '. Thus

F: [0, 1] x

s' ~ 5'

defined by F(t, z) =

is a smooth proper homotopy of dn(z) = zn with g(Rz) which in tum is properly homotopic to g(z). On the other hand, G : [0, 1] x 5' ~ S' defined by G(t, z) = f(tz) is a proper homotopy of the constant mapping c: 5' ~ 5', c(z)

= f(O)

with g. Thus dn is properly homotopic to a constant map and hence deg dn = 0 by corollary 7.5.12. However, if 5' is parameterized by arc length e, 0 ~ e ~ 21t, then dn maps the segment 0 ~ e ~ 21t/n onto the segment 0 ~ e ~ 21t since dn has the effect e ie ...... eine . If (() denotes the corresponding volume form on 5', the change of variables formula thus gives

1SI which for n

*0

~ (() =

n1 (() SI

=

21tn, i.e., deg

~ = n,

is a contradiction. _

The fundamental theorem of algebra shows that any polynomial p: C ~ C of degree n can be written as

where z" ... , zm are the distinct roots of p, k" ... , k m are their multiplicities, k, + ... + km = n. and c

E

C is the coefficient of zn in p(z). The fundamental theorem of algebra can be refmed to

take into account multiplicities of roots in the following way.

7.5.16 Proposition Let D be a compact subset of C with open interior and smooth boundary aD. Asswne !hat the polynomial p : C ~ C has no zeros on aD. Then the total number of zeros of p which lie in the interior of D, counting multiplicities, equals the degree of the map p/l pi: aD~ S'. Proof Let z" ... , zm be the roots of p in the interior of D with multiplicities k(1), .... k(m). Around each zi construct an open disk Di centered at zi' Di C D, such that aD n aDi = 0, and aDi n aDj = 0, for all i j. Then V = D \ (D I U ... U Dm) is a smooth compact two-dimensional

*


550

u aO I u ... aO m . The boundary orientation of aO j induced by V is opposite to the usual boundary orientation of aOj as the boundary of the disk OJ; see Figure

manifold whose boundary is aD

7.5.2. Since pI! p I is defined on all of V. corollary 7.5.11 implies that the degree of pI! pi: aV

--> Sl is zero. But the degree of a map defined on a disjoint union of manifolds is the sum of the individual degrees and thus the degree of pI! p I on aD equals the sum of the degrees of pI! p I on all aDj' The proposition is therefore proved if we show that the degree of pI! p I on aOj is the multiplicity k(i) of the root Zj'

0,

D

Figure 7.5.2

Let m

r(z) = c

II (z - Zj)

kG)

so p(z)

j=l.j;tj and the only zero of p(z) in the disk OJ is Zj' Then Zj + Rjz

aDj, where R j is the radius of OJ. is a diffeomorphism and therefore the degree of pI! pi: aO j --> SI equals the degree of (p 0 Sl of zk(j)arg r(z) with (p

0

E

O.

Since we can multiply 00 by any scalar, the integration map is onto. Any 00 with nonzero integral cannot be exact by Stokes' theorem. This last remark also shows that integration induces a mapping, which we shall still call integration,

fM : Hnc(M) ~ JR,

which is linear and onto. Thus,

in order to show that it is an isomorphism as (i) states, it is necessary and sufficient to prove it is injective, i.e., to show that if

fM 00 = 0

for 00 E n"(M), then 00 is exact. The proof of this will be

done in the following lemmas. 7.S.20 Lemma Theorem 7.S.19 holds for M = s!. Proof Let p : JR ~ SI be given by pet) = eit and 00 E nl(s!). Then p*oo = f dt for f E .'T(JR) a 21t-periodic function. Let F be an anti derivative of f. Since

o=

1 = f. 00

Sl

t+21t

f(s) ds = F(t + 21t) - F(t)

t

for all t E JR, we conclude that F is also 21t-periodic, so it induces a unique map G E 1(SI), determined by p*G = F. Hence p*oo = dF = p*dG implies 00 = dG since p is a surjective submersion. T 7.S.21 Lemma Theorem 7.S.19 holds for M = SO, s> 1. Proof This will be done by induction on n, the case n = 1 being the previous lemma. Write sn = NUS,

where N = {x E S" I x"+! ~ O} is the closed northern hemisphere and S = {x E S" I

x"+1 ~ O} the closed southern hemisphere. Then N n S = sn-I is oriented in two different ways as the boundary of N and S, respectively. Let

ON = {x E sn I x"+1 > - £},

Os = {x E S"

I

x"+1 < t} be open contractible neighborhoods of N and S, respectively. Thus by the Poincare lemma, there exist aN E nn-I(ON)' as E nn-I(Os) such that daN = 00 on ON' das = 00 on Os. Hence by hypothesis and Slokes' theorem,

Section 7.5 Introduction to Hodge-DeRham Theory

553

where i : sn-I ~ sn is the inclusion of sn-I as the equator of sn; the minus sign appears on the second integral because the orientations of Sn-I and as are opposite. By induction, i*(aN - as) E on-I(sn-I) is exact.

°

°

= ON n Os and note that the map r: ~ sn-I, sending each XES to rex) E Let sn-I, the intersection of the meridian through x with the equator sn-I, is smooth. Then r 0 i is

the identity on sn-I. Also, i 0 r is homotopic to the identity of 0, the homotopy being given by

°

sliding x E along the meridian to rex). Since deaN - as) = ffi - ffi = 0 on 0, by Theorem 6.4.16 we conclude that (aN - as) - r*i*(a N - ag) is exact on 0. But we just showed that i*(~ - ag) E on-I(sn-I) is exact, and hence r*i*(~ - as) E on-I(o) is also exact. Hence aN - as

E

on-I(o) is exact. Thus, there exists ~ E on-2(0) such that aN - as = d~ on 0. Now

use a bump function to extend ~ to a form y E on-2(sn) so that on 0, ~

V, where V is an open set such that cl(U)

C

=y

and y =0 on sn \

V. Then if x EN if

XES

is by construction COO and dA = ffi. " 7.5.22 Lemma A compactly supported n-form ffi

compactly supported (n - I)-form on ]Rn Proof Let

0:

sn

~]Rn

E

on(]Rn) is the exterior derivative of a

iff IRn ffi = O.

be the stereographic projection from the north pole (0, ... , 1) E sn onto

]Rn and assume without loss of generality that (0, ... , 1) eo- I (supp ffi). By the previous lemma, O*ffi

= do.,

for some

a

E

on-I(sn) since 0

= IRn ffi = Isn O*ffi

by the change-of-variables

formula. But 0* ffi = do. is zero in a contractible neighborhood U of the north pole, so that by the Poincare lemma, a

=d~

on-2(sn) such that ~

=y

on U, where ~

E

on-2(u). Now extend ~ to an (n - 2)-form

yE

on U and y = 0 outside a neighborhood of cl(U). But then 0*(0.-

dy) is compactly supported in ]Rn and do*(a - dy) = o*da = ffi. " 7.2.23 Lemma Let M be a boundaryless connected n-manifold. Then Wc(M) is at most

one-dimensional. Proof Let (UO' x

is a weak Riemannian (or pseudo-Riemannian) metric on Q. the smooth vector

t':

bundle map
is a strong Riemannian metric. then
-+ (x(t).

vet»~

be an integral curve of S. That is. x(t) = vet)

and

vet)

=y(x(t). v(t».

(2)

As we remarked. these will shortly be shown to be Hamilton's equations of motion in the absence of a potential. To include a potential. let V: M

~

lR be given. At each x. we have the

differential of V. dV(x) E Tx *M. and we define grad Vex) by (grad Vex). w >x = dV(x) . w.

(3)

(In infinite dimensions, it is an extra assumption that grad V exists. since the map T xM ~ T x*M induced by the metric is not necessarily bijective.) The equations of motion in the potential field V are given by x(t) = vet);

vet) =y(x(t). v(t» - grad V(x(t»).

(4)

The total energy. kinetic plus potential. is given by H(v x) = (1/2) IIvxW + Vex). The vector field XH determined by H relative to the symplectic structure on TM induced by the metric. is given by (4). This will be part of a more general derivation of Lagrange's equations given below.

I@'"

Supplement 8.1B Geodesics

Readers familiar with Riemannian geometry can reconcile the present approach to geodesics based on Hamiltonian mechanics to the standard one in the following way. Define the covariant derivative V: 3t(M) x 3t(M) ~ 3t(M) locally by (V x Y)(x) = yx(X(x). Y(x» + DY(x) . X(x). where X(x) and Y(x) are the local representatives of X and Y in the model space E of M

Section 8.1 Hamiltonian Mechanics

and Yx : E x E

-t

573

E denotes the symmetric bilinear continuous mapping defined by polarization of

the quadratic form y(x. v). In finite dimensions. if E

= ]Rn.

quadratic form on ]Rn determined by the Christoffel symbols

then y(x. v) is an JRn-valued

r jk'

The defining relation for Vx Y

becomes n Vx

y _ xiyk....i -

1

where locally i

a

a Xi ayk a + --. - k ax' ax] ax

jk - .

k

•

a

X=X - . and y=y - . ax' ai It is a straightforward exercise to show that the foregoing definition of V x Y is chart independent and that V satisfies the following conditions defining an qffine connection: (i) V is lR-bilinear.

= f Vx Y and VxfY = f Vx Y + X[f]Y. = DY(x) . X(x) - DX(x) . Y(x) = [X. Y](x).

(ii) for f: M -tlR smooth. VIX Y (iii) (V x Y - Vy X)(x)

by the local formula for the Jacobi-Lie bracket of two vector fields. (The equivalence of sprays and affine connections was introduced by Palais and Singer [1960].) If c(t) is a curve in M and X E X(M). the covariant derivative of X along c is defined by

where c is a vector field coinciding with c(t) at the points c(t). Locally. using the chain rule. this becomes DX d (it(c(t» = - Yc(t)(X(c{t)). X(c{t))) + dt X(c(t).

which also shows that the definition of DX/dt depends only on c(t) and not on how c is extended to a vector field. In finite dimensions. the coordinate form of the preceding equation is

where c(t) denotes the tangent vector to the curve at c(t). The vector field X is called autoparallel or is parallel-transported along c if DX/dt = O. Thus c is autoparallel along c iff in any coordinate system we have c(t) - Yc(t)( c(t). c(t» = 0

or. in finite dimensions

574

Chapter 8 Applications

That is, c is autoparallel along c

iff c

is a geodesic.

There is feedback between Hamiltonian systems and Riemannian geometry. For example, conservation of energy for geodesics is a direct consequence of their Hamiltonian character but can also be checked directly. Moreover, the fact that the flow of the geodesic spray on TM consists of canonical transformations is also useful in geometry, for example, in the study of closed geodesics (cf. Klingenberg [1978]). On the other hand, Riemannian geometry raises questions (such as parallel transport and curvature) that are useful in studying Hamiltonian systems.

~

We now generalize the idea of motion in a potential to that of a Lagrangian system; these are, however, still special types of Hamiltonian systems. We begin with a manifold M and a given function L: TM

~

lR called the LagrangWn. In case of motion in a potential, take

which differs from the energy in that -V is used rather than +V. The Lagranian L defines a map called thefiber derivative, IFL: TM ~ T*M as follows: let v, w

E

TxM, and set lFL(V)'W""ddt L(v+ tw)I

",0

.

That is, lFL(v)· w is the derivative of L along the fiber in direction w. In the case of L(v x) = (112) (v x' vX>x - V(x), we see that lFL(v x)' Wx = (vx' wX>x' so we recover the usual map

r/': TM

~ T*M associated with the bilinear form (, >x'

Since T*M carries a canonical symplectic form

(J),

we can use lFL to obtain a closed

two-form 0\. on TM:

0\. = (lFL)* (J) . A local coordinate computation yields the following local formula for COr.: if M is modeled on a linear space E, so locally TM looks like U x E where U E

C

E is open, then O\.(u, e) for (u, e)

U x E is the skew symmetric bilinear form on E x E given by (J)L(U,

e)· ((el' ez)' (fl' fz»

= D 1(DzL(u, e)· e 1)· fl - D 1(D zL(u, e)· f1)· e 1 + Dz(DzL(u, e) . e 1) • fz - DzCDzL(u, e) . f 1) • ez '

(5)

where Dl and Dz denote the indicated partial derivatives of L. In finite dimensions this reads

where (qi, qi) are the standard local coordinates on TQ. It is easy to see that

0\. is (weakly) nondegenerate if DzDzL(u, e) is (weakly) nondegenerate;


in this case L is called (weakly) nondegenerate.

In the case of motion in a potential,

nondegeneracy of ffiL amounts to nondegeneracy of the metric (, defined by A: TM

-7

575

>x.

The action of L is

lR, A(v) = JFL(v) . v, and the energy of L is E = A - L. In charts, E(u, e) = D2L(u, e) . e - L(u, e),

and in finite dimensions, E is given by the expression .) dL·i . E(q, q = - . q - L(q, q).

dei' Given L, we say that a vector field Z on TM is a Lagrangian vector field or a Lagrangian system for L if the Lagrangian condition holds: ffiL(V)(Z(V), w)

=dE(v) . w

(6)

for all v E TqM, and WE Tv(TM). Here dE denotes the differential of E. We shall see that for motion in a potential, this leads to the same equations of motion as we found before. If

~

were a weak. symplectic form there would exist at most one such Z, which would be

the Hamiltonian vector field for the Hamiltonian E. The dynamics is obtained by find the integral curves of Z; that is, the curves t 1-+ vet) ETM satisfying (dv/dt)(t) = Z(v(t». From the Lagranian condition it is easy to check that energy is conserved (even though L may be degenerate).

8.1.14 Proposition Let Z be a Lagrangian vector field for L and let vet) E TM be an integral curve of Z. Then E(v(t» is constant in t. Proof By the chain rule, d dt E(v(t)) = dE(v(t) . v'(t) = dE(v(t» . Z(v(t) = ffiL(V(t»(Z(v(t», Z(v(t» = 0

by skew symmetry of ffiL' • We now generalize our previous local expression for the spray of a metric, and the equations of motion in the presence of a potential. In the general case the equations are called Lagrange's equations. 8.1.15 Proposition. Let Z be a Lagrangian system for L and suppose Z is a second-order equation (that is, in a chart U x E for TM, Z(u, e) = (u, e, e, Z2(u, e» for some map Z2: U x E -7 E). Then in the chart U x E, an integral curve (u(t), vet)) E U x E of Z satisfies Lagrange's equations: that is,


576

(7)

for all

wEE.

If D2 D2L, or equivalently~, is weakly nondegenerate, then Z is automatically

second order. In case of motion in a potential, (7) reduces to the equations (4).

Proof From the definition of the energy E we have locally

(a term D2 L(u, e) . f2 has cancelled). Locally we may write Z(u, e) Using formula (5) for

(OL'

=(u, e, Y1(u, e), Y2(u, e».

the condition on Z may be written

D 1(D 2L(u, e)· Y1(u, e»· fl - D 1(D 2L(u, e)· f l )· Y)(u, e)

+ D 2(D 2L(u, e) . Y)(u, e»· f2 - D 2(D 2L(u, e) . f)· Y2(u, e) = D)(D2L(u, e) . e) . f) - D)L(u, e) . f) + DzeD2L(u, e) . e) . f 2. Thus if

~

o we get

(9)

is a weak symplectic form, then D 2 D 2 L(u, e) is weakly nondenerage, so setting f)

=

Y)(u, e) = e, i.e., Z is a second-order equation. In any case, if we assume that Z is

second order, then condition (9) becomes

for all f)

E

E.

If (u(t),

vet»~

differentiation, then ~ = v and

is an integral curve of Z

ii = Y 2(u, ~),

and using dots to denote time

so

by the chain rule. _ The condition of being second order is intrinsic; Z is second order iff T'tM where 'tM : TM

~

0

Z = identity,

M is the projection. See Exercise 8.1D.

In finite dimensions Lagrange's equations (7) take the form

1t (:~ )=

:>

i

=

I, ... , n .

Proposition Assume q> : Q ~ Q is a diffeomorphism which leaves a weakly nondegenerate Lagrangian L invariant, i.e., L 0 Tq> = L. Then c(t) is an integral curve of the

8.1.16


if and only if

Lagrangian vector field Z

T
0, x = 0. Since for any qo E Q and tangent vector Vo to Q at qo there exists a homography h such that h(iyo) = qo, and the tangent of h at iyo in the direction iuo is Va, it fOllows that the geodesics of g are images by h of thc semiaxis (0, y) I y > OJ. If c =

°

or d

= 0, this image equals the ray (bid, y) I y > OJ or the ray (alc, y) I y > OJ. If both c '" 0, d '" 0, then the image equals the arc of the circle centered at «ad + bc)!2cd, 0) of radius 1/(2cd). Thus

the geodesics of the Poincare upper halfplane are either rays parallel to the y-axis or arcs of circles centered on the x-axis.

(See Fig. 8.1.1). The Poincare upper half-plane is a model of the

Lobatchevski geometry.

Two gcodesics in Q are called parallel if they do not intersect in Q.

Given either a ray parallel to Oy or a semicircle centered on Ox and a point not on this geodesic, there are infinitely many semicircles passing through this point and not intersecting the geodesics, i.e., through a point not on a geodesic there are infinitely many geodesics parallel to it. •

578


y

--4---~--~----4-4-----4-~----~--~--~------~------~_x

Figure 8.1.1 Geodesics in the Poincare Upper Half Plane We close with a completeness result for Lagrangian systems generalizing Example 4.1.23B. 8.1.18 Definition. A C2 function Vo: [0, 00] -+ IR is called positively complete if it is

decreasing andfor any

e > sup{Vo(t)}, it satisfies l~O

l~[e X

-

Vo(t)r l12 dt dt

=

+ 00, where x ~ O.

The last condition is independent of e. Examples of positively complete functions are _t a , -t[log(1 + t)]a, - t log(l + t) [log(log(l + t) + 1)]a, etc.

Let Q be a complete weak Riemannian function and I et Z be the Lagrangian vector field for L(v) = : TQ -+ Q is the tangent bundle projection. Suppose there is a

8.1.19 Theorem (Weinstein and Marsden [1970])

manifold, V: Q -+ IR be a II v 112 (2 - V("t(v», where

C2 'I:

positively complete function Vo and a point q' Q. Then Z is complete.

Q such that V(q)

E

~

Vo(d(q, q'» for all q

E

Proof Let c(t) be an integral curve of Z and let q(t) = ('I: 0 c)(t) be its projection in Q. Let qo = q(O) and consider the differential equation on IR

f"(t) =

with initial conditions f(O) = d(q', qo), ['(0)

dV - -;

(10)

(f(t»

=,J2(~ -

Vo(f(O» , where

.

~ = E(c{t)) =

.

E(c(O»

~ V(qo).

We can assume ~ > V(CJo), for if ~ = V(qo)' then q(O) = 0; now if q(t) = 0, the conclusion is trivially satisfied, so we need to work under the assumption that there exists a La for which q(La) '" 0; by time translation we can assume La =

o.


We show that the solution f(t) of (10) is defined for all t

~

579 O. Multiplying both sides of

(10) by f'(t) and integrating yields 2 . 1 2['(t) = p - Vo(f(t)). I.e .• t(s)

=

JS dCq'. qo)

[2(P - Vo(u))]

By hypothesis. the integral on the right diverges and hence t(s) that f(t) exists for all t ~

~

+

00

-1/2

as s

du .

~

+

00.

This shows

o.

For t ~ O. conservation of energy and the estimate on the potential V imply

d(q(t). q') :::; d(q(t). qo) + d(qo. q') :::; d(q'. qo) + = d(q'. qo)

r'

f:

ilil(s)[[ ds

r'

+ Jo [2(P - Vo(q(s))] 1/2 ds:::; d(q'. qo) + Jo [2(P - Vo(d(q(s). q'))] 1/2 ds.

Since

r1

f(t) = d(q'. qo) + Jo [2(P - Vo(f(s))]

112

ds

it follows that d(q(t). q ') :::; f(t); see Exercise 4.II(v) or the reasoning in Example 4.1.238 plus an approximation of d(q(t). q ') by Cl

functions. Hence if Q is finite dimensional. q(t)

remains in a compact set for finite t-intervals. t

~

O. Therefore c(t) does as well. V(q(t)) being

bounded below on such a finite t-interval. Proposition 4.1.9 implies that c(t) exists for all t

~

O.

The proof in infinite dimensions is done in Supplement 8.1e. If F, is the local flow of Z. from 'r(F-tv)) = 'r(F,(-v)) (reversibility). it follows that c(t) exists also for all t:::; 0 and so the theorem is proved. _

(kiF Supplement s.l.e Completeness of Lagrangian Vector Fields on Hilbert Manifolds

This supplement provides the proof of theorem 8.1.19 for infinite dimensional Riemannian manifolds. We start with a few facts of general interest. Let (Q. g) be a Riemannian manifold and 'r; TQ ~ Q the tangent bundle projection. For vETqQ. the subspace Vv = ker Tv 'r = T veTqQ)

C

T v(TQ) is called the vertical subspace of

Tv(TQ). The local expression of the covariant derivative V' defined by g in Supplement 8.18 shows that V'yX depends only on the point values of Y and thus it defines a linear map (V'X)(q); v E TqQ ....... (V'yX)(q) E TqQ where Y E X(Q) is any vector field satisfying Y(q) = v. Let jv; TqQ ~ Tv(TqQ) = Vv denote the isomorphism identifying the tangent space to a linear space with the linear space itself and consider the map hv; TqQ

~

Tv(TQ) by

L

0

(V'X)(q) : TqQ ~ Vv. Define the horizontal map


580

where vET qQ and X E X(Q) satisfies X(q) = v. Loeally. if E is the model of Q. hy has the expression hy: (x. u) E V x E ..... (x. v. u - y/u. v» E V x E x Ex E. This shows that hy is a linear eontinuous injective map with split image. The image of hy is called the horizontal subspace of T/TQ) and is denoted by l\.. It is straightforward to check that Ty(TQ)=VyEllHy and that Ty'tIl\.:Hy~TqQ. jy:TqQ~Vy are Banach space isomorphisms. Declaring them to be isometries and Hy perpendicular to Vy gives a metric gT on TQ. We have proved that if (Q. q) is a (weak) Riemannian manifold, then g induces a metric gT on TQ. The following result is taken from Ebin [1970]. 8.1.20 Proposition If (Q. g) is a complete (weak) Riemannian manifold then so is (TQ. gT). Proof. Let (v n) be a Cauchy sequence in TQ and let qn = 't(v n)' Since 't is distance decreasing it follows that (qn) is a Cauchy sequence in Q and therefore convergent to q E Q by completeness of Q. If E is the model of Q. E is a Hilbert space. again by completeness of Q. Let (V. cp) be a chart at q and assume that V is a closed ball in the metrie dermed by g of radius 3£. Also. assume that TyT cp : T/TM) ~ Ex E is an isometry for all v E TqQ which implies that for £ small enough there is a C > 0 such that IITTcp(w)II :;; C IIwl! for all WE TTU. This means that all curves in TV are stretched by Tcp by a factor of C. Let V

C

V be the closed

ball ofradius £ centered at q and let n. m be large enough so the distance between vn and vrn is smaller than £ and vn' vrn E TV. If Y is a path from vn to vrn oflength < 2£. then 't 0 Y is a path from qn to qrn of length < 2£ and therefore 't 0 Y C V. which in turn implies that y C TV. Moreover. Tcp 0 Y has length < 2C£ and therefore the distance between Tcp(v n) and Tcp(v rn) in E is at most 2C£. This shows that (Tcp(v n») is a Cauchy sequence in E and hence convergent. Since Tcp is a diffeomorphism. {vn} is convergent. _ In general. completeness of a vector field on M implies completeness of the first variation equation on TM.

Proof of 8.1.19. Let c:]a.b[

~

TQ be a maximal integral curve of Z. We shall prove that

lil1\ibc(t) exists in TQ which implies. by local existence and uniqueness. that c can be continued beyond b. i.e .• that b = + 00. One argues similarly for a. We have shown that q(t) = ('t 0 c)(t) is bounded on finite t-intervals. Since V(q(t» is bounded on such a finite t-interval. it follows that = -

dU, then the conserved quantity is

594


Proof Since Lu(ub).u

= d(ub(u))·u,

for stationary ideal compressible or incompressible

homogeneous flows we have dub bIb dp 0 = - . u = - ( L u )·u + -d(u )) . u - - . u = u 2 p

at

1 - -2 (d

2

1

II u II ). u - -p dp . u ,

so that dub J,, as' u(x(s))ds '2

since x'(s)

=u(x(s)).

The two-form

=

0

• 0)

=dub

is called vorticity, which, in JR.3 can be identified with curl u. Our

assumptions so far have precluded any tangential forces and thus any mechanism for starting or stopping rotation. Hence, intuitively, we might expect rotation to be conserved. Since rotation is intimately related to the vorticity, we can expect the vorticity to be involved. We shall now prove that this is so. Let C be a simple closed contour in the fluid at t = 0 and let C, be the contour carried along the flow. In other words, Ct = = 0*'1'. (iii) Show that u is a Hamiltonian vector field (see Section 8.1) with energy 'V directly in ]R2 and then for arbitrary two-dimensional Riemannian manifolds M. (iv) Do stream functions exist for arbitrary fluid flow on li2? On S2?

598 (v)


Show that the vorticity is

(J)

= M",.

8.21 Clebsch Variables; Clebsch [1859]. Let ~ be the space of functions on a compact manifold M with the dual space ~ *, taken to be densities on M; the pairing between f E ~ and pE ~*is (f,p)=JMfp. (i) On the symplectic manifold ~x ~* x ~x ~* with variables (a, A, 1.1, p), show that Hamilton's equations for a given Hamiltonian H are . I5H a =

sr'

. I5H 1.1 =

8P'

j. _

I5H

- - l5a'

. _ I5H p - - 151.1 '

where 15H/I5A is the junctional derivative of H defined by I5H

.

.

(sr' A) = DH(A)'A . (ii) In the ideal isentropic compressible fluid equations, set M = pub, the momentum

density, where dx denotes the Riemannian volume form on M. Identify the density o(x)dx E ~ * with the function o(x) E ~ and write M = - (pdl.1 + Ada)dx. For momentum densities of this form show that Hamilton's equations written in the variables (a, A, 1.1, p) imply Euler's equation and the equation of continuity.

§8.3 Electromagnetism Classical electromagnetism is governed by Maxwell's field equations. The form of these equations depends on the physical units chosen, and changing these units introduces factors like

41t, c = the speed of light, simplest form; thus c

Eo = the dielectric constant and 110 = the magnetic permeability.

100 , ~o

discussion assumes that

This

are constant; the choice of units is such that the equations take the

= 100 = ~o = 1

and factors 41t disappear. We also do not consider

Maxwell's equations in a material, where one has to distinguish E from D, and B from H. Let E, B, and J be time dependent C1-vector fields on 1R3 and p: JR.3 x JR. ~ JR. a scalar. These are said to satisfy MaxweU's equatWns with charge density p and current density J when the following hold: div E

=

P

curl B -

aB at aE at

(1)

(no magnetic sources)

(2)

=

0

(Faraday's law o/induction)

(3)

=

J

(Ampere's law).

(4)

div B = 0 = 0 curl E +

(Gauss's law)

E is called the electricjield and B the magneticjield.

The quantity

In p dV = Q

is called the charge of the set

n

C JR.3. By Gauss' theorem, (1)

is equivalent to

(5) for any (nice) open set

n C JR.3; i.e., the electric flux out of a closed surface equals the total charge

inside the surface. This generalizes Gauss' law for a point charge discussed in Section 7.3. By the same reasoning, (2) is equivalent to

lan

(6)

B. n dS = O.

That is, the magnetic flUX out of any closed surface is zero. In other words there are no magnetic SOurces inside any closed surface. By Stokes' theorem, (3) is equivalent to

Ias for any closed loop

as

E. ds =

r(curl E) • n dS = - aadrs B· n dS

Js

bounding a surface S. The quanity

JasE' ds is called the voltage around

600

as.


Thus, Faraday's law of induction (3) says that the voltage around a loop equals the negative of

the rate of change of the magnetic flux through the loop. Finally, again by the classical Stokes' theorem, (4) is equivalent to

(8)

JsJ· n dS has the physical interpretation of current, this form states that if E is constant in

Since

time, then the magnetic potential difference JasB. ds around a loop equals the current through the loop. In general, if E varies in time, Ampere's law states that the magnetic potential difference around a loop equals the total current in the loop plus the rate of change of electric flUX through the loop. We now show how to express Maxwell's equations in terms of differential forms. Let M = jR4 = {(x, y, z, t)} with the Lorentz metric g on jR4 of diagonal form (1, 1, 1, - 1) in standard coordinates (x, y, z, t).

8.3.1 Proposition There is a unique twojorm F on lR4 , called the Faraday twojorm such that

(9)

Eb = - i a/ilt F ; Bb

(Here the b is Euclidean in jR3 and the

*

=- i a/ilt * F.

(10)

is Lorentzian in jR4.)

Proof If F = F xy dx

1\

dy + F zx dz

+FxI dx

1\

1\

dx + F yz dy 1\ dz

dt+ FYI dy I\dt + Fzt dz

1\

dt,

then (see Example 6.2.14E),

* F = Fxy dz 1\ - Fxl dy

dt + F zx dy 1\

1\

dt + F yz dx 1\ dt

dz - FYI dz 1\ dx - F zt dx

1\

dy

and so

- i a/ilt F = F xl dx + F yt dy + F zt dz and

- i alilt

* F = Fxy dz + F zx dy + Fyz dx.

Thus, F is uniquely determined by (9) and (10), namely F = El dx

1\

dt + E2 dy 1\ dt + E3 dz 1\ dt

+ B3 dx

1\

dy + B2 dz

1\

dx + B 1 dy 1\ dz. •

We started with E and B and used them to construct F, but one can also take F as the primitive object and construct E and B from it using (9) and (10). Both points of view are useful. Similarly, out of p and J we can form the source one-form j = - p dt + J 1 dx + J2 dy +

601

Section 8.3 Electromagnetism

13 dz; i.e., j is uniquely determined by the equations - ia/dtj relation, J is regarded as being defined on

=p

and i a/dt *j

=*.F;

in the last

]R4.

8.3.2 Proposition Maxwell's equations (1)-(4) are equivalent to the equations

of = j

dF = 0 and on the manifold

]R4

endowed with the Lorentz metric.

Proof A straightforward computation shows that

dF = (curl E +

~~)xdy A dz A dt + (curl E + ~~ ) ydz A dx A dt

+ (curIE+

~~)zdXAdYAdt+(diV

B)dxAdYAdz.

Thus dF =0 is equivalent to (2) and (3). Since the index of the Lorentz metric is 1, we have 0 =* d

*.

Thus

of = * d * F = * d( - Eldy A dz - E2dz A dx - E 3dx A dy + Bldx A dt + B2dy A dt + B3dz A dt) =

* [ - (div E)dx A dy A dz + ( curl B - ~ ) xdy A dz A dt + + (curl B(curlB-

Thus

~)ydzAdxAdt+(curIB- ~~)zdXAdYAdt]

~)xdx+(curIB- ~)ydY + (curIB- ~~)zdZ-(diVE)dt

of

= j iff (1) and (4) hold . • As a skew matrix, we can represent F as follows

F=

x

y

0

B3 _B2 EI

_B3

BI

E2

0

E3

0

B2 -BI

z

-EI _E2 _E3

X Y z

0

Recall from Section 6.5 and Exercise 7.SG, the formula

Since Idet [gkt 11 = I, Maxwell's equations can be written in terms of the Faraday two-form F in

602


components as Fij,k + Fjk~ + Fkij =

and

pk,k

°

(1)

=- ji,

(12)

where Fij,k = aF/dx k, etc. Since 02 = 0, we obtain

°=

02F = oj

= *[

= * d * j = * d( -

(~ + div J )dx

1\

dy

P dx 1\

1\

dy

1\

dz 1\ dtJ =

dz + (*

P) 1\ dt)

~ + div J;

i.e., ap/ at + div J = 0, which is the continuity equation (see Section 8.2). Its integral form is, by Gauss' theorem -dQ = -d

dt

for any bounded open set

n.

f.

dtQ

P dV =

I

an

J • n dS

Thus the continuity equation says that the flux of the current density

out of a closed surface equals the rate of change of the total charge inside the sUrface.

Next we show that Maxwell's equations are Lorentz invariant, i.e., are special-relativistic. The Lorentz group L is by definition the orthogonal group with respect to the Lorentz metric g,

i.e.,

L = {A E uL(JR.4)

I g(Ax, Ay) = g(x, y) for all

x, y

E

JR.4}.

Lorentz in variance means that F satisfies Maxwell's equations with j iff A *F satisfies them with A *j, for any A

E

L. But due to Proposition 8.3.1 this is clear since pull-back commutes with d

and orthogonal transformations commute with the Hodge operator (sec Exercise 6.2D) and thus they commute with O. As a 4 x 4 matrix, the Lorentz transformation A acts on F by F 1-+ A *F = AFAT. Let us sec that the action of A

E

L mixes up E's and B's. (This is the source of statements like: "A

moving observer sees an electric field partly converted to a magnetic field.") Proposition 8.3.1 defines E and B intrinsically in terms of F. Thus, if one performs a Lorentz transformation A on F, the new resulting electric and magnetic fields E' and B' with respect to the Lorentz unit normal A *(aJdt) to the image A(JR.3 x 0) in JR.4 are given by

(E'f

=-i A-a/at A *F,

(B'f

=- i A-a/at A *F.

For a Lorentz transformation of the form x' -

x-vt

-~'

Y '= y, z

= z,

t ' = ---,t""-""v",,x=-

~

(the special-relativistic analogue of an observer moving uniformly along the x-axis with velocity v) we get


603

and

[

2 3 B3+vE2J .

B'= BI, B -vE,

~~

We leave the verification to the reader. By the way we have set things up, note that Maxwell's equations make sense on any Lorentz

manifold; i.e., a four-dimensional manifold with a pseudo-Riemannian metric of signature (+, +, +, -). Maxwell's vacuum equations (i.e., j = 0) will now be shown to be conformally invariant on any Lorentz manifold (M, g). A diffeomorphism o -

df)

dt

.

= dlv

dq>O 2 An + at +V f-

d2f

dt2 ;

(18)

i.e.,

This equation is the calssical inhomogeneous wave equation. The homogeneous wave equation (right-hand side equals zero) has solutions f(t, x, y, z)

= 'I'(x -

t) for any function 'V. This

solution propagates the graph of 'V like a wave--hence the name wave equation. Now we we can draw some conclusions regarding Maxwell's equations. In tenns of the vector potential A and the function q>, (1) and (4) become

(19)

which again are inhomogeneous wave equations. Conversely, if A and q> satisfy the foregoing equations and div A + dq>/dt = 0, then E = - grad q> - dAldt and B = curl A satisfy Maxwell's equations. Thus in

]R4,

this procedure reduces the study of Maxwell's equations to the wave equation,

and hence solutions of Maxwell's equations can be expected to be wavelike. We now repeat the foregoing constructions on ]R4 using differential fonns. Since dF = 0, on

]R4

we can write F = dG for a one-fonn G. Note that F is unchanged if we replace G by

G + df. This again is the gauge fr'edom Substituting F = dG into of = j gives OdG = j. Since 8

=do + Od

is the Laplace-deRham operator in

]R4,

we get

8G =j - dOG.

°

Suppose we try to choose G so that OG = we can let G = Go + df and demand that

(20)

(a gauge condition). To do this, given an initial Go'

so f must satisfy 8f =- OG o. Thus, if the gauge condition (21)

holds, then Maxwell's equations become

607


L'.G =j.

(22)

Equation (21) is equivalent to (18) and (22) to (19) by choosing G = Ab +
( ad(

~~ ) *J.!. ad( ~: ) *J.! ) - D2G(J.!>( ad( ~: ) *J.!. ad( ~: ) *J.! ).

Section 8.4 The Lie-Poisson Bracket in Continuum Mechanics and Plasma Physics 615

The two cyclic penuutations in F, G, H added to the above fonuula sum up to zero: all six tenus involving second derivatives cancel and the three first tenus add up to zero by the Jacobi identity for the bracket of g. • 8.4.S The Free Rigid Body

The equations of motion of the free rigid body described by

an observer fixed on the moving body are given by Euler's equation (24)

TI =TIxw,

where TI, WE JR.3, TIi = Ii Wi' i = 1, 2, 3, I = (II' 12, 13) are the principal moments of inertia, the coordinate system in the body is chosen so that the axes are the principal axes, W is the angular velocity in the body, and TI is the angular momentum in the body. It is straightforward to check that the kinetic energy 1

H(n) = -TI· 2

(25)

W

is a conserved quantity for (24). We shall prove below that (24) are Hamilton's equations with Hamiltonian (25) relative to a

(-) Lie-Poisson structure on JR.3. The vector space JR.3 is in fact a Lie algebra with respect to the bracket operation given by the cross product, i.e. [x, y]

=x x

y. (This is the structure that it inherits from the rotation

group.) We pair JR.3 with itself using the usual dot-product, i.e., (x, y ) =x • y. Therefore, if F: JR.3 ~ JR., of/oTI = VF(TI). Thus, the (-) Lie-Poisson bracket is given via (19) by the triple

product {F, G }(n) = -TI • (VF(TI) x VG(TI».

Since oH/oTI = W, we see that for any F: JR.3

(26)

~ JR.,

d . . dt (F(n» = DF(n) • TI = TI • VF(n) = -TI • (VF(n) x W = VF(n) • (TI x w)

so that

F=

{F, H} for any F: JR.3 ~ JR. is equivalent to Euler's equations of motion (24).

8.4.6 Proposition (Pauli [1953], Martin [1959], Arnold [1966J. Sudarshan and Mukunda

[1974]) Euler's equations (24)for afree rigid body are a Hamiltonian system in JR.3 relative to the (-) Lie Poisson bracket (24) and Hamiltonian function (25). 8.4.7 Ideal Incompressible Homogeneous Fluid Flow In §8.2 we have shown that the equations of motion for an ideal incompressible homogeneous fluid in a region with smooth boundary

an

are given by Euler's equations of motion

n

C JR.3

616


at + (v av

• v)v = -vp

) (27)

div v=O v(t, x) E Tx(am for x E an

with initial condition v(O, x) = vo(x), a given vector field on n. Here v(t, x) is the Eulerian or spatial velocity, a time dependent vector field on n. The pressure p is a function of v and is uniquely determined by v (up to a constant) by the Neumann problem (take div and the dot product with n of the first equation in (27»

~p

= -div«v • V)V)

~ = Vp. n = --((v· V)V)· n

on an,

)

(28)

where n is the outward unit normal to an. The kinetic energy

f.

1 nllvll 2 dx 3 H(v)=1

(29)

has been shown in §8.2 to be a conserved quantity for (27). We shall prove below that the first equation in (27) is Hamiltonian relative to a (+) Lie-Poisson bracket with Hamiltonian function given by (29). ... Consider the Lie algebra *div(n) of divergence free vector fields on n tangent to an with bracket given by minus the bracket of vector fields, i.e., for u, v E *div(n) define [u, v] = (v • V)u - (u • V)v.

(30)

(The reason for this strange choice comes from the fact that the usual Lie bracket for vector fields is the right Lie algebra bracket of the diffeomorphism group of n.) Now pair *div(n) with itself via the L2-pairing. As in 8.4.2, using the Hodge-Helmholtz decomposition, it follows that this pairing is weakly non-degenerate. In particular

cSH Yv"=v.

(31)

The (+) Lie-Poisson bracket on *div(n) is

5.

{F,G}(v)= n v·

cSG V) [(8V.

Therefore, for any *div denotes the angle between the tangent line to the disk at Q and the x-axis, the position of the disk in space is completely determined by (x, y, e, q». These variables form elements of our configuration space M = IR2 X S t X st. The condition that there is no slipping at Q means that the velocity at Q is zero; i.e.,

(total velocity = velocity of center plus the velocity due to rotation by angular velocity de/dt).

Section 8.5 Constraints and Control

625

Figure 8.5.1 The expression of these constraints in terms of differential forms is ffi l

ffi l

= 0,

ffi2

= 0 where

= dx + a cos q> dB and ffiz = dy + a sin q> dB. We compute that ffi = ffi l /\ ffiz = dx /\ dy + a cos q> dB /\ dy + a sin q> dx /\ dB. dffi l = - a sin q> dq> /\ dB , dffi2 = a cos q> dq> /\ dB. dffi l /\ ffi = - a sin q> dq> /\ dB /\ dx /\ dy, dffiz /\ ffi = a cos q> dq> /\ dB /\ dx /\ dy.

These do not vanish identically. Thus, according to Theorem 6.4.20, this system is not integrable and hence these constraints are nonholonomic. A second example of constraints is due to E. Nelson [1967). Consider the motion of a car and denote by (x, y) the coordinates of the center of the front axle, q> the angle formed by the moving direction of the car with the horizontal, and B the angle formed by the front wheels with the car (Fig. 8.5.2).

L Figure 8.5.2


626

The configuration space of the car is

]R2

x

1r2 ,

parametrized by (x, y,

Manifolds, Tensor Analysis, and Applications, Second Edition (Applied Mathematical Sciences 75)

Manifolds Tensor analysis and Applications

Manifolds, tensor analysis, and applications