Global Analysis: Differential Forms in Analysis, Geometry, and Physics (Graduate Studies in Mathematics, V. 52)

Global Analysis Differential Forms in Analysis, Geometry and Physics Ilka Agricola Thomas Friedrich Graduate Studies ...

Author: Ilka Agricola | Thomas Friedrich

353 downloads 977 Views 7MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
Report copyright / DMCA form

DOWNLOAD PDF

Global Analysis Differential Forms in Analysis, Geometry and Physics

Ilka Agricola

Thomas Friedrich

Graduate Studies in Mathematics Volume 52

American Mathematical Society

Global Analysis

Global Analysis Differential Forms in Analysis, Geometry and Physics

Ilka Agricola

Thomas Friedrich Translated by Andreas Nestke

Graduate Studies in Mathematics Volume 52

American Mathematical Society Providence, Rhode Island

Editorial Board Walter Craig Nikolai Ivanov

Steven G. Krantz David Saltman (Chair) 2000 Mathematics Subject Classification. Primary 53-01; Secondary 57-01, 58-01, 22-01, 74-01, 78-01, 80-01, 35-01.

This book was originally published in German by Friedr. Vieweg & Sohn Verlagsgesellschaft mbH, D-65189 Wiesbaden, Germany, as "Ilka Agricola and Thomas Friedrich: Globale Analysis. 1. Auflage (1st edition)", ©F}iedr. Vieweg & Sohn Verlagsgesellschaft mbH, Braunschweig/Wiesbaden, 2001 Translated from the German by Andreas Nestke

Library of Congress Cataloging-in-Publication Data Agricola, I1ka, 1973(Globale Analysis. English)

Global analysis : differential forms in analysis, geometry, and physics / Ilka Agricola, Thomas Ftiedrich ; translated by Andreas Nestke. p. cm. - (Graduate studies in mathematics, ISSN 1065-7339 ; v. 52) Includes bibliographical references and index. ISBN 0-8218-2951-3 (alk. paper) 1.

Differential forms.

2.Mathematical physics.

1.

Friedrich, Thomas, 1949

ll. Title.

111. Series.

QA381.A4713 2002 2002027681

514'.74- dc2l

Copying and reprinting. Individual readers of this publication, and nonprofit libraries acting for them, are permitted to make fair use of the material, such as to copy a chapter for use in teaching or research. Permission is granted to quote brief passages from this publication in reviews, provided the customary acknowledgment of the source is given. Republication, systematic copying, or multiple reproduction of any material in this publication is permitted only under license from the American Mathematical Society. Requests for such permission should be addressed to the Acquisitions Department, American Mathematical Society, 201 Charles Street, Providence. Rhode Island 02904-2294, USA. Requests can also be made by e-mail to reprint-peraissionlaes.org. © 2002 by the American Mathematical Society. All rights reserved. The American Mathematical Society retains all rights except those granted to the United States Government. Printed in the United States of America.

The paper used in this book is acid-free and falls within the guidelines established to ensure permanence and durability.

Visit the AMS home page at http://wv.aas.org/

10987654321

070605040302

Preface

This book is intended to introduce the reader into the world of differential forms and, at the same time, to cover those topics from analysis, differential geometry, and mathematical physics to which forms are particularly relevant. It is based on several graduate courses on analysis and differential geometry given by the second author at Humboldt University in Berlin since the beginning of the eighties. From 1998 to 2000 the authors taught both courses jointly for students of mathematics and physics, and seized the opportunity to work out a self-contained exposition of the foundations of differential forms and their applications. In the classes accompanying the course, special emphasis was put on the exercices, a selection of which the reader will find at the end of each chapter. Approximately the first half of the book covers material which would be compulsary for any mathematics student finishing the first part of his/her university education in Germany. The book can either accompany a course or be used in the preparation of seminars. We only suppose as much knowledge of mathematics as the reader would acquire in one year studying mathematics or any other natural science. From linear algebra, basic facts on multilinear forms are needed, which we briefly recall in the first chapter. The reader is supposed to have a more extensive knowledge of calculus. Here, the reader should be familiar with differential

calculus for functions of several variables in euclidean space R", the Riemann integral and, in particular, the transformation rule for the integral, as well as the existence and uniqueness theorem for solutions of ordinary differential equations. It is a reader with these prerequisites that we have in mind and whom we will accompany into the world of vector analysis, Pfaf flan

v

vi

Preface

systems, the differential geometry of curves and surfaces in euclidean space, Lie groups and homogeneous spaces, symplectic geometry and mechanics, statistical mechanics and thermodynamics and, eventually, electrodynamics. In Chapter 2 we develop the differential and integral calculus for differential forms defined on open sets in euclidean space. The central result is Stokes' formula turning the integral of the exterior derivative of a differential form

over a singular chain into an integral of the form itself over the boundary of the chain. This is in fact a far-reaching generalization of the main theorem of differential and integral calculus: differentiation and integration are mutually inverse operations. At the end of a long historical development mathematicians reached the insight that a series of important integral formulas in vector analysis can be obtained by specialization from Stokes' formula. We will show this in the second chapter and derive in this way Green's first and second formula, Stokes' classical formula, and Cauchy's integral formula for complex differentiable functions. Furthermore, we will deduce Brouwer's fixed point theorem from Stokes' formula and the Weierstrass approximation theorem.

In Chapter 3 we restrict the possible integration domains to "smooth" chains. On these objects, called manifolds, a differential calculus; for functions and forms can be developed. Though we only treat submanifolds of euclidean space, this section is formulated in a way to hold for every Riemannian manifold. We discuss the concept of orientation of a ma.nifold, its volume form, the divergence of a vector field as well as the gradient and the Laplacian for functions. We then deduce from Stokes' formula the remaining classical integral formulas of Riemannian geometry (Gauss-Ostrogradski formula, Green's first and second formula) as well as the Hairy Sphere theorem, for which we decided to stick to its more vivid German name, `'Hedgehog theorem". A separate section on the Lie derivative of a differential form leads us to the interpretation of the divergence of a vector field as a measure for the volume distortion of its flow. We use the integral formulas to solve the Dirichlet problem for the Laplace equation on the ball in euclidean space and to study the properties of harmonic functions on R". For these we prove, among other things, the maximum principle and Liouville's theorem. Finally we discuss the Laplacian acting on forms over a Riemannian manifold, as well as the Hodge decomposition of a differential form. This is a generalization of the splitting of a vector field with compact support in R" into the sum of a gradient field and a divergence-free vector field, going back to Helmholtz. In the final chapter we prove Helmholtz' theorem within the framework of electrodynamics.

Preface

vii

Apart from Stokes' theorem, the integrability criterion of Frobenius is one of the fundamental results in the theory of differential forms. A geometric distribution (Pfafflan system) is defined by choosing a k-dimensional subspace

in each tangent space of an n-dimensional manifold in a smooth way. A geometric distribution can alternatively be described as the zero set of a set of linearly independent 1-forms. What one is looking for then is an answer to the question of whether there exists a k-dimensional submanifold such that, at each point, the tangent space coincides with the value of the given geometric distribution. Frobenius' theorem gives a complete solution to this question and provides a basic tool for the integration of certain systems of first order partial differential equations. Therefore, Chapter 4 is devoted to a self-contained and purely analytical proof of this key result, which, moreover, will be needed in the sections on surfaces, symplectic geometry, and completely integrable systems. Chapter 5 treats the differential geometry of curves and surfaces in euclidean space. We discuss the curvature and the torsion of a curve, Frenet's formulas, and prove the fundamental theorem of the theory of curves. We then turn to some special types of curves and conclude this section by a proof of Fenchel's inequality. This states that the total curvature of a closed space curve is at least 27r. Surface theory is treated in Cartan's language of moving frames. First we describe the structural equations of a surface, and then we prove the fundamental theorem of surface theory by applying Frobenius' theorem. The latter is formulated with respect to a frame adapted to the surface and the resulting 1-forms. Next we start the tensorial description of surfaces. The first and second fundamental forms of a surface as well as the relations between them as expressed in the Gauss and the Codazzi-Mainardi equations are the central concepts here. We reformulate the fundamental theorem in this tensorial description of surface theory. Numerous examples (surfaces of revolution, general graphs and, in particular, reliefs, i.e. the graph of the modulus of an analytic function, as well as the graphs of their real and imaginary parts) illustrate the differential-geometric treatment of surfaces in euclidean space. We study the normal map of a surface and are thus lead to its Gaussian curvature, which by Gauss' Theorema Egregium belongs to the inner geometry. Using Stokes' theorem, we prove the GaussBonnet formula and an analogous integral formula for the mean curvature of a compact oriented surface, going back to Steiner and Minkowski. An important class of surfaces are minimal surfaces. Their normal map is always conformal, and this observation leads to the so-called Weierstrass formulas. These describe the minimal surface locally by a pair of holomorphic functions. Then we turn to the study of geodesic curves on surfaces, the integration of the geodesic flow using first integrals as well as the investigation

of maps between surfaces. Chapter 5 closes with an outlook on the geometry of pseudo-Riemannian manifolds of higher dimension. In particular, we look at Einstein spaces, as well as spaces of constant curvature.

Symmetries play a fundamental role in geometry and physics. Chapter 6 contains an introduction into the theory of Lie groups and homogeneous spaces. We discuss the basic properties of a Lie group, its Lie algebra, and the exponential map. Then we concentrate on proving the fact that every closed subgroup of a Lie group is a Lie group itself, and define the structure of a manifold on the quotient space. Many known manifolds arise as homogeneous spaces in this way. With regard to later applications in mechanics, we study the adjoint representation of a Lie group.

Apart from Riemannian geometry, symplectic geometry is one of the essential pillars of differential geometry, and it is particularly relevant to the Hamiltonian formulation of mechanics. Examples of symplectic manifolds arise as cotangent bundles of arbitrary manifolds or as orbits of the coadjoint

representation of a Lie group. We study this topic in Chapter 7. First we prove the Darboux theorem stating that all symplectic manifolds are locally equivalent. Then we turn to Noether's theorem and interpret it in terms of the moment map for Hamiltonian actions of Lie groups on symplectic manifolds. Completely integrable Hamiltonian systems are carefully discussed. Using Frobenius' theorem, we demonstrate an algorithm for finding the action and angle coordinates directly from the first integrals of the Hamilton function. In §7.5, we sketch the formulations of mechanics according to Newton, Lagrange, and Hamilton. In particular, we once again return to Noether's theorem within the framework of Lagrangian mechanics, which will be applied, among others, to integrate the geodesic flow of a pseudoRiemannian manifold. Among the exercises of Chapter 7, the reader will find some of the best known mechanical systems.

In statistical mechanics, particles are described by their position probability in space. Therefore one is interested in the evolution of statistical states of a Hamiltonian system. In Chapter 8 we introduce the energy and information entropy for statistical equilibrium states. Then we characterize Gibbs states as those of maximal information entropy for fixed energy, and prove that the microcanonical ensemble realizes the maximum entropy among all states with fixed support. By means of the Gibbs states, we assign a thermodynamical system in equilibrium to a Hamiltonian system with auxiliary

parameters satisfying the postulates of thermodynamics. We discuss the

Preface

ix

role of pressure and free energy. A series of examples, like the ideal gas, solid bodies, and cycles, conclude Chapter 8. Chapter 9 is devoted to electrodynamics. Starting from the Maxwell equations, formulated both for the electromagnetic field strengths and for the dual 1-forms, we first deal with the static electromagnetic field. We prove the solution formula. for the inhomogeneous Laplace equation in three-space and obtain, apart from a description of the electric and the magnetic field in the static case, at the same time a proof for Helmholtz' theorem as mentioned before. Next we turn to the vacuum electromagnetic field. Here we prove the solution formula for the Cauchy problem of the wave equation in dimensions two and three. The chapter ends with a relativistic formulation of the Maxwell equations in Minkowski space, a discussion of the Lorentz group. the Maxwell stress tensor and a thorough treatment of the Lorentz force.

We are grateful to Ms. Heike Pahlisch for her extensive work on the preparation of the manuscript and the illustrations of the German edition. We also thank the students in our courses from 1998 to 2000 for numerous comments

leading to additions and improvements in the manuscript. In particular, Dipl.-Math. Uli Kriihmer pointed out corrections in many chapters. Not least our thanks are due to M. A. Claudia Frank for her thourough reading and correcting of the German manuscript with regard to language. The English version at hand does not differ by much from the original German edition. Besides small corrections and additions, we included a detailed discussion of the Lorentz force and related topics in Chapter 9. Finally, we thank Dr. Andreas Nestke for his careful translation.

Berlin, November 2000 and May 2002 Ilka Agricola Thomas Friedrich

Contents

v

Preface

Chapter 1.

Elements of Multilinear Algebra

Exercises

Chapter 2. Differential Forms in R" §2.1. Vector Fields and Differential Forms §2.2. Closed and Exact Differential Forms §2.3. Gradient, Divergence and Curl §2.4. Singular Cubes and Chains §2.5. Integration of Differential Forms and Stokes' Theorem §2.6. The Classical Formulas of Green and Stokes §2.7. Complex Differential Forms and Holomorphic Functions §2.8. Brouwer's Fixed Point Theorem Exercises

Chapter 3. Vector Analysis on Manifolds §3.1.

Submanifolds of lR'

Differential Calculus on Manifolds §3.3. Differential Forms on Manifolds §3.4. Orientable Manifolds §3.5. Integration of Differential Forms over Manifolds §3.6. Stokes' Theorem for Manifolds §3.7. The Hedgehog Theorem (Hairy Sphere Theorem) §3.2.

1

8 11 11

18

23 26 30 35 36

38 43 47 47 54

67 69 76 79 81 xi

Contents

xii

The Classical Integral Formulas 82 The Lie Derivative and the Interpretation of the Divergence 87 §3.10. Harmonic Functions 94 100 §3.11. The Laplacian on Differential Forms Exercises 105 §3.8. §3.9.

Chapter 4. Pfaffian Systems §4.1. Geometric Distributions §4.2. The Proof of Frobenius' Theorem §4.3. Some Applications of Frobenius' Theorem Exercises

Chapter 5. Curves and Surfaces in Euclidean 3-Space §5.1. Curves in Euclidean 3-Space §5.2. The Structural Equations of a Surface §5.3. The First and Second Fundamental Forms of a Surface §5.4. Gaussian and Mean Curvature §5.5. Curves on Surfaces and Geodesic Lines §5.6. Maps between Surfaces §5.7. Higher-Dimensional Riemannian Manifolds Exercises

Chapter 6. Lie Groups and Homogeneous Spaces §6.1. Lie Groups and Lie Algebras §6.2. Closed Subgroups and Homogeneous Spaces §6.3. The Adjoint Representation Exercises

111 111

116 120

126 129 129 141

147

155 172

180 183 198

207 207 215 221

226

Chapter 7. Symplectic Geometry and Mechanics §7.1. Symplectic Manifolds §7.2. The Darboux Theorem §7.3. First Integrals and the Moment Map §7.4. Completely Integrable Hamiltonian Systems §7.5. Formulations of Mechanics Exercises

229

Chapter 8. Elements of Statistical Mechanics and Thermodynam: .es §8.1. Statistical States of a Hamiltonian System

271

229 236 238 241

252 264

271

Contents

§8.2.

xiii

Thermodynamical Systems in Equilibrium

Exercises

Chapter 9. Elements of Electrodynamics §9.1. The Maxwell Equations §9.2. The Static Electromagnetic Field §9.3. Electromagnetic Waves §9.4. The Relativistic Formulation of the Maxwell Equations §9.5.

The Lorentz Force

283 292

295 295

299 304 311 317

Exercises

325

Bibliography

333

Symbols

337

Index

339

Chapter 1

Elements of Multilinear Algebra

Consider an n-dimensional vector space V over the field K of real or complex

numbers. Its dual space V' consists of all linear maps from V to K. More generally, a multilinear and antisymmetric map,

wk: Vx...xV-+K. depending on k vectors from the vector space V, is called an exterior (multilinear) form of degree k. The antisymmetry of wk means that, for all k vectors vl, ... , vk from V and any permutation a E Sk of the numbers { 1, ... , k}, the following equation holds:

wk(VQ(1), ....Va(k)) = sgn(a) i.Jk(VI, -..,Vk). Here sgn(a) denotes the sign of the permutation a. In particular, wk changes sign under a transposition of the indices i and j:

wk(i1i ...,Vi, ...,Vj, ...,Vk) = -wk(v1, ...,v ....,VI.....Vk). The vector space of all exterior k-forms will be denoted by Ak( V*). Furthermore, we will use the conventions A°(V') = K and A1(V*) = V. Fixing an arbitrary basis e1, ... , e in the n-dimensional vector space V, we see that each exterior k-form wk is uniquely determined by its values on all k-tuples of the form el,,... 441 where the indices are always supposed to be strictly ordered, I = (i1 < ... < ik). On the other hand, a k-form can be defined by arbitrarily prescribing its values on all ordered k-tuples of basis vectors and extending it to all k-tuples of vectors in an antisymmetric and I

1. Elements of 1llultilinear Algebra

2

multilinear way. The number of different k-tuples of n elements is equal to Thus we conclude (k) = k.

Theorem 1. If k > n, Ak(V*) consists only of the zero map. For k < n the dimension of the vector space Ak(V*) is equal to

dim (nk(V`)) _ (k) Exterior forms can by multiplied, and the product is again an exterior form.

Definition 1. Let wk E Ak(V*) and 7/ E A1(V') be two exterior forms of degrees k and 1. respectively. Then the exterior product wk n 171 is defined as a (k + 1)-form by the formula 1

wknll!(t'1....L'k+l) = k!l! oESk+j

Esgn(v)wk(v,(1),...Vo(k))11l(z'o(k+l)....vn(k+l))

Obviously. wk n,1 is a multilinear and antisymmetric map acting on (k + 1) vectors, i. e. of degeree k+l. The following theorem summarizes the algebraic rules governing computations involving the exterior multiplication of forms.

Theorem 2. The exterior product has the following properties:

(1) (wi +w2)AY/=w1 nrft+w2n (2)

(3) (awk)A1)l=wkA(ar/)=a(wkAr/)foranyaEIfs: (4) (wkAgl)A m=wkA(r11A '); (5) wk A 111 = (-1)k'711 n wk.

Proof. Only the last two formulas require a proof, in which we shall omit some of the upper indices for better readability. First (k+l)!m! (wk A11l) Aµ/'"(v1, ...,vk+l+m)

1:

sgn(a)(wk A 711)(v,(1), ... , v,(k+1))1f (vn(k+1+1).... , vs(k+l+m))

f7E Sk+l+m

Decompose the permutation group Sk+l+m into residue classes with respect to the subgroup Sk+l C Sk+1+m formed by all permutations acting as the

identity on the last m indices {k + 1 + 1, ..., k + l + m}. Each residue class R thus consists of all permutations a E Sk+l+m with fixed values Q(k + I + 1), ..., v(k + I + m). Fixing any permutation ao E R. all the remaining elements a E R are parametrized by the elements in Sk+l: or = a0 0 7r,

7r E Sk+l

3

1. Elements of Multilinear Algebra

Hence,

E sgn(o)(wk A rll)(Va(l),

--- ,

Vo(k+l))11m(Va(k+1+l), - - - , Va(k+l+m))

oER

rsgn(oo)sgn(ir)(w A r/)(Vaoo,r(1), --Vooo,,(k+())1L(Vao(k+t+1), ..Vao(k+t+m)) ,rESk+i

= sgn(ao)(k + l)!(w A 17)(vao(1), ... , Voo(k+1))1J(VCo(k+1+1), ... , Vao(k+1+m))-

Using now the definition of the exterior product wk A 711, this leads to the formula k! 1!

ERsgn(a)(w A i))(va(1),

vo(k+l+m))

(k + l)! a

=

sgn(o)w(va(1),

... va(k))1T(t'a(k+1) ... V0(k+l))1-(Va(k+1+1), - - -Va(k+l+m))-

aER

In order to compute the sum over the whole group Sk+l+m, we sum over all the residue classes R and, simplifying the scalar factors, we obtain the equation (k! l! m!) - (wk A ill) A pm (v1, ... , tk+l+m )

E sgn(a)w(va(1), ... Vo(k))17(Vo(k+1),

--

Vo(k+l))1(Va(k+1+1), .

Vo(k+l+m))-

Sk+I+m

This shows the associativity of the exterior multiplication of forms. The last 0 formula (5) is proved analogously.

Definition 2. The exterior algebra A(V') of the vector space V is formed by the sum of all exterior forms

A(V*) = k=0 E endowed with the exterior multiplication A of forms as multiplication.

Next we will construct an explicit basis of the vector spaces Ak(V*). To

do so, start from any basis el, ... e of V and denote by a, ... , o the dual basis of the dual space V' _ A' (V*). For an ordered k-tuple of indices (k-index for short) I = (i1 < ... < ik) let of denote the k-form defined by the formula

of := ail A ... A o;k . Obviously, for a fixed k-index J = (j1 < ... < jk),

ol(ejl, --.,eik)

J0

if 196 J,

t 1 if I = J.

In particular, the k-forms of are linearly independent in Ak(V'). For dimensional reasons, this immediately implies


4

Theorem 3. Let e1, ... , en be a basis of the vector space V, and denote by

ol.... , o its dual basis in the dual space V. Then the forms o,,1 = (i1 < ... < ik), are a basis for the vector space Ak(V*). Exterior forms can be pulled back under a linear map. In fact, if L : W -+

V is a linear map from the vector space W to the vector space V, and Wk E A(V') is an exterior k-form in V, then the formula

(L*wk)(wl, ...,Wk) := wk(L(wl),... , L(wk)) defines an exterior k-form (L'wk) E Ak(W`) in the vector space W. Passing from the form wk to the induced form L*(wk) is compatible with all algebraic operations. In particular, the following formula holds:

L' (wk n,/) = (L*wk) A (L5,1) Furthermore, a vector can be inserted into an exterior form, and the result is an exterior form of one degree less. Let wk E Ak(V*) be a k-form on V and vo E V any vector. Define a (k- 1)-form (voJ wk) E Ak-1(V*) by the formula

(voJ wk)(v1, ...,vk-1) :=

wk(vp,V1,

.... Vk-1)

The (k -1)-form vo J wk is called the inner product of the vector vo with the k form wk, and will also be denoted by The antisymmetry of the k-form wk leads to the following relation for (k - 2)-forms:

vl J (vo J wk) = - vo J (vl J wk) . From now on, let V be a real vector space equipped with a non-degenerate scalar product g. This is a symmetric bilinear form,

g : V x V ---- R,

with the property that the linear map g# : V - V` from V to the dual space V' defined by

g#(v)(w) := g(v,w) is bijective. For a given basis e1, ... , e,, of V, the matrix

M(g) _ (g(e{, ei))ij-1 is symmetric and invertible. For brevity, its entries will be denoted by gig := g(ei, e,), the entries of the inverse matrix (M(g))-1 by g'j. Recall the following result, which goes back to Lagrange and Sylvester:

Theorem 4. Let g be a non-degenerate scalar product on the real vector space V. Then there exists a basis e1, ...,en in V such that the matrix


5

M(g) is diagonal, i. e. 1

0 1

M(g) = 0

-1 J

L

The number p of (+1)-entries as well as the number q of (-1)-entries in this diagonal matrix are independent of the particular basis. The pair (p, q) is called the signature of the scalar product g, and the number q is called its index.

First we extend the scalar product g to the spaces Ak(V*) of k-forms, keeping the same symbol for its extension. This continuation is done, relative to an orthonormal basis, by means of the formula k k = ilil k k ...g ikik wi(ei,, ...,eikw2(ei ...,eik)

g(wl,w2) -

g

it (wj(x) - O . wl(0)) dxl = >2wl(x)dx'. Poincare's lemma is but an existence result for the desired form 77k-1, and does not claim its uniqueness. For example, if 77k-1 is a solution of the equation d77k-1 = wk, the sum 77k-1 +dCk-2 also solves the equation for any (k - 2)-form k-2. Conversely, for any two forms ii-i and ,jr' satisfying d77i-1

= d772-1 = wk, we have d(771-1

- 772 1) = 0.

Then, by Poincare's lemma, there exists a (k-2)-form do-2 such that ii 772-1 = duo-2. In other words, 77i-1

-

-1-

772-1 +

and we conclude that the general solution can always be expressed as the sum of a particular solution and the derivative of an arbitrary (k - 2)-form. Example 7. Consider on 1R3 the closed 2-form

w2 = xydxAdy+2xdyAdz+2ydxAdz. We will determine, in two different ways, a 1-form whose derivative coincides with w2. Let us first explain the Ansatz method to be used. The 1-form 771

will be taken as 771 = f (x, y, z) dx + g(x, y, z) dy + h(x, y, z) dz,

2. Differential Forms in IR"

22

R still have to be determined. The

where the functions f, g, h : R3 exterior derivative is easily computed:

d'l'

ay] dx A dy + [82

[ax

az] dx A dz + I ah - ag] dy A dz. y

Hence the functions have to satisfy the conditions of

ag

Oh - Of X Y,

2

ah - Og y and anay Oz

2x

y' az 49Z Integrating, e.g., the first two with respect to x yieldss

ay

OX

g=

2 x2y

+ ,l

dx,

h = 2xy + , J

Inserting the result into the last condition,

2x = 2x + /

dx. 49Z

/

Oy dx - J a N dx,

we see that it is satisfied for function f; in particular, we may choose f = 0. Then g = 2x2y, h = 2xy, and hence, using 171

=

21

x2ydy+2xydz,

we obtain a solution, as is easily checked. The integration method computes the "primitive form" by means of the map P(w2) introduced in the proof of Poincare's lemma. In the example, this turns out to be the sum of 6 terms, namely

f

1

o t(tx)(ty)dt/J xdy - (J1 t(tx)(ty)dt) ydx l +2 (1 t(tx)dt J ydz - 2 (J 1 t(tx)dt I zdy

P(w2) = + C

ro

1

+2 Uo t(ty)dt I x dz - 2

(jI

t(ty)dt I z dx.

Each of the summands can easily be computed:

2-

1

I

2

2

2

2

2

P(w) = 4x ydy- 4xy2dx+ 2xydz - 2xzdy+ 2xydz - 2yzdx 1

2

2

2

2

4

(4xy + 2yz dx + 4x y- 2xz dy + 2xydz. Which of these two methods leads to a solution more quickly depends on the particular situation; since an integration has to be carried out in any case, it might not be possible to find an explicit elementary solution (just. as not every continuous function is elementary integrable).

2.3. Gradient, Divergence and Curl

23

2.3. Gradient, Divergence and Curl Each tangent space TpRn of the coordinate space is an oriented, euclidean vector space, and hence there is the volume form dRn(p)

E Ap(R")

as well as the Hodge operator /\p-k(Rn).

A (Rn) ---,

This allows us to associate with every differential form wk of degree k on Rn a corresponding (n - k)-form *wk defined by applying the *-operator pointwise, i.e., at each point p E Rn to wk(p). Consider an orthonormal basis e1, . . . , en of the space Rn and the corresponding coordinate functions x 1, ... , x". For a k-form wk = F w1dxr expressed in these coordinates the associated form is thus determined by the following formula: sgn

k *W

l... n

wt dxj.

J

Here, J = (jl < ... < jn_k) is the complementary multi-index to I = (i1 < < ik). The volume form dRn itself is simply

dRn = dx1 A ... A dxn . In addition, we have the possibility to pass from a vector field V to a 1-form wv, and vice versa. This is accomplished by

Definition 9. For a vector field V the dual 1 -form wv is defined by the equation

*wv := VJ dRn

.

If the vector field V = E V'8/8x' is expressed in cartesian coordinates, then the corresponding representation for w11 is obtained as follows: wV =

(-l)n-I *(V1dx2A...Adxn-V2dx1Adx3A...Adxnf...)

= V1dx1+...+Vndxn. This transition from vector fields to 1-forms will be used now to introduce the gradient of a function and the divergence and curl of a vector field in an invariant way.

Definition 10. Let f : U --, R be a C'-function defined on an open subset U C Rn. The vector field grad(f) associated with the 1-form df is called the gradient of the function f. The defining equation for the vector field grad(f) thus reads as follows: ' Wj1r.d(!) = grad(f) J dRn = *df .

2. Differential Forms in Rn

24

In the chosen coordinates we have

Of 0-+ -Of

a grad(f) = aT1 art + ... + cle axn Definition 11. The divergence of a C'-vector field V is the function determined by the equation

d(*;.,) = d(V J dRn) := div(V) - dRn . The formula

n

div(V) =

10 ax=

expresses the divergence of V through its components.

The next theorem contains a few simple properties of this operation.

Theorem 4. (1) Let f and g be C' -functions. Then

grad(f - 9) = f - grad(9) + 9 - grad(f) (2) For a function f and a vector field V of class C' the following -

identity holds:

div(f - V) = f div(V) + df (V) Proof. We only prove the second formula. By definition

div(f V) dRn = d(f (V J dRn)) = df A (V J dRn) + f d(V J dRn) = df A *Wv + f div(V) dRn = [41(V) + f div(V)J dRn. The last step makes use of the following equation, valid for every vector field V and any 1-form n1:

171 A *4 = n1(V) (W. Definition 12. Let f be a C2-function defined on an open subset U C Rn.

The divergence of the gradient of f is called the Laplacian o(f) of the function f :

0(f) = div(grad(f))

-

In the chosen coordinates the Laplacian is computed by the formula

o(f) _

02f

An immediate consequence of the identities in the preceding theorem is the formula

A(f -9) = 0(f)-9+f -A(9)+2(grad(f),grad(9))

2.3. Gradient, Divergence and Curl

25

In particular, there exists an additional operation acting on vector fields in dimension n = 3, the so-called curl. For a vector field V and the corresponding 1-form wv in R3, the form *d(wy) again is a 1-form and hence in turn defines a vector field, the curl curl(V). In short, we have the following

Definition 13. The curl of the vector field V is the unique vector field determined by the following condition:

dwy =: Curl(V) J dR3 = *'''curl(V) The definition of the exterior derivative &-'y dwy =

av2 _

avl

a21

Ox

dx1 ndx2+

Iav3 _

aV 1

dx1 ^d3+

a23

a21

[aV3 _ av21 dX AdX a22

ax3

leads immediately to the formula

curl(V) =

av3 _ av21 a [aV1 _ aV31 a [aV2 _ aV1 ]] a22 a23 J a21 + a23 all 5x2 + 5X1 aX2

a

The properties of the curl of a vector field are summarized in the following theorem.

Theorem 5. If the function f and the vector field V are of class C2 and defined on an open subset U C R3, then

(1) div(curl(V)) = 0 and curl(grad(f)) = 0;

(2) curl(f V) = f curl(V) + grad(f) x V, where x denotes the vector product in R3;

(3) if the curl of V vanishes, curl(V) = 0, and U is star-shaped, then there exists a function f such that V = grad(f); (4) if the divergence of V vanishes, div(V) = 0, and U is star-shaped, then there exists a vector field W such that V = curl(W). Proof. The first equations immediately follow from the fact that the square of the exterior derivative vanishes, dd = 0. In fact, we obtain

div(curl(V)) dIR3 =

ddwv = 0

and

wcurl(grad(f))

_

*dwgrad(f)

_ *ddf =

0

Claims (3) and (4) are special cases of Poincare's lemma. For curl(V) = 0 the associated form wv is closed. By Poincare's lemma there exists a function f

such that wv = df. Hence V is the gradient of the function f. Property (4) is proved similarly.

2. Differential Forms in It"

26

2.4. Singular Cubes and Chains We intend to develop the integral calculus for differential forms, and to do so, we need suitable sets as integration domains. First, in this chapter, we will only allow for subsets of W' which are higher-dimensional analogues of parametrized curves, so-called singular chains. The k-dimensional unit cube will be denoted by [0, 1]k,

[0, 1]k = {(x'....,xk)ERk:

05 x1,...,xk, lj

rwk

in U by

1kWk

Example 9. For k = 1 a singular k-cube is simply a parametrized Clcurve c : (0, 1] -i U C R'. In this case, the integral of the 1-form wl = p1dyl +... + p"dy" is called the line integnzl of wl along c. If cl, ..., c" are the component functions of c, the pullback of the form is written as

c'w'(t) =

pl(c(t)).

d dtt) dt + ... + pn(c(t)) -

dcn dtt)

dt,

and hence, in this situation, we obtain the following general formula for the line integral: dctt)1 dt. w' = 1 pi(c(t)). J

J

If the 1-form wl is the differential of a smooth function f, then by

c*(wl) = c*(df) = d(f o c) we obtain the following value for the integral over df :

J

df =

jdt)dt

=

In particular, the line integral of an exact 1-form depends only on the end points of the curve and not on its shape. It vanishes for a closed curve.

2.5. Integration of Differential Forms and Stokes' Theorem

31

Example 10. We will use the last remark to show that the winding form,

cal =

x2

+Y22 dx + x2 x y22 dy,

is not exact on R2 - {0}. To this end, consider the parametrization c(t) _ (cos 21rt, sin 21rt) of the circle by the interval [0.1]. Then

c'wl = 27r dt and

= 27r.

jw1

The integral of a k-form over a singular k-cube is, up to sign. independent of the parametrization of the cube. This is a consequence of a well-known transformation rule for higher-dimensional integrals: Let y, : U -' V be a diffeomorphism, and let f : V -+ R be integrable; /then

f(f o

)(y) . I det(D(y))I dy

JV

f (x)dx .

and 4 : [0, 1]A U of one and the same point Two parametrizations set in U differ by a diffeomorphisin cp : [0,11k [0, 1]k of the unit cube. 4 = ci o gyp. The determinant of the differential Dp(x) has constant sign, which will be denoted by e(,p). Theorem 7. In the above situation we have

Jk =

f wk cz"

Proof. By Theorem 1 the pullback of the form

( i)"wk =

F*(ci)"(wA)

= cp`(f dxlA...Adx") = (foip)'4p'(dx'A...ndx")

coincides with (c2)*wk = (f o cp)

dxl A ... A dx".

Thus the statement follows from the quoted transformation rule for n-dimensional integrals. 0

In case k = 1, changing the direction of a curve results in a change of sign for the line integral. Now we prove the essential result of this chapter, Stokes' theorem. This is a far-reaching generalization of the Fundamental Theorem of Calculus and comprises the classical integral formulas of the 19-th century as special cases. The proof to follow will, however, reveal to the careful reader that the core of the matter is the fact that integration and differentiation are mutually inverse operations.

32

2. Differential Forms in 1R"

Theorem 8 (Stokes' Theorem). Let wk be a differential form defined on the open subset U C IR", and let sk+l : [0, 1]k+l - U be a (k + 1)-chain. Then

wk = r

dk

Jgk+1

f.9$k+l

Proof. Since the integral is additive, it suffices to prove the formula for singular (k+1)-cubes. First we consider the standard cubelk+l : [0, 1]k+l R k+1 and represent the k-form wk on Rk+l as

_

k+l

fi d21 / ... / dxi A ... A dxk+1

wl~ = i=1

The derivative is then k+l

k = [(_1)1_1L]i dxl A ... A dxk+1 8xi

i=1

Hence, by the definition of the integral,

f

J

k+1

dwk

a

i-1

jk+1

xi dx

[0,11k

On the other hand, applying the maps IV Ql parametrizing the different parts of the boundary of the unit cube leads to the formula * ( Ik+l J.a ) >

wk--

f,(x , ..., X j 1

f

1

, a, Xj. . . ., xk) dx1 A . . . A dsk

from which we conclude that

(Ii I/

J

Ù)_

[0.1[k

`

k J

10111k

J J afi (xl, ..., dxj r

fl

J-1, t, xJ, ... , xk)dt

dxl ... dxk

o

[o.1[k!

adx. dxi [O,1Jk+l

By the definition of the boundary of Ik+1, this implies that

rf L L. kr+l

Jk w= ajk+l

J-1 Q-o'1Ik+1 li.Q)

kr+1

j= l

f dfx)'dx = [o.llk+l

fk lk+1

and hence we have verified Stokes' formula for the standard cube. For an arbitrary singular (k + 1)-cube ck+1 : [0, 1]k+1 U C 1R" we now use the

2.5. Integration of Differential Forms and Stokes' Theorem

33

fact that the exterior derivative commutes with the pullback of forms. We have

J

dWk

=

f (ck+ll*( Jk+1

Ck+1

l

l`

J

k+1

__

d((Ck+l)'Wk)

k)

J

J

jkJ+1

k+1

f

wk _ j=1 Q=0.1

ajk+1

wk

k+1 ` Wk

(c

+1

.)

k+1

wk j=1 a=0,1Ck+lolk+1

Jk.

= ask+l

U.u)

We already emphasized that the line integral of an exact 1-form is independent of the particular shape of the curve. As a first application of Stokes' theorem, we will prove that the line integrals of a closed 1-form along two different curves coincide if there exists a continuously differentiable deformation of one curve into the other leaving their initial and end points fixed. This leads to the notion of homotopy, which is fundamental in topology.

Definition 18. Two Cl-curves co, cl : [0,11 U C R" are called homotopic if there exists a C1-map F : [0,1] x [0,11 - U with the properties

F(t, 0) = co(t), F(0, s) = co(0) = c(0).

F(t,l) = cl(t), F(1,s) = co(1) = cl(1). The map F is called a homotopy between the curves co and cl.

Theorem 9. Let co, cl : [0,1] - U be two homotopic C1-curves, and let w1 be a closed 1 -form on U. Then the line integrals along co and cl coincide:

Lw1 = jwl l

Proof. Choose any homotopy F : [0,1]2 -> U between co and c1. This map F is, at the same time, a singular 2-cube, and hence the 2-form dw1 can be

2. Differential Forms in IR

34

integrated over F. By assumption we have dwl = 0, and the corresponding integral vanishes:

dwl. 0=IF

On the other hand, by Stokes' theorem, we have for the right-hand side

r JF

dw1 fJ w1 + J F

(t,O)

_ J (t,I) _ I O,s)

(1,s)

(

These four integrals are computed using the homotopy property of F. First, F(t, a) = ca(t) immediately implies, for a = 0,1, FlZt,a)w1

= cowl.

Moreover, F(a, s) is independent of s, and hence

Fl*(a,,)wl = 0. Lastly, these relations combine to yield the equation 1

0=

J0

1

cowl

- J0 cwl ,

and by the definition of the integral this concludes the proof.

0

We will now generalize these observations concerning line integrals to the higher-dimensional case. Let sk lj ch- be a singular k-chain with image set A C IR", and suppose that Oak = 0. The set A can, e.g., be a k-A defined on dimensional sphere Sk in R". Consider a smooth map f : A a neighborhood of A which is homotopic to the identity IdA. This is again supposed to mean that there is a smooth map defined on a neighborhood U of A x (0.11 in R"+1

F : A x (0,11 -+ A such that F(a, 0) = f (a) and F(a,1) = a. Then

Fo(skxIdlo,11) :=E is a singular (k + 1)-chain in R", and, because

Oak

= 0, the boundary is

OF o (sk x Id10,11) = sk - f o sk.

For a k-form wk defined on an open neighborhood of the set A C W', the following holds.

Theorem 10. If Osk = 0 anldf is homotopic to the identity, then I Jsk

k=J jock

wk

2.6. The Classical Formulas of Green and Stokes

35

Proof. We compute the difference using Stokes' theorem: f Wk

-

fk

k= J fosk

sk

f

=

OFo(skxId1o,11)

k

Fo(skxld[O,j])

The (k + 1)-form (sk x Id[o,I))`F*(dwk) vanishes. This follows from the implicit function theorem together with the assumption that A is the image of a k-dimensional chain. But this immediately implies the statement. 0

2.6. The Classical Formulas of Green and Stokes In this section, we will discuss the classical two-dimensional special cases of

Stokes' formula. Let D C R2 be a subset of R2 which can be represented as the image of a CI-map f defined on the standard cube [0, 112. By OD we denote the boundary of this set, considered as a singular 1-chain. The derivative of the 1-form w1 := (x dy - y dx)/2 is the volume form on R2. Hence

vol(D)

fdwl

2 aD

The formula transforms the calculation of a two-dimensional volume into the evaluation of a line integral, and this turns out to be a special case of Green's formula to be discussed now. Consider the 1-form wI

and compute its derivative:

dw I =

I

-

-

dx A dy. J

Thus we arrive at Green's first formula.

Theorem 11. Let P(x, y) and Q(x, y) be functions of class CI. Then

I

IOQ-8PJdxAdy. fD

D

L Ox

ay

Suppose now that P(x, y) and Q(x, y) are of class C2, and consider the 1-forms wl

P [-LQ

dx+ LQ dy]

and

qI := Q I-5y . dx + 5P dy]

The derivative of the difference wI - 771 is easily computed to be

d(w'-1 ) = Applying Stokes' formula leads to Green's second formula.

2. Differential Forms in W'

36

Theorem 12. Let P(x, y) and Q(x, y) be functions of class C2. Then

[PO(Q) - QO(P)]dx n dy =f [QLP - P

l dx + f P Q - Q J

8D

L

Pl dy. 1

Stokes' formula concerning certain surface integrals is, in a similarly simple way, a special case of the above general integral formula. In fact, let F C R3

be a subset of IIt3, which can be represented as the image of a Cl-map f defined on the standard cube [0,1]2 (a surface piece). By 8F we denote the boundary of this set considered as a singular 1-chain in JR3. Consider a vector field V defined on an open neighborhood of F. Its curl is defined by the following condition:

dwy := curl(V) J dllt3 = *wcurl(V)

Integrating the 2-form sweurj(V) over the surface piece F, we obtain Stoke theorem in its classical form.

Theorem 13. Let V be a smooth vector field on a neighborhood of F. Then

J

(V1dx1+V2dx2+V3dx3) =

OF

8V2

8V11

axl - axe I

dx Adx 2+IIr8V3 1

/

F 8V11

&,;y = IOV3

8V21

dx ndx3+ axe - 'ax-3 dx2nda3. axl - 5x3 I

F

Remark. Using the volume form dF of a regular surface piece yet to be discussed, the classical Stokes formula can be written more concisely as

Lcun1'1) dF. Here N denotes the normal vector to the surface in R3. We will return to this in Chapter 3.

2.7. Complex Differential Forms and Holomorphic Functions The complexification of the vector space of all real-valued forms is called the space of complex-valued forms on an open subset of R". Such a form can be split into its real and imaginary parts;

wk = wo + i Wk and differentiation as well as integration are defined with respect to this decomposition: d,jk

dwo + i

dwi, LkjkOLkI

.

2.7. Complex Differential Forms and Holomorphic Functions

37

In a similar way we extend the exterior product to complex-valued forms: W k A 771

:= (WO A 170 - W I A 911) + i (,oo A 7l1 + W I A 770) .

Then the previous computational rules and Stokes' theorem still hold. Now we want to apply complex-valued forms to study holomorphic functions and, to do so, first identify the real vector space R2 with the complex numbers C. From z := x + i y and z := x - i y we obtain the differential forms

dz :=

dz :=

and dzndz =

Let f be a complex-valued function with real and imaginary parts u and v of class C1, f (z) = u + i v. Denote by u=, uy, vv, vy the partial derivatives with respect to the corresponding variables. Then f (z) dz is a complex-valued differential form. We compute its differential: Now let f : U --+ C be a complex-differentiable function defined on an open subset of C. Elementary complex analysis starts by proving that its real and imaginary, parts are smooth functions in the sense of real analysis (Goursat's theorem). Furthermore, the Cauchy-Riemann equations hold:

ux=vy and uy= -vx. Theorem 14. If f (z) is a complex-differentiable function, then the 1-form f (z) dz is closed,

d(f (z) dz) = 0. An immediate consequence is Cauchy's theorem.

Theorem 15. Let U be an open subset of C, and let -y be a closed curve in U, which is the boundary of a singular 2-cube. Then the integral 0

vanishes for each complex-differentiable function f .

In a similar way we derive Cauchy's integral formula. To do so, we assume that f (z) is a complex-differentiable function on a neighborhood of the disc

K(zo,M) = {zEC:IIz-zoII Z - zo - zol12

-

=

e2

fK(zo (z - zo)f(z) dz. 0

2. Differential Forms in IR"

38

We compute the derivative of the 1-form (z - zo) f (z) dz: 2i -

Thus

J

= K(zo,M)

2i. E2

Z - ZO

f(x) dandy. K(zo.e)

The mean-value theorem of integral calculus states that there is a number ze in K(zo, e) for which

f (z) dx n dy = f (ze)vol(K(zo, e)) = 7r `2 f (ze) K(zo.e)

If e tends to zero, then f (ze) converges to f (zo), and we arrive at Cauchy's integral formula:

Theorem 16.

f(zo) =

1

27ri JaK(zO,jvf)

z - zo

This formula is fundamental for the theory of functions. It implies, e.g., that every function which is complex-differentiable in the neighborhood of a point can be expanded into a power series (i.e. is an analytic function).

2.8. Brouwer's Fixed Point Theorem A fixed point of a map f : X --+ X from a set to itself is defined to be a point xo which is not moved by f, f (xo) = xo. In topology, several fixed point theorems are known. They state that certain continuous maps from a metric space to itself necessarily have at least one fixed point. If, e.g., X is a complete metric space, and f : X -p X is a contracting map, then by the Banach fixed point theorem the map f has at least one fixed point. There are, of course, (non-contracting) maps from a complete metric space to itself without fixed points; translations in R" are examples for this. A topological space X is said to have the fixed point property if every continuous map f : X - X from X to itself has a fixed point. This is obviously a topological property, i. e., homeomorphic spaces either all have the fixed point property or none of them has it. The unit circle X = S' is compact; nevertheless it does not have the fixed point property; rotations are continuous maps from the unit circle to itself without stationary points. Brouwer's fixed point theorem states that the closed n-dimensional ball of radius R,

D"(R) = {xER": IIxii IIx - f(x)II

shows that F is smooth. Moreover. F acts on the boundary of the ball as

the identity, F(x) = x for all x E S. Let Fl.... , F" be the components of F. Differentiating the following relation, which is valid for all x E

Dn.

n

E(F'(x))2 = 1, i=1

yields

(Fiaa*5) Idxi = 0,

2

ij=1

i=1

and hence for each index j j:Fi(x)8F11'(x) 11T1

= 0.

i=1

Therefore, the system of equations

i=o

8P (x) = 0 Ox)

has a non-trivial solution (al, ... , an) = (Fl (x), ..., F"(x)) j4 (0.... , 0). Hence the determinant of the matrix det

0

vanishes. Now we apply this observationto the differential form

Wn-' = F1 A dF2 n ... A dF" and conclude that its differential vanishes: i

\

&,n-1 = dFlAdF2A...AdF" = det(aa2 )dx'A...Adx" = 0.

2. Differential Forms in 1t"

40

By Stokes' theorem the integral of the form w' of the singular cube D" is equal to zero:

0=

-

1

do-1 =

n-1

jn-1

JD-

On the other hand, F acts on the sphere wn-1ISn-I

over the boundary Sn-1

S"-1

as the identity; hence

= x1 dx2 A ... A dxnlSn-i .

This implies

0=J

x1 dx2 A

r ... A din = J dxl A ... A dx" = vol(D"), Dn

n -I

0

and we arrive at a contradiction.

Theorem 18 (Brouwer's Theorem). Every continuous map f : D"

Dn

from the n-dimensional ball to itself has at least one fixed point.

Proof. The proof will be reduced to the case of a C1-map by applying the Stone-Weierstrass approximation theorem (see, e.g., [Rudin, 19981, Theorem

7.32). The ball D" is compact. Consider the ring C°(D") of all real-valued continuous functions on it, as well as the subring R of those functions which are the restriction to D" of a C1-function with strictly larger, open domain of definition. Obviously, the subring R contains the constant functions and separates points. By the Stone-Weierstrass theorem, it is dense in C°(U). Applying this to the components f 1, ... , f" of f, we conclude that for each e > 0 there exists a C1-map

p : D' - R" such that IIf (x) - p(x)II < e for all x E D'. Consider the renormalized map P(x) := p(x)/(1 + s). Since

IIP(x)II - IIf (x)II 5 IIP(x) - f (x)II < e and IIf (x)II < 1, we have IIP(x)II 5 1 + e; hence IIP(x)II 5 1. Therefore, P is a map from the ball to itself. Moreover, P can be estimated against f : IIf (x) - P(x)II 5 IIf (x) - P(x)II + IIP(x) - P(x)II 5 e + IIP(x)II 11

- 1+

< e+(1+e)1+e < 2e. Summarizing, we have proved that for each e > 0 there exists a C1-map P : D" D" satisfying for every x E D" the estimate

IIf(x)-P(x)II 0.

i=1 Then the system of equations

91(x) = 92(X) _ ... = 9n (x) =

0

has at least one solution in D'(R). Proof. We combine the functions to define a map g : Dn(R) --+ Rn, g(x) := (91(x), .. , 9n(x)). If g(x) 0 0 holds for all points x E Dn(R), we can consider the map f : Dn(R) -p Sn-1(R),

f (x) := -R

g(x) 119(x)11

whose image lies in the sphere Sn-1(R). By the fixed point theorem, f has

a fixed point in S"-'(R). Hence there exists a point xo E Sn-1(R) such that

xo = -R 9(xo)

119(xo)11

This implies R2 II9(xo)II = -R (g(xo), xo), contradicting the assumption of the theorem.

In particular, the assumption of the theorem is satisfied for gi(x) = xi + hi(x) if the functions hi : R' -+ R grow more slowly than linear forms, Ihi(x)I < Ci 11x11" Under this condition the sum n

n

E 9i(x)xi = R2 + c` hi(x)x' i=1

i=1

2. Differential Forms in R"

42

behaves like R2 on the sphere S"-'(R) and becomes positive for sufficiently large radii.

Corollary 1. The system of equations hl(x) = x', ..., h"(x) = x"

has at least one solution f o r arbitrary continuous functions bounded by I at infinity.

I

I'

Example 11. The system of equations x = ' 1 + x2 + y'2. y = cos(x + y) has in l 2 at least one solution, e.g.,

x = 1.2758079,

y = 0.14722564.

Example 12. Consider in 1R2 the following system of equations:

= 0, 92(x, y) = y + e-(I-U)2 = 0. 91(x, y) = x + The picture below diaplays the graph of the function g, (x, y) x + 92(x. y) y e-(s+y)2

over the set [-2.2] x [-2,2]. It shows that this function is positive on the circle S' (2) of radius 2. Hence the above system of equations has at least one solution in the disc D2(2). A numerical computation of the solution leads to the values

x = -0.303122,

y = -0.789407.

43

Exercises

Exercises 1. Let f :

1R2 - {0} R2 - {0}, f (r, 0) = (r cos 0, r sin 0), be the polar coordinate map on the "punctured" plane. Prove:

a) The winding form satisfies f

dx) = d9;

b) the radial form x dx + y dy satisfies f ` (x dx + y dy) = r dr.

2. Consider on R2 - {0} the winding form wl = 114.-1., as well as the following family of curves depending on the integer parameter n E Z:

c:

[0, 11 -

. R2 - {0},

(cos2ant,sin2nrnt).

Compute the line integral of the winding form along the curve c,,, and conclude that the curves c,l are not homotopic in R2 - {O} for different values of the parameter n. 3. Compute the exterior derivative of the following differential forms:

a) xydxAdy+2xdyAdz+2ydxAdz; b)

z2dxAdy+(z2+2y)dxAdz;

c) 13 x dx + y2dy + xyz dz;

d) e' cos(y) dx - e' sin(y) dy;

e) xdyAdz+ydxAdz+zdxAdy. 4. Consider 1R2n with coordinates x1, ... , x2n and the following differential form of degree 2: w2 = dx' n dxn+l + dx2 A dxn+2 + ... + dxn n dx2n . Prove:

a) The form w2 is closed;

b) the n-th exterior power of w2 is related to the volume form via the formula w2 n ...

n w2 = (-1)n(n-1)/2n! dxl A ... A dx2n .

5. The subject of this exercise is to explain why it makes sense to denote the vector fields x -+ (x, e;), defined using the standard basis e1, . . . , e,, of

2. Differential Forms in R"

44

R", by a/8x'. Writing these vector fields for the moment as E;, every vector field can be written in the form n

V= For each function f : U

R on an open set U of R" we define a new

function. the derivative of f in the direction of the vector field V, by

(V(f))(x) := (Df=)(V(x)) Prove the formula

n

V(f) _i=108xi (fl which by omitting the argument f provides the explanation asked for. 6. Prove the following rules for vector fields on )1t3:

a) div(V1 x V2) _ (curl(Vl),V2) - (VI,curl(V2)); b) curl(curl(V)) = grad(div(V)) - 0(V), where the Laplacian is to be applied componentwise to V.

7. Compute the line integral

(x- 2xy)dx + (y2 - 2xy)dy JC2 along the curve C = {(x, y) E R2 : x E [-1.1), y = x2}.

8. Compute the line-integral sin(y)dx + sin(x)dy, IC where C is the segment joining the points (0, x) and (n. 0). 9. Consider on R3 the differential form w2 = y dxAdy. Determine all 1-forms 17 1 = p dx + q dy satisfying dal = w2.

10. Prove the following converse to one of the statements of Example 9: If wl is a 1-form defined on the open set U, and if the line integral of wl is independent of the curve, then wl is exact. Hint. Prove this by explicitly constructing a "primitive function". In physics, the vector field V corresponding to the 1-form wy is called conservative, if it does no work along any closed curve -y. 11 wy = 0. Thus, V is conservative if and only if w1, is exact.

45

Exercises

11. Consider the singular 2-cube, f : [0, 2ir] x [0, 27r] -r S2 C R3 - {0}, f (u, v) _ (cos u sin v, sin u sin v, cos v),

as well as the 2-form w2 = (xdyAdz+ydz Adx+zdxndy)/r3 on 1R3 - {0}, where r = (x2 + y2 + z2)1/2 denotes the distance from the origin. a) Prove that w2 is closed; b) compute the integral of w2 over f ; c) conclude from the properties just proved that w2 is not exact on 1R3-{0}, and that there is no singular 3-chain c3 in 1R3 - {0} whose boundary

equals f, 0c3 = f. 12. Prove that the integral defines a unique bilinear map HDR(U) X Hkub(U)

R,

([WI], [s'])

-- J

Wk.

gk

13. Let w1 = f (x) dx be a 1-form on the interval [0, 1] with f (0) = f (1). Prove that there exist a real number p and a function g with g(0) = g(1) by means of which wl can be written as

W = pdx+dg. 1

14 (Continuation of 13). Let 771 be a closed 1-form on R2 - {0} and wl the winding form. Prove that there exist a number p as well as a function g : 1R2 - {0} -+ R for which 77 1 =

pwl +dg.

Consequently, the winding form is the generating element of the first de Rham cohomology of R2 - {0}. Hint. Consider the polar coordinate map f from Exercise 1 and its pullback f `wl. This can be written as f `wl = A(r, O)dr + B(r, B)dO; here B(r, O)dO is a 1-form on [0, 27r] (depending on the parameter r) to which the previous exercise applies. 15. Consider on 1R3 the following exact differential form known from 7:

W2 = xydxAdy+2xdyAdz+2ydxAdz, and the upper half-ball A C S2:

A = {(x,y,z)EIR3: x2+y2+z2=1, z>0}. Prove that the integral of w2 over A vanishes.

2. Differential Forms in R°

46

16. Let C be the circle in R2 with the equation x2 + (y - 1)2 = I in its standard parametrization. Compute the line integral

f xy2 dy - yx2 dx a) directly;

b) using Green's formula.

17. Let E be the ellipse with the equation x2/a2 + y2/b2 = 1 (a > 0, b > 0) in its standard parametrization. Compute by means of Cauchy's integral formula the integral

rJE dz

z

and obtain from this the value of the integral 12 dt Jp

a2 cos2(t) + b2 sin2(t)

18. Let 7-1 be an infinite-dimensional Hilbert space, and D = {x E l I IxI I < 11 its unit ball. Does D have the fixed point property?

19. A subset A C X of a metric space X is called a retract of X, if there is a continuous map r : X --. A such that r(a) = a for all points a E A. Prove that if X has the fixed point property, then so does every retract A of X.

20. The set A = { (x, y) E [-1,1]2: xy = 0 } has the fixed point property.

Chapter 3

Vector Analysis on Manifolds

3.1. Submanifolds of Rn In Chapter 2 we introduced an integration method for differential forms over sets which can be represented as images of singular chains. These sets, however, may be quite irregular, and it is rather difficult to develop a differential calculus for functions defined on them. Further notions like tangent space, vector field, etc., are not available either in their context. Hence we will now restrict the possible subsets of Rn to a class for which a differential as well as an integral calculus can be established in a satisfactory manner. These sets are called manifolds, and they are-intuitively speaking-characterized by the fact that their points can be defined locally in a continuous (differentiable) way by finitely many real parameters, that is, locally these sets look just like euclidean space. It was the fundamental idea of B. Riemann in his Habilitationsvortrag (1854) to introduce the notion of a manifold as the new basic concept of space into geometry. In physics, manifolds occur as configuration and phase spaces of particle systems as well as in field theory. The precise description of what a submanifold of euclidean space is supposed to be is the content of the following definition.

Definition 1. A subset M of Rn is called a k-dimensional submanifold without boundary if, for each point x E M, there exist open sets x E U C R"

and V C R' as well as a diffeomorphism h : U

V such that the image

h(U n M) is contained in the subspace IRk C R :

h(UnM) = Vn(IRkx{O}) = {yEV: yk+1-

=yn=0}. 47

3. Vector Analysis on Manifolds

48

The set U* := UnM together with the map h' := hlu.: U* -- V' := VnRk is called a chart around the point x of the manifold. The sets U* and V' are open subsets of M and Rk, respectively (see next page). V Rn

v n (Rk x {o})

h-1

A family of charts covering the manifold M is called an atlas. The notion "diffeomorphism" can be understood in the sense of an arbitrary C'regularity (1 > 1). Correspondingly, we have manifolds of regularity class CL. For simplicity, we suppose in this chapter that all maps, manifolds, etc., are of class CO°, and for this reason we will simply talk about smooth maps, manifolds, etc. Note that, without any change, all the statements also hold assuming only C2-regularity.

For any two charts (h', U') and (hi, Ur) of the manifold for which the intersection u* n u; is not empty, one can ask how they are related. The map h'o(hi)-1:

is called the chart transition from one chart to the other. The sets h1(U` n Ul) and h' (U' n Ul) are open subsets of the coordinate space Rk, and the transition function h' o (hi)-1 is obviously a diffeomorphism.

The notion of dimension also needs to be explained for manifolds. If a subset of Rk is mapped homeomorphically onto an open subset of the space R, the dimensions of both coincide, k = 1. Under the additional assumption of

49

3.1. Submanifolds of 1R

differentiability, which will always be made here, this is easy to prove: the differential of a diffeomorphism at an arbitrary point is a linear isomorphism between the tangent spaces T1IRk and TyR', and this immediately implies k = 1. The corresponding fact for homeomorphisms is a deep topological result going back to Brouwer (1910). In any case, the number k occurring in the definition of a manifold is uniquely determined and will be called the dimension of the manifold. Sometimes we will write the dimension of a manifold as an upper index, i.e., denote the manifold also by Mk. In the first theorem we will prove that, under certain conditions, subsets of 1R' defined by equations are submanifolds. This will give rise to plenty of examples.

Theorem 1. Let U C lRn be an open subset, and let f : U - Rn-k be a smooth map. Consider the set

M = {x E U : f (x) = 0} . If the differential D f (x) has maximal rank (n - k) at each point x E M of the set M, then M is a smooth, k-dimensional submanifold of 1R' without boundary.

Proof. The proof is based on a straightforward application of the implicit function theorem. If xo E M is a point from M, then there exist an open neighborhood xo E Uxo C U and a diffeomorphism hxo : Uxo -' hxo(Uxo) C Rn such that the map f o hzp : hxo (Uxo) - Rn-k is given by the formula

f oh=o (x1, ...,xn) = (xk+1, ...,x'). This implies Uxo n M = h=o (hxo (Uxa) n ]R' ), and hence the chart around the 0 point xo E M we asked for is constructed.

Example 1. Every open subset U C Rn is an n-dimensional manifold without boundary.

Example 2. The sphere Sn = {x E Rn+1 : IIxUI = 1} is an n-dimensional manifold. To see this, we consider the function f : Rn+1 -, R defined by f (X) = I Ix1I2 - 1. Then we have D f (x) = (2x', ..., 2xn+1), and the rank of the (1 x n)-matrix D f (x) on Sn is equal to 1. Theorem 1 implies that Sn is an n-dimensional manifold.

Example 3. Consider a smooth map f : Rn ]Rm and its graph G(f) = {(x, f (x)) : x E Rn} C 1Rn+m G(f) is the zero set of the map 1 : IRn+m = lRn x 1R1Rm defined by O (x, y) = f (x) - y. The differential D4 has maximal rank equal to m at each point. Therefore, G(f) C Rn+m is an n-dimensional manifold.


50

Example 4. The torus of revolution is the surface in R3 described by the equation (0 < r2 < rl ) (

x2 + y2 - rl )2 = r2 - z2 .

A parametrization can be obtained by the formulas x = (r1 + r2 cos cp) cos V,,

y = (r1 + r2 cos cp) sin 0,

z = r2 sin cp

with parameters 0 < cp, ' < 2ir. The partial derivatives of the function

f(x,y,z) = ( x2+y2-rl)2-r2+z2 are

Ox -

2x(1

- x-+y2 ), 21

of - 2y(1 y

+y

VI'X21

2),

Oz

= 2z,

and it is obvious that the vector D f does not vanish at any point of the torus of revolution. Hence Theorem 1 applies and shows that the above equation defines a manifold.

Example 5. Not every set defined by an equation is a manifold. For example, consider the set in R2 described by the equation x4 = y2:

Near the point (0, 0) E ]R2 this set is not a manifold. In fact, after having deleted this point, any neighborhood of the set splits into four components and hence cannot be homeomorphic to an interval. Indeed, the assumption of Theorem 1 concerning the differential of f (x, y) = x4 - y2 is not satisfied at the point (0, 0) E R. 2

3.1. Submanifolds of R"

51

Example 6. On the other hand, there exist manifolds in R" which cannot be described by systems of equations satisfying the assumptions of Theorem 1. Later we will see that every manifold defined as in Theorem 1 has a particular property-it is orientable. An example of a non-orientable manifold is the so-called Mobius strip. One of its parametrizations is

x = cos(u)+vcos(u/2)cos(u), y = sin(u)+vcos(u/2)sin(u), z = vsin(u/2) with parameters 0 < u < 2ir, -7r < v < 7r.

Apart from equations, manifolds can also be defined by prescribing their local parametrizations (charts). Let us explain this construction principle. Theorem 2. Let M be a subset of R" and assume that for each point x E M there are an open set U, x E U C R", an open set W C Rk, and a smooth map f : W U such that the following conditions are satisfied:

(1) f(W) = M n U; (2) f is bijective;

(3) the differential Df(y) has rank k at each point y E W;

(4) f'1 : M n U W is continuous. Then M is a k-dimensional submanifold without boundary in R.

Proof. For an arbitrary point x E M we choose a map f : W -i U with the stated properties and denote by y the pre-image of x, f (y) = x. The differential has rank k, and hence we can assume without loss of generality

that

1r 9k 0.

det

4

ak

Consider the map defined by g(a, b) := f (a) + (0, b) with g : W x R"-k -+ R". Then the determinant of the differential of g coincides with the determinant

above, and hence, in particular, it is different from zero. Applying the


52

inverse function theorem of differential calculus, we obtain two open sets V2 V1 and V2. with (y, 0) E V1 and x E V2 in R", for which g : V1 is a diffeomorphism. We invert this map and denote the resulting inverse diffeomorphism by h := g-' : V2 -+ V1. By assumption f is continuous, and hence there exists an open set 0 C R" such that

{f(a):(a,O)EV1} = f(w)no. Consider now the sets V2 := V2 nO and V1 = g-'(V2). Then we have

V2nM = V2nOnM = {g(a,0):(a,0)EVII, and thus we obtain

h(V2nAf) = g-1(V2 nM) = V1 n(Rk x 101) Therefore, the condition to be satisfied for each point of a manifold holds for A1.

Now we will extend the notion of manifold, taking into account also bound-

ary points. We confine ourselves to the case that the boundary itself is a smooth manifold without boundary (no corners or edges). We define the k-dimensional half space Hlik to be the set Hk

= {xERk:xk>O}.

Definition 2. A subset M of R" is called a k-dimensional submanifold (with boundary) if for each point x E M one of the following conditions is satisfied:

(1) There exist open sets U and V, with x E U C R", V C R", and a diffeomorphism h : U -+ V such that

h(U n M) = V n (Rk x {0}) . (2) There exist open sets U and V, with x E U C R", V C R", and a diffeomorphism h : U -+ V such that

h(UnM) = Vn(Hk X {O}) and the k-th component hk of h vanishes at the point x, hk(x) = 0.

Conditions (1) and (2) cannot be satisfied at the same time for one and the same point x E M, for otherwise there would exist diffeomorphisms h1 :U1 -iV1, h2 : U2 --+ V2 such that

h1(UInM) = VlnRk and h2(U2nM) = V2nlEllk, h2(x)=0.

3.1. Subinanifolds of R"

53

The set hl (U1 n U2) then would be an open subset in IRk mapped diffeomorphically onto h2(Ul n U2) by the chart transition map h2 o h, 1. Since h2(x) = 0, the set h2(U, nU2) would thus contain a point from the boundary 8Hk = Rk-4 of the half-space. Consequently, it could not be open in Rk. Altogether this contradicts the inverse function theorem. This observation justifies the following

Definition 3. Let M C 1[t" be a manifold. A boundary point of M is a point x E M for which condition (2) of Definition 2 is satisfied. The set of all boundary points is denoted by 8M and called the boundary of M. Theorem 3. Let AEI be a k-dimensional manifold. Then its boundary OM is either empty or a smooth (k-1)-dimensional manifold without boundary,

88M=0. Proof. Fix a boundary point x E OM and choose open sets U C R n. X E U, and V C Ht" with

h(U n m) = V n (Hk x (01). For every other boundary point x' E U n OM the k-th component of h has to vanish at x` by the preceding observation, hk(x*) = 0. Hence we have

h(U n 8M) = V n (IItk-' x {0}), and thus (h IunaAf, U n OM) is a chart for the boundary 8M.

Example 7. The boundary of the Mobius strip is a closed space curve.

Example 8. The (n - 1)-dimensional sphere Sn-' is the boundary of the n-dimensional ball D".


54

3.2. Differential Calculus on Manifolds When a manifold is covered by charts, every chart range is an open subset of R" and hence a set on which differential calculus is familiar from analysis. In this way it is possible to develop a differential calculus on manifolds. As in the preceding section, we will now start from a k-dimensional manifold Mk and a chart h : U V around a point x and denote by y := h(x)

the image of x under this chart map. Then h-1 : V -{ U is smooth and (Dh-')y = (h-1).,y is a linear map between the tangent spaces to R" (compare Definition 2, Ch. 2)

(h-1).,y:

TyR" - T=R" .

Definition 4. The tangent space of the manifold Mk at the point x is defined to be the image of TVRk under the map (h-1).,y: Trllfk

(h-1).,y(Ty %)

C .,R" .

The tangent space T?Mk is a k-dimensional vector space, since the differential of the diffeomorphism h-1 is injective. Moreover, we have to check that the tangent space of the manifold just defined does not depend on the choice of the chart. But this is an immediate consequence of the equivalent description for the tangent space that is to follow next. Theorem 4. The tangent space TTMk consists of all vectors (x, v) E T=R" for which there exists a smooth curve y : [0, ±E) - Mk C R" such that -y(0) = x and y(0) = v. Proof. A vector v = (x, v) E TXMk in the tangent space can be represented

as v = (Dh-'),(w) for a certain vector w E Rk. The image under h' of the straight line in Rk through y in the direction of the vector w is the curve 7(t) we were looking for: the equality

-t(t) = h-1(y + tw) immediately implies -y(O) = h-1(y) = x, and from the chain rule we obtain for the tangent vector

dt (h-1(y + tw)) l t=o = (Dh-'),(w) = v.

ddtt) = The converse is proved analogously.

If the manifold Mk is defined by (n - k) equations, and if, in addition, the differentials of the defining functions are linearly independent, then the tangent space has a simple description.

3.2. Differential Calculus on ?Manifolds

55

Theorem 5. Let fl, ... , fn_k : 1R" , 1R be smooth functions and suppose that

00.

df1 A ... A

Then the tangent space TTMk of the manifold Mk

= {xER":

f1(x)=...=f,,-k(:r.)=0}

consists of all vectors v E T11R" satisfying

df1(v) = ... = dfn-k(v) = 0. In particular, the euclidean gradient fields grad(f1), ... , grad(fk) are perpendicular to the tangent space of the manifold at each point of :11k.

Proof. Taking a curve

r)

Alk in 11k and differentiating the

equation

f1('r(t)) = ... = f"-k(r(t)) = 0 with respect to the curve parameter t yields

df1(i(t)) _ ... = df"-k(i(t)) = 0. The tangent space TA1k is thus the subspace of all those vectors v E T IR" on which all the differentials df1, . df"-k vanish. Comparing the dinlensions of these two vector spaces shows that they have to coincide.

The set of all tangent spaces to the manifold is called the tangent bundle of Alk and denoted by TMk. It is a manifold of dimension 2k. In fact, at least in the case that A1k is determined by equations, f1 = . . . = f"_k = 0, the set TAlk is in turn defined as a subspace of 1R" x 1R" by the equations

fl (:r) = ... _ .fn-k(x) = 0 and df1(x. v) = ... = df"-k(x, v) = 0. These are 2(n - k) conditions in R2". The corresponding functional determinant (Jacobian) does not vanish, since the differentials df1 are linearly independent. The formula (x, v) := x defines a projection 7r : TMk . Alk on the tangent bundle of the manifold assigning to every vector its base point.


56

Example 9. Consider the sphere S' = {x E R"+1 :IIxII = 1}. The differential of the function IIXI12-1 is 2(x1,

xn+1) and hence the tangent space to the sphere at any point consists of all vectors perpendicular to this point: =

TS" = {(x, v) E R"+1 x Rn+1: IIxII =1, (x, v) = 01.

Definition 5. Let Mk C R' and N' C R' be two manifolds, and let f : Alk - N' be a continuous map. We call f a differentiable map if for each chart h-1 : V - Mk of the manifold Mk the resulting map f o h-1 V -. N' C R' defined on the open subset V C Rk is differentiable. As in euclidean space, the differential of a smooth map can be introduced as a linear map between tangent spaces. For a tangent vector (x, v) E TZMk we choose a curve y : [0, e) - Mk with 7(0) = x and y(O) = v. The composition f o y(t) is a curve in N1 passing through f (x) E N', and its tangent vector

describes the result of applying the differential of f to the tangent vector (x, v),

f.,x (X, V) :=

(f(x),

dtf o y(0)

I

The differential of a smooth map between two manifolds has all the properties which are familiar from euclidean space.

Theorem 6. The differential f.,2 : TXMC -+ Tj(t)NI of a smooth map is a linear map between the tangent spaces, and the differential of the superposition of two smooth maps f and g is equal to the superposition of their differentials,

(g o f)..

= 9.,f1=) ° f*.x

The last formula is the generalized chain rule.

Definition 6. A vector field V on a manifold Mk assigns to every point x E Alk a vector V(x) E ,,Mk in the corresponding tangent space.

If the map V : Mk - TRn = Rn x R" is smooth, then we will speak of a smooth vector field on the manifold. Vector fields can again be added and multiplied by functions, so that the vector space of smooth vector fields is a module over the ring Coo(Mk) of all C°°-functions on Mk.

Example 10. The formula V(x) = (x, (x2,-x1,0)) defines a vector field on the 2-dimensional sphere (see the figure on the next page).

3.2. Differential Calculus on Manifolds

57

Mk C R', which this time and and sometimes For a chart map h : V also later will be denoted by h instead of h-1, h.(a/ayi) are vector fields tangent to Mk defined on the subset h(V) C Mk, and they provide a basis in each tangent space. For simplicity and as long as it is clear to which chart we refer, these vector fields on the manifold will also be denoted by a/ayi. On the subset h(V) C Mk. every other vector field V can be represented as their linear combination k

V(y) _

Vi(y)5 y ii

i=1

Here V2(y) are functions defined on the set h(V); using the chart map. now and then they will also be considered as functions on the parameter set V. These functions are called the components of the vector field V with respect to the fixed chart. Example 11. In euclidean coordinates on 1R2, consider the vector field V=x15x2-x25x1

depicted on the next page. Introducing in R2 - (0} polar coordinates by the formula

h(r,p) = (r cos V, r sin so),

0 < r < oo, 0 <
ro the set V is disjoint from the boundary, V n OMk = 0. From >i vi = 1 we obtain >i dcpi = 0 and use this to rewrite the exterior derivative of k-1

=

wk-1:

k-1 i=1

-

Pidw k-1 i=1

+

dipi n i=1

wk-1

i=1

80


This implies the equation r

JM' dwk-1 =

d (hi (Piwk-1))

V is an open subset of Rk for rp + 1 < i < r, and h; is a k-form on V1 whose support is completely contained in V. For each of these indices i we choose a k-chain c in Rk for which supp h, (Wiwk-1) C Int ck C ck C V j.

Applying now Stokes' theorem for chains (Theorem 8, Chapter 2) to ck, we obtain

Jd(h(iwk_1))

wA-1))

=

f

h' (Viwk-1) = 0,

d (ht = Jask since the form vanishes on the boundary of the chain. Now we consider the indices i between 1 and ro. For any of these, V is an open subset of the halfspace Hk, and as before we obtain chains ck in lHlk with the same properties of the supports with respect to h; Applying Stokes' theorem to these as well, we obtain 1.

1v;d (hi Now hi

(,p1wk-1) does

hi (ca1wk-1)

J

.

not necessarily vanish any more on Ock n

(Rk- l x {0} ),

but only at the points of 8c; belonging to the interior of Hk: d (h;

(Vjwk-1))

_

I

VflRk-1

V;

hi

(`Piwk-l).

The pairs (V n Rk-1, h1IRk) with i = 1, ..., ro form a covering of the boundary BAIk. Hence, by the definition of the integral, ro

wk-1

-

t-1 knRlshowing the equation we set out to prove. Jamk

h i (ViwR-1)

,

1

O

In the sections to follow we will be dealing with various applications of Stokes' theorem. As a generalization of the discussion in §2.5, we will first study line integrals and prove an analogue-only for 1-forms, however-of Poincare's lemma. This holds for manifolds in which every closed path can be contracted to a point. Definition 19. A connected manifold Mk is called simply connected if every two C'-curves co, cl : [0, 1] , Mk with coinciding initial and end points are homotopic.

3.7. The Hedgehog Theorem (Hairy Sphere Theorem)

81

By Theorem 9 in §2.5, whose proof immediately carries over to the case of a manifold, on a simply connected manifold the line integral of a closed 1-form w1 depends exclusively on the end points of the curve. Having fixed a point xo E Mk, the line integral along a curve joining the points xo and x, w lox uniquely defines a function on the manifold. Its differential df coincides with w1, and we obtain f(x) =

Theorem 22. Every closed 1-form on a simply connected manifold is exact.

Example 31. The winding form defined on 1R2 - {0} is closed, but not exact. This shows that R2 - {0} is not simply connected.

3.7. The Hedgehog Theorem (Hairy Sphere Theorem) Consider two oriented compact manifolds Mk and Nk without boundary and of equal dimension. Two maps fo, fl : Mk - Nk between them are called homotopic if there exists a smooth map

F : Mk x [0,1] -+ Nk such that F(x, 0) = fo(x) and F(x,1) = fl (x). We prove

Theorem 23. Let wk be a k-form on Nk and let fo, fl : Mk

Nk be

homotopic maps. Then

JMk0 (wk) = fm k fl (wk) . Proof. The oriented manifold Mk x [0, 1] has boundary

8(Mk x [0,1]) = Mk x {1} - Mk x (0}, where the minus sign indicates that the orientation is reversed. Therefore, Stokes' theorem implies

Jf

k

fl (w) -

JMk

f(wk) =

F*(wk) = 8J(Mkx[O,1])

JMk x [OI]

dF(wk) .

But the form dF*(wk) = F'(dwk) = 0 vanishes, since the k-dimensional manifold Nk carries no non-trivial (k + 1)-forms.

0

Theorem 24. The antipodal map from the sphere to itself, A : Sn - Sn, A(x) = -x, is homotopic to the identity Ids. only for odd dimensions n. Proof. Consider on Rn+1 the form n+1

wn =

_

-1x' dx1 A ... A dx' A ... A dx"+1 i=1


82

whose restriction to the sphere Sn is its volume form dSn (Example 28, equation (17)). If A is homotopic to the identity Ids.., then the previous theorem implies

Jis.

A*(wn) = / n w" = vol(Sn) . S

The induced form A* (,n) = (-1)n+lwn is proportional to the form

n.

Thus we obtain the condition (_1)n+lvol(Sn) = vol(S"), i. e., (n + 1) has to be an odd number.

Theorem 25 (Hedgehog Theorem). A sphere S2k of even dimension has no nowhere vanishing, continuous tangent vector field.

Proof. Suppose that there exists such a vector field on the n-dimensional sphere S". We first approximate this vector field by a smooth vector field V (Stone-Weierstrass theorem), and then normalize it so that the vector V(x) has length one at each point. Next we consider the resulting smooth tangent vector field as a vector-valued function V : Sn - Rn+I satisfying the following two conditions:

(x, V(x)) = 0,

IIV(x)II = 1.

Define the homotopy F : Sn x [0, 1] - Sn from the sphere to itself by the formula

F(x, t) = cos(irt) x + sin(irt) . V(x). The length of F(x, t) is equal to one everywhere, since x and V(x) are perpendicular. Moreover, F(x, 0) = x and F(x, 1) = -x, i. e., F is a homotopy between the identity and the antipodal map of the sphere Sn. But then the dimension n has to be an odd number. In the German mathematical literature, this result is known as the "Hedgehog Theorem" (,,Satz vom Igel"), since its contents can be expressed figuratively by saying that a hedgehog cannot be combed in a continuous way. Because it is so vivid, we prefer this to the name "Hairy Sphere Theorem", which seems to be more common in the Anglo-Saxon world.

3.8. The Classical Integral Formulas Now we will discuss the classical integral formulas already treated in §2.6 for chains. Compared to the preceding case, in this new formulation we benefit

from having notions like divergence, gradient and Laplacian as developed in the differential calculus on manifolds at our disposal. At the same time, the orientation of the manifold and the exterior unit normal vector field on the boundary will play a special role. We will prove these integral formulas

3.8. The Classical Integral Formulas

83

for arbitrary compact and oriented manifolds (in R'). This will be the final formulation of the classical integral formulas as they are needed in many branches of mathematics as well as theoretical electrodynamics and hydrodynamics. We start with the Ostrogradski formula relating the divergence of a vector field to its flow across the surface.

Theorem 26 (Ostrogradski Formula). Let Mk be an oriented, compact manifold, and let N be the exterior unit normal vector field to its boundary. Then, for every vector field V : Mk Tlllk on Mk,

div(V)dMk =

(V, N)

d(OMk).

nik

lL

Proof. We know from Theorem 17 that the divergence and its inner product with the volume form are related by the formula

div(V) dMk = d(V J dMk) . A straightforward application of Stokes' theorem implies

J div(V)dMk = lk

J

d(V i dMk) = f V nl k

k

Let x E 8Mk be a point of the boundary. We decompose the vector V(x) into one part that is proportional to the exterior normal vector, and a vector W(x) that belongs to the tangent space T,ZBMk to the boundary:

V(x) = (V(x), N(x)) N(x) + W(x) . Moreover, note that the restriction of the inner product W J dMk to the boundary 8Mk vanishes identically. This implies that for the inner product of V with the volume form dMk. on the boundary 8Mk

V _j dMk = (V, N) N j dMk + W i dMk = (V, Al) N(x) _j dMk . Hence, we obtainJ

div(V)dMk = ik

f

Mk

(V, N) Ni dll'ik .

By Theorem 19, the inner product N(x) . dMk coincides with the volume form of the boundary, d(8Jik). 0 As a direct application of the Ostrogradski formula we obtain Gauss' theorem.

Theorem 27 (Gauss' Theorem). Let V be a vector field, let f be a function on the oriented, compact manifold Mk, and let N be the exterior unit normal vector field of the boundary. Then

J l (V, grad(f )) dMk + f l

k

f . div(V)dMk = J

alk f

(V, N) d(81bik) .


84

Proof. This equation immediately follows from the Ostrogradski formula together with the rule from Theorem 11

div(f V) = f div(V) + V(f) = f div(V) + (V, grad(f)) . In a similar way we derive Green's formulas in versions that are not confined to 12. First we generalize Green's first formula.

Theorem 28 (Green's First Formula). Let f, g : Mk

R be smooth func-

tions on the compact, oriented manifold Mk. Then

f

f'O(g) dMk +

g'ad(g)) dMk =

ff

(grad(.9)N) d(8Mk).

aMk

A1k

Proof. By the definition of the Laplacian, we have

f O(g) dMk = J %rk

f

'k f div(g'ad(g)) dAik .

Now apply Gauss' theorem. Applying Green's first formula twice leads to Green's second formula.

Theorem 29 (Green's Second Formula). Let f, g : Mk - III be two smooth functions. Then 1 [g.

(f)-f. (g)] dMk = f

N) ] d(aMk).

aMk

Ark

1-1

Remark. The scalar product (grad(f ), N) defined only on the boundary is

iaN aN

often denoted by the symbol Of ION, since it is the derivative of the function

f in the direction of the exterior normal vector. This leads to a different formulation of Green's second formula:

f [g

(f) - f . %(g)] dMk =

.

fMk

[g.-f.

]

Corollary 1. Let Mk be a compact, oriented manifold without boundary. Then (1)

f

div(V) dMk = 0 for every vector field V;

Mk

(2)

f go(f) dMk = f fo(g) dMk = - f (grad(f ). grad(g)) Alk

Mk

dMk

Mk

for any two functions f, g E CO°(Mk).

0

3.8. The Classical Integral Formulas

85

Hence the Laplacian is symmetric with respect to the L2-scalar product. Moreover, the choice of sign we adopted implies that it is non-positive. Corollary 2 (Hopf's Theorem). Let Mk be a compact, connected, oriented manifold without boundary and assume that the function f : Mk R sat-

isfies at each point the condition 0(f)(x) > 0. Then the function f is constant.

Proof. Integrating the assumption s(f)(x) > 0 over Mk and applying the symmetry of the Laplacian just proved, we first obtain

0< f 1-o(f)-dMk = J f-j(1)-dMk = 0, Mk fk

i.e., the Laplacian of f vanishes identically, A(f) = 0. Inserting f = g into Green's first formula (Theorem 28) and taking into account that the boundary integral vanishes by the assumption concerning Mk, this implies

J

grad(f)I2-dMk

= -Jntkf

0,

and hence grad(f) = 0. Thus f is constant, since Mk is connected.

0

Concluding this section, we formulate Stokes' theorem in its classical form on 1R3. Contrary to the preceding theorems involving the generalizations of divergence and gradient to manifolds as introduced in the second section of this chapter, this only involves the notion of curl on open subsets of 1R3 from §2.3.

Theorem 30 (Stokes' Theorem-Classical Version). Let M2 C R3 be a compact, oriented surface, let V be a vector field defined on an open subset M2 C U C 1R3, let N : M2 S2 be the exterior unit normal field to the surface M2, and let T : aM2 - T(aM2) be the unit tangent vector field on the boundary curve aM2 with the induced orientation. Then

f (curl(V), N) dM2 = f MZ (V, T) d(0M2). Proof. Consider the 1-form 4 := V1dx1 + V2dx2 + V3dx3

on U associated with the vector field V = V la/axl + V2a/axe + Via/ax3 as explained in §2.3 and its derivative

d`4 =

av2

aV1

axl - axe

,

I dx ndx2 +

, aV3 aV2 J dx2 ndx 3. ax, - avl ] dx ndx3 + [ ax3 ax2 - ax3

aV3


86

Recall that the curl of V corresponds to the 1-form *dwv. If, on the other hand. It : W -+ M2 is a parametrization of the surface with components h', h2, h3 and coordinates yl, y2 from W, then for the exterior normal vector N to the surface we have the relation 09

h x 09h

yl

/ II A

y2

09y1

y2 x ah

ll

Two arbitrary vectors v, w E R3 satisfy the identity 1Iv

x wII2 = det

(V, V)

(v, w)

(v, w) 1 (w, w) J

and hence the preceding equation implies

f.

x

ayl aye II For the first component of the normal vector N written in the coordinates II

y1, y2 this reads, e.g., as

N'dM2 = N' f dy' n dye =

ahe Oh3 _ah2 ah3 , dy' A [ay' aye aye ayl

1

dye .

On the other hand, it is easy to compute the pullback by h of the forms dx2 and dx3:

h*(dx2) =

OhY12

+ A2dy2, h`(dx3) =

ldy' + Oh3dy2.

A direct comparison implies the following formula, which is independent of the coordinates y', y2:

N'dM2 = dx2 A dx3 . Similarly one proves

N2dM2 =

- dx' A dx3,

N3dM2 = dx' A dx2 .

The scalar product of the curl of V with the unit normal field N multiplied by the volume form dM2 is thus simply the differential of wv: (curl(V), N) dM2

aV' 2 = [ aV3 ] N dM + 109x3 axe 09x3 = L aV2

1

dwv.

aV2

9V3

09x1, N2 dM2 + 09x1 - axe, N 3dltil 2 L aV'

Therefore, Stokes' theorem can be applied in the format

J (curl(V),N) dM2 = %f2

J Af2

&4 =

y=J[V1dx1 8M2

8M2

+ V2dx2 + V3dx3].

3.9. The Lie Derivative and the Interpretation of the Divergence

87

If, however, T is the unit tangent vector field to the curve 9M2, then d(8M2)(T) = 1, and hence

T'd(aM2) = dxl, T2d(8M2) = dx2, T3d(8M2) = dx3. Now we can rewrite the line integral above as

J [V ldxl + V2dx2 +V 3dX3]

= J1L12

(V,

T) d(8M2),

and, summarizing, we arrive at Stokes' classical integral formula.

0

3.9. The Lie Derivative and the Interpretation of the Divergence The aim of this section is to interpret the divergence of a vector field geometrically as the infinitesimal volume distortion of its flow. First we recall some results from the local theory of ordinary differential equations and introduce the flow on a manifold as well as the Lie derivative of forms. Then we compute this Lie derivative by means of the exterior derivative, which, in a special case, leads to the interpretation of the divergence mentioned in the title.

Let V be a vector field on the manifold Mk. An integral curve of V is a Mk whose tangent vector -y(t) = ry,(8/8t) at each point curve ry : (a, b) coincides with the value of the vector field there:

?(t) = V('Y(t)) The well-known existence theorem for autonomous differential equations states that for every initial point x E Mk there exists a maximal integral


88

curve ryx

:

(ax, bx) - Mk defined on an interval, containing the number

0 E R, satisfying

'Y.,(0) = X. Moreover, this maximal integral curve is uniquely determined by the initial condition. Denote by EV the set

EV = {(t,x)E]RxMk:ax) = 4).,m[Vl, V2]m(9)

,

i.e., the vector fields [WI, W2] and [VI, V2] are, in fact, 4)-related.

The commutator of two vector fields measures the extent to which their flows do or do not commute. This explains the name for the vector field [V, W].

Theorem 36. Let V and W be two complete vector fields on the manifold Mk and denote by 4't and X8, respectively, their flows. Then the commutator [V, W] vanishes if and only if 4it o 4' = %P, o tt for all -oo < s, t < oo.

Proof. Because d

= tim (4'-h-h).W - (4'-t1) / = (4'-t1).([V, W]),

the commutator [V, W] vanishes if and only if the vector field W is invariant under the flow 4it, (4)1).W = W. This condition is in turn equivalent to the commutativity 4't o T, = T. o 4it of the diffeomorphism 4't with the flow ID, of W.

3.10. Harmonic Functions A function f : Mk - R is called harmonic if it is a solution of the homogeneous Laplace equation 0(f) = 0. As a special case of Hopf's theorem (Corollary 2), we have the following:

Theorem 37. Every harmonic function on a compact, connected, and oriented manifold without boundary is constant. If the boundary of the manifold Mk is not empty, there exist two particularly important boundary value problems for harmonic functions.

3.10. Harmonic Functions

05

The Dirichlet Problem: Assume that a function V : OMk - IR is given. We IR whose values on the boundary look for a harmonic function f : Mk aMk coincide with those of cp:

0(f) = 0 in Mk and

f IaaMk = cp .

The Neumann Problem: Assume that a function cp : OMk -+ IR is given. We R whose normal derivative on the boundary coincides with yp:

look for a harmonic function f : Aik

0(f) = 0 in Mk and eiv

= cp on aMk.

A solution of the Neumann problem is never unique. For each solution f the sum f + C is, for an arbitrary constant C, a solution of this problem, too. This is the only degree of freedom, since we have

Theorem 38. Let Mk be a compact, connected, and oriented manifold in R", and let cp : aMk -+ IR be a smooth function.

(1) The Dirichlet problem has at most one solution.

(2) If f,, f2 both are solutions of the Neumann problem, then f, - f2 is constant.

(3) The vanishing of the mean value of cp is a necessary condition for the solvability of the Neumann problem: JOMk

p d(aMk) = 0 .

Proof. If fl, f2 both are solutions of the Dirichlet problem, then the difference u := f, - f2 satisfies the equations

0(u) = 0

and u IBMk = 0.

From the first Green formula we obtain

0 = J fk

-Lk

U. 85Jk

a" aN Nd(aMk),

and hence the gradient of u vanishes. Since we have u IBMk= 0, the function u has to vanish identically. In the case of the Neumann problem the argument runs along the same lines. i.e., starting from the equations

0(u) = 0

and

au

= 0 on aMk

and the Green formula, we again conclude that grad(u) = 0. Thus the difference u = fl - f2 is constant. If the Neumann problem has at least one


96

solution for any given function cp :8Mk integral formulas, we obtain 0

R, then, applying the classical

r L . d(Wk) = J

f O(f) dMA =

d(OMk).

Mk

160

.

The Dirichlet and the Neumann problems have, for a given boundary condition cp : 8Mk R (satisfying f cp = 0 in the case of the Neumann problem) a solution. We will not prove this existence result here for general mani-

folds, but confine ourselves to the case of the ball D"(R) C R' of radius R > 0. For these spaces, there is an explicit classical solution formula which will be derived here. For the sake of simplicity we only discuss the case of

dimensions n > 3 and leave it to the reader to complete the discussion in dimension n = 2, which differs only slightly from the one below. We start with a few preparations. By

r=

(x1)2 +

... + (xn)2

we denote the distance from the point (x', ... , x") E Rn to the point 0 E Rn.

Lemma 3. Let u(x) be a harmonic function defined on the set Rn - {0}, and assume that u depends only on the radius r. Then there exist constants C1 and C2 such that C2

u(x) = C1 + -n-2 Proof. By assumption there exists a smooth function h : (0, oo) -' R satisfying u(x) = h(r(x)). Differentiating this, we obtain the formulas n( )(' i( )r2 - (xi)2 x' Ou OZU

h(r)r, ax = h

112

r \r/

+h

r r3 8x and the latter implies the following differential equation for the function h(r): _

n

0

O(u)

52x1

lh(r).

=

For n > 3 the function h(r) = C1 + C2 r2-n is the solution of this ordinary differential equation.

Let y E Rn be a fixed point. The function u(x) = JJx - y112-n defined on the set Rn - {y} is a translate of r2-n, and hence a harmonic function. On the other hand, the partial derivatives of a harmonic function are harmonic functions as well. Thus

u(x) - 2 .=1

au y ax'

I Ix112

Ilx

I lyl l2

yl In


97

is a harmonic function defined on iRn - {y}. We will use this family of harmonic functions in the observations to follow.

Lemma 4. Let IIxii < 1 be any point in the interior of the unit ball D". Then

I

IIXII" n

1

IIx

dSn-1(y) = vol(Sn-').

Proof. First we prove that the function

f n-' IIx_ IIyIIn 2

h(x)

.

dSni-1(J)

depends only on the radius r = r(x). In fact, for a linear orthogonal map T : 1R" -+ R" we obtain, from I det(DT)I = I det(T) I = 1 and the corresponding transformation formula for the volume form of the sphere,

T*(dSn-1) = T*(NJ

d1Rn)

= NJ (T'(d1Rn)) = NJ dRn =

dSn-1,

the property to be proved:

h(Tx) = J _ I

dSn-1(y) n-1 IITxIITyIIn

1- IIXII2

= J n-s lixl

TII'(Iy)Iin

dSn-1(y)

. dSn-1(z) = h(x).

Jsn-' IIx - zlln If, in addition, h(x) is a harmonic function, then

z (h)(x)

IS, f

A. (1_ilx2

dSn-1(y))

= 0.

= yIIn The first lemma implies that there are constants C1, C2 with h(x) = C1 + C2 r2-". At the point x = 0 the function h(x) is a regular function and satisfies h(O) = vol(Sn-1). Thus the constant C2 vanishes, and C1 is equal to vol(Sn-'). 1

The solution of the Dirichlet problem for the unit ball D' C ]R" together with an explicit formula are the subjects of the next theorem.

12 is.,

Theorem 39. For every continuous function cp : S"-1 - 1R, the formula _ 1 yII" cp(y) dS"-'(y) AX) = vol(Sn-1) II-

IIx

defines a harmonic function in the interior of the unit ball, and at a boundary point z E Sn-1 lim f (t z) = cp(z).

t-1-


98

is harmonic with respect to the Proof. The function (1 - IIxII2)IIx variable z, and hence the function f (x) is harmonic, too. We prove that f(z) coincides with ;p(z) on the boundary Sn-1 in the way stated above. y E Sn-1} the maximum of the modulus of p Denote by in := and fix a positive number e > 0. The continuous function p is uniformly continuous on the compact set Sn-'. Hence there exists a number b > 0 such that for any two points y, z E Sn-1 in the sphere, i i - z < 6 implies yll-,,

the estimate I :p(y) - cp(z) I < e. We decompose the sphere Sn-1 = D1 u D2 into the parts

- zII > a} . D1 = {y E S" : IIy - ziI < b}, D2 = {y E Sn-1 For y E D2 and 0 < t < 1 we estimate the distance from y to the line :

II

segment between 0 and z E S"-': fly - tz1I2

=

1 - 2t(y,z) + t2 = (t - (y,z))2 + 1 - (y,z)2 > 1 - (y.

>

1 - (y, z) = 2-(2- 2(y, z)) = -Ily - zII2 >

1

1

Z)2

b2 .

2

We then split the difference _

P ' z) - Y (z) = vol(Sn-1) J$^

2

dSr

Y

Iltz

(?/)

into the integrals over D1 and D2. The modulus of the first integral can be increased to

1-IItzII2

dS"

n

1

1

S^-' Iltz - ylln Di We treat the second integral using the inequality obtained before:

D2

< 2m -

(1-t2)(

a2)n

vol(Sn-').

In summary, for every positive number E > 0 there exists a number 6 > 0 such that for all 0 < t < 1 the following inequality holds:

- t2).

E+2mb Hence the upper limit is bounded by E,

lim suplf(t' z) - V(z)I < E The last inequality holds for all positive numbers E > 0, and this in turn implies lim f (t tt-

z) = p(z)

.

0

99


Next we will discuss several consequences of the solution formula for the IR defined on the Dirichlet problem. For a harmonic function f : D"(R) closed ball of radius R > 0, the function j (z) := f (R z) is also harmonic on the unit ball. Applying the previous theorem and returning to the vari-

able r = R z E D"(R) finally leads to the Poisson formula for harmonic functions: 2

2

-

2

f(x) = vo1Sn-1) J "-' IIR- RI yl l1"f(R y) dS"-1(y)

Evaluating the Poisson formula at the point x = 0 leads to Gauss' mea value theorem for harmonic functions.

Theorem 40. The value of a harmonic function f at the center of the ball coincides with the mean value of the harmonic function on the boundary of the ball:

We will use Gauss' mean value theorem in the proof of the maximum principle for harmonic functions.

Theorem 41. Every harmonic function f on the connected manifold Mn C R" attaining its maximal value in the interior of Mn is constant.

Proof. Denote by m the maximal value of f and by Q = {x E Al"\8AI" : f (x) = m} the set of all inner points of M" at which f attains this maximal value. By assumption 1 is a non-empty closed set in M"\811". Hence it suffices to prove that 1 is an open subset of M"\8M". Choose a point xo E S2 and a radius Ro such that the ball D"(xo, R0) with center x0 and radius Ra is completely contained in Mn\8M". By Gauss' mean value theorem

f (x0) =

VOl(S1

n-1)

Jsn_l

f (x0 + Ro y) dSn-1(y)

m. = .f (xo)

This implies that f is constant and equal to m on the sphere with center x0 and radius Ra. This observation can also be applied to each radius RR < Ro below R0. Together this implies that f - m is constant on the ball D"(x0i R0), i. e., D" (x0, Ra) is contained in Q. Thus Il is an open subset. 0

Eventually we will prove one more application of the Poisson formula, Liouville's theorem for harmonic functions.

Theorem 42. Every harmonic function f : (above) is constant.

1[P"

R bounded from below


100

Proof. Changing f, if necessary, by adding a constant, we can suppose without loss of generality that the function f is non-negative. Fix a point xo E Rn and choose the radius R such that xo lies in the ball D"(0, R). By the Poisson formula, Gauss' mean value theorem, and the assumption f > 0 we have Rn 2

f (xo) =

vol(Sn-1) Jsn

Rn-2

R' - II

2

IIo-R

yll In

R2 - I IxOI I2

< vol(Sn-1)

Jn-i IIIxoII - IIR

_

2 - II oIIRIn 2

2(

vo Sn-RIIlxoll

-2(R2 _

RSn

)IIIxoII

Isn

- iRI"

y) dS"-' (y)

. f (R

ylll

n

.f (R y)

dSn- ' (y)

f(R y) dSi-1(y)

f(0)

Taking the limit for R -* oo yields for all points xoi E R" the estimate

f(xo) 0.

19. Prove the orientability of the sphere S", using stereographic projection and Theorem 14.

20. Let AI' be a manifold, h : U _ M' a chart, and -y [a, b] --. h(U) C 111' a curve in Afk completely contained in the image set of h. Represent the curve 7 in the coordinates (h, U) as h-1 o ry(t) = (y1(t), ... , y'(t)). Prove that the length of the curve can be computed by means of the coefficients of the Riemannian metric g1, in the chart (h, U) via the formula :

b

l(7) = f a

1/2

9ti(7(t))d dtt) d dtt)

dl.

21. Let f : [a, b] , R+ be a positive function and let Ale = {(x. y, z) E R3 : y2 + z2 = f2(x)} be the corresponding surface of revolution in R3. Prove the volume formula /b

vol(M2) = 2n / f (x) 1 + (f'(x))2 dx. a

22. Compute the following surface integrals: for M2 = (x, y, z E R3) : x2+y2+:2 = a2. z > 01:

a) ty

-

Exercises

b)

109

/ (x2 + y2)

012.

hr

where M2 is the boundary of the subset of 1R3 x2 + y2 < z < 1.

described by the inequality

23. Compute the following surface integrals:

a) f2(R) (xdyAdz+ydzAdx+zdxAdy); [(y - z)dy A dz + (z - x)dz A dx + (x - y)dx A dy], where M2 is the

b) nJt2

boundary of the subset of R3 described by the inequalities x2 + y2
Oji A wj(V, W) = 0. j=1

and hence all 1-forms w1, ... , w,,,_k vanish on the commutator [V, WJ. There-

fore, this vector field takes values in £k, i.e.. the distribution £k is involutive.

Proof of the implication (2) ; (4). Let £k be a k-dimensional involutive distribution. The form dwi A (wl n ... Aw,,,_k) has degree (m - k + 2). Inserting (m - k + 2) vector fields into this form, we can assume, without loss of generality, that two of these vector fields have values in £k. Denote them by V and W. Because of the involutivity of £k, we have

w;(V) = w;(W) = 0 and dwi(V,W) = -wi([V,W]) = 0. Therefore, the exterior product dwi A (wl A ... A wm-k) vanishes on every (m - k + 2)-tuple of vector fields.

Proof of the implication (1) : (4). The proof of this implication proceeds like the one before. Let £k be an integrable distribution, xo E All a fixed point, and h : W -+ All a parametrization of an integral manifold through this point. Then we have h*(wi) = 0 (1 < i < m - k), and this implies h`(dwi) = 0. Thus the 2-form dwi vanishes on the k-dimensional subspace £k(xo), dwi IEk(zo)XEk(zo)

= 0.

Inserting again (m - k + 2) vectors into the form dwi A (wi A ... Awm_k), we

conclude that at least two of these vectors-call them V and W-lie in the subspace Ek(xo). Hence wj (V) = wj (W) = 0 and &&,i (V, W) = 0. As above, we conclude that the form dwi A (wl A ... A Wm_k) = 0 vanishes.

4. Pfaffian Systems

116

4.2. The Proof of Frobenius' Theorem The implication (3) = (1), which is the core of Fobenius' theorem, is a local statement. Hence, without loss of generality, we can suppose that the manifold Mm is an open subset of R. We will prove a slightly more general result from which the proof of this implication immediately follows.

Theorem 2. Let w1i ... , w,,,_k be linearly independent 1 forms on an open subset M"' C Rm such that m-k

dwi = E B.ij A wj j=1

for certain 1 forms Oij. Then there exist at each point x E Mm a neighborhood U C Mm of x and functions hj and fj defined on the set U satisfying m-k

w= = j=1

Proof of the implication (3)

(1). Let £k be a distribution with the

property stated in condition (3). By Theorem 2, we can represent the forms w; in a neighborhood U of an arbitrary point zo E M' as m-k j=1

for certain functions. By assumption, the 1-forms w1, . . . , w,,,_k are linearly independent. Thus the differentials dfli ... , df,,,_k are linearly independent as well, and the set

Nk = (X E U : f1(x) = fi(xo), ..., fm-k(X) = fm-k(XO) } is a submanifold containing the point xo E M. At an arbitrary point x E Nk, we determine the tangent space:

TxNk = {v E TM' : df1(v) dfm_k(v) = 0 } C {v E TM' : w1(v) _ ... = w,,,-k(v) = 0) = Ek(x). For dimensional reasons, the vector spaces coincide, i.e., Nk is an integral manifold of the distribution £k through the point xo E Mm, and thus the integrability of the distribution £k is proved. 0

Proof of Theorem 2. We proceed in two steps. First we reduce the proof to the case of a system of 1-forms w1, ... , wm_k in a special normal form. To this end, we represent the euclidean space R n as a product RI = Rk X am-k and denote its points accordingly by y = (y', ... , yk) E Rk and z =

4.2. The Proof of Frobenius' Theorem

117

... , zm-k) E Rm-k. It is sufficient to prove the claim in Theorem 2 for 1-forms of the following special type:

(z1,

k

wi = dz`-1: Aii(y,z)'dy'

(1 0 if and only if it satisfies

_I + K2

,i

2

= R2

rc2T

in the natural parametrization. Proof. Differentiating the equation II-'(S)II2 ° R2, we obtain (t(s), -Y(s)) _ 0, and hence y(s) is a linear combination of the vectors h(s) and 6(s),

y(s) = a(s) h(s) +,13(s) b(s) . Furthermore, IIy(S)112 - R2 implies a2(s) + f32(s) (t(s), y(s)) - 0 again leads to K(s) (h(s), y(s)) + 1

R2. Differentiating

0, and thus

a(s) = We differentiate the equation c(s) (h(s), y(s)) + 1 - 0 and obtain the following relation by a simple transformation:

2s) ic'(

a(3)

I£ (S)T(S)

The asserted necessary condition for a spherical curve then immediately follows: R2

=

1

a2(s) + $2(s)

=

a2(s) +

2

r

( K2(3)T(S)

/

5. Curves and Surfaces in Euclidean 3-Space

136

If, conversely, this relation between curvature and torsion holds for a C4curve, then we first differentiate it and obtain the equation

r(s)

d ds

/G(S)

_

KG(s)

!GZ(s)r(s)

0.

Now consider the vector

a(s) := 7(s) +

.6

h(s)

,z() (s)

;R-S)

(s).

Using the Frenet formulas and the preceding relation between curvature and torsion, we compute the derivative of the latter, and find that

id(s) = 0. Hence d:= a(s) is constant, and 11-y(S)

-

aI1z

=

az(s) + \az(s)(s)/z = Rz'

i. e., the curve -y(s) lies on the sphere of radius R with center d.

0

Next we turn to plane curves. Note first that these can be described as the curves with vanishing torsion. Theorem 6. A curve of class C3 with nowhere vanishing curvature, k(s) 0, lies in a plane in R3 if and only if its torsion r(s) - 0 vanishes identically.

Proof. Let a' be a vector perpendicular to the plane in iR3 containing the curve. All the tangent vectors t(s) lie in this plane; hence (t'(s), a) - 0. Since ac(s) # 0, we immediately obtain (h'(s), a") - 0 by differentiating this equation. Thus d coincides with the binormal vector 6(s). In other words, the binormal vector 6(s) is constant. Then 0 = 6'(s) = -r(s) h(s) implies that the torsion vanishes, r(s) - 0. The converse is proved analogously. 0 The curvature of a plane curve can be ascribed a sign. In fact, the principal normal vector is proportional to the vector obtained by rotating the tangent vector through the angle 7r/2 in the positive sense. The curvature of a plane curve is ascribed the positive sign if the corresponding factor is positive. Identifying R2 with the complex numbers, the rotation through 7r/2 in the positive sense corresponds to multiplication by the number i E C. Using the multiplication of complex numbers, this leads to

Definition 7. Let -y : [0, L] - C = R2 be a plane curve in its natural parametrization. The plane curvature k(s) is the function k : [0, L) -' R defined by the equation ast(s) = k(s)

i

t(s) .

5.1. Curves in Euclidean 3-Space

137

The absolute value jk(s)j of the plane curvature coincides with the curvature K(s) of the curve viewed as a space curve.

Example 2. -y(t) = (t, ±t2)

.

A closed curve i' : [0, L] - C = R2 is one that starts and ends at the same point and whose tangent vectors at this point coincide as well, i'(0) = -Y(L) and t(0) = F(L).

Theorem 7. Let

C be a closed curve. Then the integral

[0, L]

r

2 J7

k(s)ds =

2

ILL

is an integer, called the winding number of the closed curve.

Proof. Consider the map t' : [0, L]

t* (s) = exp

C defined by

[k(u)du]

Then dt'(s)/ds = i k(s) - t*(s). and hence (t(s)/t*(s))' = 0. The tangent vectors t(s) are thus described by the formula

F(s) = C exp

Lk(u)duj fL

for a certain constant C. Since t(0) = t(L), the number J k(s)ds is an integral multiple of 27r.

o

0

We conclude the section on curves with the discussion of the Fenchel inequality, which claims that the total curvature of a closed space curve is


138

bounded from below by 2;r. We start by considering plane curves, and then generalize the result to space curves. First we need an auxiliary observation. Lemma 1. Let cp : [a, b] lR be a real function of class C', and suppose that the function f (t) := ew 0.

Hint: The function f : M2 -+ R, f (x) = (x, x), has to have a critical point on Ate.

23. Let the group G = SL(2,R) act on the hyperbolic plane ?{2 by _ a b .z _ az+b

cd

6F

cz+d'

Verify that the image point g z actually does belong to ?{2, and that superposition of two of these transformations corresponds to matrix multiplication in G. Show. moreover, that each g E G leaves the metric invariant; hence G is an 3-dimensional group of isometries from N2 onto itself.

24. Let AI'; = 1R3 and denote by DyX the directional derivative of the R3 in the direction of the vector X. Prove that VXY := DXY+2X xY vector-valued function X : R3

defines a covariant derivative having all the properties (1)-(4) from Theorem 42. but violating property (5).

25. Consider on the set {(x,y) E R2 : -7r/2 < y < 7r/2} the pseudoRiemannian metric

1 _ dx2-dye x'2

- y'2

COS2

x'

a) Prove that E _2(y) and P =2(y) are first integrals of the cos

geodesic flow, satisfying, in addition, the inequality p2 - E > 0. b) Discuss the geodesic lines on M2. To do so, assume that y is a function of r and integrate the resulting ordinary differential equation.

c) On Al" there exist points that cannot be joined by a geodesic line.

203

Exercises

26. Prove that every three-dimensional Einstein space is a space of constant curvature.

27. Let M' be a non-flat Einstein space (for example, S"'). Show that MI x M"' with the product metric is an Einstein space, but not a space of constant curvature.

28. It is well-known that a symmetric bilinear form h.(x, y) on a vector space is completely determined by its quadratic form q(x) := h(x, x) (via the polarization formula: 2h(x, y) = q(x + y) - q(x) - q(y)). Prove, in a similar way, that the Riemannian curvature tensor 1 (U. V, W1, W2) is completely determined by the quadratic form K(U, V) := R(U, V, V, U) corresponding to the sectional curvature. 29. Prove that a four-dimensional Riemannian manifold is an Einstein space if and only if for every 2-plane E2 C TM4 and its orthogonal complement (E2)1 the corresponding sectional curvatures coincide, K(E2) = K((E2)1).

30. Because of its symmetry properties, the curvature tensor of a pseudoRiemannian manifold can be interpreted as a transformation R : A2(M'') A2 (M7°) on 2-forms, n 1

R(ai Aaj) = 2 > Ro3i . a, Aa,3. Q,:3=1

In dimension m = 4, this gives rise to an endomorphism R : A2(A14) A2(M4) A2(M4). On the other hand, the Hodge operator * : A2(A14) acts on A2(M4), and its square depends only on the index k of the metric (see

Theorem 5, Chapter 1): ** = (-1)k. In the cases k = 0 (positive definite metric) and k = 2 (neutral metric), the Hodge operator decomposes the real bundle A2(M4) into the corresponding eigenspaces A2 (M4) (see Exercise 8, Chapter 1). Prove that the block representation of the curvature tensor.

R _

R++ R-+

R+- R--

with respect to this decomposition of A2(M4) has the following properties:

a) 7Z is symmetric, i.e., R++ = R++, R__ = R__ and R+_ = R_+.

b) The traces of R++ and R__ coincide, tr(R++) = tr(R_-) = -r/12. c) M4 is an Einstein space if and only if R+_ vanishes.


204

Literature: Th. Friedrich, Self-duality of Riemannian manifolds and connections, in: Riemannian geometry and instantons, Teubner-Verlag, Leipzig, 1981, 56-104.

The Einstein equation in the general theory of relativity combines the geometric curvature quantities of a four-dimensional pseudo-Riemannian manifold of signature (1, 3) with its physical properties encoded in the energymomentum tensor T,

Ric-

rcT.

Here Ric. g. r are the Ricci tensor, the metric, and the scalar curvature; K is a constant depending on the chosen system of units. Already for the vacuum, T = 0, there are non-trivial, physically very interesting solutions of this equation, among others the Schwarzschild metric to be discussed now. Obviously, a vacuum solution of the Einstein equation has to be an Einstein space with vanishing Ricci tensor in the sense of the definition given before. 31 (Schwarzschild metric). On a spherically symmetric and static space-time manifold M4 (this is a pseudo-Riemannian manifold of signature (1, 3) with

isometry group SO(3, R) and a distinguished time direction), it is always possible to introduce coordinates from R x R+ x S2 with respect to which the metric can be written as g = e2a(r) dt2 - [e 2b(r) dr2 + r2 (d92 + sin2 O dV2) J .

Here, the functions a(r) and b(r) asymptotically tend to zero for r - co (the metric is "asymptotically flat"). We introduce the following basis of 1-forms:

ao = ea dt, Cl = eb dr,

a2 = rdO, 03 = r sin 9 dcp .

a) Check that the metric satisfies g = 0,02 - 0i - 02 - a3 . b) Compute the forms dai and show, using the first structural equation, that the connection forms are given by wo, = -a'e-bcO,

e-b

WO2 = W03 = 0,

a-b

W12 = -r 0`2, w13 = r- a3, w23 =

cot 9

r

a3-

c) We introduce the notation lid := 2 E.4 Rtasia0 A0,3. (f is the so-called curvature form). Show, using the second structural equation, that

110 = e-2b(a'b'-a"-a2)aoAal, f20 = -a'er"aoAa2, -a'er6

03O =

ao A 03,

5231 = b'erbal A a3,

1121 = bier

6a1

A02,

X32 = '-'-'012 A a3,

Exercises

205

and compute from these formulas the components R kl of the curvature tensor in this basis. d) Compute the components of the Einstein tensor G := Ric - 1g ,r: ) , Gi l = ,1-s _e-2b(77 Goo= I e-2b (13 _ 2b F 7-

-

+),

G22 = G33 = -e-2b (a'2 - a'N + a-+

=d) r

e) Solve the vacuum equation by means of this Ansatz. The result is (M is a constant of integration) r

g = 11-

1

2M dt2 -

r

J

r

11I

2M

r

1_1

dr2 J

- r2 (d92 + sin2 0 d 2)

32. Restrict the Schwarzschild metric g to the two-dimensional submanifold

defined by 0 = x/2 and t = const. Prove that this yields the metric of a paraboloid of revolution (a "lying" parabola!) with the equation z2 = 8M(r - 2M) (see the picture). 33. Light moves along geodesic lines whose tangent vectors have length zero. Making use of the fact that the equations defining a geodesic are precisely

the Euler-Lagrange equations of the length function G = ta gjjx'ia, prove the following assertions:

a) A particle moving at t = 0 in the equatorial plane 0 = x/2 stays there forever.

b) The quantities L := r2cp and E := t(1-2M/r) are first integrals. Moreover, the second Kepler law holds: The orbit ray covers equal areas in equal times.


206

c) Set r = r(W) and derive the equation describing the motion of light rays.

Result: It is reasonable to set u := 1/r: u" + u = 3Mu2. d) Solve this differential equation approximately up to second order in V. Result:

uo =

1

sin V +

[1

+

cos 2W

.

Interpret the constant of integration 2 0 ro as 3the scattering length. Which asymptotic value arises for cp if r tends to oo and sp is supposed to stay small? Twice this value, denoted by 5 in the picture, is the relativistic deviation of light in the gravitational field of a very large miss, which can be described by the Schwarzschild metric.

Chapter 6

Lie Groups and Homogeneous Spaces

6.1. Lie Groups and Lie Algebras In the preceding chapter Noether's theorem showed that symmetry considerations simplify the study of geometric problems, and sometimes it is only by symmetry considerations that a solution is possible at all. In fact, beginning in the 1870s, the conviction grew that the basic principle organizing geometry ought to be the study of its symmetry groups. In his inaugural lecture at the University of Erlangen, which later became known as the "Erlanger Programm", Felix Klein said, in 1872, "Es ist eine Mannigfaltigkeit and in derselben eine Transformationsgruppe gegeben; man soil die der Mannigfaltigkeit angehorigen Gebilde hinsichtlich solcher Eigenschaften un-

tersuchen, die durch die Transformationen der Gruppe nicht geandert werdenl.

One has to distinguish whether the groups under consideration are discrete (for example permutation groups) or continuous (for example, oneparameter groups of isometries). The latter were systematically introduced by the Norwegian mathematician Sophus Lie (1842-1899). which is why they bear his name today.

1 "Let a manifold and a transformation group in it be given; the objects belonging to the manifold ought to be studied with respect to those properties which are not changed by the transformations of the group."-quoted from F. Klein, Des Erlanger Programm, Ostwalds Klassiker der exakten Wissenschaften, Band 253. Verlag H. Deutsch, Frankfurt a. M., 1995, p. 34. 207

6. Lie Groups and Homogeneous Spaces

208

The fundamental idea of a Lie group is a very simple one. It ought to be a group which is at the same time a manifold, and hence allows a differential calculus. Moreover, the manifold structure has to be compatible with the group structure, i. e., the product is a differentiable map.

Definition 1. A Lie group is a group G which, at the same time, is a differentiable manifold such that the map is (infinitely often) differentiable.

Remark. Obviously, the last condition is equivalent to requiring that the product map and the inversion (9, h) -. g . h,

g'-' g-1 ,

be differentiable maps.

Example 1. Every finite-dimensional vector space V is an abelian Lie group whose group product is exactly the addition of vectors (v, w) H v + w.

Example 2. The unit circle S' = {z E C : IzI = 1} is an abelian Lie group with the usual multiplication of complex numbers as product.

Example 3. Most Lie groups can be realized as matrix groups. The set of all invertible matrices with entries in K = R or C is an open subset of IK"z and hence a manifold. Endowed with matrix multiplication as product, it forms a Lie group, the general linear group

GL(n,K) := {A E .M"(K) : det A 540} . More generally, GL(V) is meant to denote the group of invertible endomorphisms of the vector space V.

Example 4. The vector space K" and the general linear group GL(n, K) combine to form a new Lie group, the affine group Aff (K") := GL(n, K) x IIS"

with multiplication

(A, v) (B, w) := (AB, Aw + v). This product rule arises in a natural way by defining an action of the affine group on the vector space K" through

Aff(K") x K' V. (A, v)x := Ax + v. and then applying the transformations determined by (B, w) and (A, v) on a vector x one after the other. Each Lie group G acts on itself by means of the left and the right translation with a fixed element g E G,

L9, R9 : G -. G, L9(h) = g h and R9(h) = h g.

6.1. Lie Groups and Lie Algebras

209

The corresponding differentials are the following maps in the tangent bundle of G: Th9G. (dLg)h: ThG - TghG, (dR9)h : ThG To avoid double indices, in this chapter we will use the notation (df)h instead of f.,h for the differential of a map f.

Definition 2. A vector field X on G is called left-invariant (or rightinvariant, respectively) if it is transformed into itself by dL9 (or dR9, i.e., dL9X = X. At a point h E G, this means

(dLg)hX(h) = X(g - h). Since left translation is obviously a diffeomorphism of G, Theorem 35 in §3.9 can be applied to yield, for the commutator of two left-invariant vector fields, the formula

dLg[X,Y] = [dL9X,dLgy] = [X,Y1. This property, together with the fact that vector fields satisfy the Jacobi identity (Theorem 34, §3.9), endows the vector space of left-invariant vector fields with the structure of a Lie algebra, the Lie algebra g of the Lie group G.

Theorem 1. The vector space of left-invariant vector fields on a Lie group G is canonically isomorphic to the tangent space at the neutral element,

g='TeG. Proof. With each left-invariant vector field X, we associate its evaluation at the neutral element e, X - X(e) =: X E TOG. Conversely, every element X E TeG determines a vector field Xx on G by setting

Mx(g) := (dL9)e(X) This satisfies the relation

Xx(gh) = (dLgh)e(X) = (dLg)h(dLh)e(X) = (dLg)hXX(h)

0

hence Xx is left-invariant.

Remark. Because of this fact, we will no longer distinguish between the Lie algebra of left-invariant vector fields and the tangent space to the group

at the neutral element. Its elements will be denoted by upper case Latin letters X, Y, ... E 9. Choosing a basis X1, ..., Xr of 9, we can again write their commutators as linear combinations of the basis elements, r

[Xi,Xj] = k=1

&Xk-


210

The antisymmetry of the commutator and the Jacobi identity imply that the constants C have to satisfy the relations

Cij - - Cii '

CijCk,n + Cj' "ki + CrniCkj = 0.

Following E. Cartan, the numbers C are called the structure constants of the Lie group G, since Cartan's structural equations are simple to formulate in their terms. To see this, we agree to call a differential form w on G left-invariant if it satisfies the condition L*9w = w.

Following the argument in the case of vector fields, it is easy to see that the r-dimensional vector space g* of left-invariant 1-forms is canonically isomorphic to Te G. Now let al, ...,o-, be the basis of g* dual to X1. ..., Xr. Theorem 2 (Maurer-Cartan Equations). Let C be the structure constants of a Lie group G with respect to the basis X1, ... , Xr of its Lie algebra g. Then the exterior derivatives of the forms in the dual basis al, ... , or of g* are given by

do, _ -r k of Aak. j 0, and there would exist integers k and I such that

I=

q = Va.

But this would imply q = Ilk E Q. Hence we arrive at a contradiction. As the closed subgroups of IR are precisely the cyclic groups and R itself, only this last possibility remains for Z + qZ. In Example 4, §3.1. we encountered a parametrization of a torus of revolution

by means of S' x S'. The following pictures show the trace of a curve yy in this parametrization for two close rational and irrational values of q, respectively. In Example 8, §7.4, we show that the motion of a spherical pendulum can be parametrized precisely by such a curve on a torus.

The main objective of this section is to prove that a closed subgroup H of a Lie group G is a submanifold of it. Consequently, it is itself a Lie group, and the quotient space C/H carries the structure of a manifold with a smooth G-action. As a preparation, we need a few technical lemmas. Let II - II be any norm on the vector space g. Lemma 3. Let H be a closed subgroup of G. and let Xn # 0 form a sequence converging to zero in 9 such that exp(X,,) belongs to H and Xn/IIXnII tends to an element X E g. Then

exp(tX) E H for all t E R. Proof. For a fixed t > 0 we define a sequence of natural numbers m by

mn := max{kEN: Then the following estimate holds:

mnIIXnII < t < (mn + 1)IIXnII = mnIIXnII + IIXnII But on the right-hand side, the sequence IIXnII tends to zero, and hence

lim mnIIXnII = t. n-or,


218

This implies

limn rnXn =

hn-oo m m HIX,,II

IlXnll

= tX,

and, since the exponential map is continuous,

exp(t X).

Iim

n-oo Each term in the sequence exp(m,,Xn) = [exp(X,,)]"'" is an element of H. As H is closed by assumption, the limit of this sequence, exp(t X), has to lie in H, too. The proof for the case t < 0 proceeds along the same lines.

Lemma 4. For every closed subgroup H C G, the set

b :_ {XEg: exp(tX)EHforalltElR} is a linear subspace of g.

Proof. For any vector X E h, its scalar multiples a X also belong to h. It thus suffices to show that h is closed under addition. To this end, let X, Y be elements of h, and suppose that X + Y 36 0. In any case, the product exp(tX) exp(tY) belongs to the subgroup H, and for sufficiently small t we have, by Lemma 1,

exp(tX) exp(tY) = exp(t(X +Y) + 0(t2)). Then Z(t) := O(t2)/t apparently converges to zero for t - 0, and we can rewrite the preceding equation as

exp(tX) exp(tY) = exp (t(X +Y + Z(t))) E H. Choose a sequence of positive numbers that converges to zero, t - 0, and define Xn := tn(X + Y + Z(tn)). Each term exp(XX) lies in H, and --:in X+Y = lim X+Y+Z(t,) lim n-- IIX + Y + Z(t,a)[I = Fix -+Y11 11X-11 Obviously, we have Xn # 0 and Xn -' 0. Thus, Lemma 3 applies, and we can conclude that for every t E R

exp(t

IIX+YII

is an element of H. Hence X + Y belongs to Fj. Suppose that H is, in addition, a closed subgroup of G, and let h be defined as in the preceding lemma. We then choose any linear complement h' of h in 9,

9 = h+h'. Lemma 5. There exists a neighborhood V' C 4' of 0 such that, for every X' 34 0 in V', the element exp(X') does not belong to H.

6.2. Closed Subgroups and Homogeneous Spaces

219

Proof. If the assertion were false, there would exist a sequence X;, E h' converging to zero and satisfying exp(X,,) E H. Now consider the compact set K :_ {X' E h': 1 < JJX'II < 2} and choose natural numbers p,, such that p ,,X,, E K. Since K is compact, we may assume that the sequence p,X;, converges to some 0 9& X' E K. Again, [exp(X;,)]P is an element of H and lim

pnX' 11p-Xnii

_

X' 11X'6

Then Lemma 3 implies that X'/IIX'II E h, contradicting 0 0 X' E '.

0

Now we can turn to the main theorem of this section.

Theorem 7. Let G be a Lie group, and let H be a closed subgroup. Then: (1) H is a submanifold of G and thus itself a Lie group. (2) There exists precisely one differential structure on G/H such that (a) the projection 7r : G - G/H is smooth, (b) for every p E G/H there exist a neighborhood Wp C G/H of p and a smooth map w : Wp G such that 7r o p = Idw,,, (c) the action of G on G/H defined by (g. kH) gkH is smooth. Proof. It obviously suffices to show that there exists a neighborhood W C G of e for which H n W is a submanifold (left translation is a diffeomorphism of G). As before, ddecompose the Lie algebra into g = 1) + h' and consider the map corresponding to this decomposition, 4D :

g = h + 4' ---i G, 4(X + X') = exp(X) exp(X').

In h' we choose a neighborhood V' C h' as in Lemma 5. as well as a subset V C h so small that the exponential map still is a diffeomorphism on V +V'. The image W of V + V' under -ID is an open neighborhood of e E G, and

HnW= by the definition of b and Lemma 5. The set H fl W is thus parametrized by the chart (V, -D Iv+(o}), and hence a submanifold of G.

Now we turn to the proof of the second assertion. Let 7r : G - G/H denote the projection. We define a topology on G/H by the condition

A C G/H is open :a 7r-' (A) C G is open. It is called the quotient topology on G/H, and it is designed to render the map 7r continuous. Endowed with this topology, G/H is a Hausdorff space (see Duistermaat/Koik, Lemma 1.11.3). To verify the properties a manifold


220

has to satisfy, consider the distinguished point xo := e H E G/H together with the sets V, V' introduced in the first part of the proof. The map

' : V' - G/H, X'

a(exp X')

,

is continuous and maps V' onto an open neighborhood U of xo. Moreover, V, is injective, since v(X') = O(Y') implies the existence of an element h E H

such that exp(X') = exp(Y') h. Hence

h = exp(X') exp(-Y') = b(0 + (X' - Y')) . Thus h also belongs to the set W, which was defined as the image of V + V

under C Since we already proved H fl W = 4(V + {0}), this implies X' = Y'. In summary, the map 1/i : V' U is continuous and bijective. For an arbitrary point gH E G/H, consider the left translation by g E G on

G/H. L9 : G/H

G/H, kH

gkH

and introduce a chart around gH E G/H by L9(U),

U9H

09H: V' - UgH,

Z'gH := Lg o TV .

For two points gH and kH, the chart transition can be rewritten as follows, kH o y9H =

=

v-1

o Lk-1 o L9 0 V1 = exp-1 o(ir-I o 4-1 0 L9 0 a) o exp

exp-1 oLk-1h

oexp .

Therefore. as a superposition of smooth maps, the chart transition is also smooth. Hence we have proved that G/H is a differentiable manifold, the projection ;r : G - G/H is smooth, and C acts smoothly from the left on G/H. It remains to show (b). For the distinguished coset p = .ro = e H, define y; for each x in U =: Wp by

cp(x) = exp(Vi-1(x)) = 7r-1(x). For an arbitrary point p = gH one again uses the left translation L9.

0

Definition 5. The action of a Lie group G on a manifold Al is called transitive if, for two arbitrarily given points x and y in Al. the one can always be written as the image of the other under the action of G. i. e., there exists a g E G such that y = g x. An equivalent formulation of this requirement is to say that M consists of a single G-orbit, G x = Al. A manifold together with a transitive group action is also called a homogeneous space.

Obviously, the left translation on the quotient Al = G/H is a transitive group action, and thus G/H is a homogeneous space. Theorem 7 can be applied to show that some well-known matrix groups are Lie groups: The following groups apparently are closed subgroups of GL(n, K).

6.3. The Adjoint Representation

221

Example 8. The subgroup of GL(n, K) consisting of all matrices with determinant 1 is a Lie group, the special linear group,

SL(n,K) :_ {A E Mn(K) : det A = 1} .

Example 9. Let H

I =: h u, v E c } be the vector space of {[. Hamilton's quaternions with standard basis V

111

1

110

11'

oil' J Lof OJ, K = Lo i 01 and norm N(h) := uu + vv. The group of all quaternions with norm 1 is

E=

1=

LO

isomorphic to the Lie group

SU(2) := JA E GL(n, IC) : AAt = 12 and det(A) = 1). Example 10. The preceding example can be generalized as follows. The unitary group is embedded into the space of complex matrices as

U(n) :_ {A E

AAt = 1n}

.

The condition AAt = 1n immediately implies I det Al = 1, hence det A E Sl; the special unitary group is defined as the group of all unitary matrices A satisfying det A = 1:

SU(n) := {A E U(n) : det A = 1} . Example 11. The orthogonal group O(n, K) consists of the matrices A E M ,,(K) leaving the euclidean standard scalar product of Kn invariant,

(Ax, Ay) _ (x, y) Realizing the scalar product as (x, y) = xty, we see that this condition is equivalent to AAt = ln. Hence we obtain

O(n, K) = JA E Mn(K) : AA' = 1n} . Obviously, an orthogonal matrix has determinant + 1 or -1. The subgroup of all orthogonal matrices with determinant +1 is called the special orthogonal group SO(n, K),

SO(n,K) = {AEMn(K): AAt=lnand detA=1}.

6.3. The Adjoint Representation Definition 6. Let G be a Lie group with Lie algebra g, and let V be a finite-dimensional vector space.


222

(1) A representation of the Lie group C on V is a smooth group homomorphism e : G GL(V), i.e., a smooth map compatible with the group structure,

e(g h) = e(g) e(h) (2) A representation of the Lie algebm g on V is a homomorphism of Lie algebras, p : g - gl(V), i. e., a linear map compatible with the commutator,

e([X, Yl) = [e(X ), e(Y)] = e(X) e(Y) - e(Y), e(X) Sometimes, V is then also called a G-module or a g-module, respectively.

Example 12. The trivial representation of a Lie group G is the group homomorphism that maps every element g E G to the neutral element in GL(V): p(g) = 1V; the trivial representation of g associates the zero map with every element X, o(X) = 0v. Example 13. Matrix groups are defined by means of one of their representations, often called the defining representation. In fact, we introduced the groups GL(n, R), SL(n, R) and SO(n, R) in a way endowing them naturally with a representation on R". A simple example illustrates that these matrix groups and their Lie algebras have many more representations. The Lie algebra sl(2, R), for example, has representations in all dimensions: For every natural number n, define e : s((2, R) - gl(n + 1, R) by

g(H) = diag(n, n - 2, ..., -(n - 2), -n), 0

0 n

1

0

2

e(F) =

e(E) 0

0

n-1

0

nL

1

0

These matrices satisfy the commutator relations of sl(2, R), [e(H), e(E)1 = 2,o (E),

[e (H), e(F)l = -2e(F), [e(E), e(F)] = e(H), and hence form an (n + 1)-dimensional representation of sl(2, R). Properties

that cannot be expressed by the Lie bracket do not have to be preserved under a representation: For example, we have E2 = 0, but e(E)2 0 0. Nevertheless, the property that g(E) is a nilpotent matrix is preserved. There is also a a representation of the Lie group SL(2, R) corresponding to this representation of the Lie algebra; this will be the subject of Exercise 4.

Apart from left and right translation, there is a third remarkable action of a given Lie group G on itself, the so-called conjugation action,

ag : G - G, a9(h) := ghg-' = L9R9-, h.


223

It is smooth and satisfies ag(e) = e, and in contrast to left and right translation, it is far from being transitive. In the case G = GL(V), it decomposes the invertible matrices precisely into their similarity classes. In addition, the relation ag(e) = e implies that its differential at e is a map from g to g, d(ag)e: TG 5--- g ----+ TG 2--- 9,

which is obviously invertible, since d(Lg)e and d(Rg-i)e are invertible. We define the adjoint representation of G on g by Ad : G ---+ GL(g),

Ad(g) = d(ag)e E GL(g).

Before we verify that this actually is a representation, recall the definition of the center of a group. It consists of those elements which commute with all the others:

ZG = {gEG: gh=hgdhEG}. Theorem 8. The map Ad : G - GL(g) is a representation of G on the vector space g. The center ZG of G is contained in its kernel, ZG C ker Ad,

and equality holds if and only if G is connected.

Proof. First we check the homomorphism property:

Ad(gh) = d(LghRgh )e = d(LgLhRh-, Rg-i)e = d(agah)e = d(ag)ed(ah)e = Ad(g)Ad(h). Let z belong to the center Z. Then aZ = IdG, and hence Ad(z) = IdGL(9), i.e., z is in the kernel of Ad. Now suppose that G is connected and that

Ad(g) = Ide. Since ag : G - G is a group homomorphism, the map t ' -+ ag (exp tX) is a one-parameter subgroup for every X E g, and, by Theorem 5, there exists an element Y E g such that

exp(tY) = ag(exptX). Differentiating this equation with respect to t, we obtain Y(e) = dt (ag(exp tX )) I e=o = Ad(g) (X (e)) = X (e) , which proves X = Y. For the one-parameter group defined above this means

that exp(tX) = ag(exptX) for all t E R and X E 9. The exponential map is a local diffeomorphism g - G; hence ag = IdG on an open neighborhood W of e. For a connected

224


Lie group G this implies ay = IdG, since G can be represented as the union of all powers of W (with respect to the group products), 00

G = UW'. As a9 = Idc is equivalent to g E ZZ, everything is proved.

0

The differential of the adjoint representation of G (in the sense of Definition 4) is a representation of the Lie algebra g which can now be expressed by the commutator.

Theorem 9. The differential ad := Ad. : g -. gl(g) of the adjoint representation is a homomorphism of Lie algebras determined by the formula

ad(X)(Y) = [X, YJ. Proof. By the definition of the differential, we have

Ad(exptX) = exp(tAd.(X)) = 1 + tAd.(X) + ... ; hence

Ad.(X)(Y) =

Xd

Ad(exptX)(Y) - Y

li.o The flow corresponding to the vector field -X is (bt = R P(_tX), since d4ie(e)

dexp(-tX) I

dt LO t=o Applied to a left-invariant vector field Y, however, its differential coincides with Ad(exptX)(Y),

Ad(exptX)(Y) = dL P(tx)dR P(-ex)(Y) = dR p(_tx)(Y) = d4it(Y). Thus the original identity can be rewritten as

Ad.(X)(Y) = li o

`ht(Y) - Y

The right-hand side is precisely the definition of the commutator [Y, -X] _ O

[X, Y].

This representation will also be called the adjoint representation (this time of the Lie algebra g). In case of doubt, the context has to decide whether the representation of the Lie group or that of its Lie algebra is meant. Remark. The definition of the differential immediately implies the identity

Ad(expX) = exp(adX).


225

This has to be understood as an identity of operators. Applied to an element Y, it means ad(X)3i3(Y)

exp(X) Y exp(-X) = 1 + ad(X)(Y) +

+ ...

= 1 + [X, Y1 + [X, [2I Y11 + [X, [X 3'XI Y111 + ... .

Example 14. Let g be the three-dimensional Lie algebra which is abstractly defined by the following commutator relations for a basis el, e2, e3: [el, e21 = e3,

[e3, ei1 = e2.

[e2, e31 = el,

The representing matrices of the adjoint representation with respect to this basis can be computed from them. For the operator ad(ei ), we obtain 0

ad(ei)

e2 e3

e3

i

l

=

rO

-e2

0

1

-1

0

e2 e3

Ll

e2 e3

and similarly for the other two operators,

0-1 L2 := ad(e2)

0 rol

0

0 0

10

0 L3 := ad(e3) _

,

1

0

0 0

00

Let us look, on the other hand, more closely at the orthogonal group O(n, R).

It was defined as the set of matrices satisfying f (A) = AAt - 1, = 0. The differential of this map at the point X is

df(A)x = AXt+XAt, and hence, according to Theorem 5 in §3.2, the Lie algebra of O(n, R) is

o(n, R) = Te O(n, R) = {A E M,(R) : A + At = 01. The Lie algebra of the orthogonal group consists precisely of the skewsymmetric matrices, which for n = 3 is apparently spanned by Li, L2 and L3. This proves that the three-dimensional defining representation of o(3, R) is isomorphic to the adjoint representation. In higher dimensions, this fact no longer holds as a simple dimensional consideration shows: A skew-symmetric matrix has exactly as many degrees of freedom as entries above the diagonal. Therefore, 22

dim o(n, R) = and this is equal to n only for n = 3.

- n = n(n2 1)


226

Exercises SL(2, R) is not surjective. Hint: 1. The exponential map exp : si(2, R) What are the values that tr exp(A) can attain for A E al(2, R)?

2. Hamilton's quaternions H (Example 9) form not only a vector space, but also a (non-abelian) division algebra, i. e., an associative algebra in which each non-trivial element is invertible. Prove that the standard basis E, I, J, K of Hamilton's quaternions obeys the following algebra relations:

I.K=-J, and compute the inverse of the quaternion h =

0.

3. Identify the quaternions of trace 0,

Ho = {xi . I + x2 J + x3 K j xi, x2, x3 E R}, with the 3-dimensional euclidean space R3. Prove that, for every U E SU(2), the map UxU-1, Ho - Ho, x

defines a special orthogonal transformation of R3. The resulting map e SU(2) - SO(3,R) is a representation which is not injective ("faithful"). 4. The defining representation of SL(2, R) on R2 is the usual matrix action on vectors,

gV

[cx + dy] Ic d] [y] Let V,, be the (n + 1)-dimensional vector space of homogeneous polynomials of degree n in the variables x and y. Define an action of g E SL(2, R) on the polynomial p E V,, by

P(g) ' P ([X])

= P\g_1

[;]).

Prove that B is a representation of SL(2, R).

5. Let G be a Lie group, and let H be a discrete, normal subgroup of G. Prove that H is necessarily contained in the center of G.

Exercises

227

6. Let (p, V) be a representation of the group G. A subspace W of the representation space V is called invariant if. for every g E G. the relation p(g)W C W holds. For trivial reasons, the subspaces W = V and W = {0} are invariant: if the representation has no further invariant subspaces, it is called irreducible. Consider the following two-dimensional representation of the additive group R:

tl

rrI LOW = L0

1

11

Prove that this representation is not irreducible. Does the invariant subspace have an invariant complement?

7. By Theorem 6. the differential of a representation (p. V) of the Lie group G is a representation (Lo., V) of its Lie algebra g. The tensor product of two representations (p, V) and (µ, W) of G is defined by (p ®Fr)(9)(v $ w) := p(9)v (9 p(g)w, g E G. V E V. W E W . Prove that this determines a representation of G on V O W with differential

(e® i),(X)(v ®w) := p,(X )v ®w + v ®p.(X)w, X E g. 8. In order to describe the hyperbolic plane as a homogeneous space. it is useful to introduce a new model for it, the open unit disc.

a) Let D = {z E C I lzj < 1} be the open unit disc with metric

9 = (1_1212)2 I0 Show that the Cayley transform.

I.

`

-i`+i=:x+iy,

c(z) =

c:

i

z-i

is an isometry between D with the metric above and the upper half-plane ?{2 with the fourfold of the hyperbolic metric. b) Let the Lie group

SU(1.1) :_ { act on D via the formula a

b

lb

b

z

:

1a12

- 1b12 = 1}

_ az+b az+b

a Prove that this action is transitive, and that the isotropy group of zero, b

Go := {g E SU(1,1) : g 0 = 0}, is isomorphic to SO(2,IR). Hence D 5 SU(1,1)/SO(2,1R).

Chapter 7

Symplectic Geometry and Mechanics

7.1. Symplectic Manifolds Riemannian geometry is the geometry of a symmetric, bilinear form depending on the point of a manifold. The curvature is a measure of how far two symmetric bilinear forms differ locally. Contrary to this, symplectic geom-

etry is that of an antisymmetric bilinear form depending on the point of a manifold-hence the geometry of a 2-form w. It turns out that all symplectic manifolds are locally equivalent: there cannot be any concept similar to curvature in the sense of Riemannian geometry. Symplectic structures differ, if at all, only globally. Historically, the formulation of mechanics in the sense of Hamilton led to symplectic geometry, hence its essential role in modern mathematical physics.

Definition 1. A symplectic manifold is a pair (1112., w) consisting of a manifold 1112ni of even dimension together with a closed non-degenerate 2form w,

d w = 0 and

A

called the symplectic form or symplectic structure. By Theorem 16 in §3.4, every symplectic manifold is orientable. The volume form is understood to be the 2m-form dM2m

= (-1)

-(--1)/2 '

win

.

m! 229

7. Symplectic Geometry and Mechanics

230

Example 1. In 1R2"' with coordinates {q1, ..., q,", pi, ..., p,,,), the formula m

E dpi A dqi i=1

defines a symplectic form with highest power

w"' = m! -

(-1)-(--1)/2

- dpl A ... A dpm A dq1 A ... A dq,,, .

The volume form in the sense of symplectic geometry is the ordinary volume form of ]R2i'. This symplectic structure is called the canonical symplectic structure.

Example 2. Define a 1-form 0-the so-called Liouville form--in the cotangent bundle T'X"' of an arbitrary m-dimensional manifold as follows: Let V E T,, (T' X ') be a tangent vector at q E T' X' and represent it by a curve

V : (-e, e) - T* X' such that

V(0) = q,

V(0) = V.

Project this curve first by means of the projection 1f : T'X"' - X"' to the manifold, and, after that, apply the 1-form q to the tangent vector of the projected curve:

9(V) := n

d d (T ° V (t)) I=o

The 2-form w := dO is a symplectic structure on T'X'. Any system, {q1, ... , qm }, of coordinates in X' determines-representing a 1-form q as q = >2 pi dqi-coordinates {qt, ... , 9m, pi, ... , p,n } in TX". By the definition of the Liouville form 0, we have m

0= and the 2-form

pi dqi

m

w=d9=EdpiAdgi i=1

is non-degenerate. In particular, the (co-)tangent bundle of every manifold is an orientable manifold (see Exercise 9 in Chapter 3). Further examples of symplectic manifolds arise as the orbits of the coadjoint

representation of a Lie group G. Starting from the adjoint representation, GL(g), of the group G and passing to the dual of the linear Ad : G operator, Ad'(g) := (Ad(g-1))* : g' - g', we obtain a representation

Ad` : G -b GL(g')

231

7.1. Symplectic Manifolds

of the group G in the dual space g' of the vector space g. Through each functional F E g' passes an orbit 01(F) := {Ad*(g)F : g E G}, on which the group G acts transitively. The isotropy group

GF := {g E G : Ad'(g)F = F) is a closed subgroup of G, and 0 *(F) is diffeomorphic to the homogeneous space G/GF. Its Lie algebra can be characterized by a similar condition:

Theorem 1. The Lie algebra OF C g of the isotropy group GF C G is equal to

OF = {XEg :F([X,Y])=0 for all YEg}. Proof. Suppose that F([X,YJ) = 0 holds for all elements Y E g. Then from

(Ad*(exp(t X))F)(Y) = F(Ad(exp(- t X))Y) = F(exp(- t ad(X))Y) =

F(Y - t

z

we immediately obtain Ad*(exp(t X))F = F. The one-parameter group exp(t X) is a subgroup of GF, and hence its tangent X belongs to the Lie algebra OF. This proves one inclusion, the converse is proved analogously.

0 If a Lie group G acts smoothly on a manifold Mm, we can associate with each element X E g of the Lie algebra the unique vector field k on Mm whose integral curves coincide with the trajectories of the one-parameter transformation group exp(t X):

X(x) _

d

The vector field X is called the fundamental vector field corresponding to the element X E g of the Lie algebra. If G acts transitively on Mm, every tangent vector V E TXM'" at a fixed point x E Mm can be realized by a fundamental vector field. This general construction will now be applied to an orbit O' of the coadjoint representation. First, realize a given vector V E TFG' as the value of a

fundamental vector field, f(F) = V. For a further element Y E g of the Lie algebra such that k(F) = V, the equality (X-- Y)(F) = 0 immediately implies

0. e=0


232

Theorem 1 implies X - Y E OF, and the resulting map is injective,

9/9F3X- X(F)ETFO'. For dimensional reasons, it is bijective: the tangent space TFO* to an orbit 0* at F E O' can be identified with the vector space 9/9F. We now define

a symplectic structure wo on each orbit O' C g'.

Definition 2. Let V, W E TFO' be two tangent vectors to the orbit at F E O', and choose elements X, Y E g with f ((F) = V, Y(F) = W. The value of the Kirillov form wo on the vectors V, W is determined by the formula

wo (V, W) := F([X, Y])

Theorem 2. The pair (O',

is a symplectic manifold, and the 2 form

wo. is G-invariant.

Proof. Note first that the 2-form wo is uniquely defined. If the elements X, X1 E g realize the vector V at F, then the difference X - X1 belongs to the Lie algebra OF, and Theorem 1 implies

F([X,Y1) = F([X -X1,Y])+F([X1,Y]) = F([X1,Y]). Moreover, wo is a non-degenerate 2-form. If, in fact, wo (V, W) = 0 for

every tangent vector W E TFO', then we obtain F([X,Y]) = 0 for all elements Y E g. By Theorem 1, X lies in the Lie algebra OF, and hence V = f(F) = 0. It remains to show that wo is a closed form. For two elements X, Y E 9, the function wo (X, k) : O' R is determined by the formula

F([X,YI) Differentiate this relation in the direction of a third fundamental vector field:

2(wo.(X,Y))(F) = d [Ad'(exp(-t.Z)F)[X,Y]]It_o = dtF(Ad(exp(tZ))([X,Y]))It=o = F([Z, [X,YII) Then the expression for the exterior derivative dwo of the 2-form vanishes identically:

dwo. (X, Y, Z) = X (wo (Y, Z)) - Y(wo (X, Z)) + Z(wo (X, f))

-wo.([Y,Z1,X), since it reduces to the Jacobi identity of the Lie algebra g.

0

Corollary 1. Each orbit O' C g' of the coadjoint representation of a Lie group is a manifold of even dimension.

7.1. Sylnplectic Manifolds

233

Example 3. The affine group of R has the matrix representation

G=

{ [0

1 1

a>0,bER}

with Lie algebra 9

=

1[Y l

0

0

:x ,y ER}

.

J11

The computation r[0 0]'[100a -b/a 1 _ 10 ay0bx1 Ad [0 1] [0 1] [0 0] implies that g has one-dimensional orbits. To determine the orbits of the

coadjoint representation, write any element of g' as a pair (a, 0) of real numbers, whose evaluation on the element (x, y) E g is ((a, /3), (x, y)) = ax + /3y

By definition, the group element g =

10

.

1] acts as follows:

(Ad* (g-1)(a, 3), (x, y)) = ((a, R), Ad(g)(x, y)) = ((a, Q), (x, ay - bx)) = ax + /3(ay - bx) = ((a - fib, /3a), (x, y)) . Summarizing, we have Ad*(g-1)(a,0) = (a - f3b,,Oa), and hence for 3 96 0 the coadjoint orbit through (a,,3) is two-dimensional. This example shows that the adjoint and the coadjoint representation of a group G are, in general, not equivalent.

After having discussed examples of symplectic manifolds, we now want to introduce the symplectic gradient, which is the analogue of the gradient of a function on a Riemannian manifold. In this situation, we will make use of the fact that the non-degenerate 2-form w also provides a linear bijection between the tangent bundle TA12m and the cotangent bundle T* M2",.

Definition 3. Let H : M2,

IR be a smooth function on a symplectic manifold. The symplectic gradient s-grad(H) is the vector field on M2in defined by

w(V,s-grad(H)) := dH(V). Example 4. Let {q1, ..., q,,,, p1i ... , pm } be coordinates on M2'" such that the symplectic form w can be written as w = E dpi A dq;. Then

s-grad(H) =

(aH 0

aH

a

This formula immediately follows from the equation defining the symplectic gradient. A curve -y(t) in the symplectic manifold M2rn, represented in the


234

fixed coordinates y(t) = {q1(t), .... q,n(t), pl (t), ... ,p,n(t)}, is thus an integral curve of the vector field s-grad(H) if and only if the so-called Hamilton equations hold:

aH

9i = api

aH and Pi=-aqi

Theorem 3 (Liouville's Theorem). Let H be a function on a symplectic manifold (M2m, w), and suppose that s-grad(H) is a complete vector field with flow 4Dt : M2m , M2m. Then: (1) The Lie derivative of w vanishes,

Gs-g,ad(H)(w) = 0. (2) The flow 't preserves the symplectic volume,

J

dM2m =

dm2m.

J

Proof. From dw = 0 and Theorem 32 in §3.9, we conclude that

Gggrad(H)(w) = d(s-grad(H) J w) = - dd H = 0. ,11m

do not alter the symplectic structure, i.e., 4 (w) = w, and so both assertions are proved. 0 Hence the diffeomorphisms 4)t : A f2m

The existence of this invariant measure has consequences for the dynamics of symplectic gradient fields.

Theorem 4 (Poincare's Return Theorem). Let (M2,, w) be a symplectic manifold of finite volume, and let 4bt : M2m _ hf2m be the flow of the symplectic gradient of a smooth function H. For any set A C M2m of positive measure, the set

B = {x E A : tn(x) 0 A for all n = 1, 2, ...} has measure zero.

Proof. Note first that the intersections 4>_n(B) fl B are empty. Any point

xE

fl B would be a point x E B such that 4n(x) E B C A,

contradicting the definition of the set B. This immediately also implies that

the intersections ' _n(B) fl 4'_m(B) are empty for n

m, and, from the

invariance of the measure, we obtain

x

oc

Evol(B) = Evol((Dn(B)) < Vol(M2m) < x, n=1

n=1

i. e., the measure of the set B has to vanish.

0

This result has several famous generalizations, as the only example of which we quote Birkhoff's ergodicity theorem (without proof).

?.1. Symplectic Manifolds

235

Theorem 5 (Birkhoff's Ergodicity Theorem). Let f be an integrable function on the symplectic manifold (M2,, w), and let Ot be the flow of a symplectic gradient. Then the following limit exists almost everywhere: lim 1 t

Jo

t

f o .0,(x) =: f` (x) .

Furthermore, the function f' is also integrable, and its integral coincides with that of f. Lastly, f* is invariant under the flow fit. Definition 4. The Poisson bracket of two functions f and g on a symplectic manifold is the function

{f,9} = w(s-grad(f),s-grad(9)) = dg(s-grad(f)) = -df(s-grad(g)). Example 5. In the {q, p}-coordinates, we have

{f, 9} _

m Of a9

_ Of a9

5q1

aq1 Opi

i=1 \ apt

In the next theorem, we summarize the properties of the Poisson bracket.

Theorem 6. The ring C'°(M2,) endowed with the Poisson bracket is a Lie-Poisson algebra: for constants cl, c2 E R;

(1)

(2) { f, g} = -{g, f }; (3) { f, {g, h}} + {g, {h, f }} + {h, { f,g}} = 0

(Jacobi identity);

(4) {f,g-h} =g {f,h}+h- {f - g}; (5) s-grad({f, 9}) = [s-grad(f ), s-grrd(9)J.

Proof. The two first identities result immediately from the definition of the Poisson bracket, and the fourth follows from d(g - h) = g - dh + h dg. We prove (5). We insert the vector fields V := s-grad(f ), W := s-grad(g) together with an additional vector field Y into the equation defining the 3-form dw,

0 = dw(V,W,Y) = V(w(W, Y)) - W(w(V, Y)) + Y(w(V, W)) - w([V, W], Y) + w([V, YJ, W) - w([W, Y), V) .

Applying the definition of the symplectic gradient as well as that of the Poisson bracket, we can rewrite this equation as

0 = -V (Y(9)) + W (Y (f )) + Y({ f, 9}) - w([V, W], Y)

+ [V,Y)(9)-[W,Yj(f) = -Y({f,9})+w(Y,[V,WI)


236

Hence s-grad({ f, g}) = [V, W] = [s-grad(f ), s-grad(g)J. The Jacobi identity is a consequence of formula (5). In fact, we obtain

{f. {g,h}}+{g.{h, f}}+ {h.{f,g}} = s-grad(f)(s-grad(g)(h)) - s-grad(g)(s-grad(f)(h)) -s-grad({f,g})(h) = [s-grad(f ), s-grad(g)] (h) - [s-grad(f ), s-grad(g)] (h) = 0. A Hamiltonian system consists of a symplectic manifold (111211, W, H) to-

gether with a function H. The integration of the corresponding Hamilton equation relies on determining the integral curves of the vector field s-grad(H). For this, there exists an analogous notion of first integrals as in the Riemannian case.

Definition 5. A function f : 1112" -. R is called a first integral of the Hamilton function if it is constant on each integral curve of the vector field s-grad(H).

Theorem 7. (1) A function f is a first integral of the Hamilton function H if and only if its Poisson bracket with H vanishes,

{ f, H} = 0. (2) The set of all first integrals of a Hamilton function is a Lie-Poisson algebra.

Proof. We compute the derivative of a function f along an integral curve y(t) of s-grad(H):

dt f o -y(t) = df (j(t)) = df (s-grad(H)) = {H, f } . This implies the first assertion. The second follows from the Jacobi identity for the Poisson bracket.

7.2. The Darboux Theorem The Darboux theorem states that all symplectic manifolds are locally equivalent.

Theorem 8 (Darboux Theorem). Near each point x E 1112n of a symplectic manifold (M2i/.w), there exists a chart h : U C 1112m - R2m in which the symplectic form w is the pullback of the usual symplectic form.

w = h*

dpi n dqi) e=1

7.2. The Darboux Theorem

237

Coordinates with these properties are called symplectic (canonical) coordinates.

Proof. In the cotangent space TTM2," to the manifold at the point x E M2",, we choose a basis al, ... , o , ILl...... m in which the symplectic form at this point is represented in normal form, Iii

w(x) = E ai A pi . i=1

Consider, moreover, a chart (D : V - 1R 2»i around x such that

4t(x) = 0 and w(x) = 4D` (dPAdQi(0)) Denoting the corresponding symplectic form on V by

Wi := fi' I

dpi A dqi I

,

i=1

we see that there exists a neighborhood U C V of x such that for all parameters t E [0,1] the form

wt :_ (1 - t)w + twl is a symplectic structure on U. In fact, dwt = 0, and since wt(x) = w(x) for all t, a compactness argument shows that all the forms wt (t E [0,11) do not degenerate at the same time in a neighborhood of x. Since d(wl - wo) = 0, Poincare's lemma shows the existence of a 1-forma such that the difference

w, -wo = da is the exterior derivative of this 1-form. By subtracting locally, if necessary, a 1-form with constant coefficients from a, we may assume that a vanishes at the point x, a(x) = 0. Dualizing a by means of the symplectic forms wt, we obtain a family Wt of vector fields on U parametrized by t,

wt W, Wt) = a(V) .

Let cp(y, t) E Mgr" be the solution of the (non-autonomous) differential equation Ve(t) = Wt('p(t)), V(0) = y All the vector fields W1(x) 0 vanish at the point x, and the solution

corresponding to the initial condition x is constant, p(x, t) - x. Hence there exists a neighborhood U1 C U of x such that, for every initial condition y E U1, the corresponding solution p(y, t) is at least defined in the interval


238

[0, 1]. Let t : Ul -+ A12°' be the corresponding map. The formula for the Lie derivative of a differential form (Theorem 32, §3.9) implies

(L)

+ 0i ('Ca , ,a (Wt)) = V P1 - w) + Vi (Gww, ((WO) dt 4 (wt) _ 'pi = ipi (wi - w + d(Wt J wt) + W1 J dwt) = tipi (wt - w - da) = 0.

Thus Spi (wl) = pp(w) = w, and 4 o cpl is the chart we were looking for,

(DoVI)" Edpindyi

w.

O

7.3. First Integrals and the Moment Map As in the Riemannian case, some first integrals can be derived from symmetry considerations. The isometries (which are not available on symplectic manifolds) giving rise to these first integrals will be replaced by symplectic diffeomorphisms which, however, satisfy a compatibilty condition with respect to the Hamilton function under consideration. We will describe this in detail in the case of an exact symplectic manifold, i. e. , a symplectic manifold whose symplectic form is the exterior derivative of a 1-form, w = d9.

Suppose that a Lie group G acts from the left on M2" in such a way that each diffeomorphism 1g : M2i' -+ M2ni leaves the form 9 invariant,

l9(0) = 9. These diffeomorphisms 1g are then symplectic, i.e., they preserve the sym-

plectic structure w. Now let X E g be an element of the Lie algebra of the group G, and let k be the fundamental vector field corresponding to X under the G-action on M2in. The evaluation of the 1-form 9 on X is a function,

.6(X) := 0(.k). This construction determines a linear Map '1 : g _ Coo(M2,n) from the Lie algebra g to the space of functions COD(M2in) on the symplectic manifold. Its properties are the subject of the following symplectic variant of Noether's theorem.

Theorem 9 (Noether's Theorem). (1) $ : g - C, (M2,) is a homomorphism of Lie algebras, 'DQX, Y]) = (4'(X ), CY)} . (2) s-grado4' corresponds to the transition to fundamental vector fields,

s-grad(qD(X)) = f(.

239

7.3. First Integrals and the Moment Map

(3) If the Hamilton function H is G-invariant, then

is a first

integral of H, 0.

Proof. Fix an element X E g in the Lie algebra and consider the oneparameter group of diffeomorphisms corresponding to the group elements exp(-t X). Its generating vector field is the fundamental vector field X. The relation l9 (0) = 0 implies that the Lie derivative of 0 with respect to X vanishes,

0 = cX(0) = X-jd0+d(XJ0) = X_jw+d(-t(X)). Thus, for every vector field V, we have the equation

-w(X,V) = V(4(X)) = w(V,s-grad(4'(X))) as well as X = s-grad(4(X)). Using this formula, we compute the difference

{4(X),4(Y)} - .0([X, Y]) = w(X,Y) - 4,([X, Y]) = dO(X,Y) - 4,([X, Y])

= X(4(Y)) -Y(,t(X)) -24([X,Y]) = 2({'F(X), 4'(Y)} - -NX,Y])) , and this yields

{4(X), F(Y)} = C[X, Y]) . If, finally, the Hamilton function is G-invariant, we obtain

-X(H) = 0,

{H,fi(X)} =

0

i.e., 4(X) is a first integral.

The elements of the Lie algebra g provide first integrals for every G-invariant Hamilton function. These first integrals can be combined into a single vector-valued first integral by passing to the dual space g'.

Definition 6. The moment map of a Hamiltonian system with symmetry group G is the map IF : M2m -+ g' from the symplectic manifold to the dual of the Lie algebra defined by

CX)(m)

IF(m)(X)

Theorem 10. (1) The map %P is Ad'-equivariant, i. e., the following diagram commutes: M2m

19

M2.


240

(2) %V is a first integral of H.

Proof. The fundamental vector field X of a G-action has the following invariance property: d X (1,(x)) = dt [exp(-tX) g x] Jt_o _

d lg

d

dt

[exp(-t Ad(g-1)X)

xJ I t-0

= dlg(Ad(g-1)X(x)).

N

N

The 1-form 0 is G-invariant by assumption, and from this we obtain

'P(lgx)X = 8(X(lg.x)) = B(Ad(g-1)X(x)) = '(x)(Ad(g-1)X). 0 In Exercise 11, we discuss the case M'' = T'R3 with symmetry group SO(3, R) and its usual representation on R3. In particular, it is shown that the moment map P : T`R3 --+ so(3, R) = R3 coincides with classical angular momentum, hence justifying its name. Closely related to this situation is the following example:

Example 6. Consider the 2-dimensional representation of G = SL(2, R) on

M2 = V = R2. Its cotangent bundle is T'M = V x V' ^' V x V, since the representation V is self-dual. A group element g E SL(2, R) acts on an element (p, q) E V x V of the cotangent bundle by g - (p, q) = (gR gq)We call two elements (p, q) and (p', q') equivalent if they lie in the same G-orbit, (p, q) - (p', q'). Let p = (pl, p2) and q = (ql, q2) be the components of the vectors p and q, respectively. One easily computes that the moment map is given by V x V - + s1(2, R),

2 (g1P2

(q, p) --

+ g2P1)

q2P2

[

1

-gipi

- 2(g1p2 + g2Pl)

In particular, this map is equivariant with respect to the adjoint action of SL(2, R) on sl(2, R). The moment map is best studied by examining its action on SL(2, R)-orbits. For this, observe that the quantity

det(q, p) := det [qi pi g2

= g1P2 - 92P1

P2

is SL(2, R)-invariant. It thus allows to parametrize the orbit space; we omit the easy proof here'.

1For details on this SL(2,R)-action, we refer to §1.4. of the book by Hanspeter Kraft, Ceometnsche Afethoden in der Invariantentheorie. Vieweg, 1985.

7.4. Completely Integrable Hamiltonian Systems

241

Theorem 11. (1) If det(q,p) _: A 56 0, then

(q, p) -

0 A ( (1)1(0))

(2) det(q,p) = 0 if and only if p and q are linearly dependent. In this case, there exist infinitely many G-orbits, for which one can choose the following representatives: 0 0 0 0 0 0 ( (0)1(0)), ((,U) W) ((1)1(0)),

with AERR.

Thus, the moment map acts on G-orbits as follows:

((01),(A0))

LA02

-A/2J'

\(0)' 0// ~

[0

0

J

((0) ' (1)) ' (CO) ' Co)) [0 0 In particular, the generic orbits with parameter A 54 0 are mapped to semisimple elements of the Lie algebra sl(2, R). Their orbits are 2-dimensional closed submanifolds of sl(2, J). 7.4. Completely Integrable Hamiltonian Systems In this section we will make use of the following fact concerning the structure of discrete subgroups IF of the additive group IRA.

Theorem 12. Let t C ]RI be a discrete subgroup. Then there exist linearly independent vectors vl, ... , vk such that

r=

k

m; vi

:

m; an integer

i. e., F is the lattice generated by the vectors vi, .... vk.

Proof. If I' # {0} is not trivial, we choose a vector ryl E r such that I I7i I I 0 0 and consider the ball D" (0; I I7i I I) The intersection D" (0; I I71 I I) nr

is a compact and discrete subset of IR^, hence finite. Thus, on the straight line generated by 71i there exists a vector 7i E D° (0; 117, 1I) n r realizing the minimum of the distance to 0 E lR'. For this vector we have Ht

7i n r = {m 7j : m an integer),

since any vector x 36 m7i belonging to the intersection (1R 7i) n r would have to lie in one of the segments and then (m+1)7i -x would be a vector on the line through 7t with a smaller distance to 0 than 7l*. If the group r contains only integer multiples of -y,*, the proof is completed.

7. Syrnplectic Geometry and Mechanics

242

Otherwise, there has to exist a vector 1'2 E r\{m - ryi : man integer}. We project ry2 orthogonally to the straight line passing through ry1 and denote by Y2 the resulting vector. It lies in one of the half-closed segments y2 E [m - 'y , (m + 1) ryi ). Let E be the cylinder with axis [m' . (m + 1) - 7i) whose radius is equal to the distance from ry2 to the line through 'Y1. In this cylinder, there are again only finitely many elements of the group r. Let rye be the vector in r n E whose distance to the axis of the cylinder is minimal and which is not a multiple of ryi . Then we have 2

r n {R 7i ED R7;)

mi ry,

:

mi an integer

.

i=1

In fact, if there were a point x 0 miel + m2e2 in r n {Rryi ® R-y }, then x would belong to the interior of a parallelogram in the {'y , 7s }-plane. Taking the difference with a vertex of this parallelogram, we obtain a vector in r lying closer than -y; to the axis of the cylinder. Repeating this construction finitely many times proves the assertion. 0

Corollary 2. Let F C R" be a discrete subgroup. Then R"/r is diffeomorphic to the product of a k-dimensional torus Tk with R"-k,

R"/r

Tk x

Rn-k

Theorem 13 (Arnold-Liouville Theorem). Let (M2m, w, H) be a Hamiltonian system, and let fl = H, f2, ..., fm be m functions with the following properties:

(1) all functions fi are first integrals of H: (2) the functions fi commute, { fi, f;} = 0; (3) the differentials dfl, ..., df,,, are linearly independent at each point; (4) the symplectic gradients s-grad(fi) are complete vector fields.

For a given point c = (cl, ... , c,,) E R'", we consider the level manifold

Al, = {x E M2m : fl(x) = C1, ..., fm(x) = C,n}. Then:

a) The connected components of Al, are diffeomorphic to Tk x

R'-k

b) The vector field s-grad(H) is tangent to Mc. In particular, each integral curve of this vector field is completely contained in one of the level manifolds. c) If Mc is compact and connected, angle coordinates y91, ... , cp,,, can be introduced in Mc -- T' so that the integral curves of s-grad(H)


243

are described by the system of differential equations

y , = vi,

v; = constant.

Proof. Consider the flows 4i , ... , 4if

:

M2m

M2, of the symplectic

gradients s-grad(fi). Since

0 = s-grad{fi,f3) = [s-grad(fi),s-grad(f3)1, all these flows commute with one another (Theorem 36, §3.9). This determines an action of the additive group R' on the manifold M2'":

(tl, ... , tm) x = .01 0 ... The orbits of this R'-action coincide with the connected components of the level manifolds. In fact, since

0 = {ff,fi} = s-grad(fi)(f3), each function fi is constant on every orbit. Hence, the orbits of the R'"action are contained in the level manifolds. On the other hand, both are m-dimensional submanifolds of M2in, since the differentials dfl, ..., dfm are linearly independent. The isotropy group r(xo) = {t E Rm : t xo = xo} of a point xo E M2rn for the R'-action is discrete. Hence each component of a level manifold is diffeomorphic to the product of a torus and euclidean space:

Rm/r(xo) = T" x ][ Assertions a) and b) are proved, so we turn to the remaining one. Suppose that a level manifold Mc is compact and connected. Choose a point xo E Mf and denote by v1, ... , v,1 E Rm a basis of the isotropy group r(xo). Representing the basis vectors {v;) of the vector space I(tm as linear combinations of the vectors in the standard basis e1, ... , em of R.. M

vi = 1: ajaeo , a=1

we obtain a quadratic matrix A := (aid). Let m

r(m) _

{ni.ei

:

ni an integer

i=1

denote the standard integral lattice in Rm. Then m

0: Rm/r(m) -. Rm/r(xo) _ -

m

o Exi. ei) = E xi . Vi, i=1

defines a diffeomorphism. The inverse of this map, 4D-1 : MM -+ Rm/r(m) = S1 x ... x S',

i=1


244

as well as its components, 4b-1 = (WI, ..., cp"), lead to the angle coordinates

for the level manifold M, In fact, if yl, ... , y' are the coordinates in R"'/r(xo) = MM determined by 1

m

M. VM,

1

then, by the construction of the R'-action on Af,

s-grad(fi) = y; With respect to the

gyom}-coordinates, this yields m a aj,

s-grad(fi) = a=1

awa

In particular. the symplectic gradient s-grad(H) is a vector field with constant coefficients on the torus MM = T', and the third assertion results by taking v. := alb. W e want to discuss more closely h o w the angle coordinates ( 1, ... , m) of a compact and connected level manifold can be determined directly from the commuting first integrals. This will lead to an explicit algorithm for the integration of a Hamiltonian system (M2m, w, H) provided with sufficiently many commuting first integrals. Because of this procedure, these systems are called completely integrable (or integrable by quadrature). Denote by wl (c), ... , ul,,, (c) the frame of 1-forms on Al, dual to the vector fields s-grad(fl ), ..., s-grad(f,,,). The representation of the vector fields s-grad( f;) in terms of the vector fields 8/&pj immediately implies the following formula for the differentials: M

d'pi =

=1

aia wa(c)

Let ik be the closed curve in MM corresponding to the parameter values rpm=0. Then m

aik = f dcpi = F, ai0 f wn(c) 'Y k

a=1

k

Hence, first the coefficients aid and then the angle coordinates can be computed directly from the first integrals. We summarize this in the form of an algorithm comprising five steps.

Step 1. Fix c = (cl, ... , c,,,) E IIt"' and let Al, be compact and connected. Choose a homology basis y', ... , y", for the first homology group H, (Al,: 7L).


245

Step 2. Compute the symplectic gradients s-grad(fi) of the first integrals

f1=H,f2.....fm Step 3. Determine the frame of 1-forms w1 (c), ..., wm(c) dual to the frame of vector fields s-grad(f l ), ... , s-grad (f.. ) on M.

Step 4. Compute the line integrals frk w0(c), and invert the resulting (m x m) matrix. This yields the matrix A = (aij(c)). Step 5. Compute the angle coordinates (pi(c) on the level set MM from the equations m

dvi = E ai0(c) wa(c), 1 < i < m. Q=1

This procedure computes the angle coordinates Vi(c) on one level manifold Mc. Note that, according to Step 5, these are only determined up to constants. As we vary the parameters c = (cl, ... , c,,,), the Cpl, ..., cp,,, become functions on an open neighborhood of a level manifold Mc C M2m. Since the symplectic gradients are tangent to Mc, the Poisson bracket with the original functions f1, . . . , fm is computed by

IVi,fjI = dpi(s-gradfj) = aij(fl, . . . , fm). Moreover, it is a function exclusively depending on fl, . . . , fm. Similarly, we prove that the functions {Vi, cpj } are also constant on the level sets.

Lemma 1. The Poisson brackets {cpj,Vj} = bij(fl, ...,fm) are functions depending only on fl, ... , fm. In particular, they are constant on each level set 't1c.

Proof. We compute the derivative of {vi, wj } with respect to the vector field s-grad(fk):

{{Vi,'j},fk} _ {Wj,fk},'Pi} - {{fk,'ci},'pj} _ -{aik(fl, ...,fm),'Pi} + {aik /lfl, ...,fm),pj} m 49

m COajk a=1 m

Oya

8ajk

fta

}

a_I

as -

aaEk

aj.)

Thus all the derivatives cpj} (1 < k < m) are constant on every torus T' = Afc. But then the Poisson bracket {Vi, oj} itself is constant on every level manifold Mc. 0


246

Now we alter the angle coordinates, which up to now were considered only on a single level manifold, by a suitable constant on adjacent level manifolds. The aim of choosing these constants of integration for the angle coordinates is to obtain functions cpi commuting on M2n'.

... , B," (yl, ... , y') such

Lemma 2. There exist functions Bl (yl, that the angle coordinates

:= Wi+Bi(fl,...,fm)

Bpi

commute on the symplectic manifold M2m, {(pi , Vj*) = 0.

Proof. Using the notation {cpi, f;} = aij and {Wi,co } = bij, we apply the Jacobi identity to the triples (cpi, ip,, fk) and (ipi, Wj, cpk), and take into account the fact that the Poisson brackets { fi, fl) vanish. Thus we obtain the relations m aaik Oa,j m

E

E

a;a-

Obi;

ab;k

y u 'aka +

ft la

= 0,

aia

1

'

afa +8bki d ajQ J = 0 .

Inserting also the coefficients air of the inverse of the matrix A = (aid), we consider the 2-form m m

Il :_

> bijai°a'p dy° A dyO. i,j=1 a,p=1

The above relations say that fl is a closed 2-form. In fact, the first relation means that the 1-forms m

E a° dy°

of

a=1

are closed, doi = 0. Hence we can choose coordinates z1, ... , z'" such that of = dzi, and 1 becomes M

fl = > bij

dzi Adzj .

i,j=1

Since

Obi; Ozk

_

m

1

Obi,

ay°

Oya

8zk

8bij a= 1

or or aka

the second relation then precisely expresses the vanishing of the exterior derivative dfl = 0. The angle coordinates we set out to find are now taken to have the form m Bpi

aiaBa

Wi + O=1

247


with functions Bl*, ..., Bm depending only on fl, ... , f,,,. Then m

ai.

aY

a;l_ M

B.

syv

L

aB;

a (aiaaj3 - ajasid)

+ a.8=1

Taking into account the first of the relations above, the condition 0 turns out to be equivalent to

bij =

1:

".

a.3=1

a 33

(aiflaja - ai.aj,3)

This system of differential equations can be reformulated as

aB M 8ii - aya a

r3

m

[: bijaiaajZ, i;=1

and, by Poincare's lemma, it has a solution, since the 2-form f) considered above is closed.

Thus we can determine the constants of integration occurring in the transition from the differentials m

d'pi(c) = E afa(r) wa(c) a=1

to the angle coordinates on the individual level manifolds in such a way that the functions Vi, defined in a neighborhood of a level manifold, commute

on M2'". Now we add so-called action coordinates J1, ..., J,, and thereby bring the symplectic structure near a level manifold into normal form. The Hamilton function H = f, is itself constant on the level manifolds and only depends on the action variables, H = H(J1, ...,J,,,).

Theorem 14 (Action and Angle Coordinates). In a neighborhood of any compact, connected level manifold M,: C At" of a Hamiltonian system determined by m commuting integrals fl, ... , fm, there exist angle coordinates Cpl, ... , v,,, and first integrals J1, .... J,,, such that in

w = dindJi. i=1

In particular, this implies

Vj} = 0 = {Ji, Jj} and {y'i, Jj} = bij.

Proof. First, we determine the angle coordinates near the compact, connected level manifold MM such that

J(pj, ypj} = 0 and

{tpi, fj} = aij(fi,

fm)

248


We look for the functions J1, ..., J,,, using the Ansatz J, = Ai(fl, and compute the Poisson bracket m

8Aj ai. OY.

The condition {,..1j} = 8ij leads to the system of equations OA

ayj

= aij

where a'j is the inverse of the matrix aij. By Poincare's lemma, the integrability condition is

8aij

8aik

8yk = $yjj

On the other hand, we obtain from the Jacobi identity 0 = {`r'k,{iPi,.fj}}+{Vi.{fj, 7k}}+{.f .{Y"'k,Y^'i}}, and, taking into account {j,9i.k} = 0, this immediately yields

8aq

E fta aka

= L 8ukj

8 a aip..

a=1

Q=1

y

A simple computation shows that this relation is equivalent to the integrability condition for the coefficients aij of the inverse matrix. 0

Example 7 (Two-dimensional Hamiltonian System). Consider in R2 with the symplectic structure w = dp A dq a Hamilton function H(q, p) for which the level curves (q, P) E 1R2

H(q. p) = c}

:

are closed. The action variable J = J(H) is a function of H, and, together with the angle coordinate , we have

dpAdq = dV AV = Applying the Hodge operator * of R2, we see that dy: is proportional to the 1-form *dH, * dH .

dV

J'(H) IIdHII2 The integral of d,o over each level curve is an integer, which we take to be equal to -1. This condition is called the classical Bohr-Sommerfeld condition. The equation J'(c)

1

nor IIdHII2

* dH


249

uniquely determines the action variable J = J(H) in terms of the Hamilton function H. Consider the domain bounded by the level curve A4,,

Q, = {(q, p) E R2 : H(9, p) 5 C1. The vector field W := grad(H)/Ilgrad(H)II2 satisfies W(H) - 1; hence its flow +t maps the set flc onto tl +t. We compute the change of the area of the domain: d

(vol(S2c)) =

r

Ji1c

tli o

t (J$2) -J t k

d(W i dlt2) =

st4

J

W I dR2 =

lo

2)

If

,1 4

=

Js24 Gbb'(dR2)

IIdHII2 *

dH = X(c)

.

Thus the action variable J = J(H) can be interpreted as follows: J(c) is the volume vol(Q,) of the domain bounded by the level curve hf, _ {(q, p) E R2 : H(q,p) = c}.

Example 8 (Spherical Pendulum). Consider spherical coordinates on the sphere S2\{N, S} with the north and south pole deleted, h(4,) = (cos cp cos ti, sin yp cos 0 , sin

g) .

The Riemannian metric of the sphere is then described by the matrix (see Example 14 in §3.2) 9

_

1

L

0

O cost'

Denote the coordinates in the cotangent bundle by (cp, o, pw, p v) and consider the Hamilton function

H = 2v+2 H describes the motion of a pendulum of length one suspended in the center of the sphere. The meaning of the angles tp and t'} can be seen in the picture to follow. For simplicity, the gravitational constant was taken to be one.

250


Z

The variable (p does not explicitly occur in the Hamilton function; hence P:= p,, = Ocos2 iii is a first integral, {H, P} = 0. The Hamiltonian system (T*S2, H) is thus completely integrable. The level manifold

M2(ci,c2) :_ {pw,po) E T* S2 : P = c1, H = C2} is empty for negative values of the parameter c2, and it consists of the south pole remaining at rest in the case c2 = 0. Hence we suppose that the parameter is positive, c2 > 0. The equations describing the manifold A12(cl,c2) are

C1 = pr,,

c2 =

2 + 2 cs2 i + 1 + sin ?P.

The relation cl = 0 implies that cp has to be constant. In this case, the second equation of motion reduces to that of a planar pendulum, so we will henceforth exclude this case. Depending upon the sign of c1, the function is monotone increasing or decreasing, i. e., the pendulum does not change its direction of motion. Rewriting the second equation and setting z = sin 4i, we obtain

p=

c2-1-z- 2(1

C- 2

z2)

U(z)

and see that the function U(z) thus defined cannot be negative. The limiting

case p = 0 corresponds to the pendulum moving in a fixed plane, hence on a meridian. To see where the function U(z) can be strictly positive, we multiply it by the denominator 1- z2 and look for the zeroes of the resulting cubic polynomial,

V(z) = (1 - z2)U(z) =

c2 (c2 - 1 - z)(1 - z2) - 2

.


251

At the boundaries of the interval, V(±1) = -c/2 < 0 is negative, and at +oe the function V diverges to +oc. Hence one of the three zeroes has to lie above 1, and, since it cannot correspond to any angle 1;i, this zero has no physical relevance. In the generic case, the other zeroes belong to the interval (-1, 1); between them V, and hence also U, is positive. We conclude that V(z) has the qualitative behavior of the graph on the previous page. The mass point can only move between the two meridians corresponding to the zeroes in the interval (-1, 1). The boundary values t ,'4'2 defined this way are actually reached at the end of every up or down swing. Summarizing,

the level manifold can be parametrized by the two parameters 4z = V and

0=

via ( P Cl!

cos

- 2 - 2sini ) .

It consists of 2 two-dimensional tori, where ' only takes values in (>G1, 021. The corresponding coordinate vector fields are a a a a apti; a

a = TT' a = a + o '

ap

,

We express the symplectic gradients of the first integrals in the coordinates of the level manifold: P;, a a d P,2. a s-grad(H)

= cost ;, a + p av - a (2 cost V, + 1 + sin V) aplo Pro

a

a

= cost' ap +Pva +Pv a i a a _ 2 Zi app. + Pv a,. pw

s-grad(P) = a =

ate,

.

"PV


252

The dual forms w1(cl , c2) and w2(cl, c2) are thus 1

wl(Cl,C2) =

de*,

PO

C12

w2(cl,c2) = dw - p o Cos '+G dip

.

We compute the periods with respect to the homology cycle 'rl which is parametrized in M2(c1i C2) by 't()'. The factor 2 is a consequence of the fact that the boundaries, y1,'Y2, of the interval correspond to the minimum and the maximum of the motion, whereas a cycle is meant to be a motion between two extremal points of the same kind:

b1l =

f wl = 2 ti

J

%

c>>

Py

b12 =

f

w2 = -2c1

7i

r 02 diP Jv Pb oos2

Let, similarly, 72 denote the homology cycle determined by 0 < gyp` < 2Tr. Then b21 =

Jwi = 0,

Jw2

b22 =

= 27r .

The matrix occurring in the algorithm for computing the angle coordinates is now easily calculated:

-l [b21

- [a21

b221

a22]

1

-j w2

f7twl

2vfnwl

0

1/27r

The resulting quotient of the basic frequencies is

V1 =all =_1f1-'2= 21r V2

a12

,

C1

7r

112

.//ol

di&

cost tai ' PW

This is nothing but the perihelion precession, i.e. the total variation of the angle V for a complete cycle: O +a

dye = 2

d` d1/b = 2

02

. dpi = 2c1

d

27r L1 .

COS2t' ' PG = .1,1 J 'rI J d it fo , fo, I In general, the motion is quasi-periodic. The trajectory is closed if and 1

only if vl /v2 is rational; otherwise, it is dense on the torus. The transition to action angle coordinates allows us to determine the physically relevant basic frequencies of the system without having to explicitly perform the integration. This accounts for their importance in astronomic perturbation theory.

7.5. Formulations of Mechanics Newton's equations describe the motion of a mechanical system under the impact of a force. The latter is understood as a vector field depending upon position and velocity, and is central for Newton's formulation of mechanics.

7.5. Formulations of Mechanics

253

During the 18th century, this view changed in that Lagrange considered the action integral as the fundamental quantity for the description of dynamics. Newton as well as Lagrange formulated mechanics within the tangent bundle of configuration space. In the 19th century, by transition to the cotangent bundle, Hamilton succeeded in formulating the dynamics of mechanical systems within the framework of symplectic geometry. The aim of this section is to explain these fundamental ideas of mechanics and the related mathematical structures.

Newton (1643-1727)

Newtonian systems

Lagrange (1736-1813)

Hamilton (1805-1865)

Lagrangian systems Hamiltonian systems

i Newtonian systems with potential energy

1

hyper-regular Lagrangian systems

Legendre transformations

Mathematical Contents: Riemannian geometry

Finsler geometry

symplectic geometry

Formulation of Mechanics According to Newton In Newtonian mechanics, the state of a mechanical system is described by finitely many real parameters. This leads to the notion of configuration space. which is a smooth and finite-dimensional manifold Mm. A motion of the mechanical system is a curve 'y : (a, b) - M' in configuration space.

Its tangent-the velocity-is then a curve ' : (a, b) - TM' in the tangent bundle. and this space is called the phase space. According to Newton, the forces acting on the mechanical system are described by vector fields depending only on position and velocity, that is, vector fields X on TM"'. However, not all vector fields are allowed, since a force can act only in space.


254

The force vector field X has to satisfy the condition

drr o X = Id. Here 7r : TM" - Mm denotes the projection of the tangent bundle, and dir : TTM' --+ TM' is its differential. Summarizing, we arrive at the notion of a Newtonian system.

Definition 7. An autonomous Newtonian system is a triple (Mm, g, X ) consisting of a manifold M'", a Riemannian metric g, and a vector field on the space TM'" such that d7r o X = IdTMm .

The function T : TM'" -+ R defined by T(v) := 2 g(v,v)

is called the kinetic energy.

Definition 8. A motion of the Newtonian system (M"', g, X) is understood to be a curve y : (a, b) - M' in configuration space whose curve of tangents ry : (a, b) - TM'" is an integral curve of X,

'1'(t) = X MW This is an invariant formulation of Newton's equation.

Example 9. Consider R with the coordinate x and identify TR = R2 with R2. Here the coordinates are denoted by {x, i}. The vector field 8 1 0 2

X=

wax+m(-k x-P)8i

on TR has the required projection property, and a curve x(t) in R is a motion in this Newtonian system if x(t) is a solution of the oscillator equation

ml(t) = -k2x(t) - pe(t) . Example 10. Let (Mm,g) be a Riemannian manifold. We define a vector field S : TM°1 - TTMm on its tangent bundle-the so-called geodesic spray-as follows: If v E TAM' is a tangent vector, then there exists precisely one geodesic line y,,,,, : (-e, c) -> Mm such that

7'r,.(0) = X, % AO) = v. Consider its tangent curve, %,v : (-e, e) - TM'", and set S(v)

dt (%"(t)) t=o

The relation 7r o ryx,,, (t) = ry v (t) implies dir o S = IdTMm. Hence (M', g, S) is a Newtonian system, and the motions of this system are the geodesic lines


255

of the Riemannian manifold (Mm, g). In the coordinates {x', ii} of the tangent bundle, the geodesic spray is given by the formula

S=

x' i=1

axi

- E r;k x'xA axi i,j,k=1

a straightforward consequence of the system of differential equations describing geodesic lines-see §5.7. Many Newtonian systems have a potential energy. This is a smooth function

V : Mm --+ R defined on configuration space. The gradient grad(V) is a vector field on Mm locally determined by grad(V)

'"

av a

- ì,j=1 g'' (x) axi axj

In the sequel we will need, however, a different vector field, denoted by grad(V). This will be a vector field on phase space. At the point v E TM'", it is defined by the following equation: d

grad

(v + t - grad(V)(ir(v))) e-o

(V) (v) = dt In the manifold Mm, the curve v + t - grad(V)(a(v)) projects to the base point rr(v) E M' of the vector v E TM'. Hence the vector field grad(V) projects to zero under the differential dir, d7r o grad(V) = 0,

and, for any potential energy V, the vector field X := S - grad(V) is an admissible vector field on TM'" in the sense of Newtonian mechanics. In local coordinates, we obtain the formula in

av a

E e(x) axi aij i,j=1 Definition 9. A Newtonian system with potential energy is a triple (M, g, V)

consisting of a Riemannian metric g and a potential energy V : M' -+ R. The corresponding force vector field is

X = S-grad(V). A motion in a Newtonian system with potential energy is defined to be a Mm whose tangential curve (a, b) -+ TM'" is an curve 7 : (a, b) integral curve of X.

Definition 10. The energy of a Newtonian system (Mm, g, V) with potential energy is the sum of the kinetic and the potential energy,

E:TM' R, E=T+Voir.


256

Theorem 15 (Energy Conservation for Newtonian Systems). Let (Mm, g, V) be a Newtonian system with potential energy, and let X = S - grad(V) be the force vector field. Then dE(X) = 0.

In particular, E(y(t)) is constant for every motion-y(t) of the system.

Proof. In local coordinates, the energy E and the force vector field are given by the formulas I M

E = 9 E 9ij(x)xY +V(x), i,j=1

X =

i=1

x' a ii -

k=1

I'v (x) +

8i 9ikW I i=1

82k

.

Using the expression for the Christoffel symbols riki from §5.7, 2

gkQ (x) (Lgii (x) + 8x (x)

-ij axQ

(X))

we obtain dE(X) = X (E) = 0 by an elementary calculation.

,

0

Motions with large energy in a Newtonian system (M'", g, V) with potential energy are-up to a change of parametrization-geodesic lines with respect to a new Riemannian metric. This construction will lead to the MaupertuisJacobi principle. Suppose that the potential energy V : Mm - R is bounded from above by Eo,

sup{V(x): xEMm} < Eo. Then g' = (Eo-V) -g is a Riemannian metric on the manifold Mm. Consider a motion y : (a, b) - MI of the Newtonian system (MI, g, V) with energy Eo,

Eo = 29('Y(t),'Y(t))+V(7(t)) Since Eo > V(y(t)), the tangent vector y(t) vanishes nowhere, and the function

s(t) := f r' (Eo - V(7(µ)))dp a

becomes a strictly monotone function a : (a, b) - (0, b* ). We invert this function and thus view t E (a, b) as a function of the parameter s E (0, b*), t = t(s). Let the curve .y*(s) be the initial curve y in this new parametrization.


257

Theorem 16 (Maupertuis-Jacobi principle). Let -y(t) be a motion of the Newtonian system (Mm, g, V) with energy E0. Then ry' (s) is a geodesic line

in Mm with respect to the Riemannian metric g' = (Eo - V) g.

Proof. The Christoffel symbols1) r and 't V of the metrics g and g' are, in local coordinates, related by the formula k

rtj

=

1

k

_ 8V

1

8V

r'3 + 2 (Eo - V)

ajk

C7xi

8V ak

-&k +

02-a

9 9ij

We write the motion -y(t) _ (x' (t), ... , x'(t)) in local coordinates. Then dxk

_

ds d2xk

dsk dt

_

dt

_

- V) dt

dxk m 8V dxa 2(Eo - V)3 dt Ox- dt '

d2xk

1

1

1

2(Eo - V)2 dt2

ds2

dxk

1

1

72= (Eo

Ws-

and, using the equation of motion, d2xk dt2

--

m

rk Ix' dx'

m

ij dt i,j=1

OV ka

dt - a=1 8xa 9

as well as the energy condition m

dxi dxj

E i,j=i gij dt

dt

2(Eo - V),

we obtain the claimed result: d2xk WS-2 +

k dx' dx'

m

I'i ds ds ij=1 1

+ `1(Ec, __V) 3

m

a=1

=

8V ka 8xa9

8V

1

ka

-2(Eo - V)2 a=1 8xa9 in

dx' dx-I

ij=1 go dt dt

= 0.

Formulation of Mechanics According to Lagrange The transition to Lagrangian mechanics proceeds by considering the Lagrange function of a Newtonian system with potential energy.

Definition 11. Let (M'", g, V) be a Newtonian system with potential energy. The Lagrange function (or Lagrangian) L : TM' R is the difference of kinetic and potential energy,

L = T-Voir.


258

Theorem 17 (d'Alembert-Lagrange). A curve 7 : (a, b) -+ Mm is a motion of the Newtonian system (Mm, g, V) with potential energy if and only if the Euler-Lagrange equations hold: dt

(ex (y(t))) = 8x (7(t))

Proof. We prove this theorem again using local coordinates. The curve 7(t) _ (x1(t), .. . , xm (t)) is a motion of the Newtonian system with potential energy if and only if it solves the system of differential equations

ik

=-

»i

m

I

jiìj

ij=1

- a=1 E

Va9ak

For brevity, we denoted by V. the partial derivative of the potential function, 4G, := 8V/8x°. The Lagrange function is

L('y) =

2

gijii2j - V(1'), i,7=1

and this leads to the difference d dt

8L

m

8L

(8zi

a,;3=1

d9ia

1 99. Q

ax-*l

2

axi)

axa +

m

9iaya+ V; . a=1

Multiplying the Euler-Lagrange equations by gik and summing over the index i, this system of equations turns out to be equivalent to m m a9ia 109.e M 0 = xk + M V 'k +

E i-l

i9

E ` 8x3 i=1 a.3= 1

2 axi

9

The claim now immediately follows from the formula for the Christoffel symbols r 13.. o Thus the equations of motion of Newtonian mechanics are, in the case of a potential force, equivalent to the Euler-Lagrange equations. For the latter, it does not matter that the Lagrange function arises as the difference of a kinetic and a potential energy. Hence we define:

Definition 12. An autonomous Lagrangian system is a pair (Mm, L) con-

sisting of a manifold Al" and a smooth function L : TMm - R. A Lagrangian motion is a curve -y : (a, b) -- M" which solves the system of Euler-Lagrange equations Wt

(er ('i (t))) = dL ('Y(t))

259


Example 11. Let (M'", g) be a pseudo-Riemannian manifold and A a 1form on it. The Lagrange function

L(v) =

g(v, v) - A(v)

2 generalizes, in a sense to be discussed in Chapter 9, the motion of a charged particle of mass m under the influence of the Lorentz force of the electromagnetic field generated by A. Motions in a Lagrangian system can be understood from the point of view that they are critical points of the action integral L. This measures for a curve -y : [a. b] -p Mm the mean value of the Lagrange function on this curve, b

L(y) :=

Ja

Theorem 18 (Principle of Least Action). A curve -y

:

[a. b]

Mm is a

motion of the Lagrangian system (Mm, L) if and only if the variation of the action integral vanishes for every variation y1, of the curve with fixed initail and end points, y1,(a) = y(a), -y. (b) = y(b): d dµ

(jb)

0. LO =

Proof. We compute the derivative of the action integral with respect to the parameter p in coordinates -t,, (t) = (x1(µ, t), ... , x'"(µ, t)) by partial integration: =Jab

d (,C(-t,.)) 1,0

[aL((t))

- d (a (7(t)))

(0, t)dt.

The functions 8x'(0, t)/8µ are arbitrary functions vanishing at the end points of the interval [a, b], and hence the Euler-Lagrange equations are equivalent to the condition dµ (L(yµ)) Iµ=o = 0.

For general Lagrangian systems, there exists a notion of energy which, on the one hand, generalizes the energy of a Newtonian system with potential energy, and is, on the other hand, a preserved quantity. This Lagrangian energy is obtained by first introducing the Legendre transformation. Let

L : TM'

R be a Lagrange function, and let v E .,Mm be a vector at the point x E Mm. Now restrict L to the tangent space .,Aim and consider the differential D(LIT=SIm)(v) at the point v E TTMm. This is a linear map TTll1"' IR. hence a covector in T *M'.


260

Definition 13. The Legendre transformation C : TM'

T'M' of an

arbitrary Lagrangian system is the map ,C(v) := D(LIT=Mm)(v).

Example 12. If the Lagrange function L = Zg - V is the difference of a

kinetic and a potential energy, the following relation holds for v, w E TM':

G(v)(w) = 2 . D(g)(v)(w) = &,w). Hence the Legendre transformation L : TM"' T* M' is simply the identification of the tangent bundle with the cotangent bundle via the metric.

Definition 14. The energy of a Lagrangian system (MI, L) is the function E : TM' -' R on the tangent bundle defined by

E(v) = L(v)(v) - L(v). In the case of a Newtonian system with potential energy, we have

E(v) = L(v)(v) - L(v)

2g(v, v) + V(r(v)) .

This shows that the energy in the sense of Lagrangian mechanics coincides with the Newtonian energy.

Example 13. We compute the Legendre transformation for the Lagrange function from Example 11. Let (x, y) E TTM" with local coordinates

xl...

, xm

and y', ... , y'. Then, m

...,y"') = 2

m Egjjy'1!r - EAiy', i

1,3

and, as an element of T,M, its differential is m

m

D(LITTMm)(yl,...,ym) = mEgify'dhe -EAidy'.

i,j

i

We evaluate this map at (x, v), v = (v', ... , v'"): G(y)(v) = D(LI T:MO1) (y)(v) = m . g(y, v) - A(v) . By definition, this yields for the energy the value

E(v) = C(v)(v) - L(v) =

2 g(v, v),

which has the remarkable property of not depending on A. A more careful physical analysis shows that the energy of a charged particle indeed only

depends on the electric, not on the magnetic field. This is related to the fact that the magnetic field does no work on the particle (see §9.5).


261

Remark. Denoting the coordinates on TM' by {x', , X-, x', ...'e } and the coordinates on T` Mm by {qj, ... , qm. pi, .... p,"}. we see that the Legendre transformation f- is given by

qi = x'

and

aL pi =5p,

and the expression for the energy E takes the following form:

E(x,

_ "' OL xi - L(x, x) x) - ==Y 8ii

.

Theorem 19 (Conservation of Energy for Lagrangian Systems). The energy E(y' (t)) of each motion ti(t) of a Lagrangian system (Mm, L) is constant.

Proof. The energy of a curve is

E(7(t)) _ t-1

8L

dxi

f72i

dt

- L(ti(t))

and by differentiation we obtain "'

d

dt E(Y(t))

_

ij=1

8L

dx' dx-

8L

dx'

1

C 8ii8xj dt dt + C7x'&i dt dtZ J -

' &L dx' i=Y

dxi dt

Using the Euler-Lagrange equation m

dx' "` c7L OL d2xi EY Ox-. = O±'Oxi dt + axiaxj dt2 i,7=Y ij= i3L

we immediately obtain the assertion.

0

Thus the energy is a first integral for any motion in a Lagrangian system. As in §7.3. further first integrals can be derived from symmetries of the Lagrange function. To this end, consider a one-parameter group of diffeomorphisms ,P6 : Al' M' of the configuration space as well as its generating vector field, d

V(x) := d('ts(x))j8=0 on Al'. The differentials d(4 Q : TM' TM"' are diffeomorphisms of the phase space TM"' into itself.

Theorem 20 (Noether's Theorem). Let the Lagrange function L be invari-

ant under the action of a one-parameter group of diffeomorphisms, L(d(4s)(v)) = L(v). Then the function fv : TM' --+ R defined by

fv(w) = lim

µ is a constant of motion for the Lagrangian system (Mm, L). µ-.O


262

Proof. Let the one-parameter group of diffeomorphisms $t be determined in coordinates by

4tt(xil ...,xm) = ('Ft(x1, ...,X"), .... Dm(x'....,xm The invariance of the Lagrange function implies that m

m aL a(De

ax= as

I

aL O4

dx'

+ E axj . ax=as

and the vector field V has the components m

V=E =1

dt

= 0,

a

s

s=IUC7-

x

a3

'

i

Thus we obtain

fv(7(t)) = 1 8x as Is.o' and from the Euler-Lagrange equation we conclude that d

d

m

m IL 81' ax= WS=0 I8=0

+

aL 02V ax' ax_as o

dxj

dt = o. o

Example 14. To each transformation group'Ft : Mm - Mm preserving the metric g and the potential function V there corresponds a first integral,

fv(w) = g(w,V), which is linear in every fiber of the tangent bundle Tlblt. We made use of this first integral in Theorem 37, Chapter 5, to integrate the geodesic flow on surfaces of revolution (Clairaut's theorem). Hence first integrals of the geodesic flow which are linear in the fibers arise from isometries of the metric.

Example 15. If 't preserves the pseudo-Riemannian metric g as well as the 1-form A from Example 11, then the Lagrangian of a charged particle in an electromagnetic field is invariant, and hence Noether's theorem can be applied to yield the invariant

fv(w) = m g(w, V) - A(V).

Formulation of Mechanics According to Hamilton We will arrive at the formulation of mechanics according to Hamilton by starting from a Lagrangian system (M, L) with bijective Legendre transformation C : TM' T'Mm. These Lagrangian systems are called hyperregular. A regular Lagrangian system is one whose Legendre transformation is locally invertible. For simplicity, we confine ourselves here to the case


263

of a globally invertible Legendre transformation. For example, Newtonian systems with potential energy or the system of a charged particle moving in an electromagnetic field (Example 11) are always hyper-regular.

Definition 15. Let (Mm, L) be a hyper-regular Lagrangian system with Legendre transformation C : TM' --+ T* M' and energy E : TM' --+ R. The function

H := EoL

1

defined on the cotangent bundle is called the Hamilton function (or Hamiltonian) of the system.

Example 16. In the case of a Newtonian system (Mm, g, V) with potential energy, the Legendre transformation is determined by the formulas m

qi =

(9L and pi= axi = E gi.±

xi

a=1

Inverting this transformation leads to m

x' = qi

and

i' = E gtnPa , U=1

and the formulas for the energy E, the Lagrange function L, and the Hamilton function H read as follows:

(1) E = 2

i j=1

gijx'i +V(xl, ...,x'"),

m

(2) L = 2 E gijz'xi - V(xl, ...,xm), ij=1

(3) H = 2 E g''rpipj+V(gl,...,gm). ij=1 Through this change of phase space-i. e., replacing the tangent bundle TM"' by the cotangent bundle T'M'-we enter the realm of symplectic geometry, since T*M'" always carries the symplectic form w = d9.

Theorem 21 (Hamilton's Theorem). Let (M'", L) be a hyper-regular Lagrangian system. A curve -y : (a, b) -' AM"' is a Lagrangian motion if and only if the curve G(ry) : (a, b) -. T' M"' is an integral curve of the symplectic gradient s-grad(H) of the Hamilton function.

Proof. The Legendre transformation C is defined by i

OL

qi = x, Pi = axi . For its inverse map, we introduce the notation

x' = qi

and

i = I (ql, ... , qm, Pi, ... , pm).


264

The Hamilton function is then m

H= a=1

and we compute its partial derivatives:

+

1: (P. -

apt OH

a

M

a=l

E51-aL aV _ m

a

aqt

_ OL axt

OL

axt =

.

(i+)

Thus we obtain a formula for the symplectic gradient:

s-grad(H) = E t=l

A curve

G(?'(t)) = 1 21(t), ... , 2m(t), 'oil (fi(t)),

--

,

OL

is an integral curve of the vector field s-grad(h) if and only if

it = V and

Wt

(5p IL

(ti(t))) = axt ('Y(t))

The first equation is trivialy satisfied by setting it = fit, and this proves the 0 assertion.

Exercises 1. In R4 with the symplectic structure w = dxl Adx3 +dx2 Adx4. we choose the following four diffeomorphisms a, b, c, d:

a(xl,x2,x3,x4) = (21,x2+1,x3,x4), b(xl, x2, x3, x4) = (xl, x2, x3, x4 + 1), C(xl, x2, x3, x4) = (xl + 1,x2, x3, x4) , d(xl, x2, x3, x4) = (x1, 22 + 24,x3 + 1,x4 ),

and denote by r the group of motions of R4 generated by them. Prove that w is a F-invariant 2-form. Hence w induces a symplectic form on the manifold M4 = R4/F. Prove that M4 is compact. Finally, denote by [F, F) the commutator group of F. Then F/[t, rJ is a free abelian group of rank three (Thurston, 1971).

Exercises

265

2. Prove that the symplectic form w of a compact symplectic manifold M2m can never be an exact differential form. Hence the second de Rham cohomology HDR(M2m) is non-trivial. In particular, for m 1, even-dimensional

spheres St' have no symplectic structure.

3. Consider M = R2\{0} with the symplectic form w = dx n dy and the vector field

V_

x

8

y

8

x2+y2 8x+x2+y2 8y

a) Let cp = arctan(y/x) be the polar angle defined in every sufficiently small neighborhood of (x, y)

(0, 0). Compute grad(W) and

b) Conclude from this that V is no Hamiltonian vector field on all of M.

4 (Continuation of Example 3). Prove that the Liouville form on the twodimensional coadjoint orbit through the element (a, Q) E g` (3 54 0) is given by the expression

w=

daAdf

.

Hint: Show first that the fundamental vector fields corresponding to the Lie algebra elements (1, 0) and (0, 1) are

aa, and - Qom. 5. Let V be a vector field on a manifold Mm and 1 = V(x) the associated differential equation. A first integral of V is a smooth function h : Mm - R which is constant along every solution of the differential equation. a) Prove that h : Mm -p JR is a first integral of V if and only if dh(V) = 0.

b) Prove that the set Cv (M'n; R) of all first integrals of V is a subring of

C' (M'; R). c) Show that h(x, y) = y2 - 2x2 + x9 is a first integral of the vector field

V = 2y 8/8x + (4x - 4x3) 8/ay in R2. Describe the geometric shape of the integral curves in the (x, y)-plane by means of h.

d) Find a vector field in the plane which has no non-trivial first integral.

6. Let f = a + b i : U -p C be a holomorphic function on an open subset U of JR2 ^_' C. Consider the vector field V = grad (a).

a) Prove that b : U -. R is a first integral for V.

266


b) Describe the geometric shape of the integral curves of this vector field for the functions f(z) = zk and f(z) = z + 1/z. 7. Consider on ]R3 a vector field B (a magnetic field) and its 2-form

B= as well as the symplectic form on the phase space R3 x JR3 with coordinates (9, ii) = (x, y, z, Vx, Vy, Vz),

wB = m(dxAdvx+dyAdvy+dzAdvz) - - B. C

As Hamilton function, we choose the kinetic energy

Ho = 2 (vZ+vy2+vZ). a) Show that the defining equation for the Hamiltonian vector field associated with Ho is equivalent to the Lorentz equation dv

_

e

c.vxB.

mdt

(*)

b) If B = dA is exact and A denotes the associated vector field, show that the map

f:

1R3 x JR3 -R 3 x 1R3,

(4> ) ' (q, mii + A)

(q, p)

is a canonical transformation, i.e., for the canonical symplectic form wo and the Hamilton function HB,

wo = dxndpx+dyAdpy+dzndpzi HB =

-lip-

-All

the following equations hold:

f*wo = 'B, f*HB = Ho, and equation (*) does not change.

c) If B is constant, any particle moves on a helix. Hint: Interpret torsion and curvature as first integrals. If both these quantities are constant for a curve, then this curve has to be a helix (Exercise 6 in Chapter 5). 8 (Plane Toda Lattice). Consider in JR2 the second order differential equations

i=

-ex-y,

y=

ex-y

which in phase space (x, y, i, y) E JR4 are equivalent to t; = X

a a

X(x,y,i,y) = a TX + y ay -

ex-y a +

ai

The energy E = (i2 + y2)/2 + ex-y is a first integral.

ex-y

ay

with

267

Exercises

a) Show that this system has further first integrals, for example P = i + y or K = (i - 2y)(y - 2i)/9 - e---y. These quantities are related by E + K = 5P2/18.

b) The set M2 (E, P)

(x, y, i, y) E 11 P4 :

(i2 + y2)/2 + eZ-v = E, ±+y=P}

is not empty if and only if 4E - P2 > 0. In this case M2(E, P) is a smooth two-dimensional submanifold of R4 without boundary . This leads to a decomposition of ]R4 into a family of submanifolds, and every integral curve of X lies completely in one of them. c) Show that each of the submanifolds M2(E, P) lies in an affine subspace of dimension three and is diffeomorphic to R2.

d) If {(t) = (x(t), y(t), i(t), y(t)) is an integral curve of X in M2(E, P), then

x y i+y = P and f 4E-P -4ez-V =

t

Show that (A > 0) dz

A -Bez

_=

1

In

v/-A

vA- A -Bee ( I + A -Bee

'

and use this formula to integrate the equations for the integral curves in M2(E, P) completely.

9 (Euler Equations). Let I : ]R3 - R3 be a symmetric positive definite operator. Consider the differential equation I(i) = 1(w) x w, where x denotes the vector product.

a) Prove that this differential equation has two first integrals, the energy 2E = (I(w),w) and the momentum M2 = (I(m), I(w)). b) Conclude from this that the integral curves of the differential equation are intersections of two ellipsoids with center 0 E IR3 (I 0 IdR3). In particular, all integral curves are closed. Let Il be the smallest and 13 the largest eigenvalue of I. Then there exists an integral curve only for 2EIl < M2 < 2E13. . 10 (Motion in a Central Force Field). Let a potential force F act on a point of mass one in R3, dU

-grad(U(r)) F drr) r with a function U(r) depending on the radius r = IIxil only. The energy E = 11±112/2 + U(r) is a first integral.


268

a) Show that M = x x . is a further first integral (M is the angular momentum), and conclude from this that every trajectory of the point lies in a plane in R3. b) Determine for which values of the parameter M E JR3 the level surface

A3(M) = {(x,x)ER6:xx:i=M} is a three-dimensional submanifold of phase space R6.

c) Let r(t) be the distance to the origin of a motion whose an;;ular momentum is M. Then d2r

_ _dr dU

dt2

1IM112

+ _73-

i.e., r(t) describes the motion of a point in R1 under the effective force F2 = -grad(V) with effective potential V(r) = U(r) + IIMII2/2r2. The energy of this motion is

E* = 2 + U(r) +

I I2r2Iz

Prove that this energy E* coincides with the energy E (for fixed angular momentum M).

Hence, if x(t) E R3 is the trajectory of a point moving under a force F and r(t) is its distance to 0 E K3, then

V2E -

IIMII2

- 2U(r) = r',

i.e.

r

J

-dt = t

11 (Classical Moment Map). Consider K3 with the defining representation of SO(3, K).

a) Prove that this representation is equivalent to the adjoint representation of SO(3, K) on its Lie algebra so(3, K), if R3 and so(3, K) are identified via the map vl R3

so(3, f8),

v = v2 u.- v =

0

-V3

v::

v3

0 vi

-V1 0

-V2

V3

Moreover, the adjoint and the coadjoint representation of SO(3, K) are also equivalent.

b) Prove that this identification satisfies the equation

v(w) = v x w, [v, w] = [v, w],

(v, w)

Ztr (vv-) .

269

Exercises

c) By a), the moment map' : T'R3 - so(3,R)* can be interpreted as a map from T*R3 to R3. Show that it can then be written in the form

W(9,P) = 9XP, and hence it coincides with the classical angular momentum.

Chapter 8

Elements of Statistical Mechanics and Thermodynamics

8.1. Statistical States of a Hamiltonian System The Hamiltonian formulation of mechanics starts from a configuration space

X"' and makes use of the phase space T'X' with its canonical symplectic structure. A state of the mechanical system under consideration is a point in the phase space T`X'", and the motions of states are the integral curves of the symplectic gradient s-grad(H) of a Hamilton function H : T* X' --+ R. In this formulation of mechanics, the only essential data to be given are a symplectic manifold M2rn and a function H. The point of view of statistical mechanics is based on the idea that, for instance because of the size of the mechanical system or as a consequence of the imprecision of measurements, the state of the mechanical system cannot be determined precisely by fixing 2m real parameters. Instead, to each open set U C M2"', we can only ascribe the probability p(U) that the state belongs to the set U. This leads to the concept of considering mechanical states no longer as points in the phase space M2, but as probability measures on M2'".

Definition 1. Let a Hamiltonian system (M2"', w, H) consisting of a symplectic manifold and a Hamilton function be given. A statistical state is a probability measure p defined on the a-algebra of all Borel sets on M2'".

271

8. Elements of Statistical Mechanics and Thermodynamics

272

Example 1. For each point x E M2n', consider the measure 62 concentrated at this point,

a:(U)

1 0 ifx¢U, l 1 if a E U.

Therefore, classical mechanical states are particular statistical states.

Let -tt M2" -+ M2in be the flow of the symplectic gradient s-grad(H). The motion of the classical state x E MZ"' is the trajectory 0t(x). For the probability measure b+,(s) corresponding to the point it(x), we have :

t

xE (U) 1 = bx(-tt I(u)), 60&)(U) = 1 0, 1 , x E t 1(U ) and this formula leads to the following definition.

Definition 2. Let a Hamiltonian system (M2-, w, H) and a statistical state p be given. The motion of µ under the impact of the Hamiltonian system is the curve At of measures

At (U) := µ(4 1(U)) Definition 3. An equilibrium state of a Hamiltonian system (M2m,W, H) is a state A which does not change in the course of the motion of s-grad(H), At = P.

Definition 4. A statistical state p of a Hamiltonian system (M2in,W, H) has a stationary terminal distribution if the equation

'U-(U) := lim t 00 MO) defines a Borel measure on M2"'.

Theorem 1. A stationary terminal distribution µ«, is always an equilibrium state. Consider the volume form (-1)m(m-1U2

dM2m _

Wm

m!

of the symplectic manifold and the Borel measure induced from it (see the final remark at the end of §3.5). If the measure p is absolutely continuous with respect to the volume measure,

µ(U) := je(x).dM2m(x), then a is called the density function of the state µ.

8.1. Statistical States of a Hamiltonian System

273

be a state with density Theorem 2 (Liouville's Equation). Let p = function, and let at be the motion of this state in a Hamiltonian system. Then the measures pt = pt dM2" are also states with density function, and

d

dtet =

-{H,e}o$_t.

Proof. The flow ibt consists of symplectic transformations and preserves, in particular, the symplectic volume form dM2in, 4 (dM2i`) = dM2m. Transforming the integral, we see that pt = P o &_t is the density function of the state µt. This implies dt of = dt

P o D_t = do o dt

-t = -

o

-t = -{H, p} o O_t.

0 Corollary 1. A statistical state µ with density function P is an equilibrium state if and only if a is a first integral of the Hamiltonian H, (H, g} = 0. If x E M2m is a classical state in Hamiltonian mechanics, the value H(x) of the Hamilton function H is the energy E of the state,

E(x) = H(x) = JM2rn The definition of the energy of a statistical state generalizes this relation:

Definition 5. The energy of a statistical state p is the integral

E(µ) =

JMom

H(x) dp(x),

if this integral exists.

Theorem 3 (Conservation of Energy in Statistical Mechanics). If u t is the motion of a statistical state in a Hamiltonian system, the energy E(pt) is constant.

Proof. After transformation of/ the integral, we immediately obtain

E(pt) = l

H o I 1(x) dtt(x),

J M2m

and differentiating this with respect to the parameter t yields the formula

d E(µt) = fM2m {H, H}(x) du(x) = 0.

0

Now we want to study how the probability 1At(N2m) that the state µ = B dM2ri is in the compact subset N2ni C M21 at time t changes in time. The following lemma serves as a preparation.


274

Lemma 1. Let (M2i', w) be a symplectic manifold.

Then, for any two

functions f, g : lbl2in --+ R, the following formula holds:

df AdgAw,,,1 =

1 If, M

Proof. To prove this formula, we choose on 1112»' local syinplectic coordinates (ql, ..,P.),

w=

dp0 A dqq ,r= I

Setting Ai := dpi Adgi, we see that the exterior product satisfies the relations

AiAAj = AjAAi and AiAAi = 0. Since the forms Ai commute with one another, we can use the binomial formula to compute the exterior power wk, m

wn, _

(A1)'

>

1

m.

Aa' A

A A"

Because Ai A Ai = 0, only summands with rri < 1 occur in this sum, and hence it reduces to a single term:

writ = in!.AlA...AAna = m!.(dpiAdgi)A...A(dpmAdq»,) We compute WI-1 in a similar way, and obtain ",

(m-1)!

AlA...A,;...A

Thus, the following exterior products vanish:

(1) dpi Adpj Aw"'-1 = 0 = deli Adgj

Awl"-

= 0 if i # j

(2) dpi A dqj A w'

and we obtain fit

df A dg

Awrr-1 =

(.!m- dg dp, A dqi + 2f ag dq, Adpi) api Oqi OCli dpi rn

('m - 1 ) !

Of yg pi

arr.-1 mi

.

Tqi-

Of 09

di

w"'

A I A ... A A,,,

CdR

{f,g} wr

The relations m. d(f dg A w'"-1) = in df A dg A w"'- I = {f, gj w"' together with Stokes' theorem imply, after integration.


275

Theorem 4. Let (A12mw) be a compact symplectic manifold with boundary. Then

J12in

f.dg nwm-1

{f,g} w"r i)At 2°

.

In particular, the integral

J r2" {f,9} w"` = 0 vanishes for any compact symplectic manifold without boundary.

Corollary 2. If the Poisson bracket if, g} of two functions defined on a compact symplectic manifold 1112" without boundary does not change sign, then it vanishes identically, If, g} = 0.

We will apply these formulas to a statistical state with density function e.

Definition 6. The (2m. - 1)-form

At)

A'dHAw"'-1

(m-1)!

is called the probability current of the statistical state it = e dM2"' in the symplectic manifold (Al2m,w) with Hamilton function H.

Theorem 5. Let lit be the motion of the statistical state p = e dA12"' in the Hamiltonian system (1112"', W, H), and let N2", C AI2si be a compact submanifold. Then (µr(N2-)) (r=o

Wt

= JNFIJ

Proof. We compute the derivative with respect to time at t = 0 and use Lionville's equation as well as the preceding integral formulas: d N2", {He} dM2"r dt r=e dM2", o-

_J

J

N2m

N2s'

(m - 1)!

J

e dH n w"'-i =

UN2"4

f

j(Ic) .

aN2"

The probability current j(µ) is a hypersurface measure on all (2m - 1)dimensional submanifolds of the symplectic manifold M21. It expresses the infinitesimal change of the probability for the state it to be in the set N2i' C A12"' as an integral over the boundary of N2'".

Example 2. If the subinanifold N2i" C M2" is described by inequalities of the Hamilton function,

N2n = {x E M2" : C1 < H(x) < C21,


276

then the differential dH vanishes on the tangent bundle T(8N2m) of the boundary, and so does the form j(p). We hence obtain, for the motion At of every statistical state p = p dM2,, dtpt({x E M2in :

C1 < H(x) < C2}) = 0.

Thus the probability for a state p to be in N2in at time t is constant.

Example 3. In case m = 1, the formulas for the probability current of a two-dimensional Hamiltonian system (M2, w, H) simplify:

j(p) = e dH and dpt(N2) = JaN2 p dH. t Now we turn to the notion of information entropy for a statistical state. As a motivation, we first recall the notion of information of an event introduced by C. E. Shannon (1948). The heuristics is the following: If, in a series of experiments, an event occurs with probability close to one, then the "information" contents of this event is small. Conversely, if the probability of this event is close to zero, the occurrence of this event contains a large amount of "information". Modeling the probability computation by a triple (i, 21, p) consisting of a set i, a or-algebra 21 of subsets of S2, and a measure p defined on 21 such that A(fl) = 1, we arrive at the following definition for the Shannon information.

Definition 7. Let (S2, 21,,u) be a probability space. The amount of information contained in an event A E 21 is

1(A) := - log(p(A)). For a finite set S2, one forms the mean of these information amounts and gets in this way the so-called information entropy.

Definition 8. Let (12, p) be a finite probability space. The information entropy of this space is S(Q, p)

f

t

1(w)

. dp(w) = - E pi

log(pi) ,

i=1

where the set 11 consists of the elements {w1, ...,wn}, and pi := p({wi}) is the probability for the event wi. The information entropy is a measure for the uncertainty of the probability space. In fact, if St = {wl, ... , w } consists of n points and we denote by p' the measure corresponding to the equidistribution, p'(wi) := 1/n, then the following holds.


277

Theorem 6. The information entropy of a finite probability space (S),µ) does not exceed the information entropy of the equidistribution: S(Q, Ft)

0

for all x E M2s` .

Assumption 2. For every positive number B > 0, the following integral exists:

Z(B) :=

r

e-N(x)/B. dM2m(x)

JM

The function Z(B) is called the partition function of the Hamiltonian system.


279

Assumption 3. For every positive number B > 0, the following integral exists: H(x) . e-H(=)lB . dM2m(W ) . bl2m

Remark. The parameter B will later be identified with absolute temperature (multiplied by the Boltzmann constant k). Its appearance here indicates the exceptional role that temperature plays among all thermodynamic parameters. The function

E(B) :=

I

H(z)e-H(:)/B , dA12-(z)

Z(B) M2m is called the inner energy of the Hamiltonian system (M2m, w, H); its inner entropy is E(B) S(B) := log(Z(B)) + B Finally, the free energy F(B) is defined by

F(B) := -B log(Z(B)) = E(B) - B - S(B). First we note that the inner energy E(B) is a non-decreasing function on the interval (0, oo). Its derivative is

dE _ TB

1

Z2(B) B

[(I

M2m

e-

dM2m(x)) (ML H2( l2

_

- (ML H(x )e

z)e-dM2m(x))

dM2m(x))

and the Cauchy-Schwarz inequality shows that this derivative is positive

(H j4 const). Denote by En and Em., respectively, the bounds of the range of the inner energy E : (0, oo) - R. We calculate the derivative of the partition function in a similar way: dZ

1

dB = B2

JM2m

BZ Z(B) E(B).

Hence the inner energy as well as the inner entropy can be expressed in terms of the partition function. We summarize these formulas. Theorem 8 (Simple Equation of State for a Hamiltonian System).

(1) E(B) = B2gR (log(Z(B)));

(2) S(B) = W(B log(Z(B)));


280

(3) in the sense of 1 forms on the one-dimensional manifold (0, oo),

dS = B dE. Proof. The third equation is a straightforward consequence of the first two.

a The Gibbs state (canonical ensemble) is distinguished by the property that it realizes the maximum of the information entropy among all stag of fixed energy.

Theorem 9 (Gibbs State, Canonical Ensemble). Let an energy value Eo between the minimum and the maximum of the energy function E(B) for the Hamiltonian system (M2m, w, H) be given. Among all statistical states

p = P dM2' with density function of energy E(p) = Eo, there exists precisely one state, PGibbs = PGibbs dM2m, of maximal information entropy

S(pcibb.). The density function of this state is 1 -H(x)/Bo , PGibbs(x) = Z(Bo)e

where the parameter Bo is determined as a solution of the equation E(Bo) _ Eo. The value of the maximal entropy S(pCibbs) is

S(pGibbs) = S(Bo) = log(Z(Bo)) +

E(BO) o)

Proof. Choose Bo as a solution of the equation E(Bo) = E0, and )et e-H(x)/Bo dM2m 1 Z(Bo) be the corresponding Gibbs state. Its inner energy is pGibbs =

E(pcibb.) = Z(B0)

H(x)e-H(-)/Bo

JM2m

. dM2m = E(Bo) = Eo,

and hence UGibbe is a statistical state of energy Eo. For every oti er state p = P dM2m with energy

E(IL) = fM2m H(x)P(x) dM2'n = E0, we consider the function

f (t) = S((i - t)pGibbs + tp)

,

the information entropy of the statistical state (1 - t)pGibbs + tp. Differentiating f (t) with respect to the parameter t, we obtain

d2f(t) dt2

- _ JU2-

(e - PGib.)2 (1 - t)PGib, + tP

dM2'
f (1), we obtain

S(P)
0 such that Ilnl12

- (n,

)2 = k2

and

11,7112 = k2+(S+ a)2.

A short calculation for the curvature of the particle's curve of motion yields

K = Itrill =

1117 x ells

=

2TmvJ lot I

1-ln,n>2/1117112

__

sin(n,il) 2'rmvl

In11s

=

1 -oos (2, _ 27mvI InI

111711'-117,17)2

__

k

2 ymvllnl l2

2-tmvl ln113 2'rrmll,II3 If k = 0, the curvature vanishes and the curve is a straight line, i. e., there exist vectors v", w' E R3 such that n(s) = s v + v7. The equation of motion (*) then implies v' x u7 = 0, i. e., v' and 0 are linearly dependent, so thet we finally obtain n(s) = (s + A)t7, A E It , which is the equation of a line through the origin. Let us assume from now on that k > 0. We shall derive the expression for the torsion of the curve. By Frenet's formulas, the principal normal vector hh is

h=-=- k7xil. 1

The binormal vector then becomes

b = ,7xh = -i4x(rlx1), with derivative

=

-kn x (n x ii) =

n

2kmv7II,II3 )) Using Grassmann's identity u7 x (u' x ) = u (v, w-) - v' (u, m'), one shows that 6 x (u" x (i x ii)) = - (u, v3) i x *7, so that the former expression can further dsb

323

9.5. The Lorentz Force

be simplified to dg _

ds

(n, n) 2mvry11i1113

__

(-k r) X rl

2mvryIIi11I3h.

The torsion is thus equal to

tail)

r=

(b, h) _ -2mvryll71113

__

s+a 2mvry

k

+(a+a)23

In particular, the quotient of torsion and curvature satisfies the simple identity r/ic = -(s + a)/k. We prove next that the curve lies on a cone, that is, that there exists a constant vector ii whose angle p with 71 is also constant.

Define the functions

s+a

_ f(s)

2km yv

_

k + (a + a)

'

g(s)

1

2m yv

k + -(s+ a)2

They are primitives of K and -T and have the same quotient, d f (s) f (s) _ k dg(s) r ds = K' ds g(s) Frenet's formulas thus imply immediately that the vector iZ :=

is constant. To compute its angle V with rl, we derive an exact expression for q by applying Grassmann's identity to the binormal, b

1

1

= -krl x (71 x n) _ -k [rl -il(s+a)], q = (s+a)t-k6,

and remark that ii can be rewritten 11= rl/2mkryvllr7l l + h. For the opening angle of the cone we obtain

cos() =

(fi, u')

110 141

-

1

1 + (2mkyv)

9. Elements of Electrodynamics

324

By definition, a curve is a geodesic on a given surface if and only if its principal normal vector is orthogonal to the tangent plane of the surface at any of its points. Denoting by 7P the angle between h and i , we get (h, u)

-

1

Iluli

_

2mkvy 1 + (2mkvy)2

and we see that this is equal to 1 - cos2(cp) = sin(e). Hence, W and V are related through V-V = it/2, and we conclude that h is indeed perpendicular to the tangent plane of the curve. The particle moves along a geodesic of the cone, as claimed.

Exercises

325

Exercises 1 (Kirchhoff Formula). Let u(x, t) be a function defined on JR3 x Ht and S C JR3 a compact domain with smooth boundary. Prove, for each point (Xo, to) E SZ x JR, the Kirchhoff formula:

Jsl

u(xo, to) =

Ou(a,to+

cr 8t

u(1l, to - r/c) or

or(a,to)

- u(a'to

eN

c

r f au (or, to -

dy + J

r

IL

r 8N

8r-1(a,to) C

OM

where r := Ilxo - yII is the distance to the spatial point xo E R3, N is the normal vector to the boundary 00, and = Ox - 1/c28it is the wave operator. Deduce from this the solution formulas for the Cauchy problem of the wave equation by choosing for H a 3-dimensional ball. 2. Under the assumptions of Helmholtz' theorem, the electric and the magnetic field E and B, as well as the current density field J can be written as the sum of a divergence-free and a curl-free vector field,

E = Ediv + Ecuri, B = Bdiv + Bcuri and J = Jdiv + Jcurl a) Prove that the Maxwell equations can then be written as follows: curl(Ed;v) = curl(Bd;Y) =

1

c

-

18Bdiv

8t

8Ed;v

at +

47r

Jai,,,

chv(Bcuri) = 0 div(Ecuri) = 47r LO.

The continuity equation then becomes div(Jcuri) +

OLO

= 0.

Hint: In the proof of Theorem 2, we proved that, under the assumptions made here, any vector field which is at the same time divergence- and curl-free, has to vanish identically. b) On the other hand, for the electric field E we already know the decomposition

E_

c

+grad(-O).


326

Prove that this is precisely the Helmholtz decomposition of E if the magnetic potential satisfies the condition called Coulomb gauge:

div(A) = 0. 3. Deduce from Theorem 4 that if Eo(x) and Bo(x) are C3-functions on R3 such that divE0 = divB0 = 0, then the Cauchy initial value problem for an electromagnetic wave

curl(B) = c 5 ,

curl(E)

c at , div(B) = 0, div(E) = 0, E(x,O) = Eo(x), B(x, 0) = Bo(x) has the unique solution E(x, ct)

4act

f f

curl Bo(y)dy +

B(x, ct)

4act

f Eo(y)dy

4ac 8t

,

S2(x,d)

S2(x,d)

curl Eo(y)dy + 41

S2(x,d)

f Bo(y)dy

8t

1S2(x,ct)

4. Using Theorems 1 and 2, determine the electric and the magnetic field in 1R3 which is generated in the following situations:

a) a homogeneously charged ball of radius R with constant charge density P;

b) a charged spherical shell of radius R with constant surface charge density a; c) an infinitely extended straight wire of radius R through which a current with constant current density j flows. 5. Describe the solution of the classical Lorentz equation for a homogeneous, static electric field (E = const) and vanishing magnetic field (B = 0). What happens in the non-relativistic limit of small velocities v 0,

atu(x,0) = u1(x)

Part 1. General Shape of the Solution. Prove that there exist two C2functions f and g satisfying

u(x,t) = f(x+ct)+g(x-ct). The solution is hence the superposition of a wave traveling to the left and one traveling to the right. Hint: Introduce the new coordinates xt = x ± ct and show that the wave equation is equivalent to the differential equation a2u

ax+(9x- -

0.

Part 2. Solution of the Cauchy Problem. With the above Ansatz for the solution, the following relations have to hold: uo(x) = f(x) + g(x),

u1 (x) = c(f'(x) - g'(x))

By integration of the second equation over the interval [0, x], prove that the general solution has to be given by

Ed

u(x,t) = 2[uo(x + ct) + ua(x - ct)) + 2cu1(s)ds.

In particular, this argument shows the uniqueness of the solution. How can the result be understood qualitatively by means of the light-cone ?


328

9 (One-sided Infinitely Extended Oscillating String). An oscillating string extending infinitely in the positive x-direction is modeled by the one-dimensional wave equation,

attu = c2axxu,

t > 0,

x > 0.

In addition to the initial conditions,

u(x, O) = uo(x), 8tu(x, 0) = u, (x), one has to impose boundary condition which describe the behavior of th "wall" at x = 0 for all times t: u(0, t) = cp(t).

Here uo, 4o E C2(R+) and ul E C'(R+) are required. Prove (using a similar Ansatz as in the preceding exercise) that the solution is x+d

2cjx-d

[uo(x + ct) + uo(x - ct)] + -L

u(x,t) =

2 [uo(x +

ui(s)ds,

d+x

Ct) - up(ct - x)] + p(t - x/C) + ZJ

d-x

ul (s) ds,

where the upper line is to be taken for points (x, t) in the region I, i. e. below

the line x = ct, and the lower line for points (x, t) in the region II, above the line x = ct. What can be said concerning the behavior of the lightcone, in particular in II? Describe, moreover, the regularity properties of the solution on the line x = ct, and explain carefully under which additional conditions it is of class C2 there. x=ct /1

I x

10 (Oscillating String Fixed on Either Side). For the wave equation on bounded domains a separation Ansatz going back to Bernoulli proved to be successful, in that it reduces the problem to one for Fourier series. We are looking for a solution of the one-dimensional wave equation on the interval [0, 1],

attu = c28xxu,

t>0, 0<x 0 necessarily lead to the trivial solution u = 0; in the case A < 0 there has to exist a natural number k such that A = -k27r2 and uk(x, t) = Xk(x)Tk(t) = sin kirx (ak sin klrct + bk cos kirct) . The general solution is then obtained as the series 00

u(x, t) _

uk(x, t) _ E sin kirx (ak sin k7rct + bk cos klrct) . k=1

k=1

c) Find an integral formula for the coefficients ak, bk depending on the initial conditions uo and u1. Solution: ak

lore

1

1

2

ul (x) sin k7rx dx,

bk = 2 fo uo(x) sin k7rx dx .

0

11 (Validity of the Fourier Method). By a theorem of Dirichlet, the Fourier series of a function f E C1([O,1]) satisfying f (0) = f (1) = 0 is uniformly convergent and tends pointwise to f. Prove the following lemma:

For a function f E Ck([O,1]) such that f(0), ..., f(k-1) vanish at 0 and 1, there exists a constant A for which the Fourier coefficients of f satisfy the following inequality:

j f(x)sinn7rxdx

Global Analysis: Differential Forms in Analysis, Geometry, and Physics (Graduate Studies in Mathematics, V. 52)

Global analysis: Differential forms in analysis, geometry, and physics

Analysis (Graduate Studies in Mathematics)

Differential geometry, analysis, and physics

Global Differential Geometry and Global Analysis 1984

Global Differential Geometry and Global Analysis

Fourier Analysis (Graduate Studies in Mathematics)

Fourier Analysis (Graduate Studies in Mathematics)

A Course in Differential Geometry (Graduate Studies in Mathematics 27)

Manifolds and Differential Geometry (Graduate Studies in Mathematics)

Manifolds and Differential Geometry (Graduate Studies in Mathematics)

Manifolds and Differential Geometry (Graduate Studies in Mathematics 107)

Manifolds and Differential Geometry (Graduate Studies in Mathematics 107)

Global analysis in mathematical physics

Discrete Differential Geometry: Integrable Structure (Graduate Studies in Mathematics 98)

Discrete Differential Geometry: Integrable Structure (Graduate Studies in Mathematics 98)

Discrete Differential Geometry: Integrable Structure (Graduate Studies in Mathematics 98)

Differential Forms in Algebraic Topology (Graduate Texts in Mathematics)

Differential Geometry in Physics

Differential Geometry in Physics

Trends in differential geometry, complex analysis and mathematical physics

Differential Geometry in Physics

Differential Geometry in Physics

Differential Geometry in Physics

Differential Geometry in Physics

Differential Geometry and Analysis on CR Manifolds (Progress in Mathematics)

Global Calculus (Graduate Studies in Mathematics)

Differential Forms in Algebraic Topology (Graduate Texts in Mathematics)

Elements of Functional Analysis (Graduate Texts in Mathematics) (v. 192)

Analysis Now (Graduate Texts in Mathematics)

Matrix Analysis (Graduate Texts in Mathematics)

Global Analysis: Differential Forms in Analysis, Geometry, and Physics (Graduate Studies in Mathematics, V. 52)