Problems in Analysis. A Symposium in Honor of Salomon Bochner

PROBLEMS IN ANALYSIS A Symposium in Honor if Salomon Bochner ROBERT C. GUNNING GENERAL EDITOR PRINCETON, NEW JERSEY...

Author: Robert C. Gunning

8 downloads 634 Views 19MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
Report copyright / DMCA form

DOWNLOAD PDF

PROBLEMS IN ANALYSIS A Symposium in Honor

if

Salomon Bochner

ROBERT C. GUNNING GENERAL EDITOR

PRINCETON, NEW JERSEY PRINCETON UNIVERSITY PRESS 1970

Copyright

©

1970, by Princeton University Press All Rights Reserved L. C. CARD 76-106392

1. S. B. N. 0-691--08076-3

A. M.S. 1968: 0004

Printed in the United States of America by Princeton University Press, Princeton, New Jersey

Foreword A symposium on problems in analysis in honor of Salomon Bochner was held in Fine Hall, Princeton University, April 1-3, 1969, to celebrate his seventieth birthday, which took place on August 20, 1969. The symposium was sponsored by Princeton University and the United States Air Force Office of Scientific Research; the organizing committee consisted of W. Feller, R. C. Gunning, G. A. Hunt, D. Montgomery, R. G. Pohrer, and W. R. Trott. This volume contains some of the papers delivered by the invited speakers at the symposium, together with a number of papers contributed by former students of Professor Bochner and dedicated to him on this occasion. The papers were received by June I, 1969.

Contents PART I: LECTURES AT THE SYMPOSIUM

On the Group of Automorphisms of a Symplectic Manifold, by EUGENIO CALABI On the Minimal Immersions of the Two-sphere Constant Curvature, by SHllNG-SHEN CHERN

In

a Space of 27

Intersections of Cantor Sets and Transversality of Semigroups, by HARRY FURSTENBERG

41

Kiihlersche Mannigfaltigkeiten mit hyper-q-konvexem Rand, by HANS GRAUERT and OSWALD RIEMENSCHNEIDER

61

Iteration of Analytic Functions of Several Variables, by SAMUEL KARLIN and JAMES MCGREGOR

81

A Class of Positive-Definite Functions, by J. F. C. KINGMAN

93

Local Noncommutative Analysis, by IRVING SEGAL

III

PART II: PAPERS ON PROBLEMS IN ANALYSIS

Linearization of the Product of Orthogonal Polynomials, by RICHARD ASKEY

131

Eisenstein Series on Tube Domains, by WALTER L. BAILY, Jr.

139

Laplace-Fourier Transformation, the Foundation for Quantum Information Theory and Linear Physics, by JOHN L. BARNES

157

An Integral Equation Related to the Schroedinger Equation with an Application to Integration in Function Space, by R. H. CAMERON and D. A. STORVICK

175

A Lower Bound for the Smallest Eigenvalue of the Laplacian, by JEFF CHEEGER

195

ix

x

CONTENTS

The Integral Equation Method in Scattering Theory, by C. L. DOLPH

201

Group Algebra Bundles, by BERNARD R. GELBAUM

229

Quadratic Periods of HypereIIiptic Abelian Integrals, by R. C. GUNNING

239

The Existence of Complementary Series, by A. W. KNAPP and E. M. STEIN

249

Some Recent Developments in the Theory of Singular Perturbations, by P. A. LAGERSTROM

261

Sequential Convergence in Lattice Groups, by SOLOMON LEADER

273

A Group-theoretic Lattice-point Problem, by BURTON RANDOL

291

The Riemann Surface of Klein with 168 Automorphisms, by HARRY E. RAUCH and J. LEWITTES

297

Envelopes of Holomorphy of Domains in Complex Lie Groups, by O. S. ROTHAUS

309

Automorphisms of Commutative Banach Algebras, by STEPHEN SCHEINBERG

319

Historical Notes on Analyticity as a Concept in Functional Analysis, by ANGUS E. TAYLOR

325

d-Almost Automorphic Functions, by WILLIAM A. VEECH

345

Problems in Ana!ysis A SYMPOSIUM IN HONOR OF SALOMON BOCHNER

On the Group

if Automorphisms if a

Symplectic Manifold EUGENIO CALABP

1. Introduction

Let X be a connected, differential manifold of 2n dimensions. A symplectic structure on X is the geometrical structure induced by a differentiable exterior 2-form w defined on X, satisfying the following conditions: (i) The form w is closed: dw = 0; (ii) It is everywhere of maximal rank; this means that the 2n-form w n (nth exterior power of w) is everywhere different from zero, or equivalently, the skew-symmetric (2n) x (2n) matrix of coefficients of w, in terms of a basis for the cotangent space, is everywhere nonsingular. A classical theorem, ordinarily attributed to Darboux, states that a 2n-dimensional symplectic manifold (i.e., a manifold with a symplectic structure) can be covered by a local, differentiable coordinate system {U; (x)} where (x) = (Xl, ... , X2n): U-+R2n, in terms of which the local representation of the structural form w becomes

wlu = (1.1)

dx l

/\

dx 2 + dx 3

/\

dx 4

+ ... + dx 2n - l

/\

dx2n

n

=

L dX

2i - l

/\

dX 2i ;

j=l

such a system of coordinates is called a canonical system. The purpose of this study is to describe the group G of automorphisms of a symplectic manifold, i.e., the group of all differentiable automorphisms of X which leave the structural 2-form w invariant, and the invariant subgroups of G. The group G can also be characterized as mapping canonical coordinate systems into canonical systems. 1 The research reported here was supported in part by the National Science Foundation.

2

EUGENIO CALABI

Two normal subgroups of G are distinguished immediately as follows: DEFINITION 1.1. Let (X, w) be a 2n-dimensional symplectic manifold and let G be the group of all symplectic transformations of X. If X is not compact, we denote by Go the subgroup of G consisting of all symplectic transformations of X that have compact support; that is to say, a symplectic transformation g E G belongs to Go if and only if g equals the identity outside a compact region of X. DEFINITION 1.2. Let (X, w) be a 2n-dimensional symplectic manifold and let G be the group of all symplectic transformations of X. We denote by G o.o the subgroup of G called the minigroup generated by the so-called locally supported transformations, defined as follows: a transformation g EGis called locally supported if there exists a canonical coordinate system {U; (x)} defined in a contractible domain U with compact closure, such that the support of g lies in U.

The minigroup G o.o and its corresponding Lie algebras are introduced here merely for expository convenience. In Section 3 it will be shown that the commutator subgroup of the arc-component of the identity in Go coincides either with Go.o or with a normal subgroup of codimension 1 in Go.o (see Theorem 3.7, Section 3). An elementary example of a locally supported transformation is the following: let {U; (x)} be a canonical coordinate system in X; let its range V

= (x)(U)

C

R2n

contain the ball

{(t)IJ1 (t l)2 < A} for some A > 0;

choose a real-valued differentiable function {f(r) of a real variable r ?; 0 with support contained in a closed segment [0, A'] with A' < A. Then it is easily verifiable that the transformation f in R2n (with the 2-form w = n

L:

1=1

dt 21 -1

"

dt 2i),

(t)

~

(1')

= f(t) with

t'21- 1

=

1 21 - 1

cos (}{r) -

1'2;

=

t 2i - 1

sin{}{r)

+

sin {fer), t 21 cos {f(r), t 2;

~ (11)2; 1 ~ j ~ n), (r 1=1 =

is a symplectic transformation which equals the identity for r ?; A'. Therefore its restriction to V defines via the coordinate map (x) a symplectic transformation in U that can be trivially extended by the identity map in . X - U to a locally supported symplectic transformation in X. We shall state here the main results of this study in a preliminary form; more precise and stronger versions of these are repeated as theorems in the later sections.

AUTOMORPHISMS OF A SYMPLECTIC MANIFOLD

3

STATEMENT 1. The groups G, Go, and G o.o are infinite dimensional Lie groups in terms of the Whitney Coo topology (in the case of G) and the compactly supported Coo topology (in the case of Go and G o.o). STATEMENT 2. The minigroup G o.o is a closed, normal subgroup of Go; the quotient group Go/G o.o is locally isomorphic to the de Rham cohomology group Hli(X, R), the first cohomology group of X with real coefficients and compact support. STATEMENT 3. If X is not compact, the completion G 1 • O of G o•o in the compact-open topology of pres heaves (i.e., the group obtained by adjoining to Go.o the infinite products of sequences of g. E Go.o, where for each compact K c X only finitely many of the g. differ from the identity in K) is a closed, normal subgroup of G; the quotient group G/G 1 • O is locally isomorphic to the de Rham group Hl(X, R), i.e., to the first cohomology group with closed support. STATEMENT 4. The group G 1 • O has no connected, closed, normal subgroups other than the identity; in particular its commutator subgroup is an open subgroup. The same is true of G o•o, of course, if X is compact (in which case G o•o = G1 • O)' On the other hand, if X is not compact, the commlltator subgroup G~.o of G o.o is normal in Go, has codimension equal to 1, relative to Go.o, and has no connected, closed, normal subgroups other than the identity. The next two sections deal with the Lie group structure of the groups G and Go, emphasizing the relationship with the corresponding Lie algebras; the main tools used here are due to J. Moser [1]. In Section 3 we prove the four main statements just given at the Lie algebra level, and in Section 4 we expand the results at the group level, trying as far as we have succeeded to obtain results on the global structure of these groups and their closed, normal subgroups. 2. Infinite dimensional Lie groups of differentiable transformations

We shall summarize in this section some of the known facts about infinite dimensional Lie groups or pseudogroups of differentiable transformations, especially with regard to their relationships with the corresponding Lie algebras of tangent vector fields. We denote by G, H, and so forth, groups of differentiable transformations of a manifold; the corresponding pseudogroups of local transformations are denoted by G, H, and so forth; the associated Lie algebras of globally defined vector fields will be denoted by German capitals &, ,\;), and so forth; the corresponding pres heaves of local vector fields will be denoted by small German letters, such as g, 1), etc.

4

EUGENIO CALABI

An infinite dimensional Lie group G of global, differentiable transformations in a manifold X (or alternately a pseudogroup G of local transformations) is an infinite dimensional, differential manifold (naturally we do not exclude from this notion finite dimensional manifolds), in the following sense: for any finite dimensional, differential manifold M a map cp: M -+ G is defined to be differentiable, if and only if the corresponding evaluation map ij;: M x X -+ X, where ij;(t, x) = (cp(t»(x), is differentiable (in the case of a pseudogroup, one requires also that the domain of definition of ij; be open in M x X). For our present purposes, it seems irrelevant to fix any topology on G; we regard it rather as a compactoid, that is to say, we allow ourselves to consider the equivalence class of topologies that are compatible with the category of differentiable maps cp: M -+ G just defined. The Lie algebra @ associated to G is then the set of tangent vector fields (respectively presheaves of local vector fields) each defined as the equivalence class of differentiable paths in G originating at the identity with the obvious equivalence relation. The question that ordinarily arises here is how to recapture the arc component of the identity in G from the sheaf of germs g of Lie algebras determined by OJ. Any local cross section {} in g (that is to say, each local vector field belonging to g) defines a one-parameter subpseudogroup of differentiable transformations by integrating the vector field to the corresponding autonomous flow. The composition of such one-parameter local flows defines a pseudogroup r G in the pseudogroup G associated to G. If G is finite dimensional, it is known classically that r G defines an open subpseudogroup of G. In the infinite dimensional case the same conclusion holds, if G acts real analytically, at least by considering first local cross sections of g and collating the resulting local transformations. In the COO case for infinite dimensional Lie groups, several authors have shown examples to the effect that one-parameter subpseudogroups are not dense in a neighborhood of the identity, and the details of some of these examples give a strong indication that even the composition of the elements of such one-parameter systems may not fill out any neighborhood of the identity in G. Some authors have suggested a method based on affine connections; this method permits one to attach to each local vector field in @ a differentiable path in G originating at the identity but not constituting, in general, a oneparameter pseudogroup, so that the union of such paths fill out a neighborhood of the identity in G. This method is satisfactory in the case of differentiably acting pseudogroups definable in terms of a first-order, integrable differential system in the coordinate transformations but would require higher order connections for pseudogroups of a more complicated nature.

AU'J'OMORPHISMS OF A SYMPLECTIC MANIFOLD

5

The correspondence between presheaves of Lie algebras of local vector fields and pseudo groups of local differentiable maps can be established in a natural way only as a local one-to-one correspondence between differentiable paths. Thus the fundamental theorem on existence, uniqueness, and continuity with respect to initial data for ordinary differential equations establishes a one-to-one, locally biregular correspondence between differentiable, one-parameter families of local vector fields in 9 (i.e., paths in g) and local flows in X belonging to G (i.e., paths in G): any topological structure in the stalks of the sheaf of germs of one-parameter families of vector fields in X belonging to 9 (provided that its definition includes minimal regularity conditions) yields a well defined topology on the stalks of the corresponding sheaf of germs of G-flows. The corresponding topology on the sets of germs of elements of G is then obtained by passage to the quotient, assigning to each path in G (originating at the germ of the identity) the germ of the terminal element E G of the path. Thus one obtains from the sheaf of germs of Lie algebras 9 a sheaf G of germs of diffeomorphisms. The group G of global cross sections in G can be obtained without any difficulty if the manifold X is compact. In the case where X is not compact, the corresponding sheaves 9 of germs of Lie algebras of vector fields and G of germs of pseudogroups of transformations lead to the corresponding global Lie algebras and groups in many ways, of which two are the most important: the ones with unrestricted support and the ones with compact support. They are obtained from the topologies of the corresponding stalks by "globalizing" them, in the former case either by the compact-open extension of a uniform topology on the stalks (compact-open topology) or by a Whitney topology and in the latter case by a uniform topology over uniformly compactly supported cross sections. It is worthwhile noting that the global groups obtained from the path wise-connected sheaf of groups is not necessarily connected, as we know well in the case of the celebrated group of all differentiable, orientation-preserving automorphisms of spheres. 3. The Lie algebra of symplectic vector fields We apply the concepts reviewed in the previous section to the case of the Lie algebra associated with the group G or Go of automorphisms of a symplectic manifold (X, w). If cP is any exterior p-form and g is a vector field in a manifold, we denote by g L cp the interior product of g with cp; this is the (p - I)-form (identically zero if p = 0) whose value at a (p - I)-vector "II 1\ ... 1\ 7JP-I is given by (3.1)

(g L CP)(7JI

1\ •.• 1\ 7JP-I)

= cp(g

1\ "II 1\ ... 1\ 7JP-I)'

6

EUGENIO CALABI

The Lie derivative of cp with respect to

g is then obtained from the formula

[g, cp] = d(g L cp) +

g L dcp.

It is well known that any differentiable path in the sheaf of germs of diffeomorphisms, originating at the germ of the identity at any x E X, leaves the form cp invariant along the orbit of x, if and only if the path in the sheaf of germs of vector fields given by the differential of the given path yields a one-parameter family of germs of vector fields g satisfying [g, cp] = O. Thus the Lie algebra of the symplectic group G or Go is described locally by the vector fields g satisfying, since w is closed,

(3.2)

d(g L w) = O.

We now introduce the Lie algebras corresponding to the groups G, Go, and G o•o previously given in Definitions 1.1 and 1.2. PROPOSITION 3.1. The Lie algebras @, @o, and @o.o corresponding, respectively, to the groups G, Go, and G o.o are given by the global vector fields g on X satisfying (3.2) and, in addition,

(i) satisfying no further conditions in the case of @; (ii) having compact support in the case of @o; (iii) generated, in the case [email protected], by vector fields g., where each gv has compact support contained in a contractible domain Uv admitting a canonical coordinate system (xl, ... , x 2n ), that is to say, such that there exists a function U v with compact support in U v satisfying

gv L w = du v, or equivalently

(3.3)

gv = ~ (;:1

(ou ox

v 2j

_0_ _ ~~). OX 2i -1 OX 2i -1 OX 2i

The proof of this proposition is clear. Since the form w is everywhere of maximal rank, the bundle map from the tangent to the cotangent vector bundle defined by assigning to g the I-form g L w is bijective. Thus, for instance, the Lie algebra @ is isomorphic, as a vector space over the real numbers, to the set of all closed Pfaffian forms on X. This isomorphism induces a Lie algebra structure on the set of all Pfaffian forms called the "Poisson bracket"; more precisely, for any I-form ex we denote byex# the uniquely defined tangent vector g such that g L w = ex (at the bundle, sheaf, local, or g\oballevel); then the Poisson bracket of two Pfaffian forms ex and {J is defined to be {ex, {J}

=

[ex#, (3#] L w.


7

The Lie algebra of all COO I-forms (globally defined on X) with the Poisson bracket is isomorphic under the map {a --+ a#} to the Lie algebra of all tangent vector fields; the vector subspace consisting of all closed I-forms is a Lie subalgebra. It follows from (3.2) that this subalgebra is isomorphic to the algebra @ of all vector fields g such that [g, cu] = O. The subalgebras @o and @o,o of @ are similarly characterized. We shall now define some vector subspaces of these Lie algebras that will, in fact, turn out to be ideals. DEFINITION 3.2. We denote by @' the vector subspace of @ consisting of all germs of vector fields g E @ such that g L cu is exact; that is to say, @' consists of all the vector fields (du)# where u is an arbitrary differentiable function on X. Similarly we denote by @~ the vector subspace of @o consisting of all the vector fields (du)# where u is an arbitrary function on X with c:>mpact support. Finally, we denote by @~ the vector subspace of @~ consisting of the vector fields g = (du)#, where u is a function with compact support on X satisfying, in addition,

(3.4) REMARKS. In the case of the unrestricted Lie algebra @, one cannot define the algebra that corresponds to @~ in the case of @o; clearly, if X is a compact manifold, then @o = @ and, in defining @~ = @' the function u is determined by the vector field g = (du)# only up to an additive constant; therefore one can always choose u so as to satisfy (3.4); thus we have @~

=

@~

=

@'.

On the other hand, if X is not compact, the function u generating an element g E @~ is uniquely determined by g, since it is required by definition to have compact support; therefore the condition (3.4) becomes meaningfully restrictive, and indeed

Similarly, using the theorems of de Rham, for all symplectic manifolds

where b1(X) denotes the first Betti number of X with respect to real coefficients (and homology with compact support) while in the case of noncompact X

where b~ denotes the first Betti number for homology with closed support.

8

EUGENIO CALABI

We now show that the vector subspaces (W, algebra ideals of @ and @o, respectively.

@~,

and

@~

are indeed Lie

PROPOSITION 3.3 (R. S. Palais). The commutator ideal of [@, @] = @2 is contained in @', while the commutators @5 = [@o, @o] and @g = [@o, @5] are contained, respectively, in @~ and @~. PROOF. Let g = a#, 7] = fYI be arbitrary elements of @, that is to say, let ex and fJ be closed I-forms. Then {ex, f3} = [g, 7]] L w is not only closed but indeed exact, for, with no assumption on g, 7], and w, we have the following identities:

(3.5)

[g,7]] L w = [g,7] L w] - 7] L [g, w] = d(g L (7] L w» + g L d(7] L w) - 7] L [g, w] = d(g L (7] L w» + g L [7], w] - 7] L [g, w] - g L (7] L dw) = dw + g L [7], w] - 7] L [g, w] - g L (7] L dw),

where

w=

g L (7]

L w);

thus, under the assumptions dw = 0, [g, w] = [7], w] = 0, we have

[g, 7]] L w = dw. This shows that [@, @] c @' and, obviously, also [@o, @o] c @~, if g and 7] (and hence w) have compact support. In order to show that @3 c @~, we assume that g = a#, 7] = f3# belong to @o; if w is defined by (3.6), we have an elementary identity. (3.6)

ww n =

a L (7] L w»wn = -n(g L w) = -na

A (7] L

w) A wn -

1

A f3 Awn-I.

It follows that, if either of the two closed forms with compact support a or f3 is exact (say a = du, where u has compact support),

Ix ww

n

= -n

Ix d(uf3

Awn-I)

= 0.

This completes the proof of this proposition. The next results show that the continuation of the commutator sequences yields no new ideals. For this purpose we recall the definition of the Lie algebra @o.o of the minigroup (Definition 1.2 and Proposition 3.1). Clearly, since @o.o c &~, we have [&0.0, &0.0] c &~.o c &~, where &~.o· is defined to be &0.0 (J &~; in other words (Sj~.o is generated by the vector fields (du)#, where u is a function which has compact support in a contractible domain U admitting a canonical coordinate system and satisfies (3.4).


9

3.4. The Lie algebras @o.o and @~.o coincide, respectively, with @~ and @~. The algebra <M' in the case of a noncompact manifold is generated by infinite sums with locally finite supports of elements of @~ = @~.o. LEMMA

PROOF. Let g E @~. This means that g = (du)#, where u is a differentiable function on X with compact support. Let {UV}VEJ be a locally finite, open cover of X by contractible open sets U., each of which admits a canonical coordinate system (U., .(x)) and let (9'v) be a differentiable partition of unity, where each 9'v has compact support contained in U v ; set U v = 9'v u; then (du v )# E @o.o and all but a finite number of them vanish, since u has compact support. This shows that (du)# E @~ can be expressed as a sum of elements in @o.o. Now suppose that g = (du)# E @~: since in this case X is assumed not to be compact and is connected, condition (3.4) is equivalent to the existence of a (2n - I)-form 0/ on X with compact support, such that do/ = uw n • Let o/v = 9'vo/ be a decomposition of 0/ by the partition of unity (9'v), and define U v by the condition

uvw n = do/v = d(9'vo/).

Then clearly (du v )# E @~.o, showing that @" is generated additively by @~.o. The final assertion about (W now has to be proved only in the case where X is not compact. In this case an element g = (du)# is defined by a function u on X whose support is unrestricted. Since X is connected and not compact, the 2n-dimensional real cohomology group of X with unrestricted support is trivial; therefore the 2n-form uw n can be represented by do/, where 0/ is a (2n - I)-form. Decomposing 0/ by means of the partition of unity 9'v as in the case of @~, we show that g can be expressed as an infinite sum of elements gv E (~j~.o with compact, locally finite supports; this completes the proof of the assertion. LEMMA

3.5.

The Lie algebra

(.I)~.o

coincides with its own commutator.

PROOF. Let g = (du)# be a vector field in @o.o defined from a function u with compact support K c U where U is a contractible domain in which

canonical coordinates (x) = (xl, ... , x2n) can be defined. Expressing u in terms of these coordinates, the vector field g can be described by (3.3). In terms of these same coordinates, we have

Since g E @o.o, the function u satisfies (3.4); in other words we have (3.7)

r JX)(U)

u(x) dx' ... dx 2n =

o.

EUGENIO CALABI

10

We shall show that there exists a set of 4k functions (Vi> Wi) (i = I, 2, ... , 2n) each with compact support contained in U and satisfying, like the function u, equation (3.7) and satisfying 2n

du

=

L: {dVi' dwi} t=1

or equivalently 2n

g = (du)#

=

L:

[(dv i)#, (dwt)#].

i= 1

This means that u is determined from the other functions (using 3.3 and 3.5) by the formula (3.8)

=

n

L: L: 2n

u(x)

.=11=1

(ov. owi

OX~i OX2i -1

-

OV OWi) OX2/-1 OX2i .

Since the function u satisfies (3.4), there are, by virtue of Gauss' theorem, 2n functions zl, ... , z2n in X each with compact support in U such that

(3.9)

u(X) =

~ OZi(~). i=1

ox

As a matter of fact, the functions Zi can be chosen so that their support is contained in an arbitrarily small, connected, open neighborhood of the compact support K of the function u; in particular we let each of the functions Zi have support contained in a compact K1 with K c K1 C U. Now consider the functions Vi = -( -1)iZ(i-(-1)') and Wi = Xi; disregarding the bars, they satisfy (3.8). In fact we see that, for each i,

Consequently, summing both members of this equation, we have a solution of (3.8); however, the functions Vi do not, in general, satisfy (3.4), nor do Wi = Xi even have compact support. This can be remedied in a second stage, as we shall now proceed to do. Choose a non-negative valued, differentiable function hex) which is identically equal to I in K1 and whose support is a compact domain K2 c U. We can then replace the functions Wi = Xi by h(x)· Xi; the new choice of Wi will not affect equation (3.8) and the resulting functions h· Xi have compact support, so that (d(h· x i ))# E @o.o. Next, pick two more nonnegative, nonzero, differentiable functions h' and h" with compact supports K3 and K4 mutually disjunct and contained in U - K 2. Since


r

fuh" w

(i

= 1,2, ... , 2n) such that

n .u h' w and

Jr (-VI U

-

n

are both positive, there exist real constants c; and

'h') W n =

Ci

11

Jru (h . w

-i

-

"h") W n = 0

(l

Ci

~

c;

i ~ 2n).

Thus we can set (l

~

i ~ 2n).

The replacement of the 4n functions Vi and Wi by Vi and Wi, respectively, does not affect the computation of the Poisson brackets {dvi' dw i}, so that (3.8) is still satisfied, while each of the functions Vi and Wi satisfies, as does the function u, equation (3.4). Hence (dv i )# and (dwh)#

c

@~.o

and 2n

(du)# =

2: [(dvI)#, (dw )#]. i

i=l

This completes the proof of Lemma 3.5. Before stating the first main result on the Lie algebras need one more notion, that of the symplecting pairing.

@

and

@o,

we

DEFINITION 3.6. In a symplectic manifold (X, w) we define the symplectic pairing to be the alternating, bilinear map of HMX, R) (first cohomology group with compact support and real coefficients) into the real numbers, denoted by (u, v) as follows: let a, f3 be closed I-forms representing by means of de Rham's theorem the cohomology classes u, v, respectively. Then the symplectic pairing is given by (3.10)

(u,v) =

fxa

1\

f3

1\ w n -

1•

It is well known that the symplectic pairing is nonsingular in the case of the symplectic structure subordinate to a compact Kahler manifold, while it is identically zero in the case of the natural symplectic structure of the cotangent bundle of any differential manifold or, more generally, in any (necessarily noncompact) symplectic manifold with n ~ 2, where c.o n - 1 is cohomologous to zero. THEOREM 3.7. Let (X, w) be a symplectic manifold, let @ be the Lie algebra of all differentiable, symplectic, global t'ector fields, and let @o be the Lie subalgebra of @ consisting of vector fields with compact support. Then the commutator algebra @2 = [@, @] coincides with @' and @/@2 is isomorphic to Hl(X, R); furthermore, if X is not compact, the commutator

EUGENIO CALAM

12

algebra OJ~ = [(5j o, Ojo] coincides with either @~ or @~ depending on whether the symplectic pairing of H6(X, R) is, respectively, nontrivial or identically zero; the third derived algebra @g = [@o, @2] coincides in all cases with @~. The Lie algebras OJ' and OJ~ are each equal to their own commutator algebras. PROOF.

From Proposition 3.3 we have the inclusion relations

[OJ, (SJ] c

(Ij',

In Lemma 3.5 it was established that [(SJ~, (SJ~] ::J @~. Since dim R «%/(~~) = I, this means that @~ coincides with either (SJ~ or OJ~. It follows from equation (3.6) in the proof of Proposition 3.3 that @~ = (SJ~ if and only if the symplectic pairing is nontrivial, and otherwise @~ = OJ~ (this will be the case, in particular, if H6(X, R) = 0 or if w n - 1 is cohomologous to zero). The corresponding conclusions in the case of (15 can be obtained by showing that (If is equal to its own commutator subalgebra. This can be done by the following argument. There exists an open cover {UV}VEI of X by contractible canonical coordinate domains with the following additional property: there exists a partition of the indexing system J into a finite collection N

of subsets, J

= U J il such that, for each IL, the open sets (UV)VEI. are pairwise #=1

disjunct. Let (CPv) be a partition of unity subordinate to {UV}VEI and let ell = L CPv; then (ell)~=l is a finite partition of unity; any vector field g E OJ' veJIJ.

can be expressed as (du)# for some function u such that (after adding a suitable constant to u, if X is compact) uw n = difi for some (2n - I)-form ifi, and let ifill = ell' ifi. Let Ull be defined by the equation ullw n = difill and consider the vector fields gil = (du ll )#. By using the arguments of Lemma 3.5 in each Uv for v E J Il we see that, since the domains {UV}VEI. are pairwise disjunct, gil (IL = I, 2, ... , N) can be expressed as a sum of commutators of 2n-pairs of vector fields with support in U U.; the sum of these veJ/.l

N sums of commutators reproduces the given g E @' represented as a finite sum of commutators of vector fields belonging to @'. This completes the proof of the asserted commutator relations. This concludes the proof of the theorem. An immediate application of Theorem 3.7, to be used in the next section, is that we can obtain, using de Rham's theorem, a complete set of Abelian representations of the Lie algebras (SJ and (SJ o as follows. For any differentiable, compact, integral I-cycle y and any g E (SJ, we define

(3.11 )

JW(y)

=

Lg

L w.

Similarly, if g E (~o then, for any integral, locally finite, differentiable, integral I-cycle y we define J(y)(g) by the same formula. It is clear that the


13

value of the integral depends only on the homology class of y: it establishes .the homomorphism of @ (respectively @o) into hom (oH1 (X, R), R) = Hl(X, R) [respectively into hom (H1 (X, R), R) = H~(X, R)] . . We have seen that these two homomorphisms are surjective and are Lie algebra homomorphisms, if we regard the cohomology groups as Abelian Lie algebras. We can also construct geometrically a Lie algebra representation J' of @~ with kernel (5j~ onto the additive group R as follows. If X is not compact, for each point x E X let Yx be an infinite, locally finite, differentiable I-chain whose boundary oYx equals x. If ~ E @~, define the function !{I( 0: X ~ R by the equation (3.12)

!{I(~)(x) =

r ~ L w. JyX

Since Yx is defined modulo the group of I-cycles and ~ E @~, the function ¢Sa) is independent of the choice of the chain; in fact it coincides with the function u with compact support such that (du)# = ~. The representation of @~ into R with kernel @~ then is obviously given by the integral

We have at the moment no satisfactory, faithful geometrical representation of the non-Abelian nilpotent Lie algebra (3jo/@~ in the cases where [@o, @o] = @~, i.e., where the symplectic pairing is not identically zero. The next result concerns the structure of Lie algebra ideals of @ and @o that give rise to normal subgroups. It is clear that any normal subgroup H of any (finite or infinite dimensional) Lie group G of differentiable transformations has as its Lie algebra an ideal in the Lie algebra of G. The converse, however, is false, unless the germs of the actions of G at each point are uniquely determined by their local power series expansion; this would happen, for instance, when G is finite dimensional or if its action is real analytic. In the case at hand, however, when G is the group of all automorphisms f a symplectic manifold (X, w), we know that there are nontrivial elements of G that act trivially in a nonempty open set, and nontrivially in another. In particular let U be an open subdomain of X with U oF 0, V oF X; then the Lie subalgebra @u of @ consisting of all vector fields ~ E @ that vanish identically in U is locally transitive in X - V; clearly it is an ideal in @; however, the group G u of global transformations of X defined by @u is not a normal subgroup of X, since U is not invariant under G. This fact justifies the restrictive condition imposed below. DEFINITION 3.8. Let H be a Lie group of differentiable transformations of a manifold X and Sj its Lie algebra of vector fields. A Lie algebra ideal

14

EUGENIO CALABI

S)' c S) is called stable, if it is invariant under the adjoint representation of H in S).

It is easy to verify that the local isomorphism classes of normal Lie subgroups of H are in one-to-one correspondence with the stable ideals of the Lie algebra of H.

THEOREM 3.9. Let eX, w) be a 2n-dimensional symplectic manifold, and let G be the group of all global, symplectic automorphisms of (X, w}; let Go be the subgroup of G consisting of the transformations with compact support and G~ its third derived group. Then every nontrivial ideal of the Ue algebra @ of G (or @o of Go) that is stable under the adjoint action of Go (indeed even ofG~ alone) contains the Ue algebra @~ ofG~. The proof of this theorem requires a lemma which is the analogue of the lemma of Palais-Cerf in the case of symplectic transformations. It is formulated here in a form somewhat stronger than strictly necessary because it is of independent interest, and the proof given here is independent of that of the Palais-Cerf Lemma for the sake of completeness. LEMMA 3.10. The identity component G~ of the third derived group of Go is strongly transitive on X in the following sense. Let g be a germ of a symplectic transformation at a point Xo E X with target g(x o} = Xl> let y be a simple, differentiable path from Xo to Xl parametrized by t (0 ;£ t ;£ 1, y(0) = Xo, y(I) = Xl} and let Iyl be the corresponding point set. Then for any neighborhood U of Iyl there exists a global transformation g E G~ that represents the given germ g at Xo and equals the identity outside of U. Indeed g can be achieved by a G~-flow that is trivial outside U. PROOF. The proof consists of constructing a sequence of three successive differentiable flows in X (trivial outside of the given neighborhood U of lyJ); the composition of successive differentiable flows into a differentiable flow is elementary. The first stage will bring the point Xo to Xl along the path y; the second one, leaving Xl fixed, will rotate the tangent space XXI of X at Xl so that the composition of this flow with the first stage will match the first order jet of the given germ g of a symplectic map; the third stage will be a symplectic flow, trivial as the previous ones outside U which leaves Xl and each tangent vector at Xl fixed and, after composing it with the two previous stages, matches the germ g itself. Stage 1. The flow carrying Xo to Xl can be actually imbedded in a one~ parameter group, by constructing a symplectic vector field g E C;Sj~ on X with compact support in U whose restriction to Iyl coincides with the tangent vectors dy(t) along the path. Since g has compact support, it generates a one-parameter subgroup of G~ acting globally on X and


15

trivially outside U, whose action on Xo is an extension to all of R of the parameter domain of the path y. The tangent vector dy(t) at each point yet) of iyi yields, by contraction with w, a I-form dy(t) L w = aCt) which is orthogonal to dy(t). Let u be a function on X with the following properties: (i) At each point yet) of iyi, du(y(t» = aCt). (ii) The function u has compact support contained in U. (iii) The integral

Ix uw

n

vanishes.

[Note that property (i) implies that u is constant along iyi.] It is clear that such a function u exists and that the vector field g = (du)# has all the required properties to achieve the flow carrying Xo to Xl. This shows that G~ is pointwise transitive on X. Stage 2. We show now that G~ is transitive on the bundle of symplectic tangential frames. We are given a germ gl of a symplectic transformation at a point Xl with gl(X I) = Xl and we want to exhibit a G~-fIow (g(t»O:::it:::i1 in X leaving Xl and everything outside a neighborhood of U of Xl pointwise fixed, such that g(O) is the identity and dg(I)(xI) = dgl(XI). The fibre map dg(XI) of the tangent space XXI at Xl belongs to the 2n-dimensional, real symplectic group Sp(XXI' W(XI» which is known to be arcwise connected. Let Set) (0 ;;;; t ;;;; I) be a differentiable path in this group from the identity to dgl(X I) and let s'(t)

=

d~~t)

0

S -let) its logarithmic deriva-

tive, so that s'(t) belongs to the Lie algebra 5.)J (XXI' W(XI». This means that the bilinear form W(XI) 0 s'(t) on XXI defined by (W(XI) 0 s'(t»(v, v') = W(XI)(S'(t)(v), v') is symmetric for each t (0 ;;;; t ;;;; 1). We construct now a differentiable function u: [0, I] x X --+ R, i.e., a differentiable oneparameter of functions (ut)O:::it:::i I on X (Ut{x) = u(t, x» with the following properties for each value of the parameter t: (i) The function Ut has compact support, contained in U. (ii) The differential dUt of Ut vanishes at Xl. (iii) The Hessian HxJutl of Ut at Xl equals the symmetric bilinear form W(XI) 0 s'(t). (iv) The integral utw n vanishes.

Ix

The construction of the family of functions (Ut) with the four properties just described is a simple matter. We now consider the one-parameter family of vector fields gt = (dut)# (0 ;;;; t ;;;; I) and look at the differential equation.

(3.13)

og~; x) =

g(t)(g(t, x»;

g(O, x)

=

x.

16

EUGENIO CALABI

Property (i) of Ut and the definition of

~(t)

(gt)O~t~l

imply that

(gt(x) =

g(t, x» has a global solution on [0, I] x X, describing a G~-flow in X that is trivial outside V; property (ii) implies that Xl is fixed throughout the

flow. Property (iii) means that, at the fixed point Xl of this flow, the oneparameter family (dgt(x l » of actions of (gt) on the tangent space XXI at Xl satisfies the same differential equation with the same initial condition as the path Set) in the linear symplectic group Sp(XXI' w(x l therefore dgl(X l ) = S(I) = dg(x l ); the second stage of the process is then completed merely by remarking that property (iv) of Ut implies that the flow is actually in the commutator subgroup G~ of G~. Stage 3. In order to complete the proof of the lemma, we now suppose that the given germ g of a symplectic transformation leaves Xl fixed and acts as the identity on the tangent bundle XXI of Xl and try to extend it to a global, symplectic G~-flow on X, trivial outside a given neighborhood V of Xl. Let {V', (x)} be a local, canonical coordinate system in a neighborhood V' of Xl, V' C V, with XI(X l ) = 0 (l ~ i ~ 2n) and including in its range

»;

2n

the closed ball

2:

(Xi )2

~ a~

for some ao > 0; let Vn for any positive

1=1 2n

r

~

2:

ao, denote the subdomain of V' defined by

(X i )2 < r2 and let the

1=0

germ g be represented by a local symplectic map Xl -+ yi = gi(Xl, ... , x2n) (1 ~ i ~ 2n) in a neighborhood Va, of Xl (0 < al ~ ao). We have, thus, by assumption: n

(3.14)

L: dy

n

2i - 1

1\

dy 2j =

i=l

L: dx

2j - 1

1\

dx 2j

j=l

and (3.15)

gl(X)

2n

L: (X )2.

= Xl + 0(IxI 2 ), where Ixl 2 =

I

1= 1

Let h(s) be a monotone decreasing COO function of s ~ 0 which equals identically I for 0 ~ s ~ b, is strictly positive for b < s < 2b, and vanishes for all s ~ 2b, for some positive b < -tal, sufficiently small for later purposes. Consider the map of f: [0, 1] x X -+ X defined as follows: (3.16)

f(f, x)

=

X

if either f = 0 or x¢: V 2b ;

otherwise, in terms of the local coordinates (xI, ... , x 2n ), i.e., setting fti(Xl(X), ... , x 2n(x» = xi(f(f,

x»,

(3.17) ftl(X) = (t·h(lxj)-lgi(t·h(lxj)x l , ... , t·h(lxj)x2n ),

(i

= 1, ... , 2n).


17

One verifies first that, because of (3.15), equations (3.16) and (3.17) match each other in a C way; furthermore, if the constant b is sufficiently small, the map f t : X -i>- X (ftCx) = fU, is a diffeomorphism for each t (with fo equal to the identity), whilef1 coincides with g in Vb and each f t is symplectic in Vb' Thus the diffeotopy (ft)o ~t ~ 1 is symplectic for each t except in the interior of V 2b - Vb for t > O. Furthermore, one verifies cYJ

x»

directly that in Vb we have

~i

=

O(lxI2) uniformly in t, so that the flow

acts as the identity on the tangent space at Xl' In order to replace (ft ) by a diffeotopy that is symplectic everywhere in X and for aII t, we shaII construct another diffeotopy (g;)o ~ t ~ 1 acting as the identity for t = 0, and, for t > 0, outside Db' - Dbl2 for some b' (2b < b' < a 1 ) so that the composite diffeotopy (gt)o ~ t ~ 1 with gt = It 0 (g;) -1 satisfies W 0 gt = W for each t everywhere in X. Let Wt = W f t ; then Wt = Wo = W except in the spherical sheII V 2b - Db' Let b 1 be arbitrary constant with 2b < b 1 < a 1 ; then Wt - W is closed and cohomologous to 0 with compact support in the open submanifold V b, - Db12 . One can therefore find a differentiable, oneparameter family of I-forms at with compact support in V bl - Dbl2 satisfying for each t (0 ;;;; t ;;;; 1) 0

-1a t U

v

v

vvwn = d1fv, "Iv = (dv v)#. Then each "Iv E showing that each TJv E~.

@~.

We can prove the theorem by


19

We first look at the corresponding (2n - I)-form !fv' In terms of the local coordinates .(x) we have 2n !f. = C-I)i+1!f~Ux))d(.x1) 1\ ... 1\ d(vxi-1) 1\ dC.xi+1) 1\ ... 1\ dC.x2n).

2:

i= 1

Each of the 2n components !f~Ux)) of !f. (extended by the constant zero outside of V v) gives rise to a vector field ~~ = (d!f~(vCx)))# E @~; for each v E J the relationship between the 2n vector fields ~~ (1 ~ i ~ 2n) and 7J. can be described by the equation 7J. =

2n i~

[8 'J 8(.xt)' ~~ .

We now apply to each term the mapping f v ,;, which sends restriction of local vector field

8~i

8C~Xi) into the

to some subdomain of U1 , i.e., into a

restriction of the given vector field g, and ~~ into a vector field df.,iC~v,i) = with compact support in fv,iCV.) c U 1. In other words we have the representation of 2n

rv,t

7J =

2: 2:

[d(fv~/)W, ~~],

vel i= 1

where each ~~ E @~. We can modify each ~~ by adding to it a suitably chosen vector field 7J~,i E @~ with support disjunct 2 from the compact support of d(f;}) a g, so that the sum ~~ + 7J~,i belongs to @~; this modification of ~~ does not affect the Lie product. This shows that the given vector field 7J E @~ belongs to the G~-stable ideal generated by g, thereby completing the proof of the theorem. An analogous result on the commutator algebra @' without restriction at infinity would be desirable. This can be achieved in an elementary way by making suitable assumptions of completeness in an appropriate topology, using the results just established, as follows. COROLLARY. Let eX, w) be a noncompact symplectic manifold, @ the Lie algebra of all differentiable automorphisms of ex, w) and @' its commutator algebra. Then any nonzero ideal in @' that is both: (i) invariant under the adjoint action of G', (ii) either closed under infinite sums with locally finite support or complete under a compact-open extension of a function-space topology on the stalks of the sheaf of germs, must be all of@'. 2 The case where X is compact and the support of f is all of X does not present any essential difficulty. since f could be replaced without loss of generality by its Lie product (assumed # 0) with a vector field in @~ with smaller support.

20

EUGENIO CALABI

The proof of this corollary can be omitted. Summarizing the conclusions of this section, we see that the finitedimensional representations of G and Go are very limited. They were described completely at the end of the proof of Theorem 3.7.

4. Representation of the automorphism groups In this section we shall show how and to what extent the finite-dimensional representations of the Lie algebras @, @o, and @~ can be "lifted" to representations of the corresponding Lie groups G, Go, and G~, respectively. Since these representations are either Abelian or (in the case of @oj@~ when X is not compact and the symplectic pairing in HJ(X, R) is nontrivial) nilpotent, it is clear that they give rise naturally to representations of the respective universal covering groups (analytic groups, in the terminology of Chevalley) (;, Go, and G~ into quotient groups of the corresponding Lie groups Hl(X, R), HJ(X, R) R (this is the semidirect product defined by means of the symplectic pairing) and R. However, since we have at present little or no knowledge about either the group of components (zero-dimensional homotopy group) of G, Go, or G~, such information about their universal covering is not especially interesting except as an intermediate step in attacking the representation problem of the groups themselves. In the remarks after the proof of Theorem 3.7 we exhibited the geometric realization of some A bel ian representations of the Lie algebras @, @o, and @~, using de Rham's theorem. We shall apply these geometric realizations to obtain the corresponding Abelian representations at the first of the analytic groups.

+

DEFINITION 4.1. Given a Lie group L (finite- or infinite-dimensional) we denote by Q(L) the right path-semigroup of L; this is the semigroup whose elements are equivalence classes under right translations of piecewise differentiable, oriented paths in L, composed by composition of paths, after a right translation to match points of transition from each path to the next. We denote by Qo(L) the normal sub-semigroup of L represented by closed paths that are nomotopically trivial.

It is clear that the quotient semigroup Q(L)jQo(L) is naturalIy isomorphic to the universal covering group L of the identity component of L. LEMMA 4.2. Given a (finite- or infinite-dimensional) Lie group L of differentiable transformations of a manifold X and a differentiable representation of Q(L) into a Lie group H, this representation is trivial on Qo(L) (and thereby induces a representation of L), if and only if the associated


21

linear map d$ of the Lie algebra .£! of L into that of H is a Lie algebra homomorphism. PROOF. It is understood that a representation $: neLl ~ H is differentiable, if and only if any (piecewise) differentiable, finite-dimensional parametric family of paths in L has as its image in H under $ a corresponding (piecewise) differentiable family of elements in H. Given a differentiable representation $, then to any (piecewise) differentiable path a: [0, a] ~ Lin L we associate the one-parameter family al[O.tJ: [0, t] ~ L (0 :;:;; t :;:;; a) of paths. The image of this family is a (piecewise) differentiable path in H, originating at the identity, and whose initial tangent

$0

=

d$(a llo•tJ

dt

)! t=o

depends only on the initial tangent 0- 0

=

dal[o.tJ!

dt

t=o

of the given path a. The map of Lie algebras thus obtained is clearly linear. The conclusion of the proof is now reduced to a repetition of the standard proof of the first fundamental theorem of Lie groups, i.e., to the existence and uniqueness theorems for initial value problems in ordinary differential equations. We shall now construct some specific representations, when L is either the group G, Go, or G~ of automorphisms of a symplectic manifold (X, w), of neLl, respectively, into the following Abelian Lie groups, respectively, Hl{X, R), HJ(X . R), and R (written additively), satisfying the conditions of the lemma. We consider, in their respective order, the groups: (i) G, (ii) Go, and (iii) G~ (in case X is not compact) of automorphisms of a symplectic manifold (X, w) and let a = {a(t)} describe a differentiable path in each of these groups, parametrized by t (0 :;:;; t :;:;; I) with a(O) = identity. In each case we consider also, respectively: (i) any finite singular (piecewise differentiable) I-cycle y with integer coefficients in X; (ii) any locally finite, integral, singular I-cycle y in X; (iii) for any given point x any choice of a singular, locally finite, infinite I-chain Yx in X whose boundary 8yx is the O-cycle x. In each case we introduce the finite 2-chains a(y) [respectively u{Yx) in case (iii)] obtained by the homotopy operator on X represented by a and calculate the functionals ii>(a, y) in cases (i) and (ii) and ii>(u, Yx) in case (iii), defined respectively by (4.1)

ii>(a, y) =

i

u(y)

w,

22

EUGENIO CALABI

4.3. Let (X, w) be a symplectic manifold and consider, respectively: (i) the automorphism group G of (X, w), (ii) (if X is not compact) the compactly supported automorphism group Go, and (iii) (if X is not compact) the subgroup G~ of Go. Let is a constant depending only on the system (35). With Zo E D, =I 0, we integrate this inequality with respect to dg dYJ/I~ - zol over D. Remarking that

Zo

1

I(z - n- rp*

(45)

H~2) -'>- H~2)*

= ei'rp, =

e2i 0. Consideration of the terms in ell in jj2E1 gives

rp

A 11> =

0,

SHIING-SHEN CHERN

36

so that is of the form

=

..!.. cp(2: H~3)eu) . k1

u

It should be remarked that with this notation, Re (H~3)q;3) are the third fundamental forms of the surface. The form of degree 6:

(54) is independent of the choice of the frame field and, defined to be zero at a singular point, is well defined over the whole surface S2. With the local isothermal coordinate z it can be written as g(z) dz. 6 We shall show that g(z) is a holomorphic function. This follows immediately from the structure equations. In fact, consideration of the term in E2 in jj2E1 gives

{dk1

+

ik1(2w12 - W34)}

cp =

1\

0.

We can therefore put (55) where 81 is a real one-form. Substituting back, we obtain

or (56)

81

Since D2E2 (57)

=- id log k1'

mod

cpo

= 0, the terms involving eu give, on using (55) and (56),

dH(3) Il

+ '" H(3)w v ~

VJ,l

- 3,·w 12 H(3) #

=

°

-,

mod

m, T

5

~ v ~

N,

V

from which it follows that

Using (32), we derive, as before, that g(z) is holomorphic. Thus the form (54) is an Abelian form of degree 6 and it must be zero. We can therefore define (59) such that

MINIMAL IMMERSIONS OF THE TWO-SPHERE

37

Continuing in this way, we can define a local frame field eA at a regular point of S2, such that, if (60)

we have

DEs = -ks- 1rpE.- 1 - ;W2.-1,2.Es + k.ipES+l, 1 ;;;:; s ;;;:; m, ko = k m = 0.

(61)

The integer m is the smallest integer such that DEm is a linear combination of E m- 1 and Em. Equations (61) will be called the Frenet-Boruvka Formulas for the minimal immersion of a two-sphere in X. Under the change (44), E. will be changed according to (62)

Generalizing (55) we set

2;;;:; s;;;:; m.

(63)

Then the one-forms 8s - 1 , 2 ;;;:; s ;;;:; m, remain invariant under a change of the frame field in the tangent plane. Computing jj2Es and using (41) and

JJ2ES = 0,

(64)

s> I,

we obtain k~

(65)

~

= t(e - K)

°

and

{dk. (66)

dBs

+ i{k~

-

+

k~+l

ik.(8s - Bs- 1)} - t(s + l)K}rp

0, ip = 0,

1\ rp = 1\

Bo = 0, I;;;:;s;;;:;m-l.

These are the integrability conditions of (61), and they can be simplified. In fact, relative 'to the complex structure on S2 we use the operators 8, 0, and (67)

de

=

;(0 - 8).

From the fact that d log ks and B. - Bs -1 are real one-forms, we obtain from the first equation of (66), 1 ;;;:; s ;;;:; m - 1, which gives (68)

B. = de log (k 1 •

••

k.),

l;;;:;s;;;:;m-l.

38

SHIING-SHEN CHERN

To a real-valued smooth function u let its Laplacian b.u be defined by (69)

ddcu =

~ b.uq;

II

cpo

Then, in view of (68), the second equation of (66) gives

(70)

1b.log(k 1 ···ks )

+ k~

- k~+l -1(S

+

I)K=

°

or (71) We summarize our results in the following theorem: THEOREM. Let S2 -+ X be a minimal immersion of the two-sphere into a Riemannian manifold of constant curvature. There is an integer m such that the osculating spaces of order m (and dimension 2m) are parallel along the surface. The singular points are isolated. At a regular point a complete system of local invariants is given by the quantities ks > 0, 1 ~ s ~ m - 1, which satisfy the conditions (71).

The simplest case is when the Gaussian curvature K is constant. If = c, the tangent plane is parallel along the surface and the surface is totally geodesic. If K < c, it follows from (65) and then recursively from (71) that ks are all constants. The same relations give K

(72)

K=

2

m(m

+

c

1) ,

and all the ks can be expressed in terms of c and m. The surface is thus determined up to an isometry in the space.

6. The case when the ambient space is the N-sphere Consider the special case, studied by Calabi, of the minimal immersion (73)

where SN is the sphere of radius 1 in R N + 1 (so that c = 1). The preceding theorem has the consequence that the surface must lie on an even-dimensional great sphere S2m. Without loss of generality we suppose N = 2m. In . this case the tangent vectors of S2m can be realized in R 2 m + 1, and the vectors Es by vectors in the complex number space C2m + 1. If x is a point on the surface, we can write (74)

MINIMAL IMMERSIONS OF THE TWO-SPHERE

39

Moreover, the Levi-Civita connection is defined by orthogonal projection into the tangent spaces of S2m, and we have

(75)

DEI

= -rpX

DEs

=

dE.,

+ dEl, 1 < s.

We interpret Es as homogeneous coordinate vectors in the complex projective space P2m(C), Equation (61) with s = m shows that Em is an algebraic curve. It will be called the generating curve of the minimal surface. From the other equations of (61) we see that EmEm-l is the line tangent to the generating curve at Em, etc., and EmEm -1' .. El is its osculating space of order m - I. Since 1 ;:;; r, s ;:;; m,

the generating curve has the property that its osculating spaces of order Q2m-l in P2m(C), the equation of Q2m-l being

m - 1 belong completely to the nonsingular hyperquadric Z~

(76)

+ zr + ... + z~

=

0,

where Zo, z 1> ••• , Zm are the homogeneous coordinates in P 2m( C). It is a classical result (see [4], p. 235) that the linear subspaces of dimension m - 1 on Q2m-l form an irreducible algebraic variety of (complex) dimension m(m + 1)/2. To construct the minimal surface from the generating curve, we take its conjugate complex curve Em. The corresponding osculating spaces of order m are conjugate complex and therefore meet in a real point x. The minimal surface is the locus of such points of intersection. Boruvka showed that if the generating curve is a normal curve, the resulting minimal surface has constant Gaussian curvature and is given by the spherical harmonics (see [1]). In general, the surface described by x may have isolated points without tangent plane, so that we obtain only a generalized surface. By (65) we can relate the area of a minimal S2 with a projective invariant of the generating curve, thus giving a geometrical interpretation of Calabi's result that this area is an integral mUltiple of 27T. UNIVERSITY OF CALIFORNIA BERKELEY

REFERENCES 1.

0., "Sur les surfaces representees par les fonctions spheriques de premiere espece," J. de Math. Pures et Appl., 1933, pp. 337-383.

BORUVKA,

40

SHIING-SHEN CHERN

2. CALABI, E., "Minimal immersions of surfaces in Euclidean spheres," J. Diff. Geom., 1 (1967), pp. 111-125. 3. CHERN, S., Minimal Submanifolds in a Riemannian Manifold. Mimeographed lecture notes, University of Kansas, 1968. 4. HODGE, W. V. D., and D. PEDOE, Methods of Algebraic Geometry, Vol. II. New York: Cambridge University Press, 1952.

of Cantor Sets Transversality of Semigroups

Intersections and

HARRY FURSTENBERG

1. Introduction Let X be a compact metric space endowed with a notion of dimension for subsets of X (analogous to Hausdorff dimension for subsets of a manifold). Two closed subsets A, B c X will be called transverse if

(I)

· A d 1m

n

B < {dim A

--

0

+ dim B

- dim X

when this is ;;; 0 . ot h erWlse.

Let Sl and S2 be two semigroups (or groups) acting on X (i.e., each Sj is a semigroup of continuous transformations of X into itself). Sl and S2 will be called transverse if whenever A is a closed Sl-invariant set and B a closed S2-invariant set, A and B are transverse. Finally, two transformations Tl and T2 of X ~ X are transverse if the semigroups they generate are transverse. The investigations to be described arose in an attempt to verify that certain pairs of endomorphisms of a compact Abelian group are transverse. Before discussing these we shall say something of the background of the problem. If S is a semigroup acting on a space X one can frequently verify that for almost all points x E X (in an appropriate sense) the orbit Sx of x is dense in X. When the exceptional set is not empty, one would like to have conditions to ensure that a given point is not exceptional. The following is a hypothetical situation where this is realized. We assume that . A. (2)

(i) S' is another semi group acting on X, and for every x EX

dim Sx

+ dim S'x

;;; dim X.

(ii) If A is a closed S-invariant set with dim A = dim X, then A = X. Under A(i), if the semigroup S' attaches a "small" orbit to a point x, then the semigroup S will have a "large" orbit at x. In particular, if dim S'x = 0, 41

42

HARR Y FURSTENBERG

then necessarily dim Sx = dim X and, by (ii), Sx is dense in X. We have then a sufficient condition which ensures that a point has a dense orbit under S. Now the condition A(ii) is actually met in many situations. Condition A(i) is, however, a very special one, and the notion of transversality was introduced as a less delicate notion which might provide evidence for the validity of (2). The relationship between the two is this. Suppose A is a closed S-invariant set, B a closed S'-invariant set with dim A + dim B < dim X. According to A(i), A n B must be empty. If, on the other hand, the semi groups are transverse, what can be inferred is that dim A n B = O. In many cases transversality of Sand S' may be seen to imply that (2) is valid for all x outside a set of dimension O. Actually we have demanded too much in A(i), and the best one can hope for is that (2) be valid for all x outside of a small "constructible" set. For example, there are usually points x for which both Sx and S'x are finite. These periodic points can be explicitly determined. Let us consider still another hypothesis regarding transformation semigroups SI and S2: B. If A is a proper closed subset of X invariant under both SI and S2, then dim A = O. Note that if SI and S2 are transverse and, in addition, A(ii) is valid, then B will be verified. For d· A = d· A A < 1m 1m n =

{20,dim A -

dim X,

2

if dim A ~ dim X if 2 dim A < dim X.

The first alternative leads to dim A = dim X whence A = X and the second alternative leads to dim A = O.

2. Examples For a first example we take X = the circle group K = R/Z. The integers act as endomorphisms on K, and if S is any multiplicative semigroup of integers, it may be thought of as acting on K. In [l] we obtained the following theorems: THEOREM 1. Let p and q be two integers > 1, with both not powers of the same integer, and denote by S1' and Sq the semigroups generated respectively by p and q. If A is a proper closed subset of K invariant under both S1' and Sq then A is finite.

INTERSECTIONS OF CANTOR SETS

43

Theorem 1 may be regarded as a theorem in diophantine approximation. For if S is a semigroup containing both Sp and Sq and a is an irrational, then by Theorem I, Sa = K. In particular we have THEOREM 2. If S is a multiplicative semigroup of integers 0 not consisting only of powers of a single integer and a is irrational, then for every € > 0

there exist s

E

Sand r E Z with

la - ~I < ~.

The condition that the semigroup not consist only of powers of a single integer is necessary for the conclusion. For if S c {an}, a > 0, and we take a

=

i a1

2n

+

i a-

n2 ,

then

a

is irrational and cannot be "well appro xi-

1

mated" by fractions with denominator in S. Theorem I is a sharpening of B. However it is also a consequence of B. For it can be shown without much difficulty (see [I)) that if A has the property in question, then the set of differences A - A is either all of K or it has 0 as an isolated point. The first alternative is impossible if dim A = 0, and the second implies that A is finite. Thus Theorem 1 would be a consequence of the following conjecture: Conjecture 1. The semigroups Sp and Sq of Theorem 1 are transverse. Related to this is the following: Conjecture 2. For every irrational x E K, dim Spx

+ dim SqX

~

I.

Of course when x is rational then Spx and SqX are finite. If we associate with a real number its expansion to the base p, then mUltiplying by p corresponds to shifting the expansion. A subset of K invariant under Sp corresponds to a family of one-sided infinite sequences with entries 0, ... , p - I and shift invariant. In particular, if we consider the set of all sequences whose entries belong to a proper subset of {O, ... , p - I}, these will determine a closed Sp-invariant set. We will call such a set a Cantor p-set. Conjecture I implies that if p and q are not powers of the same integer, then each Cantor p-set is transverse to each Cantorq-set. We note that the Cantor p-set corresponding to the subset {aI' ... , aT} c {O, ... , p - I} has Hausdorff dimension log rjlogp (see [I], p. 35). Moreover if A is any proper closed Sp-invariant set, then A is contained in some proper Cantor pm-set for some m. It follows that dim A < dim K unless A = K. This verifies condition A(ii). An interesting consequence of Conjecture 2 is the following: Conjecture 2'. Suppose that p and q are not powers of the same integer. Then in the expansion of pn to the base pq every digit and every combination of digits occurs as soon as n is sufficiently large.

44

HARR Y FURSTENBERG

For concreteness let us take p = 2 and q = 5 so that pq = 10. Suppose that some combination of digits blo ... , bl was missing from the expansions of infinitely many 2n. We do not lose generality by supposing that b1 -# 0, b l -# O. Choose a subsequence of these n which increases very rapidly, say 00

{n;}. Form the number

g = L 5 -n,. If nl+ 1

__

-

nl --+ 00

then S5g is countable

1

and by Conjecture 2, dim SlOg = I. Hence SlOg is dense in K. We consider the decimal expansion of g, First, to obtain the decimal expansion of 5- n , we take the expansion of 2n , and move the decimal point over ni places: 5- n, = .00·· ·Oalla·· ·aij'

Here ail a· .. aij is the expansion of 2n, so that b1 ••• bl does not occur in the foregoing. We now assume the nj to increase so rapidly that none of the blocks of aik overlap. Then b1 ••• bl does not occur in the decimal expansion of g, and SlOg cannot be dense. Thus Conjecture 2' is a consequence of Conjecture 2. To find other candidates for transversality we exhibit another pair of transformation semigroups for which hypothesis B is valid. Let r be an integer > I. Denote by IT the ring of r-adic integers. IT is the completion of the integers Z in the non-Archimedean metric for which the distance between two numbers goes to 0 as their difference is divisible by high powers of r. Each element of IT has a unique expansion as an infinite series: 00

to, ... , r -

= L wnrn

Wn

E

operation x

--+

[~]

x

o

I}. If p divides some power of r then the

([y] denoting the greatest integer in y) is uniformly

continuous on the integers and extends to a transformation Dp on IT' One verifies that Dpq = DpDq. We now take X = IT' We shall see presently that as a sequence space IT is endowed with a natural notion of dimension for which dim IT = log r. We now have THEOREM 3. Let r = pq where p > I and q > I are not powers of the same integer. Let S~ and S~ denote, respectively, the semigroups of transformations of IT generated by Dp and Dq. If A is a proper closed subset of IT invariant under both S~ and S~, then dim A = O.

Theorem 3 is a non-Archimedean analogue of Theorem 1 but it is also a consequence of the latter. We shall sketch a proof. Let OT denote the space of all doubly infinite sequences W = (w n ) with entries in {O, ... , r - I}. T: OT --+ OT denotes the shift operation. We have a map rp+: OT --+ Ir with rp+(W) =

00

L wnrn o

as well as rp-: OT --+ K with rp-(w) =

-1

L wnrn.

Suppose

-00

now that A is a closed subset of IT invariant under Dp and Dq. Then it is ,/


45

invariant under Dpq = Dr which is simply the shift operation on one-sided sequences. Let A be the set of sequences w in Q r satisfying cp+(Tmw} E A for each m. It can be shown that A is infinite if dim A > O. The operations Dp and Dq may be extended to Qr and one finds that they leave A invariant. For

with

w~

(3)

= [;n]

+ qWn+l (mod r),

and (3) defines an operation on Qr. Now take B = cp-(A). It follows from -1

the foregoing and the shift invariance of A that if L ~nrn

-1

E

B so is

-00

L

t~rn,

-co

where

t~

= [~~-1]

+ qtn

(mod r).

But this means that if x E B, so is qx. Similarly B is invariant under x -+ px. By Theorem I, B is therefore either finite or all of K. One sees that this implies that A is either finite or all of Qr. But the former implies that dim A = 0 and the latter implies that A = I r • In analogy with the Archimedean case we formulate the following: Conjecture 3. The operations Dp and Dq are transverse on I r. For a final example we take X = Ip where p is a prime. Let 7r be a p-adic integer divisible by p but not by p2. In addition to the standard p-adic representation, each x

E

Ir has a representation x =

co

Lo Wn7rn

with

Wn E

{O, ... , p - I}. The shift operation on sequences now induces a transformation Dn on Ir with Dnx = 7r- 1 (x - hex»~ where hex) E {O, ... , p - I} and hex) x (mod p). Now let 7r1 and 7r2 be two such p-adic numbers. Conjecture 4. Dn, and Dn2 are transverse on Ip provided 7r27rl1 is not a root of unity. Of course 7r27rl1 is, in any case, a p-adic unit. In case 7r1 = P and 7r2 = pq where q is relatively prime to p, then it may be shown that Conjecture 4 is a consequence of Conjecture 3. We omit the proof.

=

3. Strong transversality the present investigation is devoted to a result which lends further plausibility to the conjectures of the preceding section. To formulate this we shall identify subsets of the circle K with subsets of the real line. A closed subset A c [0, 1] will be called a p-set if pA c A U (A + I) U

HARR Y FURSTENBERG

46

+ 2) u ... U

+p

- I). This corresponds to an Sp-invariant subset of K. Now if A and B are subsets of the line, it is easy to see that

(A

(A

dim A

II

B = dim A x B

II

10 ,

where 10 represents the diagonal line x = y of the plane. Similarly if I is an arbitrary line of the plane, I: y = ux + t, then dim (uA + t) II B = dim A x Bill. The remainder of this paper is devoted to a proof of the following property of p-sets. THEOREM 4. Let A be a p-set and B a q-set where p and q are not powers of the same integer, and let C = A x B. Let S be an arbitrary number. If there is some line I with positive, finite slope which intersects C in a set of dimension > S, then for almost every u > 0 (in the sense ofLebesgue measure) there is a line of slope u intersecting C in a set of dimension> S.

We can reformulate this using a variant of transversality. We say that two closed subsets A, B of the line are strongly transverse if every translate A + t of A is transverse to B. We remark that for arbitrary closed A, B, almost every translate of A is transverse to B. From Theorem 4 we deduce the following theorem: THEOREM 5. Let A and B be as in Theorem 4. Assume that for a set of u of positive Lebesgue measure the dilation uA of A is strongly transverse to B. Then A and B are strongly transverse.

For, if not, there is a line with positive slope that intersects C in a set of dimension > dim A + dim B-1 (resp. 0). Then for almost all directions this will be the case and dim (uA + t) II B > dim A + dim B-1 (resp. 0), and since dim A = dim uA, uA and B are not strongly transverse. In this connection let us mention a further conjecture: Conjecture 5. For arbitrary compact subsets A, B of the line, almost all dilations of A are strongly transverse to B. The reason for expecting almost all dilations of a set to be well behaved with respect to another set is the following result (for which I am grateful to J.-P. Kahane): THEOREM 6. If A and B are arbitrary compact subsets of the line, for almost all dilations uA of A we have

· (B d1m +uA) = {dim A + dim B, I,

if this is. ~ I, otherWIse.

Here B + uA denotes the set of sums ux + y, X E A, Y E B. The foregoing is a special case of a more general result asserting that if C is a compact set in the plane with dim C = y, y ~ I, then in almost every /


47

direction C projects onto a linear set of dimension y. This is easily proven using the characterisation of dimension in terms of capacities (see [2]). Clearly, Conjecture 5, together with the theorem we are about to prove, imply the validity of Conjecture 1. In the non-Archimedean case a similar result is true. Specifically consider subsets A, B of I 11 . I11 is contained in the field of all p-adic numbers and we may speak of lines in the p-adic plane intersecting A x B. We shall be interested in lines of the form y = ux + t where the "slope" u is a p-adic unit. THEOREM 7. Let A be a closed subset of I11 invariant under D"l' B a closed subset invariant under D"2' and assume 7T27Tl1 is not a root of unity. There is a subgroup U of finite index in the group of p-adic units such that if a single line y = uox + to intersects A x B in a set of dimension > 8, where Uo E U, thenfor almost every u E U (with respect to Haar measure on U) there is a line with slope u intersecting A x B in a set of dimension > 8. The proofs of Theorems 4 and 7 are very similar and we shall confine our attention to the former. 4. Trees and dimension Let A be a finite set with r elements. Denote by Q A the product A x A x A x . .. endowed with the usual topology that renders it a compact Hausdorff space. We shall denote by A * the free semigroup generated by 00

A: A*

=

U o

An where AO consists of the empty word which we denote

by 1, and where multiplication is by juxtaposition. If a E An we shall write /(a) = n. We shall use the termfactor to denote left factor: a is a factor of T if T = ap for some p. DEFINITION 1.

A subset d c A * is called a tree if

(i) 1 Ed. (ii) If a E d then every factor of a Ed. (iii) If a E d then some aa Ed for a E A.

---

The following example will justify the terminology:

_ _ aaa _ _

a~ 1/

'"

aa

--aab--

~ab-aba-b ............... ba _ __ _ baa :::::::::. bab--

48

HARRY FURSTENBERG

One can set up a correspondence between closed subsets of Q A and trees in A*. If A c Q A let A* denote the set of all initial segments (including 1) occurring in sequences of A. Clearly A* is a tree. Conversely if Ll is a tree, we associate to it the set of all infinite sequences all of whose initial segments are words in Ll. It is easily seen that this is a one-to-one correspondence. (If, however, A is not required to be closed, then the set associated to A* is the closure of A.) DEFINITION 2. A section of a tree Ll is a finite subset II c Ll satisfying the following conditions:

(i) With finitely many exceptions, each element of Ll has a factor in II. (ii) If p is a factor of u and both p, u E II then p = u. If aI, ala2, ... , ala2, ... , an is an increasing sequence of elements of Ll, then one of these must eventually possess a factor in II by (i), and by (ii) it follows that the foregoing sequence intersects II in exactly one element. We see from this that a section of the tree A* corresponds to an irredundant open covering of the compact set A. If II is a section we denote by /(11) the minimum of /(u), u Ell. DEFINITION 3. The dimension of a tree Ll is defined as the g.l.b. of the set of A with the property: 3 sections II c Ll with /(11) arbitrarily large and

2: e-

1I1 (a)

< 1.

aen

If A is a closed subset of Q A we set dim A = dim A*. The connection with Hausdorff dimension is given in the following: LEMMA

1.

If A = {O, ... , r - I} and A is a closed subset ofQA' set

A

=

{~wnrn:

W

= (w n) E

A}

c [0, 1].

Then

dim A = log r x Hausdorff dim (A). The proof is straightforward. One notices as a consequence that the dimension of a set depends only on the "geometry" of the associated tree and not upon how one labels its vertices. Thus if in Ll, each u Ellis followed by exactly m successors uah then dim Ll = log m. We introduce into the space of trees a compact Hausdorff topology by setting D(Ll L1') ,

=m _1_ if Ll n +1

Am

= L1' n

Am but Ll

n

Am+l =I- L1'

n

Am

If Ll is a tree and if p Ell, then {ulpu Ell} is a tree which we denote LlfJ.

49


4. If ~ is a tree we denote by ~(~) the closure of the set in the space of trees. The trees of ~(~) are called derived trees

DEFINITION {~P, p E~} of~.

Just as trees correspond to sets in !1A we can define objects corresponding to measures on !1A as follows: DEFINITION

A real-valued function 8 on A * is a T-function if

5.

(i) 0 ~ 8(a) ~ 1, (ii) 8(1) = 1, (iii) 8(a) = L 8(aa). aeA

The set of T-functions will be denoted by TA • The support 181 of a T-function is always a tree. Conversely every tree is the support of some T-function. If p. is a probability measure on !1A and we denote by P!1A the set of sequences that begin with p, then the function 8(p) = p.(p!1 A) clearly represents a T-function. The space TA is a compact metric space where we set ~(8, 8')

=

L:

2- 1(a)18(a) - 8'(a)l·

t7eA·

The convergence of this series follows from the fact that

L

8(a)

=

1.

l(a)=n

REMARK.

Quite generally, if II is a section of

181

we have

L

aeIT

8(a) = 1.

This is a special case of a still more general assertion: if p is a factor of an element of II then L 8(a) = 8(p), the sum being extended over those a E II of which p is a factor. To see this, note that it is obviously true if p E II. If p ~ II then all its successors pa, a E A are factors of elements of II and it suffices to establish the assertion for these. But then our assertion follows by induction on max lea) - l(p). If

8 is a

aeII

T-function, then for each pEl 81 we may set 8P (a)

8p is again a T-function and 18P I =

=

8(pa). 8(p)

IW.

DEFINITION 6. If () is a T-function we denote by ~«(}) the closure in TA of the set {8 a : (}(a) > O}. The functions of ~«(}) are derived T-functions of ().

50

HARRY FURSTENBERG

We can define a notion of dimension for T-functions: DEFINITION

If 0 is a T-function we set

7.

d·

r

0-

1m

.

L

f

na~~ti~~oiIOI

-

1 O(a) logO(a) L O(a)/(a)

"ell

"ell

LEMMA 2. (i) dim 0 ~ dim 101. (ii) If ~ is a tree, dim ~ = sup {dim 0:

PROOF.

101

c ~}.

If

then

L O(a)e-ill("l+

log 1/0(,,)

< 1,

"ell

whence by Jensen's inequality (using

L O(a) =

L O(a) log (e- ill

(,,)+

1):

dim 0, and hence dim 101 ~ dim o. This proves (i). To prove (ii) we need only prove that for A < dim~, 3 0 with 101 c ~ and dim 0 ~ A. We use [2, Theoreme II, p. 27], which asserts that if E is a compact subset of the line with Hausdorff dimension > {3, then there exists a probability measure fL on E with fL([a, b] < Clb - aiR). If ~ = A* then A c OA corresponds to a subset of [0, I] obtained by viewing OA as expansions of real numbers to the base r (= card. A). This set has Hausdorff dimension > ,\flog r. Choose {3 = A/log r and let fL be a measure satisfying the foregoing inequality. We lift fL to a measure p, on A and we find that P,(pOA) = fL(I) where I is an interval of length ,-I(p). Thus if O(p) = P,(pOA), O(p) ~ e,-RI(p), or 1

log O(p) ~ -log

e + {3/(p) log r = -log e + M(p)

so that 1

I·

. f

1m. In. I(IT) .... ao

L O(p) log O(p ) pell "O( L., p)/( p)

> \ =

pell

Here we use the fact that

2: O(p)/(p) ~ /(ll).

pell

This completes the proof of Lemma 2.

1\.


51

5. Markov processes on Til. The space of T-functions, Til., is endowed with a natural set of transition probabilities which induce a family of Markov processes on Til.. If (J ETA, the numbers (J(a), a E A constitute a probability distribution on A, and we may define transition probabilities on Til. by assigning probability (J(a) to the transition (J --+ (Ja. We call these the canonical transition probabilities on Til.. Fixing an initial distribution for a TII.-valued variable zo, there is determined a unique process {zn} in accordance with these transition probabilities. Inasmuch as (J(a) is a continuous function of (J, it follows that the induced Markov operator transforms continuous functions to continuous functions. There will exist stationary measures; these constitute a compact convex set, and the extremals of this set determine ergodic stationary processes {zn}. DEFINITION 8. A C-process is a stationary Markov process on Til. with the canonical transition probabilities. The main step in the proof of Theorem 4 is provided by the following theorem. THEOREM 8. If!l is a tree with dim!l > S, then there exists an ergodic C-process {zn} such that with probability I, IZnl is contained in trees of~(!l), and such that with probability I, dim IZnl > S. The purpose of this theorem is to exhibit an abundance of trees of dimension > S once we have a single tree with dimension > S. The following shows how a single T-function (Jo may generate a C-process: DEFINITION 9. A measure p. on Til. is compatible with aT-function (Jo if there exists a sequence Nk --+ 00 such that for each continuous function fon Til., (4)

rINk JTAf«(J) dp.«(J) = !~n;, Nk + I n~ tEII.t;1 9 0I (Jo{r)f«(Jb).

LEMMA 3. If p. is compatible with (J then p. has its support on ~«(J). In addition p. is a stationary measure for the canonical transition probabilities. PROOF. The first statement is obvious and the second involves a straightforward verification. If p«(J, (J') denotes the canonical transition probabilities, then

HARR Y FURSTENBERG

52

To show that

p.

is stationary we must show that

But

DEFINITION

LEMMA

4.

10.

If p. is a stationary measure on TA we set

If p. is compatible with 00 then E(p.)

~

dim 00.

PROOF.

Now let ilk denote the section 1001 n the proof just given is the same as

ANk+1.

Then the last expression in

which proves the lemma. LEMMA 5. For any T-function 00 there exists an ergodic stationary measure p. for the canonical transition probabilities with support on 2}( (0) and with E(p.) ~ dim 00.

PROOF. To begin with, there exist measures p.' compatible with 80 since we can choose {Nk } so that (4) converges for every function in p. If we set g(x) = [px] and h(y) = [qx], then A is closed under x -r px - g(x), and B is closed under y -r qy - h(y). Let us define two transformations of the unit square: (7)

1(X, y) 2(X, y)

= (px - g(x), y) = (px - g(x), qy - h(y».

Then A x B is invariant under both transformations 1 and 2. Each ; transforms a line into a finite number of line segments, and if 1 is a line with slope u, then each line of 1(/) has slope ulp and each line of 2(1) has slope quIp. Now suppose that 1 is a line that intersects A x B in a set of dimension > 8. The same will be true of at least one of the lines of 1(1) and of one


55

of the lines of S. If the first of these lines had slope u, we will find all the slopes u' = qmu/pn, n > m, represented. Note that the set of u' of this form is dense in (0, (0) precisely when p and q are not powers of the same integer. For that is equivalent to log q/log p being irrational, which implies that log u'

=

log p(m Ii0g q - n ogp

+ liog U) ogp

is dense in the reals. Thus under the hypotheses of the theorem, we will obtain a dense set of slopes with the desired property. The machinery of the preceding section will be invoked to extend the conclusion to almost all u'. Note, however, that the present argument shows that it suffices to establish the assertion of the theorem for slopes in some finite interval. We shall do so for I ~ u ~ q. We now introduce a number of spaces that will playa role in proving Theorem 4. We denote the subset A x B of the unit square in the plane by C. L will denote the set of lines Y = uX + I with I ~ u ~ q and which intersect the unit square. We will speak interchangeably of the line I and the pair (u, I). We denote by W the set of ordered pairs: a point of C, a line of L through the point. Thus W = {(x, y, u, I):(X, y) E C, Y = ux + I}. We define a transformation dim D(lo), and let n be a section of D(lo) with L e-Bl(a) < 1. If (fell

(x, y) E 10 n C then y(x, y, 10) begins with some sequence of n. If (x', y'), (x", yO) are two points corresponding to the same element u of n then, by

HARRY FURSTENBERG

58

Lemma 9, lx' - x"1 < p-l((1). Hence II determines a covering of 10 n C by intervals of respective length p-l((1). But we have

2 {p

-1«(1)}Il/IOg p

< 1,

(1

so that f3 ~ logp x dim (10 n C). It follows that dim D(lo) ~ logp x dim (10 n C). Conversely suppose a > dim (10 n C) and that {Jj } is a covering of 10 n Cwith ~ IJjia < H-1p -a. Here IJI denotes the length of the interval J. Define nj by p-",-1 ~ IJd < p-"'. Next take the set of all segments in D(lo) of length n which occur in ')I(x, y, 10 ) where (x, y) is a point of J j • Denote this set II j • By Lemma 10, there are at most H elements in III. Now U III = II' is a finite set which clearly contains a section of D(lo). We have 2 e-alogpl«(1) ~ H2

e-alogpn, =

H2P-an, ~ Hpa 21Jlla < 1,

uen'

so that a log p ~ dim D(lo). This proves the lemma. One can also introduce the notion of T-functions in the context of L-trees. 00

DEFINITION 12.

A TL-function is a function () on

U L"

satisfying

1

(i) () has its support in some D(lo), (ii) ~ ~ 1, (iii) ()(/o) = 1, (iv) ()(/o, It, ... , 1m) = ()(lo, It, ... , 1m, I').

° ()

2 I'

We denote the compact space of all TL-functions by T L • On this space we have a system of canonical transition probabilities which may be described as follows. If () has support in D(lo) consider the set of Ii with loll in D(lo) for which ()(lo/D > 0. This is a finite set. We then set ()1(1t, 12, ..• , 1m)

=

t

o,

if It i' Ii ()(l I I I ) 0, 1, 2,···, m ·f I - II ()(/o, Ii) ,1 1 1·

The transitions for the canonical transition probabilities are from () to ()I with probability ()(/o, Ii). Note that here 10 plays the role of the empty word in the case of T-functions. But for this modification the theories are perfectly analogous. In analogy with Theorem 8 one has the following theorem: THEOREM 9. If D is an L-tree with dim D > 8', then there exists an ergodic stationary Markov process {z,,} with state space TL , with the canonical transition probabilities, and with dim Iz,,1 > 8 (with probability 1).


59

We are now in a position to prove Theorem 4. Suppose dim (10 n C) > 8. By Lemma 11, dim D(lo) > 8jlogp. Now apply Theorem 9 and let {zn} denote the stationary process provided by the theorem. For any T-function, denote by /(8) the line determined by 181 c D(/(8)). Consider now the stationary, line-valued process /(zn). Since with probability I, dim IZnl > 8jlogp, a fortiori, dim D(I(zn)) > 8jlogp. By Lemma 11, we have, with probability 1, dim (I(zn) n C) > 8. We now determine the distribution of the slopes of l(zn). From the form of the canonical transition probabilities we see that if Un is the slope of /(zn), the slope of /(Zn+ 1) is given by

Un +1

Let

! { =

Un

1 - Un p

if Un < p, . If Un > p.

denote the distribution of IIOg u. Then v is a measure on the unit ogq interval invariant under the transformation t - ? t - IIog p (modulo 1). But ogq this is a version of the irrational rotation of the circle and Lebesgue measure is the unique invariant measure. It follows that the distribution of U is a measure equivalent to Lebesgue measure. The assertion of Theorem 4 is now a consequence of the Fubini theorem. V

HEBREW UNIVERSITY JERUSALEM

REFERENCES 1.

H., "Disjointness in ergodic theory, minimal sets, and a problem in diophantine approximation," Mathematical Systems Theory, I (1967),1-49. 2. KAHANE, l.-P., and R. SALEM, Ensembles Parfaits et Series Trigonometriques. Paris: Hermann & Cie., 1963. FURSTENBERG,

Kahlersche Mannigfaltigkeiten mit hyper-q-konvexem Rand HANS GRAUERT UND OSWALD RIEMENSCHNEIDER

Einleitung 1953 zeigte K. Kodaira [3], daB die Kohomologiegruppen HS(X, F) fUr s < n verschwinden, wenn X eine n-dimensionale kompakte kahlersche Mannigfaltigkeit und Fein negatives komplex-analytisches Geradenbiindel auf X ist. Hierbei bezeichnet F wie iiblich die Garbe der Keime von lokalen holomorphen Schnitten in F. Kodaira verwandte beim Beweis dieses Ergebnisses neben der harmonischen Analysis wesentlich eine Ungleichung, die zuerst von S. Bochner angegeben wurde (man vgl. [2]). Bis heute ist es nicht gelungen, diesen Satz fUr vollstandige projektiv-algebraische Mannigfaltigkeiten auf algebraischem Wege herzuleiten. Man gelangte jedoch zu mehreren Verscharfungen und Verallgemeinerungen. Von Y. Akizuki und S. Nakano wurde bewiesen [1], daB in der obigen Situation sogar HS(X, F 0 or) = 0 ist fUr r + s < n, wenn or die Garbe der Keime von lokalen holomorphen r-Formen bezeichnet. Nakano [6] gewann danach eine Aussage iiber negative Vektorraumbiindel V auf kompakten kahlerschen Mannigfaltigkeiten X: Es gilt stets HS(X, Y) = 0 fUr s < n. SchlieBlich iibertrug E. Vesentini [8) 1959 einige Resultate auf semi-negative Geradenbiindel. Da kompakte komplexe Mannigfaltigkeiten mit solchen Geradenbiindeln nicht mehr notwendig kahlersch sind, stellte sich die Frage, ob die "Vanishing"-Aussagen wesentlich an der KahlerStruktur hiingen. Diese Frage wurde, ebenfalls von Vesentini, im positiven Sinne beantwortet. Ein Geradenbiindel FheiBt negativ, wenn es eine offene, relativ-kompakt in F gelegene Umgebu.ng der Nullschnittftache gibt, die in jedem Randpunkt streng pseudokonvex ist. Ein Vektorraumbiindel mit der entsprechenden Eigenschaft nennen wir schwach negativ. Es hat sich gezeigt, daB negative Vektorraumbiindel (im Sinne der Definition von Nakano) 61

62

GRAUERT AND RIEMENSCHNEIDER

aueh sehwaeh negativ sind. Das Umgekehrte gilt jedoeh nieht. Es gibt sogar, wie wir zeigen werden, sehwaeh negative Vektorraumbiindel V iiber dem p2 mit H 1 (P2, y) # O. Wir zeigen dies aueh fUr beliebige projektivalgebraisehe Mannigfaltigkeiten. In der vorliegenden Arbeit sollen die Siitze von Kodaira, Akizuki und Nakano weiter verallgemeinert werden. Zu diesem Zweeke miissen Ergebnisse von 1. Kohn [4] wesentlieh herangezogen werden. Wir betraehten eine n-dimensionale komplexe Mannigfaltigkeit X, die relativ-kompakter offener Teilbereieh einer komplexen Mannigfaltigkeit X ist. Auf X sei eine kahlersehe Metrik gegeben. Ferner sei der Rand oX von X glatt und beliebig oft differenzierbar. 1st oX streng pseudokonvex (wir sagen aueh l-konvex), so zeigen wir, daB Hg(X, (!) fiir s < n und HS(X, Qn) fUr s > 0 versehwindet. Dabei ist (!) = QO die Strukturgarbe von X und Hg(X, (!) die s-te Kohomologiegruppe mit kompaktem Trager. Ersetzt man die l-Konvexitat dureh die sehwaehere q-Konvexitat, q > I, so sind im allgemeinen Hg(X, (!) fUr s ~ n - q und HS(X, Qn) fUr s ~ q von Null versehieden, obgleieh man zunaehst das Gegenteil vermuten moehte. Ein Beispiel hierzu kann aus einem sehwaeh negativen Vektorraumbiindel iiber dem p2 gewonnen werden, fiir das nieht das Vanishing-Theorem von Nakano gilt. Wir miissen daher die q-Konvexitat fUr q > I durch eine starkere Forderung ersetzen, die wir als Hyper-q-Konvexitat bezeichnen. In Teil I der Arbeit werden bekannte Ergebnisse zusammengestellt, die spater benotigt werden. Der Teil2 enthiilt den Beweis der Hauptaussage, und in Teil 3 wird gezeigt, wie dieser Beweis in Beziehung steht zu den Vanishing-Theoremen von Kodaira, Akizuki und Nakano. Ferner werden dort Gegenbeispiele gebracht. 1. Ergebnisse von J. Kobo A. Wir stell en zunachst einige bekannte Tatsachen aus der Theorie der kahlerschen Mannigfaltigkeiten zusammen. Es sei X eine iiberall n-dimensionale komplexe Mannigfaltigkeit; es bezeichne A'den C-Vektorraum der (beliebig oft) differenzierbaren komplexen Differentialformen vom Grade I auf X und Ar,s den Vektorraum der Formen yom Typ (r, s). Es gilt also A' = EEl Ar,s. Die totale Ableitung d: A'--+A'+1 laBt sich in T+s=l

der Form d = d' + d" schreiben, wobei d': Ar,s --+ Ar + l,s die totale Ableitung nach den Variablen Z1> ••. , Zn und d": Ar,s --+ Ar.s + 1 die totale Ableitung nach den 21>' •• , 2n bezeichnet. Es gilt: d 2 = d'2 = d"2 = 0, d'd" + d"d' = O,dip = drp,d'ip = d"rpundd(rp I\.p) = drp I\.p + (-I)'rp 1\ d.p, wenn rp E Al ist (die letzte Formel gilt analog auch fUr d' und d").

KAHLERSCHE MANNIGFALTIGKEITEN

63

Es sei auf X eine (positiv-definite) hermitesche Metrik ds 2 = L gvtl dz v dZIl gegeben. Dann ist der Metrik ds 2 ein Anti-Isomorphismus i: Ar,s -+ An-T,n-s, d.h. eine R-lineare bijektive Abbildung mit iiiccp = C*cp, C E C, zugeordnet, so daB iii 2cp = (_l)lcp flir cp E Al gilt. Eine positivdefinite hermitesche Metrik auf X heiBt kiihlersch, wenn die zugeordnete B.

(reelIe) Form w: =

~ L gwl dzv 1\

dZIl geschlossen ist, d.h. wenn dw = 0

gilt. I. Es sei ds 2 = L gvtl dz v dzu eine hermitesche Metrik in X und Xo E X ein Punkt. Unter einem geodatischen Koordinatensystem zu Xo und ds 2 versteht man ein System von lokalen komplexen Koordinaten Zh ... , Zn in einer Umgebung U(xo), so daB Xo die Koordinaten zv(xo) = 0, v = I, ... , n, besitzt und flir aIle v, /L = I, ... , n gilt: gv,.{xo) = avu , dgvtl(xo) = o. Geodatische Koordinaten heiBen in der angelsachsischen Literatur auch "normal coordinates". Es gilt: DEFINITION

Xo

Eine hermitesche Metrik ist genau dann kiihlersch, wenn es zujedem Punkt E X ein geodiitisches Koordinatensystem gibt.

Hat man in Xo ein geodatisches Koordinatensystem, so gilt flir cp = L a V, · OV.,tl, °tls dZV1 1\. 1\ dZ vr 1\ dzu, 1\ 1\ dzu, = L avil d3v 1\ d~u EAT,s im Punkte Xo: 0

(1)

0

"icp b(r, s, n)

=

=

0

0

•

0

••

L

b(r, s, n) sign (v, *v) sign (/L, */L)ii vtl d3*v (_l)n(n-l)/2+S(n-T)2 T+s- nin,

1\

d3*u,

wobei *v ein (n - r )-tupel bezeichnet, in dem genau aIle naturlichen Zahlen von 1 bis n stehen, die nicht in v = (Vh ... , vr ) vorkommen. Wir setzen wie ublich a = - iiidiii, a' = - iiid'iii, a" = - *d"*. Man hat 8 = 8' + a", a2 = a'2 = a"2 = 0, a'a" + a"a' = 0, ai> = acp, a'i> = a"cp. 8' und a" sind homogene Operatoren yom Typ (-1,0) bzw. (0, -I), d.h. 8': Ar,s -+ AT -1,8, a": AT,s -+ AT,s -1. Wir definieren nun den reellen LaplaceOperator ~ und den komplexen Laplace-Beltrami-Operator D durch ~:= da + adund D:= d"a" + a"d". 1st w

o

=

~ L gw' dz v 1\

dzu die der hermiteschen Metrik ds 2 zugeordnete

(1, I)-Form, so definiert man schlieBlich noch Operatoren L: Al-+ Al+2 und A: Al-+ AI-2 durch Lcp: = cp 1\ w und A:= (-I)I*L*. Eine Form cp heiBt primitiv, wenn Acp = 0 ist. Es gilt flir primitive (r, s)-Formen die Gleichung (2)

*Lecp = (_1)1(1+1)/2

e!

(n - I - e)!

is-rLn-'-ei>


64

fUr e = 0, I, ... , n - I und I = r + s. 1st cp E Al mit I ~ n, und gilt p-Icp = 0, so folgt cp = 0. Falls die gegebene Metrik kiihlersch ist, gilt D = !-A = d'S' + S'd'; insbesondere ist dann 0 ein reeller Operator: D
••• , Wm) --+ V IU eine weitere normale Trivialisierung, so ist 7 -1 0 7* in Xo eine unitiire Transformation, und es ist dort dhk = 0, wennhk = hk(X) die Komponentenfunktionen der Abbildungsmatrix sind. SATZ

BEWEIS. Es sei hik(XO) = Sik und 7- 1 0 7* von der Form: Wi = 2:hkWZ = wf + 2: aikvwtzv + Glieder hoherer Ordnung. Wir setzen weiter voraus, k.v daB die Koordinaten Z1, ... , Zn von Xo Null sind. Dann folgt:

=

? hik(WZ + ? akiv w1zv + ... ) (Wf + L ai;vwtzv + ... ) t.k

=

j,v

J,V

L (hik + L (aikVzV + akiVzV) + ... ) wtwt, i.k

v

d.h.

h:k{z) = hik(Z)

°

+ L(aikVzV + akiVzv) + .. '. v

Es gilt somit dh:k{xo) = genau dann, wenn hik.v(Xo): = hik.Z.(XO) = - aikv ist. 1st 7 schon eine normale Trivialisierung, so muB also dhk = gelten. Das ist auch richtig, wenn hk(Xo) nicht gleich Sik ist; denn die Transformation (hk(XO» -1 0 7 -1 0 7* ftihrt ebenfalls eine normale Trivialisierung in eine normale tiber und hat die vorausgesetzte Form. Urn die Existenz zu zeigen, dtirfen wir annehmen, daB schon hik(XO) = Sik gilt; denn das kann man ja durch eine lineare Transformation in e m(W1' ... , wm) erreichen. Es ist dann y: Wi = wf - 2: hik.v(XO)wZ· Zv fUr k.v hinreichend kleines U = U(xo) eine fasertreue biholomorphe Abbildung von U x em(wt, .. . , w~) nach U X em(w 1 , ••• , Wm), so daB 7* = 70 Y eine Trivialisierung mit den gewtinschten Eigenschaften ist. Auf Grund der Eindeutigkeitsaussage von Satz I sind Differentialoperatoren von hochstens erster Ordnung auf AT.S(V) schon dann eindeutig definiert, wenn sie in jedem Punkte Xo E X in Bezug auf eine normale . Trivialisierung invariant gegentiber unitaren Transformationen im em angegeben sind. Sei also 7: U X e m--+ VIU eine in Xo E U normale Trivialisierung, so setzen wir fUr rp E AT.S(V) und rpl U = (rpb ... , rpm) in Xo: *rp: = (iiirp1' ... , iiirpm), d'rp : = (d'rp1' ... , d'rpm) und erhalten somit globale Operatoren *: AT.S(V)--+ An-T.n-S(V*) und d': AT.S(V)--+ Ar+1.S(V). Wir definieren weiter analog zu den e-wertigen Formen: Arp = (- I )l*Dirp, rp E A'(V), S" = -*d"iii und 0 = d"S" + S"d".

°

66


1st T: U, X em ~ Vi U, eine beliebige Trivialisierung, rp E Ar.s(v) und = rpi U, = (rpl> ... , rpm), so sei d~rp,: = (d'rpy>, ... , d'rp~». 1st 0, = h,-ld~h" so wird durch (orp),: = d~rp, + 0, 1\ rp, ein globaler Operator 0: Ar.s(v) ~ Ar+l.s (V) definiert (vgl. Nakano [6]), der in normalen Trivialisierungen mit d' iibereinstimmt. Da weiter in normaler Trivialisierung S"L - LS" = id' gilt, so folgt allgemein rp,

d' = 0 = i(LS" - S"L).

1m Faile des trivialen Biindels V = X x e hat man als hermitesche Metrik die euklidische Metrik auf den Fasern und die Gleichheit Ar.s(v) = AT.S. Es stimmen aile in diesem Abschnitt definierten Operatoren mit den entsprechenden Abbildungen aus Abschnitt B iiberein. D. Wir bezeichnen mit (!) die Strukturgarbe von X und mit OT die Garbe der Keime von lokalen holomorphen r-Formen rp = L a V1 •. 'Vr dZ V1 1\ ... 1\ dz v,. Ferner bezeichne, falls V ~ X ein komplex-analytisches Vektorraumbiindel iiber X ist, Y die Garbe der Keime von lokalen holomorphen Schnitten in V und Lll(V) bzw. Llr.s(v) die (feine) Garbe der Keime von lokalen differenzierbaren Formen mit Werten in Vvom Grade I bzw. vom Typ (r, s). Wegen des Lemmas von Dolbeault ist die Sequenz d" 0-+ Y ® or -+ Llr.O(V) -+ Llr.l(V) -+ ... -+ Llr.n(v) -+ 0

fUr aile r = O, ... ,n exakt. Setzt man zr.s(v):= {rpEAr.s(V):d"rp = O} und BT.S(V):= d"Ar.s-l(V), so sind die de Rhamschen Gruppen definiert durch Hr.s(v) : = ZT,s(V)/Br.s(v), und wegen der obigen exakten Sequenz von feinen Garben gilt: Hr.s(v)

Setzt man B~'S(V),

= = O},

A~'S(V)

{rpEA~·S(V):d"rp

~

HS(X,

Y

Q9 or).

{rp E Ar.s(V):Tr rp ist kompakt in X}, Z~·S(V) = B~'S(V)

=

d"A~·S-l(V)

und

H~'S(V)

=

Z~·S(V)/

so gilt entsprechend H~'S(V) ~

Hg(X,

Y ® or),

wobei H~(X, Y Q9 or) die s-te Kohomologiegruppe von X mit kompaktem Trager und Werten in Y Q9 or bezeichnet. 1st V = X x e das triviale Biindel, so schreiben wir Hr.s und H~'s anstelle von Hr.s(x x C) und H~'S(X x C). Man hat in diesem Fall Hr.s

~

HS(X, or),

H~'s ~ H~(X,

OT).

E. 1m folgenden sei X eine n-dimensionale komplexe kahlersche Mannigfaltigkeit und V ~ X ein komplex-analytisches Vektorraumbiindel fiber X. Ferner sei X c X ein relativ-kompakter Teilbereich; der Rand ,

KAHLERSCHE MANNlGFALTlGKEITEN

67

oX von X sei glatt und beliebig oft differenzierbar. Dariiber hinaus gebe es zu jedem Punkt Xo E oX eine Umgebung U = U(xo) und eine in U beliebig oft differenzierbare streng q-konvexe Funktion p(x) mit dp(x) # 0 fUr aIle x E U, so daB X n U = {x E X:p(x) < O}. Wir setzen X also als streng q-konvex voraus. Jede andere Funktion Pl(X) mit diesen Eigenschaften unterscheidet sich von p(x) in einer Umgebung von Xo nur durch einen positiven, beliebig oft differenzierbaren Faktor. Ein komplexer Tangentialvektor im Punkte Xo E X an oX ist ein kontravarianter Vektor t = La. -;,0 + La. -;,~ , so daB fUr aIle at: = vZ. vZ.

L aa v -;,0 + L aa v -;,~ im Punkte Xo gilt: (at)(p) = O. Diese Definition ist vZv vZv unabhangig von der spezieIlen Auswahl von p. Die Menge Txo(oX, C) der komplexen Tangentialvektoren im Punkte Xo E X an oX ist ein komplex(n - l)-dimensionaler Vektorraum, der (als reeIler Untervektorraum) in dem reeIl-(2n - l)-dimensionalen Vektorraum Txo(oX) der reeIlen Tangentialvektoren im Punkte Xo E X an oX enthalten ist. Es sei schlieBlich Txo = TxoCX) der reelle Vektorraum der Tangentialvektoren in Xo an X. Dann ist eine Differentialform q> yom Grade I auf X im Punkte Xo eine alternierende I-Form auf Txo' Wir bezeichnen nun mit AT.S(V) den Untervektorraum derjenigen (r, s)-Formen auf Xmit Werten in VIX, die noch aufdem Rand oXvon X beliebig oft differenzierbar sind, d.h. es ist AT.S(V) = {cpIX:cp eine (r, s)Form auf X mit Werten in V}. 1st q> E AT.S(V), so ist-da die Einbettung id: oX eine Coo-Form yom Typ (r, s) mit Werten in VloX. Wir setzen q>loX: = id*q>. 1m Falle des trivialen Biindels schreiben wir wieder AT.S fUr AT.S(X x C). Wir sagen, daB die Beschrankung einer Form q> = L dZ V1 1\ . .. 1\ dZ vr 1\ q>Vl" 'Vr E AT.S{V) mit q>Vl" 'Vr E AO.S(V) auf dem komplexen Rand von X verschwindet, wenn fUr alle Xo E oX und alle tl> ... , ts E Txo{o X, C) der Wert q>Vl" . vr(XO)(gl , ... , ts) = 0 ist, und schreiben dafUr auch q>1 oaX = O. Es gilt fUr O-Formen q>loaX = 0 genau dann, wenn q>loX = 0 ist. Die eben definierte Eigenschaft stimmt mit dem Begriff "complex normal" in Kohn-Rossi [5] iiberein. Es gilt namlich SATZ 2. q> E AT.S(V) verschwindet genau dann al{f dem analytischen Rand Von X, wenn es zu jedem Punkt Xo E oX eine Umgebung U = U(xo) c X, eine reelle Coo-Funktion p{x) auf U mit dp(x) # 0 fUr aile x E U und X n U = {x E U: p{x) < O} und Formen zPl und zP2 aUf U vom Typ (r, s - 1) bzw. (r, s) gibt, so daft q>1 U = d"p 1\ zPl + PzP2 ;sl.

BEWEIS. Es sei Xo E oX. Nach Voraussetzung gibt es eine Umgebung U = U(xo) und eine reelle Coo-Funktionp(x) in Umit X () U = {p(x) < O}.

68


Wir k6nnen U so klein wahlen, daB man in U komplexe Koordinaten Z 1, .•. , Zn mit den folgenden Eigenschaften einfiihren kann: Xo hat die Koordinaten Zl = ... = Zn = 0, und p(z) ist (eventuell nach Multiplikation mit einer positiven Konstanten) von der Form p(z) = Zl + ZI + Glieder h6herer Ordnung. In Xo gilt cploaX = 0 fUr cp E Ar,s(v) genau dann, wenn aile Komponenten von cp(x o) den Faktor dZ I enthaIten, da die Ebene {ZI = O} die komplexe Tangente an ax in Xo bildet. Da in Xo auBerdem d"p = dZ I gilt, kann man in Xo aus cp den Faktor d"p eindeutig ausklamauf U vom Typ (r, s), so daB mern. Auf diese Weise bestimmt man ein auf Un ax die Gleichheit cp = d"p 1\ besteht. Mit = (lIp) x (cp - d"p 1\ "'1) folgt die Behauptung. Gilt umgekehrt cp = d"p 1\ "'1 + P"'2, so ist in Xo: cp = dZI 1\ "'10 d.h. cploaX = O. Man beweist weiter leicht: Fur cp E Ar,s(v) gilt cploaX = 0 genau dann, wennjur aile ex E An-r,n-s-l(V*) das Integral

"'1 "'1

r

Jox

"'2

cpl\ex

verschwindet. Hierbei bedeutet 1\ das Produkt Ar,s(v) x Ar·,S*(v*) ~ Ar+ro,s+so, das wegen der Paarung V x V* ~ C wie bei C-wertigen Formen definiert werden kann.

F.

\

Aus dem letzten Kriterium folgt unmittelbar, daB die Raume ~r,s(v): = {cp

Ar,s(v):cp = d""', '" E Ar,s-I(V)} 'Il"'S(V):= {cp E Ar,s(v):cp = 8""" '" E Ar,s+I(V), *"'loaX = O} ~r,s(v):= {cp E Ar,s(V):d"cp = 8"cp = 0, *cploaX = O} E

in Bezug auf das hermitesche Skalarprodukt (cp, "'):=

Ix

cp

1\

*"',

paarweise orthogonal sind. Wir setzen weiter Ar,s(v):= {cpEAr,s(V):*cp!oaX= *d"cploa X = O}.

Da X streng q-konvex ist, gibt es nach Kohn [4] fUr s ~ q einen beschrankten Operator N: Ar,s( V) ~ Ar,s( V), so daB fUr cp E Ar,s( V) gilt: h : = cp ONcp E ~r,s(v). Setzen wir CPI:= 8"Ncp und CP2:= d"Ncp, so folgt cp = d"CPl + d"CP2 + h mit d"CPI E ~r,s(v) und 8"CP2 E 'JY'S(V). Man hat also das Zerlegungstheorem Ar,s(v) = ~r,s(v) E8 :£l"'S(V) E8 ~r,s(v) fUr s ~ q. 1st nun cp E Ar,s(v) mit d"cp = 0 und s ~ q, so gestattet cp die Zerlegung + h mit CPo E'I)r,s(v) und h E ~r,s(v). Die harmonische Form h

cp = CPo

69

KAHLERSCHE MANNIGFALTIGKEITEN

hiingt nur von der de Rhamschen Klasse von cp abo Man erhiilt so einen Isomorphismus s

~

q.

Zur Untersuchung der Kohomologie mit kompaktem Triiger setzen wir

Der *-Operator gibt einen Isomorphismus .\)~.S(V) --+ .\)n-T.n-S(V*). AuBerdem hat man nach Serre [7] fUr q-konvexe Riiume einen Isomorphismus Hn-s(x, y* ® on-T) --+ H~(X, Y 0 OT) fUr s ;;::; n - q. Also gilt auch s;;::; n - q.

1m niichsten Teil werden wir zeigen, daB .\)~.S( V)

°

=

fUr s ;;::; n - q gilt, sofern X hyper-q-konvex und V semi-negativ ist. Insbesondere ist dann gezeigt: H~(X,

(!7)

=

0,

s;;::; n - q,

HS(X, on) = 0,

s

q.

~

2. Beweis des HauptresuItates

A. Es sei X im folgenden eine n-dimensionale komplexe kiihlersche Mannigfaltigkeit, V ~ X ein komplex-analytisches Vektorraumbtindel tiber X vom Range m, und auf den Fasern von V sei eine hermitesche Form gegeben. Ferner sei X c X ein offener relativ-kompakter Teilbereich von X mit glattem, beliebig oft differenzierbarem Rand ox. Wir wollen untersuchen, unter welchen Voraussetzungen an V und oX die Riiume .\)T.S(V) verschwinden. Dazu leiten wir zuniichst eine wichtige Gleichung her. Es sei cp E .\)T.S(V); dann folgt d'cp = i(Lo" - o"L)cp = -io"Lcp, und wegen *o"Lcp = (-IYd"*Lcp, I = r + s, d(d'cp 1\ *Lcp) = d"(d'cp 1\ *Lcp) = d" d' cp 1\ *Lcp + (- I Y+ Id' cp 1\ d"iiiLcp ergibt sich mit Hilfe des Stokes'schen Satzes (d'cp, d'cp) = i(d'cp, o"Lcp) = i

=- i

r d'cp

Jax

1\

Ix d'cp

iiiLcp

+

i

1\

J x

iiio"Lcp d"d'cp

1\

iiiLcp.

70


1 . d"0, so ergibt sich d"d'cp = Definiert man noch wie ublich X : = -2 7r1

d"(d~cp

+

0 /\ cp) = d" 0 /\ cp = 27riX /\ cp und so mit unsere

Haupt-

gleichung (3)

-(d'cp, d'cp)

=

+

27r(X /\ cp, Lcp)

°

i

r

Jox

d'cp /\ "*Lcp.

Es gilt (X /\ cp, Lcp) ~ fUr cp E Ao.S( V), s < n, wie wir im niichsten Abschnitt zeigen werden, wenn VI X ein semi-negatives Vektorraumbundel im Sinne von Nakano ist. 1m Abschnitt C untersuchen wir die Bedingungen

°

an ax, unter denen auch das Randintegral i fox d'cp /\ *Lcp ~ wird. Da stets (d'cp, d'cp) ~ gilt, mu13 dann bei semi-negativen Biindeln sogar fox d'cp /\ *Lcp = gelten. Daraus wird dann in Abschnitt D unter den folgen. gegebenen Voraussetzungen SX,S( V) =

° °

°

B. Es sei cp E AO,S( V), s < n, und if: = (X /\ cp) /\ *Lcp E An,n. 1st Xo E X ein Punkt, so konnen wir in einer Umgebung U von Xo eine Trivialisierung von V finden, die in X o normal ist. Gilt beziiglich dieser Trivialisierung cp = (CP1' ... , CPm), X = (Xii), so folgt in Xo:

if

=

2: (X /\ cp)j /\ *Lcpj 2: (Xij /\ cpj) /\ *LCPi, =

i

ifj

und daraus wegen *Lm,.

=

(_I)s<s+1)/2.

T

if

=

(_1)S<S+1)/2.

is (n - s - I)!

is Ln-s-1- . (n - s _ I)! cpj.

2: X" j,1

/\ CP' /\ ... , CPm), und i

es gilt dort: ACPi = 0 und cp;joaX = 0 fiir i das folgende Problem zu behandeln:

= ], ... ,

m. Es geniigt also,

Es sei cp E Ar.s, 1 = r + s < n, Xo E oX, und in Xo gelte Acp = 0 und cploaX = O. Unter welchen Voraussetzungen an oX gilt dann y:= i·d'cp 1\ *LcploX ~ 0 in Xo ?

Es sei peine reelle differenzierbare Funktion in einer Umgebung U von Xo mit dp # 0 und Un X = {x E U:p(x) < O}. Dann gilt Un oX = {x E U:p(x) = O}, und es ist dploX = d'ploX + d"ploX = O. Da cp auf dem analytischen Rand von X verschwindet, gibt es nach Satz 2 in einer Umbegung von Xo eine Darstellung cp = d"p 1\ if; + pif;o. Damit folgt d'cp 1\ ip = (d'd"p 1\ if; - d"p 1\ d'if; + d'p 1\ if;o + pd'if;o) 1\ (d'p 1\ ;p + p;Po), das heiBt: d'cp 1\ iploX = d'd"p 1\ if; 1\ d'p 1\ ;PloX = (-])l+ld'p 1\ d'd"p 1\ if; 1\ ;PloX, und wegen Formel (2) hat man in Xo: is - r+1 y = id'cp 1\ *Lcp = (_1)1(1+1)/2. d'cp 1\ ip 1\ W n - 1- 1 (n - 1 - I)! (5) _ is - r+1 = - ( _1)1 O.

L~1{l

Da fUr eine positive differenzierbare Funktion a die Gleichheit d'd"(ap)loaX = ad'd"ploaX besteht, ist die obige Definition unabhiingig von der speziellen Wahl von p. Aus (7) folgt unmittelbar SATZ 4. Hat X die Eigenschaft B(r, s) (d.h. gilt B(r, s) injedem Punkt Xo E oX), so ist fur jede primitive Form p E AT.S(V) mit r + s < n und ploaX = 0, deren KoejJizienten nicht siimtlich auf ax verschwinden:

i

r

Jax

d' p

/\

*Lp >

o.

Wir wollen im folgenden die Eigenschaft (iii) aus der Definition 4 niiher untersuchen. Es sei I{l = 2~Vl 2~Ul

mit A'1{l Us+1 ~

=

2:

a Vl •• 'V"P,

.. 'P

s dZVl

/\ • • • /\

dz v,

/\

dZUl

PT+1 ~

n, 2

/\ • • • /\

dzus

< ... 0.

Diese Bedingung ist jedoch nicht notwendig, wie wir jetzt am Fall (r, s) = (0, q) zeigen wollen. Da jede (0, q)-Form primitiv ist, sind die Zusatzbedingungen leer, und wir erhalten

L:

o< 2~(Jl

epla~~l" .a.12

2~p~n

< ...

0,

DEFINITION 5.

Xo E

oX genau

2 =PI O.

D .• Es sei nun g; E ~~.S(V), 1= r + s < n, g; sei primitiv, und X besitze die Eigenschaft B(r, s). 1m Faile r = 0 sei V als semi-negativ vorausgesetzt, im Faile r > 0 sei V = X x C das triviale Bunde!. Dann folgt aus un serer Hauptgleichung und den Abschnitten B und C: 0;;:; (d'g;,d'g;)

=

-21T(X 1\ g;,lip) - i

d.h. (d'g;, d'g;) = 0 und i Jox d'g;

1\

*Lg;

r d'g;

Jox

1\

*Lg;;;:; 0,

= O. Wir erhalten also

= 0,

(12)

d'g;

und wegen Satz 4 gilt mit g;; =

L iav./ldlJv

1\

df,1J,:

(13) fUr aile 1 ;;:; VI < ... < vT ;;:; n, 1 ;;:; ILl < ... < ILs ;;:; n und 1 ;;:; i ;;:; m. SchlieBlich folgt wegen d"g; = Ag; = 0 und 8' = i(d" A - Ad") noch

= O. Wir wollen zeigen, daB dann schon g; = 0 gelten muB. Dazu fUhren wir in Xo geodatische Koordinaten Zl, . . . , Zn ein, und es gelte wieder p(z) = (14)

Xl

+ Glieder

8'g;

hoherer Ordnung. Wegen (13) sind dann aile Ableitungen

o

j

= 2, .. . ,n,

j

= 1, ... , n,

(15)

76


in Xo gleich Null. Definiert man entsprechend wie im Abschnitt B die Ausdriicke la~)...•T,I'2' . ·Il., so folgt wegen fJ" q; = 0 in Xo:

d.h. flir aIle la.1 .. '.T,IIl2" ·Il., 2 ;;£ 11-2 < ... < I1-s ;;£ n, 1 ;;£ V1 < ... < Vr ;;£ n, i = 1, ... , m, verschwinden samtliche partieIlen Ableitungen in Xo. Daraus schIieBt man, da auch d"q; = 0 in Xo gilt, flir 2 ;;£ 11-1 < ... < I1-s ;;£ n:

Also verschwinden neben den Koeffizienten la.,/i auch aIle ersten partieIlen Ableitungen dieser Koeffizienten auf ax. Da das System la.,1l einem eIliptischen Operator zweiter Ordnung geniigt, namlich D, miissen aIle ,a.,1l = 0 sein, d.h. q; verschwindet identisch auf X. Wir haben also den folgenden Satz be wiesen : SATZ 6. Es sei q; E Sj~,S( V), I = r die Eigenschaft B(r, s). Ferner sei

+s

< n, eine primitive Form, X besitze

(i) V semi-negativ, falls r = 0, (ii) V = X x C,falls r > O. Dann gilt q; = O. 1st insbesondere X hyper-q-konvex, so folgt flir aIle q; E Sj~,S(V) mit s ;;£ n - q:q; = O. Zusammen mit den Ausfiihrungen in Teil 1 erhalt man das in der Einleitung angegebene Vanishing-Theorem: SATZ 7. Es sei X eine kiihlersche Mannigfaltigkeit und X ein relativkompakter offener Teilbereich mit hyper-q-konvexem Rand, q ~ I. Ferner sei Vein semi-negatives komplex-analytisches Vektorraumbundel uber X. Dann gilt

Hg(X, Y) = 0, HS(X, y* ® Qn) = 0,

s ;;£ n - q, s

~

q.

Insbesondere erhiilt man Hg(X, (9) = Ofiir s ;;£ n - q und HS(X, Qn) = 0 filr s ~ q. E.

Wir wollen Satz 6 noch etwas verallgemeinern:

SATZ 8. Es sei q; E Sj~,s, und die Bedingungen B(e - '\, s - ,\), ,\ = 0, 1, ... , min (r, s), seienfor X erfollt. Ferner geltefor alle'\: N'loaX = O. Dann ist q; = O.

KA'HLERSCHE MANNIGFALTIGKEITEN

77

BEWEIS. Wegen 8"cp = 0 ist auch 8"A"cp = N'8"cp = O. AuBerdem foIgt mit D!p = 0:8"d"A"cp = DN'!p = A"D!p = 0, und hieraus ergibt sich d(N'!p 1\ *d"A"!p) = d" A"cp 1\ *d" N'cp + (-I)IN'!p 1\ d"*d" A"cp, 1= r + s. Da der zweite Ausdruck verschwindet, erhalt man durch Integration

Nun gilt A"cploaX = O. FoIglich ist das Randintegral wegen des Kriteriums aus Abschnitt D von Teil 1 gleich NuIl, und es ist deshalb d" A"cp = 0, d.h. A"cp E Sj~-"'s-" flir aIle A. Wir fiihren nun voIlstandige Induktion nach l. Fiir I = 0 und list cp primitiv und der Satz schon bewiesen. 1st nun I > lund der Satz flir Formen yom Grade ~ I - I schon hergeleitet, so ist wegen Acp E Sj~-l.S-l die Form!p primitiv, und die Behauptung folgt aus Satz 6. Es ist den Verfassern nicht bekannt, ob Satz 8 anwendbar ist. 3. Die Sitze von Akizuki und Nakano. Beispiele A. Es sei X = X eine kompakte kahlersche Mannigfaltigkeit und Vein negatives komplex-analytisches Vektorraumbtindel tiber X. Aus unserer Hauptgleichung folgt dann -(x

1\

cp, Lcp)

~

0

fiir aile cp E SjT.S(V) = {cp E AT,S(V):d"cp = 8"cp = O}. Andererseits wurde in Abschnitt B von Teil 2 gezeigt, daB (x 1\ !p, Lcp) ~ 0 ist flir cp E AO.S(V), s < n. Da auf Grund der Gleichung (4) (x 1\ !p, Lcp) = 0 nur dann gilt, wenn!p = 0 ist, erhalten wir das Vanishing-Theorem von Nakano:

HS(X, y) = Sjo,S(V) = 0,

s < n.

B. 1st Fein negatives Geradenbiindel tiber der kompakten kahlerschen Mannigfaltigkeit X = X, so ist X eine positiv-definite globale (1, I)-Form auf X, die geschlossen ist. Man kann daher X als die einer kahlerschen Metrik auf X zugeordnete Form betrachten und erhalt mit Lcp : = X 1\ cp stets (X 1\ !p, Lcp) ~ O. Andererseits ist wegen der Hauptgleichung -(x 1\ !p, Lcp) ~ 0 flir Formen cp E SjT,S(F), d.h. L!p = O. Daraus folgt im FaIle r + s < n schon cp = O. Damit ist auch das Vanishing-Theorem von Akizuki und Nakano bewiesen: r

+ s < n.

C. Wir geben in diesem Abschnitt das in der Einleitung angekiindigte Beispiel e~nes komplex-analytischen Vektorraumbtindels an, das schwach negativ, jedoch nicht negativ ist.

78


Es sei Y eine n-dimensionale komplexe projektiv-algebraisehe Mannigfaltigkeit, 11 ~ 2. F bezeiehne das zu einem Hyperebenensehnitt geh6rende komplex-analytisehe Geradenbiindel; Fist somit positiv. Es gibt dann in der I-ten Tensorpotenz r von F, I ~ 10, schon n + 1 Sehnittflaehen So, Sb ... , Sn, die keine gemeinsame Nullstelle in Y haben. Man erhaIt deshalb einen Epimorphismus (n + 1)(!7 ----+ ----+ 0, wobei (!7 die Strukturgarbe von Y bezeiehnet. Wir tensorieren mit F- 1 und bekommen so einen Epimorphismus E: (n + l)F- 1 ----+ -1. Es sei Y der Kern von E. Die Garbe Y ist lokal frei und wird von einem n-rangigen Untervektorraumbundel V von (n + I)F - 1 geliefert. Da F -1 und mithin aueh (n + I)F - 1 negativ ist, muB V sehwaeh negativ sein. Mit HO(y, (n + l)f-l) = Hl(y, (n + l)F- 1 ) = 0 folgt aus der exak1 ----+ 0: ten Garbensequenz 0 ----+ Y ----+ (n + I)F- 1 ----+

r

r

r-

Da sehliel3lieh HO( Y, groBes, 10 erhalt man

r

-1) von Null versehieden ist fUr hinreiehend

Nach dem Satz von Nakano miiBte jedoeh Hl( Y, Y) = 0 gelten, wenn V negativ ware. / D. Zum SchluB entwiekeln wir aus dem vorigen Absehnitt ein Beispiel dafUr, daB unser Vanishing-Theorem nieht mehr richtig bleibt, wenn man die Hyper-q-Konvexitat im Faile q > I dureh die sehwachere q-Konvexitat ersetzt. Es seien Y und V wie in Absehnitt C gewahlt. Es gibt eine hermitesehe Form 2: hikWkWt auf den Fasern von V, so daB dureh {2: hikWkWi < I} eine relativ-kompakte streng pseudokonvexe Umgebung der Nullsehnittflache definiert wird. Auf den Fasern des dualen Bundels V* ist dann ebenfalls eine hermitesehe Form 2: kikw~wt gegeben, wobei (/iik) = (h t )-1, h = (h ik ), ist. Obwohl in normaler Trivialisierung beide Formen iibereinstimmen, ist die Nullumgebung {2: hikwtWi* < I} nieht mehr l-konvex. SehlieBt man jedoch die Fasern von V* zu komplex-projektiven Raumen ab und bezeiehnet man mit V* das auf diese Weise erhaltene projektive Bundel, so ist das Komplement X der oben definierten Umgebung der Nullsehnittflache beziiglich V* eine n-konvexe Umgebung der unendlich fernen. Pllnkte V = V* - V*. Wir zeigen, daB H;(X, (1}) of. 0 ist. Es ist dann das gewunsehte Beispiel in X gefunden. Es sei U = {U,:t E I} eine endliehe Steinsehe Oberdeekung von Y. Naeh Konstruktion gibt es einen Kozyklus g E ZI(U, Y), der nieht kohomolog <Xl

KA'HLERSCHE MANN/GFALT/GKEITEN

79

zu Null ist. Da die Elemente von Vy , Y E Y, Linearformen auf Vy* sind, erhalt man durch g einen Kozyklus l E Zl(U, l!J), wobei ft das Urbild der Oberdeckung U beziiglich der Projektion y* ~ Y bezeichnet. 1st l = all"}' so sind die Funktionen l'"'2 auf den Durchschnitten 0'"'2 holomorph und entIang der Fasern von y* linear; sie besitzen deshalb im Unendlichfernen eine Polstelle erster Ordnung. Es sei W R = {L,hikwtWt* > R} c c X eine Umgebung von Voo. Die Kohomologieklasse von II V* - WR UiBt sich dann nicht nach y* fortsetzen. Ware dies namlich doch der Fall, so gibt es wegen O. = U. X pn und H1( 0" l!J) = 0 ein y E Zl(U, l!J) und ein 7] E CO(UW* - WE, l!J), so daB y = l - 87] in y* - WR ist. Entwickelt man dann l, y und 7] in eine Potenzreihe nach den Koordinaten auf den Fasern und bezeichnet man mit l(l), y(1) und 7](1) den linearen Anteil, so gilt l(l) = g, y(l) = 0 und mithin g = 87](1) im Gegensatz zu der Voraussetzung. Aus dem Vorhergehenden folgt weiter, daB auch llX - W R nicht nach X hinein fortgesetzt werden kann. Wir stellen nun die Kohomologieklasse von llX - V beziiglich der Dolbeault-Isomorphie durch eine (0, I)-Form cp dar. Es sei £ eine beliebig oft differenzierbare positive Funktion auf X, die in X - WR identisch 1 ist und in einer Umgebung von V identisch verschwindet. Offenbar hat .p = d"(£cp) kompakten Trager. Gabe es eine (0, I)-Form a in X mit kompaktem Trager und d"a = .p, so hatte man d"(£cp - a) = O. 1st R > 1 hinreichend klein, so bestehtjedoch in X - W R die Gleichheit cp = £cp - a. cp ware also nach X hinein fortsetzbar. Da dies nicht der Fall ist, wie eben gezeigt wurde, reprasentiert d"(£cp) nicht die Nullklasse von H;(X, l!J). 00

00

UNIVERSITAT GOTTINGEN

LITERATUR 1. AKIZUKI, Y., and S. NAKANO, "Note on Kodaira-Spencer's proof of Lefschetz theorems," Proc. Japan. Acad., 30 (1954), 266-272. 2. BOCHNER, S., "Curvature and Betti numbers I, II," Ann. of Math., 49 (1948), 379-390; 50 (1949), 77-93. 3. KODAIRA, K., "On a differential geometric method in the theory of analytic stacks," Proc. Nat. Acad. Sci., 39 (1953), 1268-1273. 4. KOHN, J. J., "Harmonic integrals on strongly pseudo-convex manifolds, I and II," Ann. of Math., 78 (1963),112-148; 79 (1964), 450-472. 5. KOHN, J. J., and H. KOSSI, "On the extension of holomorphic functions from the boundary of a complex manifold," Ann. of Math., 81 (1965),451-472. 6. NAKANO, S., "On complex analytic vector bundles," J. Math. Soc. Japan, 7 (1955), 1-12. 7. SERRE, J.-P., "Un theoreme de dualite," Comment. Math. He/v., 29 (1955), 9-26. 8. VESENTlNI, E., "Osservazioni sulle strutture fibrate analitiche sopra una varieta kiihleriana compatta, I, II," Alii Accad. Naz. Lincei Rend. ct. Sci. Fis. Mat. Natur. (8), 23 (1957), 231-241; 24 (1958),505-512.

Iteration

c!f Analytic

Functions

c!f

Several Variables SAMUEL KARLIN AND JAMES McGREGORl

Professor Salomon Bochner has contributed notably in many areas of mathematics including the theory of functions of several complex variables and the theory of stochastic processes. In line with these interests, this paper reports results on iteration of holomorphic functions of several complex variables motivated by investigations pertaining to multi-type branching Markoff processes. Apart from its intrinsic importance and independent interest, iteration of holomorphic mappings plays a fundamental role in celestial mechanics, population genetics, numerical analysis, and other areas. Consider a vector-valued mapping fez)

(I)

z

= (fl(z)'/2(z), ... ,/p(z», = (Zlo Z2,"" zp) EZp ,

where Zp denotes the space of p-tuples of complex numbers. Suppose the mapping has a fixed point in its domain of definition and that at the fixed point the Jacobian matrix is nonsingular and all its eigenvalues are of modulus less than 1. One of the main objectives is to determine a canonical representation for the iterates (2)

fez; n)

= f(f(z;

n -

1), I),

n = 1,2, ...

fez; 0) = z

from which the complete structure and the asymptotic behavior of iterates of high order is easily ascertained. The case p = 1 has a long history and an extensive literature and has been the subject of considerable recent research, e.g., see Jabotinsky [6], Baker [1], Karlin and McGregor [8], [9] and Szekeres [17]; the last named 1 Research supported in part at Stanford University, Stanford, California, under contract NOO14-67-A-OI12-0015.

81

82

KARLIN AND McGREGOR

work contains a substantial bibliography of earlier writings on this topic. Consider a complex-valued function of one complex variableJ(z) analytic in the neighborhood of the origin and satisfying J(O) = 0 so that the origin is a fixed point. It was already known in the nineteenth century that if c = 1'(0) satisfies 0 < iel < 1, then the limit function A(z)

(3)

= lim J(z~ n) n,-..oo

c

exists, is holomorphic at z = 0 and satisfies (4)

A(f(z» = cA(z).

The function A(z) may be characterized as the unique solution of the functional equation (4) holomorphic at 0 with prescribed initial conditions A(O) = 0, A'(O) = I. If the mapping inverse to w = A(z) is z = B(w), then, from A(f(z, n»

= cnA(z),

n = 0, ± 1, ± 2, ... ,

we obtain the representation of the iterates (5)

J(z, n) = B(cnA(z».

The formula (6)

J(z; t) = B(etIOgCA(z»,

-00

< t
IA11 ;?; IA21 ;?; ... ;?; IApl > O. (Repeated A'S are to appear consecutively.) Algebraic complications may develop which do not arise in the case p = 1. These are of two kinds: (i) The canonical form must necessarily be recast whenever C possesses nonsimple elementary divisors. This situation can occur only when p ;?; 2. (ii) A more substantial modification is required whenever algebraic relations among the eigenvalues {A1> A2 , ••• , Ap} exist. An algebraic relation is an identity of the form (9)

84

KARLIN AND McGREGOR p

where r1> r2, ... , rp are positive integers, and

L

rlJ. ~ 2. When algebraic

1J.=1

relations are not present and C admits no elementary divisors, a simple diagonal canonical form prevails. Even under these circumstances a further technical obstacle pointed out by Bellman [2] and other authors is that the most straightforward generalization of the limit formula (3) is no longer forthcoming. It is correct that the first component function A 1 (z) can, in fact, be constructed in the spirit of (3), but the other component functions Av(z), v = 2, ... , p, cannot be so simply determined. The form of the general canonical representation is the content of the following basic theorem (we write THEOREM

A(z)

1. 2

UI =

vt

Ijvl for j

E

~).

Let the assumption (8) hold. There exists a mapping

= (A1(z), A 2(z), ... , Ap(z)) holomorphic at the origin such that

(10)

A(O)

=

0,

GAv' GzlJ. 2=0

= S

uv

and Av(z) satisfies afunctional relation (I I)

Av(f(z; n))=

,,~{ AvCz) + 8~V qv.in)Aiz) + Ifl~2~=AV Rv.tCn)[A}z)]f}, Ap=A v

II = 0, ± I, ± 2, ... , (v = I, 2, ... , p),

where qv.p(n) and Rv,ln) are polynomials in the variable n, [A(z)]f for j = Ul, j2, ... , jp) is an abbreviation of

The equations (l I) can be equivalently written for fez; n) by applying the inverse transformation B = A -1. Bounds on the degrees of the polynomials qv.8 and R v•f are also available. The first sum on the right of (I I) appears if C possesses elementary divisors. The second sum generally occurs in the presence of algebraic relations. The significance of Theorem 1 can be succinctly stated in the following terms. The mapping fez) satisfying (8) is conjugate to a polynomial mapping (12) 2

See the last paragraph of this paper.

ITERATION OF ANAL YTIC FUNCTIONS

85

where (13) and thus f(z) can be written in the form f(z) = A -l(h(A(z))). It is not difficult to prove that the map ~ -+ h(z) is a univalent holomorphic map of Zp onto Zp. If no algebraic relations are present then h(z) is linear and f(z) is conjugate to a linear transformation and even more specifically f(z) is conjugate to the linear mapping h(z) = Cz.

An important corollary of Theorem I is that the iterates of the mapping (2) can be embedded in a continuous one-parameter group f(z; t) of holomorphic mappings satisfying

+

= f(f(z; t), T), fez; I) = fez).

fez; t

T)

The parameter t assumes all complex values and f(z; t) is a holomorphic function of the p + I variables Zl, . . . , Zp, t and for each t is a function of z vanishing at z = 0 and holomorphic in a O-neighborhood. The embedding is not unique, but in many cases it can be made unique by imposing simple and natural auxiliary conditions. For example, in many applications the power series fv(z) of the given mapping have real (or non-negative) coefficients and the eigenvalues Av are non-negative. The embedding is then unique if it be required that the corresponding power series for f(z; t) have real coefficients for -00 < t < 00 (non-negative for 0 ;;i; t < (0). The main ideas in our method of proof will be sketched very briefly. After a preliminary linear change of variables we can assume the matrix (7) is in JOI:dan canonical form, and then it follows that If(z) I < Izl if Izvl < Pv, IJ = I, ... , P where Pv are small positive numbers in a suitable geometric progression. Then after a further linear change of variables we can assume f(z) maps the closed unit polydisc {z; Izl ;;i; I} into its own interior (contraction property). Let .?/t' be the Hilbert space consisting of all power series

g(z) =

L lU)zj jet;.

with

IIgl12

=

L IluW
0 for all i. We can now prove PROPOSITION I. Under the conditions stated above, the eigenvalues of the matrix C = I (Of./8zu)(TT) II satisfy 0 ~ lAd < 1 (i = 1,2, .. . ,p). PROOF. Introduce the vector v:;;: = (y:;;:;-, tion of the Schwartz inequality yields

y:;;:;, ... , ~). An applica-

(17) Strict inequality prevails because of the stipulations 0 (dw),

for totally finite (positive) measures 1> on the Borel subsets of the real line. In the case of a real-valued functionJ(necessarily even), (2) takes the form (3)

J(t)

=

f

cos wt 1>(dw),

and 1> may then be regarded as concentrated on the interval [0, co). In the theory of probability these functions arise most commonly as the autocovariances of stationary stochastic processes. Th us let Z = (Zt; - co < t < !Xl) be a stationary process, with E(Zf) finite, and let J(t) be the covariance of Zs and Zs+t (which is necessarily independent of s). Then J is a positive-definite function, and 1> is the "spectral measure" of the process. A particular case arises in the theory of Markov chains (for which we adopt the notation and terminology of Chung [I D. Suppose that X = (Xt ; -co < t < co) is a stationary Markov chain on a countable state space S, with (4)

7Ti

=

pilt)

=

P(Xs = i), P(Xs+t = jlXs 93

=

i),

94

J. F. C. KINGMAN

for I > 0, i,j E S. That is to say, X is a stochastic process such that, for i1> i2 , ... , in E Sand 11 < 12 < ... < tn> n

(5)

P{Xta = ia(a = 1,2, ... ,n)} =

7Til

TIPia-l,ia{ta - t a- 1 ). a=2

As is usual, the chain is assumed to be standard in the sense that, for each i E S, (6) Let a be a particular state in S (with, to avoid triviality, define a stationary process Z by

Zt (7)

1 if X t = a, 0 if X t =f. a.

= =

7Ta

> 0), and

and

The autocovariance function f of Z is clearly given by (8) Substituting this into (3), we obtain (9)

Pait)

=

7Ta

+

Jcos wt;"(dw),

with ;.. = 7T;; I. Hence the function Paa, which element of the transition matrix (10)

IS

a typical diagonal

P t = (Pij(t); i,j E S),

is positive-definite. Not every Markov transition matrix P t can arise from a stationary Markov chain; this will be possible if and only if (for some a), (II)

lim Pail) > O.

t-oo

However, Kendall [6] has shown that conclusion (9) holds quite generally even without this condition and, furthermore, that the measure ;.. is absolutely continuous. Kendall's argument uses sophisticated Hilbert space techniques, but simpler arguments exist (cf. [15], [10)). The diagonal elements PH of Markov chain transition matrices therefore form a subclass (denoted by PJ>Jt) of the class of real, positive-definite functions, and the problem arises of characterizing PJ>Jt. In a sense the answer is given by Markov chain theory, for a function P belongs to PJ>Jt

A CLASS OF POSITIVE-DEFINITE FUNCTIONS

95

if and only if there exists a family (Pjj; i, j = 0, I, 2, ... ) of functions on (0, (0), satisfying Q()

2 pj;(t) ;;; I,

j=O

(12)

and such that P = Poo. But this, although it (apparently) removes the problem from the province of the theory of probability, cannot be said to provide more than a "solution in principle." To proceed further, observe that the function Paa satisfies functional inequalities similar in form, but in addition to those defining the positivedefinite functions, which can be written as (13)

det (f(lta

-

tpl)) ~ O.

For example, if s, t > 0,

o ;;;

P(X. i= a, XHt = a) = P(XHt = a) - P(X.

= Paa(s +

=

X.+t

=

a)

t) - Paa(s)Pait),

and

o ;;; =

P(X. i= a, X.+t i= a) P(X. i= a) - P(X. i= a, X.+t = a)

= I Thus every P E (14)

fjJvi!

Paa(s) - Pais

+ t) + Paa(s)Paa(t).

satisfies

p(s)p(t) ;;; p(s

+ t)

;;; p(s)p(t)

+

1 - pes).

These inequalities, when combined with the regularity condition (15)

limp(t)

=

I,

t~O

which corresponds to (6), already imply a great deal about the function p. It follows quite easily [see 10] from (14) and (15) that P is strictly positive and uniformly continuous and that the (possibly infinite) limit (16)

q

=

-p'(O)

= lim t -1{l -

pet)}

t~Q()

exists. It may be noted that not all functions of the form (3) with/CO) = 1 satisfy (14), which is not therefore a consequence of positive-definiteness. The inequalities (14), which have often been used in Markov chain theory, are only the first of an infinite family of inequalities. For t1 < t2 < ... < tn, the probability

P{Xta i= a (a = 1,2, ... , n - I), Xta = a}

96

J. F. C. KINGMAN

may be expressed as a polynomial

in the values of Paa at the points ta - Ip (0: > (3), the exact form of which is n-l

(17)

F(t1,t 2,· .. ,ln;p) =

L= (_1)1e

Ie

0

Ie

L

np(tVj+l - IvJ 0 = Vo < v, < ... < Vk = n f =0

Moreover, n

L F(/

P{Xta of- a (0: = 1,2, ... ,n)} = 1 -

b

/2

,···,I ;Paa) T

T=l

Since these probabilities must be non-negative, it follows that every P E ;?jJj{ satisfies the inequalities n

L F(t 1, 12, ..• , IT; p) ;:;;;

(18)

1.

T=l

It should be noted that these inequalities are consequences of the fact that there exists a process Z, defined by (7), taking the values and I, and having finite-dimensional distributions determined by

°

n Paa(ta n

P{Zt" = 1(0: = 1,2, ... , n)IZo = I} =

la-1)"

a=l

°

for = 10 < 11 < ... < In. They comprise, moreover, a sufficient as well as a necessary set of conditions, in the sense of the following theorem [see 10]. THEOREM I. In order that there should exist a stochastic process (Zt; I > 0) taking the l'Olues and I and satisfying

°

n

P{Zta = 1(0: = 1,2, ... , n)} =

(19)

°

[1 ,,(ta

- la-1)

a=l

for all = to < t1 < ... < tn> it is necessary and sufficient that the function P should satisfy the inequalities (18). It therefore becomes important to study the functions satisfying the inequalities (18), which, for want of a better name, are called p-functions. If the set of all functions from (0,00) into [0, I] is given its product topology (compact by Tychonov's theorem), then (18) defines a closed subspace, so that the set of p-functions has a natural compact Hausdorff topology. A ,,-function satisfying (15) is said to be standard, and the set of standard ,,-functions is denoted by:~ It follows from the derivation of (18) that, in


97

any (standard) Markov chain, the function Paa is a standard p-function, so that (20)

It will become clear that!!J'J! is a proper, but dense, subset of &. For any P E!!J' and any h > 0, write (21)

in =in(h)

F(h,2h, ... ,nh;p),

=

so that, from (18), (22)

in ~ 0

(n ~ 1),

It follows after a Iittle algebra from (I7) that, for n

~

1,

n

(23)

penh)

=

L J,.(h)p(nh T=

rh),

1

a relation equivalent to the power series identity (24)

n~o p(nh)zn =

{I - n~/n(h)Zn}

-1,

Izl < 1 and p(O) = 1. For any sequence Un) satisfying (22), the sequence (un) defined recursively by

if

n

(25)

Uo = 1,

Un =

L j~Un-T

(n ~ 1),

T=l

is called the renewal sequence generated by (in), and the class 8f of all renewal sequences is the discrete analogue of &. Indeed, the preceding argument has, almost in its entirety, a discrete-time analogue in Feller's theory of recurrent events (see [4] and [5]) and in its application to discretetime Markov chains. From that theory we draw one result due to Chung to the effect that, for any renewal sequence (un), there exists a Markov chain (Xn; n = 0, I, 2, ... ) and a state a such that, for all n ~ 0, (26)

Un = P(Xn = alXo = a).

The corresponding assertion in continuous time is false. Comparing (23) with (25), we see that any function p E;:Y> has the property that the sequence (p(nh» belongs to !jf for all h > O. Conversely, it is possible to show that a function continuous in [0,00] with (p(nh» E!3f. for arbitrarily small h > 0 belongs to ~ This means that some properties of functions in 9! notably those referring to behavior for large t, can be

98

J. F. C. KINGMAN

deduced from the extensive theory of renewal sequences. For example, a celebrated result of Erdos, Feller, and Pollard [3] states that, if (un) is any renewal sequence for which the set {n ~ 1; Un > O} has 1 as its greatest common divisor, then lim Un exists. Hence, if p E &! lim p(nh) n~

co

exists for all h > O. Together with the uniform continuity of p, this implies that

p(oo) = lim p(t)

(27)

t~co

exists. This is a straightforward deduction, but a systematic technique for deriving asymptotic properties by the "method of skeletons" has been elaborated in [9]. If in (24) we set z = e- O\ multiply by h, and let h ~ 0 (keeping 8 > 0 fixed), the left-hand side converges to the Laplace transform

r(8) =

(28)

f"

p(t)e- ot dt

of p. The limiting behavior of the right-hand side of (24) may be ex,amined with the aid of Helly's compactness principle, and the result is the following fundamental characterization of &J: p.

THEOREM 2. If P belongs to on (0, 00] such that

(29)

{

J(O,CO]

&! there exists a unique (positive) measure

(1 - e-X)p.(dx) < 00

and such that, for all 8 > 0, (30)

{CO p(t)e-Ot dt = Jo

{8 + J(o.CO] { (1 _ e-OX)p.(dX)}-l.

Conversely, if p. is any measure satisfying (29), there exists exactly one continuous function p satisfying (30), and that function belongs to 9. In other words. (30) sets up a one-to-one correspondence between the class &J and the set of measures satisfying (29). It is important to note that (29) does not imply that p. is totally finite, although it must have p.(€, (0) < 00 for all € > O. I ndeed, an easy Abelian argument from (30) identifies the total mass of p. with the limit q in (16):

(31)

q

=

p.(0, 00].


99

Letting 0 -+- 0 in (30) gives

f'

(32)

p(t) dt = lP{oo}]-1,

so that I-'{oo} > 0 implies p(oo) = O. On the other hand, if I-'{oo} = 0, it may be shown that (33) In case q is finite, (30) may be written

r(O)={O+q- fe-Oxl-'(dx)r1 =

n~ (0

= n~ (0

+ q)-n-l{f e-Oxl-'(dX)r

+ q)-n-l f

e-oxl-'n(dx),

where I-'n is the n-fold Stieltjes convolution of I-' with itself. This inverts to give (34) where

7Tn

denotes the Poisson probability

(35) This formula may be given a probabilistic interpretation as follows. Let To < Tl < T2 < ... be random variables whose differences Tn = Tn+ 1 - Tn are independent, with distributions given by

o=

P(T n

~

x)

= I - e- qX = q-11-'(0, x]

(n even), (n odd),

for x > O. Define a process (Zt; t > 0) to be equal to I on the intervals [T2m , T 2m +d and 0 elsewhere. Then routine calculations show that Z (which is a special type of alternating renewal process) has finite-dimensional distributions given by (19), where co

p(t) =

(36)

L

P(T2m ~

t

~ T 2m +1 ).

m=O

Moreover,

P(T2m

~t~

T 2m +1)

so that (34) and (36) coincide.

=

J:

7T

m{q(t - x)}q-ml-'m(dx),

J. F. C. KINGMAN

100

The question then arises of giving a similar construction for the case q = fL(O, ro] = 00. This is a more complex problem, but the key to its solution lies in the observation that expressions similar to (30) arise in the theory of additive processes (processes with stationary independent increments). Indeed, for any fL satisfying (29), there exists an additive process ("It; t ;;; 0) (nondecreasing, and increasing only in jumps) such that (37) where

(38) Adding a deterministic drift, we obtain a process

with (39)

It has been observed by Kendall (the proof being easiest if a strong Markov version of "I ~s used) that the process defined by (40)

Zt

= =

if ts = t for some s, otherwise,

I 0

satisfies (19), where p is the function corresponding to fL in (30). Our starting point was the positive-definiteness of functions in :?/'.d, and it is therefore pertinent to remark that, as implied by the title of this article, the functions in :?/' are also positive-definite. Indeed, it is proved in [10] that, if p E ~ there exists a non-negative integrable function f such that (41)

pet)

=

p(ro)

+ LX> few) cos wI dw.

No very simple proof of this result appears to be known. Although:?/' is therefore a subset of the class of real positive-definite functions, it differs from that class in failing to be convex. This fact is important as the source of many of the difficulties encountered by the explorer of 9. It will perhaps help to consider a few examples of functions in :?/'. Suppose for instance that the measure fL is concentrated in an atom of . mass q at a single point a. Then (34) shows that [tfal

(42)

pet)

=

L

n=O

7T n

{q(t - na)}.


101

This is an oscillating function, converging to the limit p(oo) = (1

+ qa)-l.

It is differentiable except at the point a, where it has left- and rightderivatives:

According to a famous theorem of Austin and Ornstein (see [I], Section

n.I2) the functions Pij in a Markov chain are all continuously differentiable in (0,00). It therefore follows that every function in:?JJuIt is so differentiable, and therefore that (42) cannot define a member of :?JJuIt. Thus :?JJuIt is a proper subset of ~ The local behavior of the particular p-function (42) is quite typical. For any /L satisfying (29) the function (43)

met)

=

/L(t, 00]

is non increasing, right-continuous, and integrable over (0, T) for any finite T. Equation (30) may be thrown into the form (44)

r(e) = e- 1

{I + f" m(t)e- ot dt}

-1.

Expanding this formally, we obtain (45)

pet)

=

1-

s:

b(s) ds,

where 00

(46)

b(s) =

2: (_I)n-1mn(S), n=l

and mn is the n-fold convolution of m with itself. It is shown in [12] that this formal expression is always valid, that the series (46) is uniformly absolutely convergent in every compact subinterval of (0, (0), and that mn (n ;:;: 2) is continuous. Since m is continuous except for jump discontinuities at the atoms of /L, it follows that every function in :?JJ has leftand right-derivatives in (0, (0), and that (47) for all t > 0. In particular, p is continuously differentiable in any interval free from atoms of /L. Reversing the logic, the Austin-Ornstein theorem on the continuous differentiability of mem bers of 9'uIt is seen to be equivalent to the assertion that, if p E :?JJuIt, the corresponding measure /L is nonatomic in (0, (0). In

J. F. C. KINGMAN

102

fact more is true, for a result of Levy (see [13], I) implies that fL is absolutely continuous (except perhaps for an atom at 00). We shall return later to the problem of describing the measures fL which correspond to members of g.$!. For another kind of example, consider any renewal sequence (un). By the theorem of Chung quoted previously, there exists a discrete-time Markov chain satisfying (26). Denote its transition matrix by P. If e is any positive number, and I denotes the identity matrix, then Q = e(P - J)

(48)

defines the infinitesimal generator of a q-bounded Markov chain, with transition matrix Pt

=

exp {e(P - I)t}

=

e- ct exp (etP)

=

e- ct

i

(et)n pn n!

n=O 00

L

=

7T

n(et)pn,

n=O

where

7T n

is given by (35). Hence, from (26) 00

Paa(t)

L

=

7T

n(et)un.

n=O

Thus, for any (un)

E

&£ and any e > 0, the function 00

(49)

p(t) =

LU

n7T n (et)

n=O

belongs to .cJjJ.$!, and so also to &P. Not every PEg.$! is expressible in the form of (49), since (49) implies that, for instance, p'(O)

=

-e(1 -

U1)

> -00.

If, therefore, f2 denotes the class of functions (49) with (un) we have the strict inclusions

E ~

and e > 0,

(50) For any p E &l! and any integer k, the sequence (p(nk-l); n belongs to &£, and hence the function 00

(51)

Pk(t) =

L p(nkn=O

1 )7T n

(kt)

=

0, 1,2, ... )


belongs to f!JJ. If ~1' then

~2'

••.

103

are independent Poisson variables with mean t,

Pk(t) = E{p[k-l(~1

+

~2

+ ... +

~k)]}'

and the weak law of large numbers therefore implies that (52)

lim Pk(t) = pet)

k ... 00

for all t ~ O. Thus fl is dense in gJ (in the product topology), and so afortiori is gJ.,({. It is the fact that gJ.,({ is not closed in gJ that makes its identification such a delicate problem. It is of course possible that interesting topologies for gJ exist in which gJ.,({ is closed, but such topologies would need to be strong enough to prevent Pk converging to p, for all P E gJ - 9"({. If PI and P2 are two p-functions, construct as in Theorem 1 processes ZI' Z2 on distinct probability spaces 01> O2 to satisfy (19) with P = PI and P = P2, respectively. On the product space 0 = 0 1 X O2 define a new process Z by (53)

Using the product measure on 0, it is immediately apparent that Z satisfies (19), with (54)

Thus the product of two p-functions is itself a p-function, and consequently the product of two members of fY' is in f!JJ. It follows that, under the operation of pointwise multiplication, fY' is a commutative Hausdorff topological semigroup with identity. The arithmetical properties of this semigroup have been studied by Kendall [8] and Davidson [2], who have shown that it exhibits many of the properties classically associated with the convolution semigroup of probability distributions on the line. The latter is of course isomorphic to the semigroup of positive-definite functions (under pointwise multiplication) of which fY' is a subsemigroup. The fact that fY' is closed under pointwise multiplication permits the construction of further examples of p-functions. A particularly important use of this device is due to Kendall [7]. Let,\, a be positive numbers, and construct a Poisson process n of rate ,\ on (0, (0). Define Zt to be equal to 1 if no points of n lie in the interval (t - a, t), and 0 otherwise. Then, for o = to < tl < t2 < ... < tno P{Zta = 1 (a = 1, 2, ... , n)}

=

p{no points of n in Ql (ta -

a, t a)}

104

J. F. C. KINGMAN

where L is the measure of the set n

U (tlX -

a, tlX ) n (0, (0).

1X=1

It is not difficult to check that n

L

=

L min (tlX -

tlX-I> a),

1X=1

so that Z satisfies (19) with (55)

pet)

=

exp { - A min (t, a)}.

°

Thus the function (55) belongs to !1jJ for all ,\, a > (though not to !1jJj( because it is not differentiable at t = a). Since!1jJ is closed under multiplication, it contains every function of the form

pet)

=

exp {

-i~ Ai min (t, ai)}

with Ai' ai > 0. The compactness of the space of p-functions then implies that, if A is any measure on (0, 00] with

f

min (I, x)A(dx) < 00,

=

exp { -

then (56)

pet)

f

min (t, X)A(dX)}

defines an element of ~ But the functions of the form (t)

=

f

min (t, x)A(dx)

are exactly the continuous, non-negative, concave functions on [0, 00) with (0) = 0. We have therefore proved that, for all such , the function (57)

pet) =

e-(t)

belongs to ~ Notice that the set of functions of this form is a convex subset of the nonconvex set ~ The p-functions (57) playa central role in Kendall's theory, for they are the infinitely divisible elements of ~ those possessing nth roots for all n. That they have this property is obvious; to see that they are the only ones


105

let f be any function with the property that I« E 9 for arbitrarily small values of a. Then f > 0 and we may define 1> = -Iogf From (18) with n = 3,

o ~ F(tl, t 2, t 3;1«) =1«(t3) - l«(t 1 )f"(t3 - t 1 ) - l«(t 2)I«(t3 - t 2) + l«(tl)f"(t2 - t 1 )f"(13- t2) = a{ -1>(t3) + 1>(t3 - t 1 ) + 1>(t2) - 1>(t2 - t 1 )} + O(a 2). Since this holds for arbitrarily small a, we must have

1>(t3) - 1>(t3 - t 1 )

~

1>(t2) - 1>(t2 - t1 ),

which implies that 1> is concave. Among the functions of the form (57) are the completely monotonic functions, which can be written in the form (58)

p(t) =

f

e-tYII(dy)

for probability measures II on [0, (0). It is shown in [13, II] that these belong not merely to 9 but also to 9.d; they are indeed exactly the functions Paa which can arise from reversible Markov chains. Thus far the only p-functions discussed have been standard, but there are interesting examples of p-functions which fail to satisfy (15). For example, if the variables Zt are independent, with P(Zt = I) = b, then (19) is satisfied with (59)

p{t) = b

(t > 0).

Hence (59) defines a p-function whenever 0 ~ b ~ I, and this is not standard unless b = I. Since the product of p-functions is a p-function, the function (60)

p{t) = bp{t)

is a nonstandard p-function for 0 ~ b < 1 and p E 9. These nonstandard p-functions are continuous except at the origin, but there are more irregular ones. Suppose for instance that G is any proper additive subgroup of the line, and define Zt to be I with probability one if t E G, and 0 otherwise. Then (19) is satisfied with p the indicator function of G () [0, (0). If G is measurable, it necessarily has measure zero, so that p = 0 almost everywhere. In a sense these are the only two possibilities for measurable p-functions. More precisely, the following result is proved in [14]. THEOREM 3. Let p be any p-function. Then exactly one of the following statements is true: (i) p is standard,

106

(ii) pet) = (iii) (iv)

J. F. C. KINGMAN

there is a standard p-function ji and a constant 0 < b < 1 such that bji(t), p is almost everywhere zero, p is not Lebesgue measurable.

It is also worth remarking that, in case (ii), as, more obviously, in case (iv), the process Z in Theorem 1 cannot be chosen to be measurable. It has already been noted that fl' is not closed as a subset of the compact space of p-functions, and it becomes relevant to identify the closure i!) of f!IJ. Since b = lim Pn(t), n-+ co

where Pn{t) = exp {-n min (t, -n-1Iog b)}

is of the form (55), the constant p-function (59) belongs to general p-function bji(t) = lim Pn{t)ji(t) n-+

&i; as does the

00

of type (ii). The important question is therefore which p-functions of type (iii) belong to #. If in (42) we set a = I - q-1a and let q ~ 00, we obtain the p-function (61)

p(t) = 7Tt(at)

=0

(t integral) (otherwise),

which is thus a p-function of type (iii) belonging to #. On the other hand, not every p-function belongs to fl'. To see this, it suffices to note the inequality, discovered independently by Freedman and Davidson, which states that for every p E fl' and s < t, (62)

p(s) ~

t + {pet)

-

-iY /2 ,

so long as p(t) > i. This must therefore hold also for all p ticular, any p E & of type (iii) must satisfy (63)

p(t) ~

E

if In par-

i,

for all t > O. This is certainly not true of all p-functions of type (iii). It seems very likely that the upper bound i in (63) can be improved considerably. The maximum value of (61) is e- 1 (when a = t = I), and' all the evidence suggests that this is the sharp upper bound to pet) for p E & of type (iii). It will be seen, therefore, that the identification of ~ is closely involved with establishment of inequalities between the values of p, true for all p E f!IJ. These problems are extremely difficult, mainly because


107

of the lack of convexity of fJ}'; the exact form of the set {(p(l),p(2»;PEfJ}'} £ [0,1] x [0,1]

(64)

is not even known. The most important unsolved problem of the theory remains, however, the determination of the class fJ}'A of those standard p-functions which can arise from Markov chains. In the one-to-one correspondence (30), fJ}'A (as a subset of &) corresponds to a subset A' of the class of measures satisfying (29). The elements of A' are measures on (0, 00], but it is easy to show that the value of the atom at infinity is irrelevant to the question of membership of JI'. Thus attention may be restricted to those JL with JL{oo} = (that is, to recurrent states); the corresponding subset of A' is denoted by A. There are of course many ways of combining Markov chains to form new ones, and these may be used to establish structural properties of the class A. Thus it is shown in [13, I] that A is a convex cone, closed under countable sums, where these are consistent with (29). There are of course measures in A having infinite total mass, but these cause no real difficulty, since it can be shown (see [13, IV)) that every measure in A is the sum of a countable number of totally finite measures in A. The convolution of a finite (or even infinite, under suitable convergence conditions) number of finite members of .,It is again in A. At the same time it must be stressed that A is not closed in any obvious topology, since &.,11 is dense in .9. Using these properties of .A, we can at once exhibit large classes of elements of A. It is, for instance, immediately apparent that A contains any measure with density of the form

°

(65) with a mn ~ 0, so long as (29) is satisfied. On the other hand, we have already seen that measures in A must be absolutely continuous (with respect to Lebesgue measure). Moreover, it is a consequence of another result of Austin and Ornstein ([1], p. 149) that the density of any nonzero measure in A must be everywhere strictly positive, and this density may also be shown to be lower-semicontinuous. These facts go a long way toward determining the class A, and thus fJ}'A, but a complete solution remains elusive. The class fJ}'A consists, of course, of all diagonal Markov transition functions Pii> and it may be asked whether corresponding results are available for the nondiagonal functions Pij (i "# j). To describe the theory which has been developed to answer this and related questions, which depends on the concept of a quasi-Markov chain (see [11], [13]), would

108

J. F. C. KINGMAN

take us too far afield, but the answer may be stated thus. The nondiagonal Markov transition functions PH are exactly those which may be expressed as convolutions (66) where PI and P2 belong to f!JJA, a =

f'

Pl(t) dt
.•• , Xn is an arbitrary finite set of mutually orthogonal vectors in H, then the random variables .p(x l ), . • . , .p(xn ) are mutually independent. This is equivalent to the property that for every finite set of vectors Yl> ••. , Ym in H, the random variables .p(YI)' ... , .p(Ym) have ajoint normal distribution, and that E(.p(x» = 0, E(.p(x).p(y» = c<x, y) for arbitrary X and Y in H,

IRVING SEGAL

114

where c is a constant. It will be no essential loss of generality for present purposes to assume that c = 1. It may be suggestive, although entirely unnecessary mathematically, to employ the symbolism ¢>(x) ""

f

!p(a)x(a) da,

where the purely symbolic function !p corresponds to the intuitive white noise, i.e., the !p(a) are for different a, mutually independent, identically distributed, normal random variables, of mean 0, and of variance of such "infinity" that the L 2 -norm of ¢>(x) in L2 over the probability space is the same as the L 2-norm of x in L2(M). It should perhaps be emphasized that, a priori, neither !p(') nor !p(x) exists in a strict mathematical sense; !p(') is a "generalized random function" or "stochastic distribution." Nevertheless, as is well known, formal linear operations on the !p(.), including differential and integral operators, can be given mathematically effective formulation and treatment by their transformation into suitable mathematical operations on the unexceptionable object ¢>(.). These methods of treatment of linear aspects of "weak" functions break down when it is attempted to apply them to nonlinear considerations. Indeed, one might be tempted to believe that no effective notion of nonlinear function of a white noise can be given, since the appare'ntly simpler concept of a nonlinear function of a Schwartzian distribution has proved incapable of effective nontrivial development. The fact is, however, that there is an additional structure which is frequently present in the case of weak stochastic or operational functions, which is lacking in the familiar case of a numerical function, on which such a concept may be based, and indeed, the function and its powers may be represented by mathematical, i.e., nonsymbolic, strict (i.e., nongeneralized) functions on M, in these cases.

B. The linear theory of weak processes can to a large extent be pursued independently of the precise space M on which the process is a generalized function. Thus the white noise, as defined earlier, is primarily a pure Hilbert space construct, and its basic theory can proceed quite independently of whether this space is a function space, as in Wiener's theory of the "homogeneous chaos," and if so, of how it is represented. The nonlinear theory, however, depends in an essential way on the multiplicative structure' in the algebra of numerical functions on M. One could deal with this in part by introducing suitable topological linear algebras, but it is simpler, particularly in connection with the determination of an actual function on M representing the processes in question, to give a concrete treatment for

LOCAL NONCOMMUTATIVE ANAL YSIS

115

functions on the given space M, endowed with appropriate additional structure to be indicated later. The problem is that of legitimizing in a mathematically effective and formally valid way expressions of the form F(cp(a», where 'P(a) is the previously indicated symbolic white noise, and F is a given function of a real variable. It is too well known that this cannot be done in line with the usual mathematical ideas to require elaboration here; even the simplest nontrivial case F(>.) = >.2 is totally elusive, for the square of the white noise on a measure space without atoms appears as an infinite constant on the space. I shall show that if one gives up for the moment the literal interpretation of the application of the nonlinear function F to a weak function as the limit of its applications to approximating strict functions, and insists only that the resulting generalized function o/(a) '" F('P(a» should be a "local" function of 'P(a) , in an intrinsically characterized sense, whose transformation properties under vector displacements infunction space are in formal agreement with those of the intuitive function F('P(a», then one obtains essentially unique results in the crucial cases of the powers, F(>.) = >,p. These powers behave in many ways as do conventional powers, and are the limits of renormalized powers of strict functions, in a way which clarifies their nature and illustrates the natural occurrence of additive "divergences," or apparent infinities. My method is a development from one used earlier in connection with quantum fields and Brownian motion. In terms of the conventional mathematical representation x(t) of the latter, the situation may be described as follows. It was shown by Wiener that a suitable version of x(t) has the property of possessing fractional derivatives x<e)(t) of all orders e < -!-; for orders e ~ -!-, the fractional derivative exists only in a generalized sense, as a stochastic distribution for example; for e = I, the derivative is simply white noise on RI. Thus there is no problem in the definition of a power of x<e)(t) if e < !; when e = -t, a literal interpretation leads to the result that X(I/2)(t)2 = 00 with probability one; but with a suitable "renormalization," the powers of X(1/2)(t) are well-defined, finite, nontrivial stochastic distributions. For! < e < I, the powers cannot exist in the same sense as in the case e = -t (see [4]). However, they exist essentially as generalized functions ("distributions") on the probability space. This conclusion does not appear valid in the still more singular case e = I. It is shown here, by a method applicable to a variety of processes similar to white noise, that with a suitable increase in the regularity of the test functions employed, a similar conclusion holds. Due, however, to the failure of the Soboleff inequalities in an infinite-dimensional space, it is convenient to emphasize the use of Hilbert space methods, and to treat the generalized functions as sesquilinear forms on a dense domain of regular vectors in L 2 (0), where 0

116

IRVING SEGAL

denotes the probability space in question, rather than as linear functionals. The formal correspondence is this: if j is a function, the associated sesquilinear form F is given by the equation: F(g, h) = jgh. The case e = t has essentially the same singularity with respect to the formation of renormalized powers as a so-called scalar relativistic quantum process in a two-dimensional space time, as a process in space at a fixed time. The cases t < e < I similarly involve singularities quite comparable to those of n-dimensional such processes for n > 2. White noise is thus more singular than any such quantum process, as a process in space at a fixed time.

f

C. A natural mathematical category in which to treat renormalized powers of generalized stochastic processes is that of the ergodically quasiinvariant (EQI) processes. A linear map cf> from a real (topological) linear vector space L to the random variables on a probability measure space, say n, will here be called a process (it is also known as a "weak distribution," or as a "generalized random process," in the dual space to L); more precisely, it is a version of a process, the process itself being the class of all equivalent versions, relative to the usual equivalence relation (see [5]) of agreement of probabilities defined by finite sets of conditions. It is no essential loss of generality to suppose that the random variables cf>(L) form a separating collection for the space n, i.e., the minimal sigt?a-ring with respect to which they are all measurable is, modulo the ideal of all null sets, the full sigma-ring of n; and this supposition will be made throughout. Such a process is called quasi-invariant if any of the following equivalent conditions holds: (i) For every y E L* (the dual space), there exists an invertible measurable transformation on n, carrying null sets into null sets, and such that cf>(x) ~ cf>(x) + <x, y) (for suitably chosen n; otherwise the measurable transformation need not exist as a point transformation on n, but only as an automorphism of the Boolean ring of measurable sets modulo the ideal of null sets). (ii) For every y E L*, there exists a unitary operator U on the Hilbert space K = L2(n), such that U'Y(x)U -1 = 'Y(x) + <x, y)I, where 'Y(x) denotes the operator in K consisting of multiplication by cf>(x) (in general, an unbounded but always normal operator), I being the identity operator onK. (iii) There exists an automorphism of the algebra of measurable functions on n, modulo the ideal of null functions, which carries (the equivalence class of) cf>(x) into (that of) cf>(x) + <x, y).

These are various ways of asserting the "absolute continuity" of the transformations z ~ z + y in the dual space L* relative to the weak

LOCAL NONCOMMUTATIVE ANALYSIS

117

probability measure in L * corresponding to cfo; when the process arises from a conventional countably additive probability measure in L* in the natural fashion they are equivalent to absolute continuity in the conventional sense; it was in this form that absolute continuity was first shown in function space, in the work of Cameron and Martin [2]. More specifically, this work shows that the transformation g --+ g + I is absolutely continuous in C [0, 1] with respect to Wiener measure, provided I satisfies a certain regularity condition. This result is closely related to (and follows from) the absolute continuity of arbitrary vector displacements in Hilbert space, relative to the isonormal process. When L is finite-dimensional, every process corresponds to a probability measure on L*, which is quasiinvariant if and only if it is absolutely continuous with nonvanishing density function. Ergodicity is similar generalization of the ergodic property of vector displacements in Rn, i.e., the nonexistence of invariant measurable sets other than null sets or their complements. More specifically, an EQI process is a quasi-invariant one such that only the constants are invariant under all the automorphisms of the algebra of random variables induced from vector displacements in L*. Again, the ergodicity of Wiener space relative to vector translations was shown by Cameron and Martin; and this is closely related to (and deducible from) the ergodicity of the isonormal process in a Hilbert space relative to vector displacements in the space. Suppose then that cfo is an ergodic quasi-invariant process on a space L of functions x(·), so that symbolically cfo(x) '" (Xl/2 112 ), 0 may be chosen as the probability space associated with white noise in such a way that respective expectation values, on the one hand for operators in K and on the other for random variables on 0, are equal. NOTATION. For any positive operator A in a Hilbert space H, Dco(A) denotes the common parts of the domains of the e"A (n = 1,2, ... ); while [Dco(A)] denotes this set as a linear topological space in the topology in which a generic neighborhood of 0 consists of all x such that IIAkXll < E for k < ko, for some E and ko. THEOREM 2.1. Let G be a given locally compact Abelian group and B a given real, positive, translation-invariant operator in L2(G) such that B(· )-1 E Lp(G*) for all sufficiently large p, where G* denotes the dual (character) group of G. Let ('Y, K, v, r) denote the standard normal symmetric process over the Hilbert space H = L2(G). Let H denote dr(B). Then for arbitrary a E G and j = 0, I, 2, . . . there exists a sesquilinear form cp(f)(a) on [Doo(H)] such that:

(i) The map (a, u, u') ~ cp(jl(a)(u, u') is continuousfrom G x [Dco(H)] x [Dco(H)] into C 1. (ii) cp(Ol(a)(u, u') = (u, u'), cp(jl(a)(v, v) = 0 (for all a and nonzero j). (iii) If x is in Dco(B), then, denoting the sesquilinear form: (u, u') ~ cp(jl(a)(et'P(Xlu, et'P(Xlu'), as Fla, x), thefollowing relations hold when x is real and in Dco(B): F;(a, x) = F;(a, 0) Fj(a, ix)

=

k~ (i)Fj_k(a, O)x(a)k.

Furthermore these conditions uniquely characterize the cp(f)(a).

120

IRVING SEGAL

REMARK. It may be useful to indicate once again the connection with the intuitive ideas concerning the present situation. The form cp(j)(a) may be thought of as the result of "renormalizing," by suitable subtractions of "infinite quantities," the formal but mathematically nonexistent power cp(a)j. Thus cp(1)(a) is merely the white noise cp(a) at the point a, and thus in the first instance a generalized function; this generalized function is, however, when regarded as a sesquilinear form on a suitable domain in L2(Q), a strict function. Condition (i) is to the effect that this strict function as well as the strict functions corresponding to higher renormalized powers than the first, is continuous. Condition (ii) asserts, first, the foregoing interpretation of cp(j)(a) in the case j = 0, and gives, second, the key normalization condition that the expectation values of all the renormalized powers vanish. Condition (iii) then asserts, first, that the cp(j)(a) are indeed in a generalized sense functions of the white noise in that multiplication by them, acting on L2(Q), commutes with multiplication by the white noise; and, second, that these functions are indeed renormalized powers in the sense that they have exactly the same transformation properties under vector displacements as do the formal powers (i.e., the conclusion of the binomial theorem is essentially applicable to the renormalized powers). PROOF OF THEOREM 2.1. The proof is parallel to the proof of Theorem 4.1 in [7] and is based on the idea that the operator B which, occurs in that theorem plays two separate roles, one of which is here taken over by the operator I, the covariance operator of the white noise process. As shown in [II], any element g of D",(B) differs on a null set from a (unique) continuous function, and it is this continuous function which enters into condition (iii). The uniqueness part of the theorem is a corollary to results in [II] and is proved by the same method as the uniqueness for the cited theorem in [II]. The proof of the existence then proceeds by the establishment of the existence and properties of the limit of (xa)n: u, u'), as x converges to the "delta-function" on G, where xaCb) = x(a- 1 b) and u and u' are arbitrary elements ofD",(H). Lemmas 4.1 and 4.2 of [II] and the associated notation will be employed, except that the and cp of [II] have been replaced by 'Y and .p, for the present 'Y, unlike the and'Y of [II], has no special connection with the operator B. (Colloquially speaking, the 'Y is an isonormal process and has no special dynamics, while is the quantized scalar field corresponding to the differential equation " + B2 = O. In addition, at a fixed time 'YIHr has covariance operator I, while has covariance operator B-1.) The general idea of the proof may be considered to be to begin with an examination of the case of sufficiently smooth vectors in Wiener's polynomial chaos, as adapted by Anzai and Kakutani, as test functions in L2(Q). The main lemma is then the following:

and Ii - jl > r, to assume that WE Kn and w' E Kn' for certain nand n'. In this case w may be represented in the form


127

and w' may be similarly represented in terms of a symmetric function F' on G*n'. It is necessary for this reduction that c(w) satisfy the inequality: c(w) ~ const 11(1 + H)awll, for a suitable constant and exponent a, both of which must be independent of n. Since the action of ret) on w is to replace the corresponding function F by its product with exp (it L: B(p,», the computation given as Lemma 4.1 j

in [II] shows that M (w, w') is a sum of integrals of the following type, and that it suffices to bound this typical integral in the indicated fashion:

On introducing the square-integrable functions F1 and F{ as in [II] (differing from F and F' by appropriate power-products of the B(p;) and B(kl this integral takes the form

»,

Since WE D",(H), KaF1 E L 2, where K(q1,"" qn) = I + B(q1) + ... + B(qn) , for arbitrary real a. Now replace F1 by KaF1 and introduce a compensating factor of K - a at the beginning of the integrand; the integral resulting has a form to which Lemma 4.2 of [II] is applicable, with x = (k b . . . , k n- s), Y = (kn-s+ b · · · , k r ), and z = (P1'" .,Ps)' It follows that the integral in question exists if the following expression is finite and is then bounded by this expression: IIKaFl 11211F{112 x suP P1 .... PS x

[f(1 + B(P1) + ... + B(ps) + B(k1) + ... + B(kn_s»r a

[Q B(ki)r1j;C~

B(ki) -

i=n~+l B(k )1\~(k1 + ... + k r)i2dk. l)

The quantities IIKaFd2 and IIF{112 are identical with

11(1 + H)awll and

I w' II, within factors dependent only on nand n', which scale as in

[II] in such.a way that it remains only to show that the indicated supremum over Pb ... , Ps is finite and bounded independently of nand n'. Since B(p) ~ 0

128

IRVING SEGAL

for all p, this supremum is bounded by the corresponding integral with all B(p) replaced by O. The resulting integral

f(l +

B(k 1 )

+ ... + B(k n _ s»-2a[Q B(kl)r 1 x

j;C~ B(k

i=n~+l B(k »)1

i) -

2

l

Ii(k 1

+ ... + kT)i2 dk

is no longer dependent on n or n', so that it suffices to show that it is finite. To treat this integral, set n-s

d(kl' ... , kT) =

L B(k

i= 1

e(kb ... , k T) =

l ),

I

B(ki ),

i=n-s+ 1

and consider separately the integrals over the two regions, I: Ie - dl ~ e12, and II: Ie - dl < e12. Since / is bounded and 1/(1)1 = O(I/I-T) for III ~ 00, 1/(/)1 ~ C(l + II/)-T for a suitable constant C. In particular, I/(d - e)1 ~ C(l

+

Id - el)-T ~ 2Ce- T in region I ~ C in region II.

The integral over region I is then bounded by 2C

f(l +

B(k 1 )

+ ... + B(k n _ s»-2a[DB(ki x

)r

(=n~+l B(ki»)

1

-2T

li(kl

+ ... + kT)i2 dk.

Since the arithmetic mean dominates the geometric mean,

and (1 + B(k 1 ) + ... + B(k n _ s»-2a may be similarly dominated. Thus the integral in question is dominated by

f( 0 B(k

)-[2(aIT)+ll(

n-s

C"

i)

T

i=JJ+l B(k i )

)-3 li(kl + ... + kT)i2 dk.

If a ~ r, this integral is finite; for, taking, as suffices, the case a = r, it has the form (*)

f

By Fubini's theorem, this is the same as C" (N * N * ... * N)lgI2, the inner product of the r-fold convolution of N with itself, with Ig12, in the


129

sense that if this convolution is finite a.e. and the indicated inner product with Igl2 is finite, then the two integrals are the same. By hypothesis, N E Lp(G*) for all p > 1, so that the indicated convolution exists and is again in this space, by the Hausdorff-Young Theorem. On the other hand, Igl2 is also in all Lp(G*), under the indicated assumption on g, so that the inner product is indeed finite. It remains to consider the integral over region II. In this region, e < 2d; the integration over k n- H 1> ••• ,kr therefore contributes at most const.dconst., if 1. Igl, and the B(k j ) -1 for j ~ n - s + 1 are replaced by constants which bound them. The resulting integral over the k1' ... , k n - s is then bounded by one of the form: const. d- 2(alr)+const. dk 1 •• ·dkn _ .. which is finite if a is chosen sufficiently large. Taking the greater of the two a's involved, the required inequality has been established.

J

REMARK 3.1. The exponent r in the required decay for 1. while possibly best possible in the generality of Theorem 2, is not best possible in the relativistic case. Indeed, an appropriate specialization of the foregoing argument shows that in the case r = 2, the conclusion of Theorem 3.1 is valid if only 1/(1)1 = O{llI-l). More specifically, the argument is the same except that because of the slower decay for 1. N(k) must be taken as B(k)-2. In this relativistic case, this is in Lp for p > t; the two-fold convolution of such a function with itself is in Lq for q > 3 and hence has finite inner product with Ig12. This shows that :4>(x, t)2:g(X) dx is, as a function of t, and as a Hilbert-space operator, a distribution of first order; and it is known that it is not a distribution of zero order, i.e., a strict function. Similar considerations are applicable to higher powers in the relativistic case. If, for example, r = 3 and 1/(/)1 = 0(1/1- 2), the method given shows that the conclusion of Theorem 3.1 is implied by the convergence of the integral (*) with N(k) taken as B(k)-7/3. In the relativistic case, this function is in Lp for p > ~ (in four space-time dimensions); the three-fold convolution with itself of a function in this case is in Lq for q > 3; it therefore has a finite inner product with Ig12.

J

COROLLARY 3.1. If, in addition, f is a real even function (or a translate of an even function), then the operator indicated in Theorem 3.1 has a selfadjoint extension. PROOF. The case of a translate of an even function reduces to that of an even function via transformation with ret'), t' being the translation in question. It is known (and easily seen) that there exists a unique conjugation J on K such that Jv = v and J4>(x, t)J- 1 = 4>(x, -t); whenfis even and g is real, the cited operator is real relative to J, and, being densely defined, has a self-adjoint extension.

130

IRVING SEGAL

REMARK 3.2. In colloquial language, Corollary 3.1 and Remark 3.1 imply that if rp and .J; are independent scalar quantum fields (in fourdimensional space-time), with periodic boundary conditions in space, and if J(t) = I - Itiel for It I < e and is 0 otherwise, then J:rp(x, t).J;(x, t)2: J(t) dx dt has a self-adjoint extension as a conventional operator on the domain Doo(H). The same is perhaps true for "quantum electrodynamics," which has a somewhat similar "interaction Hamiltonian," but this has not yet been verified. The cited functions J(t) are particularly convenient for generalized Riemann product integration of the type indicated earlier, for a simple sum of translates can represent a function identically one in an arbitrarily large interval; similar functions are available for higher values of r. MASSACHUSETTS INSTITUTE OF TECHNOLOGY

REFERENCES I. BOCHNER, S., Harmonic Analysis and the Theory of Probability. Berkeley, Cal.:

University of California Press, 1955. 2. CAMERON, R. H., and W. T. MARTIN, "Transformations of Wiener integrals under translations," Annals of Mathematics (2), 45 (1944), 386-396. 3. KRISTENSEN, P., Tempered Distributions in Functional Space, Proceedings of the Conference on Analysis in Function Space, W. T. Martin and I. Segal, eds. Cambridge, Mass.: M.I.T. Press, 1964, Chap. 5, pp. 69-86. 4. SEGAL, I., "Transformations in Wiener space and squares of quantum fields," Advances in Mathematics 4 (1970), 91-108. 5. - - - , "Tensor algebras over Hilbert spaces, II," Annals of Mathematics, 63 (1956), 160-175. 6. - - - , "Tensor algebras, I," Trans. Amer. Math. Soc., 81 (1956), 106-134. 7. - - - , "Nonlinear functions of weak processes, 1 and II," Journal of Functional Analysis 4 (1969), 404-456, and in press. (These articles are referred to as "I" and "II".) 8. - - - , Local Nonlinear Functions of Quantum Fields, Proceedings of a Conference in Honor of M. H. Stone. Berlin: Springer (in press).

Linearization

if the Product if

Orthogonal Polynomials RICHARD ASKEYl

In [3] Bochner showed that there is a convolution structure associated with ultraspherical polynomials which generalizes the classical V convolution algebra of even functions on the circle. Bochner uses the addition formula to obtain the essential positivity result. In [18] Weinberger shows that this positivity property follows from a maximum principle for a class of hyperbolic equations. Hirschman [8] has dualized this convolution structure and has proven the required positivity result by means of a formula of Dougall which linearizes the product of two ultraspherical polynomials. We will prove a theorem which gives most of Hirschman's results as well as other positivity results which were not previously known. In particular we obtain a positivity result for most Jacobi polynomials. In view of the known duality for the classical polynomials as functions of n and x, this suggests strongly that the Bochner-Weinberger result can be extended to Jacobi polynomials. This is the only missing step in proving the positivity of some Cesaro mean for most Jacobi series, and this result can then be used to solve some D' convergence problems for Lagrange interpolation. The theorem we prove is concerned with the problem of when an orthogonal polynomial sequence Pn{x) satisfies (1)

Pn{X)Pm{X) =

nim

(XkPk{X),

k=ln-ml

Any sequence of orthogonal polynomials satisfies a recurrence formula (2)

XPn(X) = (XnPn+1(X) + fJnPn{X) + YnPn-l(X),

P-l{X) = 0, fJn is real, (XnYn+l > O. We normalize our orthogonal polynomials Pn(x) by Pn{x) = xn + .. '. Then (2) takes the form (3) 1

Pl(X)Pn(x) = Pn+1(x) + anPn(x) + bnPn-l{X), Supported in part by NSF grant GP-6764. 131

132

RICHARD ASKEY

In order to have (1) we must have an ~ O. b n > 0 is a general property of orthogonal polynomials. Our main theorem is 1. If(3)holdsforn = 1,2, ... , an bn, then (1) holds for n, m = 0, 1, ... .

THEOREM

bn+l

~

~

O,b n > Oandan+1

~

an,

By symmetry we may assume m ;:;; n and we will prove the theorem by induction on m. Since it holds for m = 1, n = 1,2, ... we may assume it holds for m = 1,2, ... , I and prove it for m = 1+ 1, 1< n. We have Pl+l(X)Pn(X) = Pl(X)pzCX)Pn(x) - aIPI(x)PnCX) - bIPI-l(X)Pn(x) = PI(X)Pn+ leX) + anPI(X)Pn(X) + bnPI(X)Pn -leX) - aIPzCX)Pn(X) - bIPI-l(X)Pn(X) = PI(X)Pn+l(X) + (an - al)PI(X)Pn(X) + (b n - bl)PI(X)Pn-l(X) + hz[PI(X)Pn-l(X) - PI-l(X)Pn(X)]' Since an - al ~ 0 and bn - b l ~ 0 and b l > 0 by the induction assumption we are finished if we can take care of the last term. Using (3) again, we see that PI(X)Pn-l(X) - PI-l(X)Pn(x) = Pn-l(X)[P1(X)PI-l(X) - al-1PI-l(X) - bl- 1PI-2(X)] I

- PI-l(X)[Pl(X)Pn-l(X) - an-lPn-leX) - bn- 1Pn-2(X)]

= (an-l - al-l)Pn-l(X)PI-l(X) + bl- l x [PI-l(X)Pn-2(X) - Pn-l(X)PI-2(X)]

+ (b n - l

-

bl - l)PI-l(X)Pn-2(X),

Continuing in this fashion we have terms that have positive coefficients except possibly for the last one Pl(X)Pn-I(X) - Pn-l+l(X), But we use (3) again to see that this term is an-lPn-leX) + bn-IPn-l-l(X) and an-I ~ 0, b n -I > O. This completes the proof of Theorem 1. The Charlier polynomials (see [6]) normalized in this way satisfy

a> 0, and Theorem 1 is immediately applicable. Similarly, Theorem 1 is applicable to Hermite polynomials since (5)

Here Hn(x) = 2nx n + ... ; thus the normalized polynomials Kn(x) satisfy (6)

ORTHOGONAL POLYNOMIALS

For Laguerre polynomials recurrence formula (7) (n

+

I)L~+1(x)

- (2n

L~(x)

we let

+a+

I -

133

Q~(x) = (-l)nL~(x)/n!

x)L~(x)

Then the

+ (n + a)L~_l(X)

=

0

becomes Q~(x)Q~(x) =

(8)

+ 2nQ~(x) + n(n + a)Q~_l(x),

Q~+1(x)

and Theorem 1 applies if a > - I. The Meixner polynomials CPn(x) o < y < I satisfy (9)

-x(l - y) CPn(x) = (n y

=

+ f3)CPn+l(X)

F( -n, -x, f3; I - I/y), f3 > 0,

- (n

+ ~y +

f3)CPn(X)

+ ~CPn-l(X), y

See [12]. If we normalize these polynomials accordingly

( ) _ n+ K n(X) -- [I _(f3)n (I/y)]n CPn x - X .. " then (9) becomes

Kl(X)Kn(x) = Kn+1(x)

(10)

+

n(l

+ y)

(l _ y) Kn(x)

+

ny(n + f3 - I) (I _ y)2 Kn- 1 (x)

and the assumptions of Theorem I are satisfied for 0 < y < I, f3 > O. We n9w consider the most important special case, that of the Jacobi polynomials p~CX.8)(X). They satisfy

P 0, I-'n+ 1 > 0, n = 0, I, ....

From (19) it is easy to see that ( _l)n Qn(X) = A A An o

thus

1···

-1

Xn

+ ... ;

ORTHOGONAL POL YNOMIALS

137

is normalized in the same way we normalized our polynomials. (19) then becomes R1{x)Rn{x) = Rn+l{x) + (An +

(20)

P-n

+ Ao)Rn{x) + P-nAn-1Rn-l{X),

A sufficient condition that the coefficients in (20) satisfy the hypothesis of Theorem 1 is that An+l ~ An > 0, P-n+l ~ P-n > O. In terms of birth and death processes this condition says that the rate of absorption from state n to state n + 1 which is given by An does not decrease with n nor does the rate of absorption from state n to state n - 1 which is given by /Ln. We conclude with some references to earlier work on Theorem 1. For Legendre polynomials (1) was stated by Ferrers [7] and proofs were given shortly thereafter by a number of people. Dougall [4] stated (I) for ultraspherical polynomials and a proof was first given by Hsii [9]. For Hermite polynomials (1) was given by Nielson [15] and for Laguerre polynomials the ak were first computed by Watson [17] and given in a different form so that it was obvious that (1) holds by Erdelyi [5]. Hylleraas [10] proved (l) for Jacobi polynomials for a = 13 + 1 and in unpublished work Gangolli has remarked that (1) holds for a = k, 13 = 0; a = 2k + 1, 13 = 1; a = 7,13 = 3, k = 1,2, .... The case 13 = -t, a ~ 13 follows from the case a = 13 in a standard way. Thus we have new proofs of (1) in all the cases that have previously been established except a = 13 + 1, -1 < 13 < -t and -t < a = 13 < t. However, in all the cases except the ones considered by Gangolli, ak was explicitly found, and it is sometimes necessary to have it exactly to use (I). ak can be computed explicitly in the case of Jacobi polynomials, but it seems impossible to use it in the form that it has been found [14, (3.7)]. In a preprint just received, G. Gasper has proved (1) for Jacobi polynomials, PAa,{J){x), a + 13 + 1 ~ 0, (X.

~

13.

UNIVERSITY OF WISCONSIN MADISON

REFERENCES 1. ASKEY, R., "On some problems posed by Karlin and Szego concerning orthogonal polynomials," Boll. U.M.I., (3) XX (1965), 125-127. 2. ASKEY, R., and S. WAINGER, "A dual convolution structure for Jacobi polynomials," in Orthogonal Expansions and Their Continuous Analogues. Carbondale, III.: Southern Illinois Press, 1968. pp. 25-26. 3. BOCHNER, S., "Positive zonal functions on spheres," Proc. Nat. A cad. Sci., 40 (1954),1141-1147. 4. DOUGALL, J. "A theorem of Sonine in Bessel functions, with two extensions to spherical harmonics," Proc. Edinburgh Math. Soc., 37 (1919),33-47. 5. ERDELYI, A., "On some expansions in Laguerre polynomials," J. London Math. Soc., 13 (1938), 154--156.

138

RICHARD ASKEY

6. ERDELYI, A., Higher Transcendental Functions, Vol. 2. New York: McGraw-HilI, Inc., 1953. 7. FERRERS, N. M., An Elementary Treatise on Spherical Harmonics and Subjects Connected With Them. London, 1877. 8. HIRSCHMAN, T. I., Jr., "Harmonic Analysis and Ultraspherical Polynomials," Symposium on Harmonic Analysis and Related Integral Transforms. Cornell, 1956. 9. HsO, H. Y., "Certain integrals and infinite series involving ultraspherical polynomials and Bessel functions," Duke Math. J., 4 (1938), 374-383. 10. HYLLERAAS, EGIL A., "Linearization of products of Jacobi polynomials," Math. Scand., 10 (1962), 189-200. 11. KARLIN, S., and J. L. MCGREGOR, "The differential equations of birth and death processes and the Stieltjes moment problem," Trans. Amer. Math. Soc., 86 (1957),489-546. 12. - - - , "Linear growth, birth and death processes," J. Math. Mech., 7 (1958), 643-662. 13. KARLIN, S., and G. SZEGO, "On certain determinants whose elements are orthogonal polynomials," J. d'Anal. Math., 8 (1961), 1-157. 14. MILLER, W., "Special functions and the complex Euclidean group in 3-space, II," J. Math. Phys., 9 (1968),1175-1187. 15. NIELSEN, N. "Recherches sur les polynomes d'Hermite," Det. Kgl. Danske Viden. Selskab. Math. fys. Medd. I, 6 (1918). 16. SZEGO,G. Orthogonal Polynomials. Amer. Math. Soc. Colloq. Pub. 23. Providence, R.T.: American Mathematical Society, 1959. 17. WATSON, G. N., "A note on the polynomials of Hermite and Laguerre," J. London Math. Soc., 13 (1938), 29-32. 18. WEINBERGER, H., "A maximum property of Cauchy's problem," Ann. of Math., (2) 64 (1956),505-513.

Eisenstein Series on Tube Domains WALTER L. BAILY, JR.!

Introduction We wish to prove here that under certain conditions the Eisenstein series for an arithmetic group acting on a tube domain [8], [9] in em generate the field of automorphic functions for that group. The conditions are that the domain be equivalent to a symmetric bounded domain having a O-dimensional rational boundary component (with respect to the arithmetic group) [3, Section 3] and that the arithmetic group be maximal discrete in the (possibly not connected) Lie group of all holomorphic automorphisms of the domain. More precisely, under the same conditions, we prove that certain types of polynomials in certain linear combinations of the Eisenstein series generate a graded ring ffi such that the graded ring of all automorphic forms of weights which are multiples of a certain integer with respect to the given arithmetic group, satisfying a certain "growth condition at infinity," is the integral closure of ffi in its quotient field. The objective in proving this result is that by using it one can prove that the field generated by the Fourier coefficients of certain linear combinations of Eisenstein series is also a field of definition for the Satake compactification of the quotient of the domain by the arithmetic group [3]; when that field is an algebraic number field, and especially when it is the rational number field, one may hope for arithmetic results to follow. Our methods here have been adapted largely from the proof of this result in a special case (see [12]). We now describe our results in more detail. Let G be a connected, semisimple, linear algebraic group defined over the rational number field Q. Let R be the real number field. We assume at the outset that Gis Q-simple, that G~ (the identity component of GR ) is centerless (i.e., has center reduced to {e}) and has no compact simple factors, and that if K is a maximal compact subgroup of G~, then l = K\G~ is a Hermitian symmetric space. (Later it will be easy to relax 1 The author wishes to acknowledge support for research on the subject matter of this paper from NSF grant GP 6654.

139

140

WALTER L. BAIL Y, JR.

these assumptions somewhat.) Then Gis centerless (see [3, Section 11.5]), the absolutely simple factors of G (all centerless) are defined over the algebraic closure of Q in R, and we may write (see [3, Section 3.7]) G = fJlk/QG', where G' is absolutely simple and k is a totally real number field. Moreover, x is isomorphic to a symmetric bounded domain D in em. We make the further assumption that x is isomorphic to a tube domain (1)

where sr is a homogeneous, self-adjoint cone in Rm; according to [8, Sections 4.5, 4.9, and 6.8 (Remark 1)], this can also be expressed by saying that the relative R-root system R~ of G does not contain the double of any element of R~' i.e., that R~ is a sum of simple root systems of type C. In [3, Section 3], we have defined the concept of "rational boundary component of D" and have proved that a boundary component F of D is rational if and only if the complexification P of N(F) = {g E G~IFg = F}

is defined over Q. We now add the assumption that there exists a rational boundary component Fo of D such that dim Fo = O. Then x may be identified with the tube domain :t in such a way that every element of N(Fo) acts by a linear affine transformation of the ambient'vector space em of:t and such that every element of the unipotent radical U = U(Fo) of N(Fo) acts by real translations. If F is any boundary component of D, let Nh(F) be the normalizer of N(F) in the group Gh of all holomorphic automorphisms of x; in particular, let Nh(Fo) = N h. Let r be an arithmetic subgroup of Gh ; i.e., r n Gz is of finite index in r and in Gz . Put r' = G~ n r and let r 0 = r n N h • Clearly r' is a normal subgroup of r. If g E Gh and Z E :t, letj(Z, g) denote the functional determinant of gat Z. We shall see that for y E r o,j(z, y) is a root of unity. Let GQ = GQ n G~ and if a E GQ, let r o•a = r n aNha- l and let la be the least common multiple of the orders of all the roots of unity which occur in the form j(*, y) for y E a-l r O.aa. Then for any sufficiently large positive integer I, divisible by la, the series (2)

EI.aCZ)

=

2:

j(Z, yaY

vef'/f'o.a

converges absolutely and uniformly on compact subsets of:t and represents there an automorphic form with respect to r. Let G~ be the normalizer in Gh of GQ. We shall see that r c G~. Let NQ = Nh n G~. Then we shall also see that G~ is the disjoint union of a finite number of double cosets raNQ , where a runs over a finite subset A

EISENSTEIN SERIES ON TUBE DOMAINS

141

of G~. For each a E A, let Ca be a complex number and denote the assignment a f-+ Ca by c. We put (3)

El,c =

2: c~, El,a'

aEA

Our main interest will be for the case when Ca # 0 for all a EA. Let :iff' and let $* be the Satake compactification (see [3]) of~. The main result of this paper is the following theorem:

~ =

THEOREM. Let f' be a maximal discrete arithmetic subgroup of Gh , let other notation be as above, and fix the mapping c such that Ca # 0 for all a E A. Then every meromorphic function on ~* can be expressed as the quotient of two isobaric polynomials in the Eisenstein series {El,choll> I > 0, where 10 is the I.c.m. of all la for a E A. If B is a basis for the module of isobaric polynomials of sufficiently high weight in these, then the elements ofB may be used as the coordinates of a well defined mapping 'Y of~* into complex projective space. The variety;ru = 'Y(~*) is birationally equivalent to ~*, ~* is the normal model of;ru, and if k is a field of definition for '.ill (e.g., the field generated by all Fourier coefficients of all elements of B), then there exists a projective variety defined over k which is biregularly equivalent

to~*.

1. Notational conventions

In what follows, if H is a group, g, an element of H, and X, a subset of H, let 1/ X = gXg-1. If P is a subgroup of H, the phrase "P is self-normalizing" will be used to mean that P is its own normalizer in H. If H is topological, HO will denote the component of H containing the identity. Use C and Z to denote the complex numbers and rational integers, respectively. 2. Tube domains

In this section only, G is a centerless, connected, simple linear algebraic group defined over R. With notation as in the introduction, the space £ = K\G~ is isomorphic to a tube domain given by (l). The cone ~ may be described as the interior of the set of squares of a real Jordan algebra J with Rm as underlying vector space. We denote by N the Jordan algebra norm in J. We have [Gh:G~] = 1 or 2, and the domains included under each case are described in [3, Section 11.4]. In every case, G~ contains an element 0' which acts on ~ by a(Z) = -Z -1 (Jordan algebra inverse), and j(Z,O') = ± N(Z)-V for a certain positive integer v. In each case where [Gh : G~] = 2, Gh contains an element T of order two, not in G~, such that

142

WALTER L. BAILY, JR.

T operates on :t by a linear transformation of em. Hence, j(Z, T) = ± I (a constant function of Z). Let RT be a maximal R-trivial torus of G with relative R-root system R~ and simple R-root system Rd. Then R~ is of type C and we may speak of the compact and noncompact roots in R~ (see [3, Section I D. Precisely one simple root ct is noncompact. Let S be the I-dimensional subtorus of RT on which all simple R-roots vanish except ct. The centralizer .?l'(S) of S and the positive R-root groups in G generate a maximal R-parabolic subgroup P of G. Let P l = P n G~. Then P l = N(Fo) for some O-dimensional boundary component Fo of (the bounded realization of) I, and I may be identified with :t in such a way that every element of P l acts by a linear affine transformation on the ambient affine space em of :t and the unipotent radical U of PI> by real translations. Using the Bruhat decomposition relative to R, one may prove that if g E G~, then g E P l is also necessary in order for g to act by a linear transformation of em. Direct calculation shows that T normalizes P l in each case where [Gh:G~] = 2, and since P l is self-normalizing in G~ (see [3, Section 1.5(1)]), it follows that T and P l generate the normalizer Nh of P l in G h and that Nh = P l U TP l • Hence, j(Z, g) is a constant function of Z if and only if g E N h ; and N(Fo) = Nh n G~. Moreover, U E f(RT), and viewed as an element of the relative R-Weyl group of G, sends every R-root into its negative. Then if g E .?l'CaT)u, we have j(Z, g) = c·N(Z)-V, where c is a constant, since .?l'CaT) c: .?l'(S) acts linearly on :t and in particular each element of it mUltiplies the norm function N by a constant.

3. Relative root systems We now lift the assumption that G be absolutely simple and return to the general assumptions of the introduction. Then G = ~k/QG', where k is a totally real algebraic number field and G' is simple and defined over k. According to [3, Section 2.5], we may choose a maximal Q-trivial torus QT, a maximal R-trivial torus RT, and a maximal torus Tin G such that T is defined over Q and QT c: RT c: T. Then for suitable maximal k-trivial torus kT', R-trivial torus RT', and maximal k-torus T' in G' we have kT' c: RT' c: T', QT c: ~k/Q(kT'), T = ~k/QT'. We take compatible orderings (see [3, Section 2.4]) on all the root systems. If P is a Q-parabolic subgroup of G and U is the unipotent radical of P, then P = ~k/QP' and U = ~k/QU', where P' is a k-parabolic subgroup of G' and U' is its unipotent radical; if P is maximal Q-parabolic, then P' is maximal k-parabolic, and even maximal R-parabolic in G'. If ex E R~' then the restriction of ex to QT is either zero or a simple Q-root; in the latter case, ex is called critical; each simple Q-root is the restriction of precisely one


143

critical R-root for each irreducible factor (see [3, Sections 2.9, 2.10]). If F is a rational boundary component, then N(F)c = P is a maximal Qparabolic subgroup of G (see [3, Section 3.7]). Until further notice, let F be the O-dimensional rational boundary component Fo of the introduction, let P = N(F)c, and let V be the unipotent radical of P. We may assume the relationship between P, QT, and QLl to be that P contains the minimal Q-parabolic subgroup of G generated by fZ(QT) and the unipotent subgroup of G generated by the positive Q-root spaces. By [5, Section 4.3], this determines P within its GQ-conjugacy class. Since dim F = 0, it follows that the noncompact simple root (Ii in each irreducible factor Gi of G is critical, that Q~ is of type C, and, using the latter fact to define the compact and noncompact Q-roots, that each (lj restricts onto the noncompact, simple Q-root ~. It follows from this that the noncompact positive (respectively noncom pact negative, respectively compact) R-roots are those in whose restriction to QT, expressed as a sum of simple Q-roots, ~ appears with coefficient + I (respectively - I, respectively 0). If S denotes the I-dimensional subtorus of QT annihilated by all simple Q-roots except ~, then P is generated by fZ(S) and U. We have RT c fZ(QT) c fZ(S), and all elements of fZ(S) n G~ act by linear transformations on %. Let g E .AI(QT)R represent the element of the relative Weyl group QW which sends every Q-root into its negative. Then g normalizes fZ(QT) and sends RT into another maximal R-trivial torus of fZ(QT). Hence, we may find h E fZ(QT)R such that gh E .AI(RT), and we may assume gh E G~ because RTR meets every component of GR (see [5, Section 14.4]). Replace g by gh. By our preceding considerations, it is clear that Ad g transforms every positive noncompact R-root into a negative one. On the other hand, for each irreducible factor Gj of G, let O'i be the element of GPR sending Zj into - Zj- 1, and let 0' = O'j. Then Ad 0' sends

n i

every R-root into its negative. Hence, O'g normalizes and so belongs to Pl' It follows thatj(Z, g) = const· Ni(Zj) -Vi, where Ni is the Jordan algebra

n i

norm for the ith irreducible factor, Zi is the component of Z in the ith irreducible factor of %, and Vi are certain strictly positive integers.

4. The Bruhat decomposition

. Retaining the notation of Section 3, let N + denote the unipotent subgroup of G generated by the subgroups of G associated to the positive Q-roots. Let N- be the similarly defined group associated to the set of negative Q-roots. Of course, N~ is connected and therefore Net c G~. Let G&

144


denote the subgroup of GQ generated by all the groups gNet for g E GQ (see [13]). Obviously G& c G~. The main facts we need are the following (see [13, Section 3.2 (18) et at]): (i) The group GQ can be written as the disjoint union of the sets (4)

(Bruhat decomposition),

where w runs over a complete set of representatives of the co sets of .2'(QT)Q in %(QT)Q, and (ii) GQ = .2'(QT)Q· G&. It follows from (ii) that each of the elements w appearing in (i) can be taken in G& c G~. Since Po = .2'(QT)· N+ is a minimal Q-parabolic subgroup of G, it follows from the remark just made and from [5, Section 4.13] that if two Q-parabolic subgroups of G are conjugate, then they are conjugate by an element of G& c GQ. Let r be an arithmetic subgroup of G~. Since G~ has no compact simple factors, r is Zariski-dense in G. Let d(r) be the algebra of all finite linear combinations with rational coefficients of elements of r (viewed as matrices from the matrix representation of G). In the Bruhat decomposition (4), if w E %(QT) is such that Ad weN + ) n N - has dimension strictly smaller than that of N +, then the Zariski closure of Netw.2'(QT)QNet in G is a proper algebraic subset of G. Since Q~ is of type C, there is one element Wo of QWQ which, sends every positive Q-root into its negative; thus, Ad wo(N+) n N- has the same dimension as N+; and Wo is the only element of QWQ for which this is true. Therefore, since r' is Zariski-dense in G (see [4, Theorem 1]), we have that

r' n Net wo.2'(QT)QNet is nonempty. Every element of Net can be written in the form u·n', where u E UQ and n' E N~ n .2'(S). Since Wo normalizes .2'(S) (by taking every compact Q-root into another), it follows that if Yo E r' n Net wo.2'(QT)QNet , then we may write Yo = uWoP, where P E P Q , and we may assume wo, P E GQ, adjusting by an element of .2'(QT)Q if necessary. In the notation of the introduction, we see that r' c GQ , since G is centerless (see [4, Theorem 2]). Also r normalizes r' and hence normalizes the group algebra d(r') (over Q) of r'. That group algebra contains GQ • In fact, consider the left regular representation of Ge on its group algebra over C, let V be the vector space over C spanned by all the matrices in r', and let W be the vector space over C spanned by all the matrices in GQ ; clearly, VQ = d(r') and V c W; since r' c GQ , r'· VQ = VQ , and since r' is Zariski-dense in G, it follows that G· V = V; since the identity matrix e E V, we have Gc eVe, hence W = V and GQ c W Q = VQ • Moreover, since the adjoint representation of G on V is faithful (G is centerless) and defined over Q, it is an isomorphism of algebraic groups, defined over Q,


145

between G and its image. Hence, GQ may be identified with the set of Q-rational points in the image of G. Therefore, GQ is its own normalizer in Gc . From the foregoing, we also see easily that r normalizes G~, i.e., r c G~, and that G~ n G~ = G~. Hence, GQis of finite index in G~. If P * is any Q-parabolic subgroup of G and if H is any arithmetic subgroup of G, then GQ is the union of finitely many disjoint double cosets HaP *Q' Thus in particular we may write GQ =

(5)

U r'aP

Q,

aeAl

from which we have GQ =

U

r'aN(Fo)Q. Since P is self-normalizing, we

aeAl

see from property (ii) of G& that obvious that we have (6)

G~

=

G~

U raNQ

= G&· NQ = GQ· NQ • Then it is

(disjoint union)

aeA

for some finite subset A of G&. 5. Eisenstein series We know that Gh/G~ is a finite Abelian group of type (2, 2, ... , 2), if not trivial. With notation as before and a E G~, define ro,a = r naN", r~,a = r' n aNh , ro = ro,e, r~ = r~,e. We know that ar' c GQand we also know (see [3, Section 3.14]) that there exists a positive integer da such that if y E a- 1 r' a n N h , then j(Z, y)cl a = 1, and by Section 2, if pEN", then j(Z, p) is constant as a function of Z. Hence, since r o,a/r~,a is Abelian of type (2,2, ... ,2), if not trivial, we have j(Z, y)2cl a = 1 if yEa -lr O,a' Let the set A be as in (6) and let d be the I.c.m. of all da , a E A. Let I be a positive integer divisible by d. For each a that we have chosen, select a complex number Ca, and then form the series E"c given in (3). By our choice of I, it is clear that the values of the individual termsj(Z, yay do not depend on the choice of representatives y of the cosets of r O,a in r. If this series converges, it clearly defines a holomorphic automorphic form on :t with respect to r. If b E r, write b = gp, with g = gb E G&, p = Pb E NQ • Write r as a disjoint union of double cosets r'bro,a, bE B. Since r' is normal in r, we have r'bro,a = brTo,a and (7)

E"a(Z)

=

L( L

beB

j(Z, bY'a)')'

Y'er'/r'o,a

Using the cocycle identity (see [3, Section 1.8 (1)]) for j, the convergence of E"a for sufficiently large I then follows from known convergence results on Eisenstein series [3, Section 7.2]. Writing b = gp as above, we have j(Z, bya) = j(Z, g(pyp-l)(pap-l)p) = c-j(Z, gylal),

146


where c is the constant value ofj(Z, p), aI, g E GQ(since NQ normalizes the latter), and Y1 Erb = pr'p-l c GQ. Let rb,al = a,Nh n r b. Then the inner sum in (7) becomes

2:

(8)

j(Z, gY1 a1)1 = Ea"biZ ),

Yl Er"b/r"b,al

which is an automorphic form of weight I with respect to the arithmetic group gr b. Therefore, cDFEl,a may be calculated for any rational boundary component F by applying information already available (see [3, Sections 3.12, 7.7]) on the limits of Poincare-Eisenstein series for arithmetic groups contained in GQ. If g is a holomorphic automorphism of a domain Deem, thenj(Z, g) is a nowhere vanishing holomorphic function on D. Using this fact and the information just referred to, one sees easily that for any rational boundary component F, cDFEl,a, if not =0, is the sum of a normally convergent (in general, infinite) series of lth powers of holomorphic functions of which not all are identically zero, such that each term not identically zero is a nowhere vanishing holomorphic function onF, If Fa = Fo·a-I, a E A, we have for a' E A cDF (El a') a

t

=

lim j(Z, a- )1El a,(Za- 1 )

1 Z-Fo'

= lim Z ....

(9)

Fo

= lim

2: 2:

\

(j(Z, a- 1 )j(Za-I, ya'»!

1Er"f['o,a'

j(Z, a- 1 ya')I,

Z .... F o 1Er"/r"o,a'

and all terms on the right go to zero except those for which a- 1 ya' E N h ; the existence of such terms implies a = a' and y E r O,a, which leaves. in case a = a', only one term with nonzero limit; it follows that we have (10)

(Kronecker delta symbol).

Therefore, (10')

Since each orbit of zero-dimensional rational boundary components under r contains Fa for precisely one a E A, we see that if the mapping c: A ~ e is nonzero on all of A, then cDFlEl,c =1= 0 for every O-dimensional, rational boundary component Fl' By the transitivity of the cD-operator (see [3, Section 8]), it follows that for any rational boundary component F,' cDFEl,c is not identically zero, Consequently, by the previous remarks, cDFEl,c is the sum of a series of lth powers of nowhere-vanishing holomorphic functions on F, The following is easily proved (see [1, Lemmas V, VI]):


147

Let 2: ai be an absolutely convergent series of complex numbers, not all zero. Then there exists a positive integer m such that 2: ar' #- O. LEMMA

1.

If f is an integral automorphic form (see [3, Section 8.5]) with respect to r, thenfmay be viewed as a cross-section of an analytic coherent sheaf on ~ = '1:/r, such that that sheaf has a unique prolongation to an analytic coherent sheaf on the Satake compactification ~* of~, and such thatfhas a unique extension, which we denote by f*, to a cross-section of the extended sheaf (see [3, Section 10.14]). We know from [3, Section 8.6] that each of the series E,.c is an integral automorphic form with respect to r. From this, from Lemma 1, and from the discussion preceding the lemma, we obtain at once: PROPOSITION 1. There exists a finite set !£' of positive integers I such that the members of the family {E"~·a},E.!l',aEA have no common zero's on ~*. If c: a --+ Ca is given such that Ca #- 0 for all a E A, then, by increasing the size of !£' if necessary, we may assert that the members of the family {E'~C}'E.!l' (fixed c) have no common zero's on ~*.

The proof is obvious. Referring to the first paragraph of this section, let 10 be a positive integer divisible by d such that for all positive integers I divisible by 10 the series E,.c converge. Let n be a positive integer divisible by 10 and let An be the linear space over C spanned by all expressions of the form (11) where ai E A, such that Ii > 0, loll;, i = 1, ... , /L, and 11 + ... + l/.t = n. An element of An will be called an isobaric polynomial (in the Eisenstein series) of degree n. Denote by A~ the linear space of the extensions to ~* of the elements of An. We make the same set of definitions with the functions E"a replaced by E,.c for a fixed, nowhere zero c and denote the spaces corresponding in this way to An and to A~ by An.c and A~,c, respectively. Clearly A~.c c A~. Then we have from the proposition immediately:

If loin and if n is sufficiently large, then the elements of (respectively of A~.c) have no common zero's on ~*.

COROLLARY. A~

Clearly, if p, q E A~, q #- 0, then p/q is a merom orphic function on ~*; or it may be identified with a meromorphicfunction on % invariant under r.

7. Projective imbeddiogs Let A * be any subspace of A~ such that the elements of A * have no common zero's on ~* and let Ao, ... , AM be a basis of A *. Then the assignment


148

determines well defined hoI om orphic mappings of % and of ~* into CpM, which we denote by ¢(h) and !{I(,,), respectively. Let ~ = !{I(h)(~*); then ~ is a (complete) projective variety. The functions "M induced on :t by Ao, ... , AM are automorphic forms of weight n. If k is a suitably large positive multiple of n, then a basis of automorphic forms of weight k affords an injective holomorphic imbedding of ~* in some projective space such that ~* is mapped biregularly onto its image. It follows (see [2, pp. 353-354]) that since the AI are nonconstant and without common zero's, the mapping !{I(,,) has the property that for every x E~, !{I(hi(x) is a finite set. Hence, we may speak of the degree of the mapping !{I(,,): ~* -+)lli, which is equal to the degree of the algebraic extension of the field of rational functions on m! by the field of those on ~*. Moreover, if loll, I > 0, then the image of E/~a or of EI~C under restriction to the image of any rational boundary component F in ~* is a nontrivial cross-section (if not identically zero) of an ample sheaf, and is hence nonconstant. Furthermore, we see at the same time that ¢(") can be extended to a mapping of the union of all rational boundary components which is continuous in the Satake topology (see [3, Section 4.8]). We now let A * = A~,c for fixed, nowhere-vanishing c. Let !{In be the mapping associated to a basis of A *, let m!n = !{In(~*)' and let dn be the degree of !{In. It is clear that dn ;?: dn+ 10' Therefore, there ex~sts an n' such that dn = dn , for all n ;?: n'. Also it is easily seen (by the Jacobian criterion) that if p is a regular point of~ c ~* and if !{Inl induces a biregular mapping of a neighborhood of p onto a neighborhood of a regular point of m!nl' then the same is true for all n ;?: nl' We wish to prove the following proposition:

"0, ... ,

PROPOSITION

2.

The degree dn , is equal to one.

We proceed to the proof of this by stages. If g E Gh , for any real number r, let Lg.r

= {Z E :t11j(Z, g)1 =

r};

this is a real analytic subset of:t and is different from :t, and hence is of measure zero if j(Z, g) is not constant as a function of Z, i.e., if g ¢ N h • Since the series (2) converges uniformly on compact sets, it follows that the number of the surfaces Lya,r passing through any compact subset of :t for fixed r is finite. If Ij(Z, ya)1 == d·lj(Z, 'Y'd) I, where d is a positive real· constant, 'Y, 'Y' E r, a, a' E A, then we see easily from the cocycle relation that a = a' and 'Y = 'Y" 'Ya, 'Ya E r O,a' From the term wise majoration of the series (2) in a truncated Siegel domain, afforded by [3, Section 7.7(i)], we conclude that on an Fo-adapted


149

truncated Siegel domain 6 in %, there exists a series of positive constants which majorizes the series

2:

j(Z, b -lya)'

(/011, I > 0)

YEI'/I'o.a

on 6, for any a, b E A (to reduce this to the form considered in [3, Section 7.7(i)], write b-1ya = b-1a(a-1ya)). Moreover, from this we conclude that, if 6 is suitably chosen, then we have Ij(Z, b-1ya)1 < 1 if b #- a or if b = a and y rt r O,a' Define the subset g;. of % by g;.·a

= {Z E %llj(Z, b-1ya)1 < 1 if bE A - {a} or if b = a,

y

rtro.a}.

Then from this discussion and from properties of the Satake topology (see [3, Section 4]) we have at once the following lemma: There exists a dense subset C of % such that % - C is of measure zero and such that for Z E C, the terms in the series for E I •C are all distinct. For each a E A, the set g;. is open, and nonempty, and contains an Fa-adapted Siegel domain. The closure of g;. in the Satake topology contains a neighborhood of Fa in that topology. LEMMA

2.

Consider only n > 0 divisible by 10 , Let Ll n be the inverse image under of the diagonal of cpU n x Cpu n , where /Ln + 1 = dim c' Clearly Ll n contains the diagonal of ~* x ~*, and Ll n :::::> Ll n + lo ' Without loss of generality, we may assume n' > 0 chosen such that Ll n = Lln' for all n ~ n' (because {Ll n} is a decreasing sequence of algebraic sets). Consider now only n ~ n'. We have seen that rp;;l(X) is finite for all x E:lli. We now prove the following lemma:

rpn )( rpn

A:.

LEMMA 3. If Xo = if';n(Fo), then Xo = if';n(F1) for every O-dimensional, rational boundary component Fl and if';;; l(XO) is precisely the set of all O-dimensional, rational boundary components.

PROOF. It is sufficient to consider the case Fl = Fa for some a E A. Since the result is evidently independent of the choice of basis for A:,c, we may assume the basis to consist of monomials in the series E I •c ' Then by (10'), the image under if';n of Fa is the point in projective space represented by [1:···: 1]; thus if';n(Fo) = if';n(Fa) for all a. The converse requires further considerations. Let F be any rational boundary component of % (we may have F = :t) with dim F > 0, and let Z E F. The function (J)FEI •C on F is an automorphic form of positive weight on F with respect to the homomorphic image r F of Nh(F) n r in the full group of holomorphic automorphisms of F, and r F is an arithmetically defined discontinuous group acting on F. Without loss of generality, we may assume that Fa is a rational boundary

150


component of F. We may view F (see [9, Section 4.11]) as a tube domain on which NhF = Nh(F) n Nh acts by linear transformations; with a certain abuse of notation, let r F,a = r F n aNhF . Then by [3, Section 7] it is easily seen that Cf)FE"c is the Eisenstein series

L:

(12)

L:

c~·

aeA"

jF(Z, ya)qFI,

ZEF,

yePFIPF ,a

where qF is a positive rational number (not a multi-index!), jiZ, g) is a nonzero constant e g times the functional determinant of g as a transformation of F, and a runs over the set AF of elements of A such that Fa is r-equivalent to a rational boundary component of F. By assuming ab initio suitable divisibility properties for 10 , we may assume that qFI has the necessary divisibility properties for the tube domain F. This having been said, we now prove that r'I;; l(XO) n % is empty and note that the same argument will prove that r'I;; l(XO) n F is empty by merely supplying the subscript F where needed. If Z E % and r'ln(Z) = Xo, then we have E,.C(Z) of 0 and fk(Z) = Ekl.c(Z)!E"C(Z)kWill have the same value for all k (because Xo = [1 : ... : 1]). Following [12, p. 126], we introduce the function

Mz(A)

=

L: L: aeA

yeP/Po.a

(A - E,,c(Z)(Ca-j(Z, ya»-I)-l. \

This series converges if A is not one of the discrete set of points E"c(Z), (ca-j(Z, ya»-I ofC, because of the convergence of the series for E1,c itself, and uniformly so on any bounded subset of C not meeting that discrete set. Hence, M z is a meromorphic function on C with infinitely many distinct poles. On the other hand, M z is holomorphic at the origin and its power series expansion is - ~ fk(Z)Ak -1. By hypothesis, all fk(Z) are k=l equal, hence Mz(A) = -fk(Z)/(l - A) has at most one simple pole-which is absurd. This completes the proof of the assertion that r'I;; l(XO) is simply the set of O-dimensional rational boundary components. It follows from Lemmas 2 and 3 that there exists a neighborhood 9l of Xo

such that % n r'I;;1(9l)

c

(y 9;.). r. Let ~ = (y 9;.).

Suppose now that the degree dn' is greater than 1. We assume 10 chosen such that E1o,c is not identically zero. Henceforth, fix c and let E1,c = E1. Since dn' > 1, one sees easily that there exist points Zl, Z2 E .9'*, say,. i = 1,2, in distinct orbits of r such that Zj E

9;..,

(i) E1o(Zj) of 0, i = 1,2; (ii) the canonical images of Zl and Z2 are regular points of ~ c ~*; (iii) r'ln(Zl) = "'n(Z2) and their common value Wn is a regular point of


151

~n' and ,'fin is a biregular mapping of a suitable neighborhood of Zi onto

a neighborhood of W n , i = 1,2; (iv) Wn ¢ tPn{'iS* - 'is). Defining 8 n = tPn{'iS* - 'is), we may find a small neighborhood iRi of Zi contained in 9';." i = 1, 2, such that iRl n iR 2 ' r is empty, and such that ,'fin is a biregular analytic mapping of iRi onto a neighborhood iR of Wn in ~n - 8 m i = 1,2. Let rp denote the composed mapping {r.bnliR2)-lo (r.bnl iR 1) of iRl onto iR 2; because of the properties of r.bn and tPn which are stable for large n (q.v. supra), rp is independent of the choice of n for sufficiently large n, say n ~ n'. Then rp is biregular and we may assume E10 =I- 0 on iRl U iR 2. Let I = 10 , With liZ) = Ez(Z) -k. E1k{Z) as before, our conditions imply thatlk{Z) = Ik{rp(Z», k = 1,2, ... , Z E iR 1 • 8. Extension of the mapping cp

We have the following lemma: LEMMA 4. phism 01'1:...

The mapping rp may be extended to a holomorphic automor-

PROOF. The main ideas here are those of [12, pp. 126-130]. Define M z{>") as before, when Ez(Z) =I- O. By hypothesis, El =I- 0 in iRl U iR 2. If Z E iRl U iR 2, then the set of points {EI{Z){C a ){Z, ya»-l} is a discrete set. Moreover, in any region of '1:.. x C avoiding poles of M z{>"), M z{>") is an analytic function, and its poles for any fixed Z E '1:.. such that EI(Z) =I- 0 are the points of that discrete set, each of which is a simple pole with residue equal to the number of times a term of given value occurs in the 00

Eisenstein series. As before, -MzC>..)

2:

Ik(Z)>..k-l. Then the equations k=l liZ) = Ik(rp(Z», k = 1,2, ... , imply that the sets of poles of M z and of M fP(Z) are the same. If Z E 9';." then by the cocycle identity =

and jj{Z;, ai- 1 yb)1 < 1, unless b = ai and yE ro,a .. in which case jj{Zh aj-lyai)1 = 1; therefore, (ca,)(Z, a,»-IEI{Z) is the pole of MzC>..) having the smallest absolute value, i = 1,2. Hence, we have

The equality of the other poles of M z and M fP(Z) gives the system of equations

152


Dividing (13) by (14) and using the cocycle identity gives j(g;(Z)·a 2, ai 1ya 2)'

(15)

=

(Cbzj(Z, yzhz)(ca1j(Z, a 1»-1)!,

where hz, yz E G~ depend, at first, on Z. However, the number of possibilities for each Z is at most countable, and using the fact that an analytic function not identically zero vanishes on at most a set of measure zero, we conclude that there exists a function f from the set of cosets rjr O,a, for each a, into G~ such that for suitable constants d y we have j(g;(Z)a2' ai 1ya 2)1

(16)

== dy(j(Z,J(y»j(Z, a 1)-1)I,

Since the set of lth roots of unity is finite and the functionj(Z, a), for fixed a, is a nonvanishing holomorphic function on all of'r and in particular on the neighborhood 91 1 , we may conclude that (17) where h y is a nonvanishing holomorphic function on all of'r. We now proceed to solve the system (17), The group ail ra 2 is also arithmetic. We let X be the one-to-one analytic mapping of 91 1 into 'r obtained by first applying g; and then translating by a 2 • Our main problem becomes that of solving the equations j(a(Z), y) -1 == hlZ),

(18)

for an analytic mapping a of'r into itself such that a = X in a neighborhood of Z1> where y runs over an arithmetic group r 1 and h y are nonvanishing holomorphic functions on cr. We now choose Yo = UWoP E r 1 as in Section 4 and let y run over the elements '\Yo, where ,\ runs over the lattice A = Un r 1 in UR . Identifying UR with the group of real translations in 'r, the system (18) for these y becomes (19)

j(a(Z)

+

u

+ '\, WOp)-l

= g~(Z),

where g~ is holomorphic in ;t, By Section 3, this system becomes

n 91l(a(Z) + u + '\)i)V,

(20)

= gll.(Z),

i

cr.

where gil. = const. g~ is holomorphic in Since we know that a solution X = a of the system (18) exists in a neighborhood of Z1> it will follow that if a subsystem of the system (18) has a unique analytic solution for all Z E 'r, then this solution must be an analytic continuation of x, and hence must be a solution of the full system (18) for all Z E cr. Let n1, ... , nk be positive integers and m = n1 + ... + n k , If Zi E en" write Zi = (Zij)j~l, ... ,ni' and if Z E em = f1 en" write Z = (ZI)i~l, ... ,k' If j

also z'

E

em, define Z +

z' by the usual component-by-component addition,


153

In what follows, let pj be a homogeneous polynomial of degree d j > 0 in nj indeterminates with complex coefficients, viewed as a function on c n , in an obvious way. We assume that the first partial derivatives

~pj, j

=

1,

uXjj

... , nj are linearly independent polynomials for each i. Let VI, . • . , Vk be positive integers and put p = pII ... p~k. Let A be a Zariski-dense subset ofC m • LEMMA 5. There exists afinite number of points r a E A c cm, where ex runs over a finite indexing set B', such that if D is a connected open subset of c m and if {fa}aEB' are analytic functions on D for which the system

p(a(z)

(21)

+ ra)

=

fiz)

has an analytic solution a mapping some open subset of D into cm, then the system (21) has a unique solution a defined on all of D, a: D --i>- cm. PROOF. View the polynomials PI> ... , Pk as polynomials in independent sets of indeterminates {Xjj}j=I ..... n,. i=I ..... k. It is an elementary exercise to prove that if n denotes the linear span, in the vector space of polynomials with complex coefficients, of the set of higher (i.e., of order higher than I) partial derivatives of p and if 1TI> ... , 1TN is a basis of n, then 1TI' ••• , 1TN,

~P

uXjj

, j = 1, ... , nj, i = I, ... , k are linearly independent polynomials. It

follows from Taylor's expansion that if y (22)

p(z

+ y)

=

p(y)

+

2: :p. i.j

Xl]

E

cm, then

(Y)Zjj

+

2: Pp(y) Qp(z),

OEB

where B is some finite indexing set, Po and Qo are homogeneous polynomials such that deg Po + deg Q/3 = deg p and Po, f3 E B, are linearly independent elements of n. Therefore we may find raE A, a E B', card B' = m + card B, such that the determinant of the matrix

is nonzero (because the product of Zariski-dense sets is Zariski-dense in the product variety). Then it is possible to solve uniquely the system of equations (23)


154

for the "unknowns" ailz) and ap(z), and it is clear that a!j(z) is analytic in all of D. Since a solution does exist in an open set and since D is connected, it follows that a = (au) is analytic in all of D and is the unique solution there to the system (21). This proves the lemma. To demonstrate the applicability of Lemma 5 to our situation, one notes the systems of equations (18) and (21) and observes (e.g., by an easy case-by-case verification) that the Jordan algebra norm Wi always satisfies the requirements (i.e., that PI be homogeneous and that the polynomials

:XP.i. be linearly independent) imposed upon each of the polynomials Pi in 11

Lemma 5. This computation is left to the reader. Finally we note that any translate A + u of the lattice A of UR is Zariski-dense in em (the complexification of UR ). Thus we have obtained an analytic mapping a of % into the ambient space em of % such that a coincides with X in a small neighborhood of Zl' Now view the noncompact Hermitian symmetric space % as imbedded in its compact dual %C (see [14, Section 8.7.9]). Then 02 E G~ extends to a holomorphic automorphism of the compact, complex manifold %C, and then Oil a = a' is an analytic mapping of % into %C which coincides with cp in a small neighborhood WI of Zl' Our next step is to show that a' maps % into the closure %* of% in %c. It is easy to see that for n I;;;; n'there exists a proper Zariski-closed subset %;: of )lin such that Sn C %,:' and such that if %~ = "';; 1(%;:) and %h = ls;; 1(%;:), then % - %n is a covering manifold of ~* - %~ and the latter is a covering manifold of )lin - %;:. (It is sufficient that %,:' include the singular points of )lin> the set Sn, the images of the singular points of ~*, and the image of the set of regular points x of~* at which is not locally biregular-the intersection of the last set with ~ is, modulo singular points of~, a proper Zariskiclosed subset of~ by the Jacobian criterion.) Let % = %n. The set % - % is connected and open and contains Zl' Let Zo be any point of % - % and join Zl to Zo by a path cpo Let CPa' be the image of cP under a'. We have lsn·a' = lsn in a neighborhood of Zl' It is clear that lsn(CP) is a path in)lin - %;: and by the homotopy covering theorem (see [14, Sections 1.8.3-4]) can be raised to a unique path in % - % beginning at Z2, since lsn(Zl) = lsn(Z2)' By analytic continuation of the relation lsn a' = lsn it is clear that this covering path beginning at Z2 can be no other than CPa" Hence, a'(Zo) must be a point of%. Therefore, a'(% - %) c % - % c %. Viewing % as the bounded domain D, we have a'(D) c 15,' since a' is continuous in D and % - % is dense in %. Bya known property (see [9, Section 4.8]) of the boundary components of D, since a'(D) n D is not empty, a'(D) n (15 - D) must be empty; thus a'(D) c D. Let a" be the similarly defined mapping associated to cp-1. Since, also, a"(D) c D 0

"'n

0


155

and a'a" = a"a' = identity (because this is true in a small open set, 91 1 or 91 2 , to begin with), we see that both a' and a" are bijective maps of D onto itself. This completes the proof of Lemma 4. 9. Conclusions

Thus we see that there exists a biholomorphic map a: D --+ D, a E Gh - r such that ~n 0 a = ~n' Let r* be the subgroup of Gh generated by a and r. Clearly r !i r*, so that r* cannot be a discrete subgroup of Gh • By construction, ~n is constant on any orbit of r and also on any orbit of the group generated by a and hence is constant on any orbit of r*. Since r* is not discrete, it is not property discontinuous on :t, and thus there is an orbit w of r* in :t with a limit point Zoo Hence, in every neighborhood of Zo, there exist infinitely many points mapped by ~n onto the same point of ~n' But this contradicts what we already know about tPn. The contradiction having come from the assumption that d n , > 1, we conclude that d n = 1 for n ~ n'. This proves Proposition 2. Therefore, tPn is a proper birational mapping of the normal variety )B* onto m;n> n ~ n', and for w E~n' tP;;1(W) is finite. Since dn = 1, we conclude from Zariski's main theorem (see [10, p. 124]) that )B* is simply the normal model of~n' and hence there exists a projective variety biregularly equivalent to )B* and defined over any field of definition for~n' Since by Chow's theorem (see [6]), any meromorphic function on ~n is a rational function expressible as the quotient of two isobaric polynomials of like weight (see [3, Section 10.5]), the proof of the theorem stated in the introduction is complete.

COROLLARY 1. Let G be a connected, semisimple algebraic group defined over Q such that GR has no compact simple factors. Let K be a maximal compact subgroup ofG'it and let r be a maximal discrete arithmetic subgroup ofG'/i. Assume that;! = K\G'it is Hermitian symmetric, and assume that Gh is connected. Then the same conclusions as those in the theorem hold. PROOF. Let Z be the center of G (which is not assumed to be centerless), and let 17: G --+ G' = G/Z. Then 17(r) is maximal, discrete, arithmetic in G'it = Gh , and we may apply the theorem. The cases where Gh is not connected are described in [3, Section 11.4]. COROLLARY 2. Let G, ;!, and r be either as in the statement of the theorem or as in the statement of Corollary 1. Then the conclusions of the theorem, or of Corollary 1, respectively, are true if the module of isobaric polynomials in {E1,c} of sufficiently high weight is replaced by the module of isobaric polynomials in {E1,a}loll, 1>0, aEA of sufficiently high weight. PROOF. The proof is trivial because the first module mentioned is a submodule of the second.

156


The group r is called unicuspidal if all minimal Q-parabolic subgroups of G are r-conjugate. Then, if P is as before and r is unicuspidal, we have GQ = r,PQ = PQ • r, and thus the set A may be chosen to consist of e alone. Let E, = E"l (i.e., c. = 1). We then have the following corollary: COROLLARY 3. Ifr is unicuspidal, the cross-sections Et have no common zero's on m*, and the isobaric polynomials of sufficiently high weight in these induce a mapping ,p ofm* onto a complete projective variety of which m* is the projective normal model. THE UNIVERSITY OF CHICAGO

REFERENCES 1. BAILY, W. L., Jr., "On Satake's compactification of Vn ," Amer. J. Math., 80 (1958), 348-364. 2. - - - , "On the theory of 8-functions, the moduli of Abelian varieties, and the moduli of curves," Ann. of Math., 75 (1962), 342-381. 3. BAILY, W. L., Jr., and A. BOREL, "Compactification of arithmetic quotients of bounded symmetric domains," Ann. of Math., 84 (1966), 442-528. 4. BOREL, A., "Density and maximality of arithmetic groups," J. f. reine u. ang. Mathematik, 224 (1966),78-89. 5. BOREL, A., and J. TITS, "Groupes reductifs," Publ. Math. I.H.E.S., 27 (1965), 55-150. I 6. CHOW, W. L., "On compact complex analytic varieties," Am. J. Math., 71 (1949), 893-914. 7. HELGASON, S., Differential Geometry and Symmetric Spaces. New York: Academic Press, Inc., 1962. 8. KORANYI, A., and J. WOLF, "Realization of Hermitian symmetric spaces as generalized half-planes," Ann. of Math., 81 (1965), 265-288. 9. - - - , "Generalized Cayley transformations of bounded symmetric domains," Am. J. Math., 87 (1965), 899-939. 10. LANG, S., Introduction to Algebraic Geometry. New York: Interscience Publishers, Inc., 1958. 11. RAMANATHAN, K. G., "Discontinuous groups, II," Nachr. Akad. Wiss. Gottingen Math.-Phys. Klasse, 22 (1964), 154-164. 12. SIEGEL, C. L., "Einfiihrung in die Theorie der Modulfunktionen n-ten Grades," Mathematische Annalen, 1/6 (1938-1939), 617-657. 13. TITS, J., "Algebraic and abstract simple groups," Ann. of Math., 80 (1964), 313-329. 14. WOLF, J., Spaces of Constant Curvature. New York: McGraw-Hili, Inc., 1967.

Laplace-Fourier Traniformation, the Foundation Jor Qyantum lriformation Theory and Linear Physics JOHN L. BARNES 1. Two-dimensional choice information in a discrete function For clarity this paper begins with the very simple examples shown in Figs. la, I b, and 2. These illustrate the idea that a simple function can contain choice information (see [2]) in two ways. In these figures the elements chosen are shaded. To construct messages they are placed in positions located in intervals on the scale of abscissas called cells and on the scale of ordinates called states. It is assumed that they are chosen independently and that the binary locations are chosen with equal statistical frequency. In Fig. la the information is contained in the height, i.e., ordinate of the function. In Fig. I b the information is contained in the position, i.e., the first or second place, in the cell. For the transmission of information the function illustrated in Fig. la would be said to be heightmodulated. Sometimes "amplitude" or envelope modulation are the terms if the carrier is a sinusoidal wave. Fig. I b would be said to be position- or time-modulated. Instead of position, time rate of change of position is often used. The terms are then pulse position or pulse rate modulation. Pulse radar, sonar, and neural axon spike transmission, as well as phase or frequency modulation in radio are in this class. For height modulation over a 2-space-dimensional field one can select "black and white" still photographs. Here shades of gray give the height and are levels of energy. Both height and position modulation may be used jointly as illustrated in Fig. 2. Here the message function has independent choice information carried in the height and horizontal position of the selected elements. If a coherent source such as a laser beam is used to form a 2-D space interference pattern on a photographic film, then called a hologram, by 157

JOHN L. BARNES

158

Height States nSH 2~----~~~~~----~

o ........... ••••••••••••••••••••••••••••••••••••••••••

o 11.1.1.1.1.1.1.1.1.1.

I

•••••••••••••••••••••4

nCH Cells

mH = 16 messages

dH

=4

bits of information

Fig.la

Height modulation

Position States

nsp ..

o

I

20

2

I ••••••••••••••••••••••

o

•••••••••••••••••••••2

••••••••••••••••••••••••••••••••••••••• 3

mp = 16 messages

dp = 4 bits of information Fig. 1b Position modulation

4· nCp Cells

LAPLACE-FOURIER TRANSFORMATION

Height States nSH

159

Position States nS~

20

o 2,0

2,0

........... ·· .. .. .. .. .. .. .. .. .. .. ........... ........... ........... ........... ........... · ... .... . . . ........... ...........

o

2

•••••••••••••••••••••

o

••••••••••••••••••••••

:t -IN 0

•••••••••••••••••••• 2

mHfP

= 256

dHfP

= 8 bits of information

3 •••••••••••••••••••••••••••••••••••••••••

messages

Fig. 2 Joint height and position modulation

summing exposure from a direct beam and from one reflected from a 3-D space object; and if then the direct beam is subtracted out by passing it through the hologram, there will result a presentation which retrieves the 3-D space picture from the 2-D space hologram. In this holographic process the retention of position, as well as height, modulation provides the depth information missing from an ordinary photograph. In line with this terminology, ordinary photography could be called "halfography" since it uses only half (approximately) the information available in the 3-D space physical world. It will be seen later that ordinary probability and statistics correspond to "halfography" since they often present approximately half the information available. By the use of (usually uniquederivative) complex analysis (see [9]) in place of real analysis, probability and statistics can be made whole. 2. Laplace-Fourier transformation as a path to the analytic complex domain While a single set of symbols would suffice to show the mathematical relations to be discussed in what follows, particular symbols carry associated physical meaning to engineers and physicists beyond the form of the mathematical relations (see [10]). Hence the same mathematics will frequently be stated in several sets of symbols.

JOHN L. BARNES

160

Professor Bochner's deep understanding of functional transform theory first influenced my thinking through his 1932 book [7] and his supervision of my graduate research at Princeton University. 3. Uncertainty theorems from the !l'-~ transformation The fundamental uncertainty principle was known intuitively by the German composer Johann Sebastian Bach (see [2]) and used by him about 1730. Through the intervening years it has become known in successively more precise forms. A modern version given by Norbert Wiener at a 1924 G6ttingen seminar could be expressed in our form as follows (see [3]): If J(t),

WIENER'S UNCERTAINTY THEOREM.

d~~), dTt~)

exist and

EL2

on R, and if the bilateral Fourier-integral transform ~II[f(t)] £ F(iw), then (U 1 ) (EJ

(a) F(iw) E L2 on I, (b) 1 ; : ; M· ~Iiwl, and (c) 1 = ~t· ~Iiwl for J(t) a Gaussian pulse,

in which ~t £ 2ab ~w £ 2a w

==

aliwl,

where energy

e(t) £ i:=_ooJ 2(t1)dt1, E(iw) £

tw

IF(iw1W

e

(

00

)

1

I Nk

(il3 k )1

iO k= 1,2,3

hypotheses of Wiener's Uncertainty Theorem, Theorem (Th l ), (I), and (A), the conclusions are: (Th 2 )

(HI) (H 2 ) (D)

nn nn nn

~ ~t,,; ~IEk(iwk)l,

~ ~Xk· ~IPk(i.Bk)l,

k

=

1,2,3,

~ ~CPk· ~I Qk(itPk)l·

In conclusions (HI) and (H 2 ), Theorem (Th 2 ) gives a precise meaning to Heisenberg's (1927) Uncertainty Principles [14], and, in conclusion (D), to Dirac's (1931) Uncertainty Principle [12]. In [3] the simple deduction of Planck's and de Broglie's Laws was given. ASYMPTOTIC UNCERTAINTY THEOREM. An example of using Wiener's Basic Uncertainty Theorem for the asymptotic case, n ---0> 00, i.e., for classical physics and engineering, is the following. Multiply (U I ) by the velocity of electromagnetic waves in free distance-space, c, to obtain

c~

cM·~liwl. ~r =

But r

=

ct,

c~t,

in which ~r is the 2a y uncertainty in r, the radar pulse distance traveled in time t in free distance-space. Then, under the hypotheses of Wiener's Uncertainty Theorem, using (R) in (U 3 ), the conclusion is (Th 3 )

(W)

c

~

M· ~Iiwl.

JOHN L. BARNES

168

Theorem (Th3)'s conclusion (W) gives a precise meaning to the Radar Uncertainty Principle of Woodward (1953) [I8]. INVARIANT METRICS UNDER ANALYTIC COORDINATE TRANSFORMATIONS. 3-D complex (see [8]) vectors and their !l'-ff transforms will be used in the following model for (special) relativistic physics (see [I]). ORIGINAL DOMAIN. The distance-space-time displacement vector in C 3 is of the across type, and is k = 1,2,3 (summation convention),

(V")

in which complex distance-displacement zl< ~ xl< + ictl< with II< a real unit vector in the k-direction; xl< is a real distance-space coordinate; c is the velocity of electromagnetic waves in free distance-space, ctl< is the imaginary part of the complex distance-displacement, and tl< is time coordinate in the k-direction. Then the complex distance-displacement metric is given by the inner product as

k, I = 1,2,3,

(M")

in which gl O. It is clear that if nn(F) exists it is unique. We also make a sequential definition which in a large class of cases

180

CAMERON AND STORVICK

coincides with Ir(F). Let us consider functionals defined on the larger space B[a, b]: B[a, b] = {x(·)lx(t) defined and continuous on [a, b] except for a finite number of finite jump discontinuities}.

Let a be any subdivision of [a, b], a: a = to < tl < t2 < ... < tn = b, and let norm a = max (t; - t;-I). For any x E B[a, b], let (4.2) Xa

( ) = {X(l;_l) t x(b)

if t;-1 ~ if t = b.

1

0 let

where if; E L 2 ( -co, co) and vo (4.5)

== g. We now define

neq(F) = w lim

I~(F),

a-O

where w lim denotes weak limit. In Theorem I of [I] we have shown that if F(x) is bounded and continuous in the uniform topology on B[a, b), I~eq(F) exists and equals I,..{F) for real positive A. We have shown in [I] that for a large class of functionals F, H:D(F) and I~eq(F) exist and are equal for Re (A) > o. When I~D(F) and I~eq(F) exist and are equal for Re A > 0, we shall call their common value I,,(F). Let F be a functional for which I~eq(F) and I~D(F) exist and are equal for Re A > O. Then if the following weak limit exists, we define (4.6) In order to obtain the existence of the weak limit JiF), we shall first prove a theorem showing that certain functions defined by cluster sets satisfy our integral equation.

THE SCHROEDINGER EQUATION

181

5. The integral equation for cluster values

Let OCt, u) be continuous almost everywhere in the strip < u < 00, let IO(t, u)1 ;;; M for (t, u) E R, and let q be any real number, q f= O. Let !fo E L 2( - 00, (0) and let THEOREM 2. R: 0 ;;; t ;;; to,

-00

Ft(x) = exp

(5.1)

{E 8(t -

s,

xes»~ dS}.

Let G(~, .\) == G(t, ~, .\) == (I,,{FM)(g), where ~ = (t, ~). Then every element == r(t,~) ofCC(G(·, .\), -iq) satisfies the integral equation:

rm

ret, ~) = ( ~ ) 1/2 WJ'" !fo(u) exp (iq(~ - U)2) du 2mt _ '" 2t (5.2)

+ ( 2~i ) 1/21twJ'" 0 _

<Xl

for almost all (t,

~) E

(i (~- U)2) (t - s)- 1I2 O(s, u)r(s, u) exp q2(t _ s) duds

R.

The proof of this theorem parallels the proof of Theorem 9 of [1], but it differs at significant points. For the convenience of the reader the entire proof is presented. PROOF. Let t and q be given real numbers, t E (0, to) and q f= O. By applying Lemmas 4 and 5 of [1, p. 527] to 8*(s, xes»~ = 8(t - s, xes»~ we observe that 8(t - s, xes»~ is measurable for almost every x E Co[O, t], and is measurable in the product space. Thus FtCx) is Wiener integrable on Co[O, t] and hence also on Co[O, to]. By Theorem 4 of [I], Ih(Ft) exists and is bounded for Re.\ > 0, II Ih(Ft) II ;;; eMto = M'. Hence we obtain

II Ih(FM II ;;; M'II!fo11 and

Upon integration with respect to t we obtain

f: I~", I(Ih(Ft)!fo)(~)12 d~ o

Ifwe set (Ih(Ft)!fo)(O == G(t, Fubini theorem

and

~,

dt ;;; M'2toll!fo11 2.

.\) == Ga, .\) where

~

= (t,

~)

we have by the

182

CAMERON AND STORVICK

From Lemma 2 of Section 2, we conclude that the cluster set C = 'i&'g( G( ., A), - iq) is a nonempty set. Here the Hilbert space H is understood to be L 2 (R), and the domain n is the right half plane, Re A > O. Let An be a sequence such that Re An > 0, An ---'>- - iq, and G(~, An) ---'>weakly, i.e.,

rm

(5.3) for all CfJ E L2(R). Our function G(t, Theorem 8 of [1]:

G(t, g, An) = (5.4)

g,

An) satisfies the integral equation of

(2~tf'2 J:oo ifJ(u)e-An(~-U)2/2t du A )1/2 + ( 2. 217

it 0

(t -

I S )1/2

Ioo08(s u)G(s u A 0' , , _

)e-An(~-U)2/2(t-S)

n

duds

•

We now multiply this equation by CfJk(g) == Hk(g)e-~2/2 (where Hk(g) is essentially the Hermite polynomial of degree k, i.e., Hk(g) = (_I)ke~2/2. (dk/dgk)e-~2/2), and integrate with respect to g over (-00,00):

J:oo CfJkWG(t, g, An) dg = J:oo CfJk(g)(2~t) J:oo ifJ(u)e- An«-U)2/ 2t du dg 1/2

+ J:oo

G:f'2

/

CfJk(O

J~ (t _IS)1/2

.J:", 8(s, U)G(S, U, An)e-An(~-U)~/2(t-S) du ds dg.

(5.5)

Let us begin by considering the second term on the right-hand side of the integral equation (5.5) which we write as follows: (5.6) where (5.7) g(s,

g, A)

=

(217(t~

S)f/2

Loooo 8(s, u)G(s, u, A)e-("'2)[(~-U)2/(t-S)1 duo

Then by Lemma I and Theorem 4 of [1], the following estimates hold:

IIG(s, ., A) I

~

I ifJ I eMto ,

and

Ilg(s, ., A) I ~ ~

(5.8)

~

118(s, u)G(s,·, A) I MIIG(s,·, A) I MllifJlleMto,

THE SCHROEDINGER EQUATION

and

f~",

u: /g(s,~,

183

dsf d~ ~ f~", s: /g(s,~, f: /g(s',~, = f: s: t"'", ~, ~ s: J:

,\)/

,\)/ ds

/g(s,

,\)/ ds'

'\)//g(s',

~, ,\)/ d~

ds

ds'

ds

//g(s, " ,\)//. //g(s', " ,\)// ds'

~

s: ds s: M2//ifl//2e2M1o ds'

~

(2M2//ifl//2e 2M1o .

d~

This enables us to conclude that

and thus by applying the Schwarz inequality we obtain for an arbitrary function rp E L 2 ( - 00, (0),

We now apply the Fubini theorem to interchange the order of integration

'" f:

f~ rp(~)

g(s,

~, ,\) ds d~ = =

(5.9)

f: t"'", rp(~)g(s, ~, (2:)1/2 f:

,\) d~ ds

(1 _IS)1/2

.

f~", f~", rp(~)8(s, u)G(s, u, '\)e-("/2)[(~-U)2f(1-')1 du d~ ds.

We now consider the double integral:

f~ J~", rp(~)h(u)e-(A/2)(~-U)2 du d~, <Xl

where h(u) = 8(s, u)G(s, u,'\) and By Lemma 1 of [1], since I/hl/
and M 2. Clearly if, say, V(M I ) ~ V(M 2), then hI ~ h. It will then suffice to prove the estimate for the submanifold with boundary MI and, moreover, the same argument will work for any manifold with boundary. Now the regions of MI lying between the critical levels of f2 have a natural product structure L x I given by the level surfaces and their orthogonal trajectories. We introduce product coordinates (x, t) by choosing local coordinates {Xi} on some L and setting t = f2. Since dt is orthogonal to dXh the volume element dv may be written in coordinates as (5)

II:tll, we have Ilgradf 11·v (t, x) = grad t, :JII:tll ·11:tll = dt (!) = 1.

SinceJ2 = dt, VI(t, x) = (6)

2

(7)

Let V(t) denote the volume of the set {x continuous and differentiable.

E

MIIJ2(x) ~ t}. V(t) is

(8) By (7) this is equal to

=

f' (L

(10)

=

f'

(11)

=

-hI

(9)

L(f'

Moreover, (12)

V2.dt) dx

V2.dX) dt

A(Lt) dt

roo

Jo

~

hI

1"" V(t)·dt

t.dV(t).dt. dt

198

JEFF CHEEGER

and t = J2. Thus (1 1) becomes

hI

f"

t{L VI(t, x)· v2(t, x) dX} dt

(13)

= hI

f'

{L t·VI(t, X)·V2(t, x) dX} dt

(14)

Squaring the inequality (8)-(14) and dividing through by

(fM

(IS)

(fM1 J2t yields

IlgradJ2II)2 > h2 > h 2

(fM1 J2)2

=

I

=

.

Combining (15) with (2)-(4) completes the proof.! In dimension 2, it is relatively easy to see that h is always strictly greater than zero. In fact, let V(M) = V, and let c be such that a metric ball of radius r < c is always convex. Then, if A(S) < c min V(M1) = V'

(16)

it follows that each component of S must lie in a convex ball. On such balls the metric g satisfies k· E

~

g

~ ~E

where E is the Euclidean metric in

normal coordinates. Hence h > 0 is implied by the usual isoperimetric inequality in the plane. Now, according to a theorem of the author (see [3]), c may be estimated from below by knowing a bound on the absolute value of the sectional curvature SM, an upper bound for the diameter d(M), and a lower bound for the volume. Once this is done, it is elementary that k may be estimated from ISMI. This yields COROLLARY. that

If dim M = 2,

8M =

iff + d(M) + ISMI < 8, then ,\ >

cp,

then given 8 there exists

B

such

B.

In case dim M > 2 the situation is not so elementary but 6.1 and 6.2 of [4], or [5], will still imply that h > 0. 2 Actually the results of [4] and [5] show the existence of an integral current T whose boundary in S, such that A(S) divided by the mass of the current is always bounded away from zero independent of S. However, since T is of top dimension, it is known that T 1 The equality SM II grad f" I dV = S;;' A(L t} dt is actually a special case of the "coarea formula" (see [6]). 2 Thanks are due to F. Almgren for supplying these references.

THE SMALLEST EIGENVALUE OF THE LAPLACIAN

199

may be taken to be either Ml or M 2 • If 8M =F 0 may also be deduced from Theorem 1 of [1] without too much difficulty. It would be of interest to generalize the argument given here to Il.2 acting on k-forms. Singer has pointed out that this would give a new proof of the fact that the dimension of the space of harmonic forms is independent of the metric and that the techniques might be applicable to other situations. To date, we have not been able to accomplish this except in the case dim M = 2. The essential point here is that for any eigenvalue of /),..2 on I-forms one can find an eigenform of the form dj, where/is an eigenfunction corresponding to A. This observation is probably of little help if n > 2. UNIVERSITY OF MICHIGAN

and SUNY, Stony Brook REFERENCES 1. ALMGREN, F., "Three Theorems on Manifolds with Bounded Mean Curvature," B.A.M.S., 71, No.5 (1965), 755-756. 2. BERGER, M., Lecture notes, Berkeley Conference on Global Analysis, 1968. 3. CHEEGER, J., "Finiteness Theorems for Riemannian Manifolds," Am. J. Math. (in press). 4. FEDERER, H., and W. FLEMING, "Normal Integral Currents," Ann. of Math, 72 (1960). 5. FEDERER, H., "Approximating Integral Currents by Cycles," AMS Proceedings, 12 (1961). 6. - - , "Curvature Measures," Trans. A.M.S., 93 (1959), 418-491.

The InteBral Equation Method In ScatterinB Theory C. L. DOLPHl

1. Iatroduction

The subject of this essay seems singularly appropriate to this occasion for several reasons. Much of the material on which it depends stems from the Berlin and Munich schools where Salomon Bochner spent many of his early mathematical years. The foundations of the theory are perhaps best expressed via the Bochner integral (see Wilcox [I] and Dolph [2]). Finally, the subject has reached some sort of culmination for the direct problems with the as yet unpublished work of Shenk and Thoe [3], while the inverse problem has achieved a significant new impetus from the recently published work of Lax and Phillips [4] and that of Ludwig and Morawetz [5]. In this review I hope to give some indication of the history and status of this subject in the hope that perhaps a new generation of mathematicians can be interested. For this reason very few theorems will be proved in detail, and the discussion will be limited to the appropriate operators with independent variables restricted mainly to two- and three-dimensional spaces, although almost all that will be said will be valid, with modifications, for n ~ 2. At the outset it is necessary to make a clear distinction between the basic physical problems which will be treated and the method of the title, since, 1 The author wishes to express his great gratitude to P. Lax and R. S. Phillips for making their new unpublished work [4) available to him as soon as possible; to S. Sternberg [421 for the same reason; to P. Werner for making the dissertations of all of his recent students including those of Kussmaul [52) and Haf [63) available; to C. H. Wilcox, who furnished me a Xerox copy of his Edinburgh lecture notes; and in particular to N. A. Shenk II, and D. Thoe for their unremitting efforts both by mail and personal contacts to keep me informed of their progress. I would also like to express my thanks to H. Kramer, R. Lane, and R. R. Rutherford for the stimulating summer they provided at G. E. Tempo 1968. I have taken the liberty to reproduce some things from those in Section 7; see also [82). Sponsored in part by NSF grant GP-6600.

201

202

C. L. DOLPH

in many cases, the physical problems can be treated by other methods, the most extensive such treatment of which is to be found in the work of Lax and Phillips [6]. The mathematical problem is as follows: Let n in Rn with n = 2 or 3 be an unbounded domain with a compact C 2 hypersurface r as boundary, where r may have several components or be empty. In the former case r is considered as the disjoint union r 1 u r 2 with each rj either empty or consisting of some of the components of r. Let v be the unit normal to r which points into n, r i the interior bounded by r, and a a real-valued Holder continuous function on r 1> and let q(x) be a real-valued Holder continuous function on n u r which satisfies (1.0)

q(x) = O[exp (-2alx!)]

as Ixl--+ 00

for some a > O. It should be noted that this latter assumption is more general than that used by Lax and Phillips [6] who require a q of compact support and a domain which is star-shaped. Further let K denote the subset {k; 1m k > - a} of the complex plane if n is odd and the portion {k =F Ollm kl < a and

-00

< arg k < oo} u {k; 1m K > 0 and 0 < arg k, < 7T}

of the logarithmic Riemann surface {k =F 0, -00 < arg k < oo} if n is even. This is a type of normalization and, to obtain that used by Lax and Phillips [6] and Newton [7], one should reflect about the real k-axis. One now considers the reduced wave equation (1.1)

[-Ll

+ q(x)

- k 2 ]c/>(x) = 11

in

n

and we desire outgoing solutions satisfying (1.2) (1.3)

(:V-

a )c/>(X+OX)=/2

c/>(x

+

ox) =/3

0nr on

l

r2

for k E K and with function [h] which satisfy for some 0 < f3 < 1, j; = O(ealxl) as Ixl --+ 00, 11 E CP(n u r), 12 E C(r 1), 13 E Cl + (r 2) andj; = O. In addition one needs an appropriate outgoing radiation condition whose history and various formulations will be commented upon subsequently. There are four main areas of interest for problems of this type, all mathematical, but all strongly influenced by both classical and modern

SCAITERING rHEOR Y

203

physics: (i) to establish a suitable Hilbert space, define an appropriate self-adjoint operator in it and, in terms of "distorted plane waves," develop a spectral theory for real k by considering the overall problem as a perturbation on those which can be solved by Fourier analysis; (ii) to define the so-called wave and scattering operators, decide when the latter is unitary and relate it to the asymptotic form of the solution for the so-called integral equation of scattering (for the Schrodinger equation it is usually called the Lippman-Schwinger equation by physicists); (iii) to discuss the analytic properties of the scattering operator and its analytic continuation as well as that of, it turns out, not the resolvent but the resolvent kernel (or its transform, for n > 3); and (iv) to use the knowledge gained by some or all of this to obtain asymptotic solutions in time which, while they are the cause of the so-called exponential catastrophe, can be discussed in suitably enlarged spaces. Once this has been accomplished the inverse problem can be faced. A most promising beginning on it has been made by Lax and Phillips [4].2 The areas of (i) and (ii) are eminently associated with Carleman [8], Kato [9], Pozvner [10], Ikeba [11], Lax and Phillips [6], Shenk [17], Thoe [18], and Kuroda [19]. An excellent summary of results up to the recent collaboration of Shenk and Thoe can be found in Shenk [20]. Thus we shall limit our remarks concerning them to a brief updating of the spectral results achieved by Shenk and Thoe [3]. Area (iii) is the one the author has been most interested in and the only one to which he has occasionally contributed. It will be the one reported most fully here. While Lax and Phillips [4] and Ludwig and Morawetz [5] have made recent contributions to area (iv), these, unlike the recent results of Shenk and Thoe [3], have just been published, so that only hints of them will be reported here. To date, the integral equation method in this area has produced only fragmentary results (compare Dolph [21], Thoe [22]), although it is clear that it will eventually succeed and is part of the overall plan of Shenk and Thoe. To orient the reader, a semichronological approach, in which at the end several difficulties of the method of this title will be indicated, will be used. Physical problems of the type described, with some or most of the conditions (l.l)-(1.3) present, occur frequently in acoustics and electromagnetic and quantum scattering, as reference to Frank von Mises [23] or Morse and Feshbach [24] clearly show. Initially work was limited to the case where the variables separate and where q is identically zero, and if condition (1.2) occurs it is most often with a = identically zero. Most 2 This approach to the inverse problem differs from the method of GelfandLevitan [I2] and Faddeev [I3]. [14]. and [IS] in that the hope is to use knowledge of the poles in 1m k < 0 to deduce the perturbation for a body. For an introduction. see Dolph [16].

C. L. DOLPH

204

importantly of alI, initially k was assumed real. The earliest (1881) such solution known to me is due to Lord Rayleigh [25] for the case of a cylinder or circle of radius r = a for n = 2 with Dirichlet boundary conditions. The total field takes the form

in which the first terms are the well-known expansion of an incident plane wave while the second terms represent the scattered field. All subsequent remarks will be built around this solution since it explicitly illustrates alI that is known about the problem formulated for the case at hand. The first thing to be noted is the type of condition the scattered field satisfies as r --+ 00. Abstracted, it leads to the justly celebrated Sommerfeld [26] radiation condition introduction by him in 1912, which states that for uniqueness with the exterior problem for real k one must require that the limit

limr(n-l)/2[~ur

-

ik]4>--+O.

It was not until 1943, however, that

T-OO

Rellich [27] showed that this was alI that was necessary (Sommerfeld had required more) and also demonstrated that any solution of the reduced wave equation with q = 0 that went to zero faster than I/r, r = Ixl must vanish identically. This last result has played a vital role in 'many subsequent developments (see, for example, Kato [28], Ikebe [II], and others). Loosely speaking, the Sommerfeld radiation condition says that there can be no sources at infinity and it is sufficient to make a certain surface integral vanish on a large sphere as r --+ 00. For the case n = 3, Stratton [29, p. 486] gives a heuristic development on which I cannot improve. Returning to the Rayleigh solution, one notices that there are "eigensolutions" of the exterior problem at the zeros of H~l)(ka) (the Hankel function of the first kind) which occur for discrete values of k with 1m k < O. These are the so-called "complex eigenvalues" first introduced by Lord Kelvin [30] in 1884 for the case of a perfectly conducting sphere and are similar to those introduced by Gamow [31] in 1928 for the case of a-decay. Their introduction was criticized and improved by Lamb [32] in 1900 and Love [33] in 1904, and in my judgment their best interpretation in the Gamow case was by van Kampen [34] in 1953. Perhaps the most informative historical discussion of them is to be found in Beck and Nussenzveig [35], Nussenzweig [36], [37], and Petzold [38] for simple. cases. They are the subject of the book by Lax and Phillips [6] referred to earlier and will playa vital role in our subsequent discussion. With these preliminary remarks, attention will now be directed to the integral equation method in scattering theory itself.

SCATTERING THEORY

205

2. The integral equation method for obstacle scattering While I cannot be absolutely certain, it appears that Kupradze [39], [40] in the period 1934-1936 was the first to apply the Fredholm integral equation method to an arbitrary exterior problem under Dirichlet boundary conditions, and he appears to have been the first to state a radiation condition valid for all k (see, for instance, Schwarz [41]). As in the Sternberg paper [42] for the similar problem with complex k, there were lacunae, at least according to Freundenthal [43]. In 1947 Maue [44] applied the same method and actually treated the Rayleigh case described. In the simplest case for that problem, his integral equation took the following form for the cylinder:

Using expansion techniques from special function theory, it can readily be shown that the corresponding homogeneous integral equation has nontrivial solution only at the zeros of J;,(ka) and of Hp)(ka), i.e., at the zeros of the derivative of a regular Bessel function or (in the normalization used here) at the zeros of a Hankel function of the first kind. The first type of zeros correspond to eigenvalues of the associated interior Neumann problem for the cylinder in a way completely analogous to that which occurs in ordinary potential theory. Examination of Rayleigh's solution shows that they do not, however, affect either the scattered or the total field and thus in some sense they are spurious, because of the method, and should be proven so. This will be done for a quite general case in Section 4. The second set of zeros all occur for 1m k < 0 and imply that the exterior problem for the circle has so-called "complex eigenvalues" which are not removable. This is indeed the case in general and is the subject of areas (iii) and (iv) defined above. Weyl [45] considered the Dirichlet problem quite generally via a doublelayer assumption, the same one that leads to the Maue equation. The difficulty of the interior eigenvalues of the Neumann problem he overcame by the addition of a suitably chosen single layer, mimicking the methods of potential theory, even under the assumptions that nonsimple elementary divisors occurred and, as I have remarked before (see Dolph [21]), he thought they might. Muller [41] soon gave an argument which showed that they did not and by so doing considerably simplified Weyl's argument. The next step which yielded analyticity in 1m k ~ 0 was taken by Muller's student Werner [47], who replaced the simple double-layer Ansatz by one

206

C. L. DOLPH

of the form tfo(x, k) = k n -

2

{L O~1I Fk[(x -

S)fL(S) dSs]

+

(2.2)

L.

Fk[(x - Y)T(Y) dVII ] } ,

where, in n-dimensional Euclidean space,

n-2

p = -2-;

+ +--+ H~l),

(2.3)

_ +--+

H~2)

and in which T is an unknown volume potential. Under this assumption the difficulties associated with the interior Neumann problem disappear and one has merely to deal with the first part of the Fredholm alternative to get existence and uniqueness since one can set up a vector integral equation involving a compact operator in a suitable Banach space. One obtains, in addition, analyticity in k for the solution for all of k, 1m k ~ 0 and this method was used subsequently by Shenk [17] and is part of the technique in the current version of Shenk and Thoe [3] which will be discussed in Section 6. From a practical point of view the presence of an additional volume integral makes numerical calculations immensely more ,difficult. As a result several people-Leis, another student of MUlier [48], Werner and Birkhage [49], and Pannic [50]-replaced the double-layer assumption by one which reads, for both the Dirichlet and Neumann problems, as follows:

where 7J = 7J(k) = 1

for Re k ~ 0 = - 1 for Re k < O.

Under this assumption one needs only the first part of the Fredholm alternative. Pannie also went on to treat the corresponding exterior electromagnetic problems although here I find the treatment somewhat obscure. As Werner and Greenspen [51] have shown for the Dirichlet problem and Kussmaul [52] for the Neumann problem, it is possible to obtain excellent results numerically for at least the cylinder problem and certainly the absence of the volume integral is of tremendous help. I fully' anticipate that a subsequent version of Shenk and Thoe [3] will employ it also. As sometimes happens in mathematics, good mathematics results, ironically enough, come from a method failure rather than from the true

SCATTERING THEOR Y

'21)7

physical situation. To return to the Maue integral equation (2.1), I conjectured that what was true there was true in general. This led Wilcox and me to suspect that a technique of Tamarkin's [53], combined with a theorem of Rellich [27], could be used to prove that this was always the case. This we were able to establish. Subsequently, for his Edinburgh address of 1967, Wilcox developed an extensive Fredholm-type treatment valid for the entire complex plane. Since it seems unlikely that these results will ever be published I shall use this occasion to state them in some detail. Before doing this, a few comments on the radiation condition are in order.

3. The radiation condition There are many possible forms for a radiation condition and even today new ones are being devised for special purposes (compare Ludwig and Morawetz [54]). In 1956, Wilcox [56] gave a particularly elegant reformulation of that of Sommerfeld, in which for any solution c/>(x, k) in C 2 he showed that it was sufficient to require that lim

R .... <x>

J. /(88r - ik)c/>/2 dS~O. T=R

Earlier, in 1950, Kupradze [56] gave the following conditions (compare the 1956 German translation [57]):

r(:r - ik)c/>

=

e±lkTo(1)

rc/>(r)

=

e±tkTo(I),

where the plus sign holds for 1m k < 0 and the minus sign for 1m k ~ O. Subsequently, most workers, including Shenk and Thoe [3], have used a 1960 reformulation due to Reichardt [58] for the plane and a 1965 generalization of Schwartz [41]. If Fk+(x - y) is any fundamental solution of (1.1), these conditions are merely the requirement that the surface integral arising from Green's formula, namely:

i [

c/>(s)

B=r

88

Vs

Fk+(x - s) - Fk+(x - s)

88

Vs

c/>(s)] dSs

converge to zero as r ~ 00. For k =f 0 and 0 ~ arg k ~ 7T this condition is equivalent to the Sommerfeld radiation condition if one takes Fk+(x - y) to be that given in (2.3). The elegant formulation of a radiation condition due to Wilcox [59] in 1959 seems somehow to have been overlooked, probably since he did not

208

C. L. DOLPH

explicitly state its validity for the entire complex plane at the time of its introduction. To state it let k E [K - [0)) and let v be a solution of j

(3.1) for

Ixl

~ c.

Then he observes that the mean,

r

MxAv] = 41

vex + rw) dw,

TT)I"'I=1

is in fact equal to lkT relkT + v_ex) e--r-'

MX.T[V] = v+{x)

r = rex),

where v±{x) = ± 8elkTi ·k TTl

1"'1 =1

(8-8 + ik)rv(x + rw) dw r

is defined for all x E R3 (independently of r ~ ro(x» and v±(x) are entire solutions of (k). v is a k-radiation function for, if v E C2[O], v is a solution of (3.1) for x such that

r

)1"'1=1

(88r - ik)rv(x + rw) dw = o.

For 1m k ~ 0 these conditions are equivalent to those of Sommerfeld. With this result we can return to the Kupradze-Maue-Weyl integral equation. 4. The Fredholm integral for the exterior Dirichlet problem Following Wilcox [60], we assume that r is a closed bounded surface in R3 of class C 2 , that 0 is the connected exterior of r, and that r' is the interior of r. The problem to be solved is that of finding a solution of

/).if> + Pif>

=

p(x)

for x

E

0

for x

E

r.

and if>(x) = 0

Here p(x) is a prescribed function-the source-and k is the wave number in the set {K - (O)} but is otherwise an arbitrary complex number. Let I if>o(X, k) = - %3

I Ix _ yl

eiklx-YI

p(y)dy

SCATTERING THEORY

be the solution for the case in which case, let c/>(x, k)

n=

209

R3 and

= c/>o(x, k) + v(x, k) for x

where v(x, k) is a k-radiation function for that v(x, k)

E

n in the

r = E

O. In the general

n,

sense of Wilcox such

Cl(Q)

and v(s, k)

= -c/>o(s, k) for x E r.

As in the Kupradze-Maue-Weyl equations, the double-layer assumption implies a representation for v(x, k) in the form (4.1)

v(x, k) =

i

a

ellkllx-sl

"2 I [' UPs TT X

-

S

I p.(s) dS.,

where P is a unit exterior-directed normal on r. This leads to the following integral equation (in view of the well-known jump relation): for s, S'

(4.2)

E

r,

when

a

I

K(s,s'; k)

e1lkl Is-s'l

= -2TT " I UPs' s -

S

'1'

Then in terms of iterates of the kernels, one can represent the resolvent kernel of (4.2) in the form R(s,s '., k)

= N(s, s'; k) D(k)

,

D(k) 1= 0,

where R(s, s'; k)

and R(s, s'; k)

L +L

= K(s, s'; k') +

= K(s, s'; k)

K(s, s"; k)R(s", s'; k) dSs•

R(s, s"; k)K(s", s'; k) dSs·'

for all k such that D(k) "# O. This in turn implies that the double-layer density admits the representation p.(s, k)

= v(s, k) +

L

R(s, s'; k)v(s', k) dS.,

210

C. L. DOLPH

and that the total field admits the representation

In

.p(x, k) =

G(X, y; k)p(y) dy.

Here the resolvent Green's function has the following explicit expression: G(x, y; k)

1

=

-47T

elll

=

In

(y)!f!(y) e-2alYI dy

and a norm The introduction of this space avoids the so-called "exponential catastrophe" which results if one works in a time-independent way in the usual Hilbert space appropriate for the region 1m k ~ o. When, in this enlarged Hilbert space, under the assumption (1.0), k is restricted to the region 1m k ~ -a, the integral operator (5.1)

I Tk = - 41T

r eilkllx-yl

Jo Ix _ yl

p(y)(y) dJL(Y),

where p(y) = e2alxlq(x) is an analytic compact family of operators of the Hilbert-Schmidt class with the property that IIT(k)IIHl = O[(lm kF/2]

as 1m k

--+ 00.

Moreover, [J - T(k)]-l is a meromorphic operator-valued function for 1m k ~ -a and has a pole at ko if and only if there exists a such that = T(k) in H 1 • Explicitly, it was the resolvent kernel of the scattering integral equation (see Dolph et al. [61] or Ikebe [11]) which could be continued since it is not apparently possible to continue the resolvent relations of Hilbert. We also showed that the poles were symmetrically placed with respect to the imaginary k-axis and that those in the region 1m k ~ 0 all occurred on Re k = o. We also discussed the scattering operator in the form given by Ikebe [70] and showed (i) that under these same conditions it could be continued analytically through the region 11m kl < a; (ii) that it was meromorphic in this region with poles confined to the subregion -a < 1m k < 0 and to the non-negative imaginary axis; and (iii) that the poles in the next-to-Iast region were also symmetrically placed with respect to the imaginary axis. We did not, however, relate the singularities of the integral operator and/or resolvent Green's kernel to 3 Similar results were announced by Ramm [681. I have just received a reprint of his paper [69] containing the full details.

c.

214

L. DOLPH

those of the scattering operator. However we did give an example which indicated that the region of continuation to 1m k > - a was in agreement with the known results of radially symmetric potentials and was the best possible. This example led us to suspect, but this has not been verified as yet, that under the hypothesis (1.0), in contrast to that of compact support for q(x), there will always be a contribution from infinity in the lower halfplane due to the occurrence of a branch cut along 1m k = - a. Subsequent to our joint work, Thoe [22] gave a proof under a differentiability condition on q(x) that there were only a finite number of poles in any strip in the region and made a beginning on the associated asymptotic series for the wave equation 1m k > O. (There is a known simple relation between the reduced wave equation with potential when derived from the wave equation and when derived from Schrodinger's equation.) This relation is, for example, explicitly spelled out in Shenk [20]. Independently, McLeod [71] introduced ellipsoidal coordinates and in a manner reminiscent of the way physicists prove the convergence of the Born series (see Newton [72], Zemach and Klein [73], and Khuri [74]), he was able to obtain Thoe's result without the additional differentiability condition. One essentially integrates by parts in this new coordinate system and then makes appropriate estimates. The results on the meromorphic nature of a family of compact operators have subsequently been considerably generalized by students of Phillips and Werner, respectively, namely Steinberg [62] and Haf [63]. Steinberg obtains his results by working with the projection operator defined by the contour integral involving the resolvent (see, for example, Riesz-Nagy [75]) while Haf bases his on the Schmidt procedure of approximation by degenerate kernels. Since Steinberg's results are the most general and will be used in the next section, we shall state them explicitly. From the viewpoint of one who teaches applied analysis, however, the method used by Haf is probably more easily understood today by graduate engineers and physicists. Haf's thesis also contains several detailed applications of his results to various scattering problems. Following Steinberg, we let B be a Banach space and O(B) be the bounded operators on B. Let Kl be a subset of a complex plane which is open and connected. A family of operators T(k) is said to be merom orphic in Kl if it is defined and analytic in Kl except for a discrete set of points where it is assumed that in the neighborhood of one such point ko co

(5.2)

T(k)

=

2

Tlk - ko)i,

i= -N

where the operator T; E O(B), N ~ 0, and where the series converges in the uniform operator topology in some deleted neighborhood of k o• The

SCATTERING THEORY

215

family T(k) is said to be analytic at ko if N = 0 in (5.2). Then one has the following theorems: THEOREM 5.1. If T(k) is an analytic family of compact operators for k E Kl> then either [/ - T(k)] is nowhere invertible in K1 or [/ - T(k)]-l is meromorphic in K 1 • THEOREM 5.2. Suppose T(k, x) is afamily of compact operators analytic in k and jointly continuous in (k, x) for each (k, x) E n x R (R is the set of real numbers) then if [/ - T(k, x)] is somewhere invertible for each x, then its inverse is meromorphic in k for each x. Moreover, if ko is not a pole of [/ - T(k, xo)]-\ then the inverse is jointly continuous in (k, x) at (ko, xo) and the poles of the inverse depend continuously on x and can appear and disappear only at the boundary of n, including the point at infinity.

In addition to these results, Steinberg also gives sufficient conditions for poles to be of order one. Again these results illustrate the usefulness of the interplay between mathematics and physics.

6. Intimations concerning the unified Shenk-Tboe theory All of what precedes has been unified for the problem given by the relations (1.0)-(1.3) under the hypotheses stated below them. To state some of these results we rewrite the Werner relation in the form - a and this in turn yields meromorphic continuation of the outgoing solutions from 1m k > 0 to all of 1m k > -a, the only poles occurring at the poles of the last inverse. Except for these poles, these analytically continued solutions are unique and satisfy the outgoing radiation condition. At the poles the "complex eigenvalues" occur and the outgoing solutions are not unique. However, as in the usual Riesz-Fredholm theory, a solution will exist if and only if the Lt;] involved are in the null space of the "adjoint" of [/ + M(k)]. As in Werner's work [47] and that discussed in Section 4, this approach eliminates the (removable) singularities of the corresponding interior problems. The resolvent Green's operator can be constructed and used to build a spectral theory for real k completely analogous to that of Ikebe [11] and Shenk [17] for this general problem, but its description is too difficult and lengthy to present here. Perhaps more importantly, the poles of the operator [/ + M(k)]-1 can be related to those of the scattering operator in a very. precise sense, thus answering the question left open by Dolph et al. [61] about the relationship of singularities of the scattering operator and the nontrivial solution of the extended integral operator discussed there. If one lets A denote the self-adjoint extension of the operator defined by

SCATTERING THEORY

217

equations (l.l)-(1.3) in L 2 [n), then the scattering operator S(k) for the "Schrodinger wave equation" applicable to this problem, taken in the form i dU(t) = AU(t) dt '

is a unitary operator on L 2[sn-l] for each positive k, where sn-l is the unit hypersphere in n dimensions. As such, S(k) has a meromorphic extension (with a branch point at k = 0) if n is even to the entire complex k-plane and has no nonzero real poles. The poles of S(k) in the upper halfplane are simple, lie on the imaginary axis, and have as their square the nonzero eigenvalues of A. Shenk and Thoe now define a resonant state as a nontrivial solution of

.Irr u(s) 0Vs Fk+(x 8

u = Tk+U =

-In

s) - Fk+(x - s) ';\0 u(s) dSs uVs

Fi%(x - y)q(y)u(y) dVy

in H~OC[D) satisfying the boundary condition (1.2) and (1.3). Such a function will automatically satisfy [-~

as a distribution in

n, since for [-~

+ q - k 2 )u = 0 H~OC[D]

one has

- k2]Tlv = 0

as distributions in n. Suppose now for simplicity that one considers the poles of S(k) with 1m k tt = 0

4>(0, t)

c

=0

4>(x,O) = 0 4>t(x, 0)

= f(x) =

for 0

~

0 for x > a,

where

and take its Laplace transform via the usual relation: {J(x, s)

=

f"

e- st4>(x, t) dt.

x
{!fiW= 1) =

=

Sa I~ tu,(!fi(X)y(X»

dx

Sa I~ !fi(X)tu,(y(x» dx,

I ...

where dx represents Bochner integration with respect to /L. If {Vj}7'= 1 c oft is a second covering of K and if {gj}7'= 1 is a corresponding partition of unity subordinate to {V j}7'= h then we shall compare the preceding approximation with I(y; {Vj}7'=h {gj}7'=1)' To do this we write n

L tu,(!fi(X)y(X» i=1

m

n

m

n

=

L L UX)!fi(X)tU.(y(x» j=11=1

=

L L glX)!fi(X)gu,4x )tvh(x». i= 1= 1

1

Let K c K be a (possibly empty) compact set such that for U ' , U" and x E U ' n U" n K, gu'u·(x) = e = identity of the group d.

E

oft

GROUP ALGEBRA BUNDLES

231

Then, since gu,Vj(x) = eon K,

Lt~

tul"';Cx)y(x» dx

Ii j~ t~ + L'i j~ t~ = Ii j~ + L'i j~ i~ =

UX)"'i(x)tvh(x» dx UX)"'t(x)gu,vlx)tvh(x» dx

tvMlx)y(x» dx

However, since glx), "'i(X) isometry, II

~

glX)"'i(X)gU,Vj(x)tv/y(x» dx.

0 for all i, j, and since each gu,Vj is an

~1 ~ UX)"'i(X)gu,vix)tvh(x» II ~ ~1 t~ glX)"'i(X)llyll

where

Ilyll

ly(x)1 =

Ii j~

is the norm of

Iltu(y(x»IIA' x

tv/Ux)y(x» dx

=

Lj~

E

+

y in

UE

ro(C) (see [3]):

x

ly(x)l,

+

L~1

UX)"'t(x)gu,vlx)tvlY(x» dx.

glx)tvh(x» II

I(y; {Vj}1'=1, {gj}1'=1)IIA < 'T

=

K,

Ux)tvh(x» dx

L'i j~ t~

Because of the obvious inequality:

where

glX)"'i(X)gu,vlx )tV,(Y(x» dx

L'i j~

glx)tvh(x» dx -

Consider the set of all triples

sup

cw. We now write, since supp y =

L'i j~1 ~

III(y; {Ui}f=1, {"'t}f=1) -

Ilyll =

= Ilyll,

~ Ilyll, we find

2fL(K\K)llyll·

(fJI, {Ui}f= 1> {"'i}f= 1) where

(i) fJI is a coordinate bundle in the equivalence class of C; fJI is engendered by a refinement Cilt of cw; the coordinate maps of fJI are found by restricting those of !iJ. (ii) {Ui}f= 1 C Cilt is a covering of K. (iii) {"'t}f= 1 is a partition of unity subordinate to {Ui}f= 1. We partially order {T} by the inclusion order among the associated {Cilt}. We now assume that the measure fL on G is regular and that for all N in some neighborhood base % = {N} for G, fL(oN) = 0, where oN = N n (G\N) = boundary of N. If G is a locally compact group, these conditions

232

BERNARD R. GELBAUM

°

obtain (see [3], [6]). According to the main result in [3], for e > there is a refinement 0110 of cit, a coordinate bundle 11$0 in the equivalence class of (S such that 0110 is the associated covering and such that the associated coordinate maps are derived from those of!iJ by restriction and there is a compact K c K such that

(a) fL(K) > fL(K) - e/21Iyll, (b) for V~, V; E 0110, x E V~ n V; n

K there

obtains guouo(x)

=

e.

Furthermore, if 11$ is any coordinate bundle whose associated covering 011 is a refinement of 0110 and whose associated coordinate maps are derived from those of 11$0 by restriction, then (b) holds with "V~, V; E 0110," etc., replaced by "V', V" E 011," etc. In consequence, for any such 11$, 011 we find

III(y; {V;}f=b {tP;}f=l) - I(y; {Vj}7'=b {gj}7'=11IA < e. We shall write Ily) for I(y; {Vi }f=l{tP;}f=l). These remarks are the basis for the following theorem: THEOREM I. Let y E r 0(0") and let 11$ be a coordinate bundle (in the equivalence class of g) with associated covering 011. Then for e > there is a T(e) such that if Tb T2 ~ T(e) then

°

PROOF. Let T(e) = TO == (11$0, {V w}f=l, {tf;W}f=l} where 11$0 is chosen as in the argument above, Vw E 0110, etc. If T1, T2 ~ T(e), let T3 arise from any common refinement 0113 of the coverings 0111 and 0112 associated to T1 and T2. In other words, T3 = (813, {Wk}~ = b {7]k}%'= 1), where, in particular, the open covering associated to 11$3 is some common refinement 0113 of 0111 and 0112. We let T1 = (11$1' {Vi }f=l, {tP;}f=l), T2 = (11$2' {Vj}7'=b {gj}7'=1) and we calculate

=

Li~ k~l

tPi(xhk(X)tUj(y(x)) dx -

Lk~l

'1Jk(X)tWk (y(x)) dx.

We show now that gUjWk(X) = e on Vi n W k n K. Indeed we recall that implicit in the definition of each T is the fact that the associated coordinate maps are restrictions of those given for Pi. This means that for TO, Tb T2, T3 there are maps aT: OlIT~ cit such that aT(V) ::::> V and such that by definition g>[j 1 = g>;;/U) on p -1( V), r = 0, I, 2, 3. Let us consider t u, and tWI

K 3 • For such

T

it is clear that

BERNARD R. GELBAUM

234

On the other hand if Tj are associated to Yi, let the underlying coverings be 0/11> i = I, 2. We can then construct in an obvious fashion a T whose underlying covering is a refinement of 0/11 and 0/12 and whose {UjW= 1 covers Kl U K 2 • For such a T the equation just given holds. The existence of I(y) for all Y E r o(C) implies that ultimately I,(Yj) and I,,(Yi) are close. Thus for T associated to K we find lim , Il(XlYl

+ (X2Y2) =

I«(XlYl

+ (X2Y2).

For T arising from Tl and T2 as described, we note that they are cofinal in the set of T associated to K and that I,«(XlYl

+ (X2Y2) =

(Xlf.(Yl)

+ (X2 I,(Y2) =

(XlI,,(Yl)

+ (X2 I'2(Y2).

Passage to the limit now yields the required linearity of I. (ii) For all

T,

IIIh)II ;£ Ilyll/L(supp y). Hence the conclusion follows.

Since d consists of isometries, for Y E ro(C) and x E G, Iy(x) I == Iltu(y(x))IIA is independent of U. Furthermore Iy(x) I is continuous on G, supp Iy(x) I = supp y(x). We define lyl =

L

Iy(x) I dx,

and thereby define a norm on r o(C). The completion of r o(C) in this norm will be denoted by £l(G, C). We observe that £l(G, C) is a Banach space insensitive to the bundle structure of C. For Y E r o(C) we show III(y)IIA ;£ lyl· Indeed Ilf.(y)IIA ;£ lyl by virtue of the definition of I •. The result follows since I(y) = lim , fh). 2. LI( G, If) amd LI( G, A)

By £l(G, A) we mean the set of Bochner-integrable A-valued functions on G. We shall establish an isometric isomorphism between £l(G, C) and £l(G, A). For this purpose we denote by CoCA) the set of continuous compactly supported A-valued functions on G. Let !j be an arbitrary coordinate bundle in the equivalence class of C. Let eft be the underlying open covering of G, {CPu} the set of coordinate maps, and {gu'v'} the set of coordinate transformations (transition functions) defined by {CPu}. For Y E r o(C) , let K be the support of Y and let

235


{D;}f=1 C cit be a covering of K with an associated subordinate partition of unity {!f.;}f= 1. As we did earlier, we denote by 7 a triple (311, {U;}f= 1> {!f.t}f= 1) consisting of a coordinate bundle 311 in the equivalence class of iff, an open covering {Ut}f= 1 of Ky where {Di}f= 1 c cit, Ol/ is a refinement of cit and {9?u} for 311 arise from {9?o} for iiJ by restriction. For each 7 let n

T.(y)(x)

=

L !f;(x)tuh(x»

E

Co(A).

1=1

Clearly T, is linear. For e > 0 there is a K c K such that p.(K\K) < 70 such that for 7 ~ 70

e/211yll

and there is a

XEK. This follows from the argument used in the establishment of I. Thus, we see that for 71> 72 ~ 70

Thus {T.(y)} is a Cauchy net in V(G, A) and we denote its limit by T(y). From the construction of T(y) we see that T(y)(x) = 0 if x ¢: K and for each e > 0 and corresponding K, 70, T,(y)(x) = T(y)(x), x E K. If en -+ 0 we may choose corresponding Kn, 7 n so that Kn c Kn+ 1> and p.(K\Kn) < en. Thus

Hence T(y)(x) is continuous on

UKn and thus a.e. on K. Since all T, are

n=1

linear, so is T. Next we note that

fa IIT(y)(x)IIA dx = Sa Iy(x) I dx = Iyl· Hence T is an isometry on r 0(0") (in particular T is one-to-one). Furthermore, if H(x) E Co(A), we may write n

H(x) =

L !f.;(x)H(x) 1=1

and then find Yt(x) (see [2]) such that tu.(Yt(x» = !f;(x)H(x). Thereupon, if y(x) =

n

2:

1=1

Yi(X) then

236

BERNARD R. GELBAUM

If supp H = K, then supp y c K and by arguments used earlier, for e > 0, there is a compact K. c K, p,(K\K.) < e so that for all T ;;; TO and XEK. n

T.(y)(x) =

n

L: L: ifi;tu,(YI(x»

n

=

1=1;=1

L: tu,(Yi(X»

=

H(x).

i=l

Hence T(y)(x) = H(x) a.e. and thus, viewed as a subspace of U(G, A), CoCA) is the complete image by T of r 0(6"), viewed as a subspace of U(G, 6"). Thus we have U(G, 6")

::::>

ro(6")~Co(A)

c U(G,A),

where T is an isometric isomorphism between dense sets. Clearly T is extendable to U(G,6") and thus this extension, again denoted by T, implements an isometric isomorphism between U(G, 6") and U(G, A). The map I: ro(@")-+ A may be extended uniquely to a map denoted again by I: U(G, 6") -+ A. In terms of this map I we may define a multiplication in U(G, 6") when G is a locally compact group and p, is Haar measure. The natural definition extends classical convolution. We begin by fixing a coordinate bundle fj and all the associated apparatus used in the definitions of T, and I" T and I. Since n

T.(y)(x) =

L: ifii(X)tu,(Y(x», i=l

we see that I.(y)

=

Sa T.(y)(x) dx,

and, in consequence of earlier inequalities, we conclude that I(y) =

Sa T(y)(x) dx.

°

We show that, if y, E r 0(6"), then T(yo) = T(y)T(o). Indeed, if = (f!JJ, {Ui}f= 1, {ifii}f= 1) is given we observe that {ifiiifi;}f.;= 1 is also a partition of unity subordinate to {Ui}f=l. We let T' = (f!JJ, {Ui}f=lo {ifiiifiAf.;=l). T

The set of all

T'

is cofinal in the set of all n

T,,(yo) =

T.

Then

n

L: L: ifii(X)ifiix)tu.cy(x»tuio(x» l=lj=l n

=

n

=

n

L: L: ifii(X)ifiix)gu,uix)tuiy(x»tui°(X» 1=1 j=l 2 ifii(X)gu,uix)tuiY(x» j=l 2 ifi;(x)tuio(x». 1=1 n


As in earlier arguments, for suitable and S and for suitable T we find

K in

237

the union of the supports of y

= T.(y)(x)T.(S)(x) for x E K.

T.,(yS)(x)

Using the cofinality of {T'} in {T} we conclude that

= T(y)(x)T(S)(x) a.e. in x.

T(yS)(x)

Thus we see that if for any 1'J E ro(C) we write 1'Jx(y) == 1'J(y- 1 x), then

=

l(ySx)

=

L

T(ySx)(y) dy

L

T(y)(y)T(Sx)(Y) dy

=

L

=

T(y)*T(S)(x)

T(y)(y)T(S)(y-lX) dy E

CoCA).

Thus we define (y*S)(x)

=

T- 1 [I(ySJl

T(y*S)(x)

=

T(y)*T(S)(x).

and conclude that

It is now clear that T is an algebraic isometric isomorphism between £leG, C) and £leG, A). UNIVERSITY OF CALIFORNIA IRVINE

REFERENCES 1. 2.

3. 4. 5.

6. 7. 8.

B. R., "Tensor products and related questions," Trans. Amer. Math. Soc., 103 (1962), 525-548. - - , "Banach algebra bundles," Pacific Journal of Mathematics, 28 (1969), 337-349. - - , "Fibre bundles and measure," Proceedings of the American Mathematical Society, 21 (1969), 603-607. - - , "Q-uniform Banach algebras" Proc. Amer. Math. Soc., 24 (1970),344-353. GROTHENDIECK, A., "Produits tensoriels topologiques et espaces nuc\eaires," Mem. Amer. Math. Soc., No. 16 (1955). HERZ, C. S., "The spectral theory of bounded functions," Trans. Amer. Math. Soc., 94 (1960), 181-232. JOHNSON, G. P., "Spaces of functions with values in a Banach algebra," Trans. Amer. Math. Soc., 92 (1959),411-429. STEENROD, N., The Topology of Fibre Bundles. Princeton, N.J.: Princeton University Press, 1960. GELBAUM,

QEadratic Periods

if Hyperelliptic

Abelian Intearals R. c. GUNNINGl

1.

Consider a compact Riemann surface M of genus g > 0, represented in the familiar manner as the quotient space of its universal covering surface M by the covering translation group r. For the present purposes the only thing one needs to know about the surface M is that it is a simply connected noncompact Riemann surface. The group r is properly discontinuous and has no fixed points; and the quotient space Mjr is analytically equivalent to the Riemann surface M. The image of a point z E M under an automorphism T E r will be denoted by Tz. Some fixed but arbitrary point Zo E M will be selected as the base point of the covering space M. The Abelian differentials on the surface M can be viewed as the r-invariant complex analytic differential forms of type (1,0) on the surface M. Since M is simply connected, any such differential form 0 can be written as 0 = dh, for some complex analytic function h on the surface M; the function h is then uniquely determined by the normalization condition hCzo) = 0. Since 0 is r -invariant, it is apparent that hCTz) = hCz) - OCT) for some complex constant OCT), for any element T E r; and clearly OCST) = OCS) + OCT) for any two elements S, T E r. The mapping T --+ OCT) is thus a homomorphism from the group r into the additive group C of complex numbers; this homomorphism will be called the period class of the Abelian differential O. As is well known, the period classes of the Abelian differentials on the Riemann surface M form a g-dimensional subspace of the 2g-dimensional complex vector space Hom cr, C) consisting of all group homomorphisms from r into C. Now suppose that 01 = dhl and O2 = dh2 are two Abelian differentials on M, where the functions hI' h2 on M are both normalized by requiring 1

This work was partially supported by NSF Grant 3453 GP 6962. 239

R.

2~

~

GUNNING

that h 1(zo) = h 2(zo) = O. The product a = h 102 = hI dh2 is an analytic differential form of type (1,0) on the surface if and clearly satisfies the functional equation (1) for any element T E r. Again, since M is simply connected, this differential form can be written as a = ds for some complex analytic function on the surface if; the function s is uniquely determined by the normalization condition s(zo) = O. It follows from (1) that the function s satisfies the functional equation (2)

s(Tz)

= s(z) - 01(T)h 2{z) - aCT)

for some complex constant aCT), for any element T E r. The mapping T --+ aCT) will be called the period class of the quadratic expression a = hI dh2 in the Abelian integrals on the surface M; the set of all such mappings will be called, for short, the quadratic period classes of the Abelian differentials on M. The aim of this paper is the explicit determination of these quadratic period classes for the special case of a hyperelliptic Riemann surface; in this case, the quadratic periods are quadratic expressions involving the ordinary period classes of the Abelian differentials on M. In general, the quadratic period classes may be transcendental functions of the Abelian period classes. Section 2 of this paper is devoted to a general discussion of the properties of these quadratic period classes, for an arbitrary Riemann surface; Section 3 contains the explicit calculations for a hyperelliptic surface; and Section 4 provides the motivation for studying these quadratic period classes, with a discussion of their role in the general theory of Riemann surfaces.

2. A quadratic period class is not a homomorphism from the group r into the complex numbers; but it follows readily from (2) that it does satisfy the relation (3)

a(ST)

= a(S) + aCT) - 01(S)02(T)

for any two elements S, T E r. This sort of relation is a familiar one in the cohomology theory of abstract groups (see [2]). In general, for any two group homomorphisms 01> O2 E Hom (r, C), the function 01 U O2 defined on r x r by (0 1 U 02)(S, T) = 01(S)02(T) satisfies the relation

+ (0 1 u 02)(R, ST) - (0 1 U 02)(R, S) = 0 for any three elements R, S, T E r; that is to say, in the cohomological language, 01 u O2 is a 2-cocycle of the group r with coefficients in the (0 1 U 02)(S, T) - (0 1 U 02)(RS, T)

HYPERELLIPTIC ABELIAN INTEGRALS

241

trivial r -module C. Equation (3) is simply the assertion that this cocycle is cohomologous to zero, indeed, that it is the coboundary of the l-cochain 0'. Perhaps it should be noted in passing that requiring the cocycle 81 v 82 to be cohomologous to zero does impose nontrivial restrictions on the homomorphisms 81 and 82 ; indeed, these restrictions are precisely the Riemann bilinear equalities on the periods of the Abelian differentials 81 and 82 , This can be verified by a straightforward calculation, choosing the canonical presentation for the group r and examining the restrictions imposed by (3) and the group relations in r. However, it is more instructive to note that the correspondence 81 x 82 -+ 81 V 82 can be viewed as a bilinear mapping

since Hom (r, C) ~ H1(r, C). Since M is contractible, there are further canonical isomorphisms Hq(r, C) ~ Hq(Mjr, C) ~ Hq(M, C), for q = 1, 2, and with these isomorphisms, the bilinear mapping just given can be identified essentially with the usual cup product operation in the cohomology of the space M. The Riemann bilinear equalities amount precisely to the conditions that the cup products of any two analytic cohomology classes in H1(M, C) are zero (see [1D, which yields the desired assertion. The collection of all the l-cochains 0' E C1(r, C) satisfying merely the coboundary relation (3) form a complex linear set of dimension 2g; for this set is nonempty, and any two elements of the set differ by a l-cocycle, that is, by an element of Hom (r, C). The problem is to select the unique quadratic period class in this set; this appears to be a nontrivial problem in general. A much simpler problem that one might imagine tackling first is the problem of determining the analytic l-cochains in this set, where a l-cochain 0' satisfying (3) is called analytic if there is some complex analytic function s on the Riemann surface M for which the functional equation (2) holds. Note that the analytic l-cochains form a complex linear set of dimension g; for it is clear from (2) that two analytic I-cochains differ by an analytic class in Hom (r, C), that is, by the period class of an Abelian differential on the surface M. It is actually quite easy to determine the analytic l-cochains, using the Serre duality theorem and the familiar technique for calculating that duality (see [I D. The details will be given elsewhere. In at least one special case it is very easy to calculate the quadratic periods for an arbitrary Riemann surface; for when 81 = 82 it is clear that s = hV2, and hence that O'(T) = - 81 (T)2j2 for any element T E r. This

242

R. C. GUNNING

observation can be extended to yield a symmetry principle for the quadratic periods in general. Let 8i = dh h i = 1, ... , g, be a basis for the Abelian differentials on M; and let T -+ ailT) be the period class of the quadratic expression ali = h/J" i,j = 1, ... , g. Then since, clearly, Sij + Sj; = hih" it follows immediately that (4)

for any element T E r. Finally, it should be noted in the general discussion that the quadratic periods, unlike the ordinary Abelian periods, depend on the choice of the base point Zo EM. To examine this dependence, select another base point zt; and indicate the various expressions just considered, renormalized to vanish at zt, with an asterisk. Thus clearly hNz) = h;(z) - h;(zt) and st(z) = St;(z) - h;(zt)h;(z) - s;lzt) + h;(zt)hlzt); and it follows immediately for the quadratic periods that (5)

for any element T E r. Note that the change is skew-symmetric in the indices i,j, as of course it must be in consequence of the symmetry principle (4).

3. Suppose now that the Riemann surface M is hyperelliptic, so that it can be represented as a two-sheeted branched covering of the Riemann sphere P, branched over points to, tlo ... , t 2 g+ 1 in P; and for simplicity, suppose further that the base point Zo of the universal covering surface M is chosen to lie over the point to E P. Select disjoint smooth oriented paths Tk in P from the point to to the various points tk> k = 1, ... , 2g + 1. Each path Tk in P lifts to two separate paths TL T~ in M; these two paths are disjoint except for their common initial point, the single point Po E M lying over to, and their common terminal point, the single point Pk E M lying over t k • In turn, each path Tk lifts to a unique path :r~ in M with initial point the base point Zo E M and terminal point denoted by z~. Since the points z~ and z~ lie over the same point of M, they are necessarily congruent under the covering translation group r; therefore, there is a uniquely determined element Tk E r such that z~ = TkZ~. Note that the paths :r~ and Tk:r~ have the same terminal point; hence the path (:rD, (Tk:r~)-l, obtained by traversing first the path :r~ with its given orientation and then the path Tk:r~ with the opposite of its given orientation, is a connected piecewise-smooth oriented path from Zo to TkZO in M.


243

I. For a hyperelliptic Riemann surface represented as above, the quadratic periods for a basis 8i = dh i of the Abelian differentials on the surface are given by THEOREM

(6) for i, j = I, ... , g and k = I, ... , 2g

+

I.

PROOF. Introduce the hyperelliptic involution P: M --+ M, the analytic automorphism of the Riemann surface M corresponding to the interchange of the sheets in the covering M --+ P; this automorphism has period 2, and the quotient space M/P under the cyclic group generated by P is analytically equivalent to the Riemann sphere. The key to the proof is the observation that 8i(z) + 8;(Pz) = 0 for any Abelian differential 8i on the surface M. To see this, note that 8i(z) + 8i(Pz) is an analytic differential form on M which is invariant under the automorphism P, hence corresponds to an Abelian differential on the quotient space M/P ~ P, and thus must be identically zero. Now let z~(t) E T~ be the point lying over t E Tk, so that the functions z~ parametrize the paths T~ by the coordinates along the paths Tk. Then 8;(zW» + 8;(z~(t» = 0 for all t E Tk, as above, and consequently hi(zW» + hi(zW» = constant, for all t E Tk; since zWo) = z~(to) = zo, it follows from the normalizations adopted that actually hb~(t» + hi(Z~(t)) = 0 for all t E Tk. For the quadratic expressions UiJ = h;8j it then follows also that Ui;(ZW» = Ui;(ZW» for all t E Tk. Applying these and the earlier observations, note that

= (. [Ui;{Z) - 8;(Tk)8j (z)] J~

=

f.~

=-

[Ui;(ZW») - u;;(zW»] - 8i(Tk)

2 (Tk 0

8;(Tk) = -

f.

J20

8;(z) =

i

Tki~-i~(zJ

8;(z)

[8 j (zW» - 8lzW»] = 2 (. 8;(z)

tic

=

(2 8;(z)

J~

8i(Tk)hlz~).

Similarly note that

=

(1 Ui;(Z)

J~

Jilc

2h;(z~).

The desired result follows immediately from these two formulas. To see that the preceding theorem gives a complete description of the quadratic periods of the Abelian differentials on a hyperelliptic Riemann

244

R. C. GUNNING

surface, it is necessary to show that the elements Tk generate the entire covering translation group r. It is quite easy not only to do this, but also to determine the relations between these generators; the results are well known, but since it appears hard to find a sufficiently explicit reference for them, the details will be included here. Matters are somewhat simplified if a bit more care is taken with the notation. In particular, suppose that the branch points tk E P are so numbered that the paths Tk occur in counter2g+I

U

clockwise order around the point to. The domain P -

Tk

is simply

k=1

connected; thus the portion of the Riemann surface lying over it consists of two disjoint sets each homeomorphic to that domain. These two sets will be labeled sheets I and 2 of the covering, in some fixed but arbitrary order. This having been done, let T~ be that path in M lying over Tk which has sheet v along its right-hand side, for v = I, 2. Having fixed a base point Zo E Ai lying over the point Po E M, there is a canonical isomorphism between the covering translation group r and the fundamental group 7Tl(M, Po). Specifically, for an element T E r, select any path in Ai from Zo to Tzo; this path covers a closed path at Po in M, representing the element in 7Tl(M, Po) associated to T under this isomorphism. It is clear that the element Tk E r is associated to the element of 7Tl(M, Po) represented by the path (TD( T~) -1 obtained by traversing first the path T~ in the direction of its orientation and then the path T~ in the opposite direction to its orientation; this element of 7TI(M, Po) will also be denoted by Tk • The problem is now to show that these elements generate the entire fundamental group 7TI(M, Po). Note that any closed path based at Po in M can be deformed continuously into a path contained in the 2g+ 1

subset K = U

k=1

2g+ 1

(T~ U T~);

for clearly the set Ko = U

Tk

is a deformation

k=1

retraction of P - t for any point t E P - K o, and the lifting of this retraction to M accomplishes the desired result. Since K is a wedge of circles, 7Tl(K, Po) is the free group generated by the paths tracing out these component circles; but these are precisely the paths representing the elements T k, hence these elements Tk generate the group 7Tl(M, Po), as asserted. Moreover, since the Tk are free generators for the group 7Tl(K, Po), the relations between these elements in 7Tl(M, Po) all follow from the exact homotopy sequence of the pair (M, K), which clearly has the form f)

7T2(M, K, Po) ---+ 7TI(K, Po) ---+ 7Tl(M, Po) ---+ o.

The pair (M, K) is evidently I-connected; thus it follows from the Hurewicz isomorphism and excision that, with the notation of [4],


245

noting that M/Kis a wedge of2-spheres. Thus the full group 7T2(M, K,po) is generated by two elements ill, il 2, together with the natural action of the fundamental group; and since this action of the fundamental group commutes with the exact homotopy sequence, all relations among the generators of T" of 7Tl(M, Po) are consequences of the relations derived from the elements ail l , ail 2 of 7Tl(K, Po). It is clear geometrically that ill, il2 correspond to the obvious mappings from the closed unit disk to sheets I and 2 of the covering respectively, with their common boundary K; and that ail l = T 1 T2· .. T 2g+ 1 and ail 2 = T l- 1T2- l ... T2~~ 1. Therefore the relations in 7Tl(M, Po) are consequences of the following two relations: (7) It must be pointed out that the quadratic periods are not quite so simple as they might first appear to be from the symmetry formula (4) and formula (6) of Theorem 1. These periods are essentially non-Abelian in nature, and for a general element of the group r they have a rather more complicated form than they have for the special generators T". For instance, one may wish to calculate the quadratic periods for a canonical set of generators of the fundamental group 7Tl(M, Po). Introducing the elements A", B" E 7Tl(M, Po) defined by

A" = T2"T2"+1" ·T2g+1T21/, B" = T2k~lTii/, fork = l, ... ,g, a simple calculation shows that these are also generators for 7Tl(M, Po), and that the relations between them are consequences of the single relation

A1B1AllBllA2B2AilBil ... AgBgA;;lB;; 1 = 1. Thus these elements can be viewed as a canonical set of generators of the fundamental group of the surface M. It then follows by a straightforward calculation from formulas (3) and (6) that the quadratic periods for these generators have the form

+ - 2UiiB,,)

=

9

2: [Bt(AI)BiBI) 1="

B;(B,,)BiB,,) -

+

Bi(BI)BiA I)]

Bi(B,,) BiA"B" + 1 . . . Bg)

Bt(A"B" + 1 . . . Bg)BJCB,,).

In each case the first term, which is symmetric in i and j, can be determined directly from the symmetry formula (4); the interest actually lies in the remaining terms, which are skew-symmetric in i and j, and which reflect the non-Abelian nature of the quadratic periods.

246

R. C. GUNNING

4. That all the quadratic Abelian periods with base point a Weierstrass point can be expressed so simply in terms of the usual period matrix of the Riemann surface reflects the particular simplicity of hyperelliptic surfaces. Even allowing an arbitrary base point, the expressions remain quite simple, in view of (5). For a general Riemann surface the situation is rather more involved. Even sidestepping the matter of the choice of base point, by considering only those linear combinations of the quadratic Abelian periods which are independent of the base point, there remains the question whether the resulting expressions are actually functions of the period matrix of the surface; that is to say, any expressions in the quadratic Abelian periods which are independent of the base point can be viewed as functions on the Teichmuller space, but need not be functions on the Torelli space (see [3]). The detailed investigation of this matter is of some interest, involving as it does the relatively lightly explored area of nonAbelian problems on Riemann surfaces, but will be reported elsewhere. The mere fact that these quadratic periods are of a non-Abelian character is, of course, an insufficient justification for considering them extensively. However, on the one hand, the quadratic periods do arise naturally in studying some problems on compact Riemann surfaces. For example, any finitely sheeted topological covering space of a compact Riemann surface has a natural Riemann surface structure; fixing the topological type of the covering, the period matrix of the covering space is a well defined function of the period matrix of the base space, and it is of some interest to know this function more explicitly. Equivalently, given a compact Riemann surface M of genus g, the problem is to determine explicitly the period matrix of the unique compact Riemann surface M' of the same genus g such that a fixed subset GeM x M', the graph of the topological covering, is homeomorphic to an analytic subvariety of the manifold M x M'. The Abelian part of the problem can be handled quite easily, for Lefschetz' theorem determines the period matrices of the surfaces M' such that the graph G is homologous to an analytic cycle on the manifold M'. There are numerous surfaces M' admitting correspondences onto M having the topological type of the covering mapping, however. A detailed examination of the analogue of Riemann's bilinear relations picks out the period matrix of the unique covering space among all such surfaces M' in. terms of the quadratic periods. In order for this approach to be very useful, more detailed knowledge of the quadratic periods on arbitrary Riemann surfaces seems necessary. On the other hand, the quadratic periods do have some relevance to the old problem of determining which Riemann matrices are the period


247

matrices of Riemann surfaces. The Riemann bilinear equalities correspond to the vanishing of the cup product of any two analytic period classes, or equivalently, to the fact that the product 81 A 82 of any two Abelian differentials is cohomologous to zero. Actually, of course, the product 81 A 82 vanishes identically; this is equivalent to the fact that the differential form a = hI 82 is closed, that is, that the quadratic periods are well defined. Note that the Abelian differentials 8 = dh have simple critical points in the s~nse that h is a finite branched covering map at any zero of 8; and conversely, if 8 = dh is any closed complex-valued differential form with simple critical points on a two-dimensional differentiable manifold, the manifold has a unique complex structure for which 8 is an Abelian differential form. Now if 81> ... , 8g are any closed complex-valued differential forms with simple critical points on a two-dimensional differentiable manifold, and if 8, A 8j = 0 for all i,j, it follows immediately that all the forms determine the same complex structure; that is, that these are precisely the Abelian differential forms on some Riemann surface. Thus one is led to suspect that the period conditions beyond the Riemann bilinear relations may involve the quadratic periods; but again, some more detailed knowledge of the quadratic periods on arbitrary Riemann surfaces seems necessary before pursuing this line of investigation further. PRINCETON UNIVERSITY

REFERENCES 1. GUNNING, R. C., Lectures on Riemann Surfaces. Princeton, N.J.: Princeton University Press, 1966. 2. HALL, MARSHALL, The Theory of Groups. New York: The Macmillan Company, 1959. 3. RAUCH, H. E., "A transcendental view of the space of algebraic Riemann surfaces," Bull. Amer. Math. Soc., 71 (1969), 1-39. 4. SPANIER, EDWIN H., Algebraic Topology. New York: McGraw-Hill Book Co., Inc., 1966.

The Existence

if Complementary Series

A. W. KNAPP!

AND

E. M. STEIN2

1. Introduction Let G be a semisimple Lie group. The principal series for G consists of unitary representations induced from finite-dimensional unitary representations of a certain subgroup of G. These representations are not all mutually inequivalent, and their study begins with a study of the operators that give the various equivalences-the so-called intertwining operators. For G = SL(2, R), these operators are classical transformations. The principal series can be viewed conveniently as representations on L 2 of the line or L 2 of the circle. In the first case, the operators are given formally by scalar multiples of (Lla) and (Llb)

f(x)---+

f~oof(X

- y)(signy)lyl- 1Ht dy.

The operator (l.la) is fractional integration of the imaginary order it and is also known as a Riesz potential operator of imaginary order; for t = 0, the operator (Ll b) is the Hilbert transform. If the principal series instead is viewed on the circle, the operators are less familiar analogs of these, given formally in the case of (Lla) by (1.2)

f(8) ---+

Jor2" f(8

- '1')(1 - cos 'P)- O. It is (2.3)

A(z)f(k o) =

t

e(I-2)P!Oga(km')a(m')a- 1 (m(km'»f(kk o) dk,

where the notation on the right refers to the decomposition of G into MANN, namely km' = m(km')·a(km')·n·n;

this decomposition exists uniquely for all k not in M. In short, for (a, z) to give a representation of the complementary series, we expect that a must be equivalent with am' and that z must be real. In this case if z > 0, the inner product should be a multiple of (A(z)J, g). That is, a multiple of A(z) must be a positive Hermitian operator. A(z) is always Hermitian for z real, and it is positive if and only if its kernel is a positive-definite function. For z > 1, we can settle the question of positivity immediately. For such a value of z, the kernel vanishes at the identity and is continuous and bounded on K; since it is not identically 0, it is not positive-definite. Thus there should be no complementary series for z > 1.

THE EXISTENCE OF COMPLEMENTAR Y SERIES

253

Our approach to the question of positivity when 0 < z < involves complex methods. To begin with, z -J>o A(z)Jis analytic for Re z > 0, and, if J is smooth, we show that this function of z extends to be meromorphic in the whole plane. Denoting the new operators, defined for Re z ~ 0, by A(z) also, we shall see that z -J>o A( - z)A(z)J is meromorphic in the whole plane. For z purely imaginary and not 0, A( -z)A(z) is an intertwining operator for the unitary representation V"'z, which is irreducible by Bruhat's theorem. Thus A( -z)A(z)J = c(z)Jwith c(z) scalar for z imaginary. If we introduce a suitable normalization B(z) = y(z)-lA(z), we shall obtain B( -z)B(z)J = J for imaginary z, hence for all z by analytic continuation. Suppose B(O) is the identity. Then for B(zo) to fail to be positivedefinite for some positive zo, the equality B( -z)B(z)J = Jsays that either B(z) or B( -z) must have a singularity for some z with 0 < z < zoo Thus an investigation of the singularities of B(z) will be the key to the whole problem of the existence of complementary series associated with a.

3. The existence theorem We continue to assume that G has real-rank one. The representations Vu,z(x) of the nonunitary principal series, which was defined in Section 2, are parametrized by the finite-dimensional irreducible unitary representations a of M and by the complex number z, which corresponds to the character a -J>o ezPH(a) of A. The space H" on which V"'Z operates is a subspace of U(K) ® V" that depends on a but not on z, and the action of K, by right translation, is independent of z. The space of COO vectors for V"'Z is the subspace of Coo functions in H"; thus, it too is independent of z. We denote this subspace by Coo(a). We shall say that V"'Z is a member of the complementary series if there exists a positive-definite continuous inner product O. Vnless a is equivalent with am' and z is real, the only continuous linear operator Lon C"'(a) satisfying (3.2) is O. If a is equivalent with am' and z is real, then the continuous operators on C"'(a) satisfying (3.2) are exactly the scalar multiples of A(z). A(z) is bounded and Hermitian.

Before we pass to a study of the analyticity of A(z), let us observe that the A(z) have a common finite-dimensional resolution. Specifically, let H'b be the subspace of H" of functions that transform under K according to the equivalence class D of irreducible representations of K. H'b is finitedimensional since H" ~ U(K) ® V"' and it is independent of z. Then each A(z) maps each H'b into itself, by equation (3.4) for x in K. If f is in C"'(a), the mapping z ----';0 A(z)f is an analytic mapping of {Re z > O} into C"'(a). We shall be concerned with extending this mapping to a meromorphic function defined in the whole complex plane. It will be enough to consider the simpler function z ----';0 A(z)f(I), where I is the identity of K, provided we prove joint continuity of this function in z and! Since the singularities of the kernel (3.3) occur only for k in M, we can suppose thatfis supported near M, particularly away from M' - M.

THE EXISTENCE OF COMPLEMENTARY SERIES

255

This turns out to mean that we can transform the whole problem to a problem about the simply connected nilpotent group N. In fact, using the change-of-variables formula of [2, p. 287], we find that (3.5)

A(z)f(l) =

IN e(1-2)P

log

a(ym')a(m')a-1(m(ym')){e(l+Z)PH(YY(K(y))} dy.

The notation here is the same as in formulas (2.1) and (2.3). The ingredients of this formula are technically much simpler than those of formula (2.3), and we consider them one at a time. First we make some comments about 1'1. The restricted roots of the Lie algebra 9 of G are either 2a, a, 0, -IX, -2a or a, 0, -a. In this notation, 1'1 = exp (9-0: EEl 9-20:)' Let p = dim 9-0: and q = dim 9-20:' The group A acts on 1'1 by conjugation; geometrically this action looks like dilations, except that the 9 _ 20: directions are dilated twice as fast as the 9 - 0: directions. Now consider the first factor in the integrand of (3.5). Although it is not necessary to do so for the present problem, one can compute this factor explicitly. If y = exp (X + Y) with X E 9-0: and Y E 9-20:' then 3 (3.6)

e(l-Z)ploga(ym')

=

(te 211X114

+ 2ell YI12)-(P+2 )(1-2l/4, Q

where the norm is that induced by the Killing form of 9 and where e = (2p + 8q)-1. Put Iyl = e-ploga(ym'). Then the function (3.6) has an important property of homogeneity relative to A: if b is in A, then Ibyb-11

=

e-2PH(bllyl·

Next we consider a(m')a-1(m(ym')), which we shall denote aCYl. This is a matrix-valued function defined everywhere but at the identity and satisfying the homogeneity property a(byb- 1) = aCYl for all bin A. Finally we consider the factor e(1+2lPH(YY(K(y)). This function is a smooth function of compact support in 1'1 because f is assumed to be supported away from M' - M. The function depends on the complex parameter z but is entire in the variable z since pH(y) has no singularities. To see that (3.5) extends to be merom orphic in the whole z-plane, we choose a continuous function rp(r) of compact support on [0, (0) so that rp{lyl)f(K(Y» = f(K(Y)), we expand e(l+ZlPH(YY{K{y)) about y = identity in a finite Taylor series with remainder term, we collect the polynomial terms of the same homogeneity relative to A, we multiply both sides of the expansion by rp{lyl), and we substitute into (3.5). The terms of the expansion can be computed well enough to conclude the following: each term but the remainder has a merom orphic extension with at most one pole, that one simple and occurring at an integral multiple of z = -{p + 2q)-I, and the 3 That this explicit formula holds might be guessed from an earlier formula that S. Helgason had derived for exp { - 2pH(y)}.

256

KNAPP AND STEIN

remainder term gives a contribution analytic in a large right-half plane. Collecting these results, we have the following theorem: THEOREM 3.2. Let f be in COO(a). As a mapping into COO(a), the function z -'? A(z)f has a meromorphic extension to the whole complex plane with singularities only at the non-negative integral multiples of - (p + 2q) -1. The singularities at these points are at most simple poles. The poles can occur only at integral multiples of - 2(p + 2q)-1 if a(y) = a(m')a- 1(m(ym')) satisfies (3.7)

a(exp (-X

+

Y))

=

a(exp (X

+

Y))

for X E 9-a and Y E 9-2a. Moreover, the mapping (z,f) -'? A (z)f for z in the regular set andf in COO(a) is a continuous mapping to COO(a). REMARKS. Condition (3.7) holds for all a for the Lorentz groups SOeCn, I), the Hermitian Lorentz groups SU(n, I), and the symplectic Lorentz groups Sp(n, I), but it fails for the spin groups Spin(n, 1). In any case, the parameter z is normalized so that z = 1 corresponds to p; therefore, z = 2(p + 2q)-1 corresponds to the restricted root cx. The result for SOe(n, 1) that the only poles of A(z)fare simple and are at multiples of-cx was obtained by Schiffmann [7]. Using Theorem 3.2, we can now define A(z) for all z. To proceed further, however, we need more information about a. It is possible to show, under the additional assumptions on G that G is simple and has a faithful matrix representation, that some representation D of K, when restricted to M, contains a exactly once. 4 By the reciprocity theorem, this means that K acts irreducibly on some HZ =1= o. Fix such aD = Do, and let v(k) be a nonzero member of HZ o• Since A(z) commutes with K, we obtain (3.8)

A(z)v

=

y(z)v

for a complex-valued meromorphic function y(z). Define

B(z)

=

y(z)-1A(z).

As we shall see in Section 4, there is no complementary series associated with a near z = 0 unless the unitary representation U,,·o is irreducible. [And for G = Spin(n, 1) or SU(n, 1) there is no complementary series for any z unless this condition is satisfied.] We therefore assume now that U,,·o is irreducible. A necessary and sufficient condition for this irreducibility is given as Theorem 3 of [3]. The condition implies that y(z) does have a pole at z = 0, which implies that the operators B(z) are uniformly bounded on compact subsets of 0 ~ Re z ~ c if c is sufficiently small. The irreducibility and equation (3.4) then imply that B(O) = I. 4 Independently J. Lepowsky has obtained this result and a generalization in his thesis at the Massachusetts Institute of Technology.


257

The definition of y(z) is arranged so that B( - z)B(z)f = f for all f in C<Xl(a) and for all z. In fact, Theorem 3.3 implies that B( -z)B(z)f is merom orphic in the whole plane. But B( -z)B(z) for purely imaginary z l' 0 intertwines U"'z with itself. By Bruhat's irreducibility theorem in [I], B( - z)B(z) = c(z)I with c(z) scalar for z imaginary. Applying both sides to v, we see that c(z) = 1 for z imaginary. That is, B( -z)B(z)f = f for z imaginary. By analytic continuation, B( -z)B(z)f = ffor all z. B(z) preserves each H~, and B(O) = I. Fix D, and suppose B(zo)ID is not positive-definite for some Zo > O. Then either B(z) has a pole nearer 0, or some B(z)fhas a 0, in which case B( -z) has a pole, because B( -z)B(z) = I. The poles of B(z) (and similarly for B( -z)) arise when A(z)fhas, for some J, a pole of higher order (possibly negative) than does y(z). These are the ideas behind the main theorem: THEOREM 3.3. Suppose that G is simple, that G has a faithful matrix representation, and that dim A = I. Let a be an irreducible finite-dimensional unitary representation of M satisfying the necessary conditions above, namely, that

(i) a is equivalent with am', where m' is a member of M that is not in M,and (ii) the unitary representation U",O is irreducible. Define A(z) by (2.3) and y(z) by (3.8). Let Zo be the least number ~O such that, for some f E C <Xl (a), Z -+ A(z)f has a pole at - Zo and y(z) does not or such that y has a zero at Zo or at - zoo Then Zo > 0 and the parameters (a, z) give rise to representations of the complementary series for 0 < z < Zo with inner product <J, g) = Y(Z)-l

t

(A(z)J, g).·u dk

for f and gin C<Xl(a). It is a simple matter to see also that the parameters (a, zo) give rise to a representation of the quasi-complementary series. It can happen that this representation is the trivial representation of G. We should emphasize why the number Zo in the theorem is strictly positive. The set whose least member is Zo consists at most of the positive nonzero integral multiples of (p + 2q)-1 and the non-negative values z such that one of the merom orphic functions y(z) and y( -z) vanishes. This set is discrete, and it does not contain z = 0 because y(z) has a pole at z = o.

258

KNAPP AND STEIN

4. Further investigation of the singularities of B(Z)

For the case that a is the trivial representation of M, we can choose the eigenvector v of the A(z) to be a constant function. The associated function y(z) is closely related to Harish-Chandra's c function (see [2]), and we can obtain very explicit results as a consequence. For G of general real rank, the c function can be defined as the analytic continuation in p. to the whole complexified dual of the Lie algebra of A of the function c(p.)

=

IN e(iU+O)H(x) dx.

Let us return to the real-rank one case. PROPOSITION 4. I. Let a be the trivial representation, and choose v to be a constant function. Then the function y(z) is given by y(z) = c( -izp). Therefore r(!{p + q + I) )r(t(p + 2q)z) , (z) - 2(1' + 2q)(1 - z)f2 y r(t(p + 2q)(l + z»r(t(p + 2 + (p + 2q)z»

where p = dim g-a and q = dim g-2a' PROOF.

y(z)

=

IK e(l-z)Ologa(km') dk = IK

=

r

r

e(l-z)Ologa(k)

dk

e(l-z)ologa(k) dk = e(1-Z)Ologa(1«x))e2oH(x) dx, J~M IN the last equality following from [2] (see p. 287). If x = ank E ANK, then k = a- 1(an- 1a- 1)x E MANN. Hence a(K(x» = exp (-H(x», and we obtain y(z) = fN e(l + z)oH(x) dx, as required. The formula for y(z) then follows from [2] (see p. 303). In [4], B. Kostant obtained the existence of complementary series for a trivial and G of any real rank. For a first application of Proposition 4.1, we shall compare his results in the rank-one case with what we can prove from Theorem 3.3 and Proposition 4.1 when a is trivial. Using the explicit value of y(z) in the proposition, we see that y(z) is nonvanishing for -I < z < I and that the only poles of y(z) for -I < z ~ 0 occur at the non-negative integral multiples of -2(p + 2q)-I. If q = 0, there is a pole at every multiple of - 2p -1 less than 1. If q > 0, then p is even and a pole occurs at every multiple satisfying -(p + 2)(p + 2q)-1 < Z ~ O. By Theorem 3.2 the poles of A(z)f occur only at multiples of -2(p + 2q)-1. Thus by Theorem 3.3 there is a complementary series for if q = 0 I { (4.1) 0 < z < Zo = p + 2 'f 0 p+2q l q > .


259

Kostant has this estimate (in [4], see Section 3.1 and Theorem 10). Kostant shows further that there is no (positive-definite) complementary series to the right of the point Zo in this inequality. We can obtain this result by our method if q = 0, 1, or 3. But if q = 7, we obtain only the weaker result that there is no complementary series immediately to the right of Zoo This weaker result comes from comparing the signs near Zo of y(z) and (A(z)J,f) for anfsuch that A( -zo)fhas a pole. (Such anfexists.) We turn to other applications of Proposition 4.1. When q = 0 or I, we have Zo = 1 in (4.1). If exp {(1 - Zl)P log a(km')}a(m')a-1(m(km- 1)) is a positive-definite function and if 0 < z < Zl ;;; I, then the product with exp [{I - (1 - Zl + z)}p log a(km')] is also positive-definite. We obtain the following corollary: COROLLARY 4.2. Let G = SOe(n, 1), Spin(n, 1), or SV(n, 1). Let a be an irreducible unitary representation of M such that a is equivalent with am'. Then the positive Z such that va .• is in the quasi-complementary series form an interval with 0 as left endpoint. COROLLARY 4.3. Let G = SOe(n, 1), Spin(n, 1), or SV(n, 1). Let a be an irreducible unitary representation of M such that a is equivalent with am' and such that va.O is reducible. Then there is no positive z such that va .• is in the quasi-complementary series. For the proof of the second corollary, it is possible to use Theorems 1 and 3 of [3] to show from the reducibility of va.O that B(O) is unitary and not scalar. But if va·· o is in the quasi-complementary series, A(z) is semidefinite for 0 < z < Zo, by Corollary 4.2; this fact implies that B(O) is semidefinite, which is a contradiction. THE INSTITUTE FOR ADVANCED STUDY PRINCETON UNIVERSITY

REFERENCES 1. BRUHAT, F., "Sur les representations induites des groups de Lie," Bull. Soc. Math. France, 84 (1956), 97-205. 2. RARISH-CHANDRA, "Spherical functions on a semisimple Lie group, I," Amer. J. Math., 80 (1958), 241-3lO. 3. KNAPP, A. W., and E. M. STEIN, "Singular integrals and the principal series," Proc. Nat. Acad. Sci., 63 (1969),281-284. 4. KOSTANT, B., "On the existence and irreducibility of certain series of representations," Bull Amer. Math. Soc., 75 (1969),627-642. 5. KUNZE, R., Analytic Continuation of Intertwining Operators: Construction of Complementary Series for Complex Semis imp Ie Groups (unpublished manuscript). 6. KUNZE, R., and E. M. STEIN, "Uniformly bounded representations, III," Amer. J. Math., 89 (1967), 385-442. 7. SCHIFFMANN, G., "Integrales d'entrelacement: cas des groupes de Lorentz," C. R. Acad. Sci. Paris, Serie A, 266 (1968), 859-861.

Some Recent Developments in the Theory

oj Singular

Perturbations

P. A. LAGERSTROMl 1. Historical introduction In the nineteenth century celestial mechanics played an essential role in developing perturbation methods and asymptotic theory. This work culminated in Poincare's great treatise, Les Methodes Nouvelles de la Mechanique Celeste. In the twentieth century fluid mechanics has played a somewhat similar role. The present paper attempts to draw mathematicians' attention to some important ideas in the theory of singular perturbations developed recently by workers in applied fields. The ideas will be illustrated by simple model equations; the references to fluid mechanics are merely historical and are not needed for the mathematical discussion which starts in Section 2. We first give a brief historical sketch. The equations of motion for stationary viscous flow may be written (in suitable nondimensional variables )2 (1.1 a)

ex

i

j= 1

i

(l.lb) where the (I.Ic)

Vj aVk aXj

aVj j=1 aXj Vj

= ~ a2v~ j= 1

aX j

_

ap, aXk

=0 ,

are the velocity components, p is the pressure, and ex

length x velocity = R eyno ld s num ber = .. VISCOSIty

In a typical problem one wants the solution outside a closed (n - 1)dimensional surface S. Typical boundary conditions are then (1.2a)

on S:

(1.2b)

at infinity:

1 2

Vj = VI

0,

= I;

Vj

= 0 for j =I- 1.

Work on this paper was supported by NSF Grant No. GP 9335. For details concerning fluid mechanics, see [3]. 261

262

P. A. LAGERSTROM

In 1850 Stokes treated the case of small a by simply putting a = 0 in (l.la). While this worked for n = 3, it was found that for n = 2 any nonzero solution grows logarithmically at infinity, thus violating (1.2b). This is the famous Stokes paradox. It was later found that an attempt to find a second approximation for n = 3 also led to logarithmic divergence (Whitehead's paradox). The contribution to asymptotic theory was entirely of a negative nature. It was the problem of large values of a which led to significant progress in asymptotic methods. In 1904 Prandtl introduced his famous boundarylayer theory to find approximate solutions for a large. His method of using different length-scales and different approximations in different regions of space has had a great although belated influence on what is now known as the theory of singular perturbations. In the 1950's the range of application of Prandtl's technique was extended to many fields of applied mathematics; at the same time efforts were made to understand and enlarge the technique by fitting it into some orderly asymptotic scheme. The ideas advanced by Saul Kaplun 3 about 1955 were crucial. Kaplun made a very profound analysis of the ideas underlying the Prandtl technique (developed for large a) and showed that the same ideas may be used to get asymptotic solutions of the same equation with small a. This not only essentially solved the Stokes paradox (after more than a hundred years) and the Whitehead paradox but also greatly increased the understanding of singular perturbations and advanced the technique for handling them. Concepts and techniques such as "matching," "overlap," and "intermediate limit," which are now in standard use, are due to Kaplun. Two recent books on perturbation methods ([I] and [4]) give abundant evidence of the increase in techniques (Prandtl-Kaplun technique as well as many others) and the extension of the range of applications. But while mathematicians have given the older Prandtl technique some mathematical respectability by rigorously proving its correctness in special cases (see, for example, [5]), the more recent and much deeper Kaplun ideas are practically unknown among pure mathematicians. The first model equation which follows is an illustration of Prandtl's technique (the parameter E corresponds to a- 1 / 2 in Prandtl's case). The second model example illustrates some of the important ideas introduced by Kaplun, in particular his method of clarifying the Stokes paradox. 2. Posing the problem: Prandtl's method

In mathematical physics one frequently encounters the problem of constructing an asymptotic expansion of a function, with the requirement that 3

The late Saul Kaplun's work is collected in [2].

THEORY OF SINGULAR PERTURBATIONS

263

it be uniformly valid over a given region in space and time as certain parameters tend to limiting values. Thus (restricting ourselves to one coordinate and one parameter) we seek to construct functionsfk(x, t-} such that, for a givenf(x, ,,), n

L

f(x, ,,) -

lim

(2.1)

fk(X, ,,) =

k=O

fLn(")

0

,

as " tends to zero; the limit is required to be uniform for all values of x which lie in a given closed set. The fLn(") are gauge functions which form an asymptotic sequence

(2.2) as " tends to zero. The physical situation may be such that" is always real and positive. In fact, in the type of problem to be discussed here f(x, ,,) often has an essential singularity in " at" = 0; hence the path of approach to zero must be specified. We shall always assume that "lim" means "limit as " tends to zero through positive values." In a typical situation the functionf(x, ,,) is defined implicitly by a differential equation and boundary conditions. The fk are then found by solving approximating equations. In order to find these equations one normally makes specific assumptions about how" occurs in thefk. (We shall see at the end of Section 4 that in a certain sense this procedure should be reversed.) The simplest assumption is, of course,

(2.3) (in particular fLk may be "k). A problem for which this assumption works is called a regular perturbation problem. A trivial example illustrates how a regular perturbation method may fail. Let f be defined by (2.4)

"f"

+ f' =

a;

f(O) = 0;

where a = constant #- I. Puttingf '"

L

f(l) = 1,

"kgk(X) we find that the equation

k=O

for go is g~ =

(2.5)

a.

Only one boundary condition may be imposed. Standard methods show that (since" > 0) we should retain the condition at x = 1. Hence (2.6)

go

=

ax

+ (l

- a).

Since a#-1 this approximation is not uniformly valid in [0, 1]. (Such a solution is usually called an outer solution.) In some sense, to be discussed later, it is valid if we exclude the origin. A solution valid near the origin

264

P. A. LAGERSTROM

(called an inner solution) may be obtained by introducing a new variable (a "stretched coordinate" or "inner variable"), defined by (2.7)

£X

= x.

Putting (2.8)

f(x, £) = I(x, £),

we seek an expansion (2.9)

1,..,

L: £khk(x).

k=O

The equation for ho is then (2.10) We retain the boundary condition at x

= 0 which gives

(2.11) The constant A cannot be determined by the boundary condition at x = 1 since we expect ho to be valid only near the origin. Instead we determine A by matching the inner solution ho with the outer solution go, using the matching condition (2.12) which gives (2.13)

A

=

1 - a.

Condition (2.12) may seem surprising; for the time being we note only that a comparison with the exact solution shows that it is correct in the present case. Although the example is trivial, it illustrates a situation typical for a singular perturbation problem: one constructs (at least!) two asymptotic expansions valid in different regions. They are not constructed independently; in order to determine the terms of both expansions one needs to compare them with the aid of matching principles. The main purpose of this article is to point out the underlying ideas which lead to a matching condition of the type (2.12). These ideas also show how to do the matching correctly in cases for which a simple matching condition of the type (2.12) does not hold. As a preliminary, the concepts of valid "near the origin" or "away from the origin" have to be defined. The basic reference for the following sections is [2].


265

In passing we note that by adding the approximations go and ho and then subtracting their common part, one obtains an approximation fa which is uniformly valid in [0, I]:

fo(x, €) = (I - a){l - e- XIE )

(2.14)

Actually it is valid to all orders

En,

+ ax.

the mistake being of order e -liE.

3. The domain of validity: overlap matching ORDER SPACES. Comparison with the exact solution, or with (2.14), shows that go is a uniformly valid approximation in an interval [xo, 1], for any Xo > o. This, however, is insufficient for matching. We need in a sense its maximal domain of uniform validity. As is easily seen, go is uniformly valid in any interval [-1)(€), I], where 1](€) is >0 for € > 0 and € = o(1](€)). Thus we may characterize the domain of validity by a function class. The class of functions ~(€), defined on some open interval 0 < € < EO, which are positive and continuous will be denoted by ff. Only the behavior of ~ as € tends to zero will matter; hence we may group the elements of § into equivalence classes. The relation between ~ and 1],

~ tends to a

(3.1)

1]

nonzero finite limit,

is an equivalence relation; the corresponding class, called an o-c1ass, is denoted by "ord 1]." We may make a broader identification, into O-c1asses, by the equivalence relation: (3.2)

There exist constants such that 0 < C l
0 < C2 for 0 < € < EO.

C l , C2

~/1]

The corresponding class is denoted by "Ord 1]" and the space of such classes will be denoted by~. A partial ordering may be introduced. Let Z and Y be two classes. Then (3.3)

Z < Y

means

~ =

0(1])

for

~

in Z and 1] in Y.

(The same relation could be defined for o-c1asses.) In the obvious sense we may define intervals; (Z, Y], [Z, Y], etc. The space of O-c1asses (or of o-c1asses) has an interesting topology, with open intervals as basic neighborhoods, which does not seem to have been sufficiently explored. DOMAIN OF VALIDITY. Let "f/ be a set of O-c1asses and let F("f/) be the set of all elements of § whose O-c1asses are in ''f/. Denoting the difference

266

P. A. LAGERSTROM

I/(x, ~) - g(X, 10)1 by W(X, E) we say that g is an approximation to 1 uniformly valid to order unity on the domain "I'" if " 'YJ in F("I'") implies that w(x, E) tends to zero uniformly on the interval 4 '(E) ~ x ~ 'YJ(E);

(3.4)

"I'" is then called a domain of validity of the approximation g. To give examples of domains of validity we return to the functions discussed in Section 2. By comparing go with the exact solution [for which we may substitute/o as defined by (2.14)], we see that the half-open interval (Ord E, Ord I] is a set of validity of go. It is in fact the maximal set of validity if we assume that go and 1 are defined only for 0 ~ x ~ I; similarly [Ord 10, Ord I) is a set of validity of ho. OVERLAP MATCHING. We notice that the domains of validity of go and ho have a common part, namely the open O-interval

oF = (Ord E, Ord I).

(3.5)

It is the existence of a domain oloverlap which makes matching possible. If there is an O-set where both go and ho are valid,5 then on this set Igo - hoi tends to zero with E. This may be conveniently formalized with the aid of a limit process. We define xn by (3.6) and the 'YJ-limit of a function a(x, E) as lim a(x, E) = lim a(x, E)

(3.7)

n

as E

t

0, xn fixed.

In other words lim means that we approach E = 0 along the curves 71

xn = constant in the (x, E)-plane. It is clear that if b(x, E) and c(x, E) are two approximations to d(x, E) which have an overlap domain 2), then (3.8)

Ord'YJ in

2)

implies

lim n

Ib - cl =

O.

Such a limit is called an intermediate limit. We shall consider (3.8) as the fundamental matching principle. For the example in Section 1 we find

go (x) - ho(x) = (a'YJxn + 1 - a) - A(l - e-nX.'E). Applying lim for any 'YJ such that Ord 'YJ is in the overlap domain we find 71

(1 - a) - A = 0, 4 When such an interval is empty we say that w = 0 for all x in this interval. Also, one should of course consider only the intersection of the interval and the domain of definition of w(x, E). 5 Throughout this paper we restrict ourselves to the lowest order and assume 1'0(-) = 1.

267


which is nothing but the matching principle (2.12). However, the validity of (2.12) is rather restricted; it is a special case of the basic matching principle (3.8). The clarification of the nature of matching has proved of great importance in asymptotic constructions. However, we are still faced with the problem how to determine the domain of validity a priori. This will now be discussed with the aid of a second model equation. 4. Maximal domains of validity. Extension theorem. Stokes paradox STRUCTURE OF DOMAINS OF VALIDITY. We first discuss some properties of domains of validity which are independent of the special nature of the functionj(x, E). Let l ' be a domain of validity. We shall see how it may be enlarged. It may be easily proved that if ~ and 't'; are domains with an O-class in common, then the union of ~ and 't'; is a domain of validity. Thus one may form maximal domains of validity. In practical cases there will generally be one such maximal domain. A maximal domain l ' has the obvious property that if X and Z are in l ' and X < Y < Z then Y is in "Y. Various other properties may be found although the intrinsic characterization of ~ maximal domain of validity remains an unsolved problem. (For comments and speculation, see [2], Chapter VI.) One property, however, will later be of essential use; it is expressed by Kaplun's extension theorem (see [2]): (4.1)

Let l ' be a maximal domain of validity containing the O-class Y. Then there exists a class X in l ' such that X < Y.

PROOF. Assume first that Y = Ord 1. Then, by assumption, given a constant C > 0 and a 0 > 0, one may find an £6 > 0 such that w(x, £) < 0 for C ~ x ~ 1, 0 < £ ~ £6. The trick of the proof is to use this property for a sequence of constants Cn and a sequence On> both tending to zero. One may, for instance, put On = Cn = !. Since all constants! belong to Ord (1)

n

we may for each n pick an

£n

n such that w < ! for ! ~ x n n

We now define a function 1](£) by the requirement 1](£n+1)

~

1; 0
O. k=I •...

,n

Otherwise fJ(gl) = ... = fJ(gn) = 0 and the sequence [1;] defined by ft = gk for i == k (mod n) fJ-converges to O. But then (i) would imply fJ(f) = 0, contradicting our assumption. Thus

~f E S.

~ fJ(f) =

Hence

fJa f ) ;;; c,

which gives (4.4). Thus (i) implies (ii). Let (ii) hold. Now (ii) with n = 1 in (4.4) is simply (ii) of Theorem 2. Hence fJ is equivalent to a Riesz seminorm. Thus we may assume without loss of generality that fJ is a Riesz seminorm. Givenfin L denote by G, any finite subset {gh ... , gn} of L + such that If I ;;; gl v ... V gn' Define (4.5) From (4.5) and (4.4) we conclude that fJ ;;; ca. For G, consisting of the singletonf(4.5) implies a ;;; fJ. We contend that a is an M-seminorm. (3.1), (3.2), (3.4), and (4.3) clearly hold for a. To obtain (3.5), simply note that gl V ... V gn ~ Ig I implies gl V •.• V gn ~ If I whenever If I ;;; Ig I· To obtain (3.3), consider any G, = {ft};= 1, .,m and Gg = {gj}j=l,oo.,n' Then If + gl ;;; If I + Igl ;;; V j,j (ft + gj). Thus by (4.5) 00

a(f + g) ;;; Max fJ(ft + gj) ;;; Max fJ(ft) + fJ(gj) i,;

i,;

=

Max fJ(ft) + Max fJ(gj). i

i

Taking infimums on the right over all Gr and Gg , we obtain (3.3) for a from (4.5). That (iii) implies (i) is clear.

SOLOMON LEADER

282

In connection with the proof of Theorem 4 we remark that for any seminorm 13 on a lattice group L there exists a largest M-seminorm such that 13M ~ 13, namely

13M(f) = Inf Max f3(g)

(4.6)

G

geG

whereGisanyfinitesubset{gh ... ,gn}ofLsuchthatlgll V"'V Ignl ~ If I· Note also that if 13 is a Riesz seminorm then (4.4) in Theorem 4 can be replaced by f3(Ul V ..• V un) ~ c Max f3(Ui) for all finite subsets I=l ..... n

{Ul"'" un} of L+. THEOREM 5. In any Banach lattice norm super convergence has a monotone base. PROOF. Let [fn] be norm super convergent to 0 in the Banach lattice L under the norm 13. By Proposition 9 and the completeness of Lunder 13 there exists for each n some Un which is the f3-limit of Ifni V Ifn+11 V ••. V Ifn+kl as k ~ 00. Then Ifni V •.. V Ifn+kl ~ Un by (ii) of Proposition 4. Therefore Un = Ifni V Un+1 by Proposition 2 and (ii) of Proposition 1. Thus Un+1 ~ Un' Now in the notation of (4.2), f3(u n) = lim a(n, k) by k ... 00

continuity of norm. Hence f3(u n) ~ 0 by (4.1) of Proposition 6. COROLLARY 5(a). order convergence. PROOF.

In any Banach lattice norm super convergence implies

Apply Theorem 5 and (iv) of Proposition 4.

COROLLARY 5(b).

In any Banach lattice the following are equivalent:

(i) Norm super convergence equals order convergence. (ii) The norm is monotone-continuous with respect to order convergence: If Un -l.(!i 0 then f3(u n) -l. O.

PROOF.

Apply Theorem 5 and Corollary 5(a).

COROLLARY 5(c). M-norm if and only PROOF.

The norm in a Banach lattice is equivalent to an

if norm convergence has a monotone base.

Apply Theorems 4 and 5.

COROLLARY 5(d).

In any Banach lattice the following are equivalent:

(i) Norm convergence equals order convergence. (ii) The norm is equivalent to a monotone-continuous M-norm. PROOF.

Apply Corollaries 5(b) and 5(c).

Given a Banach lattice L let % be norm convergence, gence, and tfI relative uniform convergence. Clearly

(4.7)

(!)

order conver-

SEQUENTIAL CONVERGENCE IN LATTICE GROUPS

283

Now %. s; @ by Corollary 5(a). Starring and applying Proposition 8 we obtain .AI" = .Al"s* s; @*. Operating on this with d we obtain .AI" = .Al"d S; @*d which, together with (4.7), yields.Al" S; Ol/*. On the other hand Ol/ S; .AI". Thus Ol/* S; .AI"* =.AI". Thus we obtain the basic result .AI" = Ol/* of G. Birkhoff [I] for Banach lattices. 5. L-seminorms A Riesz seminorm {3 on a lattice group L is an L-seminorm if (see [2], [8])

(5.1)

(3U+g)={3U)+{3(g) 6.

THEOREM

forallf,ginL+.

For (3 a seminorm on a vector lattice L the following are

equivalent:

(i) Given sequences Un] and [g,cl in L and a strictly increasing sequence [nk] of positive integers such that (3(gk) ----»- 0 and L: Ifni ~ Igkl, then

L:

lim

k-+oo nk~n 0 choose k large enough so that a(j; - fj)
n(x)} is a sequence of Coo radial functions having compact support in R2 - {O}, and converging, except at the origin, to the indicator function of the unit disk, in the manner indicated cross-sectionally in Fig. I. Define ¢>n,r(x) =

¢>n(~)'

and let ¢>r(x) be the indicator function of the

disk Ixl < r. Then ¢>n.r(x) ~ ¢>r(x) monotonically, except at the origin. Define Nn.r(g) = L ¢>n.r{g(p)}. Then it follows from Beppo Levi's PEP theorem that

But by (I),

r

JGIr'

(2)

INn •r (g)12 dg

=

(2i~(2))-1

.

r

L"'n.,(2 - 2s)L"'n.,(2s) ds

JRes=c

r

+ (2I{(2))-1 JRes=c

~(2

- 2s) ~(2s) L$n,,(2 - 2s)L"'n.,(2s) ds.

Now L", (z) is an entire function of z, inasmuch as ¢>n rex) is COO with compact s~pport in R2 - {O}. Moreover, repeated integration by parts

} W1 Fig. 1

A GROUP-THEORETIC LATTICE-POINT PROBLEM

293

shows that Ln)z) is of rapid decrease in any vertical strip. The Fourier transform of - -oS, and '(0) = -1, the residue of

any strip

=

1~

I tends to

1,

'TTr4

n(2)" Thus, by moving the line of integration to

it follows from (2) that

/Nn.r(g)1 2 dg

=

Cn + (2i,(2))-1

+ (2m2))-1

r

JRes=1/2

r

JRes=1/2

Ln,r(2 - 2s)Ln,r(2s)ds

n~(; ~s) L,pn./2 -

2s)Ln)2s) ds,

S

where C n -i>- (,(2))-2'TT 2r 4. Now by an exponential change of coordinates, the Plancherel theorem for R\ and the fact that 2s = 2 - 2s on Re s = 1, we find that in general

fRes=1/2lLl2sWldsl (3)

= = =

tes=1/21Ll2 - 2sWIdsi

L'" If(p)12p dp 2 r If(x)/2 dx. JR2 4'TT

BURTON RANDOL

294

From this it is evident that in the sense of L2 convergence on the line Re s =

-t,

L"'n.,(2s) ~ L",,(2s)

r =s' 2s

=

and L"'n.,(2 - 2s) ~ L",,(2 - 2s)

2-2s

~ _ s' Moreover, since ¢n.,(x) ~ ¢,(x) in L2(R2), it follows in a similar way that Lq, (2 - 2s) and L$ (2 - 2s) are L2 on Re s = -t, and that L$n,,(2 - 2s)' ~ L$,(2 - 2s) inn'the sense of L2 convergence on Re s = -t. This implies that

r

JG/r'

INr (g)i2 dg = (~(2»-27T2r4 + (2i~(2»-1 + (21~(2» .

i

-1

r (lr~ ) ds n2~(2) - 2s) L$,(2 - 2s) -r

JRes=1/2S

Res=1/2

S

S

2s

S

ds.

Now

so

r

(~(2»-27T2r4 + (~(2»-17Tr2

INr(g)12 dg =

JG/r'

+ (4~(2»

-1

~(l - it)

fOO

r _ ~(1 00

+ it) Lq,,(1

.

- It) 1

rit

+ it dt.

But Lq,,(z) = r 2- zL$1(z), and the mean of Nr(g) is m2»-17Tr 2, so we conclude that

Now for any positive integer N, •

~(1 ± It) =

2: n 1 N

Hit

n=1

+ (1 ±

. If)

i

oo

N

[x] - x

x

+t

2±it

dx

N±lt

+ --. ± It -

N±lt

2N

(see [5], 3.5.3), and by differentiating this formula k times, and then setting N = [t] in the result, we find that ~(k)(l ± it) = O(logk+1 t). This, combined with the fact that {W + it)} -1 = O(log7 t) (see [5], 3.6.5), shows that any fixed derivative of

ig +- ~t~

-1 1 . is in L2( -00, (0). It + It On the other hand, ¢1(X) = Ixl- 1J 1(27Tlxi) = O(lxl- 3/2 ), and since

; : {L$l(l - it)}

=

(-i)kLil - it), where g(x) = ¢1(X) logk lxi, it is

A GROUP-THEORETIC LATTICE-POINT PROBLEM

evident that g(x) is in L2(R2), and hence, by (3), any derivative of 4 is in U( -00, (0). It follows that we can integrate the expression (4~(2»

f

ex>

-lr 2

_ ex>

W - it) W + it) 4 10 -

295

,0 -

it)

r21t it) 1

+ it dt

by parts, writing r21! = e2ltlOgr, and this shows that for any integer k, + O(r2 log-k r).

v(r) = a R 2, R 3. These triangles form a non-Euclidean plastering or tesselation of the interior of the unit circle. The union of t and its image under R2 is a fundamental domain (with suitable conventions about edges) for the triangle group (2, 3, 7), which is the group generated by T = R3Rl' V = RaR2' U = R2Rl with the relations T7 = V3 = U2 = T-IVU = 1. T, V, U are, respectively, non-Euclidean rotations of angles 2'TT/7, 2'TT/3, 'TT counterclockwise about the vertices of t belonging to the angles 'TT/7, 'TT/3, 'TT/2. Now the uniformizing Fuchsian group of S is a normal subgroup N c (2, 3, 7) of index 168. The quotient group r 168 = (2, 3, 7)/N is Klein's simple group of automorphisms of S and is obtained from (2, 3, 7) by imposing the relation (UT-4)4 = I. A convenient fundamental domain for N is the circular arc (non-Euclidean) 14-gon d shown in Fig. 1. It will be noticed that t appears as the un shaded triangle immediately below and to the right of po. There are 168 un shaded triangles, which are the images of t under r 168 , in d and 168 shaded triangles, which are the images of t under anticonformal elements of (2,3, 7)'. We let r~68' the extended Klein group, be the group of 336 proper and improper (sense reversing) non-Euclidean motions obtained by adjoining, say, Rl to r 168. Then d can be viewed either as the union of 336 shaded and unshaded triangles consisting of t and its transforms by r~68 or as the union of 168 double triangles (one unshaded, one shaded) consisting of the union of t and its image under Rl [thus a fundamental domain for (2, 3, 7)] and the transforms of this by r 168. If the sides of d are labeled as in Fig. 1, then, Klein has given the side identifications induced by N in Table 1. d with those identifications is Klein's model of his surface S. TABLE

1

1~6 3~8 5~10

7~ 9~

11

12 14

~2

13~4

300

RAUCH AND LEWI1TES

p

Fig.l

THE RIEMANN SURFACE OF KLEIN

301

One sees that each odd-numbered side is identified with the side 5 units around counterclockwise. We need to label certain points in tl for the sequel. The 14 external vertices, those in which pairs of sides meet, are identified under N in two sets of 7, which alternate. We have exhibited the labels of two successive vertices in Fig. 1 as P and Q. The remainder we think of as labeled P, Q, P, etc., in counterclockwise order. P and Q are 7-vertices (after identification), i.e., vertices where 7 double triangles meet. There are 22 other 7-vertices of which 7 have two representatives each on tl. We label them as follows. po is the center of tl (the origin). Two units (triangles) out from po is a layer of7 which we label PI, Pi, etc., counterclockwise, where the first two are actually shown in Fig. 1. Two units farther out is another layer of 7, Pr, P~, etc., counterclockwise, where again only the first two are shown. Finally the 14 vertices (representing 7 when identified) Pr, Pg, etc., counterclockwise, are on the sides of tl, the subscript denoting the side on which the vertex lies, and again only the first two are exhibited. In addition, it is necessary to label certain 2-vertices (4-vertices under identification) on the sides of tl. There are two on each side; we call them Xl' X{, etc., the subscript denoting the side and the unprimed point lying closer to P. The first two are shown in Fig. 1. We call attention to the fact that Llo L 3 , L2 appear in Fig. 1 as the two diameters and the circular arc, parts of which form the boundary of t. 3. The homology basis on S We now use a standard technique in combinatorial surface topology (see, for example, [20]) to compute a canonical homology basis for S. We make each side of tl into an oriented path by assigning to it the counterclockwise direction as the positive orientation. We introduce the surface symbol 123451-173-195-12-17-14-19-1 deduced from Fig. 1 and Table 1. We observe that 1 and 2 are linked. Write B = 2345 or 2- 1 = 345B- 1 • One obtains I B 1- 1 73- 1 95- 1 345B- 1 7- 1 4- 1 9- 1. From B to B- 1 one has A-I = 1- 1 73- 1 95- 1 345 or 1 = 73- 1 95- 1 345A. One obtains 73- 1 95- 1 345 [A, B] 7- 1 4- 1 9- 1 , where [A, B] = ABA- 1 B- 1 • Observing that 7 and 9 are linked we write C-1 = 3- 1 95- 1 345 [A, B] or 9 = 3C- 1 [B, A] 5- 1 4- 1 3- 1 5 to obtain C-1 7- 1 4- 1 5- 1 345 [A, B] C 3- 1 7. Set D = 3- 1 7 or 7 = D- 1 3- 1 to obtain 3- 1 4- 1 5- 1 345 [A, B] [C, D]. 3 and 5 are linked, and so we set E- 1 = 45 [A, B] [C, D] or 4- 1 = 5 [A, B] [C, D] E to obtain 3- 1 5 [A, B] [C, D] E 5- 1 3 E- 1 • Thus, finally, setting F = 5- 1 3, the surface symbol reduces to [A, B] [C, D] [E, F], which is the desired canonical form.

RAUCH AND LEWITTES

302

Summarizing, we have A = 5-14-13-159-137-11 B = 2345 C = [B, A] 5- 1 4- 1 3- 1 59- 1 3 D = 3- 1 7 E = [D, C] [B, A] 5- 1 4- 1 F = 5- 1 3 If we look at Fig. 1, we find that all six paths are closed and begin and end at P. Hence, when we Abelianize paths to obtain chains, the six paths become the cycles of a canonical homology basis. For any oriented path M on a let a(M) denote the chain obtained by Abelianizing. Then we have TABLE

2

= a(A) = 1 - 4 - 7 - 9 = a(C) = -4 - 9 Y3 = aCE) = -4 - 5 81 = a(B) = 2 + 3 + 4 + 5 82 = a(D) = -3 + 7

Y1 Y2

83 = a(F) = 3 - 5, where KI(Yi> Yi) = KI(8 h 8j ) = 0, KI(y" 8j ) = 8ii , i,j = 1,2,3, KI(y, 8) being the intersection number of y and 8. 4. Action of r~68 and

r 168

on homology basis

We restrict ourselves to the action of R 1, R 2 , R 3, the generators of (2, 3,7), and a/ortiori r~66' One sees immediately the symmetry of a under R1 and R 3 , and their action is analyzed in a few strokes. Reference to Fig. 1 gives the following table of the actions of R1 and R3 on the sides of a. TABLE

R 1:

1-+-14 = 9 2-+-13=4 3 -+-12 = 7 4-+-11 = 2 5-+-10 = 5 7-+-8=3 9-+-6= 1

3 R3:

1-+-2 2-+-1 3 -+-14 = 9 4-+-13 = 4 5-+-12 = 7 7-+-10=5 9-+-8=3


303

From Tables 2 and 3 one can read off the actions of Rl and R3 on the homology basis of Section 3. TABLE

R1 :

4

Yl ~-3 - 2 - 1 + 9 = -Yl - 81 - 82 - 83 Y2 ~ -1 - 2 = -Yl + Y2 - Y3 - 81 - 82 Y3 ~ - 5 - 2 = -Y3 - 81 + 83

81 ~ 4

+ 7 + 2 + 5 = 81 + 82 + 3 = -82 5 + 7 = 82 + 83

82~-7

83 R3:

~

-

Yl ~ - 5 - 4 - 3 - 2 = - 81 Y2 ~ - 3 - 4 = Y3 - 83 Y3 ~ - 7 - 4 = Y3 - 82 - 83 81 ~ -1 + 9 + 4 + 7 = -Yl 82 ~ - 9 + 5 = Y2 - Y3 83 ~ - 7 + 9 = - Y2 + Y3 - 82

83

-

Now 11 is not symmetric under R2 although S is. The difficult part of this paper is, therefore, to compute the action of R 2 • The difficulty is due to a certain complexity and tediousness in tracing the images of certain chains under R2 and then reducing them homologically to chains on the boundary of 11. We now write a subdivided surface symbol for 11: abe d efg h ij a-I k I m f - l e- 1 d- 1 nr 1 i - I e- 1 b- 1 m- 1 1-1 k- 1 1 g-1 n-l, where the subsymbols are explained by

r

TABLE

5

a = P Q = I, b = QX2, e = X 2P, be = 2, d = PN, e = P:X~, f= X~Q, def= 3, g = QX4 , h = X 4 P, gh = 4, i = PPl, j = PlQ, ij = 5, a-I = QP = 6, k = PP~, I = P~X;, m = X;Q, kim = 7, f- 1 = QX~, e- 1 = X~Pl, d- 1 = PlP, f- 1e- 1d- 1 = 8 = 3-\ n = PQ = 9, j - l = QP~o, i - I = P?oP, j- 1 i - 1 = 10 = 5-\ k- 1 = P?2P, 1 1 1 m- / - k= 12 = 7-\ h- 1 = PXI3 , g-1 = X I3 Q, 1 1 h- g- = 13 = 4-\ n- 1 = QP = 14 = 9- 1 In Table 5 each ordered pair of points denotes the oriented circular arc joining them. Table 6 is constructed in the following way: the image of the arc in question is found by marking its end points and appropriately chosen (not unique) circular arcs ,\ joining them to L 2 , if possible, and observing how many units, i.e., edges of triangles, the points are from L 2 , and then marking the arcs corresponding to ,\ under R2 and going out on the other side

RAUCH AND LEWIITES

304

of L2 the same number of units. In the course of doing so it often becomes necessary to use the identification of sides of ~ in Table 1 to re-enter ~ because of its lack of symmetry with respect to R 2 • The interested reader will be able with perseverance and patience to verify the entries. TABLE

R 2:

6

1 ---+ NPg 2 ---+ Pgx~ + X~P~ 3 ---+P~Pl + PlX2 + XllNo - 13 = 4 ---+ PrOX{2 + x;Pi 5 ---+ NQ7 + Q5Pg - 12 = 7 ---+ NN + Pl2X 1a + x 4 Pg - 14 = 9 ---+ PiPlo,

where Q5, Q7 denote the representatives of Q on sides 5 and 7. From Tables 2, 5, and 6 and by the use of Fig. 1 to replace certain chains by homologous chains we have constructed Table 7 giving the action of R2 on the homology basis. To avoid excessive length of the paper we give all the details for the first entry only, leaving the verification of the remainder again to the interested reader's perseverance and patience. TABLE

7

R 2: Yl ---+ N X 4 + X 1a N2 + P?Pi + Pi X; + PloN + P~Pg '" -i - h + h + k + 1+ m + 2 + i + j - 9 + 3 - 7 + 1 - j = 2 - 9 + 3 + 1 = Yl + a 1 + a 2 + aa Y2 ---+ ProN + Pr X; + X{2PrO '" j - 9 - 8 - m + m - 11 + i = 5 - 9 + 3 + 2 = Y2 + a1 Ya ---+ Q7Pi + Prx; + X{2No + PgQ5 '" -m + i - 11 + m + j = 5 + 2 = Ya + a 1 - aa a1 ---+ Pg X~ + X~Pr + PrPl + P3X2 + XllNo + PrOX{2 + x;Pi + PiQ7 + Q5Pg '" -i - 4 - f - e - c - d + c + i - i + 11 - m + m - j = - 2 - 3 - 4 - 5 = - a1 a2 ---+ PrOXll + X2Pl + PlPi + NN + Pr2 X la + x 4 Pg '" -i - c + c + d - 1- m + f + e - k - h + h + i = 3 - 7 = -a2 aa ---+ Q7Pi + NPl + NX2 + XllNo + NQ5 '" - f - e - d - c + c + i + j = - 3 + 5 = - aa

From Tables 2 and 6 we see that R2 takes Yl = 1 - 4 - 7 - 9 into PlX4 + X 1a N2 + NPi + P'fX; + X{2No + NoPi + PiN, where we have reversed the order of some pairs to have all positive signs and


305

rearranged terms so that, with identifications, the terms fall in natural order, end-to-end. We next observe from Fig. 1 and Table 6 that P~ X 4 '" P~P + P X 4 = - i - h, X13N2 '" X 13N2 '" X 13P + PPr2 = h + k, P~F't + Pi x; '" P~ x; = I, X{2No '" X{2Q - 11 + PPro = m + 2 + i, PfoF't + P'tP~ '" p{oQ - 9 - 8 - 7 - 6 - PlQ = j - 9 + 3 - 7 + 1 - j. Hence R2 Yl '" - i - h + h + k + 1+ m + 2 + i + j - 9 + 3 - 7 + 1 - j =

k+l+m+2-9+3-7+I=7+2-9+3-7+I=2-9+3+ 1 = Yl + 81 + 82 + 83 , In Table 7 '" means "homologous to." Using Tables 4 and 7 to construct matrix representations Jl1 , Jl2 , Jl3 of the actions of Rh R 2, R3 on the homology basis of Table 2 we now write down by composition the matrix representations Jli 1, Jlv , JI u of the generators T-l = R 1R 3, V = R3R2' U = R2Rl of r 168 (we prefer for technical reasons to use T- 1 instead of T as a generator). Observe that, consistent with the practice of [17], Lemma 8 and Proposition 9 (note the corrections in the reference in the bibliography), R 1 R 3 , for example, will be represented by Jl3J11 , i.e., the reverse order of the geometric transformations.

(1)

;;-1 _ v'l'tT -

o o o 1

o o o o

-1

1

1

-1

-1

-1

-1

-1

-1

-1

-1

0

1

1

o o

-1

-1

1

0

-1

-1

o o o o

-1

1

0

0-1

-1

o o

-1

1

o

0

0

0

1

0

0

o o

-1

1

o o o o o o

o o

o o

o o o

o o o o

-1

(2)

-1

1-1 0

-1

(3)

o o o

0

o o o o

0 -1

o o o o

-1

o o o

-1

-1

o o

-1

0 -1

RAUCH AND LEWI1TES

306

5. Computation of the normal period matrix We can now use the results of Section 4 to compute the period matrix 7T over 151> 15 2 , 15 3 of the normal Abelian integrals of the first kind on S with respect to the canonical homology basis Yl, Y2, Y3; 151> 15 2, 15 3 of Section 3 (see [17], p. 12 if.). The technique is to observe that 7T is fixed under the transformations of the inhomogeneous Siegel modular group .A3 corresponding to the matrices (I), (2), (3) in Sp(3, Z) of Section 4 (see [19] and [17], Section 2, C, p. 16). Thus we have

7T

=

+ B)(Cn +

(An

D)-l

or (4)

7T(Cn

+

D)

A7T

=

+ B,

where

.A

I :)

= (;

is any of the matrices in (I), (2), and (3). If we set

n

= (: : :) e

(7T

e

f

is symmetric), and .A = .Au (here B = C = 0) we find from (4)

b = -b - d -b - e = -e - e -d - e = e or

2b =-d b=e -d = 2e.

(5)

Similarly, choosing .A = .Av, one finds

(6)

(i) (ii) (iii) (iv) (v)

+ ab + ae + a + be + b + e + 1 = + ad + ae + be - e = 0 + b2 + be + b + cd + d + e = 0 + bd + be + de - e - 1 = 0 f = eb + cd + ce + e 2 + d + e + 1. a2 ab ab b2

0


307

Setting (5) in (6, iv) one obtains (7)

or

e=

(S)

-1

± v'7 i 4

.

We shall decide the choice of sign later. Substituting (5) in (6, ii), (6, iii), and (6, v), we obtain c = e2

(9)

ae - e 3

+ e2 =

0

a = e2

or

-

(e of- 0)

e

J=e 2 -e+1. But from (7) one finds e 2 = - -t - eJ2 and hence e 2 and e 2 - e + 1 = 1- - 3e/2, so that

7T

=

c~-~ e 1

e -2e

e

-~e1

e

-2-2

-

e = -

-t -

3eJ2

i)

3e

"2 -2

Now 7T as a normal period matrix must have a positive-definite imaginary part. Reference to (S) shows that we must pick the minus sign in e so that we have finally 1

30 i

-8 + - S (10)

7T

=

v'7 i -4-~

3

-8 +

v'7 i

-S-

v'7 i 4 1

4

3

v'7 i

2+

-2-

-4 -

-4-

v'7 i

0

i

-8 + -Sv'7 i -4-~

7

3V7 i

8+ -S-

We observe that the entries of 7T all lie in the field k( v' - 7), which is the field generated over the rational field k by the character of the representation (irreducible) induced on the differentials of first kind of S by r I6S (see [IS]). NOTE ADDED IN PROOF (May 1970): Macbeath's surface of genus 7 in [14] was anticipated by Fricke in [22]. In pp. 265-270 of [1] Baker discusses Klein's surface without, as far as we can see, arriving at anything like our results. As he says, he does not construct any explicit homology basis, and

308

RAUCH AND LEWITTES

the period matrix he obtains is not proved to correspond, and probably does not correspond, to a canonical homology basis. THE CITY COLLEGE OF THE CITY UNIVERSITY OF NEW YORK THE HERBERT H. LEHMAN COLLEGE THE GRADUATE CENTER OF THE CITY UNIVERSITY OF NEW YORK

REFERENCES 1. BAKER, H. F., Multiply Periodic Functions. Cambridge, 1907. 2. - - - , "Note introductory to the study of Klein's group of order 168," Proc. Cambridge Phi/os. Soc., 31 (1935),468-481. 3. BURNSIDE, W., Theory o/Groups 0/ Finite Order. Cambridge, 1911. 4. EDGE, W. L., "The Klein group in three dimensions," Acta Math., 79 (1947), 153-223. 5. GORDAN, P., "Ober Gleichungen siebenten Grades mit einer Gruppe von 168 Substitutionen," Math. Ann., 20 (1882), 515-530; 11, ibid., 25 (1885), 459-521. 6. HURWITZ, A., "Ober algebraische Gebilde mit eindeutigen Transformationen in sich," Mathematische Werke, Bd. I. Basel: Birkhliuser, 1932, pp. 392-430. 7. - - , "Ober einige besondere homogene lineare Differentialgleichungen," ibid., pp. 153-162. 8. KLEIN, F., and R. FRICKE, Vorlesungen iiber die Theorie der elliptischen Modul/unctionen, Bd. I. Leipzig, 1890. 9. KLEIN, F., "Ober die Auflosung gewisser Gleichungen vom siebenten und achten Grade," Math. Ann., 15 (1879), 251-282. 10. - - - , "Ober die Transformation siebenter Ordnung der elliptischen Funktionen," Math. Ann., 14 (1879), 428-471. 11. LEECH, J., "Generators for certain normal subgroups of (2, 3, 7)," Proc. Cambridge Phi/os. Soc., 61 (1965), 321-332. 12. LEECH, J., and J. MENNICKE, "Note on a conjecture of Coxeter," Proc. Glasgow Math. Assoc. (1961), 25-29. 13. MACBEATH, A. M., Fuchsian Groups, mimeographed notes, Queen's College, Dundee, University of St. Andrews. 14. - - - , "On a curve of genus 7," Proc. Londoll Math. Soc., 15 (1965), 527-542. 15. - - - , "On a theorem of Hurwitz," Proc. Glasgow Math. Assoc., 5 (1961), 90-96. 16. POINCARE, H., Sur I'integration algebrique des equations lineaires et les periodes des integrales abeliennes, Oeuvres, 1. III. Paris: Gauthier-Villars, 1934, pp. 106-166. 17. RAUCH, H(ARRY) E., "A transcendental view of the space of algebraic Riemann surfaces," Bull. Amer. Math. Soc., 71 (1965), 1-39, Errata, ibid., 74 (1968), 767. 18. - - - , "The local ring of the genus three modulus space at Klein's 168 surface," Bull. Amer. Math. Soc., 73 (1967), 343-346. 19. - - - , "Variational methods in the problem of the moduli of Riemann surfaces," in Contributions to Function Theory. Bombay: Tata Institute of Fundamental Research, 1960, pp. 17-40. • 20. SPRINGER, G., Introduction to Riemann Sur/aces. Reading, Mass.: AddisonWesley, 1957. 21. WEBER, H., Lehrbuch der Algebra, Bd. II. Braunschweig, 1908. 22. Fricke, R. "Ueber eine einfache Gruppe von 504 Operationen," Math. Ann., 52 (1899), 319-339.

Envelopes

cif Holomorphy

cif Domains

in Complex

Lie Groups O.

s.

ROTHAUSl

There are two well known results in the literature concerning envelopes of holomorphy that we want to single out here, namely the ones describing the completion of Reinhardt domains and tube domains. The statement for the latter type we owe to Salomon Bochner. There is a feature common to both the results which is worth noting. Roughly speaking it may be described as follows: a domain in a complex Lie group which is stable under a real form of the group may be completed by forming certain "averages" in the complex group. In this note we shall give one reasonably general form of this phenomenon, which more or less includes the case of Reinhardt domains. We shall follow the usual convention of letting German alphabetic characters denote the Lie algebra or its elements corresponding to the group denoted by the associated ordinary alphabetic character. Let G be a connected complex Lie group, @ its Lie algebra. Let j, p = - identity, be the endomorphism of @ giving the complex structure. Let S) be a real form of @, so that @ = S) ffijSj, and H the corresponding analytic subgroup of G. We want to investigate the envelope of holomorphy of domains D contained in G which are H stable; i.e., Dh = D for all hE H. We make the following assumption governing the situation: H is a maximal compact subgroup of G. This assumption has a number of useful consequences, mostly well known, which we now present. For these purposes, let G be the universal covering group of G, r the fundamental group of G, so that G ~ G/r. Let rr be the natural map from G to G, let Ii be rr- 1(H) and lio be the connected component of the identity in Ii. 1

This research was partially supported by NSF GP 8129. 309

O. S. ROTHAUS

310

There are natural homeomorphisms of the identification spaces as indicated: G

G

G lio

H~Ii~Ii'

lio By a theorem of E. Cartan and others, G/ H is Euclidean; on the other hand Ii/lio is discrete. Since Euclidean space is simply connected, we have LEMMA 1.

lio = Ii.

Clearly Ii is then a covering space for H. But G, viewed as a fibre bundle over GIIi, fibre Ii, is equivalent to a trivial bundle, since G/Ii is Euclidean. G being simply connected, it follows that LEMMA

2.

Ii is the universal covering space of H.

There is an involutory automorphism 0' of @ given by O'(lh + i~2) = j~2 for ~l' lJ2 E ~. This automorphism induces an involutory automorphism of G, whose subgroup of fixed points has connected component of identity equal to Ii. Since Ii ::> r, the automorphism, still denoted 0', may be viewed as an automorphism of G/r ~ G, fixing lijr ~ H. Since H also is compact, we have

lJl -

LEMMA

3.

G/H is a Riemannian symmetric space (see [1]).

The transvections of G/H are given, of course, by expj~, ~ E ~, and every element in G may be written uniquely as a transvection times an element of H. H being compact, it is well known that H decomposes into the direct sum of a semisimple algebra lEi, and Q;, the center of ~. Q; is the Lie algebra of C, the connected center of H, which must be a torus, and lEi is the Lie algebra of S, a connected semisimple subgroup of H. Every element in H may be written, though not uniquely, in the form sc, s E S, C E C. For future purposes, we shall suppose that a basis Cl> c2 , .•• ,C, of Q; has been chosen so that exp L ,\ci = identity implies and is implied by ..\, E Z Vi. Now let h -+ r(h) be a finite d-dimensional representation of H; for our purposes we suppose the representation is already unitary, so that r(h) E U(d). There is induced a representation of the algebra ~, which we still denote by ~ -+ r(~). The representation of ~ can be extended to give a representation R of @ by setting

ENVELOPES OF HOLOMORPHY IN LIE GROUPS

311

for fJl' fJ2 E ~. The representation R of @ now induces a representation R of R, restricted to fI, is simply the lift of r to fl. Since a representation of a connected group is completely determined by the corresponding representation of the Lie algebra, it follows immediately that R is trivial on r. Hence R may be viewed as a representation of air ~ G, and thus we have:

a.

LEMMA

4.

Every representation of H extends to a representation of G.

It is worth noting that the entries in the representation Rare holomorphic functions on G. In fact R is easily seen to be the unique holomorphic extension of r. Also note that in the extension, transvections are represented by Hermitian definite matrices. Now let r be a representation of H in U(d) and R the representation of G in GL(d, C) extending r. GL(d, C) has an analytic structure J arising from the fact that U(d) is a real form. The associated involutory automorphism T of GL(d, C) is the familiar one mapping a matrix to its conjugate transpose inverse. We have by definition the equations

R(jg) == JR(g) and R(ag) == TR(g).

The first of these equations asserts that R is a holomorphic map of G to GL(d, C). From the second equation we can prove LEMMA

5.

R(G) is closed in GL(d, C).

Let kv == R(gv) be a sequence of elements in GL(d, C) converging to k in GL(d, C). Write gv == Pvhv with apv == p;;l, ah v == hv; i.e., Pv is a transvection and hv E H. Write bv == avbv with Ta v == a;; 1, Tb v == bv. Then R(pv) == av and R(hv) == bv. And it follows that the sequences av and bv converge separately, to a and b, respectively. Since H is compact, R(H) is compact, and it follows that bE R(H) c R(G). Let Pv == exp Vv, so av == exp R(Vv). It follows that R(Vv) converges to an element in the Lie algebra, say u. Since the map of Lie algebras is a surjection, there is av Ej~ such that R(V) == u; then R(exp V) == exp u == a, completing the proof. Now let r- be a faithful representation of H in U(d). We claim that LEMMA

6.

R is afaithful representation ofG.

Suppose R(g) == identity. Write g == pk with ap == p-l, ah == h. Then each of R(p) and R(h) are the identity. Since R(h) == r(h) we have h == identity. Now letp == expjfJl; R(p) == exp ir(1h). ir(fJl) is Hermitian, and

312

O. S. ROTHAUS

since exponentiation is one-one on Hermitian matrices, implies that ~1 = 0, implying p is the identity. By assembling the last few results we can show THEOREM

1.

r(~l)

= 0, which

G is a Stein manifold.

Let R be the extension to G of a faithful representation of H. By Lemma 6, the entries of R give sufficient holomorphic functions to separate points. Let lh, ~2' . . . , ~n be a basis of .~ and put Zv = Xv + iyv, Z = (Zl> Z2, . . . , zn) E C n. For Z in a sufficiently small neighborhood N of the origin in C n , the map of N to G given by Z -+ g = exp

L: (xv + yvj)~v v

gives a complex analytic chart at the identity in G. We have

R(exp L: (xv + yvj)~v) =

exp

L: zvr(~v)

= (f"P{Z»".8·

L: zvr(~v) belongs to M(d, C), the ring of d-dimensional matrices over the complex numbers, which may be identified as before with the Lie algebra of GL(d, C). Since the exponential map of M(d, C) into GL(d, C) has a holomorphic inverse in a neighborhood of the identity, and since the representation r is faithful, it follows readily that for z sufficiently close to the origin, z is a holomorphic function of the entriesf"p{z). Thus a subset of the f"p{z) give holomorphic coordinates near the identity in G. An analogous argument takes care of arbitrary points in G. Finally, the topology of GL(d, C) may be given as the topology it inherits as a subspace of M(d, C). Since R(G) is closed in GL(d, C), it follows that R(G) topologized as a subspace of GL(d, C) is homeomorphic to G. Let K be a compactum in G. Let ex be an upper bound for the moduli of all of the entries of R(g) and det

~(g)

for g

E

K. Then the set A of

= (u ij ) E GL(d, C) such that IUij I ~ ex and de~ U ~ ex is compact in GL(d, C). A n R(G) is also compact. Hence the holomorphically convex

I

U

I

hull of K is compact, proving that G is holomorphically convex, and completing the proof that G is Stein. REMARK. It may be shown that if G is a closed subgroup of GL(d, C), G stable under T, then H = G n U(d) is maximal compact in G, and H. is a real form of G. Let d be the Casimir operator of S, and put

V

=

-~22>r.


313

Both a and V are in the center of the universal enveloping algebra of H or G. We shall regard a and V as differential operators for the manifold H by viewing the elements of the Lie algebra as left-invariant vector fields. Both operators are then formally self-adjoint. Let h ---+- r(h) now be an irreducible representation of H, inducing representation lj ---+- r(lj) of S). Restricted to 6, the representation remains irreducible, since the elements of the center ~ of S) are all represented by scalar matrices. While restricted to ~, the representation is simply the representation of~ induced by a representation of the torus C. The representation r of S) is completely determined by its restriction to 6 and~, but not every representation of .\J gives rise, of course, to one of H. A representation r of C is completely determined by a i-tuple K = (aI' a2, ... , al) of integers, namely: r (exp

2: AvCv) = exp (27Ti ~ Avav) .

On the other hand, a representation of S, if S is of rank I, is determined by an I tuple L = (bb b2 , ••• , bl ) of non-negative integers as follows. Every irreducible representation is determined by its highest weight w, and there exist I fundamental weights WI, W2, ... , WI such that every highest weight W is of form W = 2:bjWj. Thus to every irreducible representation of H we may assign uniquely a pair (L, K) consisting of an I-tuple of non-negative integers, and a i-tuple of integers. We shall parametrize the representations of H by all such pairs, and remind the reader to suppress in ensuing computations the pairs which do not correspond to any representation. Now let h ---+- r(h) correspond to pair (L, K). First of all we see that Vr{h)

=

(2: a~)r(h) = x(K)r{h)

and as a straightforward computation shows arCh)

= r(a)r(h),

where rCa) is the matrix representing a in the extension of the representation to the universal enveloping algebra. rCa) is clearly a scalar matrix, rCa) = 7J identity, where 7J = 7J{w) is a constant depending on the highest weight. 7J has been computed by Freudenthal (see [2]) as follows. Let p = t sum of the positive roots of H. Then A{W)

= (w, w) + 2{w, p),

where the symmetric bilinear form (.,' ) is positive-definite. If W = 2: hjwj, = (hI, b2, ... , bl), then we denote A{W) by A(L) and note that A(L) is an

L

O. S. ROTHAUS

314

inhomogeneous quadratic form in the hi whose part of weight 2 is positivedefinite. Moreover, it is known that A(L) ~ 0, and A(L) = 0 only for L

=

o.

The degree d = dL of the representation corresponding to the weight w . Il(a p + w) has been glven by Weyl: d = riCa, p) ,where the products are taken over the positive roots a. d is a polynomial in L. Let D be an H-stable domain in G. Then E = 7r(D) is a domain in G/H, and D = 7r-1(E). We call E convex ifitcontains the geodesic segment connecting any two of its points. D is called convex if 7r(D) is convex. If E is an arbitrary domain in G/ H, the convex hull P of E is the intersection of all convex domains containing E. The convex hull of D is defined to be /'-.

7r- 1(7r(D)). The convex hull of E may be constructed as follows. Let En+ 1 be the union of all geodesic segments connecting points of En and put Eo = E. Then the En are an increasing sequence of open sets whose union is P. There is an analogous construction for the hull of an open, H-stable, set in G. Now letfbe a real function defined on G/H.fis called convex iffor any geodesic segment x(t), 0 ~ t ~ 1, we have f(x)

~

(1 - t)f(x(O))

+ if(x(l)).

If f is defined on G, we say f is convex if f(gh) = f(g) for hE H, and viewed as a function on G/ H it is convex as above. Clearly, iffis a continuous convex function on G, the set {glf(g) < I} is a convex domain. Our first basic result on convexity is as follows. Let r be a representation of H in U(d) and R its extension to G. Let A be an arbitrary complex matrix of dimension d. By A* we denote the conjugate transpose of A. Then LEMMA

7. f(g)

= tr AR(g)[AR(g)]* is a convex function on G.

It is clear thatf(gh) = f(g) for h E H. Now let A > 0 and 0 ~ p ~ I, q = 1 - p. By the inequality of the arithmetic and geometric mean

AP. 1q

=

AP

~

q

+ pA.

If B is a Hermitian definite matrix, then B = exp C, for unique Hermitian C, and we denote by Bt the matrix exp tC, for any real t. The inequality just given implies that q/ + pB - BP is Hermitian definite. Thus if P is also Hermitian positive-semidefinite, we have tr P·BP

~

qtr P + ptr P·B.


315

Let a and f3 be transvections in G, and put in the last P = R(a)A* AR(a), B = R2(f3). Wis a well defined transvection for any real p. Thus tr R(a)A* AR(a)R 2 P(f3) ;;; q tr R(a)A* AR(a)

+ p tr R(a)A* AR(a)R 2 (fJ);

this may be rewritten

We have now only to note that the geodesic segment connecting the points aH and (afJ 2a)112H in G/H is (af32 Pa)1/2H, 0 ;;; p ;;; I, to see that/is convex as asserted. If F is a continuous function on H, then F has a Fourier expansion F(h) ~

L dL tr AL,KrL,K(h),

L,K

where rL,K is the irreducible representation of H parametrized by pair (L, K), dL is its degree, and AL,K = f F(h)rL,K(h- 1 ) d/l,

where d/l is Haar measure on H. If the series is uniformly convergent, then it is convergent to F. We always have Plancherel's theorem, flF(hW d/l

=

L dL tr AL,KAf,K'

L,K

If we suppose F is a COO function, then since the operators ~ and V are self-adjoint, it follows that the Fourier coefficients of ~a Vb Fare Aa(L) Xb(K) AL,K'

Let D be an H-stable connected domain in G, and let Fbe a holomorphic function on D. For fixed g E D, we consider F(gh) as a function on H, and take its Fourier series F(gh) ~

L dL tr [AL,K(g)rL,K(h)].

Now AL,K(g) has entries which are readily seen to be holomorphic functions on D, and satisfies in addition AL,K(gh) = AL,K(g)rL,K(h) for h E H. This suggests: LEMMA 8,

There is a constant matrix AL,Ksuch that AL,K(g) = AL,KRL,K(g).

A function holomorphic on D is completely determined by its value on the set goH for any go ED. Consider then B(g) = AL,K(go)Ri,"k(go)RL,K(g), B(goh) = AL,K(goh) for any h E H, so that B(g) = AL,K(g) everywhere. Let U be an open, relatively compact, H-stable set in D.

O. S. ROTHAUS

316

9. The Fourier series for F(gh) is absolutely uniformly convergent 0, h E H.

LEMMA

for g

E

The Fourier coefficients of /:I,.a\/bF(gh) are /..a(L)Xb(K)AL,KRL,K(g). Since /:I,.a \/bF(gh) is uniformly bounded for g E U, we have, by Plancherel's theorem, for suitable constant m,

dL/.. 2a(L)x2b(K) tr [AL,KRL,K(g)Rf,K(g)Af,K]

~

m,

true for arbitrary L, K and g E U. From Lemma 7 it follows that the same inequality is true for g E O. Then with the aid of the Cauchy-Schwarz inequality we obtain m 1/2 Itr AL,KRL,K(g)1 ~ /..a(L)xb(K)

(To take care of the terms with X(K) = 0, or /"(L) = 0, the same treatment but with b =1= 0, or a =1= 0, is used.) Now by selecting a and b sufficiently large, it is easy to see that the series

2:

dLltr AL,K(g)1 L,K is uniformly convergent by Weierstrass's M-test. From the last result, we can readily derive our principal result: THEOREM

2.

F(g) extends to be a holomorphicfunction on D.

By the previous result F(g) extends to a holomorphic function on O. Let U o c U 1 c ... C Un C . . . be an increasing sequence of open, relatively compact, H-stable, open sets converging to D. For any H-stable open set M we denote by M O = M, Ml, M 2 , • •• the increasing sequence of open sets, as described earlier, converging to M. With this notation, we claim that the increasing sequence of open sets U;: converges to D. What we prove, by induction on v, is that if g E DV, then g E U;: for some n. This is obvious for v = 0. Suppose g E DV+ 1. Then 1T(g) is on the geodesic segment joining 1T(p) and 1T(q) for some p and q E DV. By the induction both p and q E U;: for some n. But then g E U;: + 1 C U;:n, completing the induction. Now let g E D. Then g E U;: for some n, and since U;: is open, a neighborhood of g is c U;: C On- Since F extends to be holomorphic on On> it extends to be holomorphic at g, completing the proof of the theorem. It is tempting to conjecture that if E is convex, then E is holomorphically convex. Since G is a Stein manifold, it would be sufficient to prove that E is pseudoconvex. A possible procedure might be as follows. If R is any holomorphic representation ofG in GL(d, C) thenf(g) = tr AR(g)R*(g )A* is plurisubharmonic, since it is the sum of squared moduli of holomorphic


317

functions. The set {glf(g) < I}, which we call a half-space, is then holomorphically convex. Hence it would suffice to prove that every convex set is the intersection of the half-spaces containing it. CORNELL UNIVERSITY

REFERENCES 1. HELGASON, S., Differential Geometry and Symmetric Spaces. New York: Academic

2.

Press, Inc., 1962. N., Lie Algebras. New York: Interscience Publishers, Inc., 1962.

JACOBSON,

Automorphisms

if Commutative

Banach A18ebras STEPHEN SCHEINBERG

This presentation consists of a few observations related to joint work with Herbert Kamowitz. In [2] we showed that if T is an automorphism of a semisimple commutative Banach algebra and if Tn ¥- I (all n), then the spectrum of T, aCT), must contain the unit circle. Examples were given to show that this containment can be proper. Section 1 that follows contains more complicated examples of such aCT) plus the theorem that aCT) is necessarily connected. In [1] we studied derivations and automorphisms of a particular radical algebra. Section 2 concerns another class of radical algebras, those of formal power series. These algebras behave like semisimple algebras in that all derivations and automorphisms are necessarily continuous. 1.

Let R be a region which is the interior of its closure and which is bounded away from 0 and 00. Assume that both {lzi < I} - Rand {Izl > I} - Rare semigroups under multiplication. Finally suppose R contains an annulus adjacent to the unit circle. THEOREM 1. There is an automorphism T of a semisimple commutative Banach algebra for which aCT) = R.

PROOF. It is easy to see that R = URn' where each Rn satisfies the requirement just given and, further, oRn is rectifiable. If Tn can be constructed for R n, then T = L EB Tn on the direct sum of the algebras will have aCT) = R. Thus we may assume oR is rectifiable. The amlulus that R contains separates the plane into two components no and n oo , containing o and 00, respectively. Let r 0 = oR n "and roo = oR n noo.

no

319

STEPHEN SCHEINBERG

320

Let A be the family of all bounded analytic functions on R. The multiplication on A will be the Hadamard product; each member of A has a Laurent series on the annulus:

To see that A

*A

~

A, write 00

1=10

+ 100

=

-1

2:o + -2:

00

for each f in A. Clearly 10 is analytic on Ro =

no U

R and sup 1/01 ~ Ro

const sup III. The analogous statement applies to 100' R

Now 1* g = 10 * go + IX) * goo and a standard (easy) computation shows that 10 * go(z) = (27Ti)-1 1'., 10(z/w)go(w)w- 1 dw and 100 * goo(z) = (27Ti)-1 r loo(z/w)goo(w)w- 1 dw, with suitable orientation on r o, roo. In

f

J1'o

these integrals go(w) and g",(w) are defined a.e.; the semigroup condition is just what is needed to ensure z/w E Ro when zERo and WE roo, and similarly with 0 and 00 interchanged. It is now evident that A becomes a Banach algebra when 11/11 is defined to be (a suitably large constant) sup III. It is also clear that A is semiR

simple, since the homomorphisms CPk(J. anz n) = ak separate the members of A [i.e., CPk(f) = (27Ti)-1 f/Z/=1 / (Z)Z I}. Put S = (2rri)-1 Sy ,x(,x - T)-1 d,x. Then a(S) = aCT) n (inside of y) and aCT - S) = aCT) n (outside of y). For any fE A, x EX, Sf(x) = (2rri)-1

f (1 y

- ,x-1T)-Y(X) d,x

= (2rri)-1 fy L ,x-nf(Tnx ) d,x = ~f(rnx). 0

[(2rri) -1 Jy ,x -n d,x]. This is justified since l,xl is uniformly > 1 for ,x E y. If y surrounds 0, this string of equalities gives Sf(x) = f(Tx), implying S == T. If y does not surround 0, we have Sf(x) = 0, implying S == 0. Either case contradicts the assumption that aCT) meets both the inside and outside of y.

2. Given a sequence an > 0, let

Aa is a Banach space isomorphic to [1. It becomes a Banach algebra under formal power-series multiplication iff a n+ m ~ anam. Aa is a radical Banach algebra iff also a~/n --? o. Since z generates A a , many questions are easy to handle. Recall that a derivation of an algebra is a linear operator D such that D(xy) = (Dx)y + x(Dy). It is easy to see that a continuous (i.e., bounded) derivation D is determined, once Dz is known, by Dx = (Dz)x', where x' is the differentiated formal power series. (It is obvious for polynomials, which are dense.)

THEOREM 3 (Loy [4]). necessarily bounded.

A derivation D on the Banach algebra Aa is

PROOF (see [4]). By the closed graph theorem, it is enough to show that (Df)m, the mth coefficient of Df, is continuous inf, for each m. By induction, if m is the first integer for which this fails, fn can be found for which

322

STEPHEN SCHEINBERG

Ilfn I ~ 0 and (Dfn)m ~ 00 very fast. From this one can, therefore, contradict D(L. znfn) E A a , because its coefficients will be too large.

THEOREM 4.

These are equivalent for any A a , where k is any integer

~

I:

(i) There is a derivation D of Aa such that Dz = CkZk + ... (Ck # 0). (ii) There is a derivation D of Au such that Dz = Zk. (iii) an+(k-l) = O(anln) as n ~ 00. PROOF. (ii) => (i) trivially. (i) => (iii) by simply looking at the norms of Dzn D is bounded. (iii) => (ii) by direct examination of the norm.

=

nzn-l(Dz), since

COROLLARY I. Let an = lin! Then Dx = z 2 x' defines a derivation on Au. If we put an = e- ncn , then Cn ~ 00 is the condition that Au be radical. The condition an+m ~ ana m becomes the "convexity" condition n m < - - C n + - - Cm = n+m n+m

C n + m•

Thus Cl ~ C2 ~ • •• ensures that Au is a Banach algebra. Condition (3) will fail for every k if, say, Cn+1 - Cn ~ lin and kCn+k ~ log n for n ~ n(k). This follows by a straightforward calculation which is omitted. For example, Cn = log log n will give the following corollary: COROLLARY 2. Let an

=

(log n)-n. If D is a derivation on Au, then

D =0.

D. J. Newman [3] proved this result for bounded derivations. Tox(z) = x(efOz) defines an automorphism of Aa. The proof of Theorem 3 works for automorphisms as well and shows that every automorphism of Au is bounded. It is then clear that any automorphism T is determined by Tz, by the formula Tx(z) = x(Tz), since this holds for polynomials. Let Tz = az + ... ; it is easy to see that lal = I, since lal > I makes T unbounded and lal < I makes T-l unbounded. If D is a derivation, then of course eD is an automorphism and eD # any To, unless D = o. Indeed, Dz = CkZk + . .. implies eDz = z + CkZk + ... , since k ~ 2 by condition (iii) of Theorem 4. THEOREM 5. Au admits an automorphism other than the T;s a derivation other than O. PROOF.

Aa admits

is trivial, as noted previously.

=> By replacing T by ToT we may assume Tz = z + C~k + ... , Ck # o. Then TZR = zn + nc~n+k-l + .... Boundedness of Timplies boundedness of no:n+k-l/an, which is exactly condition (iii) of Theorem 4.

AUTOMORPHISMS OF COMMUTATIVE ALGEBRAS

323

In the ring A of all formal power series without constant term, every automorphism is of the form Tx(z) = x(Tz), and Tz = az + ... (a '" 0). When a = 1, T is the exponential of a unique derivation D; namely, D

=

log T

=

log {J

+ (T -

J)}

co

=

1

2:I (_I)n-l -n (T -

I)n, the series con-

verging in the strong operator topology (convergence in A is, of course, convergence in each coefficient). In fact, the series applied to any element of A is eventually constant in each coefficient. D is a derivation by the proof of Theorem 6 of [I]. On Aa any T = eD must send z to z + ... , since Dz = CkZk + . . . with k ~ 2. The series for D = log T need not converge in the strong topology; e.g., let D be as in Corollary 1 of Theorem 4. The question of whether every automorphism T of Aa sending z to z + . . . is the exponential of a derivation is exactly the question of whether the formal D = log T, defined on all formal power series, sends Aa into Aa. This may not always be the case. Theorem 2 has been obtained independently by R. J. Loy (personal communication). MASSACHUSETTS INSTITUTE OF TECHNOLOGY

REFERENCES 1.

H., and S. SCHEINBERG, "Derivations and automorphisms of L1(O, 1)," Trans. Amer. Math. Soc., /35 (1969), 415-427. 2. - - , "The spectrum of automorphisms of Banach algebras," J. Funct. Anal., 4 (1969), 268-276. 3. NEWMAN, D. J., "A radical algebra without derivations," Proc. Amer. Math. Soc., 10 (1959), 584-586. 4. LoY, R. J., "Continuity of derivations in topological algebras of power series," Bull. Austral. Math. Soc., 1 (1969),419-424. KAMOWITZ,

Historical Notes on Analyticity as a Concept in Functional Analysis ANGUS E. TAYLOR

1. Introduction This is an essay-one of a projected series-on certain aspects of the history of functional analysis. The emphasis of this essay is on the way in which the classical theory of analytic functions of a complex variable was extended and generalized and came to playa significant role in functional analysis. One can perceive two lines of development: (i) the extension of the classical theory to cases in which the function of a complex variable has its values in a function space or in an abstract space, and (ii) the development of a theory of analytic functions from one general space to another (where by a "general space" we mean a function space or an abstract space). The essay also seeks to place the history of these developments in proper perspective in the narrative of the development of functional analysis as a whole. Before dealing explicitly with the main subject of the essay it is desirable to sketch some relevant details of the history of functional analysis. The explicit emergence of the subject as a distinct and separate branch of mathematics may perhaps be considered as beginning with a series of five notes published by Vito Volterra in 1887 (see Volterra [55], [56], [57], [58] and [59]). The ideas underlying functional analysis were of course much older. At the 1928 International Congress in Bologna Jacques Hadamard mentioned Jean Bernoulli's problem of the curve of quickest descent and the pioneering work of the BernoulJis and Euler in the calculus of variations as the true and definitive foundation of "Ie calcul fonctionnel" (see Hadamard [24]). Volterra took the significant step of focussing attention on functions for which the independent variable was a function or a curve. This is not to say that Volterra was the first to study operations by which functions are transformed into other functions (or other entities). He was not. The "functional operations" considered by mathematicians prior 325

326

ANGUS E. TA YLOR

to the work of Volterra were studied, however, from a somewhat different point of view, more algebraic than analytic. Mathematicians examined the manipulations which could be performed with functional operations, viewed as symbols. (See Pincherle [2] and references cited therein.) In particular, Volterra seemed to be making a new venture in subjecting his "functions of lines" to analysis comparable to that applied in calculus and the classical theory of functions. In our current terminology, Volterra considered the domain of one of his functions to be a class of functions (in the classical sense) or a class of curves or surfaces. Volterra's functions were numerically valued; thus the ranges were sets of numbers. Volterra drew his examples from boundaryvalue problems for partial differential equations and from the calculus of variations. In Volterra's time the name "functional" was still in the future. In the first of his 1887 notes, he referred to "functions which depend on other functions." The title of one of the later notes in this series was "Sopra Ie funzioni dipendenti da linee." This gave rise to the term "fonctions de lignes" which persisted in its Italian, French, and English forms for a considerable period. It is asserted by Maurice Frechet and Paul Levy that the noun "functional" (fonctionelle) originated with Hadamard (see Frechet [14], p. 2, and Levy [32], p. 8). Since Hadamard's pioneering work [22] on a general method of representing linear functionals, the term "functional" has rather generally been reserved for a numerically valued function whose argument varies over a class of functions or some abstract set. However, such names as "operazioni funzionali" and "Ie calcul fonctionnel" were being used before 1900 in connection with operations which map functions into other functions (see Pincherle [41] and [42]). In [41], Pincherle has the following to say with reference to "Ie calcul fonctionnel": "On reunirait sous ce titre les chapitres de I'analyse ou l'element variable n'est plus Ie nombre, mais la fonction consideree en elle-meme." The name "analyse fonctionnelle" was introduced by Levy, according to Frechet (see [14], p. 3). In the early work on functional analysis certain algebraic and analytical operations were available in the nature of the explicit situation, there being no "abstract space" under consideration. It was possible to imitate, to a degree, the formulation of concepts such as continuity of a functional and uniform convergence of a sequence of functionals without a fully explicit treatment of metric and topological notions in the underlying class of functions. It was even possible to calculate such things as "variations" or differentials and functional derivatives without a fully explicit truly general definition. Most commonly these things were done merely by using absolute values and uniformity ideas, assuming the members of the underlying class

HISTORICAL NOTES ON ANAL YTICITY

327

of functions to be bounded and continuous. It was the work of Frt!chet which led to abstraction and the explicit introduction of metrical and topological concepts into the general setting. With the greater generality and abstraction introduced by Frechet and by E. H. Moore the name "general analysis" gained some currency. Many writers have tried to maintain a restriction on terminology, using "functional" always, as Hadamard had done, for a numerically valued function. In spite of this, "functional analysis" has come to mean not only functional calculus in the sense of Hadamard (I'etude des fonctionnelles), but also general analysis in the sense of Frechet and Moore-that is, the study offunctions (transformations) of a very general character, mapping one set onto another. The sets may be abstract or they may be composed of mathematical objects having a certain amount of structure: continuous functions, linear operators, matrices, measurable sets, and so forth. These historical notes present some of the results of an attempt to search out the pioneering work on analytic functions in general analysis, that is, of functions from one set to another which are analytic in a suitable sense as an extension of the concept of complex analytic functions of a complex variable. The essay is divided into four parts, dealing respectively with generalizations of the concept of a polynomial, analytic functions of a complex variable with values in a general space, analytic functions from one general space to another, and the role in spectral theory of abstractly valued ;' analytic functions of a complex variable.

2. Polynomial operations One line of development of the generalization of the notion of analyticity follows the Weierstrassian point of view, which places the power series at the center of the theory. For this it is necessary to have a functional analysis counterpart of the monomial function defined by the expression anzn (an and z complex, n a natural number). The initial steps in generalizing the concept of a polynomial suitably for functional analysis seem to have been taken by Frechet. His first application was not to a generalization of analytic functions, but to a generalization of the Weierstrass theorem on approximation of continuous functions by polynomials. In a paper published in 1909 [8] he considers how ordinary real-valued polynomials of one real variable may be characterized as continuous functions such that the application of certain differencing operations to these functions leads to an identically vanishing result. The

328

ANGUS E. TAYLOR

starting point is the observation thatf(real and continuous) is a first degree polynomial in one real variable if

f:.d = f(x + y) - f(x) - fey) + f(O) ==

o.

For a polynomial of degree n the corresponding identity is f:.n+lf == 0, where f:.n+lf is a certain sum involving terms ±f(Xil + ... + X'k) and ±f(O), where (il, ... , i k) is a combination of integers chosen from the aggregate (1,2, ... , n + I) and k = 1,2, ... , n + 1. For example,

f:.af = f(Xl + X2 + X3) - f(X2 + X3) - f(X3 + Xl) - f(Xl + X2)

+ f(Xl) + f(X2) + f(X3)

- f(O).

In this 1909 paper Frechet extends the considerations to real functions of several real variables and then to real functions (i.e., functionals) of an infinite sequence of real variables. For this latter case the nature of continuity and of the domain of definition of the functionf are not discussed very clearly or generally. Here a continuous functional for which f:.n+ d== 0 for some n is called a "fonctionnelle d'ordre entier n." In the following year (l91O) Frechet published a paper [9] on continuous (real-valued) functionals defined on the class of real continuous functions which we now denote by C[a, b], where [a, b] is a finite real interval. Here he carries over to this different setting the concept of a "fonctionnelle d'ordre entier n" from the 1909 paper. The definition of such a functional U calls for U to be continuous and for f:.n+lU to vanish identically. Frechet generalizes Hadamard's 1903 theorem on the representation of linear functionals by showing that if U is a continuous functional on C[a, b] it can be represented in the form U(f)

= lim

[u~O)

+

U~l)(f)

+ ... +

u~rn)(f)],

n-+ 00

where

Here u~O) is a number and Ul.'"k)(Xl> ••• , x rk ) is a continuous function depending only on U and the indices (but not on f); it may be taken to be a polynomial in Xl> ••• , x rk • The convergence to U(f) of the sum is uniform on sets in C [a, b] which are compact (in the sense of the term "compact" in use at that time as introduced by Frechet). The functional UAr k ) is "entier, d'ordre Tk." In this paper Frechet also shows that if U is entire and of order n, then U(yr/l + ... + ypfp) is a polynomial of degree at most n in the real


329

variables Yl> ... , Yp. Here iI, ... ,fp are members of C[a, b]. He also observes that U is representable (uniquely) in the form U(f)

=

Uo + U1(f)

+ ... +

Un(f),

where Uo is constant and Uk is entire and of order k as well as homogeneous of degree k. In the last part of this 1910 paper Frechet turns to a definition of holomorphism. Suppose g E C[a, b]. Then a functional U is called holomorphic atf = g if there is a representation (necessarily unique) U(f) = Uo

+

U1(f- g)

+

Ulf- g)

+ ... +

UnCf- g)

+ ...

converging suitably under certain restrictions, where Un is entire, of order n, and homogeneous of degree n. The representation is to be valid when max if(x) - g(x)i < E, for a certain positive E, and the convergence is to be uniform with respect to fwhenfis confined to a compact subset of the functions which satisfy the inequality. Frechet observes that Volterra had already considered particular instances of series representations of this type. Frechet observes that if U is holomorphic atf = g, then U(g + tf) is a holomorphic function of t at t = O. Here he comes close to an alternative approach to the establishment of a theory of analytic functionals. He notes, however, that U can have the property that U(tf) is holomorphic in tat t = 0 without U being holomorphic atf = O. The example he gives is this: U(f) = maxf(x) + minf(x). This functional U is homogeneous of the first degree: U(tf) == tU(f). But U is not entire of order one, and hence is not holomorphic atf = O. Frechet is here considering real scalars exclusively. The next stage in the development is apparently due to R. Gateaux, who was clearly strongly influenced by both Frechet and Hadamard. Gateaux was killed in September 1914, soon after the beginning of World War I, and his manuscript work remained unpublished until 1919. He made decisive contributions to the theory of analytic functions in the framework of f.unctional analysis. At this point I take note only of his work in polynomial operations. This work is presented in two different contexts (see [17] and [18]). In [17], the manuscript of which dates from March 1914, Gateaux considers the space (which he calls E~) of all complex sequences (Xl> X2, ••• ) with ecart E(x, x') =

i: ~n. 1 +iX IXn -x~ixni n -

I

•

n=l

A "polynomial of degree n" is a functional P which is defined and continuous on a certain set Din E:" and such that P(>.z + ILt) is an ordinary

330

ANGUS E. TAYLOR

polynomial of degree n in .\ and Fk for each z and tin D. Here Gateaux is using as a defining property something which Fn:chet had observed as a property possessed by his entire functionals of order n. In another posthumous paper [18] we find Gateaux considering functionals defined on a space of continuous functions. Citing Frechet, he speaks of a functional as entire of order n if its differences of order n + 1 vanish identically while some difference of order n does not so vanish. He then points out that this definition is not satisfactory in the complex case. As an example he cites U(z) = x, where z = x + iy (x and y real) for which /).2U == 0, /).1 U ¥= O. Here U is continuous, but it is not suitable to regard it as a "polynomial" in z. In view of this, Gateaux uses the following definition: a continuous functional U is called entire, of order n, if U(.\z + Fkt) is a polynomial of degree n in .\ and Fk whenever z and t are in the given space of continuous functions. At this point Gateaux introduces the concepts of variations for a functional: oU(z, t)

=

[~ U(z + At)L=o'

02U(Z, t)

=

[~ oU(z + At, t)L=o'

These are what have subsequently become known as Gateaux differentials. We know that oU(z, t) is not necessarily continuous or linear in t, but Gateaux does not discuss these points carefully. Instead, he says with reference to oU(z, t 1 ), "C'est une fonctionnelle de z et de 11 qu'on suppose habituellement lineaire, en chaque point z, par rapport it t1'" He is discussing the case of real scalars at this point. Again, citing Frechet, Gateaux obtains the following formulas when U is an entire functional which is homogeneous and of order n: U(z

+

At)

=

U(z)

.\k

+ ... + k!

OkU(Z, t)

.\n

+ ... + n!

onu(z, t)

and U(t)

=

~ Onu(Z, t).

n.

In a 1915 paper Frechet [10] points out that the problem of determining the representation of a functional of the second order is reducible to the problem of representing a bilinear functional. This is an early allusion to the relation (which emerges in later studies by various authors) between entire functionals of order n and multilinear funetionals. The next major step, and a definitive one, was taken by Frechet. In 1925 he published a note [13], later expanded into a paper [15] published in


331

1929, entitled "Les polynomes abstraits." The paper is a natural and direct outgrowth of his papers of 1909 and 1910, but the setting is more general. Frechet is now considering functions from one abstract space to another. His spaces are of a general type which he calls "espaces algebrophiles." These form a class more extensive than the class of normed linear spaces, but less extensive than the class of topological linear spaces as currently defined. The scalars are real, and the characterization of a function from one such space to another as an abstract polynomial is by means of continuity and the identical vanishing of differences, just as in the 1910 paper. The principal result is expressed thus: "Tout polynome abstrait d'ordre entier nest la somme de polynomes abstraits d'ordre h = 0, I, ... , n, chaque polynome d'ordre h etant homogene et de degre h, et la decomposition etant unique." By 1929, of course, the theory of abstract linear spaces was much further advanced than in 1910. The difference between the 1910 and the 1929 papers is not in the basic characterization of abstract polynomials, but in the development of concepts and a framework and technique for dealing suitably with linearity and continuity as they enter into this particular problem. The next six years saw the completion of the theory of abstract polynomials. This completion consisted in tidying up the proofs, putting them into their ultimate general form, and completing the linkage between several different ways of founding the theory. In particular, the difference between the complex and the real case was clarified. In his 1932 California Institute of Technology doctoral dissertation R. S. Martin [34] used the following definition: A functionf(x) from one normed vector space to another is a polynomial of degree n if it is continuous and if p(x + AY) is a polynomial of degree at most n in A (with vector coefficients) 'and of degree exactly n for some x, y. This definition applies for the case of either real or complex scalars. Martin was interested in a general theory of analytic functions. He was the student of A. D. Michal. At this time Michal was giving lectures on abstract linear spaces, and he developed an approach to polynomials by using multilinear functionals. These lectures were not published, but references to the work of this period are found in Michal [37]. In 1935 the Polish mathematicians S. Mazur and W. OrIicz published an important paper [36] systematizing the theory quite completely for the case of linear spaces with real scalars. The first of their results was announced to the Polish Mathematical Society in 1933. Their work was independent of that of Martin, Michal, and his group. They separated out the part of the development of the theory which is purely algebraic. In the algebraic portion of the theory they begin with multi-additive operations. If U*(Xl> ... , Xk) is defined for Xl> ... , Xk in X, with values in Y, and is

332

ANGUS E. TAYLOR

additive in each X;, U(x) = U*(Xh ... ' Xk) is said to be rationally homogeneous of degree k (provided it does not vanish identically). An operation U of mth degree is one which has a representation U(x) = Uo(x)

+

Ulx)

+ ... +

Um(x),

where Um(x) ¢ 0 and each Uk is either identically zero or is rationally homogeneous of degree k. It is then shown how an operation U which is rationally homogeneous of degree k determines and is uniquely determined by a symmetric k-additive operation u* such that U*(x1 , . • . , Xk) becomes U(x) if one puts Xl = X2 = ... = Xk. U* is obtained from U by differencing, and there is a "multinomial theorem" expressing U(tIXI + ... + fkXk) as a sum of terms in which the coefficient of each term of the form t1'.l . .. t~k is a multiple of U*(Xl, ... , Xl>

••• ,

Xk, ... , Xk).

~~

Finally, operations of degree m are characterized by both the Frechet approach and the Gateaux approach. The case of complex scalars is not considered. Passing beyond the purely algebraic part of the theory, Mazur and Orlicz deal with operations mapping one space of type (F) into another. (Here the nomenclature is that of Banach (see [1], p. 35). It is important for some but not for all of the results that the spaces be complete as metric spaces.) They add the requirement of continuity: a continuous k-additive operation is called k-Iinear, a continuous operation which is rationally homogeneous of degree k is called a homogeneous polynomial of degree k, and a continuous operation of degree m is called a polynomial of degree m. It is then shown that this concept of a polynomial coincides, for mappings from one (F)-space to another, with the concepts as variously introduced by Frechet, Gateaux, and Martin. Finally, for spaces with complex scalars, I. E. Highberg [25], another student of A. D. Michal, showed that for a mappingffrom one complex algebrophile space to another the following two sets of conditions are eq ui valen t : (i) f is continuous; f(x + ,\y) is a polynomial of degree at most n in "for each X, y and of degree exactly n for some X, y. (ii) fis continuous ;fhas a Gateaux differential at each point; ~n+ Ii == 0 and ~nf¢ o.

The requirement of Gateaux differentiability in the second condition is superfluous in the real case but not in the complex case.


333

3. Analytic functions of a complex variable

As I pointed out earlier, Frechet's work of 1910 includes consideration of the notion of a functional U defined on a set in the space era, b) and holomorphic at a point g of that space. In particular he noted that, for such a U, U(g + If) is a holomorphic function of tat t = o. Frechet was dealing with the case of real scalars, and thus in this situation t and U(g + if) are real. The essence of the situation is that U(g + if) is expressible as a power series in t. The concept of an analytic function of a complex variable, with values in a function space, does not appear to come in for consideration at this period either in Frechet's work or in that of Gateaux. In this essay I cannot deal fully with the question of just how this concept developed. Analytic dependence on a complex parameter appears at many places in the study of differential and integral equations. In Ivar Fredholm's famous 1903 paper [16], for instance, the solution of the equation f(s) - .\

f

k(s, t)f(t) dt = g(s)

is presented in the form f(s)

=

g(s)

.\ i

+ d(.\)

b

a

D(s, t; .\)g(t) dt,

provided d(.\) =F O. This is in a context where f and g are members of C[a, b). Here d(.\) is an entire analytic function of .\ and D(s, t; .\) is a continuous function of s, t as well as being an entire analytic function of .\. However, there is no suggestion at this stage of regarding the analytic dependence on .\ in terms of the conceptualization of an analytic mapping from the complex plane into a function space. If we turn to the work of F. Riesz [43] ten years later, we find explicit recognition of what was latent in the work of Fredholm, though not in precisely the same context. Riesz was studying "linear substitutions" in the theory of systems of linear equations in an infinite number of unknowns. He considered, in particular, substitutions of the type called completely continuous, acting in the class of infinite sequences which later came to be denoted by f2 (the classical prototype of a Hilbert space). If A is such a substitution and E is the identity substitution, Riesz's studies led him to the assertion (see [43], p. 106) that the inverse substitution (E - '\A) -1 is a meromorphic function of .\. In this conclusion we perceive Reisz's conception of (E - '\A) -1 as a function of the complex variable .\ with values in

334

ANGUS E. TAYLOR

the class of bounded linear substitutions on 12. However, there is no explicit discussion here of exactly what it means in general for such a function to be analytic at a particular point or to have a pole at a particular point. There is only a discussion of the particular function (E - AA)-l. This is couched in terms of what occurs when one looks at certain related linear substitutions (and their inverses) involving only a finite number of unknowns. These systems are dealt with by using determinants of finite order. A bit further on (see [43], pp. 114-119) Riesz examines (E - AA)-l for the case of an arbitrary bounded linear substitution A (i.e., not merely the completely continuous case). Riesz shows that the class of regular points A (those for which E - AA is appropriately invertible) form an open set in the complex plane and that (E - AA) -1 is analytic at each regular point /L in the following sense: (E - AA) -1 can be represented as a power series in A - /L with certain coefficients which are bounded linear substitutions. The series converges in a well defined sense when A is sufficiently close to /L. The mode of convergence is that of what is sometimes called the uniform topology of the bounded linear substitutions. Further references to the work of Riesz will be made later when we come to the consideration of the role of analyticity as a concept in spectral theory. However, it should be mentioned at this point that Riesz observed (see [43], pp. 117-119) that it is possible to integrate (E - AA) -1 and other such functions along contours in the complex plane and to make effective use of the calculus of residues in the study of linear substitutions. In a paper published in 1923 Norbert Wiener [60] pointed out that Cauchy's integral theorem and much of the classical theory of analytic functions of a complex variable remain valid for functions from the complex plane to a complex Banach space (which did not then regularly carry that name). Wiener made a few observations about applications. In particular, he pointed out that some of the work of Maxime Bocher [2], on complex-valued functions f(x, z) of a real variable x and a complex variable z, can be regarded as a study of an analytic function of z with values in the function space C [a, b]. (See Taylor [51], p. 655, for a slight refinement of this.) In another paper Taylor [50] extended Wiener's work to obtain some rather surprising things in connection with an analytic function of z with values in U(a, b). In the 1930's A. D. Michal and his students began to use Wiener's observations in the study of analytic mappings from one complex Banach space to another. In 1935 A. E. Taylor discovered that the following definition leads to a satisfactory general theory of such mappings: a function f with values in a complex Banach space Y is called analytic in a neighborhood N of a point Xo in the complex Banach space X if f is continuous in N and if, for each x in N and each u in X, f(x + Au) is


335

analytic (i.e., differentiable) as a function of the complex variable A in some neighborhood of A = O. This definition and the theory flowing from it are natural generalizations of the work of Gateaux [18]. The details of Taylor's work are given in [45] and [47]. That Gateaux's work could be generalized in this manner was also observed by L. M. Graves (see [20], pp. 65 I-653). Taylor also studied some of the divergences from the classical theory which appear in the case of vector-valued analytic functions of a complex variable. During the 1930's the research of American mathematicians was increasingly directed to the study of Banach spaces. The theory of analytic functions began to play a systematic role in spectral-theoretic studies of linear operators. This will be dealt with separately in a later part of this essay. Another interesting and important development occurred as a result of studying the concept of analyticity in the context of various alternative modes of convergence. In 1937 Nelson Dunford and Taylor independently discovered very closely related results, both of which depend essentially on what has come to be known as the principle of uniform boundedness. Dunford's result, published in 1938 (see [3], p. 354) is that, if / is a function from an open set D in the complex plane to a complex Banach space X, then/is analytic on D, in the sense of being differentiable at each point of D with respect to the strong topology of X, provided that x*(f{z» is analytic on D for each continuous linear functional x* defined on X. This result, which has enormous usefulness, was initially very surprising because this is a case in which the weak convergence of difference quotients implies their strong convergence. The result of Taylor relates to the dependence of a bounded linear operator on a complex parameter. If AA is such an operator (from one complex Banach space X to another such space Y) for each complex A in the open set D, Taylor showed (see [50], p. 576) that A" is differentiable on D as an operator-valued function provided that A"x is' differentiable on D as a vector-valued function for each vector x in X. That is, convergence of the difference quotients in the strong topology of Y implies convergence in the uniform topology of the space of operators. This result was also unexpected and surprising. The result of Dunford can be deduced from that of Taylor, and vice versa. The use in functional analysis of analytic functions from the complex plane to a function space or an abstract space has developed and ramified enormously since the 1930's. Since this essay treats only the early stages of the subject I forego further details and examples. For a report on the subject as of 1943 see Taylor [51]. For a striking application to obtain a famous classical theorem by recognizing it as a functional analysis instance of another classical theorem see [52] (this is also dealt with in [54], pp. 211-212).

ANGUS E. TAYLOR

336

4. Analytic functions of a vector variable

In a 1909 paper David Hilbert sketched the start of a theory of numerical analytic functions of a countable infinity of complex variables (see [26], pp. 67-74). He used the Weierstrassian approach; that is, he dealt with functions represented by a "power series" the successive terms of which are: a constant, a linear form, a quadratic form, a ternary form, and so on. A definition of analytic functions based on this notion of a power series appears in the work of Helge von Koch [30] in 1899. This work deals with infinite systems of differential equations; von Koch imitates the classical pattern of argument used to obtain analytic solutions of differential equations. Hilbert considers analytic continuation and the composition of analytic functions. He gives an example to show that analytic continuation can give rise to uncountably many branches of a function. As was pointed out earlier in this essay, Frt!chet in 1910 introduced a definition of what it means for a functional to be holomorphic at a point in the space C[a, b]. The basic idea is that of an expansion in a series of homogeneous polynomials of ascending degree. Frechet's work is much clearer than that of Hilbert, perhaps mainly because the questions surrounding proper definitions of convergence and of domains of definition of forms and power series are obscure when one deals with a countable infinity of complex variables without an adequate consideration of the structure of the space composed of the sequences (Xl> X:h ... ). Frechet's 1910 paper marked out quite clearly the lines along which the theory was to develop, and the work of Gateaux exploited Frechet's beginning in brilliant fashion. Doubtless there were other forerunners of the definitive work of these pioneers. For example, Volterra had considered infinite series in which integrals of the form

f· .. f

K(xl> ... , Xn)f(Xl)' . I(xn) dXl' .. dX n

play the role of the homogeneous polynomial of degree n on C [a, b]. At this stage there was lacking to Frechet and Gateaux something fully comparable to the Cauchy point of view, according to which a function is characterized as analytic on a suitable set if it is differentiable on each point of the set. The early workers in functional analysis borrowed the concept of a "variation" from the calculus of variations. Gateaux used variations in his theory of analytic functionals (see [18]) to characterize a functional U as being holomorphic on a certain domain of definition if it is continuous and admits a first variation SUez, t) at each point z of the domain. (The definition of this variation was given earlier in this essay, in the discussion of polynomials.)


337

The variation oU(z, t) came to be called a Gateaux differential. However, it lacked certain properties which are desirable in a differential. Frechet made a definitive contribution with his 1925 paper [12] in which he shows how to define the concept of a differential of a function which maps one abstract space (of suitable type) on another. Frechet's differential has strong properties; it has become the standard instrument of differential calculus in modern Banach space theory. In his paper Frechet ascribes to Hadamard a basic principle: the fundamental fact about the differential of a function in calculus is that it depends linearly on the differentials of the independent variables. He refers to Hadamard's book [22] on the calculus of variations, where Hadamard asserts that the methods of differential calculus can be extended to functionals U(y) for which the first variation is a linear functional of the variation of y. Actually, Frechet's concept of the differential is a direct extension to the abstract situation of the well known definition introduced into calculus by O. Stolz. The line of Frechet's thinking as early as 19 I 2 with respect to the Stolz definition is clearly shown in [24] and in other papers mentioned therein. However, the tools were not yet available to formulate the definition adequately in the abstract space context which Frechet came to in 1925. The Frechet concept of the differential is stronger than the Gateaux concept. More precisely, if/is a function from a normed linear space X to a normed linear space Y, if/is defined in a neighborhood of the point Xo and has a Frechet differential at xo, then/has a Gateaux differential at Xo and it coincides with the Frechet differential. This proposition becomes false, however, if we interchange the names Frechet and Gateaux in it. Some of the disparity between the two concepts is revealed by the observation that/may have a Gateaux differential and yet be discontinuous, while / must be continuous at a point if it has a Frechet differential at that point. Because of this great difference between the two differentials it is remarkable that the following theorem is true. (It was discovered independently by L. M. Graves and Angus E. Taylor, but Graves has priority in the time of discovery. (See [20], as well as [45] and [46]. Both Graves and Taylor knew the work of Gateaux.) Suppose that X and Yare complex Banach spaces and that/is a function with values in Y which is defined on an open set D in X. Then/has a Frechet differential at each point of D if and only if it is continuous and has a Gateaux differential at each point of D. Moreover, under these conditions / has Frechet differentials of all orders at each point of D, / is holomorphic at each point of D, and the power series expansion of/in the neighborhood of a point Xo in D is convergent within the largest spherical neighborhood centered at Xo which lies wholly in D. The homogeneous polynomials in x - Xo which form the terms of the power series are expressible as multiples of the Frechet differentials of

338

ANGUS E. TAYLOR

fat Xo according to the appropriate generalization of the Taylor series. The proof of this theorem utilizes and exposes the core of the theory of analytic mappings from one complex Banach space to another. Gateaux's work, building on the original power series conception of Frechet, set the whole development in motion, but Gateaux's theory of analytic functionals made no reference to or use of the Frechet concept of a differential. One consequence of all this is that there is a simple definition of analyticity completely analogous to the classical definition from the Cauchy standpoint: f is analytic on an open set if it has a Frechet differential at each point of the set (both domain and range in complex Banach spaces). A few years prior to the discoveries by Graves and Taylor the power series approach to abstract analytic functions was under intensive study by Michal and his students, especially Clifford and Martin. They did not use the methods based on complex variables and Cauchy's formulas, as Gateaux had done, but relied entirely on the power series approach. (See [34], [38], and [39], especially p. 71 in [39].) In a posthumously published monograph by Michal are given some historical notes about his involvement with the theory of polynomials and abstract analytic functions (see [37], pp. 35-39). Taylor was Michal's student. His work, originally motivated by an interest in a long paper of Luigi Fantappie [7] on a rather different theory of analytic functionals, led into the consideration off(x + AY) as a function of the complex variable A, and from this, using Cauchy's integral formulas, Taylor discovered the linkage between the Gateaux and Frechet differentials. It was apparent that Fantappie's work did not lend itself to treatment in the framework of Banach spaces. It has subsequently become clear that a more general theory of topological linear spaces is necessary, and there have been extensive developments based on the foundations laid by Fantappie. (See, for instance, [31], pp. 375-381, and references therein to Grothendieck, Sebastiao e Silva, and others.) The state of subsequent development of the theory of analytic functions of a vector variable is partially indicated, with many references, in the massive book of Hille and Phillips ([28], Chapter 3, especially pp. 109-116). See also Dunford and Schwartz ([6], pp. 520-526) for an important application. Significant contributions were made in three papers by Max Zorn [61], [62], [63]. There are great untouched areas in the theory of analytic functions of a vector variable, especially in the study of such things as manifolds of singularity, convergence sets of power series, and envelopes of holomorphy. This is not surprising, since these subjects are so deep and complicated in the case of functions of a finite number of complex variables.


339

5. The role of analytic functions in spectral theory

I made reference earlier in this essay to the work of Riesz in showing that (£ - AA)-1 depends analytically (and in some cases meromorphically) on A. This work of Riesz foreshadows a wealth of important developments,

but the full scope of what was latent in Riesz' book did not become evident for many years. Taylor, in a paper [49] published in 1938, showed that if T is a closed linear operator in a complex Banach space, the inverse (U - T) -1, if it exists in a suitable sense for at least one A, is an operatorvalued analytic function on the (necessarily open) set of A'S for which it is defined. He also showed that if T is bounded and everywhere defined there must be some value of A for which (U - T)-1 does not exist; that is, the spectrum of T is not empty. This result hinges on the use of Liouville's theorem for vector-valued analytic functions. The special case of this theorem on the spectrum, for operators in a Hilbert space, had been known earlier. Marshall Stone [44] obtained the result without using vectorvalued analytic functions. He did this by applying linear functionals to (U - T)-1 X •

At about this same time and a few years later work was going on in the study of complete normed rings (later to be known as Banach algebras) with the use of analytic functions as a tool. Ring elements Ae - a and their inverses were investigated by Gelfand [19], Lorch [33], Mazur [35], and Nagumo [40]. It is interesting to note that, although Gelfand defined the concept of an analytic function of the complex variable A with values in a complete normed ring, he developed the theory of such functions by reducing the arguments to the numerical case through the use of linear functionals. All of this work in normed rings has many parallels with the more general spectral theory of linear operators. Spectral theory of linear operators is the functional analysis counterpart of the theory of eigenvalues of linear transformations in spaces of finite dimension. The spectrum of a linear operator T acting in a Banach space corresponds to the set of eigenvalues of the linear transformation in the finite-dimensional case. The spectrum of T consists of all values of A for which U - Tfails to have an inverse in a suitable sense. When the Banach space is finite-dimensional, the spectrum is a finite set of eigenvalues and (U - T) -1 is a rational function of A. Much of spectral theory, for a particular operator acting in a Banach space, may be viewed as the study of the operator-valued function (U - T)-l, called the resolvent of T, and of the behavior of this function of A both globally and locally. A great deal of information about T itself can be extracted from this study. The calculus of residues and Cauchy's integral formula play important

340

ANGUS E. TAYLOR

roles in spectral theory. Integrals of the form

r

1 . f(>')(AI - T)-l d>' -2 m Jc over suitable closed contours C in the plane, where the complex-valued function f is analytic on a neighborhood of the spectrum of T, are used to define operators which can be used in significant ways. The heuristic basis for the definition of an operator denoted by f(T) is the symbolic "Cauchy's formula"

r

f(T) = ~ f(>.) d>'. 2m Jc >. - T

For the finite-dimensional case (i.e., for the study of finite square matrices) there are nineteenth-century anticipations of this sort of thing in the work of Frobenius and others, most explicitly in an 1899 paper by Poincare. For references to this early work see Dunford and Schwartz ([6], pp. 606-607) and Taylor ([51], p. 662, [53], p. 190). Systematic exploitation of the calculus of residues and of the symbolic operational calculus based on the Cauchy integral formula, as applied to general spectral theory, starts with Riesz and picks up again in the years around 1940. The work of investigators ofnormed rings, referred to earlier, is part of this general development. Dunford [4], [5] and Taylor [51], [53], [54] dealt explicitly with operator theory. (See also Einar Hille [27]. For further references see Hille and Phillips [28], pp. 164-183.) The subsequent development and application in functional analysis of the ideas and methods of classical analytic function-theory have been varied and rich. I conclude this essay by mentioning just one example of such development. Analytic function-theory methods have been used to deal with perturbations of operators and their spectra. Perturbation theory goes back a long way in mathematics, of course. The pioneering work in studying perturbations of operators in Hilbert space seems to be that of F. Rellich, dating from 1937. Numerous subsequent investigators have made use of the symbolic operational methods of Dunford and Taylor. For references to the work of Rellich, B. v. Sz. Nagy, F. Wolf, and others see Kato [29]. UNIVERSITY OF CALIFORNIA BERKELEY

REFERENCES l. 2.

BANACH, STEFAN, Theorie des operations Iineaires, Warsaw, 1932. BacHER, MAXIME, "On semianalytic functions of two variables,"

Mathematics, (2), 12 (1910), 18-26.

Annals of


341

3. DUNFORD, NELSON, "Uniformity in linear spaces," Transactions of the American Mathematical Society, 44 (1938), 305-356. 4. - - - , "Spectral theory," Bulletin of the American Mathematical Society, 49 (1943),637-651. 5. - - - , "Spectral theory, I: Convergence to projections," Transactions of the American Mathematical Society, 54 (1943),185-217. 6. DUNFORD, N., and J. T. SCHWARTZ, Linear Operators, Part I: General Theory. New York: Interscience Publishers, Inc., 1958. 7. FANTAPPIE, LUIGI, "I funzionali analitici," Memorie della R. Accademia Nazionale dei Lincei, (6) 3 fasc., 11 (1930), 453-683. 8. FRlkHET, MAURICE, "Une definition fonctionnelle des polynomes," Nouvelles Annales de Mathematiques, (4) 9 (1909), 145-162. 9. - - - , "Sur les fonctionnelles continues," Annales Scientifiques de l'Ecole Normale Superieure, (3) 27 (1910), 193-216. 10. - - - , "Sur les fonctionelles bilineaires," Transactions of the American Mathematical Society, 16 (1915), 215-234. II. - - - , "Sur la notion de differentielle totale," Comptes Rendus du Congres des Societes Savantes en 1914, Sciences, 5-8, Paris, 1915. 12. - - - , "La notion de differentielle dans I'analyse generale," Annales Scientifiques de l'Ecole Normale Superieure, (3) 42 (1925), 293-323. 13. - - - , "Les transformations ponctuelles abstraites," Comptes Rendus A cad. Sci. Paris, 180 (1925), 1816. 14. - - - , Les Espaces Abstraits et leur theorie consideree comme introduction a l'analyse generale. Paris: Gauthier-Villars, 1928. 15. - - - , "Les polynomes abstraits," Journal de Mathematiques Pures et Appliquees, (9) 8 (1929), 71-92. 16. FREDHOLM, IVAR, "Sur une classe d'equations fonctionnelles," Acta Mathematica, 27 (1903),365-390. 17. GATEAUX, R., "Fonctions d'une infinite des variables independantes," Bulletin de la Societe Mathematique de France, 47 (1919),70--96. 18. - - - , "Sur diverses questions de calcul fonctionnel," Bulletin de la Societe Mathematique de France, 50 (1922), 1-37. 19. GELFAND, I. M., "Normierte Ringe," Matem. Sbornik, 9 (51) (1941),3-24. 20. GRAVES, L. M., "Topics in the functional calculus," Bulletin of the American Mathematical Society, 41 (1935), 641-662. 21. HADAMARD, JACQUES, "Sur les operations fonctionnelles," Comptes Rendus Acad. Sci. Paris, 136 (1903),351. 22. - - - , Lec:ons sur Ie calcul des variations. Paris: Hermann, 1910. 23. - - - , "Le calcul fonctionnel," l'Enseignement Mathematique, 14 (1912), 1-18. 24. - - - , "Le developpement et Ie role scientifique du calcul fonctionnel," in Atti del Congresso Internazionale dei Matematici, Bologna, 3-10 Settembre 1928 (VI) Torno I, pp. 143-161. 25. HIGHBERG, IVAR, Polynomials in Abstract Spaces, unpublished doctoral dissertation, California Institute of Technology, 1936. 26. HILBERT, DAVID, "Wesen und Ziele einer Analysis der Unendlichvielen unabhiingigen Variabeln," Rendiconti del Circolo Matematico di Palermo, 27 (1909,59-74. 27. HILLE, EINAR, "Notes on linear transformations, II: Analyticity of semigroups," Annals of Mathematics, (2) 40 (1939), 1-47. 28. HILLE, EINAR, and R. S. PHILLIPS, Functional Analysis and Semi-Groups, rev. ed. Providence, R.I.: American Mathematical Society, 1957. 29. KATO, TOSIO, Perturbation Theory for Linear Operators. Berlin: Springer-Verlag, 1966. 30. VON KOCH, HELGE, "Sur les systemes d'ordre infini d'equations differentielles," Ofversigt af Kongl. Svenska Vetenskaps-Akademiens Forhandlingar, 61 (1899), 395-411.

342

ANGUS E. TAYLOR

31. KOTHE, GOTTFRIED, Topologische Lineare Riiume, I. Berlin: Springer-Verlag, 1960. 32. LEVY, PAUL, "Jacques Hadamard, sa vie et son oeuvre-Calcul fonctionnel et questions diverses," in Monographie N° 16 de {' Enseignement MathematiqueLa Vie et {'oeuvre de Jacques Hadamard. Geneve, 1967, pp. 1-24. 33. LORCH, E. R., "The theory of analytic functions in normed Abelian vector rings," Transactions of the American Mathematical Society, 54 (1943), 414-425. 34. MARTIN, R. S., Contributions to the Theory of Functionals, unpublished doctoral dissertation, California Institute of Technology, 1932. 35. MAZUR, S., "Sur les anneaux lineaires," Comptes Rendus Acad. Sci. Paris, 207 (1938),1025-1027. 36. MAZUR, S., and W. ORLICZ, "Grundlegende Eigenschaften der polynomischen Operationen" (erste Mitteilung), Studia Mathematica, 5 (1935), 50-68. 37. MICHAL, ARISTOTLE D., Le Calcul Differentiel dans les espaces de Banach. Paris: Gauthier-Villars, 1958. 38. MICHAL, A. D., and A. H. CLIFFORD, "Fonctions analytiques implicites dans des espaces vectoriels abstraits," Comptes Rendus A cad. Sci. Paris, 197 (1933), 735-737. 39. MICHAL, A. D., and R. S. MARTIN, "Some expansions in vector space," Journal de Mathematiques Pures et Appliquees, 13 (1934), 69-91. 40. NAGUMO, M., "Einige analytische Untersuchungen in linearen metrischen Ringen," Japanese Journal of Mathematics, 13 (1936), 61-80. 41. PINCHERLE, SALVATORE, "Memoire sur Ie calcul fonctionnel distributif," Mathematische Annalen, 49 (1897), 325-382. 42. - - - , "Funktional-Gleichungen und Operationen," in Encyklopiidie der Mathematischen Wissenschaften mit Einschluss ihrer Anwendungen, Ill, Heft 6, Leipzig, 1906, pp. 761-817. 43. RIESZ, F., Les Systemes d'equations lineaires a une infinite d'inconnues. Paris: Gauthier-Villars, 1913. 44. STONE, M. H., Linear Transformations in Hilbert Space. New York: American Mathematical Society, 1932. 45. TAYLOR, ANGUS E., Analytic Functions in General Analysis, unpublished doctoral dissertation, California Institute of Technology, 1936. 46. - - - , "Sur la theorie des fonctions analytiques dans les espaces abstraits," Comptes Rendus Acad. Sci. Paris, 203 (1936), 1228-1230. 47. - - - , "Analytic functions in general analysis," Annali della R. Scuola Normale Superiore di Pisa, 6 (1937), 277-292. 48. - - - , "On the properties of analytic functions in abstract spaces," Mathematische Annalen, 115 (1938), 466-484. 49. - - - , "The resolvent of a closed transformation," Bulletin of the American Mathematical Society, 44 (1938),70-74. 50. - - - , "Linear operations which depend analytically on a parameter," Annals of Mathematics, 39 (1938), 574-593. 51. - - - , "Analysis in complex Banach spaces," Bulletin of the American Mathematical Society, 49 (1943), 652-669. 52. - - - , "New proofs of some theorems of Hardy by Banach space methods," Mathematics Magazine, 23 (1950), 115-124. 53. - - - , "Spectral theory of closed distributive operators," Acta Mathematica, 84 (1951),189-224. 54. - - - , Introduction to Functional Analysis. New York: John Wiley & Sons, Inc., 1958. 55. VOLTERRA, VITO, "Sopra Ie funzioni che dipendono da altre funzioni," Nota I. Rendiconti della R. Accademia dei Lincei, Series IV, vol. III (1887), pp. 97-105. 56. - - , Nota II, ibid., pp. 141-146. 57. - - - , Nota III, ibid., pp. 153-158. 58. - - - , "Sopra Ie funzioni dipendenti da 1inee," Nota I, ibid., pp. 225-230. 59. - - - , Nota II, ibid., pp. 274-281.

HISTORICAL NOTES ON ANALYTICITY

343

60. WIENER, NORBERT, "Note on a paper of M. Banach," Fundamenta Mathematica, 4 (1923),136-143. 61. ZORN, MAX, "Characterization of analytic functions in Banach spaces," Annals of Mathematics, (2) 46 (1945), 585-593. 62. - - - , "Gateaux differentiability and essential boundedness," Duke Mathematical Journal, 12 (1945),579-583. 63. - - - , "Derivatives and Frechet differentials," Bulletin of the American Mathematical Society, 52 (1946), 133-137.

..N -Almost Automorphic Functions WILLIAM A. VEECHl

T will denote a locally compact, a-compact, Abelian group, and ~ = ~(T)

will be the algebra of bounded, complex-valued, uniformly continuous functions on T. Bochner defines f E ~ to be almost automorphic if from every sequence {fJm} = fJ S; T may be extracted a subsequence {an} = a (an = fJmn) such that (a) the limit Ta!Ct) = limf(t + an) n exists for each t, and (b) the equation LaTa!Ct) = lim Taf(t - an) n

(1)

=f(t) holds identically. The limits in (a) and (b) are not required to be uniform (but they are automatically locally uniform), and, in fact, because T is a-compact, (a) places no restriction at all on/, so long asfE~. With condition (b) it is a different matter altogether, and the functions which satisfy it are precisely the functionsfE ~ which are continuous in the "Bohr topology" on T, the topology on T induced by the algebra d of continuous Bochner-von Neumann almost periodic functions (see [6]). Here we shall consider a condition which is weaker than (1). We say fE ~ is d-almost automorphic if the totality S = {LaTa!} is norm precompact (for I gil = sup Iget)!) and if a second more technical condition 00

t

is satisfied. Theorem 2 which follows gives at least the coarse structure of the class (actually algebra) of d-almost automorphic functions. Proposition 1 which follows enables us to reformulate both Bochner-von Neumann almost periodicity and d-almost automorphy in such a way that there is no explicit requirement of compactness, and this may be of independent interest. We begin with some notation. A set B S; ~ is a T-algebra if it is a uniformly closed, self-adjoint, translation-invariant algebra containing the 1

Research supported by NSF grants GP-5585 and GP-7952X. 345

346

WILLIAM A. VEECH

constants. Each/E '6' is contained in a smallest T-algebra which we denote by Bf • Bf is generated by polynomials in the translates of / and its conjugate. Given / E '6', !t is to be the tth translate of /:ft(s) = /(s + t). Let Xfo = {!t}tET, and note that X? is precompact precisely when / Ed. Xl is always precompact in the topology of local uniform convergence, and we let Xf S; '6' be its closure in this weaker topology. Because T is a-compact X j is metrizable. There is an isometric isomorphism between Bf and C(Xf ) given by h ~ H, where h(t) = H(!t), t E T. Xf is a translation invariant subset of'6', and therefore there is a natural action of Ton Xf yielding a flow (T, Xf)' We say lis minimal if the associated flow is minimal (every point has a dense orbit) or, equivalently, if for every T,x/ there is a sequence f3 such that ToTal = f If T,x/ exists, it is easy to see that Tah exists for each h E B f , and, moreover, if ToTa/ = f, then ToTah = h, h E Bf . We conclude therefore that each h E B f is minimal if/is minimal. In general we say that a T-algebra is minimal if each of its elements is minimal. We remark also that this property of hE B f characterizes B f . If h is such that Tah exists whenever T,x/ exists, then hE Bf . [For then H(!t) = h(t) extends to be continuous on Xf.] LEMMA 1. Suppose / E '6' is such that whenever Ta/ exists there is a sequence f3 such that / E BT aTaf' Then / is minimal. PROOF. Bya well known Zorn's lemma argument there exists a minimal set M S; Xf for (T, Xf)' If m E M, say m = Taf, then (T, X Taf ) is simply (T, M). Thus Ta/ is minimal, as is ToTa/ whenever the limit exists. Thus, by our assumption on f, / E Bg for some minimal function g, and / is minimal. If IX is a sequence such that Ta/exists for somef, then Ta may be regarded as a linear operator on Bf • Evidently the range of this operator is contained in BTaf and contains a dense subset of the latter. Generally it is true that IITahiloo ~ Ilhll oo , but if/is minimal, equality holds. (For then there exists f3 with ToTah = h, hE B f , and the reverse inequality follows from this.) It follows in particular that Ta is onto when/is minimal.

LEMMA 2. Let /E '6' be minimal. If /E BTaf for some sequence exists a sequence f3 such that TaTo/ = f

IX,

there

By the remark preceding the lemma, Ta is onto, and therefore Tag for some g E Bf • Since g is then minimal, we can find f3 with Tnf = TpTag = g, and a fortiori TaTnf = f. The lemma is proved. PROOF.

/

=

347

&I-ALMOST AUTOMORPHIC FUNCTIONS

We now fixf E 'if and consider the significance of the existence in Xr of a set S with the following properties: (a) (b) (c) (d) (e)

S is closed. fE BTar if TafE S. For each TJE Xr there exists a sequence f3 such that TpTafE S. If TafE S, and if f3 is a sequence such that TaTpf = I, then TpfE S. If Tal, TpfE S, and if TaTpfexists, then TaTpfE S.

LEMMA 3. Suppose f and S

Problems in Analysis. A Symposium in Honor of Salomon Bochner

Problems in Analysis. A Symposium in Honor of Salomon Bochner

Problems in Analysis. A Symposium in Honor of Salomon Bochner