Stochastic Analysis: Proceedings of the Taniguchi International Symposium on Stochastic Analysis, Katata and Kyoto, 1982 (North-Holland Mathematical Library)

North-Holland Mathematical Library Board of Advisory Editors: M. Artin, H. Bass, J. Eells, W. Feit, P. J. Freyd, F. W...

Author: Kiyosi Ito

26 downloads 508 Views 6MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
Report copyright / DMCA form

DOWNLOAD PDF

North-Holland Mathematical Library Board of Advisory Editors:

M. Artin, H. Bass, J. Eells, W. Feit, P. J. Freyd, F. W. Gehring, H. Halberstam, L. V. Hormander, M. Kac, J. H. B. Kemperman, H. A. Lauwerier, W. A. J. Luxemburg, F. P. Peterson, I. M. Singer and A. C. Zaanen

VOLUME 32

NORTH-HOLLAND PUBLISHING COMPANY AMSTERDAM NEW YORK OXFORD

Stochastic Analysis Proceedings of the Taniguchi International Symposium on Stochastic Analysis, Katata and Kyoto, 1982 Kiyosi It8 Department of Mathematics Gakushuin University Tokyo, Japan

1984

NORTH-HOLLAND PUBLISHING COMPANY AMSTERDAM NEW YORK OXFORD 9

@ Kinokuniya Company Ltd., I984

All rights reserved. N o part of this pirblication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mecharrical, photocopying, recording or otherwise, without the prior permission of the copyright owner.

ISBN: 0 444 S75SS 3

Published by:

NORTH-HOLLAND PUBLISHING COMPANY AMSTERDAM -NEW YORK OXFORD

-

Distributed in Japan b y :

KINOKUNNA COMPANY LTD.

PRINTED IN JAPAN By Tokyo Press Co., Ltd.

Preface Stochastic analysis is a new branch of probability theory stemming from the theory of stochastic differential equations and is becoming more and more important in close connection with partial differential equations, non-linear functional analysis, control theory and statistical mechanics. For the purpose of promoting the activity in this field we held the International Workshop on Stochastic Analysis at Katata from July 1 through July 7, 1982 under the.auspices of Taniguchi Foundation. Fifteen mathematicians, eight from abroad and seven from Japan, participated in the Workshop to discuss stochastic analysis from various angles. As an extension of the Workshop we held the International Symposium on Stochastic Analysis at Kyoto on July 8 through July 10 with about 150 participants. The papers contributed to the Workshop at Katata and the Symposium at Kyoto are contained in these Proceedings. For the workshop and the symposium Taniguchi Foundation kindly gave us a full financial support and provided us with facilities at Kyuzeso, Katata. We would like to express our sincere gratitude to Mr. T. Taniguchi and Professor Y. Akizuki in this respect. We hope that this symopsium has contributed not only to promotion of scientific activities but also to international mutual understanding, the ideal of Mr. Taniguchi in supporting us. Also we would like to thank the Research Institute for Mathematical Sciences at Kyoto University for providing us with lecture halls. Professors N. Ikeda and S. Watanabe made painstaking efforts to organize the workshop and the symposium and edit these proceedings. Mr. S. Okuda, Taniguchi Foundation, effectively made administrative businesses. Mr. S. Okada, Kinokuniya Publishing Company, eagerly cooperated with us to publish these proceedings. We are very grateful to all of them. March 18, 1984

CONTENTS Preface N. IKEDA & S. WATANABE -

An introduction to Malliavin's cal-

~m

I. M. BISMUT - Jump processes and boundary processes R. FIGARI. E. ORLANDI & G. PAPANICOLAOU - Diffusive behavior of a random walk in a random medium T. FUNAKI - Random motion of strings and stochastic differential equations on the space C([O, 1], Rc!) B. GAVEAU - An example of a stochastic quantum process: interaction of a quantum particle with a boson field R. HOLLEY - Convergence in L2 of stochastic Ising models: Jump processes and diffusions N. lKEDA - On the asymptotic behavior of the fundamental solution of the heat equation on certain manifolds K. Ito - Infinite dimensional Ornstein-Uhlenbeck processes S. KOTANI- Ljapunov indices determine absolutely continuous spectra of stationary random one-dimensional SchrOdinger operators H. KUNITA - First order stochastic partial differential equations S. KUSUOKA & D. STROOCK - Applications of the Malliavin calculus, Part I Y. LE JAN & S. WATANABE - Stochastic flows of diffeomorphisms P. L. LIONS - Some recent results in the optimal control of diffusion processes P. MALLIAVIN - Implicit functions in finite corank on the Wiener ~~

D. MICHEL - Conditional laws and Hormander's condition I. SHIGEKAWA - Transformations of the Brownian motion on the Lie group M. SUZUKI - Asymptotic behavior of nonlinear Brownian motion near the instability point Y. TAKAHASHI - Entropy functional (free energy) for dynamical systems and their random perturbations H. T ANAKA - Limit theorems for certain diffusion processes with interaction

1 53

105 121 135 149 169 197

225 249 271 307 333 3$ 387

409 423 437 469

Taniguchi Symp. SA Katata 1983,pp. 1-52

An Introduction to Malliavin’s Calculus Nobuyuki IKEDAand Shinzo WATANABE

6 1.

Introduction

The purpose of the present exposition is to give an introductory survey on the Malliavin calculus, an attractive topic in stochastic analysis and to which several works in these proceedings are related. It is an infinite dimensional differential calculus for functions on a Wiener space. Since functions on the Wiener space are functions of paths, this calculus is also called a stochastic calculus of variation for Wiener functionals. The first important event in stochastic analysis is, without doubt, the introduction of the Wiener measure by N. Wiener in 1923 ([96]). Since then many attempts have been made to develop an infinite dimensional analysis on a Wiener space. But they are unsatisfactory because few effective applications could be supplied to those Wiener functionals, most interesting and important in probability theory, which are products of It6’s calculus such as stochastic integrals or solutions of stochastic differential equations ([36]) and [37]). The first main task of Malliavin’s calculus is to establish a differential calculus well-adapted to these functionals and in doing this, one is naturally led to the notion of Sobolev spaces of Wiener functionals and the weak derivatives defined on them. Malliavin’s first approach in [58] and [59], followed by Stroock [82] and [83], was based on the stochastic calculus of Omstein-Uhlenbeck processes on the Wiener space and an equivalent but more analytical approach was given by Shigekawa [78]. One of the most important contributions on this line is due to Meyer [70] in which the equivalence of various Sobolev norms are established: This result of Meyer simplified the former approaches and a more natural and manageable theory of Sobolev spaces is now available (1891 and [94). We will discuss this topic in detail below. Here we would like to remark an infinite dimensional characteristic feature of our calculus. In the classical theory of Sobolev spaces and weak derivatives on a finite dimensional space, the Lebesgue measure plays a fundamental role. In the infinite dimensional case there is nothing like a Lebesgue measure, however, and a natural substitute is a Gaussian measure. Such a measure p, when the mean is 0, is determined by a Hilbert space H in the sense that the characteristic functional of /I is given

N. IKEDAAND S. WATANABE

2

-

by exp {-[I &/2] through its norm. But p is not carried on H , actually p ( H ) = 0, and ,ti should be carried on a larger space W 3 H. In the following, we choose such a Banach space W and discuss the derivatives of functions on W. Thus we adopt Gross’s scheme of an abstract Wiener space { W , H, p}. It should be noted, however, that there is no intrinsic choice of the space W. In the case of the Wiener measure, one usually takes W to be the Banach space of continuous functions with the supremum norm but one may choose a smaller Banach space endowed with, for example, the Holder norm of the order a! < 112. Thus t o discuss the derivatives of functions on W in connection with the measure p, the Frkchet differential calculus based on the norm of W is inadequate: This is also clear from the fact that p-measurable functions are transformed to p-measurable functions under the translation by elements in and only in H . As we shall see, the definition of Sobolev norms will be given intrisically only in terms of H. Once Sobolev spaces are defined, we can speak of a smooth function on W in the sense of weak derivatives. Many interesting Wiener functionals obtained by It& calculus are shown to be smooth in this sense. It is a remarkable fact that there exists a smooth function on Win this sense which can not have a continuous modification on W i n the sense of Banach-norm (see Example 5.1). This shows that a theorem like “Sobolev’s imbedding” is no longer valid in the infinite dimensional case. Recently, Malliavin introduced the notion of capacities connected with the Sobolev norms and a class of exceptional sets in W called dim sets (see [a]). The above fact can be rephrased as follows: A single point is always slim in the infinite dimensional case. The Malliavin calculus could not have attracted so much attention, if there had not been interesting and significant applications. Malliavin first considered its application to the regularity of the heat kernel for degenerate second order differential operators of Hormander-type ([58] and [59]). His idea is to use the differential calculus on the Wiener space: Related analytical quantities are expressed as integrals (= expectations) of Wiener functionals obtained by It6’s calculus and we apply to them the integration by parts on the Wiener space so as to obtain necessary estimates. This idea was refined and expanded by Kusuoka and Stroock ([86] and [SS]) and these works can be regarded as a great success of the Malliavin calculus in the problem of analysis. A similar idea was also used in a regularity problem arising from the filtering theory (Michel [72], Bismut and Michel[14], Kunita [MI, Kusuoka and Stroock [57]). Bismut (171) gave another approach to the integration by parts for Wiener functionals and his method, based on the Girsanov theorem, can be applied to more general cases than those of Gaussian measures, cf. Bismut [ll], [12] and [13] and Bichteler and Jacod [4]. In this exposition, however,

3

A n Introduction to Malliavins Calculus

we confine ourselves to an analysis of Gaussian measures.

0 2. An elementary study of the finite dimensional case As stated in Section 1, the first main aim of the Mailiavin calculus is to establish a differential calculus based on a Gaussian measure. In this section we shall explain, in the simple case of finite dimensional spaces where the proof is easily provided by the classical analysis, notions and formulas which we are aiming at extending in the general case of infinite dimensional spaces. Let V be a real vector space of the finite dimension n and ,u be a mean 0 Gaussian measure on it with the support on the whole space V. p is determined by a Euclidean norm cr*(E) on V*, the dual space of V, by the relation

where ( E , x } is the canonical bilinear form on V* x V. a*(E) induces a Euclidean norm a(x) on V by

a(x)* =

2(ei, x>2 i=l

where {ei} is an orthonormal base (ONB) in V * relative to a*. If {el} is the dual base of Vrelative to {ef;}, i.e. (ef;, e,) = a;, then {e,} is an QNB in V relative to a and the canonical identification V* = V through the Riesz theorem is provided by identifying e, with e i for each i = 1, 2, - .,n. Then we can write a ( x ) = a*(x) = ( x , x)"' and (x, y) is the inner product of the Euclidean space V. In the following, these norm and inner product are denoted by I I v and ( ., *}v,respectively.

-

1

Remark 2.1. If V = Rn and /I = N(0, u); the mean 0 Gaussian measure with the covariance u = (u") where u is a symmetric strictly positive-definite matrix, i.e. JRmxip(dx)= o and

i , j = 1, 2,

..

then the above intrinsic Euclidean norm I . I v and the inner product (. ,*)v corresponding to p are given by


4

for x = (x') and y = (us)>, where u-' = (u~,). Before proceeding further, we prepare several necessary notions and notations. Let V be another Euclidean space with the norm 1 - I*. and the inner product ( ., *)*.. We denote by A(V, V ) the Euclidean space of all linear mappings I : V - + Y endowed with the Hilbert-Schmidt inner product ( I , l')Hs = (Z(eJ, Z'(e5))r where {e,} is an ONB in V. For each k = 0, 1, ., A,( V, V ) is defined successively by A,( V, V )= V and Y )= A(V, Ax-l(Vy Y ) ) . Thus A,(v,Y ) = A(v,V ) . If V = R, db(V,Y ) is denoted simply by A,(V). Then A ( V ) = V* which can be identified with V as above and clearly the norms coincide under the identification. A(V, V )is also denoted by V @ Y (the tensor product of V and V )and A,(V) by V @ V @ . 8 V (k-fold tensor product of V).

-

x:=l

.

'I

-

P

For x E Vand E V ,x @ E E V Q ' k is defined by [x (23( ] ( y ) = ( x , y),.E, , trace of 1 is an element y E V. For 1 e A,(V, V )(= A(V, A(V, V ) ) )the in V defined by n

trace 1 =

C (Z[e,])[e,] i=l

where {e'] is an ONB in V. Finally we denote by L,( V-+ V ,p), 1 5 p < 00, the usual L,-space of all V-valued, p-th p-integrable functionsf(x) on V with the norm

L,( V + R, p) is denoted simply by L,( V, p). For a differentiable function$ V - t V ,its derivative DJ':V-t A( V, V ) is defined, as usal, by

Further, Dkf: V - t A,(V, V )is defined by Dkf = D(Dk-'f). Iff: V-+ V is only p-measurable and locally p-integrable, Df and Dkfare defined in the Schwartz distribution sense. Next we shall introduced another operator 6 in the following way: For a differentiablef: V+ V or, more generally, V-+ A(V, V ) , we define 9f: V += R (resp. V+ V ) by the relation

An Introduction to Malliavin's Calculus

5

-

where ( ,* ) H S is the Hilbert-Schmidt norm in A( V, V ) )for all smooth (= C - ) g : V - t R (resp. V - t V ) with compact support. Since p(dx) is N(0, I) (I is the identity matrix) in the Euclidean coordinate of V relative to an ONB {et},the integration by parts yields at once

(2.3) . ,

= trace Df(x) - f(x)[x]

in the case f: V - t V and, neglecting the middle, (2.3) still holds in the general case f:V - t A(V, V ) . If f:V - t V or V - t A(Vy V ) is only plocally integrable, S f is defined in the Schwartz distribution sense. For a twice differentiable function f: V + R or V + Vydefine Lf: V 4R (resp. V - t 9 '") by (2.4)

J?f(X)

=

W f )(4

= trace D2f(x) - Df(x)[x].

Note that Df(x)[x]= (Df(x), x ) , in the case V = R. Again Lf is defined in the Schwartz distribution sense iff is only p-locally integrable. The operator L is called the Ornstein-Uhlenbeck operator. By (2.2) and (2.4), we see at once that

iff, g : V + R are C2and square p-integrable together with their first and second derivatives. Remark 2.2. In Remark 2.1 of the case V = Rnand p = N(0, u), Lf(x), forf: R" + R,is expressed as

Note also that

and hence

N.IKEDAAND S. WATANABE

6

Let (2.6)

D" = {f: V 3 R, smooth (= C'") and D Y E L,(v+ A,(v),PI,k = 0, I , 2,

n

- . .I.

P>l

-

For a mapping F = ( F l yFZ, . -,F'): V - + Rd with F iE D", i = 1, 2, . ., d, we set

-

a(x) = (oij(x))

where at'(x) = ( D P ( x ) , DFj(x)),,

(2.7)

i , j ' = 1,2,

.- d . a,

We shall assume the following: (A. 1) For a.a. x (p), U(X) is strictly positive definite. Under this assumption, the inverse a-'(x) = (T@) of a(x) can be defined for a.a. x (p). We say that F satisfies the assumption (A. 2)p, 1
0, W'o" is the Banach space formed of all continuous functions w: [0, T] -+ R" such that w(0) = 0 with the norm 11 wI(= max,,,., Iw(t)I. Let H = {h = (h*((t))G,E W;; each of h, is absolutely continuous with square integrable derivative}. H is a separable Hilbert space with inner product (hl, h2)H =

2 J: I;i(s)h:(s)ds

h, E H , i = I , 2?

where

If P is the Wiener measure on W;, then it is easy to see that { W;, H, P"} is an abstract Wiener space which we call the n-dimensional Wiener space. For 0 < oc < 1/2, set W;,a= {w E W ; ; \\wl\,, < m}, where

Then W;,, is a Banach space with the norm 11 . I), and it is easy to see that { W;,e,H , P w Iw0" } is also an abstract Wiener space. 1U

Example 3.2. (n-dimensional pinned Wiener space). For a fixed T > 0, W ; and H are as in Example 3.1 and consider their subspace @, = {w E W;; w(T) = 0) and ~, = H. Let PrTbe the ndimensional pinned Wiener measure on W;,T, that is, the Gaussian measure with the mean 0 and the covariance

n

1

w'(t)wj(s)PzT(dw)= &p(T - t ) ,

*0".T

0

<s 5 t 5 T ,

f i T , PET} is an abstract Wiener (cf. [5]). It is easy to see that space which we call the n-dimensional pinned Wiener space. From now on, we fix an abstract Wiener space { W, H, p} and discuss a differential calculus for p-measurable functions on W. Our first aim is to introduce a similar class of functions as D" in Section 2. Before proceeding further, however, we prepare several notions and notations. In the following, E, El, . . . are separable Hilbert spaces with norms and and ( ., * ) E , ( . , * ) E l , . .. inner products denoted by I . IE, I . IEl , ' A(El, E,), denoted also by El 0E,, is the Hilbert space of all linear mappings A : E, -+ E, such that

-

11


for some (= any) ONB {e,} in El, which is endowed with the HilbertSchmidt inner product m

( 4 ,A P ) H S : =

C (Al(ed, i=l

Az(e,))E2

'

As in Section 2, for k = 0, 1, . . we define Ak(El,E,) successively by Ed = A(& Ak-SEi, Ed). If Ez = R, AdEi, Ez) -'!,(El, Ez) = Ez, is denoted simply by A,(E,). A,(E,) is also denoted by El 0E, 0 - . . 0El \

7-

/

k

and called the n-fold tensor product of El. Note that Al(El) = E f = El and the norms coincide under the identification. For x E El and y E E,, x 0y E El 0E, (= A(El, E,)) is defined by ( x 0y)[z] = (x, z),, .y, z E El. In the following, E Dc E, E! c El, . . . are some dense subspaces of Hilbert spaces E, El, . . . . A typical example is the case E = H and Eo= W*. It is convenient to introduce a dense subspace AO(E!,E;) of 44, Ed by

Ao(E;,E;) =

(.E A(El,Ez);

A is of the form A =

5e, 0fi

i=l

for some n, e, E Et and f i E E:, i = 1,2, . . ., Note that, in the case El = H and E; = W * , A

E

do(W*,E;) C A(H, E,) n

extends to a continuous linear mapping A : W 3 E: by A[x] = C e,(x). fi. i=l

For k = 0, 1, . . ., A",E;, E:) is defined successively by AXE!, E:) = E,O and Al(E!, Ei) = do(E&A;-](E;, E,O)). If Ei = R, we write simply .4i(E!, E;) = Ai(E;). If A E A:(E!, E:), then A is of the form of a finite

sum A

n

=

C

e, 0[el O h l ]with e, E E! andfi, E E ; and the trace of A

i,j=l

is an element in E: defined by

where { C k } is an ONB in the linear span of {el, e,, . . ., en}. We denote Lp(W 3 E, p) := L,(E), 1 p < 03, the usual L,-space of p-measurable functions f: W-+ E with

12

N. IKEDA AND S. WATANABE

L,(W-+ R,p) is denoted simply by Lp(W, p):= L,. Finally we shall introduce a convenient class of functions on W called polynomials on W. We set 9 = {F;F W + R which is expressed in the form F(w) = p(l,(w), Iz(w), . . ln(w)) for some It, I,, l,, - - I, E W* and a real polynomial p(tl, t2, - ., t,)} and B(Eo)= {F; F: W+ E o which is expressed in the form F(w) = C;=lF,(w)e, for some n, F, E B and e, E E" c E } . a ,

+,

Proposition 3.1. l I p < m .

B ( E o ) is a dense subspace of L,( W -+ E, p) for any

Let 8, be the set of E-valued polynomials on W of degree ) ~ ( w ) p ( d=w ) ~ ( w ) ~ ~ ( w ) p ( d w ) W

J

= -

W

( D F ( ~ )~, ~ ( w ) ) , p ( d w )

+

for every F E Di and G E Di such that l/p I/q 1, (see [78], [89] and [94]). Futhermore we note that there exists a stationary Gaussian diffusion on W with the semigroup {T,},,, given by (3.13) and the invariant measure p, called the Ornstein- Uhlenbeck process. As stated in Introduction, in [58] Malliavin defined the notion of derivatives for Wiener functionals in terms of Ornstein-Uhlenbeck processes over Wiener spaces, (also see [82]).

0 4.

The pull-back of Schwartz distributions and the regularity of image measures

Let { W, H , p} be an abstract Wiener space as in the previous section and F(w) = (F’(w),F2(w), . . ., Fd(w)) be a mapping W - t Rd. If F is “smooth” and “non-degenerate”, we can expect as in the finite dimensional case that F has such nice properties as ( i ) every smooth vector field X on Rd can be lifted to a “smooth” vector field W +. H such that D F [ f w ] = X F ( w ) , (ii) the image measure F*(p) on Rd of p has a smooth density. We shall study these properties of the mapping F by defining the pullback or lifting of Schwartz distributions on Rd as elements in D-” of the previous section. This approach has some advantage: If we can give a correct mathematical sense to 6,(F), the pull-back of the Dirac &function 6, at x E Rd by the mapping F, then we can justify a formal expression

s:

18

N.IKEDAAND S. WATANABE

like EP[G,(P)Gl for the density Ep[G(w): P(w) E dx]/dx, often used intuitively, and the smoothness of the density is a by-product of this justification. Here EP[ -1 denotes the expectation with respect to the probability p. Furthermore, in this framework, the various properties of Schwartz distributions can be lifted to those of generalized Wiener functionals. First, the above smoothness of F: W + Rd is defined in the sense of Sobolev spaces: We assume that (4.1)

F E D"(R')), equivalently, Ft E D", i = 1,2, .

a ,

d.

Then a(w) = (oij(w)) is defined by (4.2)

= (DFi(w), DF'(W))H, i,.f = 1, 2,

8 ( W )

* * *,

d

and u'f E D". The matrix o is called the Malliavin covariance of F. As in Section 2, we consider the following non-degeneracy assumption (A. 2), for F: We say that I; satisfies (A. 2), ( p > 1) if a(w) is strictly positive definite for almost all w ( p ) (this assumption is refered to as (A. 1)) and if

co

det (rtj(w)) E LJW,

(4.3)

where a-'(w) = (j' 1. Using the identity d

DTfj = - C Y;J;ZDO:',

(4.4)

where

k,l=l

+

= uL1 edLzand (?;')

Proposition 4.1. 1. fiedforp > k

+

Ti,

E

E

> 0,

= (utl)-', it is easy to verify the following:

09 for all 1 I r

O

9"(Rd):=the Schwartz space of tempered distributions on Rd.

Then we have the following (see [93] and [94]):

Theorem 4.2. Let k be a positive integer. If F: W--t Rd satisfies (A. 2), for some p > 4k and if we take q > 1 satisfying (4.12)

l/q

+ 4k/P < 1

Y

then the mapping Y ( R d )3 4 -+ q5 0 F E D" is continuous with respect to the norm 11. ll-2k on Y ( R d )and the norm 11 .IIpn, - 2 L on D" where

(4.13)

I/Po

+ l/q

=

1.

More precisely, we have

i

(4.14) where

~ / ~ ° F ~ ~ P n , - Z kK k , q l I $ I I - Z k ~

Kk,*

is given by (4.10).

Proof. We have

$ EY(Rd),

An Introduction to Malliavin’s Calculus

21

and by (4.9)

I lI(1 =

+ 1x12 - ~)-k$lI~IIi71~(~~8>111

ll$ll-2k

ll~2k(+v;g)ll*.

This, combined with (4.10), concludes the proof. Corollary 4.1. 4

Under the same assumption as Theorem 4.2, Y ( R d )3 9

9 F E D” extends to a unique continuous linear mapping qwzk 3 T 0

---f

T ( F ) E D;:k.

Since we can choose any q such that (4.12) is satisfied, we can choose any p , such that

1 1. In particular, Y ’ ( R d )3 T -+ T ( F ) E D-” is dejned and

This T ( F ) E D-” is called the composite of the Schwartz distribution T and the Wiener functional F or the pull-back of the distribution T on Rd under the mapping F: W-+ Rd. T ( F ) coincides with T o F if T is given by a bounded continuous function on Rd. Now the above result can be applied to the existence of smooth First, we note the following facts easily verified by density of F&). Fourier analysis: If 6, E Y ’ ( R d )is the Dirac &function at y, then 6, E %?-2, if and only if m > d/2. Furthermore, Rd E y --+ 6, E V-,,-, is 2ktimes continuously differentiable for k = 0, 1,2, . . .,where (4.15)

WZ =,

[d/2]

+ 1.

Therefore, we can conclude the following (see Watanabe [93] and [94]): Theorem 4.3. Let k = 0, 1,2, . . . and suppose that Fsatisjes (A.2), for p > 4(m, k). Then for every 1 < po < p/4(mo k), 6,(F) E D;tmo

+

+

22

N.IKEDA AND S. WATANABE

for every y E Rd and Rd 3 y +-6,(F) t D;:mo-2kis 2k-times continuously diferentialbe. In particular, y 4(6,(F), g ) E R is 2k-times continuously diferentiable for any g t DiY+2k,l/p, 114, = 1, where (., *) is the x D:Y+2k. canonical bilinear form on D;a2mo-2k

+

--

Corollary 4.3. IfFsatisjies (A. 2)-, then fbr any k = 0, 1,2, . and po > 1, Rd 3 y -+ 6,(F) E D;tmo-2kis 2k-times continuously direrentiable and hence Rd 3 y -+ (S,(F), g ) is 2k-times continuously direrentiable for every

In particular, it is infinitely differentiable i f m

In Theorem 4.3, if g E DiY then y -+ (6,(F), g } is continuous and it is easy to show by the Riemann sum approximation that

J

(4.16)

Rd

f(y)(G,(F), g)dY = ( f o F, g> = J w f o F(w)g(w),u(dw)

for every continuous f: Rd -+ R with a compact support. This shows that (6,(F), g) coincides with the density F,(,ug)(dx)/dx of the measure F*(,ug)(A) =

[ g(w)l,{F(w))p(dw), W

for Bore1 A c Rd ,

with respect to the Lebesgue measure dx on Rd. In particular (6,(F), 1) = F,(,u)(dx)/dx where F&) is the image measure of ,u by the mapping F and

( w 9 9

g> = Er(g I F

= Y)1)

by using the well-known notion of conditional expectation. Thus we have

Corollary 4.4. Let k = 0, 1, 2, . . . and F satisfy (A. 2), for p > 4(m0 k). Then Y = F*(,u) has a C2*-density Y(X) with respect to the Lebesgue measure dx and Er[g I F = X ] Y ( X ) has a C2k-versionif g E DiY+” with l/q, 4(m0 k)/p < 1. In particular, i f F satisjies (A.2)m, then v = F,@) has a C”-density Y(X) with respect to the Lebesgue measure h and for any

+

+

+

23

A n Introdirction to Malliavin's Calcidirs

Ep(g IF = X)Y(X) has a CZk-versionwith respect to the Lebesgue measure dx for every k = 0, 1 , 2 , - ... Example 4.1. Let { W ; , H, P"} be the d-dimensional Wiener space (Example 3.1) and for fixed t > 0 and x E Rd, let F: W -+ Rd be defined by F(w) = x w ( t ) . Then F E D"(Rd)with utf(w) = &,t. Hence Y,, = &,/t E L, for all p 2 1 and therefore (A. 2)- is satisfied. Let k(x) E P ( R d )such that I k(x) I 5 K ( l [ x I) for some positive constant K and derivatives are all polynomial growth order. Then it is easy to verify that

+

+

g(w) = exp

for all p have JRd

> 1.

Hence y

-+

[Jl

1

+ w(s))ds

k(x

E D : ~

(6,(F), g ( w ) ) is C Z kif k = m - m, > 0. We

0 and x E Rd be fixed. Then ( i ) X i = F ( t , x, w) E Dm,i = 1,2, . d, (ii) The Malliavin covariance ad' = ( D X ; , O X ; ) , is given by

-

(5.4)

a'? =

I ,

i , j = 1, 2, . . ., d. 2 s' (Y,Y,'Vo(X,))i(Y,Y;'Vo(Xs))'ds, 0

n=l

Here (Ax)' = C;=lA:xj $ A = (A:.) is a d X d-matrix and x = (x') E Rd.

A rough idea of the proof is as follows: Since

a DX:[h] = -Xi(t, x, w a&

+~ h ) ,

h E H,

IL0

we obtain from (5.2) DXf[h]= (5.5)

2 5 s' 3, I/,i(X,)DX:[h] dw"(s) + 2 f V:(X,)k(s)ds + 5 s' 3,V6(Xs)DX:[h]ds, 0

o=l k=l

0

a=l

0

k=l

0

where h = (hn(s)). By the method of variation of constants, we obtain

DXI[h] =

(5.6)

2 s' [r,Y;' V,(X,)]"(S)dS,

o=l

0

or, if one likes, it is easy to verify by the It8 formula that the right-hand side of (5.6) actually satisfies (5.5). Similarly, we obtain the equations for DLXt[h,,h,, .,h,] successively and we see easily D k X E Lp(W;+ Ak(H),P") for all p 2 1. These formal arguments can be made rigorous by the standard Cauchy polygonal approximations, cf. [34] for details. For a vector field V(x) on Rd identified with the Rd-valued function V ( x ) = (V'(x)), define a function&: RdX GL(d, R)+Rd by

--

(5.7)

f T ( r ) = Y - I V ( x ),

r = (x, Y ) E Rd x GL(& R) ,

(see [34]). Now (5.4) can be written also as

Set (5.9)

6'' =

2 rfGa(rs).fJa(r,)ds, o=l

i , j = 1,2, . . .,d .

0

Our next aim is to study the conditions for "non-degeneracy" of the Malliavin covariance a = (a"). Noting the well-known fact (cf. [34])that

30

N.IKEDA AND S. WATANABE

E(ll Y#) < co and E(lI Y;'Il*) < co for allp 2 1, we can replace u = (at') by 2 = (P)in this study, that is, (A. 1) is satisfied if and only if det 2 > 0 a.e. (P") and further (A. 2)m is satisfied if and only if

(5.10)

E((det 6)-")

6). Lemma 5.6. (cf. [34], Lemma V-8.6). Brownian motion on s E [O, 61. Then (5.17)

P"[V,o,al(B)< el

Let B ( s ) be a one-dimensional

I 2 exp [-6/(2'~')1

for every positive E and 6. Lemma 5.7. Let B ( s ) be a one-dimensional Brownian motion on [0,a] where a is a positive constant. Then for every 0 < Y < 112, there exist positive constants a, and a, such that

This is a consequence of well-known Fernique's theorem (cf. Kuo [47], Theorem 111-3.1). In Example 3.1, we remarked that for 0 < 'i < 1/2, {Wi,r,H, P"} is an abstract Wiener space and Fernique's theorem

35

A n Introduction to Malliavin's Calculus

implies that E[exp [ E l [ wli:]]< 03 for some E > 0. It is easy to conclude (5.18) from this. For details, see [86]and [94]. Let ~ ( s )be an It6 process with bounded characteristics vo(s). As is well-known, there exists a one-dimensional Brownian motion B ( t ) such that

where A ( s ) = C J Tn(u)2du. n=l

0

Hence, by Lemma 5.7, we have the following

Lemma 5.8. Let ~ ( s )be an Ztd process with bounded characteristics. Then for any 0 < Y < 1/2, ci,i = 0, 1, 2 exist such that (5.20)

Pw[ sup O<s ~ * 2 - ~( ba ) - ] 2 48-'2(b

(S

- tl),ds

- a)-'lZ13

which completes the proof. Proof of the key lemma. Suppose t = 1 for simplicity. In proving (5.15), we may clearly assume n 2 2. In the following a,, i = 1,2, . . . and d,, i = 1,2, are positive constants independent of n. First we note that a one-dimensional Brownian motion B ( t ) with B(0) = 0 exists such that

-

where

Set

and

Here c, is given and c, will be determined later.

Set

and Wl,z= [A(&) 2 l/nal].

Then (5.23)

Wl C Wl,l U W1.:

if a, 2 c,

+ 1,


since then l/nal 5

l/nC4+l

l/(2nC4)for n 2 2. Set

w,= [& - & = l/n] w,= [ sup -Ieo(u) - €o@)I opz3 >

We say that x E Rd satisfies the assumption (A. 3) i f there exist A4 0 and A , , A,, . . ., A , E ,f" such that A,(x), A,(x), . A d ( x ) are linearly Clearly x satisfies (A. 3) if and only if there exists A4 0 independent. such that

-

a ,

(5.38) If (5.38) is satisfied, then we can find > 0 and bounded neighbourhoods U ( x ) in Rd and U(I) in GL(d, R ) of x and I respectively such that (5.39)

C

inf 2ESd-I

(f.fA)'(r)

2 E,

if r

E

U ( x )x U ( I ) .

ACfx

Set u = inf {s;re 4 U(x) x U(1))

where re = (Xe, YJ is the solution of (5.2) and (5.3) put together with ro = (x,I ) . Then, by Lemma 5.5, for some c, > 0, P"[u

Set a: = 0 and (5.40)

C$

< t/n]

exp [-c,n],

= u A (t/n),n = 1,2,

n = 1, 2, . . . .

. -. Then

41

A n Introduction to Malliavin’s Calculus

and P”[W;] I exp [ -cln]. Let # (JM)= N, say. Then for every 1 E Sd-’ we can find ao,al, ., a,,0 I k I M such that 1 I a. I r, 0 I a‘,a,, ...,aI,< rand

-

(5.41) where

Noting (5.35), we can apply the key lemma successively to conclude the following: for each j = 0, 1, . ., k, we can find positive constants c{, i = 1 , 2 , 3 , 4 , independent of n and Z, such that

-

In particular, we can conclude that sup P w [ k Z€Sd-l

a=l

J’[l~fva(rs)]2ds < l/ncl] I c, exp [-cgnc4] 0:

for some ci,i = 1,2, 3 , 4 . By Lemma 5.3, we can conclude that (5.42)

E[(det & ) - P I

0, F: W:

Let 9” = {x E R d ;x does not satisfy (A. 3)). By a slight modification of the proof, we can show that the conclusion of Theorem 5.2 still holds if x E 3’ but the set 3 is “thin” at x in a certain probabilistic sense, (cf. Malliavin [59] and [34]). For a further study on this line, especially applications to hypoellipticity and global hypo-ellipticity of the operator

c va, + vo

1 ‘ A =2 a=l

and to an ergodic theorem of A-diffusion, cf. Kusuoka and Stroock [56]. For some examples to which Malliavin’s method can be applied, see Stroock [82]. [34] and [86]. In [63], Malliavin also discussed an application to estimate the resolvent.


42

9 6.

Several topics related to the Malliavin calculus

(A) Stochastic oscillatory integrals In Example 5.1, LCvy's stochastic area was introduced: It is a stochastic line integral (cf. [33]) S ( t , x, w) = with respect to the 2-dimensional Wiener process X,(t) starting at x and a is a differential 1-form on R2given by (Y

It is easy to see that for R

=

E

1 -(x'~x'

2

-

x'dx').

(0, co)

and by the Feynman-Kac formula and a formula on Hermite polynomials, we obtain

m x , y ; 4=

Rt 2 sinh 2

(6.3)

(cf. 1211 and [34]). Then there exists a positive a(t; 2) =

and hence

TRt ~

Rt sinh 2

to such

for t 2

to,

that

An Introduction to Malliavin’s Calculus

1

- lim - log a(t; 1) =

(6.4)

cr-

t

R

-

2

43

> 0.

In [62], Malliavin used the stochastic calculus of variation to study similar asymptotic properties of stochastic oscillatory integrals. Let, for a fixed t > 0, { 6 ‘& f i t , pTt} be the d-dimensional pinned Wiener space (Example 3.2) and set X ( s ) = X ( s , x, y , w) = x

+ ( y - x)s/t + w ( s ) ,

s E [O, t l .

Let

c d

cr =

crY,(X)dXZ

i=1

be a smooth I-form on Rd where q ( x ) are smooth with bounded derivatives of all orders 2 1. We set

and

K,(x, y ; R, a) = E[exp

(6.6)

{mW t , w ;x , Y , or>}]

where E [ . ] denotes the expectation with respect to the probability Then it is easy t o see that ( i ) for x , y E R d a n d O I u , s I t ,

(6.7)

~d

K J x , z ; 2, cr)K,(z, y ; 1, a)(&)d’2exp

[--Id2 2s

We now assume that

Malliavin’s result is as follows: Take a 1e (0, m) and set

Prt.

44


Then there exists a positive constant a(,?)such that - lim

(6.9)

tt-

1 log a(t; I , a) = a(I) t

He used the integration by parts on the Wiener space to obtain necessary estimates. In fact, by using (4.6) and (4.7), we can rewrite K,(x, y ; 2, or) in the following form : m x ,y ;I,@)

where K is a positive constant independent o f t , x,y and I . Hence the proof of (6.9) can be reduced to estimate the right hand side of (6.10). For details, see Malliavin [62]. Asymptotic results of this type are related to some problems in the theory of quantum physics and stochastic holonomy in the representation theory of semisimple Lie groups, cf Gaveau [22], Gaveau and Vauthier [27], Malliavin [%I, [60] and [61], M. P. Malliavin and P. Malliavin [67]. (B) Capacities In his paper [64] in these proceedings, Malliavin introduced a family of capacities Cp3,,1 0 on the Wiener space and discussed applications of this notion: Using the same notations as in Section 3, let, for an open set 0 c W, Cp,,(8)= inf{llullp,r;u E D;, u 2 0 and u 2 1 on O } ’

and call it C,,,-capaciry of 8. For any A c W, its (outer) capacity is defined as usual by

crJm=

inf

A C B : open

CP,,(O)

-

A n Introduction to Malliavin’s Calculus

45

A set A C W is called slim if C,,,(A) = 0 for all 1 0. C,,,-capacity (corresponding to the equivalent norm ~ ~= IIu11, u ~ /I IDu Ix11J coincides with that in Fukushima’s sense of the Dirichlet space associated with the Ornstein-Uhlenbeck process on W ([19]) and a set of C,,,-capacity 0 coincides with a polar set of the Ornstein-Uhlenbeck process on w,(cf. [20]). A slight different but equivalent and more manageable notion of C,,,-capacities was introduced by Takeda [90]. The notion of C,,,-capacities has been used to obtain an implicit function theorem on W by Malliavin. For various applications of this notion, see [64] and [65]. Also see Kusuoka [51] for some topics related to the notion of C,,,-capacity on W. Recently many attempts have been made to measure by the capacity a set of p-measure 0: In the case of the ordinary Wiener space, some classical theorems like the law of iterated logarithm or LCvy’s Holder continuity are shown to hold not only almost everywhere but also everywhere except a set of C,,.-capacity 0 for all co >p > 1 and r > 0, while some typical subsets of W with the Wiener measure 0, related to the recurrence of paths or multiple points of paths are shown to have positive C,,,-capacity for some p > 1 and r > 0. For details, see [69], [20], [40], [41], [42], [90] and [go].

+

(C) Applications to filtering theory and the partial Malliavin calculus The Malliavin calculus has been applied to obtain regularity results in filtering theory by, e.g., Michel [72], Bismut-Michel [14], Kunita [44] and Kusuoka and Stroock [57]. The problem is to show the regularity of certain conditional distributions and therefore, in the study of laws of Wiener functionals some components are supposed to be fixed. Such a kind of calculus is formulated and called the partial Malliavin calculus in [57]. Also see Michel’s paper [73] in these proceedings. (D) Applications to infinite dimensional diJEusions A theorem in Stroock [82] enables us to deal with the finite dimensional marginal distributions of some infinite dimensional interacting diffusion. For related topics, see also Holly and Stroock [31].

(E) Bismut’s approach to the Malliavin calculus Bismut [6] obtained the following formula of integration by patrs. Under the same situation as in Section 5, let g($) be a real, bounded, FrCchet differentiable function on the Banach space C([O, TI -+ Rd) with the supremum norm and its FrCchet derivative

~

~

,

46


be given by

with signed measures dgi(. ; $) on [0, TI. Let W ; 3 w + X ( w ) E C([O, TI + R d ) be defined by the solution X,(w) = X ( t , x,w) to SDE (5.2). Let u = (ul(t),uz(t), . . ., u , ( t ) ) be a bounded and {g,}-adapted process on W;. Then

where E [ - ]denotes the expectation with respect to Pw. We shall comment this Bismut formula in our context: Since

by (5.6), the right hand side of (6.11) coincides with E [ ( D ( g ( X ) ) ,u),] where u, identified with

E

H , is an element in D:(H). Thus

(6.1 1) amounts to (6.12)

-c

6u=

This is seen most simply by the Girsanov theorem: For any F E D:,

c

E [ F ( w + E L I ) ] = E exp

C 1: K1 E:

u,(s)dw"(s) -

-E'

2

'St

a-1

u,(s)'ds

I I F(w)

and by differentiating in E , we have

proving (6.12). The formula (6.12) suggests a generalization of stochastic integrals t o a class of anticipative integrands, cf. Skorohod 1811, Shigekawa [77], Gaveau and Trauber [26], Ramer [75] and Taniguchi [92]. In a dual argument, we see also that for any F E Di, its predictable representation

A n Zntroduction to Malliavin's Calculus

47

is given by

and this is closely related to the results of Haussmann [30]. The method based on the Girsanov theorem can be applied to more general cases than Wiener measures, cf. Bismut [l I] and [I21 and Bichteler and Jacod [4]. Also see Bismut's paper [13] in these proceedings.

(F) The Malliavin calculus and boundary value problems Bismut [I21 discussed the regularity of boundary semigroups by obtaining a very elaborate formula of integration by parts. Cf. also his paper ([13]) in these proceedings. Arous, Kusuoka and Stroock [l] discussed the regularity of harmonic measures for operators of the Hormander type as an application of the partial Malliavin calculus. (G) de Rham complex of Wiener functionals The following results are due to Shigekawa [79]. Using the same notations as in Section 3, let the alternation A : A,(H) + A , ( H ) be defined as usual by

and let d ; ( H ) = { T Ed,(H); A(T) = T}. In particular, &( H) = R and & ( H ) = H. Then the Sobolev space D;(d;(H)) are defined and we set

Define

d,: A"(T*(W)) by setting d, = A o D and = (n 1)s. Then

+

a,+,

0 -+ R

2:

o

A" +' (T* (W))

d"+'(T*(W)) --+ d"(T*(W)) by s.tting d, = 0, 6,o 8,+, = 0 and both do

i

{constant functionals} +k'(T*( W ) )-+ A1(T*(W ) ) da 5 AZ(T*(W))+ .

*

.

48

N.

0cR

IKEDA AND

li"AO(T*(W ) )

s. WATANABE

A*(T*(W ) )

are exact. Here i is the injection and i*(F(w)) =

A,; A"(T*(W))

-+

-

A2(T*(W ) )

83

F(w)p(dw).

... Let

A"(T*(W)), be defined by

Then A , = L - n I where L is the Ornstein-Uhlenbeck operator of Section 3. From these results, we obtain the following de Rham-Kodaira decomposition: A"(T*(W))= 8% 0CJ~+~(A"+'(T*(W))) 0&-I(A"-'(T*(W)))

where 2'"= { F E A"(T*(W));A,F = 0) and it holds that Zn = {constant functionals} if n = 0 and 2, = (0) if n 2 1.

(H) The law of a system of multiple Wiener intergrals In case of a one-dimensional Wiener space, it is well-known that the Wiener-It8 decomposition is realized by the multiple Wiener integrals (Ita [38]). We consider an Rd-valued Wiener functional F = (F', F2, . -,F d ) has a finite order expansion by multiple Wiener integrals. By where using the Malliavin calculus, Shigekawa [78] obtained a sufficient condition for the absolute continuity of the law of F. Kusuoka [52] gave a necessary and sufficient condition by using an algebraic method. In concluding this note, we would refer the reader to the following interesting and important expository papers on the Mallianvin calculus : Williams [95], Bismut [9], Kusuoka [54], Malliavin [65] and Stroock [87]: The last three papers will give more complete and up-to-date information on the topics in the Malliavin calculus and its applications.

References G . B. Arous, S. Kusuoka and D. W. Stroock, The Poisson kernel for certain degenerate elliptic operators, preprint. [ 2 ] P. Baxendale, Stochastic flows and Malliavin calculus, to appear in Proceedings of 20th IEEE conference on Decision and Contl., 1981, 127152. [ 3 1 Ya. I. Belopol'skaya and Yu. L. Daleckii, It6 equations and differential geometry, Russian Math. Surveys, 37 (1982), 109-163. 141 K. Bichteler and J. Jacod, Calcul de Malliavin pour les diffusions avec sauts: Existence d'une densitt dans le cas unidimensionnel, Lecture [ 1]

An Zntroduction to Malliavin’s Calculus

49

Notes in Math, 986 (1983), 132-157, SCminaire de Probabilitts XVII, ed. par J. Azima et M. Yor, Springer-Verlag, Berlin. 1 5 1 P. Billingsley, Convergence of Probability measures, John Wiley and Sons, New York, 1968. 161 J. M. Bismut, Martingale, the Malliavin calculus and Hypoellipticity under general Hormander’s conditions, Z. Wahr. verw. Geb., 56 (19811, 469505.

171 181 191

-, Martingales,

the Malliavin calculus and Hormander’s theorem, in “Stochastic Integrals” ed. by D. Williams, Lect. Notes in Math., 851 (1981), 85-109, Springer-Verlag, Berlin. -, MBcanique Altatoire, Lect. Notes in Math., 866 (1981), SpringerBerlag, Berlin. , An introduction to the stochastic calculus of variations, in Stochastic Differential Systems, ed. by M. Kohlmann and N. Christopeit, Lect. Notes in Control and Inform. Sci., 43 (1982), 33-72, Springer-Verlag, Berlin. -, MCcnique albatoire, Ecole dCtB de Probabilites de Saint Flour, Lect. Notes in Math., 929 (1982), 1-100, Springer-Verlag, Berlin. -, Calculus des variations stochastique et processus de sauts, Z. Wahr. verw. Geb., 63 (1983), 147-235. -, The calculus of boundary processes, to appear in Ann. Ec. Norm. sup.. -, Jump processes and boundary processzs, in these Proceedings, 63-104. J. M. Bismut and D. Michel, Diffusions conditionnelles, I, 11, J. Fuc. Anal. 44 (1981), 174-211, 44 (1982), 274-292. R. H. Cameron and W. T. Martin, Transformation of Wiener integrals under translations, Ann. Math., 45 (1944), 386-396. Ju. L. Daleckii and S. N. Paramonova, Integration by parts with respect to measures in a functions space, I, 11, Theor. Prob. Math. Statist. 17 (1979), 55-56, 18 (1979), 39-46. R. M. Dudley, J. Feldman and L. Le Cam. On seminorms and probabilities and abstract Wiener spaces, Ann. Math., 93 (1971), 3 9 0 4 0 8 . K. D. Elworthy, Stochastic differential equations on manifolds, London Math. SOC.,Lecture Notes Series 70, Cambridge Univ. Press, Cambridge, 1982. M. Fukushima, Dirichlet forms and Markov processes, Kodansha/NorthHolland, Tokyo/Amsterdam, 1980. , Basic properties of Brownian motion and a capacity 03 the Wiener space, J. Math. SOC.,Japan, 36 (1984), 161-176. B. Gaveau, Principle de moindre action, propagation de la chaleur et estim k s sous elliptiques sur certain group nilpotents, Acta Math., 136 (1977), 95-153. -, Systtmes dynamiques associes a certain opkrateur hypoelliptiques, Bull. SOC.Math., 102 (1978), 203-229. -, An example of a stochastic quantum process: interaction of a quantum particle with a boson field, in these Proceedings, 135-147. B. Gaveau and J. Moulinier, GBomCtrie differentielle stochastique et inttgrales stochastiques non anticipantes de bruits blancs B plusieurs paramkires, to appear in C. R. Acad. Sci. Paris. -, IntCgrals oscillantes stochastiques: Estimation asymptotique de fonctionnelles caracteristiques, J. Func. Anal., 54 (1983), 161-176. B. Gaveau and P. Trauber, L‘intkgrale stochastique comme optrateur de divergence dans l’espace functionnel, J. Func. Anal., 46 (1982), 230-238. B. Gaveau and J. Vauthier, Intkgrales oscillantes stochastiques I’iquation de Pauli, J. Func. Anal., 44 (1981), 388-400. I. M. Gelfand and G. E. Shilov, Generalized functions, Academic Press, New York, 1964, Vol. 2. L. Gross, Abstract Wiener spaces, Proc. Fifth Berkeley Symp. Math. Statist.

-

-

1251 1261 I271 1281 1291

50

N. IKEDAASD S. WATANABE

Prob. 11, Part 1, 31-41, Univ. Calif. Press, Berkeley, 1965. U. Haussmann, On the integral representation of functionals of It8 processes, Stochastics, 3 (1979), 17-27. R. Holley and D. W. Stroock, Diffusions on an infinite dimensional torus, J. Func. Anal., 42 (1981), 29-63. L. Hormander, Hypoelliptic second order differential equations, Acta Math., 119 (1967), 147-171. N. Ikeda and S. Manabe, Integral of differential forms along the path of diffusion processes, Publ. R.I.M.S. Kyoto Univ., 15 (1979), 827-852. N. Ikeda and S . Watanabe, Stochastic differential equations and diffusion processes, Kodansha/North-Holland, Tokyo/Amsterdam, 1981. -, Stochastic flows of diffeomorphisrns, to appear in Adv. in Probability, Vol. 7, ed. by M. Pinsky, Marcel Dekker, Inc. New York, 1984. K. It6, Differential equations determining Markov processes, Zenkoku Shij6 Siigaku Danwakai, 244 (1942), No. 1077, 1352-1400, (in Japanese). -, On stochastic differential equations, Mem. Amer. Math. SOC.,4 (1951). -, Multiple Wiener integral, J . Math. SOC.Japan, 3 (1951), 157-169. A. N. Kolmogoroff, Zufallige Bewegungen, Ann. Math., 11, 35 (1934), 116117. T. Komatsu and K. Takashima, Hausdorff dimension of quasi all Brownian path in the d-dimensional Euclidean space, to appear in Osaka J . Math., 21 (1984). N. Kono, Proprittts quasi-partout de fonctions altatoires Gaussiennes, ExposCs du 29 Avril au 20 Mai 1983, Stminaire d’Analyse des Fonctions Altatoires, preprint. N. Kono, 4-dimensional Brownian motion is recurrent with positive capacity, preprint. H. Kumano-go, Pseudo-Differential Operators, The MIT Press, Cambridge, 1982. H. Kunita, Densities of a measure-valued process governed by a stochastic partial differential equation, Sys. Cont. letters, 1 (1981 ), 100-104. -, Stochastic partial differential equations connected with non-linear filtering, Proc. C.I.M.E. Session on “non-linear filtering and stochastic control”, ed. by K. Mitter and A. Moro, L x t . Notes in Math., 972 (1982), 100-168, Springer-Verlag, Berlin. H. H. Kuo, Integration by parts for abstract Wiener measures, Duke Math. J., 41 (1974), 373-379. -, Gaussian measures in Banach spaces, Lect. Notes in Math., 463, Springer-Verlag, Berlin, 1975. --, Uhlenbeck-Ornstein process on a Riemann-Wiener manifold, Proc. Intern. Symp. SDE, Kyoto, 1976, ed. by It6, 187-193, Kinokuniya, Tokyo, 1978. -, Donsker’s delta function as a generalized Brownian functional and its application, Theory and application of random fields, Proc. IFIP-WG 7/1 Working conf. at Bangalore, ed. by G. Kallianpur, Lect. Notes in Cont. and Inform. Sci., 49 (1982), 167-178, Springer-Verlag, Berlin. S . Kusuoka, Dirichlet forms and diffusion processes on Banach spaces, J. Fact. Sci. Univ. Tokyo, 29 (1982), 79-95. -, Analytic functionals of Wiener process and absolute continuity, in Functional analysis in Markov process ed. by M. Fukushima, Lect. Notes in Math., 923 (1982), 1-46, Springer-Verlag, Berlin. -, On the absolute continuity of the law of a system of multiple Wiener integral, J. Fact. Sci. Univ. Tokyo, 30 (1983), 191-197. -, The Malliavin calculus and hypoellipticity of second order degenerate elliptic differential operators, Probab. theory and Math. Statist. Proc. Fourth USSR-Japan Symp., ed. by K. It8 and J. V. Prohorov, Lect. Notes

A n lritrodiiction to Malliavin’s Calcirliis

51

in Math., 1021 (1983), 365-371, Springer-Verlag, Berlin. The Malliavin calculus and its applications, to appear in Sfigaku, 1984, (in Japanese). S. Kusuoka and D. W. Stroock, Applications of the Malliavin calculus, Part I, in theses Proceedings, 271-306. -, Apllications of the Malliavin calculus, Part I1 and 111, in preparation. -, The partial Malliavin calculus and its application to non-linear filtering, to appear in Stochastic Processes. P. Malliavin, Stochastic calculus of variation and hypoelliptic operators, Proc. Intern. Symp. SDE, Kyoto, 1976, ed. by K. ItB, Kinokuniya, 1978. -, Ck-hypo-llipticity with degeneracy, Stochastic Analysis, ed. by A. Friedman and M. Pinsky, 199-214, 327-340, Academic Press, New York,

-,

1978.

-,

GBometrie diffkrentielle stochastique, Les Presses de I’UniversitC de MontrBal, Montrkal, 1978. -, Stochastic Jacobi fields, Partial differential equations and geometry, Proceedings of the Park City Conference, 203-235, Lect. Notes in pure and applied Math., 48, Marcel Dekker, Inc., New York, 1979. -, Sur certaines inttgrales stochastiques oscillantes, C. R. Acad. Sci. Paris, 295 ( 1982), 295-300.

-,

Calcul des variations stochastiques subordonnt au procesm d: la chaleur, C. R. Acad. Sci. Paris, 295 (1982), 167-172. -, Implicit functions in finite corank on the Wiener space, ia thes- Proceedings, 369-386. -, Analyse diffkrentielle sur l’espace de Wiener, Proc. ICM, 1983. -, Diffusion on the loops, to appear. M. P. Malliavin and P. Malliavin, Factorisations et lois limites de la diffusion horizontale au dessus d’un espace riernannien symetrique, Lect. Notes in Math., 404 (1974), 166-217, Springer-Verlag, Berlin. P. A. Meyer, Demonstration probabilite de certaines inkgalitis de Littlewood-Paley, SCminaire de Prob. X, ed. par P. A. Meyer, Lect. Notes in Math., 511 (1976), 125-141, Springer-Verlag, Berlin. -, Notes sur les processus d’ornstein-Uhlenbeck, Siminaire de Prob., XVI, ed. par J. Az6ma et M. Yor, Lect. Notes in Math., 920 (1982), 95133, Springer-Verlag, Berlin. -, Quelques resultats analytiques sur le semigroupe d’ornstein-Uhlenbeck en dimension infinie, Theory and application of random fields, Proc. IFIP-WG 7 / 1 Working conf. at Bangalore, ed. by G. Kallianpur. Lect. Notes in Cont. and Inform. Sci., 49 (1983), 201-214, Springer-Verlag, Berlin. A hand-written manuscript of Meyer distributed in the seminars at r711 -, Paris and Kyoto. [721 D. Michel, Regularit6 de lois conditionnelles en thkorie du filtrage nonlin6aire et calcul des variations stochastique, J. Func. Anal., 41 (1981),

-,

8-36.

Conditional laws a i d Hormander’s condiiion, in t h x e Procezding, 387408.

M. A. Piech, A fundamental solution of the parabolic equation on Hilbert spaces, J. Func. Anal., 3 (1969), 85-1 14. R. Ramer, On nonlinear transformations of Gaussian mearures, J. Func. Anal., 15 (1974), 166-187. I. Shigekawa, Absolute continuity of probability laws of Wiener functionals, Proc. Japan Acad., 54 (1978), 230-233. -, Analysis on abstract Wiener spaces, Master t h s i s of Kyoto Univ., 1979, (in Japanese). -, Derivatives of Wiener functionals and absolute continuity of induced

52

N. IKEDA AND S. WATANABE measure, J. Math. Kyoto Univ., 20 (1980), 263-289. de Rham-Hodge-Kodaira’s decomposition on abstract Wiener spaces and its application to infinite dimensional diffusion processes, preprint. -, On a quasi everywhere existence of the local time of the 1-dimensional Brownian motion, to appear in Osaka J. Math. V. Skorohod, On a generalization of a stochastic integral, Theor. Prob. Appl. 20 (1975), 219-233. D. W. Stroock. The Malliavin calculus and its auulications to second order parabolic differential operators, I, 11, Math. System Theory, 14 (19811, 25-65, 141-171. -, The Malliavin calculus, a functional analytic approach, J. Func. Anal., -U - ( 1 9 8 1 ) , 217-257. The Malliavin calculus and its auulications. in “Stochastic Integrals” ed: by D. Williams, Lect. Notes in Math., 851 (1981), 394-432, SpringerVerlag, Berlin. -, Lectures on topics in stochastic differential equations, noted by S. Karmaker, Tata Inst. Fund. Res., 1982. -, Some applications of stochastic calculus to partial differential equations, Ecole d’6t6 de ProbabilitCs de Saint Flour, ed. par P. L. Hennequin, Lect. Notes in Math., 976 (1983), 268-382, Springer-Verlag, Berlin. -, Stochastic analysis and regularity propxties of certain partial differential operators, Proc. ICM, 1983. D. W. Stroock and S . R. S. Varadhan, On the support of diffusion processes with applications to the strong maximum principle, Proc. Sixth Berkeley Symp. Math. Statist. Prob., 111, 333-359, Univ. Calif. Press, Berkeley, 1972. H. Sugita, Sobolev spaces of Wiener functionals and Malliavin’s calculus, to appear in J. Math. Kyoto Univ. M. Takeda, (r, p)-capacity on the Wiener space and properties of Brownian motion, preprint. S. Taniguchi, Malliavin’s stochastic calculus of variations for manifoldvalued Wiener functionals and is applications, Z. Wahr. verw. Geb., 65 (1983), 260-290. -, The adjoint operator of weak derivative as It6 integrals, preprint. S . Watanabe, Malliavin’s calculus in terms of generalized Wiener functionals, Theory and application of random fields, Proc. IFIP-WG 7/ 1 Working conf. at Bangalore, ed. by G. Kallianpur, Lect. Notes in Cont. and Inform. Sci., 49 (1983), 284-290, Springer-Verlag, Berlin. -, Lectures on stochastic differential equations and Malliavin calculus, noted by M. Gopalan Nair and B. Rajeev, Tata Inst. Fund. Res., 1984. D. Williams, “To begin at the beginning. .”, in “Stochastic Integrals”, ed. by D. Williams, Lect. Notes in Math., 851 (1981), 1-55, Springer-Verlag, Berlin. N. Wiener, Collected Works, Vol. 1, ed. by P. Masani, MIT Press, Cambridge.

-,

[831 I841 [851 1861 1871 1881

1891 1901 1911

~

-.

.

IKEDA NOBUYUKI DEPARTMENT OF MATHEMATICS OSAKA UNIVERSITY TOYONAKA, OSAKA560 JAPAN

SHINZOWATANABE OF MATHEMATICS DEPARTMENT KYOTO606 JAPAN

Taniguchi Symp. SA Kyoto 1982, pp. 53-104

Jump Processes and Boundary Processes Jean-Michel BISMUT

9 0. Introduction In this paper, we will present the results which we have recently obtained in our two papers [4] and [6]. Since the starting point of these two papers has been the development of the Malliavin calculus by Malliavin, we start by giving a brief history of this method. Consider the stochastic differential equation in Stratonovitch form m

dx

=

X,(x)dt

+ C X,(X) dwt , *

1

x(0) = x,

where X,, XI, . . . ,X , are smooth vector fields, and w = (w’, . . . , w m ) is a Brownian motion. (0.1) defines a Markov continuous diffusion whose generator 9 is given by

The smoothness of the transition probabilities for the diffusion (0.1)-which define the semi-group etY-is usually studied via Hormander’s theorem on hypoelliptic second order differential operators [lo]. In particular we know from Hormander’s theorem that under conditions on X,, Xi, . -,X , and their Lie brackets, the operator a/at 64 is hypoelliptic, and that the transition probabilities are smooth. In [20] and [21], Malliavin developed a purely probabilistic method of proof for the existence of smooth transition probabilities. The idea in [20] and [21] was to use the stochastic differential equation (0.1) itself to get a direct proof for smoothness. To do this, Malliavin showed that it was possible to integrate by parts on the Wiener space, and that a wide class of functionals of x.(o) given by (0.1) could be submitted to such a calculus of variations. To prove that integration by parts is indeed possible, Malliavin used the Ornstein-Uhlenbeck operator d which is an unbounded self-adjoint operator acting on the L, space (for the Wiener measure), and the corresponding infinite dimensional Ornstein-Uhlenbeck

-

+

54

J.-M. BISMUT

process whose generator is d . Still using the Ornstein-Uhlenbeck operator, Shigekawa [24], Stroock [26], [27] and Ikeda-Watanabe [12] simplified and extended Malliavin’s original approach. In particular the estimates which prove the smoothness of the transition probabilities were proved in Malliavin [21], Ikeda-Watanabe [121 and improved in Kusuoka-Stroock [19], [28] where the full Hormander’s theorem was in fact obtained. In [3], we suggested a different approach to the Malliavin calculus, based on the quasi-invariance of the Wiener measure, which is expressed by the well-known Girsanov transformation. A formula of integration by parts was then obtained in [3], which was in fact deeply related to a result of Haussmann [9] on the representation of certain FrCchet differentiable functionals of the trajectory x.(o) as stochastic integrals with respect to the Brownian motion w. In [4], we developed a calculus of variations on jump processes. Our motivation was : a) To try to exploit the resources of the stochastic calculus on jump processes, and in particular the existence of a Girsanov transformation on jump processes (see Jacod [14]) in a framework where no clear-cut extension of the Ornstein-Uhlenbeck process exists. b) To understand better how the calculus of variations on diffusions works, in particular in its relation to the classical It6 calculus and martingale theory. c) To obtain specific analytic results on a class of jump processes. In [4], the computations seem to be difficult. One of the key reasons (which may appear in Section 1) is that in spite of all the randomness of Brownian motion, all the a-variations of the Brownian motion are deterministic processes, which is not the case for jump processes. The calculus of variations developed in [4] consists in doing an “elementary” integration by parts at the level of each jump, so that in the end an infinite number of classical integration by parts has to be done. In the limiting Brownian motion case, many complications are smoothed out due to continuous stochastic integration, which makes the unpleasant variation terms disappear. In Section 1 of this paper, we present another approach to the calculus of variations on jump processes, which is based on more elementary arguments than in [4], and does not rely on the Girsanov transformation on jump processes. Section 1 should make the reading of [4] easier although much of the technical work is done in [4]. The estimation techniques are briefly indicated. In our later paper [6], we still focused on a special sort of jump processes, which are the boundary processes of continuous diffusions. Our motivations were:

Jump Processes and Boundary Processes

55

a) To understand better the relation between certain pseudodifferential operators [30] and the stochastic calculus. b) To exhibit the interplay between the continuous diffusion (and its continuous martingales) and the discontinuous boundary process (and its discontinuous martingales). In particular, we felt that the It6 theory of excursions (It6-McKean [13], Ikeda-Watanabe [12]) could be a powerful analytic tool to study the boundary semi-groups. c) To understand the possible interplay between the calculus of variations on the continuous diffusion and the calculus of variations on the jump boundary process. d) To try to exhibit some “Hormander-like” interaction between the “drift” and the LCvy kernel of the boundary processes or between two Ltvy kernels. e) To find degeneracy conditions on the continuous diffusion so that the boundary process would exhibit a slowly regularizing behavior which is typical of some jump processes. Section 2 gives a simplified account of our results in [6]. Proofs are in general briefly indicated. The proofs exhibiting the interaction described in d) are given in detail. Relations with the techniques of enlargement of filtrations [15], [16], [17], and [38] are exhibited.

0 1. The calculus of variations on jump processes The purpose of this section is to present some of the methods and results which we obtained in [4] on the calculus of variations for jump processes (these results were announced in [5]). Recall that in [3], we had given an approach to the Malliavin calculus using the quasi-invariance of the Brownian measure which is expressed through the Girsanov exponential formula. In [4], our idea was to explore if the Girsanov transformation on jump processes (see Jacod [14]) could be the starting point for the development of another calculus of variations whose purpose would be to study the transition probability laws for Markov jumps processes. An integration by parts formula was proved in [4] using such arguments. In particular it appeared that such a formula could be obtained as the consequence of an infinite number of integration by parts in the LCvy kernel of the considered jump processes. In [4], we applied this technique to study the transition probabilities of a special class of pure jumps processes whose construction was elementary using auxiliary independent increment jump processes. The Malliavin calculus of variations on diffusions is based on certain stochastic differential equations. The Brownian motion model is still important for two reasons:

56

J.-M. BISMUT

a) It is essential to build explicitly the solutions of stochastic differential equations, which can then be submitted to the calculus of variations on the Brownian motion space. b) The necessary estimates are obtained by reference to the Brownian motion model, [12], [19], [21], [28]. Of course, if the Malliavin calculus of variations is applied on the Brownian motion itself, it gives-not unexpectedly-essentially trivial results (for an illuminating discussion of this case, see Williams [36]). As it appears in [4], this is not the case on jump processes, even when the calculus is applied to independent increment jump processes. Moreover although computations are elementary in their principle, the resulting formulas are extremely heavy to manipulate. In this section, we will try to present the calculus of variations on independent increment jump processes in an elementary way, i.e. based on the most elementary aspects of their structure. Once this is done, the reader can at least have an intuition for how to study more complex Markov jump processes, constructed by means of such independent increment processes, as in [4]. Note that we will only briefly address the question of knowing what is the "right" formulation for expressing a Markov jump processes in terms of independent increment jump processes. In a) an integration by parts formula is proved on the probability space of an independent increment jump process. In b) this formula is used to obtain an integration by parts formula on the semi-group of the considered process. In c) the estimates which are necessary to make such a formula valid are derived. In d), the application of such methods to more general processes is briefly considered along the lines of [4]. Finally e) is devoted to some geometrical considerations.

a) Integration by parts on independent increment jump processes Let g(z) be a function defined on R"/{O}with values in R' which has the following properties. a) g is differentiable with a continuous derivative g,. b) g is such that

Note that since g is 2 0, if g(z) = 0, then also gz(z) = 0. For t 2 0, a E R", set

57


D(R") denotes the space of functions defined on R' with values in denotes R" which are right-continuous with left-hand limits. {Fl}ttO ro the canonical filtration of D(R") (here F,= 9(z,Is 5 t)). { S t } lwill eventually be made right-continuous and complete as in Dellacherie-Meyer [8] without further mention. B is the predictable a-field on R' xD(R") [8], [14]. Let z, be the independent increment right-continuous process whose characteristic function is given by (1.2). Let I7 be the probability law of z. on D(R"). If &(o,z) is a function defined on R' X SZ X R"/{O} with values in R such that a) 17, is B 0B"(R"/{O})-measurableand b) for any t 2 0 (1.3) (resp. (1,3')

00

),

then we denote by SsSt2, S:
A> Is]

NT

=

C E"C(dzf(ZT),A$"(SI, . .

A ; ~ ( S , ., Sn-1, AZ,,,

' *

' 3

Sn-1, A Z S , , . . ' 7 Az~n-,, AZ,S,))JSI

., ~ z , , - , z, ) ) g ( z ) d z ~ s .]

Since A(z) has compact support, it is feasible to integrate by parts in the variable z in each of the terms of the sum in the right hand side of (1.1 I), so that (1.11) is equal to

Observe that (1.12) is well defined even if g may be 0 at some points since g(z)dz gives measure 0 to these points. Now clearly (1.12) is equal to (1.13)

-

This equality also holds if NT = 0. Using (1.1 I), (1.13) and integrating in the variables S,, . . ., S,, . . ., we get

Now clearly since A(z) has compact support (1.15)

so that

(1.16) (1.5) is proved when (1.6) is verified. Let p be a C” function defined on R with values in [0, I] which is equal to 1 for I t ] > 1, and to 0 for It]< 1/2. Let z:, z:I be the independent increment processes associated to the LBvy measures p(lz)/c)g(z)dz and (I - p(izI/E))g(z)dz, and let 17” be their corresponding probability laws. On (D(R”)xD(R”),17’X17”), the law of the process z, given by

n’, 2,

= 2;

+

2;

is exactly 17. Now by (l.l), we have (1.17) Moreover 17/63li”’ as., z’ and z” do not have the same jump times. Using (1.17), and reasoning on z’ as previously for each fixed z”, it is not hard to obtain EU‘@lI‘’

[(dzf(zT), SsO,(1.5) still holds. Proof. (1.5) still makes sense. In fact, if (1.19) holds, then, if < 2(z), and so using (1.1) and the boundedness of A, we

1z1;I 1, [ Ag(o,z) I

1

see that IA8(w,z)lg(z)dz is uniformly bounded. Similarly if (1.19f) holds, by using (1.1) and Schwarz's inequality, [ A8(o,z)]g(z)dz is still bounded, and so SSsTAexists and is integrable. Similarly, if (1.21) (resp. (1.21') holds, S:,T(div,g(z)A(z)/g(z)) is a martingale (resp. a square-integrable martingale). We now prove (1.5). Assume M > 0 exists such that if IzI 2 My A(z) = 0. Take p as in the proof of Theorem 1.1. For 0 < E < My set

J

(1.22)

(1.5) applies for As. Now we make E + 0. Clearly

J.-M. BISMUT

62

and so

S,,,A"

(1.24)

+ SsSTA

L,(n).

in

Assume that (1.19) and (1.21) hold. Then E"

S

J:

ds Idiv,gAc - div,gA/dz

(1.25)

I E " r 0d s S Izl M, A(z) = 0.

Remark 2. In general, if (1.19') and (1.21') hold, the equality (1.28) does not make sense any more, since the sum S,,,(div,gd/g) is not well A(z) exists. defined. Also observe that since A , is bounded, A(0) = limE40 If (1.19) or (1.19') hold,

J

IA(z)\g(z)dz is bounded.

J g(z)dz = +

CQ

So if

,

it is clear that A(0)=0. Moreover if grad,gA/g is bounded, (1.21) implies (1.219. Observe that by (1. l), we may take in (1.19') 1(z) = z

In this case the constraint (1.20) reads (1.32) This is no accident, since (1.32) shows that the perturbation S,vlTAcan at most be of the order of the quadratic variation of the process z,.

Remark 3. If n = 1, it is easy to prove that (1.33)

A = lim (2 - 7)p.p.

2

rTt2

~

dz IzI'+r

in

g'(~)

Let u, be a bounded predictable process. Let z E R -+ Y(Z)be a nonnegative C" function with compact support such that for Iz] 2 1, Y(Z) = z2. For 0 < Y < 2, set g,(z) =

-7 ~2(zI'+r - - - .

J.-M. BISMUT

64

Let 1 7 r be the probability law of the process z. associated to gr. Set

As(a,z ) = usY ( Z ) . If 2(z) = C[zl, (1.19') holds, and moreover for C large enough (1.20) holds. It is trivial to check that (1.21') also holds. For a n y f e C:(R), we have

Now make Y t t 2 in (1.34). Using (1.33), it may be proved that {IP} is tight on 9 ( R ) (endowed with the Skorokhod topology) and moreover that as r +. 2, li'r converges to the Brownian measure P on %'(R). We will take a formal limit in (1.34) without too many justifications. Observe that if for 0 2 s 2 T, I Az,]

1, then SssTu,~ ( d z , = )

JOr

u, d[z, z],.

Now

as Y + 2, the jumps of IIr become "smaller and smaller". The quadratic variation of z, for P being equal to t, as Y t t 2 , the first

[

term in (1.5) "tends" to E P f ' ( z J have

(&)'(z)zI+r

sr

u ds].

= (1 -

Moreover for Iz/ I 1, we

Y)z ,

so that S;sTu , [ ~ Y / z ~ + ~ ] ' (isz )the z ~ +compensated ~ sum of jumps, which, when I LIZ, I 5 1, are exactly usLIZ,( 1 - 7). It is then reasonable to expect that as l'tf2, the limit of Sg,,u,[~/z'+r]'(z)z'+rwill be the It6 integral -

uSz. So as Yf72, (1.34) becomes formally

It is gratifying that (1.35) holds, and this is shown by a rigorous argument given in [3],[9]. (1.5) then appears to be the natural extension of integration by parts on elementary functionals of Brownian motion to jump processes.

b) Integration by parts on the semi-group of an independent increment process We will now extend Theorem 1.2 in order to obtain a formula of integration by parts on the semi-group of z,, i.e. we will prove that for adequately chosen T > 0 we have that for f E C;(R"),and 1 5 i I n

65

Jump Processes and Boundary Processcs

(1.36)

;:[

E" - ( z T )

1+

E U [ f ( z T ) D $= ] 0

As previously pointed out, formula (1.36) itself is not very interesting, since in this case, we know the characteristic function +T(ac) explicitly. However the method to obtain (1.36) is closely related to our work [4], where the calculus of variations is still performed on independent increment jump processes, but the considered functionals are much more complicated. R(z) is a measurable function R"/{O} R + such that ---f

(1.37)

J

R2(z)g(z)dz< 12151

+

03

.

v(z) is a function defined on R"/{O}with values in R+ having the following

properties. a) Y ( Z ) is bounded and differentiable, and has a bounded differential VZ

.

b) C > 0 exists such that if Iz/ 5 1

(1.38)

YlZl

I CR(z)lzl.

c) The following inequality holds (1.39) Choose p as in the proof of Theorem 1.1, and for (1.40)

p,(t> = p ( t M

7jr

> 0 set

'

We now have

Theorem 1.3. For any T > 0, 7jr > 0 , f ~C;(Rn),and i, (1 5 i 5 n),

Proof. (1.41) makes sense. First note that p,(S,,,v) and ~ ' ( S , , , V ) so that p q ( s 8 5 T v ) / s 8 < T Y, p , ( s 8 < T Y ) / ( s S < T ')', p : ( s s 5 T v ) / S , < T are ~ bounded. Since av/az, is bounded, S,,,(av/az,)v exists and is integrable for the same reason as SsSTuin the proof of Theorem 1.2.

are if S s < T Y 5

66

J.-M. BISMUT

To prove (1.41), the easiest way is to go back to the assumptions in the proof of Theorem 1.1. Namely, we assume that E, M, 0 < E < M, exist such that if Iz/ 5 E , or Iz] 2 M , Y ( Z ) = 0. We also temporarily assume that (1.6) holds. Using the notations in the proof of Theorem 1.1, we have if NT # 0,

Set for n (1.43)

< NT NT

K, =

C Y(Az~,,,),

Z, =

zT - Az,,

I

n'#n n'=1

(1.42) is then equal to

(1.44)

If N , = 0, pll(S,,,v) = 0, so that equality between (1.42) and (1.44) still holds (if NT = 0, Cy.. . is taken to be 0). By integrating (1.44) in all variables, we find easily that (1.41) holds. Assumption (1.6) is released by using the same argument as we used in the proof of Theorem 1.1. The support condition on Y is released as in the proofs of Theorems 1.1 and 1.2, using in particular (1.37), (1.38) and (1.39). We now make 7 -+ 0 in (1.41). To obtain (1.36), we need that (1.45)

P,(S,,T

Y)

+

and so we need that SssTv

> 0 as.

(1.46)

J

as.

1

A necessary condition is that

g(z)dz =

+ . 00

Jiinip

67

Processes and Bouridary Processes

(for ( I .36) to hold (1.46) is needed since otherwise a Dirac mass is left in the law of z.!). From Theorem 1.3, we get Theorem 1.4. If T > 0 and u are such that I/S,,,u is in a given L,(I7) with p > 2, then for any f ' C;(R"), ~ (1.41) still holds with pq replaced by 1. For any k E R', t 2 kT, Ictlk$,(ct) is a bounded function. For any l? E N , t > (l? n)T, the probability law of z , is given by ql(y)dy, where q,( E Ci(R"). In particular, if for any T > 0, l/SIIsTu is in a given L,(n) with p > 2, then for any t > 0, the law of z , is given by qt( .)dy, where q,( .) E C;(R").

+

a )

Proof. We make 7 -+ 0 in (1.41). Clearly (I .46) holds so that the first term in the left hand side of (1.41) is taken care of. It is not hard to prove, using (1.38) and the boundedness of (au/az,)u that S,sT(av/az,)~is in all the L,(I7) (1 < q < co). S,s.(au/az,)u/(S,..u)2 is then in L,(Z7). Moreover

+

For t 2 7, pi(t) = 0, so that if t

> 0, p:(t) + 0 as 71 +0.

Clearly

(1.47)

Finally S,C,T(a(gu)/az,)/gis in L2(17). Using (1.47) we see that the dominated convergence theorem applies in the three last terms of the left hand side of (1.41). Using (1.41) with f ( z ) = e-i(".z), we see that for any i ( l 2 i I n), Ia'I+.(a) is a bounded function. Since +,(a)= [+T(c~)]"~, the result on function $, follows. The results on the law of z , are then standard results on Fourier transform. Remark 4.

s

+

A result of Tucker [31] states that if g(z)dz = 03, for any t > 0, the law of z, has a density with respect to the Lebesgue measure. Although the proof in [31] is probabilistic, this result admits an easy analytic proof by differentiating qt(cx). Remark 5. We use the same assumptions and notations as in Remark In particular, assume temporarily that n = 1. As we shall see later (in Theorem 1.6) if u is C" with compact support and is such that u(z) = z2 for IzI 1, then for every 'i < 2, the conditions of Theorem 1.4 are 3.

J.-M. BISMUT

68

verified. Now when all the jumps of z are in size 5 1, then S s a T u = [z, zIT. As TT72, it is reasonable to replace SsaTu by T. Moreover (av/az)u = 2z3. The 3-variation of z for the Brownian measure P is 0. so for P, we may formaZZy cancel SZsT*u. aZ

"

For Iz]< 1, -(g,u)

g, = (I - T)z.

aZ

to become - z,. Taking the

As 7772, we may expect S:,,

formal limit in (1.41) (with ps = l ) , we get that for T > 0, f~ C;(R),

[;

(1.48)

EP

~

(z,)

1

[

- EP f(zT)-

ITTI=O

(1.48) is of course trivially true.

Remark. 6 . As should be expected, formula (1.41) with ps replaced by 1 can be directly obtained by non trivial manipulations on the characteristic functions (see [4]). c) Some estimates on independent increment processes We now are left to find sufficient condition under which the assumptions of Theorem 1.4 are verified. As we shall see, the effect of the calculus of variations is to transfer an estimate on a Fourier transform to an estimate on a Laplace transform. Let 7, be the non-negative measure on 10, w[ which is the image measure of g(z)dz by the mapping z -+ ~ ( z ) .Since [ vg(z)dz < 03, we

+

s

J

+ +

+

have xdp.(x) < co. More generally, let m be a non-negative o-finite measure on 10, w[, such that x A l dm(x) < 03. For /3 2 0, (Y E

R, set (1.49)

ir,(B)

=

s

s

- I)dm(x) , M,((Y)

(e-j'

+

=

s

(cos ax - l)dm(x),

Clearly, for p 2 1 1 (1.51)

E"[l-sz

For (1.51) to be finite for at least one p

> 2, it suffices that E > 0,

69


C 2 0 exist such that for ,~9E R’

(1.52) or equivalently, that for j3 large enough

(1.53)

Tr,LP) 5 - (2

+ €1Log P . +

We will now sufficient give conditions on a measure m on 10, m[ so that T,@) behaves adequately as P -+ + co. The following is proved in [4]. Theorem 1.5. The following two conditions are equivalent a) A s x + O

m(]x, b) A S P + +

1 + co[) - C Log . X

m

r,(P)

-

-

c Log P .

A suficient condition for a)-b) to hold is that

+

- - CLogjaj.

C) AS -+ 00, Mm(a) Either of the two conditions d) l hmlx 2t-4 5-0 log l/x

=

C, or

implies f) lim

P-+-

~

T,(P)

LogP

2 - c.

Proof. The proof relies on standard Abelian and Tauberian techniques. For the full proof, see [4]. Corollary. Assume that D (0

< D 2 + m), exists such that

(1.54)

Then for T > 210, l/S,,,u belongs to one given L,(I7) with p

> 2.

Proof. This is obvious using (1.54) and the implication d) Theorem 1.5.

+ f) in

J.-M. BISMUT

70

Remark 7. In general the function R appearing in Section 1 b) is a bounded function, so that for C > 0 v(z) 5 Clzl. If (1.54) holds, then also

If n = 1, it is crucial to remark in Theorem 1.5 that in general a) or b) do not imply c), and d) or f) do not imply e). Otherwise using (1.551, we would find that

I d

(1.56)

-

j (cos ax

20

L o g LI

I.l-+m

and so for E

- l)g(x)dx

> 0, as a -+ + co

(1.57)

which would make the conclusions of Theorem 1.4 trivially true without any calculus of variations! The counter example is as follows. Let m be given by

'Ix.

A s x + O , m ] x , +co[--

Log 2

Moreover for k E N

c(1 cos 211.2X-n)= C (I cos 2n2"-") c(1 - cos 2n2-") Mm(27C), m

- Mm(2n2k)=

n

-

-

n=k+1

+m

=

= -

1

+

and so lM,(a)I does not tend to +co as la1 + 00. In this case, the probability law whose Fourier transform is

is singular with respect to the Lebesgue measure. So if n = 1, at the critical logarithmic concentration, the calculus of variations gives non trivial results for independent increment jump processes. In a private communication, Prof. H. Delange has shown us how to construct a function g 2 0 and C" on 10, co[ such that as x + 0

+


71

(1.58) and that if

M(a) =

(1.59)

so +"

(cos a x - l)g(x)dx ,

IM(a)I has an arbitrarily slow growth at infinity. It is then clear that even when g is C", (1.58) is not enough to imply a logarithmic behavior of (1.59) as la1 --f 00. Note also the following result of [4].

+

Theorem 1.6. Let Y be such that 0 < i' < 1. Then the following conditions are equivalent: a) As x -+ 0+, mflx, m[) (3x7. - CT(1 - Y)(sinlr/2(1 - Y ) ) l a l r . b) As IayI + + m , M,(a) C) AS p -+ m, ~,(p) - c r ( i - r)p.

+

-

+

-

-

If the equivalent conditions of Theorem 1.6 are verified, then for

T>O

(s

+=(a)= exp T (e-"""

-

I)dm(x))

is such that for any n E N , lal"+r(a)is bounded. So qT(a)is the Fourier transform of a probability measure which has C" density with respect to the Lebesgue measure. In this case, it is the concentration of m which determines the regularity of the corresponding probability. d) The calculus of variations on general jump processes As we have already said, the previous method is not devised to be applied to independent increment jumps processes, but can be used on processes which are constructed by using independent increment jump processes as an instrument, in the same way as continuous diffusions are constructed using Brownian motion, which is plugged into a stochastic differential equation. I n [4],we have treated the case where the process x, with values in R" is given by the solution of

(1.60)

s, = x,

+ +!

X,(x,)ds

+ vt + . . + Y!

where y', . . .,yq are (mutually independent) independent increment jump processes, where the probability law is modified by using the Girsanov transformation on jump processes [14]. To treat this case, the previously developed calculus of variations

72

J.-M. BISMUT

must be applied to functionals of y’, . . ., y* which are much more complex than those which we previously considered. Namely, functionals of process x, which involve jump martingales constructed by means of x , must be submitted to the calculus of variations. In the same way as in the calculus of variations on diffusions, a random flow vt : x, -+ x , must be considered and lifted to various bundles, and these lifts must also be submitted to the calculus. In [4], we have treated the case where the jump process is “elliptic”, i.e. the support of the jump measures for arbitrarily small jumps spans the whole space R”. Let us note that in the proof of Theorem 1.4, as soon as we are able to control the differential of order 1 of the law of z and so prove that lal+T(a)is bounded, the boundedness of [a]”+,(a) as long as t 2 nT is obvious. For the case of the process x given by (1.60), no such argument exists a priori. However, it is possible to mimic the previous argument by using a step by step integration by parts procedure, i.e. to make a variation of the processes y’, . . -,y 4 first on [0, TI, then on [T, 2T], [2T, 327, . . ., [(n - 1)T, nT], so that at each step, a control is obtained for the differentials of higher and higher order of the law of x n T . This procedure is fully developped in [4], and avoids the iteration of the calculus on the same interval [0, TI which would require: more differentiability on the Ltvy kernels. frightening computations. Note that this procedure can also be applied to ordinary diffusions whose generator is everywhere hypoelliptic, and can be localized using the localization procedure of Stroock [26]. e) Some geometrical considerations The structure of equation (1.60) is not completely satisfactory. In fact it makes full use of the vector space structure of R” since the various jumps are “added” to each other. In particular, there is no interplay between the jumps of y l , . .,y 4 which would be similar to the interaction of vector fields by the bracketing in ordinary continuous diffusions. A natural idea would be to replace (1.60) by more general stochastic differential equations with jumps (see Jacod [ 141). However technical difficulties do arise, essentially because contrary to (O.l), such general equations do not define flows of diffeomorphisms i.e. trajectories starting from different points may collide. As will appear on an example later, it seems that for general jump processes, the Ltvy kernel gives an analytically useless description of the process (except in the case of independent increment processes). Namely, it is very hard to describe explicitly how does a LCvy kernel M(x, dy) vary with x.

73


Although we now know how to describe in a geometrically invariant way a much larger class of jump processes, we will concentrate on the boundary processes of certain continuous diffusions, where, hopefully, our point will clearly appear.

8 2.

The calculus of boundary processes

In this section we report on some results which we have obtained in our forthcoming paper [6] on the calculus of boundary processes. Assume that z is a reflecting Brownian motion on [0, m[ ([13], p. 40, [12], p. 119), L its standard local time at 0, w = (w’, -,w m ) a Brownian motion independent of z. Consider the stochastic differential equation in Stratonovitch form

+

+ cXi(X, z) .dwt + D(x)dL m

dx (2.1)

= Xo(x, 2)dt

1

x(0) = x,

where X,, . . ., X,, D are smooth vector fields. A drift b(x, z) is introduced on z using a Girsanov transformation. If A , is the right-continuous inverse of L, we study in [6] the transition probabilities of the Markov process ( A , , x A J . Of course ( A t , xAt)is a jump process, whose jumps correspond to the excursions of z out of 0. In [6], we use the fact that the LCvy kernel of the jump process is itself the image of the excursion measure of (z, w ) corresponding to the excursions of z out of 0 through the solutions of a stochastic differential equation. We will essentially focus on some aspects of our work [6], and insist on some possible connections with other recent developments in probability. In a) the main notations and assumptions are given. In b) a stochastic flow is associated to the considered stochastic differential equation. In c) the Girsanov transformation is briefly introduced. In d) the boundary process is defined. In e) a partial calculus of variations on w is presented along the lines of Bismut-Michel [7]. The key quadratic form C y is introduced as in [20], [21]. In f) some simple considerations relating the calculus of variations to the method of enlargement of filtrations (see Jeulin [15], Jeulin-Yor [16], [17], Yor [38]) are developed. In particular the “non differentiability” of local time L with respect to any natural differential structure on the space C(R+;R’)associated to z forces us in [4] to use the calculus of variations on jump processes to study the component A , in ( A t , XJ. This is briefly done in g ) . In h), the key problem of the a s . invertibility of C z is studied. As in [20], we know a priori from g) that if this is the case, the law of ( A t , x A Chas ) a density

J.-M. BISMUT

74

with respect to the Lebesgue measure. Sufficient conditions under which this is the case are proved. The consequences are interesting since they ) the vector field D show that the LCvy kernel of the process ( A t , x A C and may interact through some sort of Lie bracketing which is precisely expressed through true Lie brackets of (Xo,X,, . . ., X,, d/az, 0). In i), j ) the so called ''localizable'' and "non localizable" cases for regularity of the transitions probabilities for the boundary semi-group are considered. In particular, in the non localizable case, conditions are given under which the boundary semi-group is slowly regularizing. In k), z is now a standard Brownian motion, and x is driven by the vector fields (Xo,XI,. ., X,) when z > 0, by (XA, X : , ., X;) when z < 0, and D for z = 0. Existence of densities for the transitions probabilities of the boundary process is proved under conditions which still exhibit interactions between all the considered vector fields. In l), regularity results for the two-sided case are briefly presented.

-

Assumptions and notations m is a positive integer. Q(resp. Q') is the space C ( R + ;R") (resp. C ( R +; R + ) ) . The trajectory of o E 8 (resp. o' E Q') is written as w, = (wt,. . -,wp) (resp. z). The o-field 9, (resp. 9;) in Q (resp. 8') is de= 9?(z,Is < t ) ) . is the space fined by .F,= 9?(w,Is I t ) (resp. 9; D x 8',endowed with the filtration {.F,}trO, where $, = .Ft09;. All filtrations will be made right-continuous and complete as in Dellacherie-Meyer [8], without further mention. P is the Brownian measure on 8, such that P[wo= 01 = 1. For z E R + , PI is the probability measure on Q' associated to the reflecting Brownian motion on [O, co[, starting at z, i.e. P;[zo= z] = 1. For notational convenience we will write P' instead of PL. On (Q', PL),L, denotes the local time at 0 of z. By [12], p. 120, we know that B, = z, - z - L, is a Brownian martingale with Bo = 0. We also know that if zo = 0 a)

+

A, is the right continuous inverse of L, i.e. A,

(2.3)

=

inf{A 2 0; L,

>t } .

d is a positive integer. Y = ( x , z) is the standard element in Rd+',with x E Rd and z E R. Rd will be identified to Rd x (0). Xo(x,z), . -, X,(x, z ) are m 1 vectors fields defined on Rd+' with values in Rd, whose components are in C;(R'+'). D(x) is a vector field defined on Rd with values in Rd whose components are in C:(Rd). b(x, z) is a function

+

-

75


defined on Rd+',with values in R, which is in C;(Rd+'). If X , is a continuous semi-martingale, dX denotes its differential in the sense of Stratonovitch, and 6X its differential in the sence of It6 [22]. If h is a C" diffeomorphism of Rd onto R d , and if K ( x ) is a tensor field on Rd, (h*-'K)(x) denotes the pull-back of K(h(x)) to x through the differential ah/ax(x) (see [l]). If Y(x) is a vector field, we see that

.

( h * - ' Y ) ( x ) = [?]-'Y(h(x))

b) The reflecting process and its flow We now build a reflected process as in Ikeda-Watanabe [12], p. 203. Take (x,, z,). E Rd X R'. On P 0Pi,) consider the stochastic differential equation

(a,

dx = X,(X, z)dt x(0) = x,

(2.4)

(the summation sign form of (2.4) is

Cy=lis systematically omitted).

X,(X, Z )

(2.5)

+ D(x).d L + Xi(x, 2).dwt

I ax. + --'X 0, and any p 2 1, I A t S T B ris in Lp(P(8 P'). b) for any f E C;(R X Rd),i f x , is the process y8(u,x,), then

+

(2.20) Proof. For the complete proof, see [6]. The partial calculus of variations is very close to Bismut-Michel [7]. Remark 6. Because of Remark 5, it is essential that f has compact support. However, by using (2.18), this assumption may be released, so that smoothness of the law of xAcwill follow (the component A , is no longer needed). Theorem 2.9. Assume that t' > 0 is such that a) for every x E Rd,C;t, is P 0P' a.s. invertible. b) for every T 2 0, there is q > 2 such that for any x E Rd,

is in L,(D, P 0P') with a norm which is bounded independently of x E Rd. Then for any x, E Rd,any multiindex m, any t 2 [mit', on (B,P (8 P') there exists a random variable 0 7 having the following properties: a) For any T 2 0, IA,t

By the zero-one law, we know that P 0P' as., V,t is a fixed space, not depending on a. Let us assume that V; # T,,(Rd). If S is the {gc}tro stopping time (2.38)

S

=

inf{t

> 0; V, f

V,t}.

then S is > 0 a.s.. Let f be a non-zero element in T,*,(Rd)orthogonal to V;. Then

(2.39)

(f, (~~-*X~)(xo)) =0 for

Now from [11-Theorem IV. 1.1, we know that

s

<S.

J.-M. BISMUT

86

or equivalently

Now (2.41) gives the Its-Meyer decomposition of the { g t } t r semi-martino gale (f, p,*-'X,), which is 0 for s 4 S. By canceling the martingale terms, we find that for s 5 S

J: (f, p:-'[XJ, Xt1)6w' (2.42)

so'(f,

1

=09

< i, j < m ,

X*])6B = 0 .

p:-1[$

An elementary reasoning on the quadratic variation of the local martingales (2.42) and the continuity of the processes p:-'[X,, X,], p:-'[a/az, Xi] (see [3]-Theorem 5.2) show that P 0P t a.s., for s 5 S 1

(f, p:-lPj, X i ] ) = 0 , (f, v:-'[T$ a Xi])= 0 .

(2.43)

< i, j < m

Reapplying (2.41) on (2.42), we find that for s (f, p,*-"X,, [X,, Xill)

(2.44)

(f, P:-"az,

a

=

0

9

S

9

a [a ,, Xi]])= 0 .

We now cancel the bounded variation process in the Meyer decomposition of (f, p:-'Xi) (s < S), i.e. using (2.41)-(2.43), we get for s 4 S (2.45) Since P

[

(f,p:-l[X0, xild.>

+ ( L j:p:-'[D,

0P' a s., the support of the measure d L is exactly the closed set


87

(z, = 0) which is negligible for the Lebesgue measure ([13], p. 44), from (2.45) we deduce that for s S

By a result of Yamada (see Ikeda-Watanabe [12],p. 168), we know that (2.69) has a unique strong solution. Using (2.68) and the comparison theorem of Yamada [37](Ikeda-Watanabe [12],p. 352), we find that p: 2 r:

(2.70)

for

t

0, the Hausdorff-Besicovitch dimension of (r, = 0, 0 < t E ) is the constant 1 - d/2. Using this result with d = 1, we find that a.s. the dimension of (z, = 0, 0 5 t < S ) is 1/2. Because of (2.72), it is clear that a.s. the dimension of (zdt= 0, t < 7,) is 1/2. Now (2.69) shows that for 0 5 t I Y,, r: is the square of a Bes (1 C) process and moreover 0 < C < 1. We then know that as.,

+


91

the dimension of ( r : = 0,O 5 t < 7,) is (1 - C)/2 < 1/2. This is a contradiction to (2.71). So we find that M, = 0. Assume that the second line in (2.51) does not hold. By using the stopping time optional selection Theorem [8]-IV-84, we can find a {.F;t}t20 T such that (P6 I")(T< co) > 0, and moreover if T < 03, then T < S , Z, = 0, MT f 0. Now using the strong Markov property of (w,z), the whole reasoning can be restarted after time T (instead than after time 0); we still arrive at a contradiction. The second line in (2.51) holds. By iterating the whole procedure on (2.51) we find in particular that f is orthogonal to U:"(E,UFJ(x,, O)(by taking s = 0 in (2.51)). By the assumption in the Theoremfis 0. This is a contradiction to S > 0.

+

+

Remark 9. Instead of using the dimension properties of the set of zeros of a Bes2(d) process rt (0 < d < 2) starting at 0, we could as well use the fact (It6-McKean [13], p. 226) that if Ld is the local time of r at 0, and Ad is its inverse, then Ad is a stable process with exponent 1 - d/2, so that its characteristic measure will be proportional to I,to(dx/x2-d/2)). By proceeding as in It6-McKean [13], p. 43, this shows that if N f ( t )is the total length of the intervals in (r > 0) fl[0, t ] whose length is 5 L , then a.s., for any t > 0, as E J 0 (2.75)

Nf(t)

- Cd&d'2Ld(t)

where Cd is > 0. Using (2.75), a contradiction is easily obtained from the assumption M, # 0 in the proof of Theorem 2.14.

Remark 10. Theorem 2.14 exhibits clearly that the excursions of the process (x, z) out of z = 0 can interact with the process when it stays on z = 0 through D, so that the probability law of ( A t , xAt)has densities, although the LCvy kernel of this process may be degenerate. i) Regularity of the boundary semi-group: the localizable case W e h o w give sufficient conditions under which the assumptions of Theorem 2.10 are verified. Definition 2.15. For d

(2.76)

k'(x, z) =

E

N , the function ke(x,z ) is defined by

C (f, Y(x,2))'.

inf

f€R~,llfll=lj = 1 T E E /

We then have Theorem 2.16. I f x ,

(2.77)

E

Rd is such that for a given 8 E N , 0 > 0

lim zLog inf kC(x,z) 2>O,z-O

15--101

0, T

2 0, IAtsTI[C2;]-'} is in all the L,(PO P') (1 0; z,

(2.78)

=

7')

then (2.79) j) Regularity of the boundary semi-group :the non-localizable case We now give conditions under which the assumptions of Theorem 2.1 1 are verified. For 4 E N , define me = 6 x 20E-'.

Theorem 2.17. Assume that for a given 4 such that

E

N, there exists C

>0

lim z Log inf ke(x,z ) = - C .

(2.80)

z>o,r-o

xERd

For any t > 1 6 n m , C , T 2 0, there exists q > 2 such that for any xoE Rd,IAILTIIC$]-'lis in L,(P P') with a norm bounded independently of xo. For any t 0, the law of ( A , , xAt)is given by p,(a, y)dady andp,(a, y ) is such that a) it is C" on 10, w [ X Rd, b) if t > ( k d 2) 1 6 f l m , C , p,(a, y ) E C * ( R X Rd).

>

+ + +

Proof. See [6]. Remark 11. The condition (2.77) is a local one, while (2.80) is a global condition, which justifies the terminology which we have used. Moreover, in [6],we show how instead of assuming that (2.77) is verified on a neighborhood of the starting point xo, it may be verified on a neighborhood of the final point y as well (for such a problem on standard diffusions see Stroock [26]). Remark 12. The conditions of Theorem 2.17 give exactly the analytical conditions under which the boundary semi-group is slowly regularizing in the sense of [4](also see Theorem 1.4). Moreover it is shown in [6] that under conditions like (2.80), the generalized symbol of the


93

generator of the boundary process exhibits a logarithmic behavior (see [6], Section 6, Remark 5). The conditions of Theorem 2.17 are minimal. In fact consider the stochastic differential equation

dx

=

exp{

(2.81)

- -Id$. 1 22,

x(0) = 0 .

Conditionally on z, the law of x A tis clearly a centered Gaussian whose variance is (2.82) Now, if n+ and u are defined as in Section 2, g),

(2.83)

sup 2, 2 ___ I c + n+[O<s<s Logl l / a l =

c + Log l / a

(the last equality in (2.83) is classical [12]). From (2.83), we see that

and so

Since (2.86) we see from (2.85) that for t small enough, (2.86) is of xdcunder P 0P" is h,(x)dx, where (2.87)

+ co.

Now the law

J.-M. BISMUT

91

+

We then find that for t small enough, h,(O) = 03. h, is not even continuous at 0. It should be pointed out that this result has nothing to do with the fact that we are considering xAt instead of ( A t , x A J , since differentiation in x as in (2.7) is irrelevant here.

k) Two sided boundary processes: existence of a density Let Xi(x, z), . . ., X’,(x, z) be another family of vector fields having the same properties as X,,. X,. z now denotes a standard (i.e. non reflecting) Brownian motion. The corresponding probability space is still written as Q’ and the probability law of z’ is written as P:,. L,is the local time at 0 of IzI (i.e. L is twice the standard local time at 0 of z), so that a ,

(2.88) z:,

lz,l

=

lzol

+ Stsgnz,az, + L , .

z; are defined by z;

= 2,

v0,

z; = z, A 0 .

A, is still the inverse of L as in (2.3). Consider the stochastic differential equation dx = I,>,[X,(X, z)dt

(2.89)

+ X,(x, z) dwi] *

+ I,,,[XXx, z)dt +

x(0) = x,

XKX,

z).dwil

.

+ D(x)dL

3

A flow vt(a,x,) can be associated to (2.89) in the same way as in Theorem 2.1. If b’(x, z) is a function having the same properties as b(x7z), the Girsanov exponential is changed into n/r, = exp[Jl (2.90)

IZ*>,(NXY,

I z,)6z, - 2 bZ(X,,ZJdS

11

1 + J: I*s~((v~-1~~)(x~, zY), $. J: ‘3,

(xo,z

~)(vb*-l~t)

r2 0, C:50 is invertible.

Proof. U, is the vector space spanned by Iz,,o(y~~’Xi)(xo) (I t

We then proceed as in the proof of Theorem 2.14. Namely assume that Vo+(which is a non random vector space) is # T,,(Rd). Then if S is the stopping time

S = inf{t

(2.94)

> 0; Vt f

Vo+}.

S is positive a s . Let f be a non-zero element of Tz0(Rd)orthogonal to Vl. Then (2.95)

(A

=

0

(L ( v : - ~ ~ x ~=D0

on (zU> 0 )n 10, SI, on (zEl < 0) n [o, SI .

Using the optional selection Theorem [8]-IV-84, it is easily proved that > 0) and (z < 0). (2.95) can be replaced by (z = 0) is included in both closures of (z

(2.96)

(L ( v : - ~ ~ x ~=D0 ( A (P,*-~X:)(X~)> =0

From (2.95), we find that

w7

on (zU 2 0) n S [ , on (z, I 0 ) 10, s[

n

.

J.-M. BISMUT

96

From It6-Tanaka's formula, we know that

(2.98)

(there is no integral

s:

. . - d L because the support of d L is ( z = 0), and of

(2.96)). From (2.97), we find easily that for 1 < j 5 m

( A P:-i[x,7 (2.99)

xi])= 0

on

4)

( L c p ~-I[$, *

0

=

> o m 10, SI , on (z, > 0) n LO, SI . (2,

By iteration, using (2.99) again as in (2.44), and reasoning as in (2.45), we find that for 0 j m.

0, the sum is (f, &lH) .

if t

If z, = 0, and if S is a left cluster point of ( z = 0), the sum is still 0, and moreover by (2.103), (f, y$-'H) = 0. If z, = 0 and z is positive on a left neighborhood of S, the sum is (f, & - l H ) , and G, = 1. If z, = 0 and if z is negative on a left neighborhood of S, both sides of (2.107) are

J.-M. BISMUT

98

0. Finally if z, < 0, the left hand side of (2.107) is 0 and G, = 0. Let E,rand E: be the processes Et =

1:

Iz>o((f,P:-'(Wo,

HI

+ $[4,[XI3H11)))d.

respectively. By using the first line in (2.101) as well as (2.107), we find that for any t 2 0, a s . (2.109)

EL,,

=

G,,s(f, P ? / . H ) *

Now the process G,,,(f, cpfiiH) is continuous. This is clear if t < S, by using (2.103). If S is a left cluster point of (z = 0), (f, pZ-lX,) = 0 and continuity at S still holds, while if S is isolated on the left from (z = 0), G will be continuous at S. From (2.109), we find that a.s.

Et

(2.1 10)

=

G , ( f , p?-'H)

on [0,S ] .

Similarly (2.111)

E: = G : ( f ; pf-'H)

We claim that for t (2.1 12)

on

[0, S ] .

S

(G,

+ G X f , lo?-'H) = (f; &'W

*

We only need to prove (2.1 12) if z, = 0. If t < S, (f, y$-lH) = 0 and (2.112) is true. If t = S, and S is a cluster point on the left of (z = 0), the same reasoning applies. If S is not a cluster point on the GL = 1, and (2.112) still holds. From (2.110)left of (z = 0), G, (2.112), we see that

+

(2.113)

(f, g ~ ? - l H )= E, + E:

on [0, S ]

.

Comparing with (2.109, we find that (2.1 14)

(f, Jl p Y [ D , H ] d L

=0

on [0,S]


99

so that

(2.1 15)

on (z=O)n[o,s[.

(f,y;-'[D,H])=O

The first line in (2.104) has been proved. Now from (2.103)-(2.110) it is clear that (2.1 16)

E,

=

0

on (z, = 0) fl[0, S [

.

Set ps = 2

fils =

+ E:,

2 (f,y:-"x,, j=1

Hl(x0

o))2.

From (2.1 16), we see that (2.1 18) c,,

lit

on

p, = 0

(z, = 0)

n [0, S[.

have been defined in (2.102). Set

We know that z is a reflecting Brownian motion, and that (w', . ., wm,B ) is a {g.,}t20 Brownian martingale. Moreover cs is a {,F.t},20 stopping time. Using (2.118), we have

Pt

on

=0

(2, =

n

0) [O, cs[ .

Moreover using standard results on semi-martingales we know that E,, and hence p t is a continuous process. Using (2.108), we find that (2.120)

E,

=

Jl ES

ds

+ J: (f, yz-yx,, H ] ) 6 w j .

Assume that Mo # 0. By renormalizing f, we may assume that Mo = 1. Let g be a positive constant such that g < 1/4. Since Eo = 0, we may suppose that S > 0 has been chosen to be small enough so that ifs<S

J.-M.

100

so that if t (2.122)

BiSMUT

< cs 1-T71 0. Remark 13. Since under the assumptions of Theorem 2.19, the boundary semi-group has densities, we see that unexpected interaction may occur at the boundary between the two sides (z > 0 and z < 0) of the process (x,, z,) as well as with the vector field D. This is a clear example that Levy kernels do interact. The two sided boundary process: regularity of the semi-group The assumptions to get regularity for the boundary semi-group are in general much stronger than for one-sided processes. The following counter example is developed in [6]. Consider first the stochastic differential equations 1)

(2.124)

dx

=

I,,,dwi ,

~ ( 0=) x ,

which can be put in the equivalent form (2.125)

dx = Z,,,dwt, dh = dt ,

~ ( 0 )= 0

h(0) = 0 .

The calculus of variations applies to the component x A t in (2.125). In fact conditionally on z, the law of x g e is a centered Gaussian, whose variance is CAIhTwhere C , = s. By [13], p. 26, the law of A , is


101

One finds that l / d G is in all L,, and the law of xAcis proved to be smooth. Consider now the system (2.126)

dx = Iz>O I h < T dwt dh = I,>,dt I,,,dt, 9

+

x(0) = x, h(0) = 0 .

(2.126) appears as a two-sided perturbation of (2.125). We claim that now the law of xAt is not smooth. In fact conditionally on z, the law of xAtis a centered Gaussian whose variance is C(PtAT, where (2.127) Clearly, C;,,,

C: =

< C&.

(2.128)

s:

I,,,dS

By LCvy’s Arcsine law ([I31 p. 57), the law of C$ is IO<s 0). In the two-sided case there is in general no “localizable” result, except when both sides are equally regularizing. We prove however a regularity result in [6]. Recall that the families of vector fields E,, E: have been defined in Definition 2.18. Definition 2.20.

For 4 E N , X‘(x) is the function defined by

We then have Theorem 2.21. Assume that 4 E N , 7 > 0 exist so that for any x (2.130)

X“x) 2 ’I

E

Rd,

J.-M. BISMUT

102

Then for any xoE Rd, any T 2 0, Z A t S T ] [ C ~is] ~in' all 1 the L p ( P @P'). Proof. Note that (2.130) is a global assumption on Rd. (2.130) indicates in particular that UI(EnU E@( x , 0) spans Rd for each x . The techniques of estimations of Theorem 2.16 based on classical stochastic calculus do not work any more. Apparently, no single side ( z > 0) or (z < 0) is enough by itself to get the desired result. Assume for instance that C:=l - Pa(x,*

'Psl$we get llve+P611; 2 E-2k"llP,ll:+41d

(2.5)

with some positive constant k" depending only on the dimension. (Note that the best p so it is within the integration limits at least for E sufficientlysmall). Using (2.5) we get

-

2 2~11~''P.llaZ2 C,~-eIlP,1122+"d for c, > 0 depending only on the dimension and on the ellipticity bound for the u,'s. From (2.6)

d

-(llp,/l~)-~/d

dt

2 cg2

or

(llp611;)-2/d2 C,t €1

+1

114

Rodolfo FIGARI, Enza ORLANDI AND George PAPANICOLAOU

which implies

Finally the Chapman-Kolmogorov equation and (2.7) provides us with the pointwise bound P i x , x', t ) =

cP

k Y , t/2)P,(Y, x', t/2)

Y€ZP

The estimate a) now follows easily

Proof of b). Because min, [a log a

+ la] = -e-2-1

C,(P,(x, t ) lOgp,(x, t ) + 4xIP,(x,

t)

XEZ,

for any real 2, we have

+ bp.(x, t ) ) 2 --e-"-'

C

XEZf

or

Choosing b such that e-b-l

=

+

(1 1 - e-ar

and a = l / ( M e ( t ) we ) have -Q,(t>

+ 1 +b 2 -1.

Hence

Because 2/a

+ 1 2 (1 + e-.)/( 1 - e-.)

for any positive a, we have

e-'lzl

DiffusiveBehavior of a Random Walk

115

which is what we wanted to prove.

0 3. Outline of the convergence proof Let ~ ' ( x 0 , ) be the solution of the elliptic problem: (3.1)

(L,u')(x,0 )

+

(YU'(X, 0 ) = f ( x )

x E Zt,

f E 1'(Z;).

A heuristic argument based on a multiple scales [4]suggests that the continuum limit u(y), y E Rd, of uais the solution of the equation

Where qiJ is defined by

and the XJ are formally solutions of d

(3.4)

-

.z[V;(u,(x,

J=1

O)r:)]xk(x,0 ) = V;(u,k(x,0))

x E Z? .

The precise definition of the X's will be given below. Let us indicate briefly the physical interpretation of the quantities which appear in (3.3). (3.4) identifies Xk(x) as the steady temperature distribution due to a uniform heat flux that is generated by an applied unit temperature gradient in the k-direction. 6k,J V;Xk is then the temperature gradient in the J-direction induced by the applied unit temperature gradient in the kdirection. (3.3) identifies then qrJ as the average heat flux for unit gradient which is, by definition, the average conductivity of the medium. If the solution U' of (3.1) (for a! = 0) is going to become smooth in the microscopic scale we will have around each point a real temperature gradient {au@/ax,}. In this scheme we expect that C JqtJ (aue/axJ)is the effectiveheat flux and (3.2) the effective heat equation. To make the argument above rigorous one must first prove the existence of a unique solution for the heat flux induced by a unit temperature gradient. We state this, which is entirely analogous to the one in [4],as follows. There exists a unique set of functions @(o) belonging to L*(sd)satisfying

+

Rodolfo FIGARI, Enza ORLANDI AND George PAPANICOLAOU

116

5

(3.5)

E{GJ(J(0)(BkJ

J=l

+

fi)(Tej

- I)!&U)}

=

(3.6)

E($:(o)) = 0

(3.7)

E(+2CJ(Tei - I)$) = E($Z(TeJ - I>&

4

for any E L2(Q),J,k = 1, . . ., d. We want now to define the non-stationary potential P ( x , o) corresponding to the field (-J."J. Putting P(0, o) = 0, Vo, we can define Xk(x,o)by just adding the corresponding to a path of bonds having 0 as the starting point and x as the end point. Condition (3.7) just means that this definition is path-independent.

(4:)

Let us for example fix a specific order of the coordinates and choose a path reaching x going parallel to each axis i until reaching x1

with this definition

so that there is a constant B such that

TeJbeing unitary the von Neumann ergodic theorem implies that

converges strongly in L2(Q)and because of (3.7) it is Te,invariant for any k = 1, . - .,d and has average value equal to 0. We have then (3.9)

1 lim -E{(Xk(mx, 0))2}'/~ m

=0

.

m-m

Choosing for the path going to mx the m-times iterate of the path chosen from 0 to x, we have also m

Xk(mx,o) =

C TkXk(x,

0).

1=0

DiffusiveBehavior of a Random Walk

117

Because X*(x, o)is in L'(Q), the individual ergodic theorem tells us that X'"(mx, o)/m converges for a.e. o to an L'(sZ)function. Because we have L*(Q)-convergenceto 0 we get the result that for almost each o,Xk(mx,o)/m converges to 0. From here on the proof goes exactly as in [4] or [2]. It is first shown that the following is true. Let u'(x, o)be the unique solution of the variational problem

) of the equation Let ?i be the unique f 2 ( Z t solution

for all 6 E / * ( Z : ) where , the qrJ are defined by (3.3). Then lim

(3.12)

C

E{Iu'(x, w ) - ti'(x)r} = 0

r I O XEZ!

where +:(x, o) = (TX$9(o). Denote by Re(,?, o) E (LI")- A)-' and R,(R) = (La- A)-' the resolvents of L: and Et, where R $ R + . We can then restate the previous result as

For each also have (3.15)

E

and o,with the notation TJt,o) = etL: and T,(t) = etLiwe lim EII(T,O, 0 ) - w)fll:*cz.d) = 0 *lo

uniformly for t over compact sets. Let Q:( -,o) be the measure on D([O, m), Rd) corresponding to the Markov process x,(t) defined by the transition probabilities p,(x,y , t, a),x,y E Zf and by the initial condition

118

Rodolfo FIGARI, Enza ORLANDI

AND

George PAPANICOLAOU

P{x,(O) = x} = 1. Let us define in the same way Qf(-) as the measure on D([O, m), R d )corresponding to the Markov process whose infinitesimal generator is L, and subject to the same initial condition. One can rewrite (3.15) in the form

We have then shown that the difference of the one point distributions of the two processes goes to 0 when E goes to 0. This result extends immediately to all finite point distributions of the two processes. We now prove that both the Q:(. ,0)and the Q;( .) lay in a compact set of measures on D([O, m), Rd). (3.16) will then imply that they have the same limit points in the weak topology of measures, and if one of them has a unique limit, so does the other. A sufficient condition for compactness is (see [l], Chapter VI, Section V, Chapter 4) that lim lim sup {

(3.17)

h10

8 1 0 O 0, a, r E [O, 11) is the fundamental solution of the operator (1;/2)4 with the boundary conditions :

[

$(t,

i

0, r ) = -aP (t,

aa

p ( t , 0 , r ) = -(t, aP ao

1, r ) = 0

1, r ) = 0

1 p ( t , 0, r ) = p ( t , 1, r ) = 0

in the case (I) , in the case (11) , in the case (111) ,

and the second term of the right hand side of (2.7) is defined by a stochastic integral with respect to the cylindrical Brownian motion B,(a). We see that the equation (1.2) has a unique solution X,=X,(-) which is a %'-

[f

valued continuous process satisfying ~ u p ~ E ~ ~ ,iX,(a)Ydo] , ~ , each T

0 ) of independent %?-valued random variables such that each Y , has the same distribution as Y. Theorem 3. For each 0 < t, < t, < . . < t,, the joint distribution of { Y J - ;K ) } ; = ~ on the space V" tends weakZy to that of { Y t $ (. )} :=as, K-+

00.

Remark 2. In the case (11), the random variable Y( .) has a realization: Y(a) = a(Ao)w, with a d-dimensional Brownian motion w, starting from the origin with the time parameter E [O, 11.

0 4. Brownian strings in a potential field In this section we discuss the equation of a Brownian string in a potential field U ( x ) ( x E Rd),that is, the equation (1.2) with a(x) = 1, and b(x) = - (1/2)FU(x):

We give a stationary measure of the equation (4.1) explicitly and also investigate the equation in the case when the function U ( x ) diverges on some region in R d . (1) The stationary measure. The Hamiltonian of an elastic string X E %? in the potential field U with modulus x of elasticity is given by

Random Motion of Strings

127

With this H ( X ) , the equation (4.1) is rewritten in the following form:

where 6H/6X denotes the functional derivative of H ( X ) . This is an infinite-dimensional analogue of Einstein-Smoluchowski equation and is sometimes called Ginzburg-Landau equation in physical literatures (see [31). T o give the stationary measure of the equation (4.1), we set p the standard Wiener measure on the space C([O, K-'I, R d ) which satisfies the following conditions according to the cases (1)-(111). ( I ) p(X(0) E dx) = dx (.= Lebesgue measure on Rd), (11) p ( X ( 0 ) = A,) = 1 (i.e., the probability law of the Brownian motion starting from A,), (111) p(X(0) = A,, X(c-') = A,) = 1 (i.e., the probability law of the pinned Brownian motion). We note that p is an infinite measure in the case (I). Let p. be a measure on the space %? defined by p,(B) = p ( X ( r - ' .) E B ) for every topological Bore1 subset B of V. We introduce a non-negative measure v G us on the space %? by (4.3)

{

d v ( X ) = exp -

s'

U(X(o))do}dp.(X).

To show that v is a stationary measure of the equation (4.1), we assume the following.

Assumption 1. The function U is bounded from below and belongs to C1(Rd). The function V U is Lipschitz continuous. We denote by P,, the probability distribution on C([O,co), %?) of the the expectation solution X,of (4.1) with initial state X,E %? and by Ex*[-] with respcet to Pxo. Let Cb(%?)be the space of real valued bounded continuous functions on %? and let Co(%?) be the family of all f E C,(%?)which IX(o)l 2 M with some M = M ( f ) < 00. satisfy that f ( X ) = 0 if sup,,E.co,ll Theorem 4. Under Assumption 1, in each of three cases (I)-(III), the measure v is a stationary reversible measure of the solution X,of (4.1), that is, we have

T. FUNAKI

128

where ?hefunction space @(V)

Co(V)in the case (I) and C,(@ in the cases

=

(11) and (111).

This theorem is shown by using Theorem 1.

(2) The case with a divergent potential. The discussions here are limited to the case (111) which is interesting from the topological view point. We can define the measure v by (4.3) for a divergent potential U too, namely, U is a continuous function from Rd into (- 03, co]. We = 1. We set denote by Y again normalizing the measure to be ~(9)

D = {X E R d ; U ( X )< m} , D, = {X E R d ; U ( X )< N } ,

N

2

1,2,

* * *

,

and assume the following. Assumption 2. The function U is bounded from below and belongs to Cl(0). The function U is Lipschitz continuous on D, for each N = 1,2, . . .. Two points A , and A , belong to the region D and can be connected with a continuous curve in D. We also assume the following technical condition. Assumption 3. There exists a sequence { U(N'};=,of functions on Rd which satisfy the following four conditions. U ( N )satisfies Assumption 1 for each N = 1, 2, . . . . U")(X) = U ( x ) for x E D,. U(")converges non-decreasingly to V ( x )as N-+m for each x E Rd.

sup Je N

f j L'V(")(X(a))

(lo

dadP"(X)

0. U(x) = U(r) = with some 0

03

{(r - ro)-c

for r

ro ,

r

> ro ,

for

< r, < co and c > 2 where r = 1x1.

Random M o t i o n of Strings

129

Let D be the closure of the domain D and let 4 = CAo,A1([O, I], D) be the space of all X ( E % satifying X ( 0 ) = A,,, X(1) = A , and X ( a ) c D for every (i E [0, 11. Since the support of the measure Y is included in the space 8,the process X , introduced by Theorem 5 is a @-valued continuous process. The space @ has a decomposition according to the relative fundamental group ?cl of the connected component of B which include A,, and A , : qi. a )

Q=u zExl

This gives the following decomposition of the probsbility distribution P on C((- 00, w), 8)of the stationary process X,. Theorem 6.

We have

and P , is a conditional probability measure P(. I

where a$ = Y(%J

a(-

03, 0O>,

%,)I.

Since P , depends on the parameter

K,

we denote it by Pi,+.

( 3 ) ,The limit of P i , =as li -+ 00. The results of S. A. Molchanov enable us to investigate the limit of P 6 , Kas li -+ 00. We assume that d = 2 and D is a non-convex connected open polygon in R2. For each i E z,, let Ti( -)be a shortest continuous path in 9, which has a representation in terms of length. We also assume that the function U and the region D satisfy the following condition. U ( x ) 2 C (dis ( x , aD))-., x

Assumption 4. ( i )

E

D, with some C

> 0 and 0 < c < 1 + f l ,where aD is the boundary of D. (ii) For each i E r,, {u E (0, 1); T,((i)

Theorem 7.

8 5.

aD} is a finite set.

Under Assumptions 2, 3 and 4, we have that

Pi,+ -+ 6,;pszi where X$)((i)

E

= T,(a),

t

E

as

K -+ 03

,

R', (i E [0, I] and 6 denotes the Dirac's 8-measure.

Two dimensional Brownian strings

In this section we discuss the recurrent property of 2-dimensional strings with free edges. We assume that d = 2, K = 1 , a(x) = 12,b(x) = 0 and discuss the case (I) so that the equation is

dX,((i) = dBt((i)

+ 'dX,((i)dt . 2

T.

130

FUNAKI

This is a sort of Langevin equation and we can easily show that the solution X,is recurrent as an L2([0,I], R2)-valuedstochastic process. We get stronger results on recurrence stated as follows. Theorem 8. (i) The process X, is recurrent as a %-valued process, that is, for each X,E $fand non-empty open subset 0 of V, we have

P,,(X,,

E

0 for some t , f

03,

n

=

1,2, . . .) = 1 .

(ii) The string X , sweeps away allpoints in R', that is,for every x and X,E %, we have Px,(Xt,(u,) = x for some u,

E

[0, I] and t ,

co, n =

1,2,

-.

a )

E

R2

=1

.

To prove the theorem, we use a decomposition of the process X , : X,(O)

where

+ y , + Z,(fJ)

= X,t(fJ)

7

X,(u)is a non-random function introduced in

§ 2 and Y, =

J : ~ , ( u ) d u .We see that two processes Y , and Z , ( . ) are mutually inde-

pendent, Y , is a 2-dimensional Brownian motion and that Z , ( + )has the following property. Lemma. Let { v , ; t > 0) be a family of probability measures on the space %' = C([O, 11, RZ)induced by 2, and let v be that induced by

where w(u) is a 2-dimensional Brownian motion with a time parameter t sE [0, I]. Then v , tends weakly to Y as t + 03.

We can prove the assertion (i) by using the strong Markov property of the proeess X,and the recurrent property of Y , as an R2-valuedprocess. The assertion (ii) follows by noting the assertion (i). Remark 4. When d 2 3, since the process Y , on Ra is not recurrent, we can show that the solution X , of (5.1) is also not recurrent as a %'valued process.

0 6. Ornstein-Uhlenbeck theory for the string We consider the following equation:

13 1


I

UE[O,l],

tzo,

a>O,

where B,(a) is the cylindrical Brownian motion on L2([0,11, R d )and b(x) is assumed to be an Rd-valued Lipschitz continuous function defined on Rd. This equation represents the Ornstein-Uhlenbeck theory which describes the dynamics of the string with friction intensity a forced by white noise. We discuss only the cases (11) and (111) assuming A, = A , = 0. The operator - A with boundary condition (11) or (111) defined on the space L2([0,11, Rd)has pure point spectrum repeated according to multiplicity. Let $, = #,(a) be the normalized eigenfunction corresponding to 2,. Setting x , ( t ) = ( X t ( a ) ,$,(a)) and u,(t) = (V,(u), $i(u)) formally, the equation (6.1) turns into the following system, where, ( ., ) stands for the inner product in the space L2([0,11, Re).

jdx,(r) = u,(t)dt (6.1)’ {du,(t)

=

adw,(t)

+ ab,(x(t))dt - ER,x,(t)dt - avi(t)dt, 2

= 2 R;x: < -},

x = {xi};=l;x, E R’, Ilxll:

i=l

n = 0 , -1

we can show the following existence and uniqueness theorem of the solution of the equation (6.1)’. Theorem 9. For every initial state {x(O), ~ ( 0 )E)H, X H-’, there exists a unique solution { x ( t ) , u ( t ) } of the equation (6.1)’ such that SUP

tEC0,TI

E[IIx(t)IIi

and { x ( t ) , v(t)} E C([O,

+ Ilv(t>lI~ll
0) .

The parameter Q, > 0 represents the inverse temperature of the system. Physicists [3] are studying this type of equation with 3-dimensional para~ and with d replaced by the Laplacian ~ ~ = 1 2 / i 3 u ~ meter o = { u ~ } ; E= R3 in order to investigate the kinetic drumhead model of interface. The problem is to look for the limit of the solution of (7.1) as ,k?+ 03 tie., low temperature limit). The first step to answer this problem is to study the asymptotic behavior of the invariant measure Y()) of (7.1) as p ---f 03. By the result of 0 4, Y()) is given by d v ( b ) ( ~= ) 2;' exp

{ - B, 1;~ ( ~ ( o ) ) d o } d , u ( x ) x E v , )


133

where 2) is a normalizing constant and ,u is a probability distribution on %? = C([O, 11, R’) of the pinned Brownian motion X( .) satisfying X(0) = --A and X(1) = 2. Introducing a class { X e ( . ) ;f E [0, I]} of functions on 10, 11 by

we get the following partial result on the asymptotic behavior of dp).

Theorem 11. For every 6 lim P--

P( J 1

~ ( a)

> 0, we have

~ ( ( 0rda 1 > 6 f o r every f

E [0, I]

)=0 .

It seems to be an interesting but difficult problem to give the limit process of X , as p-+ 03. Remark 5 . The asymptotic behavior as E -+ 0 of the following equation was discussed by Faris and Jona-Lasinio [I].

i

dX,(O) = €dB,(O) - U’(X,(a))dt

+ dX,(O)dt ,

XJO) = X,(l) = 0 ,

where U is the function given by (7.2).

References W. G. Faris and G. Jona-Lasinio, Large fluctuations for a nonlinear heat equation with noise, J. Phys. A: Math. Gen. 15 (1982), 3025-3055. [ 2 ] T. Funaki, Random motion of strings and related stochastic evolution equations, Nagoya Math. J., 89 (1983), 129-193. [ 3 ] K. Kawasaki and T. Ohta, Kinetic drumhead model of interface I, Prog. Theoret. Phys., 67 (1982), 147-163. [ 1]

OF MATHEMATICS DEPARTMENT NAGOYA UNIVERSTY NAGOYA 464, JAPAN

Taniguchi Symp. SA Katata 1982, pp. 135-147

An Example of a Stochastic Quantum Process: Interaction of a Quantum Particle with a Boson Field Bernard GAVEAU

5 1. The study of field theories leads immediately to the study of infinite dimensional partial differential equations of Schrodinger type (the so-called Schwinger equations). These kinds of equations appear also in quantum statistical mechanics for systems which are not at thermodynamical equilibrium; in this context, a new class of problems has been introduced under the name of “quantum stochastic processes” but unfortunately, there are very few rigorously treated examples and models of such processes. I should like to develop, in this lecture, an example of a quantum stochastic process, which can be also considered as an elementary field theory. The general idea of a stochastic model is the following: we start with a purely deterministic model 2 with an infinite number of degrees of freedom, but we are interested only in a small part X I of this system. The evolution of I,, is controlled by the other part 2, of the system, about which we have only partial knowledge: thus, in a vague sense, X 2can be considered as “random” and 1,has some kind of random behavior, it exchanges energy with 2, (it can receive energy, and also loses energy, and in this last case, we speak of “friction”) and ZZcan be thought as a heat bath. In classical statistical mechanics, the situation is more or less clear (at least near equilibrium) and, in general it is useless to come back to an exact mechanical model of a heat bath; for example, in the Langevin equation, we have a heavy particle interacting with very light particles; in time At (which is short time for the observer, but a long time for the particle), the heavy particle loses momentum by friction at a rate -kpAt where k is a constant and p its momentum, and also the momentum varies by all the collisions with the light particles; if, in time At, the number of such collisions is very big, we can assume that the variation of momentum is given by d B ( t ) , (i. e. by a white noise) and in this way, we get a Langevin equation (1)

dp

=

-kpdt

+ d B ( t ).

Here we have not described the “heat bath” (i.e. the light particles) by

136

B. GAVEAU

classical mechanics but a rigorous theory should do so. Also, we have not considered the precise dynamics of the collision.

0 2. In quantum statistical mechanics, in non equilibrium situation, these questions are far from being clear and in fact not at all understood, in fact, if we try to adapt (1) in quantum mechanics, p becomes an operator and dB(r) should also be a kind of stochastic operator, but it is not clear what statistics we should impose on d B ( t ) ;the major problem is that (1) is already a macroscopic description, but quantum mechanics is, by its very construction, a microscopic description, even though the quantum statistics has finally macroscopic effects (for example, in conductivity problems, in laser physics and non linear optics, in some “chemical” models. . .). So, the idea is to try to come back to an exact mechanical system 2 and to look at the behavior of a small part by 8,forgetting the other bigger part Zz that we want to treat as a “heat bath”, or as a random system. The model that I should like to discuss here, is the following: El is a point quantum particle and we are interested in its behavwe suppose that 2, is in a ior; Z2is a boson field which interacts with Z,; state of equilibrium; here, to avoid difficulties, we will not consider thermodynamical equilibrium of 2, (so our model is totally caricatural); 2, has some kind of evolution under the influence of 8, and we are only interested in the behavior of 2, so we shall integrate on the degrees of freedom of XZ,or, in probabilistic language, take a conditional expectation with respect to 2,; in the case of a boson field these two operations are equivalent, but it is not, if we take for Zza fermion field or even a boson field in representation different from Fock representation. What is the interpretation of this model? From the point of view of statistical mechanics, we can think of it, as a motion of a particle in a crystal, the particle interacting with the phonon field which describes the quantization of lattice vibrations; in same sense, it is a polaron model [l]. From the point of view of field theory, we could think of a non relativitic particle with an electric charge, interacting with a quantized electromagnetic field (so the number of photons can vary because the particle reacts with the field, but the particle can never disappear as in true quantum electrodynamics) [21. Moreover we shall not be interested in renormalization problems. 0 3. Let us first recall how one can describe the interaction of a point charge with an exterior electromagnetic field. The Schrodinger equation is

137

An Example of a Stochastic Quantum Process

Here, m is the mass of the particle, e is its electric charge, his Planck’s constant, t is time, B is (J/ax, a/ay, a/az), is the wave function (I take a scalar particle), A is the external vector potential and V is the scalar potential. We can assume that we are in Lorentz gauge, so that

+

(3)

div A = 0

and if we write Fi = 1, m = 1, we get for the Hamiltonian

We shall use Kac’s idea to study this Hamiltonian; namely we shall study the heat equation

It is well known that we can write a probabilistic formula for this equation: call Ptq0the heat semi-group solution of (5); we have for V 0

=

In this formula, b,(t) is the 3-dimensional Brownian motion, Ep, is the mathematical expectation under the condition that b,(O) = p o ; in the Feynman-Kac exponential we see that there is an It6-stochastic integral (not a Stratonovich one); the fact that we use It6 intergral takes into account the term - (e2/2)I A 1’ of (4); moreover from the quantum electrodynamical point of view it is more natural to use It6 integral (because of a causality condition). It is rather difficult to get sensible estimate of (6); such estimates were given in [4] in the case of a particle with spin 1/2 and some symmetry condition on A ; in this conference, Professor Malliavin has given estimates on (6) for a scalar particle which are based on his stochastic calculus of variation; in fact, we can think of (6) as a Fourier stochastic integral operator, and the idea is to get an estimate of (6) just as in usual theory of Fourier integral, by integrating by part in Wiener space, using the Ornstein-Uhlenbeck operator for which the Gaussian measure Epois an invariant measure (see also [7], [S]). In this model we have a particle, and an external field acting on the particle, but the particle does not react on the field. We want now to consider the possibility of the reaction of the particle on the field. To

B. GAVEAU

138

do that we shall have to quantize the electromagnetic field.

9 4. a). Classically the electromagnetic field can be expanded in Fourier integrals (or Fourier series if the system is in a parallelepiped periodic boundary condition). We shall write

s

+

A(x) = p(k)-1/2(a(k)e-'k'x a*(k)e"k'x)dk

(1)

where the sign

s

denotes either a Fourier integral or a Fourier series, p(k)

in a given function of k , related to dispersion relations; moreover

where the e(')(k),i = 1,2 are orthogonal, length 1 and orthogonal to k and the a(Q(k)are the polarisation components. We shall group the terms in A(x) in the following way: let us note to

J

s*

an integral analogous

but where we identify k and - k and we write 4 w =

1 ~

p(k)l/z

(a@)

+ a*(k) + 4 - k ) + a*(-k))

p(k) = -iF(k)'/'(a(k) - a*(k) P(k) = p(k)'''(a*(k)

+ a(-k)

- a*(-k))

+ ~ ( k-) ~ * ( - k )- ~ ( - k ) )

(see Landau-Lifschitz [6]). With these notations we obtain (2)

[q(k)cos (kax)

Moreover we also obtain

and here, a*(k).a(k)is a scalar

+ Q(k) sin (kex)]dk

A n Example of a Stochastic Quantum Process

139

b). Let us now pass to the quantization of electromagnetic field; in are the annihilation (resp. creation) operator this case u("(k)(resp. a(*)*(k)) of a photon (or a phonon, or more generally a boson) of momentum k and polarisation e(')(k)(transverse vector to k) and the quantity given by the right hand side of (3) is the Hamiltonian of the free field; in fact for each i = 1, 2 and each k, (q(*)(k), p(*)(k))(resp. (Qct)(k),P(t)(k)))are set of two canonically conjugate operators satisfying

[q'"(k),q'J'(k')]= 0 [p")(k),p("(k')] = 0 [q@)(k),p(j)(k')] = isij8(k - k') and the same for Q(*)(k), P(j)(K). If we replace formally in the right hand side of (3) thep, q and P, Q by these operators we get 1 F(k)(a*(k).a(k) -

2

+ a(k).a*(k))dk

(because u and u* do not commute). Unfortunately this operator has a lowest eigenstate which has infinite energy and which is

J p(k)dk = + m

(in general) .

The idea is that we subtract this "quantity" from the right hand side of (3) and doing so, we get only

This is a trivial renormalization, even though the point 0 energy that we have subtracted is really useful from other points of view. Let us now fix i E {1,2} and k and introduce (5)

+

p[*'(k)* p(k)2q'"(k)2 - p(k)

9

P ( ~and ) qci)satisfying the commutation relations given above. The von

Neumann theorem about representation of commutation relation tells us that we can represent this operator as an operator of Ornstein-Uhlenbeck type

Here we have an infinite number of such operators ( 5 ) and the von Neu-

140

B. GAVEAU

mann theorem is false for such objects. But, nevertheless, we shall assume that the Hamiltonian (4) can be represented by a formal functional differential operator: namely we introduce for each k, variables q ( ( ) ( k )and Q(*)(&) and we define

This is the free field Halmiltonian and it acts on functionals T(g(k),Q(k)).

0 5. a). We consider now the interaction of a non relativistic quantum particle in R" with the boson field described above. The wave function of the total system is a functional Y(x, g(&), Q(&),t ) depending on x E R3, and of the coordinates of the field. Its temporal evolution is given by (7)

A is given by formula (2) and H F p by formula (6). In the case where x is particle in a crystal and A is a phonon field describing the quantization of lattice vibration, (7) is justified because the generalized momentum of the particle is p - eA instead of p . b). If we want to study the stationary states of (7), we have just to study the Hamiltonian on the right hand side of (7). Using the ideas of Kac [5] it is then easier to study the heat semi-group of (7) (or to go to imaginary time or to pass to so called Euclidean version); in this case we can write a probabilistic formula for this heat semi-group. More precisely let us introduce: (1) The classical Brownian motion b,(t) in R3 (generator 4 2 ) (2) For each k and each i let us introduce an Ornstein-Uhlenbeck process q:)(k, s) the generator of which is the Ornstein-Uhlenbeck operator

and also we suppose that q?)(k,s) are independent for i f j and & # k' and also independent of Brownian motion. Let us also introduce a second set of Ornstein-Uhlenbeck processes


141

independent of the preceding processes and independent between themselves for i # j and k # k'. Let us finally denote

(we suppress the sample a). (3) Let us denote by E the mathematical expectation on the Brownian motion, E" the mathematical expectation on the degrees of freedom of the field, with the invariant Gaussian measure, namely the integration over 4(k, s), Q(k, s), these Ornstein-Uhlenbeck processes being distributed at time s = 0 according to the invariant Gaussian measures of their generators (and then at all time s). (4) Let us suppose that at time t = 0, the system is in the following state : the Ornstein-Uhlenbeck processes are distributed according to their invariant distribution and the particle according to some wave function T(x). Then at time t the particle has a wave function F(t, x) which is given by

I

(

F(t, x) = E,E exp -ie

.

J:

A(b,(s)).db,(s)

because of 5 3, but here, A contains in its expression the Ornstein-Uhlenbeck processes (see formula (2)). We can rewrite this as

+

Q 6. In formula (9), E" is a Gaussian expectation, do not depend on the degrees of freedom of E" and then we have an expectation E" of an exponential of a linear functional of a Gaussian process; we then have the general well-known rule

But the q(t)(k,s), Q@)(k,s) are independent

142

B. GAVEAU

E((J:S*. . .)’)

=

J* E((J~.. .)‘) .

In our case this quantity is exactly

where we denote

Moreover it is well known that

and the same with Q instead of q ; then the last computation gives

This is an evolution equation on the wave function p for the quantum particle in R3which we are studying. But this evolution contains a double Kac functional [3], so that this evolution is non Markovian. U p to now the computations are exact.

5 6. a). Let us first remark that e(‘)(k)depends only on the direction of k, and that in general we can suppose y(k) depends only on the modulus 1 k I.

Let us replace S*dk by

A n Example of a Stochastic Quantum Process

143

b). Let us also suppose that .\G. is bounded; we have then to study the quantity

X exp (- p(1 k \)Is - s")db?)(k, s)db?)(k,s'))]

.

If p is only in L2(R3)by Cauchy-Schwarz inequality we have to study the same quantity J ( t ) but with e2/4instead of e2/8. Because exp is a convex function we can put out of it the

[

du(k) and interchange this sign with

s

s2

the sign E and we can write that J ( t ) < do(ko)J(ko,t ) where J(ko,t ) depends only on k, in S2and is

(here bc)(k,s) = e@)(k) .b,(s) depends only on the direction of k). (c) But in the preceding formula for J(ko,t ) , b2)(ko,s), b?(k0, s) and ko.b,(s) are the three orthogonal projections of b&) on the three orthogonal directions eCt)(ko),e(')(k0),k,; these three projections are then three independent Brownian motions in R, so that the stochastic integrals in (15) are stochastic integrals of certain functions knowing ko.b,(s). By changing the frame of orthogonal coordinate, we can then suppose that ko is in direction 3, e(')(ko)and e(')(k0)in directions 1 and 2. Let us define the stochastic kernel

Then (15) becomes, using the independence of dbf)(s)and dbf)(s)

(17)

J(ko,t ) = E(exp ( - 1

where R = -(e2/4).

f f R,(s, s')db~')(s)dbt')(s'))) 0

0

B. GAVEAU

144

d). To obtain an estimate we shall compute (16) by discretizing the time, we shall also denote

So (oo +m

Ro =

p2dp

assuming that the classical dispersion relation p of the boson field is such that R, < m. We have then to study in the discretized version

+

Because this is a quadratic exponential in bj:) - b:iLl and because R,(st, s,) is independent of the first component of the Brownian path, we can partially integrate the first component of the Brownian path. Denote by 92 the n x n matrix the elements of which are R(s,, s,) for i # j and R, on the diagonal and denote As = si - si-l. The integral (18) is

where Ec3) denotes the expectation of the third component of Brownian path. The result is then (19)

J(k,, t ) = F3)((det(I

+ 2 1 A ~ 9 ? ) ) - ’ /.~ )

Let us now pass to the limit for As-+ 0; this means the following: we consider on [0, t ] the following integral equation depending on the random o ;

This integral equation has a symmetric, Hilbert-Schmidt kernel, so that its eigenvalues are discrete 1 p,(t) and

+

It is also clear that this kernel R is positive by its construction; in fact, we have


=

145

+ sin pb(s) sin pb(s'))f (s)f(s')dsds'

1;"

P'dP."((J:aPY s) cos (Pb(s))f(s)ds)z

+

(s:

>

!XP, s) sin (Pb(s))f(s)ds)2) 09

where Q(p, s) is an Ornstein-Uhlenbeck process which is independent of b(s) and E" is its expectation. e). From this last fact we see immediately that J(k0Y

t)

< E((1

Now +==

C pt(t) = R i=l so that (22)

s:

+ EPa(t))-li2). i=l

R(s, s)ds = RtRo ,

J(k,, t ) < (1

+ RtRo)-1/2

from which we also deduce (23)

J(t)

< (1 + RtR0)-'/*

which means the mapping +(x) -+ +(t, x) given by ( 9 ) is a linear mapping from L" to L" or from L2 to L" and that its norm is decreasing like t - ' I 2 when t + + m .

5 7. Let us now describe some ameliorations of the preceding results: a). The mapping +(x) -+ +(t, x) is not a semi-group and the functional (9) is not a Markov functional; its memory is due to the degrees of freedom of the boson fields that we have integrated by saying that at time 0, there are no bosons around (vacuum states or integration with respect to the invariant Gaussian measure of the Ornstein-Uhlenbeck processes). b). Let us now give a slightly better estimate of the norm of the mapping (9) in a particular case. This particular case is the situation in

B. GAVEAU

146

which the quantized boson field has only degrees of freedom in the direction ko (and polarisation orthogonal to ko). This means that the sign

1*

in

(9) has to be replaced by

(so that we have a Dirac measure on the unit spahere instead of the uniform measure). We can suppose that ko is the z-axis and we are studying the scattering of a particle by a boson field having a given polarization. We replace +(x) by a Fourier integral +(x> =

J exp i(l,x)$(l)cil.

In (9) we can integrate first on the components x and y of the Brownian path and we obtain: $(l)E,E"(exp

[-4[ds

x exp (il,z&>) where z,(s) denotes the z component of the Brownian path and I = (Iz, I,, l,); finally q ( " ) ( p s) , . . - denote the degree of freedom of the boson

field for x-polarization and momentum p , E" is the expectation of the field and E, the expectation on z,(s) starting at time 0 at z. In this situation we can perform the E" expectation, in fact at that point we have reduced the problem to an ordinary Kac exponential in one variable but in a random potential depending on time given by the following formula:

Using this formalism, it is possible to obtain an exponential decrease in t of the norm of the mapping +(x) -P +(t, x) but the details are complicate and will be given elsewhere.

A n Example of a Stochastic

Qiraritiirii

Process

147

0 8. The preceding text is a detailed version of the second part of my talk at Katata, the first part was concerned with fluctuation for a chemical reaction and has already appeared in Journal of Statistical physics, 1982. Finally, the lecture at the Kyoto symposium was concerned with various problems on some points of field theory and has already appeared in other journals, one part of this lecture was concerned with the quantization of Yang Mills field with cut off in a non perturbative way (J. Funct. Analysis, 38, 1980, 324 and 42, 1981, 356); another part explained a new way of doing a renormalization by changing the Lagrangian frame in infinite dimension and is directly inspired by the work of Professor Leray and Professor Maslov (J. Funct. Analysis, 1982, and C. R. Acad. Sci. Paris, 295, 1982, 189 and C . R. Acad. Sci. Paris, 293, 1981, 469). I should like to express my deepest gratitude to M. Taniguchi and to Professor It6 for their kind invitation to attend this symposium. References [11 [2] [3] [4] [5 ] [6] [7 ]

181

R. P. Feynman, Statistical mechnics, Benjamin, 1972. Quantum electrodynamics, Benjamin, 1961. B. Gaveau, Estimation de fonctionnelles de Kac et de certaines fonclionnelles non markoviennes singulikres, C.R. Acad. Sc. Paris, 292 (198 1 ),

-,

577-580. B. Gaveau and J. Vauthier, Intkgrales stochastiques oscillantes: 1'6quation de Pauli, J. Funct. Analysis, 44 (1981), 3 8 8 4 0 0 . M. Kac, On some connections between differential and integral equations and theory of probability, Proc. Second Berkeley Symp. Math. Statist. Prob., 189-215, Univ. California Press, Berkeley, 1951. L. D. Landau and E. M. Lifschitz, Theorie quantique relativiste, Mir, Moscou, 1963. P. Malliavin, Stochastic calculus of variation and hypoelliptic operators, Proc. Intern. Symp. SDE Kyoto 1976, ed by K. It8, 195-263, Kinokuniya, Tokyo, 1978. -, Sur les intkgrales stochastiques oscillantes, C.R. Acad. Sc. Paris, 295 (1982), 295-300.

D~PARTMENT DE MATH~MATIQUES TOUR45-46,s ETAGE U N I V E R S PARIS I~ VI 4 PLACE JUSSIEU,75230, PARISCEDEX 05 FRANCE


Convergence in L2 of Stochastic Ising Models: Jump Processes and Diffusions Richard HOLLEY')

0 0.

Introduction

The purpose of this paper is to investigate the convergence of the semigroups of stochastic Ising models in the L2 spaces of their Gibbs (equilibrium) states. In particular we are interested in finding conditions which guarantee that the convergence takes place exponentially fast. Exponentially fast Lz convergence has several interesting consequences. In the first place it means that the system returns to equilibrium very quickly if it is displaced from equilibrium by a small amount. Secondly it gives quite a bit of information on the space time correlations when the system is in equilibrium. This information can in turn be used to study renormalization and the resulting limiting process. Finally, if the interaction is attractive (ferromagnetic) then it is sometimes possible to draw conclusions about the rate of convergence in the uniform norm from information about the rate of convergence in L2. Section two is devoted to a more detailed explanation of these consequences of exponentially fast convergence in L2. We are primarily interested in stochastic Ising models in which changes take place via continuous diffusion with reflection at the ends of a bounded interval, and we choose the diffusion and drift coefficients to minimize the technical difficulties in the construction of these processes. Section one contains the description of these processes and the corresponding Gibbs states. The semigroup of a stochastic Ising model converges exponentially fast in L2if and only if the generator has 0 as a simple eigenvalue and 0 is an isolated point of the spectrum. The former condition is equivalent to the tail field of the Gibbs state being trivial, so we concentrate on the isolation of 0 in the spectrum. Our tool for attacking this problem is a relationship between the spectrum of the diffusing stochastic Ising model and the spectrum of a stochastic Ising model in which all changes are made by jumps. The latter seems to be more tractable. Section three ')

Research supported in part by N.S.F. Grant MCS 80-07300.

R. HOLLEY

150

contains a description of the jump process and explains the relationship between the spectrum of the diffusion and jump generators. Finally in sections four and five we obtain sufficient conditions for 0 to be an isolated point of the spectrum of the generator of the jump process. The method in section four involves a study of the resolvent using the explicit form of the generator, and in section five we take an indirect route by finding sufficient conditions for the semigroup to converge exponentially fast in the uniform norm uniformly on a dense set of functions. The conditions we obtain involve considerable notation, so we postpone giving them precisely; however, basically what they say is that at high temperatures there is exponentially fast convergence in L2for both the diffusion and jump processes. $j1.

The diffusion stochastic Ising model

The stochastic Ising models with which we will be dealing are Markov processes on the state spaces E = l-IkEZdI,, where = [a,b], a k e d closed bounded interval for all k. We denote elements of E by x and xk E I, denotes the value of the configuration x at site k. The specific stochastic Ising model is determined by an interaction or potential which we will take to be a pair potential (@(s,j)} of finite range, R, (i.e. @ ( k , , ) ( xxk j, ) = 0 if Ik - j l > R). In addition we assume is translation invariant (i.e. @ ( k , J ) ( x k , x j ) = @ ( , , , , - k ) ( x , , , X j - k ) that {CD(~,~)} if x , = x, and x , = x , - k ) and that

This last assumption is made to minimize technical difficulties. An example of such an interaction is (1.2)

@(k,j)(xk>

=

-sin

(xk)

sin ( x j ) X ( l ) ( l

- j I),

where X, is the indicator function of A and we take each Ik = [ -(n/2), n/2]. The Gibbs states determined by the interaction {@(k,j)} at temperatures 1//3 are probability measures on the Bore1 sets of E ( E is given the product topology) which satisfy the following conditions: For A c Z d define a-algebras

F A= u { y , : k

E

A} and g A= u { x k :k $ A } .

Let PA(Xk:

I j l x ) = exp [-p(

@(k,j)(xk, k,j€A

xj)

+

kEA

ieA

@(k,j)(xk>

xj))1/z(n9

151

Convergence in L3 of Stochastic lsing Models

where Z(A, x) is the normalizing constant needed to make ,on(.Ix) a probability density in the variable . with respect to the normalized product Ik. Then ,u is a Gibbs state for the interacof Lebesgue measures on tion 0 at temperatures I/! (we write p E Yo,@) if ,u(. I g,,)(x) has density ,on(. 1 x)

(1 -3)

.

As will be seen below the elements of Y o , are @ stationary measures for the stochastic king model determined by { @ ( k , j ) } and 8. In order to define the diffusion stochastic Ising model we first let

{

9 = f : f E Cm(E),f depends on only finitely many coordinates,

and

af- (x) = 0

-

1.

if x, is an endpoint of Ik

In order to simplify the notation we set Uk(xk,x) = C jO ( * , J x Lx, j ) . For f E 59 define

In our example (1.2) this becomes (1.5)

9 f ( x )=

x (-((x)ax; azf + ,8 cos (x,) x

sin ( x j )

lj-kl=l

kEZd

~

af

-

(x))

ax,

.

Given x E E it can be shown that there is a unique measure Px on C([O,a), E ) such that for all f

E 9,f ( x ( t ) ) -

s:

9f(x(s))ds is a P, mar-

tingale (with respect to the natural a-algebras) and such that

P, (Lebesgue measure It: x k ( t ) = a or b} = 0)

= 1

for all k

and P,(x(O) = x) = 1. (See Quinn [lo]). The Markov family {Px:x E E } is the diffusion stochastic Ising model (see [12]). We let Tt be the semigroup of this Markov process, and it is clear from the definition that the generator of the semigroup, when restricted to 9 is given by 9. Let ,u E Yo,, and define P, =

1

P,,u(dx).

We conclude this section

E

by showing that P, is reversible, i.e. that (1.6)

P,W

E

r,,x(s) E r,)= ~ , ( x ( t )E rz,x(s) E r,)

R.HOLLEY

152

for all 0 5 s, t

0 such that forfE L2(p) (2.1)

IlTtf -

< I l f - P.(f)llLzip) e - ‘ ,

P(f)llLY,81

7

An immediate consequence of this is that if the system is perturbed slightly away from p then it returns to p very quickly. To see this suppose that Y is a probability measure with Y 0 (depending on {@,,,,)} and E ) such that for all A, c A, c Z d and a l l y e B which are 2FAomeasurable and g E

Convergence in L2of Stochastic Ising Models

153

Lz@)which are PAlmeasurable there is a constant A , such that (2.2)

IEJ"(~(t>lg(x(o)>l - , 4 f ) A g ) I 0 such that for all n

Convergence in L2of Stochastic lsing Models

< Ilf - pCln+(f)llLn(p:)

f 2 - c ( t - 1 l)g;

llLZ~t$)

155

7

where g,' is the Radon-Nikodym derivative of (T(")+)*6, with respect to p,'. It can be shown (see [lo]) that there is a R < 03 such that llg; 1 J L 2 ( r ~ ) I elna. Hence the first term on the right side of (2.4) is bounded by 2 sup,If(x)l e 8 e - ( 6 t - 2 n. d )Taking n = ( ( ~ / 2 2 ) t ) 'we / ~ see that the first term (2.4) will go to zero exponentially as fast as t + 00. on the right side of That leaves the other two terms. The second term involves only the equilibrium states, and whether or not it goes to zero and if so how fast has nothing to do with the stochatic Ising model. We postpone our discussion of it for a moment. The third term on the right side of (2.4) causes us the most trouble, and we have been unable to handle it in general. The problem is that even though we know that T?)+f(x) -+ T,f(x) as n -+ 03 for fixed t and x , we have already decided to let t go to infinity with n. The general theorem which we can prove in this situation requires that n goes to infinity at least as fast as a constant (>e) times t . This is clearly not the case for our choice of n. We get around the difficulty by adding another hypothesis which will allow us to forget about the third term. j , } is attractive, The additional hypothesis is that the interaction {a(,, or ferromagnetic. Specifically we assume that

This has the following consequences (see [9] Chapter 9). For all f e 9, which are increasing i.e. satisfy (2.8)

f ( x ) 2 f ( y ) if x , 2 y ,

for all k

E

(here a, = a and b, = b for all k E Z d ) and for all p

E

Zd

we have, for all t 2 0 and x E E,

(2.10) The latter consequence is a standard result about ferromagnetic systems (see [9]). The former, (2.9), is proved in [lo]; however it is intuitively

R. HOLLEY

156

plausible since (2.7) implies that if x, 2 y r , k E Z d and x ( t ) = y and d R ) + ( t= ) x then the drift at every site in A(n) is larger in the x ( ~ ) + ( - ) process than it is in the x( .) process. Thus since the diffusion coefficients are always constant, the two processes can be coupled in such a way that if xp)+(O) 2 x,(O) for all k then the same inequality holds for all time. We now take an f E 9 satisfying (2.8) and argue as in (2.4). (2.9) implies that

As in (2.6) the first term on the right side of (2.12) is bounded by ( f ( b ) - f(a))eae-('t-.'"d), and we denote the second term by F,(n). Then taking n = ((c/2R)t)'ld we see that for all increasingfc 9 there is a constant A , such that

In some cases (see Remark (5.16) below) it is possible to show that there is a constant ?' > 0 such that for all f~ .9 there is a constant B, such that (2.14)

I p m - p,(f)l I B F r n*

In this case we see from (2.13) that there is a number 8 all increasingfe 9 there is a constant C, such that (2.15)

sup I T t f ( x )- p ( f > I < C, exp.(-6t l / d ) X

> 0 such that for .

It is well known that (2.14) holds if d = 1. Thus in one dimension the semigroup converges exponentially fast in the uniform norm on the increasing elements of 93 if it converges exponentially fast in L2. § 3. Jump stochastic Ising models

In the jump stochastic Ising model each site jumps at the times of a Poisson process with intensity one. The Poisson processes used by different sites are assumed to be independent, and when a jump occurs at . where site k the distribution of the new position has density P ( ~ , (Ix(t)),

Convergence in L2of Stochastic Ising Models

157

~ ( tis) the configuration at the time of the jump. To describe the generator, 8, of this process we let 9 be as before and define 8 on 9 by (3.1)

=

cs1 k

(f(ZkY

a

u) - f(x))pk(Y I x)dy

*

Here (Zk, JJ) is the element of E which is equal to x at every site except

k, where it is equal to y. Let { S t :t 2 0) be the unique semigroup whose generators, when restricted to 9,is given by (3.1) (see [7]). By a computation very similar to (1.7) we see that if p E Y,,,, then forf, g E 9

x

k("3

v) - g(x))pk(Y 1 x ) d y d d x )

Hence again using the results of [ l l ] we see that Also

SZP

*

is selfadjoint in L2(p).

But

pk(zIx)-Vz. Since Uk(z,x) is bounded, c > 0. From (3.4) we obtain

R. HOLLEY

158

Integrating both sides of (3.5) with respect to p, summing over k and using (1.7) and (3.3) we obtain

Note that the inequality in (3.6) still holds for processes which have only finitely many sites such as those in the previous section. Our goal is to prove, under certain conditions, that for some E > 0 and for all f E L2(p) (3.7) This is equivalent to

In view of this equivalence and (3.6) it seems plausible that to prove (3.7) it should be enough to show that there is a constant E~ > 0 such that

J

-J f ~ ~ d2 pgo (f - p ( f ) ) ' d p

(3.9)

for all f e 9 .

That proving (3.6) is enough to prove (3.7) has been proved by Quinn [lo]. In turn (3.9) is equivalent to

and the last two sections are devoted to proofs of (3.10) at high temperatures.

3 4.

L2 convergence for jump processes

be fixed. In this section L2 always means L'@) and all Let p E expectations are computed with respect to p. Our first observation is that in L2, S is just a sum of projectives. To see this note that

is a projection, and that if we set Qk = I - Pg, then (4.2)

Qf(.>=

-C Qef(*) -

Convergence in L2of Stochastic lsing Models

159

(4.3) Remark. Throughout this section we will work with infinite sums even in some situations where it is not clear that they make any sense. To make all of this rigorous one should work with finite sums, get bounds which are uniform in the number of terms, and then pass to the limit. See [7], which is the basis for the ideas in this section, where this type of argument is done. We leave it to the reader to supply these steps. FOrfe Lz we define lllSlll= Clr II Q n f l l m Note that {f:lllflll < m) is a subset of the domain of W) and that

We begin by finding sufficient conditions to guarantee that (4.5)

IIlSJIII I e - t lllflll ’

In order to state these conditions we first define

and set

x Pa,’(% (4.8) Lemma.

U

v)dudv}

.

160

R. HOLLEY

where

). We now choose ha(?) and h,(v) to minimize M h o , h l ( ~(i.e.

and hI(v) = 1 - ha(7).) Call the resulting minimum M(7). Note that the factor not involving Mho,hl(ljl) on the right side of (4.10) is Jw,g - PkgY 1 5=Wl(.I). Thus

Convergence in Lp of Stochastic Ising Models

161

Therefore applying Lemma (4.8) we have (4.16)

(2

+ 1)II Q&ll

< II

Qgf

I1 + jC A,,j(Il Q4l+ /I Qjgll) +k

Summing (4.16) over k and recalling that g = R2f yields (4.17)

(2

+ 1 - 4llIRAfllI Il l l f 111 -

Finally the theorem is proved by using the formula

st

--

lim &A(AEA-C

(see [I] Chapter 13, Section 10).

2-m

(4.18) Corollary. If A < 1 and ,u has trivial tailjield (i.e. is an extremal point of .40m,s),then for all f E L2 (4.19) Proof.

IIStf

- P(flIIL.1I I l f - P(flll e-cl-A)t

Let 0 5 t, < tz. Then forfE 9,

*

R. HOLLEY

1 62

Hence SJconverges to something, say

4, in L2 and

(4.20) IlIS,flll = 0. i.e. Pk# = 4 for all Also 11141I1 = ll[limt4mStfill 2 ht-m k , and hence q5 is tail measurable. By hypothesis 4 must be constant and the constant is clearly p(f>. Thus we have (4.19) for all f e 9. Since 9 is dense in L2 the result follows (see [lo]).

(4.21) Example. In example (1.2) if d = 1 one can improve the above results to conclude that (4.19) holds if A = 3 4 , < 1. In example (1.2) with d = 1, 3 4 , < 1 if ,Q < 0.47. We don’t pursue this here though because the results in the next section allow us to conclude that (4.19) holds for Q, < 1 in this example. (4.22) Remark. Note that the results in this section make no assumption of attractiveness even though our principle application is to that case. In the next section we take another approach which relies heavily on attractiveness.

Q 5. The attraction case: Jump processes In this section we take an approach which has no hope of working if there is more than one Gibbs state. Nevertheless in the attractive case it seems to yield better results than those obtained in the last section. The general method that we use here would yield something for arbitrary finite range interactions ;however, we add the assumption of attractiveness in order to interchange an integral and an absolute value and thus obtain better results. The idea is to couple two jump processes in such a way that at any given site all jumps in both processes occur simultaneously and to show that no matter what the initial configurations are, the difference between the two processes at any given site goes to zero exponentially fast. We begin by constructing the coupled process. First note that if

and

163

Convergence in L' of Stochastic lsing Models

The coupled process is now constructed as follows. Let E 2 = E X E be the state space. We denote elements of E 2by (x('),x(')). Now define an operator b on the functions on E 2 which depend on only finitely many coordinates by (5.5)

Of(X('),

x(2)) =

cs' (f(PX,I x(')),Pn, I $b,(U

$b&4

k

0

- f ( x ( ' ) X'2'))dU ,

I f f really only depends on one variable, say x('), then bf(x('),d2)) = Q f ( x ( l ) )and hence the semigroup gt generated by b satisfies gtf ( x ( ' ) , = S,f ( x ( ' ) )for such functions f. (see [8] Section 2.1). We now assume that the interaction {@(*,,)} is attractive (or ferromagnetic). i.e. that (2.7) holds. It is easily checked that !.7) implies that if xp) 2 xy) for a l l j then for all y and z

From [9] we know that (5.6) implies that + k ( y \ x ( ' ) ) +*(y x@))for all y , hence (5.7) if xy) 2 xy) for allj, then for all u E [0, 11, $bk(uI x(')) 2 #&(u I x@)).

It follows that if xY'(0) 2 xY'(0) for all j , then xY)(t) 2 xiz)(t)for all j and all t 2 0 (see [3]). For a general review of coupling arguments see Liggett [8]. (5.8) Lemma.

If the interaction is attractive and

then for all f E 9 and all x('),x(') E E (5.10)

;:1

[ S t f ( x ( ' )) S,f(x")')[5 (b - a ) C sup -(x) k

x

I

e-('-a)t.

Proof. By the monotonicity mentioned above we see that

R. HCLLEY

164

To complete the proof it suffices to show that

+ (1 - R)xfZ)(t)for some R E [0, 11.

where z is of the form Rx(')(t)

(5.12)

X E,,,[xY)(t) = -(1

Thus

- xY'(t)] - E,,.[~f)(t) - xh"'(t)]

- CY)E,,,[XA''(~) - ~:'(t)],

since Eb,,[xY)(t)- xp)(t)] = E,,,[xk)(t) - x:)(t)] for all j by the translation invariance of the interaction. Finally (5.1 1) follows immediately from (5.12). (5.13) Theorem. rfthe interaction is attractive and (5.9) holds then there is only one p E Yo,) and for all f E L2(p), (5.14)

IIStf-

P(f)IILa(p'

I Ilf-

pU(f)IILl(p)e-(l-"'t

*

Convergence in L2of Stochastic Zsing Models

165

Proof. Since all elements of Y,,, are stationary for S, the uniqueness of ,u follows from Lemma (5.8). Since 9 is dense in L2(,u)(5.14) follows from (5.10) and the spectral theorem (see Quinn [lo]).

(5.15) Example. Again we consider the example @W(Xk,x,) = --in (x*>sin (x,)ql)(lk -ill with I* = [-(n/2), n/2]. In this case

where I, is the modified Bessel function of order 0. In order to get functions which we recognize we note that if in Lemma (5.8) we replace

and differentiate with respect to sin (xj) instead of x,, then the conclusion (5.10) still holds except the constant depending on f is different. The result is that if

then (5.14) holds. Thus we need to find ,8 so that sup, (d/da)(Il(~a)/Io(~a)) < 1/2d. = /Isup, (d/du)(Zl(u)/Io(a)). First note that sup, (d/da)(Il(pu)/Io(pa)) Using the relations (d/dz)I,(z) Thus we need to find sup, (d/da)(Il(a)/Io(a)). = Il(z), and 2(d/dz)I,(z) = Iv-l(z) Iv+l(z),and (d/dz)I,(z) = Iv-l(z) (v/z)Iv(z),we proceed as follows.

+

_d_ _ Il(4 _ --

4(Io(4

da lo@)

+ Iz(a>>Io(a)- I1(aIZ Z0(a>Z

=-(11 2

2Ida)Z - Io(4Iz(4 10(4z

Note that since 11(0)= 12(0)= 0 and Io(0)= 1, we have sup, (d/du)(Il(u)/Io(a)) 2 $. To obtain the opposite inequality it suffices to show that 211(a)22 Io(a)12(a)for all a. Consider 2Z,(a)z - Io(a)12(u). At a = 0 it is equal to 0 and its derivative is

R. HOLLEY

166

From the power series expansion of I , it is easily seen that this has the same sign as a, and hence

Finally we conclude that in this example if /3

< l/d then (5.14)holds.

(5.16) Remark. What lemma (5.8) shows is that if its hypotheses are , A , is satisfied then for all f e 9 sup,( S,f(x)- ,u(f)l 2 A , e - ( l - a ) twhere a constant depending onf. Just as in [5]it can be shown that this implies that there is a 7' > 0 such that (2.14) holds. Thus when the interaction is attractive and (5.9) holds we may conclude that (2.15) holds for the diffusion stochastic Ising model.

References 1 1 1 W. Feller, An introduction to probability theory and its applications vol. 2 John Wiley and Sons, New York, 1971. 121 D. Forster, Hydrodynamics fluctuations, broken symmetry and correlation functions, Benjamin Reading, Mass, 1975. t 3 1 R. Holley, An Ergodic theorem for interacting systems with attractive interactions, Z. Wahrscheinlichkeitstheorie verw. Geb., 24 (1972), 325-334. 1 4 1 R. Holley and D. Stroock, L, Theory for the stochastic Ising model, Z. Wahrscheinlichkeitstheorie verw. Geb., 35 (1976), 87-101. Applications of the stochastic Ising model to the Gibbs states, Commun. 1 5 1 -, Math. Phys., 48 (1976), 249-265. Rescaling short range interacting stochastic processes in higher dimen 161 -, sions, Colloquia Mathematica Societatis Janos Bolya, 27 Random Fields, Esztergom (Hungary), 1979. T. M. Liggett, Existence theorems for infinite particle systems, Trans. Amer. Math. SOC.,165 (1972), 471-481. -, The stochastic evolution of infinite systems of interacting particles, Lecture Notes in Mathematics, 598 (1977), 187-248, Springer-Verlag, Berlin-Heidelberg-New York. 191 C. Preston, Random fields, Lecture Notes in Mathematics, 534 (1976), Springer-Verlag, New York. T. Quinn, Ph. D. Thesis, University of Colorado, 1983. D. Stroock and M. Fukushima, Reversibility of solutions to martingale problems, Adv. in Math., to appear.

Convergence in L2 of Stochastic Ising Models

167

[12] D. Stroock and S. R. S. Varadhan Multidimensional diffusion processes, Springer-Verlag,New York, 1979. [13] W. G. Sullivan, A unified existence and Ergodic theorem for Markov evolution of random fields, Z. Wahrscheinlichkeitstheorie verw. Geb., 31 (1974), 47-56. DEPARTMENT OF MATHEMATICS COROLADO, BOULDER BOULDER,COROLADO 80309 U.S.A.

UNIVERSITY OF


On the Asymptotic Behavior of the Fundamental Solution of the Heat Equation on Certain Manifolds Nobuyuki IKEDA') Dedicated to the memory of Dr. Hitoshi Kumano-go.

0 1. Introduction Let M + be an open d-dimensional manifold with (d- 1)-dimensionaI smooth boundary N endowed with a smooth Riemannian metric g +. We consider the case where there exists a symmetric double M = M +U N U M - of the Riemannian manifold ( M + ,g + ) . Let g be the Riemannian metric of M and A be the associated Laplace-Beltrami operator, i.e., in local coordinates (2,x2,. . .,xd)

where g&) = g,((a/ax,),, @/ax,),), G = det k i j ) and (g") = (gtj)V'. certain cases, the coefficients G-'/2a(gtjG'/2)/ax~ occurring in A jump as x crosses the submanifold N, but the heat equation

au ~

at

=

1 -Au, 2

( t , x) E (0, w ) x M ,

still has the continuous minimal fundamental solution p of (1. l), (see [25] and Section 3). In this paper, we are going to study short time asymptotics of the minimal fundamental solution p of (1.1). Following Varadhan [25],we have the following asymptotic formula: if the Riemannian manifold ( M , g) is complete, then for fixed x,y E M ,

x # Y,

where p(x, y ) is the distance between x and y induced by the Riemannian metric g . In [3], Buslaev took up the matter of finding corrections to (1.2) related to the "short-wave asymptotic behavior of diffraction" on This work was supported in part by the Grant-in Aid for Scientific Research.

170

N. IKEDA

smooth convex surfaces.') The problem is to find how the asymptotic behavior of the minimal fundamental solution p of (1.1) reflects the shape of the hypersurface N of M (also, see [17]). To illustrate the situation, we first introduce some geometric notions. We first impose the following : Assumption (i). N endowed with the induced Riemannian structure is a totally geodesic hypersurface of M , i.e., every geodesic of N is also a geodesic of M. If the metric tensor is smooth, it is well known that N is a totally geodesic hypersurface of M if and only if its second fundamental form a vanishes identically (e.g., see [l] and [16]). However, in case where the Riemannian metric g is not smooth, although N is a totally geodesic hypersurface of M , we may assume the following: Assumption (ii). For every V E T,(N) such that g,(V, V) # 0, a,( V , V ) < 0 for every x E N. As usual, a geodesic joining x and y is called an enveloping ray if it is a curve having more than one point in common with boundary N of M ' , (see Buslaev [3]). Under the assumptions (i) and (ii), Buslaev [3] was led to the following conjecture: for a wide class of manifolds it holds that if the geodesic #,,*(t), 0 5 t 5 1, joining x and y , (x,y E M' U N ) , is an enveloping ray, then as t 4 0 , where C is a non-negative constant depending on the shape of the hypersurface N , the arc-length of #z,u([O,11) n N and p(x, y). Although the intuitive background of his considerations is transparent, it is heuristic. Buslaev [3] used the concept of continuum product integral to show (nonrigorously) the asymptotic formula (1.3). Our main aim is to give a rigorous proof of the asymptotic formula (1.3) for a special class of manifolds, (see Theorem 1 in Section 3). The constant C in the right hand side of (1.3) will also be calculated explicitly. We restrict ourselves to the case where x, y E N and x # y . Under the assumptions mentioned in the following sections, by using skew products of diffusion processes, we will reduce the problem to the Laplace asymptotic formula for Wiener functionals similar to Schilder's one in [21] (also see Donsker-Varadhan [5], [6] and Dubrovskii [7], [S]). For details, see Lemma 4.5. As a result, we finally arrive at the calculation of the Wiener *) For a deep connection between the various types of second order partial differenal equations, see [IS].

On the Asymptotic Behavior of the Fundamental Solution

171

integral E"[exp

{- z j :

Iw(s)lds)/w(l)

for

= 01

z

>0 ,

where E W [ . / w ( l )= 01 denotes the expectation with respect to the Wiener measure with the fixed endpoint 0. As proved by Kac [13], the explicit expression for this integral could be obtained in terms of Bessel functions of order 1/3 (or the Airy function), (see also [14], [I51 and [23]). The organization of the paper is as follows. In Section 2 we prepare some geometric notions which will be needed later. The assumptions and main results are stated in Section 3. In Section 4, by using skew products of diffusion processes and the Feynman-Kac formula, we give a sketch of the proof of main results. Section 5 contains the details of the proof. In Section 6, we will give some comments.

3 2.

Preliminaries

First we will give some notations and notions which will be needed later. Throughout this paper we assume that manifolds are connected and a-compact. In general, for every mapping we denote by the Let $ be a curve in a manifold. Then we denote by 4 differential of the tangent vector field of the curve $, i.e., 4 = &(d/ds) where d/ds denotes the standard vector field on the real numbers. Let a : R' -+ R' be a positive continuous function satisfying the conditions :

+

+.

+*

(A.1): ( i ) a([) = a(lE]) for s' E R'. of the function a to [0, co) is a non(ii) The restriction increasing C"-function. (iii) a(0) = 1 and a+(O)< 0 where a + ( [ )is the right derivative of a at [. Let us fix a positive integer d 2 2. Let S be a ( d - 1)-dimensional smooth complete Riemannian manifold with metric g. Consider the product differentiable manifold M = R' X S with its projections rl : R' X s-+ R' and z2:R' x S -+ S. From now on we use the following notations x' = n,(x) and

X = z,(x)

for x E M .

We define a Riemannian metric g on M by gzv, Y)=

(2.1)

szl((d*(x), (rJ*(Y)) + &'>-'&((d*(m,(x*)*(Y)) for X , Y E T z ( M ) and

x = (x', X) E M

172

N. IKEDA

where g is the standard flat Riemannian metric on the 1-dimensional Euclidean space R'. The Riemannian manifold (M, g) is called the warped product of R' and S by the function a-' (cf. [2]). Then it is easy to see that the Laplace-Beltrami operator A on M has the form

A =L

(2.2)

+ a(x')iI

where

L

=

a ax'

A(xl)-l-

(A(x')-a

3

,

and 0 is the Laplace-Beltrami operator on S. Let us consider the submanifolds M " and N given by

M + = {x;x = ( X I , x) E M , x' > O}, M - = {x;x = (XI, x) E M , x' < 0 ) and N = {x;x = ( X I , x) E M , x' = 0 ) . Let g + and g - be the restrictions of g to M + and M - respectively. It is easy to see that ( M , g ) is a symmetric double of the Riemannian manifold ( M + g, + ) . Although the Riemannian metric g is not C', we can still define the notions of the arc-length Z(q5) of a piecewise smooth3) curve q5: [a, b] + M and the global Riemannian distance p(x, y ) between two points x and y in M as in case of smooth Riemannian manifolds. As well known it is easy to see that the arc-length I(#) is independent of the parametrization of the curve. For details, see Varadhan [25], [26] and Milnor [18].

Lemma 2.1. Take any two points x,y E N . Then, for every piecewise smooth curve T,!P : [0, 11 + A4 joining x and y , there exists a piecewise smooth curve q5: [0, 11 -+ N joining these points such that

I(+) L I(#) where the equality can hold only if+([O, 11) c N. Furthermore

p(x, y ) = p(X, 7 )

for

x = (0, x)

and y

=

(0,p)

where p is the global Riemannian distance on (S, g). By piecewise smooth we mean 4 is continuous and piecewise Cm.


173

This is an immediate consequence of (2.1) and the assumption (A.l) and so the proof is omitted. Take two points x and y in M. As usual, we can also define the notion of geodesics joining x and y. Take a local coordinate (x2,x3,. . .,x") in 5'. Let Ttj be the coefficients of the Riemannian connection P in (S, g) with respect to the local coordinate (2,xs,. . ., x"). Now, keeping Lemma 2.1 in mind, we can show the following.

-

Lemma 2.2. Let us consider a local coordinate (2,x2, , x") such that x' E R' and X = (x2,x3, . .,xd) E S. Let # ( I ) = ( @ ( t ) ,$ ( t ) ) , 0 5 t 1, be a geodesic. Then (i) for every t such that # ( t ) E M\N,

for i

=

2, 3, . . ., d ,

where $ ( t ) = (#'(t), #'(t), . ., #"(t)) E S and a'(t) = da(E))/dCfort # 0, (ii) for every t E (a, b) such that #((a, b)) c N , #'(t) = 0

(2.5)

3 % dt2

-

Now let # ( t )

=

2

for i = 2, 3, . ., d

f:,($(t))&t)$X(t)

+

j,x=2

(#'(t), $ (t)), 0

5 t 5 1, be a geodesic of M and set

where k ( t ) is the inverse function of h ( t ) defined by h(t) = c

j' a(#'(s))ds ,

c =

([

a(#'(s))ds)-I

.

Then the curve $ ( t ) , 0 t 5 1, is a geodesic of S such that $(O) = $(O) and $(1) = $(1). This is an easy consequence of Lemma 2.2 and the proof is omitted. Before closing this section we give some remarks on assumptions (i) and (ii) in Section 1. We note that the Riemannian connection V + on M' can be naturally extended to the Riemannian connection V; on p.

N. IKEDA

174

Let a:T ( N ) x T ( N ) + T ( N ) I be the second fundamental form of the hypersurface N in ( Fg,) , i.e., for every X , Y ET J N ) and x E N , a,(X, Y ) is the normal component of (P:Y), (see [16]). Since, under the assumptions mentioned above, it is possible to choose a field of unit normal vectors globally on N , we can regard a as a mapping from T ( N )x T ( N ) to the space of C"-functions on N ([16]). Combining the considerations mentioned above we have the following: Proposition 2.1. 1) The hypersurface N in ( M , g ) is totally geodesic, i.e., every geodesic of N is a geodesic of M . 2) For every X , Y ET J N ) and x E N ,

where K

=

-a+(O).

The proof is straightforward and is omitted. Throughout this paper we denote by K the positive constant -a+(O).

0 3.

Main results

We consider the heat equation

with the initial condition (3.2)

u(O+,x)=f(x),

XEM.

Let p : (0, m) x M XM -+ (0, m) be the minimal fundamental solution of (3.1) with respect to the Riemannian volume m(dx), i.e., p is the fundamental solution satisfying the condition: Let f be a non-negative continuous function with compact support and u be a solution of (3.1) with (3.2). Then setting

we have

On the Asymptotic Behavior of the Frrndamental Solution

I75

From now on, throughout this paper, we assume the following: (A.2): There exists a bounded non-negative function b defind on [0, w) such that

(3.3)

1 - a(S) = KS - b(E)Ez

for

E E [O,

m)

I

Roughly speaking, the use of the stationary phase method combines the contributions to the asymptotic behavior of p from the various parts of the geodesic in a inultiplicative manner. Then, using the considerations of Varadhan [25], [26] and Molchanov [20], we can restrict ourselves to the calculations in a neighbourhood of the hypersurface N. Hence the assumption (A.2) plays a similar role to the following assumption: the function aIco,.,,is convex in a neighbourhood of the origin. Let 2, be the first eigenvalue of the eigenvalue problem (3.4) with the boundary condition (3.5)

d -u(A, dx

Of)

= 0.

It is well known that 1, > 0 (see 1121, 1141 and [23]). Let Q(3,p) be the set of all minimal geodesics of S joining X and 7,(n, 7 E S ) and set n(x, p) = #Sa(x,y ) , i.e., the number of elements of Q(X, 7). We are now ready to state the main results of this paper. Theorem 1. Let us assume (A.l) and (A.2). Take two points x = (0, X) and y = ( 0 , ~ )in N . V X and 7 are non-conjugate points along each element o f s Z ( ~7) , and I 2 n(X. 7)< CO, then

as t 5.0. Theorem 2. Let us assume (A.l) and (A.2). For every compact subset D of S, there exist positive numbers pt, i = 1,2, 0 < p, < pa < 00 such that

N. IKEDA

176

uniformly in X, J

E

D with pl 5 p(x, J )

p2.

Remark 3.1. If M is a smooth Riemannian manifold, the second term in the right hand side of (3.6) does not appear in the asymptotic behavior of p . For details, see Molchanov [18] and MinakshisundaramPleijel [19]. Therefore the second term is a correction depending on how smooth the Riemannian metric g is. Remark 3.2. Let us consider the heat equation with the Neumann boundary condition

where ajan denotes the differention in the direction of the normal of N . Let p + be the minimal fundamental solution of (3.8). Since ( M , g ) is a symmetric double of the Riemannian manifold ( M + ,g'), we have

__

where x* = (-XI, X) for x = (XI, x) E M + . Combining this with Theorem 1, we can obtain the asymptotic formula o f p + ( t x, , y ) as t J 0. Before turning to the proof of Theorem 1 we give a typical example. Example 3.1. Let us consider the Euclidean space Rd endowed the standard flat Riemannian metric g and we denote by Sd-' the (d - 1)sphere of radius 1 endowed the induced Riemannian metric g from Rd. Let M + be the exterior domain of Sd-' in R d . Let (r, 01,02,. . ., t 9 - l ) be the standard polar coordinate in Rd and we define a local coordinate (XI,x2, . ., xd) in M' by

Then, in the local coordinate (x',x2, . . -,xd),the metric tensor g + = (g;) of M is expressed in the following form : +


177

g:, = 1, g&(x) = u(x1)-1g&)

for x

=

(XI,X) E M' , and

for j

g&(x) = 0

where u(E) = (1

=

i, j = 2, 3,

. . ., d

2, 3, . . . , d

+ lE1)-z and

g&)

=

g3(a/axf,a/axj) ,

i,j = 2, 3,

. . -,d .

It is easy to see that the function u satisfies (A.l) and (A.2) with K = 2 and b(E) = (3 + 2E)/(1 + E)z. Let p' be the fundamental solution of (3.8). Now, by (3.9), we arrive at the following asymptotic formula: if X and J are not antipodal points on the Sd-' and x = (0, X), y = (0,J ) ,

as

t40.

We will again return to this example in the final section.

9 4.

Skew products of diffusion processes and the Feynman-Kac formula

Let W' be the space of all real continuous functions defined on [0, w). There exists a minimal one-dimensional diffusion measure { Q E ;E E R'} generated by L/2 (see [lo] and [ll]). We denote by Ef the expectation with respect to the measure Q , on W'. We also denote by E f [ - / w ( t )= 71 the conditional expectation with respect to the measure Q , on W' under the condition w ( t ) = 7 . Roughly speaking, this is defined by the usual formula

In Molchanov [20], a rigorous justification of various definitions of conditional processes with fixed endpoints was discussed in detail (cf. [22]). Let p be the minimal fundamental solution of

_a _u _- 1- - 2-4 , at

2

(tyX)€(O,w)XS

with respect to the Riemannian volume FFZ on S. We set

N. IKEDA

178

where $;'(w) is the inverse function of q 5 r ( ~ ) . Then, letting + ; I ( W ) inverse function of +,(w), we have

be the

Combining this with the formula (2.2) and using skew products of diffusion processes, we have

where k(t, E, 7 ) is the minimal fundamental solution of

(4.3)

-au - - L-u ,I at 2

(t,X)E(O,W)XR1

with respect to the measure A(C))df, (see [lo] and [ll]). Combining (4.2) with Molchanov's results (Theorem 2.1 and Theorem 3.1 in [20]) we have the following lemmas.

Lemma 4.1. Assume (A. 1) and (A.2). For every compact set D there exists a positive constant po such that for every p, with 0 pl as t -1 0,

c S,

< < po,

where Hl(f):[0, w) -+ R' is a positive continuous function with Hl(0) = 1.

0

for

t 2 to.

To avoid non-essential complexities, from now on, we assume (A.3). For the simplicity of notations, we set

N. IKEDA

180

We now consider the l-dimensional Wiener measure P y on W' starting at E and denote by EY the expectation with respect to P y . Then, setting

for p

>0,

K(d - l)Jt .t(l, w;O)}/w(I)

=

X k(t, 0, 0)

we obtain the following Lemma 4.3.

For every p

> 0,

(4.9)

+

4

01

where t(t, w;z) is the local time at z of the l-dimensional Brownian motion { w ( t ) , t 2 0 ) and E r [./w(l) = 01 denotes the conditional expectation with respect to P r under the condition w(1) = 0. Proof. By using the transformation of drift by C and (2.3) it holds that for every bounded continuous function j

Setting u ( t ) = (log A(E))/2 and using Tanaka's formula (see III-(4.1) in [lo]), we can rewrite (4.10) in the following form


181

qs'

- 2t

0

(1 - a(w(ts)))ds)z(s' a(w(ts))ds)-l}

Using the scaling property of the Brownian motion, we can again rewrite this in the equivalent form

(f)

(d-l)/Z

exp { - p ' / 2 t } E , W [ ( A 1 ( J f w ) ) - ( d - 1 ) / i

{

x exp - $ ~ ( 1 / tw)

+ a ( ~ , ( ~ ~ I) t . ) ) ) j

which completes the proof. The Feynman-Kac formula allows one to obtain the asymptotic formula related to the Wiener integral in the right hand side of (4.9). Let {An}, 0 < 2, < A, < . . . and {q5n} be the eigenvalues and the normalized eigenfunctions of the eigenvalue problem (3.4) with (3.5) respectively. Then we have

N. IKEDA

182

For details, see [9], [12], [13], [14] and [23]. The following is an easy consequence of (4.1 1).

Lemma 4.4. As t

4 0 ,for every p > 0,

Furthermore, for every 0 in p with C, 2 p 2 C,.

< C, < C,
0,

Since u and C are bounded functions, it is easy to show that for every p > 0, as t .10, Er[exp { -p2A,(d tw)/2t}Gj1)(,lt w ; p)/w(l) (4.14)

=

0]

= E,W[exp { -pzA2(d 7 w)/2t}Gt2)(47w;p)/w(l) = 0](l

+ O(t))

, uniformly in p. In the following section, by using direct asymptotic evaluations of (4.15)

E,W[exp { -p2A,(dJt w)/2t}GjZ)(z/fw ;p)/w(l) = 01 ,

we will prove the following:

Lemma 4.5. As t J 0,fur everyp

> 0,

(4.16)

E,WMP {-pZA,(J7w)/2t}Gt2’(J -~ t w ; P)/W(I) = 01 E,W[exp { -P‘A,(Jt w ) / 2 t } ~ t ~ ywd;tp)/w(l) = 01

(4.17)

Er[exp { - p z A ~ ( ~ t ~ w ) / 2 t } G ~ 3p)/w(l) ~ ( ~ / t= w ;01 +I, E,W[exp { - p 2 A , ( d 7 w ) / 2 t } / w ( l ) = 01

~

~

1,


183

and

(4.18)

where

(4.19) Furthermore, for every 0 < C, < C,< 00, (4.16), (4.17) and (4.18) hold uniformly in p with C, p 5 C,.

On the other hand, it follows from Lemmas 4.2 and 4.3 that under the assumptions of Theorem 1

(4.20)

x EY[exp (-p(x, y)2A2(Jfw)/2t}GI’)(J~i w ;p(x, r))/w(l) = 01 x Hz(P(4JMl + o(1)) *

The proof of the theorems. Combining (4.14), ‘(4.20) and Lemmas 4.4 and 4.5 we can easily complete the proof of Theorem 1. Furthermore, by using Lemmas 4.1, 4.4 and 4.5 and (4.14) we arrive at the conclusion of Theorem 2.

It should be noted that it is sufficient for the proof of (3.6) to show

0.

By the Markovian property of the Brownian motion, we have Pow"; Iw(s)lds 5 Azt1/6,€t-'/6 5 M,(w) 5 A,t-'/"lw(l)

5 Pow[aaZ(w)- a,(w) (= A2P, a,(w) < l / w ( l ) = 01 =

(5.14)

J-; PF[a,(w)

E

=

1

0

d s ] P L [ o , ( w )I (A#) A (1 - s)/w(l - s)

=

01


x exp

= Jz(t, a),

I89

[- 2(1 -(t")2 ]dud% s - u)

(say).

E;

Since 3Aztd

< ( ~ t - "-~ t'>z

for every

t, 0

< t < tr ,

we have

Combining this with the inequality (t"Y

4 2 4 s - u) exp

[- 2 ( 1 - s - T ) I % z F for u

1

1

< (Aptd)A (1

- s),

we have

Hence, for every t with 0 equal to the following

< =

< t < t,,

8A2 d2nA,E

t3d/2

the last term of (5.14) is less than or

exp [ -A,/t 'I3]

5 exp [ -A s / t 1 / 3.]

This completes the proof. We now turn to the proof of Lemma 4.5. Take two any positive numbers 0 < C, < C, < 00 and fix an any positive number E . Take sufficient large constants A,, i = 1 , 2 , 3. Then, by combining Lemma

N. IKEDA

190

5.3 with Lemma 5.4, we can conclude that there exists a positive number t, independent of p with C, p 5 C, such that for every t, 0 < t < t,,

+ exp [ -A 3 / t 1 q By combining this with Lemma 4.4, we get

Since e is any positive number, this implies (4.18). Similarly, using Lemmas 4.4, 5.1, 5.2, 5.3 and 5.4, we obtain (4.16) and (4.17).

0 6.

Remarks

In this section, we give some remarks on the asymptotic behavior of p ( t , x,y) in the case where x = (0, X) E N a n d y = (y’, p) 4 N . For the sake of simplicity, we assume that there exists a unique minimal geodesic #(t) = ($(t), &t)), 0 5 t 5 1 , joining x and y. We also assume that the geodesic 4 is an enveloping ray. Then, as stated in the section 2, there exists tl, 0 t , 1 such that

<
:Xt(f) =

*

This is clearly an OU 9(g)-process with characteristic norm pa = p. Since X , : (g,p") + (L,11 llL) is isometric, it can be uniquely extended to It is easy to check that B = an isometric map Xt: (E,p) 4(L, /I \IL). { X t , t E T } is an OU 9(E)-process with charactristic norm p H = p. Since ( E , p ) is isomorphic to (Xt(E),11 [IL), our separability assumption of (L, 11 / I L ) implies that (E,p) is a separable Hilbert space. Using the natural injection 8 : E -+E we obtain X from X as follows:

XU>= mY)*

(2.7)

Definition 2.5. X is called the completion of X . If X = X, then X is called complete. Theorem 2.3.

X is a complete OU L?(E)-process, then

Ornstein-Uhlenbeck Processes

(2.8)

X,(E> = M 4 8

20 1

*

Proof. Since X = X and so ( E , p ) = (E,p), the argument above implies that ( E , p ) is a separable Hilbert space. Being isomorphic to (E,p), (X,(E), 11 llL) is also a separable Hilbert space. Hence X,(E) is a closed linear subspace of (L, (1 \IL). Thus we obtain

M,Q

= CLS(X,(E))= X,(E)

.

In the subsequent sections except in section 9 we assume that X is a complete OU P(E)-process, so that (E, p ) is a separable Hilbert space and X , ( a = M,(X).

9 3.

The characteristic operator

Let X = { X , } be a complete OU 9(E)-process, and p its characteristic norm. Then (X,(E), 11 IIL) is isomorphic to (E,p) and X,(E) = M,(X), as we mentioned in the previous section. Since Xis centered Gaussian and Markov, we obtain

m,(f 1I M,(X))

= -&X,(f

1I Mo(X)) E Mo(X) = Xo(E)

9

for t 2 0, so we can find a unique g = g ( t ,f ) E E R X d f 1I MIAX)) = Xo(g(t,f

Y

which implies that (3.1)

QXt

+B(f

1I M ; ( X ) ) = Xs(g(t9 f 1)

Y

Yt s 2 0

3

by the stationarity of X . Defining an operator S , : E -+ E by S,f = g ( t , f ) for each t 2 0, we can rewrite (3.1) as follows: (3.2)

~‘(X,+,(f)IM,(X))=Xs(S,f),

t 2 0 ,S

E T .

Theorem 3.1. { S t , t 2 0) is a strongly continuous semigroup of linear contractions on the separable Hilbert space (E, p ) . Proof: Sincef- ,!?(X,(f)lM;(X)) is linear inf, (3.2) (s=O) implies that S, is linear. Since P ( S , f ) = llXO(~tf)IlL= l

5 IIX(f)llL

l ~ ( x ( f ) l ~ m ) l l L

=P ( f )

Y

S, is a linear contraction in ( E , p ) . To prove the semigroup property, observe

K. IT6

202

( M i = M8 (r n ) (by M ; 2 Mi) = &jw,+,(f)lM;)IMo) = &~,(Stf)IMi) (by (3.2)) = XO(S8Stf) (by 3.2)) *

XO(S8.J) = &K+t(f)IM,)

So = I is obvious. Hence {S,, I 2 0) is a semigroup of linear contractions in (E, p ) , which is strongly continuous, because

P ( S , f - f ) = llXo(Stf) - XO(f)llL =

I1m 3 f ) - Xo(f>I MdX)) IIL

5 IIXt(f>- XO(f)llL

+

0

( t -1 0)

-

Now we will apply the Hille-Yosida theory [5] to {St}. Let A be the infinitesional generator of {St}. Since (E,p ) is a separable Hilbert space, A is characterized by the following three conditions : (A.l) A is a closed linear operator in E and D = 9 ( A ) is dense in E. (A.2) a(1- A ) = E. p ( A f , f)5 0, f e D. (A.3) (dissipativity) Also fED===+S,fED,

d

-Slf dt

=

AS,f = S,Af,

where the differentiation is taken in the sense of norm convergence in ( E ,p ) for each f. Definition 3.1. A is called the characteristic operator of the OU 2’(E)process X . Since X = { X t } is centered Gaussian, the finite joint distributions { X t ( f ) } t , , are determined by its covariance function, which is given as follows : Theorem 3.2.

203

Omstein-Uhlenbeck Processes

Similarly for the case s 2 t.

5 4.

Three Hilbertian seminorms on D

We define two Hilbertian norms q and r and a Hilbertian seminorm b on D = 9 ( A ) as follows:

(4.1) (4.2) (4.3)

+

q ( f , g) = P ( A s) P V f , A d . r ( f , g) = P ( ( 1 - 4 A (1 -

*

b ( A g) = - P M A g) - P ( A A g ) *

It is obvious that q is a Hilbertian norm and r and b are Hilbertian seminorms by the dissipativity of A . We can see that r is a Hilbertian norm by observing

Thus we obtain the following: Theorem 4.1.

(4.4) Theorem 4.2.

(D, q ) and (D, r ) are separable Hilbert spaces.

Proof. Let G be the graph of A , i.e.

(4.5)

G:

=

{(A A f ) : f ~D}.

Since A is a closed linear operator, G is a closed linear subspace of the separable Hilbert space ( E , p ) @ ( E , p ) . Hence ( G , p O p ) is also a separable Hilbert space. Being isomorphic to this space by the projection (f,Af) H A (D, q ) is a separable Hilbert space and so is (D,r ) by (4.4).

0 5.

The innovation process

Let D: = 9 ( A ) as in the previous sections. We define an LF(D)process B = {B,} whose increments {Bat}are given as follows:

where the integral is the Bochner integral. Definition 5.1. The S(D)-process B = {B,) is called the innovation process of X . This name will be justfied at the end of 8 8.

K. IT&

204

Theorem 5.1. The innovation process is a Wiener P(D)-process with characteristic norm b (b(f)2 = - 2p(Af,f),5 4). Proof. B = {B,}is obviously centered Gaussian. It is obvious that B J D ) c M:(x)

(5.2)

nM ; ( x ) ,

s

) L = m , , ( f ) m L , ( g ) IM;(X)N = 0 * Next we will check that

(5.5)

IIB8,(f)ll~= (t

- s>b(f)2*

To do this we will use Theorem 3.2: ( X s ( f > , X t ( d ) L = P ( f , St-sg)

9

t 2s*

Since there is no possibility of confusion, we will omit L and p in the proof below.

205


- 2(f, Af) ( t - s) = (t - s ) b ( f y .

=

This proves (5.5). Now it is easy to derive the following from (5.4) and (5.5) :

( ~ ~ ~B(u. tm- )L= ~ KS,

(5.6)

ti n (u,~ I I ~ (g), . LL g E D

I

Hence B = {B,}is a Wiener 9(D)-process with characteristic seminorm b by Theorem 2.1. We denote by M;(dB)(resp. M:(dB)) the following subspace of

(L?II lid:

< u 5 2 , f €D} (resp. CLS{B,,(f): t 5 u < u , f e D } ) . CLS{B,,(f):

U

Similarly we define the a-algebras a;(dB) and a:(dB).

Theorem 5.2.

( i ) M;(dB) c M ; ( X ) ,

u,(dB) c a;(X) ,

M;(X) ,

u:(dB) 1a;(X) ,

(ii) M:(dB) where

1means orthogonality in (L, 11 /IL)

and IL means independence.

Proof. The statements for the subspaces of L are obvious by (5.2) and (5.3). Hence the statements for the a-algebras follow at once, because X is centered Gaussian.

K. IT8

206

Definition 6.1.

X d is called the deterministic part of X.

Hence X d is centered Gaussian. Proof. Use (6.3) and (3.2).

Proposition 6.4.

Hence X d is stationary by the centered Gaussian property of X d . Proof. Obvious from (6.7) and the definition of p d .

207


Proposition 6.5. Mt(Xd)C M ( X d ) = M - , ( X ) = M , ( X d ) = M - _ ( X d ) .

(6.9)

Proof. The first inclusion relation is trivial. M t ( X d )C M - , ( X ) by the definition of X d . Hence M ( X d )C M-,(X), SO

c M ( X d )c M-,(X).

M - _ ( X d ) c M,(Xd)

Suppose that Y E M-,(X).

Then Y E M : , ( X ) , so we have

Projecting this vector to M - , ( X ) , we obtain

This implies that we can find Y, E M:.(Xd) such that J J Y- Y,JI Since Yk+, E M:.(Xd) for every k,

< n-'.

Y = lim Yk+, E M:,(Xd) k-m

for every n, so Y E M - , ( X d ) . Hence we have M-,(X) C M _ , ( X d ) , which completes the proof. Proposition 6.6. X d is Markov.

Proof. Because of the centered Gsussian property of X d it suffices to observe that for t > s

mxf)IM Q ( X d ) )

=X U ) =X3St-J) E

(by (6.9)) (by (6.7))

M,(Xd).

Proposition 6.7.

(6.10) (6.1 1)

Pd(StL Stg)

= P d ( Ag )

- w : ( f ) X : ( g ) ) = Pd(S,+uf,S,+ug)

for every u 2 max( - s, - t ) . Proof. By the definition of p d (Proposition 6.2) we obtain the first equality, which, combined with Proposition 6.4, implies the second equality.

K. IT6

208

Proposition 6.8. (6.12) llX1Z+h(f) - X?(f)IIL

5 IlXt+hW - X,(f)/lL + 0

(h

+

0;.

Proof. Obvious by the definition of X d . By Proposition 6.1, 6.4, 6.6, 6.8 and 6.2 we obtain the following: Theorem 6.1. X d is an OU 9’(E)-process with charactreistic seminorm p d and Xf( f)is a-,(X)-measurable for every t E T and every f E E. And so u(Xd)is independent ofa(dB) (Theorem 5.2 (ii)).

0 7. A special Wiener integral Let B = {B,} be the Wiener 9(D)-process with characteristic norm b introduced in 5 5. We denote its increments by {BSt},=

(7.1)

Y(Q,f>

We want to define an integral of the type:

First consider the case where Q, is a step operator-valued function of t vanishing outside of a bounded interval [a, b). Then there exists a decomposition of [a, b)

d:a=a, : = - (4g ) - (f,A g )

defines a Hilbertian seminorm on D = B(A). Hence we can construct a Wiener g(D)-process B with characteristic seminorm b (Theorem 2.1). Also we can assume that B and X d are independent. Define (9.7) We can easily prove that X = { X t } is an OU Y(E)-process whose deterministic part is X d and whose innovation process is B. Restoring the bar we denote this process by X. X is an OU 8 ( E ) process. Defining

K. IT6

214

X( f ) : = x(8f ) (8: the canonical injection: E -+ E), we obtain the process X which is to be constructed. Proof of the uniqueness. Let X be any OU P(E)-process with characteristic norm p and characteristic operator A . p determines (E,p). A determines the semigroup { S t } , { S , } determines the covariance functional Vs. of the completetion X. V,, determines the finite joint distributions of X and so those of X .

9 10.

Continuous regular versions

In the previous sections we observed an OU P(E)-process X = { X , ( f ) } and its innovation process B = { B , ( f ) } which is a Wiener 9 ( D ) process. In this section we define their continuous regularizations to discuss the properties of these processes more thoroughly. c, cl, c2, stand for positive constants throughout this section. (a) A Gelfand triple {(&, rg), (D, I ) , ( K II ID}. In 8 4 we introduced a separable Hilbert space (D, r ) :

- -

D = N4, r ( f ) = P ( ( 1 - A ) f )

(10.1)

-

Let {d,} be an orthonormal base (abbr. ONB) in (D, r ) such that d, E 9 ( A 2 )for every n. Note that such an ONB exists because D(AZ)is dense in (D, I ) . Let {An} be a sequence such that

lAl":=cA;.

L>O,

(10.2)

n

Then K : = C Andn@d,

(10.3)

n

is a strictly positive definite Hilbert Schmidt operator in (D, r ) . Define a separable Hilbert space ( D K ,r,) as follows:

D,: = K(D) , r K ( f ) := r(K-I f ) =

(cr(f, dn)2A;2)1'2, n

rK(J;d:= r ( K - ' f , K-'g) =

C ~ ( fd ,M g , d n X 2. 7l

It is easy to check that (10.4)

The sequence

r(f)S

~rK(f),

c = SUP

{An}

. * >

The triple (10.5)

{(Dm y,), ( D , r ) , (H,

I1 ),1

*

is a Gelfand triple of Hilbert spaces. The ONB in ( H , 11 11) dual to {en} is denoted by {e;}, i.e.

(4,en> =

L 7 i

The a-algebra on H generated by the 11 11-open subsets (equivalently the a-algebra generated by the weakly open subsets or by the half spaces: { x E H : ( x , f) S a}, f E D,, a E R) is denoted by g ( H ) and is called the Borelsystem on H . We define Borel measurable functions on H , Borel measurable maps from H into H a n d H-valued random variables (abbr. Hvariables) on (Q, 9, P) with respect to 9?(H). ( x , y ) is Borel measurable in x E H for every f E D,. If Y(w) is an H-valued random variable, then ( Y(w),f ) is a real random variable. Let {S,} be the semigroup of linear contractions on ( E , p ) we have introduced in 5 3, and let A be its generator. When we restrict {S,} to D , we obtain a family of linear operators {Sf}. Since S,D c D and S,Af = A S , f for f E D , Sf carries D into D. Since

mf

r ( S f f 1 = P((Z 1 = P(SdI - A ) f1 SP((1- A ) f ) = r ( f ) , fED, Sf is a linear contraction on (D, r ) . Similarly we can prove the following:

Theorem 10.1. {Sf} is a strongly continuous semigroup of linear contractions on (D, r ) with generator A D := the A restricted to 9 ( A D ) : = 9 ( A 2 ) . The resolvent operator ( I - A ) - ' : E - t D gives an isomorphic map from (E,p ) to (D, r ) which transforms S, to Sf and A to AD. From now on we restrict our consideration to ( D , r ) and omit the index D in Sf and AD for notational simplicity.

K. IT6

216

(b) Stochastically r-bounded H-variables and r-bounded LRF's. Y(o) be an H-variable. Y is called stochastically r-bounded if E((Y,fY)SC:r(f)2,

Let

fEDK

Y is called centered Gaussian if ( Y,f ), f E DK form a centered Gaussian system. The family of all stochastically r-bounded H-variables is denoted by B ( H , r). B ( H , r) is a vector space with the usual linear operations. Let X( f ) be an LRF on D. X is called r-bounded if E ( X ( f I 2 )5 C$(f)Z

3

fE D.

The family of all r-bounded LRF's on D is denoted by 9 ( D , r). 9 ( D , r) is also a vector space. For Y E B ( H , r) we define

X : D,

4

L,

f-

X(f)(O)

=

(Y(o),f)

*

Since Y is stochastically r-bounded, X is a bounded linear map from (DK,r) into (L,(1 1 ) and so it is extended to a bounded linear map from (D, r) into (L, 11 11) because D, contains en,n = 1,2, . and so is dense in (D, r). The X , thus extended, belongs to 9 ( D , r ) . The map Y H X is a linear map from d ( H , r) into 9 ( D , r ) . This map, denoted by 'p, is injective. Suppose that 'pY = 0. Then ( Y , en) = 0 a s . for every n. Hence

--

P{( Y, en) = 0 n = I , 2, . . - } = 1 . Since Y(o) E H, this implies that P(Y = 0) = 1, proving the injectivity of 'p. In fact we have the following: Theorem 10.2.

'p:

g ( H , r) -+ 9 ( D , r) is bijective.

Proof. It suffices to prove that 'p is surjective, i.e. that for every X E 9 ( D , r) we can find an H-variable Y such that 'pY = X ; then Y EB ( H , r) follows automatically. Observing that

we obtain

C X(e,)z < n

Hence

00

a.s.

Ornstein-Uhlenbeck Processes

Y:=

C X(e,)eL

217

(in norm convergence in (H,

n

11 1 )

defines an H-variable. Observing that E(( Y ? f ) 2 )= E ( ( C x(efl>(eLf))')

c

4 E(C X(efl)z> (4,f ) z 5 c,"lJ12r,(fI2 n n for everyfc D,, we will check that (oY = X , i.e. (Y,f) = X ( f ) a.s.,

f~ D K .

This is obvious by the definition of Y for f = e f l and so for every f~ F: = LS (el, e,, . . .). For every f E DK we can find f f lE F such that (and so r ( f - f f l )d 0 by (10.4)) .

r K ( f - fn) --+ 0

Hence (Y,f) = 11 llL-lim(Y, fn) n

=

11

llL-limX ( f n ) = X ( f ) a.s. fl

Definition 10.1. For X E 2 ( D , r ) the H-variable Y : = q 1 X E g ( H , r ) is called a regular version of X . Theorem 10.3. Suppose that Y E 9 ( H , r ) . (i) ~(llY113< m, (ii) If Y is centered Gaussian, then (10.6)

E(llYll'")

s C(4E(IIYIlZY
s.

We can prove similarly as the above

E[IN,(x,1) - NAY, /1>lP1

s+ :

2 K’ It - sip" +

(

< K”(It - sp’z

+ luei - y - yue,~“Pdu ( x - y p + ( 1 - PI“”). [x

Then by Kolmogorov’s theorem, N,(x, 1) has a continuous extension at any ( t , x , 0). This means that ( f ( x ) , M ) , is continuously differentiable in x . Let 1 tend to 0 in (1.6), then we get

Repeating the argument inductively, we get the lemma. The following formula for changes of variables is a basic tool in our later discussions.

Theorem 1.4 (Ganeralized It6’s formula). Let F,(x), x E Rd, t E [0, T ( x ) ) be a local random field continuous in (t, x ) a s ., satisfying ( i ) For each t, F t ( . ) is a C3-mapfrom D,= { X I T ( x ) > t } into R’ a.s. (ii) For each x, F,(x), t E [0, T ( x ) ) is a continuous local semimartingale represented as Ft(x) = Fo(x)

+ 2 s’f..(x) j=1

0

dNi,

0

where N i , ’ . ., N r are continuous semimartingales, f j ( x ) , x E R d , t E [0, T ( x ) ) are local random fields continuous in (t, x ) a.s., C2 -maps from D , into R‘for each t as., and represented as fi’(x) = fl.’(x)

+

k=l

s’ 0

gJVk(x)dO:,

where Ot, . ‘ ‘, 0: are continuous semimartingales and g!sk(x),x B R d , x E [0, T ( x ) )are local random fields continuous in (s, x), C’-maps from D,into R‘. Let now M , = (M:, . -,M f ) , t E [0, T ) be a continuous local semimartingale and T’ = inf { t > 0; M , $. D,} A T. Then F,(M,), t E [0, T ’ ) is a continuous local martingale and is written as (1.7)

+ 5 [Lj(M.)

F,(M,) = Fo(Mo)

3=1

0

0

dNJ

+

Stochastic Partial Differential Equations

2.55

The proof is found in [5], Chapter I, Section 8 in case F,(x) etc. are global random fields. The extension to the present case is not difficult. The above is a differential formula for the composition of two stochastic processes. Writting m

dF,(x) = C j i j ( ~ o) d N j , j=1

the formula is written in short as

Hence it takes the same form as the classical differential formula for composite functions.

Q 2. Stochastic partial differential equation and the associated stochastic characteristic equation

A first order nonlinear stochastic partial differential equation of the parabolic type that we will consider is expressed as n

(2.1)

du, = C F(')(t,X , u,,au,) dW: 0

j=1

+ Fco)(t,X , u,,au,)dt .

Here, F(')(t,x , u,p), j = 1, ., n are continuous in (2, x , u, p ) E [0, m) x Rd X R' x Rd, continuously differentiable in t and C" +2-n-functionsof

>

( x , U,p ) fot some a 0. F(')(t, x , u, p ) is continuous in ( t , x, u, p ) and a C""."-function of (x, u , p ) . Here m is a positive integer greater than or

equal to 3. au, means the vector ( a p t , . ., adu,). W, = (W:, ., W;) is a standard n-dimensional Brownian motion. The equation may not have a global solution in general except for linear or semilinear equation. Hence we will define a local solution precisely.

Definition. Given a C1-function #(x), x E Rd, a local random field E [0, T ( x ) ) with values in R1 is called a local solution of equation (2.1) with the initial condition u,,(x, o) = $(x), if the following conditions are satisfied. ut(x); x E Rd, t

(2.2)

T = T(x,o)is an accessible, lower semicontinuous stopping time.

(2.3)

u,(x), 0

t

< T ( x ) is a local C'+-semimartingale and satisfies

H. KUNITA

256

for all (t, x ) such that t < T ( x ) a s . If there is a solution of equation (2.4) such that the terminal time T(x,o)is infinite for all x a x , we will call it a global solution of (2.4). For convenience we put always W: = t. Then the above equation isiwritten as (2.4’)

u,(x) = #(x)

+ 5J‘ F(j)(r,x , u,(x), aur(x)) dW! . o

j=o

0

In the classical theory of deterministic partial differential equation of the first order, the characteristic curves associated with the partial differential equation play an important role. Our study of the stochastic partial differential equation of the first order is also based on the stochastic characteristic curves. As usual, we will use notations &,*‘ = a,F, F, = (F,,, . . ., FZd). Sim., F P J . For two vectors p , q, p . 4 ilarly, Fp, = aFIap, and Fp = (F,,, denotes the inner product. The stochastic characteristic equation associated with (2.4) is defined by

-


257

+

where T ( x , u, p ) is the explosion time. SinGe F Y ) , FCJ)- Ff).p, FYI FLj)p,j 2 1 ( j = 0) are Cm+'~u(Cm*u)-functions of ( x , u, p ) , the solution has a modification which is a local C"Tfl-sernimartingale with any p less than a. Furthermore, for almost all w , the mapping ( f t ( . , w), v t ( ., w), C,(. ,w ) ) from the domain {(x,u,p)I T(x, u,p) > t } into R d X R'X Rd is a C"diffeomorphism for any 0 < t a s . See [ 5 ] , Chapter 11. Let $(x) be a function of C1+'+-classwith I m, which corresponds to the initial function of equation (2.4). We define local CIT#-semimartingales f,(x), q,(x), [,(x), t E [0, T ( x ) )where T ( x ) = T(x, $(x),a$(x)) as follows.

We shall study the process $,(x) in details in the remainder of this section. Processes T,(x) and [,(x) will be discussed at the next section. The map gt( -,w ) : { x I T ( x , w ) > t } into Rd is not a diffeomorphism in general, since the Jacobian matrix a,",(x) can be singular at some I less than T(x). Define

(2.8)

r(x) = inf { t

> 0; det aE,(x) = 0) A T ( x ).

It is an accessible, lower semicontinuous stopping time. Further, it holds lim,,,,,, det af,(x) = 0 if r(x) < T(x). We will show that f, defines a diffeomorphism if we restrict the map $, to the domain {r > t } . It is convenient to introduce adjoint stopping time of T ( X ) in the following manner.

> 0; Y 4 Et({r > f})}, where ,",({r> t } ) is the range of the set { x I r(x) > t } by the map F,. Lemma 2.1. ( i ) The map $, from the domain {r > t } into Rd is a C1-direomrophismfor all t a s . (ii) The inverse E;'(y), t < o(y) is a local C1-"kernimartingale and

(2.9)

a ( y ) = inf {t

satisfies

(iii) o ( y ) is an accessible, lower semicontinuous stopping time and satisfies that i f o ( y ) < 03

H.KUNlT.4

258

Proof. Consider the following SDE for K , : (2.11)

drr, =

5

aEt(K,)-’F;j)(f,$, o c,,

4,

0

K,,

j=O

g, o c,) o d W i .

For any y E Rd, there is a unique solution K ~ ( Y ) such that ~ ~ (=y y) and K,(Y) E {x I ~ ( x ) t } for any t E [0,6(y)),where 6(y) is the terminal time of

>

It holds

c,(Y)which is accessible and lower semicontinuous.

We shall prove

E, o ~ , ( y=) y n

(2.13)

dtt(x) =

-CF j=O

for t

< d(y).

Note that E,(x) satisfies

L’) (6 $,(XI, at(x), Ux))

0

d 4

where the integrands FF) are local CzJ-semimartingales. Applying ItB’s formula to Faj), it turns out that they are represented as C;,OGif)(t, x)o dW2, where GLj)(t,x) are local Ct*fl’-semimartingaleswith p’ < p. Therefore we can apply generalized ItB’s formula to c,(x) and c,. Then it holds d($, 0 K,) = dct(Kt)

+ C a2$t(~,)dx: . 0

i

Observing (2.13), the first term of the right hand side is

In view of (2.1 l), the second term is

2a t t ( K , ) a ~ t ( C ~ ) - l F k ” ( f ,ct

0

K,,

0

K,,

j=O

n

=

C Fkj)(t, E,

0

K,,

9,

0

c,,

5,

0

5,

Therefore we have d(Et o K , ) = 0, which implies Define now

0

1)

,

If T = co, this means lim, &;‘(y)

= co.

0

dWi

.

Et o c,(Y) = y if t < Y y ) .

(2.14) ~(x)= inf {t > OlC,(x) 6 {6 > t } or ldet I)X(,$(,C~

< C(x).

Kt)

c,) dW{

j=O

We shall prove x, o E,(x) = x if t

0

= oo}Ar(x).

Since $,(c,(x)) = x, we see

S!ochastic Partial DiJ-fereritialEquations

259

~~,(K,(x))~K,(x) = identity matrix. Therefore, equation (2.13) is written as

We can apply generalized ItG's formula to the following SDE d ( ~0 ,$,) = dK,(C,) -I-

ca&,(ft)

0

lit

and

ct.

Then K , 0 f , satisfies

d;;

a

=

C ~ E , ( K o, $,)-'F:)(t, $,, 7, K , f , , T}into R d . We have thus proved (i) of the lemma. ) Now substitute x = t ; ' ( y ) to K , o E,(x) = x. Then we get ~ , ( y= c;'(y) if y E $,({I: > t } ) . Therefore $'; satisfies (2.10). The stopping time b satisfies that if 6(y) < 00

In fact, a$,($;'(x))i3E;I(x)

is the identity matrix and (2,12) is satisfied for Therefore what is remaining is to prove cr = 3. It holds t3Et(f;l(x)). {b > t } Et({r> t } ) since ~ , ( y = ) t;'(y) holds for any y E t,({r > t } ) . From the definition of K , , we have ~~((6 > t } ) C {I:> t } , so that we have f t o ~'((6 > t } ) c ,$,({I: > t } ) . Since Ct 0 K ~ ( x=) x holds on {b > t } , we get {b > t } c Et({r > t } ) . Therefore we have { b > t } = E,({r > t } ) for ) any y a s . The proof is complete. any t a s . This proves d(y) = ~ ( yfor

H. KUNITA

260

Q 3. Existence and uniqueness of the solution We will now prove the existence and uniqueness of solutions of nonlinear stochastic partial differential equation of first order, using stochastic characteristic curves of the previous section. Main results are stated in the following two theorems.

Theorem 3.1. Let #(x) be a function of C1"*"-classon Rd with 2 I 5 m, a! > 0 and let t,)V L , 5, be processes defined by (2.7) and a(x) be the

stopping time defined by (2.9). Define u,(x) = q L ( & l ( x ) ) ) t E [0, u(x)). Then it is a local solution of equation (2.4). Further, the solution is a local C"-"~-semimartingalewith any B < a!. Theorem 3.2. Let u,(x), t E [0, T(x)) be a local solution of equation (2.4), where $ is a function of C1+lla-cIass.I f u,(x) is a local C2-'2"-semimartingale with 5 5 I 5 m, then it is represented as u,(x) = ?jt o E;'(x) for r E 10, T(x)Aa(x)). Since E, and 7, are uniquely determined by coefficients F ( J )j, = 0, . . .,m of equation (2.4) and the initial function #(x), the above theorem shows the uniqueness of the solution. It is conjectured that the solution u,(x), t e [0, a(x)) could not be prolonged beyond the time a@). We will

show this in case of quasi-linear equattion at the next section. In order to prove Theorem 3.1, we claim a lemma.

Lemma 3.3. It holds for i = 1, . . -,d

wit,,

(3.1)

a,q, =

(3.4

a,@, tF1) 0

=

[; 5;' . 0

Proof. For the proof of (3.1) set

We shall find a stochastic differential equation governing 8:. Observe the stochastic characteristic equation of q,. Since the integrands F") - F P( j ) are local C'*'-semimartingales, we can change the order of d and at and it holds da,q, = a,dij, by Theorem 1.2. Theorefore we have

261


Similarly we have

+

d(5, *a$,) = d i f t 0 dSt

Ct

ddtCt

0

n

=

C {Fi".d,$, + F Y ' [ , . d , f , } o d W {

j=O

-

5(d,FF)(t,E,, C,,q t ) ) . C , o d W i .

j=O

Therefore, 8: satisfies the following linear stochastic differential equation n

d@ = C FLj'8; 0 dW{ j=O

.

Since f o ( x )= x,q,(x) = #(x) and C,(x) = d#(x), we have 8; = 0. Then the above linear equation has a unique solution 0: = 0. This proves (3.1). We will next prove (3.2). It holds

a(?, E;')(x)

= dT,($~'(~))d~,($;'(x))-'

= dT,(E;'(x))Jf;'(x)

0

.

In view of (3.1), the right hand side of the above equals C,(E;l(x))d$,(~;l(x))d~,(E;'(x))* = S,(E;'(x))

*

This proves (3.2). Proof of Theorem 3.1. We can apply generalized ItB's formula to Then we get

]7,(x) and $;'(x).

d(q,

0

$;'I

=

dq,(f;')

+ 2 d,~,(f;')

0

d(f;'Y

2=1

.

The first member of the right hand side is n

C F("(t, .,qt

o

f ; ' , g, o $;)'

o

dWi

3=0

n

-

C=o F y ) ( t , .,q1

o $;I,

3

g, o $;')c,

o

$';

o

dW/

because of (2.5). The second member equals n

C FY)(t, ., 7, C;', J=u 0

5,o

$;').d77,($,')d$,($;')-'

o

dWi

because of (2.10). We know from Lemma 3.3 that C,o

$';

=

a~,($;')a;',(,E;')-'

=

a(?, t;')= du, . 0

H . KUNITA

262

Then we get du, =

5F(j)(t,.,7, t;',f, 0

0

f ; ' ) 0 dW5

3=0

=

5F'"(t,

3=0

, u,, au,) dW{ . 0

Therefore u, = 9, OE;' is a solution of equation (2.4). The fact that is a local Cz-'J-semimartingale is immediate. The proof is complete.

7,o E;'(X)

We will next consider the uniqueness of the solution. purpose, we require a lemma.

For this

Lemma 3.4. Suppose that u,(x)is a local C1-'~"-semimartingale. Then F(j)(t,x, u,(x), au,(x)) is a local C1-2'B-semimartingalewith /3 < a. Proof. By ItG's formula, it holds F'"(t, x, u , W , du,(x)) - F("(0, x, uo(x), auO(xN =

(3.3)

J: F W , x,u,(x), au,(x)> du,(x) + $ J: m3&x,u,(x), au,(x)) &us(x) 0

+ J p ( S , x, us(x),au,(x))dS. We will show that the first term of the right hand side is a local C1-',Bsemimartingale with ,B < a. For simplicity, we will write the integrand as F(s, x). Considering stopped process if necessary, we may assume that F(s, x), u,(x) and their derivatives up to ( I - 2)-times are all bounded and Lipschitz continuous. It suffices to consider the case that u,(x) is a Cz-',fl-martingale. Using Burkholder's inequality, we have

263


Therefore we have the estimate

Then by Kolmogorov's theorem, tinuous in (t. x ) .

s:

F(Y,x)du,(x) is locally Holder con-

To prove the differentiability, we set for I NAX,

I) =

l -(J I

t

E

R'\{O},

+ Ie,)du,(x + Ie*) - J: ~ ( s x, ~ , ( x ) .}

~ ( s x,

0

Then we get similarly as the above

+ Ix - YIP + II - pup>

E[IN,(x,I ) - NLY, p)I"I I Z(It -

*

Hence N8(x,I ) has a continuous extension at I = 0, proving the differentiability of

s'

F(s, x)du,(x). Repeating this argument inductively, we

0

see that the integral is a Cz-2ij-martingalewith 19 < a. We can prove similarly as Lemma 1.3 that the joint quadratic variation ( F ( . ,x ) , u(x)), is a Cz-2~fl-process of bounded variation. Therefore,

s:

the Stratonovich integral

F(s, x ) o du,(x) is a C1-2J-semimartingale.

By a similar method, one can prove that the second term of (3.3) is a Cz- 27a-semimartingale. Hence we have the lemma.

Proof of Theorem 3.2. Let u,(x) be a local Cz-'~"-semimartingale solution of equation (2.4). Then in view of the above lemma, we can apply Theorem 1.2 and we get a,u,(x) = ai$(x)

+ 5 J &(F("(s, x, u,(x), au,(x))} Crws . 0

O'J

0

Then using I t 8 s formula to F ( j ) ( t ,x , u,(x),au,(x)), it is written as

F 1:GY)(s, x )

o

dW!, where GLj)(t,x ) are local Cz-2,8-semimartingales.

Consequently we may apply generalized It& formula to u, and we get

(3.4)

4%E J 0

=

(dUt,)

+ caP'(EJ i

O

dfE *

EL.

Then

H. KUNITA

264

The first member of the right hand side is

because of (2.1).

The second member is -

C= o au,(,E,) . F(i)(f, T,,< t ,

0

dW:

J

because of (2.5).

Therefore from (3.4) and (2.5), we have

We will regard that the above two equations (3.5) and (3.6) form a system Note that u o o ~= o To = 4 of SDE’s for u, o f , - T~ and d u , o;, and duo = to= 34. Then we see that 11, o $, - ?jL E 0 and au, 0 5, f 0 are the unique solutions of (3.5) and (3.6). We have thus proved the theorem.

c,.

0

$, 4.

et

Quasi-linear stochastic partial differential equation

A quasi-linear stochastic partiol difcwtitinl equntioiz is expressed as

Here, P;(t, x, ZL) and Q3(r, s,u ) , j = 1, . . ., n, are continuous in (t, x, u), continuously differentiable in t , C”‘+’+-funtionso f x and u for some a! > 0.

stochastic Partial DifferentialEquations

265

PXt, x, u) and Qo(f,x, u) are continuous in (t, x, u) and CmYs-functionsof x and u. Hence, using the notation of Section 2, it holds

which do not contain the variable p . Therefore in the stochastic characteristic equation for the quasi-linear equation, ( f t , 7 , ) satisfies a closed system of a stochastic differential equation n

dE, = -

(4.2)

cP,(t, cc, 7 , ) d W i O

9

j=O

n

dric = C Q,(t, E t , 71) dWl' 0

9

j=O

(5, is not involved). Let ( f t ( x , u), C,(x, u)), t E [O, Tl(x, u)) be the solution starting from ( x , u) at t = 0. For a Cziu-function$, we define similarly as (2.7), = Tdx, $ ( X ) ) ?

}

ft,

0

dW-l: 3

S J } odWZ' -

By ItG's formula it hods d { ( A t ,B,)(att,i3vt),} = 0. Taking the initial condition of (A,, B,) as (I,0) where I is the dXd-unit matrix, we get (A,, B,)(i3ft,i3qc))"= I. This proves that the rank of the matrix (a:,, a?,) is d for any t a.s. Therefore we have

lim {Idet 8$,($;'(x))l

t 1O ( X )

+ [aq,(f;l(x))l}> 0

if u(x) < 00

.

,

Since lim, o(x) ldet a$;l(x)I = 00, we have lim,, (.), det a$,($;'(x)) = 0. This implies limt,rt(x) Ii3qt(E;'(x))] > 0. Consequently we have

The proof is complete. Example (Communicated by Y. Yamato).

Consider a quasi-linear

Stochastic Partial Differential Eqriations

267

equation on R'. (4.5)

The associated characteristic equation is

It has a unique conservative solution v,(x, u) = u and f,(x,u) = x - u W,. Therefore, E,(x) = x - $(x) W , and q,(x) = $(x). Then it holds aE,(x) = 1 - $'(x)Wt and ~ ( x = ) inf {t > 0; 1 - $'(x) W , = 0). E;' and ~ ( x are ) not written explicitly in general. Consider the case $(x) = x. It holds a(y) = inf{t > 0; W,= I} if y # 0, = co if y = 0. Then $;'(y) = y(1 - W J - ' if t < ~ ( y ) .The solution of equation (4.5)is u,(x) = $&'(x)) = x(l - W,)-'. Hence we have lim, , o ( z ) Iu,(x)I = co if x # 0. Consider next the case $(x) = x2. It holds a(y) = inf{t > 0; 1 - 4yW, = 0) and $;'(y) = 2y(1 dl - 4W,y)-'. Hence the solution is u,(y) = 4y2(1 dl - ~ W , Y ) - ~ It .holds lim,, ,,(y) u,(y) = 4yz and lim,, .,(v) au,(y) = -co i f y # 0.

+

+

A quasi-linear equation (4.1) is called semi-linear if coefficients Pj(t, x, u) of equation do not depend on u. Then the characteristic equation of semi-linear equation takes the form

Hence the solution e,(x) defines a flow of diffeomorphisms of Rd. Since $,(x) = e,(x), Jacobian matrix i3ft(x)is always nonsingular and the inverse F;'(X) is well defined for all t, x. Therefore, we have the following. Theorem 4.2.

Semi-linear equation has a unique global solution.

We will finally consider a linear equation. called linear if coefficients Q,(t, x, u) satisfies Q,(t,

X, U) =

Qr)(r, X)U

A semi-linear equation is

+ Qy)(t, . X)

We will assume that QY),j = 0, . ., n are continuous in (t, x), Csta-functions of x (C4vafor Qo) and their derivatives in x are bounded. The process Tt(x)= v,(x, $(x)) satisfies a linear stochastic differential equation. It has a global solution and it is represented by

Consequently, the global solution of the linear equation n

du, =

C { P j ( t ,x)&,

j=O

+ Q(l)(t,X ) U , + Q(”(t,x ) }

o

dW{

with the initial condition uo = $ exists uniquely and is expressed by

We will rewrite the above formula using the backward integral.

0

For

< s 5 t, we denote by F,,,the least a-field for which W, - W,; s 5 u,

u 5 t are measurable. Now let t be a fixed time and fr be a continuous Then the backward Strabackward semimartingale adapted to (Sr,,). tonovich integral is defined by

where A = {s = to < . . . < t, = t } is a partition of [s, t ] and [dl = max [ t,+l - t , ] . We denote by E8,,(x)the solution of (4.6) starting from x at time s. Then the inverse f ; t ( x ) is a continuous backward semimarActually it satisfies the backward stochastic tingale adapted to (S8,t). differential equation

E;K4

=x -

c J: Pj(rY E;,*t(x)) JBi O

j=O

Furthermore, the following relation is satisfied

See [ 5 ] , Chap. TI,

0 7.

Similarly we can check that


269

Therefore the solution (4.7) is represented by

QY)(s,c;;(x)) o d W!

X QP(s, E;t(x)) 0 dwt]

.

Acknowledgement. The author wishes to express his thanks to Y. Yamato for his suggestions and corrections to the first version of this article.

References [1] [2 ] [31 [4 ]

[51 [6]

171

J. M. Bismut, MBcanique alkatoire, Lecture Notes in Math., 866 (1981), Springer-Verlag. R. Courant and D. Hilbert, Methods of mathematical physics, 11, Interscience, New York, London, 1962. T. Funaki, Construction of a solution of random transport equation with boundary condition, J. Math. SOC.Japan, 31 (1979), 719-744. H. Kunita, Cauchy problem for stochastic partial differential equation arizing in non linear filtering theory, Systems and Control letters, 1 (1981), 37-41. -,Stochastic differential equations and stochastic flow of diffeomorphisms, 1’6cole d’CtC de probabilitks, Saint-Flour, 1982. S. Ogawa, A partial differential equation with the white noise as a coefficient, Z. W., 28 (1973), 53-71. -, Remarks on the B-shifts of generalized random processes, Proc. Inter. Symp. SDE Kyoto 1976, Kinokuniya, Tokyo, 1978. DEPARTMENT OF APPLIED SCIENCE KYUSHUUNIVERSITY 36 FUKUOKA 812, JAPAN


Applications of the Malliavin Calculus, Part I Shigeo KUSUOKA and Daniel STROOCK*

9 0. Introduction This is the first in a series of articles dealing with the application of Malliavin’s calculus to various problems in stochastic analysis and the theory of partial differential equations. The present article is devoted, for the most part, to rather technical aspects of the basic theory. In section l), we begin with a resum6 of the Malliavin cluculus as it is developed in [ 8 ] . We then show how to incorporate the ideas of I. Shigekawa [6]into that framework. Section 1) closes with a rather careful examination of the regularity estimates which one can get on the distribution of functionals to which Malliavin’s procedure is applicable. In the second section, we show that solutions of It6 stochastic integral equations are “smooth functions” in the sense of Malliavin’s calculus. What is new here is that we allow the coefficients in our It6 equations to look into the past (i.e. the solutions need not be Markovian). Like section l), the second section is quite technical, but we see no way of justifying our conclusions without getting involved in such technicalities. At last, in the third and final section we begin putting all the machinery constructed in sections 1) and 2) to work. The main result of section 3) is that the distribution of the solution to a general It6 equation has very much the same regularity properties as that of a classical diffusion just so long as the coefficients of the white noise are non-degenerate. Although this result comes as no surprise, we do not know any other method of deducing it and believe it is a good example with which to illustrate the power of Malliavin’s calculus. Future articles in this series will concentrate on the Markovian case, where one can say much more. In particular, we will show how to complete the program, initiated by Malliavin in [4], of recovering Hormander’s renowned hypo-ellipticity theory for second order degenerate elliptic operators. Rather than attempting to describe here what we have been

* The research of this author was sponsored in part by N.S.F. Grant MCS 80-07300.

s. KUSUOKA

272

AND

D. STROOCK

able to do in this direction, we simply announce that we have not only recovered Hormander’s basic theory but also have been able to make certain extensions and improvements which may be a particular interest to probabilists. A preliminary version of our work can be found in section 8) of [9].

5 1.

The formalism of Malliavin’s calculus

The purpose of this section is to provide a brief review of the basic facts about Mallavin’s calculus. Some of this material is contained in [8] and therefore we will restrict ourselves to simply stating those results whose proof may be found there. Throughout, 0 will denote the space of continuous maps 8: [0, a) -+ Rd satisfying 8(0) = 0. Thinking of 0 as a Polish space (with the topology of uniform convergence on compact intervals), we use 27 to denote the Borel field over 0. For t 2 0, at denotes the o-subalgebra of a Finally, we use generated by @), 0 5 s 5 t. Clearly i% = o(Ulso g‘,). W to denote the standard Wiener measure on (0,g). That is,

r

for all t E [0, m), h > 0, and E g R a(the Borel field over Rd). with dense domain On L 2 ( Y f ) we consider a certain operator 9, Dom (P),called the Ornstein-Uhlenbeck operator. The operator 2’ is uniquely determined by the following properties : is self-adjoint; i) 2’ on Dom (9) 2 ’ admits a unique extension as a closed operator 9 on L * ( W ) ii) such that Dom (9) = {@ E Dom (9) n L ’ ( Y f ) : 2%) E L’(W)}; and F E CP(RD) iii) for any D 2 1, @ = (Q1, . . .,COD) E (Dom (2’))D, having bounded second order derivatives, F O@ E Dom (9). In particular, if V, V’ E Dom (Z),then V-K’ E Dom (9) and we may define the -+L ’ ( d f ) given by bilinear map (. , .)y: (Dom (2’))’ (1.1)

, =iv) for each 1 5 k 5 d and s 2 0, B,(s) E Dom (9) for all 1 5 I 5 d and t 2 0.

B,(s)/2, and (B,(s), B , ( t ) ) , = s A t

Applications of the Malliavin Calculus, Part I

273

To see that i)-iv) determine at most one operator is quite easy. Indeed, starting from iv) and making repeated use of iii), one sees that if such an 9 exists it must be the number operator (i.e. the space iWn) of nth order homogenious chaos is an eigenspace of 9 with eigenvalue -n/2). To prove that 9 exists (i.e. that the number operator has properties i)-iv)) requires some work. The interested reader should consult the first three sections of [ 8 ] . The bilinear map (., . ) 5 ' :Dom (9) X Dom (9) + L1(W)has some important properties. In the first place,

(0, O)z 2 0 , 0 E Dom (9).

(1.3)

From (1.3), it follows that

(1.4) where

From (1.4), we get ; (1.5)

-

Finally, as a bilinear map, ( -, (@-norm. In fact,

is bounded with respect to the graph

+ (2Z@)2]1/2Ew[V2 + (9V)z]1/2.

Ey[l(O,V),1] I EW[@'

(1.6)

For reasons which will become apparent shortly, it is useful to allow 9and to act on Hilbert space valued functions. To this end, let E denote a real separable Hilbert space. Given 0 E L7(W;E ) (we use L p ( W ;E ) to denote the measurable 0: 0 + E such that I l @ l l E E Lp($f)), we say that 0 E Dom (9; E ) if (0,e), E Dom (9) for all e E E and there is a V E Lz(W,E ) such that 9(0,e), = (V, e),, e E E. If 0 E Dom (9; E), we use 9 0 to denote that associated V. Next, given 0 E Dom (9; E) and an orthonormal basis {e,};. in E, define A,, = ((0,et),, (0,e,),),. Then, from (1.4), A:, < A,,A,, and so, by (1.6), EY[(C;,=,A:j)1/2] < E Y [ C L&,I EyQ@,et% (90, e,Y1 = E"[/I@IIL Il90ll"E. Thus, we can determine an element ((0,CD))~of L1($f;E 0 E ) by setting ((0,0))5' = Cz5=1Atjet @ e,. Making the usual identification of E @ E with H.S. (E; E ) (for Hilbert spaces El and E,, H.S. (El;Ez) is the Hilbert space of Hilbert-Schmidt operators from El to EJ, we see that ((0,O))s is a symmetric, non-negative definite, trace class valued map from 0 4 H.S. ( E ;E). We will use ((Q2 to denote Trace (((0,0))p)1/2. ( a ,

+

+

S. KUSJOKAA N D D. STROOCK

274

The fact that the operation (( -,.))Iis inherently quadratic is a source of difficulties. Thus it is useful to have a linear "square root" of this operation. The possibility of finding such a "square root" was first exploited by I. Shigekawa in [6]. We will adopt a variant of the Shigekawa approach. Denote by H the space of h E C([O, 03); R d ) such that h(0) = 0, h is absolutely continuous, and Jrn[ h'(t) 1' dt 0

Hilbert space under the norm llh![H=

1 and e:, . . ., e: E El, (yl, . -,yn) -+ F(et yje:) E Cm(Rn;E,), there is a continuous F(,) from El into the continuous multilinear maps Mn(E:; E2)for which

n;

+ C:

9

and for all n 2 0

II F(Ye:)ll.Mn(E;;E , ) I Cn(1 + IIe:llE1>rn for some C , < 00 and 7, < 00. (Here, and elsewhere, F(O) = F and Mo(E;; E,) = 4.1 In connection with the spaces C'p(El;E,), we need one more construction. Namely, given F E C;(El ; E,), and (& (Z!, e":)) E El 0*(EIY, note that, for any orthonormal basis {h,},", C; P ( e ~ ) ( Z ~ ( h.?;(A,)) j), converges in E, to an element which does not depend on the particular choice of {hj}?. Thus, we can define ( F ( ' ) ) : El -+ Mz(X(E1)Z; E,) to be given by the sum of this series. Clearly, (e:, e":, Pi) -+ (F(')(e:))(Z:,2:) can be thought of as an element of C;"(El0X ( E 1 ) ' ;Ez). Moreover, when F(')(e:)E H.S. ( E i ;E,) and therefore admits a unique extension F"@)(e:) as an element of Hom (El 0El ;E,),

(1.9) Theorem. If F E C;(El; E,) and G E C";E,; E J , then G O F E CT(El; E3). Moreover, if A : El 0 . . . 0En -+ E is a continuous nzultilinear map, then A E CT(El 0 . ' . 0En;E ) . Finally, if F E C";El; E,) and @ E X ( 9 ; El), then Fo @ E X ( 9 ; E,), D(Fo 0)= (F") o @)D@,and 3 ( F o @) = ( F " ) @(D@,D@)/2 (F'" 0 @)9@. 0

+

Proof. Only the final equation needs comment. First, note that if dim El < 03, then F @ )admits a unique extention F"(') as a map of El into Hom (El 6 El; E,) and ( F @ ) o) @(DO,DO) = (F"(') o @)(((@,@))). In par-


277

0, set @* = (@* E')"'. Then 0

+ D((l/@JF) = - l/@D@Q 'P + l/@cDF

where 'P Q D@ is interpreted as an Z?(E,)-valued map by identifying P ( R ' ) with H and P ( E ) with H 0E. Using the closure property of the operation D, it is clear that this expression continues to hold after Q S is replaced by @ throughout. But then the induction hypothesis tells us that the resulting quantity on the right is in g n 0 ( 9Z?(E)). ; When no 2 (9; El. 1, a similar argument allows us to show that 9((1/@)?P)E Po-' Q.E.D. Remark. It should be pointed out that P. A. Meyer [5] has shown that the class g(9;E ) can be described without using the operation D. His idea is analogous to the well-known trick in finite dimensions which allows one to describe the Sobolev spaces entirely in terms of the Laplacian, without any mention of partial derivatives. Unfortunately, as with finite dimensional analogue, verifying that the L4(W;%"(E)) norm of

279


Dn@ can be controlled by the Lq(W; E ) norm of entails a non-trivial singular integral argument when q # 2. For this reason we have chosen to avoid relying on his results. It will be useful to have some norms on the spaces Sn(9; E ) . In order to describe these norms, it is best to introduce in little notion. Let = 0 if IJ = 8 and IY] = n if 4 = {@}UU; ((0, l})n. For Y E 9, set Y E ((0, l})%. Also set [IJ]= 8 if IJ = 8, [Y] = {k:vk = 1) if Y # 8 I[Y]I = card ([IJ]),and l l ~ ~ = l l 211~1- I[IJ]I. Given @ E 9(9; E ) , define @(") = @ if Y = 9 and @("I = DylY'-yl.. D y n L P u n @if ~ I J J= n 2 1. Note that which we will often identify with E @ fPrVl1. takes values in fll[vll(E), For n 2 0, define III@IIlg),@ E P(9;E ) , by

and, for q E [2, w), set I I I @ I ] lg)E = 11 I [ I@I I I(En)jlLq(w). It is a quite easy matter to check that S n ( 9 ;E ) is complete with respect to {[[[. IIIgL: q E [2, m)) and in this way becomes a countably normed Frtchet space. Also, 9(9; E ) becomes of countably normed Frtchet space under (111 . I]&: n 2 0 and q E [2, w)}. I n addition to the preceding norms, we will have occasion to use the defined by norms 11

and, for q E [2, m), the associated norm obvious that the following relations hold;

11@11g)E= l ~ l ~ @ ~ ~ ~ ) ~ ~ L q ( v ) It.

is

Warning. When E = R', we will drop any mention of E in our R') will be written 9(9), 1 I I @ I I It?will be written notation. Thus, 9(9; I I I @I I I(,), etc. (1.14) Lemma. For each n 2 0 there exist constants C , f o r all m 2 2 , and A E M,(E, 0 . . 0E m ;E ) ,

<w

such that

s. KUSUOKA

280

AND D. STROOCK

In particular, for @, ZP E 9(9),

lK@, We-ll(")5 ~ n l l ~ @ l l ~ ~ l l ~ ~ l l ~ ~

(1.16)

*

Also, for non-negative @ E Y ( 9 ) satisfying l/@ E Y0(9)

II l/@ll(n)I Cn(ll ~ / @ l l ~ o ~ ~ n + l ~ l l @ l l ~ n ~ ~ "

(1.17)

*

Finally, given q, ql, . . ., qn

E

[2, w ) satisfying l / q = C 1"l/qL,

n lll@Llll~;E~ m

l I l 4 @ 1 9

*

. @m)lIlEL I . 9

C,m-lllAlldim(E1O...OE,;E)

(1.18)

II

4 @ 1 9

...

I

@m)lIEL

I

C,m-l I/A IIMm(EIO...@ E r n $ )

andfor 4, ql, q2 E [2, a) satisfying l/q = l/ql (1.19)

rn

I-I II

@1

Il&

;

+ l/qz,

I (@, We-llt-"'I ~7Ill~@ll~~Hll~~ll~~H II l/@ll?) I C"(ll ~ / @ l l ~ ~ + l ~ ~ l ~ n + ' ~ l l @ l l ~ ~ ~ ~ *

Proof. Given (1.15), (1.16), (1.17), (1.18) and (1.19) follow by Holder's inequality. Moreover, (1.16) is a special case of the second inequality in (1.15). To prove (1.15), it suffices to handle the case when m = 2 (the cases m > 2 follow by an easy induction argument). But, for n 2 1, D"(A(@,W)) =

2 ( 7 ) A(DL@,D-")),

1 =o

and so the second part of (1.15) with m = 2, follows immediately. T o prove the first part, we note that for w E 9,

where Ap,u,m E M2(21[p11(El) 0X1[v11(E2); X I [ " l 1 ( E ) )and has norm not exceeding 211"11 11 A [Idil(EIOEa;E). The proof of this representation is easily accomplished by induction on I w I ; one simply has to use =%4(@, 'y)>= &9@,

+ A(@,83p)+ ( A ) ( D @ ,DZP) ,

where ( A ) E M,(X(El) 0A?@); E ) is given by ( A ) ( ; ' , 2)=

5A(D'(h,),e"*(h,)) j=1

for any orthonormal basis {h,}," in H.


281

In order to prove (1.16), we use induction on n 2 0 to prove that D"(l/@) =

~

Qrn

ml +

C ...

En(ml,

., m2)Dmi@0 . . . Q Dmn@

+ m n= n

-

where (En(ml, . ., m,)I 5 1 and we have identified X n ( R 1 )with H @ . Q.E.D. Clearly (1.16) follows easily from this. @)), and (1.20) Theorem. Let @ E g(9;RD) be given and set A = ((0, A = det A . Given a E ND\{0},there is a linear map .%?a: 9(9)+ 9(Y) such that

(1.21)

E ' [ ( ~ O @ ) ( A ~ ~ ~=~(-l)lulE~[(~~@)(~=T)] - ~ T ) ]

Hence, by Cramer's rule :

where A(") denotes the (ij)th cofactor of the matrix A . In particular, if ?€f E 9(9), then

S . KUSUOKAAND D. STROOCK

282

Using Lemma (1.14), one sees that there is for each n 2 0 a Cn(l) < 00 (not depending on @) such that (1.24)

I1 BiK ]I?) I Cn(l)(ll A I l t n g C _ l l ) ) q l ; R ~ ~ R ~ ) D - l x (11 IlglXCRD) + IISIRD>II Ilk")

+

for all q, ql,q2, q3 E [2, m) satisfying l/q = l/q, 1/q2 ticular, this proves (1.21) and (1.22) when ( a /= 1. Next let a E M Dso that

E

N o with n

=

la1 2 2 be given.

where i, = min {i: aim-.')> 0) . Given K 9(9)so that

KO= Y, Km = ABin-lYm-l + 2(n

E

+ l/q3.

In par-

Define a('),. . -,

9(2), define Yo,.

a ,

YnE

- m ) ( C A ( i n - l , j ) ( @ jA)2)€fm-l , j

for 1 5 m 5 n - 1, and Yn = Bin-lYn-l. Note that

Thus, by (1.23) :

if0

I m 5 n - 2, and

Therefore, (1.21) holds when we define BaK = Kn. Moreover, for 1 5 mI n - 1, one can combine Lemma (1.14) and (1.24) to find a C , < 00 such that for all q, ql, q2,q3E [2, m) satisfying l/q = l/ql l/q, l/q,

+

II Krn Ill."-"' I Cm(IlA l l k ~ x (I1 D@l;:;?b) for 1 5 m I n - 1 and

20-1

~

~

~

;

~

~

+ IIY@Il:rI$)

R

D

)

II v m - I l :-m+l)

+


283

From these it is easy to deduce (1.22). (1.25) Corollary. Let @, A , and A be as in (1.20). Suppose that T E 9 ( 9 )has the property that T / A n E g ( 2 )for all n 2 1. Let YW denote the measure on (O,@’) given by (TW)(dO) = W(O)W(dB)and set ,u = ( T W ) 0Q-I. Then p(dy) = f ( y ) d y w i t h f e Y ( R D ) .Infact, $11 . I((m,n), M ,n 2 1, is the norm on Y ( R D )defined by

where A denotes Fourier transform, then Ilfll(rn,n)

I ll(1 + II@ II~~)m/2/1:0)lulI

= Er[(1 +

l @ l z ) m / z e f ( E(T/AZICI-I ~ ~ ) ~ >I

+ I yr)mlzei(f,v).All the assertions made follow easily Q.E.D.

When l/d E g0(S?);one can often obtain quite good estimates on the f and its derivatives. The basis of the analysis which we have in mind is the following simple real-variables lemma.

>

hasfirst (1.26) Lemma. Suppose that q D and suppose that g E L4(RD) order distributional derivative g , ] , . . ., g,= E Lq(RD). Then g E C,(R) and D

(1.27)

/lgllC,(RD,I c(q,

.I(&llg.llL*(RD))D’s

l l g l l Ll -CD( R / qD ) *

Proof. As a tempered distribution, g satisfies the equation

284

s. KUSUOKA AND D.STROOCK

where

This equation follows from the equation g = I G , * g - G,*(dg)

after integrating the second term by parts. As is easily computed, for q > D there are positive constants A(q, D ) < w and B(q, 0) w such that

\l?3

for all p E [ 1, w) and T

IIDt(t>ll:(E)

+ l19t(t)ll?3)Lp(W)

> 0.

r

(2.2) Lemma. Let a E 9(9; H.S.(Rd;E ) ) and Q, E F ( 9 ;E ) be given. Define C(T) = r a ( t ) d S ( t )

+

,k?(t)dt,T 2 0. Then [ E F , ( 9 ;E ) . In

fact,

f o r t 2 0 and h E H, and

+ j T9,Q(t)dt

9 E ( T ) = J T ( 9 a ( t )- a(t)/2)dB(t)

+ 3P-'TP-1 Proof. The proof of this lemma is eassentially the same as the proof

289

Applicaiions of the Malliavin Calculus, Part I

of Theorem (4.6) in [8]. That is, one first proves everything for the case when a and B, are simple. One then uses the estimates derived in this case, plus standard approximation results, to handle the general case. The required estimates follow easily from the stochastic integral expressions from D( and 2( together with Lemma (2.1). The only step that may cause problems, is estimation of the term

J:

a(t)h’(t)dt in the expres-

sion for D((T)(h). However, writing this term as (B(T),h)H, where

=I

B ( T ) :O+X(H.S.(Rd; E ) ) is given by B(T)(B) the X(H.S.(R; E))-norm of this term is ( J T

*AT

a(s, 8)ds, we see that

II(Y(~)~~L.~.(~~;~)

tribution of this term to the estimate for sup

IID((t)ilJP(E)is

the second

O 0. Given a compact metric space M and a separable Hilbert space E, let C X ( M ;E ) be the class of measurable E ; M X 0 + E such that ((Z) E X ( 9 ; E ) for all Z EM , E ( . ) : M - t E, DE(.): M + 2 ( E ) and YE(.): M -+ E are continuous for W-a.e. 8, and

Then the following is an easy consequence of Theorem (1.9).

s. KUSUOKAAND D.STROOCK

290

(2.5) Lemma. Suppose that M is a compact metric space, El and E, are separable Hilbert spaces, F E CSp(E,, E,) and C S ( M ; El). Then Fo E E C X ( M ;E,).

-

Given compact metric spaces MI, . -,M N and separable Hilbert let C"pC(Ml,El) 0 . 0 C(M,,>,); E") be spaces El, . -,EN and the class of continuous F : C(Ml, El) 0 0 C(M,, EN)+ E such that for any multi-index a = (a,. a,) with [ a1 = a, aN 2 1 and at:the map any g:!) E C(M,, Et), 1 2 i 5 N , 0 < j I a

---

a

( x p , . . ., xb:', xi,), . . .) X i , ) ,

--

+- +

. . . xi",) E R'"'

there is a continous 8""):C(M,, E l ) 0 . . . 0C ( M N E y N )+ Hom (C(M",E""); I?)

( M udenotes M,"' X . . .

X M B and EQa denotes EFalQ

. . . 0E

~ N )

for which

for g C i )E C(M,, Et), i = 1, . . y N . (2.6) Lemma. Suppose that F E C;(C(M,, El)0 . Then for any multi-index a, G,: C(Ml, El) 0 *

*

- . 0C ( M N ,E N ) ;I?).

- 0C ( M N ,E N )0 C(M", EQ")+ I?

291


given by

G,

and g E C ( M " , E@'"),belongs to CT(C(Ml, El) 0 * * . 0 C ( M N ,E N )0 C(M", E@").E ) . Moreover,

if tiE C X ( M , , E6),i = 1 , . .

belongs to X ( 9 ; E"), N

(2.7)

N

p$-= C F('i) i=l

(2.8)

+2

N , then

[ Ey) 1

D$-= C F(ai) Z=1

a,

I:

L:)!

(ofi(**,)) , and

(f:?J [

W&(**i))

El(*,)

.- F ( a i f )

EN(*hJ

1

Here ai = (0, ., 0, i, 0, . . + , O ) , a i j = a, i f i j , and ( j , i ) i f i > j .

, +.

E(**))>Y

i,j=l

in E 0E uniformly on M X M. Thus, it is an easy matter to obtain (2.7) and (2.8) by passing to the Q.E.D. limit n +. 00 (cf. the last part of the proof of Theorem (1.9)). Given F: [O, m) X C([O, m), El)-+ E,, we say that F is a smooth, tempered, non-anticipating function if F is measurable: for each T 2 0 there is an P ( T ) E C3p(C([O, TI, El);Ez)such that F(T, +) = F(T)(+]LO,TJ for E C([O, w); El); and for each T > 0 and n 2 0 there exists C,(T) < 00 and T,(T) < w satisfying

+

293


+

IIF(t )Y+) I1Horn ( C KO, t,n, Ep);E d I Cn(T)(1 + II II CCLO, t l ,E l ))Tn (0 for all 0 I tI T and

+

E

C([O, t ] ,El).

Warning. When dealing with smooth, tempered, non-anticipating functions F, we will use F(")(T,+), n 2 0, T 2 0 and E C([O,m), El), interchangeably with F(T)(")(+I c o , r l ) . Also we often use I[+~I c ( c o, r l , E l ) in and, in general, we think of E C(C0, m>;El) place of Il+l~o,~,llc(co.~lE,); E ) as being in as being in C([O, TI; El), and think of f E SC(9; CX(t0, TI; E ) .

+

+

(2.9) Lemma. Let a : [0, m) X C([O, m); RD)+ RD0Rd and b : [0, m) x C([O, m); RD)-+ RD be smooth tempered non-anticipating functions satisfying Ilb(n)(t? $)lIHom(C([O,t]n;

5

(2.10)

v 11 b ( n ) ( t$1~ /I

(RD)@ln);RD@Rd)

Cn(T)(l

+

+

~ o m ( C ( [ O , t ] n ;( R D ) @ ) ; R D )

II$IIC([O,tl;RD))"(')

0 and q E [2, co), there exists a K,(T) < co, depending only on C,(T) and C,(T) in (2.10), such that

for all0

0 and M,(x, R) and M,(x, R) are defined, respectively, as the supremum of ll4t, $F)IIE.S.W;;RD) and IlNt, +)IlRDfor 0 I t I T a n d

+

{$

E

c([o,

then there is a C,(T)

(3.12)

0 and y E RD. Replacing W in (3.1) by p,(X(t, x ) - x)W and using Ilpl.(X(t, x ) - X)Whn,+l)

I C,Ilp,(X(t, 4 - ~)lli;~l)ll~lli;:l)

Y

one gets (3.12) from standard estimates on YP'-(supoI,~,~X(s, x) - X I 2 r/2) Q.E.D. (cf. Theorem (4.2.1) in [lo]). (3.13) Remark. It is gratifying to see that the upper bounds just obtained on P ( t , x, .) are, at least qualitatively, the same as those which one would predict on the basis of the heat flow semigroup. Unfortunately, we have no good idea how to obtain corresponding lower bounds, although we are confident that good lower bounds exist. The best which we can say for sure is that for each ( t , x) E (0, w) X RDand each strictly positive W E 3(9),{ y E RD:pt(t, x , y ) = 0 ) contains no interior points. This inadequate observation is an easy consequence of the fact that if P, = W o X ( . , x)-' on C([O, w); RD),then supp (P,)= {W E C([O, w); R): W(0) = X} (cf. [7]). (3.14) Examples. 1) The basic example to which our results apply are non-degenerate, time-inhomogeneous diffusions. That is, suppose that (r: [0, w ) X RD-+ RD 0Rd and b : [0, w ) X RD3 RD are measurable functions such that: a(t, .) E Cy(RD;RD X Rd) and b(t, .) E C y ( R D ;RD)for all 't 2 0; and for each T > 0 and n 2 0, there exist C,(T) < w and Y,(T) < w for which

I Cn(T)(1

+ IIYIIRD)""~)

Y

E

RD.

Assuming that Y,(T)= 0 for all T 2 0, one easily checks that for each (s, x) E [O, w) x RD,there is a unique X s ( ., x) satisfying (3.15)

X'(T, X) = x

+ lTv'a(t, X'(t, x))dO(t) a

+

rv'

b(t, X s ( t , x))dt , T 2 0 .

8

Set P(s, x; T, .) = *T0 X 8 ( T ,x)-', (T, x) E [s, w) consequence of uniqueness, we have

w(x'(T,X) E r I g t )= ~ ( txyt, , x ) ; T, r)

x RD. Then, as a (as., w )

s. KUSUOKAAND D. STROOCK

304

for all 0 5 s < t < T and x E RD. In other words, A?(., x) is a timeinhomogenious Markov process with transition function P(s, x ; T,.). Moreover, from ItB’s formula, we see that for any q5 E C,”(RD),

where

Hence, P(s, x ; -,. .) satisfies (at least formally) the Fokker-Planck equation (3.17)

a -P(s,x; t, * .) = LFP(s,x;t, * * ) , at

t 2 s,

where L f is the formal adjoint of L,. If we now add the assumption that aa* 2 E I R D for some E > 0, then Theorem (3.5) and Corollary (3.9) apply, and we see that P(s, x ; T,dy) = p(s, x ; T,y)dy where p(s, x ; ., .) E C((s, m) X RD)andp(s, x ; T, .) E Y ( R D for ) each t s. In particular, we can now say rigorously that

>

<
0, then Theorem (3.5) and Corollary (3.9) apply to the solution I(., x), of

and we conclude that P(T, x, .) = YY o X(T, x)-' has a density p ( t , x, E Y ( R Dfor ) each T > 0. It is not clear to us how one could deduce such a result by any other technique. In some special cases, it is possible to embed X ( . , x) as an infinite dimensional Markovian system which is sufficiently tractable by more standard Markovian methods (cf. the last section of [9]). However. even in such cases the reasoning is quite involved; and it seems to us unlikely that such methodology can be made to work in general. a )

(3.19) Remark. It would not be difficult to show that p"(t, x,y) is smooth as a function of (x, y ) E RD x RD. The technique which we have in mind is essentially the same as the one used in section 7) of [XI. In fact, by being a little careful, one can show that

for O < t < T .

306

S. KUSUOKA AND D. STROOCK

References D. L. Burkholder, A geometrical characterization o Banach spaces in which martingale difference sequences are unconditional, Ann. of Prob., 9 (1981 ), 997-1011. A. Friedman, Partial Differential Equations of Parabolic Type, Prentice Hall, Englewood Cliffs, N. J., 1964. S. Kusuoka, The nonlinear transformation of Gaussian measure on Banach space and its absolute continuity (I), J. Fac. Sci., U. of Tokyo, Sec. IA (Mathematics), 29 (1982), 567-596. P. Malliavin, Stochastic calculus of variations and hypoelliptic operators, Proc. International Conf. on Stoch. Diff. Equ. at Kyoto (1976), Kinokuniya-John Wiley & Sons, Tokyo and N. Y., 195-263, 1978. P. A. Meyer, Quelques resultats analytiques sur le semi-groupe d'ornsteinUhlenbeck en dimension infinite, Theory and application of random fields, Proc. IFIP-WG 7/1 Working conf. at Bangalore, ed. by G. Kallianpur, Lect. Notes in Cont. and Inform. Sci., 49 (1983), 201-214, Springer-Verlog, Berlin. I. Shigekawa, Absolute continuity of probability laws of Wiener functionals, Proc. Japan Acad., 54 (1978), A, 230-233. D. Stroock, On the growth of stochastic integrals, Z. Wahr., 18 (1971), 240244. -, The Malliavin calculus, a functional analytic approach, J. Funct. Anal., 44 (1981), 212-257. -, Some applications of stochastic calculus to partial differential equations, Bcole d'&e de Probabilitks de San Flour 1981, ed. by P. L. Hennequin, Lec. Notes in Math., 976 (1983), 268-382, Springer-Verlag, Berlin. D. Stroock and S. R. S. Varadhan, Multidimensional Diffusion Processes, Springer-Verlag, Berlin, 1979. Shigeo KUSUOKA O F MATHEMATICS DEPARTMENT OF TOKYO UNIVERSTY TOKYO 113, JAPAN Daniel STROOCK OF MATHEMATICS DEPARTMENT UNIVERSITY O F COLORADO, BOULDER CAMPUS BOX 426 BOULDER, COLORADO 80309 U.S.A.


Stochastic Flows of Diffeomorphisms Yves LE JAN and Shinzo WATANABE

0 1.

Introduction

As solutions of an ordinary differential equation (O.D.E.) on a manifold define a one-parameter subgroup of the group of diffeomorphisms called a dynamical system, solutions of a stochastic differential equation (S.D.E.) define a continuous random motion of diffeomorphisms called a stochastic dynamical system or a stochastic flow of diffeomorphisms. It is also called a Brownian motion on the group of diffeomorphisms since it is a continuous motion on the group with independent increments. The study of S.D.E. from this view point, especially that of the dependence of solutions on the initial value, was initiated by Soviet school e.g. Gihman and Skorohod [6] and Blagovescenskii and Freidlin [3] but it is rather recent that diffeomorphic property of solutions has been established by Bismut [4] and Kunita [9] (cf. also Elworthy [5], Malliavin [12] and Ikeda and Watanabe [S]). More recently, Brownian motions on the group of diffeomorphisms or homeomorphisms themselves have attracted attention and have been studied by e.g. Harris [7], Baxendale [2] and Le Jan [l I]. In these works, main problems are first to specify infinitesimal quantities which characterize Brownian motions (i.e. continuous motions with stationary independent increments) on the group of diffeomorphisms and secondly to construct Brownian motions from these infhitesimal data. It should be noted that the construction problem can be discussed by the method of S.D.E. but we need rather general S.D.E. based on Wiener processes (i.e. continuous Gaussian processes with stationary independent increments) on the space of vector fields. Also in the characterization problem, the infinitesimal data characterizing a Brownian motion can be given as data describing a Wiener process on the space of vector fields. The purpose of this paper is to discuss similar problems in more general framework of continuous random motions on the group of diffeomorphisms than Brownian motions. For example, if we consider the composite of two Brownian motions on the group of diffeomorphisms or the inverse of a Brownian motion, they are no longer Brownian motions, but are continuous processes which might naturally be called semi-martin-

Y. LE JAN

308

S. WATANABE

AND

gales on the group of diffeomorphisms. For such semi-martingales, we can associate uniquely a semi-martingale on the space of vector fields called the velocity field. Conversely, a semi-martingale on the space of vector fields generates a semi-martingale on the group of diffeomorphisms through S.D.E. so that it is the velocity field of the latter. Thus it is the main result of this paper to establish this one-to-one correspondence between a class of semi-martingales on the group of diffeomorphisms and a class of semi-martingales on the space of vector fields. A semi-martingale on the group of diffeomorphisms is a Brownian motion if and only if the corresponding semi-martingale on the space of vector fields is a Wiener process.

0 2. L.C.-system The main purpose of this section is to study a system of analytical objects (called an L.C.-system") which is necessary in characterizing infinitesimally a random motion of transformations of a state space and to describe this system in a geometric language when the state space is a smooth manifold. Let M be a topological space and C ( M ) be the space of all realvalued continuous functions on M . C ( M ) is an algebra under the usual operations and let 9 c C ( M ) be a subalgebra. We consider a system {L,( , ) x , , } of the following objects; 1) (f, g),,,, f, g E 9, x9 Y E M , which is a real bilinear form on 9 x 9 for each fixed x , y E M , 2) L : 9-+ 9, a linear operator on 9. We assume that the system {L, ( , )I,y} satisfies the following properties; a) b)

(f, g>x,, = ( s , f > y , x ?

vf, g

E

9

3

vx,Y

E

M,

(derivation property)

(LA,g>x,, = f i ( x ) ( f i >g>r,g + . A ( x ) ( L , g>x,,, VL7.h g

c)

(positive definiteness) m

*

L.C. is meant for local characteristic.

E

9, VX, Y E M ,

Stochastic Flows of Difleornorphisnzs

309

Definition 2.1. Such a system {L, ( , )I,y} is called an L.C.-system (with domain 9)on M. In this paper, we are mainly interested in the smooth case, that is, the case when M is a manifold and transformations are diffeomorphisms. So, from now on we assume that M is a smooth connected compact manifold of dimension d. We denote by C " ( M ) the algebra formed of all real C"-functions on M. Definition 2.2. An L.C.-system {L, ( , ) s , u } on M is called smooth if 9 = C m ( M ) L , : 9---f 9 is a differential operator and (x,y) + ( A g)%,, is C" in (x,y ) E M X M for everyf, g E 9. We rewrite a smooth L.C.-system in geometric language. In the following, we omit the summation sign according to the usual convention. First by b),

b)' ( A g>z,, = 4 x 7 Y " m 7 (d'),1 where A(x, y) E T,(M) 0 T,(M).(*) In a local coordinate,

a

A(x, y) = a y x , y ) -- 0 ax2

a ~

ayj

and the coefficients aij(x,y) are smooth in x and y by the above smoothness assumption. Next, L: C " ( M ) + C"(M) is a smooth second order differential operator such that L ( l ) = 0 ( 1 is the function on M identically equal to 1) as is easily seen from the properties b) and d). Then

in a local coordinate where the coefficients aii and pi are smooth in x. Now a), c), d) are equivalent to the following respectively: a)' c)' d)'

a"(x, y ) = a""(, x), aii(x,,x,)c~cj2 0, for every rn = 1,2, and cs E R, ati(x)= CZ"(X, x).

- - .,

xI,x2,-

-

0

)

Note that (a"(x)) is non-negative definite because of c') and d'). Definition 2.3. Set

*

T J M ) is the tangent space at x

6

M.

x, E M

310

Y . LE JAN AND S. WATANABE

in a local coordinate.

Lemma 2.1. For d E A , ( M ) (= the space of all diflerential l-forms on M ) , define ad. (V&(x) = u y x , x ) L ( x ) axk

(2.4)

+ ry(x)d,(x)

a

~

aYk [a%

Y)MY)l1 y = X

in a local coordinate, where R = di(x)dx'. rype (1.1).

Then V I is u tensor field of

Proof. It is not difficult to verify, by a direct calculation, that (Vd): obeys the transformation rule under a change of coordinates. More intrinsic proof may be given as follows. Let X E % ( M ) : = the space of all C"-vector fields, X(x) = X'(x)(a/ax') in a local coordinate and exp tX be the one-parameter subgroup of G = Diff(M) (=the group of all C"diffeomorphismsof M ) generated by X . Then hf(4

=

a""(, (exp tX>(x)>d,((exp(tX>(xNE T A M ) -h:(x)l d dt

and hence

E T,(M)

.

t=o

But

= X"x)(vI):(x) E T,(M) -hh:(x)) d t=o dt which proves that (VI)(x) E T ; ( M ) 0 T,(M) 2 2'(Tx(M)-+ T,(M)).

Corollary. (2.5)

1 L o ( f )= -(V(df))l 2

(contraction),

YE C"(M),

defines a second-order differential operator on M with the same main term as L. Hence (2.6)

B(f) =

af) - Jw), f Crn(M), E

is the first order differential operator on M , i.e., B E % ( M ) . When we regard B as a tensor field x -+ B(x) E T,(M), B(f)(x) is sometimes de] . we have obtained the following theorem. noted by B ( ~ ) [ ( d f ) ~Thus

Stochastic Flows of Difleomorphisms

Theorem 2.1. to giving a pair

To give a smooth L.C.-system { L , ( ,

a

(.(x, y ) = a y x , y)? ax

0-,a

B(x)

ay

311 )x,y}

is equivalent

=

with properties a'), c') such that both ( x , y ) -+ A(x, y ) E T,(M) 0 T J M ) and x + B ( x ) E T,(M) are smooth: (f, g ) x , v = 4 x 9 Y"f)x,

and

(4&l

+ W)[(df),l

L ( f ) ( x ) = Lll(f)(x)

where the second-order differential operator Lois defined by (2.5).

Thus, we may call such a pair { A ( x ,y), B ( x ) } as a smooth L.C.system. We shall study, a little more in detail, geometric aspects of the tensor field

a

a

A ( x , y ) = a"(x, y)0ax* ay5

.

If ai'(x) = a"(x, x ) is strictly positive definite for all x E M , then its inverse a,,(x) = a,,(x, x ) defines a Riemannian structure over M. In this case, T f j = -I':%z,,(x, x ) are the Christoffel symbols, i.e., coefficients of a linear connection, which is compatible with the Riemannian structure, that is the metric is preserved under the parallel translation. Indeed, the covariant derivative (VfX) of X = Xt(x)(a/axi)E B ( M ) with respect to this connection is given by

since

Y. LE JAN AND S. WATANABE

312

for X ( X ) = Xf(x)(a/axt)and Y ( x ) = Yi(x)(a/axt),

a

- X X m ( x ) Y n(x)am,n(x, x) ax5

a

X r n ( xyn(Y)am,n(X, ) Y)

=

____

=

(V!X)(x)a,,,(x,X)Y"(X)

aYt

Iy=i

a J % J ) y n ( x ) a m , n (x)l + ay" Y,

L!-x

+ v: Y ) ( x ) a m , * kx)Xrn(x>

and this proves the compatibility. It should be noted that this connection is usually not symmetric, i.e., torsion part does not vanish (cf. Example below). Finally, it is easy to verify that

where ( P f ) ( ( x )= a""(, x)(afiaxk)(x),that is, Locoincides with the LaplaceBeltrami opeator with respect to the connection defined by {i':j}. Example 2.1. Let M = Rd/2nZdbe a d-dimensional torus. A general form of smooth L.C.-systems on M which are invariant under translations is obtained in [ 141. Using the standard (Euclidean) coordinate,

a 0a ay

A(x, y ) = a y x , y)axi

-

and aij(x, y ) is the form at'(x, y )

=

CY + n EC {cos [ ( x - y)-n]c;!- sin [ ( x - y).n]dhj} Z d \ 101

where (c?) and (dkj) are real d X d-matrices for each n E Zd\{O} such that ~ ; = j c;l" = cYn, d;j = -dii = dYrL,c;jp,p, 2d;jp,R, chjR,R, 2 0 for every (p,),(It) E Rd and

+

C

n €Zd\iOl

(Ichfl + Idkj])Inlk< 03

Choose chj so that a"(x, x )

= cy

for all k

+

> 0, i , j = 1, 2, . . ., d .

+ CnEZd,iol~y = 8'. Then

Another characterization of the operator Lois given by Baxendale [2]. For the tensor field A given above, let H be the reproducing kernel Hilbert space (R.K.H.S.) of vector fields associated with A : to be precise, for

Stochastic Flows of Difleomorphisms

313

-

X = [xl,x,, . . .,x,] E M” and I = [A’, A,, . ., I”] where Rk = 2:(dxt)lXk E Tzk(M),k = 1, 2, m, for some m = 1, 2, . . .,

--

a ,

and H

=

{ X ( E , Ifor I all X and

completed by the Hilbert norm.

Then H

(2.8)

=

L,(f)(x)

-

I, m = I, 2, .

.}

f ( M ) ( * )and

trace, VfvZ

where VfgXis, for each x E M and f E C”(M), a bilinear form on H X H c_,S ( M ) X % ( M ) defined by V f * z ( X Y , ) = X ( Y ( f ) ) ( x ) . Note that the injection i : H-+ f ( M ) is Radonifying, i.e., if Y is the canonical Gaussian cylindrical measure on the Hilbert space H , then ib], which is a cylindrical measure on the FrCchet space % ( M ) can be extended to a a-additive Radon measure p on % ( M )so that (%(M),H, p ) is an abstract Wiener space. It is also known that every mean 0 Gaussian Radon measure on S(M)is obtained in this way from some A . Finally if p is the image measure of p under the map X 4 X B, then /z is a Gaussian measure on f ( M ) with mean B. Thus we can say that a smooth L.C.system ( A , B ) on M is in one-to-one correspondence with a Gaussian Radon measure /z on S ( M ) : B is the mean of p and A is the covariance of p, that is,

+

Cf. Baxendale [2] for details. In the subsequent sections, the following random L.C.-systems play an important role. Again we restrict ourselves to the smooth case. Let (Q, 9, P) be a probability space with a filtration (.Ft)tECO,w). We consider a system (AB+’(x, y), B8*@(x)) of smooth L.C.-systems which depend on (s,

0) E

to,

a)x

0.

Definition 2.4. system if

(A8+”(x, y ) , Bs-m(x)) is called a regular stochastic L.C.-

* The topology of %(M)is given by the uniform convergence of coefficients and their derivatives so that %(M) is a FrCchet space.


314

( i ) for each (s, OJ)

E

[0, co) X Q,

is a smooth L.C.-system, (ii)

for each x, y E M , (8, w ) -+ A 9 x , y ) E T J M ) 0 T,(M), (s, w ) ---f B 8 q x ) E T,(M)

are (FJ-predictable and

J SUP l D : D W s %

(iii)

0 X,V€M

for every t C"(M).

0 3.

s:>

y)[(df)x,(dg),l)lds
0, $ t o #;I is independent of .Faand #t 0 #;I is equally distributed as # t - s . In particular, it is a process with independent increments: for every to = 0 < t, t, < . < t,, n, are mutually independent. q5t, o $& k = 1,2

>

-

0, then we define

J: K,dX, to be the semi-martingale (4.4)

J: K,dX, =

s:

K,dM,

+

and call it the stochastic integral of K valued semi-martingale X,.

s:

=

K,(B".")ds

(K,) E K by the regular O ( M ) -

Remark 4.1. The space %(M)* is just the space of 1-currents on M : k E O ( M ) * is expressed formally, in each coordinate neighborhood, as k = k,(x)dx* where k,, . .,k, are Schwartz distributions on M defined in the coordinate neighbourhood which obey the transformation rule under the change of coordinates. To be precise, k(X), X E %(M),is given by k ( X ) = ( k , , Xi) if X ( x ) = X*(x)(a/ax')with support in a coordinate neighborhood and, in a general case, by using a partition of unity. Note that traceHs.&(')0 k @ )= ( W ,k(2))H,.0, k(QE %(M)*, 1 = 1, 2, is given by

C (ky)sz8k r ) ' J ,a,(x)aJ(y)a"(x,y , s, a)) I,J

where 1 = C , a,(x) is a partition of unity with Supp(a,)C V,, V, being a coordinate neighborhood and k") = kjz)2'(x)dxt in V,,

Aslw(x,y ) = aiJ(x,y , S, w)-

a a ay ax( 0__

in V , x VJ

k:'),' 0k y ) , Jis a tensor product of Schwartz distributions.

.

Y.LE JAN AND S. WATANABE

320

Finally we remark that the totality of regular %(M)-valued semi., X,,, be regular I ( M ) martingales forms a vector space. Let X,, X,, valued semi-martingales with local characteristics {A>,",x, y), B ~ " ( x )and } the semi-martingale decompositions

--

X k ( t ) = M,(t)

+

s'

.

B>@ds, k = 1, 2, . ., m,

respectively. Then if M,(t) = M:(t, x)(a/ax'),k = 1, 2, . . ., m, in a local coordinate, (MXf

4, MZ(t, Y)> =

j: a%, ,

Y , s, o)ds

with some function aX(x, y , s, o) and it is easy to see that (x,y ) --+ &(x, y , s, o) is smooth for each (s, w), (s, o)-+&(x, y , s, o) is (TJpredictable for each x , y E M and

for every multi-indices a! and /3.

Since

[ D:DCa;{(x, y , s, 0 )j2 I D;D,"a%(x, z, s,

Iz== x

0)

D,BD;aSlj(z,y , s, 0)] E = u

for almost all (s,o) (ds P(do)), we can find, for every xo,yo E M and coordinate neighborhoods Vzo,Vu,of x,, yo respectively, neighborhoods Uxo, Uu0such that xo E UzoC Vzo,yo E UuoC Vuoand

(J

(4.5)

a s . for every t

> 0.

)'"

sup {D:D;ap,(x, z, s, w)]I=x}ds

0 ZE U z o

If we set

a smooth tensor field A>,: is globally defined and, by (4.9, we can conclude that

Stochastic Flows of Diffeomorghisms

321

for every f, g E C"(M), smooth differential operators D:, Di on M and t > 0. Hence, if we set m

A"."(x,Y ) =

c

Y)

k,l=l

cB:."(x) m

Byx)=

k=l

then {A"."(x,y), B"."(x)} is a regular stochastic L.C.-system. It is now .. X , is a regular %(M)-valued easy to conclude that X = XI X , semi-martingale associated with these local characteristics. In particular, we have proved the following.

+ +

+

Proposition 4.1. The totality of regular %(M)-valued semi-martingales forms a vector space under the usual addition and scalar (=real) multiplication.

5 5.

Gvalued semi-martinglaes generated by %(&I)-valued martingales

semi-

In this section, a probability space (Q, S, P) and a filteration (9,) are given and fixed. Let X , = M ,

+

J-1

BSsads be a regular %(M)-valued

semi-martingale with the local characteristic {As+', B51."} which is, by definition, a regular stochastic L.C.-system.

+

Theorem 5.1. Let be an So-measurable G-valued random variable. Then there exists a unique regular G-valued semi-martingale $, such that

(i> $0 = +, (ii) for every x

E

M and f

E

Cm(M),

where K"(f ) E %(M)* is defined, for each x (5.2)

K " ( f )(X)= X ( f )(XI

E

X

Y

M and f E C"(M), by E

%(M)

and L:,"is defined, as above, by (5.3)

1 L;+(f ) = -Vs+(df): 2

=

1 trace,,.,Vf~" . 2

322


The local characteristic of $, coincides with {A8+',B*i*}. Proof. We assume 11. = id, i.e. +(x) = x : the general case can be obtained as #, 0 11. from the solution $, with $o = id. We embed M into a higher dimensional Euclidean space R N . Then $,(x) = (&(x), &(x), ., 4 f ( x ) ) and every X E % ( M ) is the restriction to M of a smooth vector field on R N ,that is, we may write

-

x = cF ( X ) -axta N

i=l

where x = (x', x2, . . . ,x N ) is the Euclidean coordinate. Set K;(X) = X i ( x ) and bi(x, s, w ) = (L;," B 8 , m ) ( f ) ( xwhere ) f ( x ) = xi, i = 1,2, . ., N . Then (5.1) is equivalent to the following:

+

(5.4)

&(x) = xi

-

+ [K!b(5)dM,+

1:

bt(q5,(x),s, w)ds ,

i = 1, 2, . . ., N. Note that, for any M-valued (9,)-predictable processes x , and y,, we have (5.5)

(1:

KSldM,,

J: KJ.dM,) = J: aif(x,,y , , s, o)ds

where A"*(x, y ) = aif(x,y , s, o),

a 0a ax%

ap

under the embedding M C R N . More generally, if K;,=E %(M)* is defined, for x E M , i = 1,2, . . ., N and multi-index a = (al, . ., aN),by

x = c X"X)- a N

Kt,.(X) = D ; X i ( x ) ,

i=l

ax6

E

X ( M ),

then, for any M-valued ($=,)-predictable processes x , and y,,

We shall now prove the uniqueness of the solution $,(x) = (+f(x))of (5.4) and diffeomorphic property of x 3 $,(x). For this, take any integer m > 0 and set

Stochastic Flows of Diffeomorphisms

323

(f)(x) XE .%M) 9

E

M , by

*

The local characteristic of Y , is obviously {$,*(As+),q5$(B"'")}. Debition 5.3.

This regular I(M)-valued semi-martingale Y , is

33 1

Stochastic Flows of Diffeomorphisms

denoted by

Theorem 5.6. Let {$](t)}and {Ibz(t)}be regular G-valued semi-martingales generated by regular S(M)-valued semi-martingales X l ( t ) and X z ( t ) respectivetly. Then $l(t) o q&(t) is a regular G-valued semi-martingale generated by the regular %(M)-valued semi-martingale

, G Proof. Let F f , x ( $ l$z):

x G +. R be defined, for f E Cm(M)and

x E M , by

D24Ff3”($l,$z)(X, Y) = [($;’)*(

Y ) I ( [ ( $ ~ l ) * ( ~ ) l ( ~ ) $z(x)> )($l * O

Then the assertion of the theorem can be easily deduced from Th. 5.5. Details are omitted. Corollary 1. Let X l ( t ) and X z ( t ) be regular %(M)-valued semimartingales and $‘(t) be a regular G-valued semi-martingale generated by X l ( t ) (uniquely determined up to the value $l(0)). Let

and $2(t)be a regular G-valued semi-martingale generated by $(t). Then #3(t)= $l(t)o $z(t)is a regular G-valued semi-martingale generated by the regular S(M)-valued semi-martingale Xl(t) X2(t).

+

Corollary 2. Let +(t) be a G-valued semi-martingale generated by a regular S(M)-valued semi-martingale X ( t ) . Then $ - l ( t ) is a regular G-

Y.LE JANAND

332

S. WATANABE

valued semi-martingale generated by the regular X(M)-valued semi-

1:

martingale -

$*(s)(dX(s)).

Corollary 3. The totality of all regular G-valued semi-martingales is closed under the composition and the inverse.

References P. Baxendale, Wiener processes on manifolds of maps, Proc. Royal SOL Edinburgh, 87A (1980), 127-152. -, Brownian motions in the diffeomorphism group I, Univ., Aberdeen (1982). Ju. N. Blagovescenksii and M. I. Freidlin, Some properties of diffusion processes depending on a parameter, Soviet Math., 2 (1961), 633-636. J. M. Bismut, MCcanique altatoire, Lect. Notes in Math., 860, Springer, 1981. K. D. Elworthy, Stochastic dynamical systems and their flows, Stochastic Analysis ed. by A. Friedman and M. Pinsky, 79-95, Academic Press, 1978. I. I. Gihman and A. V. Skorohod, Stochastic differential equations. Springer, 1972. T. E. Harris, Brownian motions on the homeomorphisms of the plane, Ann. Prob. 9 (1981), 232-254. N. Ikeda and S. Watanabe, Stochastic differential equations and diffusion processes, North-Holland/Kodansha, 1981. H. Kunita, On the decomposition of solutions of stochastic differential equations, Stochastic Integrals, Lecture Notes in Math. 851, Springer, 1981. H. H. Kuo, Gaussian measures in Banach spaces, Lecture Notes in Math. 463, Springer, 1975. Y. Le Jan, Flot de diffusions dam Rd. C. R. Acad. Sci. Paris 294 (1982), Strie I, 697-699. P. Malliavin, Stochastic calculus of variation and hypoelliptic operators, Proc. Intern. Symp. SDE. Kyoto ed. by K. ItB, 195-263, Kinokuniya, 1978. M. Metivier and J. Pellaurnail, Stochastic Integration, Academic Press, 1980. S. Watanabe, Stochastic flows of diffeomorphisms, Proc. Fourth Soviet, Japan Symp. on Prob., Lecture Notes in Math. 1021, Springer, (1983), 699-708. YVESLE JAN LABORATOIRE DE CALCUL DES PROBABILITIES UNIVERSITEPIERRE & MARIECURIE PARIS.FRANCE

SHINZO WATANABE DEPARTMENT OF MATHEMATICS KYOTOUNIVERSITY KYOTO606, JAPAN


Some Recent Results in the Optimal Control of Diffusion Processes Pierre-Louis LIONS

5 0.

Introduction

We consider here optimal stochastic control problems where the state of the system is supposed to be given by the solution of a stochastic differential equation : (1)

dX, = u(X,, ct,)dB,

+ b(X,, a , ) d t ,

X,

=

x E RN

where B, is an rn-dimensional Brownian motion, u(x, a), b(x, a) are given matrix and vector-valued functions on RNX V -where V (called the set of the values of the control) is a separable metric space. We stop the state processes X,at the first exit time r from a given domain U and we consider cost functions given by

We next define the value of the control problem (also called the optimal or the minimum cost function):

(3)

u(x) = inf J(x,a,),

Vx E B

.

at

The heuristic argument of dynamic programming - due to Bellman [4] -indicates the u “should be related to the solution” of the following partial differential equation of elliptic type - called the Hamilton-JacobiBellman equation (HJB in short) -:

(4)

sup {A,u(x) - f(x, a)} = 0

in U

uEY

and u should satisfy on 8U (or on some part of aU): u = y. Here A , is the linear, second-order, elliptic (possibly degenerate) operator given by A , = -a,,(x, a)8,,- bt(x, a)8, c(x, a) and a = 1/2 ud-here and below

+

P. L. LIONS

334

we use the convention on repeated indices -. In addition, formally, if a “smooth” solution ii of (4) is known (with appropriate boundary conditions) then not only ii 3 u but also one can build optimal controls in the so-called feedback form (or markovian controls). This is why we will be interested in what follows by the relations between u and (4). Our purpose here is to review recent results on that equation mostly obtained by the author (see Lions [31], [32], [33]). In section I we briefly describe the controls problems we want to look at and recall the classical methods for those problems. Section I1 is devoted to various “elementary” properties of the value u such as continuity properties, characterization in terms of maximum subsolution, density of particular classes of control processes. In section I11 we consider various applications of the notion of viscosity solutions of HJB equations (introduced by Crandall and the author [ll], [12] in the case of first-order Hamilton-Jacobi equations): in particular we recall the observation (due to Lions [32], [34], [35]) that the value function u is always a solution of (4) in viscosity sense. In addition it is the unique solution of (4) in that sense -with prescribed values on

ao -.

Section IV is devoted to regularity results of u which extend and are of the same type then those initiated by Krylov [22], [23], [24], [25]. Those results are taken from [33] (see also [35], [36]). In section V we consider the problem of reflected processes and indicate briefly a few regularity results Finally section VI is devoted to the fundamental inequalities due to Krylov [26], [24] (cf. also Pucci [53] and the pioneering work of Alexandrov [l]). We propose here an approach to these inequalities which-even if not totally different from Krylov original proof-sheds some light- we believe- on the nature of these inequalities and the relations with control problems. We also derive from these inequalities some new estimates which insure, in particular, the existence of regular transition probability densities for homogeneous non-degenerate diffusion processes with bounded, Bore1 measurable coefficients. Those results have to be compared and complete the counter-example due to Fabes and Kenig [ 171. SUMMARY

5 I.

Position of the problem 1.1 The time-independent problem. 1.2 The time-dependent problem. 1.3 Classical methods.

Optimal Control

8 11.

8 111.

8 IV. 8 V.

8 VI.

335

Preliminary properties of the value function 11.1 Continuity of the value function. 11.2 Maximum subsolution. 11.3 Remarks on the control processes. Viscosity solutions of HJB equations 111.1 The Dynamic Programming Principle. 111.2 Uniqueness of viscosity solutions. 111.3 Various remarks. Regularity and Uniqueness IV.l Regularity results. IV.2 Uniqueness results. Reflecting boundary conditions V.l Description of the problem. V.2 Regularity results. Application to Krylov inequalities VI.1 Description of the main result. VI.2 Proof and remarks.

0 I. Position of the problem 1.1 The time-independent problem c, f

Let 0 be a smooth open set in RN (possibly RN itself) and let be given functions on RN x V- satisfying

+

0,

b,

where = otj, b,, c, f (1 < i < N , 1 < j < m). Finally to simplify the presentation we assume that (o E W s 9 3 O ) . We define an admissible system d as the collection of i) a probability space ( O , T , 2Ft, P) with the usual properties, ii) an m-dimensional 9, adapted Brownian motion B,, iii) a progressively measurable process at taking its values in a compact set of V- (depending on cu,). Because of (3, for each admissible system, there exists a unique continuous process solution of (1) (with initial position x in R N ) . We denote by 'c the first exit time of X , from d 'c

= inf { t

2 0, X , 4 6 )

(=+m

if X , E d for all t

0)

P. L. LIONS

336

and we will assume that inf{c(x,a))xER N ,a~ V }= R

(6)

>0.

In all the results below concerning the time-independent problem (or stationary problem) we will always assume (5), (6) and we will not recall it. We finally define the cost function for each x E d and admissible system d :

The optimal cost function or the value of the stationary control problem is given by

u(x) = inf {J(x,a)I d :admissible system}

(8)

for all x

E

8.

1.2 The time-dependent problem Let T > 0 be fixed, we denote by Q = 0 x 10, T [ . We now assume that the coefficients 0,b, c , f , depend on t E [O, TI and satisfy

I]+(*, t, a)IIwa.-(BN) < 03 +(. , - ,a) is uniformly continuous on RN x [0, TI, +(., t, a) E W Z , m ( R N ) ,SUP (I

€ V ,t

€[O,TI

uniformly in a E "Y . +(x, t, .) E C ( V )

for

+ = ot,, b,, c, f

(1

for all x E RN , t

E

[0, TI

,

-

03;

T ) 3 w2q0) .

In all the results below concerning the time-dependent problem, we implicitly make the above assumptions.

t

E

We define admissible systems exactly as before and for each x E R N , [0, TI we consider the solution X,( =X,(x, t ) ) of

(10)

dX, = a(X,, r

+

S,

a,)dB,

+ b(X,, t +

S,

a,)ds,

X, =

X.

337

Optimal Control

We then define the corresponding cost function

X

(71

+ X

and the value function of the time dependent control problem is given by (89

u(x, t ) = inf {J(x,t, d)[ d :admissible system} .

Remark 1.1. It is worth remarking that the time-dependent problem may be viewed as a special case of the time-independent problem by looking at the problem: 2, = ( X s , Y,) where Y,= t s, the domain Q = 0 X 10, T [ and remarking that 7 A (T - t ) is the first exit time from 0 of 2,.

+

1.3 Classical methods We just want to recall here two facts which can be found in Fleming and Rishel [18] (see also Bensoussan and Lions [5], Krylov [24], Lions [38]): if we know a priori that u given by (8) belongs to C’(0)- that is if the value function of the stationary problem is smooth - then u solves the HJB equation sup {A,u(x) - f ( x , a)} = 0

(4)

in 0 .

a€
l(Q)then u solves the equation (49

au -at

+ sup {A,u(x, t ) - f ( x ,

t, a)} = 0

in Q

.

Conversely if ii E C’(6) (resp. Cz,l(Q)) solves (4) (resp. (4’)) and satisfies: z2 = (D on a0 (resp. ii = p on a,Q = (a0 x [0, TI) U (6 x { T } ) , then u is the value of the associated control problem. Those results are easily proved by the use of ItB’s formula and are essentially verification theorems. Unfortunately they require either the a priori knowledge of the smoothness of the value or the existence of a smooth solution of the fully nonlinear degenerate elliptic-parabolic equations (4) or (4’). Of course when a does not depend upon a and when a is

P. L. LIONS

338

uniformly positive definite, (4) and (4') are only quasilinear uniformly elliptic-parabolic equations and it is possible to solve them directly by standard p.d.e. considerations (see 1181, [5] for more details). However in the general case considered here classical results in p.d.e. theory do not apply and furthermore easy examples show that u (the value) is not, in general, of class C'.

Q II. Preliminary properties of the value function II.1 Continuity of the value function We want to describe here a few conditions which insure that u given by (8) is continuous on d and that u = 9 on a0 (or at least on some part of a@). Of course some conditions are needed since it is well-known that even in the very special case when there is no control (i.e. when u, b, do not depend upon a) u corresponds to the solution of linear degenerate Dirichlet problem and may well be discontinuous. The condition we introduce is an extension of the condition: f(x, a)>O on b x 7". Let us first discuss this special case: remark first that r as B function of x is U.S.C. (a.s.) and thus quantities like

are also U.S.C. with respect to x (as.) iff is nonnegative. Next u being an infimum of U.S.C. functions is also U.S.C. We now introduce our main condition : we assume that there exists closed subset possibly empty, of = 8 0 ; a Bore1 bounded function w on 8 satisfying

r+,

( V d , vx E

r

8 ;P [ r < 03, X , 4

r,]= 0; l,,,,,(w - y)(X,) Q 0 a s . ,

Vx E r,, lim inf w(y) > ( ~ ( x,) u8d.Y-x

I

(is a StAr-submartingalesatisfying for any bounded stopping times

4 G 82: E"0,I 2 E"d

*

Theorem II.1. We assume (10). Then we have i) For all admissible system d , J( .,d)is U.S.C. in b and thus u is U.S.C. on 8. ii) u win 8, u (D on andfor a l l d , for all X E b

>

r,;

l~r

vx E a,vy E r / .

iJ' R < R,, r E (0,1) if R

= A,, Y = 1 i f

20.

Remark 11.3. In Lions [31], it is shown that this exponent is in general optimal. The fact that, for related problems, the modulus of continuity of u depends in an essential way on the ratio A/& was first remarked in Lions [39], [40]. We give below examples of situations where is closed and (15) holds: other examples may be found in [31]. Let us immediately notice that if 0 = RN (or if (12) holds with = 80) then = = q5 and all conditions are automatically satisfied. Thus in this case u E Co,'(RN)with the above value of 7.

r'

r-

r' r+

Corollary II.2.

(16)

W e assume (1 l), (12) and

0, 3a, > 0 , p ( x ) >, a,

if

Finally we set for (x, t ) E RN x [0, TI, E

>0

V6

dist (x, 8 ) 2 6 .

+ s, a,)

&(x, t ) = inf E [ r - ' h(Xs, t .d

where h = f - g. Using Krylov's result afore mentioned, one sees that ii*(x,t ) = inf J,(x, t, dM) dN

where J,(x, t, d)is the above cost function. Clearly as E J 0+, J,(x, t, d)decreases to

X exp

(-s'

c(X,, t

+ a, aJdg

)I ds

and we may pass to the limit in the infima to find ~ ( x t, ) = inf J(x, t , st) = inf J(x, t, d,). .d

.dM

Remarking finally that in view of Itb's formula, we have for every (x, t ) E Q and for every admissible system d

P. L. LIONS

346

we deduce ii(x, t ) = u(x, t ) - w(x, t ) = inf J(x, t, dM) - w(x, t ) ; d.u

and we conclude.

9 III.

Viscosity solutions of HJB equations

111.1 The Dynamic Programming Principle Recently, Crandall and the author [Ill, [12] introduced a new notion of solution of first-order Hamilton-Jacobi equations : with this notion general uniqueness, stability and existence results were proved (cf. 1121, [lo], [39]). In [39], the author pointed out the intrinsic relations between this notion of “viscosity solutions” of HJ equations with the Dynamic Programming Principle. It is very straightforward to adapt this notion to the case of second-order equations (cf. Lions [32], [34], [41]) and to preserve very general stability results (and existence results too) but the main open question for second-order equations is the uniqueness of viscosity solutions. This uniqueness question is solved in Lions [32] for the class of HJB equations and we recall below the typical uniqueness results (section 111.2 below). We first recall briefly here one possible definition of viscosity solutions of HJB equations and we show how the heuristic derivation of HJB equation is made rigorous by the use of viscosity solutions. Combined with the uniqueness results of ectison 111.2 below this shows that the value functions of stochastic control problems are always solutions of HJB equations in viscosity sense and that this characterizes those value functions. For more details and properties of viscosity solutioons we refer the reader to the works mentioned above. Definition III.1. Let u E C(0); u is said to be a viscosity subsolution (resp. supersolution) of the HJB equation (4)

sup {A,u - f ( x , a)} = 0

(4)

in 0

+

if the following holds for all E Cz(0): let x, be a local maximum point of (u - +) (resp. minimum point) then we have SUP

[-a&,,

+

a)&,+(x3 - bdxo, cu>W(xo)

C ( X 0 , .)U(XO)

- f(X0, a11

O

+ ++

where 6 = u(xo)- +(xo). Replacing by 6, we may assume without loss of generality that 6 = 0 and using ItB's formula (recall that E C2(b))we deduce

and letting t + 0 SUP {-a&,, a

+

+, we deduce easily

Or)&,+(xo)- bdxo, 4&+(xo)

+

Remark 111.3. One sees that, because of the very definition of viscosity solutions of (4), one takes advantage of the fact that x, is a minimnm point of u - to replace u by and to carry out the usual derivation of HJB equations since 11. is C2. Doing so we obtain only an inequality instead of the equality.

+

III.2

+

Uniqueness of viscosity solutions

Our main result is the following:

P. L. LIONS

348

Theorem IU.2. i ) Let ii E C(0) be a viscosity solution of (4). Then for all 6 > 0, for all x E Ga, we have

E(x) = inf E[S;df ( X c ,at)exp .d

(-1:

c)dt

+ i7(X,,) exp

ii) Let i7 E C(Q) be a viscosity solution of (4'). T - S,] we have

> 0 for all x E Bdl,t E [a,

Then for all S,,6,

Of course the main applications of the result are the following comparison-uniqueness inequalities : Corollary III.l. then we have

Let u, u E C,(B) be two viscosity solutions of (4)

sup (u - u)' I

where 00,

< sup (u - u)' r'+

,

r: is a closed subset of r such that for x E 8, and for all d :P(r'
+(x)}, u is a viscosity solution of (4)

sup {A,u(x) aEr

- f ( x , a)} = 0

Then using Theorem 111.2, we find

in 0 .

Optimal Control

351

and by the choice of 0

where ~(6)-+ 0 as 6 +- 0 u(x)

+.

And this yields

< P(x) + ~(6)

for all 6

>0.

The reversed inequality is obtained by the use of a standard penalty argument in the theory of obstacle problems and optimal stopping problems. Let E > 0, 6 > 0; we introduce ,6‘,a c C”(R) satisfying

/?:la < 0 on R,P J t ) /.,a

= 0 if I2 0, /?:,a(t)2 0 on R

1

converges uniformly, as 6 goes to 0+, to -(t A 0) = P 6 ( t ) . E

We then set

where the supremum is taken over all progressively measurable processes 6, satisfying: 0 6, < 1 a.s. Using the results of Lions [33], we see that one can find uI,a,yE C:(RN)satisfying

0, then for E Q E ~ 6, a,,(€), Y Y,(E, 6) we still have: maxRN(u.,a,v- u) > 0. Let x, be a point of maximum of u.,d,u- u. Using the definition of viscosity solutions with u,,a,” we find

Al which is in general necessary: see the example of Genis and Krylov [19]. Let us give a very simple example showing the necessity of this assumption: take 0 = ] - 1, 1[, V = {ao}, u ( x , a , , ) = O , b ( x , a o ) = +x in [-1 , +l],c(x,ao)=~,f(x,or,)=O,~=l. Then = = { - 1, + 1) and (30) holds, R1 = 2 and it is easily checked that u(x) = jxr in [ - 1, 11. Therefore (31) holds if and only if A 2 2

+

r rz

+

Optimal Control

353

i.e. A 2 A,! However in the nondegenerate case this assumption on b is not necessary: see Theorem IV.2 below. Using the fact that we know (section 111.1) that u is a viscosity solution of the HJB equation, one deduces easily from the above result: CorolIary IV.l. function u satisfies

Under the assumptions of Theorem IV.1, the value

and the HJB equation holds:

sup {A,u(x) - f ( x , a)} = 0

(33)

a.e.

in 0

=CY

Another application of Thoerem IV.l is the following: Corollary IV.2. Under the assumptions of Theorem IV.l and if we assume that there exist an open set OJ c 0, an integer p E { 1, . . ., N } , a constant v > 0 such that

{5 e,

V X E O J , I,&Y,, ~ ~ ~ ...,cw,

(34)

i=l

= 1,

2 oiakl(x,

ai>Ekf,

i=l

Then dlfuE L"(oJ) for 1 < i, j As we said before, R Theorem N.2.

(35)

3v

> 0,

~~:38,,...,8,~]O,I[suchthat

2v

for all

t E RN .

i=l

A, is necessary except in the following case:

We assume V ( X ,cw)

E

B x

V , a(x, a) 2 vJN .

Then u E W2,m(0)n C2ir(0)(for some 'i and the HJB equation holds.

11 a \ I m )

2 ti

E

(0, 1) depending only on 0, v,

Remark IV.2. Theorem IV.l and Corollary IV.1.2 are due to Lions [33], [34], [35]. The W2,"regularity in Theorem IV.2 is due to Lions [44], [45] and Evans and Lions [16], while the higher C2,rregularity was recently obtained by Evans [13], [14]: all those results being obtained by purely p.d.e. methods. Partial results were previously obtained by different methods by Krylov [22], [23], [24], BrCzis and Evans [7]; Evans and Friedman [15]; Lions and Menaldi [49], [50]; Safonov [54], [55]. The special case 0 = RN was treated independently by Krylov [25], [27] and Lions [46], [47] by different probabilistic methods.

P. L. LIONS

354

We first explain how the assumption on I naturally comes into the proof of (31) in the special case 0 = RN:indeed assume, to simplify, the coefficients u, b, c, f are smooth, then by standard results on stochastic differential equations we have for all t 2 0

for all d , for all x E RN,and for all t E R N ;\ E l = 1, where K depends only on SUP, {Ilullwa.llbllw~.-- IICllwa.I l f Ilw..-}Therefore if I > I ,

+

+

+

x [O, TI) ;

;

Diu E L2(U,X (0, T ) ) , - C

au < A,u < C + __ at

in 9‘(Q)for all (Y E V

355

Optimal Control

for all 6

> 0.

(39)

au -~ + sup {A,u(x, t ) - f ( x , t , a)} = 0

Furthermore the HJB equation holds a.e. in Q .

at

Finally if a, b, c, f a r e Lipschitz in t uniformly for x E 8, a E V then au/at E L"(Q) and thus supuE, I I A . u I ( ~ ~< ( ~co. ) IV.2 Uniqueness results

At this point let us recall a few results proved above: we have seen that the value functions are viscosity solutions of HJB equations and that in some cases the value functions have some regularity properties and the HJB equations hold "in an a.e. sense". But in view of the example mentioned below, there may be many solutions having this regularity and satisfying the HJB equations in that sense. To solve this nonuniqueness question, we already proposed the notion of viscosity solutions;we present now a different approach taking into account conditions like (31) which are satisfied by the value function.

0, u,(x) = (1 - ,de-^l"')/Ais smooth except at 0, u, E W'."(RN)and the HJB equations hold in RN - (0): lDuaI Theorem IV.4.

(20) (21)

+ Ru, = 1 .

Let ii E C(0) fl W:$(0) satisfy

< f( ., a) Aii < g

A,ii

in B'(O),

for all a E Y

in 9'(0), where g

E

L;Y,(O).

In particular (20) implies that pa = A& - f ( ., a) is a nonpositive measure and we assume that the HJB equation holds in the sense of measures: sup pa = 0 . UEV-

Then we have for all 6

> 0 and for all x E 6,

Of course exactly as Theorem 111.2 implies Corollaries 111.1-2, in the same way the above result implies various uniqueness and comparison results. The above example shows that some assumption like (40) is nec-

P. L. LIONS

356

essary and that Lzc is the "best" space since ug E W',m(RN) satisfies (20) and D2ugE Lp(RN)for all p < N (D2u, E M N ( R N ) ) . The proof of this result involves careful estimates on the (in) equations satisfied by convenient regularizations of u" and uses heavily the probabilistic estimates of Krylov [24] -see Lions [33] for more details -. A'weaker form of this result appeared in Lions [36], [37]. We now explain how this result is adapted to the case of timedependent problems : Theorem IV.5. aii --

(20')

at

3C

Let ii

+ A,ii

> 0,

3g

E

E

C ( Q ) satisfy D,ii

L;Y(Q) and

in B'(Q), for all

0

(where Supp 'p c BR). And this implies obviously

Proof of Theorem VI.2.

i)

Let p E C+(D),we consider u(x, t ) = > 0. Using the

&((P(-Y~)I(~~~)), and u(x, t ) = t " ' ~ ( x , t ) for x E D, t Markov property we deduce u(x, t ) = E , [ r A * ( N

+ l)(t - s ) ~ u ( X ,t, - s)dr] .

And using Krylov estimate [24] (the time dependent version of (45))

3 63

Optimal Con:rol

and this yields

and (48) is proved. ii) Next let ’p be a bounded Bore1 function on RN,we consider u(x, t ) = EZ(‘p(Xt)). Assume ‘p 2 0 to simplify, then we have

+

where is the first exit time of X , from x B,. Next, using the scaling invariance of the Brownian motion we see that C,

< CRZN+l,

In addition it is easy to show that for t P,(t

if R

1.

>0

> r,) < C exp ( - a R / J t )

(use for example Lemma 2.2 in Krylov [24]), for some C, a is fixed, assume that the support of ‘p is contained in x have for R 2 R, 0

C < u(x, t ) < FR*N+lIIyIILN

> 0.

If Ro > 0

+ BRo;then we

+ c exp (-uR/JT)II’pIILm.

This implies that P(t, x ; dy) = p ( t , x, y)dy with p ( t , x,J J )E L:(RN)and we find

In what follows we denote by C, a,Q, various constants (dependent of t ) independent of x; we denote by h(y) = 1(Z+BRo)(y)p(t, x, y ) and by p(s) =

364

P. L. LIONS

meas (h 2 s) for s > 0. We apply the above inequality with 9 = and we find

or

Using Young’s inequality, this yields

Therefore for all q

< N / ( N - l), we deduce p(s) < cs-*

and we conclude easily.

References A. D. Alexandrov, Uniqueness conditions and estimates for the solution of the Dirichlet problem, Amer. Math. SOC.Transl., 68 (1968), 89-119. Majorization of solutions of second-order linear equations. Amer. Math. SOC.Transl., 68 (1968), 120-143. Majorants of solutions and uniqueness conditions for elliptic equations, Amer. Math. SOC. Transl., 68 (1968), 144161. -, Dirichlet’s problem for the equation Det llzijll = Wzl, . . ., zn, Z, XI, ..., xn), I., Vestnik Leningrad Univ. Ser. Mat. Mekh. Astr., 13 (1958), 5-24, (in Russian). I. Bakelman, Generalized solutions of the Dirichlet problem for the Ndimensional elliptic Monge-Amphre equations, preprint. R. Bellman, Dynamic Programming, Princeton Univ. Press., Princeton, N. J., 1957. A. Bensoussan and J. L. Lions, Applications des intquations variationnelles en contr6le stochastique, Dunod, Paris, 1978. -, Contr8le impulsionnel et in6quations quasi-variationnelles, Dunod, Paris, 1982. H. Brtzis and L. C. Evans, A variational inequality approach to the Bellman-Dirichlet equation for two elliptic operators, Arch. Rat. Mech. Anal., 71 (1979), 1-13. S. Y. Cheng and S. T. Yau, On the regularity of the Monge-Ampsre equation det (aau/axiaxj) = F ( r , u ) , Comm. Pure Appl. Math., 30 (1977), 4168.

J. M. Coron and P. L. Lions, to appear. M. G. Crandall, L. C. Evans and P. L. Lions, Some properties of viscosity

Optimal Control

11 11

1171

P11 1221

1271 281 1291 1301 1311

t321

1331

3 65

solutions of Hamilton-Jacobi equations, to appear in Trans. Amer. Math. SOC.(1983). M. G. Crandall and P. L. Lions, Condition d'unicitt pour les solutions gtneraliskes des Cquations de Hamilton-Jacobi du premier ordre, ComptesRendus Acad. Sci. Paris, 252 (1981), 183-186. -, Viscosity solutions of Hamilton-Jacobi equations, Trans. Arner. Math. SOC.277 (1983), 1 4 2 . L. C. Evans, Classical solutions of fully nonlinear, convex, second-order elliptic equations, Comm. Pure Appl. Math., 35 (1982), 333-364. -, Classical solutions of the Hamilton-Jacobi-Bellman equation for uniformly elliptic operators, preprint. L. C. Evans and A. Friedman, Optimal stochastic switching and the Dirichlet problem for the Bellman equation, Trans. Amer. Math. SOC., 253 (1979), 365-389. L. C. Evans and P. L. Lions, RBsolution des Cquations de Hamilton-JacobiBellman pour des opirateurs uniformiment elliptiques, Comptes-Rendus Acad. Sci. Paris, 290 (1980), 1049-1052. E. B. Fabes and C. Kenig, Examples of singular parabolic measures and singular transition probability densities, Duke Math. J., 48 (1981), 845856. W. H. Fleming and R. Rishel, Deterministic and stochastic optimal control, Springer, Berlin, 1975. I. L. Genis and N. V. Krylov, An example of a one-dimensional controlled process, Th. Proba. Appl., 21 (1976), 148-152. N. Ikeda and S. Watanabe, Stochastic differential equations and diffusion processes, North-Holland, Amsterdam, 198 1 . K. It& t o appear. N. V. Krylov, Control of a solution of a stochastic integral equation, Th. Proba. Appl., 17 (1972), 114-131. -, On control of the solution of a stochastic integral equation with degeneration, Math. USSR Izv, 6 (1972), 249-262. -, Controlled diffusion processes, Springer, Berlin, 1980. -, Control of the diffusion type processes, In Proceedings of the International Congress of Mathematicians, Helsinki, 1978, 859-863. -, Some estimates in the theory of stochastic integrals, Th. Proba. Appl., 18 (1973), 54-63. -, Some new results in the theory of controlled diffusion processes, Math. USSR Sbornik, 37 (1980), 133-149. -, On the selection of a Markov process from a system of processes and the construction of quasi-difEusion processes, Math. USSR. Izv., 7 (1973), 691-708. N. V. Krylov and M. V. Safonov, An estimate of the probability that a diffusion process hits a set of positive measure, Soviet Math. Dokl., 20 (1979), 253-255. -, A certain property of solutions of parabolic equations with measurable coefficients, Math. USSR Izv., 16 (1981), 151-164, (in Russian). P. L. Lions, Optimal control of diffusion processes and Hamilton-JacobiBellman equations, Part 1, The Dynamic Programming Principle and applications, Comm. P.D.E., 8 (19831, 1101-1174. -, Optimal control of diffusion processes and Hamilton-Jacobi-Bellman equations, Part 2, Viscosity solutions and uniqueness, Comm. P.D.E. 8 (1983), 1229-1276. -, Optimal control of diffusion processes and Hamilton-Jacobi-Bellman equations, Part 3, Regularity of the optimal cost function, In Nonlinear Partial Differential Equations and applications, Colllge de France Seminar, Vol. V, Pitman, London, 1983.

P. L. LIONS

366 1341 -,

Optimal stochastic control of diffusion type processes and HamiltonJacobi-Bellman equations, In Advances in Filtering and Optimal Stochastic Control, Ed. W. H. Fleming and L. Gorostiza, Springer Lecture Notes in Control and Information Sciences, Berlin, 1982. -, ContrBle optimal stochastique et tquations de Hamilton-Jacobi-Bellman, Comptes-Rendus Acad. Sci. Paris, 295 (1982), 567-570. -, Equations de Hamilton-Jacobi-Bellman dCgtnBrCes, Comptes-Rendus Acad. Sci. Paris, 289 (1979), 329-332. -, Equations de Hamilton-Jacobi-Bellman, In Se'minaire GoiilaouicSchwartz 1979-1980, Ecole Polytechnique, Palaiseau. -, On the Hamilton-Jacobi-Bellman equations, Acta Applicandae, 1 (1983) 17-41. -, Generalized solutions of Hamilton-Jacobi equations, Pitman, London, 1982. -, Existence results for first-order Hamilton-Jacobi equations, Ricerche Mat., 1983. -, Fully nonlinear elliptic equations and applications, Proceedings of the Function Spaces and Applications Conference in Pisek, Teubner, Leipzig, 1982. -, Sur le tquations de Monge-Amptrre, I. To appear in Manuscipta Math. 41 (1983) 1-43. -, Sur les tquations de Monge-Amtrre, 11. To appear in Arch. Rat. Mech. Anal. (1983). -, Rdsolution des probltrmes gtntraux de Bellman-Dirichlet, ComptesRendus Acad. Sci. Paris, 287 (1978), 747-750. -, Rtsolution analytique des problkmes de Bellman-Dirichlet, Acta Math., 146 (1981), 121-147. -, ContrBle de diffusions dans RN, Comptes-Rendus Acad. Sci. Paris, 288 (1979), 339-342. -, Control of diffusion processes in RN,Comm. Pure Appl. Math., 34 (1981), 121-147. P. L. Lions and J. L. Menaldi, Optimal control of stochastic integrals and Hamilton-Jacobi-Bellman equations, I, SIAM J. Control Optim., 20 (1982), 58-81. -, Optimal control of stochastic integrals and Hamilton-Jacobi-Bellman equations, 11, SIAM J. Control Optim., 20 (1982), 82-95. -, Problkmes de Bellman avec le contrde dans les coefficients de plus haut degr6, Comptes-Rendus Acad. Sci. Paris, 287 (1978), 503-506. P. L. Lions and A. S. Sznitman, Stochastic differential equations with reflecting boundary conditions, preprint. A. V. Pogorelov, On the Minkowski multidimensional problem, I. Wiley, New York, 1978. C. Pucci, Limitazioni per soluzioni di equazioni ellitiche, Ann. Mat. Pura Appl., 74 (1966), 15-30. M. V. Safonov, On the Dirichlet problem for Bellman's equation in a plane domain, Math. USSR Sbornik, 31 (1977), 231-248. -, On the Dirichlet problem for Bellman's equation in a plane domain, 11, Math. USSR Sbornik, 34 (1978), 521-526. D. W. Stroock and S. R. S. Varadhan, On degenerate elliptic-parabolic operators of second-order and their associated diffusions, Comm. Pure Appl. Math., 25 (1972), 651-714. -, Diffusion processes with boundary conditions, Comm. Pure Appl. Math., 24 (1971), 147-225. -, Multidimensional diffusion processes, Springer, Berlin, 1979.

Optimal Colltrol UNIVERSTE PARIS IX-DAUPHINE, CERE MADE PUCE DE LAITRE DE TASSIGNY 75775, PARIS, CEDEX 16 FRANCE

367


Implicit Functions in Finite Corank on the Wiener Space Paul MALLIAVIN

0 0. Introduction In R",integration relative to the Lebesgue measure may, sometimes, use the differentiable structure of the space; for instance disintegration along a family of submanifold, method of stationary phase and critical points. A definition of smooth functions on the Wiener space have been worked out from sometimes [4], [6], [7]. The advantage of this definition is that Wiener functionals coming from the integration of S.D.E. are smooth. The main application of this smoothness have been to prove regularity result for the law of some Wiener functionals. It could seem now appropriate to push forward some kind of differential calculus on the Wiener space in order to obtain some insight in the disintegration of Wiener measure by finite dimensional valued Wiener functional. One difficulty to apply standard implicit function theorem is that the Wiener functionals are not continuous for any Banach norm. Some kind of redefinition is therefore necessary. We shall introduce a scale of capacities and the notion of slim set, that is a set of capacity zero for all the scale of capacities. Then smooth function could be redefined outside a slim set, and outside this set there are continuous (with their derivatives) relative to the Banach space norm. Furthermore the projection of a slim set by a linear projection of finite corank is a slim set on the image of X . With these ingredients an implicit function theorem will be proved in the last section of this work. (Further developpements in [8], [9]) I am very grateful to the warm hospitality of my Japanese colleagues during this conference and also to the generous support of Taniguchi foundation which makes possible this meeting which has been for me an important occasion of exchange of ideas. 1.

2. 3. 4.

Contents. Smooth functions on the Wiener space. Pseudo-direct sum decomposition, Sobolev spsces and slim sets. Redefinition of smooth functions. Implicit function theorem.

P. MALLIAVIN

370

8 1.

Smooth functions on the Wiener space

We shall recall in this section known results. We denote by X the space of continuous applications of [0, I] in a finite dimensional space, vanishing for r = 0. The Brownian motion defines on X a Gaussian measure p. Finally we shall take on X the uniform topology and we shall call ( X , p) the Wiener space. We denote by H the Cameron-Martin subspace of X :

We denote

H, = { x E H ; x”(r) is a measure} . For h E H, we define ( 4 h ) = x( 1). h’( 1) -

s’

~ ( 7 h”(t)dr ) .

.

Then we shall obtain in this way a continuous linear form on X . Given an orthonormal basis e,, . . ., en, . . of H, such that e, E H,, we consider

hn(x) =

Il hni((x, ei))

>

--

where h, are the Hermite polynomials on R,n = (n,, -,n,, . . .) is a multi-index with n, 2 0 and In I = C n, < 00. We denote by Sqthe space generated in L 2 ( X )by the h, with In I = q. Then we have the Wiener’s chaos orthogonal decomposition :

+

L y X ) = 02Pq. P

We denote by B the finite linear span of the Sq, then B is a dense vector subspace of L2(X). We denote by 8,the totality of elements of 9 of degree < q. We define the Ornstein-Uhlenbeck operator 9 on X qby

9 h n = -InIh, and on B by linear combination. More generally if G is a separable Hilbert space, g,, . . .,g , , . . . an orthonormal basis of G , i f f € L2(X,G), we shall write

f=C h . Then

Implicit Functions in Finite Corank

For p

=

011

the Wiener Space

371

2 we have lIfll2L2LY,G)

=

c

Ilfill2ll(X)

.

We denote by B,(G) the space of functionsfc L2(X,G ) such that

fi E 9,

for all I .

Then i f f € P,(G), 2 Y f ~ Lp(X,G). We denote B(G) = U, B,(G) and we define aP(S?, G) as the domain of the closure in Lp(X, G) of 9 defined G) is defined. Given h E H, and f c on B(G). In the same way gP(.Yr; B(G)the derivative off along the vector h is well defined : ( V , f ) ( x ) = lim e - ' { f ( x

+ eh) - f ( x ) } .

Then +(

VhfXX)

is a linear map of H into G. We shall denote by [llVfllaBQits HilbertSchmidt norm. We define in the same way 9p(V;:G):asthe domain of the closure of V for the norm

We can consider the Hilbert space G,=G@H

with the Hilbert-Schmidt norm. Then

Of: X + G , . We can iterate the process and define VZf = V ( V J ) ,

Vrf = V ( P - l f )

and the norm

.

where H , = symmetric tensor product of H 0 0H ; we define the domain g P ( V r ;G'), as the closure of V r defined on B ( G ) under previous +

P. MALLIAVIN

372

norm. (The Hilbert-Schmidt norm defines on H 8 G an Hilbertian structure). Theorem. (Meyer's inequalities [5]). Let 1 < p

Z)dPz(4 Z

Now we integrate in y , by Fubini’s formula

< c{Ii

I~Pz~IILP(x;G)

ri7fII~~(X,~~@~)}

Using again, but in reverse way, the Meyer’s inequalities we obtain Il2ZfllL.p

0, p absolutely continuous relative to dp contradiction which proves the theorem.

References

141 [51

[61 71

V. P. Havin and V. G. Mazilia, Non-linear potential theory, Uspehi Math. Nauk, 27 (1972), No. 6, 67-138, (Russian Math. Survey 27 (1972), NO. 6,71-148.) L. I. Hedberg, Spectral Synthesis in Sobolev Spaces, Acta Mathematica, 147 (1981), 237-264. M. KrCe, PropriCtt de trace en dimension infinie, d'espaces du type Sobolev, C.R. Acad. Sc. Paris, 279 (19741, 157-164; Bull, SOC.math. France, 105 (1977), 141-163. P. Malliavin. Stochastic Calculus of Variation and Hypoelliptic Operator, Proc. Int. Symposium on S.D.E., 1976, Kyoto, ed. K. Kinokuniia, Tokyo, 1978. P. A. Meyer, Quelques results analytiques sur le semi-group d'OmsteinUhlenbeck en dimension infinie, Theory and Application of Random fields, Proc. IFIP-WG, Working Conference, Bangalore, 1982, Lect. Notes in Control and Information Sci., Springer-Verlag, Berlin, 201-214; Note sur les processus #Omstein-Uhlenbeck, SBminaire de ProbabilitCs XIV, 1980/81, Lect. Notes of Math., Springer-Verlag, Berlin, 95-133. I. Shigekawa, Derivative of Wiener functionals, J. Math. Kyoto Univ., 20 (1980), 263-289. D. Stroock, The Malliavin calculus and its application to second order parabolic equation, J. Math. System Theory, 14 (1981), 25-65. I. M. Bismut, Large deviations and the Malliavin Calculus, Birkhauser 1984. A. B. Cruzeiro, Equations dserentielles et Formules de Cameron-Martin non-IinBaires, Journal of Functional Analysis 1983, (54) p. 206-227.

fis,

10 RUE SAINTLOUIS VILE 75004, PARIS

FRANCS


Conditional Laws and Hormander’s Condition Dominique MICHEL

Introduction After the first papers of Zakai [38] and Fujisaki-Kallianpur-Kunita [l 11 on the filtering equation, a number of results on conditional laws and, especially, their regularity properties have been obtained using the filtering equation. Following the work of Pardoux (in his thesis [28]) on the stochastic partial differential equations (S.P.D.E.) under ellipticity conditions, Krylov-Rozovskii [ 161, Pardoux [29] and Rozovskii-Shimizu [31] proved the regularity of conditional laws under a uniform partial ellipticity hypothesis and assuming the existence of a density for the initial conditional law. Using a backward S.P.D.E. and a stochastic FeynmanKac formula, Pardoux [30] extended these results to prediction and smoothing. In an other direction, Davis [7] and Eliott-Kohlmann [lo] used the filtering equation and the works of Doss [9] and Sussmann [36] to get a robust version of the conditional law in some particular cases. More recently, without using the filtering equation, Michel [27] and Bismut-Michel [4] applied the Malliavin calculus to get regularity results for the conditional laws under a local hypoellipticity condition of the Hormander type. In [4],they also show that, after a transformation by a stochastic flow, the conditional law is the law of a semi-martingale whose characteristics are explicitly calculated. Chaleyat-Maurel-Michel [6] obtained the regularity result of [4] under a global hypoellipticity hypothesis using extensions of Kohn [151 technics to the probabilistic aspect of the problem and applying them to the filtering equation. Kunita [19], [20], [21] obtained results of the same type in the particular case where the coefficients of the Zakai equation do not depend on the observation. We want to present here the main ideas of Bismut-Michel [4] and Chaleyat-Maurel-Michel [6]. In the first stage, we use stochastic flows and Girsanov’s transformation to reduce the problem to a simpler one. The second step is to get some results on the simplified problem and the third step is to use these results to study the initial problem. But we shall see that, in the regularity problem, we get better results working directly with the initial problem than by first tackling the simplified one.

D. MICHEL

388

The paper is organized as follows: in the first part, we show how to perform a transformation by a stochastic flow and a Girsanov transformation to get a simplified problem. In the second part, we give some results on the structure of the conditional laws. In the third part, we obtain a first theorem of regularity using a Hormander theorem for a heat equation whose coefficients are not regular in time. In the fourth part, we expose the main ideas of Malliavin' calculus in the Bismut' formulation and apply it to the regularity problem. Finally, in the fifth part, we give a stochastic Hormander theorem whose proof uses the construction of a class of test semi-martingales and the definition of Sobolev norms adapted to S.P.D.E. The author wishes to thank Professor Ito and Professor Ikeda for their invitation and their kindness during the workshop at Katata, as well as Taniguchi Foundation for financial support.

5 1.

Stochastic flows and Girsanov transformation

1.1. Notations Let us denote by SZ (resp. 6) the space V(R+, R") (resp. %(R+,RP)). The generic point of D (resp. 6) is denoted by w (resp. I?) and P (resp. P") is the Wiener measure on 0 (resp. 6). If Xis a stochastic process defined on D x 6, 33': is the a-field generated by X until time t . In particular, = 33':~". If X,is a semi-martingale on (0x 6, 9, 9,, let us define 9, P 0P") SX,(resp. dX,) is its Ito (resp. Stratonovitch) differential (cf. [26]); S,X, (resp. SJ,) is the martingale (resp. bounded variation) part of SX,. Let us consider m 2p 1 vector fields X,,. . ., X,, XI, . . ., X,,Z,, . . Z,, on R" X RP such that

+ +

X,(x,z)

-

-

I

a ,

= X i ( x , z)-

a

ax,

-.

X,(x, z) = X:(x, z)Z,(X, z) = Z { ( Z ) - ,

a

a=,

,

a ,

ax,

0

Stochastic Analysis: Proceedings of the Taniguchi International Symposium on Stochastic Analysis, Katata and Kyoto, 1982 (North-Holland Mathematical Library)

Stochastic analysis and mathematical physics

Lectures on stochastic analysis

Stochastic Analysis: Proceedings of the Durham Symposium on Stochastic Analysis, 1990 (London Mathematical Society Lecture Note Series)