From Markov Chains to Non-Equilibrium Particle Systems, Second Edition

This page intentionally left blank NEW JERSEY LONDON SINGAPORE * S ~ A N e ~ *A HONG l KONG TAIPEI 9 CHENNAI P...

Author: Mu-Fa Chen

24 downloads 680 Views 24MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
Report copyright / DMCA form

DOWNLOAD PDF

This page intentionally left blank

NEW JERSEY

LONDON

SINGAPORE * S ~ A N e ~ *A HONG l KONG

TAIPEI

9

CHENNAI

Published by

World Scientific Publishing Co. Pte. Ltd. 5 Toh Tuck Link, Singapore 596224 USA ofice: Suite 202, 1060 Main Street, River Edge, NJ 07661

UK ofice: 57 Shelton Street, Covent Garden, London WC2H 9HE

British Library Cataloguing-in-PublicationData A catalogue record for this book is available from the British Library.

FROM MARKOV CHAINS TO NON-EQUILIBRIUM PARTICLE SYSTEMS (2nd Edition)

Copyright 0 2004 by World Scientific Publishing Co. Pte. Ltd AN rights reserved. This book, or parts there06 may not be reproduced in any form or by any means, electronic or mechanical, including photocopying, recording or any information storage and retrieval system now known or to be invented, without written permissionfrom the Publisher.

For photocopying of material in this volume, please pay a copying fee through the Copyright Clearance Center, Inc., 222 Rosewood Drive, Danvers, MA 01923, USA. In this case permission to photocopy is not required from the publisher.

ISBN 981-238-81 1-7

Printed in Singapore.

Contents Preface to the First Edition . . . . . . . . . . . . . . . . . . ix Preface to the Second Edition . . . . . . . . . . . . . . . . . xi Chapter 0 . An Overview of the Book: Starting from Markov Chains . 0.1. Three Classical Problems for Markov Chains 0.2. Probability Metrics and Coupling Methods . 0.3. Reversible Markov Chains . . . . . . . . 0.4. Large Deviations and Spectral Gap . . . . 0.5. Equilibrium Particle Systems . . . . . . . 0.6. Non-equilibrium Particle Systems . . . . .

. . . . . . .

. . . . . . .

. . . . . . . 1 . . . . . . . 1 . . . . . . . 6 . . . .

. . . .

. . . .

. . . .

. . . .

. . . .

13 15 17 19

Part I . General Jump Processes . . . . . . .

21

Chapter 1. Transition Function and its Laplace Transform . 23 1.1. Basic Properties of Transition Function . . . . . . . . . . 23 27 1.2. The q-Pair . . . . . . . . . . . . . . . . . . . . . . . 1.3. Differentiability . . . . . . . . . . . . . . . . . . . . . 38 1.4. Laplace Transforms . . . . . . . . . . . . . . . . . . . 51 57 1.5. Appendix . . . . . . . . . . . . . . . . . . . . . . . 1.6. Notes . . . . . . . . . . . . . . . . . . . . . . . . . 61

.

Chapter 2 Existence and Simple Constructions of Jump Processes . . . . . . . . . . . . 2.1. Minimal Nonnegative Solutions . . . . . . . . . 2.2. Kolmogorov Equations and Minimal Jump Process 2.3. Some Sufficient Conditions for Uniqueness . . . . 2.4. Kolmogorov Equations and q-Condition . . . . . 2.5. Entrance Space and Exit Space . . . . . . . . . 2.6. Construction of q-Processes with Single-Exit q-Pair 2.7. Notes . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

. . . . .

. . . . .

. . . . .

. . . . .

Chapter 3. Uniqueness Criteria . . . . . . . . . . . . . . 3.1. Uniqueness Criteria Based on Kolmogorov Equations . . . 3.2. Uniqueness Criterion and Applications . . . . . . . . . . 3.3. Some Lemmas . . . . . . . . . . . . . . . . . . . . . 3.4. Proof of Uniqueness Criterion . . . . . . . . . . . . . 3.5. Notes . . . . . . . . . . . . . . . . . . . . . . . . .

V

. . . . .

62 62 70 79 85

88 93 96

. 97 . 97 . 102 113

. 115 119

CONTENTS

vi

Chapter 4. Recurrence. Ergodicity and Invariant Measures . . . . 4.1. Weak Convergence . . . . . . . . 4.2. General Results . . . . . . . . . . . 4.3. Markov Chains: Time-discrete Case . 4.4. Markov Chains: Time-continuous Case 4.5. Single Birth Processes . . . . . . . 4.6. Invariant Measures . . . . . . . . 4.7. Notes . . . . . . . . . . . . . . .

. . . .

. . . . . . . . . . .

. . . . . . . .

.

Chapter 5 Probability Metrics and Coupling 5.1. Minimum Lp-Metric . . . . . . . . . . . 5.2. Marginality and Regularity . . . . . . . . 5.3. Successful Coupling and Ergodicity . . . . 5.4. Optimal Markovian Couplings . . . . . . 5.5. Monotonicity . . . . . . . . . . . . . . . 5.6. Examples . . . . . . . . . . . . . . . . 5.7. Notes . . . . . . . . . . . . . . . . . .

. . . . . . . .

. . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

. 120 . 120 . . . .

Methods . . . 173 . . . . . . . . 173 . . . . . . . . 184 . . . . . . . . 195 . . . . . . . . 203 . . . . . . . 210 . . . . . . . 216 . . . . . . . 223

Part I1. Symmetrizable Jump Processes . Chapter 6 . Symmetrizable Jump Processes and Dirichlet Forms . . . . . . . . . . . . . . . . . 6.1. Reversible Markov Processes . . . . . . . . . . . . . . . 6.2. Existence . . . . . . . . . . . . . . . . . . . . . . . 6.3. Equivalence of Backward and Forward Kolmogorov Equations 6.4. General Representation of Jump Processes . . . . . . . . . 6.5. Existence of Honest Reversible Jump Processes . . . . . . . 6.6. Uniqueness Criteria . . . . . . . . . . . . . . . . . . . 6.7. Basic Dirichlet Form . . . . . . . . . . . . . . . . . . 6.8. Regularity, Extension and Uniqueness . . . . . . . . . . . 6.9. Notes . . . . . . . . . . . . . . . . . . . . . . . . . Chapter 7. Field Theory . . . . . . . . . . 7.1. Field Theory . . . . . . . . . . . . . . 7.2. Lattice Field . . . . . . . . . . . . . . 7.3. Electric Field . . . . . . . . . . . . . . 7.4. Transience of Symmetrizable Markov Chains 7.5. Random Walk on Lattice Fractals . . . . . 7.6. A Comparison Theorem . . . . . . . . . 7.7. Notes . . . . . . . . . . . . . . . . .

124 130 139 151 166 171

225

227 227 229 233 233 243 249 255 265 270

. . . . . . . 272 . . . . . . . . 272 . . . . . . . . 276 . . . . . . . . 280

. . . . . . . . 284 . . . . . . . . 298 . . . . . . . . 300 . . . . . . . . 302

CONTENTS

.

Chapter 8 Large Deviations 8.1. 8.2. 8.3. 8.4.

vii

. . . . . . . . . . . . . . . . 303

Introduction to Large Deviations . . Rate Function . . . . . . . . . . . Upper Estimates . . . . . . . . . . Notes . . . . . . . . . . . . . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . . 303 . . . 311 . . . 320 . . . 329

.

Chapter 9 Spectral Gap . . . . . . . . . . . . . . . . . . 330 General Case: an Equivalence . . . . . . . . . . . . . . 330 Coupling and Distance Method . . . . . . . . . . . . . . 340 Birth-Death Processes . . . . . . . . . . . . . . . . . . 348 . . . . . . 359 Splitting Procedure and Existence Criterion Cheeger’s Approach and Isoperimetric Constants . . . . 368 9.6. Notes . . . . . . . . . . . . . . . . . . . . . . . . . 380

9.1. 9.2. 9.3. 9.4. 9.5.

Part I11. Equilibrium Particle Systems Chapter 10. Random Fields

. . . . . 10.1. Introduction . . . . . . . . . . . . 10.2. Existence . . . . . . . . . . . . . 10.3. Uniqueness . . . . . . . . . . . . 10.4. 10.5. 10.6. 10.7. 10.8.

. . . .

. . . .

. . . .

Phase Transition: Peierls Method . . . . Ising Model on Lattice Fractals . . . . . Reflection Positivity and Phase Transitions Proof of the Chess-Board Estimates . . . Notes . . . . . . . . . . . . . . . . . .

.

Chapter 11 Reversible Spin Processes and Exclusion Processes . . . . . 11.1. Potentiality for Some Speed Functions . 11.2. Constructions of Gibbs States . . . . . 11.3. Criteria for Reversibility . . . . . . . 11.4. Notes . . . . . . . . . . . . . . . . .

. . . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . 381

. . . 383 . . 383 . . 387 . . 391

. . . . . . . . 397 . . . . . . . . 399

. . . . . . . . 406 . . . . . . . . 416 . . . . . . . 421

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . .

422 422 425 432 446

Chapter 12. Yang-Mills Lattice Field . . . . . . . . . . . . 447 12.1. 12.2. 12.3. 12.4.

Background . . . . . . . . . . . . . . . . . . . . . Spin Processes from Yang-Mills Lattice Fields . . . . . Diffusion Processes from Yang-Mills Lattice Fields Notes . . . . . . . . . . . . . . . . . . . . . . . .

. 447 . . 448 . . 457 . 466

CONTENTS

viii

Part IV . Non-equilibrium Particle Systems . . . . . . . . . . . . . . . . . . Chapter 13. Constructions of the Processes . . . . . . . . 13.1. Existence Theorems for the Processes . . . . . . . . . . 13.2. Existence Theorem for Reaction-Diffusion Processes . . . 13.3. Uniqueness Theorems for the Processes . . . . . . . . . 13.4. Examples . . . . . . . . . . . . . . . . . . . . . . . 13.5. Appendix . . . . . . . . . . . . . . . . . . . . . . . 13.6. Notes . . . . . . . . . . . . . . . . . . . . . . . .

467

. 469 . 469 . 486 . 493 502 510 . 513

.

Chapter 14 Existence of Stationary Distributions and Ergodicity . . . . . . . . . . . . . . . . . . . 514 14.1. General Results . . . . . . . . . . . . . . . . . . . . 514 14.2. Ergodicity for Polynomial Model . . . . . . . . . . . . . 521 14.3. Reversible Reaction-Diffusion Processes . . . . . . . . . . 532 538 14.4. Notes . . . . . . . . . . . . . . . . . . . . . . . . .

.

Chapter 15 Phase Transitions . . . . . . . . . . . . . . . 539 539 15.1. Duality . . . . . . . . . . . . . . . . . . . . . . . . 15.2. Linear Growth Model . . . . . . . . . . . . . . . . . . 542 15.3. Reaction-Diffusion Processes with Absorbing State * . . 547 15.4. Mean Field Method . . . . . . . . . . . . . . . . . . 550 15.5. Notes . . . . . . . . . . . . . . . . . . . . . . . . . 554 Chapter . 16. Hydrodynamic Limits . . . 16.1. Introduction: Main Results . . . . . 16.2. Preliminaries . . . . . . . . . . . . 16.3. Proof of Theorem 16.1 . . . . . . . 16.4. Proof of Theorem 16.3 . . . . . . . 16.5. Notes . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . .

. . . .

. . . .

. . . . . . . . .

. . .

555 555 . . . 559 . . . . 564 . . . . 570 . . . 571

. . . . . . . . . . . . . . . . . . . . . .

572

Author Index . . . . . . . . . . . . . . . . . . . . . .

589

Bibliography

Subject Index . . . .

. . . . . . . . . . . . . . . . 593

Preface t o the First Edition The main purpose of the book is to introduce some progress on probability theory and its applications to physics, made by Chinese probabilists, especially by a group at Beijing Normal University in the past 15 years. Up t o now, most of the work is only available for the Chinese-speaking people. In order to make the book as self-contained as possible and suitable for a wider range of readers, a fundamental part of the subject, contributed by many mathematicians from different countries, is also included. The book starts with some new contributions to thc classical subject Markov chains, then goes t o the general jump processes and symmetrizable jump processes, equilibrium particle systems and non-equilibrium particle systems, Accordingly the kook is divided into four parts. An elementary overlook of the kook is presented in Chapter 0. Some notes on thc bibliographies and open problems arc collected in the last section of each chapter. It is hoped that the book could be useful for both experts and newcomers, not only for mathematicians but also for the researchers in related areas such as mathematical physics, chemistry and biology. The present book is based on the book “Jump Processes and Particle Systems” by the author, published five years ago by the Press of Beijing Normal University. About 1/3 of the material is newly added. Even for the materials in the Chinese edition, they are either reorganized or simplified. Some of them are removed. A part of the Chinese book was used several times €or graduate students, the materials in Chapter 0 was even used twice for undergraduate students in a course on Stochastic Processes. Moreover, the gitlley proof of the present book has bcen used for gradiintc students in their second and third semesters. The author would like to express his warmest gratitude to Professor Z. T. Hou, Professor D. W, Stroock and Professor S. 3 . Yan for their teachings and advices. Their influences are contained almost everywhere in the book. In the past 15 years, the author has been benefited from a large number of colleagues, friends and students, it is too many to list individually here. However, most of their names appear in the “Notes” sections, as well as in the Bibliography and in the Index of the book. Their contributions and cooperations are greatly appreciated. The author is indebted to Professor x. F. Liu, Y . 3. Li, B. M. Wang, X. L. Wmg, J. Wu, S. Y . Zhang and Y . H. Zhang for reading the galley proof, correcting errors and ixlproving the quality of the presentations. It is a nice chance l o acknowledge thc financial support during thr. past years by fi’ok Ying-Tung Educational Foundation, Foundation of Institution of Higher Education for Doctoral Program, Foundation of State Education Commission for Outstanding Young Teachers and the ix

X

PREFACE TO THE

FIRSTEDITION

National Natural Science Foundation of China. Thanks are also expressed to the World Scientific for their efforts on publishing the book. M. F. Chen Beijing November 18, 1991

Preface t o the Second Edition The main change of this second edition is Chapter 5 on “Probability Metrics and Coupling Methods“ and Chapter 9 on “Spectral Gap” (or equivalently, “the first non-trivial eigcnvalue”) Actually, these two cha.pters have been rewritten, within the original text. In the former chapter, the topic of “optimal Markovian couplings” is added and the “stochastic cornprtrability” for jump processes is cornplited. In tlis latter cliapt,er, t,wo general results on estimating spectral gap l y couplings and two dual variational formula for spectral gap of birth-death processes are added. Moreover, a generalized Cheeger’s approach is renewed for unbounded jiirnp processes. Next, Sectiorr 4.5 on “Single Birth Processes” and Section 14.2 on “Ergodicity of Reactiondiffusion Processes“ are updated. But the original technical Section 14.3 is removed. Besides, a large number of recent publications are included. Numerous modifications, improvements or correct’ionsare made in almost every page. It is hoped that, t,he serious effort could improve the quality of the book and bring the reader to enjoy some of the recent developments. Roughly speaking, this book deals with two subjects: Markov Jump Processes (Parts I and 1) and Interacting Particle Systems (Parts m and IV). If one is interested only in the second subject, it is not necessary to read all of t,he first niae chapters, but instead, may have a look at Chapters 4, 5, 7: 9 plus s2.3 or so. A quick way to read the book is glancing at the element,ary Chapter 0, to get some impression about what studied in the hook, to have some test of the results, arid to choose what for the further reading. Some t.irnes, 1 feel crazy to writ’e such a thick book, this is due to the wider range of topics. Even though it can be shorten easily by moving sonic details but the resulting book would be much less readable. Anyhow, I belicve that the reader can make the book thin and thin. A concrete model t.hroughout the whole book is Schlogl’s (second) rnodeI, which is introduced at the beginning (Example 0.3) to show the power of our first main result and discussed right after the last theorem (Theorem 16.3) of bhe book about its unsolved problems. This model, completely different from Ising model, is typical from non-equilibrium statistical physics. Its generalization is t.he polynomial model or more generally, the class of reaction-diffusion processes. Locally, these models are Markov chains. But even in t,his case, the uniqueness problem of the process was opened for several years, though everyone working in this field believes so. From physical point of view, the Markov chains should be ergadic and this is finally proved in Chapter 4, Thus, to study the phase tra.nsit.ions, we have to go to the infinite dimensional setting. The first hard stone is the construction of the corresponding Markov processes. For which, the matherna.tical tool .is preI

xi

xii

PREFACE T O T H E SECOND

EDITION

pared in Chapter 5 and the construction is done in Chapter 13. The model is essentially irreversible, it can be reversible (equilibrium) only in a special case. The proof of a criterion for the reversibility is prepared In Chapter 7 arid completed in Chapter 14. The topics studicd in almost, every chapter are either led by or related to Schlligl’s rnodel, even though sometimes it is not explicitJy mentioned. Actually, the last four chapters are all devoted to the reaction-diffusion processes. The Schlijgl model possesses thc main characters of the current mathematics: infinite dimensional, non-linear, complex systems and so on. It provides us a chance to re-examine the well developed finite dimcnsional mathematics, to create new mathematical tools or new research topics. It is not surprising that many ideas and results from different branches of mathematics, as well ti physics, are used in the book. However, it is surprising that the methods developed in this book turn out to have a dccp application to Rierriaxiniari geometry and spectral theory. ‘l’his is clearly a different story. Since there are so much progress made in the past ten years or more, a large part of the new materials are out of the scope of this book, the author has decided to write a separate book under the title “Eigenvalues, Inequalities and Ergodic Theory”. It is a pleasure t o recall the fruitful cooperation with my previous students and colleagues: Y. H. Mao, F. Y. Wang, Y. Z. Wang, S. Y. Zhang, Y . H. Zhang et al. Their contributions heighten remarkably the quality of the book. The author acknowledges the financial support during the past years by the Research Fund for Doctoral Program of Higher Education, the National Natural Science Foundation of China, the Qiu Shi Science and Technology Foundation and the 973 Project. Thanks are also expressed to the World Scientific for their efforts on publishing this new edition of the book.

M. F. Chen Beijing August 29, 2003

Chapter 0

An Overview of the Book: Starting from Markov Chains In this chapter, we introduce some background of the topics, as well as some results and ideas, studied in this book. We emphasize Markov chains, and discuss our problems by using the language as elementary and concrete c2s possible. Besides, in order to save the space of this section, we omit most of the references which will be pointed out in the related “Notes” sections. 0.1 Three Classical Problems for Markov Chains For a given transition rate (Le., a Q-matrix Q = ( q z j ) on a countable state: space), the uniqueness of the Q-semigroup P ( t ) = (Pt3(t)), the recurrence and the positive recurrence of the corresponding Markov chain are three fundamental and clmsical problems, treated in many textbooks. As an addition, this seclion inlroduceu some practical results motivated from the study of a type of interacting particle systems, reaction-diffusion processes.

Definition 0.1. Let E be a countable set. Suppose that ( P z J ( t )is) a subMarkov transition probability matrix having the following properties.

(1) Normal condition.

-p&) < 1, hj E E , t 2 0.

P&) 2 0 ,

3

(2) Chaprnan-Kolmogorov equation. P i 3 ( I + 6 )= C P i k ( t ) P k j ( S ) ,

i , j E E , t , s 3 0,

k

(3) Jump condition. limt-o Pij(t) = Sij for all i , j E E . It is well-known that for such a ( P z j ( t ) )we , have a Q-matrix Q = ( q i j ) deduced by (4) Q-condition. (Pij(t)- S i j ) tlim 40

/ t = qij

where

1

for all i , j E E ,

2

0 AN OVERVIEW OF

THE

BOOK

Because of the &-condition, we often call P ( t ) = (Pij(t)) a Q-process. Unless otherwise st,ated, t,hroughout this chapter, we suppose that the Q-matrix Q = ( q i j ) is totally stable and conservative. That is

C3.f z. q ”

4%< C X ] ,

i € E. (0.1) The first problem of our study is when there is only one Q-process P ( t ) = (Pij(t))for a given Q-matrix Q = ( q i j ) (Then, the matrix Q is often called regular). This problem was solved by Feller (1957) and Reuter (1957). = qi,

2J

Theorem 0.2 (Uniqueness criterion). For a given &-matrix Q = the Q-process (Pij(t))is unique if and only if (abbrev, iff) the equation

(A

+

q;)Ui

=CQijUj, 0

< ui < 1,

iEE

(qij),

(0.2)

j#l

has only the trivial solution uz = 0 for some (equivalently, for all) X

> 0.

Certainly, this criterion has rna.ny applicatjons. For instance, it gives us a cornpl& answer to the birth-death processes (cf; Corollary 0.8 below). However, it seems hard to apply the above criterion directly to the following examples. Example 0.3 (SchIiigl’s modcl). Let S be a finite set and IT = X:, where Z+ = { O , l , . . . }. The model is defined by the following Q-matrix Q -r ( q ( q y) : z,y E

E):

[

A1

(2“’)”

+

if y = z

A1

+ e,

ify=z-e,+e,

4 ( 4 = - d x >4 =

c

for other y

# z,

q(WA

Y#X

(1)

where 2 = ( ~ ( u: )u E S ) , is the usual combination, XI, ’ ! A 4 are positive constants, ( p ( u , v ): u,’u E S) is a transition probability matrix on S and eU is t h e element in E having value 1 a t u and 0 elsewhere. 1

The Schlogl model is a model of chemical reaction with diffusion in a container. Suppose that the container consists of small vessels. In each vessel u f S , there is a reaction described by a birth-death process. The birth and death rates are given, respectively, by the above first two lines in the definition of (q(x,y)). Moreover, suppose that between any two vessels u and w, there is a diffusion, with rate given by the third line of the definition. This model was introduced by F. Schlogl (1972) as a typical model of nonequilibrium systems. See Haken (1983) for related references.

0.1 THREE CLASSICAL PROBLEMS FOR MARKOV CHAINS

3

Example 0.4 (Dual chain of spin system). Let S be a countable set, and X be the set of all finite subsets of 5'. For A E X ,let IAl denote the number of elements in A . For various concrete models, their &-matrices (q(A,B) : A , B E X)usually satisfy the following condition:

for some constant C, c E R := (-m, m). A particular case is that

F : I'A(A\u)=B

uEA

where

4).

2 0,

supc(.u) < CO? U

and supuc(u) C A p ( u , A )JAl
0 for some (equivalently, for all) X 2 0. Conversely, these conditions plus p 2 0 are also necessary.

.) -'(t,i2,*)llvar

< 2*1'i2 [ T > t ] - 0

Furthermore, if the process has a stationary distribution

ast--tO. T,

then

and so the process is ergodic.

As another application of the coupling technique, we discuss the monotonicity for Markov chains.

0 . 3 REVERSIBLE MARKOVCHAINS

13

Definition 0.24. Take E = {0,1,2,...} and let ( X k ( t ) ) i a o(k = 1,2) be two copies of a Markov chain ( X ( t ) ) t a owith different starting points. If

then we say t h a t the chain is monotone. One way to prove the monotonicity is using the coupling method. For example, applying the basic coupling to a Markov chain with regular Qmatrix Q = (yij) on Z+,we find that the condition:

is sufficient for the monotonicity of the Markov chain. Of course, if we use a different coupling, we will find different sufficient condition for the rnonotonicity. From this point of view, it is believable that condition (0.10) is not necessary for the monotonicity. A complete solution for the monotonicity for general jump processes, as well as other topics discussed in this section, are treated in Chapter 5 . The above two sections are based on Chen (1989a, 1991d), respectively.

0.3 Reversible Markov Chains

Definition 0.25. Let (Xt)t>o be a Markov process defined on (a,9, IF') with countable state space E . The process is called reversible if for any n 2 1, 0 < i l < . . < it, with

and any il, . ' .

, in c E ,

[X,, = i l , . . .

,xi,= in] = P [X,, = i n , . . , Xt,, = ill.

(Q.11)

Clearly, a reversible Markov chain (Xi)should be stationary. That is, xi := P [X, = i ] is independent of t 2 0. Actually,

Due t o the Markov property, (0.11) is equivalent to (0.12)

14

0 AN OVERVIEW

OF THE

BOOK

This implies that Tiqij

= rjq..ji)

i , j E E , t 2 0.

(0.13)

Since it is easy to get. Q = ( q i j ) in practice, but not ( P i j ( t ) ) ,we should start our study from a given Q-matrix. Thus, we are now at; the position as at the beginning of Section 0.1. Fur a given Q-matrix Q (yij) which is reversible with respect; to a probability measure (ni)in the sense of (0.13), we would like to know when there is one, and when there is precise one, such Q-process (Pij(5)) so that (0.12) holds. To state our main results, let us relax the probability measure (ri)by an arbitrary but non-trivial measure (7riTi). Then, we call Q = ( q i j ) (resp. ( P i j ( t ) ) )symmetrizable with respect t o (ni)if (0.13) (resp. (0.12)) holds. Finally, in this section, the only assumption for the Q-matrix ( q i j ) is the total stability: y i < 30 €or aIl i E E . Theorem 0.26. The minimal Q-process (Piyin(t)) is reversible (resp. symmetrizable) with respect to (T*)iff so is its Q-matrix.

Theorem 0.27. With respect to a probability measure ( x i ) , the reversible Q-process is unique iff the following conditions hold. (1) ( q i j ) is reversible with respect t o ( ~ i ) ~ (2) Ti(Yi - Cj+ 4ij) < 0 , (3) Equation (0.2) has only the trivial solution For general (ni): we have the following result.

Theorem 0.28. With respect t o a measure (q), there exits precisely one symmetrizable Q-process if the following conditions hold. (1) Q = ( y i j ) is symmetrizable with respect t o (ni). (2) C(T,(qi - Cj+iq i j ) < go or infi C j P;;~*’(A) > 0. (3) The only sumrnable solution t o Equation (0.2) is zero. We guess that condition (2) is still stronger than to be necessary. Thus, a complete criterion for the uniqueness of symmetrizable Q-process remains open. Besides, even though we have known a great deal about the general Q-process (cf. Section 3 4 , we only have a partial solution to the following problem.

Open Problem 0.29. What is the uniqueness criterion for honest reversible (resp. syrnmetrizable) @process? Here, “honest” means that C jPij(t) = 1 for all i E h‘ and t 2 0.

0.4 LARGEDEVIATIONS AND SPECTRAL GAP

15

In the study of symmetrizable Q-process, a new question arises. How can we justify whether a given Q-matrix is symrnetriz.able with respect to some measure (ri)? As a nice exercise, one may try to answer the question by himself for the Schliigl model. In general, this question is answered by us& ari analogue of the classical field theory in. analysis. It is interesting that the same idea can also be used ta study t,hc recurrence for symmetrizable Markov chnhs (see Chapter 7 for details).

0,4Large Dcviations and Spectral Gap Markov chains consist of a nice class of stochastic processes, not, only for their a lot of applications hut also for t,he!j, concrete beh.avior and simplicity. Tn the regular case, the paths of a hllarkov h a i n are simply step functions almost surely. We can even see thc jump law: starting from a state .i, the chain stays in i for a while according to the exponential distribution with parameter qi. Then, t.he chain jumps to j ( # i ) according to the distribution qij/yi (provided 4%> 0). Because of this reason, a large part, of the theory of stochastic processes was begun from hdarkov chains. Cowessely, Markov chains can be used to justify the power of a general theory for stochastic processes. Let us discuss the two topics expressed by the title of this section. In the Donsker-Varadhan's large deviation theory (an introduction to the theory is presented in Section 8.1), we are interested in the entropy (=rate function) :

(0.14) and -1 upper estimate: lim

t+w

lower estimate:

o be a Markov chain with transition probability P(t>= (Pij(t)) a.nd let Pi be the probability that the chain starts from i E E . Next, let 9 ( E ) be bhe set of probabilities on E , endowed with the weak topology. Set

and Q t , i = Pi o L t ' . Considering P ( t ) as an operator on b&: the set of bounded functions with uniform norm? it induces an infinitesimal generator

16

0 -4N OVERVIEW

O F T HE

BOOK

L with domain g ( L ) . Let g + ( L )be the set of strictly positive functions in

m)* In view of Markov chains, the entropy given by (0.14) is not satisfactory

since 9 ( L ) is quite poor, even the indicator Iti)(i E E) is usually not in g ( L ) . However, we have

Thesrcm 0.30. Given a regular Q-matrix Q = ( q i j ) , (1) if p E

9(&) satisfies C ,piqi < 00,

where %? = F

then

( L ) or anyone of the following sets:

> O for some E > O } , 0 < f < w},

&+ = (f : f 2 €O = b@+

(f

:

= &8n &+,

E

b@

=b

8

n go9

(2) If (gij) is reversible with respect to some 7r E P ( E ) , then p E 9 ( E ) ,we have an explicit expression as follows.

for every

This theorem is proved in Chapter 8. Some upper estimates are also studied there. Roughly speaking, the large deviations say that the exponential I ( p ) . For convergence rate of &t,z(C)is described by the entropy - iiifPEc reversible Markov processes, we have a different way 1,o look at the exponential corivergeiicc rate: IIP(l)f - ~ ( j ) l l - ~ ( f ) l l exp[-et], where 11 11 is the norm in L2(7r) and ~ ( fis)the mcan of f with respect to 7r. Let 0 denote the largest value of E > 0. The curlstant u is the rate we are looking for. As usual, the cortvergcnce rate is related to some spectral gap. Let L denote the generator with domain g ( L ) induced by P ( t ) on L 2 ( r ) and let gap(L) denote the infimum of the spectra of -L restricted to the orthogonal complement of the constant function 1. Then, we have the following result.

< 11s

Theorem 0.31.

+

0.5 EQUILIBRIUM PARTICLE SYSTEMS

17

For finite Markov chains 1vit.hQ-matrix &, gap(Q) is nothing but the first non-trivial eigenvalue of -&. Estimating gap(Q) is a traditional hard topic in mathematics. '1'0 compute gap(&) explicitly, one has to stop when the order of Q is higher than five. Surprisingly, we do have in a particular case a complete solution to the problem eve11 for some infinite matrices. Consider the birth-death &-matrix: q i , i + l = bi > 0 ( i 2 0), qi,i-1 = ai > 0 (i 3 1) arid q,j == 0 for all other i # j . Suppose that the process is ergodic and so we have a stationary distribution

nefine

{ {w~}+o wi is strictly increasing in i and C ixiwi 3 0}, "w = { there exists k 1 < k < so that wi = w is :

=

:

{wi.}i>o :

m

wiAk,

strictly increasing in [0,k] and

&(w) =

1

c

xi ~ i w =i 0},

00

bZ/Ji(Wi+l - Wi) j .= z +, l

&Wj?

i 2 0;

6 = sup

c

1 -c p j ,

i31 j G i - l ~ j b j>i j

Note that g i s simply a modification of W . Hence, only two notations W and l(w)are essential here.

Theorem 0.32. For the ergodic birth-death process as above, the following conclusions hold. (1) Variational: forrnula for the lower bound gap(D) = sup inf l i ( w ) - I , W G W 220

( 2 ) Variational formula for the upper bound: gap(D) = inf-sup &(w)-' W E Wi&J

(3) Explicit bounds and explicit criterion: 26-' particular, gap(D) > 0 iff S < 00.

3 gap(D) 2 (46)-1.In

The study on spectral gap is the aim of Chapter 9. 0.5 Equilibrium Particle Systems

Let us start from the simplest case. Consider a Q-process (P(t)) with Q-matrix

and state space E = {–1,1}. Assume that ab = 0,

otherwise, the model is trivial. Then

18

0 AN OVERVIEW OF

THE

BOOK

As the limit of Pij(t)( t -+ oo),we obtain the stabionary distribution n-1

+b),

a/(.

T+I

T

b/(a

+b).

In other words, there is only one stationary distribution, denoted by 19) = 1. ‘l’hc above process is a rnodel with single particle having two states f l . If we consider finite number of particles, say it’ E N,N 2 2, Then the state space becomes {-1, + l } N .Thc system can be nlso described by a Q-process (its operator is given by (0.15) below replacing Zd with N ) and we still have 1 4 1 = 1. What will happen if we replace N with a countable set? For instance, consider a particle system on the regular lattice Z d . At each site u E Zd, there is a particle with two states f l . Then the whole configurations consist of our state space {-l,+l}Zd, which is no longer countable. Hence, the system can notj be described by a Q-process. Now, we use c ( u , x ) ,instead of qzg,to describe the Osarisition rate of a particle changing its state. Given x E E , let if u = ‘11 (U.)(.> = if u.# v --q,

{

and define a formal generator as follows:

Rf(4=

c

c ( u , 4 [ f ( u . - .,

f(41

(0.15)

*

UEZ*

c

A particular choice of c(u, x) is c ( u , x) = exp rl‘hcIi we obtain the famous Ising model in astatistical physics. Now, corriplete different phenoinenon happens. For d = 2,we bave

1 9 1= 1,

if ,l3

1

< -log (1 + A) =: fl:2) x 0.44 2

,Bid)

For d 3, the picture is similar for a critical point > 0. But for d = I, we have 1 . 9 1 = 1. It should be clear now that the king model exhibits phase transitions which depend on the dimension d. Actually, t,his model has attracted a lot of attention in statistical physics, even in the 2-dimensional case (see for instance, McCoy and Wu (1973)). The Ising model, as well as a fundcamental part of the theory of random fields, including the typical methods-the Peierls method and the reflection positivity method for studying the phase transitions, are presented in Part JX. Based on the field theory, we introduce some simple criteria for the reversibility of spin processes and exclusion processes. Besides, two new developments

0.6 NON-EQUILIBRIUM PARTICLE SYSTEMS

19

in the field are included. The first one is to use the lattice fractals instead the regular lattice. Then, we do obtain some interesting results. For example, the Ising model on lattice Sierpinski gasket h m no phase transitions in any dimension bul the model on lattice Sierpinski carpet does exhibit the phase transitions in any dimension. The other one is to use some groups as the spin space instead of {-1, I l}, the latter one seems mainly suitable for the mctallic phasc transitions at low temperature. Howcvm, new progress on the superconductivity has been made recently by using ccramics instead of ferromagnetics. This explains why we have to consider more general spin space instead of {-1, +1}.

0.6 Non-equilibrium Particle Systems The Ising model discussed in the last section belongs to the equilibrium statistical physics. Having the knowledge about the equilibrium systems in mind, it is natural to ask what we can do for the non-equilibrium systems. A typical example is Schlogl's model (Example 0.3), replacing the finite set S with infinite one S = Z d . Thc formal generator can be written ~LS follows:

where X I , . . . , A4 and (p(u,u)) are the same as before, e, is the unit vector in E = Z s having value 1 at u E Zd and 0 elsewhere. This model is a special reaction-diffusion process studied in the last part of the book. I t may be helpful for our readers to compare the Schlogl model with the Ising model. (1) Clearly, the state space E = (-1, +l}"dfor Ising model is compact and so is L@(E).Thus, the process has at least one stationary distribution. But for Scl-ilogl model, the state space E = Z"+" is neither cornpact nor locally compact. (2) 'l'he king model is reversible, ils local Gibbs distributions are explicit. But the Schlijgl model hns no such advantage, except in a special case. (3) The generator ol the king model is locally bounded but it is not so for the Schlogl model.

These facts show that thc non-equilibrium particlc systems are more difficult to handle than the equilibrium systems.

20

0 AN OVERVIEW

OF T H E

BOOK

To construct an infinite dimensional Schlogl model, take a sequence {A,} of finite subsets of Zd so that A, t Z d . Then, we have a sequence of Markov -) : n 2 l} chains {P,(t) : n 2 l}. The next step is to prove that {Pn(t,x, is a Cauchy sequence. Thus, we have to use a probability metric, say W , for instance: as m 2 n --+ 00. w(P,(~ x,,.), ~ , ( t ,x,.)) + 0 From this line of the construction, we see a relation between the Markov chains and the interacting particle systems. Locally, particle systems are Markov chains. At this point, it explains why the title of the book is chosen. The constructions, the uniqueness of the processes as well as 15 concrete models are presented in Chapter 13. It will be proved in Chapter 14 that the reaction-diffusion processes often have at least one stationary distribution and sometimes they are ergodic. The reversible reaction-diffusion processes are always ergodic. For some special models, we will prove that there more than one stationary distributions. That is, the processes exhibit phase transitions (Chapter 15). Finally, we turn to discuss the relation between the processes and partial differential equations. It is known that the generator of &dimensional a2/ax?. Moreover, for Brownian motion {Bt}t>o is the Laplacian suitable g, f ( t ,x) := IE,g(Bt) satisfies the linear equation:

1

-=-C-f af 1 d2 at

2

,

2=1

ax?

If(0,x) = g ( 4 -

However, if we consider the reaction-diffusion equation (non-linear):

(0.16) where V is a polynomial, there is no hope to find a Markov process valued in Rd with such a generator since for a Markov process, its generator must be linear. Nevertheless, under some hypotheses on the initial distribution of the process and on the initial function p, it will be proved in the last chapter of the book that a limit of some mean of a scaled reaction-diffusion process provides a solution to Eq. (0.16). In other words, a reaction-diffusion process describes the microscopic behavior, and Eq. (0.16) describes the macroscopic behavior of a non-equilibrium system. In the last chapter, we will also prove that some solution to Eq. (0.16) are asymptotically stable but some of them are not. This result represents the critical phenomena of the systems, which corresponds more or less to the phase transitions for the microscopic processes.

PART I GENERAL JUMP PROCESSES

This page intentionally left blank

Chapter 1

Transition Function and its Laplace Transform In this chapter, we first study some basic properties of sub-Markovian transition function of a jump process: continuity and differentiability. From which, we deduce the transition intensity, q-pair. Next, we study the one-toone correspondence between the transition function of a jump process and its Laplace transform. This enables us to use the fundamental tool, Laplace transform, instead of the transition function itself in the subsequent study. As well-known, the advantage of using the Laplace transform is reducing the integral equations to the linear algebraic ones.

1.1 Basic Properties of Transition Function Throughout the book, we use the following notations. Let ( E , 8 ) and ( X ,93) be two measurable spaces. Denote by f E €,'A3 the measurable mapping from ( E ,&) to (X, 3).However, if X = with Borel a-algebra 93 = B ( R ) ,we simply use the same notation € to denote the set of all measurable functions from ( E , € ) to (R,B(R)). Similarly, let r€ (resp. r€+, b&, b€+, 8+)denote the set of all measurable real-valued (resp. non-negative realvalued, bounded real-valued, bounded non-negative and non-negative but p+)denote the set of all may be +m) functions. Finally, let Y (resp. Y+, a-additive set functions (resp. finite measures, a-finite measures). Unless otherwise stated, the state space ( E , € ) considered in the book is a Pofash space with Borel a-algebra 8. Recall that a Polish space is a separable topological space that can be metrized by means of a complete metric.

Definition 1.1. We call P ( t , z , A )(t 2 0, z E E , A E €) a (sub-Markovian) transition function of a jump process if the following conditions hold. (1) For each (2) For each

t 2 0 and A E €, P ( t ,. , A ) E .8+. t 2 0 and z E E , P(t,z, .>E 9+and P ( t ,z, E ) < 1.

(3) Chapman-Kolmogorov equation (abbrev. CK-equation). For each t , s 2 0, z E E and A E €, P(t

+ s,

2,

A) =

J

P(t,z, d y ) P ( s ,y, A ) .

(4) For each z E E and A E 8, limt,o P ( t ,z, A ) = P(O,z,A ) = d(z, A ) , where 6(., A ) is the indicator of A , also denoted by 1,.

23

24

1 TRANSITION FUNCTION

A N D ITS

LAPLACETILANSFO

In this definition, the crucial point distinguishing to the transition function of a gcrieral Markov process is the last condition (4),which means the continuity at the origin and hence often quoted as continuous condition. However, because of this condition, the sample of the process are step functions, at least before the explosion time. Since this reason, we also call (4) the j u m p condition. In many cmcs, we do not want to distinguish different jump processes with the same transition function. Hence we olten call the transition function itself a jump process. In particiilar, we call it a Markav chain in the case of E being a countable set., denot.ed by matrix (Pij(t): i , j E E ) . A jump process P ( t , z , A ) is called honest, if for each t 2 0 arid z E f3, P ( t ,5 , h’)= 1. Otherwise, it is called non-honest. Theorem 1.2. For each 2 and g E b&+, Y(1,z,dy)f(y) i s uniformly continuous in t uniformly in f with I f 1 6 9. In particular, P ( t ,5 , A ) is uniformly continuous in

t

uniformly in A.

Proof: By conditions (3) and (2) of Definition 1.1, it follows that

In bhe last step we have used the fact that la - bl Thus, we have

< c for ail a , b E [O:c]

Now: the first assertion follows from this and Definition 1.1( 4 ) . I The next result shows a nice property of jump processes. Even though we need the result only in a few of cases, it is still included for completeness. Theorem 1.3. Let P ( t , z , A ) be a j u m p process on ( E , 8 ) . Then for each z E E and A E 8 , either P ( . , z , A )= 0 or P(a,.,A) > 0. Proof: If P ( t ,x 7A) is not honest, we may introduce a fictitious state A $! E , such that EA := E U {A} is again a Polish space and A is an isolated state. Moreover, € c &A := cr(& u {A}). Let

1.1 B A S I C PROPERTIES O F TRANSITION FUNCTION

25

Then we obt,ain an honest jump process P(t,x,A ) (t 3 0, 2 E EA, A E En). Clearly, if P(t,2,A ) possesses the properties described in the theorem, then so does P(t,x,A ) . Hence, we need only consider the honest jump processes. By CK-equation, we have

+

P(t s, x,A) 3 P( s , s , (x}) P ( t ,2, A ) .

> 0, then for all s > 1, P ( s , x , A )> 0. Froin this and the continuit.y of Y ( - , x ,A ) , it follows t b t P ( t , x , A ) > 0 for all t whenever z E A . Furthermore, for x $ A, there exists u(x,A ) E [O, 031 such that Hence, if P ( t ,.,A)

,P(t,x,A ) 2 0,

P ( t ,x,A ) > 0,

u(5,A ) .

Thus, what we need to prove is that for each either u(z, A ) = 0 or u(z,A ) = 03. Suppose that 0 < u ( 5 , A ) < 00

2

(1.2)

C E arid A E 6 , 5 .$ A,

(1.3)

for some x and A . Fix z and A. Set uo = u(x,A), u(y) = u(y,A) and Obviously, u and LJ are measurable. Since ( E , G ) is v(y) = u ( y ) A a Polish space, we can construct a Markov process X ( t ) on a probability P)with t,ransition function P ( t ,x,.) and initial state X ( 0 ) = 2 space ( Q , 9: (cf. Neveu (1965), p.83, Corollary), Let

Yo(t)= V ( X ( t ) ) . Then Yo(0)= uo:0 < Yo(t)< uo. Moreover

By the dominated convergence theorem, the right-hand side tends to 1 as h + 0. So Yo is contiiiuous in probability. On the other hand, since E can be embedded into a compact space X so that the completion (E,p) of E in X is again compact with metric p. There exists a measurable and separable version Y of Yo such that 0 Y ( t ) uo = Y(0). Now, it suffices to show that there exists A E 9such that P (A) = 0 and

A )

P(hl

y7 d z ) p(v(?/) - h7

'7

A)'

(1.6)

On the other hand, by (1.2), '(2) < '(9) - h implies that P(v(y) - h, z , A ) 0. Hence, (1.5) follows from (1.6). Next, by (1.5), we have

1

+

P [ Y ( t k ) < Y ( t )- h ] = This shows that

>

P ( t ,2,d y ) P ( h ,y, ( 2 : v ( z ) < v(y) - h}) = 0.

Y ( t + h) 3 Y ( t )- h,

P-a.s.

(1.7)

By the separability, we may choose an exceptional set so that (1.7) holds for all t , h 2 0. This proves a). In what follows, we will ignore the exceptional set. b) Prove that d

-Z(t) dt

=:

1,

a.e. t.

Because Z ( t ) is non-decreasing, the derivative Z' ( t ) exist almost everywhere. To compute Z'(t), let A(t,h) = IF [ Y ( t h ) # Y ( t ) ]We . have seen that limhtoA(t,h) = 0 for all t 2 0, so by the dominated convergence theorem,

+

1

oc,

lini

h+O

0

A(t,/ ~ ) e - ~ d=t 0.

Thus, we can choose a sequence {h,}, h,

3

0, such that

By the Fubini theorem, there is ZI set N with Lebesgue measure zero so that 00 for all t $ N . Then, by the Borel-Cantelli lemma, we have

c,A(t, h,)
/ 0 and A E 8. Applying the monotone class theorem, we obtain

1 . 3 DIFFERENTIABILITY for all f E b 6 . In particular, taking equation, we get

P ( t + t’, z,A) =

It

f

45

= P(t’, .,A) and applying the CK-

q ( z ) e d ” )( t - s )

.I

rl(s,z,dy)P(t’,y, A)ds

Starting from this and using (1.31) twice, we obtain

= P(t

+ t’,

2,A)

Hence

Thus, for each t’, there is a null set Ntt = Nt/(A),such that

b) We now start from (1.32) to construct the required version R ( t , z , A ) of r l ( t , z , A ) . First, we prove that the both sides of (1.32) are joint measurable in (8, t’). Actually, for fixed t‘, the right-hand side is measurable in t ; and for fixed t , it is continuous in t’. Hence it is joint measurable. The same conclusion holds for the left-hand side. Thus the set S = ( t ,t’) : t , t’ > 0, r1( t

{

+ t’, 5,A ) # f

r l ( t ,2,d y ) P(t’,y, A ) }

is joint measurable. By the Fubini theorem and (1.32), it follows that

11

dtdt‘ = 0.

1 TRANSITION FUNCTION AND ITS LAPLACE TRANSFORM

46

Let u = t , v = t

+ t‘, then JJM

dudv = 0, where

By using the Fubini theorem again, there is a riull set H , and then there is a null set H , for every 7~ 4 H , such that

~ $ 1 1 u, < v $ IIu.

r I (v,z, A) = ]~~(u, IC, dy) P ( v - IL, y, A ) ,

Next, we construct R ( v , x , A ) as follows. T& u‘ such that 0 < u’< v and u’$ 11, and set

R(v,z, A ) =

s

T ~ ( u5’ , d y ) P (v

w

(1.33)

> 0. Choose an arbitrary

- u’, y, A ) .

(1.34)

To justify this definition, we need to show the independence of the choice of u’.To do so, let wo > 0, 0 < u’,u” < vo, u’,u“ $! H be given. Then, for v $ HL U H;(I and v > u’ v u”,by (1.33), both of

1

rl(u’, z, d y ) P(v - u’, y, A ) and

s

rl(u“,z, dy) P(v - u”,Y,A )

equal to rl(v,z, A ) . In particular, this conclusion holds for v = zfo by the continuity of P ( . , y , A ) . Therefore, choosing u’or u”,we define the same

Rho, 2 , A ) . c) Finally, we prove that the kernel R(t,2 , A ) (t > 0, A E 8)constructed above satisfied the desired conditions. By (1.34), R ( t , z ,.)EL?+for each 1. Note that P ( ~ , xE,) is non-increasing. Thus, (2) follows from Lemma 1.18, Corollary 1.21 and (1.34). Next,, we: provc the continuity. Civcn t o > 0, choosc an arbitrary uo $ H so that 0 < uo < to. Then whenever > t o , we have

R(v,X,A)

s

T~(uO,

dy) p(u - uoj Y ~ A ) ,

which shows that R(.,2 , A ) is continuous in ( l o , m). Since to is arbitrary, R(.,z, A ) is continuous in (0,m). Now, we prove that for each A E 8,

R ( v , z , A )= r l ( v , x , A ) ,

a.e. v.

(1.35)

Actually, for the above vo and uo, by (1.33) and (1.34), we sce that (1.35) holds for all v > t o r v $! Hue. In other words, (1.35) holds for almost all

1 . 3 DIFFERENTIABILITY

47

> to. Letting to tend to zero along the rational numbers, we see that (1.35) holds for almost all w > 0. Finally, (1) follows from (1.35) and (1.31). Furthermore, in parallel to the proof of (1.32) by using (1.31), from Proposition 1.22 (l),it follows that

ZI.

+

R(t t', 2,A ) =

J

R(t,5 , d y ) P(t', y, A ) ,

t $ Np.

(1.36)

Both-hand sides are continuous in t , and so Eq.(1.36) indeed holds for all t. This proves (3). So far, we have completed the proof of the existence of $P(t, z, A ) ( t > 0).

Theorem 1.23. Let z satisfy 0 < q(x) < 00. Then for each A E 8, P ( t ,z, A ) (t > 0) has a continuous derivative P'(t, 2,A ) , which is a-additive on 8 , having total variation bounded from above by 2q(z). Moreover,

P ' ( t , z , A ) = q(z)( R ( t , z , A )- P ( t , z , A ) ) , P'(t + s , z , A ) =

J

P ' ( s , z ,dy) P ( t , y , A ) ,

Finally, if q ( z )= 0, then

s

t > 0, A E 8.

> 0, t 3 0, A

E

8.

(1.37) (1.38)

P'(t, 2,A ) = 0 for all t >, 0 and A E 8.

Proof: By Proposition 1.22 (1) and the continuity of R(t,z, A ) , it follows that P ( t ,z, A ) is differentiable and (1.37) holds. Then the properties of P'(t, z, A ) can be read out from (1.37). Finally, (1.38) follows from Proposition 1.22 (3), (1.37) and the CK-equation. The case that q ( z ) = 0 is trivial. H

Corollary 1.24. We have (1) limt,o R(t,2,{z}) = 0. (2) limt-0 P'(t,z, {x}) = -q(x).

+ t , x,{x}) 3 R(s,2,{z}) x P ( t ,x,{z}). So R(t,z, {z}) 2 P ( t ,x,{ z } ) z s - + oR(s,z, {z}).Hence

Proof: By Proposition 1.22 (3), we have R(s

lim R(t,z, {z}) 2 lim R(s,x,{z}). t 4 O

s-4

This shows that limt-0 R(t,2,{z}) exists. Setting A = {z} in Proposition 1.22 (l),we get

Then, part (1) follows by letting t --+ 0. From this and (1.37), we obtain part (2). H The next result is a complement to the previous one.

48

1 TRANSIT~ON FUNCTION

A N D ITS

LAPLACE'FRANSFORM

Corollary 1.25. Let n: $ A E 9, Then

(1) 1imt-o R ( t ,5 , A ) = q(a,A ) / q ( x ) . (2) lirnt,o P ( t ,55, A ) = q ( x , A ) .

Proof: Given

E

> 0, there is a S > 0 such that

ho r n Proposition 1.22 (3), it follows that

W I ' t , ~ , A 2>

s,"(.-. 9

1

dy ) P ( t , y , A ) 2 ( ~ - E ) R ( s , x , A ) .

NOW,the remainder of the proof is similar to the p w i 0 u . s one. Corollary 1.26. If c :=

SUP,~AQ(X)

are functions of bounded variation in

0, s

>, 0.

(1.39)

Proof: a) The proof of this part is similar t o those of Proposition 1.22. Set c := 8iIP,EA q(s). We have seen from the last proof that e r t P ( t , x , A ) is non-decreasing in t. Hence, by Theorern I .23 and Corollary 1.25 (2),we h a w

=:

U ( t ,5 , A ) ,

t 2 0.

On the other hand, by Theorem 1.23, we have IP'(t, x,A ) \ < 2q(z),and so U ( t ?x , A)

< 2q(a) + c.

(1.40)

1.3 DIFFERENTIABILITY

49

Thus, by the definition of U ( t ,2 , A ) , it follows that

Applying (1.41) once again, we obtain

Therefore

j” ecsU(s+ t’, rt

=

Z,

A)ds.

This shows that for each t’ 3 0, there is a null set Ntj such that

U ( t + t ’ , ~A,) =

.i

t $ Ntt.

P(t’,5 , d y ) U ( t ,y , A ) ,

(1.42)

In particular

U(t

+ t’,Z,A) 3

t $ Nv,

P(t’, 2 , d y ) U ( t ,Y, A ) ,

(1.43)

/En

0, and then there is a null set Ht for each t @ H , such that

+

+

/

U ( t t’,z,A)= P ( t ’ , z , d y ) U ( t ,y, A ) ,

O < t $ H , t’$Ht.

(1.45)

Now, we prove that Ht = 8 for each 0 < t $ H . Suppose that there are a to: 0 < t o $ H and tb $ H,,so that (1.45) does not hold. Then, by (1.44), we should have

Hence, whenever t’

> tb, we have

Since P(t’ - tb, z, {z}) 2 give us

=/

e-q(x)(t‘-th)

+

> 0, the above two facts and (1.44)

+

P(t’ - tb, 2 , d z ) U(t0 tb,X , A ) < U(to t’,2 , A ) .

This shows that for all t’

> tb,

1.4 LAPLACETRANSFORMS

51

It is clearly in contradiction with (1.45). So for all t: 0 < t f H , Ht = 8. In other words, we have proved that

s

U ( t + t ’ , x , A ) = P(t’,x,dy)U(t,y,A), Finally, for given t , t’ we obtain

O 0, choosing tl f N: 0 < tl < t , by (1.46) and (1.44),

U ( t + t‘, X,A ) = U(t1 + (t + t‘ - t l ) ,X ,A )

=s

P(t

+ t‘ - ti,

X,

d y ) u(1.1, y, A )

/

< W’,2,d z ) w, z,A), which is thc required converse inequality of (1.44). I 1.4 Laplace Transforms

In this section, we prove the one-to-one correspondence between a jump process and its Laplace transform. Recall that for a jump process P( t,x,A) (t >, 0, IC E E , A E. &‘), its Laplace transform P(X,x,A) (A > 0, x E E , A E 8)is defined by

P(A,2,A ) =

irn

e F x t P ( tIC, , A)&.

Lemma 1.28. Let P ( X , z , A ) be the Laplace transform of a jump process P ( t ,%,A).Then the following properties hold. (1) For each X > 0 and A E 8, P(X,., A ) E 8+.For each X > 0 and x E E , P(A,2,-) E Y+. (2) Normal condition. For each X > 0, z E E and A E 8,

0 < AP(A,z,A) < 1. (3) Resolvent equation. For each A, > 0, 2 E E and A E 6 ,

P(A,2,A ) - P(P,x,A)

+

- P)

1w,

x,d Y ) P(P,Y,A) = 0.

(4) Continuous condition or jump condition. For each x E Y, and A E 8 , limx+4MXY(A,x,A)= S(x,A). Let ( q( x) ,q( z, A)) (x E E , A E 0, x E E , A E 8) is the Laplace transform of a jump process P ( t , x ,A) (t 2 0, z E E , A E 8) iff conditions (1)-(4) of Lemma 1.28 hold. A jump process P ( t , x , A ) is honest iff for all X > 0 and z E E , XP(X,x,E)= 1. Proof: Clearly, we need only to prove that under conditions (1)-(4), P(X, z,A) is the Laplace transform of a jump process P ( t , z , A ) . Once this is done, the last assertion follows immediately. The idea of the proof goes as follows. Choose an appreciate Banach space. Construct a family of resolvent operators {PA X > 0) on the Banach space corresponding to P(X,Z,A). Then apply the Hille-Yosida theorem to determine a strongly continuous semigroup. Finally, construct a required jump process in terms of the semigroup. a) Choose a Banach space. In contrast to the usual choice b 8 with the the set of signed measures with finite total uniform norm 11 . )Iu, we use 2, variation. Define the linear operation (ClCpl+ C2P2)

(4= C l c p l ( 4 + c,(P2(A), CI,C2

E

R,

cplIcp2

E

A E 8,

2

and the norm n

n

Ip(Ai)l: {Ai}? ~8mutually disjoint and

llpll= sup{ i=l

Ai a= 1

+

1

=E, n 2 1

which is the total variation of 9:llyll = p+(E) p-(E). Then (2,) . 11) is a Banach space. b) Construct a family of linear operators {PA: X > 0} on 2. Given P(X,z,A)(X > 0, 2 E E , A E 8 ) satisfying conditions (1)-(4) of Lemma 1.28. Let

(PPd (A) =

/ cp(W

P(X,z , A ) ,

A

E

8.

The domain PA) of PA is chosen to be 3.Then PA is a linear operator from 9to itself and is bounded: IlP~ll< 1/X. Next, we use two steps to show that PA is a resolvent operator. c) PA is a one-to-one mapping. Take p E 2 and let A: be the JordanHahn decomposition of p - XpP,. Then II'P - XPPAII = (cp

-

XCPPX) (A:) -

('p - X(PPA)

(A!)

1 TRANSITION FUNCTION A N D ITS LAPLACE TRANSFO

54

where JcpJ= cpt

+ cp-.

Thus, for z E AX,we have

0 6 1 - XP(X,s,AX) = S(x,AX) - XP(X,z,AX) < 1- XP(X,x, {x}). And for z $ AX, we have

0 2 -XP(X, Z,AX) = S(Z, Ax) - XP(X, Z, A x ) 2 -XP(X,x,E \ {Z}) 2 XP(X,Z, {x}) - 1. Hence 16(~, AX) - XP(X,5, Ax)I 6 1 - XP(X,

5,

{z}).

By condition (4) and the dominated convergence theorem, it follows that

(1.47) Now, to prove that PA, is one-to-one, it suffices to show that cpPx, = 0 implies that cp = 0. But by ( 3 ) , cpP~,,= 0 implies that cpPx = 0 for all X > 0. In particular, from (1.47), we see that cp = 0. This finishes the proof of assertion c). Let &?(PA)c 9denote the range of PA. Define an operator Rx on &?(PA) as follows: XI - n A = PT'. We now prove d) R A is independent of X > 0, denoted by R; its domain g ( R ) is dense in 9. We first prove that ~ ( R x is) independent of X > 0. Given cp E 9, by (3), we have cppp = (P + (A - P ) c p w 3 . This shows that 9 ( P p )c %'(PA).Exchanging X and p, it follows that

Next, we prove that for each cp E ~ ( P x=) 9 ( P p ) ,cps2x = 'PO,. Actually, for cp1, 9 2 E 2 so that cp = cplP~= cp~P,, from ( 3 ) , it follows that

+

But Pp is one-to-one, so cp2 - cp1+ (A - p)cplP~= 0. Hence (pP;l- cpP;' cp(X - p ) = 0. This gives us cpRx = pR,. Finally, we prove that g(n) := g(fI2x)is dense in 9. Note that for each cp E 9, XcpP, E g ( R ) . On the

1.4 LAPLACETRANSFORMS

55

other hand, from c), we have wen that limx-,, IIy-XpPxII = 0. This yrovcv the denseness. So far, we have proved that {PA : X > O} satisfies the hypotheses of the Hille-Yosida theorem. Hence, there exists uniquely a strorlgly continuous, contraction semigroup {Ti : t 3 0) with resolvent apcrators {PA : X > 0) such that PA = J r e - x t 7 j d t , where the integral is in the Bochner sense. Take S, = 6(x:.) E 9and set P ( t , x , A )= 6,Tt(A). We prove that e) P ( t ,x,A ) is a jump process. By thc representation theorem (cf. Yosida (1978), p.248), we have

6,Tt(A) = lim n-02

M 1 [6,(~nb'2P,)'~](A) m. m=O

Since b . ( A )and (6.Px)(A)= P(X,.,A) E &+., we hwe [S.(tnW,)](A) E &+By induction, for each 7 n 2 1, [6.(tnRPn)m](A) E &,.. Therefore P ( t ,-,A ) = (6.Tt)(A)E &+. Obviously, P(0,x,A ) = 6 ( x ,A ) . Next, since exp(-tnJ) and exp(tn2P,) are d l bounded operators from 2+into itself, b, E LZk,so S,Tt E 9+, md hence P ( t ,5,.) 6 2+. By contractivity,

P ( t ,2,I";)

=

(6,Tt) ( E )

< 1.

By strong continuity, P ( t ,2 , A ) = (S,TL)(A)is continuous in t. Now: the remainder i s l o check the CK-equation. Note that pS. = p and so

.By semigroup property,

WE

have

1 TRANSITION FUNCTION AND ITS LAPLACE TRANSFO

56

Since the one-to-one correspondence between P ( t ,x,A ) and its Laplace transform P ( X , z , A ) ,f r o m now on, we also call P(X,.,A) a jump process (q-process). If P(X,x,A) satisfies conditions (1)-(4) of Lemma 1.28, then it determines uniquely a jump process P(t,x,A), and so one can define the correThus, Lemma 1.28 shows that spondent 9. Corollary 1.30. P(X,z, A ) is a jump process with q-pair ( q ( z ) ,q(x,A ) )(x E E 9) deduced by Theorems 1.4 and 1.5 iff conditions (1)-(5)5) of Lemma 1.28 hold.

E, A

As we have seen, this corollary is essentially not new since the q-condition lays on P ( t , x , A ) but not on P(X,x,A). The next result is much more meaningful. Corollary 1.31. Use the hypotheses and notations in Remark 1.8. Then P(X,z,A)is a q-process iff conditions (1)-(4)4) of Lemma 1.28 and the following q-condition all hold. lim X[XP(X,

x+co

2,A

n En) - S(Z, A n En)]= q(z,A n En) - q(x)6(x,A n En), A€€,

n31.

Proof: By Lemma 1.28, the q-condition here is clearly necessary. Now, suppose that conditions (1)-(4) are all satisfied. By Theorem 1.29, there exists a jump process P ( t ,x,A ) , and hence EL q-pair (Q(x),Q(x,A ) )(x E E , A E 9) Then from the q-condition here and the proof of Lemma 1.28, it follows that q(x) = Q(x),x E E. Moreover, Q(Z,

Put

A n En) = q(x,A n En),

x E E, A

E

8, n 3 1.

-

En = {x E E : n - 1 < ~ ( zXi+j,

(2.9)

f E X

a controlling equation of Eq. (2.6).

Theorern 2.6 (Comparison Theorem). Let f* be the minimal solution t o f q . (2.6). Then for any solution f t o Eq. (2.9), we have f f*.

>

Proof: Simply use induction and the first successive approximation scheme to show that f 2 4'") and then let n -+ 00. H By Theorem 2.2, for each A E d , we may define a map mA from 2 into itself as follows: m A ( d = f*. Then we have the following interesting result. Theorem 2.7. mA is a cone mapping. For { A n } c d ,A,, %, gn t g , we have A E d , g E A? and mA,,gn t m,g.

TA

and (9,)

C

Proof: By definit.ion, g E 53'. Similarly, it is easy to check that A E d.On and the other hand, by the comparison theorem, we have f: := mA,gn

f: Set

fl* = lirnTL-.oof;.

Anf;

+gn.

Then

2 f*by the minimum properly. is a solution to Eq. (2.6) and so Hence Note that by the order-preserving property, f* = m,g mA,gn = f: for all n , so f* 2 f*. Therefore f* = = limn4m mAngn.

>

Corollary 2.8. Let G be a countable set, {a, : s E G} C E+, then

Proof: When G is finite, the assertion follows from the first assertion of Theorem 2.7 and induction. Then the assertion for general case follows by using the second assertion of Theorem 2.7. I The next result, is the second successive approximation scheme for the minimal solution, its proof is similar t o the previous one and hence omitted.

65

2.1 MINIMALNONNEGATIVE SOLUTIONS

Theorem 2.9. Let {g"}?

c 3.Define

In particular, if we set

then mAg = f* = CrzIf ( n ) .

7

Tlieurcr:m 2.10, Let be a non-negative solution t o equation (2.6) so t h a t f" < p i * for some constant p 2 1. Then ,f = f". Moreover, for any initial f ( " ) 0 f(') G pf*, setting = Af"(n) g (n2 0), we have flcn) f * as

I("+')

0, .x E E , A

t 8"

R,cpeating the above proof in the opposite may, it follows that

It is easy to see that the integrands are all bounded and continuous in t . Hence (B) follows by the uniqueness theorem of Laplace transform. The proof of (P) ( F A ) is similar t o that ( B ) (Bx) and hence is omitted. To show that ( F A ) ( F ) , we need more work except the trivial case that q(x) = 0. Now, assume that q(a) > 0. At the beginning of Sectiori 1.3, we proved that

*

+

+

0,

2 1,

(2.22)

where

To see that the resolvent equation follows from (2.22), note that by the second successive approximation scheme,

Thus, by summing the both sides of (2.22) from 1 to 30, we obtain the required equation. We now prove (2.22) by using induction. When n = 1, at each point (x:A), the bath sides of (2.22) have the same value as

76

2 EXISTENCE AND SIMPLE

CONSTRUCTIONS OF

JUMP PROCESSES

Suppose that (2.22) holds for n - 1. Then, from

c n

( p - A)

jm(A)++l-k)

(A

k=l

n- 1

+

= ( p - A)P(1)(A)F(n)(p) ( p - A)

c~(X)P("(A)~("-"(( k=l

= ( p - X)[(A

+ q ) - l ] P ( q p )+ n ( A ) ( P - l ) ( A )

-P-l)(p))

we see that (2.22) holds also for n and hence we are done. b) Prove that Pmin(A, z, .) is the minimal solution to ( F A ) . We use again the notations given by (2.23). Moreover, for fixed set

5

E

E,

Once we prove that

i;Cn)(x) = P ( n ) ( ~ ) ,

A

> 0, n 2 1,

(2.24)

then the assertion follows immediately since the minimal solution to ( F A )is P(n)(A, 2 , .). To prove (2.24), we adopt induction again. When n = 1, (2.24) is trivial. When n = 2, we have

c,"==,

So (2.24) holds. Suppose that it holds for n- 1 and n, then, by the monotone class theorem, we have

P("+')(A)= P(")(A)Q[(A + q)-'1] = F(")(A)Q[(X + ~)-l1] = n(A)P(n-l)(A)Q[(X + q)-lI] = rI(A)P(n-l)(A)Q[(X = rI(A)p(n)(A) = rI(A)F(")(A) -

,(n+"(X).

+ q)-'I]

2.2 KOLMOGOROV EQUATIONS AND MINIMAL JUMPPROCESS

77

Therefore, (2.24) holds for all n. c) Prove the minimum property. Because every q-process P(X,2,A ) satisfies the backward Kolmogorov inequality (2.19), which is a controlling equation of (Bx),so the property follows by the comparison theorem. d) Prove that Pmin(X, z, A ) is the Laplace transform of the minimal solution Pmin(t, 2 , A ) to (B). First, we have

1

00

edxtP(')(t,x,A)dt= 0 = P(O)(X,z,A),

X

> 0, x E E , A E 8.

Next, suppose that

But condition (1) implies ip 3 -n(4ip +x-c x-c

whenever X have

> c. Thus, by the cornparisori theorem, whenever X > c V 0, we P"'"(X)(P

b) (2)

cp

x+y

+ uniqueness.

6'p 0, which certainly implies the uniqueness since Pmin(A) is the minimal one. For this, we use the resolvent equation:

P(X)l- P ( p )14- (A - p ) P ( X ) P ( p ) l= 0. Thus, if pP(pu)l .- 1 for some p

> 0, then it follows that

1 x P(X)1- - , i . -?(X)1 CL El

= P(X)I.

And so XP(A)l = 1 for all X > 0. Next, we prove that (1) j (3) + uniqueness. c) (1) + (3). This st,ep is not needed for the proof but included €or completeness. Let (1) hold. Then

2 . 3 SOME

SUFFICIEN'I' CONDITIONS FOR UNIQUENESS

This implies that Q y ( x )= 0 whenever c

1

81

+ q(2) = 0. We have

t

(c+q)t

or

1

- 11 >,

e(C+q)SQg&,

E

ipeCi

2

,

e-Y(L--s)Q)ecsds +

t20.

On the other hand, by Theorern 2.21 arid Theorem 2.12 (l),it follows that for each t >, 0, P"'"(t)ip is minirnal solution to the equation

Combining these two facts and using the comparison theorem, it follows that

pmin(t)y 6 yect < 00,

t 2 0,

which is just (3). d) (3) => uniqueness. Use the forward equation

(2.29)

(F):

Since the q-pair is conservative, by Theorem 1.15 and condition (3): it follows that

The null set depends on 2 , By using the forward equation again, in virtue. of condition (3) and the dominated convergence theorem, we obtain

d a.e. t. - p m y t , 2 , E ) = 0, dt However, the left-hand side is indeed continuous in t by Theorem 1.15, so this equat,ion holds for all t . Therefore

Pmin(t, 2,E ) = constant =Pmin(O, 2 , E ) = I,

x E E.

Now, the uniqueness follows from the minimum property of Pmin(t). W

2 EXISTENCE AND SIMPLE CONSTRUCTIONS OF JUMPPROCESS

82

Definition 2.23. T h e q-pair (q(x),q(x,A ) ) is called bounded if supzGEq(z)

< 03. Corollary 2.24. If a y-pair is conservative and bounded, then the q-process is unique. T h a t is P ( t )= etR = (tQ)n/n!.

c;=,

Proof: The conditions for the case (1) of Theorem 2.22 are satisfied wilh c = 0 and ‘p =: A4 = supxEEq ( s ) .

{&}? c 8 and ~1 E T€+. Suppose that (1) En t E , supzEE, q(x) < 00, limndm inf,,js, p(x) = 00,

Theorem 2.25. Let 00

where inf

0=

by the usual convention.

(2) There exists a constant c E IR such that flp

< CV.

(2.30)

Then the q-process is unique.

Proof: The idea of the proof is using an approximation by a sequence of jump processes with bounded q-pairs. For this, let

Then for each n, the q-pair (9, (x))qn (z, A ) ) is bounded and conservative. It x,A ) . Clearly, replacing determines uniquely a q-process, denoted by Pn(X, c and ( q ( x ) , q ( x , A ) with ) c+ = c V 0 and (qn(x),qn(z,A)), respectively, condition (2.30) remains true. Thus, according to the first step of the previous proof, we have

On the other hand? when

But when x $

5

En,we have

E

En,we have

2.3 S O M E

SUFFICIENT CONDITIONS FOR UNIQUENESS

83

Thus, in any cases, we have

X > O , X E E ,A C E , , n 2 1 .

Hence, by Corollary 2.24, Theorem 2.21 and the comparison theorem, we obtain X

Pmin(X, z,A ) 2 P,(X, z,A ) ,

> 0, z E El A c En, n 2 1.

Combining this with what mentioned at the beginning of the proof, we arrive at: 2,En) 2

XP"'"(X,

XP,(X,

2 , En) =

1 - XP,(X,X,

E;)

So we have XP"'*(X,

z,E ) = lim X P ~ ' ~ ( Xz, , E,)

= 1,

A>

C+.

It follows that the minimal q-process is honest and hence the q-process is unique. Next, we compare the above conditions. The purpose we introduce condition (1) of Theorem 2.22 is mainly for conditions (2) and (3) there. It is clearly more practical and provides the useful estimates (2.28) and (2.29). However, if we pay attention only to the uniqueness, then the first case of Theorem 2.22 is indeed a simple consequence of Theorem 2.25. To see this, taking En = {z E E : q ( z ) < n } and using the same cp and c given by Theorem 2.22 (l), it is easy to check that the hypotheses of Theorem 2.25 are satisfied. The next example shows that the conditions for case (1) of Theorem 2.22 are really stronger than the hypotheses of Theorem 2.25.

Example 2.26. Take E = { 1 , 2 , . . - } . Let { q l , q 2 , . . . ) be the prime numbers in the natural order. Define

Then the hypotheses of Theorem 2.25 are satisfied but not conditions for case (1) of Theorem 2.22.

84

2 EXISTENCE AND SIMPLE

CONSTRUCTIONS OF

JUMPPROCESSES

Proof: Take

Since

= 00. Because 'pi is are convergence-equivalent, it follows that lirni,,cp, increasing, it is now easy to see that the hypotheses of Theorem2.25 are satisfied with En = {1,2, - * - , n}. Next, we prove that the conditions for case (1) of Theorem2.22 do not hold. Actually, we can even prove that the conditions for case (2) of Theorem 2.22 do not hold. Note that in the present case, the solution to ( Bx ) is as follows:

PZj(A) =

(A

+

4i . . .Qj-l 4i).* * (A 4j)

+

j

> 2,

So we have 00

M

00

Since limj(2-1) j--tm

aj+l

=Xlim 3-00

j = o , 4j+l

it follows that the above series diverges for all A done. I

A>O,

> 0 and hence we are

We now introduce a criterion (its special case is Theorem 0.6) which shows that Theorem 2.25 is sharp. For further discussions about Theorem 2.25, see Section 3.2.

Theorem 2.27. The minimal q-process is not honest (equivalently, the 4processes are not unique) if the inequality Rcp 2 ccp has a solution cp E b6' with supzEEcp(z) > 0 for some (equivalently, for any) c > 0. Conversely, these conditions plus cp 2 0 are also necessary.

2.4 KOLMOGOROV EQUATIONS AND 9-CONDITION Proof: Sufficiency. Choose

A0

85

> 0 and zo so that

0 and A E 8 n G,, we have (2.32) IP(X)QIAI 6 1 + SUP(X+ d y ) ) / X . YEA

Next, by the resolvent equation,

2.4 KOLMOGOROV EQUATIONS A N D CONDITION

87

On the other hand, for fixed z E E and A E €, P ( X , s , A ) is decreasing in A. Hence for X 2 XO > 0, we have

P(X)QIA< P ( X ) [ I G , Q ] I A+ P ( X O ) [ I G ~ Q ] I A A , E gnG,. Letting X --t 00 and then n + 00, by (2.32) and the dominated convergence theorem, it follows that limx+cc P(X)QIA= 0 for all A E € n G,. Furthermore, by (2.32), we can prove that lim~.+mpLP(p)P(X)QIA = 0. Thus, we obtain

On the one hand, we have proved the continuity: limxdm XP(X) = I . So by Theorem 1.14 (4) we have

lim XP(X)QIA2 QIA 2 0 ,

x-+m

A E 8 n G,, n 2 1.

(2.34)

On the other hand, by (2.33), (2.34) and using Theorem 1.14(4) again, we have

Since P ( p ,z, {z}) > 0, this shows that

lim XP(X)QIA< QIA,

x-+m

A E € n G,, n 2 1.

(2.35)

n 2 1.

(2.36)

Combining (2.34) with (2.35), we obtain lim XP(X)QIA= QIA,

x+cc

A

E

Therefore, by (F,) and Theorem 1.14, we have

which is just the q-condition.

€nG,,

88

2 EXISTENCE AND SIMPLE CONSTRUCTIONS OF JUMP PROCESSES

2.5 Entrance Space and Exit Space

Suppose that P(X,2,A ) is a q-process satisfying the equation (Bx).Then .,A) is a non-negative bounded for fixed A > 0 and A , P(X, . , A ) - Pmin(X, solution to the homogeneous equation

f = rr(A)f,

f Eb8.

Thus, if the q-processes are not unique, then the above equation should have non-trivial solution. The solutions to the equation consist of the exit space. This explains intuitively why we introduce this exit space and how it concerns with the uniqueness problem. Similarly, if we consider the jump processes satisfying the forward equation, we will need the entrance space. The main purpose of this section is to prove that the dimensions of these spaces are independent of X > 0. For this, we need a lot of preparations. The related notations will become clear in the proof of Lemma 2.43. Introduce a kernel R(X,p)(X,p > 0), not necessarily non-negative, on ( E ,8') as follows: R(X,p ; 2,A ) = 6(z, A )

+ (A - p)Pmin(p,z,A ) ,

Equivalently R(X,p ) = I

2

E E , A E 8'.

+ (A - p)P"'"(p).

(2.37)

(2.38)

By resolvent equation, it is easy to prove the following result. Lemma 2.31. We have

ww,

w,

WP, 4= m p t ,A> = R(P,4 , P"'"(p)R(p, A) = R ( p ,X)Pmin(p)= Prni"(X>.

In particular,

w, P>R(P,4

= 1.

(2.39) (2.40)

(2.41)

The following notion is again due to the resolvent equation. Definition 2.32.

(1) A family of functions {fx E b 8 + : X > 0) is called consistent, if for all A, p > 0, we have fx = R ( p , X)f,. (2) A family of measures {px E 2'+: X > 0) is called consistent, if for all A, p > 0, we have px = cp,R(p, A).

2.5 ENTRANCE SPACE AND EXITSPACE

89

Next, define

{f E b€+ : ( X I - n)f = O}, YA= {'P E 2+: 'P(A)= 'P&((X + ~ ) - ' I A ) ,A E &}, W,l = {'p E 2+: c p ( X 1 - Q ) I A= 0, for all A E d n En and n 2 l}, X > 0. 52-A =

By using the method of the proof of Proposition 2.29, we obtain

> 0, we

Lemma 2.33. For every X Lemma 2.34. Fix X

have ?'A'

= Y,l.

> 0.

(1) If f , g E T&+ and

( X I - Q)f

z9,

(2.42)

then

f 2 P"'"(X)g. Moreover, if the sign of the equality in (2.42) holds and so does (2.43). (2) If p , U E p(E,) < 00, U ( & ) < co for all n and

(2.43)

52-~= {0},

then

2+,

p(XI

- Q ) I A2

U(A)

for all

A E E n En and n 2 1,

(2.44)

then

p 2 UP"'"( A).

(2.45)

Furthermore, if p ( E ) < 00,the sign of the equality in (2.44) holds and "yx = {0}, then so does (2.45).

Proof: Since the proofs for (1) and (2) are similar, here we prove (2) only. Note that

By (2.44), we have P[(X

+ q)II 2 PQ + U.

+ ~ ) - ' I A we , see that p is a solution to the equation cp 2 (pQ[(X + q)-l.1] + U[(A + q ) - ' I ] . (2.46)

Postmultiplying by (A

On the other hand, since P(A,2,.) is the minimal solution to ( F A ) by , Theorem 2.12 (2), it follows that UPm'"(X) is the minimal solution to (2.46), replacing the inequality by equality. Thus, the first assertion of (2) follows by applying the comparison theorem. Moreover, if the equality in (2.44) holds and p ( E ) < 00, then we have p - UP"'"(A) E YA,and so the last assertion is obvious. W

90

2 EXISTENCE AND

SIMPLE CONSTRUCTIONS O F

JUMP PROCESSES

Definition 2.35. (1) A family of functions

c b&+

{fk};,l

is called linear independent, if

*

for any { c ~ } ; = c ~ R, C;=,c k f k = 0 ck = 0, IC = 1,2, ,TI. (2) A family of measures { c p k } ; = , c A?+ is called linear independent, if for any { c k } ; = , c R, c k p k = 0 =+ ck = 0, k = 1 , 2 , - . . ,n.

xi=,

As usual, the number

c

sup{n : {fk};=l is called the dimension of %A, dim Wx.

%A

is linear independent}

denoted by dim%x.

Similarly, we can define

Lemma 2.36. (1) Given XO

>

fx,

Set f p = R(Xo,p)fxo, p is consistent. (2) Given XO > 0 and 'px, E "yx,,. Set 'pp = 'pxoR(Xo,p),p cp, E W, and {cp, : p > 0) is consistent.

f,

E

9,and

0 and

E %A,.

{f,: p > 0)

Proof: a) Prove that

f,>, 0 for all p > 0.

f, = fx, + 0 But for p >, Xo, since fx,

Actually, for p

>

0. Then

>

0. Then

< Xo, we have

0 - P)~"'"(p)fx0.

E %xo, we have

Thus, Lemma 2.34 (1) implies that

Hence we also have

f,

= fx,

+

(A0

- P)pmi"(P)fxo2

0.

b) Prove the measurability. Because fx,, R ( X o , p ) 1 ~E b&, so does f,= R(Xo,p)fx, by the monotone class theorem. c) Prove that f, E % ,, p > 0. Since Pmin(X)I~ satisfies (Bx)and fx, E %A, it follows that Qfp

= QR(XO, ~

) f ~ o

+ (Xo - p)QP"'"(p)fx,, = 0 0 + dfx, + ( A 0 - P ) ( P + Y)pmin(p)fAo - (A0 - p)fxo = ( P + Y)[fxo + (A0 - P)P"'"(P)fx,l = Qfx,

= (P

+ 4)fW

CL

> 0.

2.5 ENTRANCE SPACE

91

ANT) E X I T SPACE

d) The consistency of {fp : p > 0) follows from (2.39). Combining this with a), b) and c), we obtain assertion (1). The proof of (2) is quite similar. Clearly, q, E 2 ' ( p > 0). Next, for P 6 Xo, we certainly have pp 2 0. Conversely, for p 2 Xo, since 'pAoE WA,, it follows that

HAVEtHUS, BY lEMMA 2.34(2), WE

ENCEh

The consisTherefore we always have (pp 2 0 and furthermore y , E 2+. tency of {p,} is obvious. Now, by using (F,) and the fact that 'px, E Wj,,, we obtain

Theorem 2.37. Both dim92A and dim WA are independent of X Proof: Let

{ j r ) } L = ,c %j,

> 0.

be linear independent. Set

By Lemrrias 2.36 (1) and 2.31, we have

k= 1

k=l

k=l

{fr'}

{fik)}.

Thus, the independence of implies the independence of In other words, dim 9?Lp2 dim 92~. But X and p are symmetric, hence dim %-?,A = dim

ep.

Similarly, we can prove that dim"'= dimWp. In view of the above theorern, we can use dim% and dim?' instead of dim 7 1 ~ and dim 'Y' respectively.

92

2 EXISTENCE AND SIMPLE CONSTRUCTIONS OF ,JUMP PR.0CESSES

Definition 2.38. We call and WA the exit space and the entrancc space respectively. The elements of are called the exit solutions. If dimo& < 00, it is said t o be finite exit. In particular, corresponding t o dim kP = 0 and dim W = I , we have zero-exit and single-exit. Similarly, we can define the entrance solutions, finitc cntrance, zero-entrance and single-entrance.

For each x E E , set d ( x ) = q ( x ) - q ( x , E ) , which is called the nonconservative quantity at 2. Lemma 2.39. For each X > 0, {zA(x) := 1 - XP"'"(X,z,E) : x C E } is the maximal solution t o t h e equation u = rI(A)u id/(X

+ q),

0

< u < 1.

(2.47)

Moreover, it can be obtained by the following procedure. Let

Zip' = 1, Then z r ) for each X

1 xA as n

n 3 1.

(2.48)

particular, if the given g-pair is conservative, then > 0, zA is the maximal solution t o the equation -3

a3. In

u = n(x)u,

0 0) is consistent: z p = zA -t- (A - p)P"'"(X)z,. (2) Let {uA : X > 0) be a consistent family of functions, then uA # 0 or TL, = 0 is independent of X > 0. Moreover, u x 1 0 as X T ,30. (3) If PIX) defined by (2.50) is a jump process, then XpA(E> 1 for all

X

0.

Proof: The consistency of {xA : X > 0} is due t o the fact that P"li"(X satisfies the resolvent equation. Assertion (2) is easy t o check. Assertion (3) is obvious because of the normal condition of jump processes. Because of Lemma 2.42 (2), we may define u .-= 1imx ,O uA.Tn particular, z

L:

lirnA..+o xA - 1.

In order to guarantee P(X) defined by (2.50) being a jump process, the key point is the resolvent equation, which is solved by the next lemina.

0) be a consistent family of functions, 0 # uA 1 and yX E Y+,XyA(E) 1 for all X > 0. Then P(X) = Pmi"(X) uAp satisfies the resolvent equation iff there exists a consistent family {vx : X > 0) of measures such that pA= r n ~ q where ~ , r n A > 0 and m i 1 - Xqxu =: c 2 0 is independent of X > 0.

0 by (2.51), and so P(A) = Pmin(X). Hencc we have nothing to do in t,his trivial case. Assume that ywo# 0 for some pa, Then 'pi, # 0 for all p > 0. Let

Then { q x : X > 0) is consistent. In particular, t,nking X = p0 in (2.511, we

see that rlp(4 =

Note that

[...I

" +(P

-

POhq&IY,Lb41.

does not depend on A . When p,oup = 0, then 1

+ (p -

pO)'pp,up= 1 > 0. Othcrwise,

0 , p > 0. Thus, we may define

+

(px = rnxqx,

mx > 0, X > 0.

(2.52)

c) Substituting (2.52) into (2.51), it follows that {mx}must satisfy

sET

tHAT IS

tHEN WE HAVE

m,-l

- pL"(X,p) = mx-1

- XU(X,Pu).

On t,he other hand, by the consistency of {u,},we have 21,

+

= up ( p - X ) P m i n ( X ) u , ,

(2.53)

2.6 CONSTRUCTION OF q-PROCESSES WITH SINGLE-EXIT q-PAIR

95

and so ux = u - XPrnL"(A)u.

(2.54)

Next, Xqxu = XqpR(p,X)u = qJXu

+ (p -

= vp(Xu

+ ( p - X)XP"'n(X)u)

- .A,>

= cL77pU

- b - Q7,ux.

(2.55)

That is ( p - A)+,

p ) = ( p - A ) q A 7 L p = p/?jp7A - X q x l L .

Substituting this into (2.53) we obtain m i 1 - pq g u = m i - Xqhu. Hence, rn,' - X?jxu=: c is independent of X > 0. d) So far, we have proved the hypalheses are necessary. The suficieiicy is easier, simply substituting pAgiven by (2.52) into (2.51) and using the consistency of {qX}. I

Lemma 2.44. Given a consistent family {qx} of measures. Set 'px = q A / [ c + Xqx(E)]with c 0. Then P(A) defined by (2.50) is a jump process and it is

>

indeed a q-process satisfying

(Bx)if the

q-pair is conservative.

Proof: In general, since Pmin(X) is a jump process, the continuous condition for P(A) follows from Lemma 2.42 (2). In the conservative case, since zX E %A, P(A) satisfies ( B x ) . Hence the q-condition and so the continuous condition are automatic. Combining the above lemmas, we obtain a complete construction theorem.

Theorem 2.45. Given a conservative and single-exit q-pair, every q-process can be obtained by the following procedure. Choose an arbitrarily consistent family {qx} of measures and a constant c 2 0 and then set

where 0/0 = 0 by convention. The q-process is honest iff c = 0.

Corollary 2.46. Let the q-pair be conservative. Given 9 E p+with

,Prr"rl(X)(E) < 00, then for every c 2 0, P(A) := Prrlin(X) a q-process.

x > 0,

(2.56)

+ zxpY'T'i"(X)/[c + A9PT11i11(A)(is

Proof; We need only to check that {pPmi"(X): X this follows from (2.40). I

> 0)

is coiisistent. But

Combining Corollary 2.46 with Theorern 2,40, we obtain at last the following imiqueness criterion.

96

2 EXISTENCE AND SIMPLE CONSTKUCTIONS O F J U M P PROCESSES

Theorem 2.47. Given a conservative q-pair, the g-process is unique iff dim 9 2

0. Equivalently, the minimal q-process is honest.

2.7 ‘Notes For Markov chains, the theory of non-negative solutions to a. system of equations with non,nega.ti,vecoefficients is diie to Hou and Guo (1979), which goes back to Kantorovich and Krylov (1962). The general presentation here is talcen from Chen (1979). Theorem 2.12 is taken from Chen (19801, Chen and Zheng (1983). The ‘backward and forward Kolmogorov equations were first introdnced by Kolmogorov (1931). In a special case, Theorem 2.21 was obtained by Feller (l940), sometimes called the Feller’s construction in the literature, a generalization was obtained by Hu (1966). Here, we study the general case in details. The equation ( F A )was introduced by Chen (1980)) it is locally equivalent to the; ordinary ( F ) . The equivalent result Proposition 2.29 is taken from Chen arid Zheng (19133). Theorem 2.22 and Theorem 2.25 were first appeared in Chen (1986a,b). The later one is an analogue of Stroock and Vartzdhan (1979). Basis (1976, 1980) used a stronger version of the conditions for the cases (1) a.nd (3) of Theorem 2.22. For Theorem 2.25, the author was benefit.ed by a conversa.tion wi’Lh S. Z. Tang. The Example 2.26 is due to J. L. Zheng (oral cornmunication). It should point out that a slight modification of Theorern 2.25 is available for tirne-inhomogeneous jump processes. See Zheng and Zbeng (1986), Zheng (1993). For more related results, refer also to Chebotarev (1988), Hamza arid Klebaner (1995)) Kerstirig and Klebaner (1995). For the applications to quantum mechanics, refer t o the survey article by Konstantinov, Maslov and Chebotarev (1990), Thc materials of Section 2.4 are mainly taken from Chen and Zheng (1983). For Evlarkov chains, the part of Theorem 2.30 related to ( B n )is due to Feller (1957) and R.euter (1,957))and thc gen,erd form is due t o Hu (1966). For Markov chains, the other part of Theorem 2.30 related to (F,) is due to Hou (1982). Here is th,e natural generalization of the previous results. The author learnt thc name “consistent family” from Yang (1981). For Markov chains, corresponding to Corollary 2.46, there is a probabilistic construction, due to Doob (1945), and hence called noob’s construction. Tn the same situation, Theorem 2.47 is due t o Feller (1‘357}and Neuter (1957), the general form is due to Hu (1966). Essentially, Lemma 2,43 is due to Reiiter (1959) I

Chapter 3

Uniqueness Criteria This chapler begins with the study of uniqueness problem for the jump proccsses satisfying the backward or forward Kolmogorov equations. Then we study the uniqueness problcm for general jump processes. Besides, we introduce some applications of the uniqueness critcria. 3.1 U n i q u e n e s s Criteria Based on Kolrnogorov Equations

As we will see later, the Kolmogorov equations play a special role in the study of the uniqueness problem. The problem becomes easier if we restrict ourselves to those jump proccsscs satisfying one of the equations. This is the reason why we first deal with these cases. Definition 3 . 1 . A q-process satisfying (BA)(resp. ( F A ) )is called a Bq-process (resp. Fq-process). Theorem 3 . 2 . The Bq-process is unique iff dim GY = 0

Proof: Non-uniqueness ===+dim% # 0. This is clear as we explained at the beginning of Section 2.5. We now prove that “dim% # 0 ==+ nonuniqueness” by reducing the non-conservative case to the conservative one. For a different proof, see Proposition 6.28. a) Note that a given q-pair ( q ( x ) ,q ( x , d y ) ) induces naturally a conservative q-pair ( q a ( x ) , qn(J:, d y ) ) on the enlarged state space EA = E U {A}. &A = a ( € u {A}) as follows:

x E E , A E 85 ’ qa(J:, A ) = q(x, A \ {A}) -k I A ( A ) d ( Z ) , 4ab) = 4 ( 4 , J: E E , s a ( 4 = 0. By the last condition, for any q-process PA(X> with q-pair ( ~ ( x ~) ,( 2d y,) ) , we have Pa(&A , {A}) = l / X and so PA(&A, E ) = 0 for all X > 0. This plus Theorem 1.15(1) implics that the restrzctzon P(A) of PA(X)to ( E ,8 ) is not only a jump process but also satisfies the backward equation ( B x ) corresponding t o the q-pair (q(z),q(z,dy)), and hence is a Bq-process by Theorem 2.30. b) It is easy to check that the q-pair (qA(x),qA(x,dy)) is zero-exit iff so is the q-pair ( q ( x ) ,q ( x ,d y ) ) . Now, by the assumption and Corollary 2.46, there are infinitely many q-processes with the same q-pair ( ~ ( z ~) , ( xdy)) , but having different restrictions. H To study the problem for the Fq-processes, we need some preparation. The next result is about the decomposition of a consistent family of measures. 97

3 UNIQUENESS CRITERIA

98

Lemma 3.3. Let { q X } be a family of measures. Then it is consistent iff there exist a K. E y+and a consistent family of measures { f j ~E 'YX : X > 0) such that

Moreover, K and {qX : X uniquely by {qX : A > 0).

> O}

in the decomposition (3.2) are determined

Proof: a) The sufficiency is obvious. We now prove the necessity. Let {qx : X > 0) be a consistent family of measures. Then 7"

+ (v - X)qvPrni"(X)= rlx 2 0.

ENCEh fIX V AND LET

Define

KV(A)= vqy(A) - ~ ~ ( O I A ) ,A E 8 0 En,rt 3 1.

(3.4)

Then O 0) is a consistent family of measures. Thus, we assume that (q(x)>q ( x , A ) ) is non-conservative. Then from Lemma 2.44, it follows that (3.12) still defines a jump process. This proves the first assertion. b) Because Pmin(X) is a q-process, by Corollary 2.28, P(A)given by (3.12) is a q-process iff

+

A E 8 n E,,, n. 2 1.

lim X2zAqA(A)/[c XqA(E)J= 0, x+m c) Note that Lemma 2.42 (1) and (2) give us

AP"'"(A)l

t

1

as

x T 00.

(3.13).

haveWe

On t,he other hand, by Lemma 3.4, we have Xqx(E)

Hence

as x

+ x+oo lim

lim Xq,(E) = K ( E )

A+

+ 03.

00

X.il,(E).

+

(3.15)

(3,16)

Xqx(E) K ( E )= 0, then from (3.15) and (3.16), it follows that K = 7, = 0, so qX 0 and lience P(A) = Prnin(X) for all X > 0. Thus, this situation is trivial. In what follows, we assume that

Now, if limx,,

=3

Iim X7jA(E) ' + a 3

+ K ( E )# 0.

3.1 UNIQUENESS CRITERIA BASEDON KOLMOGOROV EQUATIONS101

d) Observe lim x - lim [I - X P ~ ' ~ ( =X0,) ~ ]

A 4 0 0 x-x-.cO

and note that zA satisfies Eq. (2.47). We obtain lim X r x ( x ) = d ( s ) ,

zE

A-00

e) Fix vo > 0. For each N(v,, E , A) so that

Thus, by (3.9))whenever

11

E

> 0 and A

E & I-

F.

En! there exists an AT

=

> vo, we have

From this and (3.9)) we see that lim ~,,QIA= 0,

v+w

A E € n En,n 2 1.

Therefore, by Lemma 3.3, (3.4) and (3.9)) we obtain

Moreover,

f ) Finally, combining d), e) with c), we get

l?rorom this and a), b) and c>, it follows that P(A>is a g-process iff one of (1)-(4) holds. Now, we can state the main result of this section.

Thcorerri 3.6. The Fq-process is unique iff either the minimal q-process is honest or dim "t/ = 0.

102


Proof: a) Sufficiency. The assertion is trivial if the minimal q-process is honest. Otherwise, for any Fq-process P(A), we have

P(A,2,.) - PrnyX,z, ,)

E

w,.

Hence dim Y' = 0 also implies the uniqueness. b) Necessity. Assume that the minimal q-process is non-honest and dim '9' # 0. Fix A0 > 0, take qA, E W', \ (0) and set QA = qAoR(Xo,X).By Lemma 2.36 (21, { f j ~E Wx : X > 0) is consistent. Since zx # 0, setting K, = 0 and c = 0 in Proposition 3.5, we obtain a Fq-process which is different to Pmin(X). This completes the proof.

3.2 Uniqueness Criterion and Applications

Lemma 3.7. For each A X > 0.

E

&, either irifXEnPmin(A: z, E ) > 0 or = 0 for all

Proof: Suppose that infXEAPmin(Xo, x,E ) > 0 for some XO. Then by the resolvent equation,

P y A j = P"'"(A0)

+ (A,

- X)Pmin(A)Pmi"(Xo),

we have

On the other hand, as we have seen in (3.13) t#hatXPmin(X, x,E ) is increasing in A, we also have

Denote by

E0 = {x E E : d ( z ) := q ( z )- q ( 2 , E ) > 0) the set of all non-conservative points. We can now state our main criterion.

Theorem 3.8 (Uniqucness Criterion). Given a q-pair (q(z),q(z,A ) ) ,the q-process is unique iff the following conditions hold simultaneously.

(1) For some X > 0 (hence for all X > 0 ) , infxEEoPmin(X,z,E) > 0. (2) dim% = 0. (3) Either the q-pair is conservative or even it is not but still dim W = 0.

3.2 UNIQUENESS CRITERION AND APPLICATIONS

103

The theorem will be proved in Section 3.4. In this section, we introduce some of its applications. Before doing so, let us make a remark about the probability meanings of the conditions. Consider the minimal Markov chain (Xt)t>o on a probability space (R, 9, P). Its successive jump times are T, is called the denoted by TO = 0 < T~ < T~ < . . . 00. Then 7 := first infinity of the Markov chain. Next, let

0, there are infinitely many isolated jumps during ( T ( w ) - E , ~ ( w ) ) } .

Remark 3.9. Using the above notations, we have (1) Condition (1) of Theorem 3.8 holds iff there exists a to > 0 such that infiEEoPi(7 > t o ) > 0. (2) Condition (2) of Theorem 3.8 is equivalent t o say that Pi(fi)=O for all i.

The proof of Remark 3.9 was presented in Hou and Guo (1978), Section 12.10. Corollary 3.10. If q ( x , E ) = 0 for all z E E , then the q-process is unique iff SUPzEEq ( 2 ) < OO. Proof: Clearly, dim 92 = dim 'Y = 0. Since P m i n ( A ) = (A+q)-lI, it follows that

To study bounded q-pair, we need the following

Lemma 3.11. Let r), E W, \ ( 0 ) . Then J r),(dz)q(z,E)= 00. In particular, if supzEEq ( z ,E)< 00,then dim 'V = 0. Proof: We prove here only the first assertion. From r), E "yx = Wi, it follows that r),(X q)I = r),Q. In particular, XqA(E) q X ( q ) = r),Q(E). Thus, if the assertion is not true, then we could have

+

+

which is a contradiction. W The next result improves Corollary 2.24. Corollary 3.12. For bounded q-pair, the q-process is unique. Proof: a) Set c = supxEEq(z) < 00. For f, E

@A,

we have

3 UNIQUENESSCRITERIA

104 Hence

and so fx = 0. This provies that dimU&= 0. b) If the q-pair is conservative, then Eo = S and so the conditions of Thee rem 3.8 me all satisfied. In particular, we have proved again Corollary 2.24. c ) By rCheorem 2.21 arid Theorem 2.12 (I). Pmi"(X)l is the minimal solution to the equation u = r I ( X ) u + (A + 4 ) - 1 .

We thcn have

This implies condition (1)of the criterion. Finally: By a), c ) and Lernma3.11! the conditions of Theorern 3.8 are dl satisfied. I

As a special case of the above theorem, we have Corollary 3.13. Let E" be finite. Then q-process is unique iff both &-processss and Fq-process are unique.

Now, we want to sh,ow that Theorem 2.25 can be deduced from our main criterion. Indeed, since the q-pair discussed there is conservative, the uniqueKXSS criterion becorries dirr&%= 0. O n t,he other hand, dim% = 0 is equivalent t)o the minimal q-process being honest, which is the point we used to arrive at the uniqueness. Here, we want to show that EL dual approach enables 11s to arrive at dim% = 0. To do so: 'we introduce a comparison lemma, which is a dual of the comparison theorem for the minimal solution.

L L W L ~3.14 L ~ (comparison Lcmrnu). Let ni be a non-negative kernel, gi E 8+satisfying nil gi 1,,i = 1, 2. Denote by f i the maximal solution t o the equation

+

0 for all k 2 i } < 00.

The corresponding Q-process is called a single birth Q-process.

3 UNIQUENESSCRITERIA

106

In the case that yi,i+l > 0 for all i 2 0, we have N = 0. When N 3 1, to be distinguished, we also call the matrix (resp. Q-process) a single birth Q-matrix (resp. Q-process) with absorbing boundary.

Theorem 3-16" Given a single birth Q-matrix Q : (qij), (1) when N = 0, the Q-process is unique iff R := CrZN rn,

and q f ) = that

c i = o q k j for

xa= 0.

=. m,

where

z < k and k 2 1, Here we use the convention

(2) When N >, 1, choose arbitrarily a positive ijol, define in (3.20) and (3.19) with N = 0, replacing

$)

and En as

and qn,n+l, respectively,

with i#) and tjn,n+l:

Then the (qij)– process is unique iff

Proof: First, we consider the general case that N 3 0. a) By Theorems 2.47 and 2.40, it suffices to show that the maximal solution (ur)to the equation

equals zero identically for some fixed X > 0. When AT 2 1, the set {0,1, . . , N - 1) consists a closed subclass of the chain and so u: = 0 for all i 6 N - 1. b) Define

(3.21)

By induction and ( N O ) , it is easy to check that

3.2

UNIQUENESS CRITERION AND

APPLICATIONS

107

c) Let (ui)be a solution to the equation ( l + q i ) u i = x q i j u j , i 2 0 with

uk=O

for all k < N - 1 and u N = l . (3.23)

j#i

We now prove that (ui)is unbounded iff z,"==,Gn = 00. From (3.23), it follows that

[c n- 1

uN

= 1, un+1- un = q;,;+i

qik)(uk+l-

uk)

+ un

k=O

and hence ui as i

Gk

1

2N

(3.24)

1'. The key of the proof is to show that

< uk+l - u k < ( u N + 1 - uN

+ukGk,

k 2 N.

(3.25)

To check (3.25), we use induction. Noting that

haveand by ()3.24 we

Suppose that (3.25) holds for all k: N < k < n – 1 and we now consider the case that k = n. Then

and

=

+ un5iin,

( u N + l - uN)pLN)

n2N+l.

d) Having (3.22) and (3.25) in mind, it is now easy to complete the proof of the conclusion mentioned in c). If u, := limn+,un < 00, then, by (3.25), we have

108

3

Conversely, let

it follows that

UNIQUENESS

CRITERIA

.k < 00. Because

nkuk+l/ukand

log (uk+l/uk), and then

converge or diverge simultaneously whenever and (3.22), we get

U ~ + ~ /--fU 1. ~

Next, by (3.25)

Combining these facts together, we obtain the main conclusion. e) We now prove (3.19). Actually, by (3.20) and (3.21) with N = 0 and induction, if 6 k = mk for all k 6 n - 1, then we have

n-1

j=O

= m,

We have thus proved the first assertion of the theorem. f ) Finally, consider the case that N 2 1. From the expression of ( f i n ) , it is clear that one can regard {0,1, . . , N - 1) as a single absorbing state 0. If necessary, relabelling N , N 1,.. . as 1,2,. . . , respectively, we get a new Q-matrix and then the quantities &) (0 6 i < n ) and gn,n+l. For which, the

+


109

state 0 is the only absorbing one. However, for simplicity, we assume N = 1 and use the original ( q i j ) instead of the new Q-matrix. Then

Note that even though qol = 0 but (FLZ) : k 2 i 2 0) is still well defined by (3.20) with N = 0. Repeating the proof e), it follows that

,

n-1

n

\

On the other hand, define co = 1 and

Comparing this with n-1

k=O

and FAo) = 1, it is clear that cn = F?) for all n 2 0. Collecting these facts together, we obtain n

k=l

Therefore, we have returned to the case (1) with qol = 1. Now, part (2) of the theorem follows, because the sequence { F F ' : n 2 k 2 0} does not depend on gol and so the condition " R = 00" is independent of qol > 0. H Alternative Proof: We can reduce the case of N 2 1 to N = 0 by using a probabilistic approach, which goes back to Pakes and Tavark (1981). Let (qij) be a local modification of ( q i j ) , up to N-1 for instance. Denote by ( X t ) and the minimal processes determined by ( q i j ) and (t&) respectively. We want to show that ( X t ) has at most finite number of jumps in every Let (2,)denote the minimal process finite time-interval iff so does determined by the Q-matrix (&): & = 0 for all i 6 N - 1 and @ij = q i j for all i 2 N . Note that for each i ,< N - 1, stays at i with the exponential law having parameter qi and then jumps to other states. This is the only way yields more jumps than Due to the conditional independence and the fact N < 00, such jumps can happen at most finite times in a finite time-interval. Thus, (57,) has at most finite number of jumps more than in every finite time-interval. The same comparison holds for ( X , ) and ( X t ) . We have thus proved the assertion. H

(xi)

(xt). (x,)

(xi)

(zt)

(zt).


110

Definition 3.17. A conservative Q-matrix Q = ( q i j ) is called a birth-death Q-matrix if qi,i+l=:

bi

> 0, i 2 0;

q i , i - 1 =: ai

> 0, i 3 1 and qij = 0 for

all li - j l

> 2.

The corresponding Q-process is called a birth-death process

Since the birth-death Q-matrix is a special type of single birth Q-mat,rix, as a consequence of the above result, we have Corollary 3.18. Given a birth-death Q-matrix, the Q-process is unique iff

where

po = 1,

pn

bob1 1

-~

bn-l

*

--

UlU2..

-

. an

,

n>l.

When bo i s replaced by 0, we obtain a birth-death Q-matrix with absorbing boundary. This necds only a little modification since the change of a finite number of thc transition rates qzJ docs riot iritcrfere the uniqueness property as mentioned above. Clearly, Theorem 3.16 is a very nice result since it is explicit and completely computable. Unfortunately, this is uncommon. Even in the twodimensional case, it is usually hard to justify whether a Q-matrix is zeroexit or not. The remainder of this section is devoted to study some sufficient conditions for the uniqueness of multidimensional Q-processes by comparing them with a single birth Q-process. The typical example in mind is Schlijgl’s model. Theorem 3.19. Let E be a countable set, Q = ( q ( z , y ) : ‘c,y c E ) be a conservative Q-matrix. Suppose that there exists a partition of E such t h a t Cr-oEk = JY and the following conditions hold.

{&}r

E3 for all k 2 0. (1) If q(x,y) > 0 and J: E Ek, then y E (2) AT := inf {i : q ( z , y ) > 0 for all z E Ek and all k 2 i} (3) CI, := sup{q(z) : z E Ek} < 00 for all k 2 0.

xYEEk+I

Define a conservative Q-matrix Q = ( y z j : i , j E

< 00.

Z+)as follows: (3.26) other cases of j

tf the (qij)-process is unique (i.e., R or then so is the (q(z,y))-process.

# i.

defined by Theorem 3.16 equals

oo),


111

Proof: By the uniqueness criterion, it suffices to show that the equation (A

+ q ( z ) ) u ( z=) 1q ( 2 ,

0 6 ).(.

Y>U(Y),

6 1,

x

E

E

(3.27)

Y#X

has only the trivial solution. Suppose that Eq. (3.27) has a non-trivial solution ( u ( x ): 2 E E ) . Let Uk

= SUp{U(X) : X E Ek},

k 2 0.

(3.28)

<
0 arid d k ) E Ek so that Fk(X c k ) < x/2, u ( x ( k ) ) (1 - E k ) U k (3.29)

cri:

+

IZ: with (3.29), we obtain

Replacing

d k )in

(3.27), using conditions (1) and (3), (3.28) and

k-1

That is

?(

xuk 2 + j=o

UGEj

p(2(*);y))(uk-Uj)

0 iff there exists a K E T+,such t h a t

&(En)< 00, n 3 1,

K ( E )= 00

(3.32)

x > 0.

(3.33)

and

0 < nPm'"(A)l

< 00,

Proof: We first prove that the property (3.33) is independent of A. Actually, if ~P"'"(p)l= 0 for some p, then by the resolvent equation, we have KP'"(p)l

= KP" '" ( A) l

+ (A - p ) K P m ' " ( p ) P m i n ( A ) l .

This implies that ~ l ' ~ ' " ( X ) l = 0, which is a contradiction. On the other hand, since ~(p,P"'"(p)l)is decreasing in p, so r;Prn'"(X)l< 00 implies that ~,P"'"(p)l< 00 for all p 6 A. As for p > A, the resolvent equation gives us KPrn'"(/L)l

< KP'"(A)l

< 00.


114

The sufficiency comes from 3;,

> KEpmin(X)l 2 C ( X ) K ( E ) .

To prove necessity, assume that c(X) = 0. Choose an infinite subset N of N := {l,2,. I . } so that CrLEN inf,CE, P"'"[A)l(z) < 00 and choose a sequence ( 5 , F En : n E N } such that 0 < z n E N P m i n ( A ) l ( x n 0, then dim@ = 0. Proof: This is an easier consequence of Theorems 2.21 and 2.11. Lemma 3.23. (1) If f E bG) and Pmin(X)f = 0 , then f = 0. (2) If p E 9 and cpPmin(A)(A) = 0 for A E 8 n En,n 2 1,then p = 0.

Proof: a) From

(BA),

it follows that

f = ( X I -- I 2 ) P " ( X ) f = 0. Hence assertion { 1) follows. b) Let cp* be the John-Hahn decomposition of q, By the hypotheses, we have

cp+P"'"(X)(A) = pP-Pmin(X)(A),

A E c" n En, n >/ 1.

Now, the monotone class theorem implies that (p+Prni"(X)f = p-Pm'"(X)f

for all f E b8. Furthermore, for any f E 8,whenever one of the two sides exists, then the other side should exist and moreover! the both sides are equal. Applying (Fn),we obtain q+ ( A )=

C yipmi"(A) ( A 1 n

-f l ) I A E , =

C q-Pmin(A) ( X I n

for all A E 8,and so cp = 0 since cp E 9.

- fl2)I~~, = cp-

(A)

3.4 PROOF OF UNIQUENESS CRITERION Lemma 3.24. Let G be a kernel on ( E ) & )0, by f * the minimal solution t o the equation

< G < 1 and g E b&+.

115 Denote

f = Gf f g . Set

D = {x

E

E : g(x) > O}. Then supxEEf*(x:)= supzEDf*(x).

Proof: As usual, let f(1)

= g,

f(lz+')

+

= G f ( n ) 9,

n

> 1.

Then

and Proposition 2.18, we have

Thus

V(A)

:= Q ( A ) ( A I

- sl) 2 0,

A

E

8 n En, n 2 1

Clearly, for ewh X > 0 and z E , U ( A ,5,.) can be extended to 8 uniquely, denoted by U(A, 5 , ,) again. Using condition (2), Tlieorerri3.6 and Lemma 2.34(2), we obtain @[A) --- U(A)Pmin(A).From this and (3.34), it follows that P(A) = P"'"(A) U ( A ) P m y A ) . (3.35)

+

Hence

6 X P ( X ) l 6 1.

XT/(X)P"1'"(X)l

Besides, by condition (1) and Lemma3.7, we have 1 Combining these two facts together, we obtain

XU(X)l

U(p). (3.38)

3.4 PROOF OF UNIQUENESS CRITERION The right-hand side is non-negative whenever X

2 p.

117

Hence

and so p U ( p ) 3 (1 - p A - l ) X [ XP"'"(X)

-I]U(p),

X 2 p.

(3.39)

X(XPTi"(X) - I ) = R. Letting Note that Prnin(X) is a Bq-process, limx,, X --+ 00 on the right-hand side of (3.39) and using (3.36) and Theorem 1.14, we obtain pU(p) 3 RU(p). Thus

(3.40)

V(p) := (p1 - R)U(p) 2 0.

On the other hand, by condition (1)and Lemma 3.22, we know that dim 92 = 0. So Lemma 2.34 (1) implies that

Substituting this into (3.35),

P(X) = P"i"(X)

+ Pmin(X)v(X)Pmin(X).

(3.42)

Substituting (3.41) into (3.37), P"'"(X)[V(X)

- V(p)

+ (A - p)v(X)Prnin(X)Prnin(p)V(p)] = 0.

By Lemma 3.23(1), we have V(X) - V(p)

+ (A - p)V(X)Prni"(X)Prni"(p)V(p)= 0.

(3.43)

Hence, the resolvent equation gives us V(X) - V(p)

+ V(X)[Prni"(p)- P"'"(X)]V(p)

= 0.

(3.44)

Before moving further, we prove two facts: v(x)Pmin(X)V(p)l< 00,

(3.45)

V(X)Prni"(p)V(p)l< 00.

(3.46)

By (3.36) and (3.40), we have V(X)1 < 00 for X

> 0. But


118

so

V(X)Prni"(p)V(p)l< p-lc(p)-lv(X)l

< 00.

(3.47)

This is just (3.46). Next, by the resolvent equation and (3.43), we have

P m i n ( X , s , A )1, V(X,x,A) J,

as X

t.

(3.48)

Thus, for X 6 p, from (3.46), it follows that

0, which completes the proof. I

3.5 NOTES

119

Proof of Theorem 3.8: Combining Theorems 3.26, 3.2 and 3.6 with Lemmas 3.22 and 3.25, we obtain the desired assertion. W 3.5 Notes In the above three chapters, we have chosen a short way to present our main uniqueness criteria for general q-processes. Some more refined results are delayed to Section 6.4, which should be considered as an essential part of the theory. Next, we mention a further topic related to this chapter. In practice, the most important jump processes are honest ones. But in physics, one pays more attention on the Fq-processes, i.e., the processes satisfying the Fokker-Planck equation. Corresponding to the equilibrium physics, there are so-called reversible (or more general, symmetrizable) jump processes, which are the main subject in Chapters 6 and 7. Thus, it is natural to study the existence and uniqueness problem for each type of jump processes. Moreover, one may even ask how many jump processes do we have in each case. To show that the answer is quite interesting, we mention that even for conservative q-pair, it can happen that the minimal jump process is nonhonest, but the honest Fq-process is still unique. Note that for such a jump process, its samples are not necessarily step functions, even have no left or right limits. Hence, to get a complete theory, it is not enough to consider only the samples which are right continuous having the left limits unless enlarging the state space. However, a quite complete picture for the above problem was obtained by Hou and Guo (1976) for Markov chains, and by Zheng (1982) for general jump processes. For Markov chains, Theorem 3.6 is due to Reuter (1962), Theorems 3.8 and 3.26 are due to Hou (1974). The proof adopted here is closed to Reuter (1976). For single birth matrix without absorbing boundary, Theorem 3.16 was first proved by Zhang (1984) by using a probabilistic approach. Then an analytic proof was presented in Yan and Chen (1986), in which an incorrect assertion was made for the Q-matrix with absorbing boundary. A correction was given by Li(1990b). A unified treatment was given in Chen(1999d). The present proof is a further simplification. The uniqueness criteria in the general context and the most of the materials discussed here are taken from Chen and Zheng (1983). The discussion about Theorem 2.25 is taken from Chen (1986b). Some more applications of the uniqueness criteria, especially for nonconservative Markov chains, were present,ed in Hou (1982), Chapter 8. A more complete theory of the uniqueness problem for Markov chains is explored in anderson (1991), Hou et a1 (1994, 2000).

Chapter 4

Recurrence, Ergodicity and Invariant Measures In this chapter, we first introduce some results about the weak convergence which are very useful but not popular yet in the literature. Next, we study the recurrence and the existence of stationary distributions for general jump processes, and then for Markov chains. Moreover, three types of ergodicity are also studied. Finally, we discuss the invariant measures for jump processes.

4.1 Weak Convergence

To begin this section, we recall iiwell-known result, its proof can be found from Billingsley (1968) or Stroock and Vsraclhan (1979). Let ( E , p ) be a metric space with Bore1 0-algebra f i . Denote by @ ( E ) the set of probability rricasiires on (I?,&'). TJet C&(E) (resp. U,(E)) dcnote the set of all bounded continuous (resp. bounded uniformly continuoils) functions on E .

Theorem 4.1. Given p , prL E , Y ( E ) (n 3 l ) , the following assertions are equivalent

(1) For each f E Cb(E),limndo;,pn(f) = p ( f ) . (2) For each f E U,(E), limndo;, p n ( f ) = ~ ( f ) . (3) For each closed subset C of E , p,(C) 6 p ( C ) . (4) For each open subset G of E , limn,,pn(G) p(G). (5) For each B E & with p ( d B ) = 0, limn-,o;,pn(B) = p(€?),where d B denotes the boundary of B.

>

Now, we return to our main setup, assume that E is a Polish space and choose a metric p so that ( E , p ) becomes a complete separable space with Bore1 u-algebra 6. Theorem 4.2. A subset A?c 9 ( k ! )is relatively compact in the weak topology iff for each E > 0, there exists a compact K, c E such t h a t

inf p(K,) 2 I

I.1E.I

- E,

where the weak topology is generated by the open sets

s

E > 0, (2 E qE), E c m . { PE *qE) : I P ( f ) - Q(S)I E } , This theorem is the set's form for the compactness. That is, using compact sets to describe the relative compactness. As the monotone class theorems having both the set's form and the functional form, the above criterion also has a functional form.

120

4.1WEAKCONVERGENCE

121

Definition 4.3, A function h E 8+ is called compact if for every d E [0,m) the set (x E E : k ( z ) 6 d } is compact. Note that a compact function is not necessarily continuous, for instance,

However, cvcry compact function should be closed I=lowcr scrni-continuom). T1ia.t is: for every d E LO, oo),the set [f d ] Is closed. Equivalently, lim, r'X) f(a,) 2 f (x ) far every sequence { x n } Y I K ~ hba,t; limn, ,.mxn - x. A function 'p E r S ; 011~d (resp. %') is compact iff limlz~n.,+mp(x) = 00 (resp. and p being closed), It, may be remarked that for the sufficiency in Theorems 4.2 and 4.4. below we med only msurne that, E is a separable metric space.

121

P E M nrl

Corresporiding to Theorem 4.1 (4):we also have a functional form.

122

4 RECURRENCE, ERGODICITY AND

INVARIANT

MEASURES

Theorem 4.5. p n converges weakly to p iff for every closed function f E &+, -n+m lim pn(f) 2 ~ ( f ) . Proof: Note that I B is lower semi-continuous for open set B. The condition is sufficient. Ta prove the necessity, we need only to consider bounded Actually, for a closed f E 8+,f A N is also closed and f a

This plus the monotone convergence theorcm gives us limn rm,un (f)2 p ( f ) . Thus, we havc rcdiiced the proof to the bounded case. Next, replacing f with f/Nif necessary, we may assume that 0 f < 1. Finally, set

0, A E S,

Q =(

E Cb(EA),

4 :~A E

S}.

Xu

pU(xzL, g u ) L / [ l + ~ u ( x yuU, ) ] , where (L is a positive summable sequence on S . Then ( E , p ) is a complete separable metric space. Given p, p n E .Yd(S),n 2 1, then pn p iff p n converges to p weakly in finite-dimensional distributions.

Remark 4.6. Define P ( Z , 3) =

=+

Proof: Clearly, if p n ===+ p , then p n converges to p weakly in finite-dimensional distributions. We now assume that pn converges to p weakly in finitedimensional distributions. Without loss of generality, assume that C , k, < 1. For each E > 0 and u E S , choose n compact PC,,, C E, such that

Set

then

Hcnce {pn : n 2 1) is relatively compact in the weak topology. Suppose that p n does not converge to p in the weak topology. Since the topology can be metrized by LBvy-Prohorov metric wf,there exist an E > 0 and a subsequence { n k } such that w(pn,,p) 2 E , k = 1,2,... . On the other hand, by the relative compactness proved above, we may choose a subsequence { p m k } & so that pmk p . This certainly implies that pr,, converges to p weakly in h e finite-dimentiional distributions. So by Kolniogorov extension theorcm, we have 0 = p . And hence, E:

. By assumption,

Applying Theorem 4.4 to the compact function hno, it follows that {pAh0 : P E A} is relatively compact and so the condition is also sufficient by a). I 4.2 General Results

For thc! remainder of this chapt.er, we often assume that the q-process is unique and study further the recurrence, the existence of stationary distributiori or invariant xlieasure and the crgodicity of the q-process.

Definition 4.8. We call a q-pair regular if it is totally stable, conservative and it determines uniquely a g-process.

Our first result reduces the recurrence of a q-process to the one of embedding process (cf. Section 4.3). Define

4.2 GENERAL RESULTS

125

Clearly, H(x, d y ) is a probabilily kernel, which is called an embedding jump process. Set

no = 1,

nnfl

= II II".

Theorem 4.9. Let (q(x),q(x,d y ) ) be conservative. Then we have

Here we use the convention O / O = 0, 0

8

oc = 00.0 = 0 and 1/0 = 00.

Proof: Since (Pmitl(A,x,A): x E E ) is the minimal solution to Eq. @A):

f = rI(A)f

+ (A + q)-%,

f E &+.

By the second successive approximation scheme, we have 00

P"'"(X,.,A)

=

CII(A)"((X+B n=O

Then, by the monotone convergence ~heorcm,we ohlain

Finally, the proof will be done once we show t,hat

This can be deduced by using induction on n. Acbually, when n = 0, both sides are the same I A / ~Assume . that the assertion holds for n - 1 and we now check the case of n ( 2 1). By inductive assumptim, we have

It suffices to show that

This is trivial if q(z) > 0. Let q(z) = 0. Then both sides are eqiial to ca if x E A . Otherwise, if x A , then both sides are 0 because of the convention 0/0 = 0 and 0 . 0 0 - 0. I Next, we study the exist,ence of stationary distribution. WE first consider the time-discrete case.

4

126

4

RECURRENCE, ERGODICITY AND

INVARIANT

MEASURES

Lemma 4.10. Let ( E , & ) be an arbitrary measurable space and P ( z , d y ) be a transition probability function on ( E ,8). Suppose that there exist h E d?+ and constants C E [O, 00), c E [0, 1) such that Ph 6 C

+ ch.

Then for each stationary distribution have

T(h)

(44

of P ( z , d y ) (i.e., TP =

T

T),

we

< C/(1 - c).

Proof: As usual, let P(O)= I , P(n+l)= P P(n).By induction, we have

C P ( " ) h 6 C ( l + c + . . . + c " - ' ) + c n h < -+cnh, 1-c

Set hN = h A N ,

N 3 1. Since T

( ~ N= )

n

> 1.

(4.2)

~ P ( " ) h r vit, follows that

Now, the assertion follows by letting N -+ 00. Denote by ZZp(E) the set of all Lipschitz continuous functions on the metric space ( E ,p ) .

Theorem 4.11. Let P ( z , d y ) be a transition probability function having the property: Pf E C6(E) for each f E ,+!?zp(E). Suppose that for some xo E E , there exist a compact h E d?+on E and a constant C E [O, 00) such that . , n 1

P(")h(xo) 6 C, n. .~ m= 1 Then, P has a stationary distribution particular, (4.1) implies (4.3).

n > 1.

7rx. having the

(4.3)

property ~ , ~ ( 6 h )C. In

Proof: Define

By (4.3) and Theorem 4.4, there is a subsequence {pnk}p=lsuch that pnk Some p as k m. Let f E 622p(E). Then p n k ( f )-+ p ( f ) , p n k P f pPf and --+

---f

4.2 GENERAL RESULTS

127

This proves the main assertion of the theorem. The other assertions are obvious. To study the time-continuous case, we need two lemmas. For the later use, we allow an operator acting on a column of functions. For instance, s1 acting on (fk : 1 k n) equals ( R f k : 1 k n).

<
i0. , , If x,'~< 1, then

This is impossible. So we have xfl = 1. Successively, we have

By the irreducibility, we indeed have x: = 1 for all i E E. b) Let xz < 0;)and ciilciliz . cin-lin > 0. If xzl = 00, then 0O

> x; 2 ciilz;l

= 00.

This is impossible. In the same way, we can prove that

< 00,.. . ,x;n < 00. Now, the irreducibility implies that x; < 0;)for all j x;z

E E.

c) By the localization theorem, if (x; : i E E ) is the solution given in (2), then (x; : i # j o ) is the minimal solution to the equation

xi

=

C cijxj + bi,

i # jo.

3#30

From this, the last assertion follows immediately.

H

132


INVARIANT

MEASURES

Lemma 4.20. Let (xik)*: i E E ) be the minimal solution t o the equation

xi = C c i j x j

+ bi

(k)

,

i E El k

= 1,2.

j

Suppose that

Clbil) < bi2) < C2b!’),

iEE

for some C1, C, E [O,m].Then

c1xj1)< *xj2)*6

c,xi (I)* ,

i

E

E.

Proof: Consider the case that C2bi” 2 bj2), i E E. By Corollary2.8, (C2xi1)*:iE E) is the minimal solution to the equation

Then, as an application of the comparison theorem, we have

czxy*2 $)*

,

iEE.

Similarly, we have

xi2)* 2 C l x i(I)* , Denote by and define

i E E. W

Pi the probabilityof the Markov chain (Xn) starting from i

In particular, M

n= I

Proposition 4.21. Let P = (Pij) be an irreducible Markov chain and H

#8

be a finite subset of E .

(1) The chain is recurrent iff f&

= 1 for all i $! H . Equivalently, for all i E E . (2) Let (9: : i E E ) be the minimal solution t o the equation

yi

PikYk

+ PiH,

E

fTH

=1

E,

Ic

where pi^ = CkEH P i k . Then the chain is recurrent iff yr = 00 for a l l i E E.

4.3 MARKOVCHAINS:TIME-DISCRETE CASE

133

Proof: a) Note that

f/A) = PiH,

f&+”

=

C

n 2 I.

Pzkfs,

WH

xr=lf!O,

Since f:H = it is easy to check, as we did in the proof of Proposition 2.14, that (f:H : i E E ) is actually the minimal solution to the equation

xi =

C Pijxj + P ~ H ,

i E E:

(4.6)

j@H

On the other hand, by Corollary 2.8, it is easy to check that

j € H n=1

Hence, by the localization theorem, (y: : i E E ) is the minimal solution to the equation

Take

C1 = ming;, JEN

Cz = maxy; j€H

By Eq. (4.6), Eq. (4.7) and Lemma 4.20, we obtain

+

Pick j , E H so that C1 = yT0. Then (1 yj*,)fjoH < y;o. Thus, if f j o H = 1, we must have y;o = 00 and hence y; = 00 for all i E E by Lemma 4.19 (1). b) Consider the special case that H = { j o } . Then, the first assertion follows from Proposition 2.14 and Lemma 4.19 (2). To prove the second assertion, note that by a) we have (1 y;o)f;o = y;” and so f;o = 1 y3t0 = 00. Conversely, let f;o < 1. Denote by f & ( r ) and y;(r), respectively, the minimal solutions to (4.6) and the equations given in the proposition when (Pij) is replaced by ( ~ P i j 0 ) ,< T < 1. It is clear that

+

>Y 3 )t

$7

The above argument gives us (1

fG0W 1 fG0

as ?- T 1.

+ ~;~(r))f;~(r)= y;”(~).

Hence

134


INVARIANT

MEASURES

This proves that fj*, < 1 + "y; < 00. c) We now return to general H . By the localization theorem, if for all i E E , then the minimal solution to the equation

f&

=1

equals 1 for all i $ H . This is equivalent t o say that f c H = 1 for all i E E . d) The second assertion for general W now follows from the expression of ya, b) and Lemma 4.19 (1). H For a given transition probability matrix P = (Pij),define a new transition probability matrix ?; = ( F i j : i , j E S) as follows:

-

,Paj =

{

&j

Pij

if i = 0 if i # 0.

Then 0 is an absorbing state. The following result is well known. Lemma 4.22. For each i E E , the limit ?ria = limndm Piu -3n) exists and

ii00

= 1.

Next, by the identitry

k-I

it follows that

Lcrrima 4.23. For each i Theorem 4.24. Let P the equation

=:

1, j z l = iiio.

( P i j )be irreducible. Then, the chain is recurrent iff

has a (finite) solution ( y i ) so that limi4m yi = 00 (i.e., (yi) is compact) for some finite H

# 0.

Proof: Trr vie'w of Eq. (4.6),(f& : i $ ,FT) is i,tideed the minimal solution to the equation xa = Papj - f ' P i f f , 1: 6 H .

c

j$H

Hence, we may regard H as a singleton {O}. a) Sufficiency. Let ( y i ) be a solution having the desired property. Then

4.3 MARKOV CHAINS:TIME-DISCRETE CASE

135

so and hence

it follows that iii~ = 1 for all i # 0. Letting n + 00, and then N + a, From this and Lemma 4.23, we see that f& = 1, i # 0. Finally, the assertion follows from

and the irreducibility of P. b) Necessity. Denote by (2,)the Markov chain with transition probability (&) and set SUCH THAT

Then f o ( n )= 0 for all n > 0 and f i ( n ) = 1 for all i 2 n. Because the originaI chain is recurrent, so we have f& = 1 for all i 2 0. Hence

Choose n k

1' such that

and define yi = c E , f i ( n k ) < possesses the desired properties.

00.

Then it is easy to check that (yi)

Theorem 4.25. Let P = (Pij) be irreducible. Then the chain is transient iff the equation JEE

has a non-constant bounded solution.

136

4 RECURRENCE, ERGODICITY AND INVARIANT MEASURES

Proof: Again, regard H as a singleton (0). Suppose that the chain is transient. Then there exists an i # 0 so that o:f = iii0 < 1. Note that i7io is the probability of Markov chain finally returning to 0 starting from i. We have

F

jEE

From this and i i o o = 1, we see that (yi = jiio, i E E ) is a desired solution. Conversely, suppose that the equation has a non-trivial bounded solution (yi). Without loss of generality, assume that yo = 1,

0 < yz

< 2,

2

E E.

Then

Furthermore

Letting n + 03, by Lemma 4.22, we get iiio 6 yi for all i E E . Because (yi) is not a constant, we have either yio < 1 or yio > 1 for some io. In the former case, we have i7iOo < 1. In the latter case, replacing (yi) with (2 - yi), we obtain the same conclusion. Now, the same argument given at the end of proof a) of Theorem 4.24 implies the required assertion. H Now, we turn to study the positive recurrence of the chain P = (Pij). We begin the study with a simple .result. Proposition 4.26. Let P = (Pij) be irreducible. If the equation

has a non-trivial solution (xi)so that Ci 1zil < 03, then the chain is positive recurrent. Conversely, only if the chain is positive recurrent, the equation

&+iPaj 6 Xj) xj 2 0, j E E ,

j EE CiEEZi < 0;)

has a non-trivial solution.

Proof: For simplicity, assume that the chain is aperiodic. Then, the assertions are easy to check by using the fact: the limit

exists and is independent of i, either

7rj 5 0

or

7rj

> 0 for all j

E

E. I

4.3 MARKOVCHAINS:TIME-DISCRETE CASE

137

Let P = (Pij) be an irreducible aperiodic Markov chain and (r2)bc a probability measure.

Definition 4.27.

(1) The chain is called ergodic (equivalently, positive recurrent) if P:?' + T j as n ---f oo for all i , j E E. (2) The chain is called geometrically ergodic if there is some j3 < 1such that for each i and j , IP)?' - 7rjl = O ( P n ) as n t 00. (3) The chain is called strongly ergodic or uniformly ergodic if sup, I P ~ I - T--3~ ~as n -+ oo.

o

Actually, these types of ergodicity have stronger properties. For instance, the geometrical ergodicity has the property: tpliere exist j3 < 3. and Ci, depending on i only, such that IPiT) ( - 7rjl Cip" for all i, j and n. Furthermore, we have

0. This is enough to guarantee the positive recurrence of the chain by the irreducihility. H

+

4.4 Markov Chains: Time-continuous Case This section deals with the recurrence and the ergodicity for Q-processes. define Given a conservative Q-matrix Q = ( q i j ) on E = Z+, nij

= l [ q i # O ~ (1- Sij) qij/qi

+ l [ q i = O ~ &j,

i , j E E.

Then @ I i j ) is a transition probability matrix and is often called embedding chain of the Q-process.

140


INV.4RTANT

MEASURES

Definition 4.32. We call the Markov chain P ( t ) = (Pi,(t)) recurrent if for each h > 0, P ( h ) is recurrent. Equivalently, S,”Pt,(t)dt = 00 for all i E E. Similarly, we call the Markov chain P(t) = (P%,(t)} positive recurrent or ergodic if so is P ( k ) for every h > 0 (cf Lemma 4.42 below). Equivalently, limt400 Pii(t)= T~ > 0 for all i E E .

By using the first; successive approximation scheme and Theorem 1.3, it is not difficult to prove the following simple fact. Lemma 4.33. For a given Q-matrix, t h e minimal q-process is irreducible iff so i s i t s Q-matrix.

As we mentioned in Section 4.2, thc recurrence of the q-process can be reduced to the one of the embedding chain. Let us copy Theorem4,9 as follows.

Theorem 4.34. Let Q = (q2,) be a conservative Q-matrix. Then

In particular, if Q is irreducible and regular, then (P,j(t))is recurrent iff so is its embedding chain.

Combining Theorem 4.34 with Theorem 4.24, we obtain

Theorem 4,35. An irrcducible conservative Q-matrix is regular with recurrent P ( t ) iff the equation 2 $ Jl CITijYj Yi,

0, (Pzj(h))is irreducible and aperiodic Markov chain. Hence t,he desired assertion follows from

4.4 MAR,KOV CHAINS:TIME-CONTINUOUS CASE

141

Theorem 4.37. Let Q = (qij)be a regular irreducible Q-matrix. Then P ( t ) is positive recurrent iff the equation

(4.10) has a summable, non-negative and non-trivial solution uniquely up t o a constant.

Proof: a) Let P ( t ) be positive recurrent. By Lemma 4.36, we have AND

Corresponding to the time-discrete case, we get rj = Ci rZPt,j(t).Next, from Theorem 4.17, it ibllows that

So (ri)is a solution to Eq. (4.10). b) Let (xi)be a summable, non-negative and non-trivial solution to Eq. (4.10). Without loss of generality, assume that

xi > 0,

c x i = 1;

2

E E.

a

Applying Theorem 4.17, we have

t 2 0, j E E .

- C.ZPij(t),

23. -

i

Letting t + 00, we obtain 0 < xj = x j . This proves not only the positive recurrcnce of P ( t ) but also the uniqueness of the solution to Eq. (4.10). The next two examples show that the positive recl.irrence, unlike the recurrence, of &-process cam not be rediiced 1,o the one of its embedding chain. Example 4.38. Take

Pa

0, i=l

n0,i = Pi, rIzj

= 0,

and qi = pa/2, 423. . = qiIIij, i not the Q-process.

221;

ITio=1,

i31;

otherwise

# j.

Then

IT

=

(Qj)

is positive recurrent but

142

4 REClJRRENCE, ERGODICITY AND INVARIANT MEASURES

Pro06 Clearly, ll = (Qj) has uniquely a stationary distribution as follows: 7rO : 1/2, T% = p , / 2 , i 2 1. On the other hand, since 0 = ( q i j ) is bounded, the Q-proccss is unique. By Theorem 4.37, if P ( t ) is positive recurrent, then xjqj = C i f jxiqij. Hence for every j 2 1, zjnj =

c

xi7rzrI, = xo7ropj.

i#j

xi

That is xj = xo. This implies that xi = 00, which is in contradiction with Theorem 4.37. Therefore P ( t ) is not positive recurrent. We have seen a,n example for which the embedding chain is positive recurrent but not the Q-process, The next example goes in the opposite way. Example 4.39. Consider a conservative birth-death Q-matrix: 0;=

hi, i 3 1,

C'glT/bi

bi + M such that recurrent but not II = {l&j).

Choose

bz

0,

2

2 0.

Then the Q-process is positive

00.

Proof: By Corollary 3.18, the Q-process is mique. The birt'h-death process is syniinetrizabie with respcct to ( p i ) (cf. Sections 6.1 and 6.2):

/GiP&) = jLj,Pji(t)> i , j where Pi =

Po = 1,

Because

cc'

bob1 bz-1 aIa2. ai + .

OC:1

c p i =l+box0

E ,F:,

i= 1

bi

,

t 2 0, 221

can not be determined completely by its embedding chain 11. So we need to study this problem more carefully, As .what we did for the tirncdiscrete case, we now study the three types of crgodicit.y. Definition 4.41.Let Q = (qi,j) be a regular irreducible Q-matrix and a probability measure.

( ~ i be )

(1) T h e chain P ( t ) is called exponentially ergod.icif there is some p > 0 such t h a t for each i a n d j , lPij(t)- ~jrjl= 0 ( e d t ) a5 t CXI. 4

( 2 ) The chain is called strongly ergodic or uniformly ergodic if supi IPij(t)-~JI4 0 as t + 00,

144

4

RECURRENCE, SRGODICITY AND INVARTANT MEASURES

For general Markov processes, Definition 4.41is ineaninghl once the pointwise convergence is replaced by the convergwr:e in total variation (cf. T h e orem 4.28 and Theorem 4.43below). The next result enables us to transfer the tirue-discrete case into the timecontinuous one. Lemma 4.42. Let ( X t ) t 2 0 be a Markov process. If for any h > 0, the skeleton (X(nh)),>o i s (resp. geometrically, strongly ) ergodic, then (X(t)) is (resp. exponentially, strongly ) ergodic.

Proof: Recall that for m y finite (signed) measure p on ( E , & ) ,its total variation norm can be represented as follows (by Hahn-Jordan decomposition):

For simplicity, in this proof, w e omit the subscript “Var”. a) The key fact in the proof is that. the lunction t - > IlvP(t)ll is nonincreasing on [0, m) for evcry bounded signed measure v on ( E ,-8).Actually,

Next, we show that for these types of ergodicity, the initial distribution p can be replaced by initial point mass. Actually, if

for all x c E , then for any p E

9((E),

0 and ( k - l)h, 6 !i < Ich (k: E Pi). By a), we have --)

and so n P ( t ) = T.Furthermore

4.4 MARKOVCHAINS: TIME-CONTINUOUS CASE

145

c) Suppose that ( X ( n h ) )is geometrically ergodic, i.e.,

/'T(dr)ll~(nhl+, -

= O(e-n,'(h)))

as n

--f

oo

for some P ( h ) > 0. Again, l e t t > 0 and (k - l ) h ,< t < k h ( k E we have

s

~ ( d z ) I l P (5t,,.)

-

TII

N), Then,

= O(e- t P ( h ) / h )*

d) Similarly, we can prove the assert,ion about the strong ergodicity.

H

Theorem 4.43. is ergodic iff[lP,,(t)= C j IPij(t) - 7tjl -+ 0 as all i E E . is exponentially ergodic iff IlPi.(t) - 7r]TIIvar = 0 ( e - P t ) as some ,L? > 0 and for all i E E . Equivalently, CinillPi.( n-l[var- O ( e - P t ) as t -+ cc for some p > 0. (3) The chain is strongly ergodic iff supi llPi . ( t ) - ~ [ = l vO(e-Pt) ~~ as t -+ m for some p > 0.

(1) The chain t --+ 00 for (2) The chain t -+ 00 for

Next, for a giver1 regular irreducible 6,)-matrix Q - (ye), let (X(t)),,, be the corresponding Markov chain defined on a probability space (12,F,IF). Its successive jumps are given by TO

= 0,

7,

= inf{t : t

> 7;2-1,X(t) # X ( T ~ - ~ ) } , n 2

Due to the regularity, we have T := finite subset of E and define aH = inf{t 2 following probabilistic criteria.

T~ T~

1.

= 00. Let H be a non-empty

:Xt E

H } . Then, we have the

Theorem 4.44. (1) The chain is ergodic iff R,a, < 00 for all 1: E H . (2) The chain is exponentially ergodic iff lE,eXo~~ < oc for all i E H , where 0 < X < qi for all i E F . (3) The chain is strongly ergodic iff supzEiaH < 00.

The analytic criteria for the t'hree types of ergodicity are the following. Theorem 4.45. Let H

# 8 be a

finite subset of E .

(1) The chain is ergodic iff the equation

(4.11)

146

4 R.ECURRENCE,ERGODICITY

AND INVARIANT

has a finite non-negative solution. (2) The chain is exponentially ergodic iff for some X i E E , the equation

MEASURES

> 0 with X < qi

for all

(4.12) has a finite non-negative solution. (3) The chain is strongly ergodic iff Eq. (4.11) has a bounded non-negative solution.

asReplacing (yi) in (4.12) with yi = yi + 1), one can rewrite ()4.12

To explain how to deduce these results, we need some preparritions. Define

n= 1

Lemma 4.46. We have f i(1) H - nZff~

=

f&+l)

cnil,f2,

n 2 1.

V H

Furthermore,

(fiH : i

E E ) is the minimal solution t o the equation

and the Markov chain is recurrent iff

f

i ..I ~

1 far all i E

Proof: Note that (1) f2H =

p,[qf= 71 1 = 1 - h

and

The r m a i n assertions are then obvious.

I

l?

4.4 MARKOV CHAINS:TIME-CONTINUOUS CASE

Lemma 4.47. We have

Proof: Obviously,

Next,

But by the strong Markov property, we have

and

By induction, this proves the required assertion. Lemma 4.48. For X E R with X

Then we have

I

< qi for all i E E , define

147

4 RECURRENCE, ERGODICITY

148

A N D INVARIANT MEASURES

In particular, ( e i H ( X ): i f E ) is the minimal solution t o the equation

Proof: The first assertion follows from Lemma 4.47 immediately. To prove the last assertion, apply Theorem 2.9 to

k$H

TZ-

1

and note that gi = fiH (i E E ) by Lemma 4.46. H Equivalence of Theorcm 4.45 and Theorem 4.44: a) Assume that the chain is recurrent. Then fiH = 1 for all i by Lemma 4.46. Hence by Lemma 4.48, ( e t H ( X ) : i E E ) is the minimal solution to the equation

On tmheother hand, since f i ~ ( t = ) lPi[uH> t]: by the Fubini theorem, we have

By the assumptions of Theorem 4.44, e i H ( X )< 00 for all i E H . From this, we show that eiH(X) < 00 for all i 4 H. Given i $ H , choose i o , i l , ,in with in = i so that io E H and -

-

a

-w. Thus, by compa,rison theorem, we

2 CP,,(X)(Cj - Xc),

i G E,

j

NOW,by the regularity, X Cj Pij(X) = 1, hence 1;2

XPaj(X)cj:

i E E,

x > 0.

j

Furthermore,

That is xzi

1

+

c

XPij(X)(Cj

- l),

i E E,

x > 0.

jEH

Since lim XP'j(X) = lim Pi,i(t)= rij = rj, x +o t4m

+

it follows that 0 2 1 C j E H nj(c, - 1).This s h w s that rj the irreducibility. I We now return to a condit'ion given in Theorem 4.14.

> 0 for all j by

150

4 RECURRENCE, ER.GODICITY AND

INVARIANT

MEASURES

Corollary 4.49. Let Q = (qi3)be a conservative irreducible Q-matrix. Suppose that there exist a compact h E T& and constants I: 0 and c > 0 such that IZh < C - ch. (4.15)

>

Then the Q-matrix is regular and the &-process is exponentially ergodic. Proof: The uniqueness of the process is a straightforward consequence of Theorem 2.25. Since h is compact, we can choose a finite H so that C ( c / 2 ) h -1 on H". Now, the asscrtion follows from Theorem 4.45. Applying Corollary 4.49 with h(x) = C , xu to Schlogl's model, it is easy to check that the model is exponentially ergodic. Actually, it, is strongly ergodic, as can be shown by Theorem 4.59.

1) is a non-negative solution to the equation

j=I

But the solution to this equation is also unique: i-1

3=1

for all i 3 1. Combining the

By induction, it is easy to check that zi = F,"' above facts, we obtain = 1 = Fo( 0 ) ,

xo

xis1 -xi

= Fi (0),

i 2 1.

This certainly implies the required assertion. d) To prove the second assertion, we use Theorem 4.45 (1). Let (ui)(uo= 0) be a non-negative solution to Eq. (4.11) with H = (0). Then k-1

j=O

j

k- 1

k-1

j=O

j =O

From this and induction, it follows that

where vk = u k S 1- u k , k 2 0. Hence L

.L

ic

, L

ic

4.5 SINGLE BIRTHPROCESSES This gives us d 6 u1 < 00. Conversely, a s u m e d

Obviously, we have C j f O q O j u= j qolul have

s=o

s=o

< 00.

155

Set

< 00. Moreover, for each k > 0, we

s=o

s=o

j=O

+

That is Cj qkjuj 1 = 0, k > 0. e) In view of Theorem 4.45 (2), the condition infi qi > 0 is indeed necessary. We need to construct a solution ( g i ) to Eq. (4.13) with H = (0) for a fixed A: 0 < X < infi qi. First, define an operator

This operator essentially comes from the study on spectral gap, and will be discussed in more details in Chapter 9. Next, define

Then p is increasing in i and cp1 = q&'. Let f = c q l 0 m for some c > 1. Then f is increasing and f l = cql0. Finally, define g = f I I ( f ) . Then g is increasing and

We now need a technical result, will be proved later, taken from Chen (2000b).

156


MEASURES

INVARIANT

Lemma 4.53. Let (mi)and (ni)be non-negative sequences, ni $ 0, satisfying

Define cpk =

Cizi nj. Then for every y E (0, l),we have

Proof: Let Mn = &2nmj. Fix N > i . Then by summation by parts formula and the assumption Mn ccp;', we get

0 is necessary by Theorems 4.44 (2) or 4.45 (2) [or Chapter 9, Lemma 9.7 with K = { k } ] . By Theorem 4.44(2), there exists a X with 0 < X < qi for all i such that EoeXoo < 00. Define eio(X) = e X t P i [ o o> t l d t , i 2 0. lim

C L o d k = lim dn F(0) n-w F p k=O k

n-mzn

I "

g1

I" +

Then IEieX"o = Xeio(X) 1. From the proof a) of Theorems 4.45 and 4.44, it follows that eio(X) < 03 for all i 2 1. Furthermore, EieXoo < 03 for all i 2 1. Note that if the starting point is not 0, then oo is equal to the first hitting time: T~ = inf{t > 0 : X ( t ) = O}. Hence lEieATo < 00 for all i

2 1. Define min)= E~T,". The Taylor's expansion (4.30)

leads us to estimate the moments mZ(").By a result due to Z. K. Wang [cf. Wang (1980, Chapter 3), or Hou and Guo (1981, Chapter 9), or Wang and Yang (1992)], we have

x c i-1

mi( l ) =

j=o

nx i-I

mjn)=

1

"

P A k=j+l

Pk,

1 " pkmf-'), j=o P A k = j + 1

C

n

2 2.

(4.31)

From these quoted books, one can also find the probabilistic interpretation while of S and R: S is the mean of the hitting time of 0 starting from "00)', R is the mean of the hitting time of "03') starting from 0. Actually, we have a general result as follows.

162

4 RECURRENCE, k h C O D I C I T Y A N D INVARIANT MEASUR

P r o p o s i t i o n 4.56. Far an regular and irreducible Q-process, (mjz := Eia;4 : i @ H ) is the minimal solution to the equation

Proof: Similar t o the proof of Lemma 4.48. W In the present case, obviously, m p ) 3 mLn) if k 3 i. By (4.31), it follows that

and

Hence, by induction, one gets

Combining this with (4.30), we obtain

which implies that

< 1.

0. Then the Q-process is

with rates b, = an always unique.

+

(1) The process is recurrent iff a < 6 or a = 6 2 p. (2) The process is ergodic (exponentially ergodic) iff a (3) The process is never strongly ergodic.

< 6.

1ti3

4.5 SINGLE BIRTH I"R0CESSES

Proof: a) The uniqueness assertion is easy since

Assertion (3) is also easy since

b) To prove the other assertions, we use

Kummer Test: Let (un) and (wn) be two sequences of positive numbers. l/v, = 03 and t,he limit K := limn+m ten exists, where Suppose that

xF

Then, the series respectively.

Cunconvmges or

diverges according to

K

> 0 or

Now, to prove recurrence, we consider the series Gnu,: un = Take v, = n. Then

[6

t&=n Q: -

So R = -too if a > 6, K

We have

K

-os if a

I I :

lf6(n+1)

< 6.

> 0 if 3 > 6, R < 0 if p < 6.

K

U(Y>

c

4 ( 4 Y)Ul 6

Q O l 9 .

PEE1

YfQ

When k 2 1, by using the argument which deduced (3.30), we obtain

j=1 \ y E E j

This shows that

uk

/

and moreover,

k-1 qkOuk

+

qkj(uk

- uj)

qk,k+l(Uk+l

- uk),

k 2 1.

j=1

Hence, we always have qkuk

6

qkjuji

>, 0.

j#O,k

Thus, by Lemma 4.51, Lemma 3.14 and Theorem 4.52, it follows that ui = 0. This contradicts with the assumption that u(x)$ 0. b) To prove positive recurrence, by Theorem 4.45, we need only to show that the equation

Y

Yf0

166

4 RECURRENCE, ERGODEITY AND

INVARIAN'I' h!fEASWRK;S

has a non-negative solukiori. For this, let ( u k )be a solution constructed in Lemma 4.54 and take u(x) = u k if z E EI,,

Now, for z # 6,there exists some k so that

k 2 0.

5 E Ek.

Hence

k-1

Y

On the other hand,

We have thus constructed a desired solution. c) The proofs for the remainder assertions are similar arid hence are omitted. I 4.6 Invariant Measures

An extension to stationary measure of a process P(d) = (Pij(t))or P ( P i j )is invariant measure, i.e., a non-trivial cr-finite measure 7r so that

n=;.rP(t)

or

=

T=TP

respectively. In this section, we study the existence and uniqueness of invariant measures. Of course, the uniqueness is in the sense of up to a constant factor. As usual, we restrict ourselves to the irreducible case. We begin our study with time-discrete case. the answers depend on recurrence or transience of the process. Let ( E ,X,, Pij)be an irreducible Markov chain with discrete times. Becausc of the irreducibility, if (n,) is an invariant measure, then 7rz > 0 for all i. For the positive recurrent case, the answer is quite sirriplc arid well-known.

4.6 INVARIANT MEASURES

167

Theorem 4.60. Let ( E ,X,,Pij) be an irreducible and positive recurrent Markov chain. Then there is precisely one invariant measure. More precisely, if the chain has period d ( 3 l), then the state E can be decomposed as a union of disjointed subclasses C O , .. Cd-1 so that Pk is an aperiodic chain on each CF Furthermore, the invariant measure is given by

Corollary 4.61. Let ( E ,X,, Pij) be an irreducible and non-positive recurrent Markov chain. Then i t s invariant measure, whenever exists, should be nonsumma ble.

Proof: See Proposition 4.26. For the recurrent case, the answer can be found in many books (cf. Chung (1967) Part 11, Theorem 9.7, for example). Here we state only the result.

Theorem 4.62. Let ( E ,X,,Pij) be an irreducible and null recurrent Markov chain. Then there is only one invariant measure, which is given by 00

n=O

where 8 is an arbitrary element in E and sP$) denotes the taboo probability: for a given B c E ,

B P23( o ) = b i j , B P23! n ) = P i [ x , = j , X I , . ' . ,Xn-l $ B ] . The transient situation is much more complicated. Let us look at some examples.

Example 4.63. Take Pi,i+l = pi and Pi,o = 1-pi for i 2 0, where pi E ( 0 , l ) such t h a t limn+m lJi=,p, > 0. Then the chain is irreducible and transient. It has no invariant measure.

Proof: Clearly, if (ri)with

7ro

=

1 is an invariant measure, then

Thus, there exists an invariant measure iff limn-,m n;=,p, = 0. On the other hand, n;=,p, describes the probability that the chain does not return pk to 0 in the first n + l steps starting from 0. Hence, condition = 0 means that the probability of the chain returns to 0 in finite steps starting from 0 equals 1. That is equivalently to the recurrence of the chain.

nLz0

168


INVARIANT MEASURES

Example 4.64. Take P,,,+l= p

ri'

1/2,

P,,%-1=

1 - p =: 4 )

Then the chain is irreducible and transient. measures.

Proof: Siippose that Vk+l

-

( T ~is )

Xk

2

= 0, 51, &2). . .

.

It has infinitely many invariant

an invariant measure of the chitin. Then

+ T L l P = 0,

k = 0, f l ,f 2 , .

2

'

.

To this equation, the solutions are as follows: 7rk = cl(p/q)k

+ c2,

k

= 0, fl: 42,.

..

Hence, for any c l , c2 0 , c1 + c2 = 1, (ni)is an invariant measure. This proves our assertion. H

Example 4.65. Take

Then the chain is irreducible and transient. It has precisely one invariant: measure.

Proof: Notice that the probability of the chain goes to infinity starting from 0 is 00 1/2rJ2i+l

- 1)/2i+1 > 0.

i=l

Hence the probability of the chain without return to 0 in finit'e steps starting from 0 is positive. So the chain is transient. Next, let (ni)be an invariant measure. Then

Set (7i-i)

1, then nL - 4. It is easy to sce that is positive, simply use induction. I

7r0 =

(T?)

is unique. To show that

Proposition 4.66. Let ( E ,X,, PiJ)be an irreducible, transient Markov chain. Without loss of generality, assume that E = 25,. Suppose t h a t the chain has an invariant measure, then there exists a simple path coming from infinity. T h a t is, there exist mutually distinct states i l , 22, such t h a t Pipil > 0, Pi3,i2 > 0, '

...

-

8

169

4.6 INVARIANT MEASURES

Proof: Define a, dual chain as follows: PzJ r 7 P J i , f T a , i , j Then

p::)

E

E.

= 7rjP:r)/7ri,i , j E E . Note that an irreducible chain is transient

iff

00

(4.34)

EP"'o0. n-0

We have

oc

M

n=O

n=U

x,,

So the dual chain ( E , Pt3)is irreducible and transicnt. Without loss of generality, we assume that the chain is aperiodic. Then, for almost all LJ, -

-

,..

PXl ( W ) , F O ( W ) , PJTZ ( W ) , T ? 1 ( w ) , are all positive. On the other hand, since the chain is transient,, for almost all w , in thc sequencc ( y o ( w ) , y l ( u ) ,. >,no clement can be appeared in infinitely many times. In other words, for almost all w , there exist infinitely many distinct XO( w ), XI ( w ), * . * so that ~ x l ( w ~ >, 0,~ o%(w),xl(w) (w) > 0 , ... a

This completes our proof. W We are now ready to present an existence criterion {br invariant measures. For simplicity, we take E = Z+ again. Moreover, we allow the chain to be sub-M arkovian : (4.35) CPij61, i € E j

As we did before, for H f f P2.7( 0 ) 6,j,

c E , define the taboo probability as follows: PiklPklkz * ''Pkn-]j, i,j E

ffp(n) z.7

E , n 2 1.

k 1 , . . ., k n - l $ ! f f

Theorem 4.67. Let ( E ,X,,Pij) be an irreducible and transient Markov chain, maybe sub-Markovian (i.e., (4.35) holds). Then, the chain has an invariant measure iff there exists an infinite subset K c E such t h a t

where

0 0 0 0

(4.36) r=j ,=I

This criterion is quite deep in the theory of hlarkov chains. Unfortunately, its proof is lengthy and so is omitted here. Refer to Harris (1957) and Veech (1963) for details. As a straightforward consequence of Theorem 4.67, we have

170

4 RECURRENCE, ERGODICITY AND INVARIANT MEASURES

Corollary 4.68. Under the hypotheses of Theorem 4.67, if for each i E E , there are only finite k E E so that Pki > 0, then the chain has an invariant measure. Now, we turn to study the time-continuous Markov chains. For the recurrent case, there exists uniquely an invariant measure, which can be seen from the time-discrete case at once. Thus, we consider only the transient case. Again, assume that the chain is irreducible. Hence, we have

0 < J, PZj(t)dt < w.

(4.37)

It is similar to the time-discrete case, in the present situation, we may have no invariant measure. But excessive measures do exist. Lemma 4.69. Let Q = ( q i j ) be an irreducible, regular Q-matrix and P ( t ) be transient. Then for each probability measure cr on E with finite support,

is a finite positive excessive measure of and t 2 0.

P ( t ) : pj 2

xi

p i P ; j ( t ) for

all j E E

Proof: By (4.37) and the assumption, the assertion is obvious. Now, it is natural to ask when an excessive measure becomes an invariant measure. More general, for a given ( p i ) , when we have pept = p

for some p

< O?

t2o

~(t),

(4.38)

If so, by Lemma 4.15, we should have (4.39) i#j

Define a dual chain as follows. &(t) = e - P t p j P j i ( t ) / p i . Clearly, Pij(t) is a Markov chain and (4.38) implies that

t

P ( t ) l = 1,

> 0.

(4.40)

F'urthermore, its Q-matrix is as follows:

4.. = p j.q j i / Pa, a3

4i = P +

Qi.

Thus, in order to having a pinvariant measure, it is necessary that pE

[

- infqi,

01. a

(4.41)

Moreover, (4.39) implies that the Q-matrix = (ijij) is indeed conservative. Next, assume (4.39) and (4.41), then the construction of the minimal process gives us Pij(t) = pein(t).Combining this with (4.40), we see that Q = (Qij) is regular. We have thus proved the following result.

4.7 NOTES

171

Theorem 4.70. Let Q = (qi,) be an irreducible and regular Q-matrix. Then ( p , ) is a p-invariant measure iff (4.39), (4.41) hold and = (qt2,)is regular. The dual chain used above is introduced in terms of an excessive measure, but we can also introduce a dual chain by means of a finite, positive excessive function, in view ol (4.37). The last tedinique even works for more general state space. Refer to Chcn and Stroock (1983).

4.7 Notcs

For a more complet,e theory of Markov chains, refer to Aldous and Fill (1994-), Anderson (1991)) Chung (1967)) Hou (19821, Hou et a1 (1994, ZOOO), Hu (1983, 1985), Wang (1980)) Wang and Yang (1992), Yang (1981). In particular, the complete proofs of the results in Sections 4.3 and 4.4 are included in Anderson’s book. The ergodicity has been studied for much more gcneral state space in the time-discrete case. Refer t o Nummelin (1984), Meyn and Tweedie (1993b) and references within for more details. For closely related results in the timc-continuous cmc for the general state space, refer to Down. Meyn and ‘rweedie (1995)) Meyn and Tweedie (1993a). Section 4.1 is mainly due to Dobrushin (1970). Remark 4.6 was pointed to the author by L. P. Huang. Lemma4.10 is due to Basis (1980). The particular case “c < 0” of Theorem 4.14 was appeared in Basis (1980) and Chen (1986b, 1989b)) based on a time-discrete analogue obtained by Dobriishin(l970). For Markov chains, the special case of Theorem 4.14 (i.e., Theorem 0.11) was obtained by Tweedie (1975, 1981). The present form of Theorem 4.14 seems to be new but quite natural. The proof given here simplifies greatly the original o i m . Theorem 4.9 was presented in Chen (1986b), for which the author was benefited from a conversation with S. W. He. The proof of Theorem 4.9 adopted here was actually contained in Feller (1957). The sufficiency of Theorem 4.24 is due to Foster (1953) and Kendall(l951). The necessity is due to Mertens, Samuel-Cahn and Zarnit (1978). Kendall (1959) introduced the term geometrically ergodic for irreducible Markov CzJ,f3G,where &, < 1. Vere-Jones (1962) chains for which P$’ - 7rJ showed that /3 can be taken to be independent of z and j. Then, Nummclin arid Tweedie (1978) showed that the coefficient C,, can be chosen independent of j . If we want to have a universal coefficient C , then thc gcomctrical ergodicity turns to be the uniform ergodicity as proved by Isamson and Luecke (1978). Theorem 4.28 (2) is due to Numrnelin and ’l’weedie (1978) and Numrnelin and Tuominen (1982). Theorem 4.28 (3) is due to Isaacson and Luecke (1978). For the special case that H = (0). Theorem 4.30 (2) goes back to Kingman (1964) and Theorem 4.30 (3) is due t o Huang and Isaacson (1976). Theorem 4.31 (l), (2) and (3) are due to Foster (1953)) Popov (1977)

I

I
+(U)= (P1-P2)-(Uc). Since

we have V(Pl,P2)3 ?jllP1 - P211Var.Conversely, for given PI and P2, we define a coupling is as follows:

where A is the diagonal in E : A = {(z,x): z E E } . Note that one may ignore lac in the above formula since (Pl-P2)+ and (PI-P2)- have different supports. Then

= (PI - P2)-(Uc)= 1 - (PI A P2)(E).

This gives us V(P1,P2)

< a IJP1- P2

as required.

I

To conclude this section, we study a dual expression of the W-metric. Note that the definition of W ( P l ,P2) is meaningful for any finite measures PI and P2, not necessarily probability measures. In what follows, unless otherwise stated, we will work in this general setup. For simplicity, rewrite Pi = (PI - P2)+ and Pi = (PI - P2)-. Let A1 be the support of Pi and A2 = A"1 which is the support of Pi.

Lemma 5.8. Using the above notations, we have

5.1 MINIMUM LP-METRIC

Proof: Let projcction:

181

P' be a coupling of Pi and Pl. Denote by 7r1 : E x E -r E Ike 7rL(z, y)

7

IC.

Define

E'(C) = P

l A Pd(7dC n D ) ) ,

where D = { (z, x ) : x E E } . We now prove that of PI and Pz. To do so, let B1 E E , then

R(B1 x E ) = Pi(&)

+

= ?;I t2 is a coupling

(Pl A P2)(B1) =

Pl(B1).

Similarly] wc have E(l3 x S,) = Pz(B2). From this, it, follows that

since the support of R' is D.

We now consider the simple case of E being finim. Writ,e E= {xl, . xn}, pij = p ( x c i , z j )and pi') = Pi({ai}), pj2) = P;({zi}).Then pl1)pj2)= 0, i = 1 , 2 , ' - . ,n. Moreover, 1

w ( PY ~),= i n f (

~ ~ , ~ p i j r "cij c i j3:

4

0, z j x : i j = p i(1), z i x ; j =yj( 2 ), 1 Q i , . j 611.).

Recause of the primal-dual relation for the linear programming problem, we have

=sup{ / f d ( r ; - P ; ) : f(Yd-f(Yz)

1

< P ( Y I , Y d , yk E A k , k = 1: 2 .

We have thus obtained the following resiilt. Lemma 5.9. For finite E , we have

wq, Pi)= sup

{Ifw; --

: f(Yl)

-

f(l/z)

< P(Y11Y2)'

Yk

1

fEAkr k = L 2

Theorem 5.10. Let zo E G , ,%(E) = { P E L @ ( E:)/ p ( q x , ) P ( d l c ) Then for P1,Pz E P o ( E ) , W(Pl,f'2) = sup = Sup

}

< ca}

"

I

fd(P1 - Pz) : f E %p(E), L,(f) < 1 ,

(5.5)

1

(5.6)

(s {1

fd(P1

I

1'2)

: f E b s p ( E ) )L(f)

61 ,

where Yip(E)denotes the set of Lipschitz continuous functions on E and L(f) denotes the Lipschitz constant of f E Y i p ( E ) ~

282

5

PROI3AI3ILITY

METKICSAND

COUPLING

METHODS

Proof: Dcxiote by L(P1,P2) the riglit-hand side of ( 5 . 5 ) . Obviously, L satisfies the trianglc incqualit,y. a) By Lemma 5.2, we can choose a coupling of PI and P2 so that W(P1,P2) is achieved at P . Then

as a subset of the separable space C ( Q ) with uniform topology and hence is separable. However, the proof does not mean the separability of ( b 2 z p ( E ) ,j j 1Iu). This is not surprising since the Lipschitx continuity is not a topological concept. We remark that the space bLFip(E) may riot be separuble with respect to the Tipschitz norm. To see this, consider E = R. Given E E R)define fc(z) =: 0 if J: 6 , 1. P,"'"(X, x:! A n En)< Pmin(A, 2,A n ET1),

5.2 MARGINALITY A N D REGULARITY Noting that

189

PFin(X, 2 , A n En) = 0 when IC @ En, we have

P,"'"(X, z,A

n E,)

t ,"lim cc

P,"'"(X, z,A n En)

< Pmin(X, x,A).

On the other hand, by using the previous proof a), replacing A with AnE,, we obtain

lim P,"'"(X, z , A n En) 3 Pmin(X, 2,A),

X

> 0,z E E , A E 8.

,-+a3

We have thus proved the first assertion and then the second one follows.

Theorem 5.19. If both of the marginals are regular j u m p processes, then so is every coupling Markov process. Conversely, if a Markovian coupling is a regular jump process, then so are i t s marginals.

-

Proof: a) Jump condition. Let Pk(t,xk,dyk) and P ( t ;zl, z2;dy,, d y 2 ) be

the marginal and coupled Markov processes respectively. By the marginality for processes, we have &;

3

2 1 ,z2; (21

1 x @2>)

&;z1,x2; ( 2 1 )

x

E2) -

F(t;x,,2,;E1 x (E2\

3 P(t;z1,22;{z,} x E2) - 1 + P(t;z1,22;E1x = z1, h)) .- 1 + P2(t,z2, {z2H.

w,

{.2)))

(4)

If both of the marginals are jump processes, then lim, ,o F(t;z,,x,; {zl)x {z,}) 3 1. This means the Markovian coupling P ( t ) must be a jump process. Conversely, since p@;~1,22;{ x~( 1 34 })

< F(t;z1,z2;{x,} x E2) = P 1 ( 4 ~ 1 , { ~ 1 } ) ,

F(t) is a jump process, then limt ,o Pl(t,zl, {zl}) 3 1 and so Pl(t) is also a jump process. Symmetrically, so is Pz(t). b) Equivalence of total stability. Assume that all the processes concerned are jump processes. Denote by ( q k ( z k ) ) q k ( z k , dyk)) the marginal q-pairs on g k ) , where

if

(a,

z2),Q(x,,x2;dyl, d y 2 ) ) a coupling q-pair on (El x Next, denote by (Q(z,, E2, g ) ,where

190

5

PROBABILITY METRICS AND COUPLING METHODS

We need to show that' @(Z) < 00 for all 53 E El x E2 iff q,(a,) V q 2 ( x 2 )< cm for all z1E El and x2 E E2. Clearly, it suffices to show that, Y l ( 4 vq z ( 4

s"(%.2)

q1h)

+Y

, W

(5.9)

Note that we can not use the conservativity nor iiniqiieness of the proccsses at this step. But Eq. (5.9) follows from a) and Theorem 1.4 immediately. c) Equivulence of corzservatzvity. From now on, we assume that all the q-pairs considered below are totally stable. The problem is that in general, we know that limt+o P ( t ,2 , A)/t = q(x,A ) only for x $ A f 9 rather than A E &. The last assertion holds once the q-pair is conservative (Theorems 1.5 and 1.13). Let the marginal q-pairs are conservative. We first prove that

Moreover, the convergence is uniform in A". To do so, by Lemma 1.7, choose {EP)}? c 9 k such t,hat E p ) T EI, as n -+ 00. We h a x seen from the proof a) that 1-F(t;z1,a2;{q}x

{.2})

< 1 - ~ l ( t , ~ l , { z l +} )1 - P 2 ( t , x z , { ~ , } ) .

Hence {Ein)x Ep))nbl c & and Ein) x E p ) t El x E2 as n + 00. Note that

=: I

+ II +m.

Besides, by the marginality for processes, we have

5.2 MARGINALITY AND REGULARITY

191

Fix n large enough so that zk E E f ) . Since n ( E p ) x E p ) ) E @, we have limt,oI = 0. Next, since the marginal q-pairs are conservative, we have limt,olI ql(zl, (E!"))') q2(z2,( E P ) ) ' ) . Collecting these facts together, we obtain

2

0,

Xk E Ek,

- 1, 2.

Ak E 8,,,k

This is again cieduced by using the comparison theorem. By assumption, the marginal q-pairs are regular and so is the coupling g-pair by Theorem 5.19. Herice

-

independent of r 2 .

Y(X;21:2 2 ; A , x Ez) = Pi (A, xl: A , )

Now thc inverse implication (5.8)*(5,7) 7(5.) follows from t h e uniqueness tbeorem of Laplace transform. H

5.3 Successful Coupling and Ergodicity Based on the results obtained in the last section, we assume for the remainder of this chapter, that all the q-pairs considered are regular. I n this section, we discuss a typical application of the W-metric and coupling methods. Let (X,', X,") ( t 2 0) be the path of a coupling jump process and set,

T = inf{t 3

o : X: = x,"}.

Definition 5.21. A coupling is called successful if I -

IFD"L*"qT< m] = 1, and

XL

f

,--. P " ~ + z [ x= , ~X; for a l l t 2 T ] = 1,

Suppose that

@'z11z:2

(5.12)

Tc2

(5.13)

x1 # x 2 .

is a successful coupling, then

[ I P ( t , X I.), - P(t,X,, . ) [IVar

< 2F-[T

> t ] ---t 0

Furthermore, if the process has a stationary distribution initial distribution p , we have

as t T,

-4

00.

then for any

5 PROBABILITY METRICSAND COUPLING METHODS

196

where P ( t ) is the transition probability function of the original process, By using this way, we prove the ergotlicity of the process. Now, it should be clear that the study of successful couplings is related to the distance of total variation. In general, the succcss of couplings is weaker than the recurrence of the process arid hence weaker than the crgodicity (See Example 5.50 below for instance). For the opposite direction, the next result gives us a reasonable solution.

Theorem 5.22, Let P be a probability kernel on a Polish space ( E , p , $ ) satisfying the following conditions. (1) P ( z , .) E 9'0for all z E l3, where gowas defined in Theorem 5.10. (2) There is a constant c E [O, 1) such that

0 and write t = [t/h]h+ ht,

b) Next, for each t > 0, by Theorem 5.22, there exists uniquely a 7rt E 9 such that 7rt = 7rtPt. The proof will be done once we show that 7rt = 7rs, since then we would have 7r = 7rPt and furthermore

c) The proof of 7rt = 7rs is based on the semigroup property plus the Noting uniqueness of the fixed point of the mapping cpt(p) = pPt, p E 90. that Pt 0 Ps(.t) = Pt+s(%) = Ps 0 'Pt(7rt) = Ps(Tt)r by the uniqueness of 7 r t , we have c p s ( 7 r t ) = 7rt and then 7rt = 7rs by the uniqueness of 7rs. 1 Before moving further, we mention that if we are interested only in whether the two marginals will meet or not, then we can ignore condition (5.13). In this case, we can even allow the two marginals to be different processes. On the other hand, if the two marginals are copies of a single process, it is often easy to modify the coupling process so that (5.13) holds. In this sense, condition (5.13) is not essential. Next, in the study of successful couplings for jump processes, we may and will fix a coupling q-pair (ij(z,,x 2 ) ,Q(z,,z 2 ;dy,, dy,)) and then justify whether the corresponding process is successful or not. Thus, our main task is to find some conditions, depending on the q-pair only, to ensure the success. In this section, we restrict ourselves t o the case that

( E l ,€1) = (E2,€2) = ( E ,8) and

01 = 0 2 .

Denote by ?(t;xl,z,; dy,, d y 2 ) the jump process determined by the coupling q-pair. Then, condition (5.13) becomes

P ( t ;z, z; A ) = 1,

t 2 0,

2

E E,

(5.14)

198

5

PRORARILITY hh3'l'RICS A N D COUPLING

METHODS

where A = { ( x , x ) : x E E } . Equivalently, $(x,x;Ac) = 0 for all z E E . Under (5.14), we have

and so condition (5.12) is now reduced to for all

Definition 5.24. A coupling q-pair is called successful if (5.14) and (5.15) hold.

s(A)

To state our criterion for success, let be the operator corresponding to lhe kcrriel @(x1,x2;dy,, &,)/(A n(xlLz,)) and let afi(A) denote the restriction of IT(x) to E' \ A. Set aii = ~ I - I ( o > .

+

Theorem 5.25. A coupling q-pair (ij(z,, z2),ij(al,x2;dy,, d y 2 ) ) is successful iff the following conditions hold.

(1) @(a, z; A') = 0 for all x E E . (2) q(xlrx2) > o for all ( x l , z2) E (3) T h e equation

has only the trivial solution

IL

E2 \ A.

= 0.

Proof: As we mentioned above: the first two conditions of the lheorern are necessary. Hence, we need only to show that condition (3) is cquivalent to (5.15) irndor the assumptions (1) and (2), Now, mxurne (1) and (2). By using the Laplxcc transform, (5.14) becomes

AF(A;x,x;A)= 1,

x E E, X > 0

(5.17)

and we can rewrite (5.15) as

Thus, we need only to show - that (3) and (5.18) arc equivalcnt. Fix X > 0. Sirice (AP(A;:cl, x2; A) : :cl, :c2 E E ) is the rriiriirrral sulution to the equatiori

5 . 3 SUCCESSFUL

by (5.17) and the localization theorem, we see that ( X s ( X ; x2) is the minimal solution to the equation

f(A)(s,Jz)

199

COUPLING AND ERGODICITY

a) :

51,~ 2 ;

= aii(x)f(X)(s,,s,)+~(zl,~,;A)/(X+~(al,22)),5 1

#

# 52.

Noting that

and using condition (2), we obtain

xF(x;~,, x 2

; ~ ) some f(xl, x2) as

x 1 0,

x1 # z2

and (f(x,, x2) : ( x l , x2) E: E2 \ A) is the minimal solution to

f(%.2)

= nfifl(z,, .2)

+

q

Y

h

3

52:

~ ) / d ( w4, 21 # zz.

Thus, ( h ( z l x2) , := 1- f(rc,. a,>: (xl, x 2 ) C E Z\ A ) is the maximal solution to Eq. (5.16). I Even though we have a general procedure to approximate the maximal solution to Eq. (5.16) (cf. Lemma 2.39). €€owever,such a procedure is somet i m e s not very practical, so we would like to propose some more ef1ective sufficieril conditions for success. Take 0 $ E2 arid set EQ = ( E 2\ A > U { H } , & = ' T { & ~ ~ E ~ \ (0)). A, Define a transition probability on (&I, &) as follnws: I"

-

V~(0,O) = I,

P0(z1,Z2;A)

[ 6 ( ~ ~ , ~\2(0)) ; - ~d-g"(z,;x2;A)IA(@)]/~(X~IZ~),

x1 # x,,

AE

&.

(5.19)

Intuitively, this transition probability is nothing but considering the set A as a single state 0.

Corollary 5.26. Assume t h a t (1) and (2) of Theorem 5.25 hold. Let h E &, h 3 0 and h ( 0 ) = 0. Suppose that there exist constants C > 0 and 0 c < 1 such that Poh C c h on -& (5.20)

C/[(l - c)(l - k ) ] such t h a t

P ~ ( J : ~ , zX ~I ; ~k ) for all (xLT1,z2) E Eo satisfying

h(x,, x2) 6 K .

(5.21)

Then the coupling q-pair is successful. In particular, if (5.21) is satisfied for all z1 # z 2 ,then the same conclusion holds w i t h o u t using condition (5.20).

5 P'KORARILITY METRICSANT) COUPLING METHODS

200

Proof: Consider the process (Z(n,) := (X'(n,),X2(n))),20 defined on a ,..

probalility space ( f 2 , 3 + ,IF) valued in (Eo,&) with transition probability Po. What st" need is to show that I E i r [ ~ ( ~ ) . + o---t] 0 as n t 00. Put

4, = .Ip(n)pe],

2 0. Since 6' is an absorbing state arid In-l = 0 ITL= 0, by (5.20), wc gct lEJ, = IE [I,-,lE(J, I Z ( n - l ) ) ] Jn

= In q q n ) ) ,

*

6 E [I,-lFoh(Z(n - I ) ) ] < IE [ In-1 (C c h ( Z ( n- 1))] = C E I n _ 1 + clEJn-l.

+

(5.22)

On the other hand, by (5.21), we have

EIn

= E [ irn-lFo(Z(n - 1);

E2\ A ) ]

6 IE [ In-lFo(Z(n - 1);E 2 \ A ) ; h ( Z ( n- 1)) < K ] + E [ In-lpo(Z(n - 1); E2 \ A); h ( Z ( n- 1))> K ]

< lclEIn!,_l + K-liEJ,-I.

(5.23)

Combining (5.22) with (5.23), we obtain

Since thc eigenvalucs of the matrix on the right-hand side are smaller than 1, we see that the left-hand side goes to zero as n + 00. 'l'he same proof, even sisnpler, will give us the lad assertion. W Corollary 5.27. Let E be endowed with metric p. Assume that conditions (I) and (2) of Theorem 5.25 hold. Suppose that q ( x ) is locally bounded. (1) If for every such that

T

> 0 there

exists a bounded function p : [O,T]

GO,

~('POP)(2,,~,)$.rl(~1,~,)

O, ~N )' }~,

-

c(F:+lQ

N

1.

n-1

Ph- 9=

- @9),

9 E 80,

k-0

by conditioris (1) and (2) of Theorem 5.25 and condition (5.24), if wc takc g = aT:= p o p , lhen

That is Let n …… and then N …… to get

This proves (5.27). Next, fix r2 > 0 and set F = f o p _ F'rom (1) and (2) of Theorem 5.25 and condition (5.26), it follows that P2F 6 F for all n 2 1. Hence, for p(x1,z2) := T E ( o : T ~ )E, ~ ~ J ~ FA (~ z0 ,( ~ ~) ) ~ ( x , , x ~ Letting ) . n -+m, by (5.2?), we see that

sr,]

(To,, STJ.

5

202

PROBABILITY

METRICSAND

COUPLING

METHODS

z2(T = 00) = 0. This shows that our coupling is successful. b) We now consider the converse case. Let x1 # x2. If @ z 1 ~ z 2 = GO] > 0, then @1,z2 [ T = m] > 0 and so there is nothing to do. Thus, we may and will assume that F ~ . " ~ [ TO,

k=O,l,*..,M.

(3)' There exists a sequence {A,}T of finite subsets of E2\ A so that

A,

1' E2\ A and a

positive function cp such that

(5.29)

ficp60

(5.30)

onE2\n.

Proof: Condition (2)' is clearly necessary. Now, assume that (1) and (2)' hold. Again, consider the subset A as a single absorbing state 8. Setting cp(8) = 0 and using the notation defined by (5.19), we have cp on Ee and

0. By (5.32) and

k = 1, 2.

(d+ W)'P'L(%)

, 0, we obtain a p-optimal measurable coupling Ft(xl, z2;dy,, dy,):

-

-t

x2) 6 P y ( x l , x,)

P

(5.38)

u

for all measurable coupling Pt of Pl(t) and Pz(t). Define

1 -6, G ( y 2 , A ) = --P tn

(?,A \ { z } ) ,

2 E E'1 x Ez,

A E 8 1 x 8 2 , 72 3 1,

By the marginality, we have

6 4 1 ( 4 + 42(Z,),

2 := ( Z 1 , 2 2 ) .

-(n)

(2,El x E2) < w for all X: E El x Ez. Next, since G c ) ( ~ k , A k=) q k ( z k , A k ) , we have G P ) ( x k , -% q k ( x k , Thus, for fixed 5, by Le Cam's Theorem, { G r ) ( x k ,-)}, is uniformly tight and so Hence supn21 G

a)

a).

-(n) is {G ( 5 ,.)}1L31 for fixed 5. Therefore, by Selection Theorem, there exists a transition rncasure y(2, -) such that for each ?, there is ( n ( 2 ) ) c { n } SO that &n(S))(Z, .) -% y(Z, .) as n ( 5 ) 4 00. Hence, we obtain a coiiservative y-pair (q(Z), G(Z, d j j ) ) which corresponds to an operator as follows.

fIf(2) =

1

q(?,dj)f(G) - q(z)f(2),

2E

x E2,

f

b(81

x

82).

From lhe construction, it is readily to see that is a coupling of 01 and 0 2 . Thus, it remains to check the p-optimality of G. To do so, we consider first bounded p. In this case, the proof is easier than Lemma 5.30. Because cp is bounded lower semi-continuous, by Theorem 4.5 and (5.38),replacing .$by Markovian coupling &t) with operator 6: we obtain flp(x1, x2)

5 PROBABILITY METRICSAND COUPLING METHODS

210

(2,,22) E

= fi(p(X1,X2),

El x &.

Finally, we consider general p. Let ( p n = cp A n. Then we have proved that there exists a (p,-optimal coupling operator of fl1 and slz such that for all coupling operator 6 of R1 and 0 2 :

a(n)

Following Lhe argument above, we know that { i j ( n ) ( Z ,d$)}n21 is uniformly tight, and so we can appIy Selection Theorem to find a transition measure Q(?, d y ) , as rz weak li,mj,t;of q(n(5))(ii,djj) as n(2)--t no. Then deGrie ari operator The cp-optimality now follows from Theorem 4.5 and the rnonotone convergence theorem. The construction oE an explicit optirrial coupling is usually not easy. Here we mention two results, taken from Chen (1994a), their proofs are omitted here.

n.

Theorem 5.37. Let p be a translation-invariant metric on Z+ and set uk:= p(k 1) - p ( k ) , k 2 0, where p(k> = p(0, k ) . Then, for birth-death process,

+

(1)

-

n, is p-optimal whenever iz

- il

=: k

7i.k

is decreasing in k. Moreover, we have for

2 1,

f L p ( i 1 , i 2 ) = (ailA M u , + , +(ail v bi&k

-

-

( b i , v a i z ) u b I .- (hi, A a t 2 ) U , - 2 ,

where u-1 = uo. (2) If uk is increasing in k , then fl, is p-optimal. Moreover, for k 2 1,

fl,p(i1,i2) = [(Ui,-%)

f

+ (bi, - bi, ,']w-,i."

- a21

>+

+(hi,

22 - i l

-biz)+]

=:

Uk-1.

Theorem 5.38. Let ( u k )be a positive sequence on Z., Define a distance p(m,n) = uj - Ejsnu j Then, every coupling mentioned in 50.2,

I Ej 0,

x u j / m j , j=O

where M = rn(r) 2 1 is a constant which will be determined later. Then, for ~ ~ ~ ( c p o p ) ( i l+q(il,i2) ,iz) < 0, by (5.55) replacing zhk by cvuk/?-nk, it suffices that

That is

Since u k

1 and m 2

1, it suffices that

Equivalently, ai

+

bi+k

Since the last term is negative for large enough a , by (5.52), this is equivalent

to or

Thus, one may first choose rn = m ( r ) large enough so that the right-hand side becomes posilive for all k T , and then the inequality holds for all k r whenever N is large enough. 1

, d tq) ( d 2 ) , A ) .

q(x(’),AC) = i=l

0

Our conclusion follows from Theorem 5.47. An alternative way, even simpler, to prove the monotonicity of the process is using the following coupling

(5.58) which gives us

Example 5.54. Let us consider a special case of the above example: d = 1, Pll = 1 and F(do is non-explosive. That is, P [ X t E El = 1 for all t 2 0. The following result explains the meaning of time-reversibility. Definition 6.1. A Markov process (Xt)t>o defined on (n,S,IP)with state space ( E , 8 ) is called reversible, if for any finite n 2 1, 0 t l < t 2 < ...

0. Equivalently, .for all f, g e be or e+ and all t > 0 Proof?: The last assertion follows from the monotone class theorem. By the , we haveassumption for all

and

Thus, if the process is reversible, then

So the condition is necesssary. Congives us versely, assume (6.4). This means that P(t) is self-adjoint on L2() , the space of all real square integrable functions with inner product: Let and Then, we habe

satisfy

and

Define

229

6.2 EXISTENCE

Now, the assertion follows by setting ft = I A ~ 1, 6 i Actually, we have also proved the following result.

< n.

I

Corollary 6.4, Every reversible measure of an honest P ( t ) is a stationary distribution.

The above theorem leads to the following Definition 6.5. Let P ( t ,2,d y ) be a sub-Markovian transition function. It is said t o be symmetrizable (resp. reversible) if there exists some a-finite (resp. probability) measure T such that (6.3) holds for all A , B E 8. Equivalently, (6.4) holds for all f , g E &. Similarly, by replacing P ( t , z , d y ) with q(z,d y ) , we can define a symmetrizable (reversible) q-pair.

It is worthy to mention that for f , g E E;, r ( f P ( t ) g )is meaningful but maybe infinite. In the L2-theory, which is the topic of the last two sections of this chapter, in order to avoid infinity, we usiinlly restrict f , g to the domain of P ( t ) . But in the first, six sections of the chapter, we do nal want to be involvcd in the L2-theor;y, so we allow (f,,q) = m. Lemma 6.6. If a jump process P ( t ) is symmetrizable with respect to a measure n,then so does its q-pair

Proof: Since

P ( t ,z,( 3 2 ) ) 2

e - q y

havewe Now, we split the proof into three steps. a)

b) By a), we obtain c0 In general, by a) and b), we have

6.2 Existence The first important result about symmetrizable q-processes is as follows:

230

6 SYMMETRIZABLE

JUMP PROCESSES

Theorem 6.7. The minimal q-process is symmetrizable with respect: to

T ifF so does its g-pair. In particular, for a given symmetrizable q-pair, there always exists a t least one symmetrizable q-process.

Proof: Recall that for f E S+,f1 denotes the kernel f ( x ) d ( s , d y ) . The necessity was proved in Lemma 6.6. 'lb prove the sufficiency, let

Then P r n i n ( A ) . -

C z ,P ( n ) ( X ) .

On the other hand, because

is symmetric in A and B , so P ( l ) ( X )is syrnmetrizable with respect to Suppose that F""j(A) is symmetrizable with respect to T T . That is

T.

By the monotone class theorem, this is equivalent to

Next, since I l ( X ) is symmetrizable with respect to

T ,we

have

Therefore, we obtain

here in the last step, we have used (2.24). Thus, P(")(X)is symmetrizable with respect to 7r for all n 1, and so is its sum Pmin(A).H Having the existence result in mind, we now discuss the uniqueness problem for symmetrizable q-processes. Clearly, we have

Corollary 6.8. If the q-pair (q(z),q(x,dy)) i s symmetrizable with respect t o x and the q-process is unique, then there is only one g-process which is symmetrizable with respect t o T .

6.2 EXISTENCE

23 1

In general, the uniqueness problem is quite hard, the remainder of this section is devoted to introduce a non-trivial sufficient condition. NoOe that

> 0,3A E €such that P(X,5 , A ) # Pmin(X, x,A ) } = {x E E : 3X > 0 such that P(A,z, E ) # Pmin(X, x,E ) ) = { x E E : VX > 0 such that P(X,2,E ) # Pmin(X, x,E ) }

{x E E

: 3X

is clearly &-measurable, hence we may introduce

Definition 6.9. We call y-processes P(X) and Pmin(X) are n-equivalent, if

~ { Ex E

:

3X

> 0, 3 A E &such

t h a t P ( A , x , A ) # Pmin(X, 2,A ) } = 0.

If P'"(X),k = 1 , 2 , are all n-equivalent t o Pmin(X), we call themselves n-equivalent.

Obviously, we have Lemma 6.10. Two n-equivalent q-processes are, or are not symmetrizable with respect t o

7r

simultaneously.

Lcmma 6.11. Let ( q ( z ) , q ( % , d y ) )be symrnetrizable with respect to n and { f x : X > 0) be a consistent family of functions. Then, the equality nfx = 0 holds or not simultaneously for a l l X > 0.

Proof: Let rfp= 0 for some p . Then, by consistency, we have

here in the second to the last step, we have used Theorem 6.7. W Lemma 6.12. Let the q-pair be symmetrizable with respect t o

7r.

( 3 ) If for some X > 0, dirn@A = 0 n-a.e., that is xu = 0 for all u E then dirri%x = 0 n-a.e. for all X > 0. In this case, we write dim@ (2) I f for some X > 0, there is u A E 9?!~ so t h a t

then, there is a consistent family of functions { u x E 0 < IIuxII1 < cc for all X > 0.

%A

:X

> 0)

@A,

0.

so t h a t

6 SYMMETRIZABLE JUMPPROCESSES

232

Proof: a) Let u x f %A for some A. Define up = R(X,p)uA for all p. Then, by Lemma 2.36, U~ E qpfor all p > 0 and { u p : p > 0} is consistent. Hence, the first assertion follows from Lemma 6.11. b) Let uA E q7dr, satisfy 0 < IIuxII1 < 00 for a fixed X > 0. Define the consistent family {uI1: p > 0 ) as. above. Then, tls we have just seen that nuc, > 0 for all p > 0. Moreover, by symmetry7 we obtain IIP1lyp)~xII1 =

(UA,

prnin(pII) 6 ru-yux, 1) = IIuxllI/F
0 such that P ( X , z , E )# Pmin(A,x,E)} c {x E E : 3X > 0 such that zA # 0}, which is a n-null set.

Proposition 6.14. Let 7rzA < 03. Then there exists n-almost honest syrnmetrizable Bg-process ifF the following conditions hold. (1) The q-pair is symmetrizable with respect t o n-. (2) The q-pair is n-almost conservative.

Proof: It will be proved later (Proposition 6.25) that for every Bq-process P(X>, x E E. d( s) = lim X [ 1 - XP(A, 5,E ) ] , A-00

This plus the n-almost honesty implies that n-d = 0. Hence the necessity of the conditions is clear. Next, when conditions (1) and (2) hold and dim% = 0, the existence is trivial. If dim % # 0 but dim % 0, then each Bq-process 7r

is x-equivalent t o ,"'"(A)

and x-almost honest. Finally, if dim%'#O, then

P(A) := P n i n ( X )

+ zApx/[Xpx(E)]

6.4 GENERALREPRESENTATION OF JUMP PROCESSES

233

provides one of the desired process, where p x = x [ z x l ] . What we are going to do for the remainder of this chapter is more or less related to the last two results. -4s we will see later, the uniqueness criterion for this special type of jump processes can not be deduced directly from the one for general jump processes. Actually, it is even not known completely. 6.3 Equivalence of Backward and Forward Kolmogorov Equations 111 this section, we prove that the two Kolrnogorov equations are almost equivalent.

Definition 6.15. W e say t h a t the backward (resp. forward) equation ( B A )(resp. (FA)) holds 7r-a.e. if there is a Ir-null set N so t h a t for all 2 $ N , a l l A E 8 and X > 0, ( B A )(resp. ( F A ) holds. Theorem 6.16. Let P(X) be a n-symmetrizable q-process. Then P(A) satisfies (BA),n-a.e. iff P(A) satisfies (FA),7r-a.e.

Proof: Let P(A) satisfy (Bx),r-a.e. Choose { E n } so that En n(E,) < 00 and supzEE, q(x) 6 n. Then, by Lemma 6.6, we have

E with

Tdking Radon-Nikodym clcrivative, we obtain

The exceptional set depends on X > 0, A and n. However, for fixed z and A, by the resolvent equation, P(A,x,A) is continuous in A, so we can choose an exceptional set depending on A and n only. Furthermore, since ( E , & ) is Polish space, 6 is countably generated, we can choose an exceptional set independent of A and n. Similarly, we can prove the other assertion. 6.4 General Representation of Jump Processes

To study further the uniqueness problem of symmetrizable q-processes, we need to know more details about the exit and entrmce hoiindaries of thc processes. In this section, we first introduce some basic results on Feller's boundary theory. Then, we present a general representation of jump processes. This section can bc regarded as an addition to Chapter 2. For convenience, set %A(C)

= { u E %A

:

u < c}.


234

Lemma 6.17. Let iiA = P"'"(X)d, X > 0 and let tix be the maximal element in %x(l) which can be obtained by the procedure: fix X > 0 and let E1

U(n+l)

n(x)u(4,

n

B 0,

then u(") 1 GA. Next, {fiA}and { f i x } are consistent families. Moreover,

Proof: The approximation scheme for {fix}is obvious. By Lemma 2.39, we know that for each fixed X > 0, zA is the maximal solution to the equation Ic = r I ( X ) X

+ d(X + q ) - l ,

0

< Ic < 1.

On t,he other hand, P"'"(X)cl is the rnininial solution to the same equation, so as their difference, { E x } must be the maximal element in Wx(1).The consistency of {GX} follows from (2.40). Now, as the difference of two consistent families, { U x } must be also consistent. I Recall that for the consistent families { f ~ and } {qA}, we have

Definition 6.18. We call f := 1imx-o fx and 71 := 1imx-o qAthe canonical images of (fx) and {qA} respectively.

In what follows, for the consistent families {fx}, { u x } , {qA}, {qx} and so on, we use f , u , q and f j to denote their canonical images, respectively. Lemma 6.19. For the canonical images, we have the following decompositions:

where LW

F(x, A )

. I

lim Pmin(X, 2,A ) =

A--0

Pmin(t, Z,A)& =

rIn(IA/q)(x). n=O

Proof: Ry consistency, we have

f x = fJL

+ ( P - X)Prnil'(X)fp,

fp

= fx

+ (A - p )

Pmin(X)fp.

Letting X 1 0 in the left equality, we obtain the first desired decomposition o f f . Let,ting p J. 0 in the right equality, we obtain the other decomposition of f. The proof for the decompositions of rl is similar. I

6.4 GENERALREPKESENTATION OF JUMP PROCESSES

235

Next, recall that Eo = {x : d(z) > 0) and set

Lemma 6.20. For the consistent families {G,}, {GA) and their canonical images ii,U we have

(6.7) (6.8)

(6.9) (6.10) (6.11) where uo is the maximal element in

having the property:

Proof: All assertions are clear except the ones about we have u, ii, = xx < 1.

14'.

By Lerrima 6.17,

+

Define

1

uo := lim (1 - a x X-+O

-

iiA) = 1 - u - fi 2 0.

Then, we have (6.11). Using Lemma 6.17, (6.8) and (6.9), we obtain (6.12). To show that uo E %, noting that by the definition of uo,we have XPrn'"(X)l

as x

J. uo

1 0.

If q(x) = 0, then we certainly have uo(x)= lfuO(s).Otherwise, the equation (Bx) also implies that u O ( z )= LIu"(x). Finally, for any solution f to the equation f = XPmin(X)f, 0 < f 6 1, we have

< xPmi"(x)l,

f = XPmin(X)f

arid so f

< lirri~-+OXP"'"(X)l

= uo.

Lemma 6.21.

(1) A family

{f,: X 3 0)

of functions is consistent iff it has the representa-

tion : f,

= prriin

(

W

I

+ A,


236

where w satisfies Pmin(X)w E b€+ for some (and hence for a l l ) A > 0 and (f, E %'A} is a consistent family. Moreover, w and hence fx are determined uniquely by = (XI -

UT

(2) A family {q, : X

n)f,, fx J, 0, > 0 ) of

-+

as X

w

T 00.

measures is consistent iff it has the representa-

tion:

77,

Xfx

2

#emin (A) + q x ,

where K E ?+ satisfies ~ E P ~ ~ E ~ 9 ~+ for ( Xsome ) (and hence for a l l ) X > 0 and { f j x E W,} is a consistent family. Moreover, K. and hence f j x are determined uniquely by

44 = % ( X I

- %4)7

rlx

-1 0, h ( 4 44 +

as

AEmE,,

T 00, n21.

Proof: Assertion (2) follows by combining Lemma 3.3 with Lemma 3.4. In a similar way, we can prove assertion (1). W Corollary 6.22. As A

t 00,

we have

-

P1x

-1 0:

6, 10,

X U , -+ 0; AfiA --t d.

> 0 } be a consistent family of measures. := AqAu0 < 03 independent of X > 0.

Lemma 6.23. Let {q, : X ' 7

Next, let

{fx

f E bd?. Then

> 0) X7,f T

:X

Then

be a consistent family of functions with canonical image as X 00. In particular, for a E Eo, if we set

."X(.)

= Pmin(X, 2,{ a } ) d ( a ) ,

then {u:) has canonical image ua = v fAl

.- Xq,ua ._

'LI"

= v;

vn,

a ) d ( a ) . Moreover,

T

+ q,({u})d(u)

0 0 7

independent of p

> 0.

Proof: Actually, it wus proved in (2.55) that

h f = PVpf +

- ChpfX7

so the second assertion follows. Similarly, by Lemma 6.20, we can prove the first one. Now? it is clear that, u: = 21;

+ ( A - p)q&,

a E Eo.

But Xu: 6 d(a), so the last assertion follows by the dominated convergence theorem. W

6.4 GENERAL REPRESENTATTON O F J U M P PROCESSES

237

Definition 6.24. Let P(X) be a q-process. We say that the backward equation at point II: holds if for all X > 0 and all A , P(X)pA(z)= P I ( X ) P ( X ) I A ( X ) f (A f q ( x ) ) - l I A ( x ) . The ncxt result improves Theorem 1.15.

Proposition 6.25. The backward equation a t z holds iff d ( z ) =: lim X [ 1 - XP(X,2 , E ) ]=: D ( z ) .

x

(6.13)

'M

In general, we have the inequality replacing "=" with " 3 " ,

Proof: a) As we did in Section 1.1, enlarging ( E ,8 ) by a fictitious state A, we obtairi ( E n ,&"a), Define

Clearly, PA(^) is a q-process on ( E n ,&A) with qa = 9 . 1 ~50 . the limit

lim X ( l

x+cQ

AP(X,x,E ) ) = lim A2P,(X, x-+m

x:(A})

exists for all x E E . On the other hand, by Lemma 6.17 and Corollary 6.22 we have

D ( z ) : = lirn X ( l - XP(X,z,E)) 6 lirn Axx(.) x-tm A400 = lim (XG, XG,) = d ( z ) . x-+m

+

This proves the last assertion. b) By a), we have q,(A) = 0 and

for all II: E E and for A in a ring and hence for all A E 8'. Hence, applying the backward Kolmogorov inequaiity to both of P(A)and Pn(X),we obtain

6 SYMMETRIZABLE JUMP PROCESSES

238

where

r

Thus, whenever D ( z ) = d ( z ) , the above every inequality should become equality. In other words, (Bx) at 2 holds. c) Conversely, if (Bx) at x holds, lhen

Lemma 6.26. Let P(X) be a q-process and set u x = 1 - XY(X)l. Then, we have u x >, l l ( X ) u x .

Proof: By using the resolvent equation with some computation, we obtain

Now, the assertion follows by letting We can now state the general representation theorem of q-processes. Theorem 6.27. Every g-process has th e following form

P(X) = Pi'1(X)

+ B(X) + X ( X ) F ( X ) ,

X

> 0,

where

X ( X ) := P"""(X)[dl], X

>0

(6.14)

and F(X) are kernels having the following properties. (1) For foxed x and A , B ( . , z , A ) and b;'(.,z,A)are continuous. (2) For fixed X and J:,

B ( X , z ,.), F(X,J:,*) E .A?+, XB(X)l(z)< 1, XF(X)l(J:) F ( X ) I ( X ) = o for J: 4 E'.

< 1,

(3) For fixed X and A , B(X,+ , A )E e ~ ( l / X )F(X, ? A ) E T&+. (4) If B(X)= 0, then F(X) satisfies the resolvent condition e ,

F ( p ) = F ( A)

+ (A -p )F ( A) pmin (T+ d F ( p ) )= F ( A) { I+ (A ,p( p)} . (,I)

-

(5) The q-condition holds:

lirri A2X(A)F(X)I~ = 0, x-+m

A E &nE,, n 2 1.

6.4 GENERAL REPRESENTATION OF J U M P PROCESSES

239

Furthermore, if P(A>satisfies the backward equation a t x,then F(X,x,*) 5 0. If P ( X ) satisfies the forward equation, then for every 2,B(A,x,.), F(X,x,.) E WA. Finally, P ( X ) is honest iff

XB(A)l = for all X

and

XP(A)l = 1

(6.15)

on E*

0.

Proof: a) Let P(X) be a g-process. Since every q-process satisfies the Kolmogosov inequality a,nd Pmin(A> satisfies ( B x ) ,wc have

Moreover, by Proposition 6.25, for the conservative point 5: d ( s ) = 0, the inequdity becomes equality for all X > 0 and all A . Hence, we may define

F ( X , s , A )=

the left-hand aide of (6.16)/d(x)

if x

(=_

Eo

if x $ E'.

(6.17)

Clca.rly, we have

and F(X,x:.) = 0 whenever P(X)satisfies the backward equation at 2. Moreover, for fixed 2 and A, E(X,5 , A) is continuous in X and so is X(A)F(A)(x,A). From this, msertkm ( I ) follows. b) Next, note that Lemma 2.34 (1) implies

P(A) - Prni"(X) >J P"'"(X)(dF(X)). So we may define B(X) = P(X) - Pmin(A) - Prni"(X)(dF(A)) 2 0.

01' course, we have

Now, P(X>can be written as follows:

P(A)= P"'"(Aj

Preniultiglying ( X I

-..

+ X ( X ) F ( X )+ B(X).

(6.18)

st), we obtain

( X I - ( 1 )P(A)= ( A 1 -fl)P*li" (A) + (A& S2)X(A) F ( A) = I+ ( X I - fl)X (A) F ( A).

6 SYMMETRIZABLE JUMPPROCESS

240

Noting that, by Eq. ( B A ) we , have (XI

-

fl)X(X)J = ( A T

-

st>Pmin(X)(df) = d f .

(6.19)

Conibining the above two facts:, we arrive at ( X I - a)P(A)l(rc)= 13. d(x)F(A)l(x),

X > 0: x E ED.

(6.20)

On the other hand, by Lemma 6.26, we have ( X I - R)P(X)l

< 1+ d / X ,

x > 0.

It follows that X F ( X ) l < 1 for all X > 0. c> It is emy to check that if B(X) = 0, then P(A) satisfies the resolvent equation iff

But

Hence, the above equality becomes

Now, applying (6.19) to this eqrrality, we obtain (4). d) Suppose that P(A) satisfies ( F A ) .Then

B(A)(M - n)IA+ X(X)F(X)(XI- n ) I A = 0,

A E 8 n E,,,

Premultiplying ( X I - a),we obtain d F ( A ) ( X I- ~ ) I = A 0,

A E 8 n En, n 2 1.

Moreover, we also have

B(A)(AI- 0 ) I A = 0,

A

E8

n E,,

2 1.

TI

31

6.4 GENERAL REPRESENTATION OF JUMP PROCESSES

241

This shows that for fixed X and z, B(X),F ( X ) E W,. e) We now prove the last assertion. The sufficiency is obvious. To prove the necessity, let P(X) be honest. By (6.20), we obtain X+Xd(z)F(X)1(2) = (XI-R)XP(X)l(z) = (AI-fl)l(x) = X+d(x), z E EO. Hence, X F ( X ) l = 1 on Eo. Substituting this into the expression of P(X),we obtain

XB(X)l+ X(X)XF(X)l

1 = X P ( X ) l = XP"'"(X)l+ = 1- z , =1

+ XB(X)l+ P"'"(X)d

+ XB(X)l- zx + f i x .

Thus, by Lemma 6.17, we have

XB(X)l = 2,

- 6,

= u,.

f ) Finally, we consider the q-condition. By Corollary 2.28, P(X) is a qprocess iff

But we have proved that B(X)l E

e~(l/X so ), by Lemma 6.17 we have

From this and Corollary 6.22, we obtain

0.

P ( X ) = P"'"(X)

Then

+ u,cpx,

X

>0

(6.21)

is a q-process iff either c p = ~ 0 or c p can ~ be obtained by the following procedure.

(1) Take

K.

E p+ so that K.P"'"(X) E 2f+.

(6.22)


242

(2) Take a consistent family { f j x E Wx} of measures so that

+

(6.23) (6.24)

q x := nPrni"(A) ?jA, 21A

r

:= X f j A i i

< 03, A

21

00.

Set

' 5 = X?jxuoC=

independent of A.

0;)

(6.25)

(3) Next, take constant c so that

ao+ n(u0 + ii)

+ 21 < c.

(6.26)

(4) Finally, take

cpx = q J [ c

+

K(U -

+

(6.27)

Ux) Xfj,U].

Moreover, the q-process P(X) constructed above must satisfy ( B x ) . It satisfies ( F A )iff n = 0. It is honest iff the q-pair is conservative and c = ' 5

+ KUO.

(6.28)

Furthermore, if d i m 9 = 1, then we have constructed all Bq-processes. If d i m 9 = 1 and the q-pair is conservative, then we have indeed constructed all q- processes.

Proof: Let (6.21) hold and c p $0. ~ By Lemma 2.43, we have rn;'

- xqxii =: c

independent of A.

Now, the normal condition gives us

XE,cp,(E)

< ii,

(6.29)

3- i i x .

E 9 ~ (and l) But iiA 00

iix = P"'"(X)d

= CIqX)"d/(X

+ 4) G zx G 1,

0

premultipIying II(A)" in (6.29) and then letting m + 03, we see that (6.29) is equivalent to Xcpx(E) 1. That is X q A ( l - U) 6 c. On the other hand,

1.

TEE,

Lemma 6.29. Let (q(x),q ( x ,dy)) be symmetric with respect t o be a consistent family. Then

7r

and { f ~ }

(1) XxfA is increasing in X. for all X > 0 and nf, < .x for some (equivalently, (2) If moreover f, E for a l l ) X > 0, then qx := 7 r I f ~ l E) WA for all X > 0 and { q x : X > 0) is consistent.

Proof: a) By Lemma 6.11 or is independent of > 0. Next, by consistency and symmetry,

.This proves the first assertion and b) Let

and hence c) Finally, by symmetry, we have

which proves the last assertion.

Then


244

Lemma 6.30. Let ( q ( z )q(z, , d y ) ) be symmetrizable with respect to

T.

Then

(6.30)

Proof: By Lemma 6.29, we need only to show that limx,m XxGA = n d . But XTGx = X ( P m i n ( X ) , d ) = (d, XP"'"(X)l)

== (d, 1 - zx).

So the assertion follows from the consistency of x , ~ Lemma6.21 , plus the monotone convergence theorem. I Lemma 6.31. Suppose that the q-condition given in Theorem 6.27 holds. Then, we have

lim X F ( X , z , A ) = 0,

x+co

J: E

Eo, A E € nE,, n 2 1 .

Proof:

Proposition 6.32. Let the given q-pair (q(x),q(s, d y ) ) be symmetrizable with respect t o 7r. Then, for the existence of an honest .rr-syrnmetrizableq-process, it is necessary that one of the following conditions holds.

(1) n d = 00. (2) limx,, XnU, = 00. (3) nd limx,, XnU,

, (JE,, uozx). Letting X + 0 and then n noting that zx 7 (1 - uo) as X L 0, we obtain (uol 1 - uo) = 0.

-+

03,


246

Lemma 6.34. Let (q(x),q ( x , d y ) ) be reversible with respect t o

T. Define

If U x # 0 and U x # 0, then U x and U x , and hence fjx and jjx are linear independent. The images of fj, and i j x are f j = n [ 4 and ij = 7r[UI],respectively. Moreover, lim Xijx(A)= ~ ( d l ~ ) , A E 8 n En, x+m

n 2 1.

Proof: Since { f i x } and {GA} are consistent, the consistency of f j x and i j x follow from Lemma 6.29. If clUx c2Ux = 0 holds for some X > 0, then c2(XI - R)G, = 0 since f i x E %A. But from Gx # 0 we know that d # 0. Hence (XI - R)U, = d # 0.

+

This implies that c2 = 0. Next, from fix # 0, it follows that el = 0. Finally, since the property whether f i x and fix are zero or not is independent of A, we have thus proved the linear independence. As for the last three equalities, simply apply Lemma 6.20 and Lemma 6.21. W

Lemma 6.35. Let fjx and i j x be the same as above. Set

Then, we have

(1) w:~

some wab as X

(2) wi2 = wil for all X

co,a, b = 1 , Z . Moreover,

> 0 and so w12 = w21.

Proof: The first assertion is a straightforward consequence of Lemma 6.34 and Lemma 6.23. By using these lemmas again, we get

and so wi2=

wil

for all X

> 0. W

6.5 EXISTENCE OF HONESTREVERSIBLE JUMP PROCESSES247

Write

W(X) = (w?

4 1 )

WZl

For a given matrix

R(X)=

:;(

rY)

>

r i2 we write

P(X) = P"'"(X)

p).

+ (fix, ux)R(X)

(6.31)

Lemma 6.36. Let (q(z),q(z,d y ) ) be reversible with respect t o 7r. Then P(X) is a reversible q-process if the following conditions hold.

(1) R(X) is non-negative and symmetric. (2) R(X)W(X)l < 1, x > 0. (3) (4) Moreover, the process is honest iff the equality in (2) holds.

Proof: Clearly, condition (1) implies that P(X) 2 0. Write

Then Xp",(E) = Xri1q,((E)

+ X.;%jx(E),

2

= 1,2.

On the other hand, by (6.11)) we have

here we have used Lemma 6.33: qxuo = (uO,u.x)

< (uO,q)6 (uO,1 - uO)= 0,

X

> 0,

(6.32)

and ljxuo = 0, X > 0. Thus, condition (2) implies the normal condition X P ( X ) l < 1 and furthermore the process is honest iff (2) becomes equality by (6.15).


248

To show that P(X) satisfies the resolvent equation, by consistency of U , and G, plus some computations, it suffices that

cpt - 9; + (A - p)cp:P(pu>= 0 ,

k = 1,2.

This can be deduced by using consistency of 5j,, ?jA and condition (3). Clearly, (4) follows from (3). Finally, since limx,oo XU, = 0 and limA+-ooXii, = d , we have lim X ( I A - X P ( X ) I A )

A-+m

= lim X ( I A - X P m i n ( X ) I ~) - (0, d ) x-+m

So the q-condition is also satisfied. 1

Lemma 6.37. The hypotheses and the notations are the same as in the previous lemma. Suppose in addition that 0 < .rrd < lim XnuA< co. A-+m

Then, we can choose a matrix R(X) so that P(X) defined by (6.31) is an honest reversible q-process.

Proof: Because nd

> 0, so Eo # 0 and GA # 0. Moreover, lim Xnii, 2 .ird > 0, A-+m

we also have ii,

# 0.

Now, take hl = lim Xnii,,

hz = x d ,

A-+W

r1 = (hl - h2)/h?,

r2

= l/h1.

Clearly, r1 2 0, r2 > 0. Next, take = [rl

+ ri(w22- w?~)]/A,

ri2 = ril = T~ [ 1 - r 2 ( d 2- w;')>I/A,

ri2 = r,"(wll - wil)/A, where

n = [ 1- r2(w12 - w,12 ) ]2 + T;(wll

- w:1)(w22

- w?) - r1(w1l

- w?)

and w $ ~and wab(a,b = 1,2) are defined by Lemma 6.35. It is easy to check that wab(a,b = 1 , 2 ) are finite and so are w:~. The computation of R(X) comes from Lemma 6.36 (3). Letting p i 00, we obtain R(X) = R ( I - (W - w ( X ) ) R ) l-.

From this, it is easy to show that R(X) constructed here satisfies all conditioris oC Lenrxna, 6.36 and moreover, the process is honest. We can now state our main criterion.

6.6 UNIQUENESS CRITERIA

249

Theorem 6.38. Let ( q ( x ) , q ( z , d y ) )be reversible with respect t o T . Then there exists a n-almost honest and .rr-reversible q-process iff one of conditions (1)-(3) of Proposition 6.32 holds. Proof: The necessity g v a proved in Proposition 6.32. To prove the sufficiency, we discuss the problem according to different cases. a) Let ( q ( z )q, ( 5 , dy)) be conservative. By Proposition 6.28,

> 0, i.e., fjA(3) > 0, then

is a q-process, it is clearly reversible. When niix it is actually honest. Ot'herwise, T [ 2E

E

:

XPrni"(Xj(;c)l < 11 6 T [ U X # 01 = 0,

and so Prnin(X) is a n-almost honest q-process. b) Let dim@ 0 and Eo 0. That is, the q-pair is n-almost zero-exit and n-almost conservative. In this case, we still have r [ z E E : XPrn'"(X,2,E)< 13 =7r[zx # O ]

c) Let Eo 0 and 0 < limx,,XnuA constructed in a) satisfies 7r

[. E 15

: AP(A, x,E )

< 7r[ux# O ]

< 00.

+ T [ i i A # 01

=o.

In this case, the q-process

< 11 6 T [ d # 01 4-7r [d = 0, A P ( X ) l < I ] -n[zA-uX #O,d-O] =

piA# 0, a = 01 = 0.

d) Let 0 < nd 6 limx+m X.rr?IA < m. In this case, an honest reversible q-process was constructed by Lemma6.37. e) Let n d = 00 or lirnAds X7riiA = 00. By Lemma 6.30, we have limxdm Xnzx = 00. Then, qX = ijx := 7r1tzx-l] satisfies Proposition 3.5 (4),so by Proposition 3.5, P ( x ) = Pmin(X) -k Z A ~ ~ A / ( A ~ ) A ( E ) ) is an honest q-process, which is clearly reversible.

I

6.6 Uniqueness Criteria

In this section, we fix a symmetrizable q-pair (q(z),q(z,dy)) with a symmetrizing measure T , which is also fixed. The uniqueness studied here is naturally in the sense of ;.r-equivalence.


250

Proposition 6.39. If %'AnL'(n) = (01, then there is only one n-syrnmetrizable Dq-process.

Proof: Suppose that the Bq-processes are not unique. Then, by Theorcrn 6.27, we should have a Q-process with the form P(X) = P"'"(X) 3. B(X) with B(X,.,A) E for all X > 0 and A E 8 and nB(Xo)l > 0 for some Xo > 0. Choose large enough n so that n ( B ( X o ) I ~ > , ) 0. But then for this n, we also have

0.

Proposition 6.40. If

(1) %A n L1(n) = {O} (2) n d

and

< oc,

hold, then there is only one n-symmetrizable q-process.

Proof: Let P(X) be an arbitrary .Ir-symmetrizable q-process. Then by Theorem 6.27, it has the representation:

+

+

P(X) = Prni"(X) B(X) X ( X ) F ( X ) . But by condition (1) and the proof of the last proposition, we see that B(X,+, l?) = 0, T-ax. On the other hand, by Lerrirria 6.31, we have lim XF(X).[A

A+cQ

= 0,

AE

&nnE,,

n 2 1.

Hence lim

X-CO

( x P ~ ~ ~ ( x ) I E~, X , F(X)I)

= lim X-+m

(I,yn, XPmin(X)dXF(X)l) (by symmetry of P"'"(X>)

= lim (1, XP"'"(X)dXF(X)I,gn) (by symmetry of P(X) and Pmi"(X X-+m

=

lim (XP"'"(X)l, ~ X I ; I ( X ) I E ~ ) (by symmetry of P"'"(X))

x-+m

= 0,

72

2 1.


251

Here in the last step, we have used condition (2) and the dominated convergence theorem. Moreover, by Theorem 6.27, F ( p ) = F(X)( I + (A - p ) P ( p ) ) . Hence

Combining these facts together, we obtain

and so

Thus, there exists a n-null set N such that

Note that the jump condition gives us

By Theorem 1.14, it follows that limA,m the other hand, by symmetry,

XPmin(X)(I~cdXF(X)l) = 0. On

(1, P"'"(X)IN) = ( I N , P"'"(X)l) = 0.

So we can choose a n-null set fi so that Pmin(X, z,N ) = 0 for all z @ Therefore, we have actually obtained

N.

From this, it follows that

Now, by Proposition 6.25, we see that P(X,z, .) satisfies the backward equaTo see this, we can either use tion. Therefore, P(X) is equivalent to Prn1"(X), condition (1) and Proposition 6.39 to claim the uniqueness of Bq-processes or use Theorem 6.27 to claim that F ( X ) l = 0, n-a.e.


252

Theorem 6.41. There exists only one 7r-symmetrizable q-process if the following three conditions all hold.

(1) ( q ( x ) q(x, , dy)) is symmetrizable with respect t o 7 r . (2) %A n L ~ ( T=) {o}. (3) Either inf{Pmin(X)l(z): z E E o } > 0 or 7rd < 00. Proof: By Proposition 6.40, we need only consider the q-process having the form P(X) = Prni"(X) X ( X ) F ( X )

+

and consider only the case that inf{Pmin(X)l(x) : x E E o } > 0. By Thecrem 6.27 (4), we have

Hence

Letting p

--f

00,

we get

XF(X)IA 2 F(X)QIA,

X

> 0, A E € n En, n 3 1.

Thus on € n En, n >, 1.

U(X) := F(X)(XI- 0) 2 0

(6.33)

Of course, for every X > 0 and z, U(X,x,.) can be extended uniquely as an element in p+.We use the same notation U(X) to denote the extension. By Lemma 2.34 (a), it follows that

Set

V(X) = F ( X ) - U(X)Pmin(X)2 0,

X

> 0.

(6.34)

Then by (6.33) and (6.34) we obtain, for each xo E E , that

V(X)(XI- Q ) I A ( z ~=) 0,

X

> 0, A E € n En,n 2 1.

That is

v ( X ) I ~ ( z o=) V(X)Q((X

+ q)-'I~)(z~),

X

> 0, A E 8

(6.35)


253

by the monotone class theorem. On the other hand, by symmetry, T ( A )= 0 implies that

2 2

1 1

E ( ~ ~ ) P " ' " z, ( X(,x } ) d ( z ) F ( Xz, , A) n(dx)Pmin(X, x,{z})d(x)V(X, z,A),

A

> 0.

This shows that for all zo out of a .rr-null subset of Eo, say E;, we have

V(X,z,;)

0.

Define

Then

and so by (6.35), we have

That is

Combining the above facts with condition

(a), we see that

Note that

(IAPmin (A) (dIq),V(X)1) < (Pmin (A) (dIE;), V(A) 1) 6 (Prni"(A)(dIq),F ( A ) l ) 6 (p"'"(X)(dIE;), =

1)/A

( I q ,dPmi"(X)l)/A = 0,

X

> 0, A

E 8.


254

We arrive at

P( A) = Prnitl (A) 3- Fin (A) [ d I ]u (A) Prrlin (A).

(6.36)

Now, we start to iise the condition

Since

12X F ( X ) l = XU(A)Pmin(X)l2 c(X)XU(X)l, we have XU(X)l

< c(X)-l

for X > 0 and so

0.

Besides, by Lemma 6.17, we have P ' " ( X ) d 6 1,

x > 0,

5

E E.

We have got everything we need; the next step is copy the proof of T h e e rem 3.26 starting from (3.42). 1 'Yhe Iast result of this section is a uniqiieness criterion for reversible qprocesses. Theorem 6.42. The reversible q-process is unique iff the following three conditions all hold.

(1) (q(x),~ ( zd ,y ) ) is reversible with respect: t o 7 ~ . (2) nd < 00. (3) @A = (0). Moreover, if (I) holds but (2) or (3) does not hold, then there are infinitely many reversible q-processes.

Proof: The sufficiency follows from Proposition 6.40. Clearly, condition (1) is necessary. We now assume (1). a) If .rrd = oc, or lirnA*m X.irii, = 30, then in the last step of the proof of Theorem 6.38, replacing P(A) with

wc obtain infinitely many reversible y-processes with parameter c 2 0. 17) If (3) does not hold and limh-oa AntiA < cx3. Then by Lemma 6.11, Lemma 6.29 arid Lemma 3.4, wc have xTT1LA> U for all X > 0 and

255

6.7 BASICDIRICHLET FORM Setting I E =0,

qx = ~ [ i i x I ] , X > 0, A E 8

in Propositiori 6.28, we obtain w = lim X(fiA,'lL) x4w

< x+w lim XnG,
0, we have actually obtained infinitely many reversible q-processes. W

6.7 Basic Dirichlet Form

In this and the next sections, we use the L2-theory, more precisely, the Dirichlet forms, to study the symmetrizable jump processes. Thus, d l processes considered in these sections are symmetrizable. Besides, we fix a a-finite measure T on ( E ,8')and denote by the space of all square integrable functions on ( E ,8, T ) with inner product and norm

respectively. Occa++mally, we also consider the P ( 7 r ) ( p E [l,031) space, in that case, the LP-norm is denoted by 11 . [ I p . The following result is our shrtirig point, which is even more than what we need at the moment.

Lemma 6,43. Let P (t,2 , dy) be a jump process with an excessive measure T (ie, T 2 n P ( t ) for all t 2 0). Then {P(t)}t>o:

can be extended uniquely t o P ( n ) ( p E [ l , ~ as ) )a strongly continuous contraction semigroup.

Proof: Note that

256


By Holder inequality and the excessive property, we have

=/7r(W(z)IP(1-

W , z ,{.I)).

Combining these facts with the jump condition and using the dominated convergence theorem, we obtain

This proves the strong continuity. As for the contraction, we have

From now on, we discuss only the case that p = 2. The above extension is again denoted by {P(t)}t20.According to the ordinary semigroup theory, { P ( t ) } t pdeduces an infinitesimal generator L as follows. If for some g E

L2(T ),

( P ( t ) f - f ) / t - g -+ o in ~ ~ ( 7 r ) as t j.0, then we define Lf = g . Such functions f consist of the domain of L, denoted by !3(L). It turns out that the generator L is self-adjoint (densely defined) on L2(7r). However, the domain g ( L ) is usually quite poor. For instance, even for countable E , the simple indicator I { i ) may not belong the domain of L. Thus, it is worthy to find a weaker version of the generator L. Actually, one such version is at hand, that is the weakly infinitesimal generator Instead of the strong convergence, we consider

E.

for all h E L2(7r). Such g E L2(7r)consist of the weak domain g ( L ) . But this version is not very useful since in the L2-context the weak and the strong topologies are quite closed each other.

6.7 BASICDIRICHLET FORM

257

Since the space I.’((n) is fixed, we also call the semigroup {P(t)}t>osymmetric on L2(7r).Recall that in thc finitc dimensional case, cvcry quadratic form can be represented as a sum of squares under an nppreciatcd orthonorma1 basis. The representation simplifies greatly the classification of the quadratic surfaces. As a gcneralization to the infinite dimensional case, a spectral family { E , : a E R} on L 2 ( r )is used instead of the orthonormal basis. More precisely, every self-adjoint non-negative definite operator -L has uniquely a spectral representation as follows:

-L = Furthermore, for every continuous

io lo adE,.

‘p :

P(-L) :=

[0,oo) + [0, oo),the operator

@I

P(.>dE*

is again non-negative definite. In particular, we have the strongly continuous contraction semigroup ( P ( t )= exp(tl)) and the resolvent (P(A) = ( X I L)-l : A > 0), respectively, as follows.

Note that in this general setup, the semigroup P ( t ) (resp. the resolvent P(A)) is not necessarily sub-Markovian: i.e., 0 6 P ( t ) f 1 for f E L2(7r with 0 6 f 6 1. But in our particular situation, P ( t ) is nothing but the semigroup induced by the transition function P ( t , x , d y ) and P(A) is its Laplace transform. Set

o.A remaining question is whether it is a q-process or not. The answer is affirmative. To show t.his, we need more work. Recall that the Eq. ( B A plays ) a crucial role in Ihe study of jump processes. In the present situation, an analogue is as follows:

Lemma 6.54. Let (q(s),q(x,dy)) be a symmetric q-pair and {P(A): X be a symmetric resolvent on L2(;7).Suppose that q(x, *) 0}

(6.42)

and Eq. (6.41) holds. Then ( B x ) holds, r-a.e.

Proof: Fix f = I A E L 2 ( r ) and let g E L2(q(z)r(dx))n L2(n). Then g E !3(D*)and

By (6.41), we have

6.8 REGULARITY, EXTENSION AND

265

UNIQUENESS

Clearly, this equality can be extended to for all g E Ly((.ir).RepIacing g with (A g)-lg, we get

+

(P(X).f,9 ) = (QP(X).f, (X+q)-lg> = ( W ~ ) ~ ( ~ ) f + ( ~ + 9Ld -g 1ffL7~ ( T Thus, for each A ,

P(X)IA2 0, XP(X)l < 1 and P(X)IA = f l ( X ) Y ( X ) I A (A q)-'JA

+

(6.43)

hold almost everywhere. The exceptional set depends on A and A. Since ( E ,8 ) is separable, we can choose an exceptional set depending on X only. On the other hand, the strongly continuous means that ( P ( X ) )determined by a dense set of X > 0. By using this arid (6.42), the exceptional set can be chosen so that it is also indeprndcnt of X > 0. Now, dmotc the r-null set by N . Then (6.43) holds for all 2 $ N, X > 0 and A E 8 , II

Theorem 6.55. Suppose that (E,S) is locally compact and (6,421 holds. Then ( D * ,9 ( D * ) )determines uniquely a symmetric q-process on L'(K). Proof: By the construction of resolvent (P*(X))from the Dirichlet form (cf. Fukusbirna (1980), Theorem 1.3.1), we have

P"L"4)

c 9(D*),

D*

D*(P*(X)f, 9) + W * ( X ) f9 ,) = ( f d , f E L2(7rr7),g E 9 ( D * ) .

So the proof of Lemma 6.54 tells us

P*( X) IA ( Z2) 0, XP*(X)l(s) < 1 and p * ( A ) I A ( z )= n ( A ) P * ( X ) I A ( x ) (A q ( T ) ) - ' I A ( x ) , 2 $2 N,X > 0 , A E 8,

+ +

(6.44)

where N is a .rr-null set independent of X and A. Letting X ---f 00 in the equation in (6.44), we see that the jump condition holds for all II: $ AT. Using the equation again, it is easy to check that q-condition also holds for all z $ N. Finally, the local compactness enables us to find a kernel P(X,Z, d y ) SO that P(X,. , A ) = P(X)IA. W 6.8 Regularity, Extension and Uniqueness

We have seen from the last section that the basic Dirichlet form corresponds to a symmetric g-process on L2(.rr). On the other hand, we have proved that there is thc minimal symmetric q-process, which then correIt is interesting sponds to a Dirichlet form, denoted by (Dmin,9(Dmin)). that the two Dirichlet forms constructed by different approaches are not the same in general.

266


Proposition 6.56. Let (g(x), g(x, d y ) ) be conservative and symmetric on L2(7r).Suppose that dim% < 1. 7T

(1) If dim% 0 or dim%#O but r z x = 00, then there is only one Dirichlet form: Dm'" = D". 7T (2) If dim %#0 and r z x < 00, then there are infinitely many Dirichlet forms having the same representation:

where c 2 0. Moreover, if c1 2 cz 2 0, then

Dmin(f, f) 2 DC1( f ,f ) 2 DCZ(f,f ) 2 D * ( f ,f ) for all f E g ( ~ c~a (~ D "'l )~ c g(D'2) ~ ) c g(O*). (3) In any cases, there is a t most one honest symmetric g-process.

Proof: The case of dim% 0 was treated in Proposition 6.13, We now as7r sume that dim W f O , equivalently, K Z > ~ 0. By Theorem 2.45, all g-processes

{7lx : X > 0} is a corisistent family of measures and c 2 0 is a constant. Now, by symmetry, we have

where

7 r ( z x I ~ ) q x (= B )K ( X ~ I B ) ~ ~ ( Afor ) all A,B E 8 In particular, we have qx I= 0 if 7rzx = 00 since qx is a finite measure and zx 6 1. Otherwise, we should have qx = c ( X ) n [ z x I ]where , c(X) is a constant depending on X only. Then, by using consistency of {vx : X > 0} and { n [ z x I ]: X > 0}, it follows that c(X) is actually independent of X > 0. Now, the Dirichlet form can be computed by using the formula

P ( f , f ) = lim X(f - XPC(X)f,f). x+cc

Finally, it is obvious that the symmetric q-process corresponding to Dc is honest iff c = 0. For birth-death processes, we have dim % 1. The Q-matrix is symmetrizable with respect to

, Then the symmetric q-process is unique iff the basic Dirichlet form is regular, i.e., 9(Dmin)= B ( D * ) . Equivalently, (6.49) and (6.50) hold. In this case, the unique Dirichlet form is just the basic one.

One may ask whether the basic Dirichlet form being the maximum in any cases, The answer is negative. Example 6.63. Take E = {0,1,2,.. . } and yiJ = 0 for all i 00. Then P$"(A) = & / ( A 4%). On the other hand,

+

PZj(X) :=

Pgin(x)+ (A + Y i Y) (i X + Y j )

x

# j , xi l/q, < (6.51)

is also a Q-process. The latter one is honest and syminetrizable with respect t o the measureri = l/qi. For this Q-matrix, we have 9(Dmin)= B ( D * ) c @ ( D )but 9(D4) # 9(D).

Proof: Obviously, we have

Since

270


we have Dmin= D* and 9 ( D m i n= ) 9 ( D * ) . On the other hand,

LeL f E g(D*), then

Thus B ( D * ) C g ( D ) . But 1 E g ( D ) ! 1 # g ( D * ) and so g ( D * ) # 9 ( D ) .

In contrast to the smallest extension; there is a Krein extension. It is known that the Krein extension is so large that it is sometimes even not sub-Markovian (cf, Fukushima (1980), Theorem 2.3.2). But we would like to point out here that the Krcin extension is so~netirriesnot large enough since t.here are some Dirichlet forms with larger domain than the Krein's. To show this, let us return to the above exitlnpk. In this c u e : it, is easy to check tha.t .N, = ( 0 ) (see Fukushima (1980), (2.3.5) for notat.ion), hence DrDin = D' = D K ( t h e Krein extension): and so 9 ( D K ) c 9 ( D ) but g ' ( D K )# 9(D) The main reason is that the symmetric q-process (Pij(X))given by (6.51) does not sat'isfy the equation ( B A ) . More precisely, the non-conservative part of the Martin boundary is not considered by the Krein extension. 6,9 Notes The study of symmetrizable Markov chains was begun by Kolmogorov (1936a). He considered the Markov chains with finite state space and discrete time-parameters. Now, there are several books on this subject, both for Markov chains and for general Markov processes. See E'ukushima (19801, Kelly (1979), Silverstein (1976) and Qian and Hou (1979), for instance. A lot of progress has been made in t.he past years, refer to Fukushima, Oshima and Takeda (1994)! Ma and Rijckner (1992).

6.9 NOTES

271

This chapter presents a general theory for symmetrizable jump processes. However, the results on this class of jump processes are still not as complete as those on the general jump processes. The uniqueness criterion for honest reversible q-processes remains open. This problem is certainly important in practice. The first three sections are taken from Chen (1980), where the reversible cases were studied, some generalization was due to Zheng (1981, 1983a). But the second sufficient condition of Proposition 6.13 was appeared in Qian (1978) . For Section 6.3, we refer to Yang (1981). Theorem 6.21 is due to Reuter (1957). The proof adopted here is taken from Zheng (1981). For Markov chains, the existence criterion for honest reversible Q-processes was obtained by Hou, Guo and Chen (1979) in the conservative case, and then in the non-conservative case by Chen and Zhang (1984). The generalization, including Lemma 6.33, was appeared in Chen (1986b). The uniqueness for reversible Q-processes in the conservative case was also appeared in Hou, Guo and Chen (1979). Some results for non-conservative case were obtained by Hou and Chen (1980). Theorem 6.42 was appeared in Chen (1986b), for which, the author benefited from Y. L. Dai, R. H. Ouyan and H. J. Zhang. Theorem 6.41 are taken from Chen (1991~). Theorem 6.50 is well known in the theory of Dirichlet forms (cf. Fukushima (1980)) under the local compactness hypothesis, which is removed by Bouleau and Hirsch (1986). For Markov chains, Proposition 6.59 was proved by Silverstein (1974), Proposition 6.56 was appeared in Hou and Chen (1980). The last result shows that even the general q-processes are not unique but the honest and symmetric one can still be unique. This indicates the difficulty to obtaining a criterion for honest and symmetric q-processes. The remainder of Section 6.7 and 6.8 are taken from Chen (1991~). One topic, not included in the book, in the study of jump processes is bo construct all q-processes for a given q-pair. For the symmetrizable case, refer to Chen (1982)) Chen (198613) and Zheng (1981, 1983a).

Chapter 7

Field Theory In the study of symmetrizahle Q-processes, a new question arises, i.e., for a given 9-matrix, how can wc justify it is symrnetrizsble or not. If so, haw can we find out a symmetrizing measure. Next, for a given regular syinnietriznkle Q-matrix, it is easy to know whether the corresponding Qprocess is positive recurrent or not, but it is riot so emy to understand the riull recurrence or transience of the process. These questions are discussed in this chapter. Our main tool is the field theory which will be also used in Chapters 11 and 14. 7.1 Field Theory Let E be an arbitrary set, T be an index set, and let a: T x E x E [-00,+00] satisfying

+ R :=

Hypotheses 7.1.

# 2 , 0 < a ( t ,x,2) < 00. (2) Co-zero property. a ( t , x ? Z )= 0 iff a(t,Z., x) = 0. (1) For each t E T and x, 2 E E , x

Definition 7.2. For given 5 , 2 E E , x # 5,5 is called reachable directly from x at time t , denoted by x 2, if a ( t , x,Z)> 0; 2 is called reachable , 2 ).,. . , dn)in E from z at time t , denoted by x A it, if there are d l ) d such t h a t Then, L ( t ) := (x,

z i1 x(1) i I.

*

J:(2)

4, . . . 4, .I4

t

f

5,

,dn),2 ) is called a path from x to i?

at time t.

Let d ( t ):. { a ( t ,x.2 ) : x,3: E E>(t E T).Denote by 2 ( t ) the collection of patlls of 0 iff so does for all t > 0. Hence 2 ( t ) , D ( t ) and E,(t) are all independent of t > 0. Definition 7.9. Let

Pij(t) = 0 u Pji(t)= 0

for all

t 2 0, i , j E E.

Then the field corresponding t o P ( t ) is called a chain field, also denoted by

P(t>.

7 FIELD THEORY

276

Proposition 7.10. A chain field P ( t ) is a potential field (equivalently, P ( t ) is symmetrizable) iff it is conservative field. Then, all potential functions are given by

v&) = -log A,(l)

+ ce(tf,

i E Ef,e

E

D,

where (Xi(1)) is defined by (7.4) and c e ( l E D ) is an arbitrary function on T .

Proof: Without loss of generality, assume that P ( t ) is irreducible. Choose a reference point A and reference path and define ?ai(t), ?ia(t), &(t) by (7.2) and (7.4) respectively. Since P ( t ) is a potential field, (&(t))defined by (7.4) is a symmetrizing function: Ai(t)Pij(t)= Xj(t)Pji(t)

for all t 2 0, i and j.

Then, by CK-equation and induction, we obtain

Xi(t)Pij(rnt)= Aj(t)Pji(mt)

for all m 3 1.

Noting that AA(t) f 1 and that P ( t ) is irreducible, we have

and so

xi(m/n>= A ~ ( I=>F A ~ ( I ) / & ( I )

for all rn, n.

Combining these two facts, it follows that

A&) Next, for any potential

= A&))

t E T.

V ( t )= { G ( t ) } of P ( t ) ,we have

7.2 Lattice Field

In the last section, the study on the symmetrizability is reduced to the potentiality of a field. Then, due to the path-independence, to justify whether a given field d ( t ) is a potential field or not, it suffices to check the work done by the field along the minimal closed paths. This idea should be clear intuitively and it is actually the key point why such an elementary tool being helpful. The main purpose of this section is to illustrate this idea, further applications will be presented in the next part (Chapters 11 and 14).

7.2 LATTICE FIELD

277

Lemma 7.11. Let S be a finite set and denote by E = { O , l } S product space of ( 0 , l ) . Define

the usual

Take

Then d = ( u ( x , 2 ) : x,2 E E ) is a potential field iff the following quadrilateral condition holds:

where

Proof: For each u S, define a transform (u): E E as follows:

x[(u) = ux, Then for every path L =

(2 =

x

E , u f s.

E

x(*),dr), a

4

, d")), we have

In this proof, we also use x c ( u l )...C(un> to denote the path L , where z denotes the starting point aiid ((u,) denotes the section di-') 3 ~ ( 2 ) . By the path-independence, we have

Using these notations, we may rewrite (7.5) as

Clearly w(z((u) .. ' ((?in>) =0

for any closed path L

= (x = do), x(I),

- . - da)= x).

(7.9)

7 FIELD THEORY

278

Since every closed path consists of an even number of edges, n = 2m for some positive integer m. We use induction on m. When m = 1, then u1 = u2 and hence (7.9) follows from (7.8). Suppose that (7.9) holds for n = 2(m - 1). Then for n = 2m, there is a k such that 2 k n,uk = u1 and ue # u1 for 2 L < k - 1. Applying (7.6) and (7.7), we obtain

<
and ( E ,P:3,x i ) which are equivalent in the sense that C-l aZJ/a&& C , then they are transient or recurrent simultaneously.

0. (7.19)

Cdi a

7.4 TRANSIENCE OF SYMMETRIZABLE MARKOVCHAINS Since

xi IdiJ
0 or = 0. i) a,i > 0. In this case, we claim that c,i = 0 and hence vi = 0. Otherwise, using k=a, j=iork=i, j = a Ekj

=

{ O7

Ckj,

otherwise

instead of (cij), we would obtain a different C E H ( F ) , anti-symmetric and having a smaller norm llCll < [[ cF 11. This is impossible.

7.4 TRANSIENCE OF SYMMETRIZABLE MARKOV CHAINS

295

ii) aai = 0. Define tiai

= hia = c

> 0,

6kj

= akj

in other cases,

and the correspondent l?, fi, g ( F ) and EF. It is easy to check that l l E ~ l l< J J c F and J J so 2;, = cF by the uniqueness of c F . Therefore, by i), we still have cai = C,i = 0 and furthermore vi = 0. e ) Let T = inf{n 2 0 : X , 4 F * } and set rO0

l

o

o

In this paragraph, we prove that (7.24) (7.25) First, we have

Next, by (7.22) and (7.23), we have

3

Hence

Furthermore,

jEF'

296

7 FIELD THEORY

This shows that (w - V ) ( X n A ~is )a Pi-martingale. But w = V = 0 on E \ F', so we have v = V. To prove (7.25)) fix k E F* and set d i j = a i j ( g i k - g j ~ ) Since .

it follows that

On the other hand, noting that

dij

+ d j i = 0, we obtain

which is just what we required. f ) Finally, we use (7.24) and (7.25) to prove (7.21). First, by (7.251, we have

Next, by (7.22) and (7.23)) we have

From this and (7.24), it follows that

which is what we required.

I

Example 7.24. The simple random walk in three-dimension is transient.

7.4 TRANSIENCE OF SYMMETRIZABLE MAR.KOV CHAINS

297

Proof: Take

and set

Ni = { j E z3: li - j l = 1). Clearly, ( u ; ~satisfies ) condition (1) of Theorem 7.23. Since for j E Ni,we have

Condition (3) holds. We now check condition (2). Let M be the unit cube centered at the origin and with length 2m of edges. Then

i€ M j € Ni

i € d M jENi\M

On the other hand, let (el, e2, e3) be the coordinate bases in R3 and set

z [ f+ ( ~+ 3

g(E) =

j(.

Eei)

- Eei) - :!f(x)],

i=l

where f is harmonic in a neighbor of z. Then, by the Taylor expansion formula, 1 g(E) = -(Af)E2 O(E4)= 0 ( E 4 ) . 2

+

Using these two estimates and the fact that A(l/lxl) = 0 (x # 0), it is easy to justify condition (2) of Theorem 7.23. Alternative proof: The proof consists of two steps. Firstly, construct a tree T3 with finite resistance. We begin the construction with drawing 3 branches from the root, denoted by t l l , t 1 2 and t 1 3 respectively. Each branch has 1 ohm. At the n-th step, each branch t n - l , k (1 6 k < 3n-1) constructed at the ( n - 1)-step splits into 3 branches and each of the new branch has 2n- 1 ohm. Suppose that there are no overlap. By the equivalence principle, it is easy to see that

n=O

Secondly, embed the tree T 3 into Z3. Put the root of the tree at the origin. At each step, let the rays go ahead along the tree coordinate directions.

3 FIELDTHEORY

298

+

Whenever a ray intersect the plane x+ y z = 2n - 1 for some n, it splits into three rays. If two rays pass though each other, we simply let them “bounce” (disconnect). Since this tree is a subgraph of Z3 with finite resistance, we have proved the transience of the simple random walk in Z3. 7.5 Random Walk on Lattice Fractals

To illustrate some applications of thc results obtained in the last two sections, we discuss in this section the simple random walk on two lattice fractals, Sierpinski gasket and Sierpinski carpet. As an example of the construction of general lattice fractals, we explain how to construct the lattice Sierpinski gasket. To do so, let us recall the usual construction of Sierpinski gasket in Rd(d 2 2). Starting from d 1 points ( 0 = q,,x l ?. - , xd} with length Ix,- xj I = 2. The d+ 1 vertices with d(d l ) / 2 (adjacent) edges make a d-dimensional polyhedron, denoted by H y ) . Next, let xzJbe the midpoint of the line jointing x2 and x3, write x,, = 2%:and let be the graph with vertices {xZ3: 0 6 i, j d + 1) and edges between x , and ~ xtk, x3e, 0 k, t! d, k # j , !# i. Clearly, H i d )consists of d 1 d-dimensional polyhedrons, with each pair sharing exactly onc vertex. , }nro. The Repeating this procedure, we obtain a decreasing sequence { H (4

+

+

+

0.

0 such that 1 for every E E [-6, oo),and so

As a function of

E,

F f has a minimum 0 at d -FFf(O) = dE

s

E

+ ~f E b&+

= 0: therefore

62fdp = 0.

This implies that p ( q f ) = p ( Q f ) for each f E b€‘+ and hence for each f E rd?+, which is then equivalent to the claim that p is a stationary distribution of ( P ( t ) }(Theorem 4.17).

8 LARGEDEVIATIONS

314

Conversely, assume that p is a stationary distribution of { P ( t ) } .F'rom the last part of the proof of Donsker and Varadhan (1975) [Lemma 2.51, it follows that

On the other hand, the proof of Donsker and Varadhan (1975) [Lemma 3.11 also works in our case, hence we have

Combining these two facts together, we obtain the required assertion. Now, we would like to give a clearer expression for I ( p ) . For this we need a hypothesis.

Hypothesis 8.10 ( H I ) . There exist a 0-finite measure X and an 6' x 8measurable function q(z,y) such that q ( z , d y ) = q(z,y)X(dy)

for all

z,y E E .

Theorem 8.11. Under ( H I ) ,for each p E 9 ( q ) satisfying p 0, X-a.e. on E+. Again, for

I/

Under

0" made in the last step, we prove that there is a probability measure a such that a 0 on E and aq < 00. For this, choose a sequence of disjoint sets {&}? C € such that 0 < A(&) < 00 and B, = E . Set

Cf"

Then f E d oand

+ f(~)-']-',

Now, if we take &(z)= [q(z)

a ( A )=

then the measure a defined by

A

E

8

< aq
O on E (i.e,, E = Ed-). (2) ( E , @ )is locally compact, ~ ( x ) is locally bounded and (3) ( H I ) and ( H z )hold. Then, for every p

0 and the assumption in (2): we have

Set

Because f is increasing, the assertion for case ( 2 ) follows from Theorem 8.28 and

c) Choose 0

< d < D < 00 and define

8.4 NOTES Then fo = d,

fi

tD

as i

329

co. Finally

8.4 Notes

For Section 8.1, refer to Stroock (1984). Theorem 8.17 is taken from Chen (1990a). The remainder of Sections 8.2 and 8.3 are taken from Chen and Lu (1990a,b) with some improvements. See Jain (1990) for recent progrcss on the lower estimates. Two recent books on large deviations are Demba and Zeitouni (1993), Deuschel and Stroock (1989).

Chapter 9

Spectral Gap In this chapter, we study the exponential L2-convergence. We prove in Section 9.1 that the exponential convergence rate for a Markov process can be described by the L2-spectral gap of its generator or Dirichlet form. In the reversible case, we prove an equivalence, or even coincidence, of this convergence and the exponentially ergodic convergence. Two main results for estimating the spectral gap by couplings and two approximating procedure for jump processes are introduced in Section 9.2. Section 9.3 is devoted to birth-death processes, for which we have a complete solution to the topic studied in this chapter. In the last two sections, another powerful tool-a —a generalized Cheeger’s method is studied. It works in a very general setup but the resulting estimates are usually less explicit. 9.1 G e n e r a l Case: a n Equivalence Let {P(t)}t20 be a positive, strongly continuous, Markovian contraction semigroup (i.e., P ( t ) l = 1 for all t 3 0) on L 2 ( n )with stationary distribution T,not necessarily symmetric. Again, denote by L and .9(L), respectively, the infinitesimal generator and its domain induced by {P(t)}t2o. We say that P ( t ) converges exponentially i n the L2(n)-norm [I 11 if there is a positive E so that for all f E L 2 ( r ) ,

On the other hand, since 1 E .9(L) and L1 = 0, the vector 1is an eigenvector of L with eigenvalue 0. One may seek for the next-to-largest eigenvalue of the self-adjoint (resp. the symmetrized part of a non-self-adjoint) generator L. That is, to seek for the infimum of the spectrum of -L restricted to the orthogonal complement space of 1: {f E L2(i.) : r ( f )= 0} n B ( L ) . This leads us to define the s p e c t r a l gap of L: gap(L) = inf {

-

( L f , f ) : f E g ( L ) , r(f)= 0 and llfll

= 1).

(9.2)

Our first step is to show that (9.1) and (9.2) are closely linked. To do so, let D ( f ) denote the limit

330

9.1 GENERAL CASE:

AN

EQUIVALENCE

331

provided the limit exists, here the equality is due to the fact that T is a stationary distribution of P ( t ) . Such functions f E L2(.rr)with D ( f ) < 00 consist of the domain a ( D ) of D. Clearly, 9 ( L ) C Q ( D ) . In the case of P ( t ) being symmetric on L'(T), D ( f ) coincides with D ( f ,f) introduced in Section 6.7. This explains why we choose the notations D(f) and , 9 ( D ) . Next, define gap(D) = inf { D ( f ): f E g ( D ) , ~ ( f=) 0 and

llfll

= l}.

By definition, it is clear that the condition f E a ( D ) in the last line becomes unnecessary in the symmetric case. Because we are working in the regular case, which is natural since the process is assumed to be ergodic. Thus, it would be much better if we could use gap(D) instead of gap(L). Finally, define u(t) = -sup { log IlP(t)fll : ~ ( f=) 0 and l l f l l = l}. Since

( I P (+~s)fll 6 e-'(t)\\P(s)fII 6

e-'(t)-u(s)

Ilf 11,

by the contractivity and semigroup properties, it follows that u(-)is superadditive and u(0) = 0. Hence, the limit

u := lim - = inf tJ0

t

t>O

4

-

(9.3)

t

is well defined.

Theorem 9.1. We have u = gap(D) = gap(L). Proof: Clearly, gap(D) ,< gap(L) since D ( f ) = (-Lf,f) on 9 ( L ) . To prove u >, gap(L), simply use the fact: d = 2 ( P ( t ) f , L P ( t ) f ) G -2gap(L)IIP(t)fl12, -IIP(t)fl12 dt

f E W )7 , 0 ) = 0 and llfll = 1, and the denseness of 9 ( L ) in L 2 ( r ) . Finally, let f E 9 ( D ) with and llfll = 1, then 1 D ( f ) = lim -(f tL0 t

-~

~ ( f=) 0

1 ( t ) ff) , 2 lim -(I - ePut) = u. tl0 t

Hence gap(D) 2 u. I At the moment, except the fact 9 ( L ) c 9(D), the knowledge of 9 ( D ) is quite limited. However, it will be clear later, whenever we have a little more information about the generator, the domain 9 ( D ) is actually manageable. The next obvious facts will be helpful for our further study.

9 SPECTRALGAP

332

Lemma 9.2.

-

(1) D ( f ) 3 0 for all f E .9(D). 9 ( D ) is dense i n L2(?r). (2) fEQ(D)*g:=cf + d ~ g ( Da)n d D ( g ) = c 2 D ( f )for all c, d c R . (3) fl 9 E .9(D) a n d f + 9 D ( f + 9 ) G W ( f+)&?>I.

.w)

Bcfore moving further, we would like to mention that the non-symmetric o given as case can be often reduced to the symmetric one. Let { P ( i ) } ~be above. Define a bilinear form D ( f ,9) on Q ( L ) by D ( f , 9) : - (Lf, 9 ) . Next, define a dual (or adjoint,) 6 of D as follows: 6(f,g) = D ( g , j ) , f , g E g ( L ) and set = (D+ 6) /2. Then -is is a syrnmctric form and so we have a norm 2 -f'j I( 1 0: Ilfli, - (f) llf1I2. Naturally, one can extend the domain of D to ~ ~ by g . Therefore, we the completion of 9 ( L ) with respect to 11. ( 1 denoted can define

+

gap(l;)) = inf

{o(f) : f E 3, ~ ( f=) 0 and l l f l l

= l}.

Noting that the last inequality D ( f ) 3 CT in the proof of Theorem 9.1 holds first for f E Q ( L ) and then for f E g , moreover, since

-

1

D ( f )=- Z(D(f)

+ m)) = Wf),

f E 9,

the proof of Theorem 9.1 shows lhat o = gap@) = gap(L). Hence we have proved the following result. Corollary 9.3.

(T

= gap(D) = gap())

= gap(L).

One often uses 9, rather than 9 ( D ) defined above, as the domain of D. This is because in general the bilinear form may not be regular. To use the semi-group corresponding to (0,Q), the theory of non-symmetric Dirichlet, forms should be helpful. Refer to Ma and Rockner (1992) (especially, Theorems 2.15 arid 2.18) for details. We will return to this problem more cnrcfully (see Theorem 9.12 below)+ mlPnext, coinparison result is useful t o compare the spectral gap of a complex process wit,h a simpler one. It shows, for hlarkov chains for instance, that a local pcrturbation does not interfere the L2-exponential convergence. The proof of thc thcorcm is straightforward and hence is omitted. I -

Theorem 9 . 4 (Comparison Theorem). Let ( D ,9 ( D ) ) a n d (6,9( be two forms with g ( D ) c defined on L2(,rr)a n d L2(?) respectively. Suppose t h a t there exist constants A , B and C such t h a t

9(D),

+ CD(f), ~ . P ( ~ ) / [ A B+ c ~ ~ P ( E ) ]

O ( f )6 A W ) , hen gap(D)

Varn(f>6 BVar;(f)

f

E .9t(D)*

9.1GENERAL CASE:A N EQUIVALENCE

333

For tensor product P ( t ) of { P k ( t ) }with generator Lk (it corresponds to a Markov process with independent components), the spectral gap takes a simple form. Denote by L the generator of P ( t ) on the product space with product measure

7r

=

nk

Tk.

Theorem 9.5 (Additive Theorem). gap(L) = infk gap(Lk). Proof: Choosing the functions to be depending only on the kth coordinate in the definition, it follows that gap(L) < gap(Lk). To prove the inverse assertion, it suffices to consider two components. Then, by induction the assertion holds for finitely many components and a limiting procedure gives it for infinitely marly components, since the functions depending on finitely many components are denso in Lz(x).~ e ft satisfy n(f)= o and l l f l l = 1. Exprcss f as f(x,y> = h ( z ,y) h,l (x) hz(y), whew J h,(.,y)dn, = 0 for a,e. y, J h,(s,.)&r2 = 0 for a.e. z, hldxr = 0 and / h2dn2 = 0. Then h, h,l and hz fire orthogonal in L 2 ( , r r ) an'd so are P ( t ) h ,Pl(t)h,l and F$(t)hz.The conclusion now follows from

+

+

IIW)fl12 = lIP(t)hl12+ IIP(tP1112 + IIP(t)h,2II2 = IlP(t)h.1l2+ l l ~ l ( ~112) + h Ilf3(t)hzIl2 and IlP(t)hll = IIPl(t)P2(t)h,ll6 e-l(gap(Ll)+gap(Lz))((hll. Now, we study the spectral gap for jump processes. staled, we consder only the regular j u m p processes.

Unless otherwise

As we have seen from Lemma 6.43, for a given q-process P ( t ,z, dy) with stationary distribution x,P ( t ) f := J P ( t ,-,dy)f(y), f E bd? can be extended to L2(T)uniquely as a non-negative, strongly continuous contraction semigroup. Thus, the above discussions are applicable to the present situation. , dy)), as we did befbre, define Next, for a given q-pair ( q ( x ) q(x, Xq(&

dy) = n(dz)q(z,dY),

For f E g ( D ) , by Fatou's lemma and Theorem 1.14 (4), it follows that 3o

> ~ ( f=)lim 1/ ' x ( c i z ) ( ~ ( t )-[ ff(z>lz)(z>~ * ( f ) . 110 2t

Therefore, we have 9 ( D ) C 9 ( D * ) . Note that here 7rq may not be symmetric. Choose and fix a sequence {En} c d? such that En 1 E and SUP,^^,^ q(x) = mpz< 00, n 2 1. Assume that mTL = n for simplicity. Define

X

=

md&={g:=cf+d:

{f

E

Lo3(x): {f # 0)

EX, c , d ~ R } .

c

some En}

9 SPECTRAL GAP

334 Lemma 9.6. XL c a ( D ) .

Proof: By the regularity of q-pair, we have

On the other hand, since

where

it follows that

Note that 7r is an invariant probability of { P ( t ) } t 2 0 ,7 r ( q f 2 ) = .rr(Qf2). Combining the above facts together, we arrive at

for f E X . Now, the conclusion follows from Lemma 9.2. I This simple result already enables us to get an upper bound for gap(D).

Lemma 9.7. We have 1 gap(D) 6 2 inf{k(K) :

o < 7r(K)< 1, I K

EX},

(9.5)

where

In particular, for Markov chains, gap(D) 6 infk q k / ( 1 - nk).

+

Proof: For 1, E A' with 0 < r ( K ) < 1, set f = CIK d. Choose c and d E R such that ~ ( f = ) 0 and llfll = 1. The first assertion follows from Lemma 9.6 by computing D * ( f ) . Then the second assertion follows by taking K = { k } plus the invariance of 7r.

9.1 GENERAL CASE: AN EQUIVALENCE

Lemma 9.8. Let

Choose and define

335

such that

Proof: It suffices to prove the first asserbion. Clearly, gn E g ( 0 ) and gn ---f g in L1(7r). Hence sup, .lr(gn) < 0s. Next, since

n ( s 3 z 7r(!121E,,) we have limn

+

,oo~ ( g , ) ~ / r ( g ; )=

..(.g2)

as 72

= 00

+ 00,

0. Therefore, by definition,

Definition 9.9. We call %? c 9 ( D * ) a core of D * , if '3 is dense in .9(D*) with respect t o the norm 11 lip: l l f l l ~ * = D * ( f ) llf1I2.

+

Lemma 9.10. If n(q) < 00, then Xj is a core of D*,

Proof: We need only to show that X is a core of D*. Let f E g ( D * ) , Replacing J with JrrL= (-m) V ( J A m) if necessary, we nay assume that f is bounded. Next, set fn = JIB,. Then

Theorem 9.11. If 3' is a core of D*,then gap(D) = inf{D*(f) : n(f)= 0 and = inf{D*(f) :

f E

IlfJl

= l}

XL, n(f)= 0 and l l f l l

= l}.

Proof; Since g ( 0 )- 9 ( D * ) ,D and D* coincide on 9 ( D ) ,by Theorem 9.1, it sufices to show that for each f E g(L)*) wilh n(1)= 0 and l l f l l = 1, there

9 SPECTRAL GAP

336

exists (fn} c Xi with 7i-(fm) = 0 and I I f n I I = 1 such that D * ( f n )--t D * ( f ) . Because X is a core of D*, by Lemma 9.2, we can find a sequence { f n } c ;U, with 7 r ( f n > = 0 and /lfnll = 1 such that fn -+ f in Ij . IIp-iiorm. Note that D * ( f n- f ) -+ 0 implies that D * ( f n )is bounded in n. On the other hand, by Schwarz inequality,

and so D * ( f r L-+ ) D * ( f ) as 71 oc. The assertion now foIlows. I We now specify the above result to Markov chains. Let Q = (qiJ : i, j E E) be an irreducible regular Q-matrix. Suppose that the Q-process ( P Z J ( t has )) a stationary distribution T . Define --j

It is easy to check t,hat, (&) is a conservative Q-matrix with stationary measure ( x i ) : so is (f&j). By Theorem 1.70, the Q-matrix ( & j ) is regular. Moreover, ( i j i j ) is reversible with respective to the same probability measure (7ri), and so is t,he corresponding minimal Q-process.

Theorem 9.12. Let Q = ( q z j : i , j E E ) be an irreducible regular Q-matrix. Suppose that the Q-process (Pij(t))has a stationary distribution x and the Q-matrix ( 4 ; j ) defined above is regular. Then

fi)'

:

f E XI,,~ ( f=) 0 and

1

=1 .

Proof: By Theorem 9.11, we need only to prove that X is a core of D*. From Corollary 6.62, we. know that (gtJ) is regular iff X is a core of D.But

We claim that the cores of 75 and D* are the same. The above result is a special case of Corollary 9.3, which says that gap(L) = gap((L z ) / 2 ) , where 2 is the adjoint operator of L. Before moving further, uFe now consider two irreversible Markov chains.

+

337

9.1 GENERAL CASE: AN EQUIVALENCE

Example 9.13. Take -1/2

1/2

0

Q=(! f f l i / 4 . Thus, in

Then gap(Q) = 1 but the eigenvalues of Q are 0, -5/4 general, Spec( (Q

+ 0)/2)

Example 9.14. Let

# Re. Spec(Q).

qk = qk,k-I

and qij = 0 for all other j

# i.

=

I, qok

Then gap(Q)

, (1 - f i ) - 2 &p}

o.

Definition 9.17. We say that g is an eigenfuction of L corresponding t o X in weak sense if g satisfies the eigen-equation Lg = -Xg pointwise. Note that the eigenfunction defined above may not belong to L2(7r).

Theorem 9.18. Let ( E , p ) be a metric space and let { X t } t > be ~ a reversible Markov process with operator L. Denote by g the eigenfunction corresponding t o XI := gap(L) in weak sense. Next, let ( X t , x ) be the coupled process, starting from ( I C , ~ ) ,with coupling operator and let y : E x E --+ [O,m) satisfy y(z,y) = 0 iff IC = y. Suppose that

z

(1) 9 E %J(L)l

(2) Y E %J(Z), (3) LY(Z,Y) 6 -QY(.,Y) for all 2 # Y, (4) g is Lipschitz with respect t o y in the sense that

Then, we have gap(L) = XI

2 a.

Proof: By conditions (2) and (3), we have

Next, by condition (1) and the definition of g , rt

rt

is a P"-martingale with respect to the natural flow of a-algebras { 9 t } t > 0 . In particular, giz) = E"[ g ( X t ) XI g ( X s ) d s ] .Because of the coupling property,

+

9 SPECTRAL GAP

342

Thus, we obtain

Define the coupling time T = inf {t>0: Xt = Yt}. Thenn

Noting that g is not a constant, we have cg,y = 0. Divifding both sides by Noting that g is not a constant, we have cg,y = 0. Divifding both sides by cg,y, we obtain

for all t. This implies that A 1 2 a as required. H Condition (3) in Theorem 9.18 is essential, for which one needs l o cliovse not. only a good coupling but; also a good distance. This leads to the study on optimal couplings discussed ia Chapter 5. The other conditions in Theorem 9.18 can often be relaxed or avoided by using a localizing procedure (cf. Theorems 9.20 or 9.22 below). The next weaker result is useful! it is actually relat,ed to the strong ergodicity of the process. Theorem 9.19. Let { X t } t 2 0 , L , X I and 9 be the same as in the last theorem. Suppose that (1) 9 E

%&)I

( 2 ) s'lP,+y

19(4 - 9(?)1/)1 < m.

-

Then for every coupling I W Y , we have gap(L) = XI 2 ( supzpy@,")-I,

Proof: Set $(x,y) = g(z) - g(y). By the martingale formulation as we did in the last proof, we have

9.2 COUPLING

AND DIS'rANCE

METHOD

343

Hence

-

sup,.,IE"~YT
=

c

2 I, we have

1

A - d o , A1 + 2[ d o ,0 ) - P ( 0 : j)l.

bk M k ,

k> 1

< 0.

Thus, it suffices to shcw that J := Ck21 b k [ p ( k , j )- p ( 0 , j ) l j = 1, J blgl f z k 2 2 b k [ p ( k ,1) - p(0, I)] = h g l -k z k 2 2 bkgk j 2 2, J =

bklbk k>l

$1

- (.Qj -

291)l

When 0. When

348

9 SPECTRALGAP

since j 2 2 and qo > b l . This example is also called star model since its graphic structure. There is a center at 0, from which there exists only one bond to each k. Thus, the geometric distance can be defined a s follows. First, we have p(0,k). Then, for i, j # 0, i # j , p ( i , j ) = p ( i , 0) p ( 0 , j). Based on this structure, a “path method” goes as follows. Replace q k o = 1/2 by a more general qkO = q k ( k 2 1). By Theorems 2.40 or 2.47, it is easy to check that the process is always unique. Clearly, the stationary distribution is ni = p i / Z , i 2 0 where PO = 1, pi = bi/qi for i 2 1 and 2 = c k 2 1 p k . For every f E L2(7r) with n(f)= 0 and l l f l l = 1, we have

+

Thus, gap(D) 2 2 - l i n f ~ l q i . When qi = 1/2, we return to the original model but the estimate of the last method is not sharp. Clearly, the problem does not come from the second inequality but the first one, which is based on the graphic distance. 9.3 Birt h-Deat h Processes

In this section, we study mainly the spectral gap for birth-death processes. This is crucial since the birth-death processes are often used as a tool to compare with some general (even infinite dimensional) process. For instance, the lower bounds obtained in this section is available for general reversible Markov chains on Z+ with qi,i+l > 0 and qi,i-1 > 0, provided the deduced birth-death Q-matrix is regular. Recall that for a positive recurrent birth-death process with birth rate bi > 0 (i 2 0) and death rate ai > 0 (i 2 l),the reversible measure (xi)is the following:

9.3 BIRTH-DEATH PROCESSES

349

Let Y be the set of all positive sequences (vi : i 3 0) and define

R&)

+ bi - aJvi-1 - bZ+lVi = Aa(i)- Ab(i)+ ai (1 + b i + l ( l - 4, = ai+1

a0

where A a ( i ) = ai+l

:=o,

v-1 := 1,

- ai,Ab(i)= bi+l

- bi.

i 3 0,

(9.12)

Next, let

W = ( { W i } i > o : wi is strictly increasing in i and ~ ( w2 )0},

-

W = { {wi}i>O : there exists k : 1 < k

< 00 so that wi = W i A k , w is

strictly increasing in [0, k] and n(w)= 0},

(9.13) Note that @is simply a modification of W . Hence, only two notations W and I ( w ) are essential here. The main results can be collected a s follows.

Theorem 9.25. Consider the ergodic birth-death process as above. We have the following conclusions.

(I) Dzflerence form of the variational formula f i r the lower bound (9.14)

(2) Summation f o r m of the variational formula for the lower bound: (9.15) (3) Summation f o r m of the variational formula for the upper bound gap(D) = inf sup I ~ ( w ) - ’ . w€?w i>o

(9.16)

Moreover, the supremum in (9.14) and (9.15), and the infimum in (9.16) can all be attained. 1 (4) Explicit bounds and explicit criterion: Define ‘po =0, ‘pi=

xjGip1

2 1, Q ~ ( Y ) = C p ~ c ~ ~ for? ~p-2 L j1,Q: = [ , i + k ] ~ ~ > ~ S(y) = supn2l Qn(r),6 = 6(1) and 6’= SUP,^^ Cyl: Q>vjn),where ~ ( is~ a probability 1 measure on {0,1,.. , k - l} with density vjk)= ( p $ ~ j > - ’ / Z ((and ~ ) Z(’”)is the normalizing constant). Then we have 6 < 6’< 26 and for2

+

In particular, gap(D) > 0 iff 6

< 00.

9 SPECTRAL GAP

350

In view of (9.12) and (9.13), one sees that the difference form (9.14) and the summation form (9.15) are quite different but there is indeed a correspondence between (iii) and (wi> (Idernma9.30). As we will see soon each of them has its own advantage. By exchanging “sup” and “inf”, we obtain (9.16) from (9.15), ignoring the difference of W and %? Thus, (9.15) and (9.16) are dual one to the other. All these formulas are completely different from the classical variational formula: gap(D) = inf {D(f) : ~ ( f=) 0 and

Ilfll

(9.17)

= l}.

Because of the uniqueness assumpt.ion of the process and Corollary 6.62, we do not need the condition ‘’f E B ( D ) ” in the last formula. Clearly, for each t,est sequence ( u i ) , from (9.14) we obtain a lower bound of gap(D). In particular, according to a classification of the test sequence, we obtain t.he following result.

Corollary 9.26. Define Au(i) = ai+l

+

(1) Let vi = ~ [ ll/(i

- ai,a0 := 0,

Ab(i) = 1

3 - bi. ~

+ c)], r 2 1 , c E 10,m]. Then

Aa.(i)- Ab(i)

+1 [ai - bi+i]

if r = 1.

Z+C

Then

(2)

A u ( i ) - Ab(i) - cI

ai

i - 1+ c2 -.” c1

2

-1- ca

In particular, w e have Examples 9.27. The exact gaps for nine examples are listed in Table 9.1 in the next page. Here, for the sixth example, we need a restriction: l / k < a/b k/(k - 1 ) 2 ( k 2 2). The test sequence used in the last example is ui = - 1)/4 for even i and ‘ui = (*+ 1)/4 for odd i .

i

gaP(D) 2 (.A - d q ) 2 / ( c ~ c 22) 1/(4C?c2). p ~ j6 clpiai and C3>i p j a j 6 c2piui for all i 2 1, then

(A

gap(D) 2 - dG)2/c1 3 1/(4cIc2). (4) If ai = bi and i Y C3.>% . l / a j < c(y) for some y 2 1 and all i 3 1, then gap(D) 3 max {(4c(l))-', b:-'c(y)-l(l - y-')}. Parts (1)-(3) of the corollary are all sharp, but not part (4)) for the constant rates b, = b and ai = a. Conversely, part (4) is sharp for ai = bi = i2(i 2 1) but parts (1)-(3) fail. The proof of the corollary is delayed for a while. The simplest example to show the power of Theorem 9.25 (4) is the following one. Example 9.29. Let ai = bi = i Y for all .i 3 1 and bo is ergodic i f f y > 1 and gap(D) > 0 i f f y 3 2.

> 0.

Then the process

Partial proof of Theorem 9.25: Step 1. First, we prove in the next lemma the equivalence of (9.14) and (9.15). Lemma 9.30. When vi = ui+l/ui for positive (ui), rewrite

Ri(v)as Ri(u).

(1) Given w E W with ~ ( w=) 0, set

Then we have &(u) = li(w)-lfor all i 2 0. (2) Given positive (ui : i 2 0) such that inf+oRi(u)

limn+m bnpnttn

>

0. Then c :=

< 00. Set

wi = aiui-1 - biui + c / ( Z- l), i >, 0, where u-1 = 1. Then we have wi+' > wi for all i 2 0, w E L ' ( n ) , ~ ( w= ) c / Z ( Z - 1) 3 0 , C i 2 1 p i w i > 0 and Ii(w)-'>, Ri(u) for all i 2 0.

9 . 3 BIRTH-DEATH PROCESSES

353

Proof: a) It follows from the definition of (ui) that we obtain Since

Since ~ ( w=)0, we have

Thus On the other hand, by (9.18), we have

We have thus proved part (1)of the lemma. b) For part (2), we first prove the existence of the limit limn-+mbnpnun. To do so, take wi = aiui-1 - biu; bouo (i 2 0 ) for a moment. Note that

+

(wi+l - w i ) / u i

( a i + l ~i bi+lUi+l - aiui-1 > 0, i > 0.

+ biui)/ui (9.19)

= Ri(U)

We have wi < wi+l for all i 2 0. On the other hand, since wo = 0, we see that w1 > 0 and so wi > 0 for all i 2 1. Thus n

n

n

kj-lpj-luj-1 0I + ~ A , A ~ ~ , A C . A ( ~ , ~ ) I ( ~ -- I (Af I) ~( Z) () y ) l . Then

362

9 SPECTRALGAP

When J i s symmetric, by assumption, we have

When J is not symmetric, we have

o. Then, by symmetry of .rr(dz)p(t,5 , d y ) (Theorem 6.7), we have

i(f2/g,

Here, we have used the fact that a + l / a 2 2 for all a > 0. Hence j.Ptj) ( f , f - P t f ) . From the spectral theory, it is standard that the righthand side increases to D ( f ) as t 0 (cf. Section 6.7). Thus, it remains to show that the left-hand side converges to ( f 2 / l j , -06) as t L O . This can be done by using the dominated convergence theorem and the following facts:

we have a quadratic form D(*): defined by (9.33). For which, Xi") and are well defincd. Next, define an isoperimetric (or Cheeger's) constants as follows

Ar)

Theorem 9.43. For t h e quadratic form given by (9.33), under (9.34) and (9.35), we have

9.5 CHEEGER'S APPROACH A N D ISOPERIMETRIC CONSTANTS

369

Proof: a) First, we express h(") by the following functional form

By setting f = I A / K ( A )one , returns to the original set form of h(*). For the reverse assertion, simply consider the set A , = {f > y} for y 2 0. The proof is also not difficult:

's

tf(Z)>f(Y))

J(%%

dY)[f(Z)

-

f(d1 + K ( " ) ( f )

(Co-area formula). Hence

The appearance of K makes the notations heavier. To avoid this, one can enlarge the state space to E* = E U (m}. Regarding K as a killing measure on E * , the form D ( f , g ) can be extended to the product space E* x E* but expressed by using a measure J* only. At the same time, one can extend f to a function f * on E*: f* = f 1 ~ More . precisely, define J*(") on E* x E* bY E 47 x 8 J'")(C),

c c =A x or +> x A, A E 8 c = {m} x {w}.

Then, we have

L.

J*(")(dz,E*)(f*(.)( =

L

J ( " ) ( d z ,E ) ( f ( z )f( K ( " ) ( ( f l )

9 SPECTRALGAP

370

b) Take f with 7 r ( f 2 ) = 1, by a), Cauchy-Schware inequality and conditions (9.34) and (9.35), we have

The right-hand side is bounded above by 1. Solving this quadratic inequality in D(')(f), one obtains D(')(f) 2 1c ) Repeating the above proof but by a more careful use of Cauchy-Schwarz inequality, we obtain

d-,

6 D ( f >[2 - D " ) ( f ) ] . From this and b), the required assertion follows. I We now turn to study XI. To do so, we need the following isoperimetric (or Cheeger's) constants.

1

/&a) = -

inf

2 dA)€(OJ)

J ( a ) ( Ax A") + $")(Ac x A ) I

(4 J(")( A x Ac)+ J(CY)(AC x A) 4 4 n

1 k(a)' == inf 2 n(A)~(0,1/2] 44 J ( O ) ( Ax A C ) J(.I(AC x A ) 1 -inf 2 n(A)~(0,1) r ( A )A x(AC)

+

The functional farm of these constants can be expressed as follows.

9.5 CHEEGER'S APPROACHANT) ISOPTSRTMETRIC CONSTANTS Lemma 9.44. For every

CL

371

3 0, we have

Proof: Because Q 3 0 is fixed, we can omit the superscript "(a)"everywhere in the proof. Denote by and ,@ the right-hand sides of the above and k' >, E l . We now prove quantities. By taking f = I A , we obtain k the reverse inequalities. a) For any f f L:(.rr) with J ~ ( d z ) ? r ( d y ) l f ( z) f ( y ) l = 1, by proof a) of Theorem 9.43, we have

This proves the first. equality of k("). Next, we show t'hat

whcrc Il.llp denotes the LP-norm. First, let n ( g ) = 0 with inf,,R IIy-,cllw Then, because r ( g ) = 0 and ~ ( -f ~ ( f ) = ) 0, we have

for all c E R. Hence, by Holder inequality, we have

for all c. This gives us

< 1.

372

9

SPECTRALGAP

On the other hand, for a given J E L1(n), set ATf = {f 2 n(f)} and A; = { J ,

K ; ~

Pf A!"

(9.38) the infimum is taken over all i.i.d. random variables X and Y with EX EX2 = 1.

r

0 and

Proof: a) First, we prove the first lower bound of (9.37). Let f 6 a ( D ) with .(I) = 0 arid r ( f 2= ) 1. Set y = f c, c E R. Similar to the proof c ) of Theorem 9.43, we have

+

for all P : 0 < /3 < Xi1) 6 2. By Lemma 9.44, we have

Thus, it remains to estimate

K ~ Set .

y = IElXl E (0, I]. Clearly,

Here in the second to the last step, we have used Jenseii's inequality,

We claim that when c = 0, I E I x 2 - Y*l >, 2 ( 1 -

IE(X1)= 2(1- 7).

(9.41)

To see this, note that IE(X2- Y21 21E(X2 - Y 2 ;Y 2< 1 < X 2 ) = 21E [ X 2 ;x 2>, 11P ( X 2 < 1) - 21E [ X 2 ;x 2 < 11P ( X 2 >, 1) = 2(P[X2 < 11 - I E [ X 2 ; x 2< 11).

9 SPECTRAL GAP

374 On the other hand,

Hence

Thus, we obtain

Letting p T X p ) , we obtain the first lower bound. b) For any B C E with n(B)> 0, define a local form as follows.

Obviously, D - ( ,a ) ( f ) = D -(a) B (91,). Moreover, xO(B>

:= inf{D(j> : f / p = 0,

llfll

= 1 ) = inf

( 5 B ( j ) : i 7 ( f 2 1 B ) = I}.

Let nB = T ( . n B)/.rr(B)and set

~ ~ s j a=)

inf

[ J ( a ) ( A(xB \ A )j + J ( " ) ( ( B\ A) x A ) ] / 2+ J(")( A x BC)

(4

ACB, rr(A)>O

- _1 inf 2 ACB,r(A)>O

$-)(A x A") + $")(A" x A )

44

Applying Theorem 9.43 to the local form on L 2 ( B ,6 n B , nB) generated by J B = T ( B ) - ~ J I and B ~K~B = d ( . ,Bc)IB,we obtain

c) Note that in

,by Theorem 9.33

9.5

375

CIIEEGER'S APPROACIi A N D ISOPERIMETRIC CONSTANTS

We obtain the second lower bound of (9.37). We now study a different way to renormalize the general quadratic forms. In contrast to the previous approach, we now keep ( J , K ) to be the same but change the L2-space. To do so, let p be a measurable function satisfying cyp

:= .Ir-essinfp > 0,

[ J ( d z ,E )

Pp := r ( p ) < 00 and

+ K(da)]/7r,(dz)< ,f&,

7rp-a.e.

where 7rp = pn/BpO,. In the non-symmetric case, assume also that (9.35) holds. hior jump processes, one may take p ( z ) = g ( s ) V r for some T 2 0. From this, one sees the main restriction of the present approach: J r ( d z ) q ( s )< 00, since we require that ~ ( p .=) 00. Except this point, the approach is not comparable with the previous one. Next, define

Theorem 9.46. Let P, ap,Pp, rp and X , i (i = 0 , l ) be given above. Then, we have xi

2 apAp,2/L&,

2

= 0, 1.

(9.43)

In particular,

(9.44) and when K

-f-

0,

(9.45) where

rp 2 1/2

is the unique solution t o the equation 2y'Lk; = 1 - 41 - 4(1 - ~ ) ~ t k ; ,

y E (0,l).

(9.46)

Proof: Here, we prove the assertions for i = 1 only since the proof €or i is similar and even simpler.

=. 0

9 SPECTRALGAP

376

To prove (9.43) for i = 1, by the assumption on lo, axid TJcmma 9.10, it is clear that L’(n) is dense in L2(7r,) in the 11 Ilo-norm. Noting that L2(n,j is just the domain of the: form D ( f ) on L‘(T,), by definition of’ A, and AP,, , i~ suffices to show that np(f2) - r,(f)’ a p [ n ( f 2-) ~ ( f ) ~ ] 3for / ~ every , f E J J M ( 7 i ) . The proof goes as follows.

-

>

To prove (9.44) and t,he second estimate of (9.45), simply apply Theorems 9.43 and 9.45 with 7~ replaced by 7rp and renormalizing constants r ( z , y ) 5 p,, s ( x ) = P,. Noting that for the modified k(”), we have k(“) = k/,’3,”, k, = k / P p and so on, where k = k(’). To prove the first estimate of (9.45),lct np(f)= 0 and 7 r p ( f 2 ) = 1. Similar to (9.39), we have

Solving t.his inequality in D ( f ) , we obtain D ( f >3 & ( I + c 2 ) -

Jm

and so &,1

+ c2)2 - P,”(l+ c2)M(x,Y,c)k;,

> &(1+ c2) - @(1

where M ( X ,Y ,c) =

+

-~.--

+

+ c)21)2

I

Thus

M ( X ,Y,c)k;

&J >

&

(y 1 c2 .-..

/-

1

+ J1 - M ( X ,Y,C ) k Z / ( l + c2)

*

Combining this with (9.40) and (9.41), we get

A1,

{

2 pp inf max 2y2k;, I YE [O?21

Jm}

= 24,(ypkp)2,

where y p is the unique solution to (9.46). To see that 7 , 2 1/2, note that JG 1 - 2/2 and solve the equation 2y2ki = 2(1 - ~ ) ~ k i H. The main advantage of the Cheeger’s approach is lhat it works in a very general setting. Here is an example.

1

Actually, it is known that for d-dimensional (d 2 2) Ising model, there ) that IF?l = 1 for 0 < Pc and 1 3 1 > 1 for ,O > &. In exists a PC E ( 0 , ~so this sense, we say that the model appears a phase transition. It is even known for this model that when d = 2, & = log(1 fi)E 0.4407. The tool in the proof is the contour method. Let Et denote the unit square parallel to the coordinate axes and centered at t E 2'. Denote by 8Et the boundary of Et. The union U t 8 E t is called the dual graph of E 2 . A closed path I' (without loop) on the dual graph with finite length is called a contour, its length is denoted by Irl. Next, let V c Z2 and set

4

+

t € V ,xr=-l

where dB denotes the boundary of B C R 2 . Note that the set a(a,) is not necessarily connected. Let V = VL be the square parallel to the coordinate axes, centered at the origin with length 2L of edges. Let P6il E 9 ( E ) be the conditional Gibbs distribution p v ( 3 , with boiindary (2t = fl,t E 1"). The key step for the proof is the following a)

Lemma 10.14 (Peierls inequality). Let l? be a fixad contour. Then P&[X

:

r c a(zv)]< exp[-2~1rI], v = v'.

Proof: A pair {s, 1 ) is called an edge, denoted by {:St), if 1s - 61 = 1. Set v == / { ( s t ): { s , d } n V # @}I. For each :c, let denole the number of the edges ( s t ) in the above set with x, = xt and as = -51 respectively. Then

x'

Because every edge of El which separates +1 and -1 belongs t o a(z,), we = 1) - Ifi(zv>l.By using these notations, the have C - = Id(zv)l. So Hsmiltonisn can be rewritten as follows:

xt

10 RANDQM FIELDS

398

where z equals 1 at the sites out of V . Thus

To estimate the quantity on the right-hand side, we make a simple transform. Let

Define a mapping Xr 3 xv

bv)t =

-+

-xt

z;

E

X F as follows:

il 1 is in the inside of I' if t is in the outside of r

Clearly, this mapping is one-to-one, it,s role is removing I' from 3(z,). have a ( x v >= ~ ( X G u) r , l a ( ~ ~=)ia(q)l i Irl.

We

+

Hence

Corollary 10.15. PG,3[xo= -I] 6

9(2e4@- 9) e4P(e4i3 - 9)2 '

1 2

p > - log3.

Proof: Let 0 E I n t r denote that the origin 0 is in the inside of zo = -1. Then there exists a contour l? so that 0 E I n t r c a(+) the boundary is 1). Thus, by the Peierls inequality, we have

I?. Let (since

10.5 ISING MODELON LATTICEFRACTALS

399

where c, = I{r : Irl = n and 0 E Intr}l. Now, we estimate c,. Fix the length n of contours. Starting from the origin, a contour with length n can at most pass through [ n / 2 ] squares along the positive direction of the first coordinate axis. In other words, there are at most [ n / 2 ]intersection points with the axis along the positive direction. Thus, if we set to(I’) = min{t : (t,O) E r},then

I{to(r): Irl = n, 0 E Int r}l < [ n / 2 ]< n / 2 . On the other hand, starting from the intersection point on the first coordinate axis, at the first step, we go along the positive direction of the second coordinate axis. Then, at each step, we have at most three directions to go. Moreover, since r is closed, at the last step, we have to go back to the starting point and so there is only one way to go. Combining these facts n . 3n-2/2. Finally, the length of a closed path together, we arrive at c, must be even, therefore

Po = 0.6587

Let L + 00, then VL T Z d . Denote by P$ a weak limit of P?P as L + 00. Then P$ E 92 and

P,”Xo = -13 < 1 / 2 ,

p 2 p,.

P,-[X, = +1] < l / 2 ,

p 3 Po.

By symmetry, This shows that F‘j!

> 1.

# P i and hence

10.5 Ising Model on Lattice Fractals In this section, we study the Ising model on irregular lattice, especially, on the lattice Sierpinski gasket G(d) and Sierpinski carpet F ( d )introduced in Section 7.5, The spin space is again {-1, +l}. The configuration x is now a mapping: G(d)(resp. F ( d ) ) + {-1, +l}. The Hamiltonian of the Ising model on G(d) is given by

c

H(x)= (st):

ZsXtr s,tEG(d)

10 RANDOM FIELDS

400

where ( s t ) denotes the bond of the nearest neighbor s, 1 E G(d) i a the usual sense. Replacing G(d)with F ( d ) ,we have a Hamiltonian H on F ( d )for the Ising model. Consider first the Ising model on G(’). Note that every point in G(2)has four neighbors. This structure is exact the same as the model on the regular lattice 2’. From this point of view, one may guess that the model on G ( 2 ) has a phase transition. However, the answer is the opposite!

Theorem 10.16. For any d

2, the king model on the lattice Sierpinski gasket (.$dl has no phase transition.

To prove this result., we need some preparations. Recall that Y(+)denotes the collection of Gibbs states with potential = ( @ A ) : 6,&)

=

cu

x,xt

if A = { s , t ) a.nd Is - tl = 1 otherwise.

By Definition 10.1 (3), for each p E Y(@)and V E S, a version of the condit.iona1probability p ( x V = ylz, = itu,,u # V ) is given by

Let pv denote the measure p v (. x 2 ) arid let g ( V )denote t,he closed convex hull of {pV,* : 5 E {-1: 1}’jV}. Proposition 10.17.

(1) Vl c VZimplies that Y(Vl> 3 Y(V2). (2) p E Y(@)iff for every u E S, a version of t h e conditional probability p{x, = yy.141x.1, =: yv, v # u} is given by

(3)

q a ) = n,qv).

(4) g(@) is non-empty, convex, and compact.

ProoE The first assertion follows from the consistericy condition since every p E Y’(Vz>can be expressed as a convex combination of the elements in

10.5 ISlNG MODELON

LA’I’TICE

FRACTALS

401

$$‘(Vl). Next, let p E Y(@).Then

-

-

,

L

exp [ / Y Z k u E A @ a ( 4+ CXP

1

-

1 + exP

[ - 2P CA:ucn

[oc,:,,, ,

U€V*

9A(Y)]

This proves the “only if” part of (2). To prove the ‘‘if” part, it suffices to show that yv,y is the only probability measure un {-1, l}v whose one point conditional probabilities are given by the above right-hand side It is equivalent to say that the potential of the corresponding field is unique up to 8 constant. This is obvious in point of view of the Geld theory, since V is fiuite and pv,, is positive everywhere in {-l,l)‘+ By definition, we have Y(Q) c g ( V )and so 9(@) c n,Y(V). Thc inverse inclusion Y(V) c $?(a)follows from the proof of (2). Finally, the last assertion follows from (l),(3) and the compactness of the state space. I Proposition 10.17 will be extended t o more general situations in the next chnpt er.

n,

Lemma 10.18.

/ bPp(f1f2 > b). By the assumptions, for every a E [O, A] and b E [I?,11, we have

P"(fl

A-u + f2 z a ) 2 1-a'

10 RANDOM FIELDS

412 b) Next, we prove that

for some e , 6 > 0. Actually, by the assumptions, A' and A2 and hence f' and f 2 are symmetric under 7 . It suffices to show that this estimate holds for at least one k , which is then implied by the following two conditions:

+

3~ > 0 so that P"f' f 2 2 a , f 1 f 2 < b) > e. ( 10.17) 36 > 0 so that {(x,y) : 0 < z < y < 1, x y 2 a , xy 6 b }

+

c {(x,Y) : Y > 1/2 + 6) holds for some a

< 1 and b > 0.

(10.18)

By (10.16),

PP(fl+ f 2 3 a ,

f1f2

< b) 3 PP(f'+ f 2 2 a ) + PP(f1f2< b) - 1 A-a b-B a-+-1. 1-a b

So (10.17) follows from

A-u 1-a

+-b -bB

+

> 1.

(10.19)

On the other hand, if we set x y = a' 2 a , xy = b' 6 b, then x and y (x y) are two roots of the equation z2 - a'z b' = 0. This gives us

l. 2

2

( 10.20)

But both (10.19) and (10.20) can be deduced by

a 1 l , which contradicts our assumption. b) If for some a t , Iluill = 0, we claim that G(a1,a2, * 7u2n)= 0. Otherwise, we may choose a sequence ( b l , - ' * , bz,) such that +

1

-

and ( b , , * . . , bzn) contains a chain (a1 T U I , * 1 having the maximal length. According to the argument in a>, i t follows that ( b l , . . . ,bzn) must be (a), r u l , . . ,u1, r u l ) . Therefore

0 # G ( b l ; * *,bzn) = G ( a l , r ~ l , . ,* u- ~ , T u=~ l) l

~ l = ( (0,~ ~

which is a contradiction. c) Finally, we prove that (10.22) implies the triangle inequality of Ilu

+ bll = IG(u + b, r ( a + b ) ,

*.

. ,a

I\ . 11.

+ b, r ( a + b ) ) 11/2n

To prove Theorem 10.28, we consider first a particular case. Corollary 10.32. Let N be even. Suppose that p E EN) is reflection T E 91/2 and is RP. Then for every family {ft E b 8 : t E ZN}

invariant for all we have

In particular, for every f E b 2 8 , we have

10 RANDOM FIELDS

420

Proof: Consider first the reflections along the first coordinate direction. Set hT= 2n and let 9 be the vector space spanned by

Define to be the common value

Applying 'I'heorem 10.31 to the function obtain

2n-1

I

n

0,j

=

nt(zl,.., ,t(d)

f(j,t(~)

we

2n- 1

Repeating the argument to the other ( d - 1) directions, we obtain (10.25). Finally, (10.26) follows by setting $0 = f arid f i = 1, t E EN \ (0) in (10.25).

Proof of Theorem 10.28: a> The proof can be reduced to the special case that d = 1 by considering the coordinate directions succcssively, as we did in the proof of Corollary 10.32. b) For d .- 1, the assertion Eollows from Lhe abst8ract8 chess-board estimate. Actually, we need orily regard each block A e as a brick, iis [mi1/2, rn 3/21 (m E Z)used in the proof of Corollary 10.32. I

+

+

10.8 NOTES

42 1

10.8 Notes In the past thirty years or more, the random field has been one of the most active subject in probability theory and in statistical physics. Several books are now available: Dobrushin, Koteckjr and Shlosman (1992), Ellis (1985), Georgii (1988), Malyshev and Minlos (1991), Riinlos (ZOOO), Preston (1976) and Sinai (1982). A plenty of references can be found from these books. Theorem 10.6 is due to Dobrushin (1970). Actually: for the existence theorem, the weak continuous condition can be weaken and the set of parameters can be more general. See Dobrushin (1970) and Georgii (1988). Another generalization, based on RP, was presented in Shlosman (1986). Theorem 10.9 is an extension to the well-known Dobrushin’s uniqueness theorem, in which the minimum L1-metric was evaluated at each single site. The present theorem, due to Dobrushin and Shlosman (1985), treats with the metric in a firiite volurne globally. See hlaes and Shlosrrian (1991) for recent progress on this topic. Nole that the measurability question in the original ststcment of Theorem 10.9 was solved by Zhang (1999), that leads to the use of Theorem 5.32. The king model on lattice fractals was studied by Gefen, Aharony and Mandelbrot (1983, 1984) and by Gefen, Ahxon?, Shapir and Mandelbrot (1984). Theorem 10.16 was proved by J. L. Zheng in 1989 but it is published here for the first time. The proof given here is an analogue of those in proving the absence of phase transition for the model having finite range. Refer to Liggett (1985, Chapter 4) for details and original references. The dual graph used in the proof of Theorem 10.20 is due to J . L. Zheng. The corrhinatorial Lemma 10.21 is due to J. Wu, Again, Theorem 10,20 appears here for the first time, A complete proof was presented in Zhcng (1993). The proof uses some results from algebraic topology since the complex of coritoiirs, It should rriention that the proofs of Theorem 10.16 and Tlieorerri 10.20 are suitable for much more general situations. For instance, the same conclusion of Theorem 10.16 should be held for any lattice nested fractals. See also Yoshida and Higuchi (1996) for a related result. Refer to Mandelbrot (1982) and Falconer (1989) for more materials on fractals. For diffusion processes on fractals, refer to Barlow and Perkins (1988), Kusuoka (1987, 1989) and Linstrbm (1990) and the references within. For the Peierls method and its development, called Pirogov-Sinai method, refer to Sinai (1982). Here, we mention Dobridiin and Zahradnik (1985), Park (1988a,b). The R.P was int,roduced by Osterwaldcr arid Srhrader (1973) and so is also called the Osterwalder-Schrader positivity. Section 10.6 is taken from Shlosman (1986). Finally, Section 10.7 is mainly taken from Frohlich, Israel, Lieb and Simon (1978). Finally, for large deviations of random fields, refer to Ellis (1985) and Olla (1988).

Chapter 11

Reversible Spin Processes and Exclusion Processes This chapter deals with the reversibility of two important classes of particle systems, spin processes and exclusion processes. There are two reasons why we study this problem. One is that comparing with an irreversible process, a reversible process has nicer property and is easier to handle, so we should justify the reversibility of a given process at the beginning of the study. This leads also to the study of potentiality. On the other hand, in the equilibrium statistical physics, we are given (local) reversible measures (i.e., conditional Gibbs states) and the processes are constructed for describing the systems. Thus, the processes should be reversible. Then, a question arises, what rates we should take so that the corresponding process actually describes an equilibrium system. This again leads to the study of the reversibility.

11.1 Potentiality for Some Speed Functions In this section, we introduce some simple criteria for the potentiality of the speed functions of spin processes and exclusion processes. As we will see in the subsequent sect#ions,the potentiality is essential for the reversibility of the processes. Besides, the idea given here will be also used in Section 14.3 to study the reversibility for some reaction-diffusion processes.

Definition 11.1. Let S be a countable set, E = (0,l)’. Suppose that c(., -) : S x E -+ R and c(., ., .) : S x S x E -+ R satisfy the positivity condition: (1) c ( u , z ) > 0, u E s,z E E ; (2) c(u,v , 2 ) > 0, u , v E u # v; z E E , x(u)# x(v).

s,

Set

4&,4

=

{

c(u, z)

if $ = , z for some u E S if 5 # z,, z for any u,

where

( U z ) ( v= )

{

-

XV

if = if u # v,

( ( u , v ) z ) ( w= )

i

(11.1)

z(w)

if w # u , v

4.)

if w = 11 if w = v.

x(u)

Then we call Qs := ( q s ( z , Z ) : x,Z E E ) and Qe := ( q e ( z , Z ) : z,Z E E ) a field of spin speed functions and a field of exclusion speed functions, respectively.

422

11.1POTENTIALITY

FOR S O M E SPEED

FUNCTIONS

423

Theorem 11.2. (1) The field Q s := ( q s ( x , 5 ) : x , Z E E ) is a potential field iff the following quadrilateral condition

4%M JU+(% ,

u v z > ) c ( v ,4)= C(V, +(%

d ) C ( V , U U ~ ) ) C ( U ux) ,

holds for any u , v E S and x E E . (2) The field Q, := (q,-(x,$) : x,Z E E ) is a potential field iff the following

triangle condition c h 'u, 44%w ,(u,lJ)W'Lv,Ul ( w , u ) 4 = 4%70, +(w

I f ?(U,W)++?

u?( V , U ) X )

holds for any u,'u,w E S and x E E .

Proof: The first assertion is actually a restatement of Lemma 7.11. As in the proof of Lemma 7.11, we define 4u,.>

=

(u,v)Z

for u # 'u and x(u)# x(v>. But here ( 4> 03 > 0.

So there exists a xV.k E E such that

For each y E & ( V ) , U , I I E V with y(u)

# y(v),

by Lemma 11.9(2)! we have

where zv,k = ( z ~ , ~ )Hence, ~ , ~ by . the positive condition of ( c ( u ?T I , x)}, we get YEGmPKYl) x E ( S \ 1 4 s \ W Z V , k ) > 0,

v>

Therefore the restriction of Qe on E k ( V ) x z V k is a potential field. From Theorem 7.6 and Theorem 11.2 (2) we have

Now the triangle condition follows from the continuous condition and the fact that UVEsU z o& ( V ) x { z ~ , is~ dense ) in E. M7e now discuss the uniqueness problem for Gibbs states of a spin process with nearest neighbor speed function.

Definition 11.16. The speed function { c ( u , ~ ) is } said t o have nearest neighbor if, for each u E S, there is a au E S such that c ( u , .) E S ( U

u au):= &o(u u au)x E ( S \ {uu au}).

From now on we make the "nearest neighbor" assumption. Take S such that = Vn-l uav,-,, 1s

v,

v,

{Vn}yc

11.3 CRITERIA FOR RF,VERSIRII,~TY

439

and c(u, .) E G(Vn)

for each u E Vn-l.

(11.40)

For each n 2 1, we take Bvn as a reference point in E(Vn). For each w E E(Vn),we choose {y(l),. - . ,~(‘1)) c E(Vn)such that ,Q

:=

p

3

$(’) + . . . + y(‘“) + y(‘++l)

=: w

and define M V , x “s\v,, 11) x “qv,) and q.s(w x x:s\v,, ,Q x XS\V,) as usual (cf. (11.19)). Because of (11.40), we may rewrite these qS’sa5 &(dv, x xav,, w x zavn)and &(w x xav,, Ovn x zaVn),respectively.

Theorem 11.17. Suppose that ( c ( u , z ) ) is positive, nearest neighbor and satisfies the quadrilateral condition. Then there is a bijective mapping between the set of all Gibbs states for { c ( u , x ) }and the set of all the equivalent classes of linear independent positive solutions to the following equation

(11.41) E(BVn-l), n 2 1, where V, = 0, avo = V, and &(y,$)/yB(&y) = 1 as a convention when y = f. In particular, there is only one Gibbs state i f f (11.41) has only one positive

z

E

solution up t o a constant.

Proof: Noting thatj without any confusion, we can use exp [ - ’p(Bv,,-, x x,y x z ) ] to denote (TR(f?vn-l x z , y x z)/q8(y x z , dv,,-l x z ) since the pathindependence, where cp(y, 2) is the work done by the field Qsfrom y to z. a) Suppose that (Zn,z} is a positive solution to (11.41). Define

hn(Y x z ) = 2;’ ~ X P [ - L ~ ( ~ Vx , X_,~Y x

~ n , t

(11.42)

for each y E E(Vn-l), z E E(dVn-1>and n 2 1, where

c

zn=

exp[-cp(bn-l x

*I

Y x z>IZn,z.

(11.43)

y€E(Vn-11 z E E( aV, - 1

Clearly, /inis a probability meamre on E(VOZ). We now prove that b) {/Jn}n>l i s a consistent family. By the path-independence, wc have cp(&n

x w, Y x

x w>

= P(Ovn x w,Qvn_lx = ‘p(Qvnx w, x

+ x w) + 9+v,-1

x w) ’p(Qv,-, x z x w,Y x z x w) x

*, Y x 2)

(11.44)

440

11 SPIN

PROCESSES AND

EXCLUSION PROCESSES

for each y E E(I&-l), z E E(aVn-l) and w E E(aVn). Hence for each n 2 1,

=

C

exp [ -

and so, by (11.44),we get

for each y E E(Vn-l), z E E(aVn-l) and n 2 1. Therefore) by the Kolmogorov consistency theorem) there is uniquely a probability measure p on 8 such that pn is the projection of p on E(Vn). Next, we prove that c) the measure p obtained in b j is a Gibbs state. By Theorem 11.1(l),it is enough to prove that p is a reversible measure. But this follows immediately from (11.42), the path-independence and the following result.

Lemma 11.18. Let p be a probability measure on (E,&'),then

/L

E .%'(St,) ifF

for each y E E(V,) and u E Vn-l, where p., is the projection of p on

E(Vn).

11.3 CRITERIA FOR REVERSIBILITY

44

Proof: If p E 9 ( f l s ) , then applying Lemma 11.8(1) to f = I { y } x ~ ( ~ \ ~ , , (y E E(V,)) gives (11.45). Conversely, it is trivial that (11.45) implies (11.46) with the above f in the cases that u # V, or u E V,-I.

If u E V,

\ Vn-l

=

aV,-,, from (11.45)) it follows that

for each z E E(V,+1

~ P X = x}

\ V,).

E(S\Vn+i1

That is

4%z)p(d4 = UYX

=Ix E(S\V,+l)

4%+(W.

Summing up z over E(V,+1 \ V,), we get (11.46), and then p E 9 ( f l s ) by Lemma 11.8(1). The lemma is now proved. 1 We now return to our main proof. d) Let p E 9 ( f l S ) and f i n be the projection of p on E(V,). Then, by (11.45), we have

Summing up w over E(dV,), we get

:= ,5n(6V,-1 x z ) : z E E(aVn-l), n 2 l} is a positive solution Hence {5& to (11.41). Next, from (11.45) and Theorem 7.6, we see that

P7dY x

4 = fin(6Vn-l

x 4 e x P [ - P(ev,-l x

2,

Yx

43

for each E E(V,-1) and z E E(aVn-l). Taking 2,,= instead of 3,,= in (11.42) and (11.43), we obtain 2, = 1 and p, = G,, for each n 2 1. So p is the same as the measure obtained in b ) . We have thus proved that the mapping defined in u ) and b) is an onto-mapping. We finally prove that it is also an one-to-one mapping. e ) Let {&$ : z E E(aV,-l),n 2 l}, i = 1,2 be two positive solutions to (11.41). If %,Z (1) = n,z> E E(aK-l), 21 (11.47)

442

11 S P I N

PROCESSES A N D

EXCLUSION PROCESSES

(~22)

for a constant cy > 0, then 22)= by (11.43)] and hence p?) = p?) by (11.42). We see that {xt;} and {x?;} determinate the same Gibbs state p by b). Conversely, if { x c ' z } and {xk:'z} determinate same p E 9 ( f l S ) ,then for each n 2 1, p?) and p?), as the projection of p on E(V,), must coincide. In particular, by (11.42), we have for each z E E(aVn-l)l

(.zc))-'zSk = p:)(evn-l

x

.)

= pLn (2) (6 vn-l x

z) = (

z~))-'

But 2;) is independent of n 2 1, so (11.47) holds. Example 11.19. Let @ : S + R satisfy CBCuEBES l@(B)1.\B\ < oa for every U E Set < cxp

[

-

It

is equivalent

C(s)ds] Q(t)*(t).

Noticing Z(0) = 0 and the absolute continuity of Z ( t ) ,we see that

Next, by definition of Z(t>and the exchangeability of Q ( s ) Q ( t ) , we have

T h u s , by definition of Y ( t )and condition (13.9), we get

This proves (13.10). Finally, if the integral form in (13.9) is an equality, then every inequality in the proof becomes equality. I

Lemma 13.7. Let ( q ( x ) ,q(s,d y ) ) be a q-pair on an arbitrary space ( E ,€'I and let T , k E &/B(R$). If s1 r' 6 h on E , then Pmin(t) T r+J: Pmin(s)hds.

0,

El Finally, if we use " A (= B" to denote ''[A- BI

G (c,( t )P. (21,.,))

t 2 0,

(4+

/

t

0

(Cn(t - 4cp.(8,z2;n, 4)( ' W ) d S ,

q , x 2 E Eo, m 2 n 3 1, where Cn(t)was defined in Theorem 13.1. Furthermore, wAn(Pn(t,

B ) E 80,

w,

P(t>p< C ( I + p ) e c t ,

t 3 0, z

E E~

(13.25)

and

However, the convergence in (13.24) is not necessarily uniform in IC E EON, which is just the point why we need a different approach to prove the semigroup property.

Lemma 13.11, Let Y c Yip(E0)satisfy sup{L(g) : g E 9?} < 00. Then for each t 2 0 and x E Eo, we have (P7,(t)- P ( t ) ) g ( z ) 0 uniformly in g E Y as n -+ 00. ---f

Proof: Set gn(x:>= g ( x C ” n ) ,5 E Eo, n 2 1. Then supn2,L(g,) ,< L ( g ) . The assertion now follows from the fact

plus (13.26) and (13.25). I One main character of P ( t ) constructed by Theorem 13.8 is as follows.

13.1 EXISTENCE TIIEORGMS

FOR TEIE PROCESSES

483

P(s)= P ( t s) on & i p ( E ~ ) .Note that P,(t)P,(s) = Pn(t + 9 ) . Given f f yip(&), we have

+

and

gn = 0. Hence, by the dominated convergence theorem,

Iim P ( t ) ( P ( s )- P,(sj)f(x) = lim P(t)g,(s) = 0. n-+m

n-03

0 and a constant c
. Thus, the reaction part of the formal generator of the process is as follows:

where e, is the element in E: having value one at u,and zero at other sites. Moreover, we use the following convention:

=o:

q&j)

2

E

;z+, j $ z.,,

TIE

s.

'The second part of the generator of the process consists of diffusions between the vessels, which arc described by a transition probability matrix P = (p(u,v) : u: 'u E S). For instancc, if there are k particlcs in u: then the rate fiinction of the diffiision from u to is c, ( k ) p ( u ?v), where

c, 2 0,

C,(O)

= 0,

uE

s,

(13.37)

Hence, the diffusion part of the formal generator becomes

Finally, the formal generator of the reaction-diffusion processes can be expressed as follows:

W(.) Choose {An}? corresponding

c S, A, On,d

= flrf(4

+ W(4.

S. Replacing S with A,, we can define the and

nn,respectively.

13.2 EXISTENCE THEOREM FOR REACTION-DIFFUSION PROCESSES

487

As explained in the last section, we need to use a smaller space Eo instead of E : Eo = { L T f E : 1121( := Cu5,ku< oo}l where {k,} is a positive sequence such that

(13.38)

From now on, we call ( p ( u ,v)) satisfying (13.38) an M-controlled matrix. Then, starting from 2 E Eo? after a linear immigration: c,(k) = k : the number of particles adds to the site v is equal to z,p(u, v>.Hence

c,

and so the process will still stay in ED. Note that for a given (p(u,v)), the required (ku) always exists. For instance, take A1 > 1 arid u bounded positive sequence (d,,L)land set

This explains the source of the sequence (k,) discussed in the last section. Besides, under the assumption

(13.39) we may choose a sumrnable (d,) so that (k,) is also summable. Now, it is the position to discuss the conditions made in the last section. qu(Ol k)k and condition (13.16) becomes In the present case, ,& :=

cF=,

(13.40) u

Next, the set EOOintroduced before Corollary 13.15 becomes

To guarantee the denseness of EOOin ED,we adopt the assumption q&, kfQ

i

+ k)lkl
o} COO,

+ Ic) - qu(j1,jl + k ) ) k ,

(13.43)

j,

> j , 2 0,

Set

Obviously, Finally, set

Remark 13.16. If for every

is a birth-death Q-matrix,

then

Proof: Deniote by c2 the quantity given by the right-hand side. Clearly, in the present situation, and hence On the other hand,

< (32 - j&,

j,

> j , 3 0. I

We arc now ready to state the main result.

13.2 EXISTENCE THEOREM FOR REACTION-DIFFUSION PROCESSES

489

Theorem 13.17. Under conditions (13.37), and ()13.40—(13.43), there exists a Markov process with state space (E,E). Moreover, for each we have and

1() (2) (3) 4() For each 5() For each (6)

is continuous in t. is continuous in t.

Proof: a) Since

and

we have

where pu(x) = xu, b(u,w) = c p ( u , w) and

To show the regularity of R,, note that

490

13 CONSTRUCTIONS

OF THE PROCESSES

by the above inequality. Hence, the required assertion follows from Theorern 2.25. We have thus checked the first condition of Theorem 13.8. b) To check the second condition of Theorem 13.8, we use the coupling of marching soldiers. For the diffusion part, the coupling is

For the reaction part, at each u E An, the coupling of marching soldiers is as follows:

When u E Am \ An, let the particles evolve independebtly. Then the whole coupling for the process is

In particular, for

we have

13.2 EXISTENCE THEOREM FOR REACTION-DIFFUSION PROCESSES 491

On the other hand, for the reaction part, we have

then

492

13 CONSTRUCTIONS OF THE PROCESSES

By symmetry, this estimate also holds for 2, 6 y.,, Combining the above two estimates, we arrive at

(13.44) Therefore, condition (13.17) holds with the choice:

We have proved that the Lipschitz condition of Theorem 13.8 is satisfied. To obtain the estimate in part (1)of the theorem, we need a little more careful. By (13.44)) we have

on,nP W ( 2 , y) 6 c; P w ( G 9)+

Hence

1- P ( W , 4

-

c

v$An

P

h4

>

-C

I

P h4

13.3 UNIQUENESS THEOREMS FOR

THE PROCESSES

493

and so

From this, assertion (1) follows easily. c) It is easy to see that the hypotheses of Corollary 13.15 are also satisfied. Then, the conclusions of the theorem follow from Theorem 13.8 and Corollary 13.15 by some computations. 13.3 Uniqueness Theorems for the Processes

In this section, we prove some uniqueness theorems for the processes constructed in the previous sections. Two different approaches are used here. The first one is the usual semigroup approach. The second one is the weak maximum principle. Theorem 13.18. Suppose that the hypotheses of Theorem 13.8 hold and additionally, condition (13.30)and the following conditions are all satisfied.

(1) Growing condition:

CuEAn

where piy)(x) = ~ ~ ( z ) ~m(> , k1 ~) ,is the minimal number so that the above control holds and K1 is a constant.

(2) Moment condition:

Then there exists uniquely a Markov process having the properties listed in Theorem 13.8. Moreover,

where Em = {x E ,230 : p ( " ) ( z ) := Finally

xupu(x)mk, < co} and

K2

is a constant.

(13.46)

494

13 CONSTRUCTIONS O F THE PROCESSES

Proof: a) Since Em C Eo, Theorem 13.8 and Corollary 13.15 are applicable to the present case. Next, by the moment Condition and Lemma 4.13, we have ~~(t)pi:)(x) 6 (1 +pi:)(z)) exp [ ~ ~ t ]t ,2 0,

2

n 2 1.

E

By using an approximating argument, we obtain

t 3 0 , x E Em.

P(t)p(")(x)6 (1 + P ( ~ ) ( Z )exp ) [Kzt],

(13.47)

This proves not only (13.45) but also that Em is a closed set of the process. b) Let f E YZp(E0). By the growing condition, pnf(z)I

Since R,f(z)

< KIL(f)(l+drn)(5)),

+ Rf(z),

Ifif(.)I

2

E Em, n

2 1.

we obtain

6 K & ( f ) ( l +P(")(Z)),

J:

E

Gn.

In particular,

and so

for t

< 1, f E Yiip(E0) and z E Em.

Thus, by a) and the dominated convergence theorem, we get

= P(t)Rf(z),

t 2 0,

2

€

Em, f E pip(&).

Combining this with Corollary 13.15, we obtain (13.46). c) Finally, let Pk(t),k = 1 , 2 be semigroups on Tip(&) perties: i) Pk(t) is Lipschitzian on

having the pro-

YZp(Eo),

L ( P l , ( t ) f ) 6 L ( f ) e x p [ c 2 t ]f ,~ B p ( E o )Ic=1,2 , for some c 2 > 0 ; ii) Em is dense in Eo with respect to p and Em is closed for both Pl(t) and PZ( t ); iii) (13.46) holds for P ( t ) = Ph(t), k = 1,2.


THE

PROCESSES

495

From these, we claim that Pl(t) = Pz(t),t 2 0. Since the denseness of Em and the Lipschitz property, it suffices to show that P~(t)f(z) = P~(t)f( for all f E b-%p(E0),z E Em and t 2 0. But Em is a closed set, we may replace Eo with Em and consider Q ( t ) as a semigroup on 9Zp(Em). Then the conclusion follows easily from the Hille-Yosida theorem. Actually, for f E b9b(Ern)t let

F ( z ) = Fx(z)= Then F E &?Zp(E,)

Jom

whenever X

> c2. On the other hand,

Hence

P ( ~ ) F ( x-)F ( z ) e X h- 1 h

z E Em.

e-"P(t)f(z)dt,

h

F ( z )-

$1

h

e-'"(t)f(z)dt.

By iii), we have

RF(2) = XF(Z) - f ( z ) ,

z E Em.

Or

( X I - S2)F(z)= f ( ~ ) , 1~ E Em, > ~ 2 . (13.48) From this, we show that F is determined uniquely by f . To do so, let FI and F2 satisfy (13.48) and set g = Fl - Fz. Then

z E Em, x > c2.

Rg(z) = Xg(z),

By iii) again, d --P(t)g(z) = RP(t)g(z)= P(t)Rg(z)= XP(t)g(z)

dt

and so by the continuity,

e-xtP(t)g(z)- g ( z ) =

1

t d

(eWxsP(s)g(z))ds = 0.

0

Therefore g ( 2 ) = e-"P(t)g(z),

This gives us SUP I g ( 4

zEEm

x E Em,

> cz.

< e-xt VEEm S U P IS(Y)I*

Hence g = 0. Finally, for given two semigroups Pk(t), we have two functions Fk(z)defined above for which (13.48) hold and hence Fl = F2 as we have just proved. Therefore, Pl(t) = &(t) on b2ip(Em)by the uniqueness theorem of Laplace transform. Now, we apply the above theorem to the reaction-diffusion processes.

496

13

CONSTRUCTIONS O F THE PROCESSES

Theorem 13.19. Suppose that the hypotheses of Theorem 13.17 hold and additionally the following conditions are all satisfied.

(1) Growing condition: For a fixed rn 3 1,

(2) Moment condition: sup

c

Q&,

i

+ k) [ ( i + k ) m - im] < Kz(1 + P ) ,

iE

z+

kfO

(3) Transition condition: supv

xup ( u , w) < C,

Without loss of generality, assume that k, clusions of Theorems 13.17 and 13.18 hold.

00.

< co (by

(3) ). Then the con-

Proof: What we need is to check that the conditions given here imply the corresponding ones given in the previous theorem. a) For rn > 1, by the growing condition, transition condition, the C,inequality and the Holder inequality, we have

, 2, c, = 1 if m 1. Note that this estimate holds even for rn = 1. Combining this with conditions (2) and (3), we have checked the second condition of Theorem 13.18.


497

THE PROCESSES

b) Similarly, one can check that the first condition of Theorem 13.18 follows from the first one here since the diffusion part is M-controlled by the distance I] . 11, which was defined before (13.38):

The idea of the above theorem is keeping the function f in L@ip(EO)but restricting EOto Em. The next result goes to the opposite direction.

Theorem 13.20. The assumptions are the same as the previous one except the last two conditions are replaced, respectively, by the foltowing ones. (2)' Moment condition:

(3)' Transition condition: There are a positive summable sequence (k,) and a constant M ( m ) such that

Then there exists uniquely a Markov process having the properties listed in Theorem 13.17. Moreover, for each f E 9ipm(Eo):

Zip,(Eo) = {f : f is Lipschitz continuous with respect t o the metric

lb - Y l l m = c,IX,

- YzLlJi~}

and t > 0,

(13.49)

Remark 13.21. Condition (3) plus MI imply (3)'.

:= sup { p ( u ,v)'-~ : ZL,

v E S and p(u, v)

> 0}
/ N'. Next, choose N2 so that [ b i - c m a i ] / i A for ) all i 2 N 2 . Put N = N 1 V N 2 . Then for each n 2 1, some A E ( 0 , ~ and we have

1, as a function on H,, g is compact. Next, let F : [O,T]x X below and set

---f

W

be a bivariate continuous function bounded

Finally, let R be a linear operator on @(&',g) with f21 = 0 and satisfy:

( M z ) Rg

X

< ag + b on X for some constant a,b 2 0.

500

13 CONSTRUCTIONS

OF T H E PROCESSES

(M3) Moreover, for each T > 0, if (P, E @(F,g) (n 3 some no), as a function on [0,TI x Hn, achieves its infimum at some point (s,, x,) E [O, TI x Hn,r:

H,,r := {Z E H , : g(x) 6 then

-

T},

Ry,(s,, .)(x,) 2 0.

Theorem 13.23 (Weak maximum principle). Let F and fl be as above and satisfy (MI)-(M3). Suppose t h a t for each x E X , F ( . , x ) is continuously

.>

differentiable and

get,).

2 R F ( 4 x)) (t, z E x. F ( 0 , x ) 3 0, Then F 2 0 on [O,T]x X .

{

E [O, T ] x

Proof: Without loss of generality, assume that a

x

> 0. Define

- t ,x) + (T - t ) +~&e-at[g(x)+ b/a]. Then f E ( t ,.) E @ ( F g) , for sufficient small E > 0. Clearly we have f E ( tx) , = F(T

on [O, TI zfE+ QfE < --E on X . f E ( T.), 2 0

(a

x X,

(13.52)

Thus, we need only to show that f E 3 0 on [O,T]x X for sufficient small & > 0. Suppose that f E < 0 at some point (s,x). Since H , X , there must be an N so that ( s , x ) E H , for all n 2 N. Take T

I

= inf{F(t,

x) : (t,x) E [o,T ] x

x>I

. eaT/&.

If T = 0 then there is nothing to do. Otherwise, from ( M I ) and (M3), it follows that there is a compact Hn,r so that

3 0 on [O,T]x (Hn\ Hn,r). Thus, (s,x) E [0,T ] x Hn,r for all n 3 N . But [0,T ] X Hn,r is compact, achieves its minimum at some (s,, x,), .fE

fE(sn,2), and

6 fc(s, x) < 0,

a -f&(Sn,%)

at

2 0,

fE

n2N

n 2 N.

Combining this with (Ads), we obtain

This contradicts with (13.52) and therefore f E < 0 is impossible. We now apply the maximum principle to the reaction-diffusion processes.


THE

PROCESSES

501

Theorem 13.24. The assumptions are the same as Theorem 13.19 except the growing condition is removed and the moment condition holds for some m > 1. Then there exists uniquely a Markov process having the properties listed in Theorem 13.17. Besides,

for some constant K2.

u,

H,. Proof: a) Take Hi, = {z E EO : z, = 0, u $ A,} and set X = Then X is countable. Endow X with the discrete metric p. Next, take g(z) = p(")(z),

2

E X.

It is easy to see that g E C ( X ,p ) and is locally compact. Actually,

H,,?.

:= {.

E H,

:

g(x) < 7=}

is a finite set. However, if we use the metric induced by p ( z ) , then g is clearly not continuous with respect to p unless m = 1. For this, we have to be careful. Finally, let Pk(t),k = 1 , 2 be two semigroups constructed by Theorem 13.8, F ( t ) = Pl(t)f- P2(t)fand define fi to be the restriction of R on @(F',g):

6cp = R F ( t , .) + q R g ,

cp E

@(F,g).

By assumptions, it is easy to check, as we did in the proof of Theorem 13.8, that the operator 6 defined on @ ( F , g )satisfies the hypotheses of the maximum principle except (A&), which is often the key point in the applications. b) Fix T > 0 and suppose that cp,(s,z)

:= F ( s , z ) + " y g ( z ) +a?) E @(F,g),

(s,z) E

achieves its infimum at some point ( s ( ~ ) , z ( ~ )E) [O,T]x x(,) ke, E H,. whenever u E A,. We have

+

and

[O,T]x H , Note that

13 CONSTRUCTIONS OF THE PROCESSES

502 Thus

:= I

--+

+ II.

0,

n

--+ 00.

(XF))~~,

Here in the last step we have used p(")(a(")) = C , 1) are bounded, we have the integration by parts formula:

Let

Sincew

we have

Therefore

We have at last obtained the estimate

These facts are enough to deduce the existence and the uniqueness of the process, as mentioned in Remark 13.14. To conclude this section, we introduce two related models. For which, the details are omitted.

13.4 EXAMPLES

509

Exarriplc 13.37 (Limiting Gaussian process). Take S = Zd9,E, = R and pu(zu,yu) = 15, - gal, 71, E S . T,et { T , ~ : u,v E S} be a ramily of constants satisfying the following conditions. (1)

T,,

= Tlu-vl

(2)

Tuu

> 0,

2 0.

CvEZ",vfurvu

< Tuu.

(3) Cv~vueYlu-"l6 C < 00 for some constants C and y > 0. (4) The matrix (ruv : u,v E A) (A c Z d ) is positive definite. For each n 3 1 and u E A,, let px(z,dy) be a probability of which onto (Eu,gu) is the Gaussian measure with variance 1/ruuand mean )&,n\{ul ; C ~ T ~ ~ / T , Finally, ,.

the family of the local characteristics of the process are

as follows: 4.12(z)d y )

Cu~h, px(z, d y ) ,

q71.(z) = q n ( x >E A n ) ,

-

??,

2 1*

Then, Theorem 13.1 is suitable with the choice k , = e- YI u l , u E Zd, 0 < ;j < y and p ( z ) = Cuzuku.Actually, the limiting process is ergodic. See Basis (1980) for details.

To see the main difference between Theorems 13.1 and 13.8, let us return to Example 13.35 and using the same coupling there. We have seen chat Theorem 13.8 is suitable for this model. As for Theorem 13.1, the natural , ) )would be choice of (cu,,) and ( c w ( n m cuw =

{

u#fu,

P(U, 4 ~ w / k u ,

+ 1,

P(U>U)

U=W

4% d = Cn 6

and there are constants I 7E

flnh,

hL(&J,

21,

E

Eu

(14.1)

and rl E ( 0 , ~ such ) that

[O,cxl)

< K - qh,

(resp.

< 0),

(14.2)

where h(s)- CUE*,, hA(su)hL. Then

(1) For each n 2 1, the process P,(t) has a t least one stationary distribution 7rn

satisfying

< K/q

(resp. 6 const.).

7rn(hn)

(14.3)

(2) The process P ( t )constructed in Theorem 13.1 has a t least one stationary distribution 7 r , which can be obtained as a weak limit of a subsequence of the s7;r and satisfies

given

xu

in (13.2) (resp. (13.17)) also satisfy

ccuw < -q

< 0,

u E S and

(14 -6)

W

~ ( c U w 6 (K W

< cx),

u f S.

(14.7)

51 8

14 EXISTENCE O F STATIONARY DISTRIBU'I'IONS

AND

ERGODICITY

Then

(1) The process P ( t ) constructed in Theorem 13.1 (resp. Theorem 13.8) ha5 exact one

stationary distribution

T

satisfying

for every f E h%jf!(E) f l z i p ( & ) ( & ) (resp. f f ~ i p ( & ~ ) ( ~ o ) ) ~ (2) For each n 2 1, if the coefficients c,,(n,n) given in (13.8) vanish, then Pn(t) has exact one stationary distribution rn satisfying

Proof: Wc prom the first assertion only since the second one can bc proved in the same way. To do so, we first justify the conditions of Theorem 5.23, The first moment, condition is covered by (13.11) (resp. (13.25)) with 9" = { p E 9(&) : &)I < ca). Ncxt, by the main estimate (13.8) (resp. (13.22)), we have

where Cn(f) = exp [tC;] Thus, if we set I

,P(t7 2 2 , 9)- W ( W ,2 3 , * ) ,P ( t ,2 4 , 4)I 6 I W ( P ( 4 2 , , * ) ,P ( k z 2 , * ) ) W ( P ( t , z , , . ) ,P ( t , a 3 , - ) ) ] + IW(P(4Z,,.), P ( 4 z 3 , 9)- W ( P ( t , Z 3 , . )P, ( t ,2 4 , G K ( t )(p(.I t 2 3 ) + P ( 2 2 , .4)).

I) .

Hence, W ( P ( t z, l , .), P(t,x2,.)) is indeed continuous in (xl, x2).

As a straightforward consequence of the above theorems, for the reactiondiffusion processes, we have

Corollary 14.5. Under the hypotheses of Theorem 13.17, if (14.2) holds, then there exists exact one stationary distribution T. If c,+cM < 0, then the process is ergodic and W ( P ( t , z -), , T ) ( p ( z ) 7r(p))e-qt for ail z E Eo and t 3 0.

+

0, C = sup{(M q ) k b ( k ) - u ( k ) : k 3 0) and take K = CC,,,k,. Then C E [ O , m ) and K E [0,m). On the other hand, for h,(z,) = p,(z,) = z,,we have

6 So

1[Mx,+ b(z,) - u(x,)]k,.

14.2 ERGODICITY FOR POLYNOMIAL MODEL

521

Assertion (1) now follows from Theorem 14.2. For the linear growth model, some restriction is needed for the existence of stationary distribution, even in one-dimensional case (cf. Example 4.57). The assertions for the ergodicity follow from Theorem 14.3. H

Remark 14.7. Sometimes, we can choose a transition probability ( p , ( u v)) , on A, for each n 3 1 to keep the M-controlled condition. (For instance, it is the case if (p(u,v)) is translation invariant in S = Zd with p(u,u)= 0. See Remark 14.9 and the proof c) of Theorem 14.10 below.) Then, the constant M used in the above examples can be replaced by M - 1. In general, it seems not natural to involve the constant M in the ergodic conditions for a real model. However, some example shows that the ergodicity does depend on the choice of M and its corresponding sequence ( k , ) . On the other hand, for a given model, the choices of M are usually not unique. So it is natural to take E m i n := {x E E : c,~,Ic,(M) 0 :there exists a positive sequence ( k , ( M ) ) so t h a t ~ , p ( u , v ) k , ( ~ 0, 6, >/ 0. From now on, we will often use the following natural hypothesis.

522

14 EXISTENCE OF STATIONARY DISTRIBUTIONS AND ERGODICI

Hypothesis 14.8 (H). The transition probability p ( u , v) is translation invariant and irreducible in S = Zd, p(u,u) 0 and mo 2 1, 00,61,Sm0+l > 0.

=

Remark 14.9. Because of the translation invariance, for each h4 > 1, the sequence (k,) defined by

c 00

k,

M-np(n)(u,o),

:=

u Es

(14.13)

T&=O

P 2

< 61

(14.14)

363 P; +4 +(2 2pZ)) 363

(14.15)

respectively. Clearly, the pure birth rate PO plays no role in these conditions. The reason is that the distance used to deduce the conditions is the ordinary (in particular, translation invariant) distance on Z+and the coupling used there is the one of marching soldiers. All these are certainIy not necessary. In view of Theorems 5.37 and 5.38, there are two ways to improve the above results. The first one is introducing a refined translation invariant distance and adopt the coupling by reflection, based on Theorem 5.37. The second one is to use the classical coupling and a non-translation invariant distance, as suggested by Theorem 5.38. Both ways are meaningful. Here we adopt the second one as an illustration. Thus, we consider the following distance

P(k4

=

1

k >e E

c u jj$,wk < 00. Set u* = 0 v - ui).Suppose that there exists an & > 0 such that bk+lUk+l-(bk+ak+l+k+l-&)ur,+(ak+k)uk-1+u+ku*

6 0 , k > 0, (14.16)

where a0 = 0 and 'LL-1= 1. Then under (H),the reaction-diffusion are exponentially ergodic, uniformly in initial points.

processes

1.4.2 ERGODEITY FOR POLYNOMIAL MODEL

523

We remark that (14.14) and (14.15) can bededuced from (14.16) by setting = 1. For instance, for the second Schtogle model, (14.16) with Uk z 1 becomes 2PZk - 61 - 363k(k - 1) < 0. This is trivial when k = 0. Hence: the condition becomes Uk

This holds iff A := (2p2 - 383)2 - 1263(61 - 202) < 0. That is (14.15). The next two corollaries are out of the range of (14.14) and (14.15).

Corollary 14.11. Under (H), the processes are exponentially ergodic, uniformly in initial points, provided (1) dmo+1 is large enough for fixed & and 6 k , k 6 mo, or (2) PO is large enough for fixed 0,and 6 k , k 2 1. Proof: Take

Uk =

( k + 1)-l, k 2 0. Then ii

-

1: u*

2:

0 and (14.16)

becomes

Since the degree of ak is higher than that of bb, this inequality holds toor large enough k . Next, for fixc?d k , in case (1) (resp. (2)), the second (resp. first,) term on the left, can be arbitrarily negative for large enough Srnoil (resp. PO). Now, the assertion follows from Theorem 14.10.

Of course, in order to get a more precise ergodic region, some restriction is necessary. T h e one-paramet.er coefficienbs used in the next corollary have a deep reason, which will be explained at the end of 316.1.

Corollary 14.12. Consider the second Schlogl model with = 2a, 02 = 6a, and S, = Q > 0. Then, the processes are exponentially ergodic, uniformly in initial points, for all o 2 0.7303.

61 = 9a

Proof: Take

E

6 lo-', uo = 1, u1 = u2 = 3/2

+

E

(trick!),

Define k.1 = inf{k 3 2 : U ~ + I 2 uk). When Q = 0.7303, a numerical computation gives us !q = 15 ( k l can be smaller if a is bigger). We now show a technical point so that the computation can be stopped in finite steps.

14 EX~STENCE OF STATIONARY DISTR.IRUTIONS AND EKGOUIC

524

First, replace ukl+l.by ukl =: IL, (14.16) still holds. Next, suppose that we have already had ux: = uc+1 = g for some k 2 2. Then, in order for uk4-2

2 g,by definition, it suffices that (%+2

+ bk+l + 1

-

E

-

Uk+l)%

- (k

bc+2 Equivalently, !L>

Au(k

+ 2)u, + k + 1 2 2.

( k + 2)u, - ( k + 1) + 1) - A b ( k + 1) +'l -

E

where A a ( k ) = a k + l - u k . Thus, by induction, once this holds, we can indeed use uk = g instead of the original uk for all k 2 k l . In other words, the computation of (uk) can be stopped at k l 1. Next, since the resulting sequence (uk) satisfies (14.16) with u = u1 and u*= ii - 1, the conclusion of the corollary follows from Theorem 14.10. The remainder of this section is devoted to the proof of Theorem 14.10. For this, bhe m x t estiniate plays a key role.

+

Lemma 14.13 (Esti.matc of Moments). Under (H), for every m 2 1, there exists a decreasing function pm: ( 0 , ~3) [0, ca) such that

E z ( X u ( t ) m L, 0,

7~

E Zd and

x E E,",

(14.17)

where (X(t)>,,, is the reaction-diffusion process and

E; = {x E Eo : 2 , - zo for all

PL E

Zd}.

Proof: a) Let En denote the expected value of the process starting from xu = n. Note that Et c Em for all m 2 1, where Em was defined in the last chapter:

E,, =

{

II: E

EO : p ' " ) ( ~ ) := C ~ r k tl := inf{t > 0 : g n ( t ) M } and so g n ( t ) < KM for all t f ( O , t z ] . Furthermore, g n ( t ) < M for all t > t 2 up to t 3 := inf{t > 0 : g n ( t ) 2 M } . However, by (14.21) again, we have gn(tA) = 111 and gk(t3) < 0. Hence, the function gn goes down again, provided t 3 < 00. Thm, g, can never exceed M for all 1 2 t z . Combining this with (14.18), we obtain

K M . Define T K M = inf{t

2 o : gn(t) < K M } .

(14.23)

14.2 ERGO 111crr Y

F'DR

P oLY N o M IAL M oLIEI,

Then by (14.20), (14.21) and the fact that gn(t> 2 K M for all t have

527

< T K Mwe ,

N.ow, applying (iv) to obtain d

-dgt n ( t )

Write

E~

=E

< -&Elgn(t)T(gn(t)),

t < TKM.

(14.24)

Note that, on the one hand, we have

E ~ .

Here we have used the fact that gn(t) E [ K M , nm] for all t < T K Mand (iii). On the other hand, by (12.24), we have

Therefore, T K M< 00 and so g, 2 K M on [0, T K Mby ] the continuity of gn. By using (14.24) again, we get

That is

Set O0

Then

du

0 .

528

14 EXISTENCE OF

STATIONARY

DISTRIBUTIONS AND ERGODICI

Because r is strictly decreasing in (0, m), so its inverse function exists but also decreases. From (14.25), it follows that loggn(t) 6 T * ( E Z t )

< 00,

t

E

I?* not only

(0, T K M ] ,

i.e., gn(t) < exp [ r * ( E z t , ] , t E (0, 5%M]. Combining this with (14.23), it follows that cp&)

:=

t >o

(exp [ r * ( ~ ~ v (t K) ]M)) ,

provides a desired function. Proof of Theorem 14.10: The proof is split into five steps. a) First, consider the finite dimensional case. Let S be a finite additive group. Suppose that (p(u,v) : u,v E S ) is a translation-invariant transition probability :

p(u

+ w, v + w) = p(u,v)

for all u,v,w E S.

By using S instead of Zd, one can define an operator as in (14.11). For the diffusion part, for each u,we adopt the coupling of marching soldiers. (2, Y)

+

+

-+

.( - eu + ew, Y - eu + ew) .( - eu ew, Y) (2, Y - eu + ew)

+

at rate (xuA Yu)Pb,4, at rate (xu - Yu)+P(u, 4, at rate (yu - x,)+p(u, v).

For the reaction part, at each u E Zd, due to Theorem 5.38, we adopt the classical coupling. If xu = yu, then the two marginal processes evolve at exact the same rates. If xu # yu, then they jump independently

(x,Y)

.( + eu, Y) -+ (x - eu,y) (2,Y eu) ( x , v - eu) -+

-+

+

+

at at at at

rate rate rate rate

b(xu),

a(xu), b(Yu), u(yu).

Clearly, this coupling is order-preserved. b) Next, we make some computations. Denote by fiC the coupling operator defined above. Fix x 6 y and u E S, write xu = i j = yu. We have

,.,

R,-p(i,j)=

~ ~

(14.26)

529

14.2 ERGODICITY FOR POLYNOMIAL MODEL

where I k 2 1 is the indicator of the set { (z,y) : y, - 2, 2 l}. The last term on the right-hand side appears since p is not translation invariant. Now, by (14.16), for the first term on the right of (14.26), we have

e=i j-1

< -.>ut

- ( j - 2)'lL -

( j - 2)iu'

t=i

< -ep(i, j ) - ( j - i)u - iu*.

(14.27)

On the other hand, we have monotonicity, x translation invariance E"?Z,(t)

= +?zo(t),

C p ( v ,u)= C p ( 0 ,u - v) = 1, 21

V

where Z ( t ) = ( X ( t ) , Y ( t ) )for , any initial x

E"

= {X E

< y + X ( t ) < Y ( t ) ,a.s., and

Z$ : X,

< y, z,y E ES:

= x0,u E

S}.

Corresponding to the last two terms on the right of (14.26), we have

Here is the main place we have to pay for the method. Because of the interaction, one can not replace u ~ ~and( ~~l ~ ), ( -~ u ) ~ , ( ~ with ) u ~ ~and( U Y V ( t )- U X u ( t ) ' respectively. Otherwise, the system would become independent one for which the process is exponentially ergodic. Combining (14.26) and (14.27) with (14.28), we obtain

Hence, we arrive at

c) Let AN = [-N + l , N l d c Zd and regard AN as the torus SN = Z d / ( 2 N Z d ) ,the factor group. On S N ,we can introduce a shifting operator

530

14 EXISTENCE OF STATIONARY DISTRIBUTIONS A N D ERGODICI

in a natural way and transition invariance is meaningful. Next, for a given translation-invariant transition probability p ( u , u) on Z d , we can introduce p N ( u ,v) on S N with the same property:

Here we have identified u E SN as an element in Zd. Clearly, this ( p N ( u ,v) : u,u E S,) possesses the M-controlling property mentioned in Remark 14.9. Applying a) and b) to the present case with an obvious change of notations, we get IE>Yp(Zo(t)) 2~"(Zo(l))e-"'"-'', t21 (14.29)

0 for all t 0, this proves that Yom(0o) Set zA n = (xuA n : u E S ) . Then

>

x0(t),< ~ " " " ( t I, F-a.s.

This proves the ergodicity of the process. Actually, the convergence is exy). ponential, uniform in initial points (z,

Remark 14.14. Because p is a metric on Z+, p(z,y) = ~ , p ( z U , y U ) k U defines a metric on EO ( (Eo,p ) is a Polish space), and so we have a minimum L'-distance W ( P ,&) = inf,- s p ( x ,y)F(dz, dy). In this notation, we have indeed proved that U

21

+o

as

t -+ 00.

To conclude this section, we consider the reaction-diffusion processes with an absorbing state 6 = (6, = 0 : u E S). Note that the above proof needs only a little modification. Since the process Xo((t)stays at 8 for all t 2 0, thus, in (14.26) for instance, we can simply set i = 0 and 2, = 0. After some suitable modifications, we can prove the following result.

532

14

EXIS'I'ENCE O F STATIONARY DISTRIBCTIONS A N D

ERGODEI

Theorem 14.15. Let (uk) be a positive sequence on Z + with uo = 1 and .li := ~ u p ~ > , < ~ u03.k Suppose t h a t there exists an E > U such t h a t

where a0 = 0 and u-1 = 1. Then under (H) but b~ = 0, the reaction-diffusion processes with absorbing state 0 are exponentially ergodic, uniformly in initial points.

Applying Theorem 14.15 to the sequence Uk = 1 again, th e sufficient condition becomes infk>I(nk - b k ) / k > 0. Thus, for t.he first; Srhlijgl model, we obtain the same ergodic condition as (14.14). For the second model, the condition becomes

which is weaker than (111.15). In principle, the original process is easier to be ergodic than the absorbing one, due to monotonicity. Clearly, we should have analogue of Corpllaries 14.11 ( I ) and 14.12.

14.3 Reversible Heaction-Diffusion Processes It is natural to ask when a reaction-diffusion process is rcversible. For simplicity, we discuss this problem only for polynomial model.

other cases of 5 #

u E Z*,

x,Z

E

E.

2,

(14.33)

Assume that P O , 61,Jmotl > 0 as in the last section, then b ( k ) > 0 €or all k 3 0 and u(k) > 0 for all k 1. In order for ( q ( q 2 ) : x,2 E E ) to be a field, we need assume that p(u,'L')> 0

-

p(v,u)> 0,

u,v E E d .

(14.34)

Under this assumption, we can introduce paths, works and so OR to make a field ( q ( x , 2 ) )as in Section 7.1. Now, we look for the conditions under which (q(x?5))becomes a potential field. It is clear that for this field there is only one type of minimal closed paths. That is the triangle

14.3 REVERSIBLE REACTION-DIFFUSION

+ +

533

PROCESSES

+

+

Here, the path from z e, to z ew is due to the diffusion (z e,) + (x e,) - e, e , = z e,. For the works done by the field along these paths to be zero, it is necessary and sufficient that the triangle condition

+

+

holds for all x E E and u,ZI E Zd,u # v. Fix u # v and let z satisfy z, = z,. From the last equality, it follows that

Furthermore, the same equality gives us

(’ u(k

l)b(Ic) =:X > 0, independent of k. Equivalently, ” =A. 1) 6k+l

+

(14.36)

Based on these discussions, it is not difficult to prove (as we did several times in Chapters 7 and 11) the following result. Lemma 14.16. Under (14.34), the above field (q(z,5): z,2 E E ) is a potential field iff (14.35) and (14.36) hold.

Having the potentiality of the field at hand, the next step is to construct all Gibbs states and then prove that the Gibbs states coincide with the reversible measures or the process. However, since the potential function of the field is exactly the same as that of the field induced by the independent product of birth-death processes, we guess that a Gibbs state (i.e., a reversible measure) may be obtained by the product of the reversible measures for the birthdeath processes. That is, the independent product of Poisson measures with parameter X since (Ic l)b(Ic) = Xu(k l), Ic >, 0. The main purpose of this section is to prove this conjecture and furthermore the uniqueness of the reversible measure.

+

+

Theorem 14.17. Consider the polynomial model for which the birth-death rates satisfy Po, 61, bmo+l > 0. Suppose that ( p ( u ,v)) is translation invariant and p(u,u) = 0. Then the process is reversible iff (q(z,iE))is a potential field, i.e., (14.34), (14.35) and (14.36) hold. Furthermore, the only reversible measure is the product of the Poisson measures with parameter A.

To prove this theorem, we need some preparations. Lemma 14.18. Under the hypotheses of Theorem 14.17, if the process is reversible with respect t o T , then

(14.37) J

J

534

14 EXISTENCE OF STATIONARY DISTRIBUTIONS AND ERGODICITY

Proof: a) Let ~ ( ~ ) (=x CUES ) x z k u , m E N. Denote by x A n the element in Eo with value xu A n at every u f S . Observe that

as n + 00. By Lemma 14.13 and the monotonicity of the process, for every stationary distribution T of P ( t ) ,we have

.(p‘”’)

= s 7 r ( d s ) l E , p ( m ) ( X ( t )= )

cJ k,

7r(dx)E,Xu(t)m

U

In particular, 7r(Em)= 1,

rn E N,

(14.38)

where Em = {x : p(”)(x) < m}. b) Given f , g E b29g.t ( E ) , by Theorem 13.19, (14.38) and the dominated convergence theorem, we have

The proof is completed.

Lemma 14.19. Condition (14.37) hol;ds iff 1() (2) hold for all

and

and for all

14.3 REVERSIBLE REACTION-DIFFUSION PROCESSES

535

Proof: a) Sufficiency. Note that

1

fR9dn - 1 9 R f d n

=:I+II+III. By (l),we have

1=

c/ c1

u(&J

[ g ( z ) f ( J-: eu) - f ( z ) s ( J-: eu>]7r(d4= -II*

U

On the other hand, by (2): we have ZuP(zL, 4

uL,'u

=

1s

9 ( z - eu

+ ev)f(47@.:)

~ v P ( Wu>9(.>f(. ,

+ eu - ev).rr(dz)

u,v

=TJ

w ( u , M z ) f ( s - eu + ev)7r(dz),

here in the last step, we have exchanged u and ZI. This completes the proof of sufficiency. b) To prove the necessity, we first consider condition (2). Take y E E*, A G Zd = S. f = I l y l x E ~ \ "and g = I { y + e , - e t ) x E S \ ~ ,

If { s , t } is not contained in A, s E A but t f A for instance. Then, using the expression

f

=

I{yXy'}XES\A'

Y'E%

fY',

=: Y'EE

where A' = A U t , we may replace A and f with A' and fY', respectively, so that {s, t } c A'. Now, assume that {s,t } C A. We have

0= =

J [fog

c u,u

-g

~ l d n

2uP(u, 4 [ 9 b - eu

+ ew)f(z) - f(.

- eu

+ eu)g(.)]

X(d4

536

14 EXISTENCE OF

STATIONARY

DISTRIBUTIONS AND ERGODICI

This gives (2). As for condition (I), the proof is similar but using g = I { g + e 8 1 x E instead ~\~ of the above g .

Proof of Theorem 14.17: a) Let the process be reversible and denote by 55'the set of all reversible measures of the process. As usual, set

Let n E .%'. Applying 1,e:Mma 14.19 (1) to the function f = b,n(s : 2, = n) = a n + l ~ (:xxu = n 1).This gives us

+

7+

: 5, = 0) = z-I,

n(2 :

2,

= n) = pn/Z.

we have

(14.39)

it ~follows , Next, applying Lemma 14.19 (1) to the function f = I [ l s , = n , Z v = m that b,.ir(z : 2 , = nl z, = rn) = u ~ + ~ T :( 2z, = n 1, x, = m). So

+

Suinrning over n 2 0 gives T(Z

: Z, = 0, Z, = m ) = T ( X : Z, = m ) / Z = T ( Z :

:I:,

= O)T(:C : :cW = m )

Substituting this into (14.40) and using (14.39) we get n(2 :

2, = 12

+ 1,2, = m ) = pn+17r(x : 5, = m ) / Z = n ( x : xu = n + 1)v(x : x2.= m).

By induction, it is now easy to check that

This proves that 1221' = 1 and the unique stationary distribution is the independent of ( p n / Z : n 2 0) which is just the stationary distribution of the marginal birth-death processes. Finally, applying Lemma 14.19 (2) to the function f=1[2u=n, r , = n a + l ~ , we obtain

+

(n I)p(u,v)n(z: zll = n -t 1, x, = m> = (,m l)p(v, u>n(z: 2, = n, 2 , = rn + 1).

+

(14.41)

14.3 REVERSIBLE REACTION-DIFFUSION PROCESSES

537

In particular, setting m = n, we get p ( u , u ) = p(u, u),and so p ( u , u)= 0 a p ( u , u ) = 0. This proves the necessity of (14.34) and (14.35). Now, assume that p ( u , u ) > 0. Then (14.41) implies that

Thus

+

( n 1)bn T ( 2 : 2 , = m) = (m+ 1)7r(2 : 2 , = m+ 1). %+I Summing over m 2 0, we obtain

c

(n + 1)bn = an+1 m=l 03

m T ( X :

x, = m ) =: A,

independent of n 2 0, which gives us (14.36). By Lemma 14.16, we have proved that ( q ( 2 , 2)) defined above is a potential field. b) Let (14.34), (14.35) and (14.36) hold. Then (14.37) holds. In the finite dimensional case, (14.37) implies the reversibility of the process with respect to the independent product of Poisson measures with mean A. To pass through from the finite dimensional case to the infinite dimensional one, let f, g E b%?yl(E). We need to show that

which follows from the proof d) in the proofs of Theorems 14.1 and 14.2 (regarding gdTn and g d r as dTn and d x respectively). A further problem concerned with the reversible reaction-diffusion processes is the ergodicity. By using the monotonicity and the estimate of the first moment, we see that the process X n ( t ) starting from 5 , = n will have a weak limit for some sequence t k ---f 00. In particular, we have a stationary distribution p as a limit of X o ( t ) starting from 0 = (0, = 0). Because of Lemma 14.1% it is even true that X W ( t ) := limn+m X n ( t ) ( t >, 1) has a weak limit ,G. Again, by monotonicity, for any stationary distribution p , we should have p < p. Thus, whenever p = p, the process must be ergodic. In the present case, since p and ji both are translation invariant, the last assertion is equivalent to say that the only translation invariant stationary distribution is the reversible measure p given in Theorem 14.17. Based on these observations plus some computations on free energy, Ding, Durrett and Liggett (1990) proved the following result. Theorem 14.20. Under the hypotheses of Theorem 14.17, the reversible reaction-diffusion processes are ergodic.

538

14. EXISTENCE OF STATIONARY DISTRIBUTIONS AND ERGODICI

Refer to the cited paper for a proof. Refer also to 515.3 for more results on this class of processes. It is a good chance to look at the spectral gap for this infinite dimensional process.

Theorem 14.21. For reversible polynomial model, we have gap(S1) 2 gap(Q) > 0, where Q is the birth-death Q-matrix appeared in the reaction part of the process.

Proof: The Dirichlet form of the process consists of the reaction part and diffusion part. Ignoring the diffusion part, we get a smaller one, which is the sum of the forms of independent birth-death processes. By Theorem 9.5, we have gap(R) 3 gap(&). To see that gap(Q) > 0, one may use Theorem 9.25 (4). An easier way to see this qualitative result goes as follows. Applying Corollary 4.49 to hi = 1 i shows that the birth-death process is exponentially ergodic and so the assertion follows by Theorem 9.15 (2).

+

14.4Notes Section 14.1 is taken from Chen (198613, 1989b), but the proofs are now simplified. Some analogies of Theorems 14.1 and 14.3 were appeared in Basis (1980). An analogue of Theorem 14.2 was obtained by Huang (1987). Under the hypotheses of Theorem 14.3, the existence of stationary distribution is due to Zhang (1999). Meanwhile, some progress about the ergodicity for reaction-diffusion processes were made by Ding, Durrett and Liggett (1990) and Neuhauser (1990). In the paper by Ding et al, Lemma 14.13 was appeared. The present proof is an extension to the original one and works also for non-polynomial case, a complete exploration is contained in Chen, Ding and Zhu (1994). The sufficient conditions (14.14) and (14.15) were proved in Chen (1986b). Then, it was proved by Neuhauser (1990) that the processes are ergodic when pis and 62s are all large enough. Part (2) of Corollary 14.11 is taken from Chen (1990). Sections 14.2 is taken from Chen (1990, 1995), some other criteria and their comparison are also appeared there. Finally, Theorem 14.17 is mainly based on Zhu (1990). The ergodicity for reaction-diffusion processes with absorbing state was studied by Li (1995) with more direct approach (without using coupling). A related result will be stated at the end of Section 15.3. Based on Chapter 8, the large deviation principle for reversible reactiondiffusion processes was proved by Chen (1996a,b) completely in the finite dimensional case and partially in the infinite dimensional case. Comparing the irreversible case with the reversible one, it seems that the ergodic theorems may be further improved.

Chapter 15

Phase Transitions This chapter is devoted to the study on phase transitions for the reactiondiffusion processes. Of course, we have to restrict ourselves to some more concrete rriodels. Our first model is the linear growth model for which we adopt the moments method. The second iiiodel is rioriliriear but having 8 = (@, = 0 : u E Ed) as an absorbing state, for which we usc thc graph rcprcscntation method. Finally, &s an approximation, we study a time-inhomogeneous Q-process, instead of the infinite dimensional reactiondiffusion processes, to exhibit the phase transition. The last approach is often called the mean field method in the statistical physics. As a preparation, in the next section, we introduce an important “dual” formula. 15.1 Duality

Unless othcrwise stated. throughout this chapter, assume that ( p ( u ,u)) is a genord random walk (Lc: a translat,ion-invariant transition probability on S = E d ) with p(u,u) 0. Moreover, the rates for the diffusion part of the reaction-diffusion processes are fixed: x:,p(u, v). That is

=

Qdf(4

= .&uP(U,

).

[f(. - eu + e l J - f ( 4 ] ,

(15.1)

u.v

Define

and the dual operator Q X Y ) = CY(.)P(./4[f(Y

- eu + e u > f ( w ) ] , -

Y

E(f)-

PL,V

Note that the rate ( p ( v , u ) )for diffusions used in the last line is the dual of the original ( p ( u ,v)). Next, define a dual function (Poisson polynomial) D : ~ ( f x) E 4 Z+ as follows

W

+

where n(’) = 1, n ( k )= ,n(n- 1) - - ( n - k l), k 2 1. The main rcason we adopt the notation n ( k )is that €or a Poisson random varia.ble E with mean A, we have E$k) = A k , lc 2 0. The next result exhibits the dual relation between !& and sZ;C, in terms of the function D. 539

15 PHASE TRANSITIONS

540

Lemma 15.1. Let S be finite. Then (fldD(Y7

9)(4= (flm, 4)(Y),

r, Y E E.

Furthermore

ED(y,X(t))= W Y ( t ) , z ) ,

z, y E El

where IE is the expectation of the independent product of the R&process starting from y and the Rd-process starting from 2.

Proof: a) By definition,

x

[ (u) (u)- 1) 2

(2

(.(V)

+1)(y(v))-

x (u) 2 (u) (Y(U))z(v) I:y(V))].

Note that

We have

Here, in the last step, we have used C, p ( 7 ~11) , z 1. b) Next, set d N ) ( y ,z) = D ( y )z A N),whcre 5 A By a), we have

N

= (xu A

N

:

E S).

54 I

15.1 DUALITY

Hence

d d.5

-ED(")(Y(t

- s ) , X ( s ) )= 0,

0 6 s 6 t.

Integrating over s , we obtain

W Now, t,he second assertion of the lemma follows by letting N 4 03. Actually, (15.2) is a consequence of the integration by part.s formula. Regard X ( t ) and Y ( t )as bivariate processes with state space E x E and denote by Pd(t) and P;(t) their semigroups, respectively. Then, by Theorem 13.40, we have

Now, (15.2) follows from

For the infinite dimensional c s e , it should pointed out that the d i d relation between f l d and 02 given by Lemma 15.1 is not meaningful since D(y, -) is not a Lipschitz function. To avoid this, one may replace D with D I N ) ,and then handle the infinite dimensional case directly. But here, we would like to continue our study by starting from the finite dimensional cases. Note that corresponding t o a:,the process on E ( f j is a Markov chain. Each E(")(n3 0) is a closed set for the chain. In particular, on E('), the Markov chain has &-matrix Q = P' - I , where P = ( p ( u ,u)). Denote by p * ( t , u!u) the corresponding Q-process. Since the particles evolve independently, the transition probability of the chain on E(f)is given by

whore 0 =

(QT8

= 0). In particular, for finite S , by (15.2), we have

On the other hand, since D ( N ) ( y , f bVg!(E) and D ( N ) ( y ' , x )< 00, by using a limiting procedure (a.pplying Theorem 13.8, Lemma 5.14 and the proof a>of Theorem 4.38 to the left- and the right-hand sides of (15.3), a )

542

15 PHASE TRANSITI

respectively), we see that (15.3) still holds for S = Zd. Again, letting N + 00, we get

We have thus proved that the zero range process (Rd-process) has a dual (Ri-process) which is also a zero range process (and so the dual is called self-dual). For the reaction-diffusion process with generator R = Rd+R,, even though the above dual formula does not hold, but the same argument leads to the following result.

Proposition 15.2. For the polynomial model or the linear growth process with diffusion part as above, we have

Y E E(f),

2 f

(15.4)

Emo+[yI,

where for the polynomial model, mo was given in Example 13.27 and for the linear growth process, mo is setting to be 0.

This formula is a starting point for the study in Chapter 16. In the next section, we will use this formula to compute the first two moments of the linear growth processes.

15.2 Linear Growth Model In this section, we study the linear growth model:

b ( k ) = Alk) u ( k ) = A&,

k 2 0,

A1,AZ

> 0;

(15.5)

with diffusion part given in the last section. As was mentioned before, since ( p ( u ,w)) is a random walk, we can choose a positive summable sequence (ku) and an M > 1 so that

Recall that the convergence in finite-dimensional distributions coincides with the weak convergence in ~ ( E owith ) respect to the induced topology (cf. ) Remark 4.6). As usual, we call P ( t ) ergodic if there is a p E ~ ( E osuch that vP(t) + p as t + 00 for all v E P ( E 0 ) . The phase transition of this model is described as follows.

15.2 LINEARGROWTHMODEL

543

Theorem 15.3. For the linear growth model, the following conclusions hold. (1) If XI

t

< A,,

then the process is ergodic. More precisely, v P ( t ) v E 9 ( E o ) ,where So is the point mass a t 8. Az, then the process is non-ergodic.

+ 60 as

-+ 00 for any

(2) If A1

>

The key to prove this theorem is based on the fact that the first moment for the model is known explicitly. Actually, by (15.4), for IyI = 1, y = eu(say!), we have

W q Y , X ( t ) )= & L ( t ) Y’

+

= CP*(t,U,u’)zu’ (A, -A,) u’

It

dsCp*(t,’

S,U,U’)lE,X,‘(S).

From this, we obtain the following result. Lemma 15.4. Let p u ( z ) = x,, z E Eo, u E S = Zd. Then

P(t)p,(x) = e(X1-Xz)t

CP*(t) u , v)p,(J;).

(15.6)

A direct way to prove this lemma goes as follows. By Theorems 13.19 or 13.20, we have $P(t)f = P(t)flf for all f E Yip(Eo)(E). Since p, E B p ( E o ) ( E )and f l p u ( z ) = ( h- h)p.ll(.)

+&

m ( P ( W

- SU,),

21

we get

This differential equation is linear, so its solution is unique. A simple computation shows that the solution is given by (15.6). Next, we consider the second moment which is the case that IyI =.2. Fix y = e, e,. Since f12,D(y,z)= 2(X1 - Xz)D(y, x) ~X~J;,S, by (15.4), we have

+

+

E A Y , X ( 4 ) = C P * ( t , Y ,Y’)D(Y’,4 Y’

544

15 PHASE TRANSITIONS

fROM THIS, WE CLAIM THAT THE SOLUTION OF THE SECOND MOMENT IS GIVEN BY (15.8) BELOW. lEMMA 15.5.

Proof: Denote by by gf)(t,y) the right-hand side of (15.8). Then

Combining this with (15.8), we see that f(t,y) is a solution to Eq. (15.7). .But the solution is unique, so we have proved the required conclusiuon

15.2 LINEARGROWTHMODEL

545

Before going further, let us point out that the upper bound of P ( t ) D ( y ,x)

(Ivl = 2), which is what we need only to prove for the main theorem, can be also obtained in a different way without using the "dual" formula. Consider first the case that I S1 < 00. Then, the estimate follows from Lemma 4.12. Since the estimate is independent of 151 ' , the assertion then follows by a truncation argument. Lemma 15.6. Let ( E , p , $ ) be a metric space. Given { p n } c 9 ( E ) and m, S > 0 , if pn + p and CJ := supn pn(pmt6) < 00, then pn(pm) p(p") as ---f

n

--f

00.

Proof: Obviously, lim:

,oop n ( p m )

p(p"). On the other hand,

Letting n + 00 and then N 00,we get Kn+oo pn(pm) < p(pm). Proof of Theorem 15.3: a) From Remark 14.9, we have seen that the process is ergodic whenever XI < X2. b) Now, let XI > X2. Denote by X1((t)the process starting from 1, which is the configuration having value 1 at every site u E Z d . Let pt be the distribution of X i (t)/lEXi ( t ) .Then pt E 9( [0,00)). Moreover, ---f

LNTPL(dT)

= EX,l(t)/IEX,l(t) = 1.

By Theorem 4.4, we can choose a sequence { t n } so that pt, 9( [ 0 ,00)). On the other hand,

* some p E

where yo = 2eo. But by Lcmma 15.4 and Lemma 15.5,we have

P(t)D(yO,1) = e2 ( x 1 - x z ) 1

(Cp*(t, 0, u))a U

15

546

PHASE 'I'RANSI TIONS

11;Iollows that sup t>O

/*

r2pt(d?-) < 00.

0

Thus, by Lemma 15.6, we can choose a constant c > 0 such that p ( ( c ,m)) 2 E > 0. Then pt,,((c,m)) 2 ~ / 2 > 0 for all n 2 no = no(&). Set A ( t ) = E X t ( t ) . Noting that either X & ( t )= 0 or X,'(t) 2 1, we obtain

P ( L ) ( p oA 1)(1)=E[xA(t,)A 11 2 lE[X,'(t,) A 1 : Xi(Ln) > cA(tn)] = P[XCx,'(tn)> cA(t,)] ((c, I.) 2 E/2. R u t p0A1 E bWyl!(E),it is impossible that 6,P(t) + 60 as t 4 co.Therefore, the process ( X ( t ) ) is non-ergodic. I In view of Theorern 15.3, the only remainder case for this model is that A, = A,. To discuss this situation, we need some notations. Let = pt,,,

9 = the set of stationary distributions of P ( t ) , 9 = the set of translation invariant measures, Ye= the set of the extreme points in 9, LPm(E)= { p E Y ( E ) : JxFp(dx) < oo}:

+

and set p(u,v ) = ( p ( u ,v ) p ( v , u ) ) / 2 . Theorem 15.7. Under the assumptions of Theorem 15.3, suppose additionally t h a t ( p ( u , v ) )is transient. Then

(1) For each p

/

> 0, there is uniquely an vP E 9 n 5@satisfying

/

xcovp(dx) = p,

W v V p ( d 2 ) = P(P

+ L) + 2x1

lm

F ( 4 Zt.7 v)&

where p ( t ) is the Q-process determined by the &-matrix 2(p- I ) (2) Let p E 9 n 9 n LP1(E),then there is a X E 9 ( ( [ 0m)) , such that

P =;J vPX(dP). (3) Let p E Yesatisfy J x o p ( d x ) = p Moreover, if p E 9n 9 2 ( E ) ,then

0.

k-*m t-+w

f

pP(t) = v p .

Zd.

15.3 REACTION-DIFFUSION PROCESSES

WITH

ABSORBINGSTATE

547

One of the key to prove this theorem is that the second moment for this model is also explicit. This is a common point in the study of such type of results (cf. Liggett and Spitzer (1981)). Since a complete proof of Theorem 15.7 is quite lengthy and this finer result does not interfere too much the picture of the phase transition, we omit the details here. Refer to Ding and Zheng (1989).

15.3 Reaction-Diffusion Processes with Absorbing State

It was proved in Section 14.2 that the reaction-diffusion processes are ergodic whenever the pure birth rate is large enough. In this section, we consider an opposite extreme case that the pure birth rate vanishes. For which, we show that the processes can be non-ergodic and hence there exist phase transitions. Theorem 15.8. Take S = Z1. Consider the reaction-diffusion process ( X ( t ) ) with birth-death rates b ( k ) = Xk and a ( k ) [a(O)= 01 being arbitrary and with diffusion coefficient x,p(u, w), where ( p ( u ,w)) is the simple random walk in Z1: p ( u , v ) = 1/2 iff Iu - wl = 1. Then A, := inf { A :

IP [ ~ ' ( ( t ) $ o for

where Xo((t)is the process starting from

all

t > 01 > 0} < 00,

xo : z: = 1 and zt = O(u # 0).

The proof of this theorem is based on a comparison of the process with an oriented percolation. Let A? = { (m,n ) E Z2 : m n is even, n 2 O}. For each z E 2, draw an oriented bond from z to z (-1,l) and to z (1,l). Suppose that each z E 2 is independently open (denoted by q ( z ) = 1) with probability p E (0,l) and closed (denoted by q(2) = 0) with probability 1 - p . Given z j E 2, j = 0, .. . , k , (zo, ' . ,z k ) is called an open path from zo t o zk, denoted by zo z k , if ( z j - 1 , z j ) is a bond and zj is open for all j . Next, let

+

+

+

+

y-t

The first problem in the study of the percolation is looking for p , := inf ( p : itD (st,)

Lemma 15.9. 1/2

> o}.

< p , < 80/81.

Proof: a) The lower bound is obtained by comparing the percolation with a branching process (0 = 1, P [El = 01 = (1 - P ) ~ ,IP[& = I] = 2p(1 p ) , IP [ 1 or E (1 6 1 respectiveIy. From this, we see that pc 2 1 / 2 . b) The upper bound is harder to estimate than the lower one. Here, we adopt the contour method. Let

For l C ~ l< co, let r be the boundary of the unbounded component of (R x (-1, co)) \ W N .Such a r is called a contour. Note that for a contour I? of length m to exist there must be at least m/4 sites outside of r which are closed (we will prove this below) and the shortest possible contour has length 2 N 4. As an analogue of the Peierls' inequality, we have

+

for p > 1 - 3-4. The right-hand side is less than 1 for sufficient large N . From this and IP[ICNI= 001 6 (N+l)IP[ICol = 001, we claim that p , 6 80/81.

(-2N,

- 1)

(0, -1)

Now, we return to prove the %/4" assertion as promised in the last paragraph. Let us orient I' in such a way so that the segment (0, -1) + (1,O) to be positive. Clearly, there are four types of oriented bonds: \ J \ / * . Label them by i = 1,.- . ,4,respectively. Set mi = the number of bonds of

15.3 REACTION-DIFFUSION PROCESSES

WITH

549

ABSORBINGSTATE

4

i-th type. Then, m = mi. Next, for a bond L of type i (z = 1,2), if we stand at the midpoint of 1 and face in the direction of the orientation of I?, then the site closest to our right hand, denoted by zg, must be closed. Note that howcvcr, two bonds .t and !’ may share the same site xg. Because 1’s and 2’s bonds decrease our z coordinate by 1 and 3’s and 4’s increase: it by 1, and on the other hand, the contour r starts at (0, -1) and ends at (-2N, - l ) , we have (ml m2) - (m3 r n 4 ) = 2N. Thus 2N+m m rnl -1 7743 = 2 2F

+

+

and hence (rnl f m2)/2 2 m/4 a s desired. For our purpose, we need to generalize the above result to a slight more general situation.

+

Definition 15.10. Let Il(rn,.)I[ = (Iml Inl)/2 and q : ( R , 2 ) + (0,l). We call 7 l-dependent if q(zl), . . . ,~ ( 2 , ) are independent for all zl,. . , z, E 3 with IIx, - zjll > 1 for i # j . +

Lemma 15.11. For l-dependent percolation q , we have p , < 1- 3-36. Proof: The previous proof works also for the l-dependent case except one estimate is changed as follows. Observe: that for each (m,n,)E 2, thcre are 9 sites in 2 with \i(m,n)II < 1, so for each I’of length rn there is a set of m/36 sites which are separated by more than 1 and which must be closed for the contour to exist. Therefore

P “Cl, for 1 - p < 3-36. 1

< w]

< corlst.(9(1

-

p)”/’y

Proof of Theorem 15.8: a) The idea is to make a comparing of the rcaction-diffusion process with a l-dependent oriented percolation. Clearly, for fixed site m E Zl,the reaction-diffusion process dominates a birth-death process (Bt)t>owith rates

An: 4n,n-1 = 44 + ?a. Let To = inf{t : Bt = 0). Take N and X so large that e-N/2 Qn.nS-1=

< 3-38

< 3-38, PI [ inf B~ < N] < 3-38. l 0 is a

constant depending on

+

+

/32,

61 and

+ 663) and ,OO E [0, c),

83.

Proof a> Let PO > 0. Then f ( 0 ) < 0 and f ( A ) > 0 for all sufficient large X > 0. Hence, there exists at least one positive root of f(A) and all positive roots belong to a finite interval (0, XI. This proves (913 1. b) P u t p ( i ) = i, ph, = p r\ N and T~ = inf{t : X ( t ) > R}+Since thr,

P,V(W

/Id) -

j

nsP,(X(s))ds

0

is a Pi-martingale, we have

(15.12)

Set C = supkao [ b ( k ) - u(C) + 2k] < w. From {15.12), it, follows (;hat,

Letting N + 03, we obtain

By Gronwall’s lemma, we have

~ p ( X (At T ~ ) )< i + Ct +

et-s(i

+ Cs)ds = -C + (i + C)et,

15.4 MEANFIELD hhTHOD

d

c - m(t).

-m(t) = lEin,p((x(t))6

dt

IIence

( 15.13) c) To prove the ergodicity, we adopt the coupling of marching soldiers. Given two stationary distribution r1 and 7r2 with the first moments m1 and m2, respectively. Denote by fit and the coupling operator and the expectation of the coupling process, respectively. Note that

We have

fit(X1(k)- X2(t)1 < cLIX1(t) - X2(t)l + IX'(t)

X2(t)I,

(15.14)

where ci is given by Remark 13.16. In virtue of (15.13) and (15.14), we get

% I X l ( t ) - X 2 ( t ) [= fiiltlXl(t) - X2((t)(< (Ci + l)EIX1(t) - X"(l)(. dt This plus the assumption c$

+ 1 < 0 gives us

d) To prove the last assertion of the corollary, consider first the case that

,& = 0 and rewrite f ( A ) as Ag(X):

+ ?(A

- k)

(A

+

U.J

k=4

Clearly, g(0) > 0, g(1)

bl) . .

.

(A *

+ bk-1)

ak

< 0 whenever 61 <s;

1 2

0) is the independent product of the Poisson measures for which P E ( 4= P ( 4 ,

zf

E Zd,

where p is a non-negative, bounded C2(Rd)-function with bounded first derivative. Denote by JE:c the expectation of the process with generator RE and initial distribution p'. The main result of this chapter is as follows. 555

16 HYDRODYNAMIC LIMITS

556

, r d )E Rd and t 2 0, the limit

Theorem 16.1. For all

T

where [TIE] = ( [ T ~ / E ] :. equation:

, [ r d / 4 E) E d , exists and satisfies the reaction-diffusion

I

=

(TI,.*

-

I f(0,4=P(4. -

Furthermore, for any n >, 1 :,rl, . . ,T , E Wd and t

> 0, we have

where

A particular ca,se is that p is a constant.. Then pE is translation invariant and hence f ( t ,r ) = f ( t , 0) satisfies the equation m+l

(16.4)

Of course, if f is a constant^ then it must be a non-negative root of the cyiiation

(16.5) j =O

j=1

Definition 16.2. A non-negative, spatially homogeneous solution fo(t) t o Eq. (16.2) is called an equilibrium if it satisfies Eq. (16.4). An equilibrium solution fo(t) t o Eq. (16.2) is called stable (resp. asymptotically stable) if for every E > 0, there exists a 6 > 0 such that for any solution f ( t , r ) t o Eq. (16.2), whenever If(0,r) - fo(0)I< 6, we have I f @ , r ) - fo(t)l < E for all t > 0 (resp. h 4 m If(tl4 - f"(t)l = 01. Theorem 16.3. Let p.A be the independent product of identical Poisson measures with parameter X > 0 and EzA be the expectation of the process with > Ah the generator flE and initial distribution p'. Denote by A1 > A2 > non-negative roots of (16.51, where X j has multiplicity mj. Then, the nonnegative equilibrium solution f ( t , r>5 X i is asymptotically stable iff mi is odd and

CjGi-l rnj is even.

16.1 INTRODUCTION: MAINRESULTS

557

This result describes the critical phenomenon for a non-equilibrium system in terms of the reaction-diffusion equation. We now compare Theorem 16.3 with the results obtained in the previous chapters. First, consider the reversible case. Because Pj = aSj+l for some Q: > 0, j = 0 , . . . , m, the equation (16.5) becomes m

m+ 1

m+l

j=O

j=1

j=1

which has only one non-negative solution 8 = a. By Theorem 16.3, the solution is asymptotically stable. This conclusion is consistent with Theorem 14.20, which says that there is no phase transition in the reversible case. Next, consider the first Schlogl model with PO = 0. Then, Eq. (16.5) has two roots: X I = (PI - &I)/& and A2 = 0. It is easy to see that X I is asymptotically stable but not X2. This conclusion is certainly reasonable since there is a phase transition whenever PI is large enough (Theorem 15.8). However, if PO > 0, then there is only one non-negative root and hence asymptotically stable. From this, one may conjecture that there would be no phase transition for the first Schlogl model and there would exist phase transition for the second Schlogl model since for which not every solution being asymptotically stable (we will come back to the second model soon). These conjectures remain unsolved. Certainly, in these two different contexts the objects are actually quite different. There is a scaling factor E - ~( E J, 0) in front of the diffusion rate z(u)p(u,v)in the study of hydrodynamics in order to obtain the Laplacian in the equation. Thus, to regard Eq. (16.2) as an approximation of the particle systems, as indicated by (16.1), the diffusion rate should be large. Alternatively, if we fix the diffusion rate to be 1, then the reaction rates u k and bk should be replaced by E ~ U ~ respectively. Thus, the above comparison makes sense for those u(k) and b ( k ) up to a sufficient small scaling factor. The above two theorems will be proved in Sections 16.3 and 16.4 respectively. Some preparations are presented in the next section. To conclude this section, we explain the reason why we choose the coefficients in Corollary 14.12. Note that for the second Schlogl model, the role played by each of the parameters ,& and & is not clear at all. It seems too hard and may not be necessary to consider the whole parameters. Based on the above observation and to keep the physical meaning (the details are given below), we fix p2 = 6 a ( a > 0), 61 = 9a and 63 = a. Then, when Po E (0,4a), there are three roots A1 > A2 > A3 2 0. By Theorem 16.3, XI and A3 are asymptotically stable but not A2. When PO = 401, we have A2 = 1 with ml = 2 and XI = 4, A1 is asymptotically stable but not X2. As for PO > 4a, there is only one non-negative root which is certainly asymptotically stable. Hence, we guess that the ergodic region of Po should be

558

16 HYDRODYNAMIC LIMITS

located in (4a,m) for sufficient small a. Of course, the assertion is true in the reversible case, for which, we have Po = 36a (Theorem 14.17). On the other hand, as mentioned in Durrett and Neuhauser (1994) that the reactiondiffusion equations are usually the end of the study of hydrodynamical limits of the reaction-diffusion processes. But we can also go to the opposite direction, i.e., using the reaction-diffusion equation to investigate the microscopic processes. The main point used in the above quoted paper to prove some kind of phase transitions for the reaction-diffusion processes with absorbing state xu = 0 is t o look for the critical value at which the speed of the traveling wave solution to (16.2) changes its sign. Let us mention, without details, that in our present situation, this critical value is Po = 2a. From this point of view, the phase transitions would be appeared when ,& f (0, 2a). Based on these considerations, we propose a typical non-trivial case, for which we have more precise picture as shown in Corollary 14.12. We now go to the details. The main point is that, to keep the essential meaning of the model, we should choose the parameters so that the equation

Po + p2x2 - S I X - 63X3 = 0 contains a non-asymptotically stable root. Of course, we can take a = 53 = 1. Let X = z ,&/3. Then, the equation is reduced to z3 p z 2 q = 0, where 1 2 1 2 4 = -Po 3@2&P = 61 -

+

+

,a,

+

zP;*

+

+

When y2/4 p 3 / 2 7 > 0, there is onIy one real root, which is necessarily positive and asymptotically stable. Hence, the only interesting case is that y2/4 + p3/27 < 0. Solving the equation y2/4 + p3/27 = 0 in variable Po, we obtain (1) -

Po

(2)

-

-/%(a&

- -P2(2&

Po -

- 961) - 2(@ - 3 6 1 ) ~ ’ ~

, 27 - 961) 2(@2”- 3 6 1 ) ~ ’ ~ 27

@g)<
0 iff PO > 4. If so, there is only one non-negative root. Next, q2/4 t-p3/27 < 0 iff ,& < 4. In that case, we have three non-negative roots given in the last formula. Finally, when PO = 4, we have XI = 2 with multiplicity 2 and a single root X I = 4. Thus, as mentioned at the second to the last paragraph, for every PO E (0,4] there is precise one non-asymptotically stable root but there is no such root for all Po E (4,oo). We have thus arrived at the desired position. In our particular situation (Po = 2)) the three roots are 2 2, 2 A.

+

a,

+

16.2 Preliminaries

Let co = suprp(r). Denote by po the independent product of identical Poisson measures with mean co. Lemma 16.4. For every

E

> 0, we

have

(1) pE )= CYl P"(t, 9 , d ) l - L E ( W9) , and (2) E&D(Y,X ( t > > = CYl P"(t, Y, Y/)P"(D(Y', + ds CYlP"(t - s, Y,Y ' ) q [ R A Y ' , where p'(t, y, y') = p ( ~ - ~y,t 3, ') .

9)

,

X(S))]

In view of part (2) of the proposition, to study the convergence of ELE,we need to estimate p " ( t , y, y'). For this, we have the following result.

Lemma 16.6. Let P = ( p ( u , u ) ) be a general random walk on Z d such that p(0,O) < 1 and let ( p ( t , u , u ) )be the Q-process with Q-matrix Q = P - I . Then there exists a constant C > 0 such that

p(t,u,v) < C / h ,

t > 0, u , v

E Zd.

Proof: Clearly, the characteristic function cpt(a)( a E Rd) of the distribution ( p ( t ,0, .)) is given by

16.2 PRELIMINARIES

56 1

As the convolution of the distribution p ( 0 , u) =: pu with itself n-times, the characterstic function of P(~)(O,u) should be as follows:

where U

Hence p t ( a ) = exp [t(cp(a)- l)].Next, by the inverse formula,

Note that

and that

{

exp t p , (cos

c,"=, uiai - I)} 6 1,

Besides, we may assume that

B

:= p(,',,.. +dl

uE

~

d

.

> 0 with nd # 0, Then

P ( t , o,u>

Here in the last step, we have changed the variables

pi = ai,

1

, 6/2 or t , t' 6. For the latter one, the proof is easy:

X1. Note that if B ( X , t o ) = A1 for some to > 0, then O(X, t ) -= XI for all t 2 t o since the uniqueness of the solution to Eq. (16.19). Thus

qx,t> x1, whenever X > XI. root and limk,, for X > A1

d -qx,

at

t 2 0,

(16.22)

On the ohher hand, since XI is the largest non-negative [C,"l-o&kj - CT2' djk3] = -00, by Eq. (16.4), we have

t ) = -6,+1(0

-X

p . .. (0- Xk)mkg(B)

< 0,

(16.23)

where 9 3 0 011 [0,ca}. This means that 8(X, - ) ( A > XI) is nan-increasing and so the limit limt,w B ( X , t ) 2 XI exists. By Eq. (16.4), we have

Letting t 7 00, since both the integral and its integrand on the right-hand side converge, we have m

m 1.1

This plus thc existence of lirnidW @(A, t ) certainly gives us thc desired assert ion.

16.5 NOTES b) Let X i

< X < Xi-1.

571

Then, as an analogue of (16.22), we have

mj is even (resp. odd), then f?(A,t) is nonMoreover, by (16.23), if increasing (resp. non-decreasing) in t. Therefore, the proof a) gives us

< <
entso n the age distribution of Markou processes, Adv. Appl. Prob. 13,681-703. Park, Y. M.(1988a), Extension of Pirogou-Sinai theory of phase transition t o infinite range interactions (I), Comm. Math. Phys. 114,187-218. Park, Y . M.(1988b), Extension of Pirogov-Sinai theory of phase transition t o infinite range interactions (n), Comm. Math. Phys. 114,219-241. Parthasarathy, K . R.( 1967), Probability Measures on Metric Spaces, Academic Press. Popov, N. S.(1977), Conditions f o r geometric ergodicity of countable Markov chains, Soviet Math. Dokl. 18,676-679. Perrut, A (ZOOO), Hydrodynamic limits for a two-species reaction-diffusion process, Ann. Appl. Prob. 10 no. 1, 163-191. Preston, C . J.(1974), Gibbs States o n Countable Sets, Cambridge Univ. Press. Preston, C. J.(1976), Random Fields, LNM., Springer-Verlag. Qian, M. and I-IOU,Z. T.(Editors)(l979), Reversible Markov Processes (in Chinese), Hunan Sci. Press, Ilunan. Qian, M. P.(1978), O n the reversibility of stationary Markov chains (in Chinese), J. Beijing Univ. no. 4, 158-184. Qian, M. P. and Qian, M.(1979), Irreversibility, detail balance and circulations, in “Reversible hlarkov Processes” (in Chinese), edited by M. Qian and Z. T. Hou, 151-193. Rachev, S. T.(1982), Minimal metrics in the random variables space, Publ. Inst. Stat. Univ. Paris 27 no. 1, 27-47. Rachev, S. T . (1991), Probability Metrics and the Stability of Stochastic Models, Wiley, New York. Ren, K. L.( 1983), Potentiality and reversibility for the N-spin-flip processes and N-generalized N-spin-flip processes, Acta Math. Sci. 3,300-320. Reuter, G.E. H.(1957), Denumerable Markov Processes, Acta Math. 97, 1-46. Reuter, G . E. II.(1959), Denumerable Markov Processes (n), J. London Math. Soc. 34, 81-91. Reuter, G. E. H.(1961), Competition processes, in Fourth Berkeley Symposium on Math. Stat. and Prob. 2 . , 421-430. Reuter, G . E. H.(1962), Denumerable Markov Processes (m),J. London Math. SOC.37, 63-73. Reuter, G . E.H.(1976), Denumerable Markov Processes ( N ) ,on C. T . HOU’Suniqueness theorem for Q-semigroups, Z. Wahrs. 33,309-315.

BIBLIOGRAPHY

585

Roberts, G. 0. and Rosenthal, J. S. (1997), Geometric ergodicity and hybrid Markov chains, Electron. Comm. Probab. 2 , 13-25. Rockner, M. and Wang, F. Y. (2001), Weak Poincare‘ inequalities and L2-convergence rates of Markov semigroups, J. Funct. Anal. 185 no. 2, 564-603. Ruschendorf, L. and Rachev, S. T.(1990), A characterization of random variables with minimum L2-distance, J. Multi. Anal. 32,48-54. Schlogl, F.(1972), Chemical reaction models f o r phase transitions, Z . Phys. 253,147-161. Schonmann, R. H . (1994), Slow drop-driven relaxation of stochastic Ising models in the vicinity of the phase coexistence region, Commun. Math. Phys. 161,1-49. Shiga, T.(1988), Stepping stone models in population, genetics and population dynamics, in ‘Stochastic Processes in Physics and Engineering”, edited by S. Albeverio et al., 345-355. Shlosman, S. B.(1986), T h e method of reflection positivity in the mathematical theory of first-order phase transitions, Russian Math. Surveys 43,83-134. Silverstein, M.L.(1974), Classification of stable symmetric Markov chains, Indiana Univ. Math. J. 24, 29-77. Silverstein, M. L.(1976), Symmetric Markov Processes, LNM., Springer-Verlag. Sinai, Ya. G.(1982), Theory of Phase Transitions: Rigorous Results, Pergamon Press. Sokal, A. D.( 1981), Existence of compatible families of proper regular conditional probabilities, Z. Wahrs. 56,537-548. Sokal, A. D. and Thomas, L. E.(1988), Absence of mass gap for a class of stochastic contour models, J . Statis. Phys. 51 no. 5/6, 907-947. Spitxer, F.(1970), Interacting Markov processes, Adv. Math. 5,246-290. Spitzer , F.(1975), Principles of Random Walks, Springer-Verlag. Spitzer, F.(1981), Infinite systems with locally interacting components, Ann. of Prob. 9 no. 3, 349-364. Spohn, H. (1992), Large Scale Dynamics of Interacting Particles, Texts and Monographs in Physics 342, Springer-Verlag, Heidelberg and Berlin. Strassen, V.(1965), T h e existence of probability measures with given marginals, Ann. Math. Statist. 36,423-439. Stoyan, D.and Daley, D. J.(1983), Comparison Methods for Queues and Other Stochastic Models, Wiley. Stroock, D. W.(1978), Lectures o n Infinite Interactzng Systems, LNM. Kyoto Univ., Kinokuniya Bookstore. Stroock, D. W.(1981), O n the spectrum of Markov semigroup and the ezistence ofinvariant measures, in “Functional Analysis in Markov processes”, Proceedings, edited by M. Fukushima. LNM. 923, 287-307. Stroock, D. W.(1984), An Introduction to the Theory of Large Deviations, Springer. Stroock, D. W. and Varadhan, S. R. S.(1979), Multidimensional Diffusion Processes, Springer-Verlag, New York. Stroock, D. W. and Zegarlinski, B. (1992), T h e logarithmic Sobolev inequality for discrete spin systems o n a lattice, Comrn. Math. Phys. 149 no. 1, 175-193. Sullivan, W. G.(1984), T h e L2 spectral gap of certain positive recurrent Markov chains and j u m p processes, Z . Wahrs. 67,387-398. Szulga, A.( 1978), O n the Wasserstein metric, Transactions of the 8th Prague Conference, Prague, 267-273. Szulga, A.(1982), O n minimal metrics in the space of random variables, Theory Prob. Appl. 27,424-430. Tang, S. Z.(1982), O n the reversibility of din-flip processes (in Chinese), Acta Math. Sin. 2 5 , 306-314.

586

BIBLIOGRAP

Tang, S. 2.(1984), Construction of invuriant measures jor stochastic continuous Feller's processes with compact m e t r i c state space (in Chinese), Chin. Ann. Math. 5 no. 1, 99 -108. 'rang, S . Z.and T h , X . F.(1979), Mzaltzvariale couplings of spin-flip processes ( i n Chinese), J. Beijing Normal IJniv. 2 5 , 6-4. Thomas, L. E.(1989), B o m d s o n the mass gap .for $finite volume stochastic Ising models at low ternperahre, Comrn. Math. Phys. 126,1-11. Thorrison, H.(ZOOO), C o ~ p l Z n gStationarity, ~ and Regeneration, Springer-Verlag. Tuominen, P. arid Tweedie, R.. L . ( 1 ~ 9 ) IJxponmtial , crgodicity i n Markovian queueing and da71b 7rlodek+,J. Appl. Prob. 16,867-880. Twecdic, R. L.(l975), Suficient coizditions f o r regularity, recurrence and ergodicity of Markav processes, Math. Proc. Camb. Phil. SOC.78, 125-136. Tweedie, R. L.(1975), Crateria .for elassijyiny general M a ~ k o uchains, hdv. Appl. Proh. a, 737-771. ?'weedie, R. L.(1981), Criteria for ergodicity, exponential ergodicit?]and strong ergodicity of Markov Processes, J. Appl. Prob. 18,322- 130. Valiender, S. S.(1973), Culculntion of the Wusserstcan divtance between probability distributions on line, Theory Prob. Appl. 18,784-786. van Doorn, E. A.( 1981), Stochastic Monotonicity and Queueing Applications of RirthDeath Processes, Lecture Notes in Statistics 4, Springer. van Doom, E. A.(1985), Conditions f o r expmsntinl ergodicity and bou.nds j o r the decay parameter of a birth-death process, h d v . Appl. Prob 17,514.-530. van Doorn, E. h.(1987), Representations and bounds f o r zeros of orthogonal polynomials and eigenwalues of sign-symmetric tri-diagonal matrices, J. Approx. Th. 51 no. 3, 254-266. van Dourn, E. A.(1991) , Quasi-stationary distributions and convergence to quasi-stationarity of birth-death processes, Adv. Appl. Prob. 23, 683-700. van Doorn, E. A. (ZOOZ), Representations for the rate of convergence of birth-death processes, Theory Probab. Math. Statist. 6 5 , 36-42. Varadhan, S . R. S.( 1984): Large Deviations a n d Applications, SIAM, Philadelphia. Varopoulos, N.(1983), Brownian motion and transient groups, Ann. Inst. Fourier 33, 241-261. Veech, W.(1963), The necessity of Harris' condition for the existence of a stationary measure: Proc. Amer. Math. SOC. 14,856-860. Vere-Jones, D.(1962), Geometric ergodicity in denumerable Markov chains, Quart. J. Math. Oxford end Ser. 13, 7-28. Villani, C.(2003), Topics in Mass Tramportation, Graduate Studies in Math., AMS, Providence, RI. Wang, F. Y, (1994a), Ergodicity for infinite dimensional difjusion processes on manifolds, Sci. Sin.(A), 24 (Chinese Ed.). Wang, F. Y. (1994b), Applimtion of coupling method to the Neumann eigenvalue problem, Prob. T h . Rel. Fields 98, 299-306. Wang, F. Y. (1996a), Estimates of the first eigenvalue and the applications t o Yang-Mills lattice fields, Chinese Ann. Math. 17A:2(1996),147-154 (Chinese Ed.); Chinese 3 . Contemporary Math. 17:2(1996), 119-126 (English Ed.). Wang, F. Y. (1996b), Estimates of the logarithmic Sobolev constant for finite volume continuous span systems, J . St'atis. Phys. 84 no. 1/2, 277-293. Wang, F. Y. (ZOOO), Functional inequalities f o r empty essential spectrum, J. Funct. Anal. 1709,219-245. 'Wang, Z. K.(1965), Theory of Stochastic P.rocesses (in Chinese), Sci. Press, Beijing.

BIBLIOGRAPHY

587

Wang, Z. K.(1980), Birth-Death Processes and Markov Chains (in Chinese), Sci. Press, Beijing. Wang, Z. K. and Yang, X. Q. (1992), Birth and Death Processes and Markov Chains, Springer-Verlag, Berlin. Wasserstein, L. N.( 1969), Markov processes o n a countable product space, describing large systems of automata (in Russian), Problem Peredachi Informastii 5,64-73. Williams, D.(1964), O n the construction problem f o r Markov chains, Z. Wahrs. 3,227246. Williams, D.(1976), T h e Q-matrix problem, LNM. 511,216-234. Williams, D.( 1979), Diffusions, Markov Processes, and Martingales I: Foundations, Wiley. Wu, L. D.(1965), Classification of Markov chains with denumerable states (in Chinese), Acta Math. Sin. 15 no. 1, 32-41. Wu, B. and Zhang, Y. H. (2003), One-dimensional Brusselator model (in Chinese), preprint. Wu, X. Y.(1990), Canonical Gibbs states and reversible random fields (in Chinese), Acta Math. Sin. 33 no. 2, 277-285. Xu, X. J.(1991), Ph.D thesis (in Chinese), Beijing Normal Univ. Yan, J . A. (1988), A perturbation theorem f o r semigroups of linear operators, SBminaire de ProbabilitBs, XXII, LNM 1321,89-91. Yan, S . J.( 1990), Introduction to Infinite Particle Systems (in Chinese), Beijing Normal Univ. Press, Beijing. Yan, S. J . and Chen, M. F.( 1984), Stability of circulation decompositions and self-organization phenomena, Acta Math. Sci. 4 no. 1, 13-26. Yan, S . J. and Chen, M. F.(1986), Multidimensional Q-processes, Chin. Ann. Math. 7(B) no. 1, 90-110. Yan, S. J., Chen, M. F. and Ding, W. D.(1982a), Potentiality and reversibility f o r general speed functions (I), Chin. Ann. Math. 3 no. 5, 572-586. Yan, S . J., Chen, M. F. and Ding, W. D.(1982b), Potentiality and reversibilityfor general speed functions (II), Chin. Ann. Math. 3 no. 6, 705-720. Yan, S . J. and Li, 2. B.(1980), Probability models of non-equilibrium systems and the Master equations (in Chinese), Acta Phys. Sin. 29, 139-152. Yan, S . J., Wang, J . X. and Liu, X. F.(1982), Foundations of Probability Theory (in Chinese), Sci. Press, Beijing. Yang, X. Q.( 1981), Constructions of Time-Homogeneous Markov Processes with Denumerable States (in Chinese), Hunan Sci. Press, Hunan. Yosida, K.(1978), Functional Analysis, Firth edition, Spring-Verlag. Yoshida, N. and Higuchi, Y. (1996), Ising model o n the lattice Sierpinski Gasket, J. Stat. Phys. 84 no. 112, 295-307. Zegarlinski, B.( 1990), Log-Sobolev inequality f o r infinite one dimensional lattice systems, Comm. Math. Phys. 133,147-162. Zeifman, A. I. (1985), Stability f o r continuous-time nonhomogeneous Markov chains, LNM. 1155,401-414. Zeng, W . Q.(1983), Reversibility of two classes of infinite particle systems (in Chinese), Chin. Ann. Math. 4(A) no. 6, 763-772. Zhang, J. K.(1984), Generalized birth-death processes (in Chinese), Acta Math. Sci. 4 no. 6, 241-259. Zhang, H. J., Chen, A. Y.(2000), Stochastic comparability and dual Q-functions, J. Math. Anal. Appl. 234, 482-499.

588

BIBLIOGRAP

Zhang, H. J., Lin, X. and Hou, Z. T. (2000), Uniformly polynomial convergence for standard transition functions, in “Birth-Death Processes” by Hou, Z. T. e t a1 (ZOOO), Hunan Sci. Press, Hunan. Zhang, Y. H.(1994), Conservativity of coupling j u m p processes (in Chinese), J. Beijing Normal Univ. 33 no. 3, 305-307. Zhang, Y. H.(1996), Construction of one-dimensional order-preserving coupling (in Chinese), Chin. J. Appl. Prob. Stat. 12 no. 4, 376-382. Zhang, Y . H.( 1998), Construction of one-dimensional order-preserving coupling (continued, in Chinese), J. Beijing Normal Univ. 34 no. 3, 292-296. Zhang, Y. H.(2000a), Suficient and necessary conditions f o r stochastic comparability of jump processes, Acta Math. Sin. Eng. Ser. 16 no. 1, 99-102. Zhang, Y. H.(2000b), Stochastic comparability of the minimal j u m p processes (in Chinese), J. Beijing Normal Univ. 36 no. 2, 156-158. Zhang, Y . H.(2001), Strong ergodicity f o r continuous-time Markov chains, J. Appl. Prob. 38,270-277. Zhang, S. Y.(1999), Existence of the optimal measurable coupling and ergodicity for Markov processes, Science in China (A) 42 no. 1, 58-67. Zhang, S. Y.(2OOO), Existence and application of optimal Markovian coupling with respect t o non-negative lower semi-continuous functions, Acta Math. Sin. Eng. Ser. 16 no. 2, 26 1-270. Zheng, J. L.(1993), Phase Pansitions of Ising Model o n Lattice Fractals, Martingale Approach f o r q-processes (in Chinese), Ph. D. Thesis, Beijing Normal Univ. Zheng, J. L. and Zheng, X. G.(1986), Martingale approach for q-processes, Unpublished. Zheng, X. G.(1981), Symmetrizable Markov processes in abstract slate space (I) (in Chinese), J. Beijing Normal Univ. no. 4, 15-32. Zheng, X. G.(1982), Qualitative theory f o r q-pi.ocesses in abstract state space, Acta Math. Sci. 2 no. 1, 1-0. Zheng, X. G.(1983a), Symmetrizable Markov processes in abstract state space (E) (in Chinese), 3. Beijing Normal Univ. no. 1, 1-10. Zheng, X. G.(1983b), Feller’s boundary in abstract state space, Chin. Ann. Math. 4(B) no. 3, 267-284. Zheng, X. G. and Ding, W. D.(1987), Existence theorems f o r linear growth processes with diflusion, Acta Math. Sin. New Ser. 7 no. 1, 25-42. Zheng, X. G. and Liu, X. F.(1984), Diflerentiability of transition functions on a topological space with countable bases (in Chinese), J. Beijing Normal Univ. no. 1, 25-35. Zhou, X. Y.(1991), O n the recurrence of simple random walk o n some lattice fractals, J. Appl. Prob. 1992, 29, 454-459. Zhou, X. Y.(2002), Collection Papers of Zhou Xianyin, Beijing Normal Univ. Press. Zhu, D. J.(1990), Master’s Thesis (in Chinese), Anhei Normal Univ. Zolotarev, V. M.(1984), Probability metrics, Theory Prob. Appl. 28 no. 2, 278-302. Zolotarev, V. M.(1986), M o d e m Theory of i.i.d. Random Variables (in Russian), Nauka, Moscow.

Author Index Deuschel, J.-D., 329, 576 Diaconis, P., 576 Ding, W. D., 302, 446, 513, 537, 538, 547, 554, 575, 576, 587, 588 Dittrich, P., 513, 571, 576 Dobrushin, R. L., 171, 223, 421, 576 Doeblin, W., 223, 576 Donsker, M. D., 314, 577 Doob, J. L., 96, 455, 577 Down, D., 171, 577 Dowson, D. C., 9, 577 Doyle, P. G., 302, 577 Dudley, R. M., 182, 223, 510, 577 Durrett, R., 302, 537, 538, 554, 558, 576, 577 Dynkin, E. B., 57, 59, 577

A Aharony, A,, 421, 578 Aizenman, M., 446, 572 Aldous, D. G., 171, 572 Anderson, W. J., 119, 171, 223, 572 Andjel, E. D., 513, 572 Arnold, B., 172, 581 Arnold, L., 513, 572 Austin, D. G., 572

B Barlow, M. T., 421, 572 Basis, V. Ya., 96, 171, 509, 513, 538, 572 Bebbington, M., 572 Billingsley, P., 120, 175, 572 Blumenthal, R. M., 572 Bobkov, S. G., 380, 572 Boldrighini, C., 554, 571, 572 Bouleau, N., 271, 572

E

C

Ellis, R. S., 421, 577

Carlen, E. A,, 572 Chebotarev, A. M., 96, 572, 581 Cheeger, J., 380, 572 Chen, A. Y . , 271, 572, 580, 587 Chen, D. Y., 572, 573 Chen, J. D., 208, 573 Chen, J. W., 172, 538, 573 Chen, M. F., 6, 61, 96, 119, 171, 172, 223, 271, 302, 329, 380, 446, 466, 513, 538, 571, 573, 574, 575, 576, 580, 587 Cheng, H. X., 61, 575 Chung, K. L., 5, 167, 171, 172, 575 Cohn, D. L., 57, 575

D Dai,-Y. L., 271, 446, 576 Daley, D. J., 211, 585 Dawson, D. A., 554, 576 DeMasi, A,, 554, 571, 572, 576 Dembo, A., 329, 576 Derman, C., 172, 576

589

F Falconer, K. J., 421, 577 Fei, Z. L., 580 Feller, W., 2, 61, 96, 171, 577, 578 Feng, J. F., 571, 572, 573, 578 Feng, S., 554, 578 Ferrari, P. A., 578 Fill, 3. A., 171, 572 Forbes, F., 578 Foster, F. G., 172, 578 Frohlich, J., 421, 578 Franqois, O., 578 Freedman, D., 578 Fritz, J., 571, 578 Fukushima, M., 270, 271, 578 Funaki, T., 571, 578

AUTHOR INDEX

590

G Gefen, Y . , 421, 578 Georgii, H. O., 421, 578 Getoor, R. K., 572 Givens, C. R., 9, 578 Gong, S., 451, 579 Gotze, F., 380, 572 Granovsky, B., 380, 579 Greven, A . , 513, 578 Griffeath, D., 223, 579 Gross, L., 380, 579 Guionnet, A., 446, 579 Guo, M. Z., 571, 579 Guo, Q. F., 96, 103, 119, 161, 172, 271, 580

K Kantorovich, L. V., 96, 581 Kelly, F. P., 172, 270, 581 Kendall, D. G., 61, 171, 172, 581 Kersting, G., 96, 581 Kingman, J. F. C., 61, 171, 581 Kipnis, C., 380, 571, 581 Klebaner, F. C., 96, 579, 581 Kolmogorov, A. N., 61, 96, 270, 581 Konstantinov, A. A., 96, 581 Koteckf, R., 421, 576 Krylov, V. I., 96, 581 Kusuoka, S., 421, 572, 576, 581 Kuznetsov, S. E., 60, 581

H 1

Haken, H., 2, 513, 579 Hamza, K., 96, 579 Han, D., 172, 530, 579 Harris, T. E., 169, 172, 579 He, S. W., 171 Herbst, I., 579 Higuchi, Y . , 421, 587 Hirsch, F., 271, 572 Holley, R., 446, 459, 513, 572, 579 Hou, Z.T., 6, 61, 96, 103, 119, 161, 171, 172, 270, 302, 579, 580, 584, 588 Hsu, P. L., 61, 580 Hu, D. H., 61, 96, 171, 554, 580 Hua, L. K., 451, 580 Huang, C. C., 171, 580 Huang, L. P., 171, 223, 538, 571, 575, 580 Hunt, G. A., 580 Hwang, C. R., 380, 581 Hwang-Ma, S. Y . , 380, 581

I Ikeda, N., 7, 179, 581 Isaacson, D., 171, 580, 581 Israel, R., 421, 578

J Jain, N. C., 329, 581

Lowe, M., 582 Lbpez, F. J., 583 Landau, B. V., 9, 577 Landim, C., 380, 571, 581, 582 Lawler, G. F., 380, 582 Levin, S., 554, 577 Li, J . P., 119, 530, 580, 582 Li, S. F., 223, 571, 575 Li, S. Q., 446, 582 Li, T. D., 582 Li, Y . , 446, 513, 554, 582 Li, Z. B., 513, 587 Li, Z. P., 554 Lieb, E. H., 421, 578 Liggett, T. M., 3, 223, 380, 421, 432, 446, 449, 450, 456, 513, 537, 538, 547, 576, 579, 582 Lin, X., 172, 588 Lindvall, T., 223, 571, 582 Linstr+m, T., 421, 582 Liu, X. I?., 57, 61, 586, 587, 588 Liu, X. J., 446, 576, 582 Liu, Z. M., 580 LoBve, M., 61, 70, 582 Lu, Y . G., 329, 575 Luecke, G. R., 171, 581 Lyons, T. J., 302, 583

591

AUTHORINDEX

M M a , Z. M., 60, 270, 583 blaes, C., 421, 583 Malyshev, V. A . , 421, 583 Mandelbrot, B. B., 421, 578, 583 Mao, Y . H., 172, 380, 583 Martinelli, F., 446, 583 Martinez, S, 583 Maslov, V. P., 96, 581 McCoy, B. M., 18, 583 Meise, C., 582, 583 Mertens, J. F., 171, 583 Meyn, S. P., 171, 455, 577, 583 Miclo, L., 380, 583 Minlos, R. A , , 421, 446, 583 Mountford, T. S., 550, 583

N Nash-Williams, C. St. 3. A . , 292, 299, 583 Nelssen, R. B., 223, 584 Neuhauser, C., 538, 554, 558, 577, 584 Neveu, J., 25, 584 Nicolis, G., 513, 584 Nummelin, E., 171, 584

0 Olkin, I., 9, 584 Olla, S., 421, 571, 581, 584 Oshima, Y . , 270, 578 Osterwalder, K., 421, 584 Ouyan, R. H., 271

P Pakes, A. G., 109, 584 Papanicolaou, G. C., 571, 579 Park, Y . M., 421, 584 Parthasarathy, K. R., 175, 584 Pellegrinotti, A . , 554, 571, 572 Percherski, E. A,, 223, 576 Perkins, E. A . , 421, 572 Perrut, A , 571, 584 Pirogov, S. A, 421 Pitt, L, 579 Pollett, P., 572 POPOV, N. S., 172, 584 .

Preston, C. J., 421, 457, 584 Presutti, E., 554, 571, 572, 576 Prigogine, I., 513, 584 Pukelsheim, R., 9, 584

Q Qian, M., 270, 584 Qian, M. P., 271, 572, 573, 584

R Rockner, M., 270, 380, 583, 585 Riischendorf, L., 9, 585 Rachev, S. T., 223, 584, 585 Redig, F., 583 Ren, K. L., 446, 584 Reuter, G. E. H., 2, 61, 96, 119, 172, 584 Roberts, G. O., 380, 585 Rogers, L. C. G., 223, 571, 582 Rosenthal, J. S., 380, 585 5 Saada, E., 583 Samuel-Cahn, E., 171, 583 Sanz, G., 583 Schlogl, F., 2, 585 Schonmann, R. H., 446, 585 Schrader, R., 421, 584 Sethuraman, S., 582 Shapir, Y., 421, 578 Sheu, S. J., 380, 581 Shiga, T., 513, 550, 585 Shlosman, S. B., 421, 457, 576, 583, 585 Shortt, R. M., 9, 578 Silverstein, M. L., 270, 585 Simon, B., 421, 578 Sinai, Ya. G., 421, 576, 585 Snell, J. L., 302, 577 Sokal, A . D., 60, 380, 582, 585 Spitzer, F., 513, 547, 563, 582, 585 Spohn, H., 571, 585 Stoyan, D., 211, 585 Strassen, V., 585 Stroock, D. W., 96, 120, 171, 305, 307, 324, 329, 446, 459, 510, 513, 572, 575, 576, 577, 578, 579, 585 Sullivan, W. G., 380, 585 Saulga, A., 182, 585

AUTHOR INDEX

592

T Takeda, M., 270, 578 Tang, S. Z., 96, 446, 456, 513, 585, 586 TavarB, S., 109, 584 Theodosopulu, M., 513, 572 Thomas, L. E., 380, 585, 586 Thorrison, €I., 223, 586 Trisch, A . , 446, 583 Tuominen, P., 171, 584, 586 Tweedie, R. L., 171, 172, 455, 577, 581, 583, 584, 586

V Vallender, S. S., 9, 586 van Doorn, E. A . , 380, 586 Varadhan, S. R. S., 96, 120, 314, 513, 571, 577, 579, 581, 582, 585, 586 Varopoulos, N . , 302, 586 Veech, W., 169, 172, 586 Vere-Jones, D., 171, 586 Villani, C., 223, 586

W Wang, F. Y . , 380, 466, 575, 585, 586 Wang, J. X., 57, 587 Wang, P. Z., 580 Wang, Z. K., 57, 61, 70, 1.61, 171, 586, 587 Wasserstein, L. N., 223, 587 Watanabe, S., 7, 179, 581 Williams, D., 587 Wu, R., 151, 172, 587 Wu, J., 421 Wu, L. D., 172, 587 Wu, T. T., 18, 583 Wu, X. Y., 446, 587

X Xiao, G. N., 580 Xu, X. J., 571, 575, 587

Y Yan, J. A., 510, 587 Yan, S. J., 57, 119, 172, 446, 513, 554, 575, 587 Yang, C. N., 582 Yang, X. Q., 96, 161, 171, 271, 587 Yoshida, N., 421, 587 Yosida, K., 35, 55, 587 Yuan, C. G., 580

Z Zahradnik, M., 421, 576 Zarnit, S., 171, 583 Zegarlinski, B., 446, 579, 585, 587 Zeifman, A. I., 380, 579, 587 Zeitouni, O., 329, 576 Zeng, W. Q., 446, 587 Zhang, H. J., 172, 271, 572, 580, 587, 588 Zhang, J. K., 119, 587 Zhang, S. Y., 538, 588 Zhang, Y . H., 151, 172, 223, 583, 587, 588 Zheng, J . L., 96, 223, 421, 588 Zheng, X. G., 6, 61, 96, 119, 271, 513, 547, 554, 572, 575, 576, 578, 582, 588 Zhou, J. Z., 580 Zhou, X. Y., 300, 302, 581, 588 Zhu, D. J . , 538, 575, 588 Zolotarev, V. M., 223, 588

Subject Index Special Symbols

r-specification, 384 R(X,P), 88 8, 122,383 a-isomorphic, 60 U,(E), 120

1-dependent, 549 11. l l z l l 184 C, 383 ( B ) , 70 ( B n ) , 86 ( B x ) , 70 Bq-process, 97 Cb(E), 120 %y.f(E), 411 ( D ,g ( D ) ) , 258, 331 d-system, 57 dim% 2 0, 231 dim%, 91 dm i ?,' 91 ~ W ( L 341 ), ( F ) , 70 (Fn), 86 ( F A ) , 70 Fq-process, 97 cp-optimal, 203, 205 cp-optimal Markovian coupling, 205 cp-optimal measurable coupling, 205 gap(D), 331 gap(L), 331 % p ( E ) , 126 2 - s y s t e m , 57 Xo, 363, 368 XO(A)I 359 X I , 359 Xi(B), 360 n-almost honest, 232 n-equivalent, 231 7r-system, 57 P ( E ) , 120 (P;in(t)), 5 Pm'"(X,x,A),74 Pmin(t,x,A),77 Q-condition, 1 q-condition, 56, 85 Q-matrix, 1 q-pair, 32 Q-process, 2, 32 r-boundary, 383

A absorbing, 32 abstract chess-board estimates, 417 additive theorem, 333 aperiodic, 130 asymptotically stable, 556 autocatalytic model, 503

B backward equation at point x, 237 backward Kolmogorov equation, 73 backward Kolmogorov inequality, 73, 74 basic coupling, 11, 13, 186 basic Dirichlet form, 264 birth-death Q-matrix, 17, 110, 142, 164, 325, 488, 538 birth-death process, 11, 110, 160, 162, 210, 217, 218, 220, 266, 346, 348, 349, 359, 377, 549, 551, 559 Boltzmann constant, 385 Brussel's model, 150, 509

C canonical Gibbs state, 429 canonical image, 234 chain field, 275 Cheeger's constant, 368, 370 chess-board estimate, 410 CK-equation (Chapman-Kolmogorov equation), l, 23 classical coupling, 10, 185, 347, 354, 528 closed function (lower semi-continuous function), 121, 122, 205, 209, 305 co-zero property, 272 coalescing process, 504 coefficient of anisotropy, 385 compact function, 5 , 121, 124, 174, 391 comparison lemma, 104

593

594

SUBJECT INDEX

comparison theorem, 64, 332 conditional Gibbs state with periodic boundary condition, 408 conditional Hamiltonian, 385 conductance, 281 cone mapping, 62 configuration, 383 conservative, 32, 33 consistency condition, 384 consistent family of functions, 88 consistent family of measures, 88 constrained system, 321 continuous condition, 24, 51 contour, 397, 403, 548 contraction, 256 controlling equation, 64 core, 267, 335, 433 coupled branching process, 504 coupled random walk process, 506 coupling, 6, 173, 324, 340, 341, 342, 347, 470, 471, 479, 490, 506, 507, 518, 519, 553, 562, 563 coupling by reflection, 11, 218 coupling of marching soldiers, 11, 186, 347, 490, 528, 553 coupling operator, 10, 185 current, 281 cyclicity, 417 cylindrical function, 411 cylindrical set, 411

E edge, 272 effective conductance, 281 effective resistance, 281 eigenfunction, 341 eigenfunction in weak sense, 341 electric potential, 281 embedding chain, 139 embedding jump process, 125 energy dissipation, 282 entrance solutions, 92 entrance space, 88, 92 entropy, 15, 304 equilibrium solution, 556 ergodic, 12, 17, 137, 140, 143, 145, 149, 150, 152, 163, 164, 165, 196, 308, 325, 338, 352, 411, 450, 455, 459, 518, 520, 522, 523, 532, 538, 542, 543, 550 estimate of moment, 524 estimate of the first moment, 472 excessive measure, 170 existence criterion, 361 exit solutions, 92 exit space, 88, 92 explicit bound, 17, 349 explicit criterion, 17, 349 exponential Lz-convergence, 331 exponential ergodicity, 143 exponentially growing, 391

F

D dimension of %A, 90 dimension of YA,90 Dirichlet eigenvalue, 359 Dirichlet form, 258 Dirichlet operator, 261 DLR-equation (Dobrushin-Lanford-Ruelle equation), 384 Donsker-Varadhan entropy, 311 Donsker-Varadhan theorem, 306 Doob’s construction, 96 dual graph, 397 duality, 539 Dynkin-class, 57

Feller’s construction, 96 field of exclusion speed functions, 422 field of spin speed functions, 422 finite dimensional generalized Potlach process, 221 finite entrance, 92 finite exit, 92 finite range, 384 finitely ramified fractal, 300 first infinity, 103 first moment condition, 470, 479 first successive approximation scheme, 63 flow, 284 Fokker-Planck equation, 70 forward Kolmogorov equation, 73

595

SUBJECT INDEX

forward Kolmogorov inequality, 74 Friedricks extension, 267

G generalized Potlatch process, 505 geometrically ergodic, 137 Gibbs distribution in V with boundary condition, 387 Gibbs distribution in V with boundary condition, 387 Gibbs random field, 387 Gibbs state, 387, 429 graph representation method, 539 Gronwall’s Lemma, 472 growing condition, 493, 496

H Hamiltonian, 384 Hamiltonian with periodic boundary condition, 408 homogeneous equation, 63 honest, 24

I I-function, 304 independent coupling, 10, 185 inner regular, 60, 205 instantaneous, 32 integration by parts formula, 484, 507, 508, 510, 541 interaction function, 385 invariant measure, 124, 128, 166 inverse temperature, 385 irreducible, 130, 134 Ising model, 18, 384 isoperimetric constant, 368, 370 1

J

L

L6vy-Prohorov metric, 7 Laplace transform, 37, 51 large deviation principle, 305 lattice field, 278 lattice Sierpinski carpet, 299, 300, 402 lattice Sierpinski gasket, 298, 299, 400 limiting Gaussian process, 509 linear growth process, 503 Lipschitz condition, 470, 479 localization theorem, 67 locally compact, 499 logarithmic Sobolev inequality, 358 Loth-Volterra model, 221 lower semi-continuous function (closed function), 121

M marginality, 6, 10, 174, 184 Markov chain, 24 maximal coupling, 208 maximal solution, 92 mean field method, 539 measurable coupling, 392 minimal non-negative solution, 63 minimal solution, 63 minimum LP-distance, 8, 173, 179, 391, 469, 531 minimum property, 63 moment condition, 493, 496, 497 moments method, 539 monotone, 211 monotone class theorem, 57

jump condition, 2, 24, 51 jump process, 23

N K Kantorovich-Rubinstein-Wassersteinmetric, 173 kernel, 40 Krein extension, 270

nearest neighbor, 438 non-conservative quantity at x, 92 non-honest , 24 normal condition, 1, 51

596

SUBJECT INDEX 0

Ohm’s law, 281 open path, 547 operator Q , 71 optimal Markovian coupling, 203 order-preserving coupling, 211, 220 Osterwalder-Schrader positivity, 406, 421

resolvent equation, 51 reversible, 13, 227, 339, 341, 344, 348, 433, 452, 455, 533, 537, 550 reversible q-pair, 229 reversible measures, 433

S P partition function, 385 partition of E , 85 path, 272 Peierls inequality, 397 periodic configuration, 408 phase transition, 18, 20, 397, 400, 402, 413, 455, 457, 542, 547, 550, 558 Pirogov-Sinai method, 421 Polish space, 23 polynomial model, 503, 520, 521, 532, 533, 538, 550, 555 positive measure, 438 positive recurrent, 137, 140 potential, 273 potential field, 273 Potlatch process, 505 Potts gauge model, 450 principal eigenvalue, 362 probability kernel, 40

Q quadratic form, 359, 368 quadrilateral condition, 277, 278, 423, 426, 429, 435, 437, 439

R random field, 384 rate function, 15, 304 reachable, 272 reachable directly, 272 realization, 404 recurrence, 124, 140 reflection positive, 406 regular, 2 regular g-pair, 124 resistance, 281

Schlogl’s first model, 503, 520, 522, 532, 557 Schlogl’s model, 2, 4, 6, 19, 79, 110, 150, 164, 221 Schlogl’s second model, 503, 520, 522, 532, 552, 557 Schwarz inequality, 417 second successive approximation scheme, 64 section, 272 self-dual, 542 shift, 407 Shlosman model, 452 single birth Q-matrix, 4, 105, 152, 324 single birth Q-process, 105, 112, 151, 160, 300 single-entrance, 92 single-exit, 92 smoothing process, 504 specification, 384 spectral gap, 16, 330, 340, 348, 359, 538 speed functions, 425 spin space, 383 stable, 32, 556 stationary distribution, 124, 126 statistical sum, 385 stochastic monotonicity, 21 1 strong continuity, 256 strongly ergodic, 137, 143 sub-Markovian, 257 successful, 12, 198, 202, 218, 220, 222 successful coupling, 195 symmetric, 257 symmetrizable, 14, 142 symmetrizable jump process, 229

SUBJECT INDEX

T taboo probability) 167 tight, 205 total energy dissipation) 282 total variation, 8 totally instantaneous, 32 totally stable) 33 transition condition) 496, 497 transition function, 23 transition measure) 40, 205 translation, 407 translation invariant, 408 triangle condition, 423, 426, 429, 435, 438, 443, 533

U uniformly ergodic, 137, 143 uniformly tight, 205 uniqueness criterion, 2, 3, 103, 115 uniqueness theorem of Laplace transform, 58 universal measurable set, 60 universal measurable space, 60

597

V Varadhan theorem) 305 variational formula, 17, 349 variational formula for Dirichlet form, 363 Volterra-Lotka model, 509

w Wasserstein metric, 8, 173, 179, 391, 469, 531 weak convergence, 7 weak convergence in finite-dimensional distributions) 123 weak domain) 341 weak maximum principle, 500 work, 273

Y Yang-Mills lattice field, 451 Z zero range, 384 zero range process, 503 zero-entrance) 92 zero-exit , 92

From Markov Chains to Non-Equilibrium Particle Systems, Second Edition

Markov chains

Markov Chains

Markov chains

Nonequilibrium Problems in Many-Particle Systems

Markov set-chains

Finite Markov chains

Denumerable Markov Chains, Second Edition (Graduate Texts in Mathematics)

Denumerable Markov chains

Interactive Markov Chains

Notes on Markov chains

Particle-Lung Interactions, Second Edition

Markov chains and stochastic stability

Markov chains and mixing times

Markov chains and mixing times

Queueing networks and Markov chains

Markov Chains and Mixing Times

Markov Chains and Stochastic Stability

Markov chains and stochastic stability

Markov Chains and Invariant Probabilities

Nonequilibrium Problems in Many-Particle Systems. Lectures 3rd Session C.I.M.E

Markov chains and mixing times

Nonlinear Systems, Second Edition

Universality In Nonequilibrium Lattice Systems

Introduction to Discrete Event Systems, Second Edition

Non-negative Matrices and Markov Chains

Markov Chains Models, Algorithms and Applications

Markov Chains Models Algorithms and Applications

Markov Chains (North-Holland Mathematical Library)

Markov chains. Models, algorithms and applications

Numerical Methods for Structured Markov Chains

From Markov Chains to Non-Equilibrium Particle Systems, Second Edition

Markov chains

Markov Chains

Markov chains

Nonequilibrium Problems in Many-Particle Systems

Markov set-chains

Finite Markov chains

Denumerable Markov Chains, Second Edition (Graduate Texts in Mathematics)

Denumerable Markov chains

Interactive Markov Chains

Notes on Markov chains

Particle-Lung Interactions, Second Edition

Markov chains and stochastic stability

Markov chains and mixing times

Markov chains and mixing times

Queueing networks and Markov chains

Markov Chains and Mixing Times

Markov Chains and Stochastic Stability

Markov chains and stochastic stability

Markov Chains and Invariant Probabilities

Nonequilibrium Problems in Many-Particle Systems. Lectures 3rd Session C.I.M.E

Markov chains and mixing times

Nonlinear Systems, Second Edition

Universality In Nonequilibrium Lattice Systems

Introduction to Discrete Event Systems, Second Edition

Non-negative Matrices and Markov Chains

Markov Chains Models, Algorithms and Applications

Markov Chains Models Algorithms and Applications

Markov Chains (North-Holland Mathematical Library)

Markov chains. Models, algorithms and applications

Numerical Methods for Structured Markov Chains

Recommend Documents