Communications in Mathematical Physics - Volume 190

Commun. Math. Phys. 190, 1 – 56 (1997) Communications in Mathematical Physics c Springer-Verlag 1997 Twisted Wess-Zu...

Author: A. Jaffe (Chief Editor)

49 downloads 559 Views 7MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
Report copyright / DMCA form

DOWNLOAD PDF

Commun. Math. Phys. 190, 1 – 56 (1997)

Communications in

Mathematical Physics c Springer-Verlag 1997

Twisted Wess-Zumino-Witten Models on Elliptic Curves Gen Kuroki1 , Takashi Takebe2 1

Mathematical Institute, Tohoku University, Sendai 980, Japan Department of Mathematical Sciences, University of Tokyo, Komaba, Tokyo 153, Japan. E-mail: [email protected]

2

Received: 21 January 1997/ Accepted: 1 April 1997

Abstract: Investigated is a variant of the Wess-Zumino-Witten model called a twisted WZW model, which is associated to a certain Lie group bundle on a family of elliptic curves. The Lie group bundle is a non-trivial bundle with flat connection and related to the classical elliptic r-matrix. (The usual (non-twisted) WZW model is associated to a trivial group bundle with trivial connection on a family of compact Riemann surfaces and a family of its principal bundles.) The twisted WZW model on a fixed elliptic curve at the critical level describes the XYZ Gaudin model. The elliptic Knizhnik-Zamolodchikov equations associated to the classical elliptic r-matrix appear as flat connections on the sheaves of conformal blocks in the twisted WZW model. Contents 0 1 1.1 1.2 1.3 1.4 2 3 3.1 3.2 3.3 4 4.1 4.2 5 5.1 5.2

Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Spaces of Conformal Coinvariants and Conformal Blocks . . . . . . . . . . . Group bundles and their associated Lie algebra bundles . . . . . . . . . . . . . Definition of the spaces of conformal coinvariants and conformal blocks Correlation functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Action of the Virasoro algebra . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Critical Level and the XYZ Gaudin Model . . . . . . . . . . . . . . . . . . . . . . . Sheaves of Conformal Coinvariants and Conformal Blocks . . . . . . . . . . Family of pointed elliptic curves and Lie algebra bundles . . . . . . . . . . . Sheaf of affine Lie algebras . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Definition of the sheaves of conformal coinvariants and conformal blocks Sheaf of the Virasoro Algebras . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Definition of the sheaf of the Virasoro algebras . . . . . . . . . . . . . . . . . . . Action of the sheaf of Virasoro algebras . . . . . . . . . . . . . . . . . . . . . . . . . Flat Connections . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Construction of flat connections . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Elliptic Knizhnik-Zamolodchikov equations . . . . . . . . . . . . . . . . . . . . . .

2 5 5 7 10 15 16 20 20 22 23 26 26 29 35 35 38

2

G. Kuroki, T. Takebe

5.3 Modular invariance of the flat connections . . . . . . . . . . . . . . . . . . . . . . . 6 Concluding Remarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Appendix A. Theta functions with characteristics . . . . . . . . . . . . . . . . . . . . Appendix B. The Kodaira-Spencer map of a family of Riemann surfaces . . Appendix C. On a formulation for higher genus Riemann surfaces . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

41 47 48 50 51 54

0. Introduction In this paper, we deal with a variant of the (chiral) Wess-Zumino-Witten model (WZW model, for short) on elliptic curves, which shall be called a twisted WZW model. The usual (non-twisted) WZW model on a compact Riemann surface X gives rise to the sheaves of vector spaces (of conformal blocks and of conformal coinvariants) on any family (or the moduli stack) of principal G-bundles, where G is a semisimple complex algebraic group. Note that the notion of principal G-bundles is equivalent to that of Gnt -torsors, where Gnt denotes the trivial group bundle G × X on X. (The symbol (·)nt stands for “non-twisted”.) This suggests that there exists a model associated to a nontrivial group bundle Gtw with a flat connection on X, which gives sheaves of conformal blocks and conformal coinvariants on a family of Gtw -torsors. (The symbol (·)tw stands for “twisted”.) We call such a model a twisted WZW model associated to Gtw . The aim of this work is not to establish a general theory of the twisted WZW models but to describe certain interesting examples of the twisted WZW models related to the elliptic classical r-matrices ([BelD]). In this introduction, we explain our motivations and clarify the relationship between the twisted WZW models and various problems in mathematics and physics. One of the motivations is the viewpoint of representation theory where the WZW model is formulated as an analogue of a theory of automorphic forms due to Langlands. We list corresponding ingredients of both theories in Table 1. Notations shall be explained in the main part of this paper. There is a theory of automorphic forms for arbitrary (possibly non-split) reductive groups over a global field as well as over a local field. But so far only the WZW model associated to the trivial group bundle has been considered and the counterpart of the non-split reductive group over a global field has been absent. The twisted WZW models fills this blank. The second motivation comes from the geometric Langlands program over C and its relation with quantum integrable systems. A geometric analogue of the Langlands correspondence over C is described by using the WZW model at the critical level, where the centers of the completed enveloping algebras of affine Lie algebras are sufficiently large so that we can consider analogues of the infinitesimal characters of finite-dimensional semisimple Lie algebras ([Hay, GW]). For introduction to the original Langlands program we refer to [Bo] and [Ge]. For a general formulation of the geometric Langlands correspondence over C related to the non-twisted WZW models at the critical level, see [Bei] and [BeiD] and for the analogue of the local Langlands correspondence to affine Lie algebras at the critical level, see [FF3 and Fr1]. The twisted WZW model at the critical level shall give a geometric analogue of the Langlands correspondence for a non-split reductive group over a global field. To study this model at the critical level is important not only in this context of the geometric Langlands program but also in the theory of the quantum integrable spin chains. B. Feigin, E. Frenkel and N. Reshetikhin found in [FFR] that the non-twisted

Twisted WZW Models on Elliptic Curves

3

Table 1. Analogy between automorphic forms and conformal blocks Theory of automorphic forms

Theory of the WZW models

a global field, i.e., a number field or the function field of an algebraic curve over a finite field

the function field of a compact Riemann surface X

a local field

C((ξ)), a field of formal Laurent series

a reductive group over the global field

a semisimple group bundle on X with flat connection or the associated Lie algebra bundle gtw

a non-split reductive group over the global field

a semisimple group bundle with flat connection on X which is not locally trivial under the Zariski topology

the adèle group associated to the reductive group

the affine Lie algebra LL (g⊕L )∧ = g ⊗ C((ξi )) ⊕ Ckˆ i=1

the principal adèle subgroup of the adèle group

the subalgebra gD˙ X (g⊕L )∧

a unitary representation of the adèle group

a representation M of (g⊕L )∧ or its algebraic dual M ∗

the space of automorphic forms in the representation space, i.e., the invariant subspace of the representation space with respect to the principal adèle subgroup

the space CB(M ) of conformal blocks, i.e., the invariant subspace of M ∗ with respect to gD ˙ X

=

H 0 (X, gtw (∗D)) of

WZW model at the critical level on the Riemann sphere is closely related to a spin chain model called the Gaudin model. See also [Fr2]. Its Hamiltonian is described as an insertion of a singular vector of the vacuum representation at a point and the diagonalization problem turns out to be equivalent to a description of a certain space of conformal blocks. This “Gaudin” model is, however, merely a special case of the model introduced by M. Gaudin [Ga1, Ga2, Ga3] as a quasi-classical limit of the XYZ spin chain model. Let us call this general model the XYZ Gaudin model, following [ST1] where the diagonalization problem of this model was studied by the algebraic Bethe Ansatz. In order to extend the results of [FFR], we need the twisted WZW model on a elliptic curve at the critical level as we shall see in Sect. 2. We remark that the non-twisted WZW models at the critical level on an elliptic curve is related to quantum integrable systems on root systems. In fact those systems defined by the trigonometric (dynamical) r-matrices are described by the non-twisted WZW model on a degenerated elliptic curve with only one ordinary double point. The nontwisted WZW model at the critical level on an elliptic curve leads to a system called the Gaudin-Calogero model ([ER, N]) which was defined as Hitchin’s classical integrable system ([Hi]) on the moduli space of semistable principal bundles on an elliptic curve.1 The reason why root systems appear in the non-twisted WZW models is explained as follows. Let G be a complex semisimple group and T its maximal torus. Let a and b denote generators of the fundamental group π1 (X) of an elliptic curve X. Then, for g ∈ T , the homomorphism from π1 (X) into G sending a and b to 1 and g respectively induces a semistable principal G-bundle on X. This defines the covering by T of the moduli space of semistable G-bundles on X. Furthermore the universal covering of T is identified with its Lie algebra, on which the root system structure exists. Namely, the 1 This relation of the non-twisted WZW model and the Gaudin-Calogero model is due to B. Enriquez and A. Stoyanovsky. T.T. thanks Enriquez for communicating their unpublished result.

4


root system appears as a covering space of the moduli space of semistable principal G-bundles on X. The third motivation is a geometric interpretation of Etingof’s elliptic KZ equations. As is well-known, the Knizhnik-Zamolodchikov equation is a system of differential equations satisfied by matrix elements of products of vertex operators ([KZ, TK]) and is a flat connection over the family of pointed Riemann spheres. Similarly from the nontwisted WZW model over elliptic curves arises the elliptic Knizhnik-ZamolodchikovBernard equations (KZB equations, for short), which Bernard found in [Be1] by computing traces of products of vertex operators twisted by g ∈ G. The interpretation of the elliptic KZB equations as flat connections on sheaves of conformal blocks, which are defined without use of the traces, was found in [FW]. Using the same idea as [Be1], Etingof computed in [E] a twisted trace of a product of vertex operators and found that it obeys linear differential equations of KZ type defined by the elliptic classical r-matrices. We call these equations the elliptic KZ equations. In the present paper it is shown that the elliptic KZ equations also has an interpretation as flat connections on sheaves of conformal blocks. Let us explain now the contents of this paper. In Sect. 1, we give a definition of the conformal coinvariants and the conformal blocks of the twisted WZW model on an elliptic curve. The definition of the non-trivial group bundle Gtw (1.3) and the associated Lie algebra bundle gtw (1.4) is given in Sect. 1.1 and their fundamental properties are studied. This bundle gtw was used by I. Cherednik [C] for an algebro-geometric interpretation of classical elliptic r-matrices. An important point is that the cohomology groups of gtw vanish in all degrees. Since the 1-cohomology H 1 (X, gtw ) can be canonically identified with the tangent space of the moduli space of Gtw -torsors at the equivalence class consisting of trivial ones, the trivial Gtw -torsor can not be deformed. Thus non-trivial Gtw -torsors do not appear in the twisted WZW model associated to Gtw . Conformal coinvariants and conformal blocks of this model are defined in Sect. 1.2. We introduce correlation functions of current and the energy-momentum tensor in Sect. 1.3, following mostly [TUY]. An action of the Virasoro algebra on the conformal coinvariants and the conformal blocks is defined in Sect 1.4. This model at the critical level for G = SL2 (C) describes the XYZ Gaudin model and the case for G = SLN (C) is related to the higher rank generalizations of the XYZ Gaudin model, as shown in Sect. 2. Away from the critical level, we can define a connection on the sheaves of conformal coinvariants and blocks over the family of pointed elliptic curves. Sect. 3 and Sect. 4 are a sheaf version of Sect. 1 over a family of pointed elliptic curves. By extending the tangent sheaf of the base space of the family (4.3) and constructing its action on the sheaves of conformal coinvariants and conformal blocks, we can introduce the D-module structure on them in Sect. 5, which naturally implies the existence of flat connections. The explicit formulae in Sct. 5.2 show that our connections are identical with Etingof’s elliptic KZ equations. This connection has modular invariance which Etingof proved by his explicit expressions of the equations. We give a geometric proof of this fact in Sect. 5.3. Useful properties of theta functions are listed in Appendix A. An algebro-geometric meaning of the extension (4.3) is explained in Appendix B. Higher-genus generalization of the theory is discussed in Appendix C.


5

1. Spaces of Conformal Coinvariants and Conformal Blocks In this section we define the space of conformal coinvariants and conformal blocks associated to a twisted Group bundle. 1.1. Group bundles and their associated Lie algebra bundles. In this section we define a group bundle Gtw and an associated Lie algebra bundle gtw on an elliptic curve X = Xτ = C/(Z + τ Z), where τ belongs to the upper half plane H := {Im τ > 0}. We fix a global coordinate t on X which comes from that of C. Let G be the Lie group SLN (C) and g be its Lie algebra, slN (C) = {A ∈ MN (C) | tr A = 0}. We fix an invariant inner product of g by (A|B) := tr(AB) Define matrices α and β by 0 1 .  0 ..  α :=  .. . 1

0  ,  1 0

for A, B ∈ g. 

 β :=  

(1.1)

1

0 ε ..

0

.

  , 

(1.2)

N −1

ε

where ε = exp(2πi/N ). Then we have αN = β N = 1 and αβ = εβα. We define the group bundle Gtw and its associated Lie algebra bundle gtw by Gtw := (C × G)/∼, gtw := (C × g)/≈,

(1.3) (1.4)

where the equivalence relations ∼ and ≈ are defined by (t, g) ∼ (t + 1, αgα−1 ) ∼ (t + τ, βgβ −1 ), (t, A) ≈ (t + 1, αAα−1 ) ≈ (t + τ, βAβ −1 ).

(1.5) (1.6)

(Because of αβ = εβα, the group bundle Gtw is not a principal bundle.) The fibers of Gtw are isomorphic to G and those of gtw are isomorphic to g, but there are not canonical isomorphisms. The twisted Lie algebra bundle gtw has a natural connection, ∇d/dt = d/dt, and is decomposed into a direct sum of line bundles: M La,b , (1.7) gtw ∼ = (a,b)6=(0,0)

where the indices (a, b) runs through (Z/N Z)N r {(0, 0)}. Here the line bundle La,b on X is defined by (1.8) La,b := (C × C)/≈a,b , where ≈a,b is an equivalence relation defined by (t, x) ≈a,b (t + 1, εa x) ≈a,b (t + τ, εb x). We regard La,b as a line subbundle of gtw through the injection given by

(1.9)

6


La,b 3 (t, x) 7→ (t, xJa,b ) ∈ gtw ,

(1.10)

where Ja,b is the element of g defined by Ja,b := β a α−b .

(1.11)

We remark that { Ja,b | (a, b) ∈ (Z/N Z)N r {(0, 0)} } is a basis of g = slN (C). The space of meromorphic sections of La,b over X pulled back to C can be canonically identified with Ka,b = { f ∈ M(C) | f (t + 1) = εa f (t), f (t + τ ) = εb f (t) }.

(1.12)

Here M(C) is the space of meromorphic functions on C. (K0,0 is the space of elliptic functions and corresponds to the trivial line bundle on X.) The mapping f 7→ (t, f (t)) modulo ≈a,b

(1.13)

gives a canonical isomorphism from Ka,b onto H (X, La,b ⊗ KX ). The Liouville theorem implies that the only holomorphic function in Ka,b is zero when (a, b) 6= (0, 0). This is equivalent to H 0 (X, La,b ) = 0. Since L∗a,b ∼ = L−a,−b and the canonical line bundle of X is trivial, it follows from the Serre duality that H 1 (X, La,b ) = 0. Thus we obtain a simple vanishing result H p (X, La,b ) = 0 and therefore from the decomposition (1.7) we obtain the following result. 0

Lemma 1.1. H 0 (X, gtw ) = H 1 (X, gtw ) = 0. Example 1.2. For N = 2, matrices α and β are nothing but the Pauli matrices σ 1 and σ 3 . The Jacobian elliptic functions sn, cn, and dn are meromorphic functions in K1,0 , K1,1 , and K0,1 and can be regarded as meromorphic sections of the line bundles L1,0 , L1,1 , and L0,1 respectively. Example 1.3. For general N and each (a, b) ∈ (Z/N Z)2 r {(0, 0)}, we define the function wa,b by 0 θ[0,0] θ[a,b] (t; τ ) . (1.14) wa,b (t) = wa,b (τ ; t) := θ[a,b] θ[0,0] (t; τ ) (See (A.8) in Appendix A for the notation.) The function wa,b (t) on C is uniquely characterized by the following properties: 1. The function wa,b (t) is a meromorphic function in Ka,b and hence can be regarded as a global meromorphic section of La,b ; 2. The poles of wa,b (t) are all simple and contained in Z + Zτ ; 3. The residue of wa,b (t) at t = 0 is equal to 1. Because of these properties, it will play an important role in concrete computations in later sections. For convenience of those computations, let us list several other properties of wa,b (t): • The Laurent expansion at t = 0 is equal to ∞

wa,b (t) =

1 X 1 + wa,b,ν tν = + wa,b,0 + wa,b,1 t + · · · , t t

(1.15)

ν=0

where the coefficients are written in the following forms: wa,b,0 =

0 θ[a,b] , θ[a,b]

wa,b,1 =

00 000 θ[a,b] θ[0,0] − 0 , 2θ[a,b] 6θ[0,0]

... .

(1.16)


7

• Formulae (A.6) and (A.7) imply wa,b (t) = wa0 ,b0 (t) if a ≡ a0 and b ≡ b0 mod N . (1.17) • Lemma A.1 will be used in the following form: X wa,b,1 = 0, (1.18) w−a,−b (t) = −wa,b (−t),

(a,b)6=(0,0)

where the summation is taken over all (a, b) ∈ (Z/N Z)2 r {(0, 0)}. 1.2. Definition of the spaces of conformal coinvariants and conformal blocks. In this section we define a conformal block associated to the twisted Lie algebra bundle gtw defined in Sect. 1.1. First let us introduce notation of sheaves. As usual, the structure sheaf on X = Xτ is denoted by OX and the sheaf of meromorphic functions on X by KX . A stalk of a sheaf F on X at a point P ∈ X is denoted by FP . When F is a OX -module, we denote its fiber FP /mP FP by F|P , where mP is the maximal ideal of the local ring OX,P . Denote by FP∧ the mP -adic completion of FP . We shall use the same symbol for a vector bundle and for a locally free coherent OX -module consisting of its local holomorphic sections. For instance, the invertible sheaf associated to the line bundle La,b is also denoted by the same symbol La,b . Denote by 1X the sheaf of holomorphic 1-forms on X, which is isomorphic to OX since X is an elliptic curve. The fiberwise Lie algebra structure of the bundle gtw induces that of the associated sheaf gtw over OX . Define the invariant OX -inner product on gtw by (A|B) :=

1 tr (ad A ad B) ∈ OX 2N gtw

for A, B ∈ gtw ,

(1.19)

where the symbol ad denotes the adjoint representation of the OX -Lie algebra gtw . Then the inner product on gtw is invariant under the translations with respect to the connection ∇ : gtw → gtw ⊗OX 1X : d(A|B) = (∇A|B) + (A|∇B) ∈ 1X

for A, B ∈ gtw .

(1.20)

Under the trivialization of gtw defined by the construction (1.4), the connection ∇ and the inner product (·|·) on gtw respectively coincide with the exterior derivative by t and the inner product defined by (1.1). For any point P on X with t(P ) = z, we put gP := (gtw ⊗OX KX )∧ P, which is a topological Lie algebra non-canonically isomorphic to the loop algebra g((t− tw ∧ ∼ z)). Its subspace gP + := (g )P = g[[t − z]] is a maximal linearly compact subalgebra P of g under the (t − z)-adic linear topology. Let us fix mutually distinct points Q1 , . . . , QL on X whose coordinates are t = z1 , . . . , zL and put D := {Q1 , . . . , QL }. We shall also regard D as a divisor on X (i.e., L ˙ The Lie algebra gD := L gQi has the D = Q1 + · · · + QL ). Denote X r D by X. i=1 natural 2-cocycle defined by ca (A, B) :=

L X i=1

Res(∇Ai |Bi ), t=zi

(1.21)

8


L D where A = (Ai )L i=1 , B = (Bi )i=1 ∈ g and Rest=z is the residue at t = z. (The symbol “ca ” stands for “Cocycle defining the Affine Lie algebra”.) We denote the central extension of gD with respect to this cocycle by gˆ D :

ˆ gˆ D := gD ⊕ Ck, where kˆ is a central element. Explicitly the bracket of gˆ D is represented as ˆ [A, B] = ([Ai , Bi ]0 )L i=1 ⊕ ca (A, B)k

for A, B ∈ gD ,

(1.22)

where [Ai , Bi ]0 are the natural bracket in gQi . The Lie algebra gˆ P for a point P is noncanonically isomorphic to the affine Lie algebra gˆ of type A(1) N −1 (a central extension of the loop algebra g((t − z)) = slN C((t − z)) ). If P = Qi for i = 1, . . . , L, then Qi tw ∧ gˆ P = gˆ Qi can be regarded as a subalgebra of gˆ D . Put gP + := (g )P as above. Then g+ Q D can be also regarded as a subalgebra of gˆ i and gˆ . tw Let gD ˙ be the space of global meromorphic sections of g which are holomorphic X ˙ on X: tw gD ˙ := 0(X, g (∗D)). X D There is a natural linear map from gD ˙ into g which maps a meromorphic section of X tw g to its germ at Qi ’s. As in the non-twisted case (e.g., Sect. 2.2 of [TUY]), the residue theorem implies that this linear map is extended to a Lie algebra injection from gD ˙ into X D ˆ gˆ D , which allows us to regard gD as a subalgebra of g . ˙ X

Definition 1.4. The space of conformal coinvariants CC(M ) and that of conformal blocks CB(M ) associated to gˆ Qi -modules Mi with the same level kˆ = k are defined to NL be the space of coinvariants of M := i=1 Mi with respect to gD ˙ and its dual: X CC(M ) := M/gD ˙ M, X

∗ CB(M ) := (M/gD ˙ M) . X

(1.23)

(In [TUY], CC(M ) and CB(M ) are called the space of covacua and that of vacua respectively.) In other words, the space of conformal coinvariants CC(M ) is generated by M with relations (1.24) AX˙ v ≡ 0 for all AX˙ ∈ gD ˙ and v ∈ M , and a linear functional 8 on M belongs to the space of X conformal blocks CB(M ) if and only if it satisfies that 8(AX˙ v) = 0

for all AX˙ ∈ gD ˙ and v ∈ M . X

(1.25)

These Eqs. (1.24) and (1.25) are called the Ward identities. The most important conformal blocks for our purpose are constructed from Weyl modules (or generalized Verma modules) which are determined from the following data: – A parameter k ∈ C which is called the level of the model. – Finite-dimensional irreducible representations Vi of the fiber Lie algebra gtw |Qi isomorphic to g.


9

i i i ˆ ˆQ ˆ Qi . The Put gQ := (gtw )∧ := gQ + + + ⊕ Ck, which are subalgebras of g Qi and g Qi Qi Q i subalgebra gˆ + := g+ ⊕ Ckˆ of gˆ acts on Vi through the linear map kˆ 7→ k idVi and i tw ˆ Qi -module induced from Vi is the natural projection gQ + → g |Qi , A 7→ A(Qi ). The g called a Weyl module or a generalized Verma module:

gˆ Qi

Mk (Vi ) := Ind Vi = U (ˆgQi ) ⊗U (gˆ Qi ) Vi Qi

(1.26)

+

gˆ +

See [KL] for properties of Weyl modules. The space of conformal coinvariants and that of conformal blocks associated to the data (Q, V ) = ({Qi }, {Vi }) are defined to be the space of conformal coinvariants and that of conformal blocks associated to the gˆ D -module Mk (V ) :=

L O

Mk (Vi ),

i=1

on which the center kˆ acts as multiplication by k. Namely we define them as follows: CCk (Q, V ) = CCk ({Qi }, {Vi }) := CC(Mk (V )) = Mk (V )/gD ˙ Mk (V ), X CBk (Q, V ) = CBk ({Qi }, {Vi }) := CB(Mk (V )) = (Mk (V

)/gD ˙ Mk (V X

(1.27)

∗

)) . (1.28)

Hereafter we use the word “conformal coinvariants” and “conformal block” for this kind of conformal coinvariants and conformal blocks, namely those associated to Weyl modules, unless otherwise stated. It is easy to see that the spaces of conformal coinvariants and conformal blocks are NL determined by the finite-dimensional part V = i=1 Vi of Mk (V ) as is the case with the space of conformal coinvariants and conformal blocks on P1 (C) (e.g., Lemma 1 of [FFR]), because of the cohomology vanishing. In fact, Lemma 1.1 implies a decomposition, ˆD (1.29) gˆ D = gD ˙ ⊕g + , X L L Q i D ˆ where gˆ D ˙ -modules, + = i=1 g+ ⊕ Ck. Hence we have, as left gX gˆ D

V = U (gD Mk (V ) = Ind V = U (ˆgD ) ⊗U (gˆ D ˙ ) ⊗C V, X + )

(1.30)

gˆ D +

NL ˆ where V := action of gˆ D + is defined by the mapping k 7→ k · id and i=1 Vi and the Q L D tw the natural projection g+ → i=1 (g |Qi ). Therefore, due to the Ward identity (1.25) and the definition of the Weyl module (1.26), the space of conformal coinvariants is canonically isomorphic to the tensor product of gtw |Qi -modules by the natural inclusion NL NL NL map V = i=1 Vi ,→ Mk (V ), i=1 vi 7→ i=1 (1 ⊗ vi ). (In the following we shall identify vi ∈ Vi with 1 ⊗ vi ∈ Mk (Vi ).) Proposition 1.5. The inclusion map V ,→ Mk (V ) and the induced restriction map Mk (V )∗ V ∗ induce the following isomorphisms respectively: ∼

CCk (Q, V ) ← V =

L O i=1

Vi

and

∼

CBk (Q, V ) → V ∗ =

L O i=1

Vi∗ .

10


For any point P ∈ X = Xτ , let us denote the 1-dimensional trivial representation of gtw |P by CP = CuP . Then the proposition above readily leads to the following corollary. Corollary 1.6. Let P be a point of X distinct from Qi ’s. Then the canonical inclusion Mk (V ) ,→ Mk (CP ) ⊗ Mk (V ), v 7→ uP ⊗ v, induces an isomorphism ∼

CBk ({P, Qi }, {CP , Vi }) → CBk ({Qi }, {Vi }).

(1.31)

The property above is called propagation of vacua in [TUY]. In our case the proof is far simpler due to Proposition 1.5, as in the case of P1 (cf. Sect. 3 of [FFR].) 1.3. Correlation functions. The current and the energy-momentum correlation functions are defined as in Sect. 2 of [TUY], but we must take twisting into account and use the decomposition (1.7). First we consider the current correlation functions. Let 8 be a conformal block in CBk (Q, V ) and v a vector in Mk (V ). There exists a unique ωi ∈ (L∗a,b ⊗OX KX ⊗OX 1X )∧ Qi with the property that Rest=zi hfi , ωi i := 8(ρi (fi Ja,b )v)

for all fi ∈ (KX )∧ Qi ,

(1.32)

∗ ∧ where h·, ·i is the canonical pairing of (La,b )∧ P and (La,b )P , fi Ja,b can be regarded as an element of gQi by means of (1.10) and its action on the ith component of v (namely, the Mk (Vi )-component) is denoted by ρi (fi Ja,b ). Thus we obtain a linear functional L X i=1

ResQi h·, ωi i :

L M

(La,b ⊗OX KX )∧ Qi → C

i=1

∧ which maps (fi )L i=1 ∈ (La,b ⊗OX KX )Qi to The Ward identity (1.25) implies that L X i=1

ResQi hfQi , ωi i =

PL

L X

i=1

ResQi hfi , ωi i ∈ C.

8(ρi (fQi Ja,b )v) = 0

i=1

for any meromorphic section f ∈ H 0 (X, La,b (∗D)), where fQi is the germ of f at Qi . Since H 0 (X, La,b (∗D)) and H 0 (X, L∗a,b ⊗OX 1X (∗D)) are orthogonal complements LL LL ∗ to each other under the residue pairing of i=1 (La,b ⊗OX KX )∧ Qi and i=1 (La,b ⊗OX 1 ∧ KX ⊗ X )Qi (cf. [Tat] or Theorem 2.20 of [I]), we have a meromorphic 1-form ω with values in L∗a,b such that the germ of ω at Qi gives ωi and is holomorphic outside of {Q1 , . . . , QL }: ω ∈ H 0 (X, L∗a,b ⊗OX 1X (∗D)),

(ω)Qi = ωi .

(1.33)

In order to define the correlation functions, we need explicit expression of the action of (gtw )∧ P which we can identify with the affine Lie algebra by fixing a trivialization of gtw around P . Let P be any point of X and z(P ) a point of C whose image in X = C/(Z + τ Z) is equal to P . The description (1.4) of gtw naturally determines a local trivialization of gtw at P , once we fix the coordinate t = z(P ) of P . By means of this trivialization, we fix


11

isomorphisms gP ∼ = gˆ , gtw |P ∼ = g, and so on. The induced trivialization of La,b at P is the same as the trivialization defined by the isomorphism (1.13): ∼

∼

(La,b )P → OC,z(P ) ← OX,P

for (a, b) ∈ (Z/N Z)2 ,

(1.34)

which corresponds ∇-flat sections of La,b to constant functions on X. The decomposition (1.7) of gtw induces a decomposition of its stalk at P and is consistent with the trivializations above: M M Ja,b (La,b )P ∼ (1.35) Ja,b OX,P = g ⊗ OX,P , gtw = P = where the indices (a, b) run through (Z/N Z)2 r {(0, 0)}. Let ξ be a local coordinate at P . For A ∈ g and m ∈ Z, we denote by A[m] the ˆ P which is represented by Aξ m under the trivialization (1.35). element of (gtw )∧ P ⊂ g Since A[0] is ∇-flat for A ∈ g (i.e., ∇A[0] = 0), the bracket of gˆ P is represented as: [A[m], B[n]] = [A, B][m + n] + (A|B)mδm+n,0 kˆ

for A, B ∈ g and m, n ∈ Z,

which coincides with the usual commutation relation of the affine Lie algebra. ˜ Lemma 1.7. Under the situation above, let P be in X˙ (i.e., distinct from Qi ’s) and 8 the conformal block in CBk ({P, Qi }, {CP , Vi }) corresponding to 8 by the isomorphism (1.31). Then we have the following: (i) Take x ∈ La,b |P and let fx be an element of (La,b )∧ P with a principal part x/ξ (i.e., ˜ x Ja,b uP ⊗ v) dξ does not depend on the choice of ξ and fx = (x/ξ + regular)). Then 8(f ˜ a,b [−1]uP ⊗ v) dξ ∈ (L∗a,b ⊗OX 1X )|P by fx . Thus we can define 8(J ˜ a,b [−1]uP ⊗ v)i dξ := 8(f ˜ x Ja,b uP ⊗ v) dξ. hx, 8(J (ii) The following equation holds at any point P in X˙ = X r {Q1 , . . . , QL }: ˜ a,b [−1]uP ⊗ v) dξ ∈ (L∗a,b ⊗OX 1X )|P . ω(P ) = 8(J

(1.36)

Proof. The statement (i) can be shown by the same argument as the proof of Claim 1 of Theorem 2.4.1 of [TUY]. Using the Riemann-Roch theorem (or the function wa,b (t) defined by (1.14)), we can choose fx in (i) from H 0 (X, La,b (∗D + P )). Then we have ˜ x Ja,b uP ⊗ v) dξ ˜ a,b [−1]uP ⊗ v)i dξ = 8(f hx, 8(J =−

L X

˜ P ⊗ ρi (fx Ja,b )v) dξ 8(u

i=1

=−

L X

8(ρi (fx Ja,b )v) dξ = −

i=1

= ResP hfx , ωi dξ = ResP

L X

ResQi hfx , ωi dξ

i=1

hx, ωi dξ = hx, ω|P i. ξ

Here we have used the Ward identity (1.25) and the residue theorem. Thus we have proved Eq. (1.36).

12


˜ a,b [−1]uP ⊗ v) dξ a correlation function Definition 1.8. We call this 1-form ω = 8(J of the current Ja,b (ξ) and v under 8, or a current correlation function for short, and denote it by 8(Ja,b (P )v) dP or 8(Ja,b (ξ)v) dξ when we fix a local coordinate ξ. We now proceed to the definition of the energy-momentum correlation functions. Lemma 1.9. Let P be in X˙ and ξ a local coordinate defined on an open neighborhood U of P . Then the following expression gives a holomorphic section of 2X (∗D) = (1X )⊗2 (∗D) on sufficiently small U : X 1 lim 8(Ja,b (ξ(P )) J a,b (ξ(P 0 ))v) 8(S(P )v) (dξ(P ))2 := 2 P 0 →P (a,b) (1.37) k dim g dξ(P ) dξ(P 0 ). − (ξ(P ) − ξ(P 0 ))2 Here the indices (a, b) run through (Z/N Z)2 r {(0, 0)}, and J a,b is the dual basis of Ja,b with respect to (·|·), namely, J a,b = ε−ab J−a,−b /N . Proof. The argument in the proof of Lemma 1.7 and the Hartogs theorem of holomorphy show that the current correlation function 8(Ja,b (ξ)J a,b (ζ)v) dξ dζ defines a global section on X ×X of the sheaf F := (L∗a,b ⊗OX 1X (∗D))(L∗−a,−b ⊗OX 1X (∗D))(∗1), where 1 is the diagonal divisor of X × X. We define 8(S(P, P 0 )v) dξ(P ) dξ(P 0 ) ∈ H 0 (U × U, F ) by 8(S(P, P 0 )v) dξ(P ) dξ(P 0 )  X 8(Ja,b (ξ(P )) J a,b (ξ(P 0 ))v) − :=  a,b

 k dim g  dξ(P ) dξ(P 0 ). (ξ(P ) − ξ(P 0 ))2

In order to show Lemma 1.9, first take a local coordinate ξ on a sufficiently small neighborhood U of P with ξ(P ) = 0 and a local trivialization of La,b on U and choose a meromorphic section f ∈ H 0 (X, La,b ⊗OX KX ) whose Laurent expansion has the form f (ξ) = ξ −1 + (regular at ξ = 0). (1.38) The inclusion (1.10) and the Ward identity (1.25) imply that 8(Ja,b (ξ(P ))J a,b (ξ(P 0 ))v) is equal to a,b ˜ 8(J [−1]uP 0 ⊗ v) a,b [−1]uP ⊗ J

˜ = −8((f Ja,b )P 0 J a,b [−1]uP 0 ⊗ v) −

L X i=1

˜ a,b [−1]uP 0 ⊗ ρi ((f Ja,b )Qi )v), 8(J

(1.39) ˜ ∈ CB ({P, P 0 , Q }, {C , C 0 , V }), and 8 0 ˜ 0 where 8 ∈ CB ({P , Q }, {C , V }) cork i P P i k i P i respond to 8 through the isomorphism (1.31). The second term of (1.39) is regular as a function of P at P 0 as shown in Lemma 1.7 and the first term is rewritten as


˜ − 8((f Ja,b )P 0 J a,b [−1]uP 0 ⊗v) =

13

k 8(v)+(regular at P 0 = P ). (1.40) (ξ(P ) − ξ(P 0 ))2

(Details of the computation is the same as that of the proof of the assertion (4) of Theorem 2.4.1 of [TUY]. Note that [Ja,b , J a,b ] = 0.) Equations (1.39) and (1.40) mean that 8(S(P, P 0 )v) dξ(P ) dξ(P 0 ) is a holomorphic section of (L∗a,b ⊗OX 1X (∗D)) (L∗−a,−b ⊗OX 1X (∗D)) on U × U for the coordinate ξ that satisfies (1.38). Restricting it to the diagonal of U × U , we obtain a local holomorphic section 8(S(P )v) (dξ(P ))2 ∈ H 0 (U, 2X (∗D)). Note that since L∗a,b = L−a,−b , the factors L∗a,b and L∗−a,−b cancel out on the diagonal. Thanks to this fact, the trivialization of La,b ’s which we implicitly fixed in the argument above does not affect the result. Definition 1.10. Put κ := k + h∨ , where h∨ is the dual Coxeter number of g (i.e., h∨ = N because g = slN ). We call 8(S(ξ)v) (dξ)2 in (1.37) a correlation function of the Sugawara tensor S(ξ) and v under 8, or a Sugawara correlation function for short, and (1.41) 8(T (ξ)v) (dξ)2 := κ−1 8(S(ξ)v) (dξ)2 a correlation function of the energy-momentum tensor T (ξ) and v under 8, or a energymomentum correlation function for short when κ 6= 0. Let us calculate the coordinate transformation law of the correlation function of the Sugawara tensor. Let ζ be another coordinate on U . The differential dξ(P )dξ(P 0 )/(ξ(P )− ξ(P 0 ))2 transforms under the coordinate change ξ 7→ ζ = ζ(ξ) as dξ(P ) dξ(P 0 ) {ζ, ξ}(P 0 ) dζ(P ) dζ(P 0 ) dξ(P ) dξ(P 0 ) + O(ξ(P ) − ξ(P 0 )), = + (ζ(P ) − ζ(P 0 ))2 (ξ(P ) − ξ(P 0 ))2 6 (1.42) where {ζ, ξ} = ζ 000 /ζ 0 − 3/2(ζ 00 /ζ 0 )2 (ζ 0 = dζ/dξ) is the Schwarzian derivative of ζ = ζ(P ) with respect to ξ = ξ(P ). Hence the correlation function of the Sugawara tensor transforms with respect to a coordinate change ξ 7→ ζ = ζ(ξ) not as 2-differentials but as k dim g {ζ, ξ}8(v). (1.43) 8(S(ζ)v)dζ 2 = 8(S(ξ)v)dξ 2 + 12 This means that the family {8(S(ξ)v)dξ 2 } defines a meromorphic projective connection on X. (For the notion of projective connections on Riemann surfaces, see [Gu].) The Schwarzian derivative {ζ, ξ} vanishes identically if and only if ζ is a fractional linear transformation of ξ (i.e., ζ = (aξ + b)/(cξ + d)). From this fact it follows that {8(S(ξ)v) dξ 2 } behaves like a 2-differential under fractional linear coordinate changes. For later use, we compute the local expression of the energy-momentum correlation function around Qi . We take a holomorphic local chart (U, ζ) with Qi ∈ U and ζ(Qi ) = 0, a local trivialization of La,b by (1.34), and a trivialization of gtw Qi by (1.35). Under these trivializations, we have, due to (1.33), 8(Ja,b (P1 )J a,b (P2 )v) dP1 dP2 X L X = ζ1−m−1 ζ2−n+m−1 8(ρi ( ◦◦ J a,b [n − m]Ja,b [m] ◦◦ )v) i=1 n,m∈Z

+

k8(v) (ζ2 − ζ1 )2

dζ1 dζ2

(1.44)

14


if |ζ1 | > |ζ2 |. Here P1 , P2 ∈ U , ζ1 = ζ(P1 ), ζ2 = ζ(P2 ), and ◦◦ ◦◦ denotes the normal ordered product defined by ( A[m]B[n], if m < n, ◦ ◦ 1 (A[m]B[n] + B[n]A[m]), if m = n, A[m]B[n] = (1.45) 2 ◦ ◦ B[n]A[m], if m > n. Using (1.44), we obtain an expression of the correlation function (1.37) around the point Qi : X 8(S(ζ)v)(dξ)2 = ζ −m−2 8(ρi (S[m])v) (dζ)2 , (1.46) m∈Z

where S[m] are the Sugawara operators defined by: S[m] =

1 XX ◦ a,b [n] ◦◦ , ◦ Ja,b [m − n] J 2

(1.47)

a,b n∈Z

which satisfy the following commutation relations: [S[m], A[n]] = −κnA[m + n] for A ∈ g, (1.48) k dim g 3 (m − m)δm+n,0 id . (1.49) [S[m], S[n]] = κ (m − n)S[m + n] + 12 In particular the Sugawara operators S[m] commute with gP if κ = 0 (i.e., the level k is critical). When κ = k + h∨ 6= 0, the usual Virasoro operators are defined by normalizing S[m]: (1.50) T [m] := κ−1 S[m], which satisfy the well-known commutation relations: [T [m], A[n]] = −nA[m + n]

for A ∈ g, ck [T [m], T [n]] = (m − n)T [m + n] + (m3 − m)δm+n,0 id, 12

(1.51) (1.52)

where ck = k dim g/κ. Later we fix the local coordinate at Qi to ξi = t − zi and the one at P ∈ X to ξ = t − z with t(P ) = z, where t is the global coordinate of C (cf. (1.4)) and zi = t(Qi ). Lemma 1.11. In this coordinate 8(S(ξ)v) (dξ)2 and 8(T (ξ)v)(dξ)2 can be extended to global 2-differentials 8(S(t)v)(dt)2 and 8(T (t)v) (dt)2 . Proof. Under fractional linear coordinate changes, {8(S(ξ)v) (dξ)2 } behaves like a 2differential due to (1.43). Since the coordinate changes between two ξ’s are merely translations, if X is covered by these coordinates, then {8(S(ξ)v) (dξ)2 } gives a meromorphic 2-differential on X. Remark 1.12. Using Weierstraß’ ℘-function, we can prove the lemma above in a more explicit manner. In fact, since 1 2 + O((t1 − t2 ) ) dt1 dt2 ℘(t1 − t2 ) dt1 dt2 = (t1 − t2 )2


15

is a global meromorphic 2-form on X × X with a pole along the diagonal, we can equivalently replace definition (1.37) by X 1 2 lim 8(Ja,b (t(P ))J a,b (t(P 0 ))v) 8(S(P )v) (dt(P )) := 2 P 0 →P a,b . (1.53) 0 0 − k dim g · ℘(t(P ) − t(P )) dt(P ) dt(P ) This definition is meaningful globally on X and coincides with that of the proof above. 1.4. Action of the Virasoro algebra. When the level k is not −h∨ , the Lie algebras of formal meromorphic vector fields at Qi are projectively represented on Mk (V ) through the energy-momentum tensor. We denote by TX the tangent sheaf of X = Xτ (i.e., the sheaf of vector fields on X). Let us fix a local coordinate at Qi to ξi = t − zi , and denote by T D the direct sum of the Lie algebra of formal meromorphic vector fields at Qi for i = 1, . . . , L: T D :=

L M

T Qi ,

T Qi := (TX ⊗OX KX )∧ Qi =

i=1

L M

C((ξi ))

i=1

∂ . ∂ξi

(1.54)

The Virasoro algebra Vir D at D is defined to be the central extension of T D by Cˆc: Vir D := T D ⊕ Cˆc,

(1.55)

whose Lie algebra structure is defined by h L L i θi (ξi )∂ξi i=1 , ηi (ξi )∂ξi i=1 L L cˆ X ⊕ Res(θi000 (ξi )ηi (ξi ) dξi ), = (θi (ξi )ηi0 (ξi ) − ηi (ξi )θi0 (ξi ))∂ξi ξi =0 12 i=1

(1.56)

i=1

where θi (ξi ), ηi (ξi ) ∈ C((ξi )) and ∂ξi := ∂/∂ξi . When L = 1, this Virasoro algebra Vir D is the usual one defined as a central extension of the Lie algebra of vector fields on a circle. D on The action of θi = θi (ξi )∂ξi ∈ T Qi and that of θ = (θi (ξi )∂ξi )L i=1 ∈ T v ∈ Mk (V ) and 8 ∈ (Mk (V ))∗ are given by Ti {θi }v := −

P m∈Z

ρi (θi,m T [m])v,

T {θ}(v) :=

L X

Ti {θi }v,

(1.57)

Ti∗ {θi }8,

(1.58)

i=1

(Ti∗ {θi }8)(v) := −8(Ti {θi }v), where θi (ξi )∂ξi = Note that we have

P

T ∗ {θ}8 :=

L X i=1

m+1 ∂ ξi m∈Z θi,m ξi

is the Laurent expansion (cf. (1.47) and (1.50)).

(Ti∗ {θi }8)(v) = Resh8(T (ξi )v) (dξi )2 , θi (ξi )∂ξi i, ξi =0

where h·, ·i is the contraction of a 2-differential and a tangent vector.

(1.59)

16


L D Proposition 1.13. Let θ = (θi ∂ξi )L i=1 and η = (ηi ∂ξi )i=1 be elements of T . Then the operators T {θ}, T {η} acting on Mk (V ) and the operators T ∗ {θ}, T ∗ {η} acting on (Mk (V ))∗ satisfy

ck X Res(θi000 (ξi )ηi (ξi ) dξi ) id, ξi =0 12 L

[T {θ}, T {η}] = T {[θ, η]} +

(1.60)

i=1

[T ∗ {θ}, T ∗ {η}] = T ∗ {[θ, η]} −

ck X Res(θi000 (ξi )ηi (ξi ) dξi ) id, ξi =0 12 L

(1.61)

i=1

where ck = k dim g/κ and κ = k + h∨ . Namely, the definitions (1.57) and (1.58) define representations of Vir D on Mk (V ) with central charge ck and on (Mk (V ))∗ with central charge −ck respectively. This is a direct consequence of the definitions (1.57), (1.58) and the Virasoro commutation relations (1.52). As shown in Lemma 1.11, we have a global meromorphic 2-differential 8(T (t)v)(dt)2 for 8 ∈ CBk (Q, V ) and v ∈ Mk (V ). Therefore (1.59) and the residue theorem imply the following lemma. Lemma 1.14. Let 8 be a conformal block in CBk (Q, V ), θ(t)∂t in H 0 (X, TX (∗D)), and θi (ξi )∂ξi the Laurent expansion of θ(t)∂t in ξi for each i = 1, . . . , L. Denote (θi (ξi )∂ξi )L i=1 by θ. Then T ∗ {θ}8 = 0. 2. Critical Level and the XYZ Gaudin Model In this section we restrict ourselves to the case k = −h∨ = −N , namely the case when the level is critical. We showed in Sect. 1.2 that the conformal block is determined by its finitedimensional part. (See Proposition 1.5.) We shall see in this section that the Sugawara tensor is expressed by integrals of motion of the XYZ Gaudin model on this finitedimensional space. Indeed, it shall be shown that determining certain spaces of conformal blocks at the critical level is equivalent to solving the XYZ Gaudin model. First let us recall the definition of the XYZ Gaudin model, generalizing the definition in [Ga1, ST1] to slN case. Keeping in mind that we will show the relation of the conformal field theory and the XYZ Gaudin model, we use the same notation for slN modules, points Qi on a elliptic curve, etc. as in the previous section Sect. 1, and fix local coordinates at each point Qi to ξi = t − zi , where t is the global coordinate of C. The Hilbert space of the model is a tensor product of the finite-dimensional irreNL ducible representation spaces of slN (C): V := i=1 Vi . The generating function τˆ (u) of the integrals of motion of the model is defined as the trace of square of the quasiclassical limit T (u) of the monodromy matrix of the spin chain model associated with the Baxter-Belavin’s elliptic R-matrix: T (u) :=

L X

X

i=1 (a,b)6=(0,0)

τˆ (u) :=

1 tr(T (u))2 2

wa,b (u − zi )Ja,b ⊗ ρi (J a,b ),

(2.1)

Twisted WZW Models on Elliptic Curves L 1X = 2

X

17

w−a,−b (u − zi )wa,b (u − zj )ρi (Ja,b )ρj (J a,b ),

(2.2)

i,j=1 (a,b)6=(0,0)

where the indices of the summations over (a, b) run through a = 0, . . . , N − 1, b = 0, . . . , N − 1, (a, b) 6= (0, 0), and wa,b are functions defined by (1.14). As before, ρi is the representation of g on the ith factor Vi of V . The integrals of motion are encoded here in the following way: τˆ (u) =

L X

Ci ℘(u − zi ) +

i=1

L X

Hi ζ(u − zi ) + H0 ,

(2.3)

i=1

where ζ and ℘ are Weierstraß’ ζ and ℘ functions, Ci is the Casimir operator of g acting on Vi , i.e., 1 X ρi (Ja,b )ρi (J a,b ), (2.4) Ci = 2 (a,b)6=(0,0) PL and Hi (i = 1, . . . , L) and H0 are integrals of motion. The operators Hi satisfy i=1 Hi = 0, and hence there are L independent integrals of motion. Example 2.1. When N = 2, the Casimir operator Ci is equal to li (li + 1)idVi , where li = (dim Vi − 1)/2, and Hi are expressed as (cf. [ST1]): X X wa,b (zi − zj )ρi (Ja,b )ρj (J a,b ), Hi = j6=i (a,b)=(0,1),(1,1),(1,0) L X 1X −ea,b ρi (Ja,b )ρi (J a,b ) H0 = (2.5) 2 i=1 (a,b)=(0,1),(1,1),(1,0) ω X ωa,b a,b + −ζ ρi (Ja,b )ρj (J a,b ) , wa,b (zi − zj ) ζ zi − zj + 2 2 j6=i

where ωa,b = aτ + b and ea,b = ℘(ωa,b /2). We interpret this system as a twisted WZW model at the critical level. Let us come back to the situation in Sect. 1 and put u = t(P ). The slN -module Vi is assigned to the point Qi and regarded as the gtw |Qi module by the trivialization (1.35). Assign the vac˜ ∈ CBk ({P, Qi }, {CP , Vi }) uum module Mk (CP ) at P (k = −h∨ = −N ). As before 8 corresponds to a conformal block 8 ∈ CBk ({Qi }, {Vi }) through the isomorphism 2 ˜ (1.31). The correlation function of the Sugawara tensor 8(S(t)(u P ⊗ v)) (dt) has an expansion (1.46) at Qi and at P , X 2 ˜ P ⊗ ρi (S[m])v)(t − zi )−m−2 (dt)2 ˜ 8(u 8(S(t)(u P ⊗ v)) (dt) = m∈Z

=

X

m∈Z

−m−2 ˜ (dt)2 , 8(S[m]u P ⊗ v)(t − u)

(2.6)

P −m−2 ˜ (dt)2 , respectively. The right-hand side of (2.6) is m≤−2 8(S[m]u P ⊗ v)(t − u) since S[m]uP = 0 for all m = −1. Hence, evaluating (2.6) at t = u, we have X ˜ 8(ρi (S[m])v)(u − zi )−m−2 = 8(S[−2]u (2.7) P ⊗ v). m∈Z

18


Lemma 2.2. Let v be a vector in V =

NL i=1

S[−2]uP ⊗ v ≡ uP ⊗ τˆ (u)v

Vi . Then we have in CCk ({P, Qi }, {CP , Vi }).

Hence the right-hand side of (2.7) is equal to 8(τˆ (u)v). Proof. First note that S[−2]uP =

1 2

X

Ja,b [−1]J a,b [−1]uP .

(a,b)6=(0,0)

The key step is to exchange the operators Ja,b [−1] and J a,b [−1] with operators acting on v by using the Ward identity (1.24). Recall that the functions wa,b (t − u) (1.14) in Ka,b define meromorphic sections Ja,b,P (t) := wa,b (t − u)Ja,b ,

J a,b,P (t) := w−a,−b (t − u)J a,b ,

(2.8)

of gtw through the inclusion (1.13) and (1.10). These sections belong to π∗ gtw (P ). Since Ja,b,P (t) has a Laurent expansion Ja,b,P (t) =

Ja,b + wa,b,0 Ja,b + wa,b,1 Ja,b · (t − u) + O((t − u)2 ) t−u

(2.9)

at P (see (1.15)), and J a,b (t) has a similar expansion, we have Ja,b [−1]J a,b [−1]uP = (Ja,b,P (t))P (J a,b,P (t))P uP − kwa,b,1 uP ,

(2.10)

where k = −h∨ = −N . Summing up (2.10) for (a, b) and using (1.18), we obtain 1 X S[−2]uP = (Ja,b,P (t))P (J a,b,P (t))P uP . (2.11) 2 (a,b)6=(0,0)

Substituting (2.11) into S[−2]uP ⊗ v and swapping Ja,b,P (t) and then J a,b,P (t) by the Ward identity (1.24), we obtain S[−2]uP ⊗ v ≡ uP ⊗

L 1X 2

X

wa,b (zi − u)w−a,−b (zj − u)ρi (Ja,b )ρj (J a,b )v,

i,j=1 (a,b)6=(0,0)

which proves the lemma because of (1.17).

(2.12)

Corollary 2.3. [τˆ (u), τˆ (u0 )] = 0 for any u and u0 . In particular, Hi (i = 0, 1, . . . , L) commute with each other. Proof. Since the Sugawara operators S[m] commute with the affine Lie algebra at the critical level due to (1.48), we have A[n]S[m]uP = 0

for A ∈ g and n = 0.

Hence we can find the following formula in the similar way as the proof of Lemma 2.2: S[−2]uP 0 ⊗ S[m]uP ⊗ v ≡ uP 0 ⊗ S[m]uP ⊗ τˆ (u0 )v, where t(P ) = u, t(P 0 ) = u0 , and v ∈ V . Using this formula and Lemma 2.2, we obtain uP 0 ⊗ uP ⊗ τˆ (u)τˆ (u0 )v ≡ uP 0 ⊗ S[−2]uP ⊗ τˆ (u0 )v ≡ S[−2]uP 0 ⊗ S[−2]uP ⊗ v ≡ S[−2]uP 0 ⊗ uP ⊗ τˆ (u)v ≡ uP 0 ⊗ uP ⊗ τˆ (u0 )τˆ (u)v. This proves the corollary in view of Proposition 1.5.


19

Once the correspondence of the Hamiltonians of the XYZ Gaudin model and the correlation functions of the twisted WZW model is established, the eigenvalue problem of the XYZ Gaudin model is rewritten in terms of the conformal block of the twisted WZW model, as is the case with the (XXX) Gaudin model. (See [Fr2].) We sketch below how it goes, restricting ourselves to the sl2 case. For the general slN case, we should introduce higher order Sugawara operators, whose constructions are found in [Hay] and [GW]. Let us introduce a meromorphic (single-valued) function on X of the form q(t) =

L X

li (li + 1)℘(t − zi ) +

i=1

L X

µi ζ(t − zi ) + µ0 ,

(2.13)

i=1

where li = (dim Vi − 1)/2 (cf. Example 2.1), µi and µ0 are parameters satisfying PL P −n−2 be the Laurent expansion of n∈Z qi,n (t − zi ) i=1 µi = 0. Let qi (t − zi ) = qi q(t) at Qi . Denote by K (Vi ) the submodule of M−2 (Vi ) generated by the vectors (S[m] − qi,m )vi for vi ∈ Vi , m ∈ Z and put M qi (Vi ) := M−2 (Vi )/K qi (Vi ). Theorem 2.4. The space of conformal coinvariants and that of conformal blocks asNL qi sociated to the module M q (V ) := i=1 M (Vi ) are isomorphic to the quotient of NL µ V := i=1 Vi by the subspace J (V ) spanned by vectors of the form (Hi − µi )v for i = 0, 1, . . . , L and v ∈ V and its dual: CCk (M q (V )) ∼ = V /J µ (V ),

CBk (M q (V )) ∼ = (V /J µ (V ))∗ .

Proof. We prove the statement for the conformal blocks. The statement for the space of conformal coinvariants follows from this since it is finite-dimensional and dual to the space of conformal blocks. NL Let 8 be any linear functional on M−2 (V ) = i=1 M−2 (Vi ). A necessary and sufficient condition for 8 to be a conformal block in CBk (M q (V )) is that it vanishes on gD ˙ M−2 (V ) and on the subspaces X K q (V ) :=

L X

M−2 (V1 ) ⊗ · · · ⊗ K qi (Vi ) ⊗ · · · ⊗ M−2 (VL ).

i=1

First we show that this condition implies 8((Hi − µi )v) = 0 for i = 0, 1, . . . , L and v ∈V. The assumption is encapsulated in the following expression by a generating function, X 8(ρi (S[m] − qi,m )v)(u − zi )−m−2 = 0, (2.14) m∈Z

which means 8(τˆ (u)v) = q(u)8(v) because of (2.7) and Lemma 2.2. Thus (2.3) and (2.13) shows that 8(Hi v) = µi 8(v). We prove the converse statement next. Assume that 8 vanishes on the subspace J µ (V ). Let v be an arbitrary vector in M−2 (V ). We want to show that 8(ρi (S[m] − qi,m )v) vanishes for any m and i, but for this purpose we may assume v ∈ V without loss of generality. Indeed any v can be written in the form v = gX˙ v 0 by the decomposition 0 (1.29), where gX˙ ∈ U (gD ˙ ), v ∈ V , and therefore X ρi (S[m] − qi,m )v = gX˙ ρi (S[m] − qi,m )v 0 ,

20


b since S[m] belongs to the center of U−2 (sl(2)). The Ward identity (1.25) implies that 8(ρi (S[m] − qi,m )v) = 0 if 8(ρi (S[m] − qi,m )v 0 ) = 0. For v ∈ V , we can prove 8(ρi (S[m] − qi,m )v) = 0 by tracing back the first part of this proof. 3. Sheaves of Conformal Coinvariants and Conformal Blocks So far we have fixed the modulus τ of an elliptic curve and marked points on it. In this section we introduce sheaves of conformal coinvariants and conformal blocks on a family of pointed elliptic curves. 3.1. Family of pointed elliptic curves and Lie algebra bundles. In this subsection we construct a family of elliptic curves with marked points, a group bundle, and the associated Lie algebra bundle over this family. The fiber at a point of the base space of the family gives the group bundle Gtw and the Lie algebra bundle gtw on a pointed elliptic curve defined in Sect. 1.1. e and S by Recall that H denotes the upper half plane. We define X S := { (τ ; z) = (τ ; z1 , . . . , zL ) ∈ H × CL | zi − zj 6∈ Z + τ Z if i 6= j }, e := S × C. X e Let π˜ = πX e/S be the projection from X onto S along C and q˜i the section of π˜ given by zi : e for (τ ; z) = (τ ; z1 , . . . , zL ) ∈ S. q˜i (τ ; z) := (τ ; z; zi ) ∈ X A family of L-pointed elliptic curves π : X S is constructed as follows. Define e by the action of Z2 on X (m, n) · (τ ; z; t) := (τ ; z; t + mτ + n)

e for (m, n) ∈ Z2 , (τ ; z; t) ∈ X.

(3.1)

e by the action of Z2 : Let X be the quotient space of X e X := Z2 \X.

(3.2)

e Let πX e/X be the natural projection from X onto X and π = πX/S the projection from X onto S induced by π. ˜ We put qi := πX e/X ◦ q˜i ,

Qi := qi (S),

D :=

L [

Qi ,

˙ := X r D. X

i=1

PL Here qi is the section of π induced by q˜i and D is also regarded as a divisor i=1 Qi on X. The fiber of π at (τ, z) = (τ ; z1 , . . . , zL ) ∈ S is an elliptic curve with modulus τ and marked points z1 , . . . , zL . tw A group bundle Gtw X and a Lie algebra bundle gX on X are defined as follows. Due to e the definition of X, the Galois group of the covering πX e/X : X X is naturally identified e is given by (τ ; z; t) · (m, n) := (−m, −n) · (τ ; z; t). with Z2 . Its natural right action on X 2 e Then the covering πX e/X : X X is regarded as a principal Z -bundle on X. The actions of the Galois group Z2 on G and g are defined by


21

(m, n) · g := (β m αn )g(β m αn )−1 (m, n) · A := (β m αn )A(β m αn )−1

for g ∈ G and (m, n) ∈ Z2 , for A ∈ g and (m, n) ∈ Z2 .

(3.3) (3.4)

These actions produce the associated group bundle Gtw X and the associated Lie algebra on X: bundle gtw X e ×Z2 G, gtw := X e ×Z2 g. Gtw := X (3.5) X

X

Their fibers at a point (τ ; z) ∈ S can be identified with Gtw and gtw in Sect. 1.1. We denote the OX -Lie algebra associated to the Lie algebra bundle gtw X by the same tw , as mentioned in Sect. 1.2. The sheaf g can be written in the following symbol gtw X X form: e p ∈ Z2 }. ˜ = p · A(x) ˜ for x˜ ∈ X, gtw X = { A ∈ (πX e/X )∗ (g ⊗ OX e ) | A(p · x)

(3.6)

e which does not intersect (m, n) · U 0 for any Hence if we take an open subset U 0 of X 2 (m, n) ∈ Z r {(0, 0)} and denote by U the image of U 0 on X, then the restriction of gtw X on U can be canonically identified with g ⊗ OU : ∼ gtw X |U = (πU 0 /U )∗ (g ⊗ OU 0 ) = g ⊗ OU ,

(3.7)

∼

where πU 0 /U is the natural biholomorphic projection U 0 → U . Denote by 1X the sheaf of 1-forms on X and by 1X/S the sheaf of relative diftw ferentials on X over S. It follows from the definition of gtw X that gX possesses a nat1 tw tw ural connection ∇ : gX → gX ⊗OX X , which is induced by the trivial connection 1 id ⊗ d : g ⊗ OX e → g ⊗ X e through the identification (3.6). This means that, under the trivialization (3.7), the connection ∇ is identified with id ⊗ dU where dU is the exterior derivation on U . The relative connection ∇X/S along the fibers is defined to be the composite of the connection ∇ and the natural homomorphism 1X → 1X/S . Under the trivialization (3.7) and the coordinate (τ ; z; t), the relative connection ∇X e/S is equal to the exterior derivation by t. Define the invariant OX -inner product on gtw X by (A|B) :=

1 1 tr (ad A ad B) ∈ OX tr (ad A ad B) = 2h∨ gtwX 2N gtwX

for A, B ∈ gtw X,

(3.8)

where the symbol ad denotes the adjoint representation of the OX -Lie algebra gtw X . Under is equal to the inner product defined by the trivialization (3.7), the inner product on gtw X (1.1) and hence it is invariant under the translation along the connection ∇. Recall that ε = exp(2πi/N ). For (a, b) ∈ (Z/N Z)2 , the 1-dimensional representation (m, n) 7→ εbm+an of Z2 defines the associated flat line bundle La,b on X. We obtain the decomposition of gtw X into line bundles: M Ja,b La,b . (3.9) gtw X = (a,b)6=(0,0)

This is a sheaf version of (1.7). Lemma 3.1.

Rp π∗ gtw X = 0 for all p.

22


Proof. Since L∗a,b ⊗OX 1X/S is isomorphic to L−a,−b , it follows that R1 π∗ La,b ∼ = HomOS (π∗ L−a,−b , OS ) by the Serre duality. Therefore, because of the decomposition (3.9), it is enough to show π∗ La,b = 0 for (a, b) 6= (0, 0). Let U be any open subset of S and put V := π −1 (U ). For each s = (τ ; z) ∈ U , the restriction La,b |Xs of La,b on the fiber Xs := π −1 (s) can be identified with the line bundle La,b on Xτ defined in Sect. 1.1. Hence we obtain H 0 (Xs , La,b |Xs ) = 0 for each s ∈ S. In particular, for every f ∈ H 0 (V, La,b ) = H 0 (U, π∗ La,b ), the restriction f |Xs of f on the fiber vanishes for each s ∈ S and hence f vanishes itself. This means that H 0 (U, π∗ La,b ) = 0. We have proved the lemma. 3.2. Sheaf of affine Lie algebras. In this section we define a sheaf version of the Lie algebras gˆ D , gD ˙ , etc. on the base space S of the family. X For an OX -module F and a closed analytic subset W of X, the restriction F |W of F on W and the completion Fˆ |W = (F)∧ W of F at W are defined by n Fb|W = (F)∧ W := projlim (F/IW F ),

F|W := F /IW F,

(3.10)

n→∞

where IW is the defining ideal of W in X. Qi i D D D We define the OS -Lie algebras gQ ˙ as follows: S , gS,+ , gS , gS,+ , and gX tw ∧ i gQ S := π∗ (gX (∗Qi ))Qi , tw ∧ gD S := π∗ (gX (∗D))D =

tw ∧ i gQ S,+ := π∗ (gX )Qi , L M

i gQ S ,

tw ∧ gD S,+ := π∗ (gX )D =

i=1

L M

i gQ S,+ ,

(3.11)

i=1

tw gD ˙ := π∗ (gX (∗D)). X

The 2-cocycle of gD S is defined by ca (A, B) :=

L X i=1

ResQi (∇Ai |Bi ) =

L X

ResQi (∇X/S Ai |Bi ),

(3.12)

i=1

L D where A = (Ai )L i=1 , B = (Bi )i=1 ∈ gS and ResQi is the residue along Qi . Using the D 2-cocycle ca (·, ·), we define a central extension gˆ D S of gS : D ˆ gˆ D S := gS ⊕ OS k,

(3.13)

i where its Lie algebra structure is defined by the formula similar to (1.22). Put gˆ Q S := Qi Q ˆ which is a OS -Lie subalgebra of gˆ D . We call gˆ D (resp. gˆ i ) the sheaf of gS ⊕ OS k, S S S affine Lie algebras at D (resp. Qi ). D The diagonal embedding of gD ˙ into gS is defined to be the mapping which sends X L D A ∈ gD ˙ to (Ai )i=1 ∈ gS , where each Ai is the image of A given by the natural embedding X tw tw D D D ˆD gX (∗D) ,→ (gX (∗Qi ))∧ ˙ with its image in gS and g ˙ , Qi . We identify gX S . For A, B ∈ gX 1 we can regard (∇X/S A|B) as an element of π∗ X/S (∗D). Hence, using the residue

Qi theorem, we obtain that ca (A, B) = 0. Thus gD ˙ , as well as gS,+ , is an OS -Lie subalgebra X of gˆ D S. D ˆ Put gˆ D S,+ := gS ⊕ OS k. Then Lemma 3.1 implies the sheaf version of (1.29).


23

D ˆD Lemma 3.2. gˆ D ˙ ⊕g S = gX S,+ . D Proof. We can calculate Rp π∗ gtw ˙ ⊕ X for p = 0, 1 as the kernel and the cokernel of gX D D gS,+ → gS , which sends (aX˙ ; a+ ) to aX˙ − a+ . But then Lemma 3.1 means that both the D D kernel and the cokernel vanish and hence gD ˙ ⊕ gS,+ = gS . We have proved the lemma. X

e i := q˜i (S) which does not intersect (m, n)·U 0 Choose any open neighborhood U 0 of Q 2 for any (m, n) ∈ Z r {(0, 0)}. Then, applying the trivialization (3.7) to U 0 , we obtain a natural isomorphism i ∼ b (3.14) gQ S = g ⊗ π∗ (OX|Qi (∗Qi )), which does not depend on the choice of U 0 and is defined globally on S. Furthermore, using the coordinate (τ ; z; ξi ) with ξi = t − zi , we have the following isomorphism defined over S: i ∼ (3.15) gQ S = g ⊗ OS ((ξi )). Qi i Under this trivialization, gQ S,+ is identified with g ⊗ OS [[ξi ]] and the connections on gS induced by ∇ and ∇X/S are written in the following forms:

X ∂A ∂A ∂A dτ + dzi + dξi ∈ g ⊗ 1S ((ξi )) ⊕ g ⊗ OS ((ξi )) dξi , (3.16) ∂τ ∂zi ∂ξi L

∇A =

i=1

∂A dξi ∈ g ⊗ OS ((ξi )) dξi , ∇X/S A = ∂ξi

(3.17)

Q where A ∈ g ⊗ OS ((ξi )) ∼ = gS i . We also obtain the induced global trivialization of the sheaf of affine Lie algebras on S:

∼ gˆ D S =

L M

ˆ g ⊗ OS ((ξi )) ⊕ OS k.

(3.18)

i−1

Under this trivialization, the bracket of gˆ D S is represented in the following form:

L X L L ˆ , (B ⊗ g ) , B ] ⊗ f g + k (Ai |Bi ) Res(dfi · g), (3.19) (Ai ⊗ fi )L = [A i i i i i i i=1 i=1 i=1 i=1

ξi =0

where Ai , Bi ∈ g and fi , gi ∈ OS ((ξi )). 3.3. Definition of the sheaves of conformal coinvariants and conformal blocks. For any ˆD OS -Lie algebra a = gD S,+ , g S , etc., we denote by US (a) the universal OS -enveloping algebra of a and define the category of a-modules to be that of US (a)-modules. Definition 3.3. For any gˆ D S -module M, we define the sheaf CC(M) of conformal coinvariants and the sheaf CB(M) of conformal blocks by CC(M) := M/gD ˙ M, X CB(M) := HomOS (CC(M), OS ). Namely, the OS -module CC(M) is generated by M with relations

(3.20) (3.21)

24


AX˙ v ≡ 0

(3.22)

for all AX˙ ∈ gD ˙ , v ∈ M, and 8 ∈ CB(M) means that 8 belongs to HomOS (M, OS ) X and satisfies (3.23) 8(AX˙ v) = 0 for all AX˙ ∈ gD ˙ , v ∈ M. These equations, (3.22) and (3.23), are also called the Ward X identities. We can regard CC(·) as a covariant right exact functor from the category of gˆ D S -modules to that of OS -modules and similarly CB(·) as a contravariant left exact functor. The gˆ D S -modules of our concern are the sheaf version Mk (V ) of Mk (V ) in Sect. 1.2. We give two equivalent definitions of Mk (V ). First definition of Mk (V ). Fix an arbitrary complex number k. For each i = 1, . . . , L, ˆ let Mi be a representation with level k of the affine Lie algebra gˆ i := g ⊗ C((ξi )) ⊕ Ck. ˆ (Here gˆ i -modules are said to be of level k if the canonical central element k acts on them as k · id.) Assume the smoothness of Mi , namely, for any vi ∈ Mi , there exists m = 0 such that, for Ai1 , . . . , Aiν ∈ g, m1 , . . . , mν = 0, and ν = 0, 1, 2, . . ., (Ai1 ⊗ ξim1 C[[ξi ]]) · · · (Aiν ⊗ ξimν C[[ξi ]])vi = 0 if m1 + . . . + mν = m. (3.24) NL Put M := i=1 Mi and M := M ⊗ OS . Then M is a representation with level k of the LL affine Lie algebra (g⊕L )∧ := i=1 g ⊗ C((ξi )) ⊕ Ckˆ associated to g⊕L . We can define the gˆ D S -module structure on M by (Ai ⊗ fi (ξi ))L i=1 (v ⊗ a) :=

L X X

(ρi (Ai ⊗ ξim )v) ⊗ (fi,m a),

ˆ := kv, kv

(3.25)

i=1 m∈Z

P where Ai ∈ g, fi (ξi ) = m fi,m ξim ∈ OS ((ξi )), fi,m , a ∈ OS , v ∈ M , and ρi (Ai ⊗ξim ) denotes the action of Ai ⊗ ξim on the ith factors in v. If each Mi is the Weyl module Mk (Vi ) induced up from a finite-dimensional irreducible representation Vi of g, then we NL NL put V := i=1 Vi , M := Mk (V ) := i=1 Mk (Vi ), and M := Mk (V ) := Mk (V ) ⊗ OS and denote CC(M) and CB(M) by CCk (V ) and CBk (V ) respectively. Second definition of Mk (V ). Let Vi be a finite-dimensional irreducible representation NL of g and put V := i=1 Vi . Denote the constant sheaf associated to V by the same ˆ symbol V . Using the trivialization (3.18), we can define the action of gˆ S,+ = gD S,+ ⊕ OS k on V ⊗ OS by X ˆ := kv (Ai ⊗ fi (ξi ))L (ρi (Ai )v) ⊗ (fi (0)a), kv (3.26) i=1 (v ⊗ a) := i

where Ai ∈ g, fi (ξi ) ∈ OS [[ξi ]], a ∈ OS , v ∈ V , and ρi (Ai ) is the action of Ai on the ith factors in v. The gˆ D S -module MS,k (V ) induced from V ⊗ OS is defined by gˆ D S

MS,k (V ) := Ind (V ⊗ OS ) = US (ˆgD ) (V ⊗ OS ). S ) ⊗US (gˆ D S,+ gˆ D S,+

Using the decomposition

(3.27)


ÿ gˆ D S =

L M

25

! g⊗

ξi−1 OS [ξi−1 ]

⊕ gˆ D S,+ ,

i=1

we can show that MS,k (V ) has the following OS -free basis: ρi1 (As1 [m1 ]) · · · ρiν (Asν [mν ])vj ,

(3.28)

where ν = 0, 1, 2, . . ., in = 1, . . . , L, {As } is a basis of g, {vj } is a basis of V , and m1 5 · · · 5 mν < 0. This is also an OS -free basis of Mk (V ) and hence MS,k (V ) is isomorphic to Mk (V ) as a gˆ D S -module. In the following we identify Mk (V ) with MS,k (V ). This identification of the two definitions and Lemma 3.2 prove the sheaf version of Proposition 1.5. Proposition 3.4. Let Vi be a finite-dimensional irreducible representation of g for each NL i and put V := i=1 Vi . Then the natural inclusion V ⊗ OS ,→ Mk (V ) induces the following isomorphisms: ∼

CCk (V ) ← V ⊗ OS

and

∼

CBk (V ) → V ∗ ⊗ OS .

Proof. From the second definitions of Mk (V ) and Lemma 3.2, it follows that ∼

D Mk (V ) = US (ˆgD ˙ ) ⊗OS (V ⊗ OS ). ) (V ⊗ OS ) ← US (gX S ) ⊗US (gˆ D S,+

Namely, Mk (V ) is freely generated by V ⊗ OS over US (gD ˙ ). Hence we obtain the X formulae ∼ D D D CCk (V ) = Mk (V )/gD ˙ Mk (V ) ← US (gX ˙ )/gX ˙ US (gX ˙ ) ⊗OS (V ⊗ OS ) X ∼

← OS ⊗OS (V ⊗ OS ) = V ⊗ OS , ∼

CBk (V ) = HomOS (CCk (V ), OS ) → HomOS (V ⊗ OS , OS ) = V ∗ ⊗ OS . We have completed the proof of the proposition.

Corollary 3.5. For each i = 1, . . . , L, let Vi be a finite-dimensional irreducible representation of g and Mi a quotient module of the generalized Verma module Mk (Vi ) of the NL affine Lie algebra gˆ i . Put M := i=1 Mi and M := M ⊗ OS . Then the sheaf CC(M) of conformal coinvariants and the sheaf CB(M) of conformal blocks are OS -coherent. Proof. Since CB(M) = HomOS (CC(M), OS ), it suffices for the proof to see that CC(M) is coherent. The right exactness of the functor CC(·) and the fact that M is a quotient gˆ D S -module of Mk (V ) imply that CC(M) is a quotient OS -module of CCk (V ) = CC(Mk (V )), which is OS -coherent due to Proposition 3.4. Hence CC(M) is also OS -coherent.

26


4. Sheaf of the Virasoro Algebras This section provides the sheaf version of the Virasoro algebras and its actions on representations of the sheaf of the affine Lie algebra, which will be used in Sect. 5 to endow the sheaf of conformal coinvariants and the sheaf of conformal blocks with DS -module structures, when the level is not critical (i.e., κ = k + h∨ 6= 0). 4.1. Definition of the sheaf of the Virasoro algebras. We define the sheaf of the Virasoro algebras by L M OS ((ξi ))∂ξi ⊕ OS cˆ, (4.1) VirSD := TS ⊕ i=1

where TS is the tangent sheaf of S. The Lie algebra structure which we shall give to this OS -sheaf below reduces to the Virasoro algebra structure on Vir D (1.55), when S is replaced with a point. In order to define a Lie algebra structure on VirSD , we introduce the following notation: – For µ, ν ∈ TS , the symbol [µ, ν] denotes the natural Lie bracket in TS ; LL L – For θ = (θi )L O ((ξ ))∂ , the symbol [θ, η]0 = ([θi , ηi ]0 )L i=1 , η = (ηi )i=1 ∈ i=1 i=1 LLS i ξi denotes the natural Lie bracket in i=1 OS ((ξi ))∂ξi given by [θi (ξi )∂ξi , ηi (ξi )∂ξi ]0 = θi (ξi )ηi0 (ξi ) − ηi (ξi )θi0 (ξi ) ∂ξi . PL 000 – cV (θ, η) := i=1 Resξi =0 θi (ξi )ηi (ξi ) dξi (the symbol cV stands for “Cocycle defining the Virasoro algebra”); LL – For θ ∈ the symbols µ(θ) and µ(f ) denote the i=1 OS ((ξi ))∂ξi and f ∈ OS , L L natural actions of a vector field µ ∈ TS on i=1 OS ((ξi ))∂ξi and OS respectively. We define the Lie algebra structure on VirSD by [(µ; θ; f cˆ), (ν; η; g cˆ)] := ([µ, ν]; µ(η) − ν(θ) + [θ, η]0 ; (µ(g) − ν(f ) + cV (θ, η))ˆc),

(4.2)

where (µ; θ; f cˆ), (ν; η; g cˆ) ∈ VirSD . Note that VirSD is not an OS -Lie algebra but a CS -Lie algebra. We call VirSD the sheaf of Virasoro algebras on S. Remark 4.1. Later representations of VirSD shall be interpreted as representations of an D extension VirX ˙ of TS defined below and thus shall be given a DS -module structure. Let TX denote the tangent sheaf of the total space X and TX/S the relative tangent sheaf of the family π : X → S (i.e., the sheaf of vector fields along the fibers of π on X). Since π : X → S is smooth, we have the following short exact sequence: 0 → TX/S (∗D) → TX (∗D) → (π ∗ TS )(∗D) → 0. Note that π ∗ TS = OX ⊗π−1 OS π −1 TS does not possess a natural Lie algebra structure, but π −1 TS ⊂ π ∗ TS does. Defining TX,π (∗D) to be the inverse image of π −1 TS in TX (∗D), we obtain the following Lie algebra extension: 0 → TX/S (∗D) → TX,π (∗D) → π −1 TS → 0.


27

The direct image of this sequence by π is also exact and gives the following Lie algebra extension: D (4.3) 0 → TX˙D → VirX ˙ → TS → 0, where we put

D VirX ˙ := π∗ TX,π (∗D),

TX˙D := π∗ TX/S (∗D).

Remark 4.2. The exact sequence (4.3) is essential in the constructions of connections on the sheaf CC(M) of conformal coinvariants and the sheaf CB(M) of conformal blocks. Generally, a connection is defined to be an action of the tangent sheaf satisfying certain axioms. Using the exact sequence (4.3), we can obtain a connection if we have actions D D of VirX ˙ whose restriction on TX ˙ is trivial (cf. Lemma 4.10, Lemma 4.11, Lemma 4.12, Lemma 4.13, Lemma 4.14, and Theorem 5.1). D e is of ˜ X˙ to X Lemma 4.3. For a local section aX˙ of VirX ˙ = π∗ TX,π (∗D), its pull-back a the form: L X a˜ X˙ = µ0 (τ ; z)∂τ + µi (τ ; z)∂zi + θt (τ ; z; t)∂t , i=1

where µi = µi (τ ; z) ∈ OS and θt (τ ; z; t) is a meromorphic function globally defined along the fibers of π˜ with the following properties: 1. The poles of θt (τ ; z; t) are contained in π −1 (D); e/X X 2. The quasi-periodicity: θt (τ ; z; t + mτ + n) = θt (τ ; z; t) + mµ0 (τ ; z) for (m, n) ∈ Z2 .

(4.4)

e given by (τ ; z; t) 7→ Proof. Let (m, n) be in Z2 and fm,n denote the action of (m, n) on X (τ ; z; t + mτ + n). Then its derivative dfm,n sends ∂τ , ∂zi , and ∂t to ∂τ + m∂t , ∂zi , and ∂t respectively. Since a˜ X˙ induces the vector field aX˙ in π∗ TX,π (∗D), we obtain a formula µ + θt (t + mτ + n)∂t = dfm,n (ãX˙ ), which is equivalent to θt (t + mτ + n) = θt (t) + mµ0 , which proves the lemma.

PL The local section aX˙ is mapped to µ = µ0 ∂t + i=1 µi ∂zi ∈ TS by the projection along π in (4.3) and belongs to TX˙D = π∗ TX/S (∗D) if and only if µ = 0. Under the local coordinate ξi = t − zi , the local section aX˙ is uniquely represented in the following form: aX˙ = µ + θi (ξi )∂ξi ∈ TS ⊕ OS ((ξi ))∂ξi , where θi (ξi ) is the Laurent expansion of θt (τ ; z; t) in ξi = t − zi . Thus we obtain the LL D D embedding of VirX ˙ into TS ⊕ i=1 OS ((ξi ))∂ξi ⊂ VirS given by D VirX ˙ ,→ TS ⊕

L M

OS ((ξi ))∂ξi ⊂ VirSD ,

aX˙ 7→ (µ; θ) = (µ; (θi (ξi )∂ξi )L i=1 ). (4.5)

i=1 D D D We identify VirX ˙ with its image in VirS . For instance, (µ; θ; 0) ∈ VirX ˙ means that LL D D µ ∈ TS , θ ∈ i=1 OS ((ξi ))∂ξi , and (µ; θ; 0) belongs to the image of VirX ˙ in VirS . We D D D also identify the subsheaf TX˙ ⊂ VirX˙ with its image in VirS .

28


Remark 4.4. These formulations are essentially an application of the Beilinson-Schechtman theory in [BS] to our situation. The theory contains a natural construction of the Kodaira-Spencer map of a family of compact Riemann surfaces and its generalization to Virasoro algebras. For a brief sketch, see Appendix B. D D A natural question is whether or not the embeddings of VirX ˙ into VirS is a Lie algebra homomorphism. D D Lemma 4.5. The embedding VirX ˙ ,→ VirS is a Lie algebra homomorphism. D D Proof. Let aX˙ and bX˙ be in VirX ˙ = π∗ TX,π (∗D). Denote their images in VirS by e by a˜ ˙ and b˜ ˙ respectively. It suffices for (µ; θ; 0) and (ν; η; 0) and their pull-backs to X X X the proof to show that cV (θ, η) = 0. e the vector fields a˜ ˙ and b˜ ˙ are represented as Under the coordinate (τ ; z; t) of X, X X

b˜ X˙ = ν + η t (t)∂t ,

a˜ X˙ = µ + θt (t)∂t ,

(4.6)

where we write µ, ν ∈ TS in the following forms: µ = µ0 ∂τ +

L X

µi ∂ zi ,

ν = ν0 ∂ τ +

i=1

L X

ν i ∂ zi .

(4.7)

i=1

Here we omit the arguments (τ ; z) for simplicity: µi = µi (τ ; z), θt (t) = θt (τ ; z; t), etc. Because of Lemma 4.3, we have θt (t + mτ + n) = θt (t) + mµ0 ,

η t (t + mτ + n) = η t (t) + mν0 .

Hence we can define the relative meromorphic 1-form ω ∈ π∗ 1X/S (∗D) by ω = ω(t) dt :=

∂ 2 θt (t) ∂η t (t) dt. ∂t2 ∂t

(4.8)

Here, the well-definedness of ω as a 1-form in π∗ 1X/S (∗D) follows from the fact that the definition of ω(t) implies ω(t + mτ + n) = ω(t) for m, n ∈ Z. On the other hand, under the local coordinate (τ ; z; ξi ) of X around Qi given by ξi = t − zi , the vector fields aX˙ and bX˙ are represented in the following forms: aX˙ = µ + θi (ξi )∂ξi ,

bX˙ = ν + ηi (ξi )∂ξi ,

where µ and ν are the same as those in (4.6) and θi and ηi are given in terms of θt , η t , µi and νi in (4.6) and (4.7) by θi (ξi ) = θt (zi + ξi ) − µi ,

ηi (ξi ) = η t (zi + ξi ) − νi .

Thus by (4.8), we have ω=

∂ 2 θi (ξi ) ∂ηi (ξi ) dξi ∂ξi ∂ξi2

around Qi .

Hence the residue theorem leads to 2 L L X X ∂ θi (τ ; z; ξi ) ∂ηi (τ ; z; ξi ) Res dξ ResQi ω = 0. = − cV (θ, η) = − i ξi =0 ∂ξi ∂ξi2 i=1 i=1 This proves the lemma.


29

D D Remark 4.6. The same question about the embedding VirX ˙ ,→ VirS can be answered under a more general formulation for higher genus compact Riemann surfaces with a D D projective structure. However, in the higher genus case, the embedding VirX ˙ ,→ VirS is not always a Lie algebra homomorphism. The case of genus 1 is very special. See Appendix C for a short sketch of a formulation.

– – – – –

D ˆD In order to define the action of VirX ˙ on g S , let us introduce the following notation: L L L A = (Ai )L i=1 , B = (Bi )i=1 ∈ i=1 g ⊗ OS ((ξi )); 0 0 L ∼ LL g ⊗ OS ((ξi )) [A, B] = ([Ai , Bi ] )i=1 denotes the natural Lie bracket in gD S = i=1 given by the base extension of the Lie algebra g; ˆ (B; g k) ˆ ∈ gˆ D = gD ⊕ OS k; ˆ (A; f k), S S For µ ∈ TS , the symbol µ(A) denotes the natural actions of TS on gD S; LL L D The natural action of θ = (θi )i=1 ∈ i=1 OS ((ξi ))∂ξi on A ∈ gS is defined by

θi (Ai ⊗ fi (ξi )) := Ai ⊗ (θi (ξi )fi0 (ξi ))

θ(A) := (θi (Ai ⊗ fi ))L i=1 ,

where θi = θi (ξi )∂ξi and A = Ai ⊗ fi (ξi ) ∈ g ⊗ OS ((ξi )). D ˆD Then the action of VirX ˙ on g S is defined by D ˆ ˆD for (µ; θ; 0) ∈ VirX ˙ and (A; g k) ∈ g S , (4.9)

ˆ := [(µ; θ; 0), (A; g k)] ˆ (µ; θ; 0) · (A; g k)

D D where VirX ˙ is identified with its image in VirS by (4.5) and the bracket of the right-hand side is a Lie bracket in the semi-direct product Lie algebra VirSD n gˆ D S defined by

ˆ := (µ(A) + θ(A); µ(g)k) ˆ [(µ; θ; f cˆ), (A; g k)] ˆ ∈ gˆ D . for (µ; θ; f cˆ) ∈ Vir D and (A; g k) S

(4.10)

S

D D ˆD Lemma 4.7. The action of VirX ˙ on g ˙ . S preserves gX

Proof. Because of (3.16), under the identifications above, the restriction of the action D D tw of VirX ˙ = π∗ TX,π (∗D) on gX ˙ comes from the action of TX,π (∗D) on gX (∗D) via the tw D D connection ∇ on gX . Namely, if aX˙ ∈ VirX˙ , AX˙ ∈ gX˙ , and their images in VirSD and gˆ D S are denoted by (µ; θ; 0) and (A; 0) respectively, then [(µ; θ; 0), (A; 0)] = the image of ∇aX˙ AX˙ ∈ π∗ gtw (∗D) . D D D Thus we obtain [VirX ˙ , gX ˙ ] ⊂ gX ˙ .

4.2. Action of the sheaf of Virasoro algebras. In this subsection we define an action of ˆD the Lie algebra VirSD n gˆ D S on g S modules. Fix an arbitrary complex number k. For each i = 1, . . . , L, let Mi be a representation with level k of the affine Lie algebra gˆ i satisfying the smoothness condition (3.24). Put NL M := ⊗ O . Then M is a representation with level k of the i=1 Mi and M := M LL S affine Lie algebra (g⊕L )∧ = i=1 g ⊗ C((ξi )) ⊕ Ckˆ and M is a gˆ D S -module. The Sugawara operators S[m] acting on Mi ’s are given by the formula (1.47) and its action on the ith factor Mi in M is denoted by ρi (S[m]). Define the Sugawara tensor field by

30


S(ξ)(dξ)2 :=

X

ξ −m−2 S[m] (dξ)2 ,

m∈Z

and its action on Mi is denoted by ρi (S(ξi ))(dξ)2 . Then, by the same way as Lemma 1.11, we can prove the following lemma. Lemma 4.8. For any s ∈ S, 8 ∈ CB(M)s and v ∈ Ms , there exists a unique ω ∈ (π∗ 2X/S (∗D))s such that the expression of ω under the coordinate ξi coincides with 8(ρi (S(ξi ))v) (dξi )2 for each i = 1, . . . , L. We denote ω in Lemma 4.8 by 8(S(ξ)v) (dξ)2 or 8(S(P )v) (dP )2 , which is called a correlation function of the Sugawara tensor S(ξ) and v under 8, or a Sugawara correlation function for short. Assume that κ = k + h∨ 6= 0 and put ck := k dim g/κ. Define the Virasoro operators T [m] and the energy-momentum tensor T (ξ) by T [m] := κ−1 S[m],

T (ξ)(dξ)2 := κ−1 S(ξ)(dξ)2 ,

(4.11)

as in (1.50) and the energy-momentum correlation function 8(T (ξ)v) (dξ)2 to be κ−1 8(S(ξ)v) (dξ)2 as in (1.41). The action ρi (T [m]) of the Virasoro operators on Mi defines a representation of the Virasoro algebraPwith central charge ck = k dim g/κ (Lemma 1.13). For vi ⊗ g ∈ Mi ⊗ OS and θi = m∈Z θi,m ξim+1 ∂ξi ∈ OS ((ξi ))∂ξi , put X ρi (T {θi })(vi ⊗ g) = (ρi (−T [m])vi ) ⊗ (θi,m g). (4.12) m∈Z

For example, ρi (T {ξim+1 ∂ξi }) = ρi (−T [m]). For θ = (θi )L i=1 ∈ operator T {θ} acting on M is defined by T {θ} :=

L X

LL i=1

OS ((ξi ))∂ξi , the

ρi (T {θi }),

(4.13)

i=1

where we consider ρi (T {θi }) as an operator acting on the ith factor in M. Define the action of (µ; θ; f cˆ) ∈ VirSD on M by (µ; θ; f cˆ) · (v ⊗ g) := v ⊗ µ(g) + T {θ}(v ⊗ g) + ck v ⊗ (f g)

(4.14)

for v ⊗ g ∈ M = M ⊗ OS . The dual actions on M∗ := HomOS (M, OS ) are defined by (µ8)(v) := µ(8(v)) − 8(µ(v)), (ρ∗i (T ∗ {θi }8))(v) := −8(ρi (T {θi })v), T ∗ {θ} :=

L X

ρ∗i (T {θi }),

(4.15) (4.16) (4.17)

i=1

((µ; θ; f cˆ) · 8)(v) := µ(8(v)) − 8((µ; θ; f cˆ) · v),

(4.18)

where 8 ∈ M∗ , and v ∈ M. Since we have (µ; θ; f cˆ) · 8 = µ8 + T ∗ {θ}8 − ck f 8, ∗

VirSD

(4.19)

on M defines a representation of with central charge −ck . the action of The Virasoro operators T [m] satisfy the commutation relations (1.52). Therefore a straightforward calculation proves the following lemma. VirSD


31

D Lemma 4.9. The action of gˆ D S (3.25) and that of VirS (4.14) on M induce a represen, whose semi-direct product Lie algebra structure tation of the Lie algebra VirSD n gˆ D S is given by (4.10). D ∗ D D Define the actions of VirX ˙ on M and M through the embedding VirX ˙ ,→ VirS D and the actions of VirS . Then Lemma 4.5 and Lemma 4.9 immediately lead to the following lemma. D ∗ Lemma 4.10. These actions of VirX ˙ on M and M are representations of the Lie D algebra VirX˙ . D D Lemma 4.11. The action of VirX ˙ on M preserves gX ˙ M and hence defines a repreD sentation on CC(M) of the Lie algebra VirX˙ . D D Proof. Assume that αX˙ ∈ VirX ˙ ∈ gX ˙ , AX ˙ , and v ∈ M. Lemma 4.9 implies that

αX˙ AX˙ v = [αX˙ , AX˙ ]v + AX˙ αX˙ v, D and Lemma 4.7 means that [αX˙ , AX˙ ] ∈ gD ˙ AX ˙ v belongs to gX ˙ . Hence αX ˙ M and X D D D VirX˙ gX˙ M is included in gX˙ M. From Lemma 4.10 it follows that the induced action of D D D VirX ˙ on CC(M) = M/gX ˙ M defines a representation of the Lie algebra VirX ˙ .

As a result of Lemma 4.10, Lemma 4.11 and (4.15), we obtain the following lemma. D ∗ ∗ Lemma 4.12. The action of VirX ˙ on M preserves the subsheaf CB(M) of M and D defines a representation on CB(M) of the Lie algebra VirX˙ .

The actions of TX˙D on M and M∗ are also defined through the embedding TX˙D ,→ VirSD . Then, as in the proof of Lemma 1.14, we can show the following lemma from Lemma 4.8 thanks to the existence of the energy-momentum correlation function (4.11). Lemma 4.13. The action of TX˙D on M∗ satisfies TX˙D · CB(M) = 0. Using the exact sequence (4.3), Lemma 4.12, and Lemma 4.13, we can construct a flat connection on the sheaf CB(M) of conformal blocks in Sect. 5 (Remark 5.2). However, for the construction of a flat connection on the sheaf CC(M) of conformal coinvariants, we shall need the following lemma, as well as the exact sequence (4.3) and Lemma 4.11. Lemma 4.14. The action of TX˙D on M satisfies TX˙D M ⊂ gD ˙ M. X We remark that Lemma 4.14 implies Lemma 4.13, but the converse does not hold. The key point in the proof of Lemma 4.13 is the notion of the energy-momentum correlation function, which is not useful for the proof of Lemma 4.14. Hence we must find a direct proof of Lemma 4.14 without using the energy-momentum correlation functions. The rest of this subsection is devoted to the proof of this lemma along the course similar to that of [Ts]. ∼ LL g ⊗ OS ((ξi )) by We define the OS -inner product ( . , . ) on gD S = i=1 L X L (Ai |Bi ) Res(fi gi dξi ) (Ai ⊗ fi )L i=1 , (Bi ⊗ gi )i=1 = i=1

ξi =0

(4.20)

32


for Ai , Bi ∈ g and fi , gi ∈ OS ((ξi )). This inner product is non-degenerate and allows us to regard gD S as the topological dual of itself under the ξi -adic topologies. Putting R := { (a, b, m, i) | (a, b) ∈ (Z/N Z)2 r {(0, 0)}, m ∈ Z, i = 1, . . . , L },(4.21) m L m L em a,b,i = (ea,b,i,j (ξj ))j=1 := (δi,j Ja,b ⊗ ξj )j=1 ,

ea,b,i = m

L (ea,b,i m,j (ξj ))j=1

(4.22)

:= (δi,j J a,b ⊗ ξj−m−1 )L j=1 ,

(4.23)

we obtain the following topological dual OS -bases of gD S with respect to the inner product: F0 := { em a,b,i | (a, b, m, i) ∈ R },

F 0 := { ea,b,i | (a, b, m, i) ∈ R }. m

L D For A = (Ai )L i=1 , B = (Bi )i=1 ∈ gS , we introduce the following notation:

ρ(A) :=

L X

ρi (Ai ),

(4.24)

i=1 ◦ ◦ ◦ ρ(A)ρ(B) ◦

:=

L X

ρi ( ◦◦ Ai Bi ◦◦ ) +

i=1

X

ρi (Ai )ρj (Bj ).

(4.25)

i6=j

D Recall that, for θ = (θi (ξi )∂ξi )L i=1 ∈ TS , the Virasoro operator T {θ} acting on M is defined by (4.13), (4.12), (4.11) and (1.47). Using the dual bases above, we can represent the Virasoro operator T {θ} in the following form:

T {θ} = −

1 2κ

X

◦ a,b,i ◦ ρ(em

◦ ◦ θ)ρ(em a,b,i ) ◦ ,

(4.26)

(a,b,m,i)∈R

where we put L D ◦ θ := (ea,b,i ea,b,i m m,j (ξj )θj (ξj ))j=1 ∈ gS .

The formula (4.26) follows from the special cases with θ = (δi,j ξjn+1 ∂ξj )L j=1 for n ∈ Z, which are obtained by straightforward calculations. The bases which we really need later are however not these naively defined bases, F0 and F 0 , but “good” dual frames in the sense of [Ts]. See Lemma 4.15 and Remark 4.16 n below. In order to construct such dual bases, we define the meromorphic functions wa,b on H × C for n = 0, 1, 2, . . . by derivatives of wa,b (1.14): n n wa,b (t) = wa,b (τ ; t) :=

(−1)n ∂ n wa,b (τ ; t). n! ∂tn

(4.27)

n e can be regarded as a global section Then the meromorphic function wa,b (τ ; t − zi ) on X of the line bundle La,b (∗Qi ) on X and its Laurent expansion in ξi is written in the following form: n (ξi ) = ξi−n−1 + (−1)n wa,b,n + O(ξi ), (4.28) wa,b

where wa,b,n is the coefficient of tn in the Laurent expansion (1.15) of wa,b (t). For m ∈ OS ((ξj )) (a, b) ∈ (Z/N Z)2 r {(0, 0)}, i, j = 1, . . . , L, and m ∈ Z, we define fa,b,i,j by if m = 0, δi,j ξjm m fa,b,i,j (ξj ) := (4.29) −m−1 wa,b (zj − zi + ξj ) if m < 0,


33

and put m m m L = (Ja,b,i,j )L Ja,b,i j=1 := (Ja,b ⊗ fa,b,i,j (ξj ))j=1 , a,b,i m −m−1 a,b,i Jm = (Jm,j )j=1 := (J a,b ⊗ f−a,−b,i,j (ξj ))L j=1 .

(4.30)

Then the following topological OS -bases of gD S: m F1 := { Ja,b,i | (a, b, m, i) ∈ R },

a,b,i F 1 := { Jm | (a, b, m, i) ∈ R },

(4.31)

are dual to each other by virtue of the residue theorem. Changing the bases of expansion from F0 and F 0 to F1 and F 1 , we obtain another expression of T {θ} from (4.26): T {θ} = −

1 2κ

X

◦ a,b,i ◦ ρ(Jm

m ◦ θ)ρ(Ja,b,i ) ◦◦ ,

(4.32)

(a,b,m,i)∈R

where we put, as above, a,b,i a,b,i D ◦ θ := (Jm,j (ξj )θj (ξj ))L Jm j=1 ∈ gS .

The following lemma immediately follows from the definitions above and the decomposition (3.9) of gtw X to the direct sum of the line bundles La,b . Lemma 4.15. Under the notation above, we have the following: m m is equal to the image of Ja,b wa,b (τ ; t − zi ) ∈ gD 1. If m is negative, then Ja,b,i ˙ . X 2. If m is not negative and θ is a local section of TX˙D = π∗ TX/S (∗D) with θ = −m−1 a,b,i θt (τ ; z; t)∂t , then Jm ◦θ is equal to the image of J a,b w−a,−b (τ ; t−zi )θt (τ ; z; t) ∈ gD ˙ . X

Remark 4.16. This lemma means that the topological dual bases F1 and F 1 given by (4.31) are good dual frames of gD S in the sense of Tsuchimoto [Ts]. For fi ∈ OS ((ξi )), the regular part fi,+ and the singular part fi,− are uniquely defined by the conditions fi = fi,+ + fi,− ,

fi,+ ∈ OS [[ξi ]],

fi,− ∈ ξi−1 OS [ξi−1 ].

Note that the differentiation ∂ξi commutes the operations fi 7→ fi,± : ∂ξi (fi,± (ξi )) = (fi0 (ξi ))± , 0 which shall be denoted by fi,± (ξi ).

Lemma 4.17. For fi , gi ∈ OS ((ξi )), the normal product ◦◦ (J a,b ⊗ fi )(Ja,b ⊗ gi ) ◦◦ can be represented in the following forms: ◦ a,b ◦ (J

0 ⊗ fi )(Ja,b ⊗ gi ) ◦◦ = (J a,b ⊗ fi )(Ja,b ⊗ gi ) + kˆ Res(fi,+ (ξi )gi,− (ξi ) dξi ) ξi =0

= (Ja,b ⊗ gi )(J

a,b

0 ⊗ fi ) − kˆ Res(fi,− (ξi )gi,+ (ξi ) dξi ). ξi =0

34


Proof. From the commutativity of Ja,b and J a,b and the definition of the normal product (1.45), we can find that ◦ a,b ◦ (J

⊗ fi )(Ja,b ⊗ gi ) ◦◦ = (J a,b ⊗ fi )(Ja,b ⊗ gi,+ ) + (Ja,b ⊗ gi,− )(J a,b ⊗ fi ).

Using the definition (3.19) of the Lie algebra structure on gˆ D S , we obtain the formulae 0 (ξi )fi (ξi ) dξi [J a,b ⊗ gi,− , Ja,b ⊗ fi ] = kˆ Res gi,− ξi =0 0 (ξi ) dξi , (4.33) = kˆ Res fi,+ (ξi )gi,− ξi =0 [J a,b ⊗ fi , Ja,b ⊗ gi,+ ] = kˆ Res fi0 (ξi )gi,+ (ξi ) dξi ξi =0 0 (ξi ) dξi . (4.34) = −kˆ Res fi,− (ξi )gi,+ ξi =0

These formulae prove the lemma. For the brevity of notation, we introduce the sets R± by R+ := { (a, b, m, i) ∈ R | m = 0 },

R− := { (a, b, m, i) ∈ R | m < 0 },

where R is defined by (4.21). LL Lemma 4.18. For θ = (θi (ξi )∂ξi )L i=1 ∈ i=1 OS ((ξi ))∂ξi , we have the following expressions of the Virasoro operator T {θ}: X 1 X a,b,i m m a,b,i T {θ} = − ρ(Jm ◦ θ)ρ(Ja,b,i )+ ρ(Ja,b,i )ρ(Jm ◦ θ) , (4.35) 2κ R+

where the symbols

P

R−

denote the summations over (a, b, m, i) ∈ R± . Moreover, substi-

R±

tuting the definition (4.30) to this formula, we obtain 1 X X m ρj (J a,b ⊗ w−a,−b (zj − zi + ξj )θj (ξj )) ρi (Ja,b ⊗ ξim ) 2κ R+ j=1 (4.36) X m a,b,i ρ(Ja,b,i )ρ(Jm ◦ θ) . + L

T {θ} = −

R−

Proof. From the definitions (4.27), (4.29), and (4.30), we can derive the following formulae: m ) = ρi (Ja,b ⊗ ξim ) ρ(Ja,b,i a,b,i ρ(Jm

◦ θ) = ρi (J

a,b

⊗

for (a, b, m, i) ∈ R+ , ξi−m−1 θi (ξi ))

−m−1 m Ja,b,i,i = Ja,b ⊗ wa,b (ξi ) −m−1 −m ∂ξi (wa,b,+ (ξ)) = mwa,b,+ (ξi )

for (a, b, m, i) ∈ R− , for (a, b, m, i) ∈ R− , for (a, b, m, i) ∈ R− .

Therefore, applying Lemma 4.17 to the formula (4.32), we obtain ˆ T {θ} = (the right-hand side of (4.35)) + kR(θ),


35

where R(θ) is defined by X −m Res (ξi−m−1 θi (ξi ))− (−m)wa,b,+ (ξi ) dξi , R(θ) := R−

ξi =0

which is a finite sum and hence is a local section of OS . Hence, for the proof of (4.35), it is enough to show that R(θ) = 0 in the case of θ = (δi,j ξj−l ∂ξj )L j=1 for l = 1, 2, 3, . . . and i = 1, . . . , L. Using the formula (4.27) and the Laurent expansion (1.15) of wa,b (t), we can find that X −m Res (ξi−m−1 ξi−l )− (−m)wa,b,+ (ξi ) dξi m 2.

Hence we obtain R(θ) = 0 by (1.18). We have proved the first expression (4.35) of T {θ}. Remark 4.19. The second expression (4.36) of T {θ} is very useful for the explicit calculations of the elliptic Knizhnik-Zamolodchikov connections (Sect. 5.2). We are ready to prove Lemma 4.14. Let v be in M and θ in TX˙D . Then, applying Lemma 4.15 to the first expression (4.35) of T {θ} in Lemma 4.18 shows that T {θ}v ∈ gD ˙ M. Hence we have proved Lemma 4.14. X 5. Flat Connections We keep all the notation in the previous section. In this section, we assume κ = k+h∨ 6= 0 and shall construct DS -module structures on the sheaf CC(M) of conformal coinvariants and on the sheaf CB(M) of conformal blocks. We can show as a direct consequence that, under the assumption of Lemma 3.5, the sheaves CC(M) and CB(M) are locally free coherent OS -modules (i.e., vector bundles on S) with flat connections and their fibers at s ∈ S are canonically isomorphic to the space of conformal coinvariants and that of conformal blocks respectively. In Sect. 5.2, we shall show that the connections on CCk (V ) and CBk (V ) coincide with the elliptic Knizhnik-Zamolodchikov equations introduced by Etingof [E]. In Sect. 5.3, we shall obtain a proof of the modular property of the connections without referring to the explicit expressions of them. 5.1. Construction of flat connections. Recall that we have the Lie algebra extension D D D VirX ˙ of the tangent sheaf TS by TX ˙ in (4.3). Since the action of TX ˙ maps M into D D gX˙ M due to Lemma 4.14, the representation of the Lie algebra VirX˙ on CC(M) given by Lemma 4.11 induces the Lie algebra action of TS on CC(M), which shall be denoted by (5.1) TS × CC(M) → CC(M), (µ, φ) 7→ Dµ φ. Moreover it immediately follows from (4.14) that

36


Df µ v = f (Dµ v),

Dµ (f v) = µ(f )v + f (Dµ v)

for f ∈ OS , µ ∈ TS , and v ∈ CC(M). Thus we obtain the flat connection D on the sheaf CC(M) of conformal coinvariants. Because of CB(M) = HomOS (CC(M), OS ), we obtain the dual connection D∗ on CB(M): (Dµ∗ 8)(v) := µ(8(v)) − 8(Dµ v).

(5.2)

We can summarize the results as follows. Theorem 5.1. For each i = 1, . . . , L, let Mi be a representation with level k of the NL affine Lie algebra gˆ i satisfying the smoothness condition (3.24). Put M := i=1 Mi ⊕L ∧ and M := M ⊗ OS . Then M is a representation with level k of (g ) and M is a ∨ D gˆ D ˙ on M induces the S -module. Assume that κ = k + h 6= 0. Then the action of VirX DS -module structures on the sheaf CC(M) of conformal coinvariants and on the sheaf CB(M) of conformal blocks. Remark 5.2. We have another description of the dual connection D∗ on CB(M). D Lemma 4.13 shows that the representation of the Lie algebra VirX ˙ on CB(M) given by Lemma 4.12 induces the Lie algebra action of TS on CB(M), which coincides with the dual connection D∗ . Corollary 5.3. For each i = 1, . . . , L, let Vi be a finite-dimensional irreducible representation of g and Mi a quotient module of the Weyl module Mk (Vi ) over the affine Lie NL algebra gˆ i . Put M := i=1 Mi and M := M ⊗ OS . Assume that κ = k + h∨ 6= 0. Then the sheaf CC(M) of conformal coinvariants and the sheaf CB(M) of conformal blocks are locally free coherent OS -modules with flat connections on S and dual to each other. Proof. We have already shown the OS -coherencies of CC(M) and CB(M) in Corollary 3.5 and the existence of DS -module structures on CC(M) and on CB(M) in Theorem 5.1. It is well-known that any OS -coherent DS -module is OS -locally free. (See Theorem 6.1 in Chapter I of [Ho], Proposition 1.7 in Chapter VI of [BEGHKM], or Theorem 1.1.25 of [Bj].) Corollary 5.4. For each i = 1, . . . , L, let Mi be a representation with level k of the NL affine Lie algebra gˆ i satisfying the smoothness condition (3.24). Put M := i=1 Mi and M := M ⊗ OS . Then the fiber of CC(M) at s = (τ ; z) ∈ S is canonically isomorphic L to the space of conformal coinvariants for (X; {Qi }L i=1 ) = (Xτ ; {qi (s)}i=1 ): CC(M)|s ∼ (5.3) = CC(M ). Moreover, under the assumptions of Corollary 5.3, the fiber of CB(M) at s ∈ S is canonically isomorphic to the space of conformal blocks: (5.4) CB(M)|s ∼ = CB(M ). −1 Proof. We can identify the restriction of gtw (s) with gtw in Sect. 1. Put X on X = π Q := D ∩ X. Then we can find the following canonical isomorphisms without using Corollary 5.3: (5.5) M|s = (M ⊗ OS )|s ∼ = M ⊗ C = M, tw 0 tw ∼ (5.6) (π∗ gX (mD)) s = H (X, g (mQ)) for m = 0, Q D tw 0 tw gX˙ s = (π∗ gX (∗D)) s ∼ (5.7) = H (X, g (∗Q)) = gX˙ , ∼ Q M. (5.8) (gD ˙ M) s = gX ˙ X


37

The Riemann-Roch theorem shows that dimC H 0 (π −1 (s), gtw (mD)|π−1 (s) ) is a constant function of s ∈ S if m = 0. Therefore the existence of the isomorphism (5.6) follows from the Grauert theorem. (For the Grauert theorem, see, for example, Corollary 12.9 in Chapter III of [Har] and Theorem in Chapter 10 Sect. 5.3 (p. 209) of [GR].) The isomorphism (5.7) is obtained by the inductive limit of (5.6). The isomorphism (5.8) is obtained by using (5.5), (5.7), and applying the right exact functor (·)|s to the exact D sequence gD ˙ ⊗ OS M → g X ˙ M → 0. Similarly the isomorphism (5.3) is obtained from X (5.5) and (5.8). Under the assumption of Corollary 5.3, the sheaves CC(M) and CB(M) are locally free OS -modules of finite rank and dual to each other. Then we have CB(M)|s = HomC (CC(M)|s , C). This formula together with the isomorphism (5.3) gives the isomorphism (5.4). Let us describe the connections D and D∗ more explicitly. For any vector field µ ∈ TS , the action of Dµ on CC(M) and that of Dµ∗ on CB(M) are described as follows. D Using the short exact sequence (4.3), we can lift µ to an element (µ; θ; 0) ∈ VirX ˙ at D least locally on S, and the ambiguity in the choice of (µ; θ; 0) is equal to TX˙ . But, since the actions of TX˙D on CC(M) and CB(M) are trivial, the actions of (µ; θ; 0) on CC(M) and CB(M) do not depend on the choice of the lift and give Dµ and Dµ∗ : Dµ v = (µ; θ; 0) · v = µ(v) + T {θ}v, Dµ∗ 8 = (µ; θ; 0) · 8 = µ(8) + T ∗ {θ}8

(5.9) (5.10)

D for (µ; θ; 0) ∈ VirX ˙ and v ∈ CC(M), and 8 ∈ CB(M). The following lemma provides D us with the explicit formulae of (µ; θ; 0) ∈ VirX ˙ .

Lemma 5.5. The following expressions in the coordinate (τ ; z; t) define vector fields in D VirX ˙ = π∗ TX,π (∗D) and are lifts of ∂zi and ∂τ respectively: aX˙ (∂zi ) = ∂zi ,

aX˙ (∂τ ) = ∂τ + Z(τ ; z; t)∂t ,

(5.11)

e satisfying the where the function Z(τ ; z; t) is a global meromorphic function on X following properties: 1. The poles of Z(τ ; z; t) are contained in π −1 (D); e/X X 2. The quasi-periodicity: Z(τ ; z; t + mτ + n) = Z(τ ; z; t) + m for (m, n) ∈ Z2 .

(5.12)

e sends ∂τ , ∂z , and ∂t to ∂τ + n∂t , ∂z , and Proof. Since the action of (m, n) ∈ Z2 on X i i ∂t respectively, the expressions (5.11) are vector fields in π∗ TX,π (∗D). Example 5.6. We can use the following function as Z(τ ; z; t) for any i0 ∈ {1, . . . , L} (cf. Sect. 3.1 of [FW]): Z(τ ; z; t) = Z1,1 (τ ; t − zi0 ),

Z1,1 (τ ; t) = −

(See Appendix A for the notation of the theta functions.)

0 (t) 1 θ[0,0] . 2πi θ[0,0] (t)

(5.13)

38


Lemma 5.7. The connections D on CC(M) and D∗ on CB(M) possess the following expressions: D∂/∂zi = ∂zi − ρi (T {∂ξi }),

(5.14)

D∂/∂τ = ∂τ + T {Z(τ ; z; t)∂t } = ∂τ +

L X

ρi (T {Z(τ ; z; zi + ξi )∂ξi }),

(5.15)

i=1

∗ = ∂zi − ρ∗i (T {∂ξi }), D∂/∂z i

(5.16)

∗ = ∂τ + T ∗ {Z(τ ; z; t)∂t } = ∂τ + D∂/∂τ

L X

ρ∗i (T {Z(τ ; z; zi + ξi )∂ξi }). (5.17)

i=1

Proof. In the local coordinate (τ ; z; ξj ) with ξj = t − zj , the vector fields given by (5.11) can be represented in the following forms around Qi : aX˙ (∂zi ) = ∂zi − δi,j ∂ξj ,

aX˙ (∂τ ) = ∂τ + Z(τ ; z; zj + ξj )∂ξj .

(5.18)

Namely, their images in VirSD are of the following forms: aX˙ (∂zi ) = (∂zi ; (−δi,j ∂ξj )L j=1 ; 0),

aX˙ (∂τ ) = (∂τ ; (Z(τ ; z; zj + ξj )∂ξj )L j=1 ; 0), (5.19)

whose actions on CC(M) and CB(M) can be written in the forms (5.14), (5.15), (5.16), and (5.17). Remark 5.8. The explicit formulae (5.16) and (5.17) of the flat connection coincide with the expressions of ∇τ and ∇zi in Sect. 3.1 of [FW]. In order to prove the flatness of their connection, Felder and Wieczerkowski use the explicit expression of ∇zi , which corresponds to (5.24) in our case. However in our construction the flatness of the connection is a priori obvious. Lemma 4.5 is the key lemma for the proof of the flatness. Our proof of the flatness can be also applied to Proposition 3.4 of [FW]. 5.2. Elliptic Knizhnik-Zamolodchikov equations. In this subsection, we show that the flat connection on the sheaf of conformal blocks defined in Sect. 5.1 is nothing but the elliptic Knizhnik-Zamolodchikov equation introduced by Etingof [E], when M = Mk (V ). Let Vi be a finite-dimensional irreducible representation of g = slN (C) for each i NL and put V := i=1 Vi . In view of Proposition 3.4, the connections D on CCk (V ) and D∗ on CBk (V ) are regarded as connections on V ⊗ OS and on V ∗ ⊗ OS respectively and are dual to each other. In order to give the explicit expressions of these connections, we define the function Za,b (t) = Za,b (τ ; t) by wa,b (t) Za,b (t) := 4πi

0 0 (t) θ[a,b] θ[a,b] − θ[a,b] (t) θ[a,b]

.

(5.20)

The point t = 0 is an apparent singularity of this function and we can analytically continue Za,b (t) to t = 0 by ÿ 1 Za,b (0) := 4πi

00 θ[a,b] − θ[a,b]

0 θ[a,b] θ[a,b]

2 ! .

(5.21)


39

Theorem 5.9. As operators acting on V ⊗ OS and V ∗ ⊗ OS , the operators Dµ and Dµ∗ for µ = ∂/∂zi , ∂/∂τ possess the following expressions: D∂/∂zi =

∂ 1X − ∂zi κ

X

wa,b (zj − zi )ρj (Ja,b )ρi (J a,b ),

(5.22)

Za,b (zj − zi )ρj (Ja,b )ρi (J a,b ),

(5.23)

wa,b (zj − zi )ρ∗j (Ja,b )ρ∗i (J a,b ),

(5.24)

Za,b (zj − zi )ρ∗j (Ja,b )ρ∗i (J a,b ),

(5.25)

j6=i (a,b)6=(0,0)

D∂/∂τ =

L 1 X ∂ + ∂τ κ

X

∂ 1X + ∂zi κ

X

L 1 X ∂ − ∂τ κ

X

i,j=1 (a,b)6=(0,0)

∗ D∂/∂z = i

j6=i (a,b)6=(0,0)

∗ = D∂/∂τ

i,j=1 (a,b)6=(0,0)

where ρi (Ai ) and ρ∗i (Ai ) for Ai ∈ g act on the ith factor Vi of V and on the ith factor Vi∗ of V ∗ respectively. Here, for each (a, b) 6= (0, 0), the function wa,b (t) = wa,b (τ ; t) is defined by (1.14) and the function Za,b (t) = Za,b (τ ; t) by (5.20) and (5.21). Proof. Since the connections D and D ∗ are dual to each other, for the proof of the proposition it suffices to obtain either the formulae for Dµ (i.e., (5.22) and (5.23)) or those for Dµ∗ (i.e., (5.24) and (5.25)). The formulae (5.24) and (5.25) can be proved in the same way as Lemma 2.2 or the statements in Sect. 6 of [FFR]. But we shall give the proof of (5.22) and (5.23) using the expression (4.36) of T {θ} in Lemma 4.18. Let us fix v ∈ V ∗ ⊗ OS ⊂ M := Mk (V ). We rewrite ρi (T {∂ξi })v and T {Z(τ ; z; t)∂t }v in Lemma 5.7 modulo gD ˙ M in terms of operators acting on V , apX plying the Ward identity (3.22) in the following form: X n n (ξi ))v 0 ≡ − w−a,−b (zj − zi )ρj (J a,b )v 0 , (5.26) ρi (J a,b ⊗ w−a,−b j6=i

ρi (J a,b ⊗ ξ −n−1 )v 0 ≡ (−1)n+1 w−a,−b,n ρi (J a,b )v 0 X n − w−a,−b (zj − zi )ρj (J a,b )v 0 ,

(5.27)

j6=i

for v 0 ∈ V ⊗OS and n = 0. Here we used the Laurent expansion (4.28) and Lemma 4.15. First we prove (5.22) from (5.14). The formula (4.36) for θ = (δi,j ∂ξj )L j=1 together with Lemma 4.15 shows 1 X ρi (J a,b ⊗ w−a,−b (zi − zi0 + ξi ))ρi0 (Ja,b )v. (5.28) ρi (T {∂ξi })v ≡ − 2κ (a,b)6=(0,0) i0 =1,...,L

Applying the Ward identity (5.26) (n = 0) to the terms with i0 = i in the right-hand side of (5.28), we find that X X 1 w−a,−b (zj 0 − zi )ρj 0 (J a,b )ρi (Ja,b )v ρi (T {∂ξi })v ≡ 2κ (a,b)6=(0,0) j 0 6=i (5.29) X X 1 w−a,−b (zi − zi0 )ρi (J a,b )ρi0 (Ja,b )v. − 2κ 0 (a,b)6=(0,0) i 6=i

40


Renumbering the indices of the first sum by (a, b) → 7 (−a, −b) and applying (1.17) to the second, we conclude that 1 X X wa,b (zj − zi )ρj (Ja,b )ρi (J a,b )v. ρi (T {∂ξi })v ≡ κ (a,b)6=(0,0) j6=i

This means that the operator D∂/∂zi acting on V ⊗ OS is represented as (5.22). The expression (5.23) of D∂/∂τ can be deduced from (5.15) in the similar manner, which is however more involved. Let us use here the function Z(t) = Z(τ ; z; t) given by Example 5.6. Then Z(t) is regular at t 6= zi0 and the Laurent expansion of Z(t) in ξi0 = t − zi0 is represented as Z(zi0 + ξi0 ) = −

1 −1 (ξ + Z1 ξi0 + O(ξi30 )), 2πi i0

Z1 :=

000 θ[0,0] . 0 3θ[0,0]

(5.30)

The formula (4.36) and Lemma 4.15 imply T {Z(τ ; z; t)∂t }v ≡ −

1 2κ

X

L X

va,b,i,j ,

(5.31)

(a,b)6=(0,0) i,j=1

where the vectors va,b,i,j are defined by va,b,i,j := ρj (J a,b ⊗ w−a,−b (zj − zi + ξj )Z(zj + ξj ))ρi (Ja,b )v. Using the Laurent expansions (1.15), (5.30), and the Ward identities (5.26) (n = 0) and (5.27) (n = 0, 1), we can find the following expressions for the vectors va,b,i,j : • If j 6= i and j 6= i0 , then va,b,i,j = w−a,−b (zj − zi )Z(zj )ρj (J a,b )ρi (Ja,b )v. • If j = i and j 6= i0 , then va,b,i,i ≡ Z 0 (zi )ρi (J a,b )ρi (Ja,b )v −

X

w−a,−b (zj 0 − zi )Z(zi )ρj 0 (J a,b )ρi (Ja,b )v.

j 0 6=i

• If j 6= i and j = i0 , then 1 0 (−w−a,−b,0 w−a,−b (zi0 − zi ) + w−a,−b (zi0 − zi ))ρi0 (J a,b )ρi (Ja,b )v 2πi 1 X w−a,−b (zi0 − zi )w−a,−b (zj 0 − zi0 )ρj 0 (J a,b )ρi (Ja,b )v. + 2πi 0

va,b,i,i0 ≡ −

j 6=i0

• If j = i and j = i0 , then 1 (2w−a,−b,1 − (w−a,−b,0 )2 + Z1 )ρi0 (J a,b )ρi0 (Ja,b )v va,b,i0 ,i0 ≡ − 2πi 1 X 0 (−w−a,−b (zj 0 − zi0 ) + w−a,−b,0 w−a,−b (zj 0 − zi0 ))ρj 0 (J a,b )ρi0 (Ja,b ) v. + 2πi 0 j 6=i0

1 0 Note that w−a,−b (t) = −w−a,−b (t). Substituting these expressions into (5.31), we obtain an expression for T {Z(τ ; z; t)∂t }v like


T {Z(τ ; z; t)∂t }v = −

1 2κ

X

41

Za,b,i,j ρj (Ja,b )ρi (J a,b )v

with Za,b,j,i = Z−a,−b,i,j .

(a,b)6=(0,0) i,j=1,...,L

(5.32) Note that ρj (J a,b )ρi (Ja,b ) = ρj (J−a,−b )ρi (J −a,−b ). It is possible to compute all Za,b,i,j directly, but we take a short cut. Since the coefficients Za,b,i,j do not depend on the representations Vi and the choice of Z(t) = Z(τ ; z; t), for the determination of all Za,b,i,j , we have only to calculate Za,b,i0 ,j for j = 1, . . . , L. Picking up the terms which should be contained in Za,b,i0 ,j from the expressions for va,b,i,j , we obtain 1 (2wa,b,1 − (wa,b,0 )2 + Z1 ), 2πi = wa,b (zj − zi0 )Z(zj ) 1 0 (−wa,b,0 wa,b (zj − zi0 ) + wa,b − (zj − zi0 )), 2πi

Za,b,i0 ,i0 = − Za,b,i0 ,j

(5.33)

(5.34)

where we assume j 6= i0 . (Here we have used the formulae (1.17) and w−a,−b,ν = (−1)ν+1 wa,b,ν .) Note that we have 0 (t) wa,b

0

= wa,b (t)(log wa,b (t)) = wa,b (t)

0 0 (t) θ[0,0] θ[a,b] (t) − θ[a,b] (t) θ[0,0] (t)

.

(5.35)

Substituting (1.14), (1.16), (5.35), and the expression of Z1 in (5.30) to (5.33) and (5.34), we can find the following results: Za,b,i0 ,i0 = −2Za,b (0) = −2Za,b (zi0 − zi0 ), Za,b,i0 ,j = −2Za,b (zj − zi0 ). These formulae complete the proof of (5.23).

(5.36) (5.37)

Remark 5.10. Etingof found in [E] that certain twisted traces F of vertex operators of ∗ b N satisfy D∗ sl ∂/∂zi F = 0 and D∂/∂τ F = 0, namely, that they are flat sections of the ∗ connection D . 5.3. Modular invariance of the flat connections. In [E], Etingof proved the modular invariance of the elliptic Knizhnik-Zamolodchikov equations by explicit computation. In this subsection, we give a more geometric proof of this fact without use of the explicit formulae of the connections. A similar property was proved for the non-twisted WZW model on elliptic curves in [FW]. For the brevity of notation, we introduce the following symbols: – Put 0 := SL2 (Z); – In the following, we assume that the symbol γ always denote an arbitrary matrix ( a bc d ) ∈ 0; – For s = (τ ; z) ∈ S, t ∈ C, and i = 1, . . . , L, put z L z aτ + b i , zˇ = (zˇi )L = := , i=1 cτ + d cτ + d cτ + d i=1 t ξi tˇ := , ξˇi := , sˇ := (τˇ ; z). ˇ cτ + d cτ + d τˇ :=

42


As is well-known, γ ∈ 0 acts on S by γ · s := sˇ = (τˇ ; z) ˇ

for s = (τ ; z) ∈ S.

(5.38)

tw 2 In order to define actions of 0 on X, Gtw X and gX , we first extend the actions of Z e on X, G, and g defined by (3.1), (3.3), and (3.4) respectively to those of the semi-direct e := Z2 o 0 defined by product group 0

(m, n; γ)(m0 , n0 ; γ 0 ) := (m + m0 d − n0 c, n − m0 b + n0 a; γγ 0 ) for (m, n; γ), (m0 , n0 ; γ 0 ) ∈ Z2 × 0. Then we have (m, n; 1)(0, 0; γ) = (0, 0; γ)(ma + nc, mb + nd; 1)

(5.39)

for (m, n) ∈ Z2 and γ ∈ 0. e is a standard one. An element γ of 0 acts on X e as e on X The action of 0 γ · (τ ; z; t) := (τˇ ; z; ˇ tˇ)

e for (τ ; z; t) ∈ X.

e e on X. This action together with (3.1) induces an action of 0 e The actions of 0 on G and g are defined via the action of 0 on the Heisenberg group HN , the central extension of (Z/N Z)2 defined by HN := C× × (Z/N Z)2 . Here the group structure of HN is given by 0

(r; m, n)(r0 ; m0 , n0 ) := (rr0 εnm ; m + m0 , n + n0 ) for (r; m, n), (r0 ; m0 , n0 ) ∈ HN . The Heisenberg group HN is isomorphic to the group ˆ and rˆ for r ∈ C× with defining relations generated by the symbols α, ˆ β, ˆ αˆ N = βˆ N = 1, αˆ βˆ = εβˆ α, ˆ ˆ rˆαˆ = αˆ r, ˆ rˆβ = β r, ˆ rd 1 r2 = rˆ1 rˆ2

for r, ri ∈ C× ,

(5.40)

where the identification of the group with HN is given by (r; m, n) = rˆβˆ m αˆ n .

(5.41)

Thus the matrices α and β given by (1.2) define a representation of HN on CN by αv ˆ = αv,

ˆ = βv, βv

rv ˆ = rv

for v ∈ CN and r ∈ C× .

Note that this representation is irreducible. For γ ∈ 0 = SL2 (Z), we can define the group automorphism (·)γ of HN as follows: – If N is odd, then we put (r; 0, 0)γ := (r; 0, 0), (1; 1, 0)γ := (1; a, b), for r ∈ C× .

(1; 0, 1)γ := (1; c, d)

(5.42)


43

– If N is even, then we put (r; 0, 0)γ := (r; 0, 0), √ √ (1; 1, 0)γ := (( ε)ab ; a, b), (1; 0, 1)γ := (( ε)cd ; c, d) √ for r ∈ C× , where ε = exp(πi/N ), a primitive (2N )−th root of unity.

(5.43)

The fact that this defines an automorphism of HN follows from the presentation (5.40) and (5.41). Note that this action of 0 on HN induces that of 0 on (Z/N Z)2 given by (m, n)γ = (ma + nc, mb + nd) for (m, n) ∈ (Z/N Z)2 and γ ∈ 0 (cf. (5.39)). Twisting the representation of HN on CN by γ ∈ 0, we obtain another irreducible representation of HN on CN : HN × CN → CN ,

(h, v) 7→ hγ v.

Since the Heisenberg group HN has a unique irreducible representation, up to isomorphism, in which rˆ ∈ HN for r ∈ C× acts as multiplication by r (the theorem of von Neumann and Stone), using the Schur lemma, we can find that there is xγ ∈ GLN (C), uniquely determined up to scalar multiplications, such that hxγ v = xγ hγ v

for h ∈ HN and γ ∈ 0.

(5.44)

The mapping γ 7→ xγ induces a group homomorphism from 0 = SL2 (Z) into P GLN (C), which does not depend on the choices of xγ ’s. In the following, we take xγ from G = SLN (C), which is uniquely determined up to factor ±1 by γ. 0 1 Example 5.11. For γ = , xγ ∝ (ε−(a−1)(b−1) )N a,b=1 . −1 0 Example 5.12. When N is odd, we can choose xγ = 1 for γ ∈ 0(N ) = { γ ∈ 0 | γ ≡ 1 mod N }. e on G and g are now defined by The desired actions of 0 (m, n; γ) · g := (β m αn xγ )g(β m αn xγ )−1 (m, n; γ) · A := (β m αn xγ )A(β m αn xγ )−1

e and g ∈ G, for (m, n; γ) ∈ 0 e and A ∈ g. for (m, n; γ) ∈ 0

These are extensions of the actions of Z2 given by (3.3) and (3.4). e × G and X e × g are defined by e on X The diagonal actions of 0 γ˜ · (x; ˜ g) := (γ˜ · x; ˜ γ˜ · g) γ˜ · (x; ˜ A) := (γ˜ · x; ˜ γ˜ · A)

e × G and γ˜ ∈ 0, e for (x; ˜ g) ∈ X e × g and γ˜ ∈ 0. e for (x; ˜ A) ∈ X

Note that these actions also do not depend on the choice of xγ . e defined above, we obtain the induced actions of 0 on X, Gtw , From the actions of 0 X tw and gX defined by γ · x := πX ˜ e/X (γ · x)

for x = πX ˜ ∈ X, e/X (x)

γ · g tw := [γ · (x; ˜ g)]

for g tw = [(x; ˜ g)] ∈ Gtw X,

γ · Atw := [γ · (x; ˜ A)]

for Atw = [(x; ˜ A)] ∈ gtw X,

44


e (x; e × g are representatives of x ∈ X, where x˜ ∈ X, ˜ g) ∈ X × G, and (x; ˜ A) ∈ X tw tw e ˜ A) ∈ X × g respectively and γ ∈ 0 is identified with (0, 0; γ) ∈ 0. g ∈ GX , and (x; tw tw e (For the definitions of X, GX , and gX , see (3.2) and (3.5).) The projections X S, gtw X X, etc. are equivariant with respect to the actions of 0. Moreover we obtain the following induced equivariant actions of γ ∈ 0 on TS , gD S, and VirSD : – The biholomorphic map γ : S → S induces the Lie algebra isomorphism (·)γ : ∼ γ −1 TS → TS of vector fields: ÿ ! L L X X ˇ (cτ + d)2 ∂τ + c(cτ + d)zi ∂zi + µi (s)(cτ ˇ + d)∂zi (5.45) µγ := µ0 (s) i=1

i=1

PL

for µ = µ0 (s)∂τ + i=1 µi (s)∂zi ∈ γ −1 TS . (Formally µγ is obtained by the substitution of sˇ = (τˇ ; z) ˇ in s = (τ ; z).) ∼ D ˆ S is defined by – The Lie algebra isomorphism (·)γ : γ −1 gˆ D S →g ˆ γ := (x−1 (Ai (s; ξi ))L ˇ ξˇi )xγ )L ˇ kˆ (5.46) i=1 ; f (s)k γ Ai (s; i=1 ; f (s) for Ai (s; ξi ) ∈ γ −1 (g ⊗ OS ((ξi ))) and f (s) ∈ γ −1 OS . ∼ – The Lie algebra isomorphism (·)γ : γ −1 VirSD → VirSD is defined by γ c := µγ ; (µ0 (s)c(cτ µ; (θi )L ˇ + d)ξi ∂ξi + θi (s; ˇ ξˇi )(cτ + d)∂ξi )L ˇc i=1 ; f (s)ˆ i=1 ; f (s)ˆ (5.47) PL for µ = µ0 (s)∂τ + i=1 µi (s)∂zi ∈ γ −1 TS , θi = θi (s; ξi )∂ξi ∈ γ −1 (OS ((ξi ))∂ξi ), and f (s) ∈ γ −1 OS . The formula (5.47) reflects the fact that the vector field ˇ τˇ + µ0 (s)∂

L X

µi (s)∂ ˇ zˇi + θi (s; ˇ ξˇi )∂ξˇi

i=1

represented in the local coordinate (τˇ ; z; ˇ ξˇi ) is equal to ÿ ! L X 2 ˇ (cτ + d) ∂τ + c(cτ + d)zi ∂zi + c(cτ + d)ξi ∂ξi µ0 (s) i=1

+

L X

µi (s)(cτ ˇ + d)∂zi + θi (s; ˇ ξˇi )(cτ + d)∂ξi

i=1

represented in the local coordinate (τ ; z; ξi ). Therefore the following lemma is a direct consequence of the definitions above. Lemma 5.13. For γ ∈ 0, the isomorphisms above satisfy the following: 1. The isomorphisms induce the following Lie algebra isomorphism: ∼ D ˆD (·)γ : γ −1 VirSD n gˆ D S → VirS n g S. γ D 2. (γ −1 gD ˙ ) = gX ˙ . X


45

D γ D −1 D γ 3. (γ −1 VirX TX˙ ) = TX˙D . ˙ ) = VirX ˙ and (γ

Under these preparations, we show the modular property of the sheaf of conformal coinvariants CC(M) and the sheaf of conformal blocks CB(M) coming from quotients of the Weyl modules. For each i = 1, . . . , L, let Vi be a finite-dimensional irreducible representation of g NL and Mi a quotient of the Weyl module Mk (Vi ). Put M := i=1 Mi and M := M ⊗ OS . We denote by ci the eigenvalue of the Casimir operator Ci acting on Vi given by (2.4) and put 1i := κ−1 ci . The Virasoro operator ρi (T [0]) acting on M is diagonalizable and each of its eigenvalues is of the form 1i + m, where m is a non-negative integer. Thus, fixing branches of the holomorphic functions (cτ + d)Ci /κ on the upper half plane H, QL we obtain an operator (cτ + d)ρ(T [0]) = i=1 (cτ + d)ρi (T [0]) acting on M, where we put PL ρ(T [0]) := i=1 ρi (T [0]). ∼ For γ ∈ 0, define the isomorphism (·)γ : γ −1 M → M by ρ(T [0]) v(s) ˇ v(s)γ := x−1 γ (cτ + d)

for v(s) ∈ γ −1 M,

(5.48)

regarding x−1 γ as an automorphism of M through the natural diagonal action of G = ∼ SLN (C) on M . This isomorphism (5.48) induces the isomorphism (·)γ : γ −1 (M∗ ) → ∗ M given by ρ 8(s)γ := x−1 γ (cτ + d)

where we put ρ∗ (T [0]) :=

PL i=1

∗

(T [0])

8(s) ˇ

for 8(s) ∈ γ −1 (M∗ ),

(5.49)

ρ∗i (T [0]).

Lemma 5.14. For P ∈ VirSD n gˆ D S , v ∈ M, and γ ∈ 0, we have (P · v)γ = P γ · v γ . Proof. Since ρi (T {ξi ∂ξi }) = −ρi (T [0]), we have the following identity of operators acting on M (cf. (5.47)): µ0 (s)(cτ ˇ + d)2 ∂τ +

L X

ρi (T {µ0 (s)c(cτ ˇ + d)ξi ∂ξi })

i=1

(5.50)

ˇ + d)2 ∂τ ) · (cτ + d)−ρ(T [0]) . = (cτ + d)ρ(T [0]) · (µ0 (s)(cτ Similarly, the definition of VirSD n gˆ D S (4.10) leads to the following identities: e−rρi (T [0]) ρi (Ai ⊗ fi (ξi ))erρi (T [0]) = ρi (Ai ⊗ fi (er ξi )), e−rρi (T [0]) ρi (θi (ξi )∂ξi )erρi (T [0]) = ρi (θi (er ξi )e−r ∂ξi ), g ρi (θi (ξi )∂ξi ) = ρi (θi (ξi )∂ξi ) g

(5.51) (5.52) (5.53)

for Ai ⊗ fi (ξi ) ∈ g ⊗ OS ((ξi )), r ∈ C, θi (ξi )∂ξi ∈ O((ξi ))∂ξi , and g ∈ G. Hence we have ρ(T [0]) ˆ γ · x−1 (cτ + d)−ρ(T [0]) xγ · (Ai (s; ξi ))L i=1 ; f (s)k γ (cτ + d) (5.54) = (Ai (s; ˇ ξi ))L ˇ kˆ , i=1 ; f (s) for Ai (s; ξi ) ∈ γ −1 (g ⊗ OS ((ξi ))), f (s) ∈ γ −1 OS because of (5.46), and

46


(cτ + d)−ρ(T [0]) xγ · µ; (θi (s; ξi ))L c i=1 ; g(s)ˆ γ L = µ ; (θi (s; ˇ ξi )∂ξi )i=1 ; g(s)ˆ ˇc

γ

ρ(T [0]) · x−1 γ (cτ + d)

(5.55)

PL for µ = µ0 (s)∂τ + i=1 µi (s)∂ξi ∈ γ −1 TS , θi (s; ξi )∂ξi ∈ γ −1 (OS ((ξi ))∂ξi ), g(s) ∈ γ −1 OS because of (5.47). These formulae prove the lemma. From Lemma 5.13 and Lemma 5.14 follow the modular invariance of CC(M) and CB(M), and the modular transformations of connections D (5.1) and D∗ (5.2). The results are summarized in the following theorem. Theorem 5.15. For each i = 1, . . . , L, let Vi be a finite-dimensional irreducible repreNL sentation of g and Mi a quotient of the Weyl module Mk (Vi ). Put M := i=1 Mi and ∼ M := M ⊗ OS . Let γ be in 0 = SL2 (Z). Then the isomorphisms (·)γ : γ −1 M → M γ −1 ∗ ∼ ∗ and (·) : γ (M ) → M defined by (5.48) and (5.49) induce the isomorphisms ∼

ρ(T [0]) v(s) 7→ x−1 v(s), ˇ γ (cτ + d)

∼

ρ 8(s) 7→ x−1 γ (cτ + d)

(·)γ : γ −1 CC(M) → CC(M), (·)γ : γ −1 CB(M) → CB(M),

∗

(T [0])

8(s), ˇ

where xγ is an element of G satisfying (5.44). Furthermore these isomorphisms correspond local flat sections with respect to the connections D and D∗ to local flat sections. Namely, denoting the subsheaf of local flat sections of CC(M) and CB(M) by CC(M)D D∗ and CB(M) respectively, we obtain the following induced isomorphisms: ∼

(·)γ : γ −1 (CC(M)D ) → CC(M)D , ∗

∼

∗

(·)γ : γ −1 (CB(M)D ) → CB(M)D . The modular transformations of the connections are represented as D µγ = µγ +

L X

ρi (T {θi (s; ˇ ξˇi )∂ξˇi }) − µ0 (s) ˇ c(cτ + d)ρ(T [0])

i=1

ˇ c(cτ + d)ρ(T [0]), = Dµ s7→sˇ − µ0 (s) Dµ∗ γ = µγ +

L X

(5.56)

ρ∗i (T {θi (s; ˇ ξˇi )∂ξˇi }) − µ0 (s) ˇ c(cτ + d)ρ∗ (T [0])

i=1

ˇ c(cτ + d)ρ∗ (T [0]), = Dµ∗ s7→sˇ − µ0 (s)

(5.57)

PL where µ = µ0 (s)∂τ + i=1 µi (s)∂zi ∈ γ −1 TS , θi = θi (s; ξi )∂ξi ∈ γ −1 (OS ((ξi ))∂ξi ), −1 D (µ; (θi )L (VirX ˇ in s, namely, ˙ ), and [ · ]s7→sˇ denotes the substitution of s i=1 ; 0) ∈ γ γ ˇ for µ ∈ γ −1 TS and A(s) ∈ γ −1 EndOS (CC(M)) or A(s) ∈ [µ + A(s)]s7→sˇ = µ + A(s) γ −1 EndOS (CB(M)). Indeed (5.56) and (5.57) follow from the explicit expressions (5.9) and (5.10) and the definition (5.47). Using Proposition 3.4, we can identify CCk (V ) and CBk (V ) with V ⊗ OS and PL V ∗ ⊗ OS respectively. Put 1 := i=1 1i . Then the theorem above implies the following corollary.


47

Corollary 5.16. Then, under the identifications CCk (V ) = V ⊗OS and CBk (V ) = V ∗ ⊗ ∼ ∼ OS , the isomorphisms (·)γ : γ −1 CCk (V ) → CCk (V ) and (·)γ : γ −1 CBk (M) → CBk (V ) are of the following forms: ∼

(·)γ : γ −1 (V ⊗ OS ) → V ⊗ OS , ∼

(·)γ : γ −1 (V ∗ ⊗ OS ) → V ∗ ⊗ OS ,

1 v(s) 7→ x−1 ˇ γ (cτ + d) v(s), −1 F (s) 7→ x−1 F (s). ˇ γ (cτ + d)

The modular transformations of the connections expressed as in Theorem 5.9 are represented as ˇ + d)1, (5.58) Dµγ = Dµ s7→sˇ − µ0 (s)c(cτ ∗ ∗ Dµγ = Dµ s7→sˇ + µ0 (s)c(cτ ˇ + d)1, (5.59) PL where µ = µ0 (s)∂τ + i=1 µi (s)∂ξi ∈ γ −1 TS and [ · ]s7→sˇ denotes the substitution of sˇ in s, namely, [µ + A(s)]s7→sˇ = µγ + A(s) ˇ for µ ∈ γ −1 TS and A(s) ∈ γ −1 EndOS (V ⊗ OS ) −1 ∗ or A(s) ∈ γ EndOS (V ⊗ OS ). Example 5.17. Applying (5.59) to µ = ∂zi and µ = ∂τ , we obtain i h ∗ ∗ D(∂/∂z = D , γ ∂/∂zi i) h i s7→sˇ ∗ ∗ D(∂/∂τ + c(cτ + d)1, )γ = D∂/∂τ s7→sˇ

which were found by Etingof [E], Sect. 4.

6. Concluding Remarks We have examined a twisted WZW model on elliptic curves which gives the XYZ Gaudin model at the critical level and Etingof’s elliptic KZ equations at the off-critical level. We make several comments on the related interesting problems to be solved. Factorization. We have studied the twisted WZW model only on a family of smooth pointed elliptic curves, but as in [TUY] we can also consider the model on a family of stable pointed elliptic curves. By extending the connections acting on the sheaves of conformal blocks to those with regular singularities at the boundary of the family, we shall be able to establish the equivalence between our geometric approach and Etingof’s approach by means of twisted traces of the products of twisted vertex operators on the Riemann sphere. Furthermore we shall be able to obtain a dimension formulae for the spaces of conformal blocks. A detailed investigation shall be given in a forthcoming paper. Generalization to higher genus. Bernard generalized the KZB equations to higher genus Riemann surfaces in [Be2]. In [Fe], Felder established the geometric interpretation of the KZB equations on Riemann surfaces by the non-twisted WZW models clarifying the notion of the dynamical r-matrices in higher genus cases. Our formulation for the twisted WZW model is also valid for arbitrary Riemann surfaces. See Appendix C for details.

48


Discretization. Felder calls his interpretation of the KZB equations in [Fe] “the first step of the ‘St. Petersburg q-deformation recipe’ in higher genus cases”. We hope that our twisted WZW model on elliptic curves can also be q-deformed. The resulting “elliptic q-KZ equations”, for example, would be related to the difference equations proposed in [Tak2]. Intertwining vectors. Note that the Boltzmann weights of the A(1) N −1 face model can be expressed by the elliptic quantum R-matrix and the intertwining vectors ([Ba, JMO, DJKMO]). Therefore it can be expected that there exists a quasi-classical limit of the intertwining vectors by means of which the relation of the non-twisted and twisted WZW models on elliptic curves will be clarified. In addition, the intertwining vectors play an important role in constructing Bethe vectors of the XYZ spin chain models ([Ba, Tak1]) and the integral solution of the difference equations in [Tak2]. They are introduced as a kind of technical tool there, but our approach from the twisted WZW model, i.e., from the classical limit should reveal their algebro-geometric meaning.

Appendix A. Theta functions with characteristics Here we collect properties of theta functions of one variable used in this paper. Following [M], we use the notation: X 2 0 eπi(n+κ) τ +2πi(n+κ)(t+κ ) θκ,κ0 (t; τ ) =

(A.1)

n∈Z

for the theta functions with characteristics. Here t is a complex number, τ belongs to the upper half plane H and κ, κ0 are rational numbers. They are related with each other by shifts of t: 0

0

θκ1 +κ2 ,κ01 +κ02 (t; τ ) = eπiκ2 τ +2πiκ2 (t+κ1 +κ2 ) θκ1 ,κ01 (t + κ2 τ + κ02 ; τ ). 2

(A.2)

Since the zero set of θ00 (t; τ ) is {1/2 + τ /2} + Z + Zτ , the zero set of θκ,κ0 (t; τ ) is 1 1 − κ0 + − κ τ + Z + Zτ, (A.3) 2 2 because of (A.2). Fundamental properties of the function θκ,κ0 are the quasi-periodicity with respect to t: θκ,κ0 (t + 1; τ ) = e2πiκ θκ,κ0 (t; τ ), 0

θκ,κ0 (t + τ ; τ ) = e−πiτ −2πi(t+κ ) θκ,κ0 (t; τ ),

(A.4)

and the automorphic property: θκ,κ0 (t; τ + 1) = e−πiκ(κ+1) θκ,κ+κ0 + 1 (t; τ ), 2 1 t 1/2 2πiκκ0 πit2 /τ ;− = (−iτ ) e e θκ0 ,−κ (t; τ ). θκ,κ0 τ τ The formulae

(A.5)


49

θ−κ,−κ0 (t; τ ) = θκ,κ0 (−t; τ ), θκ+m,κ0 +n (t; τ ) = e

2πiκn

(A.6)

θκ,κ0 (t; τ ),

(A.7)

are easily deduced from the definition (A.1), where m and n are integers. We use mostly the following special characteristics. Let N = 2 be a natural number and a, b arbitrary integers. We denote θ[a,b] (t; τ ) := θ a − 1 ,− b + 1 (t; τ ). N

The standard abbreviations, 0 θ[a,b]

θ[a,b] := θ[a,b] (0; τ ),

N

2

(A.8)

2

d θ[a,b] (t; τ ) , := dt t=0

00 etc. likewise defined are also used. and θ[a,b]

Lemma A.1.

000 1 N 2 − 1 θ[0,0] − 0 6 θ[0,0] 2

X (a,b)6=(0,0)

00 θ[a,b] = 0, θ[a,b]

(A.9)

where the indices in the second term run through a = 0, . . . , N − 1, b = 0, . . . , N − 1 and (a, b) 6= (0, 0). This is a generalization of the well-known formula 000 θ1/2,1/2 0 θ1/2,1/2

=

00 θ1/2,0

θ1/2,0

+

00 00 θ0,1/2 θ0,0 + , θ0,0 θ0,1/2

(A.10)

which is the case N = 2 of Lemma A.1. Proof. It is easy to show that N

−1 N −1 N Y Y a=0

 θ[a,b] (t; τ ) = 

b=0

Y

 θ[a,b]  θ[0,0] (N t; τ ).

(A.11)

(a,b)6=(0,0)

In fact we have only to compare the periodicity and zeros of both sides by using (A.4) and (A.3). The over-all coefficient can be determined by the first term (namely the coefficient of t) in the Taylor expansion around t = 0. The coefficients of t2 of the Taylor expansion of (A.11) give X (a,b)6=(0,0)

0 θ[a,b] = 0, θ[a,b]

and using this equality, we can rewrite the terms of order t3 in (A.11) as follows: 000 1 N 2 − 1 θ[0,0] − 0 6 θ[0,0] 2

X (a,b)6=(0,0)

00 θ[a,b] =− θ[a,b]

X (a,b)6=(0,0)

0 θ[a,b] θ[a,b]

2 .

(A.12)

Therefore, in order to prove Lemma A.1, we have to show that the right-hand side of (A.12) is zero. Let us denote it by f (τ ) as a function of τ . It has the following properties:

50


– f (τ ) is a holomorphic function on the upper half plane H. ((A.3)) – f (τ ) is bounded when Im τ → +∞. ((A.1)) – f (τ + 1) = f (τ ), f (−1/τ ) = τ 2 f (τ ). ((A.5), (A.7)) Hence f (τ ) is an integral modular form of weight 2 and level 1, which is nothing but zero. (See, for example, Théorèm 4 (i), Sect. 3 Chapitre VII of [Se1], Proposition 2.26 of [Sh] or Theorem 14 in Chapter II of [Sc].) This proves the lemma.

Appendix B.

The Kodaira-Spencer map of a family of Riemann surfaces

Let π : X → S be a family of compact Riemann surfaces over a complex manifold S and qi : S → X a holomorphic section of π for each i = 1, . . . , L. Put Qi := qi (S) and SL assume that Qi ∩ Qj = ∅ if i 6= j. Then D := i=1 Qi is a divisor of X e´ tale over S. We call (π : X → S; q1 , . . . , qL ) a family of pointed compact Riemann surfaces. Denote by TX (− log D) the sheaf of vector fields tangent to D. As in Sect. 4.1, let TX,π (− log D) be the inverse image of π −1 TS ⊂ π ∗ TS in TX (− log D). Then we obtain the following short exact sequence: 0 → TX/S (−D) → TX,π (− log D) → π −1 TS → 0, which is a Lie algebra extension. The derived direct image of this sequence produces the following long exact sequence: · · · → π∗ TX,π (− log D) → TS → R1 π∗ TX/S (−D) → R1 π∗ TX,π (− log D) → · · · . The connecting homomorphism TS → R1 π∗ TX/S (−D) is called the Kodaira-Spencer map of the family (π : X → S; q1 , . . . , qL ). For an OX -module F and a closed analytic subspace Z of X, denote by FZ∧ the completion of F at Z. Consider the following exact sequences: ∧ ∗ ∧ 0 → (TX/S (−D))∧ D → (TX (− log D))D → (π TS )D → 0, ∧ ∗ ∧ 0 → (TX/S (∗D))∧ D → (TX (∗D))D → (π TS )D → 0.

∧ ∧ As above, the inverse images of π −1 TS |D ⊂ (π ∗ TS )∧ D in (TX (− log D))D and (TX (∗D))D is denoted by Tπ (− log D) and Tπ (∗D) respectively. Then we obtain the Lie algebra extensions below: −1 TS |D → 0, 0 → (TX/S (−D))∧ D → Tπ (− log D) → π −1 0 → (TX/S (∗D))∧ TS |D → 0. D → Tπ (∗D) → π

The direct images of these exact sequences to S are also exact. Then we have the following commutative diagram:


0   y 0 −→

−D π∗ TX/S   y

0   y −→

TX∗D ⊕ T −D ˙   y

51

0   y

(])   y

−D −→ T ∗D −→ R1 π∗ TX/S −→ 0,   y

− log D − log D ∗D −→ Tπ, −→ Tπ∗D 0 −→ π∗ TX,π ˙ ⊕ Tπ X       y y y

0 −→

TS   y

−→

(])

TS ⊕ (TS )L   y 0

(B.13)

−→ (TS )L   y 0

where we put −D := TX/S (−D), TX/S

− log D TX,π := TX,π (− log D),

TX∗D := π∗ TX/S (∗D), ˙

T −D := π∗ (TX/S (−D))∧ D,

T ∗D := π∗ (TX/S (∗D))∧ D,

Tπ− log D := π∗ Tπ (− log D),

Tπ∗ := π∗ Tπ (∗D).

∗D Tπ, ˙ := π∗ TX,π (∗D), X

The horizontal and vertical sequences in the diagram (B.13) are all exact and the arrow −D from TS to R1 π∗ TX/S = R1 π∗ TX/S (−D) through (]) is nothing but the Kodaira-Spencer map, which is described as follows. For µ ∈ TS , chasing the diagram above, we can choose (aX˙ , a+ ), α and [α] in order: − log D ∗D , whose image in TS ⊕ (TS )L is equal to (µ; (µ)L 1. (aX˙ , a+ ) ∈ Tπ, ˙ ⊕ Tπ i=1 ) ∈ X L TS ⊕ (TS ) ; 2. α ∈ T ∗D , whose image in Tπ∗D is equal to aX˙ − a+ ∈ Tπ∗D ; −D −D , which is the image of α in R1 π∗ TX/S . 3. [α] ∈ R1 π∗ TX/S

Then the cohomology class [α] ∈ R1 π∗ TX/S (−D) does not depend on the choice of (aX˙ , a+ ) and α. The Kodaira-Spencer map sends µ ∈ TS to [α] ∈ R1 π∗ TX/S (−D). ∗D D D and Tπ, Recall that TX∗D ˙ are denoted by TX ˙ ˙ and VirX ˙ respectively in Sect. 5. The short X exact sequence (4.3) is included in the second vertical exact sequence in (B.13) and the D lifting from TS to VirX ˙ (cf. Sect. 5.1) essentially corresponds to the operation 1 above. Hence we can see that the constructions in Sect. 5 originate in the above description of the Kodaira-Spencer map. S by OS cˆ. This is However we considered not only Tπ∗D but also its extension VirD a difference between the description of the Kodaira-Spencer map and the constructions in Sect. 5. We remark that Beilinson and Schechtman give the intrinsic (i.e., coordinatefree) description of the Virasoro algebras in [BS]. Appendix C. On a formulation for higher genus Riemann surfaces In this appendix, we shall comment on a formulation of twisted Wess-Zumino-Witten models on higher genus Riemann surfaces.

52


P Let π : X → S, qi : S → X, Qi = qi (S), and D = Qi be the same as Appendix B. Suppose that, for each i, we can take a holomorphic function ξi on an open neighborhood Ui of Qi with the property that the mapping Ui → S ×ξi (Ui ) given by x 7→ (π(x), ξi (x)) is biholomorphic and ξi (Qi ) = {0}. Then, in precisely the same way as Sect. 5, we can define TX,π (∗D), TX/S (∗D), D D VirSD , TX˙D , VirX ˙ , etc. We define the action of π∗ TX,π (∗D) on VirS by aX˙ · α := [aX˙ , α]

D D for aX˙ ∈ VirX ˙ , α ∈ VirS ,

(C.14)

D D where VirX ˙ is identified with its image in VirS and the bracket of the right-hand side D is the Lie algebra structure of VirS . D D D We remark that the embedding VirX ˙ and that of TX ˙ into VirS are not always Lie D D algebra homomorphisms and the action of VirX˙ on VirS does not always preserve TX˙D . Thus we must add a supplementary structure on the Riemann surface. Suppose that we can take an open covering {Uλ }λ∈3 of X and a family {ξλ : Uλ → C}λ∈3 of holomorphic functions satisfying the following properties:

1. For each λ ∈ 3, the mapping Uλ → S × C given by x 7→ (π(x), ξλ (x)) is a biholomorphic mapping from Uλ onto an open subset of S × C. 2. For any λ, λ0 ∈ 3 with Uλ ∩Uλ0 6= ∅, there exists a, b, c, d ∈ OS (S) with the property that ξλ0 = (aξλ + b)/(cξλ + d) on Uλ ∩ Uλ0 . We call {ξλ }λ∈3 a projective structure on the family π : X → S of Riemann surfaces. Moreover assume that {ξλ }λ∈3 ∪ {ξi }L i=1 is also a projective structure on the family. D D D Lemma C.2. Under the assumption above, the action of VirX ˙ on VirS preserves TX ˙ and in particular the embedding TX˙D ,→ VirSD is a Lie algebra homomorphism. D D Proof. It suffices to show that cV (θ, η) = 0 for (µ; θ; 0) ∈ VirX ˙ and (0; ν; 0) ∈ TX ˙ . For this purpose, as in the proof of Lemma 4.5, it is enough to show that, for 1 D D (µ; θ; 0) ∈ VirX ˙ and (0; ν; 0) ∈ TX ˙ , we can take ω ∈ π∗ X/S (∗D) with the property 000 that ω = θ (ξi )η(ξi ) dξi near Qi . To do this, we calculate the transformation property of θ000 (ξλ )η(ξλ ) dξλ under coordinate changes. Take any ξ, ζ from {ξλ }λ∈3 ∪ {ξi }L i=1 . Then there is a, b, c, d ∈ OS (S) with ζ = (aξ +b)/(cξ +d). Fix a local coordinate s = (si )M i=1 on S. Then we obtain two local coordinates (s; ξ) and (s; ζ) on X. Let aX˙ be in π∗ TX,π (∗D) and α in π∗ TX/S (∗D). Then we can represent aX˙ and α in the two local coordinates (s; ξ) and (s; ζ): ξ µ + θξ (s; ξ)∂ξ in (s; ξ), η (s; ξ)∂ξ in (s; ξ), α= aX˙ = µ + θζ (s; ζ)∂ζ in (s; ζ), η ζ (s; ζ)∂ζ in (s; ζ),

where µ =

PM i=1

µi (s)∂si . Then a straightforward calculation shows that ∂ 3 θζ (s; ζ) ζ ∂ 3 θξ (s; ξ) ξ η (s; ξ) dξ = η (s; ζ) dζ. ∂ξ 3 ∂ζ 3

Hence there is a unique ω ∈ π∗ 1X/S (∗D) such that the representation of ω under the coordinate (s; ξ) is equal to (∂ξ3 θξ (s; ξ))η ξ (s; ξ) dξ for any ξ ∈ {ξλ }λ∈3 ∪ {ξi }L i=1 . Thus we have completed the proof.


53

Example C.3. Assume that π : X → S denotes the family of elliptic curves defined in Sect. 3.1. Then the local coordinate t along the fibers gives a projective structure on the family. For each holomorphic section q : S → X of π : X S, put ξq := t − q, which is regarded as a holomorphic function on a sufficiently small open neighborhood of q(S). Then the family {ξq } is a projective structure on the family and contains {ξi }L i=1 = {ξqi }L i=1 . Example C.4. A family of compact Riemann surfaces given by the Schottky parametrization possess a natural projective structure, because each compact Riemann surface in that family is represented as the quotient space of the punctured Riemann sphere by the action of a Schottky group, which is generated by a certain finite set consisting of fractional linear transformations. The Schottky parametrization is used in [Be2]. We can generalize the statement of Lemma 4.7 for the family of Riemann surfaces with projective structures. Let gtw X denote an OX -Lie algebra which is locally OX -free of finite rank with holomorphic flat connection ∇ and suppose that the action of a vector field on gtw X via the connection is a Lie algebra derivation: ∇[A, B] = [∇A, B] + [A, ∇B]

for A, B ∈ gtw X.

Assume that the fibers of gtw X are (non-canonically) isomorphic to a simple Lie algebra g over C. The OX -inner product (.|.) is defined by (A|B) :=

1 tr (ad A ad B) 2h∨ gtwX

for A, B ∈ gtw X,

where h∨ denotes the dual Coxeter number of g. Then the inner product is non-degenerate ∼ and invariant under the translations along ∇. We can take a local trivialization gtw X = g ⊗ OX , under which the connection is represented as the exterior derivative on X (i.e., the trivial connection). We assume that we can take such a trivialization of gtw X on some neighborhood of the divisor D. Under this situation, the constructions in Sect. 3 go through in precisely the same way and then Lemma 4.7 also holds. D However, Lemma 4.10 does not always hold. The action of VirX ˙ on M is not a representation but a projective representation in general, because the embedding D D VirX ˙ ,→ VirS is not always a Lie algebra homomorphism but so is the composition of the embedding and the natural projection VirSD VirSD /Ocˆ. Nevertheless Lemma 4.11 D and Lemma 4.12 also hold in our case. Namely, the Lie algebra VirX ˙ acts on both the sheaf CC(M) of conformal coinvariants and the sheaf CB(M) of conformal blocks. Furthermore Lemma 4.13 can be proved in the same way. Therefore we conclude that CB(M) possess a projectively flat connection. For the non-twisted Wess-Zumino-Witten models, the coordinate-free version of Lemma 4.18 for θ ∈ TX˙D is used in the proof of the main theorem 4.2 in [Ts]. Since the analogue of Lemma 4.18 for θ ∈ TX˙D can be also proved in our situation, we can obtain the projectively flat connections on CC(M). We can generalize the setting above in various ways: 1. We can replace the family of pointed compact Riemann surfaces by that of stable pointed curves in the course of [TUY, U], and [Ts]. Then we shall be able to show the factorization property of conformal blocks under appropriate assumptions. 2. We can consider not only deformations of pointed Riemann surfaces but also deformations of Gtw X -torsors (or principal bundles). For example, the KnizhnikZamolodchikov-Bernard equations on Riemann surfaces (cf. [Be1, Be2, FW, Fe]) can be formulated on a family of pairs of compact Riemann surfaces and principal G-bundles on them, where G is a finite-dimensional simple algebraic group over C.

54


3. Furthermore we can also consider Borel subgroup bundles which are subbundles of the restriction of Gtw X on D and their deformations. (More generally, we can consider a family of quasi parabolic structures on Gtw -torsors.) Then we can define the notion of highest weight representations of the sheaf of affine Lie algebras with respect to the Borel subgroup bundles. Anyway a choice of a Borel subgroup is required by the definition of the category O of representations, which contains the Verma modules, their irreducible quotients, and especially the Wakimoto modules. Note that the constructions of the Wakimoto modules (cf. [FF1, FF2], and [K]) essentially depend on the choice of a triangular decomposition (equivalently that of a Borel subalgebra) of a finite-dimensional semi-simple Lie algebra over C. 4. We can replace the holomorphic flat connection on gtw X by a meromorphic flat connection with regular singularity along the divisor D. Assume that the local monodromy group of the connection around D is finite. Then we can construct a sheaf of twisted affine Lie algebras at D and can define the notion of conformal blocks for representations of the twisted affine Lie algebras. Detailed expositions shall be given in forthcoming papers. Acknowledgement. TT is supported by a Postdoctoral Fellowship for Research abroad of the Japan Society for the Promotion of Science. He expresses his gratitude to Benjamin Enriquez, Giovanni Felder, Edward Frenkel, Ian Grojnowski, Takeshi Ikeda, Feodor Malikov, Hirosi Ooguri, Nicolai Reshetikhin for comments and discussions. The authors also express their gratitude to Yoshifumi Tsuchimoto, who explained to them a detail of the proof of the main theorem in [Ts]. Parts of this work were done while T.T. was visiting the Department of Mathematics of the University of California at Berkeley, the Department of Mathematics of Kyoto University, the Erwin Schrödinger Institute for Mathematical Physics, the Landau Institute for Theoretical Physics and Centre Emile Borel - Institut Henri Poincaré - UMS 839, CNRS/UPMC. He thanks these institutes for hospitality.

References [Ba]

Baxter, R. J.: Eight-Vertex Model in Lattice Statistics and One-Dimensional Anisotropic Heisenberg Chain I, II, III. Ann. Phys. 76, 1–24, 25–47, 48–71 (1973) [Bei] Beilinson, A. A.: Affine algebras at the critical level and Langlands duality (after V.G. Drinfeld). Manuscript, 1991 [BeiD] Beilinson, A. A., Drinfeld, V. G.: Quantization of Hitchin’s fibration and Langlands program. Preprint, 1994 [BelD] Belavin, A.A., Drinfeld, V.G.: Solutions of the classical Yang-Baxter equations for simple Lie algebras. Funkts. Anal. i ego Prilozh. 16-3, 1–29 (1982) (in Russian); Funct. Anal. Appl. 16, 159–180 (1982)(English transl.) [Be1] Bernard, D.: On the Wess-Zumino-Witten models on the torus. Nucl. Phys. B303, 77–93 (1988) [Be2] Bernard, D.: On the Wess-Zumino-Witten models on Riemann surfaces. Nucl. Phys. B309, 145–174 (1988) [Bj] Björk, J.-E.: Analytic D-modules and applications. Mathematics and Its Applications 247, Dordrecht: Kluwer Academic Publishers, 1993 [BEGHKM] Borel, A., Ehlers, F., Grivel,P.-P., Haefliger, A., Kaup, B., Malgrange,B.: Algebraic D-modules. Perspectives in Mathematics 2, Boston: Academic Press, 1987 [Bo] Borel, A.: Automorphic L-functions. Proc. Symp. Pure Math. 33 part 2, 27–61 (1979) [BS] Beilinson, A.A., Schechtman, V.V.: Determinant bundles and Virasoro algebras. Commun. Math. Phys. 118, 651–701 (1988) [C] Cherednik, I.V.: Definition of τ functions for generalized affine Lie algebras. Funkts. Anal. i ego Prilozh. 17-3, 93–95 (1983) (in Russian); Funct. Anal. Appl. 17, 243–245 (1983) (English transl.)


[DJKMO]

[E] [ER] [Fe] [FF1]

[FF2] [FF3] [FFR] [Fr1] [Fr2]

[FW] [Ga1] [Ga2] [Ga3] [Ge] [GR] [Gu] [GW] [Har] [Hay] [Hi] [Ho] [I] [JMO] [K] [KL] [KZ] [M] [N] [Sc]

55

Date, E., Jimbo, M., Kuniba, A., Miwa, T., Okado, M.: Exactly Solvable SOS Models I. Nucl. Phys. B290 [FS20], 231–273 (1987); ditto II. In: Conformal field theory and solvable lattice models (Kyoto, 1986), Adv. Stud. Pure Math. 16, 17–122 (1988) Etingof, P.I.: Representations of affine Lie algebras, elliptic r-matrix systems, and special functions. Comm. Math. Phys. 159, 471–502 (1994) Enriquez, B., Rubtsov, V.N.: Hitchin systems, higher Gaudin operators and r-matrices. Math. Res. Lett. 3, 343–357 (1996) Felder, G.: The KZB equations on Riemann surfaces. hep-th/9609153, (1996) Feigin, B., Frenkel, E.: Representation of affine Kac-Moody algebras, bosonization and resolutions. In: it Physics and Mathematics of Strings, Memorial volume for Vadim Knizhnik Brink, L., Friedan, D., Polyakov, A. M. eds., Singapore, New Jersey, London, Hong Kong: World Scientific, 1990, pp. 271–316 Feigin, B., Frenkel, E.: Affine Kac-Moody algebras and semi-infinite flag manifolds. Commun. Math. Phys. 128, 161–189 (1990) Feigin, B., Frenkel, E.: Affine Kac-Moody algebras at the critical level and Gel’fand-Diki˘ı algebras. Int. J. Mod. Phys. A7, Suppl. 1A, 197–215 (1992) Feigin, B., Frenkel, E., Reshetikhin, N.: Gaudin model, Bethe Ansatz and critical level. Commun. Math. Phys. 166, 27–62 (1994) Frenkel, E.: Affine Kac-Moody algebras at the critical level and quantum Drinfeld-Sokolov reduction. Ph. D Thesis, Harvard University, (1991) Frenkel, E.: Affine Algebras, Langlands Duality and Bethe Ansatz. In: Proceedings of XIth International Congress of Mathematical Physics Unesco-Sorbonne-Paris July 18-23, 1994, D. Iagolnitzer ed., Cambridge: International Press, 1995, pp. 606–642 Felder, G., Wieczerkowski, C.: Conformal blocks on elliptic curves and the Knizhnik– Zamolodchikov–Bernard equations. Commun. Math. Phys. 176, 133–162 (1996) Gaudin, M.: Modèles exacts en mécanique statistique: La méthode de Bethe et ses généralizations. Note CEA 1559 (1), (1972) and Note CEA 1559 (2), (1973) Gaudin, M.: Diagonalisation d’une classe d’hamiltoniens de spin. J. de Physique 37, 1087–1098 (1976) Gaudin, M.: La Fonction d’Onde de Bethe. Paris: Masson, 1983 (in French); Moscow: Mir, 1987 (Russian transl.) Gelbart, S.: An elementary introduction to the Langlands program. Bull. Amer. Math. Soc. 10, 177–219 (1984) Grauert, H., Remmert, R.: it Coherent analytic sheaves. Grundlehren der mathematischen Wissenschaften 265, Berlin-New York: Springer-Verlag, 1984 Gunning, R.C.: Lectures on Riemann surfaces. Princeton: Princeton University Press, 1966 Goodman, R., Wallach, N.R.: Higher-order Sugawara operators for affine Lie algebras. Trans. A. M. S. 315-1, 1–55 (1989) Hartshorne, R.: Algebraic geometry. Graduate Texts in Mathematics 52, New York-Heidelberg: Springer-Verlag, 1977 Hayashi, T.: Sugawara operators and Kac-Kazhdan conjecture. Invent. math. 94, 13–52 (1988) Hitchin, N.J.: Stable bundles and integrable systems. Duke Math. J. 54, 91–114 (1987) Hotta, R.: Introduction to D-modules. I. M. Sc. Lecture Notes, Mathematics (1987) Iwasawa, K.: Algebraic functions. Tokyo: Iwanami, 1952 (in Japanese); AMS, Transl. math. monographs 118, (1993) Jimbo, M., Miwa, T., Okado, M.: Solvable Lattice Models whose States are Dominant Integral Weights of A(1) n−1 . Lett. Math. Phys. 14, 123–131 (1987) Kuroki, G.: Fock space representations of affine Lie algebras and integral representations in the Wess-Zumino-Witten models. Commun. Math. Phys. 142, 511–542 (1991) Kazhdan, D, Lusztig, G.: Tensor structures arising from affine Lie algebras I. J. Amer. Math. Soc. 6, 905–947 (1993) Knizhnik, V.G., Zamolodchikov, A.B.: Current algebra and Wess-Zumino model in two dimensions. Nucl. Phys. B247, 83–103 (1984) Mumford, D., Tata Lectures on Theta I. Basel-Boston: Birkhäuser, 1982 Nekrasov, N.: Holomorphic bundles and many-body systems. Commun. Math. Phys. 180, 587603 (1996) Schoeneberg, B.: Elliptic modular functions: an introduction. Grundlehren der mathematischen Wissenschaften 203, New York-Heidelberg: Springer-Verlag, 1974

56

[Se1]

[Se2]

[Sh]

[Sk] [ST1] [ST2] [Tak1]

[Tak2] [Tat] [TK]

[Ts] [TUY]

[U]


Serre,J.-P.: Cours d’arithmétique. Paris: Press Universitaires de France, 1970 (in French); A course in arithmetic. Graduate Texts in Mathematics 7, New York-Heidelberg: Springer-Verlag, 1973 (English transl.) Serre, J.-P.: Groupes algébriques et corps de classes. 2nd ed. Paris: Hermann, 1975 (in French); Algebraic groups and class fields. Graduate Texts in Mathematics 117, New York-Berlin: Springer-Verlag, 1988 (English transl.). Shimura,G.: Introduction to the arithmetic theory of automorphic functions. Publication of the Mathematical Society of Japan 11, Tokyo: Iwanami and Princeton: Princeton University Press, 1971 Sklyanin, E. K.: Separation of variables in the Gaudin model. Zap. Nauch. Sem. LOMI 164, 151–169 (1987) (in Russian); J. Sov. Math. 47, 2473–2488 (1989) (English transl.) Sklyanin,E.K., Takebe, T.: Algebraic Bethe Ansatz for XYZ Gaudin model. Phys. Lett. A 219, 217–225 (1996) Sklyanin, E. K., Takebe, T.: In preparation Takebe,T.: Generalized Bethe Ansatz with the general spin representations of the Sklyanin algebra. J. Phys. A 25, 1071–1084 (1992); Bethe ansatz for higher spin eight-vertex models. J. Phys. A 28, 6675–6706 (1995); Corrigendum. ibid. 29, 1563–1566 (1996); Bethe Ansatz for Higher Spin XYZ Models, – Low-lying Excitations –. J. Phys. A 29, 6961–6966 Takebe, T.: A system of difference equations with elliptic coefficients and Bethe vectors. q-alg/9604002, to appear in Commun. Math. Phys. ´ Norm. Sup., 4e série, 1, 149–159 Tate,J.: Residue of differentials on curves. Ann. Scient. Ec. (1968) Tsuchiya, A., Kanie, Y.: Vertex operators in conformal field theory on P1 and monodromy representations of braid group. In: Conformal field theory and solvable lattice models (Kyoto, 1986), Adv. Stud. Pure Math. 16, 297–372 (1988); Errata. In: Integrable systems in quantum field theory and statistical mechanics Adv. Stud. Pure Math., 19, 675–682 (1989) Tsuchimoto, Y.: On the coordinate-free description of the conformal blocks. J. Math. Kyoto Univ. 33-1, 29–49 (1993) Tsuchiya, A., Ueno, K., Yamada, Y.: Conformal field theory on universal family of stable curves with gauge symmetries. In: Integrable systems in quantum field theory and statistical mechanics, Adv. Stud. Pure Math. 19, 459–566 (1989) Ueno, K.: On conformal field theory. In: Vector Bundles in Algebraic Geometry, Durham 1993, ed. by N. J. Hitchin, P. E. Newstead and W. M. Oxbury, London Mathematical Society Lecture Note Series 208, 283–345 (1995)

Communicated by G. Felder

Commun. Math. Phys. 190, 57 – 111 (1997)

Communications in


Affine Orbifolds and Rational Conformal Field Theory Extensions of W1+∞ ? Victor G. Kac, Ivan T. Todorov?? International Erwin Schrödinger Institute for Mathematical Physics and Department of Mathematics, MIT, Cambridge, MA 02139, USA. E-mail: [email protected], [email protected] Received: 5 December 1996 / Accepted: 1 April 1997

Abstract: Chiral orbifold models are defined as gauge field theories with a finite gauge group 0. We start with a conformal current algebra A associated with a connected compact Lie group G and a negative definite integral invariant bilinear form on its Lie algebra. Any finite group 0 of inner automorphisms or A (in particular, any finite subgroup of G) gives rise to a gauge theory with a chiral subalgebra A0 ⊂ A of local observables invariant under 0. A set of positive energy A0 modules is constructed whose characters span, under some assumptions on 0, a finite dimensional unitary representation of SL(2, Z). We compute their asymptotic dimensions (thus singling out the nontrivial orbifold modules) and find explicit formulae for the modular transformations and hence, for the fusion rules. As an application we construct a family of rational conformal field theory (RCFT) extensions of W1+∞ that appear to provide a bridge between two approaches to the quantum Hall effect. Contents 0 1 1.1 1.2 1.3 2 2.1 2.2 3

Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Chiral Algebras Associated with Connected Compact Lie Groups . . . . . Definition of a chiral algebra. Current algebras . . . . . . . . . . . . . . . . . . . . Lattice vertex algebras . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Current chiral algebras associated to simple Lie algebras . . . . . . . . . . . . Twisted Modules of a Current Chiral Algebra . . . . . . . . . . . . . . . . . . . . . Positive energy irreducible A(G)-modules . . . . . . . . . . . . . . . . . . . . . . . ZN -twisted current chiral algebra modules . . . . . . . . . . . . . . . . . . . . . . . Twisted Characters and Modular Transformations . . . . . . . . . . . . . . . . .

58 60 60 62 64 66 66 66 68

? Supported by the Federal Ministry of Science and Research, Austria, NSF grants DMS-9103792 and DMS-9622870 and the Bulgarian NFSR under contract F-404. ?? On leave of absence from the Institute for Nuclear Research and Nuclear Energy, Bulgarian Academy of Sciences, Tsarigradsko Chaussée 72, BG-1784, Sofia, Bulgaria

58

Victor G. Kac, Ivan T. Todorov

3.1 Kac-Moody and lattice characters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69 3.2 Modular transformations of twisted characters . . . . . . . . . . . . . . . . . . . . 70 3.3 Small τ asymptotics of twisted characters of A(G) . . . . . . . . . . . . . . . . . 72 4 Affine Orbifolds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73 4.1 Projection on a centralizer’s irreducible representation. Asymptotic dimension . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73 4.2 Affine orbifold models for non-exceptional 0. Action of Z. Modular transformations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76 4.3 Fusion rules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83 5 U (l) orbifolds as RCFT extensions of W1+∞ . . . . . . . . . . . . . . . . . . . . . 85 6 Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90 6.1 Lattice current algebras for c = 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90 6.2 SU(2) orbifolds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92 6.3 A Level 1 SU(3) Orbifold. Charge Conjugation Associated with a NonAbelian Centralizer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101 Appendix A. Action of the Center of a Simply Connected Simple Lie Group on the Coroots and Fundamental Weights . . . . . . . . . . . . . . . . . . . . . . . . 105 A.1 Simply laced algebras (αi∨ = αi , a∨ i = ai ) . . . . . . . . . . . . . . . . . . . . . . . 105 A.2 Z2 action on Bl and Cl . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107 Appendix B. Exceptional Elements of a Compact Lie Group . . . . . . . . . . . . 108 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110 0. Introduction Given a chiral conformal field theory (CFT) – i.e., a chiral algebra A and a family of positive energy A-modules (closed under “fusion”) – there are two ways of constructing other CFT with the same stress-energy tensor T (z) and associated central charge c. First, one can, in some cases, extend A by adjoining to it local primary fields. The stress energy tensor generates an RCFT for the minimal models [BPZ] corresponding to central charge c < 1. For c ≥ 1 one needs in addition a chiral current algebra or a W -algebra to construct an RCFT (for special rational values of c). All RCFT extensions of the (c = 1)u(1)-current algebra have been classified in [BMT]; all local extensions of the su(2) current algebras have been described in [MST]. The second path goes in the opposite direction: one restricts A to a distinguished subalgebra of “observables” including T (z); we shall be concerned here with the case in which the subalgebra A0 consists of all elements of A invariant under a finite automorphism group 0. The resulting CFT is called a 0-orbifold. Examples of orbifolds (first in the context of a “Gaussian model” [G,H]) have been studied in detail in [DV3 ] where some general properties of arbitrary orbifold models have also been pointed out. Non unitary models of c = 1 have been considered in [F]. The present paper provides a systematic approach to orbifold RCFT. Our starting point is a chiral algebra A = A(G) associated with a connected compact Lie group G whose Lie algebra g is equipped with a negative definite integral invariant bilinear form. It appears as a tensor product of a lattice chiral algebra A(L) and (chiral) affine Kac-Moody algebras (corresponding to the simple components gj of g): A(G) = A(L) ⊗ (⊗sj=1 Akj (gj )),

(0.1)

where kj (∈ Z+ ) is the level of the vacuum gˆ j - module. The lattice L consists of all vectors ω in the direct sum g0 of u(1)-components of the centre of g such that e2πiω = 1.

Affine Orbifolds and Rational CFT of W1+∞

59

To each ω of length square 2 we can associate a “charge shift” operator E ω providing a non-abelian extension of the Lie algebra g0 . Let Gc be the corresponding maximal compact group extension of G. Its significance stems from the fact that each finite order inner automorphism of A(G) is given by (the adjoint action of) an element of Gc . An orbifold chiral algebra is the fixed point set A0 of a finite group of automorphisms 0 of a chiral algebra A. For any “non-exceptional” finite subgroup 0 ⊂ Gc we construct a finite family of A0 -modules V , which is complete in the sense that their characters transform among themselves under the modular group SL(2, Z). Each V is labeled by a weight 3 (characterizing an A(G)-module), a conjugacy class b¯ ⊂ 0, and an irreducible representation σ = σ b of the centralizer 0b of an element b ∈ b¯ in 0. It involves a choice of “phases” β(b) ∈ g, for non-exceptional conjugacy classes, satisfying the following two conditions: (i) b = e2πiβ(b) ,

¯ g∈0. (ii) β(gbg −1 ) = Adg β(b) for b ∈ b,

(0.2)

Condition (ii) implies that the centralizer of b should stabilize β. Two β’s satisfying (0.2) differ by a coroot m which is also stabilized by 0b . Any such m gives rise to a 1-dimensional representation σm of 0b . The change β → β + m can be compensated by a change in the representation σ: β+m β = V3, V3, ∗ . ¯ ¯ b,σ b,σ⊗σ m

(0.3)

Thus the family of A0 -modules is independent of the choice of β (allowing us to skip the superscript β on V ). Knowing the character χ3 of an A(G)-module V3 [K1] we are able to calculate of V3,b,σ the A0 - characters χ3,b,σ ¯ ¯ . Similarly, the modular transformation properties and hence the orbifold fusion rules. We point of χ3 [KP2] determine those of χ3,b,σ ¯ out that the group factors of fusion coefficients Nb¯ 1 σ1 ,b¯ 2 σ2 ,b¯ 3 σ3 (σi ∈ 0ˆ bi ) of an affine orbifold differ from those of the associated Grothendieck ring (see [Lus] as well as the discussion in Sect. 4 of [DV3 ]) due to multipliers µ(h|Σβi ) which define (for b1 b2 b3 = 1) 1-dimensional representations of the intersection 0b1 ∩ 0b2 (3 h). This difference shows up already for (finite) subgroups of SU(2). For higher rank G it may yield a change of charge conjugation, as displayed in the example of a 1080 element subgroup of SU(3) which admits a conjugacy class of involutive elements with a non-abelian centralizer. We compute (in Sect. 4.1) the asymptotic dimensions of orbifold characters singling out, in particular, the non-trivial orbifold modules. If G is a simple simply-connected Lie group then the non-trivial elements of Z = Z(G) ∩ 0 ,

(0.4)

where Z(G) is the center of G, are exceptional – they cannot be written in the form (0.2) (with β satisfying (ii)). Each element of Z (different from the group unit) is associated with a fundamental weight 3j satisfying (3j |θ) = 1 where θ is the highest root. We associate with it (in Sect. 4) a permutation of the orbifold modules which maps, in ˜ j and thus cannot be viewed as an ˜ 0 into 3 particular, the (affine) vacuum weight 3 automorphism (“gauge transformation”) of the (vacuum) chiral algebra. Knowing the action of e2πi3j on {V3bσ ¯ } we can extend our treatment to all exceptional elements of a 0 ⊂ SU(n). The treatment of Ad-exceptional elements (described in Appendix A), which are encountered in other simple Lie groups, remains however, outside the scope of the present paper.

60


Note an essential difference between coset models and orbifold models. For the construction of a modular invariant family of characters of coset modules it suffices to take characters of isotypic components of all (untwisted) modules of the chiral current algebra with respect to its chiral current subalgebra [KP0, KP2, KW, K1]. In a sharp contrast, for an orbifold model one has to take in addition decompositions into isotypic components of twisted chiral current algebra modules which become untwisted when restricted to the orbifold chiral subalgebra. As an application we construct (in Sect. 5) a family of RCFT extensions of W1+∞ -one for each value l(∈ N) of the central charge and for each finite subgroup 0 of U (l). It is designed to provide a bridge between two current attempts to understand the fractional quantum Hall effect in terms of chiral conformal algebras (see [FT] and [CTZ]).

1. Chiral Algebras Associated with Connected Compact Lie Groups We shall first recall the general notion of a chiral algebra and will then introduce a class of such algebras which appear to be of paramount importance in the study of RCFT. 1.1. Definition of a chiral algebra. Current algebras. The mathematical concept of a vertex or chiral algebra was introduced by R. Borcherds [Bor] and later developed by a number of authors (see, e.g. [FLM, Go, DGM, FZ, LZ, FKRW, KR2]). The version adopted here is a specialization of [K2] to Z-graded algebras (restricting from the outset attention to fields of a given conformal dimension). Let V be a Z+ -graded inner product space with a unique vacuum state, (n) , V = ⊕∞ n=0 V

dim V (0) = 1,

dim V (n) < ∞ ;

(1.1)

the gradation defines (and can be, conversely, defined by) a distinguished hermitian operator L0 called the (chiral) energy operator such that (L0 − n)V (n) = 0 .

(1.2)

The unique (up to a phase factor) vector |0i ∈ V (0) normalized by h0|0i = 1 is called the vacuum. A chiral field Y (s) of dimension s is a power series X Y (s) (z) = Yn z −n−s , s ∈ Z+ (1.3) n∈Z

with Yn (= Yn(s) ) ∈ EndV satisfying the commutation relations (CR), d + s Y (s) (z) , [Yn , L0 ] = nYn ⇔ [L0 , Y (s) (z)] = z dz Yn |vm i = 0 for vm ∈ V (m) ,

n>m.

(1.4) (1.5)

Equation (1.5) expresses the postulate that the vacuum is the lowest energy state in V . In physical terms V is the vacuum space of finite energy states. A chiral (vertex) algebra structure on V is a linear map, called the state-field correspondence, from V (s) to the space of fields of dimension s: V (s) 3 vs → Y (vs , z) = P −n−s , defined for all s ∈ Z+ and satisfying the following three axioms: n Yn (vs )z V1. Vacuum axioms: the vacuum vector corresponds to the identity operator in V :


61

Y (|0i, z) = 1V ;

(1.6)

the field Y (vs , z) allows to recuperate the vector vs : lim Y (vs , z)|0i = vs ,

z→0

i.e.Y−s (vs )|0i = vs and Yn−s (vs )|0i = 0 for n > 0 . (1.7)

V2. The translation operator L−1 : V → V defined by L−1 vs = Y−s−1 (vs )|0i satisfies the translation covariance condition: [L−1 , Y (vs , z)] =

d Y (vs , z) . dz

(1.8)

V3. The chiral fields are local: (z − w)n [Y (vs , z), Y (vs0 , w)] = 0 for n ≥ s + s0 .

(1.9)

Note that the inner product is not logically necessary in this generality. It is, however, present in all CFT (being indefinite for non-unitary theories) and gives rise to a distinguished (anti-involutive) star operation ([DGM]). We shall be concerned with (orbifolds of) chiral current algebras described below. Let G be a compact Lie group of the form G = G0 ×G1 ×. . .×Gs where G0 = U (1)r , and Gj , j = 1, . . . , s, are simple simply-connected groups. (Every compact Lie group can be viewed as a product of the above form factored by a finite central subgroup.) Let gj denote the Lie algebra of Gj (j = 0, . . . , s) and let L = {ω ∈ g0 | exp 2πiω = 1}. We assume that g is equipped with a symmetric integral negative definite invariant bilinear form. A bilinear form on g is called integral if the length square of any ω ∈ igj (j = 1, . . . , s) such that exp 2πiω = 1 (resp. of any ω ∈ L) is an even integer (respectively an integer). When restricted to a simple gj , the integrality property means that the bilinear form is equal to kj (v|v 0 ), where kj ∈ N will be identified with the level of the affine Kac-Moody algebra gˆ j and 1 (v|v 0 ) = ∨ trgj (adv adv0 ) 2gj (gj∨ is the dual Coxeter number of gj ; recall that with such a normalization (α|α) = 2 for long roots α). In what follows we let also k0 = 1. Remark 1.1. Admitting lattice vectors α with odd square lengths requires, as it will become clear shortly, extending the Z+ gradation of the vacuum space (1.1) to a 21 Z+ gradation. In physical terms it amounts to admitting locally anti-commuting (Fermi) fields of half-integer conformal dimensions in the chiral algebra. Such fields do not describe local observables (in the strict sense of the word) and could alternatively be incorporated in the positive energy representations of a chiral Bose algebra corresponding to an even integral lattice. A way to get rid of Fermi fields is to go to a double cover of the group G, which makes the lattice L even. Given the above data one can construct a chiral algebra A(G) = A(L) ⊗ (⊗sj=1 Akj (gj )) , called an affine (or current) chiral algebra as follows. For each gj (j = 0, 1, . . . , s) consider its affinization [K1]: gˆ j = C[t, t−1 ] ⊗R gj + CKj .

(1.10)

62


d It is a Z-graded algebra, the energy operator L0 acting on it as −t dt . j Let V0 (gj , kj ) denote the unique irreducible gˆ -module which admits a non-zero vector |0i such that (C[t] ⊗ gj )|0i = 0 and Kj |0i = kj |0i. Given an element v ∈ gj and n ∈ Z we let vn denote the operator on V0 (gj , kj ) corresponding to tn ⊗ v. Let v(z) = Σn∈Z vn z −n−1 be the current corresponding to v. Then the chiral algebra structure Akj (gj ) on the vacuum space V0 (gj , kj ) is defined for each j = 1, . . . , s by the following state-field correspondence: 1 n . . . v−i |0ij , z) =: ∂ i1 v 1 (z) . . . ∂ in v n (z) : /i1 ! . . . in ! Y (v−i 1 −1 n −1

(with appropriately defined normal products, [K2]). The vacuum space V is given by V = V (L) ⊗ ⊗sj=1 V0 (gj , kj ) .

(1.11)

In the next section we describe the first factor in (1.11) and the corresponding chiral algebra structure A(L) (cf. [K2], Sect. 5.4). 1.2. Lattice vertex algebras. Let Cε [L] be the twisted group algebra of the lattice L with basis eω (ω ∈ L) and multiplication rule 0

0

eω eω = ε(ω, ω 0 )eω+ω ,

ω, ω 0 ∈ L ,

(1.12)

where ε(ω1 , ω2 ) is a ±1-valued cocycle: ε(ω, 0) = ε(0, ω) = 1 , ε(ω, ω 0 )ε(ω + ω 0 , ω 00 ) = ε(ω, ω 0 + ω 00 )ε(ω 0 , ω 00 ) .

(1.13a) (1.13b)

Equation (1.13a) means that e0 = 1 and Eq. (1.13b) is equivalent to associativity. Let S = V0 (g0 , 1). This is the symmetric algebra over the positive energy subspace gˆ 0(+) = ⊕n 0 we associate a certain current representing the corresponding coroot α∨ : H α (z) =

X

Hnα z −n−1 , H0α = α∨ :=

n

2α , |α|2 = (α|α) . |α|2

(1.30)

We shall use the positive integer marks ai (and a∨ i ) of the Dynkin diagram of g which enter the expression for the highest root θ=

l X i=1

ai αi =

l X

∨ ∨ a∨ i αi = θ

(1.31)

i=1

(see [K1], Chap. 4, Tables). Their ratio relates the Cartan matrix aij of g to the symmetric Gram matrix of the coroots, ai (αi∨ |αj∨ ) = aij ∨ (aij = αi∨ |αj ) , aj while the sum of check marks of the extended Dynkin diagram gives the dual Coxeter number ∨ (1.32) tr adv1 adv2 = 2g ∨ (v1 |v2 ) . g ∨ = 1 + a∨ 1 + . . . + al The set of indices (j ∈)J for which the exponentials e2πi3j of the corresponding fundamental weights 3j generate the center Z(G) of the simply connected group G with Lie algebra g is given by J = {j = 1, . . . , l|

aj = 1} .

(1.33)

Let E α be a raising or a lowering operator, depending on the sign of α. Then the current CR (1.28) assume the form: [H α (z1 ), E β (z2 )] = (α∨ |β)E β (z2 )δ(z12 ) ∨

∨

(α, β roots) ,

0

[H (z1 ), H (z2 )] = −k(α |β )δ (z12 ), α

β

[E αi (z1 ), E −αj (z2 )] = 0 for i 6= j, [E α (z1 ), E −α (z2 )] = H α (z2 )δ(z12 ) −

(1.34) 2k 0 δ (z12 ) . |α|2


65

The affine chiral algebra Ak (g) contains the Sugawara stress energy tensor (see e.g. [K2] Sect. 5.7.): ( 1 X (α|α) : (E α (z)E −α (z) + E −α (z)E α (z)) : T (z) = 2h 2 α>0 (1.35) ) l X i ∨ : Hi (z)H (z) : , h = k + g . + i=1

Here H i and Hi correspond to dual bases in the Cartan subalgebra: H i = αi∨ ,

Hi = 3i ,

(αi∨ |3j ) = δij .

(1.36)

The normal product :: can be defined by either subtracting the singular in z12 part of an ordinary product Ja (z1 )J a (z2 ) or by ordering the frequency parts of the currents (inequivalent definitions of the normal product used in [FST] and [K2] yield the same expression for the stress energy tensor). Equations (1.34), (1.35) imply the Virasoro CR c 3 ∂ , (1.37) [T (z1 ), T (z2 )] = δ(z12 )∂2 T (z2 ) + 2T (z2 )∂2 δ(z12 ) + ∂2 δ(z12 ) ∂2 ≡ 12 ∂z2 where the Virasoro central charge exceeds the rank l of g. Denoting by d(g) the dimension of g, we have k (1.38) c = ck (g) = d(g) ≥ l . h The positive integer h entering (1.35) and (1.38) (the sum of the level and the dual Coxeter number) is called the height. The last inequality in (1.38) follows from the fact that T can be split into a sum of two commuting terms, the stress tensor TH of the Cartan subalgebra and a remainder TR : T = TH + TR ,

TH (z) =

l 1 X : Hi (z)H i (z) : . 2k

(1.39)

i=1

We find, as a consequence of (1.34), (1.35) and (1.36), [TH (z1 ), H i (z2 )] = ∂2 (δ(z12 )H i (z2 )) = [T (z1 ), H i (z2 )],

(1.40a)

[TR (z1 ), H i (z2 )] = 0 = [TR (z1 ), TH (z2 )].

(1.40b)

and hence For a level k > 1 simply laced (A-D-E) simple Lie algebra the RCFT with stress energy tensor TR correspond to (generalized) G/H parafermions – see [KP0 and Gep]. For a simply laced level 1 gˆ we have c1 (ˆg) = l and hence TR = 0. Note that the lattice chiral algebra A(L) could also contain a level 1 simply laced current subalgebra. In fact, each even (integral) lattice Lr has a sublattice Wr−ν ⊕ Lν ⊂ Lr of the same dimension r. Here Wr−ν is the root lattice of a direct sum of A-D-E (simple) Lie algebras, generated by vectors of length square 2, and Lν is its orthogonal complement (with no vector of length square 2), so that L/(Wr−ν ⊕ Lν ) is a finite abelian group, the glue group. The stress energy tensor T (z) of the chiral algebra A(G) is defined as the sum of the stress energy tensors of the factors of A(G).

66


2. Twisted Modules of a Current Chiral Algebra 2.1. Positive energy irreducible A(G)-modules. Let A(G) = A(L) ⊗ (⊗sj=1 Akj (gj )) be a current chiral algebra. Its positive energy irreducible modules are tensor products of such modules for each factor. Let L∗ = {µ ∈ g0 |(µ|ω) ∈ Z for all ω ∈ L} be the dual lattice. It is easy to see that the positive energy irreducible modules over A(L) are labeled by the elements of the finite abelian group L∗ /L as follows. Extend the cocycle ε(ω1 , ω2 ) to L∗ in such a way that (1.13) holds for ω, ω 0 ∈ L and ω 00 ∈ L∗ . We choose a vector µ of a coset of L∗ mod L, and let X S ⊗ eω . (2.1) Vµ (L) = ω∈µ+L

Then Eqs. (1.18), (1.19) and (1.22) define an irreducible positive energy module over A(L). As a consequence of the Sugawara formula (1.25), the ground state energy 1(µ) of the module Vµ (L) is given by 1(µ) =

(µ|µ) , if µ is a minimal length vector in µ + L . 2

(2.2)

Let g be the Lie algebra of a simple connected compact Lie group and let k be a non-negative integer. Then the integrable positive energy irreducible modules over gˆ of level k are labeled by the highest weight 3 of g in the lowest energy subspace (which is a finite-dimensional irreducible g-module). We denote these modules by V3 (g, k). Recall that 3 then satisfies the integrability condition ([K1], Chap. 12): (3|αi∨ ) ∈ Z+ for i = 1, . . . , l ,

(3|θ) ≤ k .

(2.3)

Each of the gˆ -modules V3 (g, k) extends to a Ak (g)-module and all positive energy irreducible Ak (g)-modules are obtained in this way [FZ]. As a consequence of Eq. (1.35), the ground state energy (= conformal dimension) 1(3) of the module V3 (g, k) is given by: 1(3) =

X (3 + 2ρ|3) , where h = k + g ∨ , 2ρ = α. 2h

(2.4)

α>0

2.2. ZN -twisted current chiral algebra modules. Let G0 be the connected compact Lie group whose maximal torus is U (1)r = Rr /L, i.e. L is the coroot lattice of G0 . (G0 contains the torus U (1)r but can, in general, be larger due to the presence of ω’s in L of length square 2; the semi-simple part of G0 is a product of simply laced compact simple Lie groups). Let (2.5) Gc = G0 × G1 × . . . × Gs , the corresponding decomposition of Lie algebras being g = g0 ⊕ g1 ⊕ . . . ⊕ gs .

(2.6)

Let Z j ⊂ Gj denote the center of Gj , j = 1, . . . , s, and let Z 0 = L∗ /L (Z 0 is a central subgroup of G0 ). The following finite subgroup of Gc will play an important role in the sequel: (2.7) Z(Gc ) = Z 0 × Z 1 × . . . × Z s .


67

Recall (see (1.33)) that the center of a simple connected simply connected compact Lie group consists of 1 and the elements (2.8) exp 2πi3j , where j ∈ J . P Recall that if Y (v1 , z) = n∈Z Yn (v1 )z −n−1 is a field of conformal dimension 1 of a chiral algebra, then Y0 (v1 ) is a derivation of A and exp Y0 (v1 ) converge in any positive energy A-module (see e.g. [K2], Sect. 4.9.). Since such derivations of the chiral algebra A(G) form the Lie algebra gC (the complexification of g), the group Gc acts on A(G) by automorphisms, and moreover, acts on each positive energy A(G)-module U in a consistent way (i.e. g(au) = g(a)g(u) for g ∈ Gc , a ∈ A(G), u ∈ U ) preserving the Hilbert metric. It follows from the usual properties of the Casimir operator that the stress energy tensor T (z) is a Gc -invariant observable: T (z) ∈ A(G)Gc .

(2.9)

Now we recall briefly the notion of a twisted module U over a chiral algebra A. Let b be an automorphism of order N of A; then we get a Z/N Z-grading A = ⊕m∈Z/N Z Am , where Am is the exp 2πim/N eigenspace of b. A b-twisted A-module U is a linear map a 7→ π(a) from A to the space of fields with values in End U such that the twisted Borcherds identity holds (see e.g. [KR2]), in particular all the CR are preserved and e2πiL0 π(a)e−2πiL0 = (−1)p(a) e

2πim N

π(a) for a ∈ Am .

(2.10)

If A = A0 , we get a usual (untwisted) A-module. Returning to A(G), fix β ∈ ig such that the corresponding element b = exp 2πiβ ∈ Gc has finite order N and choose a Cartan subalgebra of g containing iβ. Given a positive energy representation π of A(G) in a vector space U , we construct a b-twisted representation πβ in U as follows. First, due to the decomposition (1.10) of A(G) and the corresponding decomposition U = ⊗sj=0 U j , it suffices to construct the P bj -twisted representation πβ j in U j for each j, where β = j β j is the decomposition (2.6) and bj = exp 2πiβ j . Next, for a positive energy representation π of A(g) we let X α En+(α|β) z −n−(α|β)−1 , (2.11) πβ (E α (z)) = π(E α (z))z −(α|β) = n∈Z

and extend to the whole A(g) using the twisted Borcherds identity. In order to preserve CR we should have πβ (H α (z)) = π(H α (z)) −

k(α∨ |β) . z

(2.12)

Similarly, for a positive energy representation π of A(L) we let πβ (Y (eω , z)) = π(Y (eω , z))z −(ω|β) , πβ (ω(z)) = π(ω(z)) −

(ω|β) , z

(2.13) (2.14)

68


and extend to A(L) using the twisted Borcherds identity. The constructed b-twisted A(G)-module will be denoted by U (β) . Going to the stress tensor, which is a sum of a torus part, TL (1.25), and a contribution of type (1.35), (1.39) for each simple factor in G, we shall see that only the Cartan part 1X : vi (z)2 : , 2 r

T h = TL + T H ,

TL =

TH =

i=1

l 1 X : Hj H j : (z) 2k

(2.15)

j=1

changes following (2.12), (2.14) while the remainder TR in (1.39) is left unchanged. Proposition 2.1. If we set 1 1 T˜h = Th − β(z) + 2 (β|β)k , z 2z

T˜R = TR

(2.16a)

implying for the Laurent modes of T˜ 1 L˜ n = Ln − βn + (β|β)k δn0 , 2

(2.16b)

where (β|β)k = k|β|2 for each simple component of (2.6), then T˜ and J˜ satisfy the same CR as T and J (J standing for any of the g-currents, H α , E α , v i ) e.g. d ˜ ˜ . L˜ n , J(z) z n+1 J(z) = dz

(2.17)

Proof. It is straightforward to verify that the commutator of L˜ n with E˜ α ≡ πβ (Eα ) (2.11) reproduces (2.17). One further uses the fact that πβ defines a Lie algebra homomorphism on the currents, preserving their CR. The constant term in L˜ 0 is obtained by computing [L˜ 1 , L˜ −1 ]. 3. Twisted Characters and Modular Transformations The complete character of a positive energy A(G)-module V is defined on the product of the upper half plane τ and the group G as follows: ck (3.1a) χV (τ, z, u) = e2πi(k,u) trV q L0 − 24 e2πiz . Here q = e2πiτ (|q| < 1), z ∈ ig, (k, u) = u0 +

s X

kj uj ,

(3.1b)

j=1

uj are auxiliary (complex) parameters, L0 is the chiral energy operator (1.2), (1.4) (the zero mode of the stress energy tensor (1.24)), ck is the Virasoro central charge (cf. (1.38)): ck = r +

s X

ckj (gj ) .

(3.1c)

j=1

If V is irreducible then χV splits into a product of Kac-Moody and lattice characters; we reproduce their expressions and transformation properties separately. This will allow us to write down the general orbifold characters.


69

3.1. Kac-Moody and lattice characters. Let now G be a connected simply connected compact Lie group with a simple Lie algebra g. We shall use the following notation: M ∗ is the weight lattice dual to the coroot lattice M ; the set of level k dominant weights is [K1] (3.2) P+k = {3 ∈ M ∗ |(3|αi∨ ) ≥ 0, i = 1, . . . , l; (3|θ∨ ) ≤ k} ; Q ⊂ M ∗ is the root lattice; the quotient M ∗ /kM is a finite abelian group of order |M ∗ /kM | = k l |M ∗ /M |. The values of |M ∗ /M | may be found e.g. in [KW] (in the simply laced case |M ∗ /M | is the determinant of the Cartan matrix). The Kac-Moody character χ3 (τ, z, u) ≡ χV3 (g,k) (τ, z, u) can be expressed in terms of classical Θ functions of weight l/2 and certain almost holomorphic modular forms c3 λ , the string functions, of opposite weight ([K1], Eq. (12.7.12)): X M c3 (3.3a) χ3 (τ, z, u) = λ (τ )Θλk (τ, z, u) , λ∈M ∗ /kM 3−λ∈Q

M Θλk (τ, z, u) = e2πiku

X

k

q 2 (γ|γ) e2πik(γ|z) .

(3.3b)

γ∈M + λ k

We assume here that iz is an element of g and choose a Cartan subalgebra containing iz. The modular transformation law for Θ is given by (see Theorem 13.5 of [K1]): (z|z) 1 z 0 −1 M M S= : Θλk (τ, z, u) → Θλk − , , u − 1 0 τ τ 2τ (3.4) X (λ|λ0 ) e−2πi k ΘλM0 k (τ, z, u) , = (−iτ )l/2 |M ∗ /kM |−1/2 λ0 ∈M ∗ /kM

T =

1 0

(λ|λ) 1 M M M (τ, z, u) → Θλk (τ + 1, z, u) = eiπ k Θλk (τ, z, u) . : Θλk 1

(3.5)

The characters χ3 span a finite dimensional representation of SL(2, Z) as well (see [KP] or Theorem 13.8 of [K1]): X (z|z) 1 z = S330 χ30 (τ, z, u) ; (3.6) χ3 − , , u − τ τ 2τ k 0 3 ∈P+

here the S330 are given by the Kac-Peterson formula: X (3 + ρ|w(30 + ρ)) |1 | ∗ −1/2 + S330 = i |M /kM | , ε(w) exp −2πi h

(3.7)

w∈W (g)

where |1+ | is the number of positive roots, W (g) is the Weyl group of g, ε(w) = ± according to the parity of w, 2ρ and h are defined in (2.4), χ3 (τ + 1, z, u) = e2πim3 χ3 (τ, z, u) ,

(3.8)

ck (g) , (3.9) 24 where 1(3) is the conformal dimension (2.4), ck (g) is the Virasoro central charge (1.38). In the special case of g = su(2) we have m3 = 1(3) −

70


Sλλ0 =

p (λ + 1)(λ0 + 1) , 2/h sin π h

h=k+2,

(3.10a)

c λ(λ + 2) 3k − , c = ck (su(2)) = . (3.10b) 4h 24 h Note that for a simply laced affine algebra at level 1 (so that c = l) there is only one non-zero string function, which is a negative power of the Dedekind η-function: −l c3 3 (τ )|k=1 = (η(τ )) . Recall the transformation properties of the η-function: mλ =

1 η(− ) = (−iτ )1/2 η(τ ) , τ

η(τ + 1) = eπi/12 η(τ ) .

(3.11)

The matrix S simplifies in this case as it coincides with that for the lattice characters (see (3.14) below). It is clear from the construction that the lattice character χµ of the module Vµ (L) is given by L (τ, z, u) . (3.12) χµ (τ, z, u) = (η(τ ))−r Θµ1 Here, as before, z is an element of g0 and we choose a Cartan subalgebra containing z. (The expression (3.12) has, of course, the same form as the level 1 simply laced Kac-Moody character; it coincides with (3.3), (3.11) for L = M, r = l.) The modular transformation law for χµ can be read off (3.4), (3.5) and (3.11) (the expression for S in the counterpart of (3.6) being simpler than (3.7)): X 1 2 1 z |z| = Sµµ0 χµ (τ, z, u), (3.13) χµ − , , u − τ τ 2τ 0 ∗ µ ∈L /L

where

0

Sµµ0 = |L∗ /L|−1/2 e−2πi(µ|µ ) ;

(3.14)

1 r , 1(µ) = |µ|2 . (3.15) 24 2 As mentioned above, an irreducible positive energy A(G)-module V is the tensor product of the A(L)-module Vµ (L) and A(gj )-modules V3j (gj ). Hence positive energy irreducible A(G)-modules are parameterized by the set χµ (τ + 1, z, u) = e2πimµ χµ (τ, z, u) ,

mµ = 1(µ) −

P+k = (L∗ /L) × P+k1 × . . . × P+ks . Ps We let µ = 30 , call 3 = j=0 3j the highest weight of V , and write V = V3 . The character of V3 , 3 ∈ P+k , is the product χ3 (τ, z, u) =

s Y

χ3j (τ, z j , uj ) .

(3.16)

j=0

3.2. Modular transformations of twisted characters. Recall that the affine chiral algebra A(G) is defined by the data consisting of a compact group G and an invariant bilinear form on its Lie algebra g. This invariant bilinear form looks as follows: (x|y)k ≡

s X j=0

kj (xj |y j ) .

(3.17a)


71

We will also use the normalized invariant bilinear form (x|y) =

s X

(xj |y j ) .

(3.17b)

j=0

Let now β ∈ ig be such that b = exp 2πiβ ∈ G has finite order and choose a Cartan subalgebra of g containing iβ. It follows from (2.16b) and (3.1) that the value of the character of the b-twisted A(G)-module V3(β) at e2πiα ∈ G is given by the following formula: c L0 −β+ 21 (β|β)k − 24k 2πiα e χα,β 3 (τ ) ≡ trV3 q (3.18) 1 = eπi(α|β)k χ3 τ, α − τ β, − (α − τ β|β) . 2 Each factor in (3.18) can be written in a similar form for the Kac-Moody and the lattice case (assuming that α and β lie in the same Cartan subalgebra): X

χα,β 3 (τ ) =

α,β c3 λ (τ )Θλk (τ )

(3.19)

λ∈M ∗ /kM

3−λ∈Q

α,β = [η(τ )]−r Θµ1 (τ )

χα,β µ (τ )

(3.20)

where in both cases

α,β (τ ) Θλk

=e

iπk(α|β)

1 τ, α − βτ, (βτ − α|β) 2

M Θλk

=

X

k

q 2 |γ−β| e2πik(γ|α) 2

γ∈M + λ k

(3.21) (We can read off the lattice Θ-function from (3.21) setting M = Q = L, λ = µ, k = 1.) The modular transformation law for twisted characters is deduced from the known transformation properties of Kac-Moody and lattice characters (3.6–3.9) and (3.13–3.15) using the following lemma (cf. [KP2] and [K1]). Lemma 3.1. Let the finite set of functions {Fi (τ, z, u), i ∈ I} be closed under modular transformations: Fi for

a c

z c(z|z) aτ + b , ,u − cτ + d cτ + d 2(cτ + d)

=

X

Aij ∈ C ,

Aij Fj (τ, z, u) ,

(3.22)

j∈I

b ∈ SL(2, Z). Define d Fiα,β (τ )

Then

Fiα,β

= Fi

aτ + b cτ + d

1 τ, α − τ β, − (α − τ β|β) 2

=

X j∈I

.

(3.23)

Aij Fjdα−bβ,aβ−cα (τ ) .

(3.24)

72


Proof. If we set α−β then

aτ + b z˜ = , with z˜ = dα − bβ − (aβ − cα)τ, cτ + d cτ + d

Fiα,β

aτ + b cτ + d

= Fi

z˜ c(z| ˜ z) ˜ aτ + b , , u˜ − cτ + d cτ + d 2(cτ + d)

where u˜ = 21 (z|cα − aβ). The law (3.24) then follows from (3.22).

,

It is now straightforward to apply Lemma 3.1 to (3.18) to find the transformation formula of twisted A(G)-characters χα,β 3 using the transformation formula for complete characters from the previous section. Introduce the following notation: S3,30 = m3 =

s Y j=0 s X

S3j ,30j ,

(3.25a)

m3j ,

(3.25b)

j=0

where the S3j ,30j are given by (3.7) and (3.14) and the m3j are given by (3.9) and (3.15). Then we have X 1 = e2πi(α|β)k S330 χβ,−α (τ ) , − χα,β 3 30 τ 0

(3.26)

3

1

2πi(m3 + 2 (β|β)k ) α−β,β χα,β χ3 (τ ) . 3 (τ + 1) = e

(3.27)

3.3. Small τ asymptotics of twisted characters of A(G). The small τ asymptotics will be used in the sequel for singling out non-trivial orbifold modules. Since the parameter β = 2π i τ (which has a positive real part) can be interpreted as inverse temperature, the small τ asymptotics can be interpreted as the high temperature behaviour. α,β ck /24 α,β (τ ), q ck /24 c3 χ3 (τ ) involve Lemma 3.2. (a) The q-expansions of Θλk λ (τ ) and q only non-negative powers of q. α,β (τ ) has a non-zero constant term iff λ − kβ ∈ kM . This (b) The q-expansion of Θλk constant term equals e2πi(α|β)k . (c) The q-expansion of q ck /24 c3 λ (τ ) has a non-zero constant term iff 3 = k3j with j ∈ J (see (1.33)) or 3 = 0, and λ − 3 ∈ kM . This constant term equals 1. (Recall that 3j are fundamental weights.)

(d) The q-expansion of q ck /24 χα,β (τ ) has a non-zero constant term iff 3 = k3j with j ∈ J or 3 = 0, and 3 − kβ ∈ kM . This constant term equals e2πi(α|β)k . Proof. (a) and (b) are clear. (c) is proved in [KW]. (d) follows from (b) and (c) by making use of (3.19).


73

The modular inversion S relates low temperature to high temperature behaviour and is a key to computing small τ asymptotics. πick 1 By Lemma 3.2(a) and (d) each term in the expansion of e− 12τ χα,β 3 (− τ ) vanishes exponentially for τ ↓ 0 unless 3 = k3j with j ∈ J or 3 = 0, and 3 − kβ ∈ kM , hence, by Lemma 3.2(d): πick 1 lim e− 12τ χα,β − 3 τ ↓0 τ (3.28) 2πi(α|β)k for 3 = k3j , j ∈ J, or 3 = 0, and 3 − kβ ∈ kM , = e 0 otherwise. Similarly, we have: lim e− τ ↓0

πick 12τ

2πi(α|β) 1 = e − χα,β µ τ 0

for β − µ ∈ L, otherwise.

(3.29)

Substituting τ by − τ1 in (3.26): χα,β 3 (τ )

=e

2πi(α|β)k

X 30

S330 χβ,−α 30

1 − τ

πick

,

α.β − ∗ −1/2 2πi(3 |α) we eP Qs find, using (3.28) and (3.29), that e 12τ χ3 (τ j) ∼ |L /L| s j S , as τ ↓ 0, where γ = k 3 with i ∈ J or γ = 0, if α + j j i j 3 ,γj j=1 j=1 γj s j ∈ L ⊕ (⊕j=1 M ), and tends to 0 otherwise. Recalling that [KW]

S3,k3j = S3,0 e−2πi(3|3j ) if j ∈ J ,

0

(3.30)

we arrive at the following result. Proposition 3.1. The high temperature asymptotics of twisted A(G) characters is given by πic 2πi(3|β)k − 12τk α,β if exp 2πiα ∈ Z(G) χ3 (τ ) = S3,0 e lim e τ ↓0 0 otherwise. Here Z(Gc ) is the finite central subgroup of Gc defined by (2.7) and we use (2.8).

4. Affine Orbifolds 4.1. Projection on a centralizer’s irreducible representation. Asymptotic dimension. Let as before β ∈ ig be such that b = exp 2πiβ ∈ Gc has finite order. Given a positive energy A(G)-module U , we have the b-twisted module U (β) constructed in Sect. 2.2. Consider the chiral subalgebra A(G)b of fixed elements of A(G) with respect to Adb . When restricted to A(G)b , U (β) becomes an untwisted A(G)b -module. This simple, but important observation allows one to construct in many cases all untwisted modules of a chiral algebra (see e.g. [KR2]). We shall use in the sequel the following orthogonality relations of irreducible characters of a finite group 0 :

74


σ(g) 1 X ∗ δσ,σ0 , σ, σ 0 ∈ 0ˆ , σ (h)σ 0 (hg) = |0| σ(1) h∈0 1 X ∗ σ (g)σ(h) = δg, g, h ∈ 0 . ¯ , ¯ h |0g |

(4.1) (4.2)

σ∈0ˆ

Here and further 0ˆ denotes the set of all irreducible characters (= representations) of 0, σ ∗ stands for the complex conjugate character, 0g stands for the centralizer of g ∈ 0. We shall also denote by g¯ the conjugacy class of g in 0. Recall that |0| = |0g ||g|. ¯ Let 0 be a finite subgroup of the compact group Gc . We shall consider 0 as the gauge group of our CFT and define the chiral subalgebra A0 of gauge invariant observables as the set of Ad0 -invariant elements of A(G). This is called an orbifold chiral algebra. One can ensure that A0 only contains local Bose fields (even when A(L) involves fermionic vertex operators) replacing L by Leven (the maximal even sublattice of L) and 0 by its extension by L/Leven . It will be the objective of this section to construct a set of positive energy representations of A0 which again give rise to an RCFT. That will be demonstrated in the next section by displaying the SL(2, Z) properties of their characters. (This is, in general, not the case if the subgroup 0 of G is infinite.) The A0 -modules in question are obtained by splitting the twisted A(G)-modules into A0 -invariant parts. Remark 4.1. It is clear that A0 = A0Z(Gc ) , where 0Z(Gc ) is the finite subgroup of Gc generated by 0 and Z(Gc ). Hence the orbifold model does not change if we enlarge 0 by the central group Z(Gc ) and in principle we may assume that 0 contains Z(Gc ) (but we shall not do that). Pick b ∈ 0 and write it in the form b = exp 2πiβ, where iβ ∈ g. Let 0β be the stabilizer of β in 0 with respect to the adjoint action of 0 on g. Then the twisted A(G)module U (β) becomes untwisted with respect to the chiral subalgebra A(G)0β of fixed elements with respect to 0β . It follows from the construction that the group 0β acts on U (β) . Let σ be an irreducible character of the group 0β . It follows from (4.1) that the projector on the σ-isotypic component of a representation of 0β is given by Pσ =

σ(1) X ∗ σ (h)h . |0β |

(4.3)

h∈0β

The subspace Pσ U (β) is irreducible with respect to the pair (0β , A(G)0β ). This can be proved in the same way as Theorem 1.1 from [KR2]. It follows that the A(G)0β -module Pσ U (β) is isomorphic to the sum of σ(1) copies of an irreducible module which we denote by Uσ(β) . Since the affine orbifold A(G)0 is contained in A(G)0β , we obtain a A(G)0 -module Uσ(β) by restriction. Take now U = V3 . It follows from (3.18) and (4.3) that the character ck 1 (β) χβ3,σ = trq L0 −β+ 2 (β|β)k − 24 of the A(G)0 -module V3,σ is given by χβ3,σ (τ ) =

1 |0β |

X

σ ∗ (h)χα,β 3 (τ ) .

h∈0β h=e2πiα ,[α,β]=0

Applying the orthogonality relation (4.2), we can invert (4.4):

(4.4)


χα,β 3 (τ ) =

X

75

σ(h)χβ3,σ (τ ) for h = e2πiα .

(4.5)

σ∈0ˆ β

Let Z = 0 ∩ Z(Gc ) denote the small center of the subgroup 0 of Gc . Theorem 4.1. The orbifold character χβ3,σ (τ ) is nontrivial iff 3 and σ agree on Z: 3|Z = σ|Z .

(4.6)

lim e− 12τ χβ3,σ (τ ) = S3,0 σ(1)|Z|/|0β | .

(4.7)

Provided that (4.6) holds, one has: πic

τ ↓0

(β) = 0 if (4.6) fails. Furthermore, by Proof. It is clear from the construction that V3,σ Proposition 3.1 and (4.4) we have: πic

lim e− 12τ χβ3,σ (τ ) = τ ↓0

S3,0 X ∗ σ (h)e2πi(3|α)k . |0β | h∈Z h=e2πiα

It follows from the orthogonality (4.1) of characters of the group Z that this is zero unless (4.6) holds, in which case it is given by the right-hand side of (4.7). The latter is positive since S3,0 is a positive real number (see the discussion below). An important characteristic of a chiral algebra module V is its asymptotic dimension [KP2, KW] and Sect. 13.13 of [K1]. It is defined as the coefficient a(V ) of the leading term of the small τ (or high temperature) expansion of the specialized character χV : c

πic

χV (τ ) = trV q (L0 − 24 ) ≈ a(V )e 12τ .

(4.8)

For example Theorem 4.1 states that the asymptotic dimension of the orbifold module (β) is given by the right hand side of (4.7) provided that condition (4.6) holds. The V3,σ positive reals a(V ) have multifold interpretations. If A(V1 ) ⊂ A(V2 ) are two chiral algebras (with V1 ⊂ V2 ) then a(V2 )/a(V1 ) gives the index of embedding of the associated von Neumann algebras (see [R, LR] and [RST] and references therein). If V3k is an affine algebra module and V0k the corresponding vacuum module of height h then ah (3)/ah (0) is the “quantum dimension” of V3 [V]. In the case at hand the knowledge of a(V ) appears as an efficient tool for singling out non-trivial orbifold modules, and, as we shall see, for handling the splitting of reducible modules into irreducible components. An A(G)-module V3 appears as an outer product of representations of the chiral algebras A(L) and Akj (gj ). (We use the term outer tensor product to be distinguished from the tensor product of representations of a group G that is again regarded as a representation of G rather than as a representation of the direct product G × G.) The asymptotic dimension of an outer product of representations obviously equals the product of asymptotic dimensions of factors. Hence the asymptotic dimension a(V3 ) of a A(G)-module V3 is equal to the product of a(Vµ (L)) and a(V3j (gj )), j = 1, . . . , s. The asymptotic dimension of lattice modules is independent of µ: a(Vµ (L)) = Sµ,0 = |L∗ /L|− 2 . 1

(4.9)

The asymptotic dimension of Kac-Moody modules is given by (see [KP,KW] or [K1] (13.8.10)):

76


a(V3 (g)) = S3,0 = |M ∗ /hM |− 2

1

Y

2 sin

α>0

π(3 + ρ|α) . h

(4.10)

This number is positive since (3|α) ≤ k and (ρ|α) ≤ (ρ|θ) = g ∨ −1, so that (3+ρ|α) < h = k + g∨ . 4.2. Affine orbifold models for non-exceptional 0. Action of Z. Modular transformations. In order to construct a modular invariant family of 0-orbifold modules we need to impose some restrictions on the subgroup 0 of Gc . Let Z be the small center of 0. Definition 4.1. An element b ∈ 0 is called non-exceptional if there exists β(b) ∈ ig such that b = exp 2πiβ(b) and 0b = 0β . The subgroup 0 of the compact group Gc is called a non-exceptional subgroup if for any g ∈ 0 there exists ζ ∈ Z such that ζg is a non-exceptional element of 0. An element g ∈ Gc is called an Ad-exceptional element of Gc if it cannot be written in the form g = bζ, where b is a non-exceptional element of Gc and ζ ∈ Z(G). Obviously, a subgroup 0 of Gc containing Z(Gc ) (recall that, due to Remark 4.1, we may assume that 0 ⊃ Z(Gc )) which does not contain Ad-exceptional elements of Gc is a non-exceptional subgroup of Gc . We shall describe Ad-exceptional elements of a compact group G in Appendix B. Here we only note that U (n) contains no exceptional elements and SU(n) contains no Ad-exceptional elements. Any connected simple compact Lie group other than SU(n) does contain Ad-exceptional elements. From now on let 0 be a non-exceptional finite subgroup of the compact Lie group Gc . It follows from the definition that for each g ∈ 0 there exists a ζ ∈ Z such that b = ζ −1 g is non-exceptional. Moreover for each g of a conjugacy class g¯ we can choose the same ζ ∈ Z and a map β : b¯ → ig satisfying b = e2πiβ(b) ,

β(hbh−1 ) = Adh β(b)

¯ h∈0. for all b ∈ b,

(4.11)

Note that a choice of β(b) such that 0b = 0β(b) , determines uniquely the map β satisfying (4.11). A quadruple (3, b, β, σ), where 3 ∈ P+k , b is a non-exceptional element of 0, β is a map satisfying (4.11) and σ ∈ 0ˆ β is called an admissible quadruple if the compatibility (β(b)) is nontrivial for any condition (4.6) holds. Due to Theorem 4.1 the A0 -module V3,σ β admissible quadruple (3, b, β, σ); we shall denote it by V3,b,σ . We have for any g ∈ 0 the identity Adg β β (4.12a) V3,gbg −1 ,σ g = V3,b,σ , where σ g ∈ 0ˆ gbg−1 is defined by σ g (h) = σ(g −1 hg) .

(4.12b)

We thus obtain the first equivalence of admissible quadruples: (3, b, β, σ) ∼ (3, gbg −1 , Adg β, σ g ) .

(4.13)

Recalling that (4.11) defines a map β : b¯ → ig and dropping the superscript g on σ we may denote the character of the module (4.12a) by χβ3,b,σ ¯ . Furthermore, if β(b) is replaced by β(b) + m, where


e2πim = 1 ,

[β(b), m] = 0 ,

77

0β(b)+m = 0b ,

(4.14)

then β+m β = V3,b,σ , V3,b,σ⊗σ m

(4.15a)

where σm is a 1-dimensional representation of 0b defined by σm (h) = e2πi(m|α)k for h = e2πiα ∈ 0b .

(4.15b)

Here and further we are using the following simple fact. Lemma 4.1. Let G be a connected compact Lie group with Lie algebra g and let λ ∈ ig be a weight, i.e. (4.16a) e2πi(λ|m) = 1 if e2πim = 1 and [λ, m] = 0 . Then λ defines a 1-dimensional representation σλ of its stabilizer Gλ by the formula σλ (g) = e2πi(λ|γ) for g = e2πiγ ∈ Gλ , γ ∈ igλ .

(4.16b)

Proof. Since the group Gλ is connected, it is generated by elements g of the form (4.16b). The map σλ is independent of the choice of γ representing g due to (4.16a). If gj = e2πiγj ∈ Gλ where γj ∈ igλ , j = 1, 2, then the Cambell-Hausdorff formula implies σλ (g1 g2 ) = exp{2πi[(λ|γ1 +γ2 )+(λ|γ)]} where γ is a linear combination of commutators [γ1 , γ2 ], . . . , [[γi1 , γi2 ], . . . , ], for i1 , i2 , . . . ∈ {1, 2}. But (λ|[γ1 , γ2 ]) = ([λ, γ1 ]|γ2 ) = 0 and the same holds for multifold commutators of γj . Thus (4.16b) does indeed define a 1-dimensional representation of Gλ . The isomorphism (4.15) gives a second equivalence relation for admissible quadruples: (4.17) (3, b, β(b), σ) ∼ (3, b, β(b) + m, σ ⊗ σm ) provided that m ∈ ig satisfies (4.14). In deriving the equality of the corresponding characters we use the identity e−2πi(m|α)k χα,β+m (τ ) = χα,β 3 3 (τ ) .

(4.18)

The least obvious equivalence relation appears when two non-exceptional elements of 0 are obtained from each other by multiplication with an element ζ ∈ Z. Every element of Z can be written in the form , . . . , ζj(s) ) ∈ Z 0 × · · · × Zs , ζ = (ζj(0) 0 s

(ν)

ζj(ν) = e2πi3j or 1 .

∗ Here {3(0) j } generate the finite abelian group L /L; for each simple component g the fundamental weight 3j belongs to the set J of indices with aj = 1, see (1.33). If both b and ζj b are non-exceptional we can write

kβ(ζj b) = kβ(b) + k3j + m, Ad0b (k3j + m) = k3j + m ,

e

2πim

(4.19a) =1.

(4.19b)

We proceed to define the action ζj on σ and 3. According to Lemma 4.1 the phase factor 0 0 (4.20) σj (b0 ) = e2πi(k3j +m|β ) for b0 = e2πiβ , Ad0b β 0 = β 0

78


gives rise to a 1-dimensional representation σj of 0b . The transformation 3 → ζj (3) of a lattice weight 3 ∈ L∗ is given by ζj (3) = (3 + 3j ) mod L. If g is a simple rank ` Lie algebra and 3 ∈ P+k , then ζj (3) is defined by ζj (3) = k3j + wj 3,

(4.21a)

where wj is the unique element of the Weyl group W of g that permutes the set {−θ, α1 , . . . , α` } and satisfies (4.21b) −wj θ = αj . Theorem 4.2. The pair of non-exceptional quadruples ! ÿ s X ν ¯ β, σ) 3 and x = (3, b, 3= ÿ ζ(x) =

ν=0

X

(wjν 3 + kν 3jν ), ζb, β + ν

ν

X

3j ν

ν

mν + kν

!

, σ ⊗ ⊗ ν σjν

(4.22) gives rise to the same orbifold module leaving the corresponding character invariant. The action of the center on non-exceptional quadruples for which b and ζb belong to the same conjugacy class b¯ has no fixed points for level k = 1 in the simply laced case, but may have a fixed point for higher levels. For G = SU(2) this happens for even k and 3 = 1 2 k. An example of this type is provided in Sect. 6 (see Example 6.4). The corresponding twisted orbifold module turns out to be reducible in this case. Understanding its splitting into irreducible components requires more work and will be postponed to a subsequent publication. Here we shall restrict our attention to the case when Z acts on the admissible quadruples without fixed points (thus including all level 1 orbifolds, all SU(p) orbifolds (with p prime) for levels not divisible by p, as well as all 0 ⊂ G orbifolds with a trivial small center). We denote by X the set of equivalence classes of all admissible quadruples with equivalence relations (4.13), (4.17) and (4.22). One may use the following description of X . Consider the action of Z × 0 on 0 for which Z acts by multiplication and 0 by conjugation. Choose a subset B ⊂ 0 consisting of non-exceptional representatives of orbits of this action, and for each b ∈ B choose β(b) ∈ ig satisfying (4.11). We call such B an admissible subset of 0. Then X may be identified with the set of admissible quadruples (3, b, β(b), σ), where 3 ∈ P+k , b ∈ B, σ ∈ 0ˆ b , with the equivalence relation that occurs only if ζb = gbg −1 for some ζ ∈ Z and g ∈ 0 . Then we let (cf. (4.22)): ÿ X ∗ (wjν 3ν + k3jν ), b, β(b), σ ⊗ σP (3, b, β(b), σ) ∼ ν

(4.23a) !

ν

mν

⊗ σ(1−Adg−1 )β(b)

.

(4.23b) We can state now our main result. Theorem 4.3.


79

(a) Under the modular inversion S the characters χx (x ∈ X ) transform among themselves: X X X X 1 β = − S330 Sbσ, χβ3,b,σ ¯ ¯ b¯ 0 σ 0 χ30 ,b¯ 0 ,σ 0 (τ ), (4.24a) τ 0 0 g=ζ ¯ 0 b0 ⊂0

b=e2πiβ ∈b¯

σ ∈0ˆ b 3

0 b0 =e2πiβ ∈b0 [β,β 0 ]=0

where S330 is the affine Kac-Moody S-matrix (3.25a), and the “group theoretic” factor looks as follows: β Sbσ, ¯ b¯ 0 σ 0 =

1 |0|

X

0

σ 0 (b)σ(b0 )e−2πi(β(b)|β(b ))k .

(4.25)

¯ 0 ∈b0 b∈b,b bb0 =b0 b

For levels and groups 0 ⊂ G for which the small center Z acts without fixed points each equivalence class of quadruples in X is encountered |Z| times and we can write X 1 β β β0 = − |Z|S330 Sbσ, (4.24b) χ3,b,σ ¯ ¯ b¯ 0 σ 0 χ30 ,b¯ 0 ,σ 0 (τ ) . τ 0 0 0 0 (3 ,b ,β ,σ )∈X

(b) If the lattice L is even then the characters χx are eigenfunctions of the modular translation T : ∗ 1 σ (b) β β 0 β(b)|β(b ) k χ ¯ (τ ) . (4.26) χ3,b,σ ¯ (τ + 1) = exp 2πi m3 + 2 σ(1) 3,b,σ They are eigenfunctions of T 2 also for odd lattices. (c) The inverse matrix S −1 is complex conjugate to S. The matrix S in (4.24b) is manifestly symmetric and hence also unitary. (d) The matrix elements of S and T remain unchanged under the equivalence relations (4.13), (4.17), (4.22), (4.23). (e) The charge conjugation operator C = S 2 gives rise to the following involutive permutation of the set X : C : (3, b, β(b), σ) 7−→ (3c , b−1 , β(b−1 ), σ c ),

(4.27a)

where 3c = −3 in the lattice case, 3c is the highest weight of the contragredient to 3 representation of g in the affine case, and σ c (h) = σ ∗ (h)e2πi(β(b)+β(b

−1

)|α)k

for h = e2πiα ∈ 0b .

(4.27b)

Proof of Theorem 4.2. We shall content ourselves with verifying the equality of characters for admissible quadruples (4.22). The crux of the argument is the proof of the relation α,β+3 +m

χk3j +wjj 3 (τ ) = e2πi(3j +m|α)k χα,β 3 (τ )

(4.28)

(for an appropriate choice of m ∈ M ) in the case of a (rank `) simple Lie algebra g. To prove it we use the Weyl-Kac formula for the affine characters ([K1] Chap. 10). We first extend the coroot and weight spaces of g by introducing the central element

80


K=

` X

∨ ∨ ∨ a∨ ν αν ↔ α0 = K − θ

(4.29)

ν=0

and the gradation operator d(↔ −L0 ) (see Chap. 7 of [K1]). The bilinear form (.|.) is extended to the resulting ` + 2 dimensional space by (K|K) = (d|d) = 0 = (K|αi ) = (d|αi ) ,

i = 1, . . . , ` ;

(K|d) = 1 .

(4.30)

The Weyl-Kac formula then gives:

P χα,β kd+3 (τ ) =

w˜

ε(w)e ˜

P w˜

2πi τ

ε(w)e ˜

|β|2 2

2πi τ

K−β−d +α|w(kd+3+ ˜ ρ) ˜ |β|2 2

,

(4.31)

K−β−d +α|w˜ ρ˜

where the sum is over the affine Weyl group W (ˆg), ρ˜ is defined by ρ˜ = g ∨ d + ρ ,

ρ=

` X

3i ,

(4.32)

i=1

and ε(w) ˜ = ±1 according to the parity of w. ˜ We define the element w˜ j of the extended ˆ as follows (cf. Sect. 1 of [FKW] and Appendix B below): affine Weyl group W w˜ j = tj wj , wj d = d ,

tj d = d + 3j −

|3j |2 K, 2

tj v = v − (v|3j )K(v ∈ h) ,

(4.33)

w˜ j K = K ,

(where wj ∈ W (g) is defined on h as above). We shall use the following three properties of w˜ j : (i) it preserves the extended Killing form; (ii) it leaves ρ˜ invariant; (iii) it normalizes W (ˆg). They allow us to write down the exponent in the numerator of (4.31) as 2 |β| K − β − d + α |w {w˜ j (kd + 3) + ρ} ˜ = w˜ j τ 2 k τ |wj β + 3j |2 − k wj α|3j ) + (wj α − τ d + 3j + wj β |w {ρ˜ + w˜ j (kd + 3)} . 2 It follows that

w α,w β+3j

j χw˜ jj (kd+3)

(τ ) = e2πik(wj α|3j ) χα,β kd+3 (τ ) .

(4.34)

Observing on the other hand the invariance relation w−1 α,wj−1 β

j χkd+3

(τ ) = χα,β kd+3 (τ )

and the fact that w˜ j (kd + 3) can be substituted by ζj (kd + 3) in the expression (3.18) for the character, we complete the proof of (4.28). It remains to insert the result into (4.4) in order to conclude that


81

β+3j +m (τ ) ¯ ¯ j 3+3j ,ζj b,σ⊗σj

χw

= χβ3,b,σ ¯ (τ ) ,

(4.35)

thus proving Theorem 4.2.

Proof of Theorem 4.3. We use the assumption that 0 is a non-exceptional subgroup of G in order to express h in the formula (4.4) for the orbifold character by a non-exceptional element b0−1 : 0 (4.36a) h = ζb0−1 = e2πi(αζ +β− ) , where

ζ = e2πiαζ ∈ Z ,

0

b0−1 = e2πiβ− .

[αζ , β(b)] = 0 ,

This allows to rewrite (4.4) in the form 1 X χβ3,b,σ ¯ (τ ) = |0| ¯ b∈b

X

β 0 ,β

σ(b0 )χ3− (τ ),

(4.36b)

(4.37)

h=ζb0−1 ∈0b [β(b),β 0 ]=0 −

where we have used the relation 0 αζ +β− ,β

χ3

β 0 ,β

(τ ) = e2πi(3|αζ )k χ3− (τ )

(4.38a)

for e2πi(m|αζ ) = 1 whenever m ∈ M , [αζ , m] = 0, implying 0 αζ +β− ,β

σ ∗ (h)χ3

∗

β 0 ,β

(τ ) = σ(b0 )χ3− (τ )

0−1

(4.38b)

0

for σ|Z = 3|Z (we have also used σ (b ) = σ(b )). Inserting the modular inversion law (3.26) into (4.37) we find X X 0 1 X 1 β,−β 0 χβ3,b,σ = σ(b0 )e2πi(β|β− )k S330 χ30 − (τ ) , (4.39a) − ¯ τ |0| ¯ 0 0−1 b∈b

3

h=ζb b0 ∈0 bβ,−β 0 −

where, in view of (4.5), we can write 0 β,−β−

χ3

(τ ) =

X

0

σ −β− (b)χ

−β 0 σ − ∈0ˆ b

0 −β−

30 ,b¯ 0 ,σ

−β 0 −

.

(4.39b)

Finally, we would like to substitute the upper index of χ by the phase β 0 of b0 which 0 by a coroot: differs from −β− 0

b0 = e2πiβ ⇒ e−2πi(β Applying (4.15) we obtain 1 β χ3,b,σ − ¯ τ X 1 X = |0| g⊂ ¯ 0

b=e2πiβ ∈b

0 g=ζ ¯ 0 b0 b0 =e2πiβ ∈b0 bb0 =b0 b

0

X X σ 0 ∈0ˆ b 30

0 +β− )

=1

0 ([β 0 + β− , β] = 0) .

σ(b0 )σ 0 (b)e−2πi(β|β

0

)k

0

(4.40)

S330 χβ30 ,b¯ 0 ,σ0 (τ ), (4.41)

82


where

0

σ 0 (b) = σ −β− (b)e2πi(β

0

0 +β− |β)k

.

(4.42)

If the small center Z acts on admissible quadruples for which ζb ∈ b¯ without fixed points, then each term in the sum is encountered exactly |Z| times and we end up with (4.24b), (4.25). The T -transformation law (4.26) follows from Eq. (3.27): X 2πi{m3 + 21 (β|β)k } σ ∗ (h)χα−β,β (τ ) χβ3,b,σ ¯ (τ + 1) = e 3 h∈0b h=e2πiα

(4.43)

[α,β]=0

= e2πi{m3 + 2 (β|β)k −(σ|β)} χβ3,b,σ ¯ (τ ) . 1

Here we have used the fact that b is in the center of 0b and σ(∈ 0ˆ b ) is irreducible, so that σ ∗ (b) , (4.44a) σ ∗ (h) = σ ∗ (hb−1 ) σ(1) where the last factor is a complex number of absolute value 1 which can be written as σ ∗ (b) =: e−2πi(σ|β) . σ(1)

(4.44b)

(Equation (4.44b) thus defines a linear functional (σ|β) in β whose exponential agrees with the value of 3 on Z.) 0 1 Using once more Lemma 3.1 for the inverse transformation to (3.4) we −1 0 −1 ∗ derive S = S , where ∗ stands for complex conjugate. The symmetry of S is manifest β from the expressions for S330 and Sbσ, ¯ b¯ 0 σ 0 . To prove the invariance of S-matrix elements with non-exceptional entries under the equivalence relation (4.23) we use an extension of (3.30): 0

Sζj (3),30 = e−2πi(3j |3 ) S330 ,

(4.45)

(cf. [KW]) and the fact that σ 0 and 30 coincide on the central element 3j . To verify that T is also invariant under ζj one uses (σ|3j + m) = (3|3j + m) and (wj 3|3j ) = (3|wj−1 3j ) = (3|3j + m0 ) (m0 ∈ M ) to prove that the phase φ(kd + 3, β, σ) =

k 1 (kd + 3 + 2ρ|kd ˜ + 3) − (σ|β) + |β|2 2k 2

changes by an integer: k (|3j |2 − |3j + m|2 ) + (3|m0 − m) + (3 − σ|3j + m) 2 1 = (3|m0 − m) − k{(3j |m) + |m|2 } ∈ Z(⇒ e2πi1φ = 1) . 2

1φ =

We finally proceed to prove (4.27). To this end we compute C = S 2 by applying Lemma 3.1 to the central element of SL(2, Z).


−1 0

0 1

83

2 =

−1 0

0 −1

.

This gives 1 X ∗ 0 −β(b0 ),−β(b) σ (b )χ30 (τ ) |0b | 0 b ∈0b 30 X X β β0 β0 = C330 Cbσ, ¯ b¯ 0 σ 0 χ30 ,b¯ 0 ,σ 0 (τ ) ,

(Cχ)β3,b,σ ¯ (τ ) =

X

30

C330

b¯ 0 σ 0

where C330 = δ3c 30 is known from the modular properties of affine Kac-Moody characters ([K1] Chap. 13), while the second factor is computed to be 0

β β Cbσ, ¯ b¯ 0 σ 0 = δb−1 ,b0 δσ c ,σ 0 δβ,−β 0 .

(4.46)

We note that the equivalence class v of the vacuum admissible quadruple, i.e. that corresponding to the vacuum A(G)0 -module, is selfconjugate: v := class of (0, 1, 0, 1) = Cv. Note also the following formula for any x = (3, b, β(b), σ) ∈ X : Sx,v = S3,0

¯ |b| σ(1) . |0|

Remark 4.2. It follows from Lemma 3.2d that the eigenvalues of L0 are strictly positive in all A(G)-modules Vx , x ∈ X , except for the vacuum module Vv . The 0th eigenspace of L0 in Vv is C|0i. Remark 4.3. The A(G)0 -modules Vx and VCx (x ∈ X ) are contragredient. 4.3. Fusion rules. We can summarize the most important features of the outcome of the previous section as follows. Starting with a compact Lie group G = (Rr /L) × G0 , where G0 is simply connected, and a negative definite integral invariant bilinear form on its Lie algebra which is even on the lattice L, we have constructed for every non-exceptional finite subgroup 0 of G a collection of A(G)0 -modules parametrized by a finite set X . This set is equipped with an involutive permutation C (corresponding to taking a contragredient module) and a distinguished element v (corresponding to the vacuum module) such that Cv = v. We have also matrices S = (Sxy )x,y∈X and T = (Txy )x,y∈X satisfying the following three properties, provided that the small center Z acts on X without fixed points: (a) S is symmetric and T is diagonal, 0 −1 1 1 −1 (b) the map ⇒ S, ⇒ T, 1 0 0 1 0 sentation of the group SL2 (Z).

0 −1

⇒ C gives a unitary repre-

84


(c) Sxv > 0 for all x ∈ X . Following Verlinde [V], introduce the fusion algebra A(X ) = ⊕x∈X Cx by the formula: X Nxyz Cz , (4.47a) xy = z∈X

where the fusion coefficients Nxyz are defined by X Sax Say Saz /Sav . Nxyz =

(4.47b)

a∈X

It follows from the above properties of S that the fusion algebra A(X ) is a finitedimensional commutative associative semisimple algebra with identity element v and involutive automorphism C. All homomorphisms of the algebra A(X ) to C are labeled by elements y ∈ X and given by chy (x) = Sxy /Svy (x ∈ X ) .

(4.48)

The positive real number chv (x) is the relative (= quantum) dimension. The basic observation of [V] is that the fusion algebras arising in a RCFT have the following fundamental property: (d) Nxyz ∈ Z+ . Denote by X af the set P+k labeling all positive energy irreducible representations of the chiral algebra A(G) with vacuum element v = 0, conjugation C3 = 3c , S-matrix S af = (S330 ) and T -matrix T af = e2πim3 δ330 . It follows from [KP2] that the properties (a)-(c) hold, and it is a very difficult theorem established by the efforts of many people that (d) holds as well. Denote by N330 300 (∈ Z+ ) the fusion coefficients. ¯ σ), where g¯ is a conjugacy class of Similarly, let X gr denote the set of all pairs (g, 0 and σ is an irreducible character of 0g . Let v = (1, 1) be the vacuum element and let gr be the matrix defined C(g, ¯ σ) = (g −1 , σ c ), where σ c is defined by (4.27b). Let Sbσ,g 0 σ0 ¯ by the right-hand side of (4.25) and let (cf. (4.26)): gr = e2πi(β|β)k Tgσ,g 0 σ0 ¯

σ ∗ (b) . σ(1)

(4.49)

It follows from the remarks of the previous section that the properties (a), (b) and (c) hold. It can be demonstrated by an appropriate example of an SU(2) subgroup of level 1 (see Example 6.5), that property (d) does not hold in general. Lusztig [Lus] studied the “limiting” case of our X gr when in (4.26), (4.27b) and (4.49) one sets all β(b) equal zero and b = g. In this case (d) holds due to his interpretation of the fusion algebra as the Grothendieck ring of the category of 0-equivariant vector bundles. Whenever the center of G is trivial like in the case of E8 the fusion rules factorize: Nxx0 x00 = N330 300 Ng00 σ,g0 σ0 ,g00 σ00 . In particular, for a level 1 orbifold like A1 (E8 )0 they coincide with the group theoretic fusion rules which we proceed to compute. The following cubic sum rule tells us that the fusion coefficient Ng¯ 1 σ1 ,g¯ 2 σ2 ,g¯ 3 σ3 =

X Sg¯ σ ,hσ ¯ Sg¯ 1

¯ hσ

1

¯

2 σ2 ,hσ

Sg¯ 3 σ3 ,hσ ¯

S11,hσ ¯

vanishes unless there are triples gj ∈ g¯ j , j = 1, 2, 3 such that g1 g2 g3 = 1.

(4.50)


85

Proposition 4.1 ([Gor] Theorem 2.12). Let g¯ i , i = 1, 2, 3, be three conjugacy classes in a finite group 0. The number n123 of triples gi ∈ g¯ i such that g1 g2 g3 = 1 is given by n123 =

|g¯ 1 ||g¯ 2 ||g¯ 3 | X 1 σ(g1 )σ(g2 )σ(g3 ) . |0| σ(1) σ∈0ˆ

In deriving the fusion rules we follow [DV3 ], but compute explicitly the phase factors. Theorem 4.4. The fusion rules (4.50) can be expressed in either of the two forms: Nb¯ 1 σ1 ,b¯ 2 σ2 ,b¯ 3 σ3 =

1 X |0|

h∈0

Nb¯ 1 σ1 ,b¯ 2 σ2 ,b¯ 3 σ3 =

X O12

X

X

1 |0b1 ,b2 |

Here the multiplier µ is given by µ(h|Σβi ) = e2πi(α|

σ1 (h)σ2 (h)σ3 (h)µ(h|Σβi ),

(4.51a)

bi ∈bi ∩0h b1 b2 b3 =1

P

σ1 (h)σ2 (h)σ3 (h)µ(h|Σβi ) .

(4.51b)

h∈0b1 ,b2

β i )k

,

βi = β(bi ) ,

h = e2πiα .

(4.52)

The outer sum in (4.51b) is over different orbits O12 of pairs (b1 , b2 ) under the adjoint action of 0; the number |O12 | of such orbits is determined from the relation |O12 ||012 | = |0| . The proof uses the form Sg¯ j σj ,hσ ¯ =

1 |0h |

X

σj (h)σ(bj )e−2πi(α|βj )k

(4.53)

bj ∈b¯ j ∩0h

of (4.25) for the three factors in the numerator of (4.47) and reduces to a straightforward application of Proposition 4.4 (noting the conjugation invariance of µ). (For x3 = v (the ¯ σ3 = 1 (β3 = 0) we reproduce as a special case the charge vacuum module) b¯ 3 = 1, conjugation matrix (4.46): Cg¯ 1 σ1 ,g¯ 2 σ2 = Ng¯ 1 σ1 ,g¯ 2 σ2 ,11 ¯ .) The multiplier (4.52) does not depend on the choice of the phase α of h provided it belongs to the stabilizer gb1 ,b2 of the pair (b1 , b2 ) in g; µ thus defines a representation of 0b1 ,b2 according to Lemma 4.1 applied to G = Gb1 , λ = β2 . 5. U (l) orbifolds as RCFT extensions of W1+∞ What is now called W1+∞ first appeared as the (unique nontrivial) central extension Dˆ of the Lie algebra D of differential operators on the circle [KP1]. Its representation theory (including the classification of quasi-finite positive energy representations) was developed in [KR2] and [FKRW]. It has also attracted the attention of physicists, in particular, the most degenerate ‘minimal series’ of unitary representations of W1+∞ of [FKRW] are being applied in the study of quantum Hall fluids [CTZ]. (More reference to both physical applications and related mathematical developments are cited in the above ˆ papers and in the bibliography to [AFMO].) The vacuum D-module (corresponding for unitary representation to a positive integer central charge c = l) was shown [FKRW] to

86


(l) carry a canonical chiral (vertex) algebra structure. The resulting chiral algebra W1+∞ was described in [BGT] in terms of a series of quasi primary fields of dimension ν +1, ν = 0, 1, . . .: X Vnν z −n−ν−1 , [Lm , V ν (z)] V ν (z) = (5.1a) d m =z + (m + 1)(ν + 1) V ν (z) , m = 0 , ±1, z dz

satisfying local CR such that µ+ν−1 + ... + c [Vmµ , Vnν ] = (νm − µn)Vm+n

(ν!)4 (2ν)!

m+ν δ , δ m − ν − 1 m,−n µν

c=l.

(5.1b) (l) The (quasi finite) irreducible positive energy modules V~r of W1+∞ are characterized by l exponents (see [KR1,2]) ~r = (r1 , . . . , rl ) that take real values for unitary representations. Each V~r has a cyclic minimal energy vector |~ri such that Vnν |~ri = 0 for n = 1, 2, . . . , {V0ν − vν (~r)}|~ri = 0,

(5.2)

where v0 (~r) =

l X i=1

ri , ν−1

(ν − 1)!ν! X vν (~r) = (2ν)! j=0

ν j

ν j+1

X l

(5.3) (ri − j) . . . (ri + ν − j − 1)ri .

i=1

In particular V 1 (z) = T (z) so that the ground state energy eigenvalue is v1 (~r) = 21 ~r2 = Pl 2 i=1 ri :

l 1 1X 3 1X 4 1 2 L0 − ~r2 |~ri = 0 (L0 = V01 ) , v2 (~r) = ri , v3 (~r) = (r + r ) . (5.4) 2 3 4 i i 5 i i=1

ˆ The vacuum D-module contains for c = l ∈ N a unique singular vector of degree l + 1 such that the quotient by the submodule generated from this singular vector is irreducible [KR1]. This irreducible quotient (together with its chiral algebra structure) is isomorphic to a (level l) W (ul ) vacuum module – see [FKRW], Sect. 5. As a result, any irreducible (l) representation of W1+∞ has a canonical structure of an irreducible representation of W (ul ) of level l, and all irreducible representations of W (ul ) with central charge l arise in this way. Any V~r splits into a tensor product of a W (su(l)) module of central charge l − 1 (1) -module. To see this we rescale the u(1) current and split the stress energy and a W1+∞ tensor as in (1.39): 1 J(z) = √ V 0 (z) , T (z) = TJ (z) + Tsu (z) , l 1 1 2 0 2 TJ = : J : = : (V ) : , 2 2l

(5.5a)


87

[Tsu (z), J(w)] = 0 .

(5.5b)

The minimal eigenvalue of the energy of the second term, Lsu 0 is then given by the difference ÿ r !2 1 X 1 X 1 2 ~r − ri = (ri − rj )2 =: ωl (~r). (5.6) 2 2l 2l i<j i=1

(l) -module V~r is degenerate if some of the differences ri − rj are integers. It A W1+∞ is maximally degenerate if all ri − rj are integers (such representations are termed minimal [CTZ]); the representation of the second (su)-factor is indeed then a limit of the Zamolodchikov-Fateev-Lukyanov Wl (p)-models of central charge c = (l − 1) o n l(l+1) 1 − p(p+1) as observed in [CTZ]. Since every V~r can be viewed as a tensor product of maximally degenerate (including c = 1) modules we shall turn our attention to the case of integer ri − rj . Assume that ri − rj ∈ Z, we then arrange the ri ’s in a decreasing order and denote the set of such ~r’s as P + :

P + = {~r ∈ Rl |r1 ≥ r2 ≥ . . . ≥ rl , ri − rj ∈ Z} .

(5.7)

If we interpret the ordered set 3 = (λ1 , . . . , λl−1 ) of differences λi = ri − ri+1 ,

i = 1, 2, . . . , l − 1

(5.8)

as defining a highest weight of SU(l), then for the fundamental weights 31 = (1, 0, . . . , 0), . . . , 3l−1 = (0, . . . , 0, 1) the ground state energy eigenvalues (5.6) coincide with the level 1 eigenvalues of the su ˆ l current algebra A1 (su(l)): ωl (~r(i)) =

(3i + 2ρ|3i ) (i) for rj(i) − rj+1 = δij 2(l + 1)

(5.9)

(which can be verified by a direct computation). It is natural to expect that the W (sul ) representations of such weights obey fusion rules given by the tensor product expansion formulae for SU(l) (see Conjecture 6.1 of [FKRW]). It follows that a CFT with chiral algebra W (su(l)) and a highest weight module V~r with ri −rj non-zero integers has an infinite number of sectors and hence is not a rational CFT. (We are using here the basic property of any quantum field theory to be closed under fusion.) This ‘irrationality’ can also be seen from an analysis of the characters of these representations (computed in [FKRW]). The orbifold construction of the previous sections allows to define a large class of RCFT extensions of W1+∞ with the same stress energy tensor. (l) into the Fock space Fl of l In fact the embedding of the vacuum module of W1+∞ free complex fermion fields, used from the outset in [FKRW] and [KR2], does provide one such (chiral superalgebra) extension. So does its even (bosonic) part which coincides with the level 1 current algebra of the rank l (even) orthogonal group A1 (so(2l)). (Indeed, if we separate the real and imaginary part of the free fermions writing them as 1 ψj = √ (ϕ2j−1 − iϕ2j ) , j = 1, . . . , l , then Jjk (z) = iϕj (z)ϕk (z) (j < k) (5.10) 2 satisfy the commutation relations of level 1 so(2l) currents. The complex structure selects a Cartan subalgebra that includes V 0 :

88


j

H (z) =:

ψj∗ (z)ψj (z)

:= J2j−1,2j (z) ,

0

V (z) =

l X

H j (z) .

(5.11)

j=1 (l) as the U (l)-invariant subalgebra of A1 (so(2l)) (u(l)) and Then we can define W1+∞ (l) so(2l) sharing the same Cartan subalgebra.) A more general RCFT extension of W1+∞ is provided by the chiral algebra associated with the compact group U (l), equipped with a lattice structure Q (see Sect. 1). Here Q is an l-dimensional even integral lattice whose sublattice of vectors of length square 2 includes the (rank l − 1) su(l) lattice. (The root lattices of rank l semi-simple Lie algebras- so(2l), su(l + 1), su(l) ⊕ su(2)) – appear then as special cases. Note that the su(l) Cartan currents are orthogonal to V 0 (5.11) (or J (5.5)); they are

H αi (z) = H i (z) − H i+1 (z) ,

i = 1, . . . , l − 1 ,

(5.12)

(l) , α1 , . . . , αl−1 being the simple roots of su(l). Any of the extensions A(Q) of W1+∞ where Q is a (rank l) lattice with the above properties admits a finite set of positive energy CFT representations whose characters span a (finite dimensional) representation of SL(2, Z). All these extensions involve, in particular, l commuting u(1) currents and can be thus related to the approach of Fröhlich, Thiran et al. to the quantum Hall effect (see [FT] and references therein). A large family of intermediate observable algebras is provided by 0 orbifolds of A(Q) where 0 is any finite subgroup of U (l). If 0 is not contained in any proper Lie subgroup of SU(l), then A0 only involves a single u(1) (l) current – the one belonging to W1+∞ . Such A0 could be viewed as RCFT extensions of (l) minimal W1+∞ models (exploited in [CTZ]). We proceed to state the precise results for the Fock space Fl of l free (complex) fermions and its orbifolds.

Theorem 5.1. [FRKW] The fermion Fock space Fl viewed as a representation of the l ) splits into an infinite direct sum of tensor products pair (U (l), W1+∞ Fl = ⊕~r∈P+ F (~r) ⊗ L(~r) ,

(5.13)

where P+ = {~r = (r1 , . . . , rl ) ∈ Zl |r1 ≥ · · · ≥ rl }, F (~r) is the finite dimensional (l) irreducible U (l)-module of highest weight ~r, L(~r) is a unitary W1+∞ positive energy module with exponents ~r and specialized character Y l 1 2 (1 − q ri −rj +j−i ) . (5.14) χ~r (τ ) = trL(~r) q L0 − 24 = q 2 ~r η −l (τ ) 1≤i<j≤l

The following result is a specialization of Eq. (4.4) and Theorem 4.3 applied to the chiral algebra A(Zl )0 , where Zl is the integral lattice with the standard bilinear form, and 0 is a finite subgroup of U (l). Recall that A(Zl ) has a unique irreducible representation, hence we may skip the index 3. Theorem 5.2. Let 0 be a finite subgroup of U (l). Write each b ∈ 0 in the form b = exp 2πiβ, where iβ ∈ u(l) is fixed by Ad0b . Let {βi (~r)} denote the set of eigenvalues of β in F (~r). Given an irreducible character σ of 0b , let X 1 m~r,σ,βi (~r) q −βi (~r) , m~r,σ,β (q) = q 2 (β|β) i


89

where m~r,σ,βi (~r) is the multiplicity of σ in the βi (~r)-eigenspace of β in F (~r). Then the A(Zl )0 -characters can be written in the following form: X χβb,σ m~r,σ,β (q)χ~r (τ ) . (5.15) ¯ (τ ) = ~ r ∈P +

All these characters are modular functions and their C-span is invariant under the transformation τ 7→ − τ1 . ¯ we have β = 0 and all m~r,σβ (q) ∈ Z+ , and we find the In particular, for b¯ = 1, characters of untwisted orbifold modules, which, unlike χ~r are modular functions of τ . This special case of Theorem 5.2 provides a family of solutions to the following problem: find non negative integers n(~r) such that X n(~r)χ~r (τ ) ~ r ∈P +

is a modular function of τ . Each pair 0 ⊂ U (n) (0 finite subgroup), σ ∈ 0ˆ gives a solution to this problem with n(~r) = nσ0 (~r) being the multiplicity of σ in F (~r) viewed as a 0-module. Proof of Theorem 5.2. In view of (4.4) and (3.18) we can write X 1 1 σ ∗ (a)χ(τ, α − βτ, (β|βτ )) , χβb,σ ¯ (τ ) = |0b | 2 a ∈ 0b a = exp 2πiα [α, β] = 0 where, due to (5.13), χ(τ, z, u) = e2πiu

X

χ~r (τ )trF (~r) e2πiz .

(5.16)

(5.17)

~ r ∈P +

Hence we have: χβb,σ ¯ (τ ) =

X

1

χ~r (τ )q 2 (β|β)

~ r ∈P +

X

σ ∗ (a)trF (~r) (aq −β ) .

(5.18)

a∈0b

Since 0b fixes β, each eigenspace of β in F (~r) is 0b -invariant. The contribution of the βi (~r)-eigenspace to the inner sum of (5.18) is clearly equal m~r,σ,βi (~r) q −βi (~r) . This proves (5.15). Remark 5.1. Theorem 5.2 can be generalized to any simply laced simple Lie algebra g of rank l and 3 ∈ P+1 . Namely, formula (5.15) holds for any non-exceptional element b, where the sum is taken over λ ∈ (3 + Q) ∩ P+1 , and (see [K1], Exercise 12.17): Y 1 1 − q (λ+ρ|α) . χλ (τ ) = q 2 (λ|λ) η −l (τ ) α>0

We have:

χβ3,b,σ ¯ (τ ) =

X

mλ,σ,β (q)χλ (τ ) .

(5.19)

λ∈(3+Q)∩P+

The character χβ3,σ,b (τ ) is a modular function and their C-span is SL2 (Z)-invariant provided that 0 is a non-exceptional finite subgroup of our simple Lie group.

90


Remark 5.2. Taking 0 = {1} in Remark 5.1 we arrive at the following curious identity by comparing two expressions for 0-orbifold characters for each weight 3 and real number m: X Y (λ + ρ|α) . |{λ ∈ 3 + Q|(λ|λ) = m}| = (ρ|α) λ∈3+Q α>0

(λ|λ)=m

6. Examples 6.1. Lattice current algebras for c = 1. The simplest (c = 1) case of a lattice current algebra is worth singling out for at least two reasons: (1) the basic Θ-functions encountered here also appear in the SU(2) affine orbifold model; (2) the lattice part of a U (l) orbifold encountered in a W1+∞ theory is of this (U (1)−)type. A 1-dimensional lattice L = Zω is characterized by a single natural number m = |ω|2 ; we shall denote A(L; |ω|2 = m) by A(m). Note that m is twice the dimension of the basic 1 charged fields Y (e±ω , z), while v(z) = m− 2 ω(z) is the corresponding u(1) current (see 1 . The factor Sect. 1.1). The dual lattice is L∗ = Zω ∗ , where (ω ∗ |ω) = 1 ⇒ |ω ∗ |2 = m group L∗ /L is the cyclic group of order m; there are, correspondingly, m untwisted modules whose weights will be labeled by minimal length representatives µω ∗ ∈ L∗ /L ,

m m−1 ≤µ≤ , 2 2

µ∈Z.

(6.1)

The specialized character of the positive energy A(m)-module Vµ (of ground state |µω ∗ i) is given by (see [DFSZ, PT]) Kµ (τ, m) =

1 1 X m (n+ µ )2 L m Θµ1 (τ, 0, 0) = q2 . η(τ ) η(τ )

(6.2)

n∈Z

This set spans a representation of SL(2, Z) in the case of a bosonic algebra (m even) and requires supplementing it with Ramond sector (Z2 twisted) modules corresponding to half-odd integer µ’s in the interval (6.1) for m odd and splitting each integer µ character into two (corresponding to summing over even and odd n’s in (6.2)). For m = 2s even the modular transformation law for Kµ is given, according to (3.15), by µ2

Kµ (τ + 1, 2s) = eiπ( 2s − 12 ) Kµ (τ, 2s), s 1 X −iπ µν 1 s K (τ, 2s). Kµ − , 2s = √ e ν τ 2s 1

(6.3) (6.4)

ν=1−s

Example 6.1. A ZN -orbifold of A(m) is given by the chiral algebra A(N 2 m) (and its positive energy modules). If indeed we introduce the inner automorphism A(m) 3 A → U AU −1 , U eω U −1 = e2πi/N eω

∗

U = e2πiω0 /N ,

(U J(z)U −1 = J(z))

(t−1 √ωm , z),

ω0∗

√1 J0 , m

(6.5a) (6.5b)

(J being the u(1) current J(z) = Y = cf. Sect. 1.1), then the ±N ω vertex operators Y (e , z) generate the gauge invariant subalgebra


91

A(m)ZN = A(N 2 m) .

(6.6)

The Z2 -orbifold of A(m) with m odd has an even gauge invariant subalgebra A(4m). The representation theory of A(m), m = 2ρ + 1, ρ ∈ Z+ , can be deduced from this remark. Example 6.2. Modular properties of characters of A(m = 2ρ + 1) derived from those for A(4m). The characters Kµ (τ, m), m odd, µ = 21 Z mod m are expressed in terms of Kν (τ, 4m) as follows: Kµ (τ, m) = K2µ (τ, 4m) + K2µ+2m (τ, 4m) .

(6.7)

The periodicity relation Kν+m (τ, m) = Kν (τ, m)

(6.8)

allows to replace (if necessary) the indices in the right hand side of (6.7) by equivalent ones in the canonical interval (6.1). The resulting SL2 (Z) transformation properties of Kµ (τ, m) then read 2 1 iπ µ m − 12 (6.9a) Kµ (τ + 1, m) = e K2µ (τ, 4m) + (−1)2µ+m K2µ+2m (τ, 4m) m 1 X −2πiµν/m 1 e K2ν (τ, 4m) Kµ − , m = √ τ m ν=1−m (ν∈Z) (6.9b) 1 X − 2πiµν e m Kν (τ, m) . =√ m νmod m

Thus, for m odd, only the entire set of 4m characters Kν (τ, 4m) is closed under SL2 (Z). The original set {Kµ (τ, m), µ ∈ Z/mZ}, corresponding to the Neveu-Schwarz sector of the supersymmetric theory, is however invariant under the subgroup of the modular group generated by T 2 (τ → τ + 2) and S. It is remarkable that the diagonal partition function (in which we restore the dependence on the u(1) variable z), X χµm (τ, z)χ¯ µm (τ, z), (6.10) Z(τ, z) = µ mod m

where χµm (τ, z) =

X

1

2

q 2m (mn+µ) e2πiz

mn+µ m

(6.11)

n

is related to the Laughlin plateaus of the quantum Hall effect (corresponding to filling µ µ2 1 , charge m and fractional spin J = 2m , µ ∈ Z – see [CZ]). (The characters factor ν = m 2

π (Imz) used in [CZ] differ from (6.11) by a non-analytic factor, exp{− m Imτ } corresponding to a modified Hamiltonian and ensuring invariance under z → z + τ .)

Example 6.3. Charge conjugation orbifolds. The involutive lattice conjugation CL : eω → e−ω ,

J → −J

(6.12)

provides, for m 6= 2, an example of an outer automorphism of the chiral algebra A(m). Our construction of orbifold modules does not apply, strictly speaking, to this case. Nevertheless, it is easy to construct a modular invariant set of CL -orbifold characters. We shall write them down for the bosonic (m = 2s, s ∈ N) case.

92


The CL -orbifold chiral algebra A(2s)CL is generated by a single primary field φ = φ(z, ω) with respect to its A(S ⊗ 1)CL subalgebra, the real part of the vertex operator Y (eω , z): 1 φ(z, ω) = √ Y (eω , z) + Y (e−ω , z) . 2 Here A(S ⊗ 1) is the u(1) chiral current subalgebra corresponding to the subspace S ⊗ 1 (1.16). The operator product expansion of two φ’s involves the stress energy tensor T and the Virasoro primary field : J 4 (z) : that generates A(S ⊗ 1)CL . The chiral algebra splits into a CL -even and a CL -odd parts. The vacuum module character splits, accordingly, into two pieces: (6.13a) K0 (τ, 2s) = K0+ (τ, 2s) + K0− (τ, 2s), where K0± (τ, 2s) =

1 {K0 (τ, 2s) ± (K0 (τ, 8) − K4 (τ, 8))} . 2

(6.13b)

The difference of Z2 twisted level 1 A(1) 1 characters (that appears in parentheses) can be written in the form 2 1 X (−q)n . (6.13c) K0 (τ, 8) − K4 (τ, 8) = η(τ ) n 1 ) for 1 ≤ µ ≤ s − 1 Each pair of representations of weights ±µω ∗ of A(2s)(|ω ∗ |2 = 2s gives rise to a single representation of the gauge invariant subalgebra A(2s)CL . The characters K0± (6.13), being expressed in terms of Kµ , have known modular transformation properties; in particular, 1 n + 1 K0 (τ, 2s) + K0− (τ, 2s) + Ks (τ, 2s) K0± (− , 2s) = √ τ 2 2s s (6.14) o X 1 Kµ (τ, 2s) ± √ (K1 (τ, 8) + K3 (τ, 8)) . +2 2 µ=1

Analyzing this relation together with the unitarity requirement for the S-matrix one concludes that there are altogether s+7 inequivalent representations of A(2s)CL (see [DV3 ]) corresponding to s+3 untwisted and 4 twisted orbifold modules. The µ = s A(2s)-module splits, in particular, into two A(2s)CL -modules with the same specialized character ∞ 1 X s(n+ 1 )2 1 2 . Ks (τ, 2s) = q 2 η

(6.15)

n=0

Similarly, there are two pairs of twisted representations with characters Ki (τ, 8), i = 1, 3, 1 ) in (6.14). each Ki appearing twice (with a coefficient ± 2√ 2 For s = 1 the model reduces to a Z2 affine orbifold. For s = 2, 3, 4 and 6 it has been identified with known models in [DV 3 ]. We conjecture that these CL -orbifolds can be shown to exist for all values of s using the vertex operator construction of Sect. 1.1. 6.2. SU(2) orbifolds. The finite subgroups of SU(2) being thoroughly studied,1 the Ak (su(2)) orbifold characters and their modular properties can be worked out quite 1

For a modern treatment based on the McKay correspondence – see [Kos].


93

explicitly. Noting that the Cartan subalgebra of su(2) is 1-dimensional we can express its elements α, β, γ, λ by (rational) numbers identifying each of them with the coefficient 1 to 3∨ 1 = 2 σ3 (σj are the Pauli matrices – see (6.23)); then λ 1 (2n + − β)2 , n ∈ Z , 2 k λ (γ|α) = n + α , α, β ∈ Q . 2k

|γ − β|2 =

λ = 1 − k, . . . , 0, 1, . . . , k , (6.16)

The character (4.36), (3.18), (3.3) can be written in the form χβ3,b,σ ¯ (τ ) = where β (τ ) = Θλ,k,σ

k X

β c3 λ (τ )Θλ,k,σ (τ ) ,

(6.17)

λ=1−k 3−λ∈2Z

X

k

λ

q 4 (2n+ k −β) σ2kn+λ , 2

(6.18)

n∈Z

σ2kn+λ =

1 |0b |

X

σ ∗ (h)eiπ(2kn+λ)α .

(6.19)

h∈0b tr h=2 cos πα

For b 6= 1 and non-exceptional, 0b is abelian and h can be assumed diagonal. We have treated in Sects. 2, 3 and 6.1 the case of a ZN orbifold (as an automorphism group of A(SU(2)), ZN appears as a subgroup of SO(3); 0 in this case should be identified with its double cover Z2N ⊂ SU(2)). Each ZN automorphism group leaves a u(1) (Cartan) current invariant. The remaining non-abelian subgroups of SO(3) can be described as groups on two generators, s and t, obeying three relations: sn1 = tn2 = (st)n3 = 1 ,

1 1 1 2 >1 + + =1+ n 1 n2 n 3 |Ad 0|

(6.20)

(n1 , n2 , n3 are natural numbers and we denote the group unit by 1). The double cover 0(⊂ SU(2)) of Ad 0 is again generated by two elements s and t but the group unit in the first relation (6.20) is replaced by the non-trivial central element ε of SU(2): s, t ∈ 0 ⇒ sn1 = tn2 = (st)n3 = ε ,

ε2 = 1(|0| = 2|Ad 0|) .

(6.21)

Example 6.4. The H8 ⊂ SU(2) orbifold. The abstract group of quaternion units has 8 elements, {1, ε, qi , εqi , i = 1, 2, 3}; they obey multiplication rules qi2 = ε, q1 q2 = q3 which fit (6.21) with n1 = n2 = n3 = 2. It corresponds (according to McKay) to the affine Dynkin diagram D4(1) (see [K1] Chap. 4, Table Aff 1). The dimensions of its nontrivial representations coincide with the coefficients aj in the expansion of the highest root θ of D4 in terms of simple roots: θ = α1 + 2α2 + α3 + α4 .

(6.22)

We shall denote the (equivalence classes of) irreducible representations (IR) of IH8 by the simple roots αν of D4(1) (α0 corresponding to the trivial representation). Then α2 maps IH8 into a subgroup of SU(2):

94

Victor G. Kac, Ivan T. Todorov Table 1. 0 = IH8 : characters and centralizers

CC

IR α0 α1 α2 α3 α4

0g

α2 (qj ) =

1

ε

{q1 , εq1 }

{q2 , εq2 }

{q3 , εq3 }

1 1 2 1 1

1 1 −2 1 1

1 1 0 −1 −1 Z4

1 −1 0 1 −1 Z4

1 −1 0 −1 1 Z4

0

0

1 0 σj , j = 1, 2, 3 σ1 = 1 i

1 0

, σ3 =

1 0

0 −1

, σ2 = iσ1 σ3 .

(6.23) We reproduce in Table 1, for the reader’s convenience, the character table for 0 = IH8 also indicating the centralizer 0g of an element in each conjugacy class (CC). Using Table 1 and symmetrizing with respect to 2kn + λ we compute the sum (6.19) for the untwisted characters (i.e., for 0g = 0, β = 0): 1 [1 + (−1)λ ][1 + 3(−1)kn iλ ] , 8 1 = [1 + (−1)λ ][1 − (−1)kn iλ ] , j = 1, 3, 4 , 8 1 = [1 − (−1)λ ] . 4

(α0 )2kn+λ = (αj )2kn+λ (α2 )2kn+λ

(6.24)

Inserting these expressions in (6.17), (6.18) we recover for k = 1 the characters (6.13) of the CL -orbifold for s = 4: 2 1 X [1 + 3(−1)n ]q n = K0+ (τ, 8) , 4η(τ ) n 1 X (2n+1)2 1 q = K4 (τ, 8) , j = 1, 3, 4 , m = 0, 1 , χ0,1,αj (τ ) = 2η(τ ) n 2 m X 1 (−1) 1 (2n+1)2 4 χ1,1,α2 (τ ) = q = K2 (τ, 8) = K1 (τ, 2) , 2η(τ ) n 2 (6.25a)

k = 1 : χ0,1,α0 (τ ) =

where 1 K0+ (τ, 8) = K0 (τ, 8) − K4 (τ, 8) . 2

(6.25b)

The characters of the Z2 -twisted orbifolds are also computed from (6.17), (6.18) for β = 21 and σ(qjµ ) = iσµ (qjµ , µ ∈ Z/4Z is the general form of an element of the centralizer Z4 of qj ). Equation (6.19) then gives σ2kn+λ =

1 X 2kn+λ−σ)µ 1 + (−1)λ−σ [1 + (−1)kn iλ−σ ] , i = 4 4 µmod 4

reproducing, for k = 1 the CL -twisted characters of A(8):

(6.26)


χ0,q¯j ,0 (τ ) =

95

1 X 1 (4n− 1 )2 2 q4 = K1 (τ, 8) = χ1,q¯j ,1 (τ ) , η(τ ) n

j = 1, 2, 3 , (6.27)

χ0,q¯j ,2 (τ ) = K3 (τ, 8) = χ1,q¯j ,−1 . (We label throughout the irreducible representations of Z4 – and their characters – by the exponents σ = 0, ±1, 2.) The number of inequivalent orbifold modules of a level 1 current algebra (for a simple g) is 1 X ˆ |0g | . (6.28) N (0 ⊂ G; k = 1) = |Z| g⊂0 ¯ In the case at hand it is 21 (5 + 5 + 3 × 4) = 11 thus coinciding with the number s + 7 of CL -orbifold modules for s = 4. Equations (6.24) and (6.26) also allow to compute orbifold characters for higher levels; in particular, for k = 2, g = 1, we obtain (expressing the string functions c3 λ in 3 l 3 terms of the branching coefficients bλ = η cλ , for a rank l g – see [K1] Sect. 12.12): ( ) X 2 1 X 1 1 3 2n 3 (2n+1)2 2 χ3,1,α0 (τ ) = b0 (τ ) q − b2 (τ ) q η(τ ) 2 n n (6.29a) 1 3 3 = b0 (τ )K0 (τ, 4) − b2 (τ )K2 (τ, 4) , 3 = 0, 2 , 2 χ3,1,αj (τ ) =

1 3 b (τ )K2 (τ, 4) , 2 2

j = 1, 3, 4 ,

3 = 0, 2 ,

3 χ1,1,α2 (τ ) = b11 (τ )K1 (τ, 4) (since b3 λ = b−λ ) .

(6.29b) (6.29c)

Similarly, using (6.26), we can evaluate the twisted characters. For those permuted by the action of the center we find (see Example 6.6 below) χ0,q¯j ,0 (τ ) = b00 (τ )K1 (τ, 4) = χ2,q¯j ,2 (τ ) , χ2,q¯j ,0 (τ ) = b20 (τ )K1 (τ, 4) = χ0,q¯j ,2 (τ ) ,

j = 1, 2, 3 .

(6.30a)

The remaining twisted characters are split by the action of the center, and we only obtain their sums: 1 χ+1,q¯j ,1 (τ ) + χ− 1,q¯j ,1 (τ ) = b1 (τ )K0 (τ, 4) , (6.30b) 1 χ+1,q¯j ,−1 (τ ) + χ− 1,q¯j ,−1 (τ ) = b1 (τ )K2 (τ, 4) . Here the branching coefficients can be expressed in terms of the Virasoro characters 1 1 , 2 ): χ1 (τ, c) of the Ising model (corresponding to c = 21 , 1 = 0, 16 1 , b00 (τ ) = b22 (τ ) = χ0 τ, 2 1 (6.30c) b20 (τ ) = b02 (τ ) = χ 1 (τ, ) , 2 2 1 b11 (τ ) = b1−1 (τ ) = χ 1 τ, . 16 2 It follows from (6.29) and (6.30) that there are 2 × 4 + 1 = 9 untwisted and 3 × 6 = 18 twisted level 2 orbifold modules or altogether 27 A2 (su(2))IH8 -representations.

96


Example 6.5. Group theoretic S-matrix and fusion rules for IH8 ⊂ SU(2) and for IH8 ⊂ SU(2) ⊂ E8 . The simply connected compact group E8 is singled out (among the Lie groups with simple simply laced Lie algebras) for having a trivial center. The corresponding current algebra has a single level 1 representation, the vacuum A1 (E8 ) module; the modular S-matrix is then the identity operator (multiplication by 1). Hence, if 0 is a (non-exceptional) finite subgroup of E8 then the 0 ⊂ E8 group theoretic Smatrix coincides with the A1 (E8 )0 orbifold S-matrix. The possibility to embed the pair IH8 ⊂ SU(2) in E8 thus provides an additional justification for the study of the group theoretic S-matrix per-se. We observe that the S-matrix elements depend on both the Lie group G containing the pair IH8 ⊂ SU(2) and on the level of embedding of SU(2) in G which is defined as follows. Let the bases in su(2) and g be chosen in such a way that the Cartan generator H of su(2) is expressed as a linear combination of the Cartan generators H i with nonPl negative integer coefficients mi : H = i=1 mi H i . Then the integers mi satisfy the quadratic relation l l X 1X mi aij mj = mj =: N , 2 i,j=1

j=1

where, for a simply laced g, (aij ) is its Cartan matrix. The positive integer N is the level of embedding of su(2) in g. For a level 1 embedding the S-matrix elements involving at least one non-exceptional entry are independent of G. In the case of H8 the phase factor in (4.25) for a nonexceptional b and an arbitrary g is only non-trivial if both b and g belong to the same conjugacy class qj . We shall then set (−1)m−1 σ3 ⇒ exp{−2πik(β(εm qj )|β(εn qj ))} 4 m+n kπ = exp (−1) . 4i

β(εm q3 ) =

(6.31)

Omitting the upper index β on S (for this fixed choice) we obtain 4Sεm αµ ,q¯j σ = (−1)mσ αµ (qj ),

(6.32a)

0 o 0 kπ iσ+σ n −i kπ e 4 + (−1)σ+σ ei 4 2 k π = cos σ + σ0 − . 2 2

2Sq¯j σ,q¯j σ0 =

(6.32b)

(In computing the sum in the 2 elements b = ±qj of the conjugacy class q¯j in the expression (4.25) for S it is important to change at the same time σ according to (4.12). This yields (6.32b).) The only G dependence appears if the central element ε of 0 is present in both entries: (−1)nδµ2 +mδν2 2δµ2 +δν2 , 8Sεm αµ ,εn αν = pmnk ε

pε := e−2πi|β(ε)| , 2

(6.33a)

where β(ε) = 0 if G = SU(2), or, more generally, if it is an exceptional element of 0 ⊂ G, while (6.33b) pε = −1 if 0β(ε) = 0ε


97

(in a level 1 embedding). It turns out that the fusion rules involving a pair of qj and an ε are integer iff ε is a regular element of 0 ⊂ G (i.e., if (6.33b) takes place). Indeed we have Nq¯j σ1 ,q¯j σ2 ,1αµ =

σ1 − σ2 1 + (−1)σ1 +σ2 +δµ2 αµ (qj ) αµ (1) + cos π , 4 2 2

which is a k independent non-negative integer, but Nq¯j σ1 ,q¯j σ2 ,εαµ

1 + pkε (−1)σ1 +σ2 +δµ2 αµ (qj ) αµ (1) + cos = 4 2

k − σ1 − σ2 π , 2

which is integer only for odd k if pε = −1. Remark 6.1. Equation (6.33b) always takes place for a level 1 embedding SU(2) ⊂ E8 . In spite of the fact that ε is an involution (ε2 = 1) and every involution in E8 is exceptional (as a consequence of the description of finite order automorphisms of a simple Lie algebra presented in Appendix B) ε is not exceptional in 0 ⊂ SU(2) ⊂ E8 whenever SU(2) is generated by a pair of opposite roots of E8 – which is always the case (up to conjugation) for a level 1 embedding. In other words (E8 )β(ε) is strictly smaller than (E8 )ε but SU(2) ∩ (E8 )β(ε) = SU(2) ∩ (E8 )ε . By contrast, for the maximal embedding SU(2) ⊂ E8 given by

E=

8 X i=1

E αi ,

H = 2ρ =

8 X

bi αi∨ ,

F =

i=1

8 X

bi F α i ,

i=1

where bi are positive integers,2 ε is exceptional in 0 ⊂ E8 . However by the strange formula, the level of this embedding, N = 2(ρ|ρ) = g ∨ dim E8 /6 = 1240 (=

X

bi ) ,

is divisible by 4, hence the group theoretic fusion rules (with β(ε) = 0 = 1− pε ) coincide with those of the Grothendieck ring proven to be non-negative integers in [Lus1]. We have an exceptional subgroup 0 ⊂ SO(3) ⊂ E8 in this case. The image of any 4th ˜ ⊂ SU(2) is an involution whose centralizer in SO(3) is disconnected order element of 0 (see the discussion at the end of Appendix B). It is likely that at least in the case when orders of all elements of 0 divide N the corresponding twisted orbifold modules do exist and the resulting modular S-matrix coincides with the one for the Grothendieck ring. To compute the fusion rules for the Ak (SU(2))0 orbifold we shall use the (non-factorizable) |X | × |X | S-matrix of the full theory. ¯ σ) = (0, 1, αν ), ν = 0, 1, 3, 4, For the level 1 orbifold ordering the states as (3, b, ¯ α2 ), (0, q¯j , 0)(' (1, q¯j , 1)), (1, q¯j , −1)(' (0, q¯j , 2)), j = 1, 2, 3, we can write the (1, 1, 11 × 11 S-matrix as 2 Note that for the simple roots labeling of Table 2 below we have: b = 2m = 58, b = 6m = 114, 1 1 2 2 b3 = 24m3 = 168, b4 = 20m4 = 220, b5 = 270m5 = 270, b6 = 14m6 = 182, b7 = 4m7 = 92, b8 = 8m8 = 136, where m1 , . . . , m8 are the Coxeter exponents of E8 .

98


1 2 1 2 1 2 1 2

      1 √  2 2S =  1  1  1  1 1 1

1 2 1 2 1 2 1 2

1 2 1 2 1 2 1 2

1 2 1 2 1 2 1 2

1 1 1 1 −1 −1 1 −1 −1 −1 1 −1 −1 1 −1 −1 −1 1 −1 −1 1

1 1 1 1 −2 0 0 0 0 0 0

1 1 −1 −1 √0 √2 − 2 0 0 0 0

1 1 −1 −1 0 √ −√ 2 2 0 0 0 0

1 −1 1 −1 0 0 √0 √2 − 2 0 0

1 −1 1 −1 0 0 0 √ −√ 2 2 0 0

1 −1 −1 1 0 0 0 0 √0 √2 − 2

 1 −1   −1   1   0   . 0    0  0   0  √ −√ 2  2

The resulting fusion rules differ, in general, from the group theoretic ones even for admissible entries. We have, for instance, N0q¯j 0,3q¯j −3,11α2 = 1 for 3 = 0, 1 , while Nq¯j 0,q¯j −3,1α ¯ 2 = N0q¯j 0,1q¯j −1,01α ¯ µ =

1 + (−1)1−3 , 2

1 − αµ (qj ) for µ 6= 2 , while Nq¯j 0,q¯j −1,1αµ = 0 for µ 6= 2 . 2

Example 6.6. The A2 (su(2))H8 orbifold and its Clifford algebra extension. The study of level 2 SU(2)-orbifolds is simplified by the observation that A2 ≡ A2 (su(2)) is the even part of the Clifford algebra Cl3 of 3 anticommuting Majorana-Weyl spinor fields ψj (z), j = 1, 2, 3. Indeed, the 3 = 2 A2 -module is generated by an “isotopic triplet” 1 of primary fields of dimension 13 = 4h 3(3 + 2) (for 3 = k = 2, h = k + 2 = 4), k the Virasoro central charge being c = 3 h = 23 . The fields ψj (z) are single-valued in the vacuum (Neveu-Schwarz) sector and satisfy the canonical anticommutation relations (and hermiticity) [ψi (z), ψj (w)]+ = δij δ(z − w) ,

ψj∗ = ψj ,

i, j = 1, 2, 3 .

The Z2 -graded algebra Cl3 (with odd generators ψj (z)) provides a superconformal extension of A2 whose SU(2) invariant subalgebra is generated by the 1 = 23 partner G(z) = iψ1 (z)ψ2 (z)ψ3 (z)(= G∗ (z))

(6.34a)

of the stress energy tensor T (z) = T1 (z) + T2 (z) + T3 (z) ,

Tj (z) =

1 : [∂ψj (z), ψj (z)]: 4

(6.34b)

which can be viewed as a composite of two G-fields. The generator G(z) of the superVirasoro algebra is a primary field with respect to T but not with respect to A2 ; its commutator with a Cartan current is [J(z), G(w)] = δ 0 (z − w)ψ3 (w) for J(z) = −iψ1 (z)ψ2 (z) . It intertwines the 3 = 0 and 3 = 2 Neveu-Schwarz modules mapping the 3 = 1 Ramond sector into itself.


99

Each subgroup 0 of SU(2) acts on Cl3 by automorphisms which form the adjoint group Ad0 = 0/Z2 ⊂ SO(3) ; for 0 = H8 , Ad0 = Z2 × Z2 .

(6.35)

In the (orthonormal SO(3)) basis {ψj } the non-trivial elements Ej = α2 (qj ) of Z2 × Z2 act as diagonal matrices: ÿ E1 =

1 0 0

0 −1 0

0 0 −1

ÿ

! ,

E2 =

−1 0 0 1 0 0

0 0 −1

! ,

E3 = E1 E2 .

(6.36)

The Ad0 (= Z2 × Z2 ) invariant subalgebra Cl30 (0 = H8 ) of the Cl3 superalgebra is generated by G and by the individual stress-tensors Tj of the 3 “Ising models” (associated with each ψj ) – see (6.34b). The 3 commuting (1 = 2) field operators Tj (z) give rise to the even part A02 of this superalgebra. Its positive energy representations are tensor products of irreducible representations of the 3 (minimal) Ising models. There are, as expected, 33 = 27 such A02 orbifold modules. In particular, the characters of the fixed point modules split into a sum of two irreducible characters: χ1,q¯j ,1 (τ ) = b11 (τ )K0 (τ, 4) = b11 (τ ) [b00 (τ )]2 + [b20 (τ )]2 , χ1,q¯j ,−1 (τ ) = b11 (τ )K2 (τ, 4) = 2b11 (τ )b20 (τ )b00 (τ ) .

(6.37)

2 The asymptotic dimensions of b11 (b3 0 ) for 3 = 0, 2 indeed coincide, (the quantum 1 1 dimension of the (c = 2 , 1 = 2 ) module being 1). Here we have used the expression (6.30) of the Ising model characters in terms of the branching coefficients. The remaining 1 1 , 2 ) of three orbifold modules are identified in the tensor product (11 , 12 , 13 )(1i = 0, 16 Ising modules as follows: 1 1 1 ¯ ll(0, 1, α0 ) = (0, 0, 0), , , , (2, 1, α0 ) = 2 2 2 1 ¯ α1 ) = 0, 1 , 1 , (0, 1, , 0, 0 , (2, 1, α1 ) = 2 2 2 1 ¯ α3 ) = 1 , 0, 1 , (0, 1, , 0 , ) = 0, (2, 1, α 3 2 2 2 1 ¯ α4 ) = 1 , 1 , 0 , (0, 1, , (2, 1, α ) = 0, 0, 4 2 2 2 (6.38a) ¯ α2 ) = 1 , 1 , 1 , (1, 1, 16 16 16 1 1 1 1 1 , , , , 16 , (2, q¯1 , 0) = (0, q¯1 , 0) = 0, 16 2 16 16 1 1 1 1 1 (0, q¯2 , 0) = 16 , , , , 0, 16 , (2, q¯2 , 0) = 16 2 16 1 1 1 1 1 (0, q¯3 , 0) = 16 , , , 16 , 0 , ; (2, q¯3 , 0) = 16 16 2

the reducible (fixed point) modules with characters (6.37) split according to the law

100


1 1 1 1 , 0, 0 + , , , (1, q¯1 , 1) = 16 16 2 2 1 1 1 1 (1, q¯1 , −1) = , ,0 + , 0, , etc. 16 2 16 2

(6.38b)

8 The AH 2 S-matrix is the tensor product of 3 Ising model S-matrices of the form √ ÿ ! 2 1 √ 1 √1 2 0 − 2 . SIsing = (6.39) √ 2 1 1 − 2

We note that while S1q¯1 σ,1q¯2 σ0 = 0 according to (4.25) (since the conjugacy classes q¯1 and q¯2 do not contain commuting elements) the corresponding split S-matrix elements do not vanish:   1 −1 −1 1 1 1 −1   −1 1 S 1 12 13 ,10 1 10 =  , 16 1 16 3 −1 1 1 −1  4 (6.40) 1 −1 −1 1 1 1 1 1 , , ,0 0, . (1i , 1j ) = (0, 0), 2 2 2 2 Note that the sum of A02 -modules in each line of Eq. (6.38) is irreducible with respect to the conformal superalgebra Cl30 . The characters of the subset of Neveu-Schwarz modules spanned by the direct sum of 3 = 0 and 3 = 2 representations give rise to a 7-dimensional representation of the subgroup 00 (2) of SL2 (Z) generated by T 2 and S. In particular, the Neveu-Schwarz S-matrix is 1 1 1 1 2 2 2  SN S

1  1 1 1 =  4 2  2 2

1 1 1 2 −2 −2

1 1 1 −2 2 −2

1 1 1 −2 −2 2

2 −2 −2 0 0 0

−2 2 −2 0 0 0

−2  −2   2   . 0   0 0

(6.41)

The importance of this example stems from the fact that it has a bearing on other SU(2) orbifold models. The three conjugacy classes of imaginary quaternion units {±qj , j = 1, 2, 3} of H8 combine in a single 6-element conjugacy class in the binary tetrahedral group A˜ 4 which in turn is a part of a 12-element conjugacy class of the binary octahedral group S˜ 4 and of a 30 element class of the binary icosahedral group A˜ 5 . Here Sn is the permutation group of n letters, An is its alternating invariant subgroup, G˜ ⊂ SU(2) denotes, in general, the double cover of a subgroup G of SO(3). In all three cases the centralizer 0qj of an element qj of this conjugacy class is Z4 . Hence, the reducible character χ1q¯j σ is the same for all three orbifold modules and splits in the same way – according to (6.37) – for all three binary polyhedral groups. There are no other conjugacy ¯ Furthermore, for all finite classes b¯ in either A˜ 4 or A˜ 5 such that both b and εb belong to b. SU(2) subgroups 0 the Neveu-Schwarz module of Cl30 contain no fixed points and give rise to a 00 (2)-invariant subset of characters. Furthermore, a similar argument extends to a level n representation of SU(n) which also involves fixed points of the action of the center. Indeed, there is a conformal embedding


101

1 An ≡ An (su(n)) ⊂ A1 (spin(n − 1)) c = (n2 − 1) 2 2

allowing to extend an A0n orbifold to a Cln02 −1 -orbifold. The k = 1 tetrahedral (A˜ 4 ⊂ SU(2)) orbifold and its fusion rules are displayed in [DV3 ]. The octahedral (S˜ 4 ⊂ SU(2)) and the icosahedral (A˜ 5 ⊂ SU(2)) orbifolds can be studied with equal ease. We shall reproduce in Table 2 for a later reference the character table for the 120 element binary icosahedral group A˜ 5 (associated with E8(1) under the McKay correspondence). Table 2. Characters of A5 = A˜ 5 /Z2 and of its double cover 0 = A˜ 5 .

CC

IR

α0 α2 α4 α6 α8 (A 5 )g CC

IR α1 α3 α5 α7

1

{p, p4 }

{p2 , p3 }

{t, t2 }

E = α2 (q)

1 3 5 4 3 A5

1 x+ 0 −1 x− Z5

1 x− 0 −1 x+ Z5

1 0 −1 1 0 Z3

1 −1 1 0 −1 Z 2 × Z2

ε

1

−2 −4 −6 −2 0 = A˜ 5

2 4 6 2

0g

p

p4

x+ 1 −1 x−

−x+ −1 1 −x−

√ 1± 5 , 2

Z10

p2 −x− −1 1 −x+ Z10

A5 = α2 (A˜ 5 ), x± = 0 = α1 (A˜ 5 ) θE8 = 2(α1 + α7 ) + 3(α2 + α8 ) + 4(α3 + α6 ) + 5α4 + 6α5 .

p3

t

t2

q

x− 1 −1 x+

1 −1 0 1

−1 1 0 −1

0 0 0 0 Z4

'

Z6 A˜ 5 , p5

=

t3

=

q2

=

ε,

Equation (6.28) implies: N (A˜ 5 ⊂ SU(2); k = 1) = 21 (9×2+10×4+6×2+4) = 37. It ˜ is a straightforward exercise to write down, using Table 2, the characters of A1 (su(2))A5 . 6.3. A Level 1 SU(3) Orbifold. Charge Conjugation Associated with a Non-Abelian Centralizer. We shall consider the subgroup 0 of SU(3) of order |0| = 1080 which is a non-trivial central extension of the simple alternating group A6 : 1 → Z3 → 0 → A6 → 1. It is generated by the (60 element) icosahedral group A5 ⊂ SO(3) and by one more element of order 2. In a basis in which a selected Z2 × Z2 subgroup of A5 (see Table 2) is generated by any two of the matrices Ei (= α2 (qi ), i = 1, 2, 3) given by (6.36) while the generators of its Z3 and Z5 subgroups are chosen as ÿ ÿ ! ! 1 −x+ 1 −x− 1 x− 1 −x+ 1 x+ −x− , p = −1 x+ −x− , (6.42) t= 2 x 2 x −1 x −x 1 −

+

+

−

where t = p = (tp) = 1, tp = E2 , and the additional involutive generator E4 of 0 is given by (ω 2 + ω + 1 = 0): ÿ ! 0 ω 0 (6.43) E4 = − ω¯ 0 0 , E42 = 1 = (E3 E4 )2 . 0 0 1 3

5

2

102


It is the 360 element factor group A6 = 0/Z3 that acts by non-trivial automorphisms on the su(3) current algebra. There are 17 conjugacy classes of 0 versus 7 of A6 . Both are listed in the combined character table below (see Table 3). We observe that to each of the first 5 conjugacy classes in A6 correspond 3 such classes (of the same size) in 0 while the last two are mapped into classes of triple size: |t¯0 | = 3|t¯A6 | = 3 × 40(= |t¯00 |). The essential difference between A6 = 0/Z3 and ¯ with a non-abelian the subgroups 0/Z2 of SO(3) is the presence of elements E(∈ E) centralizer 08 . Table 4 is its character table (E5 = E3 E4 , q = E4 E2 , q 3 = E2 E4 ). a normal subgroup of 08 . We note that the centralizer Z4 of q in A6 isP |0ˆ g | = 17 + 3.15 + 12 + 3 + 3 = 80 There are (according to (6.28)) altogether 13 g⊂0 ¯ level 1 0 ⊂ SU(3) orbifold modules. Although it is not practical to write down the 80 × 80 S-matrix, one can Pextract the relevant information about E-twisted orbifolds. The multipliers µ(h| βi ) give rise to a new notion of conjugation whenever the class E¯ of involutions labels a sector. To display this fact we first observe that the set of (45)2 pairs (E, E 0 ) splits into 9 different orbits displayed in Table 5. Table 3. Aˆ 6 ⊂0ˆ : Zero versus non-zero triality representations Table 3a. Aˆ 6

cc

IR

1

E(E 2 = 1)

q(q 2 ∈ E)

p(p5 = 1)

p2

t(t3 = 1 = t03 )

t0

1

1

1

1

1

1

1

1

5

5

1

−1

0

0

2

−1

50

50

1

−1

0

0

−1

2

8

8

0

0

x+

x−

−1

−1

80 9 10

80 9 10

0 1 −2

0 1 0

x− −1 0

x+ −1 0

−1 0 1

−1 0 1

(A6 )g

A6

08

Z4

Z5

Z5

Z23

Z23

The stabilizer 0E,E 0 E of the pair E, E 0 E in 0 is the direct product of the central subgroup Z3 with the above 0(0) E,E 0 E ⊂ A6 . To verify the data of Table 5 one needs to construct a representative pair in each orbit. The number of elements of such an orbit is |A6 | = 360 divided by |0(0) E,E 0 E |. For instance, the orbit Op¯ is obtained by conjugation of the pair (Ep , E1 ), where ÿ 1 Ep = p−1 E3 p = 2 E1 Ep = E2 p−1 E2 ∈ p¯

! x+ −x− , −1 √ ! 1± 5 . x± = 2

−x− 1 x+ ÿ

1 −x+ −x−

We shall now prove that the oppositely ordered pairs (E2 , E4 ) and (E4 , E2 ) belong to different orbits Oq¯ although they belong to the same SU(3) orbit. To this end we construct the most general u ∈ SU(3) such that uE2 u∗ = E4 ,

uE4 u∗ = E2 ;

(6.44a)

Affine Orbifolds and Rational CFT of W1+∞ Table 3b. 0ˆ

IR

103

cc 1

ω

E ωE ω 2 E

q

3ω 3ω 2 −1 −ω −ω 2

1

ω

3ω −1 −ω 2 −ω

1

ω2

3ω

3

3∗ω

3 3ω 2

30ω 30∗ ω

3

3ω

3

3ω 2

ω2

3ω 2

3ω −1

6ω

6

6ω

6ω 2

6∗ω

6 6ω 2

6ω

9ω

9ω 2

9

9ω

9

9ω 2

9ω

15ω 15 15ω

15ω 2

9∗ω

−1 −ω

−ω 2

ωq ω 2 q

1

ω

1

ω2

p

ω 2 x+

ω2

x−

−ω

2ω

2ω 2

0

0

0

1

2 2ω 2

2ω

0

0

0

1

1

ω

ω2

1

ω

ω2

1

ω2

ω

1

ω2

−1 −ω

−ω 2

−1 −ω

t

t0

ωx+ ω 2 x+ x− ωx− ω 2 x−

0

0

0

0

+

0

0

+

ωx+

0

0

ω x+ ω 2 x+

−ω 2

2

ωp2 ω 2 p2

ωp

ω x−

ω 2 p p2

ωx+ x− ω 2 x− ωx−

ωx− ω 2 x− ω2 x

−

x+

ωx− x+

ωx+ ω2 x

ω2 x

ω

ω2

1

ω

ω2

0

0

ω2

ω

1

ω2

ω

0

0

−1

−ω

−ω 2

−ω

−ω 2

0

0

ω −1

−ω 2

−ω

−ω 2

0

0

−1

−ω −1

−ω 2

0

0

0

0

0

0

0

0

15∗ω 15 15ω 2 15ω −1 −ω 2 −ω −1 −ω 2 −ω

0

0

0

0

0

0

0

0

Z23

Z23

0g

0

Z3 × 08

Z12

Z15

Z15

|g|

1

45

90

72

72

120 120

it is given by a 2(real) parameter family, ÿ u=

u1 −ζ u¯ 2 0

u2 ζ u¯ 1 0

0 0 ζ¯

! with |ζ|2 = 2|u1 |2 = 2|u2 |2 = 1 ,

¯ 1 u2 = −ω . (6.44b) 2ζu

It remains to prove that this family of 3 × 3 matrices does not intersect our group 0. √ Comparing the relation | tr(u + uE1 )| = |2u1 | = 2 (implied by (6.44b) with the first row of Table 3B, we deduce that if u ∈ 0 then (tr uE1 )3 = 1, (tr u)6 = 1 + 4(tr u)3 . The resulting set of equations for the parameters u1 , u2 and ζ in Eq. (6.44b) has no solutions with |ζ| = 1. It turns out that the same 2-parameter family of u’s is the most general subset of SU(3) elements that transforms the two OE¯ orbits among themselves: uE3 u∗ = E3 ⇒ uE1 u∗ = uE2 E3 u∗ = E4 E3 = E5 .

(6.45)

This completes the proof that each of the two pairs of representatives in the last column of Table 5 belongs to a different 0-orbit. We finally note that the sum of all |Og¯ |(4.360 + 2.180 + 2.90 + 45) adds up, as it should, to (45)2 = 2025. Proposition 6.1. The charge conjugation matrix (4.27) for the A1 (su(3))0 orbifold involves a non-trivial involution σ → σ c for b ∈ E, σ ∈ 0ˆ E : C30 E3 σ,30 E3 σ0 = δσ0 σc ,

σc = σ ∗ ⊗ σ E ,

σ E = e2πi(2β3 |β(h)) ,

where 0E = Z3 × 08 , σ ∗ = σ (i.e., σ(ωh) = σ(h) for h ∈ 0E ),

(6.46)

104


ÿ 1 1 0 βi = β(Ei ) : β3 = 2 0 ÿ ! 1 −2 0 0 0 1 0 , β1 = 2 0 0 1 ÿ −1 ±3ω 1 ±3ω¯ −1 β 4 == 5 4 0 0

0 1 0

0 0 2

0 0 −2

! ⇒ ÿ

1 β2 = 2 !

1 0 0

0 −2 0

0 0 1

! ,

(6.47)

.

Proof. The statement is a straightforward consequence of (4.27) (Theorem 4.3e) and of the observation that β3 = β(E3 ) = β(E3−1 ). The representation σ is trivial on Z3 (and hence, selfconjugate; see Table 4), since it has to agree with the representation 30 = 0 of SU(3) on the small center. Table 4. Characters of 08 =0E3 ⊂A6

IR 10 11 12 13 2

cc

1 1 1 1 1 2

E3 1 1 1 1 −2

0E3 ,g 08 08

Table 5. Orbits O(i)0

E E

CC of E 0 E

E1 , E 2 1 1 −1 −1 0 Z 2 × Z2

0(0) E,E 0 E

⊂ A6

08

E 0 E = EE 0 ∈ E¯

Z 2 × Z2

E0E

p¯n

∈ , n = 1, 2 E 0 E ∈ t¯ or t¯0

q, q 3 1 −1 −1 1 0 Z4

of pairs (E, E 0 ) ⊂ E¯ and their stabilizers (i = 1, 2)

E 0 E = 1(E = E 0 )

E 0 E ∈ q¯

E4 , E 5 1 −1 1 −1 0 Z 2 × Z2

Z2 {1} {1}

O(i) E0 E

Representative pairs

45

(i) O ¯ = 90, E

i = 1, 2

(1) OE ¯ = O(E1 , E2 ) , (2) OE ¯ = O(E5 , E4 )

(i) Oq¯ = 180 ,

Oq(1) ¯ = O(E2 , E4 ) ,

i = 1, 2

Oq(2) ¯ = O(E4 , E2 )

360 360

Remark 6.2. The appearance of a non-trivial conjugation depends on the choice of a representative in a class of equivalent quadruples. Had we chosen instead of the involution ¯ 3 ∈ ω 2 E¯ for which element E3 ∈ E¯ a representative of a minimal phase like ωE ÿ ! 1 1 −1 0 0 1 ˜ 0 −1 0 so that |β˜3 |2 = β3 := β(ωE = |β3 |2 , ¯ 3) = (6.48) 6 6 9 0 0 2 then we would have dealt with complex representations since


105

˜

β3 χβ33 ,E,σ (τ ) with σ2 (h) = σ(h)e2πi(32 |α) , ¯ ¯ (τ ) = χ3 ,ω 2 E,σ 0

2

2

(6.49a)

where 32 is the fundamental weight of the “antiquark” representation 3∗ , ÿ ! 2 −1 0 0 0 −1 0 = β˜3 −β3 , h = e2πiα , [α, 32 ] = 0(= [α, β3 ]) . (6.49b) 32 = 3 0 0 2 The charge conjugation matrix in these new labels would assume its usual form with non-zero entry C32 bσ2 ,3∗2 b−1 σ2∗ = 1 ,

b ∈ ω 2 E¯ ,

b−1 ∈ ω E¯ .

(6.50)

Appendix A. Action of the Center of a Simply Connected Simple Lie Group on the Coroots and Fundamental Weights We shall display the action of wj for the classical Lie algebras as well as for E6 and E7 (the simply connected groups with Lie algebras G2 , F4 and E8 have a trivial center). We let J˜ = J ∪ {0}, a0 = a∨ 0 = 1. A.1. Simply laced algebras (αi∨ = αi , a∨ i = ai ). The center Zl+1 of SU(l + 1) acts on both the (co) roots and weights of A(1) via cyclic permutations: l wj = w1j ,

w1 (α0 , α1 , . . . , αl ) = (α1 , α2 , . . . , αl , α0 ) , ˜ 0, 3 ˜ 1, . . . , 3 ˜ l ) = (3 ˜ 1, 3 ˜ 2, . . . , 3 ˜ l, 3 ˜ 0) , w˜ 1 (3

w1l+1 = 1 .

(A.1)

˜ ν are the extended fundamental weights Here 3 ˜ ν = d + 3ν + κν K 3

(A.2)

chosen to have equal norm squares: ˜ ν |2 = 2κν + |3

ν(l + 1 − ν) = 2κ0 . l+1

(A.3)

The set J˜ consists of all indices 0, 1, . . . , l. The element w1 is a Coxeter element of the finite Weyl group W (Al ) = Sl+1 . In terms of the elementary Weyl reflections si it is written as: (A.4) w1 = s1 . . . sl ⇒ w1 3j = 3j − α1 − . . . − αj . The center of the simply connected group Spin (2l) with Lie algebra Dl is Z2 ×Z2 for l even and Z4 for l odd. To exhibit its action on roots and weights of Dl(1) it is convenient to use an orthonormal basis {ei } in the l dimensional root space of Dl setting αi = ei − ei+1 , 3i =

i X s=1

i = 1, . . . , l − 1 ,

αl = el−1 + el ,

α 0 = K − e1 − e 2 ,

(A.5)

1X ei . 2

(A.6)

l

es

for i ≤ l − 2,

3l−1 = 3l − el ,

3l =

i=1

The set J˜ of indices µ for which aµ = 1 consists of 4 elements: 0, 1, l − 1, l. Writing again

106


˜ ν = aν d + 3ν + κν K, 3 ˜ coincide: ˜ µ (µ ∈ J) we restrict κν demanding that the norm squares of 3 ˜ 0 |2 = 2κ0 = |3 ˜ j |2 = 1 + 2κ1 = |3

(A.7a)

l l + 2κl−1 = + 2κl . 4 4

(A.7b)

We shall first determine the finite part wl of w˜ l defined by wl α0 = wl (−θ) = αl , and ˜ wl α1 = αl−1 . As a consequence of invariance hence (being a permutation of αµ , µ ∈ J), of inner products we further deduce wl αi = αl−i , i = 1, . . . , l − 2; hence, in view of (A.5), (A.8a) wl ei = −el+1−i , i = 1, . . . , l − 1 ; wl el is then determined from the condition that an element of W (Dl ) should involve an even number of reflections: (A.8b) wl el = −(−1)l e1 . As a result, we have wl2 = w1 for l odd, wl2 = 1 for l even; in both cases w12 = 1; w1 (e1 , e2 , . . . , el−1 , el ) = (−e1 , e2 , . . . , el−1 , −el ).

(A.9)

The corresponding permutations of fundamental weights are ˜ for l even 3 ˜ ˜ ˜ ˜ ˜ ν = 0, 1 ; w˜ l 30 = 3l , w˜ l−1 31 = 3l−1 , w˜ l 3l−ν = ˜ ν 31−ν for l odd ˜ 1 , w˜ 1 3 ˜ l−1 , w˜ 12 = 1 , w˜ l−1 = w˜ 1 w˜ l . ˜0 =3 ˜l =3 w˜ 1 3 (A.10) The center of the group E6 is Z3 . Choosing a basis of simple roots of E6 in such a way that the highest root is θ = α2 +α4 +2(α1 +α3 +α5 )+3α6 , we have J˜ = {0, 2, 4}. The center acts on an arbitrary weight 3 according to the law w˜ j 3 = k3j + wj 3, j = 2, 4, where w2 (−θ, α1 , α2 , α3 , α4 , α5 , α6 ) = (α2 , α3 , α4 , α5 , −θ, α1 , α6 ) ,

w22 = w4 ,

(A.11a)

1 ˜2 =3 ˜ 4 , w23 = 1 . (A.11b) (α5 − α3 + 2α4 − 2α2 ) ⇒ w˜ 2 3 3 Here we have used the expressions for the fundamental weights in terms of simple roots: w2 32 = 34 − 32 =

1 32 = α1 + (4α2 + 5α3 + 2α4 + 4α5 + 6α6 ) , 3 1 34 = α1 + (2α2 + 4α3 + 4α4 + 5α5 + 6α6 ) , 3

|32 |2 = |34 |2 =

4 3

˜ ν = d + 3ν + κν K with as well as the relations 3 ˜ 2 |2 = | 3 ˜ 4 |2 ⇒ 2κ0 = ˜ 0 |2 = |3 |3

4 4 2 + 2κ2 = + 2κ4 or κ2 = κ4 = κ0 − . 3 3 3

The center of E7 is Z2 . Choosing a basis of simple roots of E7 such that the highest root is θ = α6 + 2(α1 + α5 + α7 ) + 3(α2 + α4 ) + 4α3 , we have J˜ = {0, 6}. The non-trivial element of the center is w˜ 6 = t6 w6 where w6 (−θ, α1 , α2 , α3 , α4 , α5 , α6 , α7 ) = (α6 , α5 , α4 , α3 , α2 , α1 , −θ, α7 ),

(A.12a)


107

˜6 =3 ˜ 0, w6 36 = −36 ⇒ w˜ 6 3 1 36 = α1 + 2α2 + 3α3 + 2α5 + (5α4 + 3α6 + 3α7 ) . 2 ˜ ν = d + 3ν + κν K, where |3 ˜ 6 |2 = 2κ6 + 3 = 2κ0 . Here again 3

(A.12b)

2

A.2. Z2 action on Bl and Cl . The simple roots, the highest root and the fundamental weights of Bl can be written in an orthonormal basis {ei } as αi = ei − ei+1 , i = 1, . . . , l − 1 , αl = el , θ = α1 + 2(α2 + . . . + αl ) = e1 + e2 , 3i =

i X

(A.13) es ,

i = 1, . . . , l .

s=1

The center Z2 of the simply connected group Spin (2l + 1) acts on (α0 = K − θ, αi ) and on (30 , 3i ) as w˜ 1 = t1 w1 , where w1 (e1 , e2 , . . . , el ) = (−e1 , e2 , . . . , el ), t1 α∨ = α∨ − (α∨ |31 )K for α∨ ∈ M ,

(A.14a)

˜ ν + (3 ˜ ν |K)3 ˜ 1 for 3 ˜ ν ∈ M∗ ; ˜ν =3 t1 3 (A.14b)

thus t1 w1 α0 = t1 (K + α1 ) = α1 , w1 αi = αi = t1 w1 αi for i = 2, . . . , l , t1 w1 α1 = t1 (−θ) = −θ + K = α0 ; ˜0+3 ˜ 0 = t1 3 ˜0 =3 ˜1 , t1 w1 3 ˜0+3 ˜ 1 ) = t1 (3 ˜0−3 ˜ 1 ) = 30 − 31 + 31 = 30 . t1 w 1 ( 3 The simple roots, the highest root and the fundamental weights for Cl are expressed as

1 αi = √ (ei − ei+1 ) , i = 1, . . . l − 1 , 2 l−1 X √ √ αi + αl = 2e1 , αl = 2el , θ = 2

(A.15)

i=1 i √ X es , 3i = 2

1 X 3l = √ es . 2 s=1 l

i = 1, . . . , l − 1 ,

s=1

The non-trivial element w˜ l = tl wl of the center Z2 of Sp(2l) acts on these orthonormal basis ei as wl (e1 , e2 , . . . , el−1 , el ) = (−el , −el−1 , . . . , −e2 , −e1 ) ; (A.16) hence wl (−θ, α1 , . . . , αl ) = (αl , αl−1 , . . . , α1 , −θ) , ˜l =3 ˜0 wl 3l = −3l ⇒ w˜ l 3 ˜ l |2 = 2κl + l . ˜ 0 |2 = 2κ0 = |3 |3 2

(A.17a)

˜0 =3 ˜ l (= d + 3l + κl K) , w˜ l 3

(A.17b)

108


Appendix B. Exceptional Elements of a Compact Lie Group Let G be a connected compact Lie group with a simple Lie algebra g of rank l, and let AdG denote the adjoint group. An element g ∈ G is called ad-exceptional if it cannot be written in the form g = exp 2πiβ, where β ∈ ig is such that Adg x = x iff [β, x] = 0 for all x ∈ g. Note that an element g ∈ G is Ad-exceptional iff it is ad-exceptional or its centralizer in G is not connected. (Recall that in a simply connected G the centralizer of any element is connected.) In this Appendix we classify ad-exceptional elements of finite order of the group AdG . The finite order inner automorphisms of the simple Lie algebra g belong to AdG and can be described as follows (see Theorem 8.6 and Proposition 8.6b of [K1]). Proposition B.1. Each order N inner automorphism of g is conjugate to Adb(s) ,

b(s) = exp 2πiβ(s) ,

where β(s) =

l 1 X sj 3∨ j N

(B.1)

(B.2)

j=1

and s0 , sj , j = 1, . . . , l are relatively prime non-negative integers such that: s0 +

l X

aj sj = N .

(B.3)

j=1

Here 3∨ j are the fundamental coweights: ∨ (αi |3∨ j ) = (αi |3j ) = δij ,

i, j = 1, . . . , l .

(B.4)

Proposition B.2. The centralizer of Adb(s) in g is generated by the E ±αν , ν = 0, 1, . . . , l, for which sν = 0 and by the Cartan subalgebra. According to Definition 4.1 an element b ∈ 0 is exceptional if there is no β ∈ g such that (B.5) b = e2πiβ and 0b = 0β . As noted, G = U (l) has no exceptional elements. By contrast, for each partition of the positive integer n ≥ 2 of the type n = k1 + . . . + kρ ,

kmin = min(k1 , . . . , kρ ) = 2,

(B.6)

there are exceptional elements of SU(n) conjugate to diagonal matrices with kj eigenν conditions: (i) values exp(2πi Nj ), j = 1, . . . , ρ, where the νj are subject to the P (ν1 , . . . , νρ , N ) = 1 (i.e. these ρ+1 integers have no common factor) and j kj νj = kN with 1 ≤ k < kmin . For n = 2, 3 all such elements belong to the center Zn of SU(n). More generally, for any n, one can find an element ζ ∈ Zn such that g = bζ is nonexceptional. (In the above example it suffices to choose ζ = exp(−2πi nk ).) This proves the statement (of Sect. 4.2) that SU(n) contains no exceptional subgroups. Recall that an element b ∈ G is Ad-exceptional if bζ is exceptional for any choice of ζ ∈ Z(G). The following theorem describes all finite order ad-exceptional elements of AdG (for a simple g), and hence all finite order Ad-exceptional elements of a simply connected G.


109

Proposition B.3. The finite order automorphism Adb(s) is ad-exceptional iff the marks aν with sν > 0 have a non-trivial common factor. Proof. It follows from Proposition B.2 that it suffices to study the commutator of β(s) with E αν for those ν(= 0, . . . , l) for which sν = 0. This commutator is trivial for j = 1, . . . , l such that sj = 0 since Eqs. (B. 1-4) imply [β(s), E αj ] = (αj |β(s))E αj ,

(αj |β(s)) = sj = 0 .

(B.7)

Thus Adb can only be ad-exceptional if s0 = 0; in this case s 0 − 1 E α0 = −E α0 . [β(s), E α0 ] = [β(s), E −θ ] = N

(B.8)

This is still not sufficient to assert that Adb is ad-exceptional since β(s) is not unique: Pl we can add to it i=1 mi 3∨ i for mi ∈ Z without changing the automorphism. That would give   [β(s) +

X i

X   −θ mi 3∨ ] = −1 − ai mi  E −θ , i ,E i si 6=0

which can be made zero iff the ai in the sum have no common factor.

Proposition B.3 shows that SU(l) has no Ad-exceptional elements, while all other simple simply connected compact groups do. Examples of Ad-exceptional b are provided by the special elements with β(s) = a1j 3∨ j for aj > 1, corresponding to sν = δνj . Such is, for instance, the diagonal symplectic matrix   −1 0 0 0  0 1 0 0  b1 = e2πi31 =  ∈ Sp(4) = {g ∈ SU(4)|t gCg = C} , 0 0 1 0  0 0 0 −1     1 0 0 0 1 (B.9) 1 1 0 0 1 0  0  =  C= , 31 = 3∨   1 0 0 −1 0 0 2 2 −1 −1 0 0 0 1 (= α1 + α2 ) . 2 (31 is only stabilized by U (2) while the centralizer of b1 in Sp(4) is SU(2) × SU(2)). If 01 ⊂ SU2 is the binary icosahedral group, then 0 =< b1 , −1 > ×01 ×01 ⊂ Sp(4) is clearly an exceptional subgroup containing the center of Sp(4). The simplest example of a non-special Ad-exceptional element is provided by the simply laced Lie algebra D5 (corresponding to the simply connected group Spin (10)). If we label the nodes of the affine diagram D5(1) so that a2 = a3 = 2 (while a0 = a1 = a4 = a5 = 1) then the non-special Ad-exceptional element of Spin (10) correspond to ∨ β = 41 (3∨ 2 + 33 ). An example of an element of SO(3) = AdSU(2) with a disconnected centralizer is provided by either of the diagonal matrices Ei , i = 1, 2, 3 of Eq. (6.36). Indeed, there is no Cartan subalgebra of SO(3) containing the infinitesimal generators of both E1 and E2 . Note that the preimages of Ei in the simply connected double cover SU(2) of SO(3)

110


do not commute (in fact, they anticommute). This example extends to the n3 element Heisenberg subgroup Hn of SU(n) generated by the n × n matrices a and b satisfying an = bn = 1 ,

ab = e2πi/n ba .

(B.10)

Clearly, Ada and Adb commute but their infinitesimal generators do not. This happens since AdSU(n) (unlike SU(n)) is not simply connected and the centralizer of either Ada or Adb is disconnected. Acknowledgement. I.T. acknowledges the support of a Fulbright grant 19684 and the hospitality of the Department of Mathematics at M.I.T. during the course of this work. Both authors acknowledge the hospitality of the Erwin Schrödinger International Institute for Mathematical Physics where this paper was completed. The authors thank Bojko Bakalov who took part in the computations of the S matrix and the associated fusion rules presented in Sect. 6.2.

References [AFMO] Awata, J.H., Fukuma, M., Matsuo, Y., Odake,S.: Representation theory of the W1+∞ algebra. Prog. Theor. Phys., Proc. Suppl.118, 343–373 (1995) [BGT] B.N. Bakalov, L.S. Georgiev, I.T. Todorov: A QFT approach to W1+∞ . New Trends in Quantum Field Theory In: Proc. of the 1995 Razlog (Bulgaria) Workshop, A. Ganchev et al. (eds.), Sofia: Heron Press, 1996 pp. 147–158 [BPZ] Belavin, A.A., Polyakov, A.M., Zamolodchikov, A.B.: Infinite conformal symmetry in twodimensional quantum field theory. Nucl. Phys. B241, 333–381 (1984) [Bor] Borcherds, R.: Vertex algebras, Kac-Moody algebras and the Monster. Proc. Natl. Acad. Sci. USA 83, 3068–3071 (1986); Monstrous moonshine and monstrous Lie Superalgebras. Invent. Math. 109, 405–444 (1992) [BMP] Bouwknegt, P., McCarthy, J., Pilch, K.: Semi-infinite cohomology and w-gravity. J. Geom. Phys. 11, 225–249 (1993) [BMT] Buchholz, D., Mack, G., Todorov, I.T.: The current algebra on the circle as a germ of local field theory. Nucl. Phys. B (Proc. Suppl.) 5B, 20–56 (1988) [CTZ] Cappelli, A., Trugenberger, C.A., Zemba, G.R.: Stable hierarchical quantum Hall fluids on W1+∞ minimal models. Nucl. Phys. B448, [FS] 470–504 (1995); W1+∞ dynamics of edge excitations in the quantum Hall effect, Ann. Phys. (NY) 246, 86–120 (1996) [CZ] Cappelli, A., Zemba, G.: Modular invariant partition functions in the quantum Hall effect. hepth/9605127 [DFSZ] Di Francesco, P., Saleur, H., Zuber, J.-B.: Modular invariance in non-minimal two-dimensional conformal theories. Nucl. Phys. B285, [FS19] 454–480 (1987) Dijkgraaf, R., Vafa, C., Verlinde, E., Verlinde, H.: The operator algebra of orbifold models. Commun. [DV3 ] Math. Phys. 123, 485–526 (1989) [DGM] Dolan, L., Goddard, P., Montague, P.: Conformal field theory of twisted vertex operators. Nucl. Phys. B338, 529–601 (1990) [FFK] Fairbairn, W.M., Fulton, T., Klink, W.H.: Finite and disconnected subgroups of SU3 and their application to the elementary particle spectrum. J. Math. Phys. 5, 1038–1051 (1964) [F] Flohr, M.: W -algebras, new rational models and completeness of the c = 1 classification. Commun. Math. Phys. 157, 179–212 (1993) [FK] Frenkel, I.B., Kac, V.G.: Basic representations of affine Lie algebras and dual resonance models. Invent. Math. 62, 23–66 (1980) [FKRW] Frenkel, E., Kac, V., Radul, A., Wang, W.: W1+∞ and W (glN ) with central charge N . Commun. Math. Phys. 170, 337–357 (1995) [FKW] Frenkel, E., Kac, V., Wakimoto, M.: Characters and fusion rules for W -algebra via quantized Drinfield-Sokolov reduction. Commun. Math. Phys. 147, 295–328 (1992) [FLM] Frenkel, I.B., Lepowsky, J., Meurman, A.: Vertex Operator Algebras and the Monster. New York: Academic Press, 1988

Affine Orbifolds and Rational CFT of W1+∞ [FZ] [FT] [FST] [Gep] [G] [Go] [Gor] [H] [K1] [K2] [KP0] [KP1] [KP2] [KR1] [KR2] [KW] [Kos]

[LZ] [LR] [Lus1] [Lus2]

[MST]

[PT] [R] [RST] [V]

111

Frenkel, I.B., Zhu, Y.: Vertex operator algebras associated to representations of affine and Virasoro algebra. Duke Math. J. 66, 123–168 (1992) Fröhlich, J., Thiran, E.: Integral quadratic forms, Kac-Moody algebras, and fractional quantum Hall effect. An ADE-O classification. J. Stat. Phys. 76, 209–283 (1994) Furlan, P., Sotkov, G.M., Todorov, I.T.: Two-dimensional conformal quantum field theory. Riv. Nuovo Gim 12:6, 1–202 (1989) Gepner, D.: New conformal field theories associated with Lie algebras and their partition functions. Nucl. Phys. B285, [FS20] 10–24 (1987) Ginsparg, P.: Curiosities at c = 1. Nucl. Phys. B295, [FS21] 153–170 (1988) Goddard, P.: Meromorphic conformal fields theory. In: Infinite-dimensional Lie algebras and groups. Adv. Ser. Math. Phys. 7 ed. V. Kac, Singapore: World Sci., 1989 pp. 556–587 Gorenstein, D.: it Finite Groups. New York: Harper & Row, 1968 Harris, G: SU(2) current algebra orbifolds of the Gaussian model. Nucl. Phys. B300, (FS22) 588– 610 (1988) Kac, V.: Infinite Dimensional Lie Algebras. Third edition, Cambridge: Cambridge Univ. Press, 1990 Kac, V.: Vertex Algebras. In: New Trends in Quantum Field Theory, Proceedings of the 1995 Razlog (Bulgaria)-Workshop, A. Ganchev et al. editors, Sofia: Heron Press, 1996 pp. 261–358 Kac, V.G., Peterson, D.H.: Affine Lie algebras and Hecke modular forms. Bull. Amer. Math. Soc. 3, 1057–1061 (1980) Kac, V.G., Peterson, D.H.: Spin and wedge representations of infinite dimensional Lie algebras and groups. Proc. Nat. Acad. Sci. USA 78, 3308–3312 (1981) Kac, V.G., Peterson, D.H.: Infinite dimensional Lie algebras, theta-functions and modular forms. Adv. in Math. 53, 125–264 (1984) Kac, V.G., Radul, A.: Quasi-finite highest weight modules over the Lie algebra of differential operators on the circle. Commun. Math. Phys. 157, 429–457 (1993) Kac, V.G., Radul, A.: it Representation theory of the vertex algebra W1+∞ . Transformation groups 1, 41–70 (1996) Kac, V.G., Wakimoto, M.: Modular and conformal invariance constraints in representation theory of affine algebras. Advances in Math. 70, 156–236 (1988) Kostant, B.: The McKay correspondence, the Coxeter element and representation theory. In: The Mathematical Heritage of Elie Cartan. Soc. Math. de France, Asterisque, hors series, 1985 pp. 209–255 Lian, B.H., Zuckerman, G.: Commutative quantum operator algebras. J. Pure Appl. Alg. 100, 117– 139 (1995) Longo, R., Rehren, K.-H.: Nets of subfactors. Rev. Math. Phys. 7, 567–597 (1995) Lusztig, G.: Unipotent representations of finite Chevalley groups of type E8 . Quart. J. Math. Oxford (2) 30, 315–338 (1979) Lusztig, G.: Leading coefficients of character values of Hecke alebras. In: Arcata Conference of Representations of Finite Groups In: Proceedings of Symposium in Pure Math (P. Fong, ed.)47, (1987) Michel, L., Stanev, Ya.S., Todorov, I.T.: D − E-classification of the local extensions of the su2 current algebras. Teor Mat. Fiz. 92, 507–521 (1992); (American edition: Theor. Math. Phys. 92, 1063 (1993)) Paunov, R.R., Todorov, I.T.: Modular invariant QFT models of u(1) conformal current algebra. Phys. Lett. B196, 519–526 (1987) Rehren, K.-H.: A new view of the Virasoro algebra. Lett. Math. Phys. 30, 125–130 (1994) Rehren, K.-H., Stanev, Ya.S., Todorov, I.T.: Characterizing invariants for local extensions of current algebras. hep-th/9409165 Commun. Math. Phys. 174, 605–633 (1996) Verlinde, E.: Fusion rules and modular transformations in 2D conformal field theory. Nucl. Phys. B300, [FS22] 360–376 (1988)

Communicated by G. Felder

Commun. Math. Phys. 190, 113 – 132 (1997)

Communications in


Huygens’ Principle in Minkowski Spaces and Soliton Solutions of the Korteweg–de Vries Equation Yuri Yu. Berest1 , Igor M. Loutsenko2 1 Department of Mathematics, University of California, Berkeley, CA 94720, USA. E-mail: beresty@math. berkeley.edu 2 Universit´ e de Montréal, Centre de Recherches Mathématiques, C.P. 6128, succ. Centre-Ville, Montreal (Quebec), H3C 3J7, Canada. E-mail: [email protected], [email protected]

Received: 3 April 1996 / Accepted: 1 April 1997

Abstract: A new class of linear second order hyperbolic partial differential operators satisfying Huygens’ principle in Minkowski spaces is presented. The construction reveals a direct connection between Huygens’ principle and the theory of solitary wave solutions of the Korteweg–de Vries equation.

I. Introduction The present paper deals with the problem of describing all linear second order partial differential operators for which Huygens’ principle is valid in the sense of “Hadamard’s minor premise”. Originally posed by J.Hadamard in his Yale lectures on hyperbolic equations [26], this problem is still far from being completely solved1 . The simplest examples of Huygens’ operators are the ordinary wave operators 2 2 2 ∂ ∂ ∂ − − . . . − (1) n+1 = ∂x0 ∂x1 ∂xn in an odd number n ≥ 3 of space dimensions and those ones reduced to (1) by means of elementary transformations, i.e. by local nondegenerate changes of coordinates x 7→ f (x) ; gauge and conformal transformations of a given operator L 7→ θ(x) ◦ L ◦ θ(x)−1 , L 7→ µ(x)L with some locally smooth nonzero functions θ(x) and µ(x). These operators are usually called trivial Huygens’ operators, and the famous “Hadamard’s conjecture” claims that all Huygens’ operators are trivial. Such a strong assertion turns out to be valid only for (real) Huygens’ operators with a constant principal symbol in n = 3 [33]. Stellmacher [40] found the first 1 Hadamard’s problem, or the problem of diffusion of waves, has received a good deal of attention and the literature is extensive (see, e.g., [8, 12, 15, 21, 22, 24, 27, 28, 35], and references therein). For a historical account we refer the reader to the articles [19, 25].

114

Y. Y. Berest, I. M. Loutsenko

non-trivial examples of hyperbolic wave-type operators satisfying Huygens’ principle, and thereby disproved Hadamard’s conjecture in higher dimensional Minkowski spaces. Later Lagnese & Stellmacher [31] extended these examples and even solved [32] Hadamard’s problem for a restricted class of hyperbolic operators, namely L = n+1 + u(x0 ) ,

0

(2)

where u x is an analytic function (in its domain of definition) depending on a single variable only. It turns out that the potentials u(z) entering into (2) are rational functions which can be expressed explicitly in terms of some polynomials2 Pk (z): 2 d log Pk (z) , k = 0, 1, 2, . . . , (3) u(z) = 2 dz the latter being defined via the following differential-recurrence relation: 0 0 Pk−1 − Pk−1 Pk+1 = (2k + 1)Pk2 , Pk+1

P 0 = 1 , P1 = z .

(4)

Since the works of Moser et al. [2, 3] the potentials (3) are known as rational solutions of the Korteweg–de Vries equation decreasing at infinity3 . A wide class of Huygens’ operators in Minkowski spaces has been discovered recently by Veselov and one of the authors [9, 10] (see also the review article [8]). These operators can also be presented in a self-adjoint form L = n+1 + u(x)

(5)

with a locally analytic potential u (x) depending on several variables. More precisely, u (x) belongs to the class of so-called Calogero-Moser potentials associated with finite reflection groups (Coxeter groups): u(x) =

X mα (mα + 1)(α, α) . (α, x)2

(6)

α∈ , (24) hRλ (x, ξ), g(x)i = Hn+1 (λ) 2 J+ (ξ)

where dx = dx0 ∧ dx1 ∧ . . . ∧ dxn is a volume form in Mn+1 , g(x) ∈ D(Mn+1 ), and Hn+1 (λ) is a constant given by n−1 (25) Hn+1 (λ) = 2π 2 4λ−1 0(λ)0 λ − (n − 1)/2 . The following properties of this family of distributions are deduced directly from their definition. For all λ ∈ C and ξ ∈ Mn+1 we have supp Rλ (x, ξ) ⊆ J+ (ξ)

(26)

n+1 Rλ = Rλ−1 ,

(27)

Rλ ∗ Rµ = Rλ+µ ,

µ∈C,

(x − ξ, ∂x )Rλ = (2λ − n + 1)Rλ , γ Rλ = 4 (λ)ν λ − (n − 1)/2 ν Rλ+ν , ν ∈ Z≥0 , ν

ν

(28) (29) (30)

where (κ)ν := 0(κ + ν)/0(κ) is Pochhammer’s symbol, and γ = γ(x, ξ) is a square of the geodesic distance between x and ξ in Mn+1 . In addition, when n is odd, one can prove that Rλ (x, ξ) =

1 2π

n−1 2

( n−1 −λ)

δ+ 2 (γ) 4λ−1 (λ − 1)!

for

λ = 1, 2, . . . , (n − 1)/2 ,

(31)

where δ+(m) (γ) stands for the mth derivative of Dirac’s delta-measure concentrated on the surface of the future-directed characteristic half-cone C+ (ξ). Another important property of Riesz distributions is that R0 (x, ξ) = δ(x − ξ) .

(32)

Formulas (26), (27), (32) show that Rλ (x, ξ) is a Riesz kernel for the ordinary wave operator n+1 . The property (31) means precisely that in even-dimensional Minkowski

Huygens’ Principle in Minkowski Spaces and Soliton Solutions KdV Equation

119

spaces Mn+1 (n is odd) Huygens’ principle holds for sufficiently low powers of the wave operator d , d ≤ (n − 1)/2. Now we are able to construct the Hadamard–Riesz expansion for the Riesz kernel of a general self-adjoint wave-type operator (18) on Mn+1 . First, we have to find a sequence of two-point smooth functions Uν := Uν (x, ξ) ∈ C ∞ ( × ) , ν = 0, 1, 2 . . ., as a solution of the following transport equations: 1 (x − ξ, ∂x ) Uν (x, ξ) + νUν (x, ξ) = − L Uν−1 (x, ξ) , 4

ν ≥1.

(33)

It is well-known (essentially due to [26]) that the differential-recurrence system (33) has a unique solution provided each Uν is required to be bounded in the vicinity of the vertex of the characteristic cone and U0 (x, ξ) is fixed for a normalization, i.e. U0 (x, ξ) ≡ 1 ,

Uν (ξ, ξ) ∼ O(1) ,

∀ ν = 1, 2, 3, . . . .

These functions Uν are called Hadamard’s coefficients of the operator L. In terms of Uν the required asymptotic expansion can be presented as follows: 8 λ (x, ξ) ∼

∞ X

4ν (λ)ν Uν (x, ξ) Rλ+ν (x, ξ) .

(34)

ν=0

One can prove that for a hyperbolic differential operator L with locally analytic coefficients the Hadamard–Riesz expansion is locally uniformly convergent. From now on we will restrict our consideration to this case. For λ = 1 formula (34) provides an expansion of the fundamental solution of the operator L in a neighborhood of the vertex x = ξ of the characteristic cone: ∞ X

8+ (x, ξ) =

4ν ν! Uν (x, ξ) Rν+1 (x, ξ) .

(35)

ν=0

When n is even, we have supp Rν+1 (x, ξ) = J+ (ξ) for all ν = 0, 1, 2, . . ., and therefore Huygens’ principle never occurs in odd-dimensional Minkowski spaces M2l+1 . On the other hand, in the case of an odd number of space dimensions n ≥ 3, we know due to (31) that for ν = 0, 1, 2, . . . , (n − 3)/2, supp Rν+1 (x, ξ) = C+ (ξ) . Hence, using (30), we can rewrite the series (35) in following form: 8+ (x, ξ) =

1 V (x, ξ) δ+(p−1) (γ) + W (x, ξ) η+ (γ) , p 2π

(36)

where p := (n − 1)/2 , η+ (γ) is a regular distribution characteristic for the region J+ (ξ): Z hη+ (γ), g(x)i = g(x) dx , g(x) ∈ D(Mn+1 ) , J+ (ξ)

and V (x, ξ) , W (x, ξ) are analytic functions in a neighborhood of the vertex x = ξ which admit the following expansions therein: V (x, ξ) =

p−1 X ν=0

1 Uν (x, ξ) γ ν , (1 − p) . . . (ν − p)

(37)

120


W (x, ξ) =

∞ X ν=p

1 Uν (x, ξ) γ ν−p , (ν − p)!

p=

n−1 . 2

(38)

The function W (x, ξ) is usually called a logarithmic term of the fundamental solution7 . It follows directly from the representation formula (36) that operator L satisfies Huygens’ principle in a neighborhood of the point ξ, if and only if, the logarithmic term W (x, ξ) of its fundamental solution vanishes in this neighborhood identically in x: W (x, ξ) ≡ 0 . The function W (x, ξ) is known to be a regular solution of the characteristic Goursat problem for the operator L : L [W (x, ξ)] = 0 (39) with a boundary value given on the cone surface C+ (ξ) . Such a boundary problem has a unique solution, and hence, the necessary and sufficient condition for L to be Huygens’ operator becomes W (x, ξ) , 0 , (40) where the symbol , implies that the equation in hand is satisfied only on C+ (ξ) . By definition (38), the latter condition is equivalent to the following one Up (x, ξ) , 0 ,

p=

n−1 . 2

(41)

In this way, we arrive at the important criterion for the validity of Huygens’ principle in terms of coefficients of the Hadamard–Riesz expansion (34). Equation (41) is essentially due to Hadamard [26]. It will play a central role in the proof of our main theorem.

III. Proof of the Main Theorem We start with some remarks concerning the properties of the one-dimensional Schrödinger operator 2 ∂ + vk (ϕ) (42) L(k) := − ∂ϕ with a general periodic soliton potential vk (ϕ) := −2

∂ ∂ϕ

2 log W [91 , 92 , . . . , 9N ] .

(43)

Here, as already discussed in the Introduction, W [91 , 92 , . . . , 9N ] stands for a Wronskian of the set of periodic functions on R1 : 9i (ϕ) := cos(ki ϕ + ϕi ) ,

ϕi ∈ R ,

(44)

associated to an arbitrary strictly monotonic sequence of real positive numbers ("soliton amplitudes"): 0 ≤ k1 < . . . < kN −1 < kN . 7 Such a terminology goes back to Hadamard’s book [26], where the function W (x, ξ) is introduced as a coefficient under the logarithmic singularity of an elementary solution (see for details [15], pp. 740–743).


121

It is well-known (see, e.g., [34]) that any such operator L(k) (as well as its proper solitonic counterpart (16)) can be constructed by a successive application of DarbouxCrum factorization transformations ([17, 16]) to the Schrödinger operator with the identically zero potential: 2 ∂ . (45) L0 := − ∂ϕ To be precise, let L be a second order ordinary differential operator with a sufficiently smooth potential: 2 ∂ + v(ϕ) . (46) L := − ∂ϕ We ask for formal factorizations of the operator L − λ I = A∗ ◦ A ,

(47)

where I is an identity operator, λ is a (real) constant, and A , A∗ are the first order operators adjoint to each other in a formal sense. According to Frobenius’ theorem (see, e.g., [29]), the most general factorization (47) is obtained if we take χ(ϕ) as a generic element in Ker(L − λ I) \ {0} and set ∂ ∂ ◦ χ−1 , A∗ := −χ−1 ◦ ◦χ. (48) A := χ ◦ ∂ϕ ∂ϕ Indeed, A∗ ◦ A is obviously a self-adjoint second order operator with the principal part −∂ 2 /∂ϕ2 . Hence, it is of the form (46). Moreover, since A[χ] = 0 , we have χ ∈ Ker A∗ ◦ A , so that (47) becomes evident. Note that for every λ ∈ R we actually get a one-parameter family of factorizations of L − λ I . This follows from the fact that dim Ker(L − λ I) = 2, whereas χ(ϕ) and C χ(ϕ) give rise to the same factorization pair (A, A∗ ) . By definition, the Darboux-Crum transformation maps an operator L = λ I +A∗ ◦A into the operator (49) L˜ := λ I + A ◦ A∗ , ∗ in which A and A are interchanged. The operator L˜ is also a (formally) self-adjoint second-order differential operator 2 ∂ ˜ + v(ϕ) ˜ , (50) L := − ∂ϕ where v(ϕ) ˜ is given explicitly by v(ϕ) ˜ = v(ϕ) − 2

∂ ∂ϕ

2 log χ(ϕ) .

(51)

The initial operator L and its Darboux-Crum transform L˜ are obviously related to each other via the following intertwining indentities: L˜ ◦ A = A ◦ L

,

L ◦ A∗ = A∗ ◦ L˜ .

(52)

The Darboux-Crum transformation has a lot of important applications in the spectral theory of Sturm-Liouville operators and related problems of quantum mechanics [30]. In particular, it is used to insert or remove one eigenvalue without changing the rest of the

122


spectrum of a Schrödinger operator (for details see the monograph [34] and references therein). The explicit construction of the family of operators (42) with periodic soliton potentials (43) is based on the following Crum’s lemma: Lemma ([16]). Let L be a given second order Sturm-Liouville operator (46) with a sufficiently smooth potential, and let {91 , 92 , . . . , 9N } be its eigenfunctions corresponding to arbitrarily fixed pairwise different eigenvalues {λ1 , λ2 , . . . , λN } , i.e. 9i ∈ Ker(L − λi I) , i = 1, 2, . . . , N . Then, for arbitrary 9 ∈ Ker(L − λ I) , λ ∈ R , the function W [91 , 92 , . . . , 9N , 9] χN (ϕ) := (53) W [91 , 92 , . . . , 9N ] satisfies the differential equation # " 2 ∂ + vN (ϕ) χN (ϕ) = λ χN (ϕ) − ∂ϕ

(54)

with the potential vN (ϕ) := v(ϕ) − 2

∂ ∂ϕ

2 log W [91 , 92 , . . . , 9N ] .

(55)

Given a sequence of real positive numbers (ki )N i=1 : 0 ≤ k1 < k2 < . . . < kN , the Darboux-Crum factorization scheme: 2 2 I → Li+1 := Ai ◦ A∗i + ki+1 I, Li := Ai−1 ◦ A∗i−1 + ki2 I = A∗i ◦ Ai + ki+1

(56)

starting from the Schrödinger operator (45) with a zero potential 2 ∂ L0 ≡ − = A∗0 ◦ A0 + k12 I , ∂ϕ produces the required operator L(k) ≡ LN with the general periodic potential (43). Now we proceed to the proof of our main theorem formulated in the Introduction. When N = 0 , the statement of the theorem is evident, since the operator L0 is just the ordinary wave operator in an odd number n of spatial variables. Using the Darboux-Crum scheme as outlined above we will carry out the proof by induction in N . Suppose that the statement of the theorem is valid for all m = 0, 1, 2, . . . , N . Consider an arbitrary integer monotonic partition (ki ) of height N : 0 < k1 < k2 < . . . < kN , ki ∈ Z. By our assumption, the wave-type operator LN := L(k) = n+1 + uk (x) ,

(57)

associated to this partition, satisfies Huygens’ principle in the (n + 1)-dimensional Minkowski space Mn+1 with n odd, and n ≥ 2 kN + 3 . We fix the minimal admissible number of space variables, i.e. n = 2 kN + 3 , and denote p :=

n−1 = kN + 1 . 2

(58)


123

By construction, the operator LN can be written explicitly in terms of suitably chosen cylindrical coordinates in Mn+1 : ÿ " !# 2 2 1 1 ∂ ∂ ∂ − − + + vN (ϕ) , (59) LN = n−1 − ∂r r ∂r r2 ∂ϕ where (r, ϕ) are the polar coordinates in some Euclidean 2-plane E orthogonal to the time direction in Mn+1 , i.e. E ∈ Gr⊥ (n + 1, 2) ; n−1 is a wave operator in the orthogonal complement E ⊥ ∼ = Mn−1 of E in Mn+1 ; and vN (ϕ) is a 2π-periodic potential given by (43). Let k := kN +1 be an arbitrary positive integer such that k > kN .

(60)

We apply the Darboux-Crum transformation (56) with the spectral parameter k to the angular part of the Laplacian in E. For this we rewrite LN in the form " # 2 1 1 ∂ ∂ ∗ 2 − + AN ◦ AN + k , (61) LN = n−1 − ∂r r ∂r r2 "

and set LN +1 := n−1 −

∂ ∂r

2

# 1 1 ∂ ∗ 2 − + A N ◦ AN + k , r ∂r r2

(62)

where AN := AN (ϕ) and A∗N := A∗N (ϕ) are the first order ordinary differential operators of the form (48). According to (52), we have LN +1 ◦ AN = AN ◦ LN

,

LN ◦ A∗N = A∗N ◦ LN +1 .

(63)

N +1 Let 8N λ (x, ξ) and 8λ (x, ξ) be the Riesz kernels of hyperbolic operators LN and LN +1 respectively. Then, by virtue of (63) we must have the relation +1 A∗N (ϕ) 8N − AN (φ) 8N (64) λ λ = 0 for all λ ∈ C ,

where AN (φ) is the differential operator AN written in terms of the variable φ conjugated to ϕ . Indeed, if identity (64) were not valid, one could define a holomorphic Ñ ˜ N : C → D0 , λ 7→ 8 mapping 8 λ (x, ξ) , such that N +1 N ∗ Ñ 8 − AN (φ) 8N . (65) λ (x, ξ) := 8λ (x, ξ) + a AN (ϕ) 8λ λ N

˜ λ (x, ξ) , depending on an arbitrary complex parameter a ∈ C, would The distribution 8 also satisfy all the axioms (21) in the definition of a Riesz kernel for the operator LN . In this way, we would arrive at the contradiction with the uniqueness of such a kernel. In particular, when λ = 1, the identity (64) gives the relation between the fundamental N N +1 +1 (x, ξ) ≡ 8N (x, ξ) of operators LN and solutions 8N + (x, ξ) ≡ 81 (x, ξ) and 8+ 1 LN +1 . In accordance with (36), we have 8N + (x, ξ) = and

1 VN (x, ξ) δ+(p−1) (γ) + WN (x, ξ) η+ (γ) p 2π

(66)

124


1 (67) VN +1 (x, ξ) δ+(p−1) (γ) + WN +1 (x, ξ) η+ (γ) , p 2π where γ is a square of the geodesic distance between the points x and ξ in Mn+1 . Substituting (66), (67) into (64), we get the relation between the logarithmic terms WN (x, ξ) and WN +1 (x, ξ) of operators LN and LN +1 +1 8N (x, ξ) = +

A∗N (ϕ) [WN +1 (x, ξ)] − AN (φ) [WN (x, ξ)] = 0 .

(68)

By our assumption, LN is a Huygens’ operator in Mn+1 , so that WN (x, ξ) ≡ 0. Hence, Eq. (68) implies A∗N (ϕ) [WN +1 (x, ξ)] = 0 . On the other hand, as discussed in Sect. II, the logarithmic term WN +1 (x, ξ) is a regular solution of the characteristic Goursat problem for LN +1 , i.e. in particular, LN +1 [WN +1 (x, ξ)] = 0 .

(69)

Taking into account definition (62) of the operator LN +1 , we arrive at the following equation for WN +1 (x, ξ) : ! ÿ 2 k2 1 ∂ ∂ n−1 WN +1 (x, ξ) = − 2 WN +1 (x, ξ) . + (70) ∂r r ∂r r According to (38), the logarithmic term WN +1 admits the following expansion: WN +1 (x, ξ) =

∞ X

Uν (x, ξ)

ν=p

γ ν−p , (ν − p)!

p=

n−1 , 2

(71)

where Uν (x, ξ) are the Hadamard coefficients of the operator LN +1 . Since the potential of the wave-type operator LN +1 depends only on the variables r, ϕ , its Hadamard coefficients Uν must depend on the same variables r, ϕ and their conjugates ρ, φ only: for all ν = 0, 1, 2, . . . . (72) Uν = Uν (r, ϕ, ρ, φ) This follows immediately from the uniqueness of the solution of Hadamard’s transport equations (33). On the other hand, since γ = s2 − r2 − ρ2 + 2r ρ cos(ϕ − φ) ,

(73)

where s is a geodesic distance in the space E ⊥ ∼ = Mn−1 orthogonally complementary to the 2-plane E, we conclude that WN +1 is actually a function of five variables: WN +1 = WN +1 (s, r, ρ, ϕ, φ) . On the space of such functions the wave operator n−1 in E ⊥ acts in the same way as its “radial part”, i.e. ! ÿ 2 n−2 ∂ ∂ WN +1 . + n−1 WN +1 = ∂s s ∂s Hence, Eq. (70) becomes ÿ ! 2 2 1 ∂ k2 n−2 ∂ ∂ ∂ + − 2 WN +1 = 0 . − − ∂r ∂s s ∂s r ∂r r Now we substitute the expansion (71)

(74)


WN +1 =

∞ X

Uν (r, ϕ, ρ, φ)

ν=p

γ ν−p , (ν − p)!

p=

125

n−1 , 2

(75)

into the left-hand side of the latter equation and develop the result into the similar power series in γ , taking into account formula (73). After simple calculations we obtain ∞ X 1 k2 0 − (76) Uν00 + Uν0 − 2 Uν − 4 (r − ρ cos(ϕ − φ)) Uν+1 r r ν=p ν−p ρ γ 2 2 =0, − 2 2 (ν + 1) − cos(ϕ − φ) Uν+1 − 4 ρ sin (ϕ − φ) Uν+2 r (ν − p)!

where the prime means differentiation with respect to r . Since the functions Uν do not depend explicitly on γ , Eq. (76) can be satisfied only if each coefficient under the powers of γ vanishes separately. In this way we arrive at the following differential-recurrence relation for the Hadamard coefficients of the operator LN +1 : 1 k2 4 ρ2 sin2 (ϕ − φ) Uν+2 = Uν00 + Uν0 − 2 Uν + r r 2ρ 0 0 + cos(ϕ − φ) 2 r Uν+1 + Uν+1 − 4 r Uν+1 + (ν + 1) Uν+1 , (77) r where ν runs from p : ν = p, p + 1, p + 2, . . .. To get a further simplification of Eq. (77) we notice that all the Hadamard coefficients of the operators under consideration (11), (10) are homogeneous functions of appropriate degrees. More precisely, they have the following specific form Uν (r, ϕ, ρ, φ) =

1 σν (ϕ, φ) , (r ρ)ν

ν = 0, 1, 2, . . . ,

(78)

where σν (ϕ, φ) = σν (φ, ϕ) are symmetric 2π-periodic functions depending on the angular variables only. In order to prove Ansatz (78) we have to go back to the relation (64) between the Riesz kernels of operators LN and LN +1 : N +1 λ∈C, (79) A∗N (ϕ) 8N λ (x, ξ) − AN (φ) 8λ (x, ξ) = 0 , If we substitute the Hadamard–Riesz expansions (34) of the kernels 8N λ (x, ξ) and +1 8N (x, ξ) into (79) directly and take into account that A and its adjoint A∗N are the N λ first order ordinary differential operators of the following form (cf. (48)): AN (ϕ) =

∂ − fN (ϕ) , ∂ϕ

A∗N (ϕ) = −

∂ − fN (ϕ) , ∂ϕ

(80)

where fN (ϕ) = (∂/∂ϕ) log χN (ϕ) , we obtain ∞ X N +1 N 4ν (λ)ν 2rρ sin(ϕ − φ) Uν+1 − Uν+1 − −

ν=0

∂ + fN (ϕ) ∂ϕ

UνN +1

−

∂ − fN (φ) ∂φ

UνN

Rλ+ν = 0 ,

(81)

126


where UνN (r, ϕ, ρ, φ) and UνN +1 (r, ϕ, ρ, φ) are the Hadamard coefficients of operators LN and LN +1 respectively; Rλ := Rλ (x, ξ) is the family of Riesz distributions in Mn+1 . The same argument as above (see the remark before formula (77)) shows that all the coefficients of the series (81) under the Riesz distributions of different weights must vanish separately. So we arrive at the recurrence relation between the sequences of Hadamard’s coefficients of operators LN and LN +1 : ∂ 1 ∂ N +1 N N +1 N Uν+1 = Uν+1 + + fN (ϕ) Uν + − fN (φ) Uν , 2rρ sin(ϕ − φ) ∂ϕ ∂φ (82) where U0N +1 = U0N ≡ 1 and ν = 0, 1, 2, . . . Now it is easy to conclude from (82) by induction in N that the Ansatz (78) really holds for Hadamard’s coefficients of all wave-type operators (11) with potentials (10). Returning to Eq. (77) and substituting (78) therein, we obtain the following threeterm recurrence relation for the angular functions σν (ϕ, φ) : 4 sin2 (ϕ − φ) σν+2 = (ν 2 − k 2 ) σν − 2(2ν + 1) cos(ϕ − φ) σν+1 ,

(83)

where ν = p, p + 1, p + 2, . . . . In order to analyze Eq. (83) it is convenient to introduce a formal generating function for the quantities { σν } : F (t) :=

∞ X

σν (ϕ, φ)

ν=p

tν−p . (ν − p)!

(84)

The recurrence relation (83) turns out to be equivalent to the classical hypergeometric differential equation for the function F (t), 4 (1 − ω 2 ) + 4ωt − t2

d2 F dF + (k 2 − p2 ) F = 0 , + (2p + 1) (2ω − t) dt2 dt

(85)

where ω := cos(ϕ − φ) . The general solution to (85) is given in terms of Gauss’ hypergeometric series: F (t) = C 2 F1 (p − k; p + k; p + 1/2 | z) + C1 z −p+1/2 2 F1 (1/2 − k; 1/2 + k; 3/2 − p | z) , (86) where z := (t − 2ω + 2)/4 and 2 F1 is defined by 2

F1 (a; b; c | z) :=

∞ X (a)µ (b)µ z µ . (c)µ µ!

(87)

µ=0

As discussed in Sect.II, the Hadamard coefficients Uν (x, ξ) must be regular in a neighborhood of the vertex of the characteristic cone x = ξ . When x → ξ , we have ω → 1 and Up (ξ, ξ) ∝ σp (φ, φ) = F (0)|ω=1 is not bounded unless C1 = 0 . In this way, setting C1 = 0 in (86), we obtain ∞ X ν=p

σν (ϕ, φ)

tν−p = C 2 F1 p − k; p + k; p + 1/2 | (t − 2ω + 2)/4 . (ν − p)!

(88)

Now it remains to recall that by our assumption (60) k ∈ Z and k > kN . Since p = (n − 1)/2 = kN + 1 , we have k ≥ p . So the hypergeometric series in the right-hand


127

side of Eq. (88) is truncated. In fact, the generating function (84) is expressed in terms (p−1/2,p+1/2) of the classical Jacobi polynomial Pk−p (ω − t/2) of degree k − p . Hence, th σk+1 (ϕ, φ) ≡ 0 , and the (k + 1) Hadamard coefficient of the operator LN +1 vanishes identically: Uk+1 (x, ξ) ≡ 0 .

(89)

According to Hadamard’s criterion (41), it means that the operator LN +1 satisfies Huygens’ principle in Minkowski space Mn+1 , if n is odd and n ≥ 2k +3 , Thus, the proof of the theorem is completed.

IV. Concluding Remarks and Examples In the present paper we have constructed a new hierarchy of Huygens’ operators in higher dimensional Minkowski spaces Mn+1 , n > 3. However, the problem of complete description of the whole class of such operators for arbitrary n still remains open. As mentioned in the Introduction, the famous Hadamard’s conjecture claiming that any Huygens’ operator L can be reduced to the ordinary d’Alembertian n+1 with the help of trivial transformations is valid only in M3+1 . Recently, in the work [4] one of the authors put forward the relevant modification of Hadamard’s conjecture for Minkowski spaces of arbitrary dimensions. Here we recall and discuss briefly this statement. Let be an open set in Minkowski space Mn+1 ∼ = Rn+1 , and let F () be a ring of partial differential operators defined over the function space C ∞ () . For a fixed pair of operators L0 , L ∈ F () we introduce the map adL,L0 : F() → F () ,

A 7→ adL,L0 [A] ,

(90)

such that adL,L0 [A] := L ◦ A − A ◦ L0 .

(91)

Then, given M ∈ Z>0 , the iterated adL,L0 -map is determined by adM L,L0 [A]

:= adL,L0 adL,L0 . . . adL,L0 [A] . . . =

M X

k

(−1)

k=0

M LM −k ◦ A ◦ Lk0 . k (92)

Definition. The operator L ∈ F() is called M-gauge related to the operator L0 ∈ F () , if there exists a smooth function θ(x) ∈ C ∞ () non-vanishing in , and an integer positive number M ∈ Z>0 , such that adM L,L0 [θ(x)] ≡ 0

identically in F () .

(93)

In particular, when M = 1 , the operators L and L0 are connected just by the trivial gauge transformation L = θ(x) ◦ L0 ◦ θ(x)−1 .

128


The modified Hadamard’s conjecture claims: Any Huygens’ operator L of the general form L = n+1 + (a(x), ∂) + u(x) ,

(94)

in a Minkowski space Mn+1 (n is odd, n ≥ 3) is M -gauge related to the ordinary wave operator n+1 in Mn+1 . For Huygens’ operators associated to the rational solutions of the KdV-equation (2), (3) and to Coxeter groups (5), (6) this conjecture has been proved in [4] and [7]. In these cases the required identities (93) are the following: Mk +1 [Pk (x0 )] = 0 , adL k ,L0

Mk :=

k(k + 1) , 2

(95)

where Lk is given by (2) with the potential (3) for k = 0, 1, 2, . . . and Mm +1 adL [πm (x)] = 0 , m ,L0

Mm :=

X

mα ,

(96)

α∈ 0 and a constant γ > 0 such that the covariance function obeys Z dµ(y) C(x − y) ≥ γ C(0)χΓ (x) (4) Rd

for all x ∈ R . Then for every energy E ∈ R there is a constant 0 < W (E) < ∞, independent of Λ and X, such that the finite-volume integrated density of states (3) obeys Z (ω) (ω) dP (ω) NΛ,X (E1 ) − NΛ,X (E2 ) ≤ |Λ| |E1 − E2 | W (E) (5) d

Ω

for all E1 , E2 ≤ E and all bounded open cubes Λ ⊂ Rd with |Λ| ≥ |Γ |. The proof of Theorem 1 will be deferred to the next section. Remark 3. (i) In what follows it will be assumed, without loss of generality, that the measure µ is normalized according to Z Z dµ(x) dµ(y) C(x − y) = C(0) . (6) Rd

Rd

(ii) If C(x) ≥ 0 for all x ∈ Rd , one may simply choose the signed measure µ to be Dirac’s point measure at the origin. Due to the continuity of C and since C(0) > 0, Condition (4) is then fulfilled with some sufficiently small cube Γ containing the origin and γ = inf x∈Γ C(x)/C(0). With other choices of µ it is also possible to satisfy (4) for certain covariance functions which R take on negative values. An example for the case d = 1 is given by C(x) := R dy w(x + y)w(y) with w := χ[−3,3] − 45 χ[−1,1] . A suitable choice is dµ(x) = m χ[−6,6] (x)dx , Γ =] − 1, 1[ and γ = 8m, where m > 0 is determined by the normalization (6).

136

W. Fischer, T. Hupfer, H. Leschke, P. Müller

(iii) The Lipschitz continuity (5) of the averaged finite-volume integrated density of states implies by the Chebyshev-Markov inequality that the probability of finding the spectrum of HΛ,X (V ) near a given energy E0 is controlled by the inequality o n ≤ 2|Λ| ε W (E) . (7) P ω : dist spec HΛ,X (V (ω) ) , E0 < ε It is valid for all bounded open cubes Λ ⊂ Rd with |Λ| ≥ |Γ | and for all energies E0 ∈ R and ε > 0 such that E0 +ε ≤ E. Actually it is (7) – or some weakened form of (7), where, for example, |Λ| is replaced by |Λ|α with α > 1 – that plays a key rôle in proofs of Anderson localisation for multi-dimensional random Schrödinger operators. (iv) Theorem 1 can be generalised to situations where the random potential is Gaussian but not Rd -homogeneous, the extension being essentially a matter of notation. This allows in particular to treat correlated Gaussian alloy-type potentials. Now we assume that the random field V has the ergodicity property (E) in addiR tion to the Gaussian property (G). Due to this and since Ω dP (ω) exp{−tV (ω) (0)} = exp{t2 C(0)/2} < ∞ for all t > 0, standard techniques show the existence of a nonrandom left-continuous distribution function N on R , called the integrated density of states (per volume) in the macroscopic limit, such that N (E) = lim

(ω) NΛ,X (E)

|Λ|

Λ↑Rd

.

(8)

More precisely, there is a set Ω0 ∈ A of full probability, P (Ω0 ) = 1, such that (8) holds for all ω ∈ Ω0 , for both boundary conditions X and for all E ∈ R except for the at most countably many discontinuity points of N , see [KM2, Ki1] or [Ki2, Sect. 7.3]. The non-randomness of N and Fatou’s lemma imply Z (ω) (ω) (E1 ) − NΛ,X (E2 ) (9) |N (E1 ) − N (E2 )| ≤ lim inf |Λ|−1 dP (ω) NΛ,X Λ↑Rd

Ω

for all those E1 , E2 ∈ R for which (8) is true. Accordingly, Theorem 1 and the leftcontinuity of N yield the Lipschitz continuity of N , namely (10) |N (E1 ) − N (E2 )| ≤ |E1 − E2 | W max{E1 , E2 } for all E1 , E2 ∈ R . These arguments are summarized in Corollary 1. Under the assumptions of Theorem 1 and supposing property (E), the integrated density of states in the macroscopic limit is absolutely continuous on any bounded interval and its derivative, the density of states, is locally bounded in the sense that dN (E) ≤ W (E) (11) 0≤ dE for Lebesgue-almost all E ∈ R . Remark 4. We infer from the proof of Theorem 1, which is presented in the next section, that the Wegner constant may be taken as W (E) =

d exp{βE + β 2 CE /2} −1 √ 2È + (2πβ)−1/2 , 2πC(0) γ

(12)

Density of States for Gaussian Random Potentials

137

where β > 0 is arbitrary and may be considered as a variational parameter. Furthermore, 2 − b2E ) and the constants È , BE and bE are defined below by (16), CE := C(0)(1 + BE (21) and (22), respectively. To get (12) we have made use p of the normalization (6). For the simple (but not optimal) choice β = (2CE )−1 − E + E 2 + 2CE /π one obtains the leading asymptotic low- and high-energy behaviour 1 ln W (E) , =− 2 E→−∞ E 2C(0) 3d e1/2π W (E) = √ , lim d/2 E→∞ E 2πC(0) γ lim

(13) (14)

which coincides with the known asymptotics [PF, Thms. 5.29, 9.3] of N except for the value of the constant on the right-hand side of (14).

3. A Wegner Estimate for Continuum Schrödinger Operators In this section we prove Theorem 1 by tracing it back to a Wegner estimate that holds for a wider class of random fields than the ones considered so far. The price one has to pay for the increased generality is a couple of more technical notions. From now on the random field V will only be required to fulfil the measurability property (M) and the form-boundedness property (F). We follow the line of reasoning laid down in [CH]. The strategy there is based on a one-parameter decomposition of the random field from which we abstract Definition 2. A random field V : Ω × Rd → R admits a (U, λ, u, %)-decomposition if there exists a random field U : Ω × Rd → R , a random variable λ : Ω → R and a Borel-measurable function u : Rd → R such that (i) V (ω) = U (ω) + λ(ω) u for P -almost all ω, (ii) the conditional probability measure of λ relative to the sub-sigma-algebra generated by {U (x)}x∈Rd has a jointly measurable density % : Ω × R → R+ with respect to the Lebesgue measure on R . Here R+ is the set of the non-negative reals. As a variant of the Wegner estimate in [CH] we state int S J Λ be the interior of the closure of a finite Theorem 2. Let the cube Λ = j j=1 union of pairwise disjoint open cubes Λj . For a random field V , which has the properties (M) and (F), define HΛ,X (V ) as in (2). Assume that for all 1 ≤ j ≤ J the random field V admits a (Uj , λj , uj , %j )-decomposition subject to the following three conditions: there exist five strictly positive and finite constants ν, v, β, R, Z such that for all 1 ≤ j ≤ J: (i) (ii)

νχΛj (x) ≤ uj (x) and uj (x)χΛj (x) ≤ v for all x ∈ Rd , −βνξ −βvξ (ξ) max{e , e } ≤ R for P -almost all ω, ess sup %(ω) j Z

ξ∈R

(iii) Ω

n o dP (ω) Tr χΛj exp − β HΛj ,N (Uj(ω) ) χΛj ≤ |Λj | Z.

138


Then the finite-volume integrated density of states (3) obeys Z RZ β max{E1 ,E2 } (ω) (ω) e dP (ω) NΛ,X (E1 ) − NΛ,X (E2 ) ≤ |Λ| |E1 − E2 | ν Ω

(15)

for both boundary conditions X and all E1 , E2 ∈ R . In Condition (iii) of Theorem 2 the indicator function χΛj is understood as a multiplication operator and the random Schrödinger operator HΛj ,N (Uj ) is well-defined due to the boundedness of uj on Λj which in its turn follows from Condition (i) of Theorem 2. Before we prove Theorem 2 we show how it can be exploited to deduce Theorem 1. Proof (of Theorem 1). First we recall that the properties (M) and (F) are implied by the Gaussian property (G), cf. Remark 1. Then, given E ≥ max{E1 , E2 } we partition the cube Λ into J < ∞ disjoint cubes Λj with edges of length obeying È /2 ≤ |Λj |1/d ≤ È , where we have introduced the energy-dependent length È := min {|E|−1/2 , |Γ |1/d } .

(16)

It is only because È becomes small with increasing |E| that one obtains the low-energy behaviour (13) of the Wegner constant W (E). Next we pick a cube ΓE ⊆ Γ with volume |ΓE | = `dE . In what follows, we assume without loss of generality that the edges of ΓE are parallel to those of the Λj ’s and that the measure µ obeys (6). The main point is that V admits a (Uj , λj , uj , %j )-decomposition for all j as can be inferred from the definitions Z −1/2 λ(ω) := (C(0)) dµj (y) V (ω) (y) , (17) j d R Z dµj (y) C(x − y) , (18) uj (x) := (C(0))−1/2 Uj(ω) (x) %(ω) j (ξ)

:= V

(ω)

(x) −

Rd (ω) λj uj (x) ,

−1/2 −ξ 2 /2

:= (2π)

e

.

(19) (20)

The signed measures µj are defined by suitably translating √ µ such √ the signed measure that Condition (i) of Theorem 2 is fulfilled with v = C(0) BE and ν = C(0) bE , where Z −1 sup dµ(y) C(x − y) ≤ |µ|(Rd ) , (21) BE := (C(0)) x∈ΓE Rd Z dµ(y) C(x − y) ≥ γ . (22) bE := (C(0))−1 inf x∈ΓE

Rd

Note that the integral in (17) is well-defined for P -almost all ω because of C(0) < ∞. A decomposition similar to (17) – (20) is used for other purposes in [U, p. 185]. The random field Uj is non-homogeneous and Gaussian with zero mean and covariance function (23) Dj (x, y) := C(x − y) − uj (x)uj (y) , which obeys |Dj (x, y)| ≤ C(0)(1 − b2E ) for all x, y ∈ Λj . Moreover, Uj is stochastically independent of the standardized Gaussian random variable λj as is consistently taken into account by (20). Hence, Condition (ii) of Theorem 2 is obviously satisfied with R =


139

2 (2π)−1/2 exp{β 2 C(0)BE /2} for any choice of β > 0. It remains to check Condition (iii). To do so we claim that Z n o dP (ω) Tr χΛj exp −β HΛj ,N (Uj(ω) ) χΛj Ω Z (ω) dP (ω) e−β Uj (x) . (24) ≤ Tr χΛj exp −β HΛj ,N (0) χΛj sup x∈Λj

Ω

If Uj is P -almost surely continuous, and hence bounded on Λj , this inequality follows from the Golden-Thompson inequality (see e.g. [RS, p. 320]) and Fubini’s theorem. To show (24) under our assumptions on the covariance function, which allow for noncontinuous realisations Uj(ω) with non-zero probability, we adapt the approximation result [Si1, Thm. B.10.1] to the present finite-volume situation with Neumann boundary conditions by using the appropriate Feynman-Kac formula [BR, Thm. 6.3.12] and the fact that P -almost surely Uj ∈ Lp (Λj ) , exp{−βUj } ∈ L1 (Λj ) for arbitrary finite p ≥ 1 and β > 0. Explicit computations show that the supremum in (24) does not exceed exp{β 2 C(0) −1/2 d < ∞ so that the (1 − b2E )/2} and that there is a constant z ≤ 2`−1 E + (2πβ) trace on the right-hand side of (24), the free Neumann partition function, is bounded by |Λj | z. Putting Z := z exp{β 2 C(0)(1 − b2E )/2} gives Condition (iii) of Theorem 2. Hence the proof is complete, because Theorem 1 now follows from Theorem 2 and the Wegner constant may be taken as in (12). The rest of the paper is devoted to a proof of Theorem 2. Proof (of Theorem 2). Let E2 ≤ E1 and I := [E2 , E1 [. By definition of NΛ,X and by the spectral theorem one has Z Z (ω) (ω) (ω) (ω) βE1 dP (ω) NΛ,X (E1 ) − NΛ,X (E2 ) ≤ e dP (ω) Tr FΛ,X (I) e−β HΛ,X (V ) . Ω

Ω

(25)

To bound the trace in (25) from above by the sum J X

n o (ω) (ω) (ω) (26) Tr max{e−βνλj , e−βvλj } FΛ,X (I) χΛj exp − β HΛj ,N (Uj(ω) ) χΛj

j=1

we employ the operator inequality HΛ,X (V

(ω)

)≥

J X

(ω) χΛj HΛj ,N (Uj(ω) ) + min{νλ(ω) j , vλj } χΛj

(27)

j=1

LJ 2 on L2 (Λ) = j=1 L (Λj ). It is valid for P -almost all ω and follows from DirichletNeumann bracketing, the fact that introducing “Neumann surfaces” lowers eigenvalues (see e.g. [RS, Ch. XIII.15, Prop. 4]), the (Uj , λj , uj , %j )-decomposition of V and Condition (i) of Theorem 2. Of course, in the case X = N Dirichlet-Neumann bracketing is not needed. Evaluating the trace in (25) in an orthonormal eigenbasis of HΛ,X (V (ω) ), using (27) and the Jensen-Peierls inequality [B] one obtains the bound (26). Let us now estimate the conditional expectation relative to the sub-sigma-algebra generated by {Uj (x)}x∈Rd of the j-th term in the sum (26). Since dominated convergence allows to interchange the trace with the emerging ξ-integration, this conditional expectation may be written as

140


Z dξ

Tr R

(ω) %˜(ω) j (ξ) Fj,ξ,X (I) χΛj

n exp

−β

o

HΛj ,N (Uj(ω) )

χ Λj

.

(28)

(ω) −βνξ −βvξ Here we have introduced the abbreviation %˜(ω) ,e } and j (ξ) := %j (ξ) max{e (ω) (ω) the spectral family Fj,ξ,X of the operator HΛ,X (Uj + ξuj ). Cyclic invariance of the trace and the Hölder inequality with the usual operator norm k . k and the trace norm bound (28) from above by

Z n o

dξ %˜(ω) (ξ) χ F (ω) (I) χ Tr χ exp − β HΛj ,N (U (ω) ) χ . (29) j j Λj j,ξ,X Λj Λj Λj

R

Thanks to Conditions (i) and (ii), one may now estimate the operator norm in (29) with the help of the spectral-averaging technique. To do so we apply Corollary 4.2 in [CH] since its validity extends from nonand remark that one may choose there g = %˜(ω) j negative bounded functions with compact support to non-negative bounded functions by a monotone-convergence argument. In this way we arrive at the ω- and j-independent upper bound |I| R/ν for the operator norm in (29). Hence, according to Condition (iii) the expectation of (28) with respect to dP is bounded by |Λj | |I|RZ/ν. By virtue of (25) and (26) the proof is complete. Acknowledgement. We are grateful to P. D. Hislop for stimulating discussions.

Note added in proof After completion of this work we have been able to extend our results to random Schrödinger operators with rather general magnetic fields (including constant ones). The details shall be published in an Addendum to the present paper.

References [AM] [BCH] [B] [BEE+]

[BR] [CL] [CH] [CHM] [vDK] [F]

Aizenman, M., Molchanov, S.: Localization at large disorder and at extreme energies: An elementary derivation. Commun. Math. Phys. 157, 245–278 (1993) Barbaroux, J.-M., Combes, J.M., Hislop, P.D.: Landau Hamiltonians with unbounded random potentials. Lett. Math. Phys. 40, 355–369 (1997) Berezin, F.A.: Convex operator functions. Math. USSR. Sbornik 17, 269–277 (1972). Russian original: Mat. Sbornik 88, 268–276 (1972) Bonch-Bruevich, V.L., Enderlein, R., Esser, B., Keiper, R., Mironov, A.G., Zvyagin, I.P.: Elektronentheorie ungeordneter Halbleiter. Berlin: VEB Deutscher Verlag der Wissenschaften, 1984. Russian original: Moscow: Nauka, 1981 Bratteli, O., Robinson, D.W.: Operator algebras and quantum statistical mechanics 2. 2nd ed. Berlin: Springer, 1997 Carmona, R., Lacroix, J.: Spectral theory of random Schrödinger operators. Boston: Birkhäuser, 1990 Combes, J.-M., Hislop, P.D.: Localization for some continuous, random Hamiltonians in ddimensions. J. Funct. Anal. 124, 149–180 (1994) Combes, J.M., Hislop, P.D., Mourre, E.: Spectral averaging, perturbation of singular spectrum, and localization. Trans. Am. Math. Soc. 348, 4883–4894 (1996) von Dreifus, H., Klein, A.: A new proof of localization in the Anderson tight binding model. Commun. Math. Phys. 124, 285–299 (1989) Fernique, X.M.: Regularité des trajectoires des fonctions aléatoires Gaussiennes. In: Hennequin, P.-L. (ed.) Ecole d’Eté de Probabilités de Saint-Flour IV - 1974. Lecture Notes in Mathematics vol. 480, Berlin: Springer, 1975, pp. 1–96


141

[FHLM] Fischer, W., Hupfer, T., Leschke, H., Müller, P.: Rigorous results on Schrödinger operators with certain Gaussian random potentials in multi-dimensional continuous space. In: Demuth, M., Schulze, B.-W. (eds.) Differential equations, asymptotic analysis, and mathematical physics, Berlin: Akademie Verlag, 1997, pp. 105–112 [FLM1] Fischer, W., Leschke, H., Müller, P.: Towards localisation by Gaussian random potentials in multidimensional continuous space. Lett. Math. Phys. 38, 343–348 (1996) [FLM2] Fischer, W., Leschke, H., Müller, P.: In preparation, to be submitted to J. Stat. Phys. [FS] Fröhlich, J., Spencer, T.: Absence of diffusion in the Anderson tight binding model for large disorder or low energy. Commun. Math. Phys. 88, 151–184 (1983) [Ki1] Kirsch, W.: Random Schrödinger operators and the density of states. In: Albeverio, S., Combe, Ph., Sirugue-Collin, M. (eds.) Stochastic aspects of classical and quantum systems. Lecture Notes in Mathematics vol. 1109, Berlin: Springer, 1985, pp. 68–102 [Ki2] Kirsch, W.: Random Schrödinger operators, a course. In: Holden, H., Jensen, A. (eds.) Schrödinger operators. Lecture Notes in Physics vol. 345, Berlin: Springer, 1989, pp. 264–370 [Ki3] Kirsch, W.: Wegner estimates and Anderson localization for alloy-type potentials. Math. Z. 221, 507–512 (1996) [KM1] Kirsch, W., Martinelli, F.: On the ergodic properties of the spectrum of general random operators. J. Reine Angew. Math. 334, 141–156 (1982) [KM2] Kirsch, W., Martinelli, F.: On the density of states of Schrödinger operators with a random potential. J. Phys. A 15, 2139–2156 (1982) [Kl] Klopp, F.: Localization for some continuous random Schrödinger operators. Commun. Math. Phys. 167, 553–569 (1995) [Ko1] Kotani, S.: Lyapunov indices determine absolutely continuous spectra of stationary random onedimensional Schrödinger operators. In: Itô, K. (ed.) Stochastic analysis, Amsterdam: NorthHolland, 1984, pp. 225–247 [Ko2] Kotani, S.: Lyapunov exponents and spectra for one-dimensional random Schrödinger operators. In: Cohen, J.E., Kesten, H., Newman, C.M. (eds.) Random matrices and their applications. Contemporary Mathematics vol. 50, Providence, RI: American Mathematical Society, 1986, pp. 277–286 [KS] Kotani, S., Simon, B.: Localization in general one-dimensional random systems II. Continuum Schrödinger operators. Commun. Math. Phys. 112, 103–119 (1987) [LGP] Lifshits, I.M., Gredeskul, S.A., Pastur, L.A.: Introduction to the theory of disordered systems. New York: Wiley, 1988. Russian original: Moscow: Nauka, 1982 [MH] Martinelli, F., Holden, H.: On absence of diffusion near the bottom of the spectrum for a random Schrödinger operator on L2 (Rν ). Commun. Math. Phys. 93, 197–217 (1984) [PF] Pastur, L., Figotin, A.: Spectra of random and almost-periodic operators. Berlin: Springer, 1992 [RS] Reed, M., Simon, B.: Methods of modern mathematical physics IV: Analysis of operators. New York: Academic, 1978 [SE] Shklovskii, B.I., Efros, A.L.: Electronic properties of doped semiconductors. Berlin: Springer, 1984. Russian original: Moscow: Nauka, 1979 [Si1] Simon, B.: Schrödinger semigroups. Bull. Am. Math. Soc. (N.S.) 3, 447–526 (1982). Erratum: ibid. 11, 426 (1984) [Si2] Simon, B.: Spectral averaging and the Krein spectral shift. Preprint mp arc 96–492, appear in Proc. Am. Math. Soc. [Sp] Spencer, T.: Localization for random and quasi-periodic potentials. J. Stat. Phys. 51, 1009–1019 (1988) [U] Ueki, N.: On spectra of random Schrödinger operators with magnetic fields. Osaka J. Math. 31, 177–187 (1994) [W] Wegner, F.: Bounds on the density of states in disordered systems. Z. Phys. B 44, 9–15 (1981) Communicated by B. Simon

This article was processed by the author using the LaTEX style file pljour1 from Springer-Verlag.

Commun. Math. Phys. 190, 143 – 172 (1997)

Communications in


Le Groupe Quantique Compact Libre U(n) Teodor Banica? Algèbres d’opérateurs et représentations - URA 747 du CNRS, Université de Paris Jussieu, 4 place Jussieu, 75005 Paris, France Received: 1 March 1996 / Accepted: 4 April 1997

Abstract: The free analogues of U(n) in Woronowicz’ theory [Wo2] are the compact matrix quantum groups {Au (F ) | F ∈ GL(n, C)} introduced by Wang and Van Daele. We classify here their irreducible representations. Their fusion rules turn to be related to the combinatorics of Voiculescu’s circular variable. If F F ∈ RIn we find an embedding Au (F )red ,→ C(T) ∗red Ao (F ), where Ao (F ) is the deformation of SU(2) studied in [B2]. We use the representation theory and Powers’ method for showing that the reduced algebras Au (F )red are simple, with at most one trace. Introduction L’une des constructions de base de l’analyse harmonique est la dualité de Pontryagin: elle associe a` un groupe abélien le groupe abélien de ses caractères et permet d’étudier cette correspondance auto-duale. Cette dualité a e´ té e´ tendue aux groupes non-commutatifs, mais l’objet dual (l’algèbre de convolution du groupe) n’est plus de même nature. Afin d’obtenir un cadre généralisant a` la fois les groupes et leur objets duaux, on est amené a` définir de nouveaux objets dans la catégorie des algèbres de Hopf qu’on appelle des “groupes quantiques”. Un certain nombre de familles d’exemples ont e´ té e´ tudiées au niveau des algèbres d’opérateurs. Ainsi, Woronowicz [Wo2] a défini en 1987 la classe des “groupes quantiques compacts matriciels”: un groupe quantique compact matriciel est une paire (G, u) formée d’une C∗ -algèbre unifère G et d’une matrice u ∈ Mn (G) telle que: (a) les coefficients {uij } de u engendrent une ∗-algèbre Gs dense dans PG. uik ⊗ ukj . (b) il existe un C∗ -morphisme δ : G → G ⊗min G qui envoie uij 7→ (c) il existe une application linéaire antimultiplicative κ : Gs → Gs telle que κ(κ(a∗ )∗ ) = a pour tout a ∈ Gs et telle que (Id ⊗ κ)(u) = u−1 . ? Present adress: Institut de Math´ ematiques de Luminy, case 930, F-13288 Marseille Cedex 9, France. E-mail: [email protected]

144

T. Banica

Cette définition recouvre e´ galement le cas “quantique compact” (obtenu par des limites projectives) et le cas “quantique discret” (par dualité). Le cas “quantique localement compact” a e´ té traité dans un cadre général par Baaj et Skandalis [BS]. Pour tout n ∈ N, la C∗ -algèbre universelle Au (In ) engendrée par les coefficients d’une matrice n × n unitaire, telle que sa transposée soit aussi unitaire, est un groupe quantique compact matriciel [W1, W2, VDW]. Au (In ) est un analogue de U(n) dans la théorie de Woronowicz. Cette algèbre, ainsi que ses versions “déformées” {Au (F ) | F ∈ GL(n, C)} constitue l’objet d’étude de ce papier. Je tiens a` exprimer ma profonde reconnaissance a` mon directeur de thèse, G. Skandalis. Je voudrais aussi remercier E. Blanchard pour de nombreuses discussions sur les C∗ -algèbres de Hopf, ainsi que S.Z. Wang pour plusieurs commentaires utiles sur ce papier. ´ 1. Définitions et Enonc´ es des Résultats Dans cette section on définit les groupes quantiques compacts matriciels Au (F ) (d’une manière légérement différente que dans l’article de Wang et Van Daele [VDW]) et on e´ nonce les resultats principaux. La fin de cette section contient le plan de l’article, ainsi que des rappels et notations. 1) Il existe plusieures définitions pour les morphismes entre les groupes quantiques compacts matriciels, auxquelles correspondent des différentes notions d’isomorphisme. Sans rentrer dans les détails (dans ce papier on dira que (G, u) = (H, v) si G = H en tant que C∗ -algèbres et si u = v), rappelons la définition [Wo2] de la similarité: Deux groupes quantiques compacts matriciels (G, u) et (H, v) avec u ∈ Mn (G), v ∈ Mm (H) sont dits similaires (on e´ crira G ∼sim H) si n = m et s’il existe une matrice Q ∈ GL(n, C) et un C∗ -isomorphisme f : G → H tel que (Id ⊗ f )(u) = QvQ−1 . 2) Soit (G, u) un groupe quantique compact matriciel. On appelle représentation de (G, u) toute matrice inversible r ∈ Mk (G) telle que X (Id ⊗ δ)(r) = r12 r13 := eij ⊗ rik ⊗ rkj . La théorie de “Peter-Weyl” de Woronowicz [Wo2] montre que toute représentation est e´ quivalente a` une représentation unitaire. En particulier, v = Q−1 uQ est unitaire pour une certaine matrice Q ∈ GL(n, C). Quitte a` remplacer (G, u) par un groupe quantique compact matriciel similaire, on peut supposer que u est unitaire. 3) Soit (G, u) un groupe quantique compact matriciel avec u ∈ Mn (G) unitaire. Alors la représentation u := (u∗ij ) est e´ quivalente a` une représentation unitaire, donc il existe une matrice F ∈ GL(n, C) telle que F uF −1 soit unitaire. Il en résulte que G est un quotient de la C∗ -algèbre Au (F ), où: Définition 1. Pour tout n ∈ N et toute matrice F ∈ GL(n, C) on définit la C∗ -algèbre Au (F ) avec générateurs {uij }1≤i,j≤n et les relations qui rendent unitaires les matrices u = (uij ) et F uF −1 . Remarquons que Au (F ) est bien définie: si J est l’idéal bilatère engendré dans l’algèbre libre sur 2n2 variables L := C < uij , u∗ij > par les relations qui rendent unitaires les matrices u := (uij ) et F uF −1 := F (u∗ij )F −1 , alors les images des générateurs uij , u∗ij dans le quotient L/J sont de norme ≤ 1 pour toute C∗ -norme sur L/J. Donc L/J admet une C∗ -algèbre enveloppante, qu’on peut noter Au (F ).

Free Unitary Compact Quantum Group

145

(Au (F ), u) est un groupe quantique compact matriciel. En effet, on a v unitaire =⇒ v12 v13 unitaire, ce qui appliqué a` v = u et a` v = F uF −1 (avec la remarque que F u12 u13 F −1 = (F uF −1 )12 (F uF −1 )13 ) permet de définir δ par propriété universelle. Enfin, par [Wo4] l’existence de l’antipode κ est e´ quivalente au fait que u, u soient inversibles, ce qui est e´ vident dans le cas de Au (F ). Remarquons que pour tout groupe compact G ⊂ U(n), C(G) est un quotient de C(U(n)). Par ce qui précède, l’analogue de U(n) parmi les groupes quantiques compacts est la famille {Au (F ) | F ∈ GL(n, C)}. Remarque. Les relations qui définissent Au (F ) sont: uu∗ = u∗ u = (F ∗ F )u(F ∗ F )−1 ut = ut (F ∗ F )u(F ∗ F )−1 = I. On en déduit des e´ galités entre les Au (F ): √ Au (F ) = Au ( F ∗ F ) = Au (λF ), ∀ F ∈ GL(n, C), λ ∈ C∗ . Il existent aussi d’autres similarités entre les Au (F ) - si V, W ∈ U(n) et F ∈ GL(n, C) 6). On pourrait donc utiliser d’autres alors Au (F ) ∼sim Au (V F W ) (voir la Proposition√ paramètres √ pour les Au (F ) - par exemple F ∗ F , ou F ∗ F , ou encore la liste des valeurs propres de F ∗ F etc., voir [W2, VDW]. Bien-sûr, le choix du paramètre n’est pas un problème sérieux: on obtient toujours les mêmes objets, au moins modulo la similarité. Le quotient de Au (F ) par les relations u = F uF −1 pourrait eˆ tre considéré comme e´ tant une “version orthogonale de Au (F )”. Remarquons que la condition u = F uF −1 −1 implique u = F uF , donc u = (F F )u(F F )−1 . Il en résulte que si F F n’est pas un multiple scalaire de l’identité de Mn (C), alors u est réductible dans ce quotient. Remarquons e´ galement que F F = cIn avec c ∈ C implique F F = cIn , donc c = c. Définition 2. Pour tout n ∈ N et pour toute matrice F ∈ GL(n, C) telle que F F = cIn avec c ∈ R on note Ao (F ) le quotient de Au (F ) par les relations u = F uF −1 . Les représentations irréductibles de Ao (F ) sont indéxées par N, et leur formules de fusion sont exactement les formules connues pour les représentations de SU(2) ([B2], voir le Théorème 4 ci-dessous). Notations. N ∗ N est le coproduit dans la catégorie des mono¨ıdes de deux copies de N ayant α, β comme générateurs ; e est l’élément neutre de N ∗ N ; − est l’involution antimultiplicative de N ∗ N définie par e = e, α = β et β = α. Le résultat principal de ce papier est le suivant: Théorème 1. Soit n ∈ N et F ∈ GL(n, C). Alors: (i) Les représentations irréductibles de (Au (F ), u) sont indexées par N ∗ N, avec re = 1, rα = u, rβ = u. Pour tous les x, y ∈ N ∗ N on a les formules rx = rx et: X rab . r x ⊗ ry = {a,b,g∈N ∗N |x=ag,y=gb}

(ii) La sous-algèbre de Au (F ) engendrée par les caractères de toutes les représentations est la ∗-algèbre libre sur le caractère χ(u) de la représentation fondamentale. (iii) χ(u)/2 est une variable circulaire dans Au (F ), munie de la mesure de Haar. (iv) Si F F ∈ RIn alors Au (F )red se plonge dans C(T) ∗red Ao (F ) par uij 7→ zvij (où v est la représentation fondamentale de Ao (F ) et z est le générateur canonique de C(T)).

146

T. Banica

Le point (i) montre que la famille F = {Au (F ) | n ∈ N, F ∈ GL(n, C)} a la propriété remarquable suivante: Si G, H ∈ F alors il existe une bijection ψ entre les classes d’équivalence de représentations de G et celles de H qui préserve les sommes et les produits tensoriels et qui envoie l’ensemble des représentations irréductibles de G sur l’ensemble des représentations irréductibles de H, ainsi que la représentation fondamentale de G sur celle de H. Un résultat important de ce type, pour la famille (à un paramètre réel positif) de groupes quantiques compacts matriciels associés a` une algèbre de Lie classique, a e´ té démontré par Rosso [R1, R2]. Un autre résultat dans cette direction, mais cette fois-ci de “rigidité”, est celui de [B2] - la famille {Ao (F ) | n ∈ N, F ∈ GL(n, C), F F ∈ RIn } vérifie la propriété ci-dessus, mais de plus est maximale. Le résultat suivant est du même type: Théorème 2. Si les représentations irréductibles d’un groupe quantique compact matriciel (G, u) sont indexées par N ∗ N, avec re = 1, rα = u, rβ = u et rx ⊗ ry = P x=ag,y=gb rab , alors il existe un n ∈ N et une matrice F ∈ GL(n, C) tels que Gp ∼sim Au (F ). La théorie des représentations de Au (F ) fait l’objet de la première partie (Sects. 2, 3, 4) de ce papier. Dans la deuxième partie (Sects. 6, 7, 8) on utilise la théorie des représentations pour résoudre certaines questions topologiques liées aux C∗ -algèbres Au (F ) et Au (F )red . Rappelons que pour un groupe quantique compact matriciel (G, u) la mesure de Haar h n’est pas forcément une trace, mais elle vérifie la formule h(xy) = h(y(f1 ∗ x ∗ f1 )), ∀ x, y ∈ Gs où ∗ est la convolution au dessus de Gs et {fz }z∈C est une famille canonique de caractères de Gs (voir le Théorème 5.6 de [Wo2]). Théorème 3. Soit n ∈ N et F ∈ GL(n, C). Alors Au (F )red est simple. Soient s, t ∈ R et soit ψ un e´ tat de Au (F )red tel que ∀ x, y ∈ Au (F )s on ait ψ(xy) = ψ(y(fs ∗ x ∗ ft )). Alors ψ est la mesure de Haar de Au (F )red . En particulier, si F est un multiple scalaire d’une matrice unitaire, alors Au (F )red est simple a` trace unique ; sinon, Au (F )red est simple sans trace. Parmi les autres résultats sur Ao (F ) et Au (F ), citons: - un résultat de commutation dans Au (I2 ). - l’égalité de facteurs Au (I2 )"red = W ∗ (F2 ). - la non-moyennabilité de Ao (F ) et Au (F ). - la non-nucléarité de Ao (In )red et Au (In )red . - des remarques sur les mesures de Haar de Ao (F ) et Au (F ). Une partie de ces résultats sont des cas particuliers d’énoncés plus généraux sur les groupes quantiques compacts. Citons ici le résultat de simplicité (la Proposition 9), dont la démonstration pour G = C∗red (Fn ) contient une simplification par rapport aux démonstrations classiques [P, H, HS] de la simplicité de C∗red (Fn ).


147

L’organisation de ce travail est la suivante: 2eme section: on rappelle les résultats de [B2] sur Ao (F ) et on donne une description (en termes de certaines partitions non-croisées) de l’espace des vecteurs fixes de la représentation u⊗k de Ao (F ). 3eme section: on construit l’algèbre abstraite PA engendrée par des {rx | x ∈ N ∗ N} qui se multiplient par les formules rx ry = x=ag,y=gb rab et on montre que A ' C < X, X ∗ >. En utilisant cette algèbre, ainsi que la même methode que dans le cas “orthogonal” [B2], on voit que la démonstration du Théorème 1 est e´ quivalente au calcul des dimensions des commutants des représentations de la forme u⊗m1 ⊗ u⊗n1 ⊗ u⊗m2 ⊗ u⊗n2 ⊗ ...

(?).

Ces dimensions sont des ∗-moments du caractère χ(u) de la représentation fondamentale de Au (F ) par rapport a` la mesure de Haar, et en fait on voit que χ(u)/2 doit eˆ tre une variable circulaire. 4eme section: si F F ∈ RIn on combine les résultats sur Ao (F ) avec un résultat de probabilités non commutatives pour demontrer le Théorème 1. On utilise ensuite des résultats de reconstruction de [Wo3] pour trouver un système de générateurs des espaces des vecteurs fixes des représentations de la forme (?). Les dimensions de ces espaces sont exactement les ∗-moments de χ(u), et en utilisant cette remarque on passe du cas F F ∈ RIn au cas général F ∈ GL(n, C). 5eme section: on decrit les Ao (F ) et Au (F ) pour F ∈ GL(2, C). Le point (iv) du Théorème 1 permet d’identifier Au (I2 )red comme une sous-C∗ -algèbre de C(T) ∗red C(SU(2)), et on en déduit deux plongements (de C∗ -algèbres de Hopf) de C(SO(3)) dans Au (I2 ), ainsi que l’égalité de facteurs Au (I2 )"red = W ∗ (F2 ). 6eme section: on utilise les caractères {fz } de [Wo2] pour “perturber” la représentation adjointe d’un groupe quantique compact matriciel. 7eme section: on généralise aux groupes quantiques compacts la “Propriété de Powers” de de la Harpe [H], ainsi que la démonstration de simplicité de [HS]. L’idée est de remplacer les automorphismes intérieurs x 7→ ug xu∗g du cas discret par les applications b complètement positives de la forme x 7→ ad(r)(x), avec r ∈ A. 8eme section: Au (F )red n’a pas la propriété de Powers, mais en utilisant les calculs de la 6eme et 7eme section on arrive a` démontrer le Théorème 3. Rappels et Notations: A) matrices: on note {e1 , ..., en } la base canonique de Cn , et eij le système d’unités matricielles de Mn (C), qui vérifie Si A est une ∗-algèP bre et u ∈ Mn (A), P eij : e∗j 7→tei . P P eij ⊗ uij , u = eij ⊗ uji , u∗ = eij ⊗ u∗ji . u = eij ⊗ uij , on note u = B) représentations: pour tout groupe quantique compact G on note Rep(G) l’enb ⊂ Rep(G) semble des classes d’équivalence de représentations unitaires de G et G l’ensembleP des classes de représentations Punitaires irréductibles. Si u = eij ⊗ uij ∈ Mn (G) etP v = eij ⊗ vij ∈ Mm (G) sont des représentations, on note u ⊗ v la matrice u13 v23 := eij ⊗ ekl ⊗ uij vkl , et u + v la matrice diag(u, v). Alors u ⊗ v et u + v sont des représentations. L’application (u, v) 7→ u ⊗ v induit une structure de mono¨ıde sur Rep(G).PDe même pour l’application (u, v) 7→ u + v Le caractère de u est χ(u) := uii ∈ G. On a χ(u + v) = χ(u) + χ(v) et χ(u ⊗ v) = χ(u)χ(v). (voir [Wo2, Wo3]). C) théorie de “Peter-Weyl” de Woronowicz: on note Gcentral l’espace linéaire (donc ∗-algèbre) engendré dans la ∗-algèbre “des coefficients” Gs par les caractères de toutes les représentations. On utilisera souvent, sans référence, le résultat fondamental suivant (Th. 5.8 de [Wo2]):

148

T. Banica

b est une La mesure de Haar est une trace sur Gcentral . L’ensemble {χ(u) | u ∈ G} base de Gcentral , orthonormée par rapport au produit scalaire associé a` la mesure de Haar. D) version pleine et réduite: la version réduite d’un groupe quantique compact matriciel (G, u) est Gred = G/{x | h(xx∗ ) = 0} (h e´ tant la mesure de Haar de G). La version pleine est Gp = C ∗ (Gs ) (la C∗ -algèbre enveloppante de Gs ). Alors Gp et Gred sont des groupes quantiques compacts matriciels (cf. [Wo2, BS]). G est dit moyennable si la projection Gp → Gred est un isomorphisme. Il est dit plein (resp. réduit) si la projection Gp → G (resp. G → Gred ) est un isomorphisme. On a Gs = Hs ⇐⇒ Gred = Hred ⇐⇒ Gp = Hp . Notons aussi que Ao (F ) et Au (F ) sont pleins. E) liberté: si (A, φ) est une ∗-algèbre unifère munie d’une forme linéaire unitale, une famille de sous-algèbres 1 ∈ Ai ⊂ A (i ∈ I) est dite libre si aj ∈ Aj ∩ ker(φ) avec ij 6= ij+1 , 1 ≤ j ≤ n − 1 implique a1 a2 ...an ∈ ker(φ) (voir [VDN]). Deux e´ léments a, b ∈ A sont dits ∗-libres si les deux ∗-algèbres unifères qu’ils engendrent dans A sont libres. Exemple fondamental: soient (A, φ) et (B, ψ) deux C∗ -algèbres unifères munies d’états et A ∗ B le produit libre (= coproduit dans la catégorie des C∗ -algèbres unifères) de A et B. Si on note φ ∗ ψ le produit libre de φ et ψ et πφ∗ψ la représentation GNS associée, alors πφ∗ψ (A) et πφ∗ψ (B) sont libres dans πφ∗ψ (A ∗ B) (voir [A, VDN]). F) produits libres: si (G, u) et (H, v) sont deux groupes quantiques compacts alors (G ∗ H, diag(u, v)) est un groupe quantique compact matriciel plein, et sa mesure de Haar est le produit libre h ∗ k des mesures de Haar h de G et k de H (voir [W2]). Le produit libre réduit πh∗k (G ∗ H) sera noté G ∗red H ; c’est un groupe quantique compact matriciel réduit. Notons que h, k e´ tant fidèles sur Gred , Hred respectivement, on a des plongements canoniques de Gred et Hred dans G ∗red H. G) ∗-distribution: pour tout e´ lément a ∈ (M, φ) d’une ∗-algèbre munie d’une forme linéaire, sa ∗-distribution est la fonctionnelle sur C < X, X ∗ > donnée par P 7→ φ(P (a, a∗ )), i.e. la composée: X7→a

φ

C < X, X ∗ > −→ M −→ C. Les ∗-moments de a sont les valeurs de µa sur les monomes non-commutatifs en X, X ∗ , i.e. sur le mono¨ıde engendré dans (C < X, X ∗ >, ·) par X et X ∗ . Si (M, φ) est une C∗ -algèbre munie d’un e´ tat fidèle et a = a∗ , alors la ∗-distribution µa peut eˆ tre vue (par restriction a` C[X], ensuite en complètant) comme une mesure de probabilité sur le spectre de a. H) √ variables circulaires: la loi semicirculaire (centrée) est la mesure γ0,1 = 2/π 1 − t2 dt sur [−1, 1]. Tout hermitien ayant cette distribution est dit semicirculaire.√Un quart-circulaire est un e´ lément positif ayant comme distribution la mesure 4/π 1 − t2 dt sur [0, 1]. Un Haar-unitaire est un unitaire u tel que µu (X k ) = 0 pour tout k 6= 0. Une variable g est dite circulaire si 2−1/2 (g + g ∗ ) et −i2−1/2 (g − g ∗ ) sont semicirculaires et libres (voir [VDN]). 2. Compléments sur Ao (F ) On voit facilement a` partir de la définition de Ao (F ) que Ao même plus, qu’on a une e´ galité (modulo la similarité)

0 1 −1 0

= C(SU(2)), et

{Ao (F ) | F ∈ GL(2, C), F F ∈ RI2 } = {Sµ U(2) | µ ∈ [−1, 1] − {0} }


149

où Sµ U(2) sont les déformations de S1 U(2) := C(SU(2)) définies par Woronowicz dans [Wo1, Wo2] (voir 5eme section). Si F ∈ GL(n, C) avec n arbitraire on a le résultat suivant. Théorème 4. [B2] Soit n ∈ N et F ∈ GL(n, C) avec F F ∈ RIn . Alors les représentations irréductibles de Ao (F ) sont auto-adjointes et indexées par N, avec r0 = 1, r1 = u et rk ⊗ rs = r|k−s| + r|k−s|+2 + ... + rk+s−2 + rk+s (i.e. les mêmes formules que pour les représentations de SU(2)). Rappelons bri` Pevement la démonstration: la condition F F ∈ RIn montre que la projection sur C Fji ei ⊗ ej , qui entrelace u⊗2 , définit pour tout k une représentation de l’algèbre de Jones Aβ,k dans M or(u⊗k , u⊗k ). En utilisant les résultats de [Wo3] on voit que cette représentation est surjective, et l’inégalité dim(M or(u⊗k , u⊗k )) ≤ dim(Aβ,k ) ≤ Ck ainsi obtenue permet de construire (par récurrence sur k) des représentations irréductibles rk de Ao (F ) qui vérifient les mêmes formules de multiplication que celles de SU(2). Un corollaire de la démonstration (voir Remarque (ii) de [B2]) est l’égalité dim(M or(u⊗k , u⊗k )) = dim(Aβ,k ) = Ck =

(2k)! . k!(k + 1)!

Notons h la mesure de Haar de Ao (F ). On a (cf. Rappel C): dim(M or(u⊗k , u⊗k )) = h(χ(u)⊗2k ) = dim(M or(1, u⊗2k )). Les nombres de Catalan Ck ont une autre propriété remarquable - ce sont les moments de la loi semicirculaire de Wigner et Voiculescu. En effet, on peut calculer les moments de γ0,1 a l’aide de 3.3 et 3.4. de [VDN], de la formule des résidus et celle du binôme: Z (2k)! 2k −1 −1 γ0,1 (X ) = (2k + 1) (2πi) . (z −1 + z/4)2k+1 = 4−k k!(k + 1)! T En combinant toutes ces e´ galités, on en déduit que: Proposition 1. χ(u)/2 ∈ (Ao (F ), h) est une variable semicirculaire. 0 1 Remarque. Si F = , alors la caractère de la représentation fondamentale −1 0 a b de Ao (F ) = C(SU(2)) est χ(u) = 2Re(a). Le fait que Re(a) soit u = −b a semicirculaire par rapport a` la mesure de Haar de SU(2) peut eˆ tre vu géometriquement, en identifiant SU(2) avec la sphère S3 , et sa mesure de Haar avec la mesure uniforme sur cette sphère. Corollaire 1. (G. Skandalis) Si F ∈ GL(2, C) alors Ao (F ) est moyennable. Si F ∈ GL(n, C) et n > 2 alors Ao (F ) n’est pas moyennable. Démonstration. Le support de la loi semicirculaire e´ tant [−1, 1] et h e´ tant fidèle sur Ao (F )red , on obtient que Sp(χ(u)/2) = [−1, 1] dans Ao (F )red (voir Rappels). Donc si n ≥ 3, alors n − χ(u) est inversible dans Ao (F )red . Mais la coünité de Ao (F ) est un ∗-morphisme unital qui envoie n − χ(u) sur 0, donc Ao (F ) 6= Ao (F )red . Enfin, si F ∈ GL(2, C), alors Ao (F ) est similaire a` un certain Sµ U(2) (voir 5eme section), qui est moyennable, cf. [N, Bl].

150

T. Banica

Remarque. Une partie des résultats classiques sur la moyennabilité a e´ té e´ tendue aux groupes quantiques localements compacts dans [BS, Bl] (voir aussi la Proposition 10 ci-dessous). La démonstration ci-dessus de la non-moyennabilité de Ao (F ) peut eˆ tre e´ tendue a` des groupes quantiques compacts matriciels quelconques - on démontre par la même méthode que (G, u) avec u ∈ Mn (G) est moyennable si et seulement si le support de la loi de Re(χ(u)) par rapport a` la mesure de Haar contient n. Rappelons que pour toute représentation r ∈ B(Hr )⊗G d’un groupe quantique compact G, les vecteurs fixes de r sont les x ∈ Hr tels que r(x ⊗ 1) = (x ⊗ 1). Ces vecteurs forment un sous-espace vectoriel de Hr qui s’identifie naturellement avec M or(1, r). On va donner maintenant une description des vecteurs fixes de la représentation u⊗k de Ao (F ). Les résultats qui suivent seront utilisés dans la 4eme section, pour démontrer le Théorème 1 pour les matrices F qui ne vérifient pas (!) la condition F F ∈ RIn . Ainsi, le lecteur intéressé uniquement par les algèbres Au (F ) avec F F ∈ RIn (e.g. par Au (In )) pourra passer directement a` la section suivante. Lemme 1 ([B2]). Soit n ∈ N et F ∈ GL(n, C) avec F F ∈ RIn . Soit H = Cn , avec la base orthonormale {ei }. P (i) L’opérateur E ∈ B(C, H ⊗2 ), x 7→ x ei ⊗ F ei est dans M or(1, u⊗2 ). (ii) On a (E ∗ ⊗ IdH )(IdH ⊗ E) ∈ CIdH . (iiii) Pour r, s ∈ N, on définit les ensembles M or(r, s) ⊂ B(H ⊗r , H ⊗s ) des combinaisons linéaires de produits (composables) d’applications de la forme IdH ⊗k ou IdH ⊗k ⊗ E ⊗ IdH ⊗p ou IdH ⊗k ⊗ E ∗ ⊗ IdH ⊗p . Alors la W ∗ -catégorie concrète mono¨ıdale des représentations de Ao (F ) est la complétion (dans le sens de [Wo3]) de la W ∗ -catégorie concrète mono¨ıdale W (F ) := {N, +, {H ⊗r }r∈N , {M or(r, s)}r,s∈N }

En gardant toutes les notations, on a: Lemme 2. (i) On note I(p) = IdH ⊗p et V (p, q) = I(p) ⊗ E ⊗ I(q). Alors tout morphisme de W (F ) est une combinaison linéaire d’applications de la forme I(.) ou de la forme V (., .) ◦ ... ◦ V (., .) ◦ V (., .)∗ ◦ ... ◦ V (., .)∗ . (ii) Pour tout k ≥ 0, les applications de la forme M ⊗I(1)⊗N avec M ∈ M or(0, 2x), N ∈ M or(0, 2y) et x + y = k engendrent M or(1, 2k + 1). (i) Pour tout k ≥ 0, les applications de la forme (I(1) ⊗ M ⊗ I(1) ⊗ N ) ◦ E avec M ∈ M or(0, 2x), N ∈ M or(0, 2y) et x + y = k engendrent M or(0, 2k + 2). Démonstration. Le point (i), i.e. le fait qu’on “peut passer les ∗ a` droite”, résulte du point (ii) du Lemme 1. On démontre (ii) par récurrence sur k. Pour k = 0 le point (i) montre que M or(1, 1) = {CI(1)}. Soit donc k ≥ 1 et A ∈ M or(1, 2k + 1). Par le point (i), A est une combinaison linéaire d’applications de la forme V (k1 , s1 ) ◦ ... ◦ V (km , sm ), avec k1 + s1 = 2k − 1 et T := V (k2 , s2 ) ◦ ... ◦ V (km , sm ) dans M or(1, 2k − 1). Par l’hypothèse de récurrence T est une combinaison linéaire d’applications de la forme B = (M ⊗ I(1) ⊗ N ) pour certains M ∈ M or(0, 2x) et N ∈ M or(0, 2y), avec x + y = k − 1, donc: – soit k1 ≥ 2x+1, et alors V (k1 , s1 )◦B = M ⊗I(1)⊗((I(k1 −2x−1)⊗E ⊗I(s1 ))◦N ). – soit k1 ≤ 2x, et alors V (k1 , s1 ) ◦ B = ((I(k1 ) ⊗ E ⊗ I(2x − k1 )) ◦ M ) ⊗ I(1) ⊗ N .


151

On démontre maintenant (iii): soit A ∈ M or(0, 2k + 2). Par le point (i), A est une combinaison linéaire d’applications de la forme B = V (k1 , s1 ) ◦ ... ◦ V (km , sm ). Remarquons que V (km , sm ) = V (0, 0), donc on peut considérer le plus petit p tel que kp = 0. Alors B = (I(1) ⊗ G) ◦ (E ⊗ I(sp )) ◦ K, avec G = V (k1 − 1, s1 ) ◦ ... ◦ V (kp−1 − 1, sp−1 ) et K = V (kp+1 , sp+1 ) ◦ ... ◦ V (km , sm ). Il en résulte que: B = (I(1) ⊗ G) ◦ (I(2) ⊗ K) ◦ E = (I(1) ⊗ (G ◦ (I(1) ⊗ K))) ◦ E. Mais G◦(I(1)⊗K) ∈ M or(1, 2k +1) est, par le point (ii), de la forme M ⊗I(1)⊗N , pour certains M ∈ M or(0, 2x) et N ∈ M or(0, 2y), et (iii) en résulte. Proposition 2. On définit pour tout k ∈ N la partie W2k (F ) ⊂ M or(0, 2k) par W0 (F ) = 1 et (par récurrence) par: W2k+2 (F ) = ∪k=x+y {(I(1) ⊗ M ⊗ I(1) ⊗ N ) ◦ E | M ∈ W2x (F ), N ∈ W2y (F )} Alors W2k (F ) est une base de M or(0, 2k), ∀ k ≥ 0. Démonstration. Les nombres Dk := Card(W2k (F )) vérifient D0 = D1 = 1 et X Dx Dy . Dk+1 = k=x+y

Ce P sontk donc les nombres de Catalan (classique, considérer le carré de la série Dk z ...). Il en résulte que Card(W2k (F )) = dim(M or(0, 2k)). D’autre part, le point (iii) du Lemme 2 montre que W2k (F ) engendre M or(0, 2k). ` ` Remarque. Soit P = P1 ... Pk une partition non-croisée en parties a` deux e´ léments de {1, ..., 2k}, i.e. une partition telle que si on note Pm = {im , jm } avec im < jm pour chaque 1 ≤ m ≤ k, alors: ∀ m 6= n, im < in < jm =⇒ jn < jm . On associe a` P le vecteur suivant de (Cn )⊗2k : X Fsj1 si1 ...Fsjk sik es1 ⊗ ... ⊗ es2k . v(P ) = 1≤s1 ...s2k ≤k

On peut montrer par récurrence sur k, en utilisant la Proposition 2, que l’ensemble de ces v(P ) coinc¨ıde avec l’ensemble {X(1) | X ∈ W2k (F )}, donc est une base de l’espace des vecteurs fixes de la représentation u⊗2k de Ao (F ). Ceci permet en principe de calculer la mesure de Haar de Ao (F ) - pour toute representation r d’un groupe quantique compact matriciel on a (Id ⊗ h)(r) = projecteur sur l’espace des vecteurs fixes de r (voir [Wo2]). 3. Reconstruction de Au (F )central On construit et on e´ tudie dans cette section l’algèP bre A engendrée par des {rx | x ∈ N ∗ N} qui se multiplient par les formules rx ry = x=ag,y=gb rab . Notations. N ∗ N est le produit libre (i.e. coproduit dans la catégorie des mono¨ıdes) de deux copies de N, notées multiplicativement {e, α, α2 , ...} et {e, β, β 2 , ...}. On considère l’ensemble A de fonctions N ∗ N → C avec support fini. On va identifier N ∗ N ⊂ A,

152

T. Banica

comme masses de Dirac. Avec l’addition et la multiplication des fonctions A est l’algèbre des polynômes non commutatifs en deux variables, c’est a` dire on a un isomorphisme: (C < X, X ∗ >, +, ·) ' (A, +, ·) par X 7→ α , X ∗ 7→ β. On définit sur N ∗ N une involution antimultiplicative par e = e, α = β et β = α. Cette involution s’étend par antilinéarité en une involution de A, notée encore −. On note En ⊂ A l’espace linéaire engendré par les e´ léments de N ∗ N de longeur ≤ n. Soit l2 (N ∗ N) la complétion de A pour la norme 2. Notons que les e´ léments de N ∗ N (vus comme e´ léments de A, donc de l2 (N ∗ N), voir les identifications ci-dessus) forment une base orthonormée de l2 (N ∗ N). Soit τ0 : x 7→< x(e), e > l’état canonique sur B(l2 (N ∗ N)). On définit S, T ∈ 2 B(l (N ∗ N)) par linéarité et S(x) = αx, T (x) = βx pour tout x ∈ N ∗ N. Rappel. A tout espace de Hilbert H on peut associer [VDN] l’espace de Fock plein F (H): si {hi }i∈I une base orthonormale de H, alors {hi1 ⊗ hi2 ⊗ ... ⊗ hik | k ≥ 0} est une base orthonormale de F (H) (on fait la convention que pour k = 0, le vecteur correspondant est celui du vide). Notons {fi }i∈I les générateurs du mono¨ıde libre N∗I . Alors la base orthonormale canonique de l2 (N∗I ) est la famille {δm }, avec m ∈ N∗I , donc de la forme m = fi1 ...fik . On a donc une isometrie: l2 (N∗I ) ' F (H) par δfi1 ...fik 7→ hi1 ⊗ hi2 ⊗ ... ⊗ hik . L’opérateur de création l(hi ) correspond ainsi a` λN ∗I (fi ), où λN ∗I est la représentation régulière gauche (par isometries !) du mono¨ıde N∗I . Pour I = {1, 2} on a S = λN ∗N (α) et T = λN ∗N (β), donc en identifiant l2 (N ∗ N) = F (H), avec H de base orthonormale {h1 , h2 }, on a: S = l(h1 ), T = l(h2 ). Lemme 3. On définit l’application : N ∗ N × N ∗ N → A par X ab. x y = x=ag,y=gb

(i) s’étend par linéarité en une multiplication associative sur A. (ii) Si P : (A, +, ·) → (B(l2 (N ∗ N)), +, ◦) est le ∗-morphisme défini par α 7→ S + T ∗ et J : A → A est l’application f 7→ P (f )e, alors (J − Id)En ⊂ En−1 pour tout n. (iii) J est un isomorphisme de ∗-algèbres (A, +, ·) ' (A, +, ). Démonstration. (i) Notons que est bien définie, car la somme est finie. Montrons qu’elle est associative. Soient x, y, z ∈ N ∗ N. Alors (x y) z = X {g,a,b∈N ∗N |x=ag,y=gb}

ab z =

X {g,h,a,b,c,d∈N ∗N |x=ag,y=gb,ab=ch,z=hd}

cd.


153

Remarquons que pour a, b, c, h ∈ N ∗ N l’égalité ab = ch est e´ quivalente a` une décomposition de la forme b = uh, c = au avec u ∈ N ∗ N, ou de la forme a = cv, h = vb pour un certain v ∈ N ∗ N. Donc (x y) z =

X

X

aud +

{g,h,a,d,u∈N ∗N,x=ag,y=guh,z=hd}

cd

{g,b,c,d,v∈N ∗N,x=cvg,y=gb,z=bvd}

Un calcul similaire montre que x (y z) est donné par la même formule, donc est associative. (ii) Soit f ∈ A. On peut vérifier facilement que P (α)f = (S + T ∗ )f = α f . Donc J(αg) = P (α)J(g) = α J(g) = J(α) J(g) pour toute g ∈ A, et par le même argument on obtient J(βg) = J(β) J(g), pour toute g ∈ A. (A, +, ·) e´ tant engendrée par α et β, il en résulte que J est un morphisme d’algèbres: J : (A, +, ·) → (A, +, ). On démontre par récurrence sur n ≥ 1 que (J − Id)En ⊂ En−1 . Pour n = 1 on a J(α) = α, J(β) = β et J(e) = e, et comme E1 est engendrée par e, α, β on a J = Id sur E1 . Supposons que c’est vrai pour n et soit k ∈ En+1 . On ecrit k = αf + βg + h avec f, g, h ∈ En (a noter que cette décomposition n’est pas unique). Alors: (J − Id)k = J(αf + βg + h) − (αf + βg + h) = = [(S + T ∗ )J(f ) + (S ∗ + T )J(g) + J(h)] − [Sf + T g + h] = = S(J(f ) − f ) + T (J(g) − g) + T ∗ J(f ) + S ∗ J(g) + (J(h) − h). En appliquant l’hypothèse de récurrence a` f, g, h on trouve que En contient tous les termes de la somme, donc contient (J − Id)k et on a fini. Enfin, pour démontrer (iii) il nous reste a` voir que J préserve l’involution ∗ et qu’il est une bijectif. On a J∗ = ∗J sur les générateurs {e, α, β} de A, donc J préserve l’involution. Aussi par (ii), la restriction de J −Id a` En est un endomorphisme nilpotent, donc J est bijectif. Lemme 4. Soit (G, u) un groupe quantique compact matriciel et soit 9G : (A, +, ) → G l’unique morphisme défini par α 7→ χ(u), β 7→ χ(u) (cf. point (iii) du Lemme 3). Soit n ≥ 1 et supposons que 9G (x) est le caractère d’une représentation irréductible rx de G, pour tout x ∈ N ∗ N de longeur ≤ n. Alors 9G (x) est le caractère d’une représentation (non nulle) de G, pour tout x ∈ N ∗ N de longeur n + 1. Démonstration. Pour n = 1 c’est clair. Supposons n ≥ 2 et soit x ∈ N ∗ N de longeur n + 1. Si x contient une puissance ≥ 2 de α ou de β, par exemple si x = zα2 y, alors on pose rx := rzα ⊗ rαy et on a fini. Supposons donc que x est un produit des α alternant avec des β. On peut supposer que x commence avec α. Alors x = αβαy, avec y ∈ N ∗ N de longeur n − 2. Notons que l’égalité 9G (z) = 9G (z)∗ est vraie sur les générateurs {e, α, β} de N∗N, donc elle est vraie pour tout z ∈ N ∗ N. Si est le produit scalaire sur G associé a` la mesure de Haar, alors (voir Rappels) < χ(rα ⊗ rβαy ), χ(rαy ) >=< χ(rβαy ), χ(rβ ⊗ rαy ) >= < χ(rβαy ), 9G (β αy) >=< χ(rβαy ), 9G (βαy) + 9G (y) >= < χ(rβαy ), χ(rβαy ) + χ(ry ) > ≥ 1.

154

T. Banica

Comme rαy est irréductible, il en résulte qu’elle est une sous-représentation de rα ⊗ rβαy . Donc χ(rα ⊗ rβαy ) − χ(rαy ) = 9G (α βαy − αy) = 9G (x) est le caractére d’une représentation de G. Soit (G, u) un groupe quantique compact matriciel et notons fx = 9G (x) pour tout x ∈ N ∗ N (notations du LemmeP4). Alors la famille {fx | x ∈ N ∗ N} vérifie fe = 1, fα = χ(u), fβ = χ(u) et fx fy = x=ag,y=gb fab . On veut montrer: – le Théorème 1 (i): pour G = Au (F ), les fx sont exactement les caractères des représentations irréductibles de Au (F ). – le Théorème 2: si les fx sont les caractères des représentations irréductibles de G, alors Gp ∼sim Au (F ) pour une certaine matrice F . Il est commode de considérer, pour tout n ∈ N et F ∈ GL(n, C) et pour tout groupe quantique compact matriciel (G, u) avec u, F uF −1 unitaires, le diagramme suivant: N∗N

N∗N

∩

∩

% 9u

(A, +, )

G −→

τ↓

(?)

C

C −→

C < X, X ∗ >= (A, +, ·)

J

−→

P ↓ B(l2 (N ∗ N))

τ

0 −→

Au (F )

9

Id

↓8 Gp ↓h C

où: – τ0 , J, P ont déjà e´ té définies et τ (f ) = f (e) = coefficient de e dans f . La commutation du carré est e´ vidente. – Gp est la version pleine de G (voir Rappels) et 8 est la surjection canonique, définie par la propriété universelle de Au (F ). – 9G (resp. 9u ) est l’unique ∗-morphisme (voir Lemme 3 (iii)) qui envoie α sur le caractère de la représentation fondamentale de Gp (resp. de Au (F )). La commutation du triangle est e´ vidente. – h est la mesure de Haar de G. Notons que les inclusions (voir Notations du début) N ∗ N ⊂ A ne commutent pas avec J. Proposition 3. Soit (G, u) un groupe quantique compact matriciel avec u et F uF −1 unitaires. Alors les assertions suivantes sont e´ quivalentes: 1) Les représentations irr´P eductibles de G sont indexées par N ∗ N, avec re = 1, rα = u, rβ = u et rx ⊗ ry = x=ag,y=gb rab . 2) Le diagramme (?) commute. 3) χ(u)/2 est une variable circulaire dans (G, h). 4) Tous les ∗-moments de χ(u)/2 ∈ (G, h) sont plus petits que les ∗-moments d’une variable circulaire, i.e. µχ(u)/2 (M ) ≤ µc (M ) pour tout monome non commutatif M (où µc est la ∗-distribution de la variable circulaire). De plus, si ces conditions sont vérifiées, alors:


155

5) 8 : Au (F ) → Gp est un isomorphisme. Note. On verra dans la section suivante que la condition 5) est en fait equivalente a` 1)-4). Démonstration. (1 ⇒ 2) Il est clair que N ∗ N est un système orthonormal dans ((A, +, ), τ ). Si 1) est vraie, alors 9G (N ∗ N) = {χ(rx ) | x ∈ N ∗ N} est un système orthonormal dans (Gs , h), d’où la commutativité de (?). (2 ⇒ 3) Remarquons que (?) commute ⇐⇒ h9G J = τ0 P . En identifiant (C < X, X ∗ >, +, ·) = (A, +, ·) on a (voir Rappel G): – la ∗-distribution de χ(u) ∈ (G, h) est la fonctionnelle h9G J. – la ∗-distribution de S + T ∗ ∈ (B(l2 (N ∗ N)), τ0 ) est la fonctionnelle τ0 P . D’autre part, les identifications du début de cette section montrent que (S + T ∗ )/2 a la même ∗-distribution que la variable (l(h1 ) + l(h2 )∗ )/2 sur l’espace de Fock plein, qui est l’exemple standard de variable circulaire (voir 1ere section de [V]). (3 ⇒ 4) est trivial. Montrons (4 ⇒ 1). Toujours en identifiant (C < X, X ∗ > , +, ·) = (A, +, ·), les monômes non-commutatifs en X, X ∗ correspondent aux e´ léments de N∗N ⊂ A. Donc l’hypothèse sur les ∗-moments de χ(u)/2 se traduit tout simplement par: (i) h9G J ≤ τ J sur N ∗ N. On démontre par récurrence sur n ≥ 0 que pour tout z ∈ N ∗ N de longeur n, 9G (z) est le caractère d’une représentation irréductible rz de G. Pour n = 0 on a 9G (e) = 1, qui est le caractère de la représentation triviale. Supposons donc c’est vrai pour n ≥ 0 et soit x ∈ N ∗ N de longeur n + 1. Le point (ii) du Lemme 3 implique J(x) = x + z avec z ∈ En . Notons AN ⊂ A l’ensemble des fonctions f telles que f (x) ∈ N pour tout x ∈ N ∗ N. Alors J(α), J(β) ∈ AN , donc par multiplicativité, J(N ∗ N) ⊂ AN . En particulier, J(x) ∈ AN . Il en résulte qu’il existe des entiers positifs m(z) tels que: X J(x) = x + m(z)z. z∈N ∗N,l(z)≤n

Calculons a` l’aide de cette formule h9G J(xx) et τ J(xx): a) Il est clair que si a, b ∈ N ∗ N, alors τ (a b) = δa,b . On obtient donc: X X X m(z)z) (x + m(z)z)) = 1 + m(z)2 . τ J(xx) = τ ((x + b) Par l’hypothèse de récurrence et par le Lemme 4, 9G (x) est P le caractère d’une représentation rx de G. Donc 9G J(x) est le caractère de rx + z∈N ∗N,l(z)≤n m(z)rz . Par les formules d’orthogonalité des caractères on a: X h9G J(xx) ≥ h(χ(rx )χ(rx )∗ ) + m(z)2 . En utilisant (i), a) et b) on conclut que rx est irréductible, ce qui termine la récurrence. Le fait que les rx ainsi construites soient distinctes résulte de (i). En effet, N ∗ N e´ tant une base orthonormée de ((A, +, ), τ ), on obtient que pour tous les x, y ∈ N ∗ N, x 6= y on a τ (x y) = 0, donc que: h(χ(rx ⊗ ry )) = h9G J(xy) ≤ τ J(xy) = τ (x y) = 0.

156

T. Banica

(1 ⇒ 5) On montre par récurrence sur n ≥ 0 que pour tout x ∈ N ∗ N de longeur n, 9u (x) est le caractère d’une représentation irréductible px de Au (F ). Pour n = 0 c’est trivial - 9u (e) est le caractère de la représentation triviale. Supposons-le pour n et soit x ∈ N ∗ N de longeur n + 1. Par le Lemme 4, 9u (x) est le caractère d’une représentation px . Comme 8 envoie px 7→ rx , qui est irréductible, px est aussi irréductible, donc on a fini. La surjection 8 envoie donc les (classes de) représentations irréductibles de Au (F ) sur les (classes de) représentations irréductibles de Gp . On conclut en utilisant un argument standard (Th. 5.7 de [Wo2]): soit {ci } une base de Au (F )s formée des coefficients des représentations irréductibles ; alors {8(ci )} est une base de Gs formée des coefficients des représentations irréductibles. Il en résulte que 8 : Au (F ) → Gp est bijective. 4. Représentations de Au (F ) Cette section est consacrée a` la démonstration des Théorèmes 1 et 2. On verra que la Proposition 3 implique facilement le Théorème 2, ainsi que (modulo un résultat de probabilités libres) le Théorème 1 pour des matrices vérifiant F F ∈ RIn . Le Théorème 1 sera ensuite démontré pour des matrices F ∈ GL(n, C) arbitraires, en utilisant les résultats obtenus pour F = In . Démonstration du Théorème 2. On se donne un groupe quantique compact matriciel (G, u) avec u ∈ Mn (G) tel que ses représentations irréductibles sont indexées par N ∗ N, avec re = 1, rα = u, rβ = u et telles que ∀ x, y, X rab . r x ⊗ ry = x=ag,y=gb

On peut supposer (modulo la similarité) que u est unitaire. Il existe donc F ∈ GL(n, C) telle que F uF −1 soit unitaire, et le théorème résulte alors de l’implication (1 ⇒ 5) de la Proposition 3. Démonstration du Théorème 1 dans le cas F F ∈ RIn . Soit n ∈ N et F ∈ GL(n) telle que F F ∈ RIn . On note u la représentation fondamentale de Au (F ), v la représentation fondamentale de Ao (F ) et z ∈ C(T) la fonction x 7→ x. Soit G la sous-C∗ -algèbre de C(T) ∗red Ao (F ) engendrée par les coefficients de la matrice zv = (zvij )ij . Alors: • χ(v)/2 est semicirculaire par rapport a` la mesure de Haar de Ao (F ) (cf. Prop. 1). • z est un Haar-unitaire dans C(T) muni de sa mesure de Haar (évident). • χ(v)/2 et z sont ∗-libres dans C(T) ∗red Ao (F ) par rapport a sa mesure de Haar (cf. Rappel F ). Ces trois conditions impliquent que le produit zχ(v)/2 est circulaire dans C(T) ∗red Ao (F ) (ceci est une version connue du théorème de Voiculescu [V] de décomposition polaire des variables circulaires, voir par exemple [B1] ou [NS]). Mais zχ(v) = χ(zv) est le caractère de la représentation fondamentale de (G, zv), donc on peut appliquer la Proposition 3: (3 ⇒ 5) implique Au (F ) = Gp , donc que Au (F )red = G, d’où le point (iv) du Théorème 1. (3 ⇒ 1) classifie les représentations de Gp = Au (F ), d’où (i,ii,iii).


157

Remarque. On aurait pu démontrer le Théorème 1 dans le cas F F ∈ RIn de la manière suivante. On considère le groupe quantique compact G ⊂ C(T)∗red Ao (F ) engendré par les coefficients de zv, v e´ tant la représentation fondamentale de Ao (F ). En combinant la théorie des représentations de Ao (F ) de [B2] avec la théorie des représentations des produits libres de [W2], on peut classifier les représentations de G. On applique ensuite le Théorème 2, pour voir que Au (F )red = G. Notons que cette démonstration ne fournit aucun outil pour aborder le cas général. Démonstration du Théorème 1 dans le cas général. Soit n ∈ N et F ∈ GL(n, C) quelconque. Notons u la représentation fondamentale de Au (F ). On doit estimer les ∗-moments du caractère χ(u), i.e. les nombres: h(χ(u)a1 χ(u)∗b1 χ(u)a2 ...) = dim(M or(1, u⊗a1 ⊗ u⊗b1 ⊗ u⊗a2 ⊗ ...)). En utilisant le fait que N ∗ N est un mono¨ıde libre: – on associe a` tout espace de Hilbert H une famille d’espaces de Hilbert {Hx }x∈N ∗N de la manière suivante: He = C, Hα = H, Hβ = H (l’espace conjugué de H), et Hab = Ha ⊗ Hb , ∀ a, b ∈ N ∗ N. – on définit une famille {ux }x∈N ∗N de représentations unitaires de Au (F ) de la manière n suivante: ue = 1, uα = u, uβ = F uF −1 (agissant sur C ) et uab = ua ⊗ ub , ∀ a, b ∈ N ∗ N. Notons que ux ∈ B(Cnx ) ⊗ Au (F ) pour tout x. Les ∗-moments de χ(u) sont ainsi les nombres {dim(M or(1, uk )) | k ∈ N ∗ N}. On va les estimer en appliquant le Th. 1.3. de [Wo3]: Lemme 5. Soit n ∈ N et F ∈ GL(n, C). Soit H = Cn , avec la base orthonormale {ei }. P On considère les applications linéaires E1 : He → Hαβ définie par 1 7→ F (ei ) ⊗ ei P −1 et E2 : He → Hβα définie par 1 7→ ei ⊗ F (ei ). (i) E1 ∈ M or(1, uαβ ) et E2 ∈ M or(1, uβα ). (ii) (E2∗ ⊗ IdHβ )(IdHβ ⊗ E1 ) ∈ CIdHβ et (E1∗ ⊗ IdHα )(IdHα ⊗ E2 ) ∈ CIdHα . (iii) Pour r, s ∈ N ∗ N, on définit les ensembles M or(r, s) ⊂ B(Hr , Hs ) des combinaisons linéaires de produits (composables) d’applications de la forme IdHk ou IdHk ⊗E1 ⊗IdHp ou IdHk ⊗E2 ⊗IdHp ou IdHk ⊗E1∗ ⊗IdHp ou IdHk ⊗E2∗ ⊗IdHp . Alors la W ∗ -catégorie concrète mono¨ıdale des représentations de Au (F ) est la complétion (dans le sens de [Wo3]) de la W ∗ -catégorie concrète mono¨ıdale, Z(F ) := {N ∗ N, ·, {Hr }r∈N ∗N , {M or(r, s)}r,s∈N ∗N }. Démonstration. (i) Si matrice unitaire a` coefficients dans une Pw ∈ Mn (B)n est une ∗-algèbre B et si ζ = ei ⊗ ei ∈ C ⊗ Cn alors X X ∗ (w13 w23 )(ζ ⊗ 1) = ei ⊗ ek ⊗ wia wka = ei ⊗ ek ⊗ δik 1 = (ζ ⊗ 1). En particulier: – pour B := Au (F ) et w := u cela montre que (1 ⊗ F )ζ = fixe de (1 ⊗ F )(u ⊗ u)(1 ⊗ F )−1 = u ⊗ (F uF −1 ).

P

ei ⊗ F ei est un vecteur

158

T. Banica −1

– pour B := Au (F ) et w := F uF −1 cela montre que (1 ⊗ F )ζ = un vecteur fixe de (1 ⊗ F )−1 (w ⊗ w)(1 ⊗ F ) = (F uF −1 ) ⊗ u.

P

ei ⊗ F

−1

ei est

Par définition des ux on a uαβ = (idCn ⊗ φ ⊗ idAu (F ) )(u ⊗ (F uF −1 )) et uβα = n (φ ⊗ idCn ⊗ idAu (F ) )((F uF −1 ) ⊗ u), où φ : B(Cn ) → B(C ) est l’isomorphisme canonique, et (i) en résulte. (ii) est un calcul facile. Pour (iii), notons que Z(F ) est par construction une W ∗ -catégorie mono¨ıdale concrète. Soit j : Hα → Hβ l’application antilinéaire définie par ei → F (ei ). Alors (avec les notations de [Wo3], p. 39) on a tj = E1 et (tj )∗ = tj −1 = E2 , donc tj ∈ M or(e, αβ) et tj ∈ M or(1, βα), donc α = β dans Z(F ). Par [Wo3], la paire universelle Z(F )-admissible est un groupe quantique compact (G, v). Le point (i) montre que (Au (F ), u) est une paire Z(F )-admissible, donc qu’on a un C∗ -morphisme f : G → Au (F ) tel que (id ⊗ f )(v) = u. D’autre part la propriété universelle de Au (F ) permet de construire un C∗ -morphisme g : Au (F ) → G tel que (id ⊗ g)(u) = v. Il en résulte que (G, v) = (Au (F ), u). La démonstration du Lemme suivant est similaire a` celle du Lemme 2. Lemme 6. (i) On note I(p) = IdHp et Vi (p, q) = I(p) ⊗ Ei ⊗ I(q) pour i = 1, 2. Alors tout morphisme de Z(F ) est une combinaison linéaire d’applications de la forme I(.) ou de la forme V· (., .) ◦ ... ◦ V· (., .) ◦ V· (., .)∗ ◦ ... ◦ V· (., .)∗ . (ii) L’ensemble des applications de la forme M ⊗ I(α) ⊗ N avec M ∈ M or(e, x), N ∈ M or(e, y) et xαy = k engendre M or(α, k). L’ensemble des applications de la forme M ⊗ I(β) ⊗ N avec M ∈ M or(e, x), N ∈ M or(e, y) et xβy = k engendre M or(β, k). (iii) L’ensemble des applications de la forme (I(α) ⊗ M ⊗ I(β) ⊗ N ) ◦ E1 avec M ∈ M or(e, x), N ∈ M or(e, y) et αxβy = k, ou des applications de la forme (I(β) ⊗ M ⊗ I(α) ⊗ N ) ◦ E2 avec M ∈ M or(e, x), N ∈ M or(e, y) et βxαy = k engendre M or(e, k). Proposition 4. On définit pour tout k ∈ N ∗ N la partie Zk (F ) ⊂ M or(e, k) par Ze (F ) = 1, Zα (F ) = Zβ (F ) = ∅ et (par récurrence) par Zk (F ) = ∪k=αxβy {(I(α) ⊗ M ⊗ I(β) ⊗ N ) ◦ E1 | M ∈ Zx (F ), N ∈ Zy (F )} si k commence par α et Zk (F ) = ∪k=βxαy {(I(β) ⊗ M ⊗ I(α) ⊗ N ) ◦ E2 | M ∈ Zx (F ), N ∈ Zy (F )} si k commence par β. Alors: (i) Zk (F ) engendre M or(e, k). (ii) Zk (In ) est une base de M or(e, k) (pour F = In ). Démonstration. (i) résulte du point (iii) du Lemme 6. Rappelons que Hk est un certain produit tensoriel entre H et son conjugué. Soit ψ : H → H l’isometrie donnée par ei 7→ ei . En identifiant H avec H a` l’aide de ψ, on obtient une isometrie ψk : Hk → H ⊗l(k) , où l(k) est la longeur du mot k. En regardant les définitions, il est clair que ψk envoie l’ensemble {X(1) | X ∈ Zk (In )} sur une partie de l’ensemble {X(1) | X ∈ Wl(k) (In )}. En utilisant la Proposition 2, l’ensemble {X(1) | X ∈ Wl(k) (In )} est formé de vecteurs linéairement indépendents de H ⊗l(k) . Ceci implique que {X(1) | X ∈ Zk (In )}


159

est formé de vecteurs linéairement indépendents, donc que Zk (In ) est une base de M or(e, k). Fin de la démonstration du Théorème 1. Rappelons que n ∈ N et F ∈ GL(n, C) Ck , où les nomsont arbitraires. Par construction de Zk (F ) on a Card(Zk (F )) = P bres {Cx }x∈N ∗N sont définis par Ce = 1, Cα = Cβ = 0 et Ck = k=αxβy Cx Cy + P k=βxαy Cx Cy pour k ∈ N ∗ N. Notons u(F ) la représentation fondamentale de Au (F ). En utilisant les points (iii) du Lemme 5 et (i) de la Proposition 4 on a pour tout k ∈ N ∗ N: dim(M or(1, u(F )k )) = dim(M or(e, k)) ≤ Card(Zk (F )) = Ck . La Proposition 4 (ii) dit que pour F = In on a e´ galité. Donc: dim(M or(1, u(F )k )) ≤ dim(M or(1, u(In )k )). Mais les nombres dim(M or(1, u(F )k )) sont les ∗-moments du caractère de u(F ) et les nombres dim(M or(1, u(In )k )) sont les ∗-moments du caractère de u(In ), donc les ∗-moments d’une variable circulaire (par le point (iii) du Théorème 1 appliqué a F = In , cas déjà résolu). On conclut a` l’aide de la Proposition 3. Remarque. Il est clair maintenant que pour toute matrice F , {X(1) | X ∈ Zk (F )} est une base de l’ensemble des vecteurs fixes de la représentation u(F )k de Au (F ). Si F F ∈ RIn , l’application ψk qu’on a construit dans la démonstration de la Proposition 4 permet d’identifier ces vecteurs comme une partie de la base des vecteurs fixes de la représentation u⊗l(k) de Ao (F ) de la fin de la Sect. 2 (en fait, cette identification est celle qui correspond a` la flèche canonique Au (F ) → Ao (F )). Il est facile a` détérminer les partitions non-croisées en parties a` deux e´ léments de {1...l(k)} qui correspondent aux vecteurs fixes de la représentation uk de Au (F ). En fait, on peut obtenir une base de l’ensemble des vecteurs fixes de la représentation uk de Au (F ) en ecrivant `k= `x1 ...xl(k) avec xi ∈ {α, β} et en associant a` toute partition noncroisée P = P1 ... Pl(k)/2 de {1, ..., k} avec des parties de la forme Ps = {is , js } telles que (xis , xjs ) soit e´ gale a` (α, β) ou a` (β, α) pour tout s le vecteur suivant: X

v=

1≤s1 ...sl(k) ≤l(k)

Fsj1 si1 ...Fsjl(k)/2 sil(k)/2 es1 ⊗ ... ⊗ esl(k) .

5. Le cas n = 2 Rappelons que pour µ ∈ [−1, 1] − {0} le groupe quantique compact matriciel Sµ U(2) est défini avec générateurs α, γ et relations α∗ α + γ ∗ γ = 1, αα∗ + µ2 γγ ∗ = 1, γγ ∗ = γ ∗ γ, µγα = αγ, µγ ∗ α = αγ ∗ . On a S1 U(2) = C(SU(2)) (voir [Wo1] et les formules (1.33) de [Wo2]). Proposition 5. Ao

0 1 −µ−1 0

= Sµ U(2).

160

T. Banica

u11 u12 0 1 la représentation fondamentale de Ao . u21 u22 −µ−1 0 0 1 sont uu∗ = u∗ u = 1 et: Les relations qui définissent Ao −µ−1 0 ∗ u11 u12 0 1 0 −µ u11 u∗12 = . −1 ∗ ∗ u21 u22 0 u21 u22 −µ 1 0 −µu∗21 u∗22 u11 u12 = . Si α := u11 et γ := u21 , alors En multipliant, −1 ∗ ∗ −µ u12 u11 u21 u22 0 1 α −µγ ∗ et les relations qui définissent Ao sont celles données u= ∗ −1 γ α 0 −µ ∗ ∗ par uu = u u = 1, i.e.: ∗ ∗ γ∗ γ∗ α −µγ ∗ α 1 0 α α −µγ ∗ = = . ∗ ∗ γ α −µγ α −µγ α γ α 0 1 Démonstration. Notons

En calculant on obtient les relations qui définissent Sµ U(2).

Proposition 6. Pour tous les n ∈ N, F ∈ GL(n, C), λ ∈ C∗ et V, W ∈ U(n) on a les similarités suivantes: (i) Ao (F ) ∼sim Ao (λV F V t ) (si F vérifie F F ∈ RIn ). (ii) Au (F ) ∼sim Au (λV F W ). Démonstration. (i) Notons u (resp. v) la représentation fondamentale de Ao (F ) (resp. Ao (λV F V t )). Alors v = (λV F V t )v(λV F V t )−1 est unitaire =⇒ v = V F V t vV F −1 V ∗ est unitaire . Il en résulte que V ∗ vV = F V ∗ vV F −1 est unitaire, donc on peut définir f : Ao (F ) → Ao (λV F V t ) par la propriété universelle et par (Id ⊗ f )(u) = V ∗ vV . Par les mêmes arguments, il existe g : Ao (λV F V t ) → Ao (F ) avec (Id ⊗ g)(v) = V uV ∗ . Donc f et g sont des bijections inverses, donc f est un isomorphisme, donc une similarité. (ii) Notons u (resp. v) la représentation fondamentale de Au (F ) (resp. Au (λV F W )). Alors v et (λV F W )v(λV F W )−1 sont unitaires, donc W vW t et F W vW ∗ F −1 sont unitaires, donc W vW t et F W vW t F −1 sont unitaires. On peut donc définir f : Au (F ) → Au (λV F W ) par la propriété universelle et par (Id ⊗ f )(u) = W vW t . Par les mêmes arguments, il existe g : Au (λV F W ) → Au (F ) avec (Id ⊗ g)(v) = W t uW . Alors f et g sont des bijections inverses. Proposition 7. Pour tout µ ∈ [−1, 1] − {0} soit Gµ la C∗ -algèbre engendrée dans C(T) ∗red Sµ U(2) par les coefficients de la matrice zuµ , uµ e´ tant la représentation fondamentale de Sµ U(2). Alors on a (modulo la similarité) les e´ galités suivantes: (i) {Ao (F ) | F ∈ GL(2, C) } = {Sµ U(2) | µ ∈ [−1, 1] − {0} }, (ii) {Au (F )red | F ∈ GL(2, C) } = {Gµ | µ ∈ [−1, 1] − {0} }. Démonstration. (i) Il suffit de montrer que pour chaque F ∈ GL(2, C) telle que F F ∈ R, t il existe V ∈ GL(2, C) multiple scalaire d’une matrice unitaire tel queλV F V = 0 1 x y pour un certain µ (cf. Propositions 5 et 6). Soit donc F = . Si −µ−1 0 z t α 1 , qui est un multiple x 6= 0, soit α une solution de α2 x + α(y + z) + t = 0 et V = −1 α t er 2 1 coefficient α x + α(y 0, scalaire d’une matrice unitaire ; alors V F V a le + z) + t = yt yz 0 y . donc on peut supposer x = 0. Dans le cas F = on a F F = tz zy + tt z t


161

0 1 Comme F F ∈ R, t = 0 et yz = zy, donc F = y avec k = z/y ∈ R∗ . Si | k |≥ 1 k 0 0 −1 on a fini. Sinon, on pose V = . 1 0 (ii) Il suffit de montrer que pour chaque F ∈ GL(2, C) il existent V, W ∈ U(2) telles (par que V F W V F W ∈ R (cf. point (i) et Proposition 6). En effet, on peut supposer x 0 décomposition polaire) que F > 0; en la diagonalisant, on peut supposer F = , 0 y 0 −1 avec x, y > 0, et dans ce cas on pose V = et W = 1. 1 0 a b , (z) et u les représentations fondamentales de C(SU(2)), Lemme 7. Notons −b a C(T) et Au (I2 ) respectivement. Alors il existe un plongement: za zb Au (I2 )red ,→ C(T) ∗red C(SU(2)) , u 7→ . −zb za 0 1 Démonstration. On a Ao = S1 U(2) = C(SU(2)) (Prop. 5), Au (I2 ) = −1 0 √ 0 1 Au (car Au (F ) = Au ( F ∗ F ) pour toute F ), et par le Théorème 1 on a un −1 0 0 1 0 1 plongement Au ,→ C(T) ∗red Ao . −1 0 −1 0 Théorème 5. Les coefficients de la représentation rβα = u⊗u−1 de Au (I2 ) commutent entre eux et engendrent une C∗ -algèbre e´ gale a` C(SO(3)). De même pour les coefficients de rαβ = u ⊗ u − 1. Démonstration. En utilisant le Lemme 7 on voit que la représentation u⊗u de Au (I2 )red est la representation

za zb a b za zb a b ⊗ = ⊗ . −zb za −b a −zb za −b a

de C(T) ∗red C(SU(2)). Il en résulte que la représentation u ⊗ u − 1 de Au (I2 )red correspond a` la représentation de dimension 3 de SU(2), i.e. a` la représentation fondamentale de SO(3). Donc les coefficients de la représentation rβα = u ⊗ u − 1 de Au (I2 )red commutent entre eux et engendrent une C∗ -algèbre commutative e´ gale a` C(SO(3)). On conclut en remarquant que C(SO(3)) est la C∗ -algèbre enveloppante de C(SO(3))s . On va montrer maintenant que l’algèbre de von Neumann Au (I2 )"red (le bicommutant de l’image de Au (I2 ) par sa représentation régulière gauche sur l2 (Au (I2 ), h), h e´ tant la mesure de Haar) est isomorphe au facteur W ∗ (F2 ) associé au groupe libre a deux générateurs F2 . Rappelons que si (M, φ) est une ∗-algèbre munie d’une forme linéaire unitale et si A ⊂ M est une ∗-algèbre unifère, alors un e´ lément x ∈ M est dit ∗-libre par rapport a` A si la ∗-algèbre engendrée par x dans M est libre par rapport a` A. On va utiliser le lemme technique suivant. Lemme 8. Soit (M, φ) une ∗-algèbre munie d’une trace, 1 ∈ A ⊂ M une sous-∗algèbre, d ∈ A un unitaire tel que φ(d) = φ(d∗ ) = 0 et u ∈ M un Haar-unitaire ∗-libre par rapport a` A. Alors ud est un Haar-unitaire ∗-libre par rapport a` A.

162

T. Banica

Démonstration. ud est clairement un Haar-unitaire. Pour montrer que ud est ∗-libre par rapport a` A il suffit de vérifier que si ai ∈ Z∗ et fi ∈ A ∩ ker(φ) (1 ≤ i ≤ n), alors P := (ud)a1 f1 (ud)a2 f2 ... est dans ker(φ). P est un produit de termes de la forme u ou u∗ alternant avec des termes de la forme ∗ d, d , fi , dfi , fi d∗ ou dfi d∗ . Remarquons que φ(d) = φ(d∗ ) = φ(fi ) = φ(dfi d∗ ) = 0 et que les termes de la forme dfi ou fi d∗ apparaissent dans P entre u et u ou entre u∗ et u∗ . On ecrit chaque dfi sous la forme [φ(dfi )1] + [dfi − φ(dfi )1] et chaque fi d∗ sous la forme [φ(fi d∗ )1] + [fi d∗ − φ(fi d∗ )]. En developpant P , on obtient une combinaison linéaire de termes, chaqun e´ tant un produit d’éléments de la forme uk avec k ∈ Z∗ alternant avec des e´ léments de A ∩ ker(φ). Il en résulte que φ(P ) = 0. Théorème 6. Au (I2 )"red = W ∗ (F2 ). Démonstration. Par le Lemme 7 on a un plongement: Au (I2 )"red ,→ L∞ (T) ∗ L∞ (SU(2)) , u 7→

za zb . −zb za

Notons d = sgn ◦ (a + a), i.e. la composée de a + a : SU(2) → R avec la fonction signe sgn : R → {−1, 0, 1}. Alors d ∈ L∞ (SU(2)) est un unitaire tel que d2 = 1. La partie polaire de za + za est zd, donc zd ∈ Au (I2 )"red . Il en résulte que dz ∗ ∈ Au (I2 )"red , et en multipliant a gauche par dz ∗ les générateurs za, zb, za, zb de Au (I2 )"red , on obtient que Au (I2 )"red est engendrée par zd, da, db, da, db. En utilisant le Lemme 8 on obtient que W ∗ (da, db, da, db) et W ∗ (zd) sont des sous-algèbres abéliennes diffuses libres qui engendrent Au (I2 )"red , ce qui implique Au (I2 )"red = W ∗ (F2 ) (voir Th. 2.6.2 de [VDN]). Remarque. Soit Unnc la C∗ -algèbre universelle engendrée par les coefficients d’une nc matrice de taille n unitaire u. En combinant la formule Un,red ⊗ Mn = Mn ∗red C(T) de McClanahan [MC1] avec la formule Mn ∗ W ∗ (Fs ) = W ∗ (Fn2 s ) ⊗ Mn de Dykema nc," [D] on obtient U2,red = W ∗ (F4 ). 6. Remarques sur la représentation adjointe La représentation adjointe d’un groupe discret 0 est un outil important pour traiter plusieurs problèmes liés aux C∗ -algèbres C∗ (0) et C∗red (0): moyennabilité de 0, nucléarité de C∗red (0), simplicité de C∗red (0) etc. On va se poser les mêmes questions pour les C∗ -algèbres Au (F ) et Au (F )red , qui, du point de vue de la dualité de Pontryagin, sont les “C∗ (Fn ) et C∗red (Fn ) quantiques”. Si G est un groupe quantique compact matriciel tel que sa mesure de Haar soit une trace, on peut définir sur Gp les représentations régulières gauche λ et droite ρ par λ(x)(y) = xy et ρ(x)(y) = yκ(x), et la représentation adjointe comme e´ tant la composée: δ

λ⊗ρ

Gp −→Gp ⊗max Gp −→B(l2 (Gred )). Le cas général est plus subtil, et on va utiliser le théorème suivant de Woronowicz (voir aussi les Rappels 5.1 de [BS]). Théorème 7. ([Wo2]) A tout groupe quantique compact matriciel G on peut associer une (unique) famille de caractères (fz )z∈C de Gs qui vérifient les formules suivantes:


(i) (i) (i) (i)

163

h(ab) = h(b(f1 ∗ a ∗ f1 )), pour tous les a ∈ Gs et b ∈ G. κ2 (a) = f−1 ∗ a ∗ f1 , pour tout a ∈ Gs . f0 = e (la coünité de Gs ) et fz+t = fz ∗ ft , pour tous les z, t ∈ C. fz ∗ κ(a) = κ(a ∗ f−z ) et κ(a) ∗ fz = κ(f−z ∗ a) pour tous les a ∈ Gs et z ∈ C.

b et si F est l’unique De plus, les fz sont définis de la manière suivante: si u ∈ G ∗ 2 matrice positive qui entrelace u et (I ⊗ κ)(u ) = (I ⊗ κ )(u), normalisée telle que T r(F ) = T r(F −1 ) (cf. Th. 5.4 de [Wo2]), alors: (Id ⊗ fz )(u) = F z . Notations. L(Gred ) est l’algèbre des opérateurs bornés Gred → Gred . L’application b → Gs sera notée χ (voir Rappels B et C). caractère G Corollaire 2. Soit G un groupe quantique compact matriciel et a, b, c, d ∈ C. L’application x 7→ fa ∗ x ∗ fb est un automorphisme de Gs . L’application λa,b : Gs → L(Gred ) donnée par λa,b (x)(y) = (fa ∗ x ∗ fb )y est un morphisme de C-algèbres unifères. (ii) L’application x 7→ fc ∗ κ(x) ∗ fd est un antiautomorphisme de Gs . L’application ρc,d : Gs → L(Gred ) donnée par ρc,d (x)(y) = y(fc ∗ κ(x) ∗ fd ) est un morphisme de C-algèbres unifères.

(i)

δ

λa,b ⊗ρc,d

(iii) L’application Gs −→Gs ⊗ Gs −→ L(Gred ) est un morphisme de C-algèbres unifères. L’application définie au point (iii) permet d’associer a` tous les a, b, c, d ∈ C une b → L(Gred ). L’intérêt du Lemme suivant apparaˆıtra dans les application ad : G démonstrations de simplicité, quand on va utiliser plusieures applications ad de ce type. Lemme 9. Soit G un groupe quantique compact matriciel et a, b, c, d ∈ R. Soit: b → L(Gred ). ad = (λa,b ⊗ ρc,d ) ◦ δ ◦ χ : G b et soit F la matrice définie dans le Théorème 7. Alors: Soit u ∈ G P (i) ad(u)(z) = i,k (F b uF a )ik z(F −c u∗ F −d )ki

P (ii) Si a + c = b + d = 0, alors ad(u) est une application de la forme z 7→ ak za∗k avec ak ∈ Gs (somme finie). (iii) Si a = c, alors ad(u)(1) = K · 1 avec K ∈ R+∗ . (iv) Soient s, t ∈ R et supposons que t + b − d = 0 ou que s + a − c + 1 = 0. Alors il existe M ∈ R+∗ tel que ad(u)/M préserve tout e´ tat φ de Gred tel que φ(xy) = φ(y(fs ∗ x ∗ ft )), ∀ x, y ∈ Gs . Démonstration. (i) En utilisant le Théorème 7, on a X fa ∗ uij ∗ fb = (fb ⊗ Id ⊗ fa )( uis ⊗ usk ⊗ ukj ) = (F b uF a )ij . Par Th. 7 (iv) et en utilisant (I ⊗ κ)(u) = u∗ on obtient fc ∗ κ(uij ) ∗ fd = κ(f−d ∗ uij ∗ f−c ) = (F −c u∗ F −d )ij .

164

T. Banica

On a donc (λa,b ⊗ ρc,d )(uik ⊗ ulj )(z) = (F b uF a )ik z(F −c u∗ F −d )lj , d’où: X (F b uF a )ik z(F −c u∗ F −d )kj . (λa,b ⊗ ρc,d )δ(uij )(z) = k

(ii) On a (F −c u∗ F −d )ki = [(F −c u∗ F −d )∗ ]∗ik = (F −d uF −c )∗ik (rappelons que F > 0), donc si a + c = b + d = 0, alors: X (F b uF a )ik z(F b uF a )∗ik . ad(u) : z 7→ i,k

P (iii) Si a = c alors ad(u)(1) = i,k (F b uF a )ik 1(F −a u∗ F −d )ki = T r(F b−d )1. P (iv) On a φ(ad(u)(z)) = φ[ (fa ∗ uik ∗ fb )z(fc ∗ κ(uki ) ∗ fd )] = φ(zM ), où X (fc ∗ κ(uki ) ∗ fd )(fs+a ∗ uik ∗ ft+b ). M := i,k

Si t + b − d = 0 alors en utilisant les formules de la démonstration de (i) on a X (F −c u∗ F −d )ki (F t+b uF s+a )ik = T r(F s+a−c ) > 0. M= i,k

Supposons maintenant que s + a − c + 1 = 0. En utilisant le point (ii) du Théorème 7 on a uik = f1 ∗ κ2 (uik ) ∗ f−1 . En utilisant cette formule, ainsi que le point (iv) du Théorème 7 et les formules de la démonstration de (i) on obtient P M = i,k (fc ∗ κ(uki ) ∗ fd )(fs+a+1 ∗ κ2 (uik ) ∗ ft+b−1 ) = P = κ[ i,k (f−t−b+1 ∗ κ(uik ) ∗ f−s−a−1 )(f−d ∗ uki ∗ f−c )] = = κ[(I ⊗ T r)(F t+b−1 u∗ F s+a+1−c uF −d )] = T r(F t+b−1−d ) > 0.

7. La Propriété de Powers Soit (A, τ ) une C∗ -algèbre unifère munie d’une trace fidèle. Haagerup et Zsido ont montré [HZ] que A est simple a` trace unique si et seulement si elle a la propriété de Dixmier: Pour tout a ∈ A, l’enveloppe convexe fermée de {uau∗ | u unitaire } contient un multiple scalaire de 1A . La simplicité de C∗red (Fn ) a e´ té démontrée en [P], et la méthode de Powers a e´ té e´ tendue par de nombreux auteurs aux produits libres [A, MC2] ou aux C∗ -algèbres de groupes discrets [H, HS, BCH]. Ces démonstrations de simplicité utilisent des estimations techniques dans B(l2 (A, τ )), qui “bougent” vers 0 tout e´ lément de trace 0, en utilisant des sommes d’automorphismes intérieurs, i.e. qui prouvent la propriété de Dixmier. Dans le cas des C∗ -algèbres réduites de groupes discrets 0, l’estimation dans l2 (0) est obtenue en utilisant des propriétés combinatoires, géométriques etc. de 0. Une d’entre elles est la propriété de Powers, définie dans [H]: Pour tout ensemble fini F ⊂ 0 − {1}, il existe des e´ léments g1 , g2 , g3 ∈ 0 et une ` partition 0 = D E telles que F · D ∩ D = ∅ et gs · E ∩ gk · E = ∅, ∀ s 6= k.


165

On voit facilement que les groupes libres Fn ont la propriété de Powers. Cette propriété apparait dans beaucoup d’autres contextes - voir [H] - par exemple toute action fortement hyperbolique, minimale et fortement fidèle de 0 sur un espace de Hausdorff ` fournit une partition 0 = D E et des (en fait, une infinité de) e´ léments gi comme en haut. La preuve de la simplicité de C∗red (0) de [HS] a deux e´ tapes: P I. Si x ∈ l2 (F ) est hermitien de trace 0, alors || 1/3 ugs xu∗gs ||≤ 0.98 || x ||. II: C∗red (0) a la propriété de Dixmier, donc est simple a` trace unique. On va e´ tendre cette démonstration aux groupes quantiques compacts “de Powers”: b des Définition 3. Soit G un groupe quantique compact. On munit l’ensemble P (G) b avec l’involution A = {a | a ∈ A} et la multiplication ◦ définie par: parties de G b | ∃ a ∈ A, ∃ b ∈ B avec r ⊂ a ⊗ b}. A ◦ B = {r ∈ G b − {1}, il existe On dira que G a la propriété de Powers si pour toute partie finie F ⊂ G ` b et une partition G b = D E telles que F ◦ D ∩ D = ∅ et des e´ léments r1 , r2 , r3 ∈ G rs ◦ E ∩ rk ◦ E = ∅, ∀ s 6= k. Le but de cette dernière partie du papier est de montrer que les algèbres Au (F )red sont simples (et avec au plus une trace). En principe on doit résoudre trois questions: a) Etendre la démonstration de simplicité de [HS] aux groupes quantiques compacts de Powers ayant une mesure de Haar traciale. b) Etendre a) aux groupes quantiques compacts de Powers quelconques. c) Etendre b) a` Au (F ), qui n’a pas la Propriété de Powers - le Théorème 1 montre que \ la partie F = {rαβ , rβα } a la propriété rx ∈ F ◦ {rx }, pour toute 1 6= rx ∈ A u (F ). En fait, toutes les démonstrations de simplicité et de non-existence d’états KMS qu’on va donner seront basées sur la même estimation (Prop. 8). Remarquons que: – si la mesure de Haar de G est une trace, l’énoncé de la Prop. 8 se simplifie considèrablement. De même pour sa démonstration - on ne doit pas utiliser la 6eme section. ∗ ∗ (0) (e.g. Cred (F2 )), alors la Proposition 8 est le lemme technique – si de plus G = Cred de [HS], mais l’estimation qu’on obtient ici est plus forte √ 1 X 2 2 || || x || . ugs xu∗gs ||≤ 3 3 s=1,2,3

Notons que les méthodes qu’on developpe ici n’ont aucune chance de s’appliquer a` Ao (F ), car Ao (F )central est commutative. b l’ensemble des représentations Notation. Pour tout x ∈ Gs on note supp(x) ⊂ P (G) irréductibles qui ont des coefficients qui apparaissent dans x (rappelons que l’espace des coefficients de r ∈ B(Hr ) ⊗ Gs est G(r) := {(φ ⊗ Id)(r) | φ ∈ B(Hr )∗ } ; par [Wo2] on a Gs = ⊕r∈G bG(r)). Proposition 8. Soit (G, u) un groupe quantique compact matriciel réduit et soient s, t ∈ b telles que rl ◦ E ∩ rk ◦ E = ∅, ∀ l 6= k. b = D ` E une partition et r1 , r2 , r3 ∈ G R. Soit G Alors il existe une application linéaire unitale T : G → G telle que:

166

T. Banica

P a) il existe une famille finie {ai } d’éléments de Gs tels que T : z 7→ ai za∗i . b) T préserve les e´ tats φ ∈ G∗red vérifiant φ(xy) = φ(y(fs ∗ x ∗ ft )), ∀ x, y ∈ Gs . c) pour tout z = z ∗ ∈ Gs tel que supp(z) ◦ D ∩ D = ∅, on a || T (z) ||≤ 0.95 || z || et supp(T (z)) ⊂ ∪i ri ◦ supp(z) ◦ ri . Démonstration. Le Lemme 9 appliqué avec a = c = 0 et d = −b = t/2 fournit une b → L(G) (rappelons que Gred = G). Notons que le choix de certaine application ad : G a, b, c, d implique a + c = b + d = 0, a = c et t + b − d = 0, donc on peut appliquer les points (ii-iv) du Lemme 9 avec u := ri , pour i = 1, 2, 3. On obtient trois familles finies {ai,k }k d’éléments de Gs et six réels positifs non nuls Ki , Mi (i = 1, 2, 3) tels que pour tout i: P (i) ad(ri )(z) = k ai,k za∗i,k . (ii) ad(ri )(1) = Ki · 1. (iii) ad(ri )/Mi préserve φ. Comme φ(ad(ri )(1)) = Mi (par (iii)) et φ(ad(ri )(1)) = Ki (par (ii)), on a Ki = Mi pour tout i. Posons 1 ad(r1 ) ad(r2 ) ad(r3 ) + + . T := 3 M1 M2 M3 Il nous reste a` vérifier la condition (c). Notons h la mesure de Haar de G (qui est fidèle par hypothèse) et (H, π) la construction GNS associée a` (G, h). Pour i = 1, 2, 3 notons: X π(ai,k )P π(a∗i,k ). Ti0 : B(H) → B(H), P 7→ Mi−1 k

Soit T 0 = (T10 + T20 + T30 )/3. C’est une application complètement positive unitale. Soit p (resp. q) la projection dans H sur la fermeture de l’espace linéaire engendré par les b = D ` E et supp(z)◦D∩D = ∅, coefficients des représentations de D (resp. E). On a G d’où: p + q = 1 , pπ(z)p = 0. Si t 6= s ∈ {1, 2, 3}, alors rt ◦ rs ◦ E ∩ E = ∅: en effet, si r, p ∈ E sont telles que r ⊂ rt ⊗ rs ⊗ p, alors on a h(χ(rt ⊗ r ⊗ rs ⊗ p)) ≥ 1, donc rt ⊗ r et rs ⊗ p ont une composante irréductible commune, qui doit eˆ tre dans rs ◦ E ∩ rt ◦ E = ∅, contradiction (cf. Rappel C). Il en résulte que si a (resp. b) sont des coefficients arbitraires de rt (resp. rs ), alors qπ(a∗ b)q = 0. Les ai,k e´ tant des coefficients de ri pour tout i (cf. Lemme 9 (i)), on a: X Tt0 (q) Ts0 (q) = (Mt Ms )−1 π(at,k )qπ(a∗t,k as,h )qπ(a∗s,h ) = 0. k,h 0

Il en résulte que la norme de T (q) est || T 0 (q) ||= lim || T 0 (q)n || n = 1

X X 1 1 1 1 lim || ( Ti0 (q))n || n = lim || (Ti0 (q))n || n 3 3 i i

donc plus petite que 13 (car les (Ti0 )n sont des applications complètement positives unitales). L’assertion de (c) sur supp(T (z)) est e´ vidente. L’inégalité || T (z) ||≤ 0.95 || z || résulte du lemme suivant (avec f = T 0 , x = π(z) et δ = 1/3) et du fait que la représentation GNS π est isometrique:


167

Lemme 10. Soit H un espace de Hilbert, x = x∗ ∈ B(H), p + q = 1 projections dans H, et f : B(H) → B(H) une application √ complètement positive unitale. Si pxp = 0 et || f (q) ||≤ δ < 1/2, alors || f (x) ||≤ 2 δ − δ 2 || x ||. Démonstration (G. Skandalis). Soit ζ ∈ H arbitraire de norme un. On veut montrer que √ | < f (x)ζ, ζ >| ≤ 2 δ − δ 2 || x ||. Par le théorème de Stinespring on peut supposer que f (z) = ω ∗ zω avec ω ∗ ω = 1. En posant ξ = ωζ, il suffit de démontrer l’énoncé suivant: Si H est un espace de Hilbert, x = x∗ ∈ B(H), p + q = 1 sont des projections dans H avec pxp = 0,√et ξ ∈ H est de norme 1 et tel que < qξ, ξ > ≤ δ < 1/2, alors | < xξ, ξ > | ≤ 2 δ − δ 2 || x ||. Notons E ∈ B(H) la projection sur Cpξ ⊕ Cqξ. Alors on peut remplacer dans l’énoncé ci-dessus H, p, q, x, ξ par E(H), EpE, EqE, ExE, ξ. En effet, on a < qξ, ξ > =< EqEξ, ξ >, < xξ, ξ >=< ExEξ, ξ > et || ExE ||≤|| x ||. On peut aussi supposer que || x ||= 1. 0 0 m 1 0 a b et ξ = Soient donc H = C2 , p = ,q = avec ,x= b 0 0 1 n 0 0 a ∈ R et b, m, n ∈ C. On a: m am + bn , >= a | m |2 +2Re(b n m). < xξ, ξ >=< bm n On a | m |2 =< qξ, ξ > ≤ δ et | m |2 + | n |2 =|| ξ ||2 = 1, donc: p | < xξ, ξ > |≤ δ | a | +2 δ(1 − δ) | b | . 2 2 On p peut supposer que a ≥ 0. Les racines de det(x − zI) = z − az− | b | sont 2 2 (a±p a + 4 | b | )/2. Mais || x ||= 1, donc ces racines sont dans [−1, 1], ce qui implique que a2 + 4 | b |2 ≤ 2 − a, d’où a ≤ 1− | b |2 . On a donc: p √ √ | < xξ, ξ > |≤ δ(1− | b |2 ) + 2 δ(1 − δ) | b |= 1 − ( 1 − δ − δ | b |)2 . √ √ On a δ < 1/2, donc la fonction b 7→ 1 √ − ( 1 − δ − δ | b |)2 atteint son maximum sur [−1, 1] en b = ±1. Ce maximum est 2 δ − δ 2 .

Enfin, on utilisera le lemme suivant au lieu de la propriété de Dixmier: Lemme 11. Soit (A, φ) une C∗ -algèbre munie d’un e´ tat fidèle, soit ψ ∈ A∗ un e´ tat, soit As ⊂ A une ∗-algèbre dense et soit 0 < δ < 1. Supposons que pour tout hermitien x ∈ ker(φ) ∩ As il existe une famille finie d’éléments ai ∈ A telle que l’application P z 7→ ai za∗i soit unitale, préserve φ et ψ et envoie x sur un e´ lément de norme ≤ δ || x ||. Alors A est simple et ψ = φ. Démonstration. On peut supposer As = A. En appliquant plusieures fois l’hypothèse, on peut supposer que δ > 0 est aussi petit que l’on veut. Soit J ⊂ A un idéal bilatère. Soit y ∈ J et z = yy ∗ /φ(yy ∗ ). Alors on peut trouver des ai avec || Σai (1 − z)a∗i ||< 1, i.e. avec Σai za∗i inversible. Mais Σai za∗i ∈ J, donc J = A. Soit x = x∗ ∈ ker(φ) quelconque et > 0 petit. On peut trouver un y = Σai xa∗i de norme plus petite que , donc | ψ(x) |=| ψ(y) |≤ . On obtient ψ(x) = 0, donc que ψ = φ sur les e´ léments hermitiens. Tout opérateur e´ tant une combinaison linéaire finie de 1 et de hermitiens de ker(φ), on a ψ = φ.

168

T. Banica

Proposition 9. Si G a` la propriété de Powers alors Gred est simple. Supposons de plus qu’on se donne un e´ tat ψ de Gred tel que ∀ x, y ∈ Gs on ait ψ(xy) = ψ(y(f1 ∗ x ∗ f1 )). Alors ψ = h (la mesure de Haar de Gred ). Donc si h est une trace, alors elle est la trace unique de Gred . Démonstration. Soit x ∈ ker(h) ∩ Gs un hermitien. Remarquons que 1 n’est pas dans F := supp(x). G ayant la propriété de Powers, on peut appliquer la Proposition 8 avec s = t = 1. On obtient donc une application unitale P f de la forme z 7→ ai za∗i qui laisse invariantes h et ψ (par le point (b) de la Prop. 8), telle que || f (x) ||≤ 0.95 || x || (normes de Gred ). On conclut a` l’aide du Lemme 11 (avec A = Gred , As = Gs et φ = h). 8. Simplicité de Au (F )red Au (F )red n’a pas la propriété de Powers (prendre F = {rαβ , rβα }), mais on va montrer qu’elle est simple en utilisant la Proposition 8. On identifie les objets définis dans la section précédente pour G = Au (F ). En utilisant la déscription des représentations de \ Au (F ), on peut identifier A ee u (F ) = N ∗ N. La multiplication ◦ sur P (N ∗ N) est donn´ (pour x, y ∈ N ∗ N) par la formule suivante: {x} ◦ {y} = {ab | ∃ g ∈ N ∗ N avec x = ag, y = gb}. Notation. Pour w ∈ N ∗ N on note {w...} (resp. {...w}) l’ensemble des mots de N ∗ N qui commencent (resp. finissent) avec w. Pour w, y ∈ N ∗ N on note {w...y} = {w...} ∩ {...y}. On note (βα)N le mot βαβα...βα (N fois). Lemme 12. On considère les ensembles D = {α...}, E = {β...} ∪ {e}, F ` = {β...α} et les e´ léments r1 = βαβ, r2 = βα2 β et r3 = βα3 β. Alors N ∗ N = D E est une partition, F ◦ D ∩ D = ∅ et rs ◦ E ∩ rk ◦ E = ∅, ∀ s 6= k. Notation. On fixe n ∈ N, F ∈ GL(n, C) et on note G = Au (F )red et h sa mesure de Haar. Corollaire 3. Soient s, t ∈ R et > 0. Soit ψ un e´ tat de G tel que ψ(xy) = ψ(y(fs ∗ x ∗ ft )), ∀ x, y ∈ Gs . P Alors il existe une application linéaire unitale V : G → G de la forme z 7→ ai za∗i ∗ (somme finie, avec ai ∈ Gs ) qui préserve ψ telle que pour tout x = x ∈ Au (F )s avec supp(x) ⊂ {β...α} on a || V (x) ||≤ || x || et supp(V (x)) ⊂ {β...α}. Démonstration. On applique la Proposition 8 aux parties définies dans P le Lemme 12. On obtient ainsi une certaine application T : G → G de la forme z 7→ bi zb∗i (somme finie, avec bi ∈ Gs ) qui préserve ψ. En posant z = x dans le point (c) de la Prop. 8 on obtient || T (x) ||≤ 0.95 || x ||, supp(T (x)) ⊂ ∪i ri ◦ supp(x) ◦ ri ⊂ {β...β} ◦ {β...α} ◦ {α...α} ⊂ {β...α}. La condition (a) de la Prop. 8 implique que T (x) = T (x)∗ ∈ Gs , donc on peut appliquer le point (c) de la Prop. 8 avec z = T (x), puis avec z = T 2 (x) etc. On choisit m ∈ N tel que 0.95m ≤ et on pose V = T m . Lemme 13. ∀ F ⊂ N ∗ N finie, (βα)N ◦ F ◦ (βα)N ⊂ {β...α} ∪ {e} pour N grand.


169

Démonstration. Si Y ⊂ N ∗ N est l’ensemble des mots alternés (i.e. qui ne contiennent ni α2 ni β 2 ), on voit facilement que (a) Y ◦ {...α} ∩ {...β} = ∅ ; (b) Y ◦ {...β} ∩ {...α} = ∅ (c) {α...} ◦ Y ∩ {β...} = ∅ ; (d) {β...} ◦ Y ∩ {α...} = ∅. Il suffit démontrer le lemme pour les parties de cardinal 1. Soit F = {z} une telle partie. • Supposons z ∈ Y . Par (d), (βα)N ◦ z est e´ gal a` e (et dans ce cas on a fini), ou commence par β. En utilisant encore une fois (d) on voit que (βα)N ◦ z ◦ (βα)N est e´ gal a` e ou commence par β. De même, en appliquant deux fois (a), on voit que (βα)N ◦ z ◦ (βα)N est e´ gal a` e ou finit par α. Donc (βα)N ◦ z ◦ (βα)N est dans {β...α} ∪ {e}. • Supposons z ∈ N ∗ N − Y , par exemple que z = xα2 y. Alors (βα)N ◦ xα ⊂ {...α} ∪ {e} par (a). Pour N ≥ l(x), il est clair que (βα)N ◦ xα ⊂ {β...α}. Par les mêmes arguments, αy ◦ (βα)N ⊂ {α...α} pour N ≥ l(y). Donc pour N grand: (βα)N ◦ (xα2 y) ◦ (βα)N = [(βα)N ◦ xα] ◦ [αy ◦ (βα)N ] ⊂ {β...α}.

Corollaire 4. Soit x = x∗ ∈ Gs tel que h(x) = 0.

P (i) Il existe une application linéaire unitale W : G → G de la forme z 7→ bi zb∗i (somme finie, avec bi ∈ Gs ), qui préserve h et telle que supp(W (x)) ⊂ {β...α}. (ii) Soient v, w ∈ R. Alors L ∈ R+∗ et une application linéaire U : G → G P il existe de la forme z 7→ ci zc∗i (somme finie, avec ci ∈ Gs ), qui préserve h, telle que supp(U (x)) ⊂ {β...α} et telle que U/L préserve tout e´ tat ψ de G vérifiant ψ(pq) = ψ(q(fv ∗ p ∗ fw )), ∀ p, q ∈ Gs . Démonstration. Fixons K ∈ N tel que (βα)K ◦ supp(x) ◦ (βα)K ⊂ {β...α} ∪ {e} (cf. Lemme 13). Notons r = r(βα)K . (i) Le Lemme 9 appliqué avec a = c = 0 et d = −b = 1/2 fournit une certaine b → L(G). Remarquons que le choix de a, b, c, d permet d’appliquer application ad : G (avec u := r) les points (ii) et (iii) du Lemme 9, ainsi que le point (iv) avec s = t = 1 et φ = h. On obtient deux réels positifs non-nuls K, M , qui sont e´ videmment e´ gaux. En posant W = ad(r)/M , il nous reste a` vérifier que supp(W (x)) ⊂ {β...α}. En utilisant r ◦ supp(x) ◦ r ⊂ {β...α} ∪ {e} et la formule de ad (Lemme 9 (i)), ainsi que l’égalité r = r on obtient supp(W (x)) ⊂ {β...α} ∪ {e}. Mais h(W (x)) = h(x) = 0, donc e n’est pas dans supp(W (x)). (ii) Le Lemme 9 appliqué avec c = −a = (v + 1)/2 et d = −b = 1/2 fournit une b → L(G). On peut appliquer (avec u := r) le point (ii) du certaine application ad : G Lemme 9, ainsi que le point (iv) avec s = t = 1 et φ = h. On obtient ainsi un M ∈ R+∗ P tel que si on note U = ad(r)/M , alors U est de la forme z 7→ ci zc∗i et préserve h. Appliquons de nouveau le point (iv) du Lemme 9, pour les mêmes a, b, c, d, mais avec s = v, t = w et φ = ψ cette fois-ci. On obtient un M1 ∈ R+∗ tel que ad(r)/M1 préserve ψ. On pose alors L = M1 /M . Enfin, l’assertion sur supp(U (x)) se démontre comme au point (i). Démonstration du Théorème 3. Rappelons qu’on a fixé n ∈ N et F ∈ GL(n, C) et on a noté G = Au (F )red . Soit x = x∗ ∈ Gs arbitraire tel que h(x) = 0 et soit 1 > 0 arbitraire.

170

T. Banica

I.) En appliquant le Corollaire 4 (i) avec le x ci-dessus on obtient une certaine application W : G → G. En appliquant le Corollaire 3 avec s = β = 1, ψ := h et := 1 on obtient une certaine application V : G → G. Remarquons que l’application V W a les propriétés suivantes: P • V W est unitale de la forme z 7→ s as za∗s (somme finie). • V W préserve h. • || (V W )(x) ||≤ 1 || x ||. Le Lemme 11 (avec A = G, As = Gs et ψ = φ = h) montre alors que G est simple. II.) Soient s, t ∈ R et φ un e´ tat de G vérifiant φ(pq) = φ(q(fs ∗ p ∗ ft )), ∀ p, q ∈ Gs . En appliquant le Corollaire 4 (ii) avec v = s, w = t et ψ = φ on obtient une application U : G → G et un L ∈ R+∗ . En appliquant le Corollaire 3 avec ψ = φ et avec un > 0 tel que || U (x) ||< L1 on obtient une certaine application V : G → G. Remarquons que l’application V U/L a les propriétés suivantes: (a) V U/L préserve φ. (b) || (V U/L)(x) ||≤ L−1 || U (x) ||≤ 1 . En utilisant (a) et en faisant 1 → 0 dans (b) on obtient φ(x) = 0. Donc φ = h sur les e´ léments hermitiens, d’où φ = h. III.) Enfin, par le Théorème 7, la mesure de Haar de Au (F ) est une trace si et seulement si F F ∗ ∈ C1. Proposition 10. Soit (G, u) un groupe quantique compact tel que sa mesure de Haar soit une trace. Alors (G, u) est moyennable si et seulement si Gred est nucléaire. Démonstration. Notons J le noyau de la projection π : Gp → Gred . On analyse les extensions a` Gp et Gred des applications λ ⊗ ρ, δ, e définies sur Gs : • La représentation gauche-droite λ ⊗ ρ : Gs ⊗ Gs → B(l2 (Gred )) est un ∗morphisme, qui s’étend donc en une application (λ ⊗ ρ)p : Gp ⊗max Gp → B(l2 (Gred )) (voir 6eme section). Le noyau de la projection π ⊗ I : Gp ⊗max Gp → Gred ⊗max Gp e´ tant J ⊗max Gp (voir [Wa]), on voit que (λ⊗ρ)p se factorise par π⊗I en une application (λ ⊗ ρ)r : Gred ⊗max Gp → B(l2 (Gred )). • La comultiplication δ : Gs → Gs ⊗Gs est un ∗-morphisme qui s’étend a` Gp en une application δp : Gp → Gp ⊗max Gp . En composant avec la projection Gp ⊗max Gp → Gred ⊗max Gp on obtient une application δ1 : Gp → Gred ⊗max Gp . • La comultiplication δr : Gred → Gred ⊗min Gred se relève en une application δ2 : Gred → Gred ⊗min Gp (voir Corollaire A.6 de [BS]). • La coünité e : Gs → C s’étend en une application ep : Gp → C. En notant τ : T 7→< T 1, 1 > l’état canonique de B(l2 (Gred )), on a le diagramme commutatif suivant : Gred δ2 ↓

π

←−

Gp

ep

−→

δ1 ↓

C ↑τ

(λ⊗ρ)r

−→ B(l2 (Gred )) Gred ⊗min Gp ←− Gred ⊗max Gp (où la commutation du carré de gauche résulte de la construction de δ1 , δ2 , et celle du carré de droite se vérifie sur les générateurs uij ). Il en résulte que si Gred est nucléaire, alors ker(π) ⊂ ker(ep ), donc G est moyennable (cf. Prop. 5.5 de [Bl]). Pour l’autre implication, voir les Remarques A.13 de [BS].


Corollaire 5. Au (In )red n’est pas nucléaire. De même pour Ao (In )red si n ≥ 3.

171

Remarque. La Proposition 6 montre qu’on a Au (F ) ∼sim Au (F 0 ), avec F 0 diagonale. La propriété universelle de Au (F 0 ) implique l’existence d’une surjection Au (F 0 ) → C∗ (Fn ), et en composant avec l’application de similarité on obtient une surjection Au (F ) → C∗ (Fn ). Le noyau de la surjection Au (F ) → C∗ (Fn ) e´ tant un idéal non-trivial de Au (F ), on a Au (F ) 6= Au (F )red . References [A] [BS] [B1] [B2] [BCH] [Bl] [D] [HZ] [H] [HS] [J] [MC1] [MC2] [N] [NS] [P] [R1] [R2] [VDW] [V] [VDN] [W1] [W2] [Wa] [Wo1] [Wo2]

Avitzour, D.: Free products of C ∗ -algebras. Trans. Am. Math. Soc. 271, 423–465 (1982) Baaj, S., Skandalis, G.: Unitaires multiplicatifs et dualité pour les produits croisés de C ∗ -algèbres. Ann. Sci. Ec. Norm. Sup., 4eme serie, t.26, 425–488 (1993) Banica, T.: On the polar decomposition of circular variables. Int. Eq. and Op. Th. 24, 372–377 (1996) Banica, T.: Théorie des représentations du groupe quantique compact libre O(n). C. R. Acad. Sci. Paris 322, 241–244 (1996) Bekka, M., Cowling, M., de la Harpe, P.: Some groups whose reduced C ∗ -algebra is simple. Publ. Math. IHES 80, 117–134 (1995) Blanchard, E.: Déformations de C ∗ -algèbres de Hopf. Bull. Soc. Math. Fr. 124, 141-215 (1996) Dykema, K.: On certain free product factors via an extended matrix model. J. Funct. Anal. 112, 31–60 (1993) Haagerup, U., Zsido, L.: Sur la propriété de Dixmier pour les C ∗ -algèbres. C. R. Acad. Sci. Paris 298, 173–176 (1984) de la Harpe, P.: Reduced C ∗ -algebras of discrete groups which are simple with unique trace. Lect. Notes Math. 1132, Berlin–Heidelberg–New York: Springer, 1985, pp. 230–253 de la Harpe, P., Skandalis, G.: Powers’ property and simple C ∗ -algebras. Math. Ann. 273, 241–250 (1986) Jones, V.F.R.: Index for subfactors. Invent. Math. 72, 1–25 (1983) McClanahan, K.: C ∗ -algebras generated by elements of a unitary matrix. J. Funct. Anal. 107, 439–457 (1992) McClanahan, K.: Simplicity of reduced amalgamated free products of C ∗ -algebras. Canad. J. Math. 46, 793–807 (1994) Nagy, G.: On the Haar measure of the quantum SU (N ) group. Commun. Math. Phys. 153, 217–228 (1993) Nica, A., Speicher, R.: R-diagonal pairs - a common approach to Haar unitaries and circular elements. Preprint (1995) Powers, R.: Simplicity of the C ∗ -algebra associated with the free group on two generators. Duke Math. J. 42, 151–156 (1975) Rosso, M.: Finite dimensional representations of the quantum analog of the enveloping algebra of a complex semisimple Lie algebra. Commun. Math. Phys. 117, 581–593 (1988) Rosso, M.: Algèbres enveloppantes quantifiées, groupes quantiques compacts de matrices et calcul differentiel non-commutatif. Duke Math. J. 61, 11–40 (1990) Van Daele, A., Wang, S.Z.: Universal quantum groups. International J. of Math. Vol. 7, No. 2, 255–264 (1996) Voiculescu, D.: Circular and semicircular systems and free product factors. Progress in Math. 92, Boston: Birkhäuser, 1990 pp. 45–60 Voiculescu, D., Dykema, K., Nica, A.: Free random variables. CRM Monograph Series n◦ 1, AMS (1993) Wang, S.Z.: General constructions of compact quantum groups. Ph. D. Thesis, Berkeley (1993) Wang, S.Z.: Free products of compact quantum groups. Commun. Math. Phys. 167, 671-692 (1995) Wassermann, S.: Exact C ∗ -algebras and related topics. Lecture Notes Series no 19, Seoul National Univ. (1994) Woronowicz, S.L.: Twisted SU (2) group. An example of a non-commutative differential calculus. Publ. RIMS Kyoto 23, 117–181 (1987) Woronowicz, S.L.: Compact matrix pseudogroups. Commun. Math. Phys. 111, 613–665 (1987)

172

T. Banica

[Wo3] Woronowicz, S.L.: Tannaka-Krein duality for compact matrix pseudogroups. Twisted SU (n) groups. Invent. Math. 93, 35–76 (1988) [Wo4] Woronowicz, S.L.: A remark on compact matrix quantum groups. Lett. Math. Phys. 21, 35–39 (1991) Communicated by A. Connes

Commun. Math. Phys. 190, 173 – 211 (1997)

Communications in


Geometric Stability Analysis for Periodic Solutions of the Swift-Hohenberg Equation Jean-Pierre Eckmann1,2 , C. Eugene Wayne3 , Peter Wittwer3 1 2 3

Dépt. de Physique Théorique, Université de Genève, CH-1211 Genève 4, Switzerland Section de Mathématiques, Université de Genève, CH-1211 Genève 4, Switzerland Dept. of Mathematics, The Pennsylvania State University, University Park, PA 16802, USA

Received: 30 January 1997 / Accepted: 6 April 1997

Abstract: In this paper we describe invariant geometrical structures in the phase space of the Swift-Hohenberg equation in a neighborhood of its periodic stationary states. We show that in spite of the fact that these states are only marginally stable (i.e., the linearized problem about these states has continuous spectrum extending all the way up to zero), there exist finite dimensional invariant manifolds in the phase space of this equation which determine the long-time behavior of solutions near these stationary solutions. In particular, using this point of view, we obtain a new demonstration of Schneider’s recent proof that these states are nonlinearly stable.

1. Introduction In this paper, we study the non-linear stability of space-periodic, time-independent solutions of the Swift-Hohenberg equation (1.1) ∂t u = ε2 − (1 + ∂x2 )2 u − u3 . Here, u(x, t) is defined on R × R+ and takes real values and ε ≥ 0 is a small parameter. Equation(1.1) has stationary solutions u(x, t) = uε,ω (x) which are of the form uε,ω (x) =

X

uε,ω,n eiωnx .

(1.2)

n∈Z

The non-linear stability problem addresses the question of the time evolution of initial data which are close to uε,ω , and stability in this context means that the solution converges to uε,ω as t → ∞. The range of possible values of ω is given by ε2 > (1 − ω 2 )2 when ω is close to 1. To simplify the exposition we shall concentrate on the case ω = 1, and omit henceforth the index ω.

174

J.-P. Eckmann, C.E. Wayne, P. Wittwer

In a very interesting paper, G. Schneider [Sch] has solved this problem, and the present work relies heavily on his ideas. Our aim is to simplify somewhat the exposition of [Sch] and to extend the result by giving a more precise asymptotic analysis, using the description of the asymptotic behavior in terms of a continuous renormalization group and invariant manifolds as introduced in [W], see below. The existence of solutions of the form Eq. (1.2) is a well-established fact, (see e.g. [CE]) and we repeat here only those points of the discussion which are needed in the sequel. The equation for the stationary solution is F (u, ε) = 0, where (1.3) F (u, ε) ≡ ε2 − (1 + ∂x2 )2 u − u3 . The equation F = 0 has the trivial solution u = 0, ε = 0. Linearizing around this solution, we see that DF equals DF = −(1 + ∂x2 )2 ⊕ 0 , acting on some weighted subspace of L2 (R) ⊕ R. The null space of DF is spanned by {cos x, sin x} ⊕ 0

and

0⊕1,

(1.4)

and thus, bifurcation theory suggests the existence of solutions of the form of Eq. (1.2), when ε 6= 0. This is indeed what happens (cf. [CR, CE]), and the higher frequency terms in Eq. (1.2) are generated from the basis Eq. (1.4) by the non-linearity u3 . The method clearly extends to similar polynomial non-linearities. An explicit calculation shows that F (uε , ε) = 0 for 2 (1.5) uε (x) = ε √ cos(x) + ε2 hε (x) , 3 and hε (x) = hε (x + 2π). Thus, the function uε equals uε,1 of Eq. (1.2). We have broken the translation invariance of the problem by the choice of cos in Eq. (1.5), instead of, say, sin. We next pass to the linear stability analysis of the solution uε . This is again a classical subject, initiated by Eckhaus [E], which we summarize for convenience, see also [CE]. Linearizing Eq. (1.1) around the solution uε we are led to study the operator Lε = ε2 − (1 + ∂x2 )2 − 3u2ε , that is, Lε v (x) = ε2 − 3u2ε (x) v(x) − (1 + ∂x2 )2 v(x) . Because uε is a 2π periodic function, it is most convenient to work in Floquet coordinates (i.e., with Bloch waves). To fix the notation, we give some details: Begin by introducing the following representation for f ∈ L2 (R): Z X Z 1/2 −ikx ˆ f (k) = d` e−imx e−i`x fˆ(m + `) f (x) = dke Z

m∈Z 1/2

= −1/2

where

−1/2

d` e−i`x f˜` (x) ,

f˜` (x) =

X

e−imx fˆ(m + `) .

(1.6)

m∈Z

Properties of f˜. Note first that f˜` is 2π periodic. Furthermore, the definition of f˜` (x) can be extended to all ` ∈ R by the definition

Geometric Stability Analysis for Solutions of Swift-Hohenberg Equation

175

f˜`+1 (x) = e−ix f˜` (x) . We next observe that if f has a smooth, rapidly decaying Fourier transform, then f˜` (x) will also be a smooth function of ` and x. If f , g are in L2 (R), then it follows from the definition of f˜` that Z 1/2 ∼ dk f˜`−k (x) g˜ k (x) . (1.7) (f g)` (x) = −1/2

We finally note that if s is a 2π periodic function, then s˜` (x) = δ(`)s(x) .

(1.8)

It is now easy to see that ∼ Lε v ` (x) = ε2 − (1 + (i` + ∂x )2 )2 v˜ ` (x) − 3(u2ε v)∼ ` (x) . In the language of condensed matter physics, ` is the quasi-momentum in the “Brillouin zone” [− 21 , 21 ] and Lε leaves the subspace F` of functions with quasi-momentum ` invariant. Using the properties just described, we get ∼ Lε v ` (x) = ε2 − (1 + (i` + ∂x )2 )2 v˜ ` (x) − 3u2ε (x) · v˜ ` (x) ≡ Lε,` v` (x) . (1.9) To fix the notation, we repeat the calculation done by Eckhaus, cf. also [CE, M]. We denote c(x) = cos(x), s(x) = sin(x). The method of Eckhaus consists in projecting the eigenvalue problem for Lε,` onto the subspace spanned by the “bifurcating directions” c and s. Observe that, modulo higher frequency terms, we have c3 = 43 c, c2 s = 41 s, and therefore the projection of Lε,` onto this subspace is described by the matrix −4`2 − `4 − 2ε2 + O(ε4 ) −4i`3 O(`2 ) O(`) 4 + O(ε ) . 4i`3 −4`2 − `4 O(`) O(`2 ) The eigenvalues of this matrix are λ0`,0 = − 4 + O(ε2 ) `2 + O(`3 ) , λ0`,1 = −2 ε2 + O(ε4 ) − 4 + O(ε2 ) `2 + O(`3 ) + O(`4 + ε4 ) . Thus, the restriction of Lε,` on the subspace spanned by c and s has its spectrum in the left half-plane. Note that the corresponding eigenvectors are s + O(|`| + ε) and c + O(|`| + ε). Extending this calculation to the full space, one shows in the same way [E, CE, M] that Theorem 1.1. For sufficiently small ε > 0 the operators Lε,` , with ` ∈ [− 21 , 21 ] are selfadjoint on the Sobolev space H 4 , have compact resolvent and a spectrum satisfying λ`,0 (ε2 ) = − 4 + O(ε2 ) `2 + O(`3 ) ≡ −c0 (ε2 )`2 + O(`3 ) , (1.10) λ`,1 (ε2 ) = −2 ε2 + O(ε4 ) − 4 + O(ε2 ) `2 + O(`3 ) , λ`,j ≤ −(1 − j 2 )2 + O(ε2 ) ,

j = 2, 3, . . . .

Notation. Since we mostly concentrate on the branch 0, we shall abbreviate λ` = λ`,0 (ε2 ). The eigenfunction corresponding to λ` is (1.11) ϕε,` (x) = const. u0ε (x) + i`gε (x) + hε,` (x)`2 ,

176


where uε is the stationary solution, and both gε and h`,ε are 2π periodic. If we choose the constant to normalize the L2 norm of ϕε,` to 1, then ϕε,` = π −1/2 sin(x) + O(ε + |`|).We can now formulate the main question of this paper: Having seen that the solution uε is linearly (marginally) stable, is it true that this solution is stable under the non-linear evolution? The answer will be affirmative. As pointed out by Schneider [Sch], the result is not obvious, since the leading non-linear term does not have a sign. Indeed, the nonlinear evolution equation for a (small) perturbation of uε is ∂t v = −(1 + ∂x2 )2 v + ε2 v − 3u2ε v − 3uε v 2 − v 3 , where we recall that uε is of order ε, and approximately equal to O(ε) cos(x). Reducing again to quasi-momentum `, and using Eq. (1.8), we get the equation 3 ∼ ∂t v˜ ` = Lε,` v˜ ` − 3uε (v 2 )∼ ` − (v )` ,

(1.12)

and it is the term 3uε (v 2 )∼ ` which does not have a sign. The saving grace will be the diffusive behavior suggested by the spectrum (in particular the branch λ` ). At first sight, the non-linearities seem to be too singular for diffusion to dominate a potential divergence. Indeed, it is well known that, e.g., the equation ∂t u = ∂x2 u + u3 , has solutions which blow up in finite time [L], and the quadratic term makes things even worse. The beautiful observation of Schneider [Sch] is, however, that the problem Eq. (1.12) is rather of a form reminiscent of ∂t v = ∂x2 v − ∂x2 v 2 − ∂x v 3 ,

(1.13)

which is good enough for convergence [CEE, BK, BKL]. In later sections we examine in detail the form of the non-linear terms in Eq. (1.12), but here we explain briefly why these terms are similar to the non-linear terms in Eq. (1.13). The derivatives in the non-linearity have their origin in the symmetries of the problem, and they are easier to understand in momentum space. In fact, Eq. (1.13) is a good approximation to Eq. (1.12) only in the low-momentum (small `) regime, but this is sufficient since for ` outside a neighborhood of ` = 0, the stationary solutions are linearly stable, (and not only marginally stable) and the form of the non-linearity is unimportant. To understand the low-momentum behavior of Eq. (1.1), note first that the SwiftHohenberg equation Eq. (1.1) – and, incidentally, other equations with coordinate independent right-hand side – has a circle of fixed points generated by translations. If we now study Eq. (1.12) at ` = 0, this corresponds to studying the Swift-Hohenberg equation in the space of functions of period 2π. In this space, say L2 ([0, 2π]), the linear operator in Eq. (1.12) has pure point spectrum with a simple eigenvalue at 0 and all other eigenvalues real and strictly negative. In this case, as Schneider notes, the center manifold theorem can be applied, and there exists a 1-dimensional center manifold. We also see immediately that the eigenvector corresponding to the 0 eigenvalue is ∂x uε , i.e., it is tangent to the circle of fixed points generated by translations. In fact, since any fixed point sufficiently close to the origin must lie in the center manifold, we see that the center manifold coincides with the 1-dimensional circle of fixed points. Thus the non-linearity in the equation, when restricted to the center manifold, must vanish. This shows that the effective non-linearity in Eq. (1.12), when evaluated at ` = 0, must vanish


177

and this accounts for one derivative in Eq. (1.13). More precisely, we see that the effective non-linearity in Eq. (1.12) is bounded by O(`), as is the non-linearity in Eq. (1.13). The second derivative of the non-linearity in Eq. (1.13) arises because of “momentum conservation.” Since ϕε,` is a smooth function of `, the linear term in Eq. (1.11) must of the form i`gε , with gε independent of `. Since the interaction is local in x, one sees upon working out the integrals that all terms proportional to ` in the non-linearity cancel exactly, see Eq.(A.3). Thus, the low momentum behavior of Eq. (1.12) is as if the non-linearity was differentiated twice – i.e., exactly as in Eq. (1.13). Our main result is that this intuitive argument correctly predicts that the leading order asymptotics are diffusive, and that furthermore, the higher order asymptotics are controlled by a sequence of finite dimensional invariant manifolds. Thus, our approach provides some insight into how finite dimensional geometrical structures can arise from a problem with continuous spectrum. Stability Theorem 1.2. Fix n ≥ 1 and δ > 0. There exists a Hilbert space, H(n), such that there is an n + 1 dimensional, invariant manifold for (1.12) in the extended phase space P (n) = R+ × H(n) of this equation. Any “sufficiently small” solution of (1.12) will either lie on this manifold, or approach it at a rate O(t−(n+1−δ)/2 ). In particular, if n = 1, small solutions of (1.12) have the asymptotic form: 1 A − x2 v(x, t) = √ e 4c0 (ε2 )t + O( 3/4−δ ) , t t where c0 (ε2 ) = 4 + O(ε2 ). Remark. In Sects. 2 and 3, we will make clear precisely what the Hilbert spaces H(n) are and what we mean by “sufficiently small.” The remainder of the paper is devoted to a proof of the Stability Theorem 1.2. 2. Formulating the Stability Theorem 1.2 in Terms of Scaling Variables In this section, we transform the problem to a rescaled dynamical system. In the next section, we will cast the dynamical system thus obtained into an invariant manifold problem. The idea of the proof is to focus on the “central branch” of the spectrum, λ` = λ`,0 (ε2 ), which is only marginally stable. The relevant part of the spectrum for the long-time asymptotics is only the part in a small neighborhood of ` = 0, a fact we exhibit by an appropriate rescaling of the dependent and independent variables. This rescaling has the disadvantage that it introduces a singular perturbation in the variables corresponding to the “stable branches” of the spectrum, λ`,n (ε2 ), n ≥ 1, because the corresponding modes decay extremely fast, when rescaled (at least on a linear level). However, invariant manifold theory has long been used to treat singular perturbation problems, and we are able to use it for that purpose here as well. In addition, these invariant manifolds will provide us with a geometric description of the long-time asymptotics of solutions near the stationary states. Our method generalizes to other problems of similar spectral nature, see the example of a cylindrical domain given in [W2]. Henceforth, we fix ε > 0, and omit it from most subscripts. Since L` = Lε,` is selfadjoint, we can define the (orthogonal) spectral projections P` and P`⊥ , which project onto the central branch and its complement.

178


Remark. We know that for |`| sufficiently small, say |`| < `0 /2, one has spec(P` L` P` ) = −c0 (ε2 )`2 + O(`3 ) , and that this is the eigenvalue closest to 0 in spec(L` ). We continue this projection smoothly to larger ` even if it cannot be guaranteed to be a projection onto the highest eigenvalue. But note that for those values of ` the spectrum of L` can be shown to be strictly bounded away from 0, see, e.g., [CE, p. 102]. To study the non-linearity, and to show the mechanism leading to the result which is analogous to Eq. (1.13), we write Eq. (1.12) in more detail:

∂t v˜ ` (x) = Lε,` v˜ ` (x) − 3uε (x) Z −

Z

1/2

dk v˜ `−k (x)v˜ k (x)

−1/2

1/2

dk1 dk2 v˜ `−k1 −k2 (x)v˜ k1 (x)v˜ k2 (x)

(2.1)

−1/2

˜ ` (x) − F3 (v) ˜ ` (x) . ≡ L` v˜ ` (x) − F2 (v) We now decompose Eq. (2.1) by projecting onto P` and P`⊥ . If f ∈ L2 , we let f˜`c = P` f˜` , ⊥ ⊥ and f˜`⊥ = P`⊥ f˜` . Similarly, Lc` = P` L` P` and L⊥ ` = P` L` P` . Then we get ∂t v˜ `c (x) = Lc` v˜ `c (x) − P` F2 (v) ˜ ` (x) − P` F3 (v) ˜ ` (x) , (2.2) and a similar equation for v˜ `⊥ :

∂t v˜ `⊥ (x) = L⊥ ˜ `⊥ (x) − P`⊥ F2 (v) ˜ ` (x) − P`⊥ F3 (v) ˜ ` (x) . ` v

(2.3)

We next split the first equation into a piece corresponding to small |`|, i.e., |`| < `0 and another corresponding to large `. Since we want to construct invariant manifolds, we need some smoothness in this construction and we choose a smooth cutoff χ satisfying 1, if |`| ≤ `0 , χ(`) = 0, if |`| > 2`0 , and of course `0 < 21 . In fact, we shall choose `0 > 0 so small that P` is the projection onto the central eigenspace for all ` ∈ [−`0 , `0 ]. Let ϕ` denote the normalized eigenvector which spans the range of P` (for |`| < `0 , and smoothly continued for ` beyond that value). Then v˜ `c can be written as v˜ `c = V (`)ϕ` , where it is understood that V is really a function of v. We also let Π` denote the operation Π` f` = hϕ` |f` i, where h·i is the scalar product in F` . This operation extracts the coefficient V and therefore Eq. (2.2) can be written as ˜ ` − Π` P` F3 (v) ˜ `. ∂t V (`) = λ` V (`) − Π` P` F2 (v)

(2.4)

Defining V < (`) = χ(`)V (`), and V > (`) = (1 − χ(`))V (`), Eq. (2.4) can be rewritten as ∂t V < (`) = λ` V < (`) − f c (V < , V > , v˜ ⊥ ) (`) , (2.5) ∂t V > (`) = λ` V > (`) − f s (V < , V > , v˜ ⊥ ) (`) , where


179

f (V , V , v˜ ) (`) = χ(`) Π` P` F2 v˜ ` + Π` P` F3 v˜ ` , s < > ⊥ f (V , V , v˜ ) (`) = 1 − χ(`) Π` P` F2 v˜ ` + Π` P` F3 v˜ ` , c

⊥

v˜ ` (x) = (V < (`) + V > (`)) · ϕ` (x) + v˜ `⊥ (x) .

Note that since V > is supported outside [−`0 , `0 ], both it and v˜ ⊥ decay exponentially (at least at the linear level) and hence will be irrelevant for the asymptotics of V < , as we shall show. With this in mind, we introduce a new coordinate, V s , which combines the “irrelevant” pieces, V s = (V > , v˜ ⊥ ). Then the Eq. (2.5) combined with Eq. (2.3) takes the more suggestive form ∂t V < (`) = λ` V < (`) − f (V < , V s ) (`) , (2.6) s < s ∂t V s = L(0) b V + g(V , V ) , s and we know that the spectrum of the linear operator L(0) b is contained in (−∞, −σ ), for some σ s > 0. In order to proceed further, we analyze the non-linear terms in Eq. (2.6) in more detail. In particular, we concentrate on the most critical terms, namely those in f of Eq. (2.6) which depend only on V < . We decompose f (V < , V s ) = f2(0) (V < )+f3(0) (V < )+ f4(0) (V < , V s ), where f2(0) collects the terms which are homogeneous of degree 2 in V < and f3(0) those of degree 3. One gets

f2(0) (V < ) (`) = 3χ(`)

Z

Z dx ϕ` (x)uε (x)

Z ≡ 3χ(`) f3(0) (V < ) (`)

Z = χ(`)

1/2

dk ϕk (x)ϕ`−k (x)V < (k)V < (` − k)

−1/2

1/2

dk K2 (`, k)V < (k)V < (` − k) ,

−1/2

dx ϕ` (x)

Z

1/2

dk1 dk2 ϕk1 (x)ϕk2 (x)ϕ`−k1 −k2 (x) −1/2 < (k2 )V < (` − k1 − k2 )

× V < (k1 )V Z 1/2 ≡ χ(`) dk1 dk2 K3 (`, k1 , k2 )V < (k1 )V < (k2 )V < (` − k1 − k2 ) . −1/2

(2.7) At this point, we make use of the diffusive nature of the problem for V < , by introducing scaling variables as in [W]. This will give us a more precise description of the convergence process than the one obtained in [Sch]. We rescale the variables in Eq. (2.6) as follows: We first fix, once and for all, a (large) constant t0 > 0. Then we define p V < (`, t) = wc sign(`) |3` |(t + t0 ) , log(t + t0 ) , (2.8) p V s (`, t) = ws sign(`) |3` |(t + t0 ) , log(t + t0 ) /(t + t0 )1/2 , where 3` = λ` for |`| < `0 /2 and is monotonically extended beyond that region in such a way that it is parabolic for large |`|. (This artifact is needed because we have no guarantee that λ` itself is monotone.) Note that if λ` were equal to −const. `2 , this scaling would

180


amount to the usual “diffusive” rescaling. Our choice takes into accountphigher order corrections produced by higher order terms in λ` . If we let now p = sign(`) |3` |(t + t0 ), and τ = log(t + t0 ), then Eq. (2.6) implies that wc and ws obey the following equations: ∂τ wc = (−p2 − 21 p∂p )wc + eτ f2 (wc , e−τ /2 ) + f3 (wc , e−τ /2 ) + f4 (wc , ws e−τ /2 , e−τ /2 ) , e−τ ∂τ ws = Mexp(−τ /2) ws + 21 e−τ ws − 21 e−τ p∂p ws + eτ /2 g(wc , ws e−τ /2 , e−τ /2 ) , (2.9) where f2 , f3 , f4 and M in Eq. (2.9) are defined below. If p pe−τ /2 = p(t + t0 )−1/2 = sign(`) |3` | , and if we denote the inverse transformation by ` = 8(pe−τ /2 ) , p where 8 is the inverse function of x 7→ sign(x) |3x |, then, given a function w = w(`, t), we define the nonlinearity f2 (w, e−τ /2 ) (p) = f2(0) (w(·, eτ )) (8(pe−τ /2 )) = f2(0) (w(·, t + t0 )) (8(p(t + t0 )−1/2 )) . (Note that 8(x) = x 1 + O(x)).) Analogous definitions apply to f3 and f4 . The operator M will be described in detail in Eq.(2.13). Remark. The non-linearities f2 ,. . . depend on the choice of t0 . If we consider the initial value problem for the Swift-Hohenberg equation, the “smallness” assumption on the perturbation of the periodic state is to be understood with respect to a choice of a (sufficiently large) t0 . As we will see, however, the nonlinear terms can be bounded, independent of t0 , for all t0 ≥ T > 0. To this change of variables will correspond the following (non-exhaustive) list of substitutions in the integrals in Eq. (2.7): Let a, b ∈ [− 21 , 21 ]. Then Z χ(`)

b

dk → χ 8(pe−τ /2 ) e−τ /2

Z

a

eτ /2 8−1 (b) eτ /2 8−1 (a)

dq 80 (qe−τ /2 ) ,

ϕ` → ϕ8(pe−τ /2 ) , ϕk−` → ϕ0(p,q,τ ) , V (k, t) → w(p, τ ) , V (` − k, t) → w(1(p, q, τ )) .

(2.10)

Here, we define 0(p, q, τ ) = 8(pe−τ /2 ) − 8(qe−τ /2 ) ,

1(p, q, τ ) = eτ /2 8−1 8(pe−τ /2 ) − 8(qe−τ /2 ) . It follows at once from the definition of 8 that

(2.11)


0(p, q, τ ) = e−τ /2 (p − q) · 1 + γ(p, q, τ ) , 1(p, q, τ ) = (p − q) · 1 + κ(p, q, τ ) ,

181

(2.12)

where κ and γ are bounded and smooth. We next discuss in detail the spectrum of Mexp(−τ /2) , which is just the rescaled linear operator for the “stable” part of w, cf. Eq. (2.6). Recall first that V s = (V > , v˜ ⊥ ). This introduces a natural decomposition of ws = (w1s , w2s ), as well as of Mexp(−τ /2) = Mexp(−τ /2),1 ⊕ Mexp(−τ /2),2 . From the definition of the first component, we get 2 Mexp(−τ /2),1 f1s (p, τ ) = ε2 − 1 + (i + i8(pe−τ /2 ))2 − K 8(pe−τ /2 ) f1s (p, τ ) , where K(`) is a kernel given by K(`) = 3

(2.13) Z dx ϕ` (x)u2ε (x)ϕ` (x) .

(Recall that ϕ` really depends on ε as well and should be written ϕε,` .) Since V s has s support bounded paway from ` = 0, say |`| > `0 /2, we see that w1 (p, τ ) will have support −τ /2 > |3`0 /3 |, and the spectrum of Mexp(−τ /2),1 is seen to be contained in in |p|e {σ|Re σ ≤ σ0 < 0}, for some σ0 and for all τ > 0. A very similar argument detailed in Appendix B shows that the spectrum of Mexp(−τ /2),2 is also contained in such a set. Thus, the linear evolution generated by Mexp(−τ /2) contracts exponentially. See Lemma B.6 below for details. We next consider the operator L = (−p2 − 21 p∂p ), which appears in the first component of Eq. (2.9). The detailed study of the semi-group generated by L will be given in Appendix B. Here, we discuss its properties on an informal level. The Fourier transform of L is ∂x2 + 21 x∂x + 21 , which is conjugate to the harmonic oscillator H0 = ∂x2 − x2 /16 + 1/4 by the (unbounded!) transformation T , of multiplication by exp(x2 /8). In formulas: L = T −1 H0 T . Therefore, H0 has (say, on L2 ), discrete spectrum µj = −j/2, j = 0, 1, . . . . It is this spectrum which leads to a nice interpretation of the convergence properties of the Swift-Hohenberg equation. The eigenvalues of L are unchanged by the transformation T , (and the eigenfunctions are multiplied by a Gaussian), so to each eigenvalue µ of L there corresponds a decay rate eτ µ in the linear problem. Because of the transformation of variables from t to τ , this decay rate becomes (t+t0 )µ in the original problem Eq. (2.6). In other words: Neglecting the non-linearities in Eq. (2.9) and setting ws = 0, (and ignoring potential problems related to the unbounded operator T ) we have a solution wc (p, τ ) =

∞ X

wm e−τ m/2 Hm (2p) ,

(2.14)

m=0

where Hm is the mth eigenfunction of L. In the original variables, this means that V < (`, t) =

∞ X

wm (t + t0 )−m/2 Hm 2`(t + t0 )1/2 (1 + O(|`|1/2 )) .

(2.15)

m=0

Thus, to each m there corresponds a specific rate (µm = −m/2) of decay for a part of the function V < . Note that a change of t0 just corresponds to a rearrangement of the series.

182


(This is not contradictory, since a change of t0 also changes the initial condition, and hence the solution whose asymptotics we are computing.) In particular, the slowest rate of decay is associated with H0 , which is Gaussian, and thus, at least at the linear level, a “generic” perturbation of the stationary state will decay like exp(−c`2 t), for some c > 0. In terms of the original independent variables (x, t), it decays like t−1/2 exp(−x2 /(4tc)), as t → ∞. This means that at this level, the periodic stationary states are stable, and that perturbations of them decay like solutions of the linear heat equation. The invariant manifold theory guarantees that this behavior persists in the non-linear problem, and in fact it tells us more. We will see that in suitable spaces we can construct a sequence of manifolds Mj of dimension j = 1, 2, . . ., such that any solution of Eq. (2.9) approaches a solution on Mj at a rate eτ µj−1 , or again reverting to the original (x, t) variables, at a rate O((t + t0 )µj−1 ). In the case at hand, this is O((t + t0 )−j/2 ). Thus, in principle, we can analyze finer and finer details of the asymptotics of perturbations of the stationary state by considering the behavior of the solution on these finite dimensional manifolds. 3. Casting the Stability Theorem 1.2 into an Invariant Manifold Theorem At the end of the preceding section, we have seen that the spectrum of the linear part of Eq. (2.9) has the following nature: The component wc satisfies a differential equation whose linear part has eigenvalues µj = −j/2, j = 0, 1, . . . , N , provided we work on a space of sufficiently smooth and rapidly decaying functions. The evolution of ws is governed by an equation with an even more stable spectrum. The invariant manifold theorem will show in which sense the built-in scalings of Eq. (2.14) survive the addition of non-linearities. While this presents no conceptual problems at all – and this is the beauty of the present approach – some care is of course needed in the application of the invariant manifold theorem. Another point which might be overlooked is the following: The invariant manifold theorem does not say that the representation of the full non-linear problem is the same as in Eq. (2.15), but with slowly varying wj . Rather, we will show that on the complement of a dimension j − 1 surface in the function space, the solutions decay at least like t−j/2 , (for every j ≥ 1), provided the initial data are sufficiently small and smooth. In order to apply the invariant manifold method to the problem, we need bounds on the non-linearities and bounds on the semi-group generated by L. While the factor of t = exp(τ ) in front of f2 in Eq. (2.9) might look like a disaster, we will see that by working in appropriate function spaces, and taking advantage of the nature of the nonlinear term, this factor will disappear. Its presence is in part due to the fact that we chose to work in “momentum” space, rather than “position” space, because the linear problem is most naturally studied in Floquet variables. If we rewrote these terms in position space (i.e., in the original (x, t) variables), they would look much less singular. We will work in Sobolev spaces, and we define Hq,r = {v | (1 − ∂p2 )r/2 (1 + p2 )q/2 v ∈ L2 } ,

(3.1)

equipped with the corresponding norm k · kq,r . The function wc will be an element of Hq,r . The function ws has two components. The first component comes from the central branch of the spectrum of the linear operator (1.9), and will also be in Hq,r . The second component comes from the stable branches of the spectrum, and it depends on both p, and x. It will be an element of the space:


183

Hq,r,ν = {w = w(p; x) | w(p; x) = w(p; x + 2π), (1 − ∂x2 )ν/2 (1 − ∂p2 )r/2 (1 + p2 )q/2 w ∈ L2 (R × [−π, π])} . By a slight abuse of notation, we will denote by kws kHq,r,ν the sum of the Hq,r norm of the first component of ws and the Hq,r,ν norm of the second component, and by kws kq,r,ν , we will mean the Hq,r,ν norm of just the second component. We will also use Hq,r,ν to denote the space of all functions with finite Hq,r,ν norm. The non-linearities satisfy the following bounds: Proposition 3.1. For every q ≥ 2 and every r ≥ 0 there is a constant C for which keτ f2 (w, e−τ /2 )kq−1,r ≤ Ckwk2q,r ,

(3.2)

keτ f3 (w, e−τ /2 )kq,r ≤ Ckwk3q,r , for all τ > 0.

Proposition 3.2. For every q ≥ 2 and every r ≥ 0 there is a constant C for which keτ f4 (wc , ws e−τ /2 , e−τ /2 )kq,r ≤ Ceτ /2 kws kHq,r,ν e−τ /2 kwc kq,r + e−τ kws kHq,r,ν

× 1 + e−τ /2 kwc kq,r + e−τ kws kHq,r,ν , (3.3)

keτ /2 g(wc , ws e−τ /2 , e−τ /2 )kHq,r,ν ≤ Ceτ e−τ /2 kwc kq,r + e−τ kws kHq,r,ν

2

× 1 + e−τ /2 kwc kq,r + e−τ kws kHq,r,ν ,

(3.4)

for all τ > 0. Remark. Note that every factor of kwc kq,r is multiplied by e−τ /2 and every factor of kws kHq,r,ν is multiplied by e−τ . Remark. As we pointed out above, the nonlinear terms depend on the constant t0 . However, the bounds in the two preceding propositions are independent of t0 . More precisely, for any T > 0, the constants C in both propositions can be chosen so that the estimates in (3.2)–(3.4) hold for all t0 ≥ T . The proofs will be given in Appendix A. Note that one loses a power of p in the first estimate of Eq. (3.2), but of course, one “gains” the square of the function. We will regain the “lost” power of p by examining in detail the semi-group generated by L. We denote by PN the projection onto the space spanned by the N eigenvalues {µj = −j/2}j=0,...,N −1 of L. We define QN = 1 − PN . (We verify in Appendix B that these projections are defined.) On the space corresponding to QN , we expect the norm of the semi-group generated by L to decay like exp(τ µN ). This is indeed the case. Theorem 3.3. For every δ > 0, there are a constant N0 and a function r(N, q) such that for every N ≥ N0 , every q ≥ 1 and every r ≥ r(N, q), there is a C = C(q, r, N ) < ∞ such that

τL C(q, r, N ) −τ (|µN |−δ)

e QN v ≤ √ kvkq−1,r , (3.5) e q,r a(τ ) where a(τ ) = 1 − e−τ and L = −p2 − 21 p∂p The proof will be given in Appendix B.

184


We also need an estimate on the linear evolution generated by Mexp(−τ /2) . Let Uτ be the solution of e−τ ∂τ Uτ = Mexp(−τ /2) Uτ , with initial condition U0 = 1. (Compare with the linear part of (2.9).) Then, in Appendix B, we prove Theorem 3.4. If w0 ∈ Hq,r,ν , then there exists c0 > 0, such that for all τ ≥ 0, kUτ w0 kHq,r,ν ≤ exp(−ec0 τ /2 )kw0 kHq,r,ν .

With the help of the bounds Proposition 3.1–Theorem 3.4, we can now reformulate the problem in terms of invariant manifolds. Equation(2.9) can be written as an autonomous system by defining η = (t + t0 )−1/2 = e−τ /2 : ∂τ wc = Lwc + η −2 f2 (wc , η) + f3 (wc , η) + f4 (wc , ws η, η) , η 2 ∂τ ws = Mη ws + η −1 g(wc , ws η, η) , ∂τ η =

− 21 η

(3.6)

.

We will construct an invariant manifold tangent at the origin to the eigenspace corresponding to the N largest eigenvalues of L, and the η direction. We subdivide the center variable wc according to the projection QN defined earlier, where N is fixed once and for all. Define (3.7) x1 = (1 − QN )wc , x2 = QN wc , x3 = ws . Note that the variable x1 is in a finite dimensional space, while x2 and x3 are in infinite dimensional Hilbert spaces. The system of equations Eq. (3.6) now takes the form ∂τ x1 = A1 x1 + N1 (x1 , η, x2 , x3 ) , ∂τ η = − 21 η , ∂τ x2 = A2 x2 + N2 (x1 , η, x2 , x3 ) ,

(3.8)

η 2 ∂τ x3 = A3,η x3 + N3 (x1 , η, x2 , x3 ) . Here A1 = (1 − QN )L, A2 = QN L, and A3,η = Mη . Remark. In view of later developments, we consider x1 and η to be the “interesting” variables and x2 and x3 the “slaved” variables, hence the new order of the variables. Remark. Equation(3.8) is a very singular perturbation problem, because of the factor of η 2 in front of the derivative of x3 . What is more, since η(τ ) = e−τ /2 , it becomes steadily more singular in precisely the limiting regime in which we are interested. Nonetheless, we will see that the invariant manifold theorem provides just the tool we need to understand this limit. Singular perturbation problems of this type do not seem to have been studied much, but they do arise naturally in other contexts, such as the study of parabolic equations in cylindrical domains ([W2]). We shall call Eq. (3.8) the full system. To simplify the notation, we shall omit the dependence on η in A3,η . Consider the spectra of A1 , A2 , A3 . From what we have seen earlier, we find that


185

spec(A1 ) = {0, −1/2, −1, . . . , −(N − 1)/2} , spec(A2 ) ⊆ [−∞, −N/2] , spec(η

−2

(3.9)

A3 ) = [−∞, −c/η ] , 2

where c is some positive constant. Thus, we expect to apply a pseudo center manifold theorem to “slave” the variables x2 , x3 to the variables x1 and η. While there are certain technical difficulties associated with the very singular perturbation, in Appendix C, we demonstrate the following proposition: Proposition 3.5. Fix N > 0. There exist r > 0, q ≥ 1, and ν > 1/2, such that the system of equations (3.8) has an invariant, N + 1-dimensional manifold, given in a neighborhood of the origin by the graph of a pair of functions h∗2 : RN × R → Hq,r , h∗3 : RN × R → Hq,r,ν . We next turn to the task of showing that the invariant manifold we found for Eq. (3.6) actually attracts solutions at an exponential rate. Notation. It is useful to introduce the notationξ = (x1 , η) for the two relevant variables. Consider a solution of the form wc (τ ), ws (τ ) of Eq. (3.6), with wc (τ ) = x1 (τ ), x2 (τ ) as in Eq. (3.7), and ws (τ ) = x3 (τ ). We wish to show that ξ(τ ) , x2 (τ ) , x3 (τ ) −→ ξ(τ ) , h∗2 (ξ(τ )) , h∗3 (ξ(τ )) , as τ → ∞, and furthermore, that it does so at an exponential rate, given essentially by the least negative eigenvalue, µN , of the operator A2 . Proposition 3.6. Fix N > 0. For every positive δ there is a ρ0 > 0 such that if the solution of Eq. (3.6) remains in a neighborhood of the origin of size ρ0 one has the following bound: There is a C ∗ < ∞ for which kx2 (τ ) − h∗2 (ξ(τ ))kq,r + kx3 (τ ) − h∗3 (ξ(τ ))kHq,r,ν ≤ C ∗ e−(|µN |−δ)τ , as τ → ∞. Proof. This proof is relatively standard, see e.g., Carr [C]. Let x2 (τ ) − h∗2 (ξ(τ )) z2 (τ ) ≡ . z(τ ) = z3 (τ ) x3 (τ ) − h∗3 (ξ(τ ) ÿ

Then we have z˙ =

A2 z2 + Nˆ 2 (ξ, z2 , z3 ) −2 η A3 z + η −2 Nˆ 3 (ξ, z2 , z3 )

! ,

(3.10)

where, with the notation of Eq. (3.8), Nˆ j (ξ, z2 , z3 ) = Nj (ξ, z2 + h∗2 (ξ), z3 + h∗3 (ξ)) − Nj (ξ, h∗2 (ξ), h∗3 (ξ)) , for j = 2, 3. The only novelty in Eq. (3.10) w.r.t. [C] is the factor of η −2 in the “3”component which is the reason for our repeating his arguments. But we can integrate Eq. (3.10) explicitly and get

186


Z z2 (τ ) = e

τ A2

τ

z2 (0) +

dσ e(τ −σ)A2 Nˆ 2 ξ(σ), z2 (σ), z3 (σ) ,

0 −2

z3 (τ ) = e(η(τ ) Z τ dσ + 0

−η(0)−2 )A3

z3 (0)

1 (η(τ )−2 −η(σ)−2 )A3 ˆ N3 ξ(σ), z2 (σ), z3 (σ) . e η(σ)2 −1/2

We assume η(0) > 0, since we are interested in the case η(0) = t0 , and we have chosen the scaling factor t0 to be a positive, finite constant. Note also that ξ remains in a neighborhood of the origin, as τ → ∞. From the bounds on the non-linear terms we see that if the solution satisfies kx2 (τ )kq,r + kx3 (τ )kHq,r,ν ≤ ρ , for all τ ≥ 0, then, with νN = N/2, the modulus of the N th eigenvalue µN of L, we have kz2 (τ )kq,r ≤ e−τ νN kz2 (0)kq,r Z τ + Cε dσe−(τ −σ)νN kz2 (σ)kq,r + kz3 (σ)kHq,r,ν , 0 −2

kz3 (τ )kHq,r,ν ≤ e(η(τ ) Z + Cε

−η(0)−2 )νN

kz3 (0)kHq,r,ν

1 −(η(τ )−1 −η(σ)−1 )νN e kz2 (σ)kq,r + kz3 (σ)kHq,r,ν . 2 η(σ) 0 (3.11) In deriving these inequalities, we used the inequalities τ

dσ

keτ A2 Nˆ 2 (ξ, z2 , z3 )kq,r ≤ e−τ νN kNˆ 2 (ξ, z2 , z3 )kq−1,r , keρA3 Nˆ 3 (ξ, z2 , z3 )kHq,r,ν ≤ e−ρνN kNˆ 3 (ξ, z2 , z3 )kHq,r,ν , which follow from the bounds of Appendix B. If we now fix δ > 0 and define C2 (τ ) = C3 (τ ) =

sup eτ

0

(νN −δ)

kz2 (τ 0 )kq,r ,

0

(νN −δ)

kz3 (τ 0 )kHq,r,ν ,

0≤τ 0 ≤τ

sup eτ

0≤τ 0 ≤τ

then Eq. (3.11) leads to the inequality C2 (τ ) ≤ K1 + K2 ε C2 (τ ) + C3 (τ ) C3 (τ ) ≤ K3 + K4 ε C2 (τ ) + C3 (τ )

Z Z

τ

dσ e−(τ −σ)δ ,

0 τ

dσ 0

1 (η(τ )−2 −η(σ)−2 )νN (τ −σ)(νN −δ) e e . η(σ)2

If we insert into these integrals the definitions η(σ) = exp(−σ/2)η(0) ,

η(τ ) = exp(−τ /2)η(0) ,

we find that both integrals are uniformly bounded in τ ≥ 0 if η(0) is in a compact subinterval of (0, 1). The proof of Proposition 3.6 is complete. Thus, all solutions near the invariant manifold approach it exponentially fast in τ .


187

One can now show without difficulty that every solution approaches exponentially quickly a particular solution on the (approximate) invariant manifold x1 (τ ), η = 0, h∗2 (x1 (τ ), 0), h∗3 (x1 (τ ), 0) . This consists simply in translating pp.21–24 of [C] into the present setting and thus there is no need to repeat this argument here. If we combine these results with Proposition 3.5, we arrive finally at a description of the invariant manifolds which exist close to the origin for (3.8). Theorem 3.7. Fix N > 0 and δ > 0. There exist r > 0, q ≥ 1, and ν > 1/2, such that the system of equations (3.8) has an invariant, N + 1-dimensional manifold, given in a neighborhood of the origin by the graph of a pair of functions h∗2 : RN × R → Hq,r , and h∗3 : RN × R → Hq,r,ν . Any solution of (3.8) which remains in a neighborhood of the origin for all τ ≥ 0 approaches a solution of the N + 1-dimensional system of ordinary differential equations ∂τ x1 = A1 x1 + N1 (x1 , η, h∗2 (x1 , η), h∗3 (x1 , η)) , ∂τ η = − 21 η ,

(3.12)

which results from restricting (3.8) to this invariant manifold. Furthermore, the rate of approach to this manifold is O(exp(−τ (N/2 − δ))). Remark. This theorem almost suffices to prove Stability Theorem 1.2 . In particular, it emphasizes that in a neighborhood of the periodic solutions of (1.1) there exists a family of invariant manifolds, M2 , M3 , . . ., described in that theorem. The one remaining piece of the puzzle is to describe the behavior of solutions restricted to the invariant manifold, and that we do in the next section.

4. The Projection of the Non-Linearity onto Zero Momentum We have already shown that there exists a (smooth) invariant manifold, parameterized by (ξ, h∗2 (ξ), h∗3 (ξ)), where ξ = (x1 , η). This manifold satisfies Eq. (3.8), which, in the case of N = 1, i.e., in the case of a two-dimensional invariant manifold amounts to ∂τ x1 = N1 x1 , η, h∗2 (ξ), h∗3 (ξ) , ∂τ η = − 21 η , ∂τ h∗2 (ξ) = A2 h∗2 (ξ) + N2 x1 , η, h∗2 (ξ), h∗3 (ξ) , η 2 ∂τ h∗3 (ξ)) = A3 h∗3 (ξ) + N3 x1 , η, h∗2 (ξ), h∗3 (ξ) .

(4.1)

Note that because N = 1 the operator A1 equals zero (which is the highest eigenvalue of L). To understand the dynamics inside this invariant manifold, we now state and prove the following proposition, which is based on Schneider’s beautiful observation: Let N˜ 1 (x1 , η) be the r.h.s. of the first equation in (4.1), i.e., ∂τ x1 = N˜ 1 (x1 , η). Proposition 4.1. There is an x1,0 > 0 such that N˜ 1 (x1 , 0) = 0, for all |x1 | < x1,0 .Thus, the non-linearity vanishes identically at “infinite time,” which corresponds to η = 0. Before proving Proposition 4.1, we show that it implies the following important

188


Theorem 4.2. If x1 (0) is sufficiently close to 0, then there are a constant C < ∞ and an x∗1 such that (4.2) |x1 (τ ) − x∗1 | < Ce−τ /2 . Proof. Using the fact that η(τ ) = e−τ /2 , we can rewrite the equation for x1 as ∂τ x1 = N˜ 1 (x1 , e−τ /2 ) .

(4.3)

Since N˜ 1 is a smooth (at least C 1+α ) function with N˜ 1 (x1 , 0) = 0 in some neighborhood of the origin, there exists a constant CN > 0, such that |N˜ 1 (x1 , e−τ /2 )| ≤ CN exp(−τ /2), for |x1 | sufficiently small. Integrating (4.3) and applying this estimate yields: Z τf dσ N˜ 1 (x(σ), e−σ/2 ) |x1 (τf ) − x1 (τi )| = τi Z τf dσ e−σ/2 = 2CN e−τi /2 (1 − e(τi −τf )/2 ) . ≤ CN τi

This estimate immediately implies the behavior claimed in Theorem 4.2. Proof of Proposition 4.1. The basic idea is to relate N˜ 1 (x1 , 0) to the non-linear term of another problem, which is known to be 0. This other problem is the center manifold equation for the perturbations of a stationary solution of Eq. (1.1) restricted to a space of 2π-periodic functions. In this case, the equation analogous to Eq. (1.12) is ∂t v = Lper v + F (v) , where F (v) collects the non-linear terms in v. The spectrum of Lper is pure point, with a simple zero eigenvalue, and all others negative, and bounded away from 0. The eigenvector with 0 eigenvalue is u0ε , where uε is given by Eq. (1.5). If we call x1,per the coordinate in the u0ε direction, then there exists a one-dimensional center manifold, tangent to this direction and given as the graph of a function H(x1,per ). A very nice observation by Schneider is that this center manifold must coincide with the translates of the stationary state uε , which is formed of fixed points of the Swift-Hohenberg Eq. (1.1). Hence, on this center manifold we must have x˙ 1,per = 0. Using this information, the equations for this center manifold take a particularly simple form. Let Pper denote the projection onto u0ε and let Qper = 1 − Pper . Then the preceding discussion implies that the flow ψt,per is the identity on x1,per , and hence the equations for the invariant manifold read: x˙ 1,per = Pper F (x1,per , H(x1,per )) = 0 , Z 0 dτ e−Qper Lper Qper F (x1,per , H(x1,per )) H(x1,per ) = −∞

= − Qper Lper

−1

Qper F (x1,per , H(x1,per )) .

(4.4)

(4.5)

We now wish to use this information to prove Proposition 4.1. The rough idea is to show that (4.6) N˜ 1 (x1 , 0) = Pper F (x1,per , H(x1,per )) , and this quantity vanishes by Eq. (4.4). More precisely, we shall show:


189

Proposition 4.3. The cubic term in x1 of N˜ 1 (x1 , η) coincides in the limit η → 0 with the cubic term in x1 of Pper F (x1,per , H(x1,per )). All other terms in N˜ 1 go to 0 as η → 0. Remark. Since Pper F x1,per , H(x1,per ) = 0, this proves Eq. (4.6) and thus Proposition 4.1. Proof. The proof of Proposition 4.3 will be given in Appendix D. 5. Completion of the Proof of Stability Theorem 1.2 We now consider exactly how the results of the previous two sections about the behavior of solutions in, and near, the invariant manifold translate back into statements about solutions in terms of the original variables. We will focus specifically on the case considered in the previous section in which the invariant manifold is two-dimensional, with coordinates (x1 , η), but the results can be immediately extended to the case of a manifold of arbitrary dimension. Suppose we have a solution wτ = wτc + wτs , of the system (3.6), which remains in a neighborhood of the origin for all τ ≥ 0. This will be the case if its initial condition is sufficiently small in Hq,r ⊕ Hq,r,ν . We measure the size of w in the norm ||| · |||, which is the sum of the Hq,r norm of wc , and the Hq,r,ν norm of ws . By the results of Theorem 3.7, we know that there exists a solution, wτinv , on the invariant manifold such that (5.1) |||wτ − wτinv ||| ≤ Ce−τ (1/2−δ) , with δ > 0. In addition, from Theorem 4.2, we know that there exists some w∗ , which lies in the invariant manifold for which |||wτinv − w∗ ||| ≤ Ce−τ /2 .

(5.2)

Here, w∗ is the function whose coordinates in the invariant manifold representation is just the limiting point x∗1 in Theorem 4.2, i.e., w∗ = x∗1 , 0, h∗2 (x∗1 , 0), h∗3 (x∗1 , 0) . Combining (5.1) and (5.2), we see that for solutions that remain near the origin, there exists a function w∗ , for which |||wτ − w∗ ||| ≤ Ce−τ (1/2−δ) .

(5.3)

Our final task is now to untangle the various changes of variables which we made in the original equation. If we first “undo” the rescaling in (2.8), we see that the solution v(`, t), corresponding to w(·, τ ) = wτ is p v(`, t) = wc (sign(`) |3` |(t + t0 ), log(t + t0 )) p 1 s (4.4) w (sign(`) |3` |(t + t0 ), log(t + t0 )) + (t + t0 )1/2 ≡ v c (`, t) + v s (`, t) . One can make a corresponding decomposition of v ∗ , the solution corresponding to w∗ . First consider v c . From (5.3), one has Z c ∗,c 2 dp |(1−∂p2 )r/2 (1+p2 )q/2 (wc (p, τ )−w∗,c (p, τ ))|2 ≤ Ce−τ (1−2δ) . kwτ −wτ kq,r = (4.5)

190


According to (4.4), wc (`, τ ) = v c (8−1 (pe−τ /2 ), t), so substituting this expression – and the analog for w∗,c – into (4.5) one finds that the left hand side of that inequality is equal to: Z dp |(1 − ∂p2 )r/2 (1 + p2 )q/2 v c (8−1 (pe−τ /2 ), t) − v ∗,c (8−1 (pe−τ /2 ), t) |2 Z (4.6) ≥ dp |(1 + p2 )q/2 v c (8−1 (pe−τ /2 ), t) − v ∗,c (8−1 (pe−τ /2 ), t) |2 Z ≥ d` (t + t0 )1/2 80 (`) |(1 + (t + t0 )(8(`))2 )q/2 v c (`, t) − v ∗,c (`, t) |2 , where in the last integral we changed the integration variable to ` = 8−1 (pe−τ /2 ) = 8−1 (p(t + t0 )−1/2 ). Remark. We dropped the derivatives with respect to p in the second line of (4.6) for simplicity – one could retain them at the expense of complicating the following expressions. Since 8(x) ≈ x, for x small, and is equal to a constant times x for |x| large (due to the definition of 3` ), we see that combining (4.5) and (4.6) and recalling that t0 > 0, one finds: Z (4.7) d` |(1 + `2 )q/2 (v c (`, t) − v ∗,c (`, t))|2 ≤ Ct−3/2(1−2δ) . Analogous estimates hold for the “stable” part of the solution. Proceeding as above, one can show that Z X 2 ν d` |(1 + `2 )q/2 (v s (`, t) − v ∗,s (`, t))|2 ≤ Ct−5/2(1−2δ) . (1 + n ) (4.8) n

Thus, the “stable” part of a solution near the origin approaches the solution v ∗ on the invariant manifold faster than the “center” part of the solution. (An effect that is entirely in accord with one’s intuition.) We next take a closer look at the solution w∗ (or v ∗ ) on the invariant manifold. From the computation in the previous section, we know that since the eigenfunction in the x1 direction is exp(−p2 ), cf. Eq(2.14), we have w∗ (p) = c∗ exp(−p2 ) + h∗3 (c∗ exp(−p2 )). If we now rewrite this in terms of the v(`, t) variables, we find v ∗ (`, t) = c∗ e−3` t + t−1/2 h∗3 (c∗ e−3` t ) .

(4.9)

Thus, if v(`, t) is a solution of (1.12) (in the unscaled variables), we see from (4.7)–(4.9) that in the L2 ((1 + `2 )q/2 d`) norm, v(`, t) = c∗ e−3` t + O(t−1/2(1−2δ) ) .

(4.10)

But we know from Sect. 2 that 3` = c0 (ε2 )`2 + O(`3 ) for ` small, and 3` = c`2 , for |`| large, so one finds by an easy and explicit estimate that Z 2 2 (4.11) d` |(1 + `2 )q/2 (e−3` t − e−c0 (ε )` t )|2 ≤ Ct−1/2 . Combining (4.10) and (4.11) one has Proposition 4.4. If v is a solution of (1.12) with sufficiently small initial condition (in Hq,r ⊕ Hq,r,ν ), then


Z (

|(1 + `2 )q/2 (v(`, t) − c∗ e−c0 (ε

2

191

)| d`)1/2 ≤ Ct−1/4(1−2δ) .

)`2 t 2

Note that if we transform back to the (x, t) variables, this implies the asymptotic estimate in Stability Theorem 1.2 , and hence the proof of that theorem is complete. A. Bounds on the Non-Linearities In this section, we prove Proposition 3.1 and Proposition 3.2. We begin by studying the kernels K2 (`, k), and K3 (`, k) introduced in (2.7). Lemma A.1. There is a constant C such that

|K2 (`, k)| ≤ Cε min (|k|2 + |`|2 ), 1 .

Proof. By the definition of Eq. (2.7), we have Z K2 (`, k) = dx ϕ` (x)uε (x)ϕk (x)ϕ`−k (x) .

(A.1)

Since uε and ϕk are both uniformly bounded, we have immediately that |K2 (k, `)| ≤ Cε. The crucial observation of Schneider[Sch] is that because of Eq. (1.11), repeated here for convenience (A.2) ϕε,` (x) = u0ε (x) + i`gε (x) + hε,` (x)`2 , (with real gε ), K2 has an expansion Z 2 dx uε (x)(u0ε (x))3 + uε (x) u0ε (x) gε (x) −i` + ik + i(` − k) + εO(`2 + k 2 ) . (A.3) Note that the first term vanishes because u is a symmetric function and hence u(u0 )3 is odd, and the term which is linear in k and ` vanishes as well, because of momentum conservation, so the proof of Lemma A.1 is complete. Remark. Note that a similar calculation immediately shows that the kernel K3 satisfies: |K3 (`, k1 , k2 )| ≤ C . We now need the following auxiliary result: Lemma A.2. If ρ2 and ρ3 are in Hq,r , and if ρ1 = ρ1 (p, p0 ) is a C r function, then Z Ξ(p) = dp0 ρ1 (p, p0 )ρ2 (1(p, p0 , τ ))ρ3 (p0 ) is in Hq,r and

kΞkq,r ≤ Ckρ1 kC r kρ2 kq,r kρ3 kq,r .

Proof. Recall from Eq. (2.12) that 1(p, p0 , τ ) ≈ p − p0 , so we are really estimating a slightly distorted convolution. If 1(p, p0 , τ ) = (p − p0 ), the proof is easy using the

192


definition of the norms. In the present case, where 1(p, p0 , τ ) is not trivial, the result follows in a similar way by “undoing” part of the variable transformation which led from the variables `, k to the variables p, p0 . To simplify matters, we consider only the somewhat easier problem of bounding Z (A.4) dp0 80 (pe−τ /2 )ρ2 (1(p, p0 , τ ))ρ3 (p0 ) . Using the definition of 1(p, p0 , τ ) this is equal to Z dp0 80 (pe−τ /2 )ρ2 eτ /2 8−1 8(pe−τ /2 )−8(p0 e−τ /2 ) ρ3 eτ /2 8−1 8(p0 e−τ /2 ) . Changing variables to k = 8(e−τ /2 p) and ` = 8(e−τ /2 p0 ), we get Z d` eτ /2 ρ2 eτ /2 8−1 (k − `) ρ3 eτ /2 8−1 (`) .

(A.5)

(A.6)

We now define a function 9τ by 9τ (eτ /2 x) = eτ /2 8−1 (x) , and note that from 8(x) = x · 1 + O(x) it follows that 9τ (y) = y · 1 + O(e−τ /2 y) . We can rewrite Eq. (A.6) as Z d` eτ /2 ρ2 9τ (eτ /2 (k − `)) ρ3 9τ (eτ /2 `) . (A.7) We define next ρˆj (k) = ρj ◦ 9τ , and we see that Eq. (A.7) is equal to Z d` ρˆ2 k − ` ρˆ3 ` .

(A.8)

Thus, we can bound the Hq,r norm of Eq. (A.4) by kρˆ2 kq,r kρˆ3 kq,r , and, since 9τ is uniformly close to the identity for all τ , this is in turn bounded by const. kρ2 kq,r kρ3 kq,r . This proves Lemma A.2 in this special case. The extension to the general case is easy and is left to the reader. We now have the necessary tools to attack the proofs of Proposition 3.1 and Proposition 3.2. Proof of Proposition 3.1. If we write out the transformation leading to f2 , i.e., from Eq. (2.7) to Eq. (2.9), we get, using Eq. (2.10), Z P (τ ) 0 −τ /2 0 0 −τ /2 τ τ −τ /2 −τ /2 ) (p) = e 3χ 8(pe ) dp e 8 (p e ) e · f2 (w, e (A.9) −P (τ ) −τ /2 0 −τ /2 0 0 ×K2 8(pe ), 8(p e ) w(1(p, p , τ ))w(p ) , where

P (τ ) = 8−1 ( 21 )eτ /2 ≈ 21 eτ /2 . We bound |K2 8(pe−τ /2 ), 8(p0 e−τ /2 ) | by Cε|8(pe−τ /2 )2 + 8(p0 e−τ /2 )2 | using Lemma A.1. Since the expressions 8(pe−τ /2 ), and 8(p0 e−τ /2 ), in Eq. (A.9) are bounded, and 8(x) = x(1 + O(x)), we can extract another factor of e−τ /2 and get a bound on eτ f2 of the form


Z

193

dp0 |8(pe−τ /2 )| + |8(p0 e−τ /2 )| · |w(1(p, p0 , τ ))w(p0 )| −P (τ ) Z ∞ 0 dp |p + p0 ||w(1(p, p0 , τ ))w(p0 )| . ≤ const. χ 8(pe−τ /2 )

const. eτ /2 χ 8(pe−τ /2 )

P (τ )

−∞

(A.10) If w is in Hq,r , then with the aid of Lemma A.2, we can estimate the Hq−1,r norm of Eq. (A.9) by Ckwk2q,r . Note further, that from the above discussion it is also clear that eτ f2 (wc , e−τ /2 )(p) is also a smooth function of e−τ /2 . Remark. The factors |p|, |p0 | are responsible for the loss of one power in the norm estimate of Proposition 3.1. It is only in the study of the flow within the invariant manifold that we will need the second order bound of Lemma A.1. Remark. Note that the nonlinear terms depend (implicitly) on the constant t0 which entered the definition of the new temporal variable τ . However, all the estimates above (as well as those which follow in the proof of Proposition 3.2) are independent of this constant. The bound on f3 is similar, but no additional regularization is needed, since there are two integrations, each of which contributes a factor e−τ /2 . We leave this to the reader. The proof of the asserted bounds of Eq. (3.2) is complete. We now turn to the estimates of the nonlinear terms f4 and g. Because these terms involve the ws , we begin with a discussion of the appropriate function space for these components. These were defined in Sect. 3, but we repeat them here for convenience. Recall that wc ∈ Hq,r , while ws ∈ Hq,r ⊕ Hq,r,ν , where Hq,r,ν = {w = w(p; x) | w(p; x) = w(p; x + 2π), (1 − ∂x2 )ν/2 (1 − ∂p2 )r/2 (1 + p2 )q/2 w ∈ L2 (R × [−π, π])} . The fact that ws is an element of the direct sum of two spaces reflects the fact (see the paragraph preceding (2.6), and then (2.8) ) that it has two components, the first of which comes from the central branch of the spectrum of L` , but with ` localized away from zero, and the second component coming from the stable branches of the spectrum of L` . In a slight abuse of notation we will denote by kws kHq,r,ν the sum of the Hq,r norm of the first component of ws and the Hq,r,ν norm of the second component, and by kws kq,r,ν , we will mean the Hq,r,ν norm of just the second component. Remark. An easy fact which will be useful later is that if we expand w(p; x) ∈ Hq,r,ν in a Fourier series with respect to x, w(p; x) =

∞ X

einx wˆ n (p) ,

n=−∞

then the Hq,r,ν norm of w is equivalent to the norm kwk2Hq,r,ν =

∞ X

(1 + n2 )ν kwˆ n k2q,r .

(A.11)

n=−∞

Thus we will use the two norms interchangeably. Now consider eτ f4 (wc , ws e−τ /2 , e−τ /2 ) .

(A.12)

194


We shall concentrate on the most “dangerous” piece which is the quadratic term with one factor of wc and one of ws . Other terms are “less dangerous” in the sense that they contain either more factors of ws each of which contributes a small factor of e−τ /2 , or more convolutions which again contribute a factor of e−τ /2 . The quadratic piece of (A.12) has the form Z dx ϕ¯ 8(pe−τ /2 ) (x) uε (x) eτ 3χ 8(pe−τ /2 ) Z P (τ ) (A.13) dp0 e−τ /2 80 (p0 e−τ /2 )wc 1(p, p0 , τ ) × −P (τ )

× ϕ0(p,p0 ,τ ) (x) e−τ /2 ws (p0 ; x) . As we mentioned above, ws has two components – one in Hq,r , and one in Hq,r,ν . The contribution from the component in Hq,r is bounded by the same techniques used to control f3 – note that it is not necessary to extract any additional factors of e−τ /2 , since we get one from the integration, and one from the fact that each factor of ws is multiplied by e−τ /2 . Thus, we restrict our attention to the component of ws in Hq,r,ν , which is where the new ingredients are necessary. Interchanging the order of the x and p0 integrals, we use Lemma A.2, with ρ1 (p, p0 ) = sup |3χ 8(pe−τ /2 ) 80 (p0 e−τ /2 ) ϕ8(pe−τ /2 ) (x) ϕ0(p,p0 ,τ ) (x)uε (x)| , x

ρ2 (r) = |wc (r)| , Z ρ3 (p0 ) = | dx ws (p0 ; x)| . Since ϕ` (x) and uε (x) are smooth, 2π-periodic functions of x, and kρ1 kC r is bounded, Lemma A.2 implies that the Hq,r norm of (A.13) is bounded by Z c Ckw kq,r k dx ws (·; x)kq,r . (A.14) The Hq,r norm of the integral can be bounded by sup kws (·; x)kq,r ≤ Ckws kHq,r,ν ,

(A.15)

x

provided ν > 1/2, where we used Sobolev’s inequality to estimate the supremum over x. Inserting (A.15) into (A.14) yields the bound claimed in (3.3). The remaining terms in f4 can be bounded in a similar fashion, but as noted above, they will tend to 0 as τ → ∞. In fact, they will be bounded by Cεe−τ /2 . Proof of Eq. (3.4) of Proposition 3.2. We finally bound the non-linear term eτ /2 g(wc , ws e−τ /2 , e−τ /2 ) .

(A.16)

In bounding eτ /2 g(wc , ws e−τ /2 ; e−τ /2 ), recall that just as ws did, this expression will have two components – one in Hq,r , and one in Hq,r,ν . The component in Hq,r is bounded using exactly the same techniques used to control the term f4 above, so we concentrate here on explaining the new ingredients necessary to bound the component in Hq,r,ν .


195

As in the bound on f4 , the potentially largest terms are those of minimal order, because each additional order provides a factor of e−τ /2 . So we look at the terms which are quadratic and which are of order wc wc , wc ws , and ws ws , respectively. The first term leads us to study Z 1/2 τ /2 ⊥ 0 c 0 c 0 0 0 dp V (p − p )V (p )ϕp (x) ϕp−p (x) . (A.17) e Pp uε (x) −1/2

Rescaling as in (2.8), we see we must bound Z P (τ ) ⊥ dp0 80 (p0 e−τ /2 ) P8(pe−τ /2 ) uε (x) −P (τ )

0

0

(A.18)

× ϕ8(p0 e−τ /2 ) (x) ϕ0(p,p0 ,τ ) (x) w (p ) w (1(p, p , τ )) . c

c

Note that the prefactor of eτ /2 has disappeared due to the factor of e−τ /2 which we gain as usual from the change of variables. Since the projection P`⊥ has bounded norm and is a smooth function of `, we can discard this factor at the price of introducing an overall constant in the estimate. Note next that the square of the Hq,r,ν norm of the remaining expression is equal to:

Z P (τ )

dp0 80 (p0 e−τ /2 )wc (p0 )wc (1(p, p0 , τ ))

−P (τ ) (A.19)

2

2 0 , × kuε (x) ϕ8(p0 e−τ /2 ) (x) ϕ0(p,p ,τ ) (x) kH ν (dx) Hq,r (dp)

where the H ν norm is the H ν -Sobolev norm of the quantity uε (x) ϕ8(p0 e−τ /2 ) (x) ϕ0(p,p0 ,τ ) (x) , considered as a function of x, and the Hq,r norm is the norm of the resulting function of p. Since uε (x) ϕ8(p0 e−τ /2 ) (x) ϕ0(p,p0 ,τ ) (x) is a smooth function of x, p0 , and p, there exists a smooth, bounded function ψ(p, p0 ), such that ψ(p, p0 ) = kuε (x) ϕ8(p0 e−τ /2 ) (x) ϕ0(p,p0 ,τ ) (x) kH ν (dx) .

(A.20)

But now, by Lemma A.2, we can conclude that (A.19) is bounded by Z P (τ ) dp0 80 (p0 e−τ /2 )wc (p0 )wc (1(p, p0 , τ ))ψ(p, p0 )k2Hq,r (dp) ≤ Ck80 ψk2C r kwc k4q,r . k −P (τ )

(A.21) We next consider the quadratic term in g which contains one factor of wc and one factor of ws . In this case, the analog of (A.18) is Z P (τ ) ⊥ u (x) dp0 80 (p0 e−τ /2 ) e−τ /2 P8(pe −τ /2 ) ε −P (τ ) (A.22) × ϕ0(p,p0 ,τ ) (x) wc (1(p, p0 , τ ))ws (p0 ; x) . Note that in this case, we pick up an extra factor of e−τ /2 , in comparison with (A.18), since each factor of ws is multiplied by this exponential.

196


Once again, we must contend with the fact that ws has two components. However, the component in Hq,r behaves exactly as in the estimates leading to (3.2), so we concentrate on the component in Hq,r,ν . As above, the projection operator can be dropped at the cost of an overall constant, and we are left with the task of bounding the Hq,r,ν norm of the remainder. The square of this norm is equal to

Z P (τ )

dp0 80 (p0 e−τ /2 )wc (1(p, p0 , τ ))

−P (τ )

2

× kuε (x) ϕ8(p0 e−τ /2 ) (x) ϕ0(p,p0 ,τ ) (x) ws (p0 ; x)k2H ν (dx)

Hq,r (dp)

0

kC r kwc k2Hq,r

≤ Ck8

2

× kuε (x) ϕ8(p0 e−τ /2 ) (x) ϕ0(p,p0 ,τ ) (x) ws (p0 ; x)k2H ν (dx)

Hq,r (dp0 )

,

(A.23) by Lemma A.2. Note that the pair of norms on the last factor is equivalent to computing the square of the Hq,r,ν norm of uε (x) ϕ8(p0 e−τ /2 ) (x) ϕ0(p,p0 ,τ ) (x) ws (p0 ; x) .

(A.24)

Since uε , 8, ϕ0 , and 1 are all smooth, bounded functions, we see just by writing out the definition of the norm that this is bounded by Ckws k2Hq,r,ν .

(A.25)

If we estimate the term quadratic in ws in a similar fashion, and combine this estimate with that in (A.21) we see that the quadratic terms in e−τ /2 g(wc , ws e−τ /2 ; e−τ /2 ) are bounded in Hq,r,ν , by C(kwc kHq,r + e−τ /2 kws kHq,r,ν )2 .

(A.26)

Analogous estimates of the cubic terms lead to a bound Ce−τ /2 (kwc kHq,r + e−τ /2 kws kHq,r,ν )3 ,

(A.27)

where the additional factor of e−τ /2 comes from the additional convolution. Combining (A.26) and (A.27) leads to the estimate in (3.4) and completes the proof of Proposition 3.2. B. Bounds on the Linear Operators In this Appendix, we give bounds on the semi-group generated by L and on the linear evolution defined by Mexp(−τ /2) . B.1. Bound on the semi-group generated by L. We consider the semi-group whose generator is L = ∂x2 + 21 x∂x + 21 . Note that in this section, for ease of use, we define L in the Fourier transformed variables, compared to Sect. 2. Fourier transformation is an isomorphism from Hq,r (in the p-variables) to Hr,q (in the x-variables), so establishing estimates on the semigroup associated to ∂x2 + 21 x∂x + 21 in the space Hr,q (dx) will


197

immediately imply estimates on the representation of L in the p-variables in the space Hq,r (dp). In order to avoid confusion, in what follows we will denote by | · |q,r the norm on Hr,q (dx). With this notation, the norms k · kq,r and | · |q,r resp. the spaces Hq,r (dp) and Hr,q (dx) are equivalent. The integral kernel of the semigroup generated by L is given by [GJ] Z 2 1 τL τ /2 dz e−z /(4a(τ )) v(eτ /2 (x + z)) , e e v)(x) = √ 4πa(τ ) where a(τ ) = 1 − e−τ . If we denote by T the operator of multiplication by exp(x2 /8) and by H0 the harmonic oscillator Hamiltonian H0 = ∂x2 − x2 /16 + 1/4, (note the unconventional sign!), then L = T −1 H0 T . Thus, the two operators L and H0 are “the same,” but they act on two quite different spaces. If the {ϕj }j≥0 are the eigenfunctions of H0 , then the ψj = T −1 Pϕj are the eigenfunctions of L, with the same eigenvalues µj = −j/2. We let Pn f = j≤n ψj (ψj , f )q , where (·, ·)q is the scalar product (f, g)q = (T f, T (1 − L0 )q g) = (T f, (1 − H0 )q T g) . We next show that for n < q − 2, the operator Pn is bounded in Hr,q (dx). First of 2 all, the eigenfunctions ϕj are bounded by O(1)|x|j e−x /8 at large x. Therefore, we also have ψj = T −1 ϕj ∈ Hr,q (dx), since it decays exponentially. Finally, (ψj , f )q = (T ψj , (1 − H0 )q T f ) = |1 − µj |q (ϕj , T f ) , and the last scalar product is bounded if f ∈ Hr,q (dx) when r > j + 2, since, with a weight function W (x) = (1 + x2 )1/2 , |(ϕj , T f )| ≤ C|(W j , f )| ≤ C|(W −1 , W j+1 f )| ≤ CkW j+1 f k2 ≤ C|f |0,r . Thus Pn is defined. We let Qn = 1 − Pn (in Hr,q (dx)). Theorem B.1. For every δ > 0, there exists an m0 and a function r(m, q) such that for every m ≥ m0 , every q ≥ 1 and every r ≥ r(m, q), there is a C = C(q, r, m) < ∞ such that C(q, r, m) −τ (|µm |−δ) |v|q−1,r . (B.1) e |eτ L Qm v|q,r ≤ √ a(τ ) Remark. The function r(m, q) is of order O(m + q). Proof. To explain the strategy of the proof, we need some notation. Let Pn(0) denote the (0) projection in H0,q (dx) onto the subspace spanned by {ϕj }j≤n and let Q(0) n = 1 − Pn . (0) −1 (0) Then, formally, T Qn = Qn T , and LQn = T H0 Qn T . This suggests that L restricted to Qn has no spectrum in the half-plane {z | Re z > −|µn+1 |}, and thus one can understand the decay in Eq. (B.1). The square-root singularity at τ = 0 is related to our gain in smoothness. The problem is that T Qn = Q(0) n T is ill-defined. However, it will be well defined if we localize near x = 0. In that region, the heuristic argument will be seen to be valid, whereas in the complement of such a region, when |x| > R, decay will be shown by direct methods, using the explicit form of the integral kernel.

198


We study first the quantity χR eτ L , where χR is a smooth characteristic function which vanishes for |x| < R and is equal to 1 for |x| > 4R/3. Thus we study a region far from the origin. Our bound is Proposition B.2. For every q ≥ 1 and every r ≥ 0 there exists a C(q, r) < ∞ such that for all v ∈ Hr,q (dx) one has 2 C(q, r) τ q/2 −τ r/2 |χR eτ L v|q,r ≤ √ + e−3R /16 |v|q−1,r , e e a(τ ) 2 τL |χR e v|q,r ≤ C(q, r)eτ q/2 e−τ r/2 + e−3R /16 |v|q,r .

(B.2) (B.3)

Corollary B.3. For every q ≥ 1 and every r ≥ 0 there exists a C(q, r) < ∞ such that for all v ∈ Hr,q (dx) one has C(q, r) τ q/2 |eτ L v|q,r ≤ √ |v|q−1,r , e a(τ )

(B.4)

|eτ L v|q,r ≤ C(q, r)eτ q/2 |v|q,r .

(B.5)

Remarks. The improvement over [W] is that we “gain” a derivative in x. The corollary follows easily by repeating the proof of Proposition B.2 with R = 0. Proof. We let D = ∂x and denote, as before, by W the operator of multiplication by (1 + x2 )1/2 . Then X 0 |χR eτ L w|2q,r and kW r Dq χR eτ L wk22 q 0 ≤q

are equivalent. We shall only consider the term with the highest derivative, because only there is the issue of regularization important. Thus we are led to bound X 2 = kW r Dq χR eτ L wk22 . Since L = ∂x2 + 21 x∂x + 21 , a quick calculation shows that Dq eτ L = eτ q/2 eτ L Dq . The diverging factor exp(τ q/2) will appear in the final bound. Note now that Z 2 1 τL q τ /2 dz e−z /(4a(τ )) Dq v (eτ /2 (x + z)) , e D v (x) = √ e 4πa(τ ) which upon integrating by parts becomes Z 2 z 1 √ dz e−z /(4a(τ )) Dq−1 v (eτ /2 (x + z)) . 2a(τ ) 4πa(τ ) Use now the Schwarz inequality in the form (for positive f and g),

(B.6)


Z kf ∗ gk22 =

199

Z dx

Z

dz1 dz2 f (z1 )f (z2 )g(x − z1 )g(x − z2 )

dz1 dz2 f (z1 )f (z2 )kg(· − z1 )k2 kg(· − z2 )k2

≤ Z

2 dz f (z)kg(· − z)k2

=

.

This leads to a bound Z eτ q/2 |z| −z2 /(4a(τ )) X ≤ √ e dz kW r χR Dq−1 w (eτ /2 (· + z))k2 4πa(τ ) R1 ∪R2 2a(τ ) ≡ X1 + X2 ,

(B.7)

where we let R1 = {x : |x| < 7R/8} and R2 = R\R1 . To be more precise, we define χR by the scaling of a fixed function: χR (x) = χ(x/R). If R → ∞, then ∂x χR (x) = O(R−1 ) and therefore it is uniformly bounded. Lemma A.2 of [W] B.4. One has the bounds −rτ 2 Ce |v|0,r , kW r χR (·)v(eτ /2 (· + z))k22 ≤ C(1 + z 2 )r |v|20,r ,

if |z| ≤ 7R/8, if |z| > 7R/8.

(B.8)

Proof of Lemma B.4. Consider first the case |z| ≤ 7R/8. Since |x| > R on the support of χR , we have |x + z| ≥ |x|/8 and hence (1 + x2 )/ 1 + (eτ /2 |x + z|)2 ≤ const. e−τ . Using this, we bound Z dx (1 + x2 )r |χR (x)v(eτ /2 (x + z))|2 R1

Z dx

= R1

r (1 + x2 )r r · 1 + (eτ /2 |x + z|)2 |v(eτ /2 (x + z))|2 τ /2 2 1 + (e |x + z|)

≤ const. e−τ r e−τ /2 |v|20,r ≤ const. e−τ r |v|20,r . In the second case, we get Z dx (1 + x2 )r |χR (x)v(eτ /2 (x + z))|2 R2

= e

−τ /2

Z

1 + (e−τ /2 y − z)2 dy (1 + y 2 )r

r (1 + y 2 )r |v(y)|2

≤ const. e−τ /2 (1 + z 2 )r |v|20,r ≤ const. (1 + z 2 )r |v|20,r . The proof of Lemma B.4 is complete. Continuing the proof of Proposition B.2, we first bound the integral over R1 in Eq. (B.7). We get from the first alternative of Lemma B.4,

200


Z 2 1 |z| τ q/2 dz √ e e−z /(4a(τ )) kW r χR Dq−1 w (e−τ /2 (· + z))k2 4πa(τ ) 2a(τ ) R1 Z 2 1 |z| τ q/2 ≤ const. √ dz √ e e−z /(4a(τ )) e−τ r/2 |w|q−1,r 4πa(τ ) 2a(τ ) R1 1 τ (q/2−r/2) ≤ const. √ |w|q−1,r . e 4πa(τ )

X1 = √

Similarly, using the second alternative in Eq. (B.8), we get Z 2 1 |z| dz √ X2 = √ eτ q/2 e−z /(4a(τ )) kW r χR Dq−1 w (eτ /2 (· + z))k2 4πa(τ ) 2a(τ ) R2 Z 2 1 |z| ≤ const. √ dz (1 + z 2 )r/2 √ eτ q/2 e−z /(4a(τ )) |w|q−1,r . 4πa(τ ) 2a(τ ) R2 1 τ q/2 −3R2 /16 e |w|q−1,r , ≤ const. √ e 4πa(τ ) since 3/16 < (7/8)2 /4. Note that the constants above depend on r and q, but can be chosen uniformly for all R ≥ 1. The proof of Eq. (B.2) is complete. Omitting the integration by parts in Eq. (B.6), the assertion Eq. (B.3) follows in the same way. The proof of Proposition B.2 is complete. We next study eτ L Qn (1 − χR )w. We have the following bound Proposition B.5. For every δ > 0, q ≥ 1, and every r ≥ 0 there is a C(δ, q, r) < ∞ such that |eτ L Qn (1 − χR )w|q,r ≤

C(δ, q, r) −(|µn+1 |−δ)τ R2 /6 √ e |w|q−1,r . e a(τ )

(B.9)

Proof. Recall that T = ex /8 and that L = T −1 H0 T . The operator T (1 − χR ) is bounded 2 and kT (1 − χR )k ≤ const. eR /6 . Therefore we have 2

Qn T (1 − χR ) = (1 − Pn )T (1 − χR ) = T (1 − χR ) − T Pn(0) (1 − χR ) = T (1 − Pn(0) )(1 − χR ) = T Q(0) n (1 − χR ) , where Q(0) n is the orthogonal projection onto the complement of the subspace spanned by the first n eigenvalues of H0 in Hq,0 . It is easy to see that on Hr,q (dx), the operator (1 + x2 )1/2 (1 − H0 )−1/2 is bounded. Thus, we get, using the spectral properties of H0 (on Q(0) n ), |eτ H0 T Qn (1 − χR )w|q,r = τ −1/2 × |(1 − H0 )−1/2 eτ H0 (τ (1 − H0 ))1/2 Q(0) n T (1 − χR )w|q,r ≤ const. τ −1/2 |eτ H0 (τ (1 − H0 ))1/2 Q(0) n T (1 − χR )w|q−1,r ≤ const. τ −1/2 e−τ (|µn+1 |−δ) |T (1 − χR )w|q−1,r ≤ const. τ −1/2 e−τ (|µn+1 |−δ) eR

2

/6

|w|q−1,r . (B.10)

The proof of Proposition B.5 is complete.


201

End of proof of Theorem B.1. We first rewrite eτ L Qn as eτ L Qn = eτ L/2 Qn eτ L/2 = eτ L/2 Qn χR eτ L/2 + eτ L/2 Qn (1 − χR )eτ L/2 . The second term can be bounded by Proposition B.5 and Eq. (B.5) as 2 C eR /6−τ |µn+1 |/4 |eτ L/2 w|q−1,r a(τ ) 2 C ≤ √ eR /6−τ |µn+1 |/4 eτ q/4 |w|q−1,r . a(τ )

|eτ L/2 Qn (1 − χR )eτ L/2 w|q,r ≤ √

This quantity is bounded by √

C e−τ n/8 |w|q−1,r , a(τ )

(B.11)

provided n is much larger than q and R2 /6 < τ n/16. The first term can be bounded by Eq. (B.5) and Eq. (B.2) as C eτ q/4 |χR eτ L/2 w|q−1,r a(τ ) 2 C ≤ √ eτ q/2 e−τ r/2 + e−3R /16 |w|q−1,r a(τ ) C ≤ √ e−τ n/8 |w|q−1,r , a(τ )

|eτ L/2 Qn χR eτ L/2 w|q,r ≤ √

(B.12)

provided r ≥ n/4 + q and 3R2 /16 ≥ τ (n/8 + q/2). Note that the conditions on R from the first and second term are compatible Combining Eqs.(B.11)–(B.12), we get |eτ L Qn w|q,r ≤ √

C e−τ n/8 |w|q−1,r . a(τ )

(B.13)

It remains to improve the decay rate from n/8 to |µm+1 |. The idea is to just take n = 8(m + 1). Then we find eτ L Qm = eτ L Qn Qm + eτ L Pm Qm + eτ L (Pn − Pm )Qm .

(B.14)

The first term is bounded by Eq. (B.13), and m/8 > −|µn+1 |. The second term vanishes and the third is diagonalized explicitly: (0) )T Qm . eτ L (Pn − Pm )Qm = T −1 e−τ H0 T (Pn − Pm )Qm = T −1 e−τ H0 (Pn(0) − Pm

We are operating here on the finite dimensional subspace spanned by the eigenvectors ϕm+1 , . . . , ϕn , and there the technique of Eq. (B.10) yields a bound √

C p τ |µm+1 |e−|µm+1 |τ . a(τ )

Combining this with the bound on the first term in Eq. (B.14), we complete the proof of Theorem B.1.

202


B.2. The linear evolution generated by Mη,2 . In this section, we deal with the problem of giving bounds on the linear evolution generated by the operator Mη,2 , which is defined by ∞

Mη,2 = Mη,2,0 ⊕ ⊕ Mη,2,n , n=2

where

Mη,2,n =

2 2

ε − (1 + (in + i8(pη)) 2

− K 8(pη)

− η 2 21 p∂p .

We want to bound the solution Un,τ of the equation e−τ ∂τ Un,τ = Mexp(−τ /2),2,n Un,τ ,

(B.15)

with Un,0 = 1. Recall the definition of L = −p2 − 21 p∂p , and rewrite Mexp(−τ /2),2,n as Mexp(−τ /2),2,n 2 = ε2 − 1 + (in + i8(pe−τ /2 ))2 − K(8(pe−τ /2 )) − e−τ 21 p∂p 2 = ε2 − 1 + (in + i8(pe−τ /2 ))2 − K(8(pe−τ /2 )) + e−τ p2 + e−τ L = Xn (pe−τ /2 ) + e−τ L , where Xn (ξ) = ε2 − 1 + (in + i8(ξ))2

2

− K(8(ξ)) + ξ 2 . We want to solve Eq. (B.15):

e−τ ∂τ Un,τ = (e−τ L + Xn (pe−τ /2 ))Un,τ , with initial condition Un,0 = 1. Observe now that Xn is an operator of multiplication m by a function of pη. Since the commutator [pm , −p2 − 21 p∂p ] is equal to m 2 p , we find 1 0 [h(p), L] = 2 ph (p), and, furthermore, eh(p) L = (L + 21 ph0 (p))eh(p) . It follows that the solution of Eq. (B.15) is τ

Un,τ = e(e

−1)Xn (pe−τ /2 ) τ

e L,

as one can check by explicit computation. From the explicit form of Xn , (in particular, the factor of −n4 ), and the estimates derived in Theorem B.1, we see that for any xn ∈ Hq,r , we have kUn,τ xn kq,r ≤ C exp(−c0 (eτ − 1)n4 )eτ q/2 kxn kq,r . Combining this with the Remark of (A.11), we immediately obtain Lemma B.6. If Uτ satisfies e−τ ∂τ Uτ = Mexp(−τ /2),2 Uτ , with U0 = 1, then there exist a C(r, q, ν) > 0, and a c0 > 0 such that for any w ∈ Hq,r,ν , kUτ wkq,r,ν ≤ C exp(−ec0 τ /2 )kwkq,r,ν .

(B.16)


203

To complete the proof of Theorem 3.4, we also need an estimate of the semigroup generated by Mη,1 . This is simply obtained, however, because from (2.13) we see that Mη,1 = Mexp(−τ /2),2,1 , restricted to functions whose Fourier transform is supported away from the origin. Using this fact, and the explicit formula given above for Mexp(−τ /2),2,n , we see immediately that for any w1 ∈ Hq,r , there exists a constant c1 > 0, such that if Uτ,1 is the semigroup generated by Mη,1 one has kUτ,1 w1 kq,r ≤ Ce−c1 τ kw1 kq,r .

C. The Pseudo Center Manifold Theorem for the Singular System Eq. (3.8) In this section, we prove Theorem 3.7. Before we start with the proof, we wish to point out in which sense we are here confronted with a new problem, which does not allow for a straightforward application of results from the literature. If we write the system Eq. (3.8) in the form ∂τ x1 = A1 x1 + N1 (x1 , η, x2 , x3 ) , ∂τ η = − 21 η , ∂τ x2 = A2 x2 + N2 (x1 , η, x2 , x3 ) ,

(C.1)

∂τ x3 = η −2 A3,η x3 + η −2 N3 (x1 , η, x2 , x3 ) , then, in view of the spectral properties of Eq. (3.9), there is a “gap” between the “central” part (corresponding to x1 and η) and the “stable” part (corresponding to x2 , x3 ). The problem is that we are really dealing with a singular perturbation because the nonlinearity in the equation for x3 also diverges as η ↓ 0. This problem would be more easily overcome if A2 were bounded. In that case, for sufficiently small η, the spectra of A2 and η −2 A3 would not overlap, and we could define first an invariant manifold by “eliminating” x3 , and then the true invariant manifold by eliminating x2 from the equations obtained after elimination of x3 . However, since the spectra overlap for all values of η, we resort to a strategy which consists of a converging sequence of alternate eliminations of x2 and x3 . To define these successive eliminations, we consider two equivalent representations of Eq. (3.8), one being Eq. (C.1) above and the other being ∂t x1 = η 2 A1 x1 + N1 (x1 , η, x2 , x3 ) , ∂t η = − 21 η 3 ,

∂t x2 = η 2 A2 x2 + N2 (x1 , η, x2 , x3 ) ,

(C.2)

∂t x3 = A3,η x3 + N3 (x1 , η, x2 , x3 ) . We shall again omit the index η from A3 . We obtain Eq. (C.2) from Eq. (C.1) by rescaling the evolution parameter of the autonomous system as t + t0 = exp(τ ). (Note that time is really given by 1/η 2 − t0 , while we view t and τ as the evolution parameters of the vector fields.) We will call 8center the flow corresponding to Eq. (C.1) and 8stable the τ t flow corresponding to Eq. (C.2). A simple inspection of the definition of these flows yields the useful identity:

204

J.-P. Eckmann, C.E. Wayne, P. Wittwer stable 8center τ =log (y+t0 ) (ξ, x) = 8t=y (ξ, x) ,

(C.3)

where ξ = (x1 , η),

x = (x2 , x3 ) .

(C.4)

We shall use the relations (C.4) throughout. The identity (C.3) holds for all x1 , x2 , x3 and for η ≥ 0. Note that the initial conditions are given for the parameter t = 0 and the −1/2 parameter τ = log(t0 ), and that η(0) = t0 . Thus, η(0) is small if the parameter t0 has been chosen sufficiently large. (The bounds on the nonlinearities are uniform in t0 ≥ t∗0 as follows from the calculations.) Let h0 be a function of ξ. This function will always be an approximate invariant manifold for one of two problems. To define these problems, we first introduce two effective non-linearities Fj (h0 ; ξ, x2 ) = Nj x1 , η, x2 , h0 (ξ) , for j = 1, 2 , Gj (h0 ; ξ, x3 ) = Nj x1 , η, h0 (ξ), x3 , for j = 1, 3 . We then define two equations (corresponding to the two different time scales Eq. (C.1) and Eq. (C.2) of the same problem Eq. (3.8)): The first equation will be called the center system: ∂τ x1 = A1 x1 + F1 (h0 ; ξ, x2 ) , ∂τ η = − 21 η ,

(C.5)

∂τ x2 = A2 x2 + F2 (h0 ; ξ, x2 ) . Similarly, we define the stable system ∂t x1 = η 2 A1 x1 + η 2 G1 (h0 ; ξ, x3 ) , ∂t η = − 21 η 3 ,

(C.6)

∂t x3 = A3 x3 + G3 (h0 ; ξ, x3 ) . Assume now that h2 and h3 are two given functions of x1 and η. We define a map 0 h2 h2 7→ , F : h3 h03 through the following construction: We let h02 (ξ) be the function whose graph is the invariant manifold for the center system Eq. (C.5) with non-linearity Fj (h3 ; ξ, x2 ), and similarly we let h03 (ξ) be the function whose graph is the invariant manifold for the stable system Eq. (C.6) with non-linearity Gj (h2 ; ξ, x3 ). Our main result here is Proposition C.1. The map F has a fixed point (h∗2 , h∗3 ). This fixed point provides an invariant manifold for the system Eq. (3.8). Remark. We shall in fact show that F is a contraction in a suitable function space. In particular, we show that F n (0, 0), the n-fold iterate of F , converges to the limit (h∗2 , h∗3 ). The intuitive approach behind this construction is that the F n (0, 0) provide a sequence of successive approximations to invariant manifolds for Eqs.(C.6) and (C.5), in which the non-linearities at the nth step are given by the approximate solutions for the invariant ; ξ, x2 ) manifold problem of the other equation: The non-linearities are then Fj (h(n−1) 3 ; ξ, x ) (in Eq. (C.6)). (in Eq. (C.5)) and Gj (h(n−1) 3 2


205

Proof. That the systems of equations (C.5) and (C.6) have invariant manifolds follows from our estimates (given in Appendix B) on the semi-group generated by the linear operators A2 and A3 , and our estimates on the non-linear terms. (For expositions of this theory that are particularly relevant in the present context, see e.g., [H, M, G].) The functions h∗2 and h∗3 whose graphs define the invariant manifolds satisfy well known integral equations, see below. Fix h = (h2 , h3 ) and consider Eq. (C.5). We want to find the function h02 (h; ξ) which eliminates x2 . To construct h02 , we first consider the equation ∂τ x1 = A1 x1 + F1 h3 ; ξ, h2 (ξ) , (C.7) ∂τ η = − 21 η . This is a differential equation on a finite dimensional space and we let 92τ (ξ; h) denote the corresponding flow. (Of course, the η-component of this problem can be explicitly integrated.) We can then formulate the problem of finding the invariant manifold which eliminates x2 from Eq. (C.6) by looking at the map defined by h 7→ F2 (h) where Z 0 dτ e−A2 τ F2 h3 ; 92τ (ξ; h), h2 (92τ (ξ; h)) . (C.8) F2 (h) = −∞

(A particularly clear derivation of these equations can be found in [G].) In a similar way, we define the flow 93τ (ξ; h) for the equation ∂t x1 = η 2 A1 x1 + η 2 G1 h2 ; x1 , h3 (ξ) , (C.9) ∂t η = − 21 η 3 , and the map Z F3 (h) =

0 −∞

dt e−A3 t G3 h2 ; 93t (ξ; h), h3 (93t (ξ; h)) .

(C.10)

We now specify the function spaces in which we work. Recall that x1 ∈ RN , η ∈ R and that ξ ∈ RN +1 . We let E c = RN ⊕ R with the usual Euclidean norm. We also assume that E 2 and E 3 are the Banach spaces in which the x2 and x3 live. In our problem, these Banach spaces are the Hilbert spaces Hq,r and Hq,r,ν , but since we believe the present theory of singular vector fields may have further applications, we consider the more general case for the moment (see, for example, [W2]). These Banach spaces should have the C k extension property [BF]. The functions h2 and h3 will be Lipshitz functions from a ball of radius r in E 2 and E 3 , respectively. They satisfy hj (0) = 0 and are tangent at the origin to E j , for j = 2, 3. Thus, we define the metric spaces, for j = 2, 3: ˜ E j ≤ σkξ − ξk} ˜ . Hj,σ = {hj : E c → E j | hj (0) = 0, khj (ξ) − hj (ξ)k We also define a distance khj (ξ) − h˜ j (ξ)kEj , ρHj,σ (hj , h˜ j ) = sup kξk ξ6=0 and introduce the notation ˜ = ρH2,σ (h2 , h˜ 2 ) + ρH3,σ (h3 , h˜ 3 ) . ρHσ (h, h)

(C.11)

206


Standard results about the existence and uniqueness of solutions of systems of differential equations now imply that ˜ h)k ≤ Ceβ2 |τ | kξ − ξk ˜ , k92τ (ξ; h) − 92τ (ξ; while

(C.12)

˜ ≤ Ceβ2 |τ | ρHσ (h, h) ˜ , k92τ (ξ; h) − 92τ (ξ; h)k

(C.13)

for any β2 > (N − 1)/2. Analogous estimates hold for the flow 9 , though in that case one can choose any exponential growth rate β3 > 0, provided |η| is sufficiently small. This is due to the presence of the factor of η 2 A1 in the first equation of Eq. (C.6). With this in mind we define two more metric spaces (for j = 2, 3): 3

Kj,βj ,Dj = {9τ : R+ × E c × H2,σ × H3,σ → E c | 90 (ξ; h) = ξ, 9τ (0; h) = 0, 9τ is C 1 in τ, ˜ h)k ˜ k9τ (ξ, h) − 9τ (ξ,

˜ ˜ + ρHσ (h, h)kξk ≤ Dj eβj |τ | kξ − ξk },

with a corresponding Lipshitz metric ˜ = k9 − 9k ˜ Kj , dj (9, 9) where k9kKj = sup sup t≥0

ξ∈E c ξ6=0

(C.14)

eβj t k9t (ξ)k . kξk

These spaces are modeled on those used in [EW]. Remark. Since we are interested in local invariant manifolds, we will assume that the non-linear terms have been cut off outside a ball of radius r in each of their arguments. Since in the applications of this paper all our functions are elements of Hilbert spaces, we can assume that there exist smooth cut-off functions which are equal to 1 inside a ball of radius r/2 and are equal to zero outside a ball of radius r, and we multiply each of the non-linear terms in Eq. (3.8) by such a cutoff. For example, in Eq. (C.6), we certainly need to cutoff the function η 2 by η 2 χ(η) (where χ is the cutoff function) to avoid blowup problems. Given this setup, we show that the map F is a contraction of H2,σ × H3,σ . In terms of the notation given above F is now defined as F (h) = F2 (h), F3 (h) . One must first show that F maps this space to itself. This step is however an easy variant of the argument which shows that F is a contraction, and we leave it as an exercise to the reader. To show that F is a contraction, we use the maps (C.8) and (C.10). Then we see that the “j” component, j = 2, 3, of F(h2 , h3 )(ξ) − F (h˜ 2 , h˜ 3 )(ξ) is given by Z 0 −Aj τ ˜ (C.15) Uj (h; ξ, τ ) − Uj (h; ξ, τ ) , dτ e 1j = −∞

where

U2 (h; ξ, τ ) = F2 h3 ; 92τ (ξ; h), h2 92τ (ξ; h) = N2 92τ (ξ; h), h2 92τ (ξ; h) , h3 92τ (ξ; h) , U3 (h; ξ, τ ) = G3 h2 ; 93τ (ξ; h), h3 93τ (ξ; h) = N3 93τ (ξ; h), h2 93τ (ξ; h) , h3 93τ (ξ; h) ,


207

cf. Eqs.(C.5), (C.6). Consider now 12 . From the estimates on the non-linear term N2 in Eq. (3.8), we see that F2 is a multi-linear function of its arguments. Thus, we can estimate the difference in the integrand of 12 by the sum of the differences in the arguments of F2 , multiplied by the Lipshitz constant of F2 . Because we have cutoff F2 outside a ball of radius r, this Lipshitz constant can be made arbitrarily small by making r sufficiently small. Thus, calling this Lipshitz constant `2 (r), we see from the estimates on eA2 t which follow from the results of Appendix B and from Eqs.(C.11)–(C.14) that Z ∞ C ˜ dτ √ e−N τ /2 `2 (r) ρHσ (h, h) k12 kE 2 ≤ τ 0 ˜ + kh2 92τ (ξ, h) − h˜ 2 92τ (ξ, h) ˜ kE 2 + k92τ (ξ, h) − 92τ (ξ, h)k Z ∞ C ˜ ≤ dτ √ e−N τ /2 `2 (r) ρHσ (h, h) τ 0 β2 τ β2 τ ˜ ˜ ˜ . + ρHσ (h, h)Ce ≤ const. `2 (r)ρHσ (h, h) + ρHσ (h, h)Ce Thus, we have shown that F is a contraction. We next consider the manifold M given by (ξ, h∗2 (ξ), h∗3 (ξ)) – where x1 is in a small neighborhood of 0 and η is in a small positive interval 0 ≤ η ≤ η0 . We want to show that M is indeed an invariant manifold for the full system Eq. (3.8). From this it follows, since the flows 8stable and 8center are equivalent, up to rescaling of time, that M is also an invariant manifold for Eqs.(C.1) and (C.2). If we set x2 = h∗2 (ξ) and x3 = h∗3 (ξ), then the third equation of Eq. (C.2) is satisfied because the third equation, when restricted to the manifold x2 = h∗2 is just the second equation of the stable system Eq. (C.6). with non-linearity G3 (h∗2 ; . . .). To see that the remaining equations are satisfied just note that the first, second and fourth equations in the full system Eq. (3.8) become, after rescaling of time, x˙ 1 = A1 x1 + N1 (x1 , η, x2 , x3 ) , η˙ = − 21 η 3 , x˙ 2 = A2 x2 + N2 (x1 , η, x2 , x3 ) , h∗2

and if we set x2 = and x3 = h∗3 , we see that we are just on the invariant manifold for the center system Eq. (C.5). Hence, we have found the invariant manifold for the full system Eq. (3.8).

D. The Vanishing of the Non-Linearity at Zero Momentum In this Appendix, we prove Proposition 4.3. This proof is essentially a scaling argument. We shall study the nonlinearity N1 (x1 , η, x2 , x3 ) and we restrict it to the invariant manifold, i.e., we replace it by N˜ 1 (x1 , η) and let η go to 0. In particular, we shall show that only one term survives, namely the one which is cubic in x31 , and all others go to 0 as η → 0. To prove this, we will analyze the nonlinearities Nj term by term, using their definitions as given in Eqs.(3.6) and (3.8). Recall again that A1 = 0 since we are considering here the projection onto the first eigenvalue of L. In Eq. (3.6), the nonlinearities are given by the terms f2 , f3 , f4 , and g, and these have been bounded in Proposition 3.1 and

208


Proposition 3.2. Recall finally that every factor of wc contributes a factor of e−τ /2 = η and every factor of ws contributes a factor of e−τ = η 2 to these bounds. Begin by considering the contribution from f2 . According to Eq. (A.9), we can extract another factor of η from Eq. (A.10), by using the quadratic nature of K2 , cf. Lemma A.1. Using Proposition 3.1 and Proposition 3.2, we see that the only contributions from f3 , f4 , and g which do not vanish as η → 0 are those of the type (wc )3 in f3 , of the type ws (wc )2 in f4 , and of the type (wc )2 in g. We start by analyzing f3 . If we write it out, we find Z dx ϕ¯ 8(pη) (x) η −2 f3 (wc ) (p) = η −2 χ 8(pη) Z ×η

2

η −1 8(1/2)

dp1 dp2 η −1 8(−1/2)

80 (p1 η)80 (p2 η)

× ϕ8(p1 η) (x) ϕ8(p2 η) (x) ϕ8(pη)−8(p1 η)−8(p2 η) (x)

× wc (p1 ) wc (p2 ) wc η −1 8−1 (8(pη) − 8(p1 η) − 8(p2 η)) ,

cf. Eq. (2.11). Upon taking η → 0, this converges to Z Z χ(0) dx ϕ¯ 0 (x)ϕ30 (x) dp1 dp2 wc (p1 )wc (p2 )wc (p − p1 − p2 ) .

(D.1)

Analogous arguments can be used to discuss the “surviving” terms of f4 and g. We just summarize the steps analogous to the calculation of f3 . One gets, as η → 0, Z Z −2 c s f4 (w , w η, η) (p) → 6χ(0) dx ϕ¯ 0 (x)uε (x)ϕ0 (x) dp0 wc (p0 )ws (p − p0 ; x) , η (D.2) and η −1 g(wc , ws η, η) (p) → −3uε (x)ϕ20 (x)

Z

dp0 wc (p − p0 )wc (p0 ) .

(D.3)

We next study these limiting expressions in the basis {ψn (p)}∞ n=0 of eigenfunctions of L = −p2 − 21 p∂p . Then we can write wc (p) as x1 ψ0 (p) +

∞ X

x(n) 2 ψn (p) .

(D.4)

n=1 ∗,(n) The crucial remark is now that on the invariant manifold, x(n) , 2 will be replaced by h2 s ∗ ∗ and similarly w will be equal to h3 . We now compute the limiting forms of h2 and h∗3 , and then we substitute these values in Eqs.(D.1)–(D.3). Consider the equation for h∗3 . Then from Eq. (C.9), we have

∂t x1 = η 2 G1 (h∗2 ; x1 , h∗3 (ξ)) , ∂t η = − 21 η 3 , because we are considering the case N = 1 where the linear part vanishes. We also have from Eq. (C.10), Z 0 dt e−A3,η t G3 h∗2 ; 93t (x1 , η; h∗ ), h∗3 (93t (x1 , η; h∗ )) . (D.5) h∗3 (x1 , η) = −∞


209

Now, when η = 0, we have 93t (ξ; h) = 93t (x1 , 0; h) = x1 , and this reduces to h∗3 (x1 , 0)

Z

0

dt e−A3,0 t G3 h∗2 ; x1 , 0, h∗3 (x1 , 0) −∞ ∗ ∗ = −A−1 3,0 G3 h2 ; x1 , 0, h3 (x1 , 0) . =

(D.6)

Note next that for η = 0 we have A3,0 = M0 , cf. Eq. (3.8), and this means A3,0 = Qper Lper . We denote by ξn (x) the eigenfunctions and by σn the eigenvalues of Qper Lper . Using Eq. (1.9) and Theorem 1.1, we see that σn = λ`=0,n−1 and therefore they are given by σ1 = −O(ε2 ) and σn ≈ −(1 − (n − 1)2 )2 , when n 6= 1. Then the nth component (in this basis) of h∗3 (at η = 0) is given by Z Z dp0 wc (p − p0 )wc (p0 ) , (D.7) h3∗,(n) (p) = −σn−1 · −3 dx ξ¯n (x)uε (x)ϕ20 (x) since all other terms vanish in the limit η → 0. We next substitute the value Eq. (D.4) for wc and set x2 = h∗2 in Eq. (D.7), and get Z ∗,(n) 2 −1 h3 (p) = −x1 σn · −3 dx ξ¯n (x)uε (x)ϕ20 (x) Z × dp0 ψ0 (p0 ) ψ0 (p − p0 ) + O(x1 h∗2 + (h∗2 )2 ) . Next, we replace ws in Eq. (D.2) with h∗3 , and in that same equation make the substitution for wc that we used above, and we find: 18x31

∞ X

σn−1

n=0

Z

Z

dx ξ¯n (x)uε (x)ϕ20 (x)

dx0 ϕ¯ 0 (x0 )uε (x0 )ξn (x0 )

Z ×

dp1 dp2 ψ0 (p1 )ψ0 (p2 )ψ0 (p − p1 − p2 ) +

O(x1 h∗2

+

(h∗2 )2 )

(D.8) .

Thus we see that the only terms which survive in N1 and N2 in the limit η → 0 result from adding together Eqs.(D.8) and (D.1). We obtain Z X = x31 Z × Z ×

dx ϕ¯ 0 (x)ϕ30 (x) + 18

∞ X n=0

σn−1 (

Z

dx00 ϕ¯ 0 (x00 ) uε (x00 ) ξn (x00 )

dx0 ξ¯n (x0 )uε (x0 )ϕ20 (x0 )

(D.9)

dp1 dp2 ψ0 (p1 ) ψ0 (p2 ) ψ0 (p − p1 − p2 ) .

This coefficient will turn out to be exactly the same as that which appears below as the coefficient of the cubic terms in the center manifold in the periodic case, and since we know that in the periodic case this coefficient (and indeed, the entire nonlinear term) is zero, it must vanish in the present case as well. The only remaining point in the proof of Proposition 4.3 is the computation of the coefficient of the cubic term in the equation in the center manifold in the periodic case, and we do that in the following subsection.

210


Remark. The above argument might seem incomplete since it ignores the O(x1 h∗2 +(h∗2 )2 ) error terms in (D.8). In fact, those terms vanish for x1 small. To see why, note that our computations of the η → 0 limit of f2 , f3 , f4 and g apply also to the nonlinear term N2 (x1 , η, h∗2 (ξ), h∗3 (ξ)) in the equation for h∗2 in (4.1). Thus, in the η → 0 limit h∗2 satisfies: ∂x1 h∗2 (x1 , 0)N˜ 1 (x1 , 0) = A2 h∗2 (x1 , 0) + N2 (x1 , 0, h∗2 (x1 , 0), h∗3 (x1 , 0)) . Using the estimates on h∗2 and h∗3 derived above, we see that this equation implies h∗2 (x1 , 0) = 0 for all x1 sufficiently small, and hence the error terms in (D.8) vanish. D.1. The non-linearity in the periodic case. In this subsection we compute the explicit form of the non-linearity (which we know to be 0 because the invariant manifold is made up of fixed points in this case). But this explicit form will allow us to compare it with the expression obtained in Eq. (D.9) so that the proof of Proposition 4.3 will be complete. We start from the equation ∂τ v = Lper v − 3uε v 2 − v 3 .

(D.10)

Let y0 be the component of v in the direction of the highest eigenvalue, σ0 = 0, of Lper , and yn , the projection onto the directions ξn , defined after Eq. (D.6), associated to the eigenvalues σn . Then the invariant manifold can be written in the form yn = Yn (y0 ) ,

n = 1, 2, . . . .

(D.11)

Using the fact that the eigenfunction with eigenvalue 0 is u0ε , we can decompose v as: v(x) = y0 u0ε (x) +

∞ X

ξn (x)Yn (y0 ) ,

(D.12)

n=1

the projection of Eq. (D.10) onto the invariant manifold leads to Z ∂τ y0 = − dx u0ε (x) 3uε (x)v(x)2 + v(x)3 .

(D.13)

Note that there is no linear term because σ0 = 0. We are interested in the exact form of the cubic term in y0 on the r.h.s. of Eq. (D.13). There are two contributions, one from v 3 , leading to Z 3 −y0 dx u0ε (x)4 , (D.14) and one from the quadratic non-linearity: Z ∞ X Yn(2) (y0 ) dx u0ε (x) uε (x) u0ε (x) ξn (x) . Y = −6y0

(D.15)

n=1

Here, Yn(2) (y0 ) is the quadratic term in y0 of Yn . Substituting Eq. (D.13) into the equation for Yn , we find the perturbative result: Z (2) 2 −1 Yn (y0 ) = y0 · 3σn dx ξ¯n (x) uε (x) u0ε (x)2 .


Inserting into Eq. (D.15), it is seen to become Z Z ∞ X σn−1 dx u0ε (x)2 uε (x) ξn (x) dx0 ξ¯n (x0 ) uε (x0 ) u0ε (x0 ) . Y = −y03 18

211

(D.16)

n=1

Combining Eqs.(D.14) and (D.16), we get the desired result, namely that the cubic nonlinearity in the periodic case coincides with the quantity X of Eq. (D.9), provided we recall that ϕ0 = u0ε . This completes the proof of Proposition 4.3. Acknowledgement. This work was begun while J-P E was a visitor at the Pennsylvania State University. It was completed during a visit of CEW to the University of Geneva. The support of the Shapiro Visitors Fund at Penn State and the hospitality of the Department of Theoretical Physics at the University of Geneva are gratefully acknowledged. In addition, the authors’ research is supported in part by the Fonds National Suisse and the National Science Foundation Grant DMS-9501225.

References [BF]

Bonic, R. and Frampton, J.: Smooth functions on Banach manifolds. J. Math. Mech. 15, 877–898 (1966) [BK] Bricmont, J. and Kupiainen, A.: Stable Non-Gaussian Diffusive Profiles. Nonlinear Analysis 26, 583– 593 (1995) [BKL] Bricmont, J., Kupiainen, A., and Lin G.: Renormalization Group and Asymptotics of Solutions. Comm. Pure Appl. Math. 47, 893–922 (1994) [C] Carr, J.: The Centre Manifold Theorem and its Applications. New York: Springer-Verlag, 1983 [CE] Collet, P. and Eckmann, J.-P.: Instabilities and Fronts in Extended Systems. Princeton, NJ: Princeton University Press 1990 [CEE] Collet, P., Eckmann, J.-P., and Epstein, H.: Diffusive repair for the Ginzburg-Landau equation. Helv. Phys. Acta 65, 56–92 (1992) [CR] Crandall, M. and Rabinowitz, P.: Bifurcation from simple eigenvalues. J. Funct. Analysis 8, 321–340 (1971) [E] Eckhaus, W.: Studies in non-linear stability theory. Springer tracts in Nat. Phil. 6, Berlin, Heidelberg, New York: Springer. (1965) [EW] Eckmann, J.-P. and Wayne, C.E.: Propagating Fronts and the Center Manifold Theorem. Commun. Math. Phys. 136, 285–307 (1991) [G] Gallay, Th.: A center-stable manifold theorem for differential equations in Banach spaces. Commun. Math. Phys. 152, 249–268 (1993) [GJ] Glimm, J. and Jaffe, A.: Quantum Physics, A Functional Integral Point of View. New York: SpringerVerlag 1981 [H] Henry, D.: Geometric Theory of Semilinear Parabolic Equations. Lecture Notes in Mathematics 840, New York: Springer-Verlag 1981 [L] Levine, H.A.: The role of critical exponents in blow up theorems. SIAM Review 32, 262–288 (1990) [M] Mielke, A.: A new approach to sideband-instabilities using the principle. In: Nonlinear Dynamics and Pattern Formation in the Natural. Doelman, A., van Harten, A., eds. UK: Longman 1995, pp. 206–222 [Sch] Schneider, G.: Diffusive stability of spatial periodic solutions of the Swift-Hohenberg Equation. Commun. Math. Phys. 178, 679–702 (1996) [W] Wayne, C.E.: Invariant manifolds for parabolic partial differential. Arch. Rat. Mech., To appear [W2] Wayne, C.E.: Invariant manifolds and the asymptotics of parabolic. Proceedings of the China/US conference on differential equations and applications, Hangzhou, PRC, July 1996. To appear Communicated by A. Kupiainen


Commun. Math. Phys. 190, 213 – 245 (1997)

Communications in


Melnikov Potential for Exact Symplectic Maps Amadeu Delshams, Rafael Ram´ırez-Ros Departament de Matemàtica Aplicada I, Universitat Politècnica de Catalunya, Diagonal 647, 08028 Barcelona, Spain. E-mail: [email protected]; [email protected] Received: 6 June 1996 / Accepted: 16 April 1997

Abstract: The splitting of separatrices of hyperbolic fixed points for exact symplectic maps of n degrees of freedom is considered. The non-degenerate critical points of a real-valued function (called the Melnikov potential) are associated to transverse homoclinic orbits and an asymptotic expression for the symplectic area between homoclinic orbits is given. Moreover, if the unperturbed invariant manifolds are completely doubled, it is shown that there exist, in general, at least 4 primary homoclinic orbits (4n in antisymmetric maps). Both lower bounds are optimal. Two examples are presented: a 2n-dimensional central standard-like map and the Hamiltonian map associated to a magnetized spherical pendulum. Several topics are studied about these examples: existence of splitting, explicit computations of Melnikov potentials, transverse homoclinic orbits, exponentially small splitting, etc. 1. Introduction In a previous work [DR96], the authors were able to develop a general theory for perturbations of an integrable planar map with a separatrix to a hyperbolic fixed point. The splitting of the perturbed invariant curves was measured, in first order with respect to the parameter of perturbation, by means of a periodic Melnikov function M defined on the unperturbed separatrix. In case of area preserving perturbations, M has zero mean and therefore there exists a periodic function L (called the Melnikov potential) such that M = L0 . Consequently, if L is not identically constant (respectively, has nondegenerate critical points), the separatrix splits (respectively, the perturbed curves cross transversely). Moreover, under some hypothesis of meromorphicity, the Melnikov potential is elliptic and there exists a Summation Formula (see the Appendix) to compute it explicitly. The aim of this paper is to develop a similar theory for more dimensions. The natural frame is to consider exact symplectic perturbations of a 2n-dimensional exact map with a n-dimensional separatrix associated to a hyperbolic fixed point.

214

A. Delshams, R. Ram´ırez-Ros

Exact symplectic maps F : P → P are defined on exact manifolds, i.e., 2n-dimensional manifolds P endowed with a symplectic form ω which is exact: ω = − dφ; and they are characterized by the equation F ∗ φ − φ = dS for some function S : P → R, called the generating function of F . The typical example of an exact symplectic manifold is provided by a cotangent bundle T ∗ M, together with the canonical forms φ0 , ω0 , which in cotangent coordinates (x, y) read as φ0 = y dx, ω0 = dx ∧ dy. Typical exact symplectic maps are the socalled twist maps, which satisfy F ∗ (y dx) − y dx = Y dX − y dx = dL(x, X), where (X, Y ) = F (x, y). The fact that the generating function S can be written in terms of old and new coordinates: S(x, y) = L(x, X), is the twist condition that gives the name to these maps. The function L is called a twist generating function. As in [Eas91], we will not restrict ourselves to this typical case, since the results to be presented in this paper are valid on arbitrary exact symplectic manifolds and the twist condition is not needed. The exact symplectic structure plays a fundamental role in our construction, since it allows us to work neatly with geometric objects. For example, it is used to introduce two homoclinic invariants: the action of a homoclinic orbit and the symplectic area between two homoclinic orbits, called simply the homoclinic area. Namely, let p∞ ∈ P be a hyperbolic fixed point of F , which lies in the intersection of the n-dimensional invariant manifolds W u,s . Given a homoclinic orbit O = (pk )k∈Z of F , i.e., O ⊂ (W u ∩ W s ) \ {p∞ } and F (pk ) = pk+1 , we define the homoclinic action of the orbit O as X W [O] := S(pk ), k∈Z

where, in order to get an absolutely convergent series, the generating function S has been determined by imposing S(p∞ ) = 0. Given another homoclinic orbit O0 of F , the homoclinic area between the two homoclinic orbits O, O0 is defined as the difference of homoclinic actions 1W [O, O0 ] := W [O] − W [O0 ]. These two objects are symplectic invariants, i.e., they neither depend on the symplectic coordinates used, nor on the choice of the one-form φ. It is worth noting that in the planar case, the homoclinic area is the standard (algebraic) area of the lobes between the invariant curves [MMP84, Mat86, Eas91] and also measures the flux along the homoclinic tangle, which is related to the study of transport [MMP84, RW88, Mei92]. The unperturbed role will be played by an exact symplectic diffeomorphism F0 : P → P, defined on a 2n-dimensional exact manifold P, which possesses a hyperbolic fixed point p∞ and a n-dimensional separatrix 3 ⊂ W0u ∩ W0s , where W0u,s denote the invariant manifolds associated to p∞ . Consider now a family of exact symplectic diffeomorphisms {Fε }, as a general perturbation of the situation above, and let Sε = S0 + εS1 + O(ε2 ) be the generating function of Fε . The main analytical results of this paper are stated and proved in Sect. 2. There, the Melnikov potential is introduced as the real-valued smooth function L : 3 → R given by X pk = F0k (p), Sb1 (pk ), L(p) := k∈Z

where Sb1 : P → R is defined as Sb1 (p) = S1 (p) − φ(F0 (p))[F1 (p)], and F1 is the first order variation in ε of the family {Fε }, that is, F1 (p) = [∂Fε (p)/∂ε]|ε=0 . Obviously, S1 is determined by imposing Sb1 (p∞ ) = 0, in order to get an absolutely convergent series. In Theorem 2.1 it is established that

Melnikov Potential for Exact Symplectic Maps

215

(i) the Melnikov potential L is F0 -invariant: L ◦ F0 = L, (ii) if L ≡ 6 constant, the perturbed invariant manifolds Wεu,s split for 0 < |ε| 1, (iii) the non-degenerate critical points of L are associated to transverse intersections of the perturbed invariant manifolds, (iv) the above-mentioned homoclinic invariants are given in first order by L. As a matter of fact, the perturbed homoclinic orbits detected by the Melnikov potential are all of them primary homoclinic orbits Oε of Fε , i.e., they are smooth in ε for |ε| small enough. The Melnikov potential admits several reformulations. For example, if Fε is a twist map on a cotangent bundle T ∗ M, with twist generating function Lε = L0 + εL1 + O(ε2 ), Sb1 has the simple form Sb1 (p) = L1 (π(p), π(F0 (p))), where π : T ∗ M → M is the natural projection. Consequently, the Melnikov potential reads as [DRS97] X L1 (xk , xk+1 ), xk = π(pk ), L(p) = k∈Z

where L1 is determined by imposing L1 (x∞ , x∞ ) = 0, and x∞ = π(p∞ ). Another interesting situation, that allows us to compare the continuous and discrete frames, is to consider Hamiltonian maps. Let Hε : P × R → R be a time-periodic Hamiltonian of period T , and Fε = 9Tε , where 9tε (p) is the solution of the associated Hamiltonian equations with initial condition p at t = 0. If Hε = H0 + εH1 + O(ε2 ), then RT Sb1 (p) = − 0 H1 (9t0 (p), t) dt, so the Melnikov potential takes the form (already known to Poincaré) Z L(p) = − H1 (9t0 (p), t) dt, R

where H1 is determined by imposing H1 (9t0 (p∞ ), t) ≡ 0, or simply H1 (p∞ , t) ≡ 0, if H0 is autonomous. An essential ingredient for the proof of Theorem 2.1 is the fact that the invariant manifolds Wεu,s are exact Lagrangian immersed submanifolds of P and therefore can be expressed in terms of generating functions Lu,s ε . The Lagrangian property of the invariant manifolds was already noticed by Poincaré [Poi99] for flows, although we learned it for maps from E. Tabacman [Tab95], as well as the expression for the invariant manifolds given in Proposition 2.1, in the twist frame. The relationship between Lu,s 1 and S1 , the first order variations in ε of the generating functions Lu,s ε and Sε , gives then the formula for the Melnikov potential. The tools utilized are very similar to those of D. Treschev [Tre94]. However, D. Treschev considers autonomous Hamiltonian flows, and the conservation of energy makes easier the deduction of the continuous version of Eq. (2.5). In that frame (Hamiltonian-Lagrangian flows), it is worth noting that a variational approach to the Melnikov method was carried out by S. Angenent [Ang93] for Hamiltonian systems with 1 21 degrees of freedom, and that a mechanism for finding homoclinic orbits in positively definite symplectic diffeomorphisms is due to S. Bolotin [Bol95], based on interpolating them by Hamiltonian flows. Section 2 contains also some remarks on the non-symplectic case: a vector-valued Melnikov function M is then defined, whose non-degenerated zeros are associated to transverse homoclinic orbits. The last part of Sect. 2 is devoted to gain information on the number of primary homoclinic orbits after perturbation. Since the Melnikov potential L is F0 -invariant, it can be defined on the reduced separatrix 3∗ := 3/F0 , which is the quotient of the

216


separatrix by the unperturbed map. The reduced separatrix is a compact manifold without boundary, provided that the unperturbed invariant manifolds are completely doubled, i.e., W0u = W0s and W0u,s \ {p∞ } is a submanifold of P and not only an immersed submanifold of P. This is equivalent to require that the separatrix is 3 = W0u,s \ {p∞ }. Several dynamical consequences of this fact can be pointed out using topological tools. In particular, Morse theory gives lower bounds on the number of primary transverse homoclinic orbits, under conditions of generic position: in Theorem 2.2 it is stated that the number of primary homoclinic orbits is at least 4. Moreover, if the maps Fε have a common symmetry I : P → P (Fε ◦I = I ◦Fε , and Fε (p∞ ) = I(p∞ ) = p∞ ) such that the one-form φ is preserved by I: I ∗ φ = φ, then the Melnikov potential is I-invariant (see Lemma 2.6). Consequently, it can be considered as a function over the quotient manifold 3∗I := 3/{F0 , I}. If, in addition, I is an involution (I 2 = Id) such that DI(p∞ ) = −Id, the family {Fε } will be called antisymmetric. In this case, in Theorem 2.2 it is stated that the number of primary homoclinic orbits is at least 4n and that they appear coupled in (anti)symmetric pairs: Oε is a primary homoclinic orbit if and only if I(Oε ) also is. It is worth mentioning that any family of odd maps Fε : R2n → R2n (with the standard symplectic structure) is antisymmetric. To prove Theorem 2.2, it is enough to check that the sum of the Z2 -Betti numbers of 3∗ and 3∗I are 4 and 2n, respectively. This is accomplished by computing the Z2 homology of 3∗ and 3∗I . Both lower bounds are optimal, as it is shown in several perturbations of maps with a central symmetry, so that the unperturbed invariant manifolds are completely doubled. It is important to notice that the invariant manifolds of a product of uncoupled planar maps with double loops are not completely doubled, see Remark 2.3, and hence, the topological results do not hold in this case. Indeed, the number of primary homoclinic orbits may be rather different under perturbation; for instance, it is possible to construct explicitly perturbations with an infinite number of primary homoclinic orbits, all of them being transverse. The study of this kind of phenomena is currently being researched. In Sect. 3, as a first example, we consider the family of twist maps on R2n : ÿ Fε (x, y) =

y, −x +

!

2µy 1 + |y|

2

+ ε∇V (y) ,

µ > 1, ε ∈ R,

with V : Rn → R determined by imposing V (0) = 0. The map above is a perturbation of the McLachlan map [McL94], which is a multi-dimensional generalization of the McMillan map [McM71], which in its turn is a particular case of the standard-like Suris integrable maps [Sur89]. The McLachlan map has a central symmetry that makes the dynamics over the separatrix essentially one-dimensional. This is the key fact that allow us to perform a complete analysis, since the natural parametrizations (3.2) can be introduced. If the potential V is entire and not identically zero, in Theorem 3.1 it is proved that the manifolds Wεu,s of the map Fε split, for 0 < |ε| 1. This result is obtained simply by checking that the Melnikov potential is not constant. Moreover, if V is a polynomial, the Melnikov potential can be computed explicitly. In particular, if V is a quadratic form: V (y) = y > By for some symmetric n×n matrix B, in Proposition 3.1 it is stated that under generic conditions on B (det(B) 6= 0 and B does not have multiple eigenvalues), the perturbed invariant manifolds are transverse along exactly 4n primary homoclinic orbits.


217

If V is linear: V (y) = b> y for some vector b ∈ Rn \ {0}, in Proposition 3.2 it is stated that the perturbed invariant manifolds are transverse along exactly 4 primary homoclinic orbits. The difference between both kinds of perturbations is that quadratic potentials V give rise to odd maps, whereas linear ones do not. Moreover, propositions 3.1 and 3.2 give the unperturbed homoclinic orbits that survive and the first order (in ε) of the homoclinic areas between the different primary homoclinic orbits. The weakly hyperbolic case 0 < h 1, cosh(h) := µ, is also studied for the case of a quadratic potential V , and asymptotic expressions for the homoclinic areas are given at the end of Sect. 3. It turns out that, for some distinguished pairs, interlaced in the same way as in the case of 1 degree of freedom, the homoclinic area predicted by the Melnikov potential is exponentially small with respect to the hyperbolicity parameter h. Of course, this does not prove that the splitting size is exponentially small in singular cases, i.e., when ε and h tend simultaneously to zero. The last section is devoted to the study of the Hamiltonian maps arising from timeperiodic perturbations of an (undamped) magnetized spherical pendulum. This model was introduced by J. Gruendler [Gru85] as a first example of application of the Melnikov method for high-dimensional (continuous) systems. The Hamiltonians considered have the form [Gru85] Hε : R2n × R → R,

Hε (x, y, t) = v 2 /2 + (r4 − r2 )/2 + εV (x, t/h),

h > 0, ε ∈ R,

where v = |y|, r = |x|, and V = V (x, ϕ) is 1-periodic in ϕ. We determine V by imposing V (0, ϕ) ≡ 0. Note that small values of h correspond to a quick forcing. General perturbations, and not only symplectic ones, are considered in [Gru85]. As a consequence, the homoclinic orbits are given in the general case by non-degenerate zeros of a vector-valued Melnikov function, instead of non-degenerate critical points of the real-valued Melnikov potential. We have computed the Melnikov potential for the Hamiltonian perturbations studied in [Gru85], and have verified that his Melnikov function is the gradient of our Melnikov potential. Most of the results stated above for the McLachlan map also hold for this Hamiltonian map. There is, however, a significant difference. One cannot deduce a priori that the Melnikov potential is not identically constant without computing it. This has to do with the fact that the Melnikov potential is simply periodic and regular for the polynomial perturbations considered, in contrast with the complex period and singularities that the Melnikov potential has for the entire perturbations of the McLachlan map. To finish the account of results, let us point out that a similar Melnikov analysis for perturbed ellipsoidal billiards has not been included for the sake of brevity and will appear elsewhere. Such billiards are a high-dimensional version of perturbed elliptic billiard tables, which have already been studied in several papers [LT93, Tab94, DR96, Lom96a]. After this research was complete, we became aware of some recent papers [Lom97, Lom96b] of H. Lomel´ı for twist maps on the annulus An = T ∗ Tn = Tn × Rn that resemble our method. However, they do not contain explicit computations (i.e., in terms of known functions) of the Melnikov potential, since complex variable methods are not used. Besides, in those papers it is assumed that the separatrix is globally horizontal, a condition that does not hold for homoclinics in R2n , since the separatrix must fold to go back to the fixed point. Other related papers are [Sun96, BGK95], but their approach is rather different, since they deal, like [Gru85], with the general case, with no symplectic structure, and therefore a vector-valued Melnikov function is needed. This makes an important difference not

218


only from a computational point of view (there are not explicit (analytic) computations in these works), but also from a theoretical point of view, since Morse theory cannot be applied in the general situation. We also want to mention the work [BF96], where perturbations of n-dimensional maps having homo-heteroclinic connections to compact normally hyperbolic invariant manifolds are considered.

2. Main Results For the sake of simplicity, we will assume that the objects here considered are smooth. For a general background on symplectic geometry we refer to [Arn76, GS77, AM78]. The basic properties of immersed submanifolds can be found in [GG73, pages 6–11]. 2.1. Exact objects. A 2n-dimensional manifold P together with an exact non-degenerate two-form ω over it, is called an exact symplectic manifold. Then, ω = − dφ for some one-form φ, usually called Liouville form, symplectic potential or action H form.H A map F : P → P is called exact symplectic (or simply, exact) if γ φ = F γ φ for all closed path γ ⊂ P or, equivalently, if F ∗ φ − φ = dS for some function S : P → R, called generating function of F . A n-dimensional submanifold 3H ⊂ P is called an exact Lagrangian submanifold (or simply, an exact submanifold) if γ φ = 0 for all closed path γ ⊂ 3 or, equivalently, if ı∗3 φ = dL for some function L : 3 → R, called generating function of 3. Here ı3 : 3 ,→ P stands for the inclusion map. Unfortunately, the invariant manifolds that we will deal with are not submanifolds, but just immersed submanifolds. Thus, the introduction of some technicalities seems unavoidable in order to give a rigorous exposition of the subject, and more precisely, to introduce the notion of separatrix, where the distance between the perturbed invariant manifolds will be measured. Given a manifold N , we recall that a map g : N → P is called an immersion when its differential dg(z) has maximal rank at any point z ∈ N . If g is one-to-one onto its image W = g(N ), there is a natural way to make W a smooth manifold: the topology on W is the one which makes g a homeomorphism and the charts on W are the pull-backs via g −1 of the charts on N . The manifold W constructed in this way is called an immersed submanifold of P and its dimension is equal to the dimension of N . It is important to notice that the topology of the immersed manifold need not be the same as the induced one via the inclusion W ⊂ P or, in other words, that W need not be a submanifold of P in the usual sense. Fig. 1 shows an example of a double loop W = g(R) to p∞ = limz→±∞ g(z) for an immersion g : R → R2 . At p∞ , the induced topology on W via the inclusion W ⊂ R2 is not the same as the induced one via g. Both g(B), for all open bounded intervals B ⊂ R, and W \ {p∞ } are submanifolds, but not W. This situation is a particular case of the following elementary result [GG73, p. 11]. Lemma 2.1. Let g : N → P be a one-to-one immersion and set W = g(N ). (i) Let B be an open subset of N with compact closure. Then, g|B : B → P is an embedding, that is, a homeomorphism onto its image g(B). Thus, g(B) is a submanifold of P, which will be called an embedded disk in W. (ii) Let Σ ⊂ W be the set of points where the two topologies on W (the one induced by the inclusion W ⊂ P and the one that makes g a homeomorphism) differ. Then,


219

R

W R 2

g

1 = g(0)

p

Fig. 1. g = (g1 , g2 ) : R → R2 , where g1 (z) = 23 z/(1 + z 2 ), g2 (z) = g1 (2z)

3 = W \ Σ is a submanifold of P. Indeed, W is not a submanifold of P just at the points of Σ. For the sake of clearness, submanifolds and immersed submanifolds will be denoted by different letters, namely 3 and W, respectively. For immersed submanifolds W, the map ıW : W → P stands for the inclusion map, as before. It should be noted that ıW is smooth, even when W is not a submanifold of P, because of the differential structure given to W. Moreover, if γ ⊂ P is a (closed) path, we will say that γ is a (closed) path in the immersed submanifold W if and only if γ is contained in W and it is continuous in the topology of W. For example, if γ is one loop of Fig. 1, it is a closed path in R2 but not in W. With these notations and definitions, we are naturally led to define exact immersed submanifolds in the same way as exact H submanifolds. A n-dimensional immersed submanifold W ⊂ P is called exact if γ φ = 0 for all closed path γ in W or, equivalently, if ı∗W φ = dL for some function L : W → R, called a generating function of W. The symplectic potential φ is determined except for the addition of a closed zeroform, and the generating functions of maps or (immersed) submanifolds are determined Rq except for an additive constant. Henceforth, the symbol W p φ, with p, q ∈ W, will denote the integral of φ along an arbitrary path from p to q in W. It only makes sense for an exact immersed submanifold W, since then the integral does not depend on the path. The difference of values of L can be expressed as an integral of this kind: Z q Z q dL = φ, ∀p, q ∈ W. (2.1) L(q) − L(p) = p

W p

Lemma 2.2. Let W be a connected exact immersed submanifold of P, invariant under 1 an exact map F . Let L and S be their respective generating functions. Then, S(p) + constant = L(F (p)) − L(p),

∀p ∈ W.

(2.2)

220


Moreover, if p∞ ∈ W is a fixed point of F , the constant is −S(p∞ ). Proof. From dS = F ∗ φ − φ and dL = ı∗W φ we get ∗ d S|W = ı∗W dS = F|W dL − dL = d L ◦ F|W − L , where S|W = S ◦ ıW and F|W = (ıW )−1 ◦ F ◦ ıW are the restrictions of S and F to W. Thus, S − L ◦ F + L is constant over W by connectedness and (2.2) is proved. To end the proof we only need to evaluate Eq. (2.2) at p = p∞ .

Fig. 2. The invariant manifolds W u and W s are different as smooth manifolds, and are not submanifolds of R2 . There exist no paths γ u,s in W u,s from p to p0 such that γ u = γ s

Let p∞ ∈ P be a hyperbolic fixed point of F . The point p∞ lies in the intersection of the n-dimensional unstable and stable invariant manifolds of the map F associated to p∞ : W s := p ∈ P : lim F k (p) = p∞ . W u := p ∈ P : lim F k (p) = p∞ , k→−∞

k→+∞

The manifolds W u,s need not be submanifolds of P, but just connected immersed submanifolds, see Fig. 2. In fact, W u,s = g u,s (Rn ) for some one-to-one immersions g u,s : Rn → P, such that g u,s (0) = p∞ and dg u,s (0)[Rn ] is the tangent space to W u,s at p∞ [PM82, II §6]. Since F is exact, they submanifolds: if γ H H are exact immersed H is a closed path in W u (W s ), then γ φ = F k γ φ −→ p∞ φ = 0, when k → −∞ (k → +∞). It should be noted that if γ ⊂ P is closed and contained in W u (resp. W s ), but it is not a path in W u (resp. W s ), the above argument fails. (For instance, if γ is one loop of Fig. 2.) We denote by Lu,s the generating functions of W u,s and we determine the generating functions S, Lu,s by imposing S(p∞ ) = Lu,s (p∞ ) = 0. The next proposition gives a nice interpretation of the generating functions of the stable and unstable invariant manifolds in terms of the generating function of the map. k u,s Proposition 2.1. Given pu,s ∈ W u,s , let us denote pu,s k = F (p ), for k ∈ Z. Then,

Lu (pu ) =

X k 0, we may choose σ, τ ∈ Sq , σ(q) = τ (q) = q such that [τ ◦ σ −1 ] = m. Comparing the coefficients of h(σ, τ ) from above we have: X fq (m00 )a(m00 ) = fq−1 (m − ξ1 ), (a) N fq (m) + m00

where a(m00 ) is the number of (k1 q)τ (k1 6= q) such that [(k1 q)τ ◦ σ −1 ] = m00 . The explicit form of a(m00 ) is not important to us. It is important, however, that a(m00 ) is a finite number which is independent of N . Recall that every element in Sq can be written uniquely as a product of disjoint cycles. Since τ σ −1 = (q)(k1 ...x)..., where the dots indicate the cycles which do not contain any elements that have already appeared, we have (k1 q) ◦ τ σ −1 = (qk1 ...x)... . If (k1 ...x) is a k − 1, (k > 1)-cycle, then k1 appears in a k-cycle in (k1 q) ◦ τ ◦ σ −1 . Meanwhile a 1-cycle (q) and a (k − 1)-cycle (k1 ...x) disappear in (k1 q) ◦ τ σ −1 . So m00 = m − ξ1 − ξk−1 + ξk . We define P fq (m) = N −2q+ i mi f¯q (m). Equation (a) can now be written as X f¯q (m) + N −1 a(m00 )f¯q (m − ξ1 − ξk−1 + ξk ) = f¯q−1 (m − ξ1 ).

(a0 )

2≤k≤q

If m1 = ... = ml−1 = 0, ml > 0, l > 1, choose σ, τ ∈ Sq , τ (q) = q such that [τ ◦ σ −1 ] = m, and the cycle in τ ◦ σ −1 which contains q is a l-cycle. By comparing the coefficients of h(σ, τ ) above we have: X b(m00 )fq (m00 ) = 0, (b) N fq (m) + m00

where b(m00 ) is some finite number independent of N . We would like to express m00 in terms of m. There are two cases to consider. If k1 is in a k-cycle different from the l-cycle containing q in τ ◦ σ −1 , then (k1 q) ◦ τ ◦ σ −1 = (k1 q) ◦ (k1 ...x)(q...y)... = (k1 ...xq...y). It is clear that

m00 = m − ξl − ξk + ξk+l .

If k1 , q are in the same l-cycle of τ ◦ σ −1 , it follows from (k1 q)(k1 ...q) = (q)(k1 ...),(k1 q)(q...k1 ) = (k1 )(q...) (k1 q)(k1 ...xq...y) = (k1 ...x)(q...y) that m00 = m − ξl + ξk + ξl−k for some 1 ≤ k ≤ l − 1. We can now write (b) in terms of f¯ as follows: X X bk f¯q (m−ξl −ξk +ξk+l )+ bk f¯q (m−ξl +ξk +ξl−k ) = 0. (b0 ) f¯q (m)+N −1 l≤k≤q−l

1≤k≤l−1

Equations (a’) and (b’) can be written as linear equations:

296

F. Xu

X

Amp f¯q (p) = Bm ,

p∈Cq

where Bm is a linear combination of f¯q−1 ’s with known coefficients and (Amp ) is a |Cq | × |Cq | matrix with coefficients which are polynomials in N −1 . We claim det(Amp ) is nonzero if we set N −1 = 0. In fact this statement is equivalent to saying that for any given f¯q−1 (m − ξ1 ), the Eqs. (a’) and (b’) have a unique solution if we set N −1 = 0. When N −1 = 0, (a’) and (b’) become: f¯q (m) = f¯q−1 (m − ξ1 ), X f¯q (m) = − bk f¯q (m − ξl + ξk + ξl−k ),

(c) (d)

1≤k≤l−1

where (d) is valid for m1 = ...ml−1 = 0, ml > 0, l > 1. By (c) we can determine f¯q (m) uniquely for any m1 > 0. By (d) we can determine f¯q (m) uniquely when m1 = ...ml−1 = 0, ml > 0, l > 1 from fq (m − ξl + ξk + ξl−k ), 1 ≤ k ≤ l − 1. It follows that (c) and (d) uniquely determine f¯q (m) for any given f¯q−1 (m − ξ1 ). We conclude that det(Amp ) is nonzero if N is sufficiently large , and we can inductively determine f¯q (m) by using (a’) and (b’) starting from f¯1 (1) = 1. It is clear that limN →+∞ f¯q (m) exists for all q, m. Slightly abusing our notation, we shall also use Lq (m) to denote the following: X δi1 i0σ(1) · · · δiq i0σ(q) · δj1 jτ0 (1) · · · δjq jτ0 (q) , Lq (m) = σ,τ ∈Sq [σ −1 ◦τ ]=m

where the δ’s are Kronecker deltas and we have suppressed the dependence on the indices i, i0 , j, j 0 ’s. We have therefore proved the following lemma: Lemma 2.2. Let N be sufficiently large, then X −2q+Pq m i ¯ i=1 fq (m)Lq (m). Iq,q = N m

Here f¯q (m) is independent of i1 , ...iq , i01 , ...i0q , j1 , ...jq and j10 , ...jq0 . Moreover limN →+∞ f¯q (m) exists for all q, m. Remark 1. It is mentioned in [2] that analogous results can be proved for SP (N ) by essentially the same arguments used for SO(N ). R R R We note that dU N1 T r(U m ) = dU N1 T r((eiθ U )m ) = eimθ · dU N1 T r(U m ) for R any eiθ ∈ S 1 . Hence dU N1 T r(U m ) = δm,0 . We are now ready to prove the following: P Lemma 2.3. Let p1 , · · · , pt and q1 , · · · , qs be positive integers such that 1≤i≤t pi = P 1≤j≤s qj = q. Then: Z 1 1 1 1 1 dU T r (U p1 ) · T r (U p2 ) · · · T r (U pt ) · T r U −q1 · · · T r U −qs N N N N N = N −2 g(p1 , · · · pt ; q1 , · · · qs ), where limN →∞ g(p1 , · · · pt ; q1 , · · · qs ) exists and is bounded.

Random Matrix Model from 2D Yang-Mills Theory

297

Proof. We can expand the LHS of the above lemma using the definition of trace. Each term in the expansion consists of q U 0 s and q U ∗ ’s. By using Lemma 2.2, each such term can be further expanded as sum of the following terms: Pq N −2q+ i=1 mi f¯q (m)Lq (m), where Lq (m) is defined as in Sect. 2.2. The index structure of the Kronecker deltas in Lq (m) can be conveniently represented by a diagram. Each such diagram g consists of t white vertices of valence p1 , · · · pt and s black vertices of valence q1 , · · · qs , and the edges from each white vertex is connected to a black vertex. The edges of each vertex are labeled by matrix indices consistent with the structure of the vertex as in the proof of Lemma 2.1, and there is an index (the index takes value from 1 to N ) running through every circle in the diagram. The way that edges are connected is specified by the permutations σ and τ in Lq (m). The following is an example of such a diagram in the case t = 1, s = 1, p1 = q1 = 2, σ = (12), τ = identity with one white vertex and one black vertex.

In the above diagram the conditions j1 = i2 , j2 = i1 , j10 = i02 , j10 = i02 are required by the consistency of the vertex structure. We note that such a diagram is in general not a fat graph. The number of edges in the diagram is 2q. We denote by F the number of circles in the diagram. Since each edge connects two different vertices, it follows that each circle in the diagram contains at least two edges. Therefore F ≤ q since different circles pass through different edges. Notice each vertex contribute a factor N −1 , and each circle contribute a factor of N . By Lemma 2.2, the highest power of N in such a term is N −q−s−t+F . Since F ≤ q, it follows that the highest power of N on the expansion of the left hand side of Lemma 2.3 is −2. Therefore the LHS of the above lemma is equal to N −2 g(p1 , . . . , ps ; q1 , . . . , qt ). That limN →+∞ g(p1 , . . . , ps ; q1 , . . . , qt ) exists and is bounded follows from the fact that g(p1 , . . . , ps ; q1 , . . . , qt ) is a finite polynomial in N −1 with fixed coefficients which depend on (p1 , . . . , ps ; q1 , . . . , qt ). Remark 2. By using analogous results Lemma 2.2 for SO(N ), SP (N ) cases (see Remark 1), one can prove the factorization hypothesis for SO(N ), SP (N ) cases. Remark 3. For much more stronger results than Lemma 2.3, see [11]. However, lemma 2.3 will be sufficient to prove asymptotic freeness in the Haar measure case in Sect. 3 (see Remark 8 in Sect. 3). The next lemma will play an important role in Subsect. 3.2. We begin by introducing some additional notations. Let us suppose that we are given σ ∈ Sq with [σ] = (σ1 , . . . , σq ) the conjugacy classes of σ (see Subsect. 2.0). We may represent σ as a unique product of different cycles:

298

F. Xu

σ=

q Y σk Y

aj,k

k=1 j=1

with each aj,k a k-cycle. For a set of N ×N matrices M1 , ...Mq and a k-cycle a = (i1 ...ik ) with 1 ≤ i1 , i2 , ...ik ≤ q, we define hM ia = hMi1 ...Mik i and hM iσ =

q Y σk Y

hM iaj,k .

k=1 j=1

For an example, if q = 3 and σ = (1)(23), then hM iσ = hM1 ihM2 M3 i. Now we are ready to prove the following lemma: Lemma 2.4. Let A1 , A2 , ...Aq and B1 , B2 , ...Bq be N × N matrices, and let dU be the normalized Haar measure on U (N ) as in Subsect. 2.2. Assume N is large enough, then Z dU hU B1 U ∗ A1 U B2 U ∗ A2 · · · U Bq U ∗ Aq i X X f¯q (m) N −2g(Gσ,τ ) · hBiτ −1 · hAiσ◦Z , = σ,τ ∈Sq ,[σ −1 ◦τ ]=m

m

where g(Gσ,τ ) is as in Lemma 2.0 and f¯q (m) is as in Lemma 2.2. Proof. By definition, hU B1 U ∗ A1 U B2 U ∗ A2 · · · U Bq U ∗ Aq i 1 X Ui1 j1 (B1 )j1 j10 Uj∗0 i0 (A1 )i01 i2 Ui2 j2 (B2 )j2 j 0 362 Uj∗0 i0 (A2 )i02 i3 · · · (Aq )i0q i1 , = 1 1 2 2 N 0 0 i,j,i ,j

P where we have used i,j,i0 ,j 0 to denote a summation over all i1 , ...iq , j1 , ...jq , i01 , ...i0q , j10 , ...jq0 . After integration and using Lemma 2.2, we have: Z dU hU B1 U ∗ A1 U B2 U ∗ A2 · · · U Bq U ∗ Aq i X −2q−1+P m X i ¯ i fq (m) = N Lq (m)(B1 )j1 j10 · · · (Aq )i0q i1 m

=

X m

N −2q−1+

i,j,i0 ,j 0

P i

mi

f¯q (m)·

X

i,j,i0 ,j 0 ,σ,τ ∈Sq ,[σ −1 ◦τ ]=m

Let us compute

δi1 ,i0σ(1) · · · δjq jτ0 (q) (B1 )j1 j10 · · · (Aq )i0q i1 .


N −2q−1+

P i

X

mi

i,j,i0 ,j 0

299

δi1 i0σ(1) · · · δjq jτ0 (q) (B1 )j1 j10 · · · (Aq )i0q i1 −

By using the notation introduced in this section, we have: P X δi1 i0σ(1) · · · δjq jτ0 (q) (B1 )j1 j10 · · · (Aq )i0q i1 N −2q−1+ i mi =N

−2q−1+

i,j,i0 ,j 0

P

P

i

mi +

i

P

τi +

i

σi

hBiτ −1 hAiσ◦Z .

By using Lemma 2.0 , the proof of the lemma immediately follows.

c2 (α)A P 2.4. Heat kernel case. Let k(U ; A) = dα χα (U )e− 2N be the heat kernel on U (N ), where dα is the dimension of the irreducible representation α of U (N ) and C2 (α) is the the measure on U (N ). We will use casmir of α, A > 0. Denote by d2 A = k(U ; A)dU R hW i to denote N1 T rW and hhW ii to denote d2 AhW i.

Lemma 2.5. Let k0 , k1 , . . . , kn be positive integers . Then (1) hhU k1 ihU k2 i · · · hU kn ii = hhU k1 ii · hhU k2 ii · · · hhU kn ii + f (k1 , . . . , kn ; N, A), where f (k1 , . . . kn ; N, A) converges uniformly to 0 with respect to A as N goes to +∞. (2) hhU k0 ii converges uniformly to Pk0 (A) with respect to A as N goes to +∞, where Pk0 (A) is uniquely determined by the following recursion relation: kX 0 −2 k0 d Pk (A) = Pk (A) + (k0 − ` − 1)P`+1 (A)Pk0 −`−1 (A) − dA 0 2 0 `=0

P0 (A) = 1, Pk0 (0) = 1 for all k0 ≥ 0, P1 (A) = e

−A/2

(4)

.

Proof. Let λα be a basis of N × N Hermitian matrices such that T rλα λβ = 21 δ αβ , it follows X δad δbc . (5) (λα )ab (λα )cd = 2 α For an arbitrary function f (U ) on U (N ), the Laplacian operator 1 is defined by    X X ∂2 β    x λ U exp i (1f )(U ) = β . ∂x2 x=0 α

(6)

β

d 1 d By using the heat equation − dA k(U ; A) = 2N 1k(U ; A), one has − dA hhU k1 i · · · 1 hU ks ii = 2N h1(hU k1 i · · · hU ks i)i. Let us first calculate 1hU k1 i. By using (5) and (6), one has

1hU k1 i = k1 N hU k1 i +

kX 1 −2 `=0

Hence

(k1 − ` − 1)N hU `+1 ihU k1 −`−1 i.

(7)

300

F. Xu s X 1 hU k1 i · · · hU ks i = hU k1 i · · · 1hU ki i · · · hU ks i i=1

X

+

hU i · · · hU ki λα i · · · hU kj λα i · · · hU ks i. k1

1≤i6=j≤s,α

We note that

X

hU ki λα i · hU kj λα i

α

= =

1 N2

X

(U ki )i1 i2 (λα )i2 i1 · (U kj )i3 i4 (λα )i4 i3

(i1 ,i2 ,i3 ,i4 ,α)

1 X ki (U )i1 i2 (U kj )i2 i1 2N 2 i ,i 1

(8)

2

1 hU ki +kj i. = 2N Therefore s 1 X d hhU k1 i · · · hU ks ii = hhU k1 i · · · (1hU ki i) · · · hU ks ii dA 2N i=1 X 1 k1 + hhU i · · · hU ki +kj i · · · hU ks ii 2N 1≤i6=j≤s ÿ * ! kX s i −2 X ki k i k1 `+1 ki −`−1 hU i + hU i · · · (ki − ` − 1)hU i · hU i ··· = 2 i=1 `=0 X 1 hhU k1 i · · · hU ki +kj i · · · hU ks ii. hU ks i + 2N

−

Ps

(9)

1≤i6=j≤s

For fixed i=1 ki = m, (9) givesPa system of first order ordinary differential equations s in variables (hhU k1 i · · · hU ks ii; i=1 ki = m is fixed) which we denote by vector µ(A). Assume µ(A) has `(m) components. Then (9) can be written as dµ = 0µ, dA where 0 is a `(m) × `(m) matrix whose entries are finite polynomials of N −1 . It follows limn→+∞ 0 = 01 . We notice that µ(0) = (1, 1, . . . , 1), and µ(A) = e0A µ(0). Therefore µ(A) converges uniformly with respect to A to µ1 (A) = e01 A µ(0). Let us denote the components of µ1 (A) by hhU k1 i · · · hU ks ii∞ . It follows from (9) that: d hhU k1 i · · · hU ks ii∞ dA * ÿ ! kX s i −2 X k i hU ki i + hU k1 i · · · = (ki − ` − 1)hU `+1 ihU ki −`−1 i 2 i=1 `=0 ks · · · hU i .

−

(10)


301

Let µ2 (A) be a vector with components Pk1 (A) · · · Pks (A). It follows from (4) that d Pk (A) · · · Pks (A) dA 1 ÿ ! kX s i −2 X ki Pk (A) + Pk1 (A) · · · (ki − ` − 1)P`+1 (A) · Pki −`−1 (A) . = 2 i

−

i=1

`=0

i µ1 (A), µ2 (A) satisfies the same equation dµ dA = 01 µi . Notice µ2 (0) = (1, . . . , 1) = µ1 (0). Hence µ1 (A) = µ2 (A). We have thus proved that for any non-negative integers k1 , . . . , ks hhU k1 i · · · hU ks ii converges uniformly (with respect to A) to BPk1 (A) · · · Pks (A). It is easy to see that this is equivalent to (1) and (2) of the lemma.

Remark 4. The differential equation in part (2) of the lemma above appeared in [4]. One can prove similar statements (with different Pk (A)’s) for SO(N ), SP (2N ) cases by the same method.

3. Basic concepts We introduce some basic concepts about free random variables. The reader is encouraged to consult the excellent exposition [1]. Definition. Let (Gi )i∈I be a family of sets. An alternating word W in (Gi )i∈I is a monomial g1 g2 ...gn with gj ∈ Gij and i1 6= i2 6= · · · 6= in . The gi ’s are called the components of W . Definition. Let (G, φ) be a noncommutative probability space. A family, (Gi )i∈I of unital subalgebras of G is called free if φ(W ) = 0 for any alternating word g1 g2 ...gn in (Gi )i∈I with φ(gi ) = 0 for all 1 ≤ i ≤ n. We will also need the definition of limit distribution and asymptotic freeness. Denote by ChXi | i ∈ Ii the set of non-commutative polynomials. Definition. For each n ∈ N , let (Ti(n) )i∈I be a family of noncommutative random variables in a non-commutative probability space (Gn , φn ). Then the sequence of joint distributions µ(Ti(n) )i∈I converges as n → +∞, if there exists a distribution µ such that µ(Ti(n) )i∈I (p) → µ(p) as n → +∞ for every non-commutative polynomial p in (Ti(n) )i∈I . We call µ the limit distribution. Definition. Let I = ∪j∈J Ij be a partition of I. A sequence of families ({Ti(n) | i ∈ Ij })j∈J of sets of noncommutative random variables is said to be asymptotically free as n → +∞ if it converges and if ({Xi | i ∈ Ij })j∈J is a free family of sets of random variables in (ChXi | i ∈ Ii, µ). We will be interested in the following S-transform. Let σ ∗ = {µ : C[x] → C | µ linear, µ(1) = 1, µ(x) 6= 0}. For µ ∈ σ ∗ , consider the formal power series ψµ , χµ , and Sµ such that

302

F. Xu

ψµ (z) =

∞ X

µ(xk )z k ,

χµ (ψµ (z)) = ψµ (χµ (z)) = z,

k=1

Sµ (z) = χµ (z)z −1 (1 + z). Sµ is called the S-transform of µ. S-transform has the property that for free multiplicative convolution µ ν (for definition, see Chapter 3 of [1]), Sµν = Sµ Sν . A It is proved in [10] that Sµa (z) = e 2 (1+2z) for A > 0 is the S-transform of an infinite divisible measure µA on the unit circle. Such a measure is an analogue of Gaussian measure on the circle. Notice µA (X −n ) = µA (X n ) for n ≥ 0. Let us calculate the A A z e 2 (1+2z) . Since ψµ (z) = moments of µA . (See [3].) From SµA (z) = e 2 (1+2z) , χA (z) = 1+z P∞ k k k=1 µ(x )z , it follows: Z 1 k ψµ (e−iθ )einθ dθ. µA (X ) = 2π We have, with χA (z) = e−iθ , I dz z[χA (z)]−(n+1) χ0A (z) µA (X k ) = 2πi I 1 dz [χA (z)]−n = n 2πi n I A 1 1 dz 1+ = e−n· 2 (1+2z) . n 2πi z In particular, µ0 (X k ) = 1. P∞ To find a recursion relation between µA (X k ),let us define R(eiθ , A) = n=1 µA (X n ) ∂F e−i(n+1)θ . Then F (θ, A) = i eiθ R(eiθ , A) − 21 and F (θ, A) satisfies ∂F ∂A + F ∂θ = 0 (see p. 44 of [3]). Expanding this equation in powers of eiθ , we obtain the following recursion relations: X n dµA (X n ) =− mµA (X m )µA (X n−m ) + µA (X n ). dA 2 n−1

m=1

n

We notice that µ0 (X ) = 1. Compared to the equations and initial conditions for Pn (A) in Lemma 2.4, we have Pn (A) = µA (X n ) for n ≥ 0. But we also have hhU −n ii = hhU n ii. It follows from Lemma 2.4 that the limit distribution of {(U )n } is µA . We record this result in the following corollary. Corollary 3.1. For A > 0, hh(U )n ii converges to µA (X n ) for all integer n, where µA A is the infinite divisible measure on the unit circle with S-transform SµA (z) = e 2 (1+2z) . 3.1. Relative angular integrals. Let us first introduce some notation. We let I = {0, 1, 2, ..., n} with n a non-negative integer. Let us suppose that we are given two sets of N × N matrices M1 , M2 , ...Ms and P1 , P2 , ...Pn and a set of N × N unitary matrices U1 , U2 , ...Un . We define Gi to be the linear span of the following set: {Ui (Pi )j Ui∗ , j ≥ 0, j ∈ Z}, where i = 1, 2, ...n. We let G0 to be the set of all noncommutative polynomials in M1 , M2 , ...Ms . By slight abuse of notation, we enumerate the elements of Gi by the


303

same letter g(i, j) with 0 ≤ i ≤ n, 0 ≤ j in the following, even though the g(i, j) may have different properties. It should be noted that they always belong to Gi , and the appropriate interpretation should be evident from the context. Recall from Sect. 2 that hg(i, j)i = N1 T r(g(i, j)) and they are independent of Ui when i > 0. Lemma 3.2. (a) Let W be an alternating word in (Gi )i∈I . Then Z dU1 dU2 ...dUn hW i is a finite polynomial Q(W ) in hg(i, j)i, i.e., Q(W ) =

X

Qa,b

a,b

bi n Y Y

hg(i, j)i,

i=0 j=ai

where only a finite number of Qa,b ’s are nonzero (we have suppressed the subscript of a, b). Moreover, Qa,b have bounded limits as N → +∞. (b) Assume hXi i has a bounded limit as N → +∞ for all Xi ∈ Gi . If each component g of the alternating word W has the property that limN →+∞ hgi = 0, then lim Q(W ) = 0.

N →+∞

Proof. Let us prove both parts (a) and (b) by induction on n. If n = 0, Z dU1 ...dUn hg(0, j)i = hg(0, j)i, the lemma is trivial. Assume the lemma is true for all n < k (k ≥ 1). Let us prove it for the case of n = k. By using the cyclicity of the trace, we may, without loss of generality, assume hW i takes either one of the following forms:

or

hU1 g(1, 1)U1∗ A1 U1 g(1, 2)U1∗ A2 · · · U1 g(1, q)U1∗ Aq i

(12)

hU1 g(1, 1)U1∗ A1 U1 g(1, 2)U1∗ A2 · · · U1 g(1, q)U1∗ i,

(13)

where A1 , ...Aq are alternating words in Gi with i = 0, 2, 3, ...k. Let us consider the case that hW i is of the form (12). By Lemma 2.4 ( with Bj replaced by g(1, j)), we have: Z dU1 hU1 g(1, 1)U1∗ A1 U1 g(1, 2)U1∗ A2 · · · U1 g(1, q)U1∗ Aq i X X (14) f¯q (m) N −2g(Gσ,τ ) · hgiτ · hAiσ◦Z . = m

σ,τ ∈Sq ,[σ −1 ◦τ ]=m

Since g(Gσ,τ ) ≥ 0 and limN →+∞ f¯q (m) has bounded limit by Lemma 2.2, it follows that lim N −2g(Gσ,τ ) f¯q (m) N →+∞

has a bounded limit. After integration with respect to the remaining U2 , ...Uk and use of the induction hypothesis , we conclude that for n = k and hW i is of the form (12), part

304

F. Xu

(a) of the lemma is true. The case when hW i takes the form (13) follows from the case of (12) if we set Aq = identity. Hence by induction, part (a) is proved. To prove part (b) in the case when hW i is of the form (12), assume the condition of part (b) is satisfied. As N → +∞, it is clear from (14) that the only possible nonzero limit comes from Gσ,τ such that g(Gσ,τ ) = 0, namely the fat graph which is planar. Let G be any such planar graph and let a, b be two points on G such that a, b are connected by an edge outside the unit circle and clockwise on the circle the number of vertices between a and b is minimal. If a = b, then a is a vertex which is connected to itself. If a 6= b, since a, b are not adjacent by the construction of G and G is planar, we conclude that between a and b there is always one vertex on the unit circle which is not connected to any other point except to itself by edges outside the circle. Hence we conclude that there is always a vertex on G which is only connected to itself by an edge outside the unit circle. If this vertex is g(1, j), the corresponding expression hgiτ −1 contains a term hg(1, j)i which goes to 0 as N → +∞ by assumption. If this vertex is Aj , then hAiσ◦Z contains a term hAj i which after integration with respect to U2 , ...Uk goes to 0 as N → +∞ by the induction hypothesis. Thus we have proved part (b) when hW i takes the form (12). To prove (b) in the case (13), by using the cyclicity of trace and the fact part (b) is true when hW i takes the form (12), we have Z dU1 ...dUk hU1 g(1, 1)U1∗ · · · Aq−1 U1 g(1, q)U1∗ i lim N →+∞ Z dU1 ...dUk hU1 g(1, q)g(1, 1)U1∗ A2 · · · Aq−1 i = lim N →+∞ Z = lim hg(1, q)g(1, 1)i dU1 ...dUk hA2 U1 g(1, 2)U1∗ · · · Aq−1 i N →+∞ Z = lim hg(1, q)g(1, 1)i dU1 ...dUk hU1 g(1, 2)U1∗ · · · Aq−1 A2 i N →+∞ Z = lim hg(1, q)g(1, 1)ihAq−1 A2 i dU1 ...dUk hU1 g(1, 2)U1∗ · · · N →+∞

U1 g(1, q)U1∗ i = · · · = constant ×

Z lim

N →+∞

dU1 ...dUk hM i,

where M is U1 g(1, j)U1∗ for q odd or Ai for q even. It follows by assumption and the induction hypothesis that Z dU1 ...dUk hW i = 0. lim N →+∞

By induction, part (b) is proved.

Remark 5. One can prove the same results for SO(N ), SP (N ) group cases (see Remark 1) by the same method above. 3.2. Asymptotic freeness. We are now ready to prove the asymptotic freeness results for Gaussian, Haar and heat kernel cases. We will only give the proof for the heat kernel case (see Subsect. 2.3). The other two cases are proved in exactly the same way after one makes the appropriate changes of notation and uses Lemma 2.1 and Lemma 2.2 instead of Lemma 2.5.


305

We will use the notation introduced in Subsect. 3.1 and Subsect. 3.2. For clarity, let us define Z hhW ii = dU1 · · · dUn K(U1 ; A1 ) · · · K(Un ; An )hW i Z

and hhg(i, j)ii =

dUi K(Ui ; Ai )hg(i, j)i.

When i = 0, we understand hhg(0, j)ii as hg(0, j)i. Slightly abusing our notation, we let Gi (i > 0) denote the set of all polynomials in Ui and again we use the symbols g(i, j) to enumerate the elements of Gi . Corollary 3.3. (a) Let us assume that W is an alternating word in G0 , G1 , ...Gn and that for any elements M ∈ G0 , limN →+∞ hM i exists. Then lim hhW ii

N →+∞

exists. (b) Suppose that each component g of W has the property that lim hhgii = 0.

N →+∞

Then

lim hhW ii = 0.

N →+∞

Proof. Since K(U ; A)dU = K(T U T + ; A)d(T U T + ) for any fixed T ∈ U (N ), by [7] we can write N Y dαk 12 (α) · K(αk ; A)dT K(U ; A)dU = CN · k=1

12 (α) =

α i − αj , sin2 2 i<j Y

where CN is a constant and dT is the normalized Haar measure on U (N ). Hence hhW ii

Z

Z

Y

= (CN )P ·

dαi,k 12 (αi,k )K(αi,k ; Ai )

dT1 · · · dTn · hW i.

1≤i≤n,1≤k≤N

By Lemma 3.2, we have Z dT1 · · · dTn · hW i =

X

Qa,b

a,b

Hence

bi n Y Y i=0 j=ai

hhW ii =

X a,b

Qa,b

bi n Y Y h( hg(i, j)i)i. i=0

j=ai

As N → +∞, by (1) of Lemma 2.5 we have:

hg(i, j)i.

306

F. Xu

lim h(

N →+∞

bi Y

hg(i, j)i)i = lim

N →+∞

j=ai

bi Y

hhg(i, j)ii.

j=ai

It follows from (2) of Lemma 2.5 that lim hhg(i, j)ii

N →+∞

exists for i > 0. When i = 0, the above limits exist by our assumption in the lemma. It follows that lim hhW ii N →+∞

exists. The proof of the second part of the corollary follows exactly the same as the proof of part (b) in Lemma 3.2: one simply replaces hg(i, j)i there by hhg(i, j)ii Remark 6. The same method, combined with Remarks 1, 2, 3, 4, 5, can be used to prove the same results as Corollary 3.2 for SO(N ), SP (N ) cases. Remark 7. If µN is any U (N ) invariant measure on N × N Hermitian matrices which satisfies the factorization hypothesis (1) in Sect. 2 and has a limit distribution as N → +∞, then it is easy to see from the proof presented above that Corollary 3.3 is true for such a measure. In particular, Corollary 3.3 is true for Gaussian case and Haar case by Lemma 2.1 and Lemma 2.3. In the Haar case this gives a direct proof of Theorem 4.3.2. in [1]. 3.3. A random matrix model. Following p. 52 of [1], we will consider now families (V (s, n))s∈S in Mn of random matrices such that each V (s, n) is unitary. Such a family defines a map 8n : → (U (n))S given by φn (x) = (V (s, n)(x))s∈S , where is a space with measure µ. A (As )s∈S measure on U (n)S is defined to be the product of measures K(Us ; As )dUs for s ∈ S. The classical joint distribution of the family (V (s, n))s∈S is the push forward of the measure µ by 8n . Theorem 3.4. For each n, let (V (s, n))s∈S be a family of unitary random matrices such that its classical joint distribution is the (As )s∈S measure on U (n)S , and let W (n; t)t∈T be a family of constant n × n matrices such that W (n; t)t∈T has a limit distribution as N → ∞. Then the sequence of families of sets of random variables (W (n; t)t∈T , (V (s, n))s∈S ) is asymptotically ∗-free as n → +∞. Moreover, for every s ∈ S, the limit distribution of {V (s, n), V (s, n)∗ } as n → +∞ is µAs measure on the As circle whose S-transform is SµAs (z) = e 2 (1+2z) . Proof. The theorem follows immediately from Corollary 3.3 and 3.1.

Remark 8. Theorem 3.4 implies the following statement about two dimensional YangMills theory with gauge group U (N ) on the plane when N → +∞, namely, ”the nonoverlapping families of Wilson loop operators” form a multiplicative free family (see [3]). Even though there has been much progress in constructing functional integral measure in two dimensional Yang Mills theory, we take the approach of using heat kernel measure which is more convenient for our purposes.


307

4. Conclusions and Questions In this paper we have given a unified treatment of asymptotic freeness by using fat graphs. With this we have demonstrated Theorem 3.4 which may be viewed as a “Fourier transform” of the Gaussian random matrix models in [1]. It remains to see if one can produce more random matrix models from our methods and if these models can be used to solve some questions in operator algebras. Acknowledgement. I’d like to thank the referee for his very useful suggestions. I’d also like to thank Professor E. Effros and Mr. David Kan for proof reading this paper. This work is partially supported by NSF grant DMS-9500882.

Note added in proof After this work was completed and submitted for publication, the author is informed by a Referee about a preprint of Ph.Biane (to appear in Fields Institute Communications) where some of the results of this paper are proved by different methods. References 1. Voiculescu, D., Dykema, K. and Nica, A.: Free random variables. CRM Monograph series 1, Providence, RI: AMS, 1992 2. Weingarten, D.: Asymptotic behavior of groups integrals in the limit of infinite rank. J. Math. Phys. 19, 5, (1978) 3. Gopakumar, R. and Gross, D.: Mastering the master field. hep-th/9503126, to appear in Nucl. Phys. B 4. Kazakov, V. and Kostov, I.: Nucl. Phys. B. 176, 199 (1980) 5. Witten, E.: In recent developments in Gauge theories eds. G.’t Hooft et al., New York and London: Plenum Press, 1980 6. Voiculescu, D.: Limit laws for random matrices and free products. Invent. Math. 104, 201–220 (1991) 7. Itzykson, C. and Zuber, J.: The planar approximation: II. J. Math. Phys. 21, 411 (1980) 8. Witten, E.: Two-dimensional gravity and intersection theory on moduli space. Surveys in Diff. Geom. 1, 243 (1993) 9. Penner, R.: Perturbative series and moduli space of Riemann surfaces. J. Diff. Geom. 27, 35–53 (1988) 10. Bercovici, H. and Voiculescu, D.: Lévy-Hinˇcin type theorems for multiplicative and additive free convolution. Pacific J. Math. 153, no. 2, 217–248 (1992) 11. Diaconis, P. and Shahshahani, M. : On the eigenvalues of random matrices. J. Appl. Prob. Special Vol. 31A, 49–62 (1994) 12. Singer, I.: On the master field in two dimensions. In: Functional analysis on the eve of the 21st century in honor of the 80th birthday of I.M. Gelfand. Progress in Mathematics Vol. 131 13. Weyl, H.: The Classical Groups. Princeton, NJ: Princeton University Press, 1946 Communicated by H. Araki

Commun. Math. Phys. 190, 309 – 330 (1997)

Communications in


Stability of Ultraviolet-Cutoff Quantum Electrodynamics with Non-Relativistic Matter ¨ Fröhlich2 , Gian Michele Graf2 Charles Fefferman1 , Jurg 1 2

Department of Mathematics, Fine Hall, Princeton University, Princeton, New Jersey 08544, USA Theoretical Physics, ETH-Hönggerberg, CH–8093 Zürich, Switzerland

Received: 4 September 1996/ Accepted: 9 April 1997

Abstract: We prove that the quantum-mechanical ground state energy of a system consisting of an arbitrary number, M , of static nuclei of atomic number ≤ Z and of an arbitrary number, N , of Pauli electrons interacting with the quantized, ultraviolet-cutoff radiation field is bounded below by −K · M , where K is a finite constant depending on Z, on the finestructure constant α and on the ultraviolet cutoff 3, with K ≤ K 0 3, as 3 → ∞, and K 0 independent of 3.

1. Introduction and Survey of Results In this paper we prove that the quantum electrodynamics (QED) of non-relativistic quantum-mechanical matter interacting with the quantized radiation field is stable (more precisely, H-stable), provided an ultraviolet cutoff is imposed on the quantized electromagnetic vector potential. A typical physical system described by this theory consists of an arbitrary number, N , of non-relativistic electrons with electric charge −e, bare mass m > 0, spin 1/2 and a bare gyro-magnetic factor g = 2, an arbitrary number, M , of nuclei of nuclear charge ≤ Ze, for some positive integer Z < ∞ (e.g. Z < 150), and an arbitrary, variable number of photons; (see refs. [1–6]). H-stability is the statement that in the ground state of the system, the energy per charged particle (electron or nucleus) remains finite, as N and/or M tend to ∞; see [7]. Since the masses of nuclei are much larger than the electron mass m, we shall treat the nuclei as static. This would yield a lower bound on the ground state energy of the system if the gyro-magnetic factors of nuclei were less than or equal to 2, [1]. However, there are plenty of nuclei (including the proton) with a gyro-magnetic factor > 2. For systems containing such nuclei, H-stability will not hold, unless the interactions between the nuclear magnetic moments and the quantized radiation field are neglected or regularized by a hard- or soft-core form factor of nuclei. However, for a size of nuclei much smaller than the Bohr radius of an atom, the contribution of the nuclear Zeeman energies to

310

C. Fefferman, J. Fröhlich, G.-M. Graf

the ground state energy of the system is much smaller than typical atomic energies, provided an ultraviolet cutoff at an energy at least as large as the rest energy of an electron is imposed on the quantized radiation field. It is then justified to treat nuclei as static point particles in estimates of the ground state energies of atomic and molecular systems. This approximation is made throughout our paper. Much of atomic, molecular and condensed matter physics is the study of physical properties of the systems just described. Throughout this paper, we impose the Coulomb (or radiation) gauge condition on the electromagnetic potentials. In this gauge, electrons and nuclei interact through instantaneous Coulomb two-body potentials, and the electrons are coupled to the transverse degrees of freedom of the radiation field by minimal substitution (i.e., by replacing ordinary derivatives by covariant ones). Treating the nuclei as static implies that they do not couple to the transverse degrees of freedom of the radiation field. The field quanta of the transverse degrees of freedom of the electromagnetic field are the photons. A typical electron energy in an atom is of order −mc2 (Zα)2 , where c is the velocity of light, α = e2 /4π~c ≈ 1/137 is the dimensionless finestructure constant, and ~ is Planck’s constant. Interactions between photons with energy large compared to typical atomic electron energies and the electrons are turned off by means of an ultraviolet cutoff imposed on the electromagnetic vector potential. The Hilbert space of pure state vectors of a system of N -electrons and arbitrarily many photons is given by (1.1) H = (H1 )3N ⊗ F , where H1 = L2 (E3 , d3 x) ⊗ C2 is the one-electron Hilbert space, 3 denotes an antisymmetric tensor product, and F is the bosonic Fock space over the one-photon Hilbert space L2 (R3 , d3 k) ⊗ C2 , i.e., F is the symmetric tensor algebra over L2 (R3 , d3 k) ⊗ C2 . The factors C2 describe the spin states of an electron and the helicities of photons, respectively. The choice of an antisymmetric tensor product in (H1 )3N , on the r.s. of (1.1), corresponds to the Pauli principle, i.e., to the Fermi statistics of electrons. Photons are bosons, and therefore symmetric tensor products are used in the definition of F. Next, we describe the Hamiltonian that generates the dynamics of a system of N electrons, M nuclei and arbitrarily many photons. We work in units, where ~ = c = 1, and we impose the Coulomb gauge condition on the electromagnetic potentials, as announced. Then the Hamiltonian of the system is given by e ph , e = H e el + 1I ⊗ H H where e el = H

N X j=1

1 (j) ˜ (3) xj 2 + VeC , σ · −i∇j + eA 2m

(1.2)

(1.3)

and VeC =

X 1≤i<j≤N

+

M N X X α α Zl − |xi − xj | |xi − yl |

X

1≤k 0, such that U (µ) U (µ0 ) = U (µ · µ0 ) and aλ (κ)# = U (µ) cλ (κ)# U (µ)−1 , and hence the commutation relations (1.7) and (2.3) are equivalent. One now checks that XZ e ph = µ d3 κ aλ (κ)∗ |κ| aλ (κ) H λ=±

=: µ Hph ,

(2.4)

e ph U (µ)−1 . where Hph = U (µ) H Furthermore Z X 1 d3 k ˜ (3) p 3(k) (x) = cλ (k) ελ (k) eik·x A − 3/2 (2π) 2|k| λ=± Z 3 X µ d κ p 3(µκ) = aλ (κ) ελ (µκ) eiκκ ·(µx) 3/2 (2π) 2|κ| λ=± (3 )

=: µ A− µ (µx),

(2.5)

where 3µ (κ) = 3(µκ), with 3 as in Eq. (1.10). Note that, for our choice of polarization vectors ε± (·), we have that ε± (µκ) = ε± (κ). We set ∗ (3 ) (3 ) A+ µ (x) = A− µ (x) ,

314


and

(3µ )

A(3µ ) (x) = A+

(3 )

(x) + A− µ (x).

e is given by We conclude that, in dimensionless variables ξ j , η l , κ, the Hamiltonian H N X

e = H

j=1

2 1 (j) σ · −i∇j + βµ eA(3µ ) βµξ j 2mβ 2

α + VC ⊗ 1I + µ 1I ⊗ Hph , β

(2.6)

where X

VC =

1≤i<j≤N

M N X X 1 Zl − |ξ i − ξ j | |ξ i − η l |

X

+

1≤l 0 and Z < ∞, there are finite, positive constants c(0, Z), C(0, Z) and C 0 (0, Z) such that, for E(ψ, A) as in (4.5), k ψ k= 1, and L ≤ c(0, Z), E(ψ, A) ≥ − C 0 (0, Z) L−1 M ,

(4.7)

where M is the number of nuclei. If 0 ≥ Z 2 there are finite, positive constants c and C such that, for c (0, Z) = c Z −1 , 0

C(0, Z) = C Z 2 ,

(4.8)

C (0, Z) ≤ C Z . 2

Remarks. (1) In [12], explicit expressions for c (0, Z), C(0, Z) and C 0 (0, Z) are derived in the three regimes: (i) 0 ≥ Z 2 , (ii) Z 2 ≥ 0 ≥ 1, (iii) 0 ≤ 1. 1 Below we shall choose 0 = 16π α2 , where α is the finestructure constant. Since α ∼ = 1/137, regime (i) is the one most relevant for atomic physics. We therefore do not present explicit expressions for c (0, Z) and C 0 (0, Z) in regimes (ii) and (iii), but refer the reader to [12]. (2) We recall that the results proven in [5, 6] imply that, for 0 = (8π α2 )−1 , α ≤ 1/132 and Z ≤ 6, we have that  + * N  X 2 + VC ψ σ (j) · −i ∇j + A (xj ) ψ,   j=1 Z 2 (4.9) + 0 | (∇ ⊗ A)(x) | d3 x ≥ − CZ 2 (N + M ),

for a finite constant C. Here = x ∈ E3 | |x − yl | ≤ (Z + 1)−1 , for some l = 1, · · · , M . As shown in [6], this implies part (2) of Theorem 1, (for Z ≤ 6).

Stability of Ultraviolet-Cutoff QED with Non-Relativistic Matter

319

Our next task is to prove a lower bound on the operator HII , and here we follow arguments described in Sect. 3 of [6]. Recall that ( ) Z X 1 d3 k (3) i k·x p 3(k) aλ (k) ελ (k) e , A− (x) = (2π)3/2 2|k| λ=± ∗ (3) A(3) , A(3) (x) = A(3) (4.10) A(3) + (x) = + (x) + A− (x), − (x) and 3(k) is a (smooth) function, with |3(k)| ≤ 1

and

supp 3 ⊆

k | |k| ≤ (2m α2 )−1 3

;

see Eq. (2.12). We set B± (x) := curl A(3) ± (x), Clearly Bβ+ (x) =

∗

Bβ− (x)

We define F ± (x) =

Π ± (x) = ∇ ⊗ B± (x) .

+ Πβδ (x) =

,

∗

− Πβδ (x)

.

Bβ± (x), or ± (x), Πβδ

(4.11)

(4.12)

(4.13)

β, δ = 1, 2, 3, and F (x) = F + (x) + F − (x). Let f (x) be a positive function on E3 . Then f (x) F (x)2 = f (x) F + (x) F − (x) + F − (x) F + (x) + F + (x)2 + F − (x)2 ≤ 2f (x) F + (x) F − (x) + F − (x) F + (x) (4.14) 1 = 4f (x) F + (x) F − (x) + k F + (x) k2 , 2 where is the vacuum vector in F. To prove (4.14), we use the general inequality | hψ, T 2 ψi | ≤ k T ∗ ψ k k T ψ k 1 ≤ hψ, (T T ∗ + T ∗ T ) ψi, 2 for an arbitrary operator T and ψ in the intersection of the domain of T and the domain of T ∗ . We apply this inequality to the special choice of T = F ± (x). Next, we note that F − (x) F + (x) = F + (x) F − (x) + F − (x), F + (x) = F + (x) F − (x) + h, F − (x) F + (x)i = F + (x) F − (x) + k F + (x) k2 , which is a simple consequence of Eqs. (4.10)-(4.13); (recall the commutation relations (2.3) and use that aλ (k) = 0, for λ = ± and all k ∈ R3 , where is the vacuum vector in F). It is obvious that k F + (x) k2 is independent of x. Thus inequality (4.14) yields the bound

320


Z

Z f (x) F (x)2 d3 x ≤ 4 k f k∞

F + (x) F − (x) d3 x + 2 k f k1 k F + (0) k2 .

(4.15) We apply inequality (4.15) to the examples in (4.13) and sum over β, δ = 1, 2, 3. This yields Z 2 f (x) | curl A(3) (x) | d3 x ( ) Z X 2 a∗λ (k)|k|aλ (k) (4.16) ≤ 2 k f k∞ d3 k | 3(k) | λ=±

+ 2 k f k1 C3 , where

3 X

C3 =

k Bβ+ (0) k2 ,

(4.17)

β=1

and

Z

2 f (x) | ∇ ⊗ curl A(3) (x) | d3 x ( Z 2

d3 k | 3(k) |

≤ 2 k f k∞

X

) a∗λ (k)|k|3 aλ (k)

(4.18)

λ=±

+ 2 k f k1 D3 , where D3

=

3 X

k Πβ+ δ (0) k2 .

(4.19)

β,δ=1

Comparing these expressions with Eq. (2.4) and using the definition of 3(k), Eq. (2.12), we find that Z 2 (4.20) f (x) | curl A(3) (x) | d3 x ≤ 2 k f k∞ Hph + 2 k f k1 C3 , and

Z

2 f (x) | ∇ ⊗ curl A(3) (x) | d3 x ≤ 2 k f k∞ (2m α2 )−2 32 Hph + 2 k f k1 D3 .

(4.21)

We are now prepared to prove a lower bound on the operator HII defined in (4.3). We choose (4.22) f (x) = exp −L−1 D(x/α) , so that

k f k∞ = 1,

k f k1 ≤ 8π α3 L3 M .

We choose 0 and L so small that

8π α 0 + 8π C(0, Z) L 2

2

3 2m

(4.23)

2 ≤ 1.

(4.24)


Then

321

HII ≥ − 64π 2 α5 L3 M 0 C3 + α2 C(0, Z) L2 D3 .

Next, we calculate the constants C3 and D3 . The result is that 4 Z 1 π 3 3 C3 ≤ d k|k| = , 2 2 2m α2

(4.25)

(4.26)

B3

and 1 ≤ 2

D3

Z

π d k|k| = 3 3

B3

3

3 2m α2

6 ,

(4.27)

where B3 is the ball of radius (2m α2 )−1 3 centered at the origin of R3 ; (refer to Eq. (2.12)). Plugging (4.26) and (4.27) into the r.s. of (4.25) and using (4.24), we obtain the bound 4 3 M. (4.28) HII ≥ − 4π 2 α−5 L3 2m Choosing



ÿ

L ≤ min c (0, Z), 16π C(0, Z)

3 2m

2 !−1/2

  ,

(4.29)

see Theorem 2 and (4.24), and combining inequalities (4.7) and (4.28), we find that ! ÿ 4 C 0 (0, Z) 3 −5 3 M. (4.30) + H ≥ − 4π α L 2m L This bound enables us to complete the proof of part (1) of Theorem 1: By (4.24) and (4.29), we can choose 0 =

16π α2

−1

00

, L = C (α, Z)

2m , 3

(4.31)

00

for some finite constant C (α, Z). Then inequality (4.30) shows that there is a finite constant ε(α, Z) depending on α and Z, but independent of 3, such that H ≥ − ε(α, Z)

3 · M. 2m

(4.32)

e = 2m α2 H, part (1) of Theorem 1 is proven. Hence, since H In order to prove part (2) of Theorem 1, we set 0 = (16π α2 )−1 and assume that 0 ≥ Z 2 , i.e., 16π(αZ)2 ≤ 1. Furthermore we make the following choices: L = c0 Z −3/2 , for some c0 ≤ c,

(4.33)

3 = 2m Z α

(4.34)

2

For these choices

5/4

.

L ≤ c (0, Z) = c Z −1 , for Z ≥ 1 ,

so that Theorem 2 applies, and

322


8π C(0, Z) L2

3 2m

2

2 1 = C 000 Z −1 Z 2 α5/4 < 2

(4.35)

for Z 3 α5/2 small enough, so that inequality (4.24) holds. Then inequality (4.30) implies that (4.36) H ≥ − ε Z 7/2 · M , e = 2m α2 H, part (2) of Theorem for some constant ε independent of α and Z. Since H 1 follows from (4.36). It remains to prove Theorem 2. This is the task of Sect. 5, (see also [12]).

5. Outline of Proof of Theorem 2 Before beginning with the proof of Theorem 2 we briefly recall its statement. Let A(x) denote a vector potential on E3 , and let B(x) = curl A(x). We fix a configuration of nuclei at positions y1 , · · · , yM in E3 and with atomic numbers Z1 , · · · , ZM . We assume that max Zl ≤ Z, for an arbitrary, but fixed Z < ∞. 1≤l≤M

Let HPauli =

N X

2

σ (j) · i ∇j − A(xj )

+ VC ,

j=1

with VC as in (2.7). For a smooth, bounded vector potential A, HPauli is a densely defined, selfadjoint operator on (H1 )3N , with H = L2 (E3 , d3 x) ⊗ C2 (see Sect. 1) whose spectrum is bounded below. In Sect. 4, the following result, Theorem 2, has been stated. Theorem. Given positive constants 0 and Z, there are finite, positive constants c (0, Z), C(0, Z) and C 0 (0, Z) ≤ C(0, Z) such that, for L ≤ c (0, Z), Z hψ, HPauli ψi + 0 |B(x)|2 e−D(x)/L d3 x Z C 0 (0, Z) M, + C(0, Z) L2 |(∇ ⊗ B)(x)|2 e−D(x)/L d3 x ≥ − L for any ψ ∈ (H1 )3N of norm 1. Remark. The constants c (0, Z), C(0, Z) and C 0 (0, Z) only depend on 0 and Z. If we are not interested in optimal estimates we can set C(0, Z) = C 0 (0, Z). In [12] it is shown that, for 0 ≥ Z 2 , there are finite, positive constants c and C such that c (0, Z) = c Z −1 , C(0, Z) = C Z 2 , C 0 (0, Z) ≤ C Z 2 . We sketch the proof of the above theorem. Our goal is to convey the spirit of the proof, sacrificing technical accuracy for the sake of simplicity. A complete proof of the theorem appears in [12]. The proof combines tricks from [13] and [14] with a new lower bound for the Pauli kinetic energy. We explain first the old tricks, then the new kinetic energy bound, and finally how to mix the ingredients and prove the theorem. Without sacrificing any ideas, we just treat the case Z1 = · · · = ZM = 1, but see [12].


323

To explain the old tricks, we first note that one can associate local kinetic and potential energies to a cube Q ⊂ E3 . If A is a vector potential and ψ ∈ (H1 )3N , then we write N Z X 2 | σ (k) · i ∇xk − A(xk ) ψ | χQ (xk ) d3 x1 · · · d3 xN TPauli (ψ, A, Q) = k=1

and T (ψ, A, Q) =

E3N

N Z X k=1

(5.1)

2 | i ∇xk − A(xk ) ψ | χQ (xk ) d3 x1 · · · d3 xN ,

(5.2)

E3N

where χQ denotes the characteristic function of Q. To localize the potential energy to Q, we use the following identity from [13]. ZZ 1 d3 z dR z R , where (5.3) VC = 2π R5 E3 ×R+

z R = Nz R (Nz R − 1) + Mz R (Mz R − 1) − 2 Nz R Mz R , with Nz R = number of electrons in the ball B(z, R), and Mz R = number of nuclei in B(z, R) .

(5.4) (5.5) (5.6)

Here, B(z, R) denotes the ball in E3 with center at z and radius R. Identity (5.3) lets us define the localized potential energy as follows. If Q ⊂ E3 is a cube of side δ, then we set ZZ X 1 zR d3 z dR ψ, ψ . (5.7) P E(ψ, Q) = 2π MzR R5 yk ∈ Q B(z, R) 3 yk 1 R < 10 δ Since zR ≥ −CMzR always, and, in particular, zR ≥ 0 when MzR = 0, the following remarks are trivial consequences of the above definitions. (i) If E3 is partitioned into cubes {Qν } of sides {δν }, then X N X 2 (k) TPauli (ψ, A, Q), and (5.8) σ · i ∇xk − A(xk ) ψ, ψ = k=1

hVC ψ, ψi ≥

X

P E(ψ, Qν ) − C

ν

X ν

ν

MQν δν−1 .

(5.9)

In (5.9) and throughout our discussion, NQ denotes the number of electrons and MQ the number of nuclei in Q. Similarly: (ii) If Q is partitioned into subcubes {Qν } with sides {δν }, then X T (ψ, A, Q) = T (ψ, A, Qν ) , (5.10) ν

TPauli (ψ, A, Q) = P E(ψ, Q) ≥

X ν

X

TPauli (ψ, A, Qν ) , and

ν

P E(ψ, Qν ) − C

X ν

δν−1 MQν .

(5.11) (5.12)

324


R R R The standard integration by parts, E3 |σ·(i∇−A)ϕ|2 d3 x = E3 |(i∇−A)ϕ|2 d3 x± E3 (σ · B)ϕ · ϕ¯ d3 x trivially implies the following; (we write Q∗ for the double of Q). KE Comparison Lemma. Let Q be a cube of side δ. Suppose the magnetic field B is constant on Q∗ . Then TPauli (ψ, A, Q∗ ) ≥ c T (ψ, A, Q) − C {|B| + δ −2 }hNQ∗ ψ, ψi .

(5.13)

One of the main technical results in [14], proven also in [12], is as follows. Local Stability of Matter. Let Q be a cube of side δ. Suppose the magnetic field B satisfies |B| ≤ C δ −2 on Q∗ . Then δ · T (ψ, A, Q∗ ) + P E(ψ, Q) ≥ − C δ −1 hNQ∗ ψ, ψi .

(5.14)

Here, we have oversimplified slightly the statement of local stability. We will also oversimplify below by neglecting the distinction between Q and Q∗ , and by supposing where necessary that NQ and MQ are of the same order of magnitude. (If not, then Q is far from electrically neutral, so that P E(ψ, Q) will be large positive, and whatever lower bounds we claim for the energy will therefore be trivial.) The new ingredient in the proof of our theorem is the following estimate. KE Lower Bound. Let Q be the unit cube, and suppose the magnetic field B is constant on Q∗ . If K > 1, then TPauli (ψ, A, Q∗ ) ≥ c KhNQ ψ, ψi − C K 5/2 {|B| + 1} − ChNQ∗ ψ, ψi .

(5.15)

We sketch the proof of the KE Lower Bound, then return to the proof of Theorem 2. The main point in establishing (5.15) is to prove an estimate for one-electron wave functions ϕ ∈ H1 , from which (5.15) will follow by separation of variables. Without significant loss of generality, we may suppose that the magnetic field B points parallel to the x3 -axis. In a suitable gauge we then have A(x) = (A1 (x), A2 (x), 0), with Aν (x) independent of x3 , and |∇Aν | ≤ C|B|. Integration by parts yields the identities Z Z Z 2 3 2 3 | σ · (i∇ − A)ϕ | d x = | (i∇ − A)ϕ | d x± (σ · B)ϕ · ϕd ¯ 3 x, (5.16) E3

Z

E3

E3

2 Z 3 ∂ 2 | σ · (i∇ − A)ϕ | d3 x = ϕ ∂x3 d x

E3

E3

2 Z X ∂ σν i − Aν ϕ d3 x. + ∂xν E3

(5.17)

ν=1,2

Applying (5.16) and (5.17) with ϕ replaced by θϕ, for a suitable cutoff θ, we derive the estimate Z 2 | σ · (i∇ − A)ϕ | d3 x ≥ (5.18) Q∗

  2  Z Z  X ∂ ∂ϕ 2 −1 3 2 3 + (|B| + 1) c i ∂xν − Aν ϕ  d x − C |ϕ| d x.  ∂x3 Q

ν=1,2

Q∗


325

In deriving (5.18), we had to multiply (5.16) by (|B| + 1)−1 to reduce the effect of R ¯ 3 x. Next, we cut the unit cube Q into thin rectangular the error term E3 (σ · B)ϕ · ϕd tubes Tα = Iα1 × Iα2 × Iα3 , where |Iα3 | = 1 and Iα1 × Iα2 is a small square of size ∼ (|B| + 1)−1/2 . Thus the Tα are parallel to the magnetic field. The number of distinct Tα ’s is ∼ (|B| + 1). Since |Iα1 |, |Iα2 | ∼ (|B| + 1)−1/2 , while |∇Aν | ≤ C|B| and Aν (x) −1/2 , for x ∈ Tα , where is independent of x3 , it follows that |Aν (x) − Aα ν | ≤ C(|B| + 1) α Aν is the value of Aν at the center of Tα . Hence, for x ∈ Tα , we have 2 2 i ∂ − Aν (x) ϕ ≥ 1 i ∂ − Aα ϕ −C(|B| + 1)|ϕ|2 , ν ∂xν 2 ∂xν and therefore (5.18) implies that ( Z X Z ∂ϕ 2 2 3 |σ · (i∇ − A)ϕ| d x ≥ c ∂x3 Q∗ α Tα  2  Z X 3 i ∂ − Aα +(|B| + 1)−1 x − C |ϕ|2 d3 x . (5.19) ϕ d ν ∂xν  ∗ Q ν=1,2

This is the basic one-electron estimate in the proof of (5.15). By rescaling and making a gauge transformation, we may regard the r.s. of (5.19) as the kinetic energy of a free particle in ∼ (|B| + 1) copies of the unit cube. Thus, the KE Bound (5.15) follows easily from (5.19) by separation of variables (reflecting the fact that there are ∼ K 3/2 eigenstates of a free particle in the unit cube having energy less than K). In using (5.15), we will drop the last term hNQ∗ ψ, ψi. Roughly speaking, if we neglect the distinction between Q∗ and Q, then the error term hNQ∗ ψ, ψi may be absorbed in the main term c KhNQ ψ, ψi if K is large. This is of course only a heuristic justification for dropping the last term in (5.15). In the full proof we keep the last term which complicates the analysis. We can start to combine the above ingredients to prove our theorem. The first main point is to prove the following local stability bound. Local Pauli Stability (Constant Fields). Let K, B0 be large constants, to be picked later, and let Q be a cube of side δ. Suppose the magnetic field B = curl A is constant on Q, and satisfies there |B| ≤ CB0 δ −2 . Then 1/2

3/2

CB0 δ · TPauli (ψ, A, Q) + P E(ψ, Q) ≥ −C K 5/2 B0

δ −1 + c

1/2

B0 K hNQ ψ, ψi. δ (5.20)

An essential point here is that B0 appears on the right side in (5.20), raised to a power −1/2 less than 2. To prove (5.20), we subdivide Q into subcubes {Qˆ α } of side ∼ B0 δ. Applying local stability of matter (5.14) to the Qˆ α , and summing over α using (5.10) and (5.12), we find that −1/2

C B0

δ · T (ψ, A, Q) + P E(ψ, A, Q)

≥ −C B0 δ −1 hNQ ψ, ψi − C B0 δ −1 MQ . 1/2

1/2

(5.21)

We drop the MQ term in (5.21) for expository purposes. The rough justification for this is that we may suppose (as explained earlier) that MQ and NQ are comparable, so

326


that the two terms on the right side in (5.21) are also comparable. In the complete proof of our theorem, we do not drop the MQ term from (5.21), and matters become more complicated. The KE Comparison Lemma yields (roughly speaking) −1/2

C 0 B0

−1/2

δ · TPauli (ψ, A, Q) ≥ C B0

δ · T (ψ, A, Q) − C 0 B0 δ −1 hNQ , ψ, ψi. (5.22) 1/2

Putting (5.22) into (5.21), we find that −1/2

C B0

δ · TPauli (ψ, A, Q) + P E(ψ, Q) ≥ −C 0 B0 δ −1 hNQ ψ, ψi. 1/2

(5.23)

Also, the KE Lower Bound (5.15) shows, roughly speaking, that C B0 δ · TPauli (ψ, A, Q) ≥ c K B0 δ −1 hNQ ψ, ψi − C K 5/2 B0 δ −1 . 1/2

1/2

3/2

(5.24)

(Here, we have rescaled (5.15) from the unit cube to the cube Q.) Adding (5.23) and (5.24), we obtain 1/2

C B0 δ · TPauli (ψ, A, Q) + P E(ψ, Q) ≥ (c K − C 0 ) B0 δ −1 hNQ ψ, ψi − C K 5/2 B0 δ −1 . 1/2

3/2

(5.25)

If we take K to exceed a large enough universal constant, then (5.25) implies (5.20). The proof of local Pauli Stability is complete, modulo polite fictions. So far, we have worked with a constant magnetic field B. If we suppose instead that B varies only slightly on Q, then we can write the vector potential A in the form A = Ac + A# on Q, where Ac has constant magnetic field, and A# is small. We may then apply local Pauli stability (5.20) to the vector potential Ac , and regard A# as a small perturbation. In this spirit, we can extend (5.20) as follows. Local Pauli Stability 1 (Varying Magnetic Fields). Let K, B0 be large constants to be picked later, and let Q be a cube with side δ. Suppose the magnetic field B = curl A, satisfies Z

|B(x)|2 d3 x ≤ B02 δ −1 and

(5.26)

|B(x) − meanQ B|2 d3 x ≤ B0−1 δ −1 .

(5.27)

Q

Z

Q

Then we have that 1/2

C B0 δ · TPauli (ψ, A, Q) + P E(ψ, Q) ≥ −C K 5/2 B0 δ −1 + c 3/2

1/2

K B0 hNQ ψ, ψi. δ

(5.28)


that

327

We omit all discussion of the proof, but just point out that (5.26) and (5.27) suggest Z Z |B(x) − meanQ B|2 d3 x |B(x)|2 d3 x, Q

Q

so that B varies only slightly on Q. The proof of our theorem proceeds by making a Calderón-Zygmund decomposition of E3 into cubes {Qν } that satisfy (5.26) and (5.27). We apply Local Pauli Stability (5.28) to each Qν , and sum the result over ν. To construct the Qν , we begin by partitioning E3 into cubes {Q0α } of side L, where L is as in the statement of our theorem. We retain all the Q0α that satisfy (5.26), (5.27), and we cut all the other Q0α into subcubes of side L/2. Thus, we obtain a collection {Q1α } of cubes of side L/2. We retain all the Q1α that satisfy (5.26), (5.27), and we cut all the other Q1α into subcubes of side L/4. Thus, we obtain a collection {Q2α } of cubes of side L/4. We continue in this way, and let {Qν } denote the collection of all the cubes Qjα (j, α arbitrary) retained during the above construction. Thus E3 is partitioned into cubes {Qν } that satisfy (5.26) and (5.27). Let δν = side (Qν ), and note that δν ≤ L. Let Qν be one of the Calderón-Zygmund cubes with side δν strictly less than L. Then Qν arose by cutting a cube Q+ν with side 2δν . We know that Q+ν cannot satisfy (5.26) and (5.27), since we cut Q+ν to arrive at Qν . If (5.26) fails for Q+ν , then, roughly speaking, we have Z |B(x)|2 d3 x > c B02 δν−1 . (5.29) Qν

If (5.27) fails for Q+ν , then, roughly speaking, we have that Z |B(x) − meanQν B|2 d3 x > c B0−1 δν−1 , Qν

so that

Z

δν2 | (∇ ⊗ B)(x) | d3 x > c B0−1 δν−1 , 2

(5.30)

Qν

by the Poincaré inequality. Hence, for every Qν , we have ν ∈ S1 ∪ S2 ∪ S3 , where S1 = {ν : (5.29) holds},

S2 = {ν : (5.30) holds},

S3 = {ν : δν = L} . (5.31) Next, we bring in the function D(x) = min {|x − yk | | 1 ≤ k ≤ M } from the statement of our theorem. We set θν = maxx ∈ Qν e− D(x)/L .

(5.32)

Since Qν has side δν ≤ L, definition (5.32) shows at once that (for a constant c) c θν ≤ e− D(x)/L ≤ θν for all x ∈ Qν .

(5.33)

The basic estimate for our Calderón-Zygmund cubes, apart from (5.26) and (5.27), is the following bound.

328


X ν

θν δν−1 ≤ C B0−2 Z

Z

|B(x)|2 e−D(x)/L d3 x +

E3

L2 | (∇ ⊗ B) (x) | e−D(x)/L d3 x + 2

C B0

C M. L

(5.34)

E3

To prove (5.34), we argue as follows. If ν ∈ S1 , then (5.29) and (5.33) imply that Z 2 | B(x) | e−D(x)/L d3 x . θν δν−1 ≤ C B0−2 Qν

Summing over ν ∈ S1 , we obtain the bound Z X 2 −2 −1 θν δ ν ≤ C B0 | B(x) | e−D(x)/L d3 x . ν ∈ S1

(5.35)

E3

Similarly, if ν ∈ S2 , then (5.30) and (5.33) imply that Z 2 θν δν−1 ≤ C B0 L2 | (∇ ⊗ B) (x) | e−D(x)/L d3 x,

since δν ≤ L .

Qν

Summing over ν ∈ S2 , we get X

θν δν−1 ≤ C B0

ν ∈ S2

Z

L2 | (∇ ⊗ B) (x) | e−D(x)/L d3 x . 2

(5.36)

E3

For ν ∈ S3 , we have that θν δν−1

= θν L

−1

≤ CL

−4

Z e

−D(x)/L

d x ≤ CL 3

−4

M Z X

e−|x−yk |/L d3 x,

k=1 Q ν

Qν

by (5.33) and the definition of D(x). Summing over ν ∈ S3 , we find that X

θν δν−1 ≤ C L−4

M Z X

ν ∈ S3

k=1

e−|x−yk |/L d3 x =

C0 M. L

(5.37)

E3

Combining estimates (5.35), (5.36) and (5.37), we obtain the desired result (5.34). At last we are ready to prove our theorem. For each ν, we have C B0 δν · TPauli (ψ, A, Qν ) + P E(ψ, Qν ) ≥ − C B0 δν−1 θν K 5/2 + 1/2

3/2

1/2

c B0

θν δν−1 KhNQν ψ, ψi .

(5.38)

On one hand, (5.38) reduces to local Pauli stability (inequality (5.28)) if Qν contains at least one nucleus, since then we have θν = 1 by (5.32). On the other hand, if Qν contains no nuclei, then a glance at definition (5.7) gives P E(ψ, Qν ) = 0, so that (5.38) is an immediate consequence of the KE Lower Bound (5.15). (Recall, we are ignoring the last term on the right side in (5.15).) Thus, (5.38) holds in all cases. Summing (5.38) over all ν, and invoking (5.8) and (5.9), we obtain the estimate


* C

1/2 B0 L

N X

2

σ (k) · i∇xk − A(xk )

k=1 3/2

+ hVC ψ, ψi ≥ − C K 5/2 B0 ( +

* cK

1/2 B0

X ν

X ν

θν δν−1

329

+ ψ, ψ θν δν−1

+

NQν ψ, ψ

− C

X ν

) θν δν−1

M Qν

, (5.39)

since δν ≤ L and θν = 1 when MQν 6= 0. We have agreed to suppose that NQν and MQν are comparable. Therefore, the term in curly brackets in (5.39) is positive, since K and B0 are large constants. Let us restrict the parameter L, by demanding that 1/2

C B0 Then (5.34), (5.39) and (5.40) imply −1/2

Z

(5.40)

| B(x) | e−D(x)/L d3 x + 2

hHPauli ψ, ψi + CK B0

E3

Z 5/2 C K B0

L ≤ 1.

2

L | (∇ ⊗ B)(x) | e 2

3/2

C K B0 d x ≥ − L

−D(x)/L 3

M,

(5.41)

E3

where CK depends on K. Our theorem follows easily from (5.41). We first take K to be a large enough universal −1/2 ≤ 0. constant. Then, with 0 as in our theorem, we pick B0 so large that CK B0 −1 1/2 Finally we take c (0, Z) = C B0 , with C as in (5.40). Thus, for L < c (0, Z), (5.40) holds; hence, (5.41) holds and tells us that Z 2 hHPauli ψ, ψi + 0 | B(x) | e−D(x)/L d3 x Z

E3

L2 | (∇ ⊗ B)(x) | e−D(x)/L d3 x ≥ − C (0, Z) 2

+ C (0, Z)

M . (5.42) L

E3

This is the conclusion of our theorem. Of course we have repeatedly oversimplified the argument. For full details, see [12]. References 1. Fröhlich, J., Lieb, E.H., Loss, M.: Stability of Coulomb systems with magnetic fields I: The one-electron atom. Commun. Math. Phys. 104, 251–270 (1986) 2. Lieb, M., Loss, M.: Stability of Coulomb systems with magnetic fields II: The many-electron atom and the one-electron molecule. Commun. Math. Phys. 104, 271–282 (1986) 3. Loss, M., Yau, H.T.: Stability of Coulomb systems with magnetic fields III: Zero energy bound states of the Pauli operator. Commun. Math. Phys. 104, 283–290 (1986) 4. Fefferman, C.: Stability of Coulomb systems in a magnetic field. Proc. Natl. Acad. Sci. 92, 5006–5007 (1995)

330


5. Lieb, E.H., Loss, M., Solovej, J.P.: Stability of matter in magnetic fields. Phys. Rev. Lett. 75, 985–989 (1995) 6. Bugliaro, L., Fröhlich, J., Graf, G.M.: Stability of quantum electrodynamics with non-relativistic matter. Phys. Rev. Lett.77, 3494–3497 (1996) 7. Lieb, E.H.: The stability of matter: From atoms to stars: Selecta of Elliott H. Lieb. W. Thirring ed., Berlin–Heidelberg–New York: Springer, 1991 8. Glimm, J., Jaffe, A.: Quantum physics: A functional integral point of view. 2nd ed. , Berlin–Heidelberg– New York: Springer, 1987 9. Glimm, J., Jaffe, A.: Collected papers, Boston: Birkhäuser 1985 10. Bach, V., Fröhlich, J., Sigal, I.M.: Mathematical theory of non-relativistic matter and radiation. Lett. Math. Phys. 34, 183–201 (1995) 11. Fröhlich, J.: Application of commutator theorems to the integration of representations of Lie algebras and commutation relations. Commun. Math. Phys. 54, 135–150 (1977) 12. Fefferman, C.: On electrons and nuclei in a magnetic field. Adv. Math. 124, 100–153 (1996) 13. Fefferman, C., de la Llave, R.: Relativistic stability of matter-I. Revista Matematica Iberoamericana 2, 119–161 (1986) 14. Minicozzi, W.: Notes to Lectures by C. Fefferman at Stanford University (1994) Communicated by B. Simon

Commun. Math. Phys. 190, 331 – 373 (1997)

Communications in


Bispectral Algebras of Commuting Ordinary Differential Operators B. Bakalov? , E. Horozov, M. Yakimov?? Department of Mathematics and Informatics, Sofia University, 5 J. Bourchier Blvd., Sofia 1126, Bulgaria. E-mail: [email protected] Received: 8 April 1996 / Accepted: 9 April 1997

Abstract: We develop a systematic way for constructing bispectral algebras of commuting ordinary differential operators of any rank N . It combines and unifies the ideas of Duistermaat–Grünbaum and Wilson. Our construction is completely algorithmic and enables us to obtain all previously known classes or individual examples of bispectral operators. The method also provides new broad families of bispectral algebras which may help to penetrate deeper into the problem. Contents 0 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 332 1 Preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 336 1.1 Sato’s Grassmannian and KP–hierarchy . . . . . . . . . . . . . . . . . . . . . . . . . 336 1.2 Darboux transformations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 338 1.3 Bessel operators, Bessel planes and related objects . . . . . . . . . . . . . . . . . 339 1.4 Involutions in Sato’s Grassmannian . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 340 2 Polynomial Darboux Transformations of Bessel Wave Functions . . . . . 342 3 Bispectrality of Polynomial Darboux Transformations . . . . . . . . . . . . . . 350 4 Polynomial Darboux Transformations of Airy Planes . . . . . . . . . . . . . . 355 5 Explicit Formulae and Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 360 5.1 Monomial Darboux transformations of Bessel planes . . . . . . . . . . . . . . . 361 5.2 Polynomial Darboux transformations . . . . . . . . . . . . . . . . . . . . . . . . . . . 367 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 371 ? Present address: Department of Mathematics, MIT, Cambridge, MA 02139. E-mail: [email protected] ?? Present address: Department of Mathematics, University of California, Berkeley, CA 94720. E-mail: [email protected]

332

B. Bakalov, E. Horozov, M. Yakimov

0. Introduction In this paper we reconsider the bispectral problem. As stated in [DG], it asks for which ordinary differential operators L(x, ∂x ) there exists a family of eigenfunctions 9(x, z) that are also eigenfunctions for another differential operator 3(z, ∂z ), but this time in the “spectral parameter” z, to wit L(x, ∂x )9(x, z) = f (z)9(x, z), 3(z, ∂z )9(x, z) = Θ(x)9(x, z)

(0.1) (0.2)

for some functions f (z), Θ(x). Both operators L and 3 are called bispectral. This problem first appeared in [G1] in connection with “limited angle tomography” (see also [G2, G3, DG]). Later it turned out to be related to several, seemingly far from it, topics and in particular, to soliton mathematics. To be more specific, we have to mention the deep connection with some very actively developing areas of research in mathematics and theoretical physics like the Calogero–Moser particle system [W2, K] (see also [R]), additional symmetries of KdV and KP hierarchies [MZ, BHY4], representation of the theory W1+∞ –algebra [BHY4], etc. These studies not only revealed the rich mathematical structure behind the bispectral problem, but also (if we use a remark by G. Wilson [W2]) “deepened the mystery” around it. Thus, not only applications, but also purely mathematical questions motivated the great activity in the past few years in the bispectral problem. In the present paper we construct new families of bispectral operators. In order to explain better our contribution, we need to review some of the achievements in the subject. The first general result in the direction of classifying bispectral operators belongs to J. J. Duistermaat and F. A. Grünbaum [DG]. They determined all second order operators L admitting an operator 3 such that the pair (L, 3) solves the bispectral problem (0.1, 0.2). Their answer is as follows. If we write the operator L in the standard Schrödinger form d2 L = 2 + u(x), dx the bispectral potentials u(x) are given (up to translations and rescalings of x and z) by the following list: u(x) = x (Airy); −2 u(x) = cx , c ∈ C (Bessel); u(x), which can be obtained by finitely many rational Darboux transformations from u(x) = 0; u(x), which can be obtained by finitely many rational Darboux 1 transformations from u(x) = − 2 . 4x

(0.3) (0.4) (0.5) (0.6)

The family (0.5) has previously appeared in [AMM, AM] and is known as “rational solutions of KdV ”. They can be obtained also by applying “higher KdV flows” to potentials vk (x) = k(k + 1)x−2 , k ∈ N. The second family (0.6) was interpreted by F. Magri and J. Zubelli [MZ] as potentials invariant under the flows of the “master symmetries” or Virasoro flows. Besides the classification of the bispectral operators by their order, another scheme has been suggested in [DG] and used in [W1]. Below we explain it in a general context as

Bispectral Algebras of Commuting Ordinary Differential Operators

333

it will be used throughout this paper. One may consider an operator L(x, ∂x ) as an element of a maximal algebra A of commuting ordinary differential operators [BC]. Following G. Wilson [W1], we call such an algebra bispectral if there exists a joint eigenfunction 9(x, z) for the operators L in A that satisfies also Eq. (0.2). The dimension of the space of eigenfunctions 9(x, z) is called rank of the commutative algebra A (see e.g. [KrN]). This number coincides with the greatest common divisor of the orders of the operators in A. For example, the operators with potentials (0.5) belong to rank 1 algebras and those with potentials (0.3, 0.4, 0.6) to rank 2 algebras [DG]. All rank 1 maximal bispectral algebras were recently found by G. Wilson [W1]. These algebras do not necessarily contain an operator of order two. The methods of the above mentioned papers [DG] and [W1] may seem quite different. Indeed, while in [DG] the “rational” Darboux transformations play a decisive role, G. Wilson [W1] uses planes in Sato’s Grassmannian obtained from the standard H+ = span{z k }k≥0 by imposing a number of conditions on it. One of our main observations is that both methods, appropriately modified, can be looked upon as the two sides of one general theory. From this new point of view in the present paper we construct nontrivial maximal bispectral algebras of any rank N , thus extending the results from [DG, W1]. For example, for any positive integer k we obtain bispectral algebras of rank N with the lowest order of the operators equal to kN . Our method allows us to obtain all classes and single examples of bispectral operators known to us by a unique method. At the same time we suggest an effective procedure for constructing bispectral operators, despite the fact that the theory involves highly transcendental functions like Airy or Bessel ones. The point is that the latter are used in the proofs while the algorithm given at the end of Sect. 3 performs arithmetic operations and differentiations only on explicit rational functions. In the rest of the introduction we describe in more detail the main results of the paper together with some of the ideas behind them. The framework of our construction is Sato’s theory of KP–hierarchy [S, DJKM, SW, vM]. In particular, our eigenfunctions are Baker or wave functions 9V (x, z) corresponding to planes V in Sato’s Grassmannian Gr and our algebras of commuting differential operators are the spectral algebras AV . We obtain our bispectral algebras by applying a version of Darboux transformations, introduced in our previous paper [BHY3], on specific wave functions which we call Bessel (and Airy) wave functions (see Sects. 1 and 4). As both notions are fundamental for the present paper we focus the attention of the reader on them. Bessel wave functions are the simplest functions which solve the bispectral problem (see [Z] where they were introduced and [F]). They can be defined as follows. For β = (β1 , . . . , βN ) ∈ CN , 9β (x, z) is the unique wave function satisfying x∂x 9β (x, z) = z∂z 9β (x, z) (i.e. 9β (x, z) depends only on xz) and Lβ (x, ∂x )9β (x, z) = z N 9β (x, z), where Lβ (x, ∂x ) = x−N (x∂x − β1 ) · · · (x∂x − βN ) is the Bessel operator. Obviously, the above equations lead to Lβ (z, ∂z )9β (x, z) = xN 9β (x, z). Similarly, for α = (α0 , α2 , . . . , αN −1 ) ∈ CN −1 consider the (generalized) Airy wave function (see [KS, Dij]) satisfying:

334


ÿ ∂xN +

N −1 X

! αi ∂xN −i − α0 x 9α (x, z) = z N 9α (x, z).

i=2

It depends only on α0 x + z N and again gives a simple solution to the bispectral problem. The Airy case is in many respects similar to the Bessel one. As we find the latter case richer in properties, we pay more attention to it, contenting ourselves only with a sketch of the former. Classically, a Darboux transformation [BC, Da] of a differential operator L, presented as a product L = QP , is defined by exchanging the places of the factors, i.e. L = P Q. Obviously, if 9(x, λ) is an eigenfunction of L, i.e. L(x, ∂x )9(x, λ) = λ9(x, λ), then P 9(x, λ) is an eigenfunction of L. Here we introduce Darboux transformations not only on individual operators but also on the entire spectral algebra corresponding to a Bessel (or Airy) plane. In other words, we apply them on operators L which are polynomials h(Lβ ) of Bessel (or Airy) operators. These transformations may be considered as Bäcklund–Darboux transformations on the corresponding wave functions [AvM]. Such Darboux transformation is completely determined by a choice of a ZN -invariant operator P (x, ∂x ) with rational coefficients normalized appropriately by a factor g −1 (z) to ensure that 9W (x, z) =

1 P (x, ∂x )9β (x, z) g(z)

is a wave function. We call 9W (x, z) (respectively W ) a polynomial Darboux transformation of 9β (respectively Vβ ). The definition of polynomial Darboux transformations of Airy planes is similar to that in the Bessel case with only minor modifications: P is not necessarily ZN -invariant and g(z) has to belong to C[z N ]. Thus we come to our main result. Theorem 0.1. If the wave function 9W (x, z) is a polynomial Darboux transformation of a Bessel or Airy wave function 9β (x, z), then it is a solution to the bispectral problem, i.e. there exist differential operators L(x, ∂x ), 3(z, ∂z ) and functions f (z), Θ(x) such that (0.1) and (0.2) hold. Note the difference between the classical definition and the definition introduced here. In contrast to [DG] where the authors make a finite number of “rational” Darboux transformations, we perform only one polynomial Darboux transformation to achieve the same result. Our definition of polynomial Darboux transformation is constructive as P (x, ∂x ) is determined by the finite dimensional space KerP . For this reason one can explicitly present at least one operator L ∈ AW ; it can be given by P h(Lβ )P −1 . Usually it is of high order. But as it is only one element of the whole bispectral algebra AW there can be eventually operators of a lower order. For example, the bispectral operators of [DG] are of order two. There is a simple procedure (see [BHY3]) to produce the entire bispectral algebra AW of commuting differential operators. In addition, one can show that the spectral curve SpecAW (see e.g. [AMcD] for definition) is rational, unicursal and ZN –invariant. In the course of our work we have widely used important ideas introduced by G. Wilson [W1]. Among them we mention first the idea of explicitly writing conditions on vectors of a plane V ∈ Gr which define the new plane obtained by a Darboux transformation. Second is the notion of involutions on Sato’s Grassmannian. In particular, we extend the bispectral involution b introduced in [W1] to the manifolds of polynomial


335

Darboux transformations. More precisely, we prove the following theorem, from which Theorem 0.1 is an obvious consequence. Theorem 0.2. (i) The bispectral involution is defined for planes W which are polynomial Darboux transformations of Bessel or Airy planes. (ii) The image bW of such a plane W is again a polynomial Darboux transformation of the corresponding Bessel (respectively) Airy plane. Our main concern in the present paper is to prove Theorem 0.2. Our second goal is to provide explicit formulae and examples (see Sect. 5), which are not only an illustration of our method but also show the existence of new families of bispectral operators with particular properties. Some of them generalize directly the well known ones like Duistermaat–Grünbaum’s “even case” (0.6) [DG]. Other families exhibit quite different properties from the well known examples. In this respect Sect. 5 has also the role to supply diverse experimental material for new insights into the theory of bispectral algebras. We draw the attention of the reader also to the explicit formulae for the action of the bispectral involution on an important class of Darboux transformations (which we call monomial) of Bessel operators. As a particular case, we obtain such formulae for all second order bispectral operators found in [DG]. The class of monomial Darboux transformations has also other remarkable properties, e.g. they are connected to representation theory of W1+∞ –algebra. We do not touch this matter here for lack of space. The interested reader can learn about it in [BHY4]. A natural question is if the operators found in this paper form the entire class of bispectral operators. The answer is negative as recently shown in [BHY5]. Finally, for the reader’s convenience we now give a brief description of the organization of the paper. Section 1 is intended only for reference. It reviews results connected with Sato’s theory, which we need for the treatment of the bispectral problem. Besides the general notions (see e.g. [S, DJKM, SW]) we recall the involutions, introduced by G. Wilson [W1] and in particular, the bispectral involution. In Sect. 2 we introduce our (N ) of polynomial Darboux transformations of Bessel planes. We give two manifolds GrB equivalent definitions (Definition 2.5 and the one provided by the statement of Theorem 2.7). Section 3 contains our main results – Theorems 0.1 and 0.2. for the Bessel case. Section 4 deals with the analogs of Sect. 2 and 3 for the Airy case (although in different order). The last Sect. 5 is devoted to explicit examples of bispectral operators, which have been studied in other papers [DG, W2], as well as new families (which we have not seen elsewhere). The emphasis in Sect. 5 is rather on the simple algorithmic way of constructing bispectral operators (wave functions, etc.) than on the novelty of the examples. For readers who wish to see the main results as soon as possible we propose another plan of reading the paper. They can start with Sect. 2 and read it up to the statement of Theorem 2.7, returning to Sect. 1 for reference when needed. Then skipping the (technical) proof of Theorem 2.7, they can go Sect. 3. After that, taking for granted the proof of Theorem 3.2, they can look at the examples of bispectral operators, originating from Bessel ones in Sect. 5. Thus they will have a complete picture of the results in the Bessel case, and having this experience, they can easily go through the Airy case. More detailed information about the material included in each section can be found in its beginning. The present paper is a part of our project on the bispectral problem [BHY2]–[BHY5]. The main results contained here were announced at the conference of Geometry and Mathematical Physics, Zlatograd 95 (see [BHY1]).

336


After this paper was written, we got a paper [KR] where some of the results about the Airy case were obtained independently.

1. Preliminaries In this section we have collected results about Sato’s theory, relevant to the bispectral problem. For the reader’s convenience we have divided the section into 4 subsections, whose titles, hopefully, give an idea of their content. The reader, who is acquainted with Sato’s theory may even skip this section and return to it for reference when needed. More detailed account of the material of the subsections can be found in their beginnings. 1.1. Sato’s Grassmannian and KP–hierarchy. We shall recall some facts and notation from Sato’s theory of KP-hierarchy needed in the paper. The survey below cannot be used as a systematic study. There are several complete texts on Sato’s theory, starting with the original papers of M. Sato and his collaborators [S, DJKM] (see also [SW, vM]). Consider the space of formal series o nX ak vk ak = 0 for k 0 . V= k∈Z

Sato’s Grassmannian Gr [S, DJKM, SW] consists of all subspaces (planes) W ⊂ V which have an admissible basis X wik vi , k = 0, 1, 2, . . . . wk = vk + i N this implies that β 0 ⊂ α and therefore there exists a Bessel operator Lα0 such that Lα = Lα0 Lβ 0 and Lα0 Lβ 0 = Lβ 0 Lα0 . Repeating the same argument with α0 , we obtain that there exists Lα satisfying (2.13) with M < N . But then (2.13) is equivalent to Vβ 0 = Vα . By Proposition 1.5 r = rankAβ = rankAβ 0 = rankAα divides M and N . If Vα = Vβ 0 = C[z r ] this finishes the proof. Otherwise we can repeat the above argument with Vα instead of Vβ 0 . Now we come to the main purpose of this section: the definition of manifolds of Darboux transformations, which will give solutions to the bispectral problem. To get some insight we shall consider, following Wilson [W1], the geometrical meaning of Darboux transformations, provided by the so-called conditions C. Proposition 2.4 implies that for generic β ∈ CN (1.27, 1.28) hold with V = Vβ and KerP is a subspace of Kerh(Lβ ). Each element f of KerP corresponds to a condition c (a linear functional on Vβ ), such that f (x) = hc, 9β (x, z)i,

(2.14)

c acts on the variable z. These linear functionals form an n-dimensional linear space C (space of conditions) where n = ordP . In this terminology the definition of Darboux transformation can be reformulated as o 1 n v ∈ Vβ hc, vi = 0 for all c ∈ C W = g(z) (see [W1, BHY3]). Following Wilson [W1], we call the condition c supported at λ iff it is of the form (cf. Lemma 2.1 (iv)) X ak ∂zk |z=λ (2.15) c= k

(the sum is over k ∈ Z≥0 and only a finite number of ak 6= 0). For Bessel wave functions this definition does not make sense when λ = 0 (since 9β (x, z) has a singularity at z = 0 for N > 1). In this case we say that c is supported at z = 0 iff it is of the form (cf. Lemma 2.1 (ii, iii)) XX bαj xα (ln x)j . hc, 9β (x, z)i = α

SN

j

The sums are over α ∈ i=1 {βi + N Z≥0 } and 0 ≤ j ≤ mult(α) − 1, where mult(α) is the multiplicity of α in the above union (only a finite number of bαj 6= 0). The space of conditions C is called homogeneous iff it has a basis of homogeneous conditions c (i.e. the support of c is a point).

346


It is easy to see that if C is homogeneous then the spectral curve SpecAW is rational and unicursal [W1] (i.e. its singularities can be only cusps) – the condition c supported at λ “makes” a cusp at λ. For rank one algebras rationality and unicursality of SpecAW are necessary and sufficient for bispectrality [W1]. For rank N > 1 another necessary condition is that SpecAW be ZN -invariant, i.e. AW ⊂ C[z N ].

(2.16)

When W is a Darboux transformation of a Bessel plane Vβ , with generic β ∈ CN , this condition is satisfied because of Propositions 2.4, 1.5. It is natural to demand that the space of conditions C (or equivalently KerP ) also be ZN -invariant. The ZN -invariance of KerP simply means that f (x) ∈ KerP ⇒ f (εx) ∈ KerP,

ε = e2πi/N .

(2.17)

It is easy to see that C is homogeneous and ZN -invariant iff KerP has a basis which is a union of: (i) Several groups of elements supported at 0 of the form: ∂yl

k0 mult(βX i +kN )−1 X k=0

bkj xβi +kN y j

j=0

, y=ln x

0 ≤ l ≤ j0 ,

(2.18)

where j0 = max{j|bkj 6= 0 for some k}; (ii) Several groups of elements supported at the points εi λ (0 ≤ i ≤ N − 1, λ 6= 0) of the form: k0 X ak εki ∂zk 9β (x, z)|z=εi λ , 0 ≤ i ≤ N − 1. (2.19) k=0

Instead of (2.19) we can also take k0 X

ak Dzk 9β (x, z)|z=εi λ ,

0 ≤ i ≤ N − 1.

(2.20)

k=0

Denote by n0 the number of conditions c supported at 0 (i.e. the number of elements of the form (2.18) in the above basis of KerP ). For 1 ≤ j ≤ r denote by nj the number of conditions c supported at each of the points εi λj , 0 ≤ i ≤ N − 1 (i.e. the number of groups of elements of the form (2.19) with λ = λj ). We have at last arrived at our fundamental definition. Definition 2.5. We say that the wave function 9W (x, z) is a polynomial Darboux transformation of the Bessel wave function 9β (x, z), β ∈ CN , iff (1.21) holds (for V = Vβ ) with P (x, ∂x ) and g(z) satisfying: (i) The corresponding space of conditions C is homogeneous and ZN -invariant, or equivalently KerP has a basis of the form (2.18, 2.19). (ii) The polynomial g(z) is given by n1 nr · · · z N − λN , (2.21) g(z) = z n0 z N − λN 1 r where nj are the numbers defined above. (N ) = set of planes W satisfying (i), (ii) by GrB (β) and put GrB S We denote the N Gr (β), β ∈ C -generic. B β


347

We point out that the form (2.21) of g(z) was introduced for N = 1 by Wilson [W1]. Qr QN −1 (Note that g(z) = z n0 j=1 i=0 (z−εi λj )nj .) We make this normalization in order that 9bW (x, z) = 9W (z, x) be a wave function; for the bispectral problem it is inessential. Definition 2.6. We say that the polynomial Darboux transformation 9W (x, z) of 9β (x, z) is monomial iff g(z) = z n0 (i.e. iff all conditions c are supported at 0). Denote the set of the corresponding planes S (N ) N W by GrM B (β) and put GrM B = β GrM B (β), β ∈ C -generic. The next theorem provides another equivalent definition of GrB (β) and is used essentially in the proof of the bispectrality in the next section. Theorem 2.7. The wave function 9W (x, z) is a polynomial Darboux transformation of the Bessel wave function 9β (x, z), β ∈ CN , iff (1.21, 1.22, 1.27, 1.28) hold (for V = Vβ ) and (i) The operator P has the form P (x, ∂x ) = x−n

n X

pk (xN )(x∂x )k ,

(2.22)

k=0

where pk are rational functions, pn ≡ 1. (ii) There exists the formal limit lim e−xz 9W (x, z) = 1.

(2.23)

x→∞

The proof will be split into three lemmas. Before giving it we shall make a few comments. The rationality of P is always necessary for bispectrality [DG, W1], (2.22) also imposes the ZN -invariance. The condition (2.23) is necessary in order that 9bW (x, z) = 9W (z, x) be a wave function. The limit in (2.23) is formal in the sense that it is taken in the coefficient at any power of z in the formal expansion (1.1) separately, i.e. lim aj (x) = 0

x→∞

for all j ≥ 1.

(2.24)

Our first lemma is similar to Proposition 5.1 ((i) ⇒ (ii)) from [W1]. Lemma 2.8. If P has rational coefficients and is ZN -invariant (see (2.22)) then the conditions C are homogeneous and ZN -invariant (see (2.18, 2.19)) Proof. If KerP = span{f0 , . . . , fn−1 }, the second coefficient of P is −∂x log Wr(f0 , . . . , fn−1 ) and is rational. Lemma 2.1 implies that Wr(f0 , . . . , fn−1 ) is of the form xα eλx × (Laurent series in x−1 ). In particular, each element of KerP is a sum of terms of the form eλx × (Laurent series in x−1 )

or

xα (ln x)k .

We order the (finite) set of all such eλx and xα (ln x)k occurring in KerP . The highest term in Wr(fi ) is just the Wronskian of the highest terms of the fi . If it vanishes then the

348


highest terms of the fi are linearly dependent, so by a linear combination we can obtain a new basis with lower highest terms. So we can suppose that the highest term of Wr(fi ) is non-zero. Repeating the same argument with the lowest term, we shall finally obtain a basis whose elements consist of only one term, i.e. are homogeneous (cf. [W1]). Because the coefficients of P are rational, (1.13) implies that it does not matter which branch of the functions xα (ln x)k in KerP we take for x ∈ C. Let j0 X

fj (x)(ln x)j ∈ KerP

j=0

P

P with fj (x) = α bαj xα . Then fj (x)(ln x + 2lπi)j ∈ KerP for arbitrary l ∈ Z and also for l ∈ C since it is polynomial in l. Taking the derivative with respect to l we obtain that j0 X fj (x)j(ln x)j−1 j=0

also belongs to KerP . P On the other hand the ZN -invariance of P (see (2.17)) implies fj (εx)(ln x + 2πi/N )j ∈ KerP and j0 X fj (εx)(ln x)j ∈ KerP j=0

for ε = e2πi/N . Now it is obvious that KerP has a basis of the form (2.18, 2.19).

Lemma 2.9. If KerP has a basis of the form (2.18, 2.19) then P has rational coefficients and is ZN -invariant (see (2.22)). Proof. Consider first the case when the basis of KerP is X ak ∂zk 9β (εi x, z)|z=λ , 0 ≤ i ≤ N − 1, λ 6= 0. fi (x) = k

n We shall show that det ∂x j fi (x) 0≤i,j≤N −1 is a rational function of x for arbitrary nj ∈ Z≥0 . Using (1.37, 1.38) we can express all derivatives of 9β (x, z) (both with respect to z and x) only by ∂xk 9β (x, z), 0 ≤ k ≤ N − 1, to obtain ∂xnj fi (x) =

N −1 X

αkj (x, λ)∂xk 9β (εi x, λ)

(2.25)

k=0

with rational coefficients αkj . Therefore det ∂xnj fi (x) = det αkj (x, λ) det ∂xk 9β (εi x, λ) . But det ∂xk 9β (εi x, λ) = const because the second coefficient of Lβ −λ is 0. If the basis f0 , . . . , fmN −1 of KerP contains m groups of the type considered above (i.e. (2.19)) we can represent the matrix ∂xnj fi (x) 0≤i,j≤mN −1 , nj ∈ Z≥0 ,


in the block-diagonal form



W1



0

  0 

349

..  .  

W2 ...

Wm where each block Ws has the form already considered above. This can be achieved by columns and rows operations, using the representation (2.25). If in addition there are some groups of elements of the form (2.18), we kill the logarithms by columns operations and then cancel the powers xβi from the numerator and the denominator of (1.13). Lemma 2.10. If C is homogeneous and h, g are as in (2.1), (2.21), then (2.23) is satisfied. Conversely, (2.23) implies (2.21). Proof. The second part of the lemma is an obvious consequence of the first one. For a basis {8i (x)}0≤i≤dN −1 of Kerh(Lβ ) (d = deg h) we consider the basis of KerP , dN −1 X fk (x) = aki 8i (x), 0 ≤ k ≤ n − 1. (2.26) i=0

Formulae (1.21, 1.13) imply Wr f0 (x), . . . , fn−1 (x), 9β (x, z) 9W (x, z) = g(z)Wr f0 (x), . . . , fn−1 (x) P det AI Wr 8I (x) 9I (x, z) P = . det AI Wr 8I (x)

(2.27) (2.28)

The sum is taken over all n-element subsets I = {i0 < i1 < . . . < in−1 } ⊂ {0, 1, . . . , dN − 1}, and here and further we use the following notation: A is the matrix from (2.26) and A I = (ak,il )0≤k, l≤n−1 is the corresponding minor of A, 8I (x) = 8i0 (x), . . . , 8in−1 (x) is the corresponding subset of the basis {8i (x)} of Kerh(Lβ ) and Wr 8I (x), 9β (x, z) 9I (x, z) = (2.29) g(z)Wr 8I (x) is a Darboux transformation of 9β (x, z) with a basis of KerP fk = 8ik . Using (2.28) it is sufficient to prove (2.23) for 9I (x, z), hence we can take KerP consisting of functions fi (x) = ∂zki 9β (x, z)|z=λi , αi

li

fi (x) = x (ln x) ,

0 ≤ i ≤ p − 1,

p ≤ i ≤ n − 1.

(2.30) (2.31)

We shall consider the case when λi 6= λj for i 6= j. The general case can be reduced to this by taking a limit. In the formula (2.27) we expand the determinants in the last n − p columns (using the Laplace rule):

350


Wr(f, 9β ) = Wr(f ) =

X X

± det ∂xjs fi (x), ∂xjs 9β (x, z)

± det ∂xjs fi (x)

0≤s,i≤p−1

0≤s≤p 0≤i≤p−1

· det ∂xjs fi (x)

. det ∂xjs fi (x)

p≤s,i≤n−1

p+1≤s≤n p≤i≤n−1

,

;

(2.32) (2.33)

where the sums are over the permutations (j0 , . . . , jn ) (resp. (j0 , . . . , jn−1 )) of (0, . . . , n) (resp. (0, . . . , n−1)) such that j0 < . . . < jp and jp+1 < . . . < jn (resp. j0 < . . . < jp−1 and jp < . . . < jn−1 ). We extract the terms with the highest power of x in the numerator and in the denominator of (2.27). Obviously, Pn−1 P α − js s RJ (ln x) (2.34) det ∂xjs fi (x) p≤i≤n−1 = const · x i=p i for some polynomials RJ (ln x) (J is the permutation (js )). On the other hand for 0 ≤ i≤p−1 (2.35) ∂xjs fi (x) = ∂xjs ∂zki 9β (x, z)|z=λi = xki eλi x λji s + O(x−1 ) and

∂xjs 9β (x, z) = exz z js + O(x−1 ) .

(2.36)

Now it is easy to see that the leading terms are obtained for the permutations (n − p, n − p + 1, . . . , n, 0, 1, . . . , n − p − 1), respectively

(n − p, n − p + 1, . . . , n − 1, 0, 1, . . . , n − p − 1).

Substituting (2.34, 2.35, 2.36) in (2.32, 2.33) and canceling the determinant (2.34) for J = (0, 1, . . . , n − p − 1), we derive that lim e−xz P (x, ∂x )9β (x, z)

x→∞

is a fraction of two van der Monde determinants and therefore is equal to g(z).

3. Bispectrality of Polynomial Darboux Transformations In this section we prove the main result of the paper, Theorem 3.3, claiming that polynomial Darboux transformations (see Definition 2.5), performed on Bessel operators, produce bispectral operators. On one hand Theorem 3.3 is an almost obvious consequence of Theorem 3.2 in which we prove that the bispectral involution is well-defined on the submanifolds GrB (β) and maps them into themselves. The importance of Theorem 3.2 is not only to provide a proof of our main result (Theorem 3.3) but also to enlighten the bispectral involution. Its proof uses only the definition of polynomial Darboux transformation from Theorem 2.7 (i.e. it does not use Definition 2.5). On the other hand, the proof is completely constructive and together with Definition 2.5 it provides an algorithmic procedure to compute bispectral wave functions and the corresponding bispectral operators. This procedure is described at the end of the section. Many examples computed by making use of it are presented in Sect. 5. Let Vβ be a Bessel plane for a generic β ∈ CN (i.e. Vβ is not a Darboux transforma0 tion of Vβ 0 with β 0 ∈ CN , N 0 < N ). In this section W will be a polynomial Darboux transformation of Vβ , i.e.


351

W ∈ GrB (β). We use the notation from (1.21, 1.22) with V = Vβ . In the next proposition we show that the manifold of polynomial Darboux transformations is preserved by the involutions a and s (introduced in Subsect. 1.4). Proposition 3.1. If W ∈ GrB (β), then (i) sW ∈ GrB (β); (ii) aW ∈ GrB (a(β)), where a(β) = (N − 1)δ − β, δ = (1, 1, . . . , 1). Proof. First recall that (Proposition 1.8) sVβ = Vβ and aVβ = Va(β) . We shall study the action of the involutions on 9W (x, z) and check that the conditions of Theorem 2.7 are satisfied. (i) is trivial because 9sW (x, z) = 9W (−x, −z) =

1 P (−x, −∂x )9β (x, z). g(−z)

To prove (ii) we note that the ZN -homogeneity of P (see (2.22)) is equivalent to P (εx, ε−1 ∂x ) = ε−n P (x, ∂x ),

(3.1)

for n = ordP , ε = e2πi/N . It follows from (1.28) that the operator Q (from (1.22)) has the same property and also that Q = h(Lβ )P −1 has rational coefficients. Proposition 1.7 implies that 9aW is a Darboux transformation of 9a(β) with 9aW (x, z) =

1 Q∗ (x, ∂x )9a(β) (x, z). g(z) ˇ

Obviously, Q∗ also satisfies (3.1). To check (2.23), we set KW = 1 +

∞ X

aj (x)∂x−j

j=1

(see (1.1, 1.2)). Recalling that ∗ =1+ KW

∞ X

(−∂x )−j aj (x)

j=1

and KaW = 1 +

∞ X

∗ −1 bj (x)∂x−j = (KW ) ,

j=1

we compute the coefficients bj (x) inductively and find that all of them are polynomials in aj (x) and their derivatives. But by Theorem 2.7 all aj (x) are rational functions of x and limx→∞ aj (x) = 0, which leads to limx→∞ bj (x) = 0 for all j ≥ 1. This proves (2.23) for aW (cf. (2.24)). (N ) . The central Proposition 3.1 shows that the involutions a and s preserve GrB result of the present paper is that the bispectral involution b has the same property. It (N ) give solutions to the immediately implies that wave functions 9W with W ∈ GrB bispectral problem. Our next theorem addresses this issue.

352


Theorem 3.2. If W ∈ GrB (β) then bW exists and bW ∈ GrB (β). Proof. Before proving the existence of bW , we shall find an analog of (1.21) for 9bW (x, z) = 9W (z, x), i.e. we shall show the existence of an operator Pb (x, ∂x ) and a polynomial gb (z) such that 9bW (x, z) =

1 Pb (x, ∂x )9β (x, z). gb (z)

(3.2)

From (2.22) it follows that the operator P can be written as X 1 P (x, ∂x ) = n pk (xN )(x∂x )k , x pn (xN ) n

(3.3)

k=0

where now pk (xN ) are polynomials. Use (1.37–1.39) to obtain X 1 pk (xN )(x∂x )k 9β (x, z) xn pn (xN )g(z) X 1 = n (z∂z )k pk Lβ (z, ∂z ) 9β (x, z). N x pn (x )g(z)

9W (x, z) =

This implies (3.2) with 1 X (x∂x )k pk Lβ (x, ∂x ) , g(x) n

Pb (x, ∂x ) =

(3.4)

k=0

gb (z) = z n pn (z N ).

(3.5)

Now we can prove the existence of bW , i.e. that 9bW (x, z) is a wave function (see (1.1)). Indeed, using (3.2) we can differentiate the formal expansion (1.34) of 9β (x, z) = 9β (xz); expanding gb−1 (z) at z = ∞ we obtain X bk (x)z −k 9bW (x, z) = exz k≥k0

for some finite k0 . Note that the coefficients bk (x) are rational. On the other hand X aj (z)x−j 9bW (x, z) = 9W (z, x) = exz j≥0

with rational aj (z) such that (see (2.24)) lim aj (z) = 0, j ≥ 1;

z→∞

a0 (z) ≡ 1.

These two (formal) expansions of 9bW (x, z) are connected by X bkj z −k , aj (z) = k≥k0

where bk (x) =

X j

bkj x−j ,

bkj = 0 for j < 0.

(3.6)


Now (3.6) implies This shows that

353

bkj = 0 for k < 0 , j ≥ 1. X 9bW (x, z) = exz 1 + bk (x)z −k k≥1

is a wave function. It is clear that it satisfies (2.23) as well. To show an analog of (1.22), i.e. that 1 Qb (x, ∂x )9bW (x, z) fb (z)

9β (x, z) =

(3.7)

with an operator Qb and a polynomial fb , we shall use the above proven identity (3.2) with asW instead of W . It follows from Proposition 1.7 that 9asW (x, z) =

1 Q∗ (−x, −∂x )9a(β) (x, z). f (z)

(3.8)

Proposition 3.1 and Theorem 2.7 (i) allow us to present Q∗ (−x, −∂x ) in the form Q∗ (−x, −∂x ) =

1

m X

xm q m (xN )

s=0

q s (xN )(x∂x )s

(3.9)

with polynomials q s (xN ). Then X 1 (x∂x )s q s La(β) (x, ∂x ) 9a(β) (x, z). m N f (x)z q m (z ) m

9basW (x, z) =

(3.10)

s=0

The identity ab = bas [W1] and Proposition 1.7 now lead to (3.7) with ÿ !∗ m 1 X s (x∂x ) q s La(β) (x, ∂x ) Qb (x, ∂x ) = f (x) s=0

=

m X s=0

and

1 q s (−1)N Lβ (x, ∂x ) (−x∂x − 1)s f (x)

fb (z) = (−z)m q m (−z)N .

(3.11)

(3.12)

From (2.21) and (3.4) it is obvious that Pb is ZN -homogeneous. This completes the proof of Theorem 3.2. An immediate corollary is the following result, which we state as a theorem because of its fundamental character. (N ) then the wave function 9W (x, z) solves the bispectral Theorem 3.3. If W ∈ GrB problem, i.e. there exist operators L(x, ∂x ) and 3(z, ∂z ) such that

L(x, ∂x )9W (x, z) = h(z N )9W (x, z), 3(z, ∂z )9W (x, z) = Θ(xN )9W (x, z),

(3.13) (3.14)

rankAW = rankAbW = N.

(3.15)

Moreover,

354


Proof. Equations (3.13, 3.14) follow from (1.21, 1.22, 3.2, 3.7) if we set L(x, ∂x ) = P (x, ∂x )Q(x, ∂x ), h(z N ) = f (z)g(z); 3(z, ∂z ) = Pb (z, ∂z )Qb (z, ∂z ), Θ(xN ) = fb (x)gb (x).

(3.16) (3.17)

Equation (3.15) follows from Propositions 1.5 (i) and 2.4.

Example 3.4. All bispectral algebras of rank 1 are polynomial Darboux transformations of the plane H+ = {z k }k≥0 (see [W1]). This corresponds to the N = 1 Bessel with β = (0),

L(0) = ∂x ,

V(0) = H+ = {z k }k≥0 ,

ψ(0) (x, z) = exz .

Every linear functional on H+ is a linear combination of e(k, λ) = ∂zk |z=λ and h L(0) = h(∂x ) is an operator with constant coefficients. The “adelic Grassman(1) ). In our nian” Grad , introduced by Wilson [W1], coincides with GrB ((0)) (= GrB terminology the result of [W1] can be reformulated as follows. All bispectral operators belonging to rank one bispectral algebras are polynomial Darboux transformations of operators with constant coefficients. Remark 3.5. The eigenfunction 9W (x, z) from Eq. (1.21) is a formal series. Let 8β (x, z) = 8β (xz), where 8β (z) is the Meijer’s G-function (1.35) (or any convergent solution of (1.33) in arbitrary domain) and set 8W (x, z) =

1 P (x, ∂x )8β (x, z). g(z)

(3.18)

8W (x, z) =

1 Pb (z, ∂z )8β (x, z) gb (x)

(3.19)

Then

because of (1.33) and x∂x 8β (x, z) = z∂z 8β (x, z). The equations QP = h(Lβ ) and Qb Pb = Θ(Lβ ) imply 1 Q(x, ∂x )8β (x, z), f (z) 1 8β (x, z) = Qb (z, ∂z )8W (x, z). fb (x) 8β (x, z) =

(3.20) (3.21)

So, we proved that 8W (x, z) is a convergent bispectral eigenfunction of the same operators L(x, ∂x ) and 3(z, ∂z ) as 9W (x, z). The involutions a, s and b can be defined on the manifold of “convergent” polynomial Darboux transformations (3.18) by Eqs. (1.43, 1.44, 1.49) in which 9 is replaced by 8 and they preserve it (Proposition 1.7 (i) now becomes a definition). The validity of the equation ab = bas in the “convergent” case is a consequence of that in the “formal” one (see the proof of Theorem 3.2). The rationality of the coefficients of the operator P (x, ∂x ) implies that its kernel has one and the same form (see Eqs. (2.18, 2.19)) in 9- and in 8-bases.


355

It is not difficult to provide an explicit algorithm for producing bispectral pairs L(x, ∂x ), 3(z, ∂z ). Although obvious we have collected the steps of this algorithm as they are scattered in the present and the previous sections. Step 1. Choose an arbitrary set of conditions based in some points λ0 = 0, λ1 , . . . , λr of the form (2.18, 2.19), i.e. a basis of KerP . The proof of Lemma 2.9 provides an explicit computation of the coefficients of P in terms of the coefficients ak , bkj in KerP . The polynomial g(z) is given by Definition 2.5 (ii). d j Qr with high enough powers d0 , . . . , dr Step 2. Take h(z N ) = z d0 N j=1 z N − λN j such that KerP ⊂ Kerh(Lβ ) (cf. Lemma 2.1). The minimal such dj ’s can be computed as follows. (i) For a condition, supported at 0, of the form (2.18) set j(k) = max{j|bkj 6= 0}, 0 ≤ k ≤ k0 . Let βi + kN = βis + ps N for 0 ≤ s ≤ mult(βi + kN ) − 1 with 0 ≤ p0 ≤ . . . ≤ pmult(βi +kN )−1 and is 6= it for s 6= t. Then set d0 = 1 + max pj(k) , the maximum is over all k and all conditions of the form (2.18). (ii) For a condition, supported at λj 6= 0, of the form (2.19) let k0 = max{k|ak 6= 0}. Then set dj = 1 + max k0 , the maximum is over all conditions of the form (2.19) supported at λj . Then put f (z) = h(z N )/g(z). Step 3. Find the coefficients of the operator Q(x, ∂x ) recursively out of the equation Q(x, ∂x )P (x, ∂x ) = h(Lβ (x, ∂x )). Then L(x, ∂x ) = P (x, ∂x )Q(x, ∂x ). A lower order operator L can be constructed using Proposition 1.5, i.e. find u(Lβ ) such that KerP is invariant under u(Lβ ) and then L out of the equation LP = P u(Lβ ). Step 4. Compute by (3.4) Pb (x, ∂x ) and by (3.5) gb (z). Also (3.11) and (3.12) give Qb (x, ∂x ) and fb (z). All expressions are explicit in terms of the coefficients of the operators P and Q. Then 3(z, ∂z ) = Pb (z, ∂z )Qb (z, ∂z ) and Θ(x) = fb (x)gb (x). 4. Polynomial Darboux Transformations of Airy Planes This section contains analogs of the results from Sects. 2 and 3 but here the building blocks are (generalized) Airy operators (see [KS, Dij]) instead of Bessel ones. There is a minor difference in the organization of the present section compared to that of Sects. 2 and 3. Here we give the definition of polynomial Darboux transformations on Airy wave functions (see Definitions 4.2, 4.3) in the spirit of the one provided by Theorem 2.7. Then we prove our main result Theorem 4.5 (which is an analog of Theorem 3.2). As in Sect. 2, it automatically implies bispectrality of the polynomial Darboux transformations. At the end, in Proposition 4.9 we show that Definition 4.3 is equivalent to a second one (analog of Definition 2.5) in terms of conditions on Airy planes. This is again important for algorithmic computations, some of which are presented in the next section. First we recall the definition of (generalized higher) Airy functions. For α = (α0 , α2 , α3 , . . . , αN −1 ) ∈ CN −1 , α0 6= 0, consider the Airy operator

356


Lα (x, ∂x ) =

∂xN

− α0 x +

N −1 X

αi ∂xN −i ≡ Pα0 (∂x ) − α0 x,

(4.1)

i=2

where α0 = (α2 , α3 , . . . , αN −1 ). The Airy equation is Lα (x, ∂x )8(x) = 0, i.e.

Pα0 (∂x )8(x) = α0 x8(x).

(4.2)

Example 4.1. When α0 = 1, α0 = 0, Eq. (4.2) becomes the classical higher Airy equation (cf. [KS]) (4.3) ∂xN 8(x) = x8(x). In every sector S with a center at x = ∞ and an angle less than N π/(N + 1), it has a solution with an asymptotics of the form (see e.g. [Wa]) 8(x) ∼ x−

N −1 2N

N

e N +1 x

N +1 N

1+

∞ X

ai x−i/N ,

|x| → ∞, x ∈ S.

(4.4)

i=1

Similarly, in each sector S as in Example 4.1 Eq. (4.2) has a solution with an asymptotics of the form ∞ X 1/N (4.5) ai x−i/N , |x| → ∞, x ∈ S 8(x) ∼ 9α (x) := xd/N eQ(x ) 1 + i=1

for some d ∈ C and a polynomial Q(x) of degree N + 1 with leading coefficient µ0 NN+1 xN +1 , where α0 = µN 0 . The solution 8 is by no means unique, but d, Q and all ai are uniquely determined and do not depend on S. In the sequel we shall deal only with 9α , which is a formal solution of Eq. (4.2). Definition 4.2. For each α ∈ CN −1 we call an Airy wave function the following function −1

ψα (x, z) := µd0 z −d e−Q(µ0 where

z)

9α (x, z),

(4.6)

9α (x, z) := 9α (α0−1 z N + x).

It is easy to see that ψα is indeed a wave function if we expand 9α (α0−1 z N + x) at x = 0: X −i/N −1 N −i/N −i−kN k (α0 z + x) = x (4.7) (µ−1 0 z) k k≥0

(we shall always use µ0 as an N th root of α0 ). The plane in Sato’s Grassmannian corresponding to ψα (x, z) will be called an Airy plane and will be denoted by Vα . Obviously, 9α (x, z) solves the bispectral problem Lα (x, ∂x )9α (x, z) = z N 9α (x, z), Lα (α0−1 z N , ∂α−1 zN )9α (x, z) = α0 x9α (x, z),

(4.8) (4.9)

∂x 9α (x, z) = ∂α−1 zN 9α (x, z).

(4.10)

0

because

0


357

It is clear that ψα satisfies (4.8) and analogs of (4.9, 4.10) obtained by conjugating −1 by z −d e−Q(µ0 z) . (Up to this conjugation 9α and ψα give one and the same solution to the bispectral problem.) We shall define polynomial Darboux transformations of Airy planes as in the Bessel case (see Definition 2.5 and Theorem 2.7). Before that we shall define a bispectral involution b1 on them. Note that the involution b from [W1] (see Subsect. 1.4) is not well defined on Vα (i.e. ψα (z, x) is not a wave function). The properties of b we would like b1 to have, are: 1) it has to interchange the roles of x and z; 2) it has to preserve Airy planes. Therefore we define b1 9α (x, z) := 9α (x, z) = 9α (α0−1 z N , µ0 x1/N ),

(4.11)

or equivalently, b1 ψα (x, z) := ψα (x, z) = µd0 xd/N z −d eQ(µ0 x

1/N

)−Q(µ−1 z) 0

ψα (α0−1 z N , µ0 x1/N ). (4.12)

For a Darboux transformation W of Vα we define ψb1 W and 9b1 W in a similar way. (We still do not know whether b1 W ∈ Gr, the notation ψb1 W is still formal.) Definition 4.3. A Darboux transformation W of an Airy plane Vα is called polynomial iff (in the notation of Definition 1.4) (i) the operator P has rational coefficients; (ii) g(z) = g1 (z N ), g1 ∈ C[z]; (iii) lim e−xz ψb1 W (x, z) = 1. z→∞

(The limit is formal and has the same meaning as in (2.23).) S (N ) Denote the set of all such W ∈ Gr by GrA (α) and put GrA = α∈CN −1 GrA (α). Remark 4.4. The parts (i) and (ii) of the above definition remain the same if we substitute ψα and ψW by 9α and 9W , where −1

ψW (x, z) := µd0 z −d e−Q(µ0

z)

9W (x, z).

(4.13)

The main result of this section is that GrA (α) is preserved by the involution b1 . Theorem 4.5. (i) If W ∈ GrA (α), then ψb1 W (x, z) is a wave function corresponding to a plane b1 W ∈ GrA (α). (ii) For α ∈ CN −1 the spectral algebra AVα is C[Lα ]. An immediate corollary is that the planes W ∈ GrA (α) give solutions to the bispectral problem of rankN : rankAW = rankAb1 W = N. The proof of Theorem 4.5 is completely parallel to that of Theorem 3.2. We shall be very brief, indicating only the major differences and the most important steps. We start with a lemma illuminating the purpose of the constraints (i) and (ii) in Definition 4.3 (cf. (3.2)).

358


Lemma 4.6. If W ∈ GrA (α), then 9b1 W (x, z) =

1 Pb (x, ∂x )9α (x, z), gb (z)

(4.14)

Pb is with rational coefficients and gb is polynomial in z N . Proof. We compute 9b1 W (x, z) = 9W (α0−1 z N , µ0 x1/N ) =

P (α0−1 z N , ∂α−1 zN )9α (α0−1 z N , µ0 x1/N ) 0

g(µ0

where if

x1/N )

1 X pk (x)∂xk , pn (x)

=

1 Pb (x, ∂x )9α (x, z), gb (z)

n

P (x, ∂x ) =

g(z) = g1 (z N )

(4.15)

k=0

with polynomials pk and g1 , then (using (4.8, 4.10)) X 1 ∂xk pk (α0−1 Lα (x, ∂x )), Pb (x, ∂x ) = g1 (α0 x)

(4.16)

gb (z) = pn (α0−1 z N ).

(4.17)

n

k=0

The proof that ψb1 W (x, z) is a wave function is the same as in the Bessel case, using the above lemma and the condition (iii) of Definition 4.3. Now the identity ab = bas [W1] is modified in the following way. Introduce the maps p and p−1 as follows 9pW (x, z) := 9W (α0−1 xN , µ0 z 1/N ), 9p−1 W (x, z) := 9W (µ0 x1/N , α0−1 z N ). The notation pW , p−1 W is formal – these are not planes in Gr. But 9b1 W (x, z) = 9bpW (x, z) = 9p−1 bW (x, z) corresponds to the wave function ψb1 W (x, z) and to b1 W ∈ Gr. Multiplying the identity ab = bas on the right by p, we obtain ab1 = b1 a1 ,

where a1 = p−1 asp.

(4.18)

Note that for W ∈ GrA (α) aW, b1 W and hence a1 W are planes in Gr. The next lemma gives the action of the involutions on the Airy planes (the proof is the same as that of Proposition 1.8). Lemma 4.7. (i) sVα = Vs(α) , where s(α) = ((−1)N +1 α0 , α2 ,−α3 , . . . , (−1)N −1 αN −1 ); (ii) aVα = a1 Vα = Va(α) , where a(α) = ((−1)N α0 , α2 , −α3 , . . . , (−1)N −1 αN −1 ). We also need an analog of Proposition 3.1. Lemma 4.8. If W ∈ GrA (α), then aW and a1 W belong to GrA (a(α)).


359

For the proof we need an analog of Proposition 1.7 for a1 . A simple computation shows that if 1 Q(x, ∂x )9W (x, z) 9V (x, z) = f (z) for V, W ∈ GrA (α) and X Q(x, ∂x ) = qk (x)∂xk , then 9a1 W (x, z) = with

1 Q∗1 (x, ∂x )9a1 V (x, z) f (z)

X

1 − N −1 k x (−1)(N −1)k qk ((−1)N x). N The rest of the proof is left to the reader. The proof of part (i) of Theorem 4.5 is completed exactly as in the Bessel case. For part (ii), we note that while the Bessel wave functions are “multiplication invariant”, the Airy ones are “translation invariant”. More precisely, for arbitrary c ∈ C, Q ∗1 =

∂x +

9α (x + c, (z N − α0 c)1/N ) = 9α (α0−1 (z N − α0 c) + x + c) = 9α (α0−1 z N + x) = 9α (x, z) 1−kN P (−α0 c)k ). Let u(z) ∈ Aα , L(x, ∂x ) ∈ Aα (expand (z N − α0 c)1/N = k≥0 1/N k z and L(x, ∂x )9α (x, z) = u(z)9α (x, z) (this is equivalent to Lψα (x, z) = uψα (x, z)). Then L(x + c, ∂x )9α (x, z) = u((z N − α0 c)1/N )9α (x, z) and L(x + c, ∂x ) ∈ Aα , u((z N − α0 c)1/N ) ∈ Aα . But Aα ⊂ C[z], therefore u((z N − α0 c)1/N ) ∈ C[z] for all c and u(z) ∈ C[z N ]. This completes the proof of Theorem 4.5. At the end of this section we note that an equivalent definition of GrA (α) can be given in terms of conditions C (cf. Sect. 2). Using the translation invariance of 9α we can suppose that none of the conditions C is supported at 0. Then we have an analog of Theorem 2.7. Proposition 4.9. The Darboux transformation W of Vα is polynomial iff (i) The space of conditions C is homogeneous and ZN -invariant. Equivalently, KerP has a basis of the form X aki ∂zk ψα (x, εj z)|z=λi fij (x) = k

=

X

aki ε−jk ∂zk ψα (x, z)

k

z=εj λi

,

(4.19)

0 ≤ j ≤ N − 1, 1 ≤ i ≤ r (for some r), λi 6= 0. (ii) The polynomial g(z) has the form (2.21), i.e. n1 N N nr g(z) = (z N − λN 1 ) · · · (z − λr ) ,

where ni is the number of conditions C supported at each of the points εj λi , 0 ≤ j ≤ N − 1.

360


The proof of the “if” part is the same as in the Bessel case and will be omitted. (In fact, most of the proofs in Sect. 2 are valid in a more general situation.) The “only if” part is also similar to the corresponding result in the Bessel case but some more explanation is needed. For fixed λ 6= 0 we shall use representations of the kernel of the operator Lα − λN in three different linear spaces of formal power series. First set y = α0−1 λN + x, (4.20) ϕα (x, λ) = µd0 λ−d e−Q(λ) 9α (y), considered as a formal power series in y −1/N , (where 9α is from (4.5)). The Airy wave function ψα (x, λ) (see (4.6)) is given by the same formula after expanding y −1/N at x = 0 as in (4.7). The other possibility is to expand y −1/N at x = ∞: X −i/N x−i/N −k (α0−1 λN )k . (4.21) (x + α0−1 λN )−i/N = k k≥0

(j) Inserting (4.21) in (4.20), we obtain another formal series χα (x, λ). Denote by ϕ(j) α , ψα , the images of ϕ , ψ , χ under the transformations χ(j) α α α α

y 1/N 7→ εj y 1/N ,

λ 7→ εj λ,

x1/N 7→ εj x1/N ,

(j) respectively (ε = e2πi/N ). Then ψα(j) and χ(j) α are obtained by expanding ϕα and in the N d corresponding spaces of formal series Ker(Lα − λ ) has bases k (j) {∂λk ψα(j) }, {∂λk ϕ(j) α }, {∂λ χα },

0 ≤ k ≤ d − 1, 0 ≤ j ≤ N − 1.

Our observation is that if KerP has a basis X fi (x) = aikj ∂zk ψα(j) (x, z)|z=λ , k,j

then the same formula gives a basis of KerP when ψ’s are substituted by ϕ’s or χ’s and vice versa. Indeed, this follows from (1.13) and the fact that P has rational coefficients. We complete the proof of Proposition 4.9 noting that while P depends rationally on x, 1/N and the same argument as in the Bessel case gives that χ(j) α are formal series in x KerP has a χ-basis of the form (4.19). 5. Explicit Formulae and Examples In this section we have collected several classes of examples. We wanted at least to include all previously known examples (unless by ignorance we miss some of them) – see [DG, W1, Z, G3, LP]. We hope that we have elucidated and unified them. For monomial transformations we derive formulae expressing the operators L and 3, solving the bispectral problem, only in terms of the matrix A and the vector γ (see Proposition 5.1 below). This explicit expression for 3 (though possibly of high order) to the best of our knowledge is new even for N = 2 (see [DG]). In other examples we illustrate the properties of the operator of minimal order from a bispectral algebra: when does its order coincide with the rank of the algebra and when this operator is a Darboux transformation of a power of a Bessel operator. We also point out that the classical Bessel potentials u(x) = cx−2 [DG] can produce new solutions of the bispectral problem for any c.


361

We describe in detail the polynomial Darboux transformations from (Lα − λN )2 , where Lα is an arbitrary Airy or Bessel operator of order N . We do not want simply to show that our procedure of constructing bispectral operators works but to point out that the involutions a and b (b1 in the Airy case) possess some very interesting properties which deserve further study. 5.1. Monomial Darboux transformations of Bessel planes. Let β ∈ CN and W ∈ GrM B (β). We use the notation from (1.21, 1.22) (with V = Vβ ) and from (3.2, 3.7). When the Darboux transformation is monomial g(z) = z n ,

h(z) = z d

(5.1)

for some n, d. We shall consider only the case when there are no logarithms in the basis (2.18) of KerP . The general case can be reduced to this one by taking a limit in all formulae (see Example 5.2 below). Now KerP has a basis of the form

fk (x) =

dN X

0 ≤ k ≤ n − 1,

(5.2)

if aki akj 6= 0, i 6= j,

(5.3)

aki xγi ,

i=1

such that γi − γj ∈ N Z \ 0

where γ = β d is from (2.4). Let A be the matrix (aki ). We shall use multi-index notation for subsets I = {i0 < . . . < in−1 } of {1, . . . , dN } and δI from (2.7). We also put γI = {γi }i∈I , AI = (ak,il )0≤k, l≤n−1 and 1I =

Y

(γir − γis ).

r<s

Let Imin be the subset of {1, . . . , dN } with n elements such that det AImin 6= 0 and P i∈Imin γi be the minimum of all such sums, and set pI =

X i∈I

γi −

X

γi .

i∈Imin

Equation (5.3) implies that these numbers are divisible by N . Finally, for a subset I of {1, . . . , dN } denote by I 0 its complement. In the following proposition we express everything entering (1.21, 1.22, 3.2, 3.7) only in terms of the matrix A and the vector γ. Therefore for each A and β ∈ CN satisfying (5.3) (with γ = β d ) we give an explicit solution to the bispectral problem (cf. (3.16, 3.17)). Proposition 5.1. In the above notation the operators and the polynomials from (3.16, 3.17) are given by the following formulae:

362


(a) g(z) = z n , X −1 X P = det AI 1I xpI det AI 1I xpI LγI . (b) f (z) = z dN −n , X X −1 Q= det AI 1I LγI 0 −nδI 0 xpI det AI 1I xpI . P (c) gb (z) = z n det AI 1I z pI , X Pb = det AI 1I LγI (Lβ )pI /N . P (d) fb (z) = z dN −n det AI 1I z pI , X Qb = det AI 1I (Lβ )pI /N LγI 0 −nδI 0 . Proof. Note that (c) and (d) follow from (a) and (b) (see the proof of Theorem 3.2). To prove (a) we note that Wr f0 (x), . . . , fn−1 (x), 9β (x, z) 9W (x, z) = z n Wr f0 (x), . . . , fn−1 (x) P det AI Wr xγI 9I (x, z) P = . (5.4) det AI Wr xγI The sum is taken over all n-element subsets I = {i0 < i1 < . . . < in−1 } ⊂ {0, 1, . . . , dN − 1}, xγI = {xγi }i∈I and 9I (x, z) are the Bessel wave functions (2.6). Using (2.9) and the simple fact P γ − n(n−1) 2 Wr(xγI ) = 1I x i∈I i , (5.5) we obtain (a). To prove (b) we shall apply the involution a directly on the tau-function τW of the plane W . Recall that [S, SW] P∞ −1 ] tk z k τ t − [z , (5.6) 9W (t, z) = e k=1 τ (t) where [z −1 ] is the vector z −1 , z −2 /2, . . . . The action of a is given by [W1] τaW (t1 , t2 , . . . , tk , . . .) = τW (t1 , −t2 , . . . , (−1)k−1 tk , . . .). We shall need the formulae [BHY3]

P det AI 1I τI (t) τW (t) = P , det AI 1I

and

(5.7)

1 Wr (xγI ) τγ (x), (5.8) 1I where τI (t) is the tau-function corresponding to the wave function 9I (x, z) and τ (x) = τ (x, 0, 0, . . .). Applying a to both sides of (5.7) and using (5.6) and (2.6) we obtain τI (x) =


P 9aW (x, z) = We compute

363

det AI 1I τa(γ+dN δI −nδ) (x)9a(γ+dN δI −nδ) (x, z) P . det AI 1I τa(γ+dN δI −nδ) (x)

a(γ + dN δI − nδ) = a(γ) + dN δI 0 − (dN − n)δ,

(5.9)

(5.10)

which is a Darboux transformation of a(γ). Equations (5.8) and (5.5) imply τI (x) xpI = . τJ (x) xpJ

(5.11)

To apply (5.11) in (5.9) we have to compute pI 0 but for a(γ) instead of γ. It is a simple exercise to see that pI 0 (a(γ)) = pI (γ) ≡ pI . Using this we obtain P 9aW (x, z) = z

−dN +n

det AI 1I xpI L(a(γ))I 0 P 9a(γ) (x, z). det AI 1I xpI

Now Proposition 1.7 gives (b) because (Lβ )∗ = (−1)N La(β) and

for β ∈ CN

a (a(γ))I 0 = γI 0 − nδI 0 .

In the following example we consider the case when there are logarithms in the basis (2.18) of KerP . Example 5.2. Let d = 2, β = (1, 1, 1), γ = β 2 = (1, 1, 1, 4, 4, 4) and KerP has a basis f0 (x) = x4 , f1 (x) = a1 x + 2a2 x4 ln x, f2 (x) = a0 x + a1 x ln x + a2 x4 ln2 x. Using that lnk x = ∂k x |=0 we approximate the above functions with f0 (x, ) = x4 , f1 (x, ) = a1 x1+ + 2a2 −1 (x4+ − x4 ), f2 (x, ) = a0 x1+2 + a1 −1 (x1+2 − x1+ ) + a2 −2 (x4+2 − 2x4+ + x4 ). Consider the Darboux transformation W () of Vβ() , where β() = (1, 1 + , 1 + 2), with a basis of the operator P () consisting of the functions fk (x, ). After changing the basis this corresponds to a matrix (cf. (5.2)) ! ÿ 0 0 0 1 0 0 0 0 . A() = 0 0 a1 2a2 0 (a + a ) a 0 0 0 0

1

2

We apply (5.7) for τW () . To make the limit → 0 we note that the numerator and the denominator depend polynomially on and that (in the notation of (5.7)) both τ{2,3,6}

364


and τ{2,4,5} tend to one and the same Bessel tau-function. So after canceling 3 and setting = 0 we obtain that τW is a linear combination of 3 Bessel tau-functions: τW =

9a21 τ(−2,1,1,4,4,7) + 18a2 (a0 − a1 )τ(−2,−2,1,4,7,7) + 4a22 τ(−2,−2,−2,7,7,7) . 9a21 + 18a2 (a0 − a1 ) + 4a22

(5.12)

As in the proof of Proposition 5.1 from this formula one can compute the operators P , Q, Pb and Qb . It is clear that they also can be obtained by taking the limit → 0 directly in the corresponding expressions for W (). From here to the end of the subsection we shall restrict ourselves to the case when β d = γ has different coordinates. We choose the following basis of KerLdβ (cf. [MZ]) 8(k−1)d+j (x) := µkj xβk +(j−1)N , where µk,1 := 1,

µkj := µk,j−1 ·

N Y

1 ≤ k ≤ N, 1 ≤ j ≤ d,

(5.13)

(βi − βk − (j − 1)N )−1 .

i=1

In this basis the action of Lβ is quite simple: 8(k−1)d+j−1 , Lβ 8(k−1)d+j = 0,

for 2 ≤ j ≤ d for j = 1.

(5.14)

Let a basis of KerP be fk (x) =

dN X

aki 8i (x),

k = 0, . . . , n − 1.

(5.15)

i=1

Example 5.3. Let n = d, βi − βj ∈ N Z for all i, form:  (1) ) . . . t(N t0 0 )  t(1) t(1) . . . t(N  1 0 1  (1) ) A= t(1) t(1) . . . t(N 1 0 2  t2  . .. .. ..  .. . . . (1) (1) (N ) t(1) t . . . t . . . t n−1 n−2 0 n−1

j and the matrix A = (aki ) has the     .   

) t(N 0 ) t(N 1 .. . ) t(N n−2

) t(N 0 .. .

...

(5.16)

) t(N 0

The type of the matrix is tantamount to the identities Lβ f0 = 0, Lβ fk+1 = fk , k = 1, . . . , n − 1. Then KerP is invariant under the action of Lβ and by Proposition 1.5 the operator L = P Lβ P −1 is differential of order N and solves the bispectral problem. For a generic β ∈ CN the spectral algebra has rank N (i.e. it is C[L]). This family can be considered as the most direct generalization of the “even case” of J. J. Duistermaat and F. A. Grünbaum [DG] (see also [MZ]). When N = 2 our example coincides with it but for N > 2 here we present a completely new class of bispectral operators. In connection with the above example we prove the following proposition. Proposition 5.4. Let W ∈ GrB (β) (β ∈ CN –generic) be such that AW contains an operator of order N . Then W is a monomial Darboux transformation of Vβ , i.e. W ∈ GrM B (β) ∩ Gr(N ) .


365

Proof. Proposition 1.5 implies that W ∈ GrB (β) belongs to Gr(N ) iff Lβ (KerP ) ⊂ KerP.

(5.17)

If we suppose that W 6∈ GrM B (β) then KerP would contain some elements of the form (2.20). The action of Lβ on them is easily computed:

Lβ Dλk 9β (x, εi λ) = Dλk Lβ 9β (x, εi λ) = Dλk λN 9β (x, εi λ) = λN (Dλ + N )k 9β (x, εi λ).

Thus the linear space span Dλk 9β (x, εi λ) 0≤k≤m can be identified with the space of polynomials in D of degree ≤ m, with the action of Lβ corresponding to P (D) 7→ λN P (D + N ). It is clear that all Lβ -invariant subspaces are of the form span Dλk 9β (x, εi λ) 0≤k≤k0 for some k0 . The corresponding polynomial Darboux transformation is trivial in the sense that it leads again to the same plane Vβ (the operator P = (Lβ − λ)k0 commutes with Lβ ). Therefore W ∈ GrM B (β). In the same manner as in Example 5.3, one can build for arbitrary k rank N bispectral algebras with the lowest order of the operators equal to kN . It is clear that when the matrix A is not of the form (5.16) (or a direct sum of such matrices) then KerP (given by (5.15)) is not invariant under the action of Lβ . Proposition 1.5 implies that in this case the spectral algebra does not contain operators of order N . The following example is one of the simplest of this type. Example 5.5. Let N = 2, β = (β1 , β2 ), β1 + β2 = 1, d = n = 2. We take KerP with a basis (5.15) where 1 a 0 0 A= 0 0 1 b for some a, b ∈ C, i.e. a xβ1 +2 , 2(β1 − β2 + 2) b xβ2 +2 . + 2(β2 − β1 + 2)

f0 (x) = 81 (x) + a82 (x) = xβ1 + f1 (x) = 83 (x) + b84 (x) = xβ2 Then Lβ f0 (x) = axβ1 ,

Lβ f2 (x) = bxβ2

and KerP is not invariant under Lβ when ab 6= 0. The spectral algebra AW = P L2β C[Lβ ]P −1 consists of operators of orders 4, 6, 8, 10, . . .. This example is also interesting for the fact that it does not require β1 − β2 ∈ 2Z. The generalization for arbitrary N is obvious. Another example illustrating Proposition 1.5 is the following one.

366


Example 5.6. Let N = 2, β = (β1 , β2 ) ∈ C2 , β1 + β2 = 1, β1 − β2 ∈ 2Z, d = 4, n = 2. We take KerP with a basis (5.15) where λ 0 0 0 λa + b λb 0 0 A= 0 0 1 0 0 0 a b for some a, b, λ ∈ C. Then it is easy to see that KerP is invariant under the operator L3β + λL2β but it is not invariant under any polynomial of Lβ of degree ≤ 2. On the other hand KerP ⊂ KerL4β obviously implies L4+k β KerP ⊂ KerP for k ≥ 0. Therefore the spectral algebra AW is the linear span of the operators −1 , k ≥ 0. P L3β + λL2β P −1 , P L4+k β P This example is interesting for the fact that (for λ 6= 0) the operator of minimal order in the spectral algebra is not a Darboux transformation of a power of Lβ , although the Darboux transformation is monomial. In the last example of this subsection we show that for d = n = 1 our results agree with those of [Z]. Example 5.7. Let d = n = 1, KerP = Cf0 , f0 (x) =

N X

P = ∂x −

ai xβi ,

i=1

f00 (x) , f0 (x)

and βi − βj ∈ N Z if ai aj 6= 0. Then

−1 X X βi L = P Lβ P −1 = P Q = × a i xp i ∂ x − ai xp i x X −1 Pβ (Dx + N ) pi −N +1 X × ai ai xpi x , D x + N − βi X βi 3 = Pb Qb = (Lβ )pi /N × ai ∂ z − z X Pβ (Dz + 1) × ai (Lβ )pi /N z −N +1 , Dz + 1 − βi

where pi = βi − βmin , βmin = min βi , ai 6=0

Pβ (D) =

N Y

(D − βi ),

Dx = x∂x .

i=1

We have Θ(x) = xN

X

a i xpi

2 ,

deg Θ = N + 2(βmax − βmin ),

where βmax = max βi . When f0 (x) = txβ1 + xβ2 , β2 − β1 = N α, α ∈ Z≥0 , ai 6=0

Θ(x) = xN (t + xN α )2 , and we obtain the operator 3 from [Z].


367

5.2. Polynomial Darboux transformations. In this subsection we shall consider the simplest case of polynomial Darboux transformation of an operator of order N , namely when the polynomial h(z) from (1.27) is equal to (z − λN )2 for some λ ∈ C \ 0. Using the kernels of the operators P , Q∗ , Pb and Q∗b from (1.21, 1.22, 3.2, 3.7), we describe the action of the involutions a and b (b1 in the Airy case). Propositions 5.10, 5.12 below raise some interesting questions and conjectures. The Bessel and Airy cases are very similar. We shall consider first the Airy one since it is simpler. Let W ∈ GrA (α), α ∈ CN −1 . Set h(z) = (z − λN )2 ,

g(z) = f (z) = z N − λN .

Then Kerh(Lα ) has a basis of the form n o ∂xk 9α (x, εj λ)

(5.18)

(5.19)

0≤j≤N −1, k=0,1

and KerP has a basis fj (x) = 9α (x, εj λ) + a∂x 9α (x, εj λ),

0≤j ≤N −1

(5.20)

for some a ∈ C. We shall start with the case N = 2. The following example is due to [G3, LP]. We shall obtain it as the simplest special case of Theorem 4.5. Example 5.8. Let N = 2 and α = (α0 ) ∈ C1 . For fixed α0 , a ∈ C \ 0 we take the basis (5.20) of KerP : (5.21) fk (x) = ψk (x) + a∂x ψk (x), k = 0, 1, where ψk (x) = 9α (x, (−1)k λ). Using that ∂x fk = a(α0 x + λ2 )ψk + ∂x ψk , ∂x2 fk = (aα0 + α0 x + λ2 )ψk + a(α0 x + λ2 )∂x ψk , we compute P from Wr(f0 , f1 , ϕ) Wr(f0 , f1 ) 1 a 1 a(α0 x + λ2 ) aα + α x + λ2 a(α x + λ2 ) 0 0 0 = 1 a 2 a(α0 x + λ ) 1

Pϕ =

ϕ ∂x ϕ 2 ∂x ϕ

.

The result is P = ∂x2 +

a2 α0 a2 (α0 x + λ2 )2 − (α0 x + λ2 ) − aα0 ∂x + . 2 1− 1 − a2 (α0 x + λ2 ) 0x + λ ) a2 (α

This expression coincides with that given in [G3] if we set α0 =

2 2 = , 2 + 3t s

a=

s , 2y

λ = 0.

We compute the operators P , Q and Q∗ as follows. If we write

(5.22)

368


P = ∂x2 + p1 (x)∂x + p0 (x), Q = ∂x2 + q1 (x)∂x + q0 (x),

Q∗ = ∂x2 + qe1 (x)∂x + qe0 (x),

then the identity QP = h(Lα ) imply q1 + p1 = 0, 2p01 + q1 p1 + p0 + q0 = −2(α0 x + λ2 ) and

qe1 = −q1 ,

qe0 = −q10 + q0 .

Our observation is that because P ∗ Q∗ = h(La(α) ) and 9aW = f −1 Q∗ 9a(α) , the operator Q∗ has a basis of the form (5.21) with some b ∈ C instead of a and a(α) instead of α. Comparing the above expressions for Q∗ with (5.22) we obtain that b = −a. By Theorem 4.5 the operator Pb also has a basis (5.19) with some c instead of a and µ instead of λ. On the other hand we can compute it directly using Eqs. (4.15, 4.16). Then gb (z) = 1 − a2 (z 2 + λ2 ) which on the other hand is up to a constant z 2 − µ2 . This gives µ2 =

1 − a 2 λ2 . a2

(5.23)

The other coefficients give a surprising result: c = a. In conclusion, if we denote the operator P from (5.22) with P (a, λ) then P = P (a, λ),

Q = P ∗ (−a, λ),

Pb = P (a, µ),

Qb = P ∗ (−a, µ)

(5.24)

where µ and λ are connected by (5.23).

The next example is completely analogous to the above one but to the best of our knowledge it is new. Example 5.9. For N = 3 the Airy operator is Lα = ∂x3 + α2 ∂x − α0 x, α = (α0 , α2 ) ∈ C2 , α0 6= 0. We take P with a basis (5.20) (N = 3). Then using Eq. (4.8) we compute a3 α 0 ∂2 a3 (α0 x + λ3 ) + (1 + a2 α2 ) x a3 α2 (α0 x + λ3 ) + (1 + a2 α2 )α2 + a2 α0 ∂x + a3 (α0 x + λ3 ) + (1 + a2 α2 ) a3 (α0 x + λ3 )2 + aα0 (1 + a2 α2 )(1 + a2 α2 )(α0 x + λ3 ) . − a3 (α0 x + λ3 ) + (1 + a2 α2 )

P = ∂x3 −

A direct computation using Proposition 1.7, Theorem 4.5 and QP = h(Lα ) leads to P = P (a, λ),

Q = −P ∗ (−a, −λ),

Pb = P (a, µ),

Qb = −P ∗ (−a, −µ) (5.25)

1 + a2 α 2 . a3

(5.26)

with µ given by µ3 + λ 3 = −


369

The above examples can be generalized for arbitrary N as follows. Proposition 5.10. Denote by P = P (a, λ) the operator P with a basis (5.20). Then in the above notation we have Q = (−1)N P ∗ (−a, −λ),

Pb = P (a, µ),

Qb = (−1)N P ∗ (−a, −µ)

(5.27)

with λ and µ connected by λN + µN = Pα0 (−1/a),

(5.28)

where Pα0 is the polynomial from (4.1). The spectral algebras 2 AW = P Lα − λN C[Lα ]P −1 , 2 Ab1 W = Pb Lα − µN C[Lα ]Pb−1

(5.29) (5.30)

consist of operators of orders 2N, 3N, 4N, . . .. Proof. Because (−1)N P ∗ (−1)N Q∗ = (La(α) − (−λ)N )2 , Qb Pb = (Lα − µN )2 ,

9bW (x, z) =

9aW (x, z) =

Pb 9α (x, z) , z N − µN

(−1)N Q∗ 9a(α) (x, z) , z N − (−λ)N

we see that Q = (−1)N P ∗ (b, −λ) and Pb = P (c, µ) for some b, c, µ. Using the equation Lα (x, ∂x )9α (x, εj λ) = λN 9α (x, εj λ) we compute pN (x) = Wr(f0 , f1 , . . . , fN −1 ) as in the proof of Lemma 2.9. We obtain pN (x) = −(−a)N (α0 x + λN − Pα0 (−1/a)). Equation (4.17) leads to (5.28) because gb (z) = const · (z N − µN ). Applying (5.28) for Pb instead of P , we obtain Pα0 (−1/a) = Pα0 (−1/c). Note that the map a 7→ c is an automorphism of CP1 since it is an involution. The only solution of the above equation with this property is c = a. To compute (−1)N Q∗ , we note that its second coefficient is equal to that of P which is equal to −p0N (x)/pN (x). This, Proposition 1.7 and Lemma 4.7 imply a(α)0 α0 = , α0 x + λN − Pα0 (−1/a) a(α)0 x + (−λ)N − Pa(α)0 (−1/b) which leads to a polynomial equation for b in terms of a and α. Because a 7→ b is an automorphism of CP1 we obtain that b = −a. Equations (5.29, 5.30) follow from Proposition 5.4. We shall find the analog of Proposition 5.10 in the Bessel case. We use the notation from the beginning of the subsection with β ∈ CN instead of α and Eq. (5.20) modified as follows (cf. (2.20)) (5.31) fj (x) = 9β (x, εj λ) + aDx 9β (x, εj λ) (j = 0, . . . , N − 1, Dx = x∂x ). In the next example we shall study the simplest case N = 2.

370


Example 5.11. For N = 2, β = (1 − ν, ν) the corresponding Bessel operator is Lβ = x−2 (Dx − (1 − ν))(Dx − ν) = ∂x2 +

ν(1 − ν) , x2

Dx = x∂x .

Using (1.38) we compute the operator P from f0 f1 ϕ D x f0 D x f1 D x ϕ 1 Dx2 f0 Dx2 f1 Dx2 ϕ . Pϕ = 2 f1 f0 x D x f0 D x f1 The answer is the following. If we set µ2 = then P =

a + 1 + a2 ν(1 − ν) , a 2 λ2

n o 1 2 2 2 2 p (x )D + p (x )D + p (x ) 2 1 x 0 x x2 p2 (x2 )

with p2 (x2 ) = x2 − µ2 , p1 (x2 ) = µ2 − 3x2 and p0 (x2 ) = −λ2 x4 + (2λ2 µ2 + (a + 1)(2a − 1)a−2 )x2 + ((a + 1)a−2 − λ2 µ2 )µ2 . The operator Pb is (cf. (3.4)) o 1 n 2 Pb = Dx p2 (Lβ ) + Dx p1 (Lβ ) + p0 (Lβ ) g(x) and gb (z) = z 2 (z 2 − µ2 ). A straightforward computation shows that if we set P = P (a, λ, µ) then Q = P ∗ (−a/(a+1), λ, µ), Pb = P (a, µ, λ)Lβ , Qb = Lβ P ∗ (−a/(a+1), µ, λ). (5.32) Therefore we can take Pb = P (a, µ, λ),

Qb = P ∗ (−a/(a + 1), µ, λ),

(5.33)

i.e. the involution b acts simply by exchanging λ with µ and vice versa, while the involution a acts as a 7→ −a/(a + 1). The action of the involutions for arbitrary N is given in the next proposition. Proposition 5.12. Denote by P = P (a, λ) the operator P with a basis (5.31). Then we can take Pb and Qb such that Q = (−1)N P ∗ (b, −λ),

Pb = P (a, µ),

Qb = (−1)N P ∗ (b, −µ)

(5.34)

with λ, µ and a, b connected by λN µN = Pβ (−1/a),

1 1 + + N − 1 = 0, a b

where Pβ is the polynomial from (1.32). The spectral algebras 2 AW = P Lβ − λN C[Lβ ]P −1 , 2 AbW = Pb Lβ − µN C[Lβ ]Pb−1 , consist of operators of orders 2N , 3N , 4N, . . ..

(5.35)

(5.36) (5.37)


371

Proof. We have gb (z) = const · z N (z N − µN ) for some µ ∈ C. Using (3.5) we compute gb (z) = z N det(Dzi fj (z))i,j=0,...,N −1 = (−a)N z N (z N λN − Pβ (−1/a)) which gives the value of µ. We have to prove that Pb given by (3.4) (which is of order 2N ) is divisible by Lβ from the right. Indeed, it is easy to see that Pb (x, ∂x )xβi = P (x, ∂x )xβi |λ=0 and the proof of Lemma 2.9 implies P |λ=0 = Lβ . Thus we can take Pb = P (c, µ) for some c ∈ C. Now (5.35) implies Pβ (−1/a) = Pβ (−1/c) leading to c = a. Finally, as in the Airy case, if Q = (−1)N P ∗ (b, −λ) for some b ∈ C then Pβ (−1/a) = (−1)N Pa(β) (−1/b) = Pβ (1/b + N − 1), showing that a−1 + b−1 + N − 1 = 0. Equations (5.36, 5.37) follow from Proposition 5.4.

In conclusion we want to make some comments. In the case N = 1 the adjoint involution a has a simple and beautiful geometric interpretation (see [W1]): in terms of Krichever’s construction it preserves the spectral curve and maps the “sheaf of eigenfunctions” into some kind of a dual sheaf. In [W1] G. Wilson also posed the problem of describing the action of the bispectral involution on Grad . We think that in the general case the study of the action of the involutions a and b on the bispectral manifolds of polynomial Darboux transformations of Bessel and Airy planes is an equally interesting and difficult task. The above examples lead us to the conjecture that the involutions a and b (b1 in the Airy case) possess some universality property. Any polynomial Darboux transformation W of a Bessel plane Vβ (respectively an Airy plane Vα ) is determined by the points λ1 , . . . , λN (6= 0) at which the conditions C are supported (see (2.1)), by the matrix A defined by (2.20) (resp. (2.19)), and of course by the vector β (resp. α). Then the corresponding matrices for aW and bW (resp. b1 W ) depend only on the matrix A. The point is that they do not depend on the points λ1 , . . . , λN at which the conditions C are supported nor on the vector β (resp. α). Acknowledgement. We are grateful to F. A. Grünbaum and G. Wilson for their interest in the paper and for suggestions which led to improving the presentation of our results. We also thank the referee who proposed important changes towards making the text more “reader friendly”. This work was partially supported by Grant MM–523/95 of Bulgarian Ministry of Education, Science and Technologies.

References [AM]

Adler, M., Moser, J.: On a class of polynomials connected with the Korteweg–de Vries equation. Commun. Math. Phys. 61, 1–30 (1978) [AMM] Airault, H., McKean, H.P., Moser, J.: Rational and elliptic solutions of the Korteweg–de Vries equation and a related many-body problem. Comm. Pure Appl. Math. 30, 95–148 (1977) [AMcD] Atiyah, M.F., Macdonald, I.G.: Introduction to commutative algebra. Reading, MA: Addison– Wesley, 1969 [AvM] Adler, M., van Moerbeke, P.: Birkhoff strata, Bäcklund transformations, and regularization of isospectral operators. Adv. Math. 108, 140–204 (1994)

372


[BHY1] Bakalov, B., Horozov, E., Yakimov, M.: Highest weight modules of W1+∞ , Darboux transformations and the bispectral problem. To appear in Proc. Conf. Geom. and Math. Phys., Zlatograd 95, Bulgaria, in a special volume of Serdica Math. J., q-alg/9601017 [BHY2] Bakalov, B., Horozov, E., Yakimov, M.: Tau-functions as highest weight vectors for W1+∞ algebra. J. Phys. A: Math. Gen. 29, 5565–5573 (1996), hep-th/9510211 [BHY3] Bakalov, B., Horozov, E., Yakimov, M.: Bäcklund–Darboux transformations in Sato’s Grassmannian. Serdica Math. J. 22, no. 4, 571–588 (1996), q-alg/9602010 [BHY4] Bakalov, B., Horozov, E., Yakimov, M.: Highest weight modules over W1+∞ algebra and the bispectral problem. To appear in Duke Math. J., q-alg/9602012 [BHY5] Bakalov, B., Horozov, E., Yakimov, M.: General methods for constructing bispectral operators. Phys. Lett. A222, 59–66 (1996), q-alg/9605011 [BE] Bateman, H., Erdélyi, A.: Higher transcendental functions. New York: McGraw-Hill, 1953 [BC] Burchnall, J.L., Chaundy, T.W.: Commutative ordinary differential operators. Proc. Lond. Math. Soc. 21, 420–440 (1923); Proc. Royal Soc. London (A) 118, 557–583 (1928); Proc. Royal Soc. London (A) 134, 471–485 (1932) [Da] Darboux, G.: Leçons sur la théorie générale des surfaces. 2ème partie, Paris: Gauthiers–Villars, 1889 [DJKM] Date, E., Jimbo, M., Kashiwara, M., Miwa, T.: Transformation groups for soliton equations. In:Proc. RIMS Symp. Nonlinear integrable systems – Classical and Quantum theory. (Kyoto 1981), M. Jimbo, T. Miwa (eds.), 39–111, Singapore: World Scientific, 1983 [Dij] Dijkgraaf, R.: Intersection theory, integrable hierarchies and topological field theory. Lecture Notes at Cargese Summer School (1991), hep-th/9201003 [DG] Duistermaat, J.J., Grünbaum, F.A.: Differential equations in the spectral parameter. Commun. Math. Phys. 103, 177–240 (1986) [F] Fastré, J.: Bäcklund–Darboux transformations and W -algebras. Doctoral Dissertation, Univ. of Louvain, 1993 [GD] Gelfand, I.M., Dickey, L.A.: Fractional powers of operators and Hamiltonian systems. Funct. Anal. Appl. 10, 13–39 (1976) [G1] Grünbaum, F.A.: The limited angle reconstruction problem in computer tomography. Proc. Symp. Appl. Math. 27, AMS, L. Shepp (ed.), 43–61 (1982) [G2] Grünbaum, F.A.: The Kadomtsev–Petviashvilii equation: an alternative approach to the “rank two” solutions of Krichever and Novikov. Phys. Lett. A 139, 146–150 (1989) [G3] Grünbaum, F.A.: Time-band limiting and the bispectral problem. Comm. Pure Appl. Math. 47, 307–328 (1994) [I] Ince, E.L.: Ordinary Differential Equations. New York: Dover, 1944 [KS] Kac, V.G., Schwarz, A.: Geometric interpretation of the partition function of 2D gravity. Phys. Lett. B257, 329–334 (1991) [KV] Kac, V.G., van de Leur, J.W.: The n–component KP hierarchy and representation theory. In: Important developments in soliton theory. A. Fokas, V. Zakharov (eds.), Springer series in nonlinear dynamics, New York: Springer 1993, pp. 302–343 [K] Kasman, A.: Bispectral KP solutions and linearization of Calogero–Moser particle systems. Commun. Math. Phys. 172, 427–448 (1995) [KR] Kasman, A., Rothstein, M.: Bispectral Darboux transformations: The generalized Airy case. To appear in Physica D, q-alg/9606018 [KrN] Krichever, I., Novikov, S.: Holomorphic bundles over algebraic curves and nonlinear equations. Russian Math. Surveys 35, 53–79 (1980) [LP] Latham, G., Previato, E.: Higher rank Darboux transformations. In: NATO ARW Lyon 91 Singular limits of dispersive waves, N. Ercolani, D. Levermore (eds.), New York: Plenum, 1994, pp. 117–134 [MZ] Magri, F., Zubelli, J.: Differential equations in the spectral parameter, Darboux transformations and a hierarchy of master equations for KdV. Commun. Math. Phys. 141, 329–351 (1991) [R] Rothstein, M.: Calogero-Moser pairs and the Airy and Bessel bispectral involutions. Preprint (1996), q-alg/9611027 [S] Sato, M.: Soliton equations as dynamical systems on infinite dimensional Grassmann manifolds. RIMS Kokyuroku 439, 30–40 (1981) [SW] Segal, G., Wilson, G.: Loop Groups and equations of KdV type. Publ. Math. IHES 61, 5–65 (1985) [vM] van Moerbeke, P.: Integrable foundations of string theory. CIMPA–Summer school at Sophia– Antipolis (1991), In: Lectures on integrable systems. O. Babelon et al. (eds.), Singapore: World Scientific, 1994 pp.163–267


[Wa] [W1] [W2] [Z]

373

Wasow, W.: Asymptotic expansions for ordinary differential equations. New York, London, Sydney: Interscience, 1965 Wilson, G.: Bispectral commutative ordinary differential operators. J. Reine Angew. Math. 442, 177–204 (1993) Wilson, G.: Collisions of Calogero-Moser particles and an adelic Grassmannian. Preprint, Imperial College, London (1996) Zubelli, J.: Differential equations in the spectral parameter for matrix differential operators. Physica D 43, 269–287 (1990)

Communicated by T. Miwa

Commun. Math. Phys. 190, 375–394 (1997)

Communications in


Towards a Kneading Theory for Lozi Mappings. II: Monotonicity of the Topological Entropy and Hausdorff Dimension of Attractors Yutaka Ishii? Laboratoire de Topologie et Dynamique, Département de Mathématiques, Université de Paris–Sud, Bâtiment 425, 91405 Orsay, France Received: 1 September 1996 / Accepted: 16 April 1997

Dédié au Professeur A. Douady pour son 60ème anniversaire Abstract: We construct a kneading theory a` la Milnor–Thurston for Lozi mappings (piecewise affine homeomorphisms of the plane). In the first article a two-dimensional analogue of the kneading sequence called the pruning pair is defined, and a topological model of a Lozi mapping is constructed in terms of the pruning pair only. As an application of this result, in the current paper we show the partial monotonicity of the topological entropy and of bifurcations for the Lozi family near horseshoes. Upper and lower bounds for the Hausdorff dimension of the Lozi attractor are also given in terms of parameters. Contents 1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 375 2 The Pruning Pair: Review . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 379 3 Monotonicity when b is Close to Zero . . . . . . . . . . . . . . . . . . . . . . . . . . . 382 4 Monotonicity near Horseshoes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 387 5 Hausdorff Dimension of the Attractors . . . . . . . . . . . . . . . . . . . . . . . . . . 388 5.1 Upper bound . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 389 5.2 Lower bound . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 392 1. Introduction In his paper [Loz] R. Lozi studied the interesting dynamical behavior of the following two parameter family of piecewise affine homeomorphisms of the plane which is now called the Lozi family: x 1 − a|x| + by 7−→ a, b ∈ R, b 6= 0, L = La,b : y x ? Current address: Department of Mathematical Sciences, University of Tokyo, Komaba, Tokyo 153, Japan. E-mail: [email protected]

376

Y. Ishii

similar to the Hénon family. He observed numerically that this simple mapping may give rise to very complicated dynamics, i.e. he found a strange attractor for the parameter values a = 1.7 and b = 0.5 (see the following Fig. 1.1). Later, this kind of observation was mathematically justified by Misiurewicz for suitable choices of parameters [Mis]. Actually he proved that under some conditions on the parameters, there exists a nonempty compact invariant set F = FL (let us call it the Lozi attractor of L) which satisfies the following three conditions: There exists a neighborhood G of F such that the distance between F and Lna,b (X) tends to zero when n goes to +∞ for every X ∈ G. (ii) The unstable manifold of the hyperbolic fixed point in the first quadrant is dense in F . (iii) La,b on F is topologically mixing, i.e. for all non-empty open sets U and V in F there exists an integer N such that, for any n ≥ N , (i)

Lna,b (U ) ∩ V 6= ∅. Ergodic studies of the dynamics on the attractor can be found in papers of Collet– Levy, Young, etc. [CL, Yo2], where they showed the existence of an SRB measure for some Lozi mappings.

Fig. 1.1. Lozi attractor for a = 1.7, b = 0.5

In the first paper [Ish], inspired by an article of P. Cvitanović et al. [CGP], we constructed symbolic dynamics for Lozi mappings similar to Milnor and Thurston’s

Kneading Theory for Lozi Mappings II

377

kneading theory for unimodal maps [MT]. More precisely, we defined a two-dimensional analogue of the kneading sequence called the pruning front PL and the primary pruned region DL , gave a solution of the pruning front conjecture (a characterization of the set of all admissible sequences), and constructed a topological model of the dynamics of L in terms of the pruning pair (PL , DL ) only. As a consequence, we gave a solution to the so-called first tangency problem, and proved that the boundary of the set of all horseshoes: H ≡ {(a, b) | La,b on Ka,b is equivalent to the full shift and a > 1 + |b|} is described as a graph of an algebraic curve a = g(b), where Ka,b denotes the set of points whose forward and backward orbits by La,b are bounded. We remark that Lozi mappings are homeomorphisms, so do not have critical points in the usual sense. See [CGP, Cvit] for more on the general discussion about the pruning front. The purpose of the current article is to consider some applications of the method developed in the previous paper. The first main result establishes the partial monotonicity of the topological entropy of the Lozi family with fixed Jacobian. To consider the topological entropy of a Lozi mapping, we take the one-point compactification of R2 and extend the map continuously by putting L(∞) = ∞. Theorem 1.1 (Partial Monotonicity of the Topological Entropy). For every b 6= 0 there exists a∗ = a∗ (b) strictly smaller than g(b) such that the topological entropy of La,b is a monotone increasing and non-constant function of a on [a∗ , +∞). Moreover, we have that h(La,b ) < log 2 when a∗ ≤ a < g(b). Besides, the difference g(b) − a∗ (b) is uniformly bounded from below by a positive constant if b runs over a bounded region. When b is sufficiently close to zero, a∗ can be taken approximately 1.97. Here, “monotone increasing and non-constant” in the statement of Theorem 1.1 (and other statements) does not mean “strictly increasing” (remark that, when a is sufficiently large, then the dynamics of La,b on Ka,b is equivalent to the full shift on the two symbols, and thus the topological entropy is always log 2 there). See the remark prior to the proof of Theorem 1.1. This result comes from the monotonicity of the pair (P, D) with respect to the parameter a ≥ a∗ . Roughly speaking, what the monotonicity of the pruning pair (P, D) means is that, if we increase the parameter a a little bit, then the dynamics corresponding to the smaller parameter is realized as a quotient of sub-dynamics of the one corresponding to the larger parameter. See Sect. 3 for the complete definition and statement. From this fact, we also prove that: Theorem 1.2 (Partial Monotonicity of the Bifurcations). Under the same conditions as in Theorem 1.1, (i)

The bifurcations of the periodic orbits of such Lozi family are monotone increasing, i.e. there are only orbit creations but no annihilation when a increases. (ii) A periodic point of minimal period d whose orbit intersects with the x-axis at r points (d ≥ r ≥ 1) at some parameter value a = a0 bifurcates precisely to 2nr periodic points of period nd (not necessarily minimal) for all n ∈ N when a increases. Before that bifurcation the corresponding periodic orbit does not exist, and it never bifurcates after that bifurcation. (iii) All new periodic points are created by this procedure in this monotone region.

378

Y. Ishii

So there are only saddle-node type bifurcations in this monotone region and no period-doubling bifurcation. We can observe bifurcation phenomenon even for a nonperiodic orbit (Corollary 3.4) and determine precisely when it bifurcates (Corollary 3.5). Moreover, one can prove these monotonicity results not only for the a-direction with b fixed but also for every family along a sufficiently short C 1 -curve in the parameter plane which is transverse to ∂H from the outside to the inside of H (see Sect. 4). This monotonicity property may be interpreted as a counterpart of a surprising result by Kanet et al. [KKY] for diffeomorphisms of the plane. There, they showed that one can find both infinitely many orbit-creation and orbit-annihilation parameter values in an arbitrary neighborhood of a non-degenerate homoclinic tangency of a one-parameter family of dissipative C 3 -diffeomorphisms of the plane. Moreover, they conjectured that the Hénon family with a fixed Jacobian −0.3 has many parameter values of such tangency. Our observations on the monotonicity seem to reveal a difference between the Lozi family and the Hénon family or, in other words, between the piecewise hyperbolic category and the smooth non-uniform one (compare the difference of bifurcations between the tent map and the quadratic map in one-dimension; the quadratic family has period doubling bifurcations but tent maps have only saddle-node bifurcations). Actually in the Lozi case, when (a, b) ∈ ∂H and b < 0, the Lozi mapping La,b has a homoclinic tangency (see Theorem 1.3 of [Ish]) but there is no concurrence of orbit-creation and orbit-annihilation in any one-parameter family stated in the previous theorem. Our method gives another application; an upper bound for the Hausdorff dimension of the Lozi attractors. As one sees in Fig. 1.1, the attractor seems to have a fractal-like structure; locally it looks like (segment) × (Cantor set). But if the mappings defining a fractal set have both expanding and contracting directions, then (unlike the conformal case) it becomes a hard task to estimate the Hausdorff dimension of the fractal set even if they are affine (see the book of Falconer [Fal]). Our method, however, gives an upper bound for the Hausdorff dimension of the set K = Ka,b of all points which have bounded orbit both in forward and backward time when a > 1 + |b| and 2a > 1 + 4|b| (see Theorem 5.5). Combining this with Young’s equality [Yo1] for the lower bound, we obtain the following explicit estimates of the Hausdorff dimension of the attractors when an SRB measure exists. √ Corollary 1.3 (Hausdorff Dimension of the Attractors). Assume that 2a > b + 2, b < 4 − 2a and b > 0 small. Then, we have √ log 2 log(a + a2 − 4b) − log 2 √ √ ≤ dimH F ≤ 1 + . 1+ log(a + a2 − 4b) − log 2b log(a + a2 − 4b) − log 2b This estimate becomes sharper when b > 0 goes to 0. Actually the ratio: (the upper bound) − (the lower bound) (the upper bound) − 1 tends to 1 − log a/ log 2 which is close to zero. For example, if we choose a = 1.7 and b = 0.1, then the estimate gives 1.176669 · · · ≤ dimH F ≤ 1.247848 · · · , and for the choice a = 1.7 and b = 0.01 we get 1.102712 · · · ≤ dimH F ≤ 1.135055 · · · .


379

Note. In recent work with D. Sands [IS], the author has proved the monotonicity in a of the Lozi family for all a > 1 and |b| sufficiently small, extending a part of Theorems 1.1 and 1.2.

2. The Pruning Pair: Review In this section we summarize some results established in the first paper [Ish] in a more convenient form for our purpose. In the following, we always assume that a > 1 + |b|. Let K = KL be the set of all points whose forward and backward orbits remain bounded. It is not difficult to see that K is completely invariant under the application of L, non-empty and compact. We develop the symbolic dynamics attached to the Lozi mapping L ≡ La,b on this set K. For a point X ∈ K, we put πL (X) ≡ · · · ε−2 ε−1 · ε0 ε1 ε2 · · · , where

  +1 εi ≡ ∗  −1

Li (X)x > 0, Li (X)x = 0, Li (X)x < 0.

Here ∗ plays a role of “joker”, i.e. we substitute both +1 and −1 for ∗; Yx refers to the x-component of Y . We call an element of π(X) an itinerary of X. So, if an orbit of X lands on the y-axis n times, π(X) consists of 2n itineraries. We sometimes use the notion of an itinerary with joker in which ∗ is not replaced by +1 or −1. Remark that this multi-valued map π conjugates the Lozi mapping on K and the shift map on a subset of {+1, −1}Z . In the following, we write [ π(X) π(B) ≡ X∈B

for any subset B of K. The partial order on the symbol space is given as follows. For every element ε = · · · ε−2 ε−1 · ε0 ε1 · · · in {+1, −1}Z , we call εu ≡ · · · ε−2 ε−1 · the tail of ε, and εs ≡ ·ε0 ε1 · · · the head of ε. Let C u (resp. C s ) be the set of all tails (resp. heads) of the elements of {+1, −1}Z equipped the standard topology. Definition 2.1. Let −1 < +1 be the order on the letters. (i) Take two distinct sequences εs and δ s in C s . Then, we can find the smallest number i ≥ 0 such that εi 6= δi . We say εs <s δ s if one of the following conditions is satisfied: • the number of +1’s in ε0 · · · εi−1 is even and εi < δi , • the number of +1’s in ε0 · · · εi−1 is odd and εi > δi . (ii) Take two distinct sequences εu and δ u in C u . Then, we can find the largest number i < 0 such that εi 6= δi . When b > 0 (resp. b < 0), we say εu δi .

380

Y. Ishii

Fig. 2.1. Partial orders in the symbol space

It is quite convenient to “visualize” the symbol space {+1, −1}Z as a product of two middle third Cantor sets which preserves the orders defined above (see Fig. 2.1). Then, the shift map on the symbol space becomes a Smale horseshoe. Now let us define our two-dimensional analogue of the Milnor–Thurston kneading sequence. The idea is, as in [Cvit, CGP], to regard a Lozi mapping as an “incomplete horseshoe” and measure its incompleteness compared with the full shift. To do this, we first consider the following continued fraction: sn = s(· · · εn−2 εn−1 εn ·) ≡

and

1

,

(2.1)

p(· · · ε−2 ε−1 ·) ≡ 1 − bs−2 + b2 s−2 s−3 − b3 s−2 s−3 s−4 + · · · .

(2.2)

−aεn +

b −aεn−1 +

b −aεn−2 + .

..

Also we consider q(·ε0 ε1 · · ·) ≡ b−1 r0 − b−2 r0 r1 + b−3 r0 r1 r2 − · · · ,

(2.3)

where rn is defined as rn = r(·εn εn+1 εn+2 · · ·) ≡

b

.

b

aεn + aεn+1 +

(2.4)

b aεn+2 + .

..

It was shown in the previous paper that p, sn , q and rn (and their partial derivatives as well) are holomorphic functions of (a, b) ∈ C2 , and continuous with respect to the three variables (a, b, ε). See Lemma 6.1 in [Ish].


381

Now our two-dimensional analogue of the kneading sequence for a Lozi mapping is: Definition 2.2. We call PL ≡ {ε ∈ {+1, −1}Z | p(εu ) − q(εs ) = 0} the pruning front of L and DL ≡ {ε ∈ {+1, −1}Z | p(εu ) − q(εs ) < 0} the primary pruned region of L. The pair (PL , DL ) is called the pruning pair of L. The dynamical interpretation of p and q is the following. In [Ish] we constructed the map: x0,−1 : {+1, −1}Z −→ R2 , which gave the inverse of the itinerary map π on π(K). What we showed there was that each sequence ε (even for a sequence outside π(K)) with a fixed tail · · · ε−2 ε−1 · (resp. head ·ε0 ε1 · · ·) is mapped into a line: x − p(εu ) = y/s−1 (resp. x − q(εs ) = r0 y) which is denoted by Luε (resp. Lsε ). The dynamical interpretation of these lines is that, if ε is in π(K) and if the orbit of X ≡ x0,−1 (ε) never lands on the y-axis, then Luε (resp. Lsε ) expresses the expanding (resp. contracting) direction for the iteration of L at X (see Fig. 2.2).

s Fig. 2.2. Dynamical interpretation of Lu ε and Lε

One of the main results of the previous paper was to give a solution of the “pruning front conjecture” [Cvit, CGP] in this setting. We say that ε is admissible if it is realized as an itinerary of a point X ∈ KL . Let us denote the set of all admissible sequences of L by AL , i.e. π(KL ) = AL . Let σ be the shift map on {+1, −1}Z : σ(· · · ε−2 ε−1 · ε0 ε1 ε2 · · ·) ≡ · · · ε−2 ε−1 ε0 · ε1 ε2 · · · . Then, Theorem 1.2 of [Ish] (the pruning front conjecture) tells us that the set of all admissible symbol sequences is completely characterized by the primary pruned region. More precisely, we have shown that

382

Y. Ishii

AL = {+1, −1}Z \

[

σ n DL .

(2.5)

n∈Z

Next let us define an equivalence relation in AL to construct a topological model of the dynamics of L on KL . Definition 2.3. Let L be a Lozi mapping and P be its pruning front. For an admissible sequence ε (resp. ε0 ), let M (resp. M0 ) be the set of all integers m such that σ m ε ∈ P (resp. σ m ε0 ∈ P). We write ε ∼P ε0 if M = M0 and εn−1 = ε0n−1 for all n ∈ Z \ M. Let ι be the natural projection with respect to this relation. We set π/∼P ≡ ι ◦ π. Let σ/∼P be the factor of σ by the projection, i.e. σ/∼P ≡ ι ◦ σ ◦ ι−1 . Theorem 2.4 (Combinatorics, [Ish] Theorem 5.5). π/∼P gives a topological conjugacy between L on KL and σ/∼P on AL ∼ , i.e. π/∼P is a homeomorphism onto P AL ∼ such that the following diagram commutes: P

KL   π/∼P y AL ∼

L

−−−−→

P

KL  π/ y ∼P σ/∼ −−−−P→ AL ∼ . P

We remark that σ/∼P on AL ∼ is determined by the pruning pair (PL , DL ) of P L only. Thus, we can analyze any topological properties of the dynamics of L on KL through the investigation of its pruning pair. Using this theorem, we have solved the so-called first tangency problem, and as a consequence, one observed that the boundary of H is algebraic. Corollary 2.5 (Boundary of Horseshoes, [Ish] Corollary 1.4). The boundary of H forms an algebraic curve. Moreover, the boundary becomes a graph of a function a = g(b). When (a, b) ∈ ∂H, the primary pruned region is empty, and the pruning front consists of two points · · · + + + − + · + − − − · · · and · · · + + + − − · + − − − · · · when b > 0 (· · · − − − + + · + − − − · · · and · · · − − − + − · + − − − · · · when b < 0). 3. Monotonicity when b is Close to Zero In this section we study dependence of the Lozi family with respect to the parameter a for fixed Jacobian −b sufficiently close to zero. Let L and L0 be two Lozi mappings. Define a partial order between their pruning pairs as follows. Definition 3.1. We write (PL , DL ) < (PL0 , DL0 ) if DL ⊃ DL0 and DL ⊃ PL0 . This definition means that the dynamics corresponding to the smaller pair (PL , DL ) is realized as a quotient of sub-dynamics of the one corresponding to the pair (PL0 , DL0 ). The next proposition plays a central role in this section which establishes the partial monotonicity of the pruning pair.


383

Proposition 3.2. There exist b∗ > 0 and a∗ ≈ 1.97 such that, for any fixed b with 0 < |b| < b∗ , the pair (PL , DL ) is a monotone increasing function of a ≥ a∗ . Moreover, this function is non-constant in the sense that DL contains a cylinder set when a∗ ≤ a < 2. Remark that, when a is large, both PL and DL are empty, i.e. the pruning pair is maximal. First, we show that a Lozi mapping is “close” to a horseshoe if a is not small. Let us put Cl ≡ {ε ∈ {+1, −1}Z | ε0 = 1, ε1 = −1, · · · , εl = −1}. Recall that the biggest head is · + − − − · · ·. See Fig. 3.1.

Fig. 3.1. P and D close to a horseshoe

Lemma 3.3. Let (Pa,b , Da,b ) denote the pair corresponding to La,b . (i) For a fixed aˆ > 1, let n be a number which satisfies 2/ân+1 > (2 − aˆ )/(â − 1). Then, there exists bˆ > 0 such that, for every (a, b) satisfying a > aˆ and bˆ > |b| > 0, we have Pa,b ∪ Da,b ⊂ Cn . (ii) For a fixed a˜ < 2, let m be a number which satisfies 2/ãm+1 < 2 − a˜ . Then, there exists b˜ > 0 such that, for any b satisfying b˜ > |b| > 0, we have Cm ⊂ Da,b ˜ . Proof. The following specific calculation was suggested by H. H. Rugh. Suppose that b = 0. Then, by (2.2) and (2.3), (p − q) equals 1−

1 1 1 + − 3 + ···. aε0 a2 ε0 ε1 a ε0 ε 1 ε 2

Given n as above, one easily sees that (p − q) is positive outside Cn for b = 0 and a ≥ aˆ . We know that (p − q) is a continuous function of (ε, a, b). So, in a neighborhood of b = 0 and aˆ ≤ a ≤ 3, (p − q) is positive outside Cn . Given m as above, one can see that (p − q) is negative in Cm for b = 0, a = a˜ . The rest of the proof of (ii) is now similar.

384

Y. Ishii

Proof of Proposition 3.2. Take first b = 0. We remark that, in this case, p ≡ 1. Then, an easy calculation shows that the a-derivative of (p − q) at the head · + − − − · · · becomes 0 −1/(a − 1) = 1/(a − 1)2 , which is always positive. The continuity of ∂q/∂a with respect to the head asserts that the derivative is also positive on the cylinder set CN for sufficiently large N when b = 0. By the same argument as in the previous lemma, we conclude that (p − q) is a monotone increasing function of a on CN for b 6= 0 sufficiently close to 0. Take a∗ ≈ 1.97 so that the cylinder set CN contains D and P as in the previous lemma (actually, one can take N = 5). This proves the monotonicity of the pruning pair (P, D). Here we will give the proofs of Theorems 1.1 and 1.2 when b is sufficiently close to zero. The complete proofs will be given in the next section. From this proposition we can first observe the bifurcations of the Lozi family. To do this, for a fixed symbol sequence ε in {+1, −1}Z we consider the parameter dependence of X ≡ x0,−1 (ε). Under the assumption of Proposition 3.2 (i.e. for a fixed b sufficiently close to 0 and a ≥ a∗ ), we say such a point X appears at a = a0 if ε is not admissible for all a < a0 sufficiently close to a0 and is admissible for a = a0 . This definition is well-defined, i.e. it does not depend on the choice of ε such that X = x0,−1 (ε). In the same way, we say such a point X = x0,−1 (ε) disappears at a = a0 if ε is not admissible for a > a0 sufficiently close to a0 and is admissible for a = a0 . One gets the next corollary on the bifurcations of non-periodic orbits. Corollary 3.4. Let b∗ > 0 and a∗ ≈ 1.97 be as in Proposition 3.2 and fix b with 0 < |b| < b∗ . Then, every point in K moves continuously, and does not disappear when a ≥ a∗ increases. Moreover, a non-periodic point in K whose orbit intersects with the x-axis at r points (+∞ ≥ r ≥ 0) bifurcates to precisely 2r points as a ≥ a∗ increases, and it never bifurcates after that bifurcation. Proof. Take a0 and a1 such that a∗ ≤ a0 < a1 . Then, by Proposition 3.2, any itinerary ε in π(X) of a point X ∈ K for a = a0 is admissible for a = a1 . This proves the first half of the corollary. Now suppose that the orbit of a point X intersects with the x-axis at r points when a = a0 . This means that π(X) contains r jokers. Thus, the map x0,−1 is 2r to one on π(X). When one takes a = a1 , then no element of π(X) is contained in the shifted images of the pruning front, so now the map x0,−1 is one to one on each element of π(X). Thus, the second half of the corollary follows from Theorem 2.4. Proof of the second part of Theorem 1.2. Suppose that we have a periodic point X of minimal period d at a = a0 whose orbit intersects with the x-axis at r points (d ≥ r ≥ 1; if r = 0, then there is no more bifurcation by Proposition 3.2). Then, its itinerary with joker ∗ is a symbol sequence of period d which contains r jokers in every successive d letters. For any n ∈ N, there are precisely 2nr ways to make a symbol sequence of period nd (not necessarily minimal) by replacing each joker by +1 or −1. When one takes a > a0 , the map x0,−1 sends such periodic sequences to distinct periodic points in the dynamical space of period nd (not necessarily minimal). This proves the second part of Theorem 1.2. One can sharpen these results on the occurrence of bifurcations in the following way: Corollary 3.5. X ∈ K appears at a = a0 if and only if {Ln (X)}n∈Z ∩ {x-axis} is non-empty.


385

Proof. Let X ≡ x0,−1 (ε) and put un (a) = un (ε, a) ≡ (p − q)(σ n ε) at a. Then, X appears at a = a0 if and only if un (a0 ) ≥ 0 for all n ∈ Z, and there exists a sequence ai < a0 (i > 0) converging to a0 and a sequence of integers n(i) such that un(i) (ai ) < 0. So zeros of un(i) (a) (which will be denoted by zn(i) ) accumulate to a0 . The a-derivative of un (a) is uniformly bounded with respect to n near a0 so un(i) tends to zero, which shows the accumulation of Ln (X) to the x-axis. Conversely if Ln (X) accumulates to the x-axis, then there exists a sequence n(i) such that un(i) (a0 ) tends to 0. This means that, when i is big, σ n(i) (ε) is in the cylinder set CN as in (ii) of Lemma 3.3. Here the a-derivative of un(i) (a) is uniformly away from 0 with respect to n(i) near a0 so this implies that zn(i) accumulates to a0 , which proves the converse. Proof of the third part of Theorem 1.2. This proceeds almost in the same way as the previous corollary. Suppose that there exists a periodic point X of period d at a = a1 and assume that its itineraries π(X) are not admissible at a = a2 < a1 . This inadmissibility is independent of the choice of a sequence in π(X). In fact, if π(X) has more than one sequence (this means that each element of π(X) is in the (pre)image of the pruning front), then all sequences in π(X) are inadmissible when a = a2 < a1 by Proposition 3.2. Because X is periodic of period d, there exists a periodic symbol sequence ε ∈ π(X) of period d. As in the argument of the previous corollary, we can find a3 between a1 and a2 and some number 0 ≤ m ≤ d − 1 such that (p − q)(a3 , b, σ m ε) = 0 and ε is admissible at a = a3 by the intermediate value theorem. This means that Lm a3 ,b (X) lands on the x-axis. Hence we have proven Theorem 1.2. Remark. This proof shows that the (pre)images of “tangencies” of the stable line Lsε and the unstable line Luε on the x-axis create all new periodic points in the monotonicity region. Here, a “tangency” means a contact of an unstable manifold with a stable manifold (which are broken lines) at its corner. See Fig. 3.2.

Fig. 3.2. “Tangency” on the x-axis

The following Fig. 3.3 shows the typical bifurcation diagram in which we summarize the statements in Theorem 1.2. Another consequence of Proposition 3.2 is the dependence of the topological entropy with respect to the parameters. It is natural to consider the topological entropy for a

386

Y. Ishii

Fig. 3.3. Monotonicity of the bifurcation diagram

continuous map on a compact space. So we take the one-point compactification of R2 and extend the Lozi maps continuously by putting L∗ (∞) = ∞. Because the topological entropy on the whole compact space is equal to that on the non-wandering set Ω, one easily obtains hL∗ (R2 ∪ {∞}) = hL∗ (Ω∗ ) ≤ hL∗ (K ∪ {∞}) = hL (K), where Ω∗ = Ω(L∗ ). But hL (K) ≤ hL∗ (R2 ∪ {∞}), so we have equality everywhere. Thus we may calculate the topological entropy on the K-set, i.e. on the topological model in the symbol space due to Theorem 2.4. Remark. For any a˜ < 2, we may pick m as in Lemma 3.3 (ii). This implies that the corresponding symbol space lacks at least sequences which contain a certain word of finite length m. If we regard each word of length m as a new symbol and σ m as a new shift map, then we have a new full shift with 2m symbols. When a = a˜ , there are at most 2m − 1 admissible symbols, so the topological entropy of this new shift map at a = a˜ is at most log(2m − 1). Hence the topological entropy at a = a˜ is at most 1 log(2m − 1) m due to the fact that the entropy of σ m is equal to the entropy of σ times m. Thus, we conclude that h(La,b ) < log 2 for any a < 2 (see also Corollary 1.5 of [Ish]). Proof of Theorem 1.1. Take b with 0 < |b| < b∗ and a∗ ≤ a < a0 . Put L ≡ La,b and L0 ≡ La0 ,b , and let P and P 0 be their pruning fronts respectively. From (2.5) and Proposition 3.2, one sees that AL ⊂ AL0 . Again by the proposition, there is a natural projection (this is just ι induced by P): ι : AL ∼ 0 −→ AL ∼ , P

P

which conjugates the dynamics. So we get hσ/∼P AL ∼ ≤ hσ/∼ 0 AL ∼ 0 ≤ hσ/∼ P

P

P

P0

A L0

∼P 0

.


387

By Theorem 2.4, this implies that hL (KL ) ≤ hL0 (KL0 ). This completes the proof of Theorem 1.1 when b is close to zero.

4. Monotonicity near Horseshoes In this section, we consider the monotonicity properties of the Lozi family along several directions near horseshoes. To state this, let nX ∈ TX R2 be an inner normal vector of ∂H at X ∈ ∂H \ {(2, 0)}, and let us put M ≡ {(X, v) ∈ T R2 | X ∈ ∂H \ {(2, 0)}, v ∈ TX R2 such that (nX , v) > 0}. Proposition 4.1. Let

f (t) = f : (−1, 1) −→ R2

be any C 1 -curve from the open interval to the parameter space such that (f (0), Dt f |t=0 ) ∈ M (see Fig. 4.1). Then, there exists δ > 0 such that the mapping: t 7−→ (Pf (t) , Df (t) ) is a monotone increasing function on [−δ, δ]. Moreover, this function is non-constant in the sense that Df (t) contains a cylinder set when −δ ≤ t < 0.

Fig. 4.1. C 1 -curves transverse to ∂H

Proof. The idea is first to show that the a-derivative of (p − q) is positive at (a, b, ε), where (a, b) ∈ ∂H and ε satisfying (p − q)(a, b, ε) = 0. For simplicity, we consider only the case b > 0. Then, when (a, b) ∈ ∂H, the primary pruned region is empty and the pruning front consists of two points · · ·+++−+·+−−− · · · and · · ·+++−−·+−−− · · · by Corollary 2.5. At these special points, we see p(· · · + + + − ± ·) − q(· + − − − · · ·) = 1 −

b b − , (a + x)(1 + x) (a + x)(b + x)

388

Y. Ishii

where x = (a −

√ a2 + 4b)/2. So, one has

n ∂x (2x + a + 1) + (x + 1) ∂ (p − q) = b ∂a + ∂a (a + x)2 (1 + x)2

+ a + 1) + (x + b) o . (a + x)2 (b + x)2

∂x ∂a (2x

It is easy to see that all of ∂x ∂a , (2x + a + 1), (x + 1) and (x + b) are positive. Because the boundary of H is described as a graph of a function a = g(b) which is monotone increasing when b > 0, the derivative of (p − q) in the direction Dt f |t=0 is also positive at f (0) ∈ ∂H and at the two special points as above. Due to the continuity of the derivative of (p−q) in the direction Dt f |t=0 , the derivative is still positive in a neighborhood of the special two points and near t = 0. Thus, Df (t) contains a cylinder set when t is negative and close to zero. Take a smaller neighborhood of t = 0, if necessary, so that all the corresponding pruning fronts Pf (t) are in the neighborhood of the two points. This finishes the proof. Remark. The length of the C 1 -curve can be chosen uniformly for every choice of compact subset of M . However, it may shrink to zero when b tends to zero. In particular, we know nothing about the monotonicity of b 7−→ (P2,b , D2,b ). The proofs of the following two corollaries are same as in Sect. 3, so we omit them. Corollary 4.2. Under the conditions of Proposition 4.1, there exists δ > 0 such that the mapping: t 7−→ h(Lf (t) ) is a monotone increasing function on [−δ, δ]. Moreover, we have h(Lf (t) ) < log 2 when −δ ≤ t < 0. Corollary 4.3. Under the conditions of Proposition 4.1, the bifurcations of Lf (t) are monotone as in the statements of Theorem 1.2. Combining these corollaries with the results in the previous section, we have the complete proofs of Theorems 1.1 and 1.2. Finally we summarize the results above to get a monotone picture in the parameter space near ∂H as Fig. 4.2. 5. Hausdorff Dimension of the Attractors The Lozi attractor is a typical example of a fractal. As mentioned in the introduction, locally it looks like (Cantor set)×(segment). If the mappings defining a fractal set are contractions, then one can get some estimates of the Hausdorff dimension of the fractal using the contraction constants. But if they have both expanding and contracting direction, then the estimation becomes very difficult even if they are affine maps. This is because the Hausdorff dimension depends largely not only on the contraction (or expanding) ratios but also on the relative position of the miniatures of the fractal (see [Fal], p. 126). In the Lozi case, we can estimate the Hausdorff dimension of the set K from above using our method. This upper bound is given when a > 1 + |b| and 2a > 1 + 4|b|. The lower bound is given when an SBR measure exists due to Young’s equality between entropy, Lyapunov exponents and Hausdorff dimension of an invariant ergodic measure.


389

Here, −→ means the direction of increase of the topological entropy and the bifurcations Fig. 4.2. Global picture of the monotonicity near ∂H

5.1. Upper bound. As we have mentioned in the introduction, the Lozi attractor F densely contains an unstable manifold which is a broken line. So, there is little hope that the upper bound of the Hausdorff dimension of F in the “unstable direction” would be better than 1, and thus, we should analyze the Hausdorff dimension of F in the “Cantor direction”. More precisely, due to the inclusion: [ [ Luε , Lsε ∩ Luε ⊂ K⊂ ε∈{+1,−1}Z

ε∈{+1,−1}Z

we want to calculate the Hausdorff dimension of the distribution of p(εu ) in the x-axis: Λa,b ≡ {p(εu ) ∈ R | εu ∈ C u }. Recall first a basic fact on Hausdorff dimension (see the book [Fal], for example, for the definition of Hausdorff dimension and the following lemma). Lemma 5.1. Let X and Y be two metric spaces and f be a Lipschitz map from X to Y . Then we have dimH f (X) ≤ dimH X, where dimH D denotes the Hausdorff dimension of D. Here, we prepare a standard model of the Cantor set as; X κn · · · ε−3 ε−2 ∈ {+1, −1}N Γ (κ) ≡ ε−2 · ε−3 · · · ε−n n≥2

for every κ < 1/2. It is easy to see that the Hausdorff dimension of Γ (κ) is − log 2/ log κ. The central claim in this subsection is: Proposition 5.2. The set Λa,b in the x-axis has Hausdorff dimension not greater than log 2 p . log 2 − log(a − a2 − 4|b|)

390

Y. Ishii

We remark that Λa,b is not a dynamically defined Cantor set in the sense of Palis– Takens. Proof of Proposition 5.2. We put p p 0 −(a − a2 − 4|b|) a − a2 − 4|b| 1 , x∈ κ1 ≡ max −aε + bx 2|b| 2|b| =

4|b| p , (a + a2 − 4|b|)2

and u

u

u

κ2 ≡ max{|bs(ε )| | ε ∈ C } =

a−

p a2 − 4|b| . 2

Recall that the dynamics: x 7−→

1 −aε + bx

(ε = ±1)

(5.1)

is associated to the definition of sn . It is easy to check that κ1 < κ2 . By the previous lemma, it is enough to construct a Lipschitz map from Γ (κ) onto Λa,b for every κ > κ2 . For εu = · · · ε−2 ε−1 · ∈ C u and n ≥ 0, we introduce the notation: (σ n εu )u ≡ · · · ε−n−2 ε−n−1 · . Then, one can rewrite u

p(ε ) = 1 +

∞ X

(−b)n s((σεu )u ) · · · s((σ n εu )u ).

n=1

Lemma 5.3. We have the following two recursion equations: s((σ n−1 εu )u ) = and

1 −aε−n + bs((σ n εu )u )

p((σ n−1 εu )u ) = 1 − bs((σ n εu )u )p((σ n εu )u ).

(5.2)

(5.3)

Now, fix an integer N > 0 and consider two elements in C u : εu = · · · ε−N −3 ε−N −2 ε−N −1 · · · ε−1 · and

δ u = · · · δ−N −3 δ−N −2 ε−N −1 · · · ε−1 · .

By (5.1), the first recursion Eq. (5.2) and the definition of κ1 , one gets −n |bs((σ n εu )u ) − bs((σ n δ u )u )| ≤ C · κN 1

(5.4)

for all n > N . If 2a > 1 + 4|b|, then |bs(εu )| < 1/2, which implies that each value of p(εu ) is in the open interval (0, 2). So, we start with |p(· · · ε−N −2 ε−N −1 ·) − p(· · · δ−N −2 ε−N −1 ·)| = |p((σ N εu )u ) − p((σ N δ u )u )| ≤ 2.


391

From the second recursion Eqs. (5.3), (5.4) and the previous estimate, we see |p((σ N −1 εu )u )−p((σ N −1 δ u )u )| = |bs((σ N εu )u )p((σ N εu )u ) − bs((σ N δ u )u )p((σ N δ u )u )| ≤ 2Cκ1 + 2κ2 ≤ (2C)2κ2 . Again, in the same way, one obtains (see Fig. 5.1) |p((σ N −2 εu )u ) − p((σ N −2 δ u )u )| = |bs((σ N −1 εu )u )p((σ N −1 εu )u ) − bs((σ N −1 δ u )u )p((σ N −1 δ u )u )| ≤ ((2C)2κ2 )κ2 + 2Cκ21 ≤ (2C)3κ22 .

Fig. 5.1. Recursion Eq. (5.3)

Inductively, we can get |p(εu ) − p(δ u )| ≤ (2C)(N + 1)κN 2 . Thus, we have shown that the mapping: v=

X n≥2

κn 7−→ p = p(· · · ε−2 ε−1 ) ε−2 · ε−3 · · · ε−n

from Γ (κ) onto Λa,b is Lipschitz for any κ > κ2 . This finishes the proof of Proposition 5.2. Next we work for the unstable direction. Let pt (εu ) be the x-coordinate of the intersection of Luε and {y = t}.

392

Y. Ishii

Lemma 5.4. The mapping εu 7−→ pt (εu ) from Γ (κ) to the line {y = t} is Lipschitz for all κ > κ2 . Proof. By the definition, one sees pt (εu ) = p(εu ) − t/s−1 . So, we get t u u |pt (εu ) − pt (δ u )| ≤ |p(εu ) − p(δ u )| + u u |s(ε ) − s(δ )| . s(ε )s(δ ) The first term in the right hand side is Lipschitz. |s(εu )| is bounded from below inde pendent of εu . So, we obtain the result from (5.4). Thus, we have shown that the surjective map: [ Γ (κ) × R −→

Luε

ε∈{+1,−1}Z

defined by (εu , t) 7−→ (pt (εu ), t) is Lipschitz for every κ > κ2 as promised. This implies that Theorem 5.5 (Hausdorff Dimension of the K-sets). Suppose that a > 1 + |b| and 2a > 1 + 4|b|. Then we have dimH K ≤ 1 +

log 2 p . a2 − 4|b|)

log 2 − log(a −

Remark. When K is a horseshoe, we can repeat a similar argument for an estimate of the Hausdorff dimension in the “unstable direction”, and get a better bound for dimH K, i.e. the number “1” in the upper bound of Theorem 5.5 can be replaced by a better bound. 5.2. Lower bound. The lower bound for the Hausdorff dimension is essentially given by Young’s celebrated equality [Yo1]. To explain this, we first recall some definitions. Let f be a C 2 -diffeomorphism on a compact surface (or a Lozi mapping) and ν be an invariant ergodic Borel probability measure. Then, the Hausdorff dimension of ν is defined by dimH ν ≡ inf {dimH B | ν(B) = 1} . By the ergodic theorem, the numbers: λ± f ≡ lim

n→±∞

1 log kDX f n k n

exist and constant for ν-almost everywhere X. We call them the ν-Lyapunov exponents of f . Now suppose that the Lozi mapping has a strange attractor F as in the introduction, and let b > 0 be small. Then it is shown by Collet–Levy and Young [CL, Yo2] that there exists an SRB measure µ for La,b . The following is an extended version of the Young’s equality [Yo1] for Lozi mappings established in [CL]. Proposition 5.6. For an SRB measure µ for La,b , we have 1 1 , − dimH µ = hµ (L) λ+L λ− L where hµ (L) is the µ-entropy of L.


that

393

Using Pesin’s entropy formula hµ (L) = λ+L for maps with singularity, we conclude dimH F ≥ 1 −

λ+L . λ− L

(5.5)

Lemma 5.7. We have the following estimates of the exponents of La,b : √ a + a2 − 4b + λL ≥ log > 0, 2 and λ− L

≥ − log

a+

√ a2 − 4b < 0. 2b

Proof. It is proven by Misiurewicz [Mis] that, if La,b satisfies a > 1 + |b|, then there is an invariant splitting: u s ⊕ EX TX R2 = EX u s of the tangent space TX R2 into one dimensional subspaces EX and EX at every point X ∈ R2 , where Lm is differentiable for all m ∈ Z, so that !n ÿ √ a + a2 − 4b n u kvk for all v ∈ EX kDX L vk ≥ 2

ÿ

and n

kDX L vk ≤

a−

!n √ a2 − 4b kvk 2

s for all v ∈ EX

for n ≥ 0. From these inequalities, we obtain the estimates for λ± L.

Proof of Corollary 1.5. Equation (5.5) combined with the previous lemma gives the desired estimates. Acknowledgement. The author would like to express his gratitude to P. Cvitanović, H. H. Rugh, D. Sands and M. Shishikura for interesting discussions and encouragements. This paper was finished during the author’s stay at Laboratoire de Topologie et Dynamique, Université de Paris–Sud. He is very grateful to the hospitality of this institute. This work is partially supported by the Research Fellowships of the Japan Society for the Promotion of Science for Young Scientists no. 3079.

References [CL]

Collet, P., Levy, Y.: Ergodic properties of the Lozi mappings. Commun. Math. Phys. 93, 461–481 (1984) [Cvit] Cvitanović, P.: Periodic orbits as the skeleton of classical and quantum chaos. Physica D 51, 138–151 (1991) [CGP] Cvitanović, P., Gunaratne, G.H., Procaccia, I.: Topological and metric properties of Hénon-type strange attractors. Phys. Rev. A 38, 1503–1520 (1988) [Fal] Falconer, K.: Fractal Geometry; Mathematical Foundations and Applications. New York: John Wiley and Sons, 1990 [Ish] Ishii, Y.: Towards a kneading theory for Lozi mappings I: A solution of the pruning front conjecture and the first tangency problem. To appear in Nonlinearity (1997) [IS] Ishii, Y., Sands, D.: The Lozi family is monotone near the tent-maps. Preprint, Paris (1996)

394

Y. Ishii

[KKY] Kan, I., Koçak, H., Yorke, J.: Antimonotonicity: Concurrent creation and annihilation of periodic orbits. Ann. Math. 136, 219–252 (1992) [Loz] Lozi, R.: Un attracteur e´ trange(?) du type attracteur de Hénon. J. Phys. (Paris) 39 (Coll. C5), 69–77 (1978) [MT] Milnor, J., Thurston, W.: On iterated maps of the interval. Preprint Princeton University (1977) and published in: Dynamical Systems, J.C. Alexander (ed.). Lecture Notes in Math., Vol. 1342. Berlin– Heidelberg–New York: Springer, 1988, pp. 465–563 [Mis] Misiurewicz, M.: Strange attractors for the Lozi mappings. In: Nonlinear Dynamics, R.G. Helleman (ed.). New York: The New York Academy of Sciences, 1980, pp. 348–358 [Yo1] Young, L.-S.: Dimension, entropy, and Lyapunov exponents. Erg. Th. Dyn. Syst. 2, 109–124 (1982) [Yo2] Young, L.-S.: Bowen–Ruelle measures for certain piecewise hyperbolic maps. Trans. Amer. Math. Soc. 287, 41–48 (1985) Communicated by Ya.G. Sinai

Commun. Math. Phys. 190, 395 – 410 (1997)

Communications in


N = 2 KP and KdV Hierarchies in Extended Superspace F. Delduc, L. Gallot Laboratoire de Physique Théorique ENSLAPP, URA 14-36 du CNRS, associée a` l’ENS de Lyon et au LAPP, Groupe de Lyon: ENS Lyon, 46 Allée d’Italie, 69364 Lyon, France Received: 6 March 1997 / Accepted: 18 April 1997

Abstract: We give the formulation in extended superspace of an N = 2 supersymmetric KP hierarchy using chirality preserving pseudo-differential operators. We obtain two quadratic hamiltonian structures, which lead to different reductions of the KP hierarchy. In particular we find two different hierarchies with the N = 2 classical super-Wn algebra as a hamiltonian structure. The relation with the formulation in N = 1 superspace and the bosonic limit are carried out. Introduction There has been recently an important activity in the study of N = 2 supersymmetric hierarchies (KP [1, 2, 3, 4, 6], generalizations of KdV [5, 7], Two Bosons [8], NLS [9, 10, 11], etc..). The most usual tools in this field are the algebra of N = 1 pseudodifferential operators and Gelfand-Dickey type Poisson brackets [12]. Although these systems have N = 2 supersymmetry, only for very few of them with very low number of fields is a formulation in extended superspace known. It is the purpose of this paper to partially fill this gap. The formalism which we shall present here partly originates from the article [13]. It turns out that in order to construct the Lax operators of N = 2 supersymmetric hierarchies, one should not use the whole algebra of N = 2 pseudo-differential operators, but rather the subalgebra of pseudo-differential operators preserving chirality. These operators were first considered in [14]. They will be defined in Sect. 1, where we also study the KP Lax equations and the two associated Hamiltonian structures. It turns out that the first (linear) bracket is associated with a non-antisymmetric r matrix [15]. Because of that, the second (quadratic) bracket is not of pure Gelfand-Dickey type. The main result of this paper is that we find two possibilities for this quadratic bracket. In fact, we show that there is an invertible map in the KP phase space which sends one of the quadratic Poisson structures into the other. However, this map does not preserve the Hamiltonians.

396

F. Delduc, L. Gallot

In Sect. 2, we study the possible reductions of the KP hierarchy by looking for Poisson subspaces in the phase space. These are different depending on the quadratic bracket which is used. Among these reductions, there are two different hierarchies with the N = 2 classical super-Wn algebra [16] as a hamiltonian structure. In particular, two of the three known N = 2 supersymmetric extensions of the KdV hierarchy [17] are found. They correspond to a = −2 and a = 4 in the classification of Mathieu. These and some other examples are described in Sect. 3. Notice that from the known cases with a low number of fields [17, 18, 19, 20, 21, 22], one expects for any n three hierarchies with super-Wn as a hamiltonian structure. So our construction does not exhaust the possible cases. We also found two hierarchies whose Poisson structure is the classical “small" N = 4 superconformal algebra. In one case the evolution equations are N = 4 supersymmetric, while in the other they are only N = 2 supersymmetric. In Sect. 4 we give the relation of our formulation with the usual formulation of the N = 2 supersymmetric KP Lax equations in N = 1 superspace [23, 3, 6]. Finally, in Sect. 5 we study the bosonic limit of the N = 2 hierarchies. It will turn out that the bosonic limit of the two N = 2 Poisson brackets contains the Gelfand-Dickey bracket, together with two non-standard brackets, found by Oevel in [24], and associated with non-standard KP. We can also check that for the hierarchies studied in this article, the conjecture formulated in [5] about the bosonic limit of N = 2 KdV-type hierarchies applies. 1. N = 2 KP Hierarchy N = 2 supersymmetry. We shall consider an N = 2 superspace with space coordinate ¯ We shall use the notation x for the triple of x and two Grassmann coordinates θ, θ. ¯ The supersymmetric covariant derivatives are defined by coordinates (x, θ, θ). ∂≡

∂ 1¯ ∂ ¯ = ∂ + 1 θ∂, D2 = D ¯ 2 = 0, {D, D} ¯ =∂ , D= + θ∂, D ∂x ∂θ 2 ∂ θ¯ 2

(1.1)

Besides ordinary superfields H(x) depending arbitrarily on Grassmann coordinates, one ¯ can also define chiral superfields ϕ(x) satisfying Dϕ = 0 and antichiral superfields ϕ(x) ¯ ϕ¯ = 0. We define the integration over the N = 2 superspace to be satisfying D Z Z 3 ¯ ¯ θ=θ=0 ¯ θ, θ)| (1.2) d x H(x, θ, θ) = dxDDH(x, ¯ . The elements of the associative algebra of N = 2 pseudo-differential operators (9DOs) are the operators X ¯ + αi D + βi D)∂ i , (ai + bi [D, D] (1.3) P = i<M

where ai , bi and αi , βi are respectively even and odd N = 2 superfields. However, this algebra is not very manageable. In particular, the set of strictly pseudo-differential operators (M = 0 in(1.3)) is not a proper subalgebra, but only a Lie subalgebra. Also, there are too many fields in these operators. We expect the phase space of the N = 2 KdV hierarchies to consist of the supercurrents of the N = 2 Wn algebras. In extended superspace, these supercurrents are bosonic superfields, and there is one such superfield for a given integer dimension. But in (1.3), each power of ∂ corresponds to four superfields, two even ones of integer dimension and two odd ones of half-integer dimension.

N = 2 KP and KdV Hierarchies in Extended Superspace

397

It is thus clear that one has to restrict suitably the form of the N = 2 operators. It turns out that a possible restriction is to define the set Cˇ of pseudo-differential operators Lˇ preserving chirality of the form1 X ¯ Lˇ = DLD, L= ui ∂ i (1.4) i<M

The coefficient functions ui are bosonic N = 2 superfields. These operators satisfy ¯ = 0. The product of two chiral operators is again a chiral operator. The DLˇ = Lˇ D explicit product rule is easily worked out ¯ 0 ) D, ¯ (1.5) Lˇ Lˇ 0 = D L∂L0 + (D.L)(D.L where we have used the notation (D.L) =

X

(Dui )∂ i .

(1.6)

i<M

¯ is the unit of the algebra C. ˇ We could have used as well the Notice that I = D∂ −1 D ¯ ¯ ˇ ˇ algebra C of 9DOs satisfying DL = LD = 0. Notice that the product of an element in Cˇ ¯ tD ∈ by an element in C¯ vanishes. In fact Cˇ and C¯ are related by transposition, Lˇ t = −DL ¯ ˇ ¯ C. Although the transposition leads from C to C, there exists an anti-involution which ˇ It is given by acts inside C. ˇ = DLˇ t ∂ −1 D, ¯ τ (Lˇ 1 Lˇ 2 ) = τ (Lˇ 2 )τ (Lˇ 1 ). τ (L)

(1.7)

Notice that it does not make sense in the algebra Cˇ to multiply a 9DO by a function. However, it is possible to multiply on the left by a chiral function φ, Dφ = 0, ¯ ¯ = λ(φ)L, ˇ λ(φ) ≡ Dφ∂ −1 D, φLˇ = DφLD

(1.8)

¯ D ¯ φ¯ = 0, and on the right by an antichiral function φ, ¯ φ), ¯ λ( ¯ φ) ¯ ≡ D∂ −1 φ¯ D. ¯ = Lˇ λ( ¯ Lˇ φ¯ = DLφ¯ D

(1.9)

We define the residue of the pseudo-differential operator Lˇ by resLˇ = u−1 [17]. The ¯ The trace of Lˇ is ˇ Lˇ 0 ] = Dω¯ + Dω. residue of a commutator is a total derivative, res[L, the integral of the residue Z ˇ ˇ Tr[L, ˇ Lˇ 0 ] = 0. TrL = d3 x resL, (1.10) Cˇ can be divided into two proper subalgebras Cˇ = Cˇ+ ⊕ Cˇ− , where Lˇ is in Cˇ+ if L is a differential operator and Lˇ is in Cˇ− if L is a strictly pseudo-differential operator (M = 0 in (1.4)). We shall note ¯ ∈ Cˇ+ , Lˇ − = DL− D ¯ ∈ Cˇ− . Lˇ = Lˇ + + Lˇ − , Lˇ + = DL+ D

(1.11)

Here an important difference with the usual bosonic and N =R 1 cases occurs. For ˇ res(Lˇ 0 ) 6= 0. any two 9DOs Lˇ and Lˇ 0 in Cˇ one has Tr(Lˇ − Lˇ 0− ) = d3 x res(L) While Cˇ+ is an isotropic subalgebra, Cˇ− is not. One important consequence of this fact 1

Operators of this type were first considered in [14]

398


ˇ = 1 (Lˇ + − Lˇ − ), then R is a is that if one defines the endomorphism R of Cˇ by R(L) 2 non-antisymmetric classical r matrix, Z 0 0 ˇ ˇ ˇ ˇ (1.12) Tr(R(L)L + LR(L )) = − d3 x resLˇ resLˇ 0 , ˇ Yˇ ]. ˇ Yˇ ] + [X, ˇ R(Yˇ )] = [R(X), ˇ R(Yˇ )] + 1 [X, R [R(X), 4

(1.13)

Notice that a non-antisymmetric r matrix in the context of bosonic KP Lax equations first appeared in [25].

KP equations. Let us now write the evolution equations of the N = 2 supersymmetric ¯ in Cˇ of the form KP hierarchy. We consider operators Lˇ = DLD L = ∂ n−1 +

∞ X

Vi ∂ n−i−1 .

(1.14)

¯ Wi ∂ −i )D,

(1.15)

i=1

Lˇ has a unique nth root in Cˇ of the form 1 Lˇ n = D(1 +

∞ X i=1

and we are led to consider the commuting flows k ∂ ˇ ˇ = [R(Lˇ nk ), L]. ˇ L = [(Lˇ n )+ , L] ∂tk

(1.16)

There are symmetries of these equations which may be described as follows. Let us first introduce a chiral, Grassmann even superfield ϕ which satisfies k ∂ ϕ = (Lˇ n )+ .ϕ, ∂tk

(1.17)

where the right-hand side is the chiral field obtained by acting with the differential k operator (Lˇ n )+ on the field ϕ. Then the transformed operator ˇ ˇ = λ(ϕ−1 )Lλ(ϕ) s(L)

(1.18)

ˇ satisfies an evolution equation of the same form (1.16) as that of L. We may also consider an antichiral, Grassmann odd superfield χ¯ which satisfies k ∂ χ¯ = −(Lˇ n )t+ .χ. ¯ ∂tk

(1.19)

ˇ ˇ = (−1)n λ((Dχ) ¯ −1 )τ (L)λ(D χ) ¯ σ(L)

(1.20)

Then the transformed operator

ˇ with the direction satisfies an evolution equation of the same form (1.16) as that of L, of time reversed.


399

Poisson brackets. The Lax Eq. (1.16) are bi-hamiltonian with respect to two compatible Poisson brackets which we now exhibit. Let Xˇ be some 9DO in Cˇ with coefficients ˇ = independent of the phase space fields {Vi }, then define the linear functional lXˇ (L) ˇ ˇ Tr(LX). The generalization of the first Gelfand-Dickey bracket is obvious and reads ˇ = Tr L[ ˇ Xˇ + , Yˇ+ ] − L[ ˇ Xˇ − , Yˇ− ] . (1.21) {lXˇ , lYˇ }(1) (L) This is nothing but the linear bracket associated with the matrix R. Now we turn to the construction of the second bracket. It will turn out more complicated than the standard Gelfand-Dickey bracket because of the non-antisymmetry of the r matrix. An analogous situation in the bosonic case is studied in [26, 24]. We finally found two different possibilities. In order to write them down, we need to be able to separate the residue of a 9DO in Cˇ into a chiral and an antichiral part. For an arbitrary superfield H(x), we define ¯ 8[H] ¯ ¯ H = 8[H] + 8[H], D8[H] = 0, D = 0.

(1.22)

ˇ An explicit form may be chosen as This is not a local operation in C. Z Z ¯ ¯ ¯ = DD d3 x0 1(x − x0 )H(x0 ), (1.23) 8[H] = DD d3 x0 1(x − x0 )H(x0 ), 8[H] where 1 is the distribution 1(x − x0 ) = (θ − θ0 )(θ¯ − θ¯0 )(x − x0 ), ∂(x − x0 ) = δ(x − x0 ), (x − x0 ) = −(x0 − x).

(1.24)

ˇ X]] ˇ In the following, we shall use the short-hand notations 8[ res[L, = 8Xˇ , ˇ ˇ ¯ ¯ 8[ res[L, X]] = 8Xˇ . In general, 8Xˇ will not satisfy the same boundary conditions as the phase space fields do. However, we noted earlier that in the case of a commutator, ˇ X] ˇ = Dω¯ + Dω. ¯ Here ω and ω¯ are differential the residue is a total derivative, res[L, ¯ − α, ¯ ˇ = Dω polynomials in the fields. Then one easily shows that 8Xˇ = Dω¯ + α, 8 X ¯ Up to this where α is a constant reflecting the arbitrariness in the definition of 8, 8. constant, 8Xˇ will respect the boundary conditions. We are now in a position to write the two possibilities for the second bracket as ˇ = Tr Lˇ X( ˇ Lˇ Yˇ )+ − Xˇ L( ˇ Yˇ L) ˇ + + 8 ˇ Lˇ Xˇ + Xˇ Lˇ 8 ¯ ˇ , (1.25) {lXˇ , lYˇ }a(2) (L) Y Y and ˇ = Tr Lˇ X( ˇ Lˇ Yˇ )+ − Xˇ L( ˇ Yˇ L) ˇ + + 8 ˇ Xˇ Lˇ + Lˇ Xˇ 8 ¯ ˇ . {lXˇ , lYˇ }b(2) (L) Y Y

(1.26)

These expressions do not depend on the arbitrary constant α. Checking the antisymmetry of the Poisson brackets and the Jacobi identity can be done with a little effort. As usual, the first bracket is a linearization of the two quadratic ones, that is to say −1 ¯ ˇ ˇ ˇ D) = {lXˇ , lYˇ }a,b {lXˇ , lYˇ }a,b ˇ , lYˇ }(1) (L), (2) (L + zD∂ (2) (L) + z{lX

(1.27)

and the linear bracket is compatible with each of the two quadratic brackets. k Introducing the hamiltonians Hk = nk Tr(Lˇ n ), the KP evolution Eq. (1.16) may be written as ˇ = {l ˇ , Hk+n }(1) (L) ˇ = {l ˇ , Hk }a,b (L). ˇ (1.28) ∂tk lXˇ (L) (2) X X

400


The Poisson brackets (1.25,1.26) may be put in the general abcd form introduced in [27] ˇ = Tr Lˇ Xa( ˇ Lˇ Yˇ ) + Xˇ Lb( ˇ Lˇ Yˇ ) − Lˇ Xc( ˇ Yˇ L) ˇ − Xˇ Ld( ˇ Yˇ L) ˇ , (1.29) {lXˇ , lYˇ }a(2) (L) ˇ = Tr Lˇ Xd( ˇ Lˇ Yˇ ) + Xˇ Lc( ˇ Lˇ Yˇ ) − Lˇ Xb( ˇ Yˇ L) ˇ − Xˇ La( ˇ Yˇ L) ˇ . (1.30) {l ˇ , l ˇ }b(2) (L) X

Y

ˇ Indeed However, the price to pay is that a, b, c, d are non-local endomorphisms of C. their expressions are ¯ 8[ ˇ = λ( ˇ ¯ resX]), a = R + c, b(X) ˇ = λ(8[ resX]), ˇ c(X) d = R + b.

(1.31) (1.32)

Using (1.12,1.13), one easily checks that a, d are non-local antisymmetric r matrices and that the following two compatibility relations hold: ˇ c(Yˇ )] − a([X, ˇ c(Yˇ )]) + c([b(X), ˇ Yˇ ]) = 0, [a(X), ˇ ˇ ˇ ˇ ˇ Yˇ ]) = 0. [d(X), b(Y )] − d([X, b(Y )]) + b([c(X),

(1.33)

These are sufficient conditions for {, }a(2) and {, }b(2) to be Poisson brackets. Poisson maps. Before turning to the study of the reductions of the KP hierarchies, let us exhibit some relations between the two quadratic brackets. Due to the abcd structure ˇ = Lˇ −1 is an anti-Poisson map of the brackets, the inversion I(L) {lXˇ ◦ I, lYˇ ◦ I}a(2) = −{lXˇ , lYˇ }b(2) ◦ I.

(1.34)

ˇ Let us now consider the invertible map in C, ¯ ˇ = ∂ −1 τ (L) ˇ = D∂ −1 Lˇ t ∂ −1 D. p(L)

(1.35)

A straightforward calculation leads to {lXˇ ◦ p, lYˇ ◦ p}a(2) = −{lXˇ , lYˇ }b(2) ◦ p,

(1.36)

which gives another equivalence relation between (1.25) and (1.26). However there is k k ˇ n−1 no relation between the hamiltonians Tr(Lˇ n ) and Tr(p(L) ). There is another relation between the two brackets, which involves the chiral superfield R ϕ satisfying the evolution Eq. (1.17). Let us introduce the linear functional lt = d3 x(tϕ), where t(x) is a Grassmann even superfield. We consider an enlarged phase space including ϕ, and extend the Poisson bracket (1.25) to this phase space by Z ˇ ϕ) = d3 xt((Lˇ Yˇ )+ .ϕ + 8 ˇ ϕ), {lt , lt0 }a(2) = 0. {lt , lYˇ }a(2) (L, (1.37) Y Then one finds

{lXˇ ◦ s, lYˇ ◦ s}a(2) = {lXˇ , lYˇ }b(2) ◦ s,

(1.38)

where the transformation s has been defined in (1.18). Notice that the hamiltonians are k ˇ nk ). invariant functions for the transformation s, Tr(Lˇ n ) = Tr(s(L) A last relation uses the antichiral R superfield χ¯ satisfying the evolution (1.19). Let us ¯ where t¯(x) is a Grassmann odd superfield. introduce the linear functional lt¯ = d3 x(t¯χ),


401

We consider an enlarged phase space including χ, ¯ and extend the Poisson bracket (1.25) to this phase space by R ˇ t+ .χ¯ + 8 ˇ χ) ¯ ˇ χ), ¯ = d3 xt¯(−(Yˇ L) (1.39) {lt¯, lYˇ }a(2) (L, Y ¯ R a 3 ¯ ¯ {lt¯1 , lt¯2 }(2) = −2 d xt1 χ¯ 8[t¯2 χ], ¯ (1.40) ¯ are defined in Eqs. (1.23,1.24). Notice that this is a non-local Poisson where 8, 8 bracket. One finds (1.41) {lXˇ ◦ σ, lYˇ ◦ σ}a(2) = −{lXˇ , lYˇ }b(2) ◦ σ, where the transformation σ has been defined in (1.20).

2. Reductions of the KP Hierarchy In order to obtain consistent reductions of the KP hierarchy, we need to find Poisson submanifolds of the KP phase space. Considering first the quadratic bracket (1.25), we rewrite it as ˇ = TrXξ ˇ a , {lXˇ , lYˇ }a(2) (L) lYˇ ˇ Yˇ L) ˇ + + 8 ˇ Lˇ + Lˇ 8 ¯ ˇ. ξlaYˇ = (Lˇ Yˇ )+ Lˇ − L( Y Y

(2.1)

ξlaYˇ is the hamiltonian vector field associated with the function lYˇ . One easily checks P ¯ It is that if Lˇ has the form (1.14), then for any Yˇ , ξlaYˇ has the form D( i0 . That is to say that the image of an N = 2 differential operator is a strictly differential N = 1 operator, without the non-derivative term. Notice also the useful relations ˇ θ2 =0 , res(L) = (D. res(L))| R 2 R 2 R ˇ = Tr(L) ≡ d x res(L), d x ≡ dxdθ1 , Tr(L)

(4.6) (4.7)

where the residue of the operator L is the coefficient of D1−1 ≡ D1 ∂ −1 . From now on, all expressions will be written in N = 1 superspace, and we drop the index of D1 and θ1 . The KP hierarchy described in Sect. 1 may be described in N = 1 superspace as follows. We consider an operator L of the form L = D2n +

∞ X

wp D2n−p−1

(4.8)

p=1

and consider evolution equations k ∂ n L = [L>0 , L]. ∂tk

(4.9)

This is nothing but the non-standard supersymmetric KP hierarchy described in [4, 3]. p The evolution Eqs. (4.9) admit the conserved quantities Hp = Tr(L n ), and they are bi-hamiltonian. The first Poisson bracket is easily deduced from its N = 2 counterpart (1.21). With lX = Tr(L X), we have {lX , lY }1 (L) = TrL([X >0 , Y >0 ] − [X ≤0 , Y ≤0 ]).

(4.10)

406


As in the N = 2 formalism, this is a standard bracket associated with a non-antisymmetric r matrix. As a consequence, the two quadratic brackets deduced from (1.25) and (1.26) are quite complicated. They involve the quantity ψX defined up to a constant by DψX = res[L , X]. The first one is R {lX , lY }a2 (L) = Tr(L X(L Y )+ − X L(Y L)+ ) + d2 x(−ψY res[L, X] + res[L , Y ] res(X L D−1 ) − res[L , X] res(Y L D−1 )).

(4.11)

The second one becomes {lX , lY }b2 (L) = Tr(L X(L Y )+ − X L(Y L)+ ) + + res[L , Y ] res(L X D

−1

R

d2 x(ψY res[L, X]

) − res[L , X] res(L Y D−1 )),

(4.12)

and already appeared in [6]. It is not a difficult task to obtain the N = 1 restrictions which correspond to the N = 2 conditions (2.2,2.3,2.10,2.16). Some of the lax operators obtained in this way are already known, in particular those satisfying (2.2) from [23] and the lowest order operator coming from (2.3) with odd ϕ and ϕ, ¯ which is the super-NLS Lax operator obtained in [3]. 5. Bosonic Limit of N = 2 KP Hierarchies In this section we wish to study the bosonic limit of the N=2 KP hierarchies described before. A conjecture on this limit for N=2 KdV hierarchies has been given in [5] and we shall verify that it holds in our case. Before starting, let us recall some basic facts about the Poisson bracket structures of KP and non-standard KP hierarchies. We denote by C the algebra of (bosonic) pseudo-differential operators. The trace operation in C will be R denoted by tr(L) = dx res(L). In the study of the KP hierarchy one considers operators of the form ∞ X vi ∂ n−i . (5.1) L = ∂n + i=1

The commuting flows are defined with the help of the antisymmetric r-matrix R defined by R(L) = 21 (L+ − L− ). These flows are hamiltonian with respect to the one parameter family of quadratic Poisson structures {lX , lY }λGD (L) = tr (LX(LY )+ − XL(Y L)+ ) R +λ dx(D−1 res[L, X]) res[L, Y ],

(5.2)

where ∂(D−1 f ) = f . These brackets are local since D−1 is applied to the residue of a commutator, which is a total derivative. They are of the general abcd form with non-local operators aλ = R + λr, bλ = −λr, cλ = λr, dλ = R − λr, where r(L) = D−1 res(L). The Gelfand-Dickey bracket corresponds to λ = 0, and the interchange abcd → dcba simply reverses the sign of λ. For the particular value λ = n1 , and when L has the form (5.1), the field v1 is central in the Poisson algebra and may be set to zero. There is a Poisson map which leads from a generic value of λ to the special value λ = n1 . It reads g(L) = e8 Le−8 , 8 = n1 D−1 v1 . Indeed one finds 1

n ◦ g. {lX ◦ g, lY ◦ g}λGD = {lX , lY }GD

(5.3)


407

For any value of λ, the restriction L = L+ defines a Poisson submanifold. The hierarchy thus obtained is the nth KdV hierarchy. We now turn to the non-standard KP hierarchy. It is constructed with the help of the non-antisymmetric r-matrix R − P0 , where P0 is the projector on the non-differential term in L: P0 (L) ≡ L0 = vn . The equations of the non-standard KP hierarchy are hamiltonian with respect to two quadratic Poisson brackets given by Oevel in [24], {lX , lY }aO (L) = tr(LXa(LY ) + XLb(LY ) − LXc(Y L) − XLd(Y L)),

(5.4)

{lX , lY }bO (L) = tr(LXd(LY ) + XLc(LY ) − LXb(Y L) − XLa(Y L)),

(5.5)

with a = R + r, b = −∂ −1 r∂, c = P0 + r, d = R − P0 − ∂ −1 r∂. The two brackets admit different Poisson subspaces. In the case of bracket (5.4), one may take L = L> and in the case of bracket (5.5), L = L+ + ∂ −1 h. There is a relation between the three brackets (5.2), (5.4), (5.5), which will come out of the bosonic limit of the N = 2 case that we now study. ¯ in Cˇ satisfying the conditions From now on, we restrict to operators Lˇ = DLD ¯ DL|0 = DL|0 = 0, where the limit |0 means that θ and θ¯ are set to zero. This defines a subspace CˇB of Cˇ which is closed under the product. To an operator Lˇ in CˇB we can associate two ordinary operators in C by ˇ = L|0 ∂, µ1 (L) ˇ = L|0 ∂ + (DDL)| ¯ µ2 (L) 0.

(5.6) (5.7)

ˇ = L1 , From now on, whenever possible we shall use the short-hand notations µ1 (L) ˇ = L2 . It is easily checked that µ1 and µ2 are morphisms from CˇB to C and have µ2 (L) the properties µ1 (Lˇ + ) = (L1 )> , µ1 (Lˇ − ) = (L1 )≤ , µ2 (Lˇ + ) = (L2 )+ − (L1 )0 , µ2 (Lˇ − ) = (L2 )− + (L1 )0 TrLˇ = trL2 − trL1 .

(5.8) (5.9) (5.10)

Using this, one easily sees that the N=2 KP Eqs. (1.16) imply k ∂ L1 = [(L1n )> , L1 ], ∂tk k k ∂ L2 = [(L2n )+ − (L1n )0 , L2 ]. ∂tk

(5.11) (5.12)

While one recognizes the non-standard KP flows in (5.11), the interpretation of (5.12) requires more care. From the definitions (5.6,5.7), one finds that L1 and L2 have the same leading and next-to-leading order terms ∂ n + v∂ n−1 + . . .. The flow equation of v, deduced from Eq. (5.11) or equivalently from (5.12), is k ∂ v = n∂(L1 )0n . ∂tk

(5.13)

We may then transform L2 according to g(L2 ) = e8 L2 e−8 , 8 =

1 −1 D v. n

(5.14)

408


Then Eq. (5.12) implies for the image the standard KP equations k ∂ g(L2 ) = [(g(L2 ))+n , g(L2 )]. ∂tk

(5.15)

We shall now see how the quadratic Poisson structures of Eqs. (5.2) and (5.4,5.5) emerge from the quadratic structures (1.25,1.26) of the N=2 KP hierarchy. Using (5.10) we ˇ on CˇB as decompose the linear function lXˇ (L) ˇ = TrXˇ Lˇ = l2 (L2 ) − l1 (L1 ), lXˇ (L) X2 X1 1 lX (L1 )

= trXL1 ,

2 lX (L2 )

(5.16)

= trXL2 .

(5.17)

Then one finds for the first bracket 1 1 , lY1 }a(2) (L1 , L2 ) = −{lX , lY1 }aO (L1 ), {lX 2 2 {lX , lY2 }a(2) (L1 , L2 ) = {lX , lY2 }+1 GD (L2 ),

(5.18)

2 , lY1 }a(2) (L1 , L2 ) = tr (Y L1 )0 + D−1 res[Y, L1 ] [X, L2 ], {lX

(5.19) (5.20)

and for the second bracket 1 1 , lY1 }b(2) (L1 , L2 ) = −{lX , lY1 }bO (L1 ), {lX 2 {lX , lY2 2 , lY1 {lX

2 = {lX , lY2 }−1 GD (L2 ), −1 L1 )0 + D res[Y, L1 ] [X, L2 ].

}b(2) (L1 , L2 )

}b(2) (L1 , L2 )

= tr (Y

(5.21) (5.22) (5.23)

It is not hard to check that these brackets generate the flows (5.11,5.12) by using the k

k

hamiltonian Hk = nk (trL1n − trL2n ). One can extract the Poisson brackets of v with 1 2 functionals lX and lX , 2 a }(2) (L1 , L2 ) = (n − 1) res[X, L2 ], {v, lX 1 a {v, lX }(2) (L1 , L2 )

= n (∂(XL1 )0 + res[X, L1 ]) .

(5.24) (5.25)

The form of the Poisson brackets simplifies by using the map g defined in (5.14). A direct calculation gives 1 1 1 a,b , lY1 }a,b {lX (2) (L1 , L2 ) = −{lX , lY }O (L1 ), 1 n

2 2 2 {lX ◦ g, lY2 ◦ g}a,b (2) (L1 , L2 ) = {lX , lY }GD (g(L2 )), 2 ◦ {lX

g, lY1 }a,b (2) (L1 , L2 )

= 0.

(5.26) (5.27) (5.28)

These are the quadratic Poisson structures generating the non-standard (5.11) and standard (5.15) KP hierarchies, which are now completely decoupled. Analogous considerations yield the corresponding result for the linear Poisson bracket. In Sect. 1, we have shown that there exists an anti-Poisson map p, eq.(1.35), between the brackets a and b. It is natural to investigate the consequences of this property in the bosonic limit. Let us define two invertible maps u and v of C by u(L) = ∂ −1 Lt and v(L) = Lt ∂. Notice that u−1 = −v. Then the following properties hold µ2 ◦ p = u ◦ µ1 , µ1 ◦ p = u ◦ µ2 .

(5.29)


409

It is just a matter of calculations to show, using (5.20) and (5.23), that they lead to the following Poisson properties between the bosonic brackets2 : b {lX ◦ u, lY ◦ u}+1 GD = {lX , lY }O ◦ u,

{lX ◦ v, lY ◦

v}−1 GD

=

{lX , lY }aO

◦ v.

(5.30) (5.31)

Let us now examine the consequences of these results. They clearly imply that the bosonic limit of the N=2 KP hierarchy is composed of two decoupled bosonic hierarchies, namely the standard and non-standard KP hierarchies. The bosonic limit of the N=2 nth KdV corresponding to the reduction (2.2) is composed of the usual nth KdV, g(L2 ) = ∂ n +

n−2 X

un−k ∂ k ,

(5.32)

k=0

the quadratic Poisson structure of which is the Wn algebra, and of a reduction of the non-standard KP hierarchy L1 = ∂ n +

n−1 X

vn−k ∂ k ,

(5.33)

k=1

the quadratic Poisson structure of which is the Wn−1 ⊕ U (1) algebra. The bosonic limit of the N=2 nth KdV corresponding to the reduction (2.10) is composed again of the nth KdV, and of a reduction of the non-standard KP hierarchy L1 = ∂ n +

n−1 X

vn−k ∂ k + ∂ −1 vn+1 ,

(5.34)

k=0

the quadratic Poisson structure of which is Wn+1 ⊕ U (1). This is exactly the conjecture of [5] for the cases considered here. 6. Conclusion An easy generalization of the hierarchies presented in this article would be to consider multi-components KP hierarchies, that is to say replace the fields ϕ and ϕ¯ in (2.3) and (2.16) by a set of n + m fields ϕi and ϕ¯ i , n of them being Grassmann even and the other m being Grassmann odd. For the lowest order case of Eq. (2.3), such a generalization has been considered in [5]. The Lax representation that we propose for such hierarchies has the advantage that one does not need to modify the definition of the residue. For the next to lowest order case of Eq. (2.3), and the lowest order case of Eq. (2.16), it should be possible to obtain in this way hierarchies based on W-superalgebras with an arbitrary number of supersymmetry charges. Little is known about the matrix Lax formulation of the hierarchies presented here. In the case of operators satisfying condition (2.2), such a matrix Lax formulation was constructed in N = 1 superspace by Inami and Kanno [23, 32]. It involves the loop 2 Using these properties and the Poisson property of the transposition T (T (L) = Lt ), {l ◦T, l ◦T }λ X Y GD = −1 Lt ∂, satisfies ˜ ˜ {lX , lY }λ GD ◦ T , it is easily shown that T , the anti-involution of G defined as T (L) = ∂ a,b ˜ {lX ◦ T˜ , lY ◦ T˜ }a,b O = {lX , lY }O ◦ T . Thus Poisson structures are obtained for Kuperschmidt reductions [25].

410


superalgebra based on sl(n|n). What we know about the matrix Lax formulation in N = 2 superspace for hierarchies based on Lax operators satisfying conditions (2.2) or (2.3) will be reported elsewhere. Notice that we obtained the form (2.2) of the scalar Lax operators from a matrix Lax representation, and only later became aware of reference [14] where these operators first appeared. References 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. 16. 17. 18. 19. 20. 21. 22. 23. 24. 25. 26. 27. 28. 29. 30. 31. 32.

Popowicz, Z.: J. Phys. A29, 1281 (1996) Aratyn, H. and Rasinariu, C.: Preprint UICHEP-TH/96-15 and hep-th/9608107 Brunelli,J.C. and Das, A.: Rev. Math. Phys. 7, 1181 (1995) Ghosh, S. and Paul, S.: Phys. Lett. B 341, 293 (1995) Bonora, L., Krivonos, S. and Sorin, A.: Preprint SISSA-56-96-EP and hep-th/9604165 Das, A. and Panda, S.: Mod. Phys. Lett. A 11, 723 (1996) Ivanov, E., Krivonos, S. and Malik, R.P.: Int. J. Mod. Phys. A 10, 253 (1995) Das, A. and Brunelli, J.C.: Phys. Lett. B 337, 303 (1994); Phys. Lett. B 354, 307 (1995); Int. J. Mod. Phys. bf A 10, 4563 (1995); Preprint hep-th/9506096 Krivonos, S. and Sorin, A.: Phys. Lett. B 357, 94 (1995) Krivonos, S., Sorin, A. and Toppan, F.: Phys. Lett. A 206, 146 (1995) Das, A. and Brunelli, J.C.: Mod. Phys. Lett. A 10, 2019 (1995); J. Math. Phys. 36, 268 (1995) Gelfand, I.M. and Dikii, L.A.: Funct. Anal. Appl. 10, 259 (1976) Delduc, F. and Magro, M.: J. Phys. A : Math. Gen. 29, 4987 (1996) Popowicz, Z.: Phys. Lett. B 319, 478 (1993) Semenov-Tian-Shansky, M.A.: Funct. Anal. Appl. 17, 259 (1983) Lu, H., Pope, C.N., Romans, L.J., Shen, X. and Wang, X.J.: Phys. Lett. B 264, 91 (1991) Laberge, C.A., Mathieu, P.: Phys. Lett. B 215, 718 (1988) Labelle, P. and Mathieu, P.: J. Math Phys. 32, 923 (1991) Popowicz, Z.: Phys. Lett. A 174, 411 (1993) Yung, C.M.: Phys. Lett. B 309, 175 (1993) Bellucci, S., Ivanov, E., Krivonos, S. and Pichugin, A.: Phys. Lett. B 312, 463 (1993) Yung, C.M. and Warner, R.C.. J. Math. Phys. 34, 4050 (1993) Inami, T. and Kanno, H.: Int. J. Mod. Phys. A 7,, Suppl. 1A, 419 (1992) Oevel, W.: Phys. Lett. A 186, 79 (1994) Kupershmidt, B.A.: Commun. Math. Phy. 99, 51 (1985) Oevel, W. and Strampp, W.: Commun. Math. Phys. 157, 51 (1993) Freidel, L., Maillet, J.M.: Phys. Lett. B 262, 278 (1991) Roelofs, G. and Kersten, P.: J. Math Phys. 33, 2185 (1992) Delduc, F., Ivanov, E. and Gallot, L.: In preparation Delduc F. and Ivanov, E.: Phys. Lett. B 309, 312 (1993) Delduc, F., Ivanov, E. and Krivonos, S.: J. Math. Phys. 37, 1356 (1996) Inami, T. and Kanno, H.: J. Phys. A 25, 3729 (1992)

Communicated by T. Miwa

Commun. Math. Phys. 190, 411 – 457 (1997)

Communications in


Quantization of Coset Space σ -Models Coupled to Two-Dimensional Gravity D. Korotkin? , H. Samtleben?? II. Institut für Theoretische Physik, Universität Hamburg, Luruper Chaussee 149, D-22761 Hamburg, Germany Received: 25 July 1996 / Accepted: 23 April 1997

Abstract: The mathematical framework for an exact quantization of the two-dimensional coset space σ-models coupled to dilaton gravity, that arise from dimensional reduction of gravity and supergravity theories, is presented. Extending previous results [49] the two-time Hamiltonian formulation is obtained, which describes the complete phase space of the model in the isomonodromic sector. The Dirac brackets arising from the coset constraints are calculated. Their quantization allows to relate exact solutions of the corresponding Wheeler–DeWitt equations to solutions of a modified (Coset-)KnizhnikZamolodchikov system. On the classical level, a set of observables is identified, that is complete for essential sectors of the theory. Quantum counterparts of these observables and their algebraic structure are investigated. Their status in alternative quantization procedures is discussed, employing the link with Hamiltonian Chern–Simons theory.

1. Introduction It is an important class of physical theories, that admit the formulation as a gravity coupled coset space σ-model after dimensional reduction to two dimensions. Including pure gravity and Kaluza-Klein theories as well as extended supergravity theories, in 3+1 dimensions they are described by a set of scalar and vector fields coupled to gravity, where the scalar fields already form a non-linear σ-model. Further reduction is achieved by imposing additional symmetries – manifest by assuming two additional commuting Killing vector fields, for example corresponding to the study of axisymmetric stationary models. ?

On leave of absence from Steklov Mathematical Institute, Fontanka, 27, St.Petersburg 191011, Russia Present address: Max-Planck-Institut für Gravitationsphysik, Albert-Einstein-Institut, Schlaatzweg 1, D14473 Potsdam, Germany ??

412

D. Korotkin, H. Samtleben

This reduction to effectively two dimensions leads to a non-linear σ-model in an enlarged coset space, coupled to two-dimensional gravity and a dilaton field. The arising additional scalar fields that contribute to parametrizing the coset space are a remnant of the original vector fields and of components of the former higher-dimensional metric. For general reason, related to boundedness of the energy, it is the maximal compact subgroup H of G that is divided out in the coset. The first reduction of this type, discovered for pure gravity [33], leads to the simplest coset space SL(2, R)/SO(2). It was generalized up to the case of maximally extended N = 8 supergravity, where the E8(+8) /SO(16) arises [40, 41]. The general proceeding was analyzed in [13, 55]. In [47–49] a program was started to perform an exact quantization of these dimensionally reduced gravity models. Progress has been achieved using methods and techniques similar to those developed in the theory of flat space integrable systems [24, 26, 46]. Despite the fact that dimensional reduction via additional symmetries represents an essential truncation of the theory, these so-called midi-superspace models under investigation are sufficiently complicated to justify the hope that their exact quantization might provide insights into fundamental features of a still outstanding quantized theory of gravitation. In particular and in contrast to previously exactly quantized minisuperspace models, they exhibit an infinite number of degrees of freedom, which is broadly accepted to be a sine qua non for any significant model of quantum gravity (compare [52, 5] for a discussion of this point in the context of related models). One of the final purposes of this approach is the identification of exact quantum states, whose classical limit corresponds to the known classical solutions. For pure gravity this includes the quantum analogue of the Kerr solution describing the rotating black hole; for extended supergravities recently discovered corresponding solutions have been of particular interest exhibiting fundamental duality symmetries [17, 16], such that their exact quantum counterparts should shed further light onto the role of these symmetries in a quantized theory. The main ideas of the new framework are the following: Exploiting the integrability of the model, new fundamental variables have been identified (certain components of the flat connection of the auxiliary linear system continued into the plane of the spectral parameter), in terms of which the “right” and “left” moving sectors have been completely decoupled [47]. The quantization is further performed in the framework of a generalized “two-time” Hamiltonian formalism, i.e. these sectors are quantized independently. The whole procedure has been established in that sector of the theory, where the new fundamental connection exhibits simple poles at fixed singularities. In the present paper we achieve the consistent general formulation of the desired coset-models in this approach. So far the formalism was mainly elaborated in the technically simplified principal model, where the coset G/H had been replaced by the group G itself. For the coset model the phase space spanned by the new variables is too large and must be restricted by proper constraints. Their canonical treatment requires a Dirac procedure, which effectively reduces the degrees of freedom. It leads to a consistent analogous Hamiltonian formulation of the coset model allowing canonical quantization. Exact quantum states are shown to be in correspondence to solutions of a modified (Coset-)Knizhnik-Zamolodchikov system. Moreover, the formalism is kept general as long as possible, without restricting to the simple pole sector. In particular, we completely extend it to the case of connections with poles of arbitrary high order at fixed singularities, which span the isomonodromic sector of the theory. Generalization of the scheme to the full phase space is sketched in Appendix A. The other main result of this paper is the identification of classical and quantum observables. For the above mentioned simple pole sector, these sets are complete. Natural

Quantization of Coset Space σ-Models Coupled to Two-Dimensional Gravity

413

candidates for classical observables are the monodromies of the fundamental connection in the plane of the variable spectral parameter. We determine their (quadratic) Poisson structure. After quantization of the connection quantum counterparts of these monodromy matrices are identified as monodromies of certain higher-dimensional KZ systems. Following Drinfeld [22] their algebraic structure may be determined to build some quasi-associative braided bialgebra. The classical limit of this structure coincides with the Poisson algebra of the classical monodromies found above. In this sense, complete consistency of the picture is established. The weakened coassociativity leads to a quantum algebra of observables with operator-valued structure constants. This might have been avoided by directly quantizing the regularized classical algebra of monodromies, as is common in Chern–Simons theory [2, 3], instead of recovering quantum monodromies in the picture of the quantized connection. We discuss this link and its consequences. The treatment of observables is performed in great detail for the simplified principal model mentioned above. This is for the sake of clarity of the presentation, since the arising difficulties in the coset case deserve an extra study in the sequel. However, the main tools and strategies that will finally be required can already and more clearly be developed and used in this context. The modifications required for the coset model are clarified afterwards. The paper is organized as follows. In Chap.2 we start by introducing the known linear system associated to the model and describe the related on-shell conformal symmetry. A short summary and generalization of the results from [47, 49] about the classical treatment of the principal model is given without restricting to the simple pole sector. The link to Hamiltonian Chern–Simons theory is discussed, where the same holomorphic Poisson structure is obtained by symplectic reduction of the complexified phase space in a holomorphic gauge fixing. This link in particular enables us to relate the status of observables in both theories. Observables in terms of monodromy matrices are identified; their Poisson structure is calculated and discussed. The technical part of the calculation is shifted into Appendix B. Chapter 3 treats the quantization of the principal model. We first briefly repeat the quantization of the simple pole sector of this model [48, 49]. Quantum analogues of the monodromy matrices are defined. Their algebraic structure and its classical limit are determined and shown to be consistent with the classical results. The alternative treatment in Chern–Simons theory and the identification of quantum observables in these approaches are discussed. In Chap.4 we finally present the generalization of the formalism to the coset models. A Hamiltonian formulation in terms of modified fundamental variables is provided. The coset constraints are explicitly solved by a Dirac procedure. Furthermore, we quantize the simple pole sector of the coset model, showing that solutions of a modified Knizhnik-Zamolodchikov system identify physical quantum states, i.e. exact solutions of the Wheeler–DeWitt equations. We close with a sketch of how to employ the whole machinery to the simplest case of pure four-dimensional axisymmetric stationary gravity. In particular, the existence of normalizable quantum states is shown. Chapter 5 briefly summarizes the open problems for future work. 2. Principal σ-Model Coupled to Two-Dimensional Dilaton Gravity The model to be studied in this paper is described by the two-dimensional Lagrangian (2.1) L = eρ R + hµν tr[∂µ gg −1 ∂ν gg −1 ] .

414


p Here, hµν is the 2D (“worldsheet”) metric, e = | det h|, R is the Gaussian curvature of hµν , ρ ∈ R is the dilaton field and g takes values in some real coset space G/H, where H is the maximal compact subgroup of G. The currents ∂µ gg −1 therefore live in a fixed faithful representation of the algebra g on some auxiliary d0 -dimensional space V0 . It is well known that this type of model arises from the dimensional reduction of higher dimensional gravities [13, 55], e.g. from 4D gravity in the presence of two commuting Killing vectors [12]. In the latter case which describes axisymmetric stationary gravity, the relevant symmetric space is G/H = SL(2, R)/SO(2). Let us first briefly describe further reduction of the Lagrangian (2.1) by means of gauge fixing and state the resulting equations of motion. The residual freedom of coordinate transformations can be used to achieve conformal gauge of the 2D metric hµν : ¯ z¯ , hµν dxµ dxν = h(z, z)dzd with world-sheet coordinates z, z, ¯ which reduces the Lagrangian to L = ρ hR + tr[gz g −1 gz¯ g −1 ] .

(2.2)

In this gauge the Gaussian curvature takes the form R = (log h)zz¯ /h. The equation of motion for ρ derived from (2.2) (2.3) ρzz¯ = 0 is solved by ρ(z, z) ¯ = Im ξ(z), where ξ(z) is a (locally) holomorphic function. Then the equations of motion for g coming from (2.2) read ¯ z¯ g −1 = 0 . ¯ z g −1 + (ξ − ξ)g (2.4) (ξ − ξ)g z¯ z We can further specialize the gauge by identifying ξ, ξ¯ with the worldsheet coordinates. Then (2.4) turns into ¯ ξ¯ g −1 = 0 . ¯ ξ g −1 ¯ + (ξ − ξ)g (2.5) (ξ − ξ)g ξ ξ The equations of motion for the conformal factor are derived from the original Lagrangian (2.1): ξ − ξ¯ tr(gξ g −1 )2 and c.c. (2.6) (log h)ξ = 4 Throughout this whole chapter we will for above mentioned reasons of clarity investigate the simplified model, where the symmetric space G/H is replaced by the group G itself. We will refer to this plainer model as the principal model. 2.1. Linear system and on-shell conformal symmetry of the model. The starting point of our treatment is the following well-known linear system associated to Eqs. (2.5) [10, 54]: d9 gξ g −1 = 9, dξ 1−γ

d9 gξ¯ g −1 9, = 1+γ dξ¯

where γ is the spacetime-coordinates dependent “variable spectral parameter” q ξ + ξ¯ 2 ¯ ± (w − ξ)(w − ξ) , w− γ= 2 ξ − ξ¯

(2.7)

(2.8)


415

or alternatively w ∈ C may be interpreted as a hidden “constant spectral parameter”; ¯ is a GC -valued function. The variable spectral parameter γ lives on the twofold 9(w, ξ, ξ) covering of the complex w-plane, the transition between the sheets being performed by γ 7→ γ1 . It satisfies ∂γ γ 1+γ = , ∂ξ ξ − ξ¯ 1 − γ

∂γ γ 1−γ , = ¯ ¯ ∂ξ ξ − ξ 1 + γ

(2.9)

such that in (2.7) it is ∂ γ 1+γ ∂ d = + , dξ ∂ξ ξ − ξ¯ 1 − γ ∂γ

d ∂ γ 1−γ ∂ . = ¯+ ¯ ¯ dξ ∂ ξ ξ − ξ 1 + γ ∂γ

(2.10)

The linear system (2.7) exists due to the following on-shell Möbius symmetry of equations of motion.1 Theorem 2.1. Let g(z, z), ¯ ρ(z, z) ¯ = Imξ(z) and h(z, z) ¯ be some solution of (2.3), (2.4), (2.6) and 9 be the related solution of the linear system (2.7). Then wξ(z) 1 9(γ) , σ w [ξ] ≡ , σ w [h] ≡ h , (2.11) σ w [g] ≡ 9−1 γ w − ξ(z) also solve (2.4), (2.6). Proof. We have s

w − ξ¯ −1 1 1 −1 9 gξ g 9 , σ [gξ g ] = w−ξ γ γ s w − ξ −1 1 1 w −1 −1 σ [gξ¯ g ] = gξ¯ g 9 . 9 ¯ γ γ w−ξ w

−1

Now fulfillment of (2.4), (2.6) may be checked by straightforward calculation.

The transformations σ w form a one-parametric abelian subgroup of the group SL(2, R) of conformal transformations. We have σ w1 σ w 2 = σ w3 ,

1 1 1 + = . w1 w2 w3

The full Möbius group may be obtained combining transformations σ w with the (essentially trivial) transformations ξ(z) 7→ aξ(z) + b ,

g(z) 7→ g(z) ,

which obviously leave the equations of motion invariant. As a result the action of an arbitrary SL(2, R) Möbius transformation σ on a solution of the equations of motion is wξ(z) 1 +b, g(z, z) ¯ 7→ σ[g] ≡ 9−1 9(γ) , (2.12) ξ(z) 7→ σ[ξ] ≡ a w − ξ(z) γ 1

A similar symmetry exists in the theory of Bianchi surfaces [11].

416


leaving h invariant. In addition to the Möbius symmetry (2.12) the model possesses the symmetry corresponding to an arbitrary holomorphic change of the worldsheet coordinate z (this symmetry disappears if we identify z with ξ). Combining this symmetry with (2.12) reveals the following Möbius symmetry of Eq. (2.5) ¯ ¯ 7→ σ[g] w(ξ − b) , w(ξ − b) , (2.13) g(ξ, ξ) aw + ξ − b aw + ξ¯ − b w(ξ¯ − b) w(ξ − b) ¯ , h(ξ, ξ) 7→ h . (2.14) aw + ξ − b aw + ξ¯ − b Infinitesimally, the symmetry (2.13) is a subalgebra of the Virasoro symmetry of (2.5) [42]. Note 2.1. It is known that the Ernst equation (2.4) for SL(2, R)/SO(2) may be rewritten as a fourth order differential equation in terms of the conformal factor h. The transformation (2.14) shows that this equation is, in contrast to the Ernst equation itself, Möbius ¯ invariant in the ξ, ξ-plane. 2.2. Two-time Hamiltonian formulation of the principal model. Here we present a generalized version of the “two-time” Hamiltonian formalism of the principal σ-model proposed in [47, 48]. It is the strategy to define a new set of fundamental variables by means of exploiting the corresponding linear system. These variables may be equipped with a Poisson structure such that a two-time Hamiltonian formulation of the model is achieved. 2.2.1. New fundamental variables and the isomonodromic sector. The main objects we are going to consider as fundamental variables in the sequel are certain components of the following one-form: ¯ be a solution of the linear system (2.7). Then the g-valued Definition 2.1. Let 9(γ, ξ, ξ) one-form A is defined as (2.15) A := d99−1 . In particular, we are interested in the components ¯ ¯ A = Aγ dγ + Aξ dξ + Aξ dξ¯ = Aw dw + A˜ ξ dξ + A˜ ξ dξ¯ ,

(2.16)

¯ and (w, ξ, ξ) ¯ respectively are considered to be independent variables. In where (γ, ξ, ξ) the sequel we shall use the shortened notation A ≡ Aγ . Moreover, we will restrict our study to that sector of the theory, where A is a singlevalued meromorphic function of γ, i.e. that also A is single-valued and meromorphic in γ. A solution 9 of (2.7) with this property is called isomonodromic, as its monodromies in the γ-plane then have no w-dependence due to (2.15). Further on, we immediately get the following relations: Lemma 2.1. The relation of the original field g to A is given by 2 2 −1 ¯ ¯ , g g = , A(γ, ξ, ξ) A(γ, ξ, ξ) gξ g −1 = ¯ ξ ξ − ξ¯ ξ − ξ¯ γ=1 γ=−1

(2.17)

as a corollary of (2.7) and (2.10). Moreover, the linear system (2.7) and definition (2.16) imply


417

∂γ A, ∂w 2A(1) 2A(−1) ¯ A˜ ξ = A˜ ξ = ¯ − γ) , ¯ + γ) , (ξ − ξ)(1 (ξ − ξ)(1 2A(−1) + γ(1 − γ)A(γ) 2A(1) − γ(1 + γ)A(γ) ¯ Aξ = , Aξ = . ¯ − γ) ¯ + γ) (ξ − ξ)(1 (ξ − ξ)(1

Aw =

(2.18)

Note 2.2. In the sequel A(γ) will be exploited as the basic fundamental variable. At this point we should stress the difference between the real group G (with algebra g) entering the physical models and the related complexified group GC (with algebra gC ). Namely, it is A(γ ∈ C) ∈ gC , whereas we will additionally impose the “imaginary cut” iA(γ ∈ iR) ∈ g. Since A(γ) is a (locally) holomorphic function, this implies A(γ) ¯ = −A∗ (−γ) ,

(2.19)

where ∗ denotes the anti-linear conjugation on gC defined by the real form g. Together with (2.17) this ensures g ∈ G. Note 2.3. The linear system (2.7) admits the normalization 9(γ = ∞) = I ,

(2.20)

which implies regularity of A at infinity: A∞ := lim γA(γ) = 0 .

(2.21)

γ→∞

Furthermore, (2.7) implies an additional relation between the original field g and the 9-function: (2.22) 9(γ = 0) = gC0 , where C0 is a constant matrix in the isomonodromic sector. The definition of A as pure gauge (2.15) implies integrability conditions on its components, which in particular give rise to the following closed system for A(γ): ∂Aξ ∂A = [Aξ , A] + , ∂ξ ∂γ

¯

∂A ∂Aξ ξ¯ . , A] + = [A ∂γ ∂ ξ¯

(2.23)

The main advantage of the system (2.23) in comparison with the original equations of motion in terms of g (2.5) is that the dependence on ξ and ξ¯ is now completely decoupled. Once the system (2.23) is solved, it is easy to check that Eqs. (2.17) are compatible and the field g restored by means of them satisfies (2.5). The remaining set of equations of the principal model (2.6), which concern the conformal factor h, may be rewritten taking into account (2.17) as the following constraints: C ξ := −(log h)ξ +

1 trA2 (1) = 0 , ξ − ξ¯

1 ¯ C ξ := −(log h)ξ¯ + ¯ trA2 (−1) = 0 . (2.24) ξ−ξ

2.2.2. Poisson structure and Hamiltonians. The described decoupling of ξ and ξ¯ dependence allows to treat the system (2.23), (2.24) in the framework of a manifestly covariant

418


two-time Hamiltonian formalism, where the field A(γ), the “times” ξ, ξ¯ and the fields (log h)ξ , (log h)ξ¯ are considered as new basic variables. The spirit of the generalized “several-times” Hamiltonian formalism is described for example in [44, 18]. ¯ Poisson structure: For this purpose we equip A(γ) with the following (equal ξ, ξ) Definition 2.2. Define the Poisson bracket on A(γ) ≡ Aa (γ)ta as: Ac (γ) − Ac (µ) , Aa (γ) , Ab (µ) = −f abc γ−µ

(2.25)

f abc being the structure constants of g.2 The relations

ξ 1 2 (1) = A (γ) , A(γ) , trA ξ − ξ¯ h i 1 ¯ A(γ) , ¯ trA2 (−1) = Aξ (γ) , A(γ) , ξ−ξ A(γ) ,

(2.26)

compared with the equations of motion (2.23) give rise to ¯ Definition 2.3. We call the (ξ, ξ)-dynamics that is generated by H ξ :=

1 trA2 (1) , ξ − ξ¯

1 ¯ H ξ := ¯ trA2 (−1) , ξ−ξ

(2.27)

¯ the implicit time dependence of the fields. The remaining (ξ, ξ)-dynamics is referred to as explicit time dependence. In fact, the motivation for this definition arises from [47, 48], where it has been shown that in essential sectors of the theory (simple pole singularities in the connection A), it is possible to identify a complete set of explicitly time-independent variables. They may be ¯ treated as canonical variables then, such that H ξ and H ξ serve as complete Hamiltonians. This will be illustrated and generalized in the next subsections for the isomonodromic sector of the theory, where A(γ) is assumed to be a meromorphic function of γ. The extension of this framework to the whole phase space of arbitrary connections A, that is strongly inspired from the treatment of the simple pole case, is sketched in Appendix A. The variables A(γ) themselves are explicitly time-dependent in general according to (2.23) and (2.26). Note 2.4. The quantities B(w) = Aw (γ) + Aw

∂γ 1 1 1 ≡ A(γ) − 2 A γ ∂w γ γ

(2.28)

build a rather simple set of explicitly time-independent variables, carrying half of the degrees of freedom of the full phase space. This may be checked by straightforward calculation. Moreover, (2.25) implies {B a (w), B b (v)} = −f abc

B c (w) − B c (v) . w−v

(2.29)

2 Assuming g to be semisimple, the existence of the symmetric Killing-form enables us to arbitrarily pull up and down the algebra indices.


419

Note 2.5. From the mathematical point of view, (2.25) is a rather natural structure [26], even though it is not canonically derived from the Lagrangian (2.1). It may however be obtained from an alternative Chern–Simons Lagrangian formulation of the model, as is sketched in the following section. Comparison to the conventional Poisson structure of (2.1) should be worked out on the space of observables, where due to spacetimediffeomorphism invariance no principal difference between one- and two-time structures appears. ¯ In order to gain a Hamiltonian description for the total (ξ, ξ)-dependence of the fields, we employ a full covariant treatment by additionally introducing conjugate momenta ¯ for the canonical “time” variables ξ and ξ. ¯ Poisson bracket Definition 2.4. Define the (equal ξ, ξ) o n o n ¯ −(log h)ξ¯ = 1 , ξ, −(log h)ξ = ξ,

(2.30)

where in the sense of a covariant theory only the explicit appearance of ξ, ξ¯ (compare Def. 2.3) is covered by treating these previous “times” as additional canonical variables, which obey the bracket (2.30). This identification of the conjugate momenta for the explicitly appearing times with the logarithmic derivatives of the conformal factor is motivated from the Lagrangian (2.2) [56]. It implies that the dynamics in ξ and ξ¯ directions is completely given by the ¯ Hamiltonian constraints C ξ and C ξ defined in (2.24), i.e. for any functional F we have dF = {F, C ξ } , dξ

dF ¯ = {F, C ξ } . dξ¯

(2.31)

The remaining equations of motion (2.24) mean weak vanishing of the Hamiltonians. This phenomena always arises in the framework of covariant Hamiltonian formalism when time is treated as canonical variable in its own right canonically conjugated to the Hamiltonian [35]; it is a standard way to take into account possible reparametrization of the time variable. 2.2.3. First order poles. In this simplest case considered in [47, 49] we assume that A(γ) has only simple poles, i.e. N X ¯ Aj (ξ, ξ) , (2.32) A(γ) = γ − γj j=1

¯ wj ∈ C. Then where according to (2.7) all γj should satisfy (2.9), i.e. γj = γ(wj , ξ, ξ), the equations of motion (2.23) yield 2 X ∂Aj ∂Aj 2 X [Ak , Aj ] [Ak , Aj ] = , , = ¯ ¯ ¯ ∂ξ (1 − γk )(1 − γj ) (1 + γk )(1 + γj ) ξ−ξ ξ−ξ ∂ξ k6=j

k6=j

(2.33) and the Poisson brackets (2.25) and (2.30) reduce to {Aai , Abj } = δij f abc Aj ,

(2.34)

{Aj , (log h)ξ } = {Aj , (log h)ξ¯ } = 0 , {γj , (log h)ξ } = −∂ξ γj , {γj , (log h)ξ¯ } = −∂ξ¯ γj ,

(2.35)

420


i.e. in this case, the residues Aj together with the set of (hidden constant) positions of the singularities {wj } give the full set of explicitly time-independent variables. 2.2.4. Higher order poles. We can also generalize the described formulation to the case, where A(γ) has higher order poles in the γ-plane: A(γ) =

rj N X ¯ X Akj (ξ, ξ) . (γ − γj )k

(2.36)

j=1 k=1

The Poisson structure (2.25) in terms of Akj has the following form: )c for k + l − 1 ≤ rj δij f abc (Ak+l−1 j , {(Aki )a , (Alj )b } = 0 for k + l − 1 > rj

(2.37)

building a set of mutually commuting truncated half affine algebras. However, it turns out that for rj > 1 the variables Akj for k = 1, . . . rj − 1 have non-trivial Poisson brackets with (log h)ξ and (log h)ξ¯ , and, therefore, are not explicitly time-independent. The problem of identification of explicitly time-independent variables can be solved in the following way. Consider Aw (γ) =

∂γ A(γ) , ∂w

which as a function of w is meromorphic on the twofold covering of the w-plane. Parametrize the local expansion of Aw around one of its singularities γj as w

A (γ) =

rj X k=1

A(w)k j + O((w − wj )0 ) (w − wj )k

for

γ ∼ γj .

(2.38)

We can now formulate Theorem 2.2. The coefficients A(w)k of the local expansion of Aw have no explicit time j dependence, i.e. = {A(w)k , Hξ} , ∂ξ A(w)k j j

¯

∂ξ¯ A(w)k = {A(w)k , Hξ} . j j

They satisfy the same Poisson structure as the Akj (2.37): n o δij f abc (Aj(w)k+l−1 )c for k + l − 1 ≤ rj (w)l b a (A(w)k ) , (A ) . = i j 0 for k + l − 1 > rj

(2.39)

(2.40)

Proof. Let us first prove (2.39). From (2.25) and the definition of H ξ it follows that 2trA2 (1) {Aw (γ), H ξ } = ∂w γA(γ) , ¯ (ξ − ξ) ∂w γ 2A(1) , A(γ) = [A˜ ξ (γ), Aw (γ)] , = ¯ 1−γ (ξ − ξ) whereas from (2.15) the ξ-dynamics of Aw is determined to be 2A(1) . ∂ξ Aw = [A˜ ξ (γ), Aw (γ)] + ∂w A˜ ξ (γ) = [A˜ ξ (γ), Aw (γ)] + ∂w γ (1 − γ)2


421

As the last term is regular in γ = γj , comparison of the two previous lines shows that the ξ-dependence of the coefficients in the w-expansion around these points is completely generated by H ξ , which proves (2.39). To show the Poisson structure (2.40), one has to consider the corresponding coefficients of singularities in (2.25). For i 6= j, the result follows directly from (2.37), as A(w)k j is a function of Alj , l = 1, . . . , rj only, such that locality remains. For i = j, one may first extract from (2.25) the behavior of {Aw (γ), Aw (µ)} around γ ∼ γj : {(Aw )a (γ), (Aw )b (µ)} = −∂w γ∂v µf abc

Ac (γ) − Ac (µ) (Aw )c (γ) ∼ f abc ∂v µ , γ−µ µ−γ

to then further study the asymptotical behavior µ ∼ γ: {(Aw )a (γ), (Aw )b (µ)} ∼ f abc

(Aw )c (γ) , v−w

such that (2.40) for i = j follows in the same way, as does (2.37) from (2.25).

Thus, also in this case we have succeeded in identifying a complete set of canonical explicitly time-independent variables. are related to the Alj by Note 2.6. Comparing (2.36) with (2.38) shows that the A(w)k j means of explicit recurrent relations that may be derived, expanding (2.36) in (w−wj ). Then A(w)k is a function of Alj with k ≤ l ≤ rj . In particular, the residues of highest j order are related by r −1 ∂γj j (w)r r Aj j = Aj j , ∂wj which explains for example, why this difference was not relevant in the case of simple poles in the last subsection. 2.3. The link to Hamiltonian Chern–Simons theory. The treatment of the principal model of dimensionally reduced gravity in the previous section was inspired by the fact that the equations of motion were obtained as compatibility conditions (2.23) of special linear systems. The interpretation of these equations as zero curvature conditions suggests a link with Chern–Simons theory whose equations of motion also state the vanishing of some curvature. The Chern–Simons gauge connection then lives on a space locally parametrized simultaneously by the spectral parameter γ and one of the true space time coordinates playing the role of time. The relevant Chern–Simons action reads Z 2 k tr[AdA − A3 ] , (2.41) S= 4π M 3 where A is a connection on a trivial G principal bundle over the 3-dimensional manifold M . In the case of interest here, the manifold M is the direct product of the Riemann surface Σ, on which the spectral parameter γ lives, and the real axis, which is interpreted as time. For this configuration, Chern–Simons theory is known to have a Hamiltonian formulation. Choosing proper boundary conditions on the connection, the action may be rewritten in the form

422


S=−

k 4π

Z tr[A∂t A]dt + M

k 2π

Z tr[A0 (dA − A2 )]dt .

(2.42)

M

The connection has been split A = A + A0 dt into spatial and time components, where A0 now plays the role of a Lagrangian multiplier for the constraint F = dA − A2 = 0 .

(2.43)

Usually, A0 is gauged to zero which leads to static components A. In particular, any singularities of the connection are time-independent in this case and treated by inserting static Wilson lines in the action (2.42) [61, 23]. A nontrivial and somewhat singular gauge for A0 must be chosen, to derive the equations of motion of the described principal model of dimensionally reduced gravity. The further required holomorphic reduction of Chern–Simons theory can still be described for arbitrary gauge fixing of A0 , as the results will be valid in any gauge. 2.3.1. Holomorphic reduction and Poisson bracket of the connection. For the following we first complexify the phase space and thereby also the gauge group. This enlarged gauge freedom may be used for a holomorphic gauge fixing then. Denoting the spatial coordinates which locally parametrize Σ by γ = x+iy, γ¯ = x−iy, k κ dxdy ≡ −2iκ ¯ and splitting the remaining defining the measure as 4π 4π dxdy = 4π dγdγ γ γ¯ dynamical parts of A into A = A dγ + A dγ, ¯ the action (2.42) implies the Poisson structure iπ ¯ ¯ Aγ,b (µ, µ)} ¯ = − δ ab δ (2) (γ − µ) , (2.44) {Aγ,a (γ, γ), κ where here and in the following the δ-function is understood two-dimensional R as a real ¯ (2) (γ) = 1. δ-function: δ (2) (x + iy) ≡ 2i δ(x)δ(y), normalized such that dγdγδ This Poisson structure corresponds to the Atiyah-Bott symplectic form on the space of smooth connections on the Riemann surface Σ [6]: Z k tr δA ∧ δA . = 4π Σ The flatness constraints (2.43) are of the first class with respect to this bracket: {F a (γ, γ), ¯ F b (µ, µ)} ¯ =

iπ abc c f F (γ)δ (2) (γ − µ) , κ

where f abc are the total antisymmetric structure constants of gC . These constraints generate the canonical gauge transformations A 7→ gAg −1 + dgg −1 ,

(2.45)

which leave the symplectic structure invariant. The phase space of the original theory is therefore reduced to the space of flat connections A(γ, γ) ¯ modulo the action of the complex gauge group (2.45). If the singularities of the connection A are restricted to simple poles, this phase space is for instance completely described by the monodromies of the connection. As a first step to explicitly reduce the number of degrees of freedom, we will fix the gauge freedom (2.45) in A, by demanding (2.46) Aγ¯ = 0 , which makes flatness of A(γ, γ) ¯ turn into holomorphy of the surviving component Aγ (γ).


423

Note 2.7. The existence of corresponding gauge transformations is a nontrivial problem. In general, when Aγ¯ is gauged away, Aγ dγ becomes a connection on a nontrivial bundle over Σ. On Riemann surfaces of higher genus, this form of gauge generically leads to multivalued holomorphic quantities exhibiting certain twist properties [50]. On the Riemann sphere the gauge transformations preserving single-valuedness of Aγ dγ at least exist on a dense subspace of connections [6, 31]. For the purpose here, strictly speaking we a priori restrict the phase space to the class of functions on the punctured sphere that allow this gauge fixing. This includes e.g. all the connections with the curvature exhibiting δ-function singularities treated in [23] (gauge fixed to holomorphic connections with simple poles) as well as connections with higher order derivatives of δ-functions in the curvature. This gauge fixing of first-class constraints changes the Poisson structure according to Dirac [19], leading to Theorem 2.3. Let the Poisson structure (2.44) for the connection ¯ ¯ a dγ + Aγ,a (γ, γ)t ¯ a dγ¯ A(γ, γ) ¯ ≡ Aγ,a (γ, γ)t

be restricted by the constraints (2.43) and (2.46). Then the Dirac bracket for the surviving holomorphic components Aa (γ) ≡ Aγ,a (γ) is given by {Aa (γ), Ab (µ)}∗ =

1 abc Ac (γ) − Ac (µ) f . 2κ γ−µ

(2.47)

In this context, the holomorphic structure (2.47) has first been proposed by Fock and Rosly [28]. Proof. The bracket between the constraints and the gauge-fixing condition is of the form ¯ (µ)} = {F a (γ), Aγ,b

iπ ab iπ ¯ δ ∂γ¯ δ (2) (γ − µ) + f abc Aγ,c (γ)δ (2) (γ − µ) . κ κ

(2.48)

On the constraint surface (2.46) this matrix can be inverted using ∂γ¯ γ1 = −2πiδ (2) (γ), which follows from the inhomogeneous Cauchy theorem. The Dirac bracket for the remaining holomorphic variables Aγ (γ) then is {Aγ,a (γ), Aγ,b (µ)}∗Z X dxdxdyd ¯ y¯ =− m,n

¯ (y)} {Aγ,a (γ), F m (x)} {F m (x), Aγ,n

−1

¯ {Aγ,n (y), Aγ,b (µ)} −1 m ¯ ¯ + {Aγ,a (γ), Aγ,n (y)} {Aγ,n (y), F m (x)} {F (x), Aγ,b (µ)} Z iπ X dxdxdyd ¯ y¯ =− κ m δ mb δ (2) (y − µ) δ am ∂x δ (2) (x − γ) + f mac Aγ,c (x)δ (2) (x − γ) 2πi(x − y) am (2) δ δ (γ − y) bm (2) mbc γ,c (2) − δ ∂x δ (x − µ) + f A (x)δ (x − µ) 2πi(x − y) 1 abc Aγ,c (γ) − Aγ,c (µ) = f . 2κ γ−µ

424


Note 2.8. For convenience in concrete calculations we still give this result in tensor notation, as is explicitly explained in [26], where the relation of (2.47) to the corresponding current algebra is discussed. This structure may be put into the form {A(γ) ⊗, A(µ)} = [r(γ − µ), A(γ) ⊗ I + I ⊗ A(µ)] ,

(2.49)

1 a 2 2 with the classical r-matrix r(γ) = − 2κ γ , where = t ⊗ ta is represented as d0 ×d0 matrix here. For the simplest but important case g=sl(2), it is = 21 I ⊗I + Π, with Π being the 4 × 4 permutation operator. The matrix r(γ) satisfies the classical Yang-Baxter equation with spectral parameter

[r 12 (γ − µ), r13 (γ) + r23 (µ)] + [r13 (γ), r23 (µ)] = 0 .

(2.50)

In shortened notation, (2.49) reads ¯

¯

{A(γ)0 , A(µ)0 } = [r(γ − µ), A(γ)0 + A(µ)0 ] ,

(2.51)

¯

with A(γ)0 := A(γ) ⊗ I , A(µ)0 := I ⊗ A(µ) . Note 2.9. In the framework of canonical and geometric quantization of Chern–Simons theory [61, 7, 23, 31], the variables Aγ and Aγ¯ are – according to (2.44) – considered and treated as canonically conjugated coordinate and momentum, respectively. After the holomorphic gauge fixing the surviving variable A(γ) = Aγ (γ) resembles – according to (2.47) – a combination of angular momenta. Note 2.10. The flatness constraints (2.43) have not been totally fixed by the choice of gauge (2.46). Apparently this gauge still admits holomorphic gauge transformations, which on the sphere reduce to constant gauge transformations. This freedom may also be seen from the appearance of ∂γ¯ in the matrix of constraint brackets (2.48), which actually prevents its strict invertibility. This implies the surviving of the (global) firstclass part of the flatness constraint F , which for meromorphic A in the parametrization (2.36) is Z Z X (A1i )a = −2πiAa∞ , (2.52) F a (γ)dγdγ¯ = ∂γ¯ Aa (γ)dγdγ¯ = −2πi i

where A∞ = Aa∞ ta , compare (2.21). Obviously, Aa∞ is a generator of constant gauge transformations in the bracket (2.47). 2.3.2. Embedding the principal model. In this holomorphic structure of Chern–Simons theory the link to the principal model can be established. As a first fact, note that the Dirac bracket (2.47) for κ = − 21 equals the Poisson structure (2.25) that was used for the Hamiltonian formulation of the principal model. The equations of motion from Chern–Simons action (2.41) read ∂t Aγ = ∂γ A0 + [Aγ , A0 ] ,

(2.53)

leading to trivial dynamics in the gauge A0 = 0, whereas for t being replaced by ξ and the special (singular) choice of gauge


A0 (γ) := Aξ (γ) =

425

2Aγ (1) − γ(1 + γ)Aγ (γ) , ¯ − γ) (ξ − ξ)(1

one exactly recovers the equations of motion (2.23). Finally the surviving first-class constraints (2.52) that are due to former flatness on the sphere gain a definite physical meaning in the principal model of dimensionally reduced gravity. Arising there equivalently as regularity conditions in γ ∼ ∞ (2.21), they are directly related to the asymptotical flatness of the corresponding solution g of Einstein’s equations (2.5). As first-class constraints in different pictures [12], they generate respectively the Matzner-Misner or the Ehlers symmetry transformations of the model. Their actual role as a physical gauge transformation related to the local Lorentz transformations becomes manifest in the proper treatment of the coset model below, see Subsect.4. 2.4. The algebra of observables. A consistent treatment of the theory and in particular the ability to extract classical and quantum predictions from the theoretical framework requires the identification of a complete set of observables. In the model as presented so far, observables can be defined in the sense of Dirac as objects that have vanishing Poisson bracket with all the constraints including the Hamiltonian constraints (2.24), which even play the most important role here. In the two-time formalism this condition shows the ¯ This is a general feature of a covariant observables to have no total dependence on ξ and ξ. theory, where time dynamics is nothing but unfolding of a gauge transformation, and observables are the gauge invariant objects. Regarding the connection A(γ) as fundamental variables of the theory, the natural objects to build observables from are the monodromies of the linear system (2.15). They may be equivalently characterized as 9(γ) 7→ 9(γ)Ml ,

for γ running along the closed path l ,

or

I Ml = P exp

(2.54)

A(γ)dγ

.

l

¯ These objects naturally have no total (ξ, ξ)-dependence; in the isomonodromic sector we treat, the w-dependence is also absent. For simple poles let us denote by Mi ≡ Mli the monodromies corresponding to the closed paths li which respectively encircle the singularities γi and touch in one common basepoint. From the local behavior of 9(γ) around γ = γi , 9(γ) = Gi I + O(γ − γi ) (γ − γi )Ti Ci , one also extracts the relations Ai = Gi Ti G−1 i ,

Mi = Ci−1 e2πiTi Ci .

(2.55)

The remaining constraint of the theory which should have vanishing Poisson bracket with the observables is the generator of the constant gauge transformations (2.52), under which the monodromies transform by a common constant conjugation. This justifies

426


Definition 2.5. In the case, where the connection A(γ) exhibits only simple poles at fixed singularities wj and with fixed eigenvalues of Aj , we call the set of Wilson loops ( ) Y tr Mik k, (i1 , . . . , ik ) (2.56) k

the set of observables. Note 2.11. For these connections A(γ), the corresponding monodromies together with the position of the singularities and the eigenvalues of Aj generically already carry the complete information. (It is necessary to add the set of eigenvalues of Aj – i.e. the matrices Tj or the Casimir operators of the algebra respectively – to the set of monodromies, since from the monodromies only the exponentials of these eigenvalues can be extracted.) In the presence of higher order poles in the connection, additional scattering data – so-called Stokes multipliers – are required to uniquely specify the connection [39]. The generic case, in which the whole information is contained in the above data, is precisely defined by the fact that no eigenvalues of the monodromy matrices coincide [38, 39]. In particular, this excludes the case of multisolitons, where the monodromies equal ±I. The algebraic structure of the observables (2.56) is inherited from the Poisson structure on the corresponding connection A(γ). Before we explicitly describe this structure, let us briefly comment on the relation to Chern–Simons theory, where quite similarly the Poisson bracket (2.44) provides a Poisson structure on gauge invariant objects. 2.4.1. Observables in Chern–Simons theory. In Chern–Simons theory on the punctured sphere, the set of observables is also built from the monodromy matrices. Note that since in the usual gauge A0 = 0 the Hamiltonian constraint is absent, observables are identified as gauge invariant objects, where this is invariance under local (γ-dependent) gauge transformations. Fixing this gauge freedom by holomorphic gauge as described above, the Dirac bracket (2.47) is now a structure on the reduced phase space of holomorphic connections A(z) modulo the action of constant gauge transformations. It has been explained in [2] that the canonical bracket (2.44) does not define a unique structure on monodromy matrices due to arising ambiguities from the singularities of this bracket (see also [59]). However, on gauge invariant objects, built from traces of arbitrary products of monodromy matrices, these ambiguities vanish [28, 1]. Hence the strategy there is to postulate some structure on the monodromy matrices which reduces to the proper one [34] on gauge invariant objects. The holomorphic Dirac bracket (2.47) allows the calculation also for the monodromies themselves, as we shall show in the following. To relate this result to [28, 2], note that in general the original Poisson bracket and reduced Dirac bracket coincide on quantities of first class in Dirac terminology, i.e. here on gauge invariant objects. In this sense the holomorphic reduction finally leads to the same result on the space of observables. 2.4.2. Poisson structure of monodromy matrices. The holomorphic Poisson structure (2.47) defines a Poisson structure on the monodromy matrices Mj . The result is summarized in the following


427

Theorem 2.4. Let A(γ) be a connection on the punctured plane γ \ {γ1 , . . . , γN }, equipped with the Poisson structure n

o ¯ A(γ)0 , A(µ)0 =

i 1 h ¯ , A(γ)0 + A(µ)0 . γ−µ

(2.57)

Let further 9 be defined as a solution of the linear system ∂γ 9(γ) = A(γ)9(γ) ,

(2.58)

normalized at a fixed basepoint s0 , 9(s0 ) = I ,

(2.59)

and denote by M1 , . . . , MN the monodromy matrices of 9 corresponding to a set of paths with endpoint s0 , which encircle γ1 , . . . , γN , respectively. Ensure holomorphy of 9 at ∞ by the first-class constraint A∞ = lim γA(γ) = 0 .

(2.60)

γ→∞

Then, in the limit s0 → ∞, the Poisson structure of the monodromy matrices is given by o n ¯ ¯ ¯ (2.61) Mi0 , Mi0 = iπ Mi0 Mi0 − Mi0 Mi0 , n o ¯ ¯ ¯ ¯ ¯ Mi0 , Mj0 = iπ Mi0 Mj0 + Mj0 Mi0 − Mi0 Mj0 − Mi0 Mj0 , for i < j ,

(2.62)

where the paths defining the monodromy matrices Mj are ordered with increasing j with respect to the distinguished path [s0 → ∞]. At this point several comments on the result of this theorem are in order, whereas the proof is postponed to Appendix B. Note 2.12. The first-class constraint (2.60) generates constant gauge transformations of the connection A in the Poisson structure (2.57). For the connections of the type (2.36) this reduces to the constraint (2.52). In terms of the monodromy matrices, holomorphy of 9 at ∞ is reflected by Y Mi = I , (2.63) M∞ ≡ which in turn is a first-class constraint and generates the action of constant gauge transformations on the monodromy matrices in the structure (2.61) and (2.62). The ordering of this product is fixed to coincide with the ordering that defines (2.62). The gauge transformation behavior of the fields explicitly reads o h i n ¯ ¯ (2.64) A0∞ , A0j = , A0j , n o ¯ ¯ ¯ ¯ ¯ 0 0 0 0 0 M∞ , Mj0 = iπ M∞ Mj0 − Mj0 M∞ − M∞ Mj0 + M∞ Mj0 . This transformation law is further inherited by arbitrary products M = monodromies, where on the constraint surface M∞ = I it takes the form

Q k

Mjk of

428


n

¯

0 , M0 M∞

o

h i ¯ = −2πi , M 0 ,

resembling (2.64). The generators of gauge transformations build the algebra o h i n ¯ ¯ A0∞ , A0∞ = , A0∞ , or

n

¯

0 0 , M∞ M∞

o

0 0 0¯ 0¯ M∞ − M∞ M∞ = iπ M∞ ,

(2.65)

(2.66) (2.67)

in terms of A∞ and M∞ respectively. In fact, the algebras (2.66) and (2.67) turn out to be isomorphic: the quadratic bracket (2.67) linearizes if the Casimirs are split out. As mentioned, we will further be interested in gauge invariant objects, which are now identified by their vanishing Poisson bracket with (2.63) and which are therefore invariant under a global common conjugation of all monodromies. Note that this includes invariance under gauge transformations with gauge parameters (conjugation matrices) that have nonvanishing Poisson bracket with the monodromies themselves. In accordance with Definition 2.5, the structure (2.61), (2.62) implies {M∞ , trM } = 0

(2.68)

for an arbitrary product of monodromies M . Note 2.13. The evident asymmetry of (2.62) with respect to the interchange of i and j is due to the fact that the monodromy matrices are defined by the homotopy class of the path, which connects the encircling path with the basepoint in the punctured plane. This gives rise to a cyclic ordering of the monodromies. The distinguished path [s0 → ∞] breaks and thereby fixes this ordering, as is explicitly illustrated in Fig.3 in Appendix B below. It is remnant of the so-called eyelash that enters the definition of the analogous Poisson structure in the combinatorial approach [28, 1, 2], being attached to every vertex and representing some freedom in this definition. However, the choice of another path [s0 → ∞] simply corresponds to a global conjugation by some product of monodromy matrices: a shift of this eyelash by j steps corresponds to the transformation Mk → (M1 . . . Mj )−1 Mk (M1 . . . Mj ) . Therefore the restricted Poisson structure on gauge invariant objects is independent of this path. Note 2.14. A seeming obstacle of the structure (2.61), (2.62) is the violation of Jacobi identities. Actually, this results from heavily exploiting the constraint (2.60) in the calculation of the Poisson brackets. As therefore these brackets are valid only on the first-class constraint surface (2.63), Jacobi identities can not be expected to hold in general. However, the same reasoning shows [58], that the structure (2.61), (2.62) restricts to a Poisson structure fulfilling Jacobi identities on the space of gauge invariant objects. On this space, the structure reduces to the original Goldman bracket [34] and coincides with the restrictions of previously found and studied structures on the monodromy matrices [28]: o n ¯ ¯ ¯ ¯ ¯ (2.69) Mi0 , Mi0 = Mi0 r+ Mi0 + Mi0 r− Mi0 − r− Mi0 Mi0 − Mi0 Mi0 r+ , n o ¯ ¯ ¯ ¯ ¯ Mi0 , Mj0 = Mi0 r+ Mj0 + Mj0 r+ Mi0 − r+ Mi0 Mj0 − Mi0 Mj0 r+ , for i < j ,


429

where r+ and r− := −Πr+ Π are arbitrary solutions of the classical Yang-Baxter equation [r12 , r23 ] + [r12 , r13 ] + [r13 , r23 ] = 0 ,

(2.70)

and the symmetric part of r+ is required to be iπ. Setting r+ ≡ iπ, (2.69) reduces to (2.61), (2.62) such that our structure is in some sense the skeleton, which may be dressed with additional freedom that vanishes on gauge invariant objects. On the space of monodromy matrices themselves, introduction of r-matrices may be considered as some regularization to restore associativity, whereas the fact that itself does not satisfy the classical Yang-Baxter equation is equivalent to (2.61), (2.62) not obeying Jacobi identities. Q In the Poisson structure (2.69), the generator of gauge transformations M∞ ≡ i Mi has the following Poisson brackets with any monodromy Mk : n

¯

0 , Mk0 M∞

o

¯

¯

¯

¯

0 0 0 0 − Mk0 M∞ r− − r+ M∞ = Mk0 r+ M∞ Mk0 + M∞ r− Mk0 ,

(2.71)

whichQentails the same Poisson bracket of M∞ with an arbitrary product of monodromies M ≡ k Mjk . On the constraint surface M∞ = I, taking into account r+−r− = 2iπ, this again implies (2.65), such that M∞ again generates the constant gauge transformations. Note 2.15. The subset of observables {tr[(Mi )m ]|i, m} ∪ {wi |i}

(2.72)

commutes with the whole set of observables. For the positions of the singularities this follows just trivially from the Poisson structure (2.25), whereas the eigenvalues of the monodromy matrices are related to the eigenvalues of the corresponding residues Ai (2.55), which in turn provide the Casimir operators of the mutually commuting algebras (2.34). This subset of commuting variables thus parametrizes the symplectic leaves of (2.61), (2.62). Note 2.16. For our treatment of the coset model below, the following additional structure will be of importance. There is an involution η˜ on the set of observables, defined by the cyclic shift Mi 7→ Mi±n , where N = 2n is the total number of monodromies. The crucial observation is now that this involution is an automorphism of the Poisson structure on the algebra of observables: ˜ 2 )} = η({X ˜ {η(X ˜ 1 ), η(X 1 , X2 }) ,

(2.73)

for X1 , X2 being traces of arbitrary products of monodromy matrices. This is a corollary of Note 2.13, as it follows from the invariance of the Poisson structure on gauge invariant objects with respect to a shift of the eyelash that defines the ordering of monodromy matrices. Like every involution, η˜ defines a grading of the algebra into its eigenspaces of eigenvalue ±1. In particular, the even part forms a closed subalgebra.

430


3. Quantization of the Principal Model 3.1. Quantization in terms of the connection. The quantization of the model looks especially natural in the isomonodromic sector with only simple poles. This has been performed in [48, 49], as we shall briefly summarize. In this case straightforward quantization of the linear Poisson brackets (2.34) leads to the following commutation relations: [Aai , Abj ] = i~δij f abc Aj , ¯ (log h)ξ¯ ] = −i~ , [ξ, (log h)ξ ] = [ξ, ¯ (log h)ξ ] = [ξ, (log h)ξ¯ ] = 0. [ξ,

(3.1) (3.2)

According to (3.2), representing ξ and ξ¯ by multiplication operators, one can choose (log h)ξ = i~

∂ , ∂ξ

∂ (log h)ξ¯ = i~ ¯ . ∂ξ

(3.3)

From (3.1), the residues Aj can be represented according to Aaj = i~taj ,

(3.4)

which acts on a representation Vj of the algebra gC . ¯ in a sector with given singularities should depend on Thus the quantum state ψ(ξ, ξ) ¯ and live in the tensor-product V (N ) := V1 ⊗ . . . ⊗ VN of N representation spaces. (ξ, ξ) Q Denote the dimension of Vj by dj , such that d := dimV (N ) = dj . 3.1.1. Wheeler–DeWitt equations and Knizhnik-Zamolodchikov system. The whole “dynamics” of the theory is now encoded in the constraints (2.24), which accordingly play the role of the Wheeler–DeWitt equations here: ¯

Cξψ = Cξψ = 0 , which can be written out in explicit form using (2.24), (2.27), (3.3) and (3.4): i~ X ∂ψ jk = ψ, ∂ξ (1 − γj )(1 − γk ) ξ − ξ¯

(3.5)

(3.6)

k6=j

i~ X jk ∂ψ ψ, = ∂ ξ¯ ξ¯ − ξ k6=j (1 + γj )(1 + γk ) where jk := taj ⊗ tak is the symmetric 2-tensor of g, acting nontrivially only on Vj and Vk . The other constraint that restricts the physical states arrives from (2.52); its meaning was sketched in Subsect. 2.3.2. In the quantized sector it is reflected by   X ¯ =0.  taj  ψ(ξ, ξ) (3.7) j

The general solution of the system (3.6) is not known. However, these equations turn out to be intimately related to the Knizhnik-Zamolodchikov (KZ) system [45]: X jk ∂ϕKZ = i~ ϕKZ , (3.8) ∂γj γj − γk k6=j

with an V

(N )

-valued function ϕKZ (γj ):


431

Theorem 3.1. If ϕKZ is a solution of (3.8) obeying the constraint (3.7), and the γj depend ¯ according to (2.8), then on (ξ, ξ) ψ=

1 i~ N Y ∂γj 2 jj j=1

∂wj

ϕKZ

(3.9)

solves the constraint (Wheeler–DeWitt) equations (3.6). The Casimir operator jj defined above is assumed to act diagonal on the states; for g=sl(2) for example, this is simply jj = 21 sj (sj − 2), classifying the representation. Theorem 3.1 and the proof were obtained in [48]. The task of solving (3.6) reduces to the solution of (3.8). Note 3.1. The γj dependence of the quantum states, introduced in Theorem 3.1, can be ¯ understood as just a formal dependence, which covers the (ξ, ξ)-dependence of these states. However, one may also split up this dynamics into several commuting flows generated by the corresponding operators from (3.8). The full set of solutions of (3.8) then may be interpreted as a “γj -evolution operator,” describing this dynamics. In some sense [49] this quantum operator resembles the classical τ -function introduced in [38]. Note 3.2. We have described how the solution of the Wheeler–DeWitt equations is related to the solution of the KZ system (3.8) in the sector of the theory, where the connection has only simple poles. It is therefore natural to suppose that the quantization of the higher pole sectors that were classically presented in Subsect. 2.2.4 is achievable in a similar way and will moreover reveal a link to the higher order KZ systems, which were introduced in [57] in the quantization of isomonodromic deformations with exactly the Poisson structure (2.37) on the residues. Note 3.3. For definiteness it is convenient to assume pure imaginary singularities γj ∈ iR (i.e. wj ∈ R). Then classically Aj ∈ g and quantized they carry representations of g itself, not of gC . 3.2. Quantum algebra of monodromy matrices. 3.2.1. Quantum monodromies. Having quantized the connection A(γ) as described in the previous section, it is a priori not clear how to identify quantum operators corresponding to the classical monodromy matrices in this picture. As they are classically highly nonlinear functions of the Aj , arbitrarily complicated normal-ordering ambiguities may arise in the quantum case. The first problem is the definition of the quantum analogue of the classical 9– function. Its d0×d0 matrix entries are now operators on the d-dimensional representation space V (N ) . We choose here a simple convention, replacing the classical linear system ∂γ 9(γ) = A(γ)9(γ)

(3.10)

by formally the same one, where all the arising matrix entries are operators now, i.e. (3.10) remains valid for higher dimensional matrices A and 9. We have thereby fixed the operator ordering on the right-hand side in what seems to be the most natural way. In the same way, we define the quantum monodromy matrices:

432


Definition 3.1. The quantum monodromy matrix Mj is defined to be the r.h.s. monodromy matrix of the (higher dimensional) quantum linear system (3.10): 9(γ) 7→ 9(γ)Mj

for γ encircling γj ,

where the quantum 9-function is normalized as 1 γ −A∞ 9(γ) = I + O γ

around γ ∼ ∞ .

(3.11)

(3.12)

Note 3.4. The normalization condition (3.12) generalizes the one we chose in the classical case (2.59) where the basepoint s0 was sent to infinity. This generalization is necessary, because the constraint (2.60) is not fulfilled as an operator identity in the quantum case, which means that the quantum 9-function as an operator is definitely singular at γ = ∞ with the behavior (3.12). Only its action on physical states, which are by definition annihilated by the constraint (2.52) may be put equal to the identity for γ = ∞. For proceeding further we now make use of an interesting observation of [57], relating the KZ systems with N and N + 1 insertions by means of the quantum linear system (3.10). We state this as Theorem 3.2. Let ϕ(γ1 , . . . , γN ) be the evolution operator of the KZ system ∂j ϕ = i~

X k6=j

jk ϕ, γj − γk

and 8(γ0 , . . . , γN ) be the corresponding evolution operator of the KZ system with an additional insertion at N = 0. Then 9(γ0 , . . . , γN ) := (I ⊗ ϕ−1 )8 satisfies the following system of equations: ∂0 9 = i~

N X ta0 ⊗ (ϕtaj ϕ−1 ) j=1

∂j 9 = −i~

γ0 − γj

9,

(3.13)

ta0 ⊗ (ϕtaj ϕ−1 ) 9. γ0 − γj

The proof is obtained by a simple calculation.

Consider the relations (3.13). Together with the remarks of Note 3.1, it follows that this 9 just obeys the proper quantum linear system (3.10) in a Heisenberg picture: the ¯ (ξ, ξ)-dependence of the operators Aj is generated by conjugation with the evolutionoperator ϕ. For the definition of the quantum 9-function it is the Heisenberg picture which provides the most natural framework, as only in this picture implicit and explicit ¯ (ξ, ξ)-dependence of operators are treated more or less on the same footing. Thus one may identify Aj = i~ta0 ⊗ (ϕtaj ϕ−1 ) . The operators ta0 play the role of the classical representation ta acting on the auxiliary space V0 , which is already required for the formulation of the classical linear system. In this sense, the KZ system with N +1 insertions combines the classical linear system with the quantum equations of motion that are described by the KZ system with N


433

insertions. The additional insertion γ0 then plays the role of γ. We shall use this link to gain information about the algebraic structure of the quantum monodromy matrices. 3.2.2. Quantum group structure. We now start from the representation of the quantum 9-function due to Theorem 3.2: (3.14) 9(γ, γ1 , . . . , γN ) = I ⊗ ϕ−1 (γ1 , . . . , γN ) 8(γ, γ1 , . . . , γN ) . This shows in particular that the quantum monodromy matrices of the principal model defined in (3.11) equal the corresponding monodromies of the KZ system with N + 1 insertions. To obtain their algebraic structure, we employ a deep result of Drinfeld about the relation between the monodromies of the KZ connection and the braid group representations induced by certain quasi-bialgebras [21, 22]. Before we state these relations, we have to briefly describe the induced braid group representations. The KZ system that is of interest here, is X jk 8, ∂j 8 = i~ γj − γk k6=j

with j = 0, . . . , N , which, as explained, in a formal sense combines the classical and the quantum degrees of freedom, the function 8 living in V (N +1) := V0 ⊗ V (N ) . This system naturally induces a representation of monodromy matrices, which may canonically be lifted to a braid group representation [43]. However, for our purpose, it is sufficient to remain on the level of the monodromy representation, which we denote by ρKZ . We further have to briefly mention two algebraic structures, which are standard examples for braided quasi-bialgebras, where for details and exact definitions we refer to [22, 43]. Let us denote by U~ the so-called Drinfeld-Jimbo quantum enveloping algebra associated with g [20, 37]. This is a braided bialgebra, which includes the existence of a comultiplication 1, a counit and a universal R-matrix RU ∈ U~ ⊗U~ , obeying several conditions of which the most important here is the (quantum) Yang-Baxter equation RU12 RU13 RU23 = RU23 RU13 RU12 .

(3.15)

The matrix RU can in principle be explicitly given, but is of a highly complicated form. It is Drinfeld’s achievement to relate this structure to a braided quasi-bialgebra A~ , where the nontriviality of the R-matrix is essentially shifted into an additional element φA ∈ A~ ⊗A~ ⊗A~ , the so-called associator, which weakens the coassociativity. The R-matrix of A~ is simply RA = e−π~ , where := ta ⊗ta is the symmetric 2-tensor of g. This R-matrix satisfies a weaker form of (3.15), the quasi-Yang-Baxter equation 12 312 13 −1 132 23 123 23 −1 231 13 213 12 RA φA RA (φA ) RA φA = φ321 R A φ A RA . A RA (φA )

(3.16)

The algebras U~ and A~ are isomorphic as braided quasi-bialgebras [22]. There is a standard way, in which braided quasi-bialgebras induce representations of the braid group. Each simple braid σi is represented as i,i+1 i,i+1 R φi , ρ(σi ) := φ−1 i Π

(3.17)

where Π is the permutation operator and φi is defined as φi := 1(i+1) (φ) ⊗ I ⊗(N −i−2) with 1(1) := 1, 1(2) := Id and 1(i+1) := (1 ⊗ Id⊗i )1(i) . We will denote the restrictions of these representations of the algebras U~ and A~ on the monodromies, which are built from products of simple braids, by ρU and ρA respectively. Now we have collected all the ingredients to state the result of Drinfeld as:

434


Theorem 3.3. The monodromy representation of the KZ system equals the described monodromy representation of the braided quasi-bialgebra A~ , which in turn is equivalent to the monodromy representation of the braided bialgebra Uh . This means, that there is an automorphism u on V (N +1) , such that ρKZ = ρA = uρU u−1 .

(3.18)

For the proof we refer to the original literature [22] or to the textbook of Kassel [43]. We should stress that in this construction the deformation parameter of the quantum group structure coincides with the true Planck constant ~. 3.2.3. Quantum algebra and classical limit. It was our aim to describe the algebraic structure of the quantum monodromy matrices defined in (3.11). By Theorem 3.2 these monodromy matrices have been identified among the monodromies of the KZ system with N +1 insertions as the monodromies of the additional point γ0 encircling the other insertions. Exploiting the consequences of Theorem 3.3 now, the quantum algebra of the monodromy matrices M1 , . . . , MN is given by: Theorem 3.4. The matrices Mj from (3.11) satisfy −1 Mi0 = Mi0 R+ Mi0 R+−1 , R− Mi0 R− ¯

¯ R+ Mi0 R+−1 Mj0

¯

=

¯ Mj0 R+ Mi0 R+−1

,

(3.19) for i < j ,

where these relations are understood in a fixed representationNof the d0 ×d0 matrix entries of the monodromy matrices on the tensor-product V (N ) = j Vj . The R-matrices R± are −1 R− := u0¯ RU−1 u−1 R+ := ΠR− Π, (3.20) 0 , where RU is the universal R-matrix of U~ mentioned above, u0 is some automorphism on V0 ⊗ V (N ) and u0¯ is the corresponding one on V0¯ ⊗ V (N ) . The classical limit of these R-matrices is given by R± = I ⊗I ± (i~)(iπ) + O± (~2 ) .

(3.21)

Note 3.5. The relations (3.19) are to be understood as follows. The notation requires two copies 0 and 0¯ of the classical auxiliary space V0 . While the standard R-matrices RU and RA live on these classical spaces only, R− and R+ also act nontrivially on the quantum representation space V (N ) , due to conjugation with the automorphisms u0 , u0¯ . Proof of Theorem 3.4. Consider the monodromy representation (3.17) corresponding to the coassociative bialgebra U. The monodromy Mj for γ = γ0 encircling γj is thereby represented as ρU (Mj ) = (RU−1 )01 (RU−1 )02 . . . RUj0 RU0j RU0,j−1 . . . RU01 ,

(3.22)

such that it is just a matter of sufficiently often exploiting the Yang-Baxter equation (3.15) to explicitly show that the relations (3.19) hold for ρU (Mj ) with R− := RU−1 , −1 R+ := ΠR− Π. Theorem 3.3 further implies the conjugation of the R-matrices with the automorphism u in order to extend the result to the representation ρKZ , in which the monodromies from (3.11) were recovered. To further prove the asymptotic behavior (3.21), it is not enough to know the classical limit of RU – which is a classical r-matrix simply – since the semiclassical expansion of


435

the automorphisms u0 , u0¯ must be taken into account. For this reason, we additionally have to use the other part of Theorem 3.3, which relates the representations ρKZ and −1 −1 , R+ := ΠR− Π in a ρA . The relations (3.19) for the ρA (Mj ) hold with R− := RA generalized form, modified by certain conjugations with the nontrivial associator φA . The semiclassical expansion of the associator is given by [43]: φA = I ⊗I ⊗I + O(~2 ) ,

(3.23)

which implies that the term of order ~ in the semiclassical expansion of (3.19) is determined by the corresponding one in RA = e−π~ , which yields (3.21). The last point to be ensured is that the normalization of the quantum monodromies (3.12) around γ ∼ ∞ coincides with the normalization chosen in the definition of the KZ monodromies [21] in certain asymptotic regions of the space of (γ, γ1 . . . , γN ), up to the order ~. The proof of this fact goes along the same line as the proof of (3.23). We have now established the quantum algebra of the quantum monodromy matrices by identifying the corresponding operators inside the picture of the quantized holomorphic connection A(γ). The classical limit of this algebra equals exactly the classical algebra of monodromy matrices (2.61), (2.62). Hence, we have shown the “commutativity” of the (classical and quantum) links between the connection and the monodromies with the corresponding quantization procedures. Let us sketch this in the following diagram: Atiyah-Bott symplectic structure ¯ (µ)} ∼ δ ab δ (2) (γ − µ) {Aγ,a (γ), Aγ,b

holomorphic gauge

Regularized algebra of monodromies ¯

? Holomorphic connection P PP b abc Ac {Aa q P i , Aj } = δij f i

Classical algebra of monodromies ¯ ¯ {Mi0 , Mj0 } = iπ (Mi0 Mj0 + . . .)

¯

{Mi0 , Mj0 } = (Mi0 r+ Mj0 + . . .) quantization

?

quantization and quasi-associative generalization

b abc Ac [Aa i , Aj ] = i~δij f i

quantum monodromies via KZ system

? ^ Quantum algebra of monodromies

quantization of the

nonassociative algebra

R+ Mi0 R+−1 Mj0 = Mj0 R+ Mi0 R+−1 ¯

¯

Note 3.6. The dotted lines in this diagram depict the link to the usual way quantum monodromies have been treated. This was done by directly quantizing their classical algebra, which is derived from the original symplectic structure of the connection up to certain degrees of gauge freedom: for later restriction on gauge invariant objects, this

436


algebra may be described with an arbitrary classical r-matrix, as was sketched in Note 2.14. A direct quantization of this structure is provided by a structure of the form (3.19), where the quantum R-matrices live in the classical spaces only and admit the classical expansion R± = I + i~r± + O± (~2 ) [1, 2]. Note 3.7. In contrast to this quantum algebra which underlies (2.69), in (3.19) the Rmatrices – due to the automorphisms u0 , u0¯ – also act nontrivially on the quantum representation space. Their classical matrix entries may be considered as operator-valued, meaning that the quantum algebra can be treated alternatively as nonassociative or as “soft.” This is in some sense the quantum reason for the fact, that the classical algebra (2.61), (2.62) fails to satisfy Jacobi identities. However, note that (3.19) only describes the R-matrix in any fixed representation of the monodromies; for a description of the abstract algebra, compare the quasi-associative generalization in [2, 3], which provides the link between the quantum structure described in the previous note and (3.19). 3.2.4. Quantum observables. Let us discuss now the quantum observables, i.e. operators commuting with all the constraints. In analogy with the classical case it is clear that all monodromies of the quantum linear system (3.11) commute with the Hamiltonian constraints. Therefore, it remains to get rid of the gauge freedom (2.63), i.e. to identify functions of monodromies commuting with quantum generators of the gauge transformations. In the classical case the gauge transformations were generated by matrix entries of the matrix A∞ or, equivalently, of the matrix M∞−I. The straightforward quantization of the classical algebra of gauge transformations generated by A∞ (2.66) is (3.24) [Aa∞ , Ab∞ ] = fcab Ac∞ , i.e. coincides with g. In terms of M∞ , the algebra of the same gauge transformations according to (3.19) reads −1 0 0 0 0 R− M∞ = M∞ R+ M∞ R+−1 . R− M∞ ¯

¯

(3.25)

The set of quantum observables is characterized as the set of operator-valued functions F of components of monodromies Mj which commute with all components of A∞ : [F ({Mj }), Aa∞ ] = 0 . (3.26) Recall that in the classical case observables were just traces of arbitrary products of monodromies Mj . At the moment the quantum analog of this representation is not clear. One should suppose that there is a similar situation to the case we would have arrived at by directly quantizing the algebra of monodromies, mentioned in Note 3.6. In this case, which has been studied in the combinatorial quantization of Chern– Simons theory [2, 3], the R-matrices live in the classical spaces only and the transformation behavior of arbitrary products of monodromies M under gauge transformations generated by M∞ reads −1 0 0 R − M 0 R− M∞ = M∞ R+ M 0 R+−1 . ¯

¯

Introducing the quantum trace tr q M with characteristic relations tr 0q R00 M 0 (R00 )−1 = tr q M 0 , ¯

¯

we see that the operators tr q M commute with the components of M∞ :

(3.27)

Quantization of Coset Space σ-Models Coupled to Two-Dimensional Gravity 0 [tr q M, M∞ ]=0.

437

(3.28)

Therefore, the quantum group generated by M∞ : −1 0 0 0 0 R− M∞ R− M∞ = M∞ R+ M∞ R+−1 ¯

¯

(3.29)

in this approach plays the role of algebra of gauge transformations. It appears a difference of this approach with the approach which we mainly follow in this paper: instead of the Lie group G generated by the algebra (3.24), the role of the gauge group is played by its quantum deformation (3.29). A question therefore remains: what is the proper quantum gauge group of a consistent quantum theory, the group G itself or its quantum deformation Gq ? Note 3.8. With the notation of the quantum trace at hand, the quantum analogue of Note 2.15 can be formulated. From the abstract algebraic point of view – beyond the presented concrete representation of the quantum monodromies – the quantum trace of powers of the Mj build the center of the free algebra defined by (3.19) and may thus be fixed according to the classical values. 4. Coset Model In this final chapter we will explain, how to modify the previously presented scheme in order to treat the coset models, which actually arise from physical theories. The field g is required to take values in a certain representation system of the coset space G/H, where H is the maximal compact subgroup of G. This subgroup may be characterized by an involution η of G as the subgroup, which is invariant under η. The involution can further be lifted to the algebra g, e.g. η(X) = −X t for X ∈ g = sl(N ). The algebra g is thereby split into its eigenspaces with eigenvalues ±1, which are denoted by g = h ⊕ k, the subgroup H underlying h. In terms of the involution, the field g is restricted to satisfy: gη(g) = I ,

(4.1)

which defines the special choice of a representation system of the coset space. 4.1. Classical treatment. Classically speaking, the Poisson structure for the G/H-valued model may be obtained from the previously described Poisson structure for the principal G-valued model by implementing additional constraints. These constraints were discussed in detail in [49] and may be equivalently formulated in terms of the function 9 or of the connection A: −1 1 g −1 9(γ) = C0 , (4.2) η 9 γ 1 1 A(γ) + 2 gη A g −1 = 0 . (4.3) γ γ The first line is a consequence of (4.1) with C0 = C0 (w) from (2.22) also satisfying C0 η(C0 ) = I now. Studying the monodromies of 9 shows that in the isomonodromic sector, C0 must be gauged to a constant matrix, using the freedom of the right-hand side multiplication of the solution of (2.7). This can be seen from Eq. (4.36) below. Derivation of (4.2) with respect to γ then yields (4.3).

438


An unpleasant feature of these constraints is that they explicitly contain the field g, which in this framework is not among the fundamental variables. To avoid this difficulty, it is convenient to slightly modify the Hamiltonian formalism of the principal model. Namely, let us relax the normalization condition 9(γ = ∞) = I, which was ˆ related to 9 by a G-valued gauge imposed in (2.20) before and consider the function 9 transformation V instead: ¯ . ˆ := V(ξ, ξ)9 9 (4.4) −1 ˆ = 0), such that the coset constraint (4.1) ˆ = ∞) = V and gC0 = V 9(γ Then it is 9(γ may be rewritten as: (4.5) g = V −1 η(V) . ˆ now satisfies the linear system The modified function 9 ˆ ˆ 1+γ d9 d9 1−γ ˆ ˆ = − P+ + Q+ 9, P− + Q− 9, (4.6) = − dξ 1−γ 1+γ dξ¯ ¯ with (ξ, ξ)-dependent matrices P± ∈ k and Q± ∈ h which can be reconstructed from V on the coset constraint surface (4.5): Vξ V −1 = P+ + Q+ ,

Vξ¯ V −1 = P− + Q− .

Note 4.1. In the coset model the Möbius symmetry (2.11) appears in especially simple form [8]: s s ¯ w − ξ w−ξ ˆ P+ , P− 7→ V 7→ 9(γ) , P+ 7→ P− , h 7→ h . w−ξ w − ξ¯ In complete analogy to the principal model, we further introduce Definition 4.1. Define the connection Aˆ by ˆ ˆ 9 ˆ −1 (γ) . A(γ) := ∂γ 9(γ)

(4.7)

The constraint of regularity at infinity then reads ˆ Aˆ ∞ := lim γ A(γ) =0. γ→∞

(4.8)

The relations (2.17) between the original fields and the connection Aˆ take the following form: 1 ˆ 1 ˆ ¯ ¯ A(γ, ξ, ξ) A(γ, ξ, ξ) = −P+ , = −P− . (4.9) ¯ ¯ ξ−ξ ξ−ξ γ=1 γ=−1 Hence, the coset constraints (4.5) are equivalent to ˆ ˆ A(±1) = −η A(±1) ,

(4.10)

which is implied by (4.3). Let us stress again that the originally equivalent coset constraints (4.1), (4.5) or (4.10) are lifted to (4.3) due to the special choice of C0 = const in the isomonodromic sector. ˆ The constraints (4.2) and (4.3) take simpler forms in terms of the new variables 9 ˆ and A, since the field g is absorbed now:


−1 ˆ 1 ˆ η 9 9(γ) = C0 , γ 1 1 ˆ A(γ) + 2 η Aˆ =0. γ γ

439

(4.11) (4.12)

The first of these equations is a sign of the invariance of the linear system (4.6) on the coset constraint surface under the extended involution η ∞ , introduced in [12]: ∞ ˆ ˆ 1 , (4.13) η (9(γ)) := η 9 γ but is difficult to handle due to the unknown matrix C0 . The latter form (4.12) of the constraint admits a complete treatment as will be described below. Note that the constraint of regularity at infinity (4.8) is already contained in (4.12) and is thereby naturally embedded in the coset constraints. The set of constraints (4.12) is complete and consistent in the following sense: ¯ Lemma 4.1. The coset constraints (4.12) are invariant under (ξ, ξ)-translation on the constraint surface. Proof. The total ξ-dependence of Aˆ can be extracted from (2.23) to be ∂Aξ (γ) −1 d ˆ ˆ A(γ) = V[Aξ (γ), A(γ)]V −1 + [Vξ V −1 , A(γ)] V +V dξ ∂γ −2P+ ˆ ˆ , A(γ) + (P+ + Q+ ), A(γ) = 1−γ 2P+ γ(1 + γ) γ 2 − 2γ − 1 ˆ ˆ A(γ) − + 2 2 ¯ ¯ − γ) ∂γ A(γ) . (1 − γ) (ξ − ξ)(1 − γ) (ξ − ξ)(1 d d 1 Together with dξ f γ1 = − dξ for any function f (γ), which follows f γ from the structure of γξ , a short calculation reveals that on the constraint surface (4.12) it is 1 1 d 1 1 d ˆ ˆ ˆ A(γ) + 2 η A ≈ −γξ A(γ) + 2 η Aˆ ≈0. dξ γ γ dγ γ γ −

In a Hamiltonian formulation these constraints therefore have weakly vanishing Poisson bracket with the full Hamiltonian, which is required for a consistent treatment. Let us now briefly present the Hamiltonian formulation of the coset model in terms of the new variables. 4.1.1. Poisson structure and Hamiltonian formulation. The definition of the connection Aˆ already implies the relation ˆ A(γ) = VA(γ)V −1 ,

(4.14)

such that from (2.23) one extracts the equations of motion for these new variables:

440


∂Aξ −1 ∂ Aˆ ˆ = V[Aξ , A]V −1 + V V + [Vξ V −1 , A], ∂ξ ∂γ ¯ ∂Aξ −1 ∂ Aˆ ξ¯ −1 ˆ . V + [Vξ¯ V −1 , A] , A]V + V = V[A ∂γ ∂ ξ¯

(4.15)

In analogy with the principal model, this motivates ˆ Definition 4.2. Define on A(γ) the following Poisson structure:

Aˆ a (γ), Aˆ b (µ)

V

= −f abc

Aˆ c (γ) − Aˆ c (µ) , γ−µ

(4.16)

¯ and denote by implicit time-dependence the (ξ, ξ)-dynamics, that is generated by 1 tr Aˆ 2 (1) − tr[Aˆ ∞ (∂ξ VV −1 )] , ξ − ξ¯ 1 ¯ Hˆ ξ := ¯ tr Aˆ 2 (−1) − tr[Aˆ ∞ (∂ξ¯ VV −1 )] , ξ−ξ Hˆ ξ :=

(4.17)

on the constraint surface (4.8). The remaining explicit time-dependence is then defined to be generated in analogy to (2.30). Note 4.2. The Poisson structures (4.16) are certainly different for different V and, therefore, are different from (2.25), that was introduced in the principal model. However, this previous treatment may be embedded in the following way. The structures (4.16) and (2.25) are certainly equivalent if we restrict them to the functionals of Aˆ that are invariant with respect to the choice of V, i.e. invariant with respect to the transformations ˆ , Aˆ 7→ θ−1 Aθ

(4.18)

with arbitrary θ ∈ G. These were the gauge transformations in the principal model, generated by (2.21). Hence, on the set of observables of the principal model, the different Poisson structures coincide. Correspondingly, the action of H ξ and Hˆ ξ from (2.27) and (4.17) respectively differs only by the unfolding of such a gauge transformation. For the coset model it is important to note that the gauge freedom (4.18) is restricted to H-valued matrices θ, since only that part of the constraint (4.8) remains first-class here and generates gauge transformations. This is part of the result of Theorem 4.1 below. 4.1.2. Solution of the constraints. Given a set of constraints (4.12) and a Poisson structure (4.16), the canonical procedure is due to Dirac [19]. The constraints are separated into first and second class constraints, of which the latter are explicitly solved – which changes the Poisson bracket into the Dirac bracket – whereas the former survive in the final theory. In the case at hand, the essential part of the constraints is of the second class, such that the Poisson structure has to be modified and only a small part of the constraints survives as first-class constraints. We state the final result as Theorem 4.1. The Dirac procedure for treating the constraints (4.12) in the Poisson ˆ structure (4.16) yields the following Dirac bracket for the connection A:


∗

Aˆ a (γ), Aˆ b (µ)

V

1 Aˆ c (γ) − Aˆ c (µ) = − f abc 2 γ−µ 1 aη(b)c Aˆ c (γ) 1 η(a)bc Aˆ c (µ) + f , + f 2 µ − γ1 2 γ − µ1

441

(4.19)

where the notation of indices means a choice of basis with tη(a) ≡ η(ta ). The bracket for the logarithmic derivatives of the conformal factor remains unchanged: o∗ n o∗ n ¯ −(log h)ξ¯ = ξ, =1. (4.20) ξ, −(log h)ξ V

V

The structure is compatible with the (now strong) identity 1 1 1 1 ˆ A(γ) + 2 η Aˆ = Aˆ ∞ = η(Aˆ ∞ ) , γ γ γ γ

(4.21)

such that compared with (4.12) it remains the first-class constraint Aˆ ∞ + η(Aˆ ∞ ) = 0 .

(4.22)

ˆ Proof. The main idea of the proof is the separation of the variables A(γ) into weakly commuting halves: 1 1 1 ˆ − Aˆ ∞ , 81 (γ) := A(γ) + 2 η Aˆ γ γ γ 1 1 1 ˆ 82 (γ) := A(γ) − Aˆ ∞ , − 2 η Aˆ γ γ γ with

8a1 (γ), 8b2 (µ)

V

≈ 0

(4.23)

on the constraint surface (4.12), as follows from (4.16) by direct calculation, using the fact that η is an automorphism: f abc = f η(a)η(b)η(c) . The whole constraint surface is spanned by 81 = 0 and Aˆ ∞ = 0, whereas 82 covers the remaining degrees of freedom. Since 81 and 82 contain respectively Aˆ ∞ ∓ η(Aˆ ∞ ), the relations (4.23) show that Aˆ ∞ + η(Aˆ ∞ ) is a first-class constraint of the theory. If we further explicitly solve the second-class constraints 81 = 0, the commutativity (4.23) implies that the Poisson bracket of 82 remains unchanged by the Dirac procedure: a ∗ 82 (γ), 8b2 (µ) V = 8a2 (γ), 8b2 (µ) V . Moreover, the Dirac bracket is by construction compatible with the vanishing of 81 : ∗

{8a1 (γ), . }V = 0 . ˆ These facts may be used to easily calculate the Dirac bracket of the original variables A(γ) without explicitly inverting any matrix of constraint brackets. With the decomposition 1 1 ˆ 1 ˆ 1 ˆ (A∞ + η(Aˆ ∞ )) + (A∞ − η(Aˆ ∞ )) , A(γ) = 81 (γ) + 82 (γ) + 2 2 2γ 2γ the result is obtained. The bracket (4.20) follows from the calculations performed in Lemma 4.1, which imply the vanishing Poisson bracket between (log h)ξ and the constraints.

442


4.1.3. Final formulation and symmetries of the theory. Let us summarize the final status ˆ of the theory and the relation of the new fundamental variables A(γ) to the original fields V and g respectively. We further discuss how the local and global symmetries of the original fields become manifest in this formulation. ˆ The formulation in terms of the new variables A(γ) is completely described in Theorem 4.1, where their modified Poisson structure is given. The solved constraints (4.21) may be considered to be valid strongly. The remaining first-class constraint (4.22) generates the transformation ˆ , Aˆ 7→ χ−1 Aχ

(4.24)

with χ ∈ H. According to (4.9), the field V transforms as V 7→ χV .

(4.25)

The relation (4.5) on the coset constraint surface shows that the field g does not feel this transformation. The gauge transformations generated by (4.22) are the manifestation of a really physical gauge freedom in the decomposition of the metric into some vielbein; they are remnant of the gauge freedom of local Lorentz transformations in general relativity. This freedom may be fixed to choose some special gauge for the vielbein field V. Note 4.3. It is important to notice that the second term in the modified Hamiltonians ¯ ¯ Hˆ ξ , Hˆ ξ from (4.17), that makes them differ from H ξ , H ξ from (2.27) becomes a pure gauge generator after the presented solution of the constraints. This is due to the fact that Aˆ ∞ ∈ h according to (4.21). Since h and k are orthogonal with respect to the CartanKilling form, the action of H ξ and Hˆ ξ just differs by h-conjugation and thus by a gauge transformation of the coset model. The field Aˆ now does not contain the complete information about the original field V, ˆ by means of but only the currents Vξ V −1 , Vξ¯ V −1 , which may be extracted from A(±1) (4.9). At first sight, one might get the impression that in contrast to (2.17), the relations (4.9) do not even contain the full information about these currents. However, if the gauge freedom (4.25) in V is fixed, the currents may be uniquely recovered from (4.9). For g = sl(N ) for example, usually a triangular gauge of V is chosen, such that Vξ V −1 is recovered from its symmetric part 2P+ = (Vξ V −1 )+(Vξ V −1 )t . The field V moreover is determined only up to right multiplication V 7→ Vθ from the currents Vξ V −1 , Vξ¯ V −1 . This is a (global) symmetry of the theory, under which the field g according to (4.5) transforms as g 7→ θ−1 gη(θ) .

(4.26)

For axisymmetric stationary 4D gravity these are the so-called Ehlers transformations. They are obviously a symmetry of the original equations of motion (2.5). ˆ The new variables A(γ) are invariant under these global transformations, which ˆ become only manifest in the transition to the original fields. The related 9-function transforms due to its normalization at ∞ as ˆ 7→ 9θ ˆ , 9 ˆ = 0): as well as the auxiliary matrix C0 , which is related to 9(γ

(4.27)


C0 7→ η(θ)−1 C0 θ .

443

(4.28)

Thereby, we have made explicit the global and local symmetries of the original fields in the new framework. 4.1.4. First order poles. Let us evolve the previous result for the case of simple poles ˆ ˆ of A(γ). We again parametrize A(γ) by its singularities and residues: ˆ A(γ) =

N X j=1

Thus

Aˆ j . γ − γj

(4.29)

Aˆ j = VAj V −1 .

(4.30)

Their equations of motion read 2 X ∂ Aˆ j [Aˆ k , Aˆ j ] = + [Vξ V −1 , Aˆ j ] , ¯ ∂ξ ξ − ξ k6=j (1 − γk )(1 − γj )

(4.31)

2 X [Aˆ k , Aˆ j ] ∂ Aˆ j + [Vξ¯ V −1 , Aˆ j ] , = ξ¯ − ξ k6=j (1 + γk )(1 + γj ) ∂ ξ¯ ¯ and are completely generated by the Hamiltonians Hˆ ξ and Hˆ ξ from (4.17). Theorem 4.1 now implies

Corollary 4.1. Let Aˆ be parametrized as in (4.29). After the Dirac procedure, the following identities hold strongly: 1 , (4.32) γj = γj+n Aˆ j = η(Aˆ j+n ) ,

(4.33)

where N = 2n. They may be explicitly checked to also commute with the full Hamiltonian ¯ constraints C ξ , C ξ . The remaining degrees of freedom are therefore covered by the γj and Aˆ j for 1 ≤ j ≤ n, which are equipped with the Dirac bracket: a b ∗ 1 Aˆ i , Aˆ j V = δij f abc Aˆ cj . 2

(4.34)

The remaining first-class constraint is

1 ˆ A∞ + η(Aˆ ∞ ) = 2

n X j=1

 Aˆ j + η 

n X

 Aˆ j  = 0 .

(4.35)

j=1

This solution of the constraints in the case of first order poles may alternatively be carried out in terms of the monodromies Mj . As was mentioned above, in the presence of only simple poles, the variables Aj are generically (see Note 2.11) completely defined by the monodromies Mj . Assuming that (4.32) is fulfilled, the coset constraints in the form (4.11) are equivalent to

444


Mj+n − C0−1 η(Mj )C0 = 0 . (4.36) There are two important points that this form of the constraints exhibits. First, it shows the necessity to choose the matrix C0 to be constant in the isomonodromic sector. Moreover, it uniquely relates the ordering of the monodromy matrices fixed for calculation of its Poisson brackets in Theorem 2.4 to the ordering defined by (4.32). This results from choosing the corresponding paths pairwise symmetric under γ 7→ γ1 . The goal is now to calculate the Dirac bracket between monodromies Mj with respect to (4.33), or, equivalently, with respect to (4.36). One way is clearly to repeat the calculation of Theorem 2.4 using the Dirac bracket (4.19) instead of the Poisson bracket (2.25). However, we can alternatively determine the Dirac bracket from simple symmetry arguments avoiding direct calculation at least for objects that are invariant under G-valued gauge transformations (i.e. traces of arbitrary products of Mj ). The involution η ∞ introduced by (4.13) acts on Mj according to (4.11) as follows: η ∞ (Mj ) = C0 η(Mj+n )C0−1 .

(4.37)

Therefore, the set of all G-invariant functionals of Mj may be represented as MS ⊕ MAS ,

(4.38)

where the set MS contains functionals which are invariant with respect to η ∞ and MAS contains functionals changing the sign under the action of η ∞ . Since η is an automorphism of the structure (2.61), (2.62), the definition of η ∞ in (4.37) implies, taking into account Note 2.16: {MS , MS } ⊆ MS ,

{MS , MAS } ⊆ MAS ,

{MAS , MAS } ⊆ MS .

(4.39)

The constraints (4.36) are equivalent to vanishing of all functionals from MAS ; therefore the part of G-invariant variables surviving after the Dirac procedure is contained in MS . The former Poisson bracket on MS coincides with the Dirac bracket. Note 4.4. The treatment of coset constraints in terms of the monodromies presented ˆ are. Therefore, above is invariant with respect to change of V since the monodromies of 9 this treatment also works in the former Poisson structure (2.25). 4.2. Quantum coset model. The quantization of the coset model goes along the same line as the quantization of the principal model described above. We again restrict to the first order pole sector of the theory, although generalization to the whole isomonodromic sector should be achievable according to Note 3.2. Having solved the constraints, the remaining degrees of freedom are the singularities γj , the residues Aˆ j for j = 1, . . . , n and the logarithmic derivatives of the conformal factor h. They may be represented as in (3.3) and (3.4) again. The quantum representation space is V (n) := V1 ⊗ . . . ⊗ Vn . The Wheeler–DeWitt equations (3.5) take the form:    X i~ X ∂ψ 1 + γ j γk γj + γk ˜ jk ψ , = − jk  ∂ξ (1−γj )(1−γk ) (1−γj )(1−γk ) ξ − ξ¯  j,k

j,k

   X ∂ψ i~ X 1 + γ j γk γj + γk ˜ jk ψ , + = jk  (1 + γj )(1 + γk ) ∂ ξ¯ ξ¯ − ξ  j,k (1 + γj )(1 + γk ) j,k

(4.40)


445

with ˜ jk := tη(a) ⊗ tak . j

jk = taj ⊗ tak

Additionally, the physical states have to be annihilated by the first-class constraint (4.22):   X η(a) X ¯ =0.  taj + tj  ψ(ξ, ξ) (4.41) j

j

The result of Theorem 3.1 is modified to establish a link to solutions of what we will refer to as the Coset-KZ system:   X 1 + γ /γ  X γk + 1/γj ∂ϕCKZ k j ˜ jk ϕCKZ . = i~ jk + (4.42)   ∂γj γj − γk γj γk − 1 k6=j

k

The relation between solutions of the Wheeler–DeWitt equations and solutions of the Coset-KZ system is now explicitly given by Theorem 4.2. If ϕCKZ is a solution of (4.42) obeying the constraint (4.41), and the γj ¯ according to (2.8), then depend on (ξ, ξ) ψ=

n Y j=1

γj−1

∂γj ∂wj

i~jj ϕCKZ

(4.43)

solves the constraint (Wheeler–DeWitt) Eqs. (4.40). This may directly be calculated in analogy to (3.9).

The procedure of identifying observables may be outlined just as in the case of the principal model, where this was described in great detail. Again the monodromies of the quantum linear system are the natural candidates for building observables and contain a complete set for the simple pole sector. In analogy to Theorem 3.2 they should be identified with the monodromies of a certain higher-dimensional Coset-KZ system with an additional insertion playing the role of the classical γ. The actual observables are generated from combinations of matrix entries of these monodromies that commute with the constraint (4.41). From general reasoning according to the classical procedure, relevant objects turn out to be the combinations of G-invariant objects, that are also invariant under the involution η∞ . 4.3. Application to dimensionally reduced Einstein gravity. Let us finally sketch how the previous formalism and results work for the case of axisymmetric stationary 4D gravity. In this case, the Lagrangian of general relativity is known to reduce to (2.1) with the field g taking values in SL(2, R) as a symmetric 2 × 2 matrix; its symmetry corresponds to the coset constraint (4.1). Most of the physically reasonable solutions of the classical theory – among them in particular the Kerr solution – lie in the isomonodromic sector and are described by first order poles at purely imaginary singularities in the connection. The quantization of this sector may be performed within the framework of this paper. According to (3.4) and Note 3.3 the residues Aˆ j are represented as

446


Aˆ j ≡ i~

1

2 hj

fj

ej − 21 hj

,

(4.44)

where hj , ej and fj are the Chevalley generators of sl(2, R). Due to its non-compactness, sl(2, R) admits no finite dimensional unitary representations, but several series of infinite dimensional representations. The study of the classical limit singles out the principal series, as was discussed in [49]. The representation space consists of complex functions f (ζ) on the real line with the ordinary L2 (R) scalar product Z f1 (ζ)f2 (ζ)dζ , (4.45) hf1 , f2 i := R

and the anti-hermitian operators act as hj ≡ 2ζj ∂j + sj ,

ej ≡ ζj2 ∂j + sj ζj ,

fj ≡ −∂j .

(4.46)

The spin sj takes values sj = 1+iqj with a continuous parameter qj ∈ R. The surviving first-class constraint (4.41) now takes a simple form: Lemma 4.2. A solution f (ζ1 , . . . , ζn ) of the constraint (4.41) is of the form Y 1 (ζj2 + 1)− 2 sj F (ζ˜1 , . . . , ζñ ) , f (ζ1 , . . . , ζn ) =

(4.47)

j

with ζ˜j :=

ζj +i ζj −i

and

 X ∂ F = 0 .  ∂ ζ˜j 

(4.48)

j

This follows by direct calculation.

The prefactor in (4.47) is exactly sufficient for convergence of the integral, such that for finiteness of the norm, it is sufficient to demand boundedness of F which is a function on the product of (n − 1) circles S 1 . In contrast to the analogous sl(2, R) representation of the principal model, where solutions of finite norm are absent due to several redundant integration variables, a convergency factor here comes out for free. This interestingly resembles the fact that the general reason for dividing out the maximal compact subgroup in the physical coset models corresponds to avoiding unboundedness of the energy in the theory. It remains to solve the Coset-KZ system in this representation. Although the general solution for sl(2, R) is not known, one might be able to obtain explicit results for a small number of insertions. The Kerr solution for instance, which is of major interest, requires only two classical insertions γ1 , γ2 ∈ iR. In this case, we may exploit Theorem 4.2 and Lemma 4.2 to explicitly reduce the WDW equation to a second order differential equation in two variables. Let V1 and V2 be two representations from the principal series ¯ ∈ V1 ⊗ V2 of sl(2, R) fixed by s1 and s2 and parametrize the quantum state ψ(ξ, ξ) according to: 11 12 γ1 γ2 ¯ ζ1 , ζ2 ) = (ζ12 + 1)− 21 s1 (ζ22 + 1)− 21 s2 F (γ, ζ) , (4.49) ψ(ξ, ξ, γ12 − 1 γ22 − 1 with


i ~s1 (s1 − 2) , 2 γ 1 + 1 γ2 − 1 ∈ S1 , γ≡ γ1 − 1 γ2 + 1 11 ≡

447

i ~s2 (s2 − 2) , 2 ζ 1 + i ζ2 − i ∈ S1 . ζ≡ ζ1 − i ζ2 + i

12 ≡

After some calculation the WDW equation then becomes ∂γ F (γ, ζ) = i~Ds1 ,s2 (γ) F (γ, ζ) , with Ds1 ,s2 (γ) =

(4.50)

ζ 2 +1 1 2ζ(ζ −1)2 ∂ζ2 + 2(ζ −1)2 + (s1 +s2 )(ζ 2 −1) ∂ζ + s1 s2 γ −1 2ζ 1 ζ 2 +1 − 2ζ(ζ + 1)2 ∂ζ2 + 2(ζ + 1)2 + (s1 +s2 )(ζ 2 −1) ∂ζ + s1 s2 γ+1 2ζ 4 + (ζ 2 ∂ζ2 + ζ∂ζ ) . (4.51) γ

This form e.g. suggests expansion into a Laurent series in ζ on S 1 leading to recurrent differential equations in γ for the coefficients. Further study of this equation should be a subject of future work. Note 4.5. Equation (4.50) reduces to a Painlevé equation when the principal series representation of sl(2, R) is formally replaced by the fundamental representation of g = su(2). In the study of four-point correlation-functions in Liouville theory a similar generalization of the hypergeometric differential equation appeared [62]. 5. Outlook We have completed the classical two-time Hamiltonian formulation of the coset model for the isomonodromic sector and sketched a continuous extension in Appendix A. For the quantum theory it remains the problem of consistent quantization of the total phase space including a proper understanding of the structures (A.8). The most important physical problem in the investigated model is the description of states corresponding to quantum black holes. One may certainly hope to extract first insights from a closer study of the exact isomonodromic quantum states of the coset model identified in the last chapter, in particular from the study of Eq. (4.50). An open problem is the link of the employed two-time Hamiltonian formalism with the conventional one. To rigorously relate the different Poisson structures, the canonical approach should be compared to our model after a Wick rotation into the Lorentzian case. This corresponds to a dimensional reduction of spatial dimensions only, such that the model would describe colliding plane or cylindrical waves rather than stationary black holes. It is further reasonable to suspect that proper comparison of the different Poisson structures can only be made on the set of observables, see also Note 2.5. Recent progress in the canonical approach has been stated in [51], where in particular the canonical algebraic structures of the observables have been revealed. However, so far the canonical and the isomonodromic approaches appear to favor different characteristic observables, which still remain to be related. As another possibility to compare our treatment with canonical approaches, the relation to further restricted and already studied models should be investigated. Of major

448


interest in this context would be for instance the relation to the Einstein-Rosen solutions, investigated and quantized in [52, 5], where imposing of additional hypersurface orthogonality of the Killing vector fields reduces the phase space to “one polarization,” yet maintaining an infinite number of degrees of freedom. An additional interesting field of future research descends from the link to broadly studied two-dimensional dilaton gravity (see e.g. [14, 32, 9, 27]), further allowing to extract information about the black hole thermodynamics. Further relevance of the investigated model appeared in certain sectors of string theory [30, 53]. Acknowledgement. It is a pleasure to thank H. Nicolai, V. Schomerus and J. Teschner for enlightening discussions. D. K. acknowledges support of Deutsche Forschungsgemeinschaft under contract No. Ni 290/5-1. H. S. thanks Studienstiftung des Deutschen Volkes for financial support.

A. Extension Beyond the Isomonodromic Sector The treatment of the isomonodromic sector presented in this paper allows a rather natural extension to the full phase space. This general scheme recalls a continuous version of the simple pole sector treated in Subsect.2, which in turn may be understood as a discrete embedding into the former. We will again first describe the scheme for the principal model and then discuss the modifications required for the coset model, see also [56]. ¯ A.1. Principal model. We start from a simply-connected domain in the ξ, ξ-plane, ¯ ¯ is symmetric with respect to conjugation ξ 7→ ξ, where the classical solution g(ξ, ξ) assumed to be non-singular. This regularity is reflected by corresponding properties of the related 9-function in the w-plane. It is holomorphicpand invertible in a (ring-like) ¯ bounded by domain D of the Riemann surface L of the function (w − ξ)(w − ξ) contours l and lσ , where σ is the involution γ 7→ 1/γ interchanging the w-sheets of L. To simplify the following formulas we further assume the spectral parameter current A(γ) to be holomorphic on the whole second sheet of L, such that it may be represented inside of l (we denote this simply-connected domain by D0 ) by a Cauchy integral over l: I ¯ A(w, ξ, ξ)dw , (A.1) A(µ) = γ(w) − µ l which is the continuous analog of the simple pole ansatz (2.32) in the isomonodromic sector; A(w), w ∈ l is a density corresponding to the residues Aj from (2.32). From (A.1), A(w) is not uniquely defined by the values of A(γ), γ ∈ D0 , in particular, it may not coincide with the boundary values of A(γ) on l. To fix A(w), we postulate the following deformation equations which are a continuous version of the discrete deformation Eqs. (2.33): I 2 [A(v), A(w)] ∂A(w) = dv , (A.2) ¯ ∂ξ ξ − ξ l (1 − γ(v))(1 − γ(w)) I 2 [A(v), A(w)] ∂A(w) dv , w∈l. = ¯ ¯ (1 + γ(v))(1 + γ(w)) ξ−ξ l ∂ξ It is easy to check that (A.2) together with (A.1) imply the deformation Eqs. (2.23) for A(γ). The Poisson structure on A(w) is also a direct continuous analog of (2.34):


{Aa (w), Ab (v)} = −f abc Ac (w)δ(w − v) ,

449

w, v ∈ l,

(A.3)

where δ(w) is a one-dimensional δ-function living on the contour l (and should, strictly ds δ(s) with an arbitrary affine parameter s along l). This structure speaking, be defined as dw in turn induces the proper holomorphic bracket (2.25) for A(γ): I Ac (w0 )dw0 a b abc {A (γ(w)), A (γ(v))} = −f 0 0 l (γ(w ) − γ(w))(γ(w ) − γ(v)) c c A γ((w)) − A (γ(v)) = −f abc . γ(w) − γ(v) The nice feature of A(w) in contrast to A(γ) is that A(w) (as its discrete analog ¯ independent, i.e. the whole dependence of A(w) on ξ and ξ¯ is Aj ) is explicitly (ξ, ξ) generated by the Hamiltonians (2.27) (note that the points γ = ±1 lie inside of D0 ): 1 H = tr ξ − ξ¯

I

ξ

l

A(w)dw 1 − γ(w)

2

1 tr H = ¯ ξ−ξ ξ¯

,

I l

A(w)dw 1 + γ(w)

2 .

(A.4)

We may now also identify a continuous family of observables, generalizing the construction of Sect.2. Define A(γ) inside and outside of D0 by the Cauchy formula (A.1) and construct the related functions 9in (γ ∈ D0 ) and 9out (γ 6∈ D0 ) according to 9γ 9−1 = A(γ). Then the continuous monodromy matrix M (w) ≡ 9out (w)9−1 in (w) ,

w∈l

(A.5)

¯ is (ξ, ξ)-independent, since both 9in and 9out satisfy the linear system (2.7). Calculations similar to those in Appendix B yield the following Poisson brackets for M (w): ¯ ¯ ¯ (A.6) {M 0 (v), M 0 (w)} = iπ − M 0 (v) M 0 (w) + M 0 (w) M 0 (v) ¯ ¯ + M 0 (v)M 0 (w) − M 0 (v)M 0 (w) , for

v≤w,

v, w ∈ l ,

where the points of contour l are ordered with respect to a fixed point w0 , playing the role of the eyelash in the discrete case. The brackets (A.6), are again valid up to the first-class constraint generated by I (A.7) A∞ = A(w)dw , l

and therefore satisfy Jacobi identities only being restricted to the gauge-invariant objects. Again there appear two fundamental ways of quantization. In terms of A, (A.3) would be replaced by a possibly centrally extended affine algebra. Alternatively, the Poisson algebra of observables (A.6) may be quantized directly after regularization analogously to (2.69): ¯

¯

¯

{M 0 (v), M 0 (w)} = −M 0 (v) r+ M 0 (w) + M 0 (w) r− M 0 (v) ¯

¯

+ r− M 0 (v)M 0 (w) − M 0 (v)M 0 (w) r+ leading to:

v ≤ w,

v, w ∈ l ,

450

D. Korotkin, H. Samtleben −1 R− M 0 (w)R− M 0 (v) = M 0 (v)R+ M 0 (w)R+−1 , ¯

¯

v≤w.

(A.8)

Embedding of the isomonodromic sector into the presented extension looks especially simple if all the singularities γ1 , . . . , γN are assumed to belong to the contour l. The density A(w) is then parametrized as n X

A(w) = −

Aj δ(w − wj ) ,

(A.9)

j=N

where the residues Aj are the same as in (4.29). The Poisson structure (A.3) is the directly inherited from (2.34) and (A.9): {Aa (w), Ab (v)} =

N X

f abc Aj δ(w − wj )δ(v − wj )

j=1

= −f abc Ac (v)δ(v − w) . The monodromy M (w) here is a step function on l with jumps at w = wj . Fixing the eyelash between γN and γ1 it is M (w) = M1 . . . Mj ,

for

w ∈]γj , γj+1 [ .

Note A.1. Isomonodromic solutions with higher order poles are embedded into the general scheme by inserting higher order derivatives of δ-functions into (A.9). The definition ∂γ A, in (A.1) already shows that the proper object in this case is the connection Aw = ∂w accordance with the results from Subsect.2. Note A.2. The representation (A.1) gains a well known meaning when the model is truncated to a real scalar field g, where A(w) becomes independent of ξ, ξ¯ and the equation of motion (2.5) reduces to the Euler-Darboux equation ∂ξ φ − ∂ξ¯ φ ¯ =0, 2(ξ − ξ)

∂ξ ∂ξ¯ φ −

(A.10)

for φ = log g. Solutions of this equation may be represented as [15] I

f (w)dw

p

φ= l

¯ (w − ξ)(w − ξ)

,

(A.11)

with 2πif (w) ≡ φ(ξ = ξ¯ = w) defined on the axis ξ = ξ¯ and continued analytically. After differentiating in ξ and integrating by parts in w, this representation takes the form ∂ξ φ =

2 ξ − ξ¯

I l

f 0 (w)dw p , ¯ (w − ξ)(w − ξ)

and thus equals (2.17) with A(±1) defined by (A.1) after identification of f 0 (w) and A(w).


451

A.2. Coset model. In analogy to the discrete case, the coset model is most conveniently described in terms of modified variables Aˆ = η(V)Aη(V −1 ) . ˆ Due to the symmetry (4.12) between the values of A(γ) on different sheets of L, we ˆ can no longer assume A(γ) to be holomorphic in D0 , but have to replace the l by l ∪ lσ ˆ enclosing D in the formulas of the last section. The coset constraints in terms of A(w) take the form ˆ ˆ σ) , A(w) = η A(w w∈l, (A.12) and allow rather simple solution via a Dirac procedure, such that the phase space is ˆ reduced to the values of A(w) on l only, equipped with the Dirac bracket 1 {Aˆ a (w), Aˆ b (v)}∗V = − f abc Aˆ c (w)δ(w − v) , 2

v, w ∈ l .

(A.13)

Via the Cauchy representation (A.1) on the contour l ∪ lσ , this bracket further gives ˆ the previously derived Dirac bracket (4.19) on A(γ). It remains the h-valued first class constraint I ˆ ˆ A(w) + η(A(w)) dw = 0 , l

generalizing (4.22). The Hamiltonians finally also take the form (A.4) with l being replaced by l ∪ lσ . In terms of the observables M (w), restriction to the coset leads to M (wσ ) = C0−1 η M (w) C0 , w∈l, with some constant matrix C0 playing the same role as in (4.36). B. Poisson Structure of Monodromy Matrices This appendix is devoted to the proof of Theorem 2.4, which was obtained in collaboration with H. Nicolai.3 For simplicity of the presentation, we give the calculation for the case, where the Casimir element differs from the permutation operator Π by some scalar multiple of the identity only, which is the case for g = sl(N, R) for example. The procedure may easily be extended (concerning the notation mainly) to the general case. Here, the Poisson-structure of the connection is given by {A(γ) ⊗, A(µ)} =

1 [Π, A(γ) ⊗ I + I ⊗ A(µ)] , γ−µ

and the statement to be proven reads: (B.1) {Mi ⊗, Mi } = iπ [ Π, Mi Mi ⊗ I ] , {Mi ⊗, Mj } = iπΠ Mj Mi ⊗ I + I ⊗ Mi Mj − Mi ⊗Mj − Mj ⊗Mi , (B.2) for i < j . We first calculate the Poisson structure of matrix entries of the function 9 at different points s1 and s2 . These points are defined on the Riemann surface given by 9 by paths, 3

After completion we learned about related results in [4, 36].

452


connecting them to a common base-point s0 , at which 9 is taken to be normalized according to (2.59). The limit s0 → ∞ will be treated later on. For the calculation, we make use of the standard formula Z s1 Z s2 dµ1 dµ2 × {9(s1 ) ⊗, 9(s2 )} = 9(s1 ) ⊗ 9(s2 ) s0 s0 n o i h , 9−1 (µ1 ) ⊗ 9−1 (µ2 ) A(µ1 ) ⊗, A(µ2 ) 9(µ1 ) ⊗ 9(µ2 ) where the integrand may be rewritten as Π ∂µ1 + ∂µ2 9−1 (µ2 )9(µ1 ) ⊗ 9−1 (µ1 )9(µ2 ) . µ2 − µ1 This expression is completely regular, even for µ1 = µ2 . However, if the appearance of the derivation operators is exploited by partial integration, the integrals will split up into parts that exhibit singularities in coinciding points µ1 = µ2 . Thus, we restrict to distinguished endpoints s1 and s2 , choosing the defining paths [s0 → s1 ] and [s0 → s2 ] nonintersecting in the punctured plane from the very beginning. Singularities remain in the common endpoints of the paths at s0 . As a regularization, one of these coinciding endpoints is shifted by a small (complex) amount that is put to zero afterwards. Then, partial integration can be carried out properly, leaving only boundary terms, that lead to surviving simple line integrals, whereas the remaining double integrals cancel exactly. The arising singularities in = 0 regularize each other such that the sum is independent of the way, tends to zero. In a comprehensive form, the result may be stated as Theorem B.1. Let s1 and s2 be different points on the punctured plane, defined as points on the covering by nonintersecting paths [s0 → s1 ] and [s0 → s2 ] with common basepoint s0 at which 9 is normalized. Then, the Poisson bracket between matrix entries of 9(s1 ) and 9(s2 ) is given by {9(s1 ) ⊗, 9(s2 )} = 9(s1 ) ⊗ 9(s2 ) × (B.3) Z s2 Π 9−1 (µ)9(s1 ) ⊗ 9−1 (s1 )9(µ) dµ µ − s1 s0 Z s1 Π dµ 9−1 (s2 )9(µ) ⊗ 9−1 (µ)9(s2 ) − µ − s2 s Z s02 i h 1 dµ Π , 9(µ) ⊗ 9−1 (µ) + µ − s0 s0 Z s0 − Z s1 Π −1 + dµ 9(µ) ⊗ 9 (µ) . + lim →0 µ − s0 s0 + s2 This expression is regular and independent of the limit procedure.

Note B.1. The result of the regularization is the complete fixing of the relative directions of the paths [s0 → s1 ] and [s0 → s2 ] approaching the basepoint s0 , that is determined by the form in which arises in the last term in (B.3). In other words, the path [s1 → s0 → s2 ] must pass through the basepoint s0 straightforwardly, as is indicated in Fig.1. The result of Theorem B.1 may be further simplified in the limit s0 → ∞, where the third term of (B.3) vanishes:


453

s2

s0

s1

Fig. 1. Choice of paths

Lemma B.1. For a fixed point s on the punctured plane and 9(γ) holomorphic at γ = ∞, it is Z

s

lim

s0 →∞

s0

i h 1 −1 dµ = 0. Π, 9(µ) ⊗ 9 (µ) µ − s0

(B.4)

The proof is obtained by estimating the integrand as a holomorphic function of γ and s0 . To proceed in calculating the Poisson bracket between monodromy matrices, we choose points s1 , s2 , s3 and s4 , pairwise coinciding on the punctured plane as s1 ∼ s2 and s3 ∼ s4 , but distinguished on the covering and defining the monodromy matrices Mi and Mj : 9(s2 ) = 9(s1 )Mi ,

9(s4 ) = 9(s3 )Mj .

(B.5)

Then, (B.3) leads to: Z Π dµ {Mi ⊗, Mj } = (Mi ⊗ Mj ) µ − s0 s →s →s Z 4 0 2 Π dµ + µ − s0 s →s →s Z 3 0 1 Π dµ − (I ⊗ Mj ) µ − s0 s →s →s Z 4 0 1 Π dµ − (Mi ⊗ I) µ − s0 s3→s0→s2

9(µ) ⊗ 9−1 (µ)

−1

(B.6)

(Mi ⊗ Mj ) 9(µ) ⊗ 9−1 (µ) (Mi ⊗ I) −1 9(µ) ⊗ 9 (µ) (I ⊗ Mj ) , 9(µ) ⊗ 9

(µ)

which is understood in the limit → 0 and s0 → ∞ and for paths [sj → s0 → si ] , i = 1, 2; j = 3, 4, chosen fixed and in accordance with the conditions of Theorem B.1 and Note B.1. Proof of (B.1). Consider first the case i = j. Then a proper choice of paths is illustrated in Fig.2. The expression (B.6) allows to put s1 = s3 and s2 = s4 and to split the integration paths into paths encircling s0 and γi , respectively: {Mi ⊗, Mi } = (Mi ⊗Mi )X − X(Mi ⊗Mi ) − (Mi ⊗I)X(I ⊗Mi ) + (I ⊗Mi )X(Mi ⊗I) + (I ⊗Mi )Y (Mi ⊗I) − (Mi ⊗I)Y (I ⊗Mi ) , with

454


s 1 ∼ s2 s 3 ∼ s4 γi

s0 ∞ Fig. 2. Choice of paths for {Mi ⊗ , Mi }

I

Π 9(µ) ⊗ 9−1 (µ) , µ − s0 s0 s2 Π 9(µ) ⊗ 9−1 (µ) . dµ Y = µ − s0 s1

X=

1 2 Z

dµ

The path of the integral Y neither passes through s0 nor intersects the path [s0 → ∞]; such that this integral vanishes in the limit s0 → ∞. This choice of path uniquely determines the orientation of the remaining paths in X, which encircle s0 . The corresponding integrals can be easily evaluated due to Cauchy’s theorem and single-valuedness of the integrands. This proves formula (B.1). Proof of (B.2). This case is treated in complete analogy. A suitable form of the paths is shown in Fig.3, which in particular illustrates the asymmetric position of the paths defining respectively Mi and Mj , with respect to the marked path [s0 → ∞].

s 1 ∼ s2

s 3 ∼ s4

γi

γj

s0 ∞ Fig. 3. Paths for {Mi ⊗ , Mj }


455

Similar reasoning as above yields {Mi ⊗, Mj } = −(Mi ⊗Mj )X − X(Mi ⊗Mj ) + (Mi ⊗I)X(I ⊗Mj ) + (I ⊗Mj )X(Mi ⊗I) ,

(B.7)

where again several integrals have already vanished in the limit s0 → ∞. Evaluating the remaining terms proves formula (B.2).

References 1. Alekseev, A. Y.: Integrability in the Hamiltonian Chern-Simons theory. Algebra Anal. 6, 53–66 (1994) 2. Alekseev, A. Y., Grosse, H., and Schomerus, V.: Combinatorial quantization of the Hamiltonian ChernSimons theory. Commun. Math. Phys. 172, 317–358 (1995) 3. Alekseev, A. Y., Grosse, H., and Schomerus, V.: Combinatorial quantization of the Hamiltonian ChernSimons theory 2. Commun. Math. Phys. 174, 561–604 (1995) 4. Alekseev, A. Y., and Malkin, A. Z.: The hyperbolic moduli space of flat connections and the isomorphism of symplectic multiplicity spaces. Preprint dg-ga/9603017 (1996) 5. Ashtekar, A., and Pierri, M.: Probing quantum gravity through exactly soluble midi-superspaces I. J. Math. Phys. 37, 6250–6270 (1996) 6. Atiyah, M., and Bott, R.: The Yang-Mills equations over Riemann surfaces. Phil. Trans. R. Soc. Lond. A308, 523–615 (1982) 7. Axelrod, S., Witten, E., and Pietra, S. D.: Geometric quantization of Chern-Simons gauge theory. J. Diff. Geom. 33, 787–902 (1991) 8. Bais, F.A., and Sasaki, R.: On the algebraic structure of selfdual gauge fields and sigma models. Nucl. Phys. B227, 75–120 (1983) 9. Barvinsky, A., and Kunstatter, G.: Exact physical black hole states in generic 2-d dilaton gravity. Phys. Lett. B389, 231–237 (1996) 10. Belinskii, V., and Zakharov, V.: Integration of the Einstein equations by means of the inverse scattering problem technique and construction of exact soliton solutions. Sov. Phys. JETP 48, 985–994 (1978) 11. Bobenko, A.I.: Surfaces in terms of 2 by 2 matrices. Old and new integrable cases. In: A. Fordy and J.C. Wood (eds.), Harmonic Maps and Integrable Systems. Vieweg, 1993 12. Breitenlohner, P., and Maison, D.: On the Geroch group. Ann. Inst. H. Poincaré. Phys. Théor. 46, 215–246 (1987) 13. Breitenlohner, P., Maison, D., and Gibbons, G.: Four-dimensional black holes from Kaluza-Klein theories. Commun. Math. Phys. 120, 295–333 (1988) 14. Callan, C., Giddings, S., Harvey, J., and Strominger, A.: Evanescent black holes. Phys. Rev. D45, R 1005–1009 (1992) 15. Courant, R., and Hilbert, D.: Methoden der mathematischen Physik. Berlin: Springer Verlag, 1931 16. Cvetiˇc, M., and Hull, C. M.: Black holes and U duality. Nucl. Phys. B480, 296–316 (1996) 17. Cvetiˇc, M., and Youm, D.: BPS saturated dyonic black holes of N=8 supergravity vacua. In: E. Gava, K. Narain, and C. Vafa (eds.), S Duality and Mirror Symmetry, Nucl. Phys. B (Proc. Supl.) vol. 46, Amsterdam: North-Holland, 1995 18. Dickey, L.A.: Field theoretical (multitime) Lagrange-Hamiltonian formalism and integrable equations. In: O. Babelon, P. Cartier and Y. Kosmann-Schwarzbach (eds.), Lectures on Integrable Systems. Singapore: World Scientific, 1994 19. Dirac, P.A.M.: Lectures on Quantum Mechanics. New York: Academic Press, 1967 20. Drinfeld, V.: Hopf algebras and the quantum Yang-Baxter equation. Soviet Math. Dokl. 32, 254–258 (1985) 21. Drinfeld, V.: Quasi-Hopf algebras and Knizhnik-Zamolodchikov equations. In: Problems of modern quantum field theory (Alushta 1989), pp. 1–13. Berlin: Springer-Verlag, 1989 22. Drinfeld, V.: Quasi-Hopf algebras. Leningrad Math. J. 1, 1419–1457 (1990) 23. Elitzur, S., Moore, G., Schwimmer, A., and Seiberg, N.: Remarks on the canonical quantization of the Chern-Simons-Witten theory. Nucl. Phys. B326, 108–134 (1989)

456


24. Faddeev, L.: Integrable models in (1+1) dimensional quantum field theory. In: J.-B. Zuber and R. Stora (eds.), Les Houches, Session XXXIX, Recent advances in field theory and statistical mechanics, Amsterdam: North-Holland, 1984, pp. 561–608 25. Faddeev, L.: From integrable models to conformal field theories via quantum groups. In: L. Ibert and M. Rodriguez (eds.), Proceedings of NATO ASI and XXII GIFT International Seminar on Recent Problems in Mathematical Physics, Salamanca, Spain 1992, Dordrecht: Kluwer Academic Publishers, 1992, pp. 1–24 26. Faddeev, L., and Takhtajan, L.: Hamiltonian Methods in the Theory of Solitons. Berlin: Springer-Verlag, 1987 27. Filippov, A.: Exact solutions of (1+1)-dimensional dilaton gravity coupled to matter. Mod. Phys. Lett. A11, 1691–1704 (1996) 28. Fock, V.V., and Rosly, A.A.: Poisson structures on moduli of flat connections on Riemann surfaces and r-matrices. Preprint ITEP 72–92, Moscow (1992) 29. Frenkel, I., and Reshetikhin, N.: Quantum affine algebras and holonomic difference equations. Commun. Math. Phys. 146, 1–60 (1992) 30. Gal’tsov, D.V.: Integrable systems in stringy gravity. Phys. Rev. Lett. 74, 2863–2866 (1995) 31. Gawe¸dzki, K., and Kupiainen, A.: SU(2) Chern-Simons theory at genus zero. Commun. Math. Phys. 135, 531–546 (1991) 32. Gegenberg, J., Kunstatter, G., and Louis-Martinez, D.: Observables for two-dimensional black holes. Phys. Rev. D51, 1781–1786 (1995) 33. Geroch, R.: A method for generating solutions of Einstein’s equations. J. Math. Phys. 12, 918–924 (1971) 34. Goldman, W.: Invariant functions on Lie groups and Hamiltonian flows of surfaces group representations. Invent. Math. 85, 263–302 (1986) 35. Hennaux, M., and Teitelboim, C.: Quantization of Gauge Systems. Princeton: Princeton University Press, 1992 36. Hitchin, N.: Frobenius manifolds. to appear 37. Jimbo, M.: A q-difference analogue of u(g) and the Yang-Baxter equation. Lett. Math. Phys. 10, 63–69 (1985) 38. Jimbo, M., Miwa, T., Môri, Y., and Sato, M.: Density matrix of an impenetrable Bose gas and the fifth Painlevé transcendent. Physica 1D, 80–158 (1980) 39. Jimbo, M., Miwa, T., and Ueno, K.: Monodromy preserving deformation of linear ordinary differential equations with rational coefficients. Physica 2D, 306–352 (1981) 40. Julia, B.: Group disintegrations. In: S. Hawking and M. Rocek (eds.), Superspace and Supergravity, Cambridge: Cambridge University Press, 1980, pp. 331–350 41. Julia, B.: Application of supergravity to gravitation theories. In: V. D. Sabbata and E. Schmutzer (eds.), Unified Field Theories in more than 4 Dimensions, Singapore: World Scientific, 1983, pp. 215–236 42. Julia, B., and Nicolai, H.: Conformal internal symmetry of 2-D sigma models coupled to gravity and a dilaton. Nucl. Phys. B482, 431–465 (1996) 43. Kassel, C.: Quantum Groups. No. 155 in Graduate texts in mathematics. Berlin: Springer-Verlag, 1995 44. Kastrup, H.: Canonical theories of lagrangian dynamical systems in physics. Phys. Rep. 101, 1–167 (1983) 45. Knizhnik, V., and Zamolodchikov, A. B.: Current algebra and Wess-Zumino model in two-dimensions. Nucl. Phys. B247, 83–103 (1984) 46. Korepin, V. E., Bogoliubov, N. M., and Izergin, A. G.: Quantum inverse scattering method and correlation functions. Cambridge: Cambridge University Press, 1993 47. Korotkin, D., and Nicolai, H.: Separation of variables and Hamiltonian formulation for the Ernst equation. Phys. Rev. Lett. 74, 1272–1275 (1995) 48. Korotkin, D., and Nicolai, H.: An integrable model of quantum gravity. Phys. Lett. B356, 211–216 (1995) 49. Korotkin, D., and Nicolai, H.: Isomonodromic quantization of dimensionally reduced gravity. Nucl. Phys. B475, 397–439 (1996) 50. Korotkin, D., and Samtleben, H.: On the quantization of isomonodromic deformations on the torus. Int. J. Mod. Phys. A12, 2013–2029 (1997) 51. Korotkin, D., and Samtleben, H.: Poisson realization and quantization of the Geroch group. Preprint DESY-96-245, gr-qc/9611061 (1996), Class. Quantum Grav. to appear 52. Kuchaˇr, K.: Canonical Quantization of Cylindrical Gravitational Waves. Phys. Rev. D4, 955–986 (1971)


457

53. Maharana, J.: Symmetries of the dimensionally reduced string effective action. Mod. Phys. Lett. A11, 9–17 (1996) 54. Maison, D.: Are the stationary, axially symmetric Einstein equations completely integrable? Phys. Rev. Lett. 41, 521–522 (1978) 55. Nicolai, H.: Two-dimensional gravities and supergravities as integrable systems. In: H. Mitter and H. Gausterer (eds.), Recent Aspects of Quantum Fields. Berlin: Springer-Verlag, 1991 56. Nicolai, H., Korotkin, D., and Samtleben, H.: Integrable classical and quantum gravity. To appear in: G. Mack, G. t’Hooft, A. Jaffe, H. Mitter and R. Stora (eds.), Quantum Fields and Quantum Space Time, Proceedings NATO-ASI, Cargèse 1996. New York: Plenum Press, 1997 57. Reshetikhin, N.: The Knizhnik-Zamolodchikov system as a deformation of the isomonodromy problem. Lett. Math. Phys. 26, 167–177 (1992) 58. Schemmel, M.: Diploma thesis, Hamburg (1997) 59. Semenov-Tian-Shansky, M.A.: Monodromy map and classical r-matrices. J. Math. Sci. 77, 3236–3242 (1995); translation from Zap. Nauchn. Semi. POMI 200, 156–166 (1992) 60. Smirnov, F. A.: Dynamical symmetries of massive integrable models, 1. Form-factor bootstrap equations as a special case of deformed Knizhnik-Zamolodchikov equations. Int. J. Mod. Phys. A7, suppl. 1B, 813–838 (1991) 61. Witten, E.: Quantum field theory and the Jones polynomial. Commun. Math. Phys. 121, 351–399 (1989) 62. Zamolodchikov, A.B., and Fateev, V.A.: Operator algebras and correlation functions in the twodimensional Wess-Zumino SU (2) × SU (2) chiral model. Yad. Fiz. 43, 1031–1044 (1986) Communicated by T. Miwa


Commun. Math. Phys. 190, 459 – 489 (1997)

Communications in


Local BRST Cohomology and Covariance Friedemann Brandt? Instituut voor Theoretische Fysica, Katholieke Universiteit Leuven, Celestijnenlaan 200 D, B–3001 Leuven, Belgium Received: 25 July 1996 / Accepted: 23 April 1997

Abstract: The paper provides a framework for a systematic analysis of the local BRST cohomology in a large class of gauge theories. The approach is based on the cohomology of s + d in the jet space of fields and antifields, s and d being the BRST operator and exterior derivative respectively. It relates the BRST cohomology to an underlying gauge covariant algebra and reduces its computation to a compactly formulated problem involving only suitably defined generalized connections and tensor fields. The latter are shown to provide the building blocks of physically relevant quantities such as gauge invariant actions, Noether currents and gauge anomalies, as well as of the equations of motion. 1. Introduction 1.1. Motivation. Gauge invariance underlies as a basic principle our present models of fundamental interactions and is widely used when one looks for extensions of these models. The BRST-BV formalism provides a general framework to deal with many aspects of gauge symmetry, both in classical and quantum field theory. It was first established by Becchi, Rouet and Stora [1] in the context of renormalization of abelian Higgs–Kibble and Yang–Mills gauge theories, later extended by Kallosh to supergravity with open gauge algebra [2] (see also [3]) and by de Wit and van Holten to general gauge theories [4], resulting finally in the universal field-antifield formalism of Batalin and Vilkovisky [5] which allows to treat all kinds of gauge theories within an elegant unified framework. The usefulness of this formalism is mainly based on the fact that it encodes the gauge symmetry and all its properties in a single antiderivation which is strictly nilpotent on all the fields and antifields. Throughout this paper, this antiderivation is called the BRST operator and denoted by s. ?

Junior fellow of the research council (DOC) of the K.U. Leuven.

460

F. Brandt

The nilpotency of s establishes in particular the local BRST cohomology, i.e. the cohomology of s in the space of local functionals (= integrated local volume forms) of the fields and antifields. This cohomology has many physically relevant applications. It determines for instance gauge invariant actions and their consistent deformations [6], the dynamical local conservation laws [7] and the possible gauge anomalies (see e.g. [8, 1, 9, 10, 11, 12]) of a gauge theory and is a useful tool in the renormalization of quantum field theories even when a theory is not renormalizable in the usual sense [13]. Since the BRST cohomology can be defined for any gauge theory and since the correspondence of its cohomology classes to the mentioned physical quantities is universal too, it is worthwhile to look for a suitable general framework within which this cohomology can be computed efficiently and which has a large range of applicability. The purpose of this paper is to propose such a framework. It applies to a large class of gauge theories and relates the BRST cohomology to an underlying gauge covariant algebra. This includes a definition of tensor fields on which this algebra is realized and of generalized connections associated with it, and reduces the computation of the BRST cohomology locally to a problem involving only these quantities. The reduced problem is formulated very compactly in terms of identities analogous to the “Russian formula" in Yang–Mills theory [9, 14]1 , F = (s + d)(C + A) + (C + A)2 .

(1.1)

Here C, A and F are the familiar Lie algebra valued Yang–Mills ghost fields, connection and curvature forms respectively, s is the Yang–Mills BRST operator, and d is the spacetime exterior derivative. The usefulness of (1.1) is based, among others, on its remarkable property to compress the familiar BRST transformations of the Yang–Mills ghost and gauge fields, as well as the construction of the field strength in terms of the gauge field into a single identity. The combination C +A occurring in (1.1) is an example of what will be called a generalized connection here. 1.2. Relations and differences to other approaches. The proposed approach generalizes a concept outlined in [15] (see also [16]) for the study of the “restricted" (= antifield independent) BRST cohomology in a special class of gauge theories characterized among others by (a) the presence of (spacetime) diffeomorphisms among the gauge symmetries, (b) the closure and irreducibility of the gauge algebra, (c) the presence of “enough” independent gauge fields ensuring that all the derivatives of the ghost fields can be eliminated from the BRST cohomology. In such theories, the extension of the concept of [15] to the full cohomological problem, including the antifields, is (more or less) straightforward and was used already in [17, 18] within a complete computation of the BRST cohomology in Einstein gravity and Einstein–Yang–Mills theories. Here these ideas are extended to general gauge theories. In particular none of the conditions (a)–(c) is needed as a prerequisite for the methods outlined in this paper. This is possible thanks to suitable generalizations of the concept [15] which at the same time modify and unite various techniques that have been developed over the last 20 years, thereby revealing relations between them which are less apparent in other approaches. Such techniques, to be described later in detail, are the so-called descent equation technique, the use of contracting homotopies in jet spaces, compact formulations of the BRST algebra analogous to the “Russian formula” (1.1), and spectral sequence techniques along the lines of homological perturbation theory [19, 20, 21]. Let me now briefly comment on the use of these techniques in this paper, as compared to other approaches. 1 Originally the term “Russian formula" was introduced by Stora in the second ref. in [9] for a different but related identity. Here it is used as in the last ref. in [9].

Local BRST Cohomology and Covariance

461

Descent equations and the “Russian formula” were first used within the celebrated differential geometric construction of (representatives of) chiral anomalies in D = 2n dimensions from characteristic classes in D + 2 dimensions [9], and also within the classification of such anomalies in [22]. Later it became clear that the descent equations are useful not only in connection with chiral anomalies, but to analyse the complete BRST cohomology, cf. e.g. [23, 24, 15, 11, 16]. The reason is that they allow to deal efficiently with the total derivatives into which the integrands of BRST invariant functionals transform in general. In this paper we will compress the descent equations into a compact form. To this end the BRST operator s and the exterior derivative d will be united to the single operator s˜ = s + d

(1.2)

defined on local “total forms" (see Sect. 2). This idea is not new; in fact it is familiar from the construction and classification of chiral anomalies mentioned above. However, somewhat surprisingly, it was not utilized systematically in a general approach to the BRST cohomology on local functionals later. The systematic use of s˜ is fundamental to the method proposed here and has several advantages. In particular it allows us to extend the concept of [15] to theories which do not satisfy the assumptions (a)–(c) mentioned above, such as Yang–Mills theory whose BRST cohomology has been calculated by different means in [25, 24, 26, 27]. The use of s˜ is particularly well adapted to the analysis of the BRST cohomology on local functionals because the latter is in fact isomorphic to the cohomology of s˜ on local total forms, at least locally, cf. [16] and Sect. 32 . Contracting homotopies similar to the ones used here were constructed and applied to BRST cohomological problems e.g. already in [25, 23, 24]. However, these contracting homotopies were designed for the cohomology of s [24] and its linearized version [25, 23] respectively. The method proposed here extends them to the s-cohomology. ˜ This has the important consequence that it leads directly to the mentioned compact formulation of the cohomological problem in terms of identities analogous to the “Russian formula” (1.1). For instance, when applied to Yang–Mills theory, the contracting homotopy for s˜ singles out the special combination (generalized connection) A + C occurring in (1.1). As a result, (1.1) itself arises naturally in this approach, cf. Sect. 7. In contrast, the corresponding contracting homotopy [24] for s gives instead of A + C just C and makes no contact with the “Russian formula” (it does however provide the same tensor fields). The proposed approach also extends the methods developed in [21] to use and deal with the antifields along the lines of homological perturbation theory [19, 20]. This extension is straightforward and, again, related to the use of s˜ instead of s. Among others it will allow us to trace the BRST cohomology at all ghost numbers (including negative ones) back to a weak (= on-shell) cohomological problem involving the tensor fields and generalized connections only. This has been utilized recently in [28] in order to compute the BRST cohomology in four dimensional N = 1 supergravity. Finally, the approach provides a “cohomological” perspective on tensor fields and connections. The latter are usually characterized through specific transformation properties under the respective symmetries. However, in a general gauge theory it is not always clear from the outset which transformation laws should be imposed for this purpose. An advantage of the approach proposed here is that such transformation laws need not be 2 The isomorphism applies only to the BRST cohomology on local functionals, i.e. to the relative cohomology H(s|d) on local volume forms. It does not extend to H(s|d) at lower form degrees in general.

462

F. Brandt

specified from the start. Rather, they emerge from the approach itself. Such a characterization of tensor fields, connections and the corresponding transformation laws has two major advantages: (i) it is purely algebraic and does not invoke any concepts in addition to the BRST cohomology itself; (ii) it is physically meaningful because the resulting tensor fields and generalized connections turn out to provide among others the building blocks of gauge invariant actions, Noether currents, anomalies and of the equations of motions. 1.3. Outline of the paper. The paper has been organized as follows. Section 2 sketches the basic algebraic approach to the BRST cohomology used in this paper and introduces some terminology and notation. Sections 3 and 4 relate the local BRST cohomology to the cohomology of s˜ and its weak (= “on-shell") counterpart. Section 5 introduces the concept of contracting homotopies for s˜ in jet spaces, and Sect. 6 shows that this concept is intimately related to the existence of a gauge covariant algebra and a compact formulation of the BRST algebra on tensor fields and generalized connections. Section 7 illustrates the method for various examples which do not satisfy the aforementioned assumptions (a)–(c) of [15] (the examples are Yang–Mills theory, Einstein gravity in the metric formulation, supergravity with open gauge algebra and two-dimensional Weyl invariant sigma models). Sections 8–10 spell out implications for the structure of gauge invariant actions, Noether currents, gauge anomalies, etc., as well as for the classical equations of motion. In Sect. 11 a special aspect of the cohomological problem is discussed, concerning the explicit dependence of the solutions on the coordinates of the base manifold which will be called “spacetime” henceforth, for no reason at all. The paper is ended by some concluding remarks in Sect. 12 and two appendices containing details concerning the algebraic approach and conventions used in the paper.

2. Algebraic Setting, Definitions and Notation In order to define the local BRST cohomology in a particular theory one has to specify the BRST operator s and the space in which its cohomology is to be computed. The BRST operator is defined on a set of fields 8A and corresponding antifields 8∗A according to standard rules of the field-antifield formalism summarized in Appendix B. In particular these rules include that the BRST operator is nilpotent and commutes with the spacetime derivatives ∂µ , (2.1) s2 = s∂µ − ∂µ s = ∂µ ∂ν − ∂ν ∂µ = 0. The basic concept underlying these fundamental relations and the whole paper is the jet bundle approach [29] sketched in Appendix A. Essentially this means simply that the fields, antifields and all their derivatives are understood as local coordinates of an infinite jet space. For this set of jet coordinates the collective notation [8, 8∗ ] is used. The local jet coordinates are completed by the spacetime coordinates xµ and the differentials dxµ . The differentials are counted among the jet coordinates by pure convention and convenience. The derivatives ∂µ are defined as total derivative operators in the jet space, cf. Eq. (A.6), and become usual partial derivatives on the local sections of the jet bundle. The concrete BRST transformations of the fields and antifields depend on the particular theory and its gauge symmetry, whereas the spacetime coordinates xµ and differentials dxµ are always BRST invariant in accordance with the second relation (2.1), s xµ = 0,

s dxµ = 0.

(2.2)


463

The use of the differentials is in principle not necessary but turns out to be very useful in order to analyse the local BRST cohomology. In particular it allows to define d = dxµ ∂µ and s˜ = s + d in the jet space. The relations (2.1) are equivalent to the nilpotency of s, ˜ s˜2 = 0

⇔

s2 = sd + ds = d2 = 0.

(2.3)

The usefulness of s˜ in the context of the local BRST cohomology stems from the fact that it allows to write and analyse the descent equations in a compact form (cf. Sect. 3). The descent equations involve local p-forms ωp =

1 dxµ1 . . . dxµp ωµ1 ...µp (x, [8, 8∗ ]). p!

(2.4)

These forms are required to be local in the sense that they are formal series’ in the antifields, ghosts and their derivatives such that each piece with definite antighost number (cf. [21] and section 4) depends polynomially on the derivatives of all the fields and antifields. From the outset no additional requirements are imposed on local forms here. In particular they are not restricted by power counting, it is not assumed that the indices µi of the functions ωµ1 ...µp occurring in (2.4) indicate their actual transformation properties under Lorentz or general coordinate transformations, and local forms are not required to be globally well-defined in whatever sense. R A local functional is by definition an integrated local volume form ωD (throughout this paper D denotes the spacetime dimension). It is called BRST invariant if sωD is d-exact in the space of local forms, i.e. if sωD + dωD−1 = 0 holds for some local form ωD−1 . Translated to the local sections of the jet bundle, in general this requires local functionals to be BRST invariant only up to surface integrals. Analogously a local R functional ωD is called BRST-exact (or trivial) if ωD = sηD + dηD−1 holds for some local forms ηD and ηD−1 . The BRST cohomology on local functionals considered here is thus actually the relative cohomology H(s|d) of s and d on local volume forms. This cohomology is well-defined due to (2.3) and represented by solutions ωD of sωD + dωD−1 = 0,

ωD 6= sηD + dηD−1 .

(2.5)

In the next section H(s|d) will be related to the cohomology of s˜ on local total forms ω. ˜ The latter are by definition formal sums of local forms with different form degrees, X ωp . (2.6) ω˜ = p

The s-cohomology ˜ on local total forms is then defined through the condition s˜ω˜ = 0 modulo trivial solutions of the form s˜η˜ + constant, where η˜ is a local total form and the constant is included for convenience. The representatives of this cohomology are thus local total forms ω˜ solving s˜ω˜ = 0,

ω˜ 6= s˜η˜ + constant.

(2.7)

The natural degree in the space of local total forms is the sum of the ghost number (gh) and the form degree (formdeg), called the total degree (totdeg), totdeg = gh + formdeg .

(2.8)

A local total form with definite total degree G is thus a sum of local p-forms with ghost number g = G − p (p = 0, . . . , D). s˜ has total degree 1, i.e. it maps a local total form with total degree G to another one with total degree G + 1.

464

F. Brandt

3. Descent Equations It is easy to see that the BRST cohomology on local functionals is locally isomorphic to the cohomology of s˜ on local total forms3 . To show this, one only needs (2.3) and a theorem on the cohomology of d on local forms, sometimes called the algebraic Poincaré lemma. The latter states that locally any d-closed local p-form is d-exact for 0 < p < D and constant for p = 0, while local volume forms (p = D) are locally d-exact if and only if they have vanishing Euler–Lagrange derivative with respect to all the fields and antifields [30, 25, 16]. The local isomorphism of the cohomological problems associated with (2.5) and (2.7) can be derived by standard arguments which are therefore only sketched. Suppose that ωD solves sωD +dωD−1 = 0. Applying s to this equation results in d(sωD−1 ) = 0 due to (2.3). Hence, sωD−1 is d-closed. Since it is not a volume form, it is thus also d-exact in the space of local forms according to the algebraic Poincaré lemma. Hence, there is a (possibly vanishing) local (D − 2)-form ωD−2 satisfying sωD−1 + dωD−2 = 0. Iterating the arguments one concludes the existence of a set of local forms ωp , p = p0 , . . . , D satisfying (3.1) sωp + dωp−1 = 0 for D ≥ p > p0 ; sωp0 = 0 for some p0 . These equations are called the descent equations4 . They can be compactly written in the form D X ωp . s˜ ω˜ = 0, ω˜ = p=p0

˜ local total form Hence, any solution of sωD + dωD−1 = 0 corresponds to an s-closed and the reverse is evidently also true. Using again the algebraic Poincaré lemma and (2.3), it is easy to see that ωD is a trivial solution of the form sηD + dηD−1 if and only if ω˜ is trivial too, i.e. if and only if ω˜ = s˜η˜ + constant. Since ω˜ has total degree (g + D) if ωD has ghost number g we conclude Lemma 3.1. The BRST-cohomology on local functionals with ghost number g and the s-cohomology ˜ on local total forms of total degree G = g + D are locally isomorphic. That is to say, locally the solutions of (2.5) with ghost number g correspond one-to-one (modulo trivial solutions) to the solutions of (2.7) with total degree G = g + D.

4. Equivalence to the Weak Cohomology of γ˜ = γ + d A simple and useful concept in the study of the BRST cohomology is a suitable expansion of local functionals and forms in powers of the antifields. Following the lines of [21] it will now be used to show that the s-cohomology ˜ on local total forms of the fields and antifields reduces to a weak (= on-shell) cohomology on antifield independent local total forms. 3 Here and in the following local equalities or isomorphisms refer to sufficiently small patches of the jet space. Global properties of the jet bundle are not taken into account. 4 For p = 0 the algebraic Poincar´ e lemma alone actually implies only sω0 = const.; however, in 0 meaningful gauge theories a BRST-exact constant vanishes necessarily, as one easily verifies (note that a constant can occur only if ω0 has ghost number −1). Notice that this might not hold anymore if one extends the BRST–BV formalism by including constant ghosts corresponding, e.g. to global symmetries [15, 16]. Such an extension is always possible [31] but not considered here.


465

The most useful expansion in the antifields takes their respective ghost numbers into account. This is achieved through the so-called antighost number (antigh) defined according to antigh(8∗A ) = −gh(8∗A ),

antigh(8A ) = antigh(dxµ ) = antigh(xµ ) = 0.

(4.1)

In particular the BRST operator can be decomposed into pieces with definite antighost number (one says a piece has antighost number k if it raises the antighost number by k units). The decomposition of s starts always with a piece of antighost number −1, X sk , antigh(δ) = −1, antigh(γ) = 0, antigh(sk ) = k. (4.2) s=δ+γ+ k≥1

The most important pieces in this decomposition are δ and γ; the other pieces have positive antighost number and play only a secondary role in the cohomological analysis. δ is the so-called Koszul–Tate differential and is nonvanishing only on the antifields, δ8A = 0,

δφ∗i =

∂ˆ R Lcl , ˆ i ∂φ

... .

(4.3)

ˆ i denotes the Euler–Lagrange right-derivative of the classical Lawhere ∂ˆ R Lcl /∂φ grangian Lcl w.r.t. φi . In particular δ thus implements the classical equations of motion in the cohomology. γ encodes the gauge transformations because γφi equals a gauge transformation of φi with parameters replaced by ghosts, i γφi = Rα Cα ,

(4.4)

where the notation of Appendix B is used. Equation (4.2) extends straightforwardly to the analogous decomposition of s˜ = s+d into pieces with definite antighost numbers. Since d has vanishing antighost number, one simply gets X sk (4.5) s˜ = δ + γ˜ + k≥1

with γ˜ = γ + d.

(4.6)

2

Note that s˜ = 0 decomposes into δ 2 = 0,

δ γ˜ + γδ ˜ = 0,

γ˜ 2 = −(δs1 + s1 δ),

... .

(4.7)

The usefulness of the decomposition (4.5) is due to the acyclicity of the Koszul–Tate differential δ on local functions at positive antighost number [20, 21, 32]. This means that the cohomology of δ on local total forms is trivial at positive antighost number,5 δ ω˜ k = 0,

antigh(ω˜ k ) = k > 0

⇒

ω˜ k = δ η˜k+1 .

(4.8)

Using standard arguments of spectral sequence techniques which are not repeated here, one concludes from (4.8) immediately that a nontrivial solution of s˜ω˜ = 0 contains necessarily an antifield independent part ω˜ 0 solving 5 An analogous statement does not hold for the relative cohomology of δ and d. Indeed there are in general solutions of (2.5) which contain no antifield independent part. Such solutions correspond to local conservation laws [7].

466

F. Brandt

γ˜ ω˜ 0 ≈ 0,

ω˜ 0 6≈ γ˜ η˜0 + constant,

antigh(ω˜ 0 ) = 0,

(4.9)

(antigh(Ak ) = k).

(4.10)

where ≈ denotes weak equality defined through A0 ≈ 0

:⇔

∃A1 :

A0 = δA1

Note that the weak equality is an “on-shell equality" since, due to (4.3), A0 ≈ 0 implies that A0 vanishes for solutions of the classical equations of motion. Furthermore (4.7) and (4.8) imply that each solution ω˜ 0 of (4.9) can be completed to a nontrival solution ω˜ = ω˜ 0 + . . . of (2.7) and that two different completions with the same antifield independent part are equivalent in the cohomology of s˜ (the latter follows immediately from the fact that the difference of two such completions has no antifield independent part). This establishes the following result: Lemma 4.1. The cohomology of s˜ on local total forms is isomorphic to the weak cohomology of γ˜ on antifield independent local total forms. That is to say, any solution ω˜ of (2.7) contains an antifield independent part ω˜ 0 solving (4.9), and any solution ω˜ 0 of ˜ (4.9) can be completed to a solution of (2.7) which (for fixed ω˜ 0 ) is unique up to s-exact contributions. Remark. The weak cohomology of γ˜ on antifield independent local total forms is welldefined since γ˜ is weakly nilpotent on these forms, antigh(A0 ) = 0

⇒

γ˜ 2 A0 ≈ 0.

(4.11)

This follows immediately from the third identity (4.7) due to δA0 = 0.

5. Elimination of Trivial Pairs A well-known technique in the study of cohomologies is the use of contracting homotopies. I will now describe how one can apply it within the computations of the s-cohomology ˜ and of the weak γ-cohomology ˜ introduced in the previous sections. The idea is to construct contracting homotopy operators which allow to eliminate certain local jet coordinates, called trivial pairs, from the cohomological analysis. This reduces the cohomological problem to an analogous one involving only the remaining jet coordinates. For that purpose one needs to construct suitable sets of jet coordinates replacing the fields, antifields and their derivatives and satisfying appropriate requirements. In this section I will specify such requirements and show that they allow to eliminate trivial pairs. In Sect. 7 various explicit examples will be discussed to illustrate how one constructs these special jet coordinates in practice. The contracting homotopies and the trivial pairs for the s˜ and the weak γ˜ cohomology are usually closely related. Nevertheless, in practical computations the use of one or the other may be more convenient. Moreover it is often advantageous to combine them. For instance one may first use a contracting homotopy for the s-cohomology ˜ that eliminates some fields or antifields completely, such as the antighosts and the corresponding Nakanishi–Lautrup auxiliary fields used for gauge fixing, and then analyse the remaining problem by investigating the weak γ-cohomology. ˜ The arguments will be worked out in detail only for the weak γ-cohomology ˜ which is more subtle due to the occurrence of weak instead of strict equalities. In contrast, the s-cohomology ˜ can be treated using standard arguments which imply:


467

Lemma 5.1. Suppose there is a set of local jet coordinates B = {U ` , V ` , W i } such that the change of local jet coordinates from {[8A , 8∗A ], xµ , dxµ } to B is local and locally invertible6 and s˜ U ` = V ` ∀ ` , s˜ W i = Ri (W) ∀ i .

(5.1) (5.2)

Then locally the U ’s and V’s can be eliminated from the s-cohomology, ˜ i.e. the latter reduces locally to the s-cohomology ˜ on local total forms depending only on the W’s. The (U ` , V ` ) are called trivial pairs. As already mentioned, Lemma 5.1 can be used in particular to eliminate the antighosts, Nakanishi–Lautrup fields and their antifields completely from the cohomological analysis because they (and all their derivatives) form trivial pairs, cf. e.g. [25] and [7], Sect. 14. In the following these fields will be therefore neglected without loss of generality. Let me now turn to the derivation of an analogous result for the weak γ-cohomology ˜ on antifield independent local total forms. Let us assume that there is a local and locally invertible change of jet coordinates from the antifield independent set {[8A ], xµ , dxµ } to {U ` , V ` , W i } such that7 γU ˜ ` = V ` ∀` , γW ˜ i = Ri (W ) ∀ i .

(5.3) (5.4)

Furthermore one can assume (without loss of generality) that each of the U ’s, V ’s and W ’s has a definite total degree. Note that all these degrees are nonnegative because the U ’s, V ’s and W ’s do not involve antifields and because it is assumed that antighosts and Nakanishi–Lautrup fields have been eliminated already. Again, the (U ` , V ` ) are called trivial pairs. In order to deal with weak equalities the following lemma will be useful later on: Lemma 5.2. Any weakly vanishing local total form f (U, V, W ) is a combination of weakly vanishing functions LK (W ) in the sense that f (U, V, W ) ≈ 0

⇔

f (U, V, W ) = aK (U, V, W )LK (W ),

LK (W ) ≈ 0

(5.5)

for some local total forms aK . Proof. Since the classical equations of motion have vanishing total degree and do not involve antifields, they are expressible solely in terms of the U ’s and W ’s because the V ’s have positive total degrees as a direct consequence of (5.3) (in fact only those U ’s and W ’s with vanishing total degrees can occur in the equations of motion). To prove (5.5) it is therefore sufficient to consider functions depending only on the U ’s and W ’s. Now, if a function f (U, W ) vanishes weakly then the same holds for its γ-transformation ˜ due to the second identity in (4.7), for the latter implies f = δg ⇒ γf ˜ = −δ(γg) ˜ ≈ 0. Using (5.3) and (5.4) one concludes f (U, W ) ≈ 0

⇒

γf ˜ (U, W ) = V ` ∗

∂f (U, W ) ∂f (U, W ) + Ri (W ) ≈ 0. ` ∂U ∂W i

(5.6)

6 I.e. locally any local total form f ([8, 8 ], dx, x) can be uniquely expressed as a local total form g(U , V, W) and vice versa. 7 One may replace the equalities in (5.3) and (5.4) by weak equalities without essential changes in the following arguments.

468

F. Brandt

Since the U ’s, V ’s and W ’s are by assumption independent local jet coordinates, and since the V ’s do not occur in the equations of motion, one concludes from (5.6) (for instance by differentiating γf ˜ (U, W ) w.r.t. to V ` ) that f (U, W ) ≈ 0 implies ` ∂f (U, W )/∂U ≈ 0. Iteration of the argument yields f (U, W ) ≈ 0

⇒

∂ k f (U, W ) ≈0 ∂U `1 . . . ∂U `k

∀k .

(5.7)

Thus a weakly vanishing function f (U, W ) must be a combination of weakly vanishing functions of the W ’s which proves (5.5). I remark that Lemma 5.2 implies in particular that the equations of motion themselves are equivalent to a set of equations involving only those W ’s with vanishing total degree. This result will be interpreted in Sect. 10 as the covariance of the equations of motion. We are now prepared to prove that the U ’s and V ’s can be eliminated from the weak γ-cohomology: ˜ Lemma 5.3. Suppose there is a local and locally invertible change of jet coordinates replacing {[8A ], xµ , dxµ } by a set {U ` , V ` , W i } satisfying (5.3) and (5.4). Then locally the U ’s and V ’s can be eliminated from the weak γ-cohomology ˜ on antifield independent local total forms, γ˜ ω˜ 0 (U, V, W ) ≈ 0

⇒

ω˜ 0 (U, V, W ) ≈ f (W ) + γ˜ η˜0 (U, V, W ),

(5.8)

i.e. locally this cohomology is represented by solutions of γf ˜ (W ) ≈ 0,

f (W ) 6≈ γg(W ˜ ) + constant.

(5.9)

Proof. By assumption, locally any antifield independent local total form can be written in terms of the U ’s, V ’s and W ’s. To construct a contracting homotopy a parameter t is introduced scaling the U ’s and V ’s according to Ut` := tU ` ,

Vt` := tV ` .

(5.10)

On total forms ω˜ 0 (Ut , Vt , W ) one then defines an operator b through b = U`

1 ∂ ∂ = U` . ` ` t ∂V ∂Vt

(5.11)

˜ t` are defined by replacing in γU ˜ ` and γV ˜ ` all U ’s and V ’s by the correγU ˜ t` and γV ` sponding Ut ’s and Vt ’s. Now, (5.3) implies γV ˜ = γ˜ 2 U ` ≈ 0. Using Lemma 5.2 one thus concludes γV ˜ ` = a`,K (U, V, W )LK (W ), LK (W ) ≈ 0 for some a`,K and LK . Hence one defines γU ˜ t` = Vt` ,

γV ˜ t` = a`,K (Ut , Vt , W )LK (W ).

This shows in particular γV ˜ t` ≈ 0 and one now easily verifies (γb ˜ + bγ) ˜ ω˜ 0 (Ut , Vt , W ) ≈ which implies

∂ ω˜ 0 (Ut , Vt , W ) ∂t

(5.12)


469

Z ω˜ 0 (U, V, W ) − ω˜ 0 (0, 0, W ) ≈

1

dt (γb ˜ + bγ) ˜ ω˜ 0 (Ut , Vt , W ).

(5.13)

0

Applying again Lemma 5.2 one concludes that γ˜ ω˜ 0 (U, V, W ) ≈ 0 implies γ˜ ω˜ 0 (Ut , Vt , W ) ≈ 0. Using this in (5.13) we finally get Z 1 dt b ω˜ 0 (Ut , Vt , W ), γ˜ ω˜ 0 (U, V, W ) ≈ 0 ⇒ ω˜ 0 (U, V, W ) ≈ ω˜ 0 (0, 0, W ) + γ˜ 0

(5.14) ˜ ω˜ 0 (. . .) ≈ γ˜ 0 dt bω˜ 0 (. . .) (the latter holds since γ˜ does not where we used 0 dt γb change the t-dependence up to weakly vanishing terms). This proves the lemma. R1

R1

Remarks. a) It is very important to realize that both (5.1) and (5.2) must hold in order to eliminate U’s and V’s from the cohomology, and that the existence of a pair of jet coordinates satisfying (5.1) does in general not guarantee the existence of complementary W’s fulfilling (5.2). A simple and important counterexample is given by xµ and dxµ which always satisfy sx ˜ µ = dxµ but usually do not form a trivial pair except in diffeomorphism invariant theories, cf. Sect. 11. Analogous remarks apply of course to (5.3) and (5.4). The reader may check that the contracting homotopies for s used in [24, 15, 16] are in fact also based on the construction of variables satisfying requirements analogous to (5.3) and (5.4). b) Clearly the aim is the construction of a set of local jet coordinates containing as many trivial pairs as possible. The difficulty of this construction is in general not the finding of pairs of local jet coordinates satisfying (5.1) resp. (5.3) but the construction of complementary W’s resp. W ’s satisfying (5.2) resp. (5.4). c) Typically the U ’s are components of gauge fields and their derivatives and the V ’s contain the corresponding derivatives of the ghosts, cf. Sect. 7. The W ’s will be interpreted as tensor fields and generalized connections, cf. Sect. 6. d) Lemmas 5.1 and 5.3 are not always devoid of global subtleties, i.e. they can fail to be globally valid. E.g. if the manifold of the U ’s has a nontrivial de Rham cohomology, one cannot always eliminate all the U ’s and V ’s globally (important counterexamples are the vielbein fields in gravitational theories, cf. [18], section 5). In such cases the proof of Lemma 5.3 breaks down globally because some of the functions of the U ’s, V ’s and W ’s occurring in the proof have no globally well-defined extensions. This problem can be dealt with along the lines of [18]. 6. Gauge Covariant Algebra, Tensor Fields and Generalized Connections It will now be shown that the existence of a set of local jet coordinates {U ` , V ` , W i } (with nonempty subset {U ` , V ` }) satisfying (5.3) and (5.4) has a deep origin. Namely it is intimately related to an algebraic structure encoded in (5.4) which will be interpreted as a gauge covariant algebra and leads to the identification of tensor fields and generalized connections mentioned in the introduction. Recall that each local jet coordinate W i has a definite nonnegative total degree since it neither involves antifields nor antighosts. Those W ’s with vanishing total degree are called tensor fields and are denoted by T ı ; the other W ’s are called generalized connections for reasons which will become clear soon. Those generalized connections with total degree 1 are denoted by C˜ N ; the other generalized connections are denoted by Q˜ NG , where G indicates their total degree,

470

F. Brandt

{T ı } = {W i : totdeg(W i ) = 0}, {C˜ N } = {W i : totdeg(W i ) = 1}, {Q˜ NG } = {W i : totdeg(W i ) = G ≥ 2}.

(6.1)

Note that the tensor fields have necessarily vanishing ghost number and form degree, whereas a generalized connection decomposes in general into a sum of local forms with different ghost numbers and corresponding form degrees, C˜ N = Cˆ N + AN , gh(Cˆ N ) = 1, gh(AN ) = 0, G X G G Q˜ NG = Qˆ N gh(Qˆ N p , p ) = G − p.

(6.2) (6.3)

p=0 G The Cˆ N are called covariant ghosts, the AN connection 1-forms and the Qˆ N G connection G-forms. Since γ˜ raises the total degree by one unit, (5.4) and (6.1) imply in particular

γ˜ T ı = C˜ N RN ı (T ), γ˜ C˜ N = 21 (−)εL +1 C˜ L C˜ K FKL N (T ) + Q˜ M2 ZM2 N (T ), γ˜ Q˜ N2 = 21 (−)εL +1 C˜ K C˜ L C˜ M ZM LK N2 (T ) +Q˜ M3 ZM3 N2 (T ) + Q˜ M2 C˜ K ZKM2 N2 (T ),

(6.4) (6.5) (6.6)

.. . for some functions R, F and Z of the tensor fields. Here (εM +1) denotes the Grassmann parity of C˜ M , (6.7) ε(C˜ M ) = εM + 1 . From γ˜ 2 T ı ≈ 0 one concludes, using (6.4) and (6.5), ∂RN ı ∂RM ı − (−)εM εN RN  ≈ −FM N K RK ı ,  ∂T ∂T  ZM2 N RN ı ≈ 0. RM 

(6.8) (6.9)

Equation (6.8) can be written in the compact form [1M , 1N ] ≈ −FM N K (T )1K

(6.10)

where [·, ·] denotes the graded commutator, [1M , 1N ] = 1M 1N − (−)εM εN 1N 1M , and 1N is the operator 1N = RN ı (T )

∂ ∂T

ı

.

Analogously γ˜ 2 C˜ N ≈ 0 implies in particular X ◦ 1M FN P K + FM N R FRP K + ZM N P M2 ZM2 K ≈ 0, MNP

where the graded cyclic sum was used defined by

(6.11)

(6.12)

(6.13)


471

X ◦ XM N P = (−)εM εP XM N P + (−)εN εM XN P M + (−)εP εN XP M N .

(6.14)

MNP

Equation (6.13) are nothing but the Jacobi identities for the algebra (6.10) in presence of possible reducibility relations (6.9). Note that the Grassmann parities of γ˜ and of the ˜ imply the following Grassmann parities and symmetries of the 1’s and F ’s C’s ε(1N ) = εN , ε(FM N K ) = εM + εN + εK FM N K = −(−)εM εN FN M K .

(mod 2), (6.15)

In order to reveal the geometric content of this algebra it is useful to decompose (6.4) and (6.5) into parts with definite ghost numbers. Note that (6.4) reads γ˜ T ı = C˜ N 1N T ı ,

(6.16)

and thus decomposes due to γ˜ = γ + d and (6.2) into γT dT

ı ı

= Cˆ N 1N T ı , = AN 1N T ı .

(6.17) (6.18)

Equation (6.17) can be interpreted as a characterization of tensor fields as gauge covariant quantities. Indeed, recall that tensor fields are constructed solely out of the “classical fields” φ, their derivatives and the spacetime coordinates due to (6.1). Therefore γT equals just a gauge transformation of T with parameters replaced by ghosts. Equation (6.17) requires thus that the gauge transformation of a tensor field involves only special combinations of the parameters and their derivatives (which may involve the classical ˆ Hence, (6.17) characterizes tensor fields too), corresponding to the covariant ghosts C. fields indeed through a specific transformation law. Now, the derivatives ∂µ T of a tensor field are in general not tensor fields since γ(∂µ T ) contains ∂µ Cˆ N . The question arises how to relate ∂µ T to gauge covariant quantities. The answer is encoded in (6.18). Indeed, recall that the AN are 1-forms, AN = dxµ Aµ N .

(6.19)

Equation (6.18) is therefore equivalent to ∂µ T ı = Aµ N 1N T ı .

(6.20)

By assumption, (6.20) holds identically in the fields and their derivatives, with the same set {Aµ N } for all ı. In general this requires that {Aµ N } contains a locally invertible subset {vµ m }. Then (6.20) just defines those 1’s corresponding to {vµ m } in terms of the ∂µ and the other 1’s, and can be regarded as a definition of covariant derivatives. To put this in concrete terms I introduce the notation {Aµ N } = {vµ m , Aµ rˆ },

{1N } = {Dm , 1rˆ },

m = 1, . . . , D,

(6.21)

where the matrix (vµ m ) is assumed to be invertible. The Dm are called covariant derivatives and according to (6.20) they are given by Dm = Vm µ (∂µ − Aµ rˆ 1rˆ ), where Vm µ denotes the inverse of vµ m ,

(6.22)

472

F. Brandt

vµ m Vm ν = δµν ,

vµ m Vn µ = δnm .

(6.23)

I note that neither the vµ m nor the Aµ rˆ are necessarily elementary fields. In particular, some of them may be constant or even zero. Let me finally discuss (6.5) which generalizes the Russian formula (1.1). Its decomposition into pieces with definite ghost number (resp. form degree) reads γ Cˆ N = 21 (−)εL +1 Cˆ L Cˆ K FKL N (T ) + Qˆ M2 ZM2 N (T ), γAµ N = ∂µ Cˆ N − Cˆ L Aµ K FKL N (T ) − Cˆ µ M2 ZM2 N (T ), ∂ µ Aν

N

− ∂ ν Aµ

N

= −Aµ Aν FKL (T ) + Bµν L

K

N

M2

(6.24) (6.25)

ZM2 (T ), N

(6.26)

where the following notation was used: Q˜ N2 = 21 dxµ dxν Bµν N2 + dxµ Cˆ µ N2 + Qˆ N2 .

(6.27)

Equation (6.24) and (6.25) give the γ-transformations of the covariant ghosts and of the Aµ N respectively. Equation (6.26) determines the curvatures (field strengths) corresponding to the “gauge fields" Aµ N . They are given by N (T ) Fmn N = Vm µ Vn ν 2∂[µ Aν] N + 2v[µ k Aν] rˆ Frk ˆ rˆ sˆ N M2 N +Aµ Aν Fsˆ rˆ (T ) − Bµν ZM2 (T ) ,

(6.28)

where the invertibility of the vµ m was used again in order to solve (6.26) for the Fmn N . That the latter should indeed be identified with curvatures follows from the fact that they occur in the commutator of the covariant derivatives, [Dm , Dn ] ≈ −Fmn N 1N .

(6.29)

Note however that some (or all) of these curvatures may be constant or even zero. The Bianchi identities arising from (6.29) are a subset of the identities (6.13), D[m Fnk] N − F[mn M Fk]M N + Zmnk M2 ZM2 N ≈ 0.

(6.30)

Remarks. a) (6.10) can be regarded as a covariant version of the gauge algebra. However it is important to realize that the number of 1’s exceeds in general the number of gauge symmetries, cf. Sect. 7. ˜ occur only in reducible gauge theories because otherwise there are no local jet b) Q’s variables which can correspond to them. c) Considerations similar to those performed here for the W ’s can be of course also applied to the W’s satisfying (5.2). That leads in particular to an extension of the concept to antifield dependent tensor fields. Examples can be found in [17, 18, 33].


473

7. Examples The concept outlined in the previous sections will now be illustrated for four examples, exhibiting different facets of the general formalism. First the concept is shown to reproduce the standard tensor calculus in the familiar cases of Yang–Mills theory and of gravity in the metric formulation. Then pure four dimensional N=1 supergravity without auxiliary fields is discussed. This illustrates the case of an open gauge algebra and is the only example where the number of 1’s and gauge symmetries coincide. Finally Weyl and diffeomorphism invariant sigma models in two spacetime dimensions are considered. In this example one gets an infinite set of generalized connections and corresponding 1-transformations, but no (nonvanishing) curvatures (6.28). I remark that the approach of [15] does not apply to any of these examples (not even to gravity in the metric formulation!) because each of them violates one of the assumptions (a)–(c) mentioned in Sect. 1. Hence, one really needs the extended concept outlined in the previous sections to perform the following analysis. As the gauge algebra is closed in the first, second and last example, the formulae of Sect. 6 are in these cases promoted to strict instead of weak equalities, with γ˜ replaced by s˜ and without making reference to a particular gauge invariant action. 7.1. Yang–Mills theories. For simplicity I consider pure Yang–Mills theories (no matter fields). The standard BRST transformations of the Yang–Mills gauge fields Aµ i and the corresponding ghosts C i read sAµ i = ∂µ C i + C k Aµ j fjk i ,

sC i = 21 C k C j fjk i ,

(7.1)

where i labels the elements of the Lie algebra of the gauge group with structure constants fij k . The trivial pairs are in this case given by {U ` } = {∂(µ1 ...µk Aµk+1 ) i : k = 0, 1, . . .},

(7.2)

˜ } = {∂µ1 ...µk+1 C + . . . : k = 0, 1, . . .}. {V } = {sU

(7.3)

`

`

i

Hence, in the new set of local jet coordinates the V ’s replace one by one all the derivatives of the ghosts. The undifferentiated ghosts themselves are replaced by the generalized connections (7.4) C˜ i = C i + Ai , Ai = dxµ Aµ i . The complete set of generalized connections contains in addition the differentials, {C˜ N } = {dxµ , C˜ i }.

(7.5)

The vµ m are thus in this case just the entries of the constant unit matrix, vµ m = δµm . Hence, indices m and µ need not be distinguished in this case. The 1-operations corresponding to (7.5) are {1N } = {Dµ , δi },

Dµ = ∂µ − Aµ i δi ,

(7.6)

where the δi are the Lie algebra elements. Equation (6.5) reproduces for N = i the “Russian formula" (1.1) in the form s˜C˜ i = 21 C˜ k C˜ j fjk i + 21 dxµ dxν Fµν i . The algebra (6.10) of the 1’s reads in this case

(7.7)

474

F. Brandt

[Dµ , Dν ] = −Fµν i δi ,

[Dµ , δi ] = 0,

[δi , δj ] = fij k δk

(7.8)

with the standard Yang–Mills field strengths arising from (6.28) and transforming under the δi according to the adjoint representation, Fµν i = ∂µ Aν i − ∂ν Aµ i + Aµ j Aν k fjk i ,

δi Fµν j = −fik j Fµν k .

(7.9)

A complete set of tensor fields is in this case given by the xµ and a choice of algebraically independent components of the field strengths and their covariant derivatives, {T ı } ⊂ {xµ , Dµ1 . . . Dµk Fνρ i : k = 0, 1, . . .}.

(7.10)

Remark. Notice that the above choice of variables is very similar to the one in [24]. In fact the tensor fields coincide in both approaches (except that here also the xµ are counted among them). The difference is that the present approach singles out the C˜ i and dxµ as generalized connections, rather than just the C i . Note that, as a direct consequence of the presence of d in s, ˜ one cannot simply choose C˜ i = C i here because that choice would not fulfill requirement (5.4). 7.2. Gravity in the metric formulation. I consider now pure gravity with the metric fields gµν = gνµ as the only classical fields and diffeomorphisms as the only gauge symmetries. The BRST transformations of the metric and the diffeomorphism ghosts ξ µ read sgµν = ξ ρ ∂ρ gµν + (∂µ ξ ρ ) gρν + (∂ν ξ ρ ) gµρ , sξ µ = ξ ν ∂ν ξ µ . (7.11) The trivial pairs can be chosen as {U ` } = {xµ , ∂(µ1 ...µk 0µk+1 µk+2 ) ν : k = 0, 1, . . .},

(7.12)

˜ } = {dx , ∂µ1 ...µk+2 ξ + . . . : k = 0, 1, . . .}, {V } = {sU

(7.13)

0µν ρ = 21 g ρσ (∂µ gνσ + ∂ν gµσ − ∂σ gµν ).

(7.14)

`

`

µ

ν

where Note that the V ’s replace all derivatives of the ghosts of order > 1. The undifferentiated ghosts and their first order derivatives give rise to the generalized connections {C˜ N } = {ξ˜µ , C˜ µ ν },

ξ˜µ = ξ µ + dxµ ,

C˜ µ ν = ∂µ ξ ν + 0µρ ν ξ˜ρ .

(7.15)

The generalized Russian formulae (6.5) read in this case s˜ξ˜µ = ξ˜ν C˜ ν µ ,

s˜C˜ µ ν = C˜ µ ρ C˜ ρ ν + 21 ξ˜ρ ξ˜σ Rρσµ ν ,

(7.16)

where Rµνρ σ is the standard Riemann tensor constructed of the 0’s. The vµ m are, as in the case of the Yang–Mills theory, just the entries of the constant unit matrix. Hence, indices µ and m are not distinguished. One gets {1N } = {Dµ , 1µ ν },

Dµ = ∂µ − 0µρ ν 1ν ρ ,

(7.17)

where the 1µ ν generate GL(D)-transformations of world indices according to 1µ ν Tρ = δρν Tµ , The algebra (6.10) reads now

1µ ν T ρ = −δµρ T ν .

(7.18)


475

[Dµ , Dν ] = −Rµνρ σ 1σ ρ , [1µ ν , Dρ ] = δρν Dµ , [1µ ρ , 1ν σ ] = δνρ 1µ σ − δµσ 1ν ρ .

(7.19)

The set of tensor fields contains the gµν , µ ≥ ν and a maximal set of algebraically independent components of Rµνρ σ and their covariant derivatives, {T ı } ⊂ {gµν , Dµ1 . . . Dµk Rλνρ σ : k = 0, 1, . . .}.

(7.20)

Remark. Recall that tensor fields are characterized by the transformation law (6.17). One might wonder whether this transformation law agrees in this case with the standard transformation law for tensor fields under diffeomorphisms which is in BRST language the Lie derivative along the diffeomorphism ghosts. The answer is affirmative because (6.17) yields in this case, e.g. for a tensor field Tµ , γ Tµ = ξ ν Dν Tµ + (∂µ ξ ν + 0µρ ν ξ ρ )Tν = ξ ν ∂ν Tµ + (∂µ ξ ν )Tν .

(7.21)

7.3. D=4, N=1 minimal supergravity. The classical field content of the D=4, N=1 minimal pure supergravity theory without auxiliary fields is given by the vielbein fields and the gravitinos, denoted by eµ a and ψµ α , ψ¯ µ α˙ respectively (α, α˙ denote indices of two-component complex Weyl spinors with conventions as in [28]). The gauge symmetries are diffeomorphism invariance, local supersymmetry and local Lorentz invariance. The corresponding ghosts are denoted by ξ µ , ξ α , ξ¯α˙ and C ab = −C ba respectively. For simplicity the analysis is restricted to the action [34] Z (7.22) Scl = d4 x 21 eR − 2µνρσ (ψµ σν ∇ρ ψ¯ σ − ψ¯ µ σ¯ ν ∇ρ ψσ ) with e = det(eµ a ), 0123 = 1 and R = Rab ba ,

Rab cd = 2E[a µ Eb] ν (∂µ ων cd + ωµ ce ωνe d ),

∇µ ψν = ∂µ ψν − ωµ σab β ψν , ˙ ∇µ ψ¯ ν α˙ = ∂µ ψ¯ ν α˙ + 21 ωµ ab σ¯ ab α˙ β˙ ψ¯ ν β , α

α

ab

1 2

α

β

(7.23) (7.24) (7.25)

where the Ea µ are the entries of the inverse vielbein and ωµ ab denotes the gravitino dependent spin connection ωµ ab = E aν E bρ (ω[µν]ρ − ω[νρ]µ + ω[ρµ]ν ), ω[µν]ρ = eρa ∂[µ eν] a − iψµ σρ ψ¯ ν + iψν σρ ψ¯ µ .

(7.26)

(Lorentz indices a, b, . . . are lowered and raised with the Minkowski metric ηab = diag(1, −1, −1, −1).) The γ-transformations read in this case γeµ a = ξ ν ∂ν eµ a + (∂µ ξ ν )eν a + Cb a eµ b + 2iσ a αα˙ (ξ α ψ¯ µ α˙ − ξ¯α˙ ψµ α ) , (7.27) γψµ α = ∇µ ξ α + ξ ν ∂ν ψµ α + (∂µ ξ ν )ψν α + 21 C ab σab β α ψµ β , γξ µ = ξ ν ∂ν ξ µ + 2iξ α σ µ αα˙ ξ¯α˙ , γξ = ξ ∂µ ξ + C σabβ ξ − 2iξ σ α

µ

α

1 2

ab

α β

γC ab = ξ µ ∂µ C ab − C ac Cc b −

β µ

¯β˙

β β˙ ξ ψµ ˙ 2iξ β σ µ β β˙ ξ¯β ωµ ab

α

,

(7.28) (7.29) (7.30) (7.31)

476

F. Brandt

¯ where (and analogous expressions for γ ψ¯ µ and γ ξ), ∇µ ξ α = ∂µ ξ α − 21 ωµ ab σab β α ξ β . The gauge algebra is open (it closes modulo the equations of motion for the gravitinos). Hence γ is nilpotent only on-shell and does not agree with s on all the fields. One can choose the U ’s in this case as {U ` } = {xµ , ∂(µ1 ...µk eµk+1 ) a , ∂(µ1 ...µk ωµk+1 ) cd , ∂(µ1 ...µk ψµk+1 ) α , ∂(µ1 ...µk ψ¯ µk+1 ) α˙ : c > d; k = 0, 1, . . .}.

(7.32)

Note that the ωµ ab = ωµ [ab] correspond one by one to the antisymmetrized first order derivatives ∂[µ eν] a of the vielbein fields due to (7.26). Hence, all the U ` are indeed algebraically independent new local jet coordinates. The corresponding V ` replace one ˜ µ a = ∂µ ξ a +. . . (ξ a = eµ a ξ µ ), by one the dxµ and all the derivatives of the ghosts due to γe ab α α ˙ ab α ¯ ˜ µ = ∂µ ξ +. . . and γ˜ ψµ = −∂µ ξ¯α˙ +. . . . The undifferentiated γω ˜ µ = ∂µ C +. . . , γψ ghosts give rise to the generalized connections {C˜ N } = {ξã , ξ˜α , ξ˜α˙ , C˜ ab : a > b}, ξã = ξ˜µ eµ a , C˜ ab = C ab + ξ˜µ ωµ ab , ξ˜α = ξ α + ξ˜µ ψµ α , ξ˜α˙ = ξ¯α˙ − ξ˜µ ψ¯ µ α˙

(7.33)

with ξ˜µ as in (7.15). The corresponding 1’s are denoted by {1N } = {Da , Dα , D¯ α˙ , lab : a > b}, Da = Ea µ (∂µ − 21 ωµ ab lab − ψµ α Dα + ψ¯ µ α˙ D¯ α˙ ),

(7.34)

where lab = −lba denote the elements of the Lorentz algebra, and Dα and D¯ α˙ are supersymmetry transformations represented on the tensor fields given below (these tensor fields are ordinary fields, not superfields; accordingly Dα and D¯ α˙ are not “superspace operators’). The Grassmann parities of the 1’s are εa = ε[ab] = 0 and εα = εα˙ = 1 (the supersymmetry ghosts commute). (7.34) indicates that in this case the vielbein fields are identified with the vµ m , i.e. the indices m coincide here with Lorentz vector indices, v µ m ≡ eµ a ,

Vm µ ≡ Ea µ .

(7.35)

Using the shorthand notation {ξÃ } = {ξã , ξ˜α , ξ˜α˙ },

{DA } = {Da , Dα , D¯ α˙ }

the algebra of the DA reads [DA , DB ] ≈ −TAB C DC − 21 FAB cd lcd ,

(7.36)

where the nonvanishing TAB C and FAB cd are a = 2iσ a αα˙ , Tαα˙ a = Tαα ˙ α µ Tab = Ea Eb ν (∇µ ψν α − ∇ν ψµ α ), cd = i (T cdα σb αα˙ − 2σ [c αα˙ T d] b α ), Fαb ˙ Fab cd = Rab cd + 2(ψ[a α Fb]α cd − ψ¯ [a α˙ Fb]α˙ cd )

(7.37) (7.38) (7.39) (7.40)


477

and analogous expressions for Tab α˙ and Fαb cd . The remaining commutators of the 1’s are (7.41) [lab , DA ] = −g[ab] A B DB , [lab , lcd ] = 2ηa[c ld]b − 2ηb[c ld]a , where d , g[ab] c d = 2ηc[a δb]

g[ab] α β = σab α β ,

˙

˙

g[ab] α˙ β = −σ¯ ab β α˙ .

(7.42)

Accordingly the generalized Russian formulae (6.5) read in this case γ˜ ξÃ = 21 C˜ ab g[ab] B A ξ˜B − 21 (−)εB ξ˜B ξ˜C TCB A , γ˜ C˜ ab = −C˜ ac C˜ c b − 21 (−)εD ξ˜D ξ˜C FCD ab .

(7.43) (7.44)

Note that these identities encode all the Eqs. (7.26)–(7.31), (7.38) and (7.40). The set of independent tensor fields consists in this case of a subset of Fab cd , Tab α , Tab α˙ and their covariant derivatives, {T ı } ⊂ {Da1 . . . Dak Fbc de , Da1 . . . Dak Tbc α , Da1 . . . Dak Tbc α˙ : k = 0, 1, . . .}.

(7.45)

Remark. Notice that the formalism provides “super-covariant” tensor fields and, in particular, “super-covariant” derivatives (7.34) containing the gravitino and the supersymmetry transformations. Note also that these tensor fields do not carry “world indices" µ, ν, . . . , in contrast to the example discussed in the previous subsection. The reason is that the undifferentiated vielbein fields count among the U ’s. Indeed, the corresponding V ’s replace all the first order derivatives of the diffeomorphism ghosts ξ µ and therefore the BRST transformation of a tensor field must not involve ∂ν ξ µ . Hence, tensor fields are indeed “world scalars” in this case. One could of course instead count the undifferentiated vielbein fields also among the tensor fields and promote the ∂ν ξ µ to generalized connections. Then tensor fields could also carry world indices and one would get additional 1’s generating GL(4) transformations of world indices, as in the metric formulation of gravity discussed in the previous subsection. However, such a choice would not correspond to a maximal set of trivial pairs and would thus complicate unnecessarily the analysis of the BRST cohomology! 7.4. Two dimensional sigma models. Consider two dimensional sigma models whose set of classical fields consists of scalar fields ϕi and the two dimensional metric fields gµν and whose gauge symmetries are given by two dimensional diffeomorphism and Weyl invariance, with corresponding ghosts ξ µ and C respectively. The BRST transformations of the fields read sgµν = ξ ρ ∂ρ gµν + (∂µ ξ ρ ) gρν + (∂ν ξ ρ ) gµρ + C gµν , sY = ξ ν ∂ν Y

for

Y ∈ {ϕi , ξ µ , C}.

(7.46)

Following closely the lines (but not the notation) of [33] I first introduce new local jet ¯ e, η, η¯ replacing the undifferentiated metric components and diffeocoordinates h, h, morphism ghosts (h, h¯ are “Beltrami variables")8 , g22 g11 √ (7.47) h= √ , h¯ = √ , e = g, g12 + g g12 + g ¯ 2 + dx2 ), η¯ = (ξ 2 + dx2 ) + h(ξ 1 + dx1 ) (7.48) η = (ξ 1 + dx1 ) + h(ξ 8

This change of jet coordinates is not globally well-defined in general.

478

F. Brandt

with g = − det(gµν ) > 0. The U ’s are ¯ ∂ p ∂¯ q e : p, q = 0, 1, . . .}, {U ` } = {xµ , ∂ p ∂¯ q h, ∂ p ∂¯ q h,

(7.49)

where ∂ ≡ ∂1 ,

∂¯ ≡ ∂2 .

(7.50)

Hence, in this case all the metric components and all their derivatives occur in trivial ¯ all their derivatives, and pairs. The corresponding V ’s replace one by one the C, ∂ η, ¯ ∂η, µ the dx . Therefore one gets in this example an infinite set of generalized connections, given by η, η¯ and their remaining derivatives, {C˜ N } = {η p , η¯ p¯ : p, p¯ = −1, 0, 1, . . .}, ¯ η¯ . η p = 1 ∂ p+1 η, η¯ p¯ = 1 ∂¯ p+1 (p+1)! ¯

(p+1)!

(7.51) (7.52)

Equations (7.46)–(7.48) imply sη ˜ = η∂η and s˜η¯ = η¯ ∂¯ η. ¯ Therefore (6.5) reads in this case p+1 1 X (p − 2r)η r η p−r (7.53) sη ˜ p= 2 r=−1

˜ is and an analogous formula for s˜η¯ . The infinite set of 1’s corresponding to the C’s denoted by (7.54) {1N } = {Lp , L¯ p¯ : p, p¯ = −1, 0, 1, . . .}. p¯

Recall that the r.h.s. of (7.53) contains the structure functions occurring in the algebra of the 1’s. In this case all of these functions are constant and the algebra of the L’s and ¯ is isomorphic to two copies of the algebra of regular vector fields (−z p+1 )∂/∂z, L’s [Lp , Lq ] = (p − q)Lp+q ,

[L¯ p¯ , L¯ q¯ ] = (p¯ − q) ¯ L¯ p+ ¯ q¯ ,

[Lp , L¯ p¯ ] = 0.

(7.55)

The set of tensor fields on which this algebra is realized is given by i {T ı } = {Tp, p¯ : p, p¯ = 0, 1, . . .},

i p ¯ p¯ i Tp, p¯ = (L−1 ) (L−1 ) ϕ .

(7.56)

i The explicit form of the Tp, p¯ in terms of the fields and their derivatives was discussed in [33] and will be rederived below for the first few T ’s. The algebraic representation of the ¯ on the tensor fields can be derived from the algebra (7.56) using Lp T i = L’s and L’s 0,0 i i i L¯ p¯ T0,0 = 0 ∀ p, p¯ ≥ 0. The latter follows from the identification sT ˜ 0,0 = C˜ N 1N T0,0 , cf. (6.16). This yields

q εγ }, and let the free path length be defined as τε (x, ω) = inf{t > 0 | x − tω ∈ Zε } . The distribution of values of τε is studied in the limit as ε → 0 for all γ ≥ 1. It is shown n is critical for this problem: in other words, the limiting behavior that the value γc = n−1 of τε depends only on whether γ is larger or smaller than γc . 1. Introduction The Lorentz gas is a model system of Statistical Mechanics consisting of a large number of like point particles moving freely in a domain of the space where spherical obstacles are disposed with some given distribution. Collisions between two (or more) particles are rare events since these particles have diameter 0. Hence, only collisions involving one particle and one obstacle are taken into account. They are described by some adequate reflection law, the exact nature of which will be of no significance in the present work; the most classical example of such reflection law is of course the case of “specular reflection”. The model considered in the present work is the case where the obstacles are periodically distributed; in other words, the centers of the obstacles form a lattice in the space Rn , which, for simplicity, is assumed to be homothetic to Zn . Finally, each particle is assumed to move with speed 1 in the interval of time between two consecutive collisions with the obstacles. It is the purpose of the present work to study some aspects of the large scale dynamics of such a system. Thus, let n ∈ N∗ denote the space dimension and let

492

J. Bourgain, F. Golse, B. Wennberg

Zε = {x ∈ Rn | dist(x, εZn ) > εγ } ,

(1.1)

for all 0 < ε < 21 and γ ≥ 1. The “free path length” (or equivalently “exit time”, since the particles move with speed 1 between two consecutive collisions with the obstacles) is defined as follows, for all x ∈ Z ε and ω ∈ S n−1 : τε (x, ω) = inf{t > 0 | x + tω ∈ ∂Zε } .

εγ

(1.2)

ε

Fig. 1. The billiard table

Clearly τε is a Borelian function for all 0 < ε < 21 and all γ ≥ 1. The present paper studies the distribution of values of τε as ε → 0, which is one of the main features of the evolution of the Lorentz gas model associated to the domain Zε as explained above. However, this problem is well posed only after a phase space equipped with a Borelian probability measure is defined. The most natural choice in this respect is the following one. Let Yε = Zε /εZn : topologically Yε is a punctured torus; let Qε = dxdω−meas (Yε × S n−1 ). Our choice of a phase space is Yε × S n−1 with the Borelian probability measure µε defined by 1 dxdω . (1.3) dµε (x, ω) = Qε Clearly τε (x + εk, ω) = τε (x, ω) for all (x, ω) ∈ Z ε × S n−1 and all k ∈ Zn so that τε defines a Borelian function on Yε × S n−1 . It is then natural to study the distribution of τε with respect to the probability measure µε . We recall its definition: Definition. The distribution φε of τε with respect to µε is the push-forward of the measure µε under τε . In other words, φε is the unique Borelian probability measure on [0, +∞[ such that, for all 0 < a < b < +∞, φε (]a, b[) = µε ({(x, ω) ∈ Yε × S n−1 | a < τε < b}) .

(1.4)

Distribution of Free Path Lengths for Periodic Lorentz Gas

493

The main results in this paper bear on the limiting behavior of φε as ε → 0 and on how it depends on the parameter ε. These results are presented without proof in the next section (Sect. 2). The proofs are relegated to the subsequent sections (Sects. 3 to 5). We shall conclude this section with a very elementary observation. In the case where particles impinging on the obstacles are specularly reflected, it is natural to consider the map which, to the position and velocity of any particle leaving the boundary of some obstacle associates its position and velocity immediately after the next collision with an obstacle. It is defined by (x, ω) 7→ (x0 = x + τε (x, ω)ω; ω 0 = ω − 2ω · n(x0 ) n(x0 )),

(1.5)

where n(x) denotes the inward unit normal at point x ∈ ∂Zε . Let then Σε+ = {(x, ω) ∈ ∂Yε × S n−1 | ω · n(x) > 0} .

(1.6)

Since any two obstacles in Zε are congruent modulo εZn , the map (1.5) defines a map B : Σε+ → Σε+ (sometimes called the billiard map: see for example [Ch1-2]). Let 0ε = ω · nx dS(x)dω − meas (Σε+ ); a Borelian probability measure νε is defined on Σε+ by 1 ω · nx dS(x)dω . (1.7) dνε (x, ω) = 0ε The probability measure νε is invariant under B, and hence a second choice of a phase space for the Lorentz gas is Σε+ equipped with the probability measure νε , the dynamics being given by the iterates of the billiard map B. This is usually the phase space and dynamics studied in most of the literature devoted to billiards (see [Ch1-2] and the references therein). The first phase space (Yε × S n−1 , µε ) is the suspension of (Σε+ , νε ) under the function τε and the Lorentz gas flow mod. εZn (i.e. on Yε × S n−1 ) is the suspension flow of the map B under the function τε . In [Ch1-2], the following quantity, called the “geometric mean free path” in [DDG2], is considered: Z τε (x, ω)dνε (x, ω) . (1.8) lε = Σε+

As explained in [Ch2] (Sect. 2), it is a natural notion of mean free path because it is the time average of free paths lengths along typical trajectories whenever the map B is ergodic. There is an explicit formula for it, (see [Ch1] Sect. 3.2 or [DDG2] for a quick proof): Qε 1 εn−γ(n−1) + O(εγ ) . = (1.9) lε = 0ε |B n−1 | This formula clearly points at the special value γc =

n n−1

(1.10)

as being critical. Indeed, as ε → 0, • if γ > γc , lε → +∞ as ε → 0, which seems to indicate a purely ballistic behavior for the Lorentz gas; • if 1 ≤ γ < γc , lε → 0 as ε → 0, corresponding to a hydrodynamic limit; • if γ = γc , lε → |B n−1 |−1 > 0 as ε → 0, corresponding to the so-called “BoltzmannGrad limit”.

494


However, it does not seem possible to extract any information about the distribution of free path lengths φε defined in (1.4), which is our main object of study here, from the explicit formula (1.9). This simply reflects the fact that the billiard under consideration in this paper does not have the “finite horizon property” (the function τε is not uniformly bounded on Σε ) and hence the first phase space (Yε × S n−1 , µε ) contains more information than the second phase space (Σε+ , νε ). Let us close this introductory section with some references. In the case γ = 1, Bunimovich, Sinai and later Chernov ([BS1-2, BSC1-2]) established the diffusion limit for the Lorentz gas with finite horizon. If the specular reflection condition is replaced by an accommodation reflection condition, a simpler proof, based on PDE methods, leads to a similar diffusion limit: see [BDG]. The Boltzmann-Grad limit (γ = γc ) has been studied by many authors, in the case where the distribution of obstacles is not periodic as considered here but random: see [Gal, Sp, BBS]. These papers prove that the limiting number density f of gas particles satisfies a linear transport equation with absorption and scattering of the form Z k(ω, ω 0 )f (t, x, ω 0 )dω 0 . (1.11) ∂t f (t, x, ω) + ω · ∇x f (t, x, ω) + σf (t, x, ω) = σ S n−1

The methods developed in these papers do not apply to the periodic case under consideration in this paper. In fact the limiting behavior of the periodic Lorentz gas in the critical scaling γ = γc is qualitatively different from the one described by (1.11): see Sect. 2, Remark 2.

2. Main Results With the definitions and notations of Sect. 1, we first state the main theorem in this paper: Theorem A. 1) If γ > γc , φε → 0 vaguely as ε → 0; 2) If 1 ≤ γ < γc , φε → δ0 weakly as ε → 0; 3) If γ = γc , any vague limit point φ of the family (φε ) is a probability measure and satisfies lim sup tφ([t, +∞[) < +∞ ; t→+∞

4) If γ = γc and n = 2, any vague limit point φ of the family (φε ) satisfies lim inf tφ([t, +∞[) > 0 . t→+∞

We recall the terminology for the various topologies on the space of Borelian probability measures on R+ (see [Bil]). The weak topology is the topology defined by the family of seminorms µ 7→ |hµ, f i| for all bounded continuous f ’s, while the vague topology is the one defined by the subfamily of these same seminorms corresponding to continuous f ’s with compact support. Point 1) in Theorem A was proved in [DDG2] (see [G, DDG1] for an alternative proof). Point 2] was essentially proved in [DDG1] (although stated in a different manner there; see [DDG2]) when n = 2. It then remains to prove point 2) for all n > 2 and points 3) and 4).


495

Remark 1. When n > 2 and γ = γc , we can prove that lim inf tn−1 φ([t, +∞[) > 0 , ε→0

but we don’t know whether this or point 3) in Theorem A is optimal: see [GW]. Remark 2. Point 4) in Theorem A or Remark 1 show the difference between the limiting dynamics of the Lorentz gas in the periodic and the random cases. In the random cases studied in [Gal, Sp and BBS], the limiting number density is proved to satisfy an equation of the type (1.11); in particular, the free path length is exponentially distributed (σ being the parameter in the exponential law). In the periodic case, the distribution of free path lengths has only algebraic decay, as shown by Theorem A 4) or Remark 1. Theorem A depends essentially on the following technical estimates. Before stating them, we need some notations. Let r ∈]0, 1/2[ and consider Z = {x ∈ Rn | dist (x, Zn ) > r} ; Q = dxdω − meas (Y × S n−1 ) ;

Y = Z/Zn ;

dµ(x, ω) =

τ (x, ω, r) = inf{t > 0 | x + tω ∈ ∂Z} ;

1 dxdω ; Q

T (ω, r) = sup τ (x, ω, r) . x∈Y

(2.1) (2.2) (2.3)

Clearly τ is Zn -periodic and can be considered either as defined for x ∈ Z or for x ∈ Y . Remark 3. T (ω, r) is the quantity referred to as the “ergodization time” in [D1]. With these definitions and notations, we can state Theorem B. For all n ∈ N∗ there exists C(n) > 0 such that dω − meas ({ω ∈ S n−1 | T (ω, r) > t}) ≤

C(n) . rn−1 t

(2.4)

This estimate is sharp in the case n = 2: indeed Theorem C. Let n = 2. There exists C 0 > 0 such that, for all t > r1 . µ({(x, ω) ∈ Y × S 1 | τ (x, ω, r) > t}) ≥

C0 . rt

(2.5)

Remark 4. Theorem B shows that τ (·, ·, r) ∈ L∞ (Y ; L1,∞ (S n−1 )). Theorem C shows that, at least if n = 2, τ (·, ·, r) ∈ / L1 (Y × S n−1 ). Hence the mean free path in the sense of the first phase space considered in Sect. 1 (that is, (Y × S n−1 , µ)), defined as Z Y ×S n−1

τ (x, ω, r)dµ(x, ω) = +∞

does not contain any information on the Lorentz gas, being infinite for all r ∈]0, 1/2[.

496


As an aside result, we improve an upper bound for T due to H.S. Dumas [D1]. We first recall the notations for diophantine vectors: for all K > 0, s ∈ R, let D(s, K) = {ω ∈ S n−1 | ∀k ∈ Zn \ {0} , |ω · k| ≥ K|k|−s } . We recall that

∀K > 0 , ∀s < n − 1 ,

D(s, K) = ∅ ,

(2.6) (2.7)

(which is a variant of a result due to Dirichlet, see [Ca] chapter I, Theorem VI), and that ∀s > n − 1 ,

dω − meas (D(s, K)c ) = O(K) .

(2.8)

Theorem D. For all n ∈ N∗ and s > n − 1, there exists C(n, s) > 0 such that, for all K > 0 and all ω ∈ D(s, K), T (ω, r) ≤

C 00 (n, s) . Krs

We refer to [D2], [ChGa] for an application of this type of estimate. 3. Proof of Theorem B Formulation of the ergodization time problem. For all x ∈ R, let kxk = inf k∈Z |x − k|. Let (3.1) F = {ω ∈ S n−1 | ∀1 ≤ i ≤ n , ωn ≥ |ωi |} ; later, we shall need the following mapping: A : F → [−1, 1]n−1 ,

ω 7→ A(ω) =

ωi ωn

.

(3.2)

1≤i≤n−1

Let ∈ [−1, 1]n−1 and R ∈]0, 1/2[; define N (, R) as the smallest positive integer N such that ∀z ∈ [0, 1]n−1 ,

min

max kzi − li k ≤ R .

l∈Z , |l|≤N 1≤i≤n−1

Clearly, if ∈ A(F ) and if N ≥ N (, R), one has

ωi ωi n−0

, min max −l xi − x n ≤ R. ∀x ∈ [0, 1] ωn ωn l∈Z , |l|≤N 1≤i≤n−1 √ If ω ∈ F , then ωn ≥ √1n . Therefore, if T ≥ n(N + 1), ∀x ∈ [0, 1]n−0 , by specializing t to be of the form t = ∀ω ∈ F ,

min max kxi − tωi k ≤ R ,

|t|≤T 1≤i≤n

(3.3)

(3.4)

(3.5)

xn +l ωn .

This argument shows that √ r T (ω, r) ≤ 2 nN A(ω), √ . n

(3.6)

on Rn−1 supported in [−1, 1]n−1 , positive Let φ be a nonnegative C ∞ function P n−1 and let φR (z) = k∈Zn−1 φ z+k on ] − 1, 1[ R for all R ∈]0, 1/2[. Let (σl )l∈Z be a


497

nonnegative doubly infinite sequence such that σl > 0 if and only if |l| < N . Then N ≥ N (, R) if and only if X σl φR (z − l) > 0 . (3.7) ∀z ∈ [0, 1]n−1 , SN (z) = l∈Z

For Sn (z) = 0 if and only if φR (z − l) = 0 for all l ∈ Z such that |l| ≤ N ; obviously φR (z − l) = 0 if and only if max1≤i≤n−1 kzi − li k > R. It is then convenient to express SN in terms of the Fourier coefficients of φR : X X σl e−i2πlhξ,i , (3.8) φbR (ξ)ei2πhξ,zi ∀z ∈ [0, 1]n−1 , SN (z) = |l|≤N

ξ∈Zn−1

In particular, if one takes σl = (1 − |l| N ) for |l| ≤ N and σl = 0 if |l| > N , the inner sum in (3.8) is a Fejer kernel, that is to say X (3.9) φbR (ξ)ei2πhξ,zi FN (hξ, i) , SN (z) = ξ∈Zn−1

with

X

FN (z) =

σl e−i2πlz =

|l|≤N

1 sin2 πN z . N sin2 πz

(3.10)

Suppose now that N ≤ N (, R); then there exists z0 ∈ [0, 1]n−1 such that SN (z0 ) = 0. Hence X φbR (ξ)ei2πhξ,z0 i FN (hξ, i) , φbR (0)FN (0) = − ξ∈Zn−1 \{0}

which implies N Rn−1 ≤

X 1 |φbR (ξ)|FN (hξ, i) . ˆ φ(0) ξ∈Zn−1 \{0}

(3.11)

Using the Fejer kernel as above is reminiscent of [M] (chapter 5, Theorem 9). The weak L1 type estimate. We now come to the main result of this section, Theorem B’ below. It is a slight generalization of Theorem B to the case where the probability measure on S n−1 is not the normalized Lebesgue measure. Let m be a probability measure on S n−1 . We assume the existence of 0 < c ≤ 1 and K > 0 such that (H)

m∗ (r) = sup m({α ∈ S n−1 | |hα, ei| ≤ r}) ≤ Krc . e∈S n−1

Obviously, the Lebesgue measure on S n−1 satisfies (H) with c = 1. Theorem B’. B’. Let m be a probability measure on S n−1 satisfying the assumption (H) above. Then there exists a constant C(m, n) > 0 (depending only on the dimension n and the measure m) such that m({ω ∈ S n−1 | T (ω, r) > t}) ≤

C(m, n) . tc rn−c

498


Proof. Let us first restrict our attention to ω ∈ F . This can be done without loss of generality: indeed, S n−1 can be covered by the images of F under finitely many elements of the orthogonal group On (R); moreover, if a probability measure satisfies (H), its pushforward under an element of the orthogonal group still satisfies (H). If ω ∈ F and N ≤ N (A(ω), R), then = A(ω) must satisfy (3.11)). Hence, applying Chebyshev’s inequality shows that √ √ m({ω ∈ F | T (ω, nR) ≥ 2 nN }) ≤     X 1 m ω ∈ F | |φbR (ξ)|FN (hξ, A(ω)i) ≥ N Rn−1  ˆ   φ(0) ξ∈Zn−1 \{0} Z X 1 b |φR (ξ)| FN (hξ, A(ω)i)dm(ω) . (3.12) ≤ ˆ N Rn−1 φ(0) F ξ∈Zn−1 \{0} This shows that ≤

1 ˆ N Rn−1 φ(0)

√ √ m({ω ∈ F | T (ω, nR) ≥ 2 nN }) Z X |φbR (ζ)| · sup FN (z)dmξ (z) , ζ∈Zn−1 \{0}

ξ∈Zn−1 \{0}

(3.13)

T1

where, for any measurable subset U of T1 , mξ (U ) = m({ω ∈ F | hA(ω), ξi ∈ U mod. Z}) .

(3.14)

In other words, mξ is the push-forward of m under the map F → T1 defined by ω 7→ hA(ω), ξi mod. Z =

n−1 1 X αi ξi mod. Z . αn i=1

We shall appeal to the next lemma to estimate the integrals appearing in the right-hand side of (3.13). Lemma 1. Let m be a probability measure on S n−1 satisfying (H), and let mξ be associated to m as in (3.14). Then there exists a positive constant C0 (m, n) depending only on the dimension n and the measure m such that Z FN (z)dmξ (z) ≤ C0 (m, n)N 1−c |ξ|1−c . (3.15) 0≤ T1

We defer the proof of Lemma 1 to after that of Theorem B’. It follows from (3.15) and the estimate (3.12) that m({ω ∈ F | T (ω,

√

√ C0 (m, n) 1 nR) ≥ 2 nN }) ≤ c n−1 N R φb(0)

X

|φbR (ξ)||ξ|1−c .

ξ∈Zn−1 \{0}

(3.16) But then, the function φ being smooth, one has, for all l > 0, the existence of Kl > 0 such that (3.17) |φbR (ξ)| ≤ Kl Rn−1 (1 + |Rξ|)−l . Hence, choosing l > n − c and observing that


Rn−1

X

|Rξ|1−c (1 + |Rξ|)−m ∼

Z Rn−1

ξ∈Zn−1

499

|x|1−c (1 + |x|)−m dx < +∞,

(3.18)

we obtain that m({ω ∈ F | T (ω,

√

√ C 0 (m, n) nR) ≥ 2 nN }) ≤ c n−c N R

which completes the proof of Theorem B’.

Proof of Lemma 1. We proceed as in [GLPS] Z Z Z C1 1−δ dmξ (z) FN (z)dmξ (z) ≤ N dmξ (z) + I= N δ z 2 (1 − z)2 T1 kzk≤δ

(3.19)

with C1 = supz∈[0,1] z 2 (1 − z)2 / sin2 πz. Then, the definition (3.14) and the assumption (H) on the measure m show that mξ ({z ∈ T1 | kzk ≤ δ}) ≤

X √ |k|≤ n|ξ|+1

X

≤

√

K

n−1 X

ωi ξi − kωn | ≤ ωn δ})

i=1

!c

δωn

p

|k|≤ n|ξ|+1

m({ω ∈ F | |

|ξ|2 + k 2

≤ K 0 δ c |ξ|1−c ,

(3.20)

for some K 0 > 0. Hence, I ≤ K20 N δ c |ξ|1−c +

2C1 N

Z

1−δ δ

1 1 + 2 z (1 − z)2

dmξ (z) .

(3.21)

Then, integrating by parts and using (H) leads to Z

1 δ

Z z Z z 1 Z 1 dmξ (z) 2 1 = 2 dmξ (t) + dmξ (t) dz ≤ C2 |ξ|1−c δ c−2 . (3.22) 3 z2 z 0 δ z 0 δ

Proceeding in the same manner with the other integral in the right hand side of (3.21) leads to (3.23) I ≤ K20 N δ c |ξ|1−c + C3 N −1 δ c−2 |ξ|1−c . Optimizing in δ leads to the choice of δ = 1/N and hence, I ≤ C0 (m, n)N 1−c |ξ|1−c as announced.

(3.24)

The following bound for averages of sufficiently small powers of the ergodization time follows from Theorem B’ by using a classical interpolation argument. Corollary B”. Under the assumptions of Theorem B’, one has, for all 0 < β < 1 Z 2C(m, n)β T (ω, r)cβ dm(ω) ≤ . (1 − β)rβ(n−c) S n−1

500


4. Proof of Theorem C In this section, only the case of n = 2 is considered. Let r ∈]0, 1/2[; the notations Z, Y , µ and τ are as in (2.1)-(2.3). A unit vector ω ∈ R2 will be called irrational if and only if ω1 /ω2 ∈ R \ Q. Definition. An open strip S of R2 of width l is a subset of R2 which can be mapped onto R×]0, l[ with l > 0 by a displacement D (i.e. a rotation composed with a translation). The middle third of S is the open strip of R2 which the same displacement D maps onto R×] 13 l, 23 l[. The boundary ∂S consists of two parallel straight lines whose direction is determined by a unit vector V of R2 ; ±V will be called the direction of the strip S. Channels. Definition. A channel in Z is an open strip included in Z of maximal width. The idea of considering channels in the context of the periodic Lorentz gas seems to be due to Bleher [Bl] (who used instead the term “corridor” ) — see also [Da]. It is well-known that, if ω ∈ S 1 is irrational, for all x ∈ R2 , the set x + Rω + Z2 is dense in R2 . Hence a channel in Z must have a rational direction. For, if C is a channel in Z with direction ω, any point x ∈ C must satisfy the condition x + Rω ⊂ Z, implying that dist (Z2 , x + Rω) > r, which is obviously not satisfied if x + Rω + Z2 is dense in R2 . However any rational unit vector is not necessarily a direction of a channel in Z as shown by the next lemma. Lemma 2. Let (p, q) ∈ Z2 \ {0} with p and q coprime, and let ω0 = √

1 (p, q). p2 +q 2

A

necessary and sufficient condition for a channel of direction ω0 to exist is that p

p2 + q 2
> 1, 0 < r < 1 and t > 1/r, dxdω-meas ({(x, ω) ∈ Z × S 1 | |x|
> 1 centered at the origin, with one side parallel to the direction ω0 , and define ! [ 0 E (ω0 ) + k ∩ (Q × S 1 ) , (4.9) E(ω0 ) = k∈Zn

(that is, the union is taken over all such translates). Let N (A, ω0 , r) be the number of channels of direction ω0 intersecting with the square Q; since p N (A, ω0 , r) ≥ 41 A p2 + q 2 , (4.10) any set E(ω0 ) corresponding to ω0 ∈ Ar satisfies dxdω − meas (E(ω0 )) ' N (A, ω0 , r) · A · 13 W (ω0 , r) · θ0 ≥ A2 m(p, q, r) with

p 1 m(p, q, r) = (1 − 2r p2 + q 2 ) 18t

(4.11)

! 1

p − 2r p2 + q 2

,

(4.12)

according to the inequalities (4.6) and (4.10). The result will now follow by summing over all ω0 ∈ Ar , at least if it can be established that the corresponding sets E 0 (ω0 ) are disjoint. To this end, consider another


rational direction ω1 = √

1 (p0 , q 0 ) p02 +q 02

503

∈ Ar . The angle between ω0 and ω1 is given by

the expression arcsin

!

|qp0 − pq 0 | p p p2 + q 2 p02 + q 02

! 2r

p

≥ arcsin

p2 + q 2

.

(4.13)

Thus, for t > 1/3r, the arc of S 1 centered at ω0 and of length θ0 cannot intersect the arc of S 1 of the same length centered at ω1 , for any rational direction ω1 ∈ Ar different from ω0 . Now, if one varies the direction ω0 in the class Ar , it follows that [

E(ω0 ) ⊂ {(x, ω) ∈ Z × S 1 | |x| 1, Σg , or when G = R2 , K = {e} and g = 1, with 01 being Z2 . Corollary 8. Let σ ∈ H 2 (0g , U (1)) be any multiplier on 0g . Then 1. K0 (Cr∗ (0g , σ)) ∼ = K0 (Cr∗ (0g )) ∼ = K 0 (Σg ) ∼ = Z2 2. K1 (C ∗ (0g , σ)) ∼ = K1 (C ∗ (0g )) ∼ = K 1 (Σg ) ∼ = Z2g . r

r

Proof. In dimension 2 the Chern character is an isomorphism over the integers and therefore we see that K 0 (Σg ) ∼ = H 0 (Σg , Z) ⊕ H 2 (Σg , Z) ∼ = Z2 , and that K 1 (Σg ) ∼ = H 1 (Σg , Z) ∼ = Z2g . By Theorem 10 we have Kj (Cr∗ (0g )) ∼ = K j (Σg ) and

for j = 0, 1,

Kj (Cr∗ (0g , σ)) ∼ = Kj (Σg , δ(Bσ )),

j = 0, 1,

where Bσ = C(Σg , Eσ ). Finally, because Eσ is a locally trivial flat bundle of C ∗ -algebras over Σg , with fibre K (= compact operators), it has a Dixmier-Douady invariant δ(Bσ ) which can be viewed as the obstruction to Bσ being Morita equivalent to C(Σg ). But δ(Bσ ) = δ(σ) ∈ H 3 (Σg , Z) = 0. Therefore Bσ is Morita equivalent to C(Σg ) and we conclude that Kj (Cr∗ (0g , σ)) ∼ = K j (Σg )

j = 0, 1.

Corollary 9. Let G be a connected Lie group and K a maximal compact subgroup such that dim(G/K) = 3. Let 0 be a uniform lattice in G and σ ∈ H 2 (0, U (1)) be any multiplier on 0. If G is K-amenable, then (∗)

Kj (Cr∗ (0, σ)) ∼ = Kj (Cr∗ (0)) ∼ = K j+1 (0\G/K),

for j = 0, 1

(mod 2).

Proof. By Theorem 10, we see that Kj (Cr∗ (0)) ∼ = K j+dim(G/K) (0\G/K),

for j = 0, 1

(mod 2).

By the Packer-Raeburn stabilization trick, Cr∗ (0, σ) is Morita equivalent to K or 0, and because G is K-amenable, K o 0 ⊗ C0 (G/K) is Morita equivalent to Bσ = C(0\G/K, Eσ ), where Eσ is as before, a locally trivial bundle of C ∗ -algebras over 0\G/K with fibre K. Finally, the Dixmier-Douady invariant δ(Bσ ) = δ(σ) ∈ H 3 (0\G/K, Z) ∼ = H 3 (0, Z). Suppose now that 0\G/K is not orientable. Then H 3 (0\G/K, Z) = {0} and therefore δ(Bσ ) = δ(σ) = 0. Hence Bσ is Morita equivalent to C(0\G/K) and we have (∗) in this case.

Quantum Hall Effect on Hyperbolic Plane

659

Suppose next that 0\G/K is orientable. The short exact sequence of coefficient groups √ i

e2π

−1

1 → Z→R −→ U (1) → 1 gives rise to a long exact sequence of cohomology groups (the change of coefficient groups sequence) (∗∗)

e2π

√

−1∗

δ

i

∗ 3 (0, R) → · · · . · · · → H 2 (0, R) −→ H 2 (0, U (1))→H 3 (0, Z)→H

Since 0\G/K is oriented, we see that H 3 (0, Z) ∼ = Z and H 3 (0, R) ∼ = R are both generated by the fundamental orientation class of 0\G/K, [0\G/K], and since i∗ [0\G/K] = [0\G/K], we see that i∗ is injective. Therefore by the exactness of (∗∗) at H 3 (0, Z), one has δ(σ) = 0 for all σ ∈ H 2 (0, U (1)), and so we see that Bσ is Morita equivalent to C(0\G/K), and again we have (∗) in this case. Corollary 10. Let M = K(0, 1) be an Eilenberg–Maclane space which is connected locally-symmetric, compact, 3-dimensional manifold. If σ ∈ H 2 (0, U (1)) is any multiplier on 0, then one has Kj (Cr∗ (0, σ)) ∼ = Kj (Cr∗ (0)) ∼ = K j+1 (M ),

j = 0, 1.

Proof. Since M is locally symmetric, it is of the form 0\G/K, where G is a connected Lie group, K is a maximal compact subgroup such that dim(G/K) = 3 and 0 ⊂ G is a uniform lattice in G. We need to verify that γG = 1. According to Thurston’s list of 3-dimensional geometries or locally homogeneous spaces, one has 1. G = R3 o SO(3), G/K = R3 , γG = 1 since R3 and SO(3) are amenable, and so is their semidirect product. 2. G = SO0 (3, 1), G/K = H3 , γG = 1 by Kasparov’s theorem. 3. G = SO0 (2, 1) o R, G/K = H2 × R, γG = 1 since it’s the semidirect product of K-amenable groups. 4. G = Heis, G/K = Heis, γG = 1, since Heis is nilpotent and hence an amenable group. 5. G = Solv, G/K = Solv, γG = 1, since Solv is a solvable group and hence an amenable group. ^ ^ ^ = 1 since SO 6. G = SO 0 (2, 1) o R, G/K = SO0 (2, 1). Firstly, γSO 0 (2, 1) is the ^ 0 (2,1) semidirect product of the K-amenable groups SO0 (2, 1) and Z. Also γG = 1, since ^ its the semidirect product of the K-amenable groups SO 0 (2, 1) and R. The other two locally homogeneous spaces in Thurston’s list are not locally symmetric. We now apply Corollary 9 to deduce Corollary 10. An interesting question is whether Corollary 10 is true without the locally symmetric assumption on M . We formulate this in terms of a conjecture. Conjecture. Let M = K(0, 1) be a connected, compact, 3-dimensional manifold which is an Eilenberg-Maclane space with fundamental group 0. Then for any multiplier σ ∈ H 2 (0, U (1)) on 0, one has Kj (Cr∗ (0, σ)) ∼ = Kj (Cr∗ (0)) ∼ = K j+1 (M ),

j = 0, 1.

660

A. L. Carey, K. C. Hannabuss, V. Mathai, P. McCann

Remarks. Selected portions of our proof of Corollary 9 go through in the situation described in the conjecture. More precisely, the proof of Corollary 9 shows that the Dixmier-Douady invariant δ(σ) = 0 for all σ ∈ H 2 (0, U (1)) for 0 as in the conjecture. 9. Range of the Trace and the Kadison Constant In this section, we will prove some structural theorems for the twisted group C ∗ -algebras that are relevant to the “Martini” problems described in the introduction. The first of these calculates the range of the canonical trace map on K0 of the twisted group C ∗ -algebras. We use in an essential way the results of the previous section as well as a twisted version of the L2 -index theorem of Atiyah [At], which is due to Gromov [Gr2]. This enables us to deduce information about projections in the twisted group C ∗ -algebras. In the case of no twisting, this follows because the Baum-Connes conjecture is known to be true while these results are also well known for the case of the irrational rotation algebras. However, our approach here is novel, and as we will show elsewhere [Ma], enables a generalization of most of the known results. 9.1. Twisted Kasparov map. Suppose that 0g is a discrete, cocompact subgroup of SO0 (2, 1). That is, 0g is the fundamental group of a Riemann surface Σg of genus g > 1. Then for any σ ∈ H 2 (0g , U (1)), the twisted Kasparov isomorphism, (∗)

µσ : K• (Σg ) → K• (Cr∗ (0g , σ))

is defined as follows. Here K0 (Σg ) denotes the K-homology group of Σg . Since Σg is spin, it is K-oriented and by Poincaré duality, the K groups K j (Σg ) are naturally isomorphic to the corresponding K-homology groups Kj (Σg ) for j = 0, 1. Explicitly, let E → Σg be a vector bundle over Σg defining an element [E] in K 0 (Σg ). Under Poincaré duality, [E] corresponds to the twisted Dirac operator /∂E+ : L2 (Σg , S + ⊗ E) → L2 (Σg , S − ⊗ E), where S ± denote the 21 spinor bundles over Σg . That is, P D : K 0 (Σg ) → K0 (Σg ) [E] → [/∂E+ ] is the Poincaré duality isomorphism. By Corollary 8 of the previous section, there is a canonical isomorphism K• (Cr∗ (0g , σ)) ∼ = K • (Σg ). Both of these maps are assembled to yield the twisted Kasparov map as in (*). We next describe this map more explicitly. Given [/∂E+ ] ∈ K0 (Σg ) as above, the lift eg , the universal cover of Σg , of this operator to H = Σ 2 2 + + − ^ ^ /f ∂ E : L (H, S ⊗ E) → L (H, S ⊗ E)

is a 0g -invariant operator. Consider now the short exact sequence of coefficient groups i

e2π

√

−1

1 → Z→R −→ U (1) → 1, which gives rise to a long exact sequence of cohomology groups (the change of coefficient groups sequence)


(∗∗)

i∗

661 e2π

√

−1

δ

· · · → H (0g , Z)→H (0g , R) −→ ∗ H 2 (0g , U (1))→0. 2

2

Therefore for any multiplier σ ∈ H 2 (0g , U (1)) of 0g , there is a 2-form ω on Σg such √ 2π −1 that e ∗ ([ω]) = σ. Of course the choice of ω is not unique, but this will not affect the results that we are concerned with. Let ω e denote the lift of ω to the universal cover H. Since the hyperbolic plane H is contractible, it follows that ω e = dη, where η is a 1-form on H which is not in general 0g invariant. Now let ∇ = d − iη denote a connection on the trivial complex line bundle on H. Note that the curvature of ∇ is ∇2 = iω. Consider now the operator 2 2 + + − ^ ^ /f ∂ E ⊗ ∇ : L (H, S ⊗ E) → L (H, S ⊗ E).

It does not commute with the 0g action, but it does commute with the projective action of 0g which is defined by the multiplier σ, and by a mild generalization of the index theorem of [CM], it has a 0g -L2 -index, + ind0g (/ ∂f E ⊗ ∇) ∈ K0 (C(0g , σ) ⊗ R),

where R denotes the algebra of smoothing operators. Then observe that the twisted Kasparov map is merely ∗ + ∂E + ]) = j∗ (ind0g (/ ∂f µσ ([/ E ⊗ ∇)) ∈ K0 (C (0g , σ)),

where j : C(0g , σ) ⊗ R → Cr∗ (0g , σ) ⊗ K is the natural inclusion map, and j∗ : K0 (C(0g , σ) ⊗ R) → K0 (Cr∗ (0g , σ)) is the induced map on K0 . The canonical trace on Cr∗ (0g , σ) induces a linear map [tr] : K0 (Cr∗ (0g , σ)) → R which is called the trace map in K-theory. Explicitly, first tr extends to matrices with entries in C ∗ (0g , σ) as (with Trace denoting matrix trace): tr(f ⊗ r) = Trace(r)tr(f ). Then the extension of tr to K0 is given by [tr]([e] − [f ]) = tr(e) − tr(f ), where e, f are idempotent matrices with entries in C ∗ (0g , σ). 9.2. The isomorphism classes of algebras C ∗ (0g , σ). Let σ ∈ Z 2 (0g , U (1)) be a multiplier on 0g . If σ 0 ∈ Z 2 (0g , U (1)) is another multiplier on 0g such that [σ] = [σ 0 ] ∈ H 2 (0g , U (1)), then it can be easily shown that C ∗ (0g , σ) ∼ = C ∗ (0g , σ 0 ). ∗ ∗ That is, the isomorphism classes of the C -algebras C (0g , σ) are naturally parametrized by H 2 (0g , U (1)). But H 2 (0g , U (1)) ∼ = H 2 (Σg , U (1)) ∼ = U (1) and the isomorphism is ˇ given explicitly by [σ] →< [σ], [Σg ] >, where [σ] is now viewed as a Cech 2-cocycle on Σg with coefficients in U (1), and [Σg ] denotes the fundamental class of the genus g Riemann surface. We summarize this below.

662


Lemma 13. The isomorphism classes of twisted group C ∗ -algebras C ∗ (0g , σ) are naturally parametrized by U (1) ∼ = R/Z ∼ = (0, 1]. The classification map is given explicitly by [σ] →< [σ], [Σg ] >, ˇ where [σ] is now viewed as a Cech 2-cocycle on Σg with coefficients in U (1), and [Σg ] denotes the fundamental class of the genus g Riemann surface. 9.3. Range of the trace map on K0 . We can now state the first major theorem of this section. Theorem 12. The range of the trace map is [tr](K0 (Cr∗ (0g , σ))) = Zθ + Z, where 2πθ =< σ, [Σg ] > ∈ (0, 1] is the result of pairing the multiplier σ on 0g with the fundamental class of Σg . Proof. We first observe that by the results of the previous section the twisted Kasparov map is an isomorphism. Therefore to compute the range of the trace map on K0 , it suffices to compute the range of the trace map on elements of the form ∂E+ 0 ] − [/∂E+ 1 ]) µσ ([/ for any element

∂E+ 1 ] ∈ K0 (Σg ). [/ ∂E+ 0 ] − [/

By the twisted analogue of the L2 index theorem of Atiyah [At] and Singer [Si] for elliptic operators on a covering space that are invariant under the projective action of the fundamental group defined by σ, and which is due to Gromov [Gr2] (see also [Ma] for a proof of a further generalization), one has 1 ˆ + hA(Σg ) ch(E)e[ω] , [Σg ]i. ∂f [tr](ind0g (/ E ⊗ ∇)) = 2π

(∗)

We next simplify the right hand side of (∗) using ˆ g ) = 1, A(Σ ch(E) = rank E + c1 (E), e[ω] = 1 + [ω]. Therefore one has + ∂f [tr](ind0g (/ E ⊗ ∇)) = rank E

h[ω], [Σg ]i hc1 (E), [Σg ]i + , 2π 2π

and we see that ∂E+ 1 ])) = (rank E 0 − rank E 1 ) [tr](µσ ([/∂E+ 0 ] − [/

h[ω], [Σg ]i hc1 (E 0 ) − c1 (E 1 ), [Σg ]i + . 2π 2π

It follows that the range of the trace map on K0 is Z h[ω], [Σg ]i − θ ∈ Z. 2π

h[ω],[Σg ]i 2π

+ Z = Zθ + Z, because


663

We will now discuss some applications of this result. We begin by studying projections in the twisted group C ∗ -algebra, which is a problem of independent interest. Proposition 4. Let σ ∈ H 2 (Σg , R/Z) be a multiplier on 0g , and 2πθ =< σ, [Σg ] >∈ (0, 1] be the result of pairing σ with the fundamental class of Σg . If θ = p/q is rational, then there are only q − 1 unitary equivalence classes of projections, other than 0 and 1, in the reduced twisted group C ∗ -algebra Cr∗ (0g , σ). Proof. By assumption, θ = p/q. Let P be a projection in Cr∗ (0g , σ). Then 1 − P is also a projection in Cr∗ (0g , σ) and one has 1 = tr(1) = tr(P ) + tr(1 − P ). Each term in the above equation is non-negative. By the previous theorem, it follows that tr(P ) ∈ {0, 1/q, 2/q, . . . 1}. By faithfulness and normality of the trace tr, it follows that there are only q − 1 unitary equivalence classes of projections, other than those of 0 and 1 in Cr∗ (0g , σ). Our second application will involve the Kadison constant of a twisted group C ∗ algebra, which we will now recall. The Kadison constant of Cr∗ (0g , σ) is defined by: Cσ (0g ) = inf{tr(P ) : P is a non-zero projection in Cr∗ (0g , σ) ⊗ K}. Recall from earlier sections the following Hamiltonians: Hη =

1 1 (d − iη)∗ (d − iη) = ∇∗ ∇, 2 2

and Hη,V = Hη + V, where V is any 0g -invariant potential on H. The operators Hη and Hη,V are invariant under the projective (0g , σ)-action. Proposition 5. Let σ ∈ H 2 (Σg , R/Z) be a multiplier on 0g , and 2πθ =< σ, [Σg ] >∈ (0, 1] be the result of pairing σ with the fundamental class of Σg . If θ = p/q is rational, then the spectrum of any associated Hamiltonian Hη,V has a band structure, in the sense that the intersection of the resolvent set with any compact interval in R has only a finite number of components. In particular, the intersection of σ(Hη,V ) with any compact interval in R is never a Cantor set. Proof. By the previous proposition, it follows that one has the estimate Cσ (0g ) ≥ 1/q > 0. Then one applies the main result in Brüning-Sunada [BrSu] to deduce the proposition. This leaves open the question of whether there are Hamiltonians with Cantor spectrum when θ is irrational. In the Euclidean case, this is usually known as the Ten Martini Problem, and is to date, not completely solved, though much progress has been made (cf. [Sh]). We pose a generalization of this problem to the hyperbolic case (which also includes the Euclidean case): Conjecture (The Ten Dry Martini Problem). Let σ ∈ H 2 (Σg , R/Z) be a multiplier on 0g , and 2πθ =< σ, [Σg ] >∈ (0, 1] be the result of pairing σ with the fundamental class of Σg . If θ is irrational, then there is an associated Hamiltonian Hη,V with a Cantor set type spectrum, in the sense that the intersection of σ(Hη,V ) with some compact interval in R is a Cantor set.

664


We will next apply the range of the trace Theorem 12 to deduce results about the discrete Hamiltonian Hτ , as in Sect. 5. Proposition 6. Let σ ∈ H 2 (Σg , R/Z) be a multiplier on 0g , and 2πθ =< σ, [Σg ] >∈ (0, 1] be the result of pairing σ with the fundamental class of Σg . If θ = p/q is rational, then the spectrum of the associated discrete Hamiltonian Hτ has a band structure, in the sense that the intersection of the resolvent set with R has only a finite number of components. In particular, the intersection of σ(Hτ ) with any compact interval in R is never a Cantor set. Proof. From the estimate Cσ (0g ) ≥ 1/q > 0, the main result in [Sun] implies the proposition. This leads us to our next conjecture. Conjecture (The Discrete Ten Dry Martini Problem). Let σ ∈ H 2 (Σg , R/Z) be a multiplier on 0g , and 2πθ =< σ, [Σg ] >∈ (0, 1] be the result of pairing σ with the fundamental class of Σg . If θ is irrational, then the associated Hamiltonian Hτ has Cantor spectrum. 9.4. On the classification of twisted group C ∗ -algebras. We will now use the range of the trace found in Theorem 12 to give a complete classification, up to isomorphism, of the twisted group C ∗ -algebras C ∗ (0, σ). A similar complete classification, up to Morita equivalence, is contained in [Ma]. Proposition 7 (Isomorphism classification of twisted group C ∗ -algebras). Let σ, σ 0 ∈ H 2 (Σg , R/Z) be multipliers on 0g , and 2πθ =< σ, [Σg ] >∈ (0, 1], 2πθ0 =< σ 0 , [Σg ] >∈ (0, 1] be the result of pairing σ, σ 0 with the fundamental class of Σg . Then C ∗ (0g , σ) ∼ = C ∗ (0g , σ 0 ) if and only if θ0 ∈ {θ, 1 − θ}. Proof. Let tr and tr 0 denote the canonical traces on C ∗ (0g , σ) and C ∗ (0g , σ 0 ) respectively. Let φ : C ∗ (0g , σ) → C ∗ (0g , σ 0 ) be an isomorphism, and let φ∗ : K0 (C ∗ (0g , σ)) → K0 (C ∗ (0g , σ 0 )) denote the induced map on K0 . By Theorem 12, the range of the trace map on K0 is [tr](K0 (C ∗ (0g , σ))) = Zθ + Z and

[tr 0 ](K0 (C ∗ (0g , σ 0 ))) = Zθ0 + Z.

So there are elements [P ] ∈ K0 (C ∗ (0g , σ)) and [P 0 ] ∈ K0 (C ∗ (0g , σ 0 )) such that [tr]([P ]) = θ and [tr 0 ]([P 0 ]) = θ0 . Clearly one has tr ◦ φ = tr 0 , which induces the identity [tr] ◦ φ∗ = [tr 0 ] in K0 (C ∗ (0g , σ 0 )). In Sect. 8, we have proved that K0 (C ∗ (0g , σ)) ∼ = Z2 ∼ = K0 (C ∗ (0g , σ 0 )). In the basis above, one has φ∗ : Z[P ] ⊕ Z ∼ = K0 (C ∗ (0g , σ)) → K0 (C ∗ (0g , σ 0 )) ∼ = Z[P 0 ] ⊕ Z. Since φ∗ [1] = [1] and φ∗ ∈ GL(2, Z), one sees that there is an integer n such that


665

φ∗ =

1 n 0 ±1

! .

Assembling these results, one has θ = [tr]([P ]) = [tr](φ∗ [P ]) = [tr 0 ](n[1] ± [P 0 ]) = n ± θ 0 . Since θ, θ0 ∈ (0, 1], one deduces that θ0 ∈ {θ, 1 − θ}. Let ψ : Σg → Σg be an orientation reversing diffeomorphism. We can assume without loss of generality that ψ has a fixed point x0 ∈ Σg . This is because there is an orientation preserving diffeomorphism η of Σg whose value at the point ψ(x0 ) is equal to x0 ; in fact η can be chosen to be isotopic to the identity (cf. exercise A2, chapter 1, [Helg]). The composition η ◦ ψ is then an orientation reversing diffeomorphism of Σg with fixed point x0 . Then ψ induces an automorphism ψ∗ : 0g → 0g of the fundamental group π1 (Σg , x0 ) ∼ = 0g . We first evaluate < ψ ∗ σ, [Σg ] >=< σ, ψ∗ [Σg ] >= < σ, [Σg ] > =< σ, ¯ [Σg ] >, since ψ is orientation reversing. By Lemma 13 we see that ψ ∗ σ = σ¯ ∈ H 2 (0g , U (1)). Therefore the automorphism ψ∗ of 0g induces an isomorphism of twisted group C ∗ -algebras ¯ C ∗ (0g , σ) ∼ = C ∗ (0g , ψ ∗ σ) ∼ = C ∗ (0g , σ). Therefore if θ0 ∈ {θ, 1 − θ}, one has C ∗ (0g , σ) ∼ = C ∗ (0g , σ 0 ), completing the proof of the proposition. 9.5. Twisted ICC group von Neumann algebras and type II1 factors. Recall that an ICC group 0 is one in which every non-trivial conjugacy class is infinite. There are many examples of ICC groups, such as free groups, fundamental groups of compact surfaces, etc. It is well known that the group von Neumann algebras of these groups are type II1 factors [Tak]. We will now prove that a similar result holds for the twisted group von Neumann algebras (this result probably exists in the literature but for completeness we reproduce a proof). We briefly recall some definitions. Let W ∗ (0, σ) denote the twisted group von Neumann algebra, where σ is a multiplier on 0, which is by definition the weak closure of C ∗ (0, σ), or equivalently, the weak closure of the algebraic group algebra C(0, σ) in the σ-regular representation on `2 (0). Let Proj(W ∗ (0, σ)) denote the set of all projections in W ∗ (0, σ). Then one has Proposition 8. Let 0 be an ICC group, and σ ∈ H 2 (0, R/Z) be a multiplier on 0. Then W ∗ (0, σ) is a II1 factor. In particular, tr(Proj(W ∗ (0, σ))) = [0, 1]. Proof. By the commutant theorem for the regular σ-representation we see that the com¯ We need to compute the centre Z(0, σ) mutant of W ∗ (0, σ) is identified with W ∗ (0, σ). ¯ Let of W ∗ (0, σ), which is equal to the intersection Z(0, σ) = W ∗ (0, σ) ∩ W ∗ (0, σ). T : 0P→ B(`2 (0)) denote the left projective (0, σ)-action. Regard x ∈ W ∗ (0, σ) ¯ as x = γ∈0 x(γ)T (γ). Since W ∗ (0, σ) is the weak closure of C(0, σ), it follows that (x(γ))γ∈0 ∈ `2 (0). Now x ∈ Z(0, σ) if and only if x commutes with T (γ 0 ), γ 0 ∈ 0. But X T (γ 0 )x = x(γ)σ(γ 0 , γ)T (γ 0 γ) γ∈0

=

X

γ∈0

and

−1

−1

x(γ 0 γ)σ(γ 0 , γ 0 γ)T (γ),

666


xT (γ 0 ) =

X

x(γ)σ(γ, γ 0 )T (γγ 0 )

γ∈0

=

X

−1

−1

x(γγ 0 )σ(γγ 0 , γ 0 )T (γ).

γ∈0 −1

−1

−1

−1

Therefore we see that x(γ 0 γ)σ(γ 0 , γ 0 γ) = x(γγ 0 )σ(γγ 0 , γ 0 ) for all γ 0 ∈ 0. So −1 |x(γ 0 γγ 0 )| = |x(γ)| for all γ 0 ∈ 0. That is, |x(·)| is constant on each conjugacy class. Now since x ∈ `2 (0), it follows that x vanishes on each infinite conjugacy class. Since 0 is an ICC group, it follows that x(γ) = 0 for all γ 6= 1. So Z(0, σ) is 1-dimensional and W ∗ (0, σ) is a II1 factor. 10. The Topological Index and the Index Theorem This section identifies the Hall conductivity τc (P, P, P ) = τ (P dP dP ) with a topological invariant, generalizing the work of [Xia]. Suppose that 0g is a discrete, cocompact subgroup of SO0 (2, 1). That is, 0g is the fundamental group of a Riemann surface Σg of genus g > 1. Then for any σ ∈ H 2 (0g , U (1)), the twisted Kasparov isomorphism, µσ : K• (Σg ) → K• (Cr∗ (0g , σ)) is defined as in the previous section. We note in the following section (using a result of [Ji]) that given any projection P in Cr∗ (0, σ) there is both a projection P˜ in the same K0 class but lying in a dense subalgebra, stable under the holomorphic functional calculus, and a Fredholm module for this dense subalgebra, which may be paired with P˜ to obtain an analytic index. On the other hand, by the results of the current section, given any such projection P there is a topological index that we can associate to it. The main result we prove here is that the (analytic index) = (topological index). The first step in the proof is to show that given an additive group cocycle c ∈ Z 2 (0g ) we may define canonical pairings with K0 (Σg ) and K0 (Cr∗ (0g , σ)) which are related by the twisted Kasparov isomorphism. We do this by generalizing some of the results of Connes and Connes-Moscovici to the twisted case. The group 2-cocycle c may be regarded as a skew-symmetrized function on 0g × 0g × 0g , so that we can modify a standard construction in [CM] to obtain a cyclic 2-cocycle τc on C(0g , σ) ⊗ R by defining: X f 0 (g0 )f 1 (g1 )f 2 (g2 )c(1, g1 , g1 g2 )σ(g1 , g2 ). τc (f 0 ⊗r0 , f 1 ⊗r1 , f 2 ⊗r2 ) = Tr(r0 r1 r2 ) g0 g1 g2 =1

Note that τc extends to C(0g , σ) ⊗ L2 , (where L2 denotes Hilbert-Schmidt operators) and by the pairing theory of [Co] one gets an additive map [τc ] : K0 (C(0g , σ) ⊗ R) → R. Explicitly, [τc ]([e] − [f ]) = τec (e, · · · , e) − τec (f, · · · , f ), where e, f are idempotent matrices with entries in (C(0g , σ) ⊗ R)∼ , the unital algebra obtained by adding the identity to C(0g , σ)⊗R and τec denotes the canonical extension of τc to (C(0g , σ)⊗R)∼ . + Let /∂f E ⊗ ∇ be the Dirac operator defined in the previous section, which is invariant under the projective action of the fundamental group defined by σ. By definition, the + ∂f (c, 0g , σ)-index of / E ⊗ ∇ is


667

+ [τc ](ind0g (/ ∂f E ⊗ ∇)) ∈ R.

It only depends on the cohomology class [c] ∈ H 2 (0g ), and it is linear with respect to [c]. We assemble this to give the following theorem. Theorem 13. Given [c] ∈ H 2 (0g ) and σ ∈ H 2 (0g , U (1)) a multiplier on 0g , there is a canonical additive map h[c], i : K0 (Σg ) → R, which is defined as + h[c], [/ ∂E+ ]i = [τc ](ind0g (/∂f E ⊗ ∇)) ∈ R.

Moreover, it is linear with respect to [c]. By a generalization of the Connes-Moscovici higher index theorem [CM] to the twisted case of elliptic operators on a covering space that are invariant under the projective action of the fundamental group defined by σ, (see [Ma] for a detailed proof), one has (∗)

1 ˆ + hA(Σg ) ch(E)e[ω] ψ ∗ (c), [Σg ]i, ∂f [τc ](ind0g (/ E ⊗ ∇)) = 2π

where ψ : Σg → Σg is the classifying map of the universal cover (which in this case is the identity map) and [c] is considered as a degree 2 cohomology class on Σg . We next ˆ g ) = 1 and that simplify the right hand side of (∗) using the fact that A(Σ ch(E) = rank E + c1 (E), ψ ∗ (c) = c, e[ω] = 1 + [ω]. We obtain

rank E + h[c], [Σg ]i. ∂f [τc ](ind0g (/ E ⊗ ∇)) = 2π

Corollary 11. Let c, [c] ∈ H 2 (0g ), be the area cocycle. Then one has h[c], [/ ∂E+ ]i = 2(g − 1) rank E ∈ Z. Proof. When c, [c] ∈ H 2 (0g ), is the area 2-cocycle, one has h[c], [Σg ]i = −2πχ(Σg ) = 4π(g − 1).

Remarks 14. These theorems have been generalized in [Ma]. They agree with Xia’s result [Xia], although our methods are different. We next describe the canonical pairing of K0 (Cr∗ (0g , σ)), given [c] ∈ H 2 (0g ). Since Σg is negatively curved, we know from [Ji] that     X |f (γ)|2 (1 + l(γ))k < ∞ for all k ≥ 0 , Aσ,g = f : 0g → C |   γ∈0g

where l : 0g → R+ denotes the length function, is a dense and spectral invariant subalgebra of Cr∗ (0g , σ). In particular it is closed under the smooth functional calculus,

668


and is known as the algebra of rapidly decreasing L2 functions on 0g . By a theorem of [Bost], the inclusion map Aσ,g ⊂ Cr∗ (0g , σ) induces an isomorphism Kj (Aσ,g ) ∼ = Kj (Cr∗ (0g , σ)),

j = 0, 1.

As Σg is a negatively curved manifold, we know (by [Mos] and [Gr]) that degree 2 cohomology classes in H 2 (0g ) have bounded representatives i.e. bounded 2-cocycles on 0g . Let c be a bounded 2-cocycle on 0g . Then it defines a cyclic 2-cocycle τc on the twisted group algebra C(0g , σ), by a slight modification of the standard formula [CM], ([Ma] for the general case) X f 0 (g0 )f 1 (g1 )f 2 (g2 )c(1, g1 , g1 g2 )σ(g1 , g2 ). τc (f 0 , f 1 , f 2 ) = g0 g1 g2 =1

Here c is assumed to be skew-symmetrized. Since the only difference with the expression obtained in [CM] is σ(g1 , g2 ), and since |σ(g1 , g2 )| = 1, we can use Lemma 6.4, part (ii) in [CM] and the assumption that c is bounded, to obtain the necessary estimates which show that in fact τc extends continuously to the bigger algebra Aσ,g . This induces an additive map in K-theory as before: [τc ] : K0 (Aσ,g ) → R [τc ]([e] − [f ]) = τec (e, · · · , e) − τec (f, · · · , f ), where e, f are idempotent matrices with entries in (Aσ,g )∼ (the unital algebra associated to Aσ,g ) and τec is the canonical extension of τc to (Aσ,g )∼ . Observe that the twisted Kasparov map is merely ∗ + µσ ([/ ∂E + ]) = j∗ (ind0g (/ ∂f E ⊗ ∇)) ∈ K0 (C (0g , σ)).

Here j : C(0g , σ) ⊗ R → C ∗ (0g , σ) ⊗ K is the natural inclusion map, and j∗ : K0 (C(0g , σ) ⊗ R) → K0 (C ∗ (0g , σ)) is the induced map in K-theory. Therefore one has the equality h[c], µ−1 σ [P ]i = h[τc ], [P ]i ∗ ∼ for any [P ] ∈ K0 (Aσ,g ) = K0 (C (0g , σ)). Using the previous corollary, one has r

Corollary 12. Let c, [c] ∈ H (0g ), be the area 2-cocycle. Then c is known to be a bounded 2-cocycle, and one has 2

h[τc ], [P ]i = 2(g − 1)(rank E 0 − rank E 1 ) ∈ Z, where [P ] ∈ K0 (Aσ,g ) ∼ = K0 (Cr∗ (0g , σ)), and where ∂E+ 0 ] − [/ ∂E+ 1 ] ∈ K0 (Σg ). µ−1 σ [P ] = [/ Remarks 15. This generalizes the main result of Xia, [Xia]. We will next prove the existence of a canonical element in KK(Cr∗ (0g , σ), C), which we call the twisted Mishchenko element. Theorem 16 (The twisted Mishchenko element). There exists a unique element [mσ ] ∈ KK(Cr∗ (0g , σ), C), called the twisted Mishchenko element, such that (∗)

[1] ⊗Cr∗ (0g ,σ) [mσ ] = 2(g − 1),

where [1] ∈ K0 (Cr∗ (0g , σ)) denotes the module generated by Cr∗ (0g , σ).


669

Proof. By the well definedness of the Kasparov intersection product [Kas2], the equation (∗) above defines the element [mσ ] uniquely. In the next section we construct a 2-summable Fredholm module (F, H), which defines an element [(F, H)] ∈ KK(Cr∗ (0g , σ), C), and whose Chern character is the cyclic area 2-cocycle [τc ], (cf. [Co2]) defined by the area 2-cocycle c on the discrete group 0. We compute that [1] ⊗Cr∗ (0g ,σ) [(F, H)] = index(F ) = τc (1, 1, 1) = 2(g − 1). By uniqueness (proved above), we see that [mσ ] = [(F, H)], which establishes existence. This completes the proof of Theorem 4 and Corollary 6 because we regard index(P F P ) as the result of pairing an element of the K-homology of Σg (defined by the twisted Mishchenko element) with an element of K0 (B 0 ) ∼ = K0 (C ∗ (0g , σ)). This enables us to demonstrate the relationship between Corollary 12 and the discrete model of the hyperbolic Hall effect. 11. A Discrete Fredholm Module and the Analytic Index We have observed following Sunada that Hτ is an operator in the twisted algebraic group algebra C(0, σ), which is a subalgebra of Aσ,g . We remark that a spectral projection into a gap in the spectrum of Hτ is given by the smooth functional calculus applied to Hτ . It follows from [Ji] that such spectral projections lie in Aσ,g . Connes constructs a Fredholm module for C0 which can be adapted to the case of C(0, σ). In his construction the Hilbert space is the `2 sections of the restriction of the spinor bundle to the orbit 0.u. This space is isomorphic to H = `2 (0) ⊕ `2 (0) under the map ι ⊕ ι. The grading is the obvious one given by the 2 × 2 matrix ε. We may define the operator F as in Sect. 7 to be multiplication by the matrix function 0 ψ∗ , ψ 0 where we restrict ψ to the orbit 0.u. Connes [Co2] shows that the module of the previous paragraph is 2-summable for C0. We show below using the same argument as in [Co2] that if λ denotes the left regular σ-representation of C ∗ (0, σ) then [F, λ(γ)] is Hilbert-Schmidt. So (H, F ) is also a 2summable module for C(0, σ). We may also exploit [Co2] to determine explicitly the character of this Fredholm module for our case. We now summarize some of the pertinent details. First, we are using the usual trace tr on the bounded operators on H. Second, our module is the `2 sections of the restriction of the spinor bundle to the orbit. From this point of view F corresponds to Clifford multiplication of a unit tangent vector to a / 0.u. We use the same geodesic connecting a given vertex of the graph to a point x0 ∈ notation ϕ(γ.u) for this unit tangent vector, regarding ϕ as a function from 0.u to T (H), the tangent space of H, as no confusion will arise. Next, note that for f ∈ H, [F, λ(γ)]f (γ 0 ) = (ϕ(γ 0 .u) − ϕ(γ −1 γ 0 .u))λ(γ)f (γ 0 ) . Connes observes that the operator on the RHS is Hilbert-Schmidt as a result of the convergence of the Poincaré series:

670


X

exp(−2d(γ.u, x0 )).

γ∈0

Thus if γ0 , γ1 , γ2 lie in 0 then 1 tr(ε[F, λ(γ0 )][F, λ(γ1 )][F, λ(γ2 )]). 2 Now λ(γ0 )[F, λ(γ1 )][F, λ(γ2 )] is the operator tr(ελ(γ0 )[F, λ(γ1 )][F, λ(γ2 )]) =

(λ(γ0 )[F, λ(γ1 )][F, λ(γ2 )]f )(γ) = ζ(γ)σ(γ1 , γ2 )σ(γ0 , γ1 γ2 )f ((γ0 γ1 γ2 )−1 γ), where ζ(γ) denotes Clifford multiplication by −1 −1 −1 (ϕ(κ−1 0 γ) − ϕ(κ1 γ))(ϕ(κ1 γ) − ϕ(κ2 γ)),

with κj = γ0 . . . γj . We can now obtain a formula for the cyclic cocycle. Following the calculation on p. 344 of [Co2] we find that for γ0 γ1 γ2 6= 1 the character of the cocycle associated to our Fredholm module is zero while for γ0 γ1 γ2 = 1 it is given by X trace(εζ(γ))σ(γ1 , γ2 ), tr(ελ(γ0 )[F, λ(γ1 )][F, λ(γ2 )]) = 2 γ∈0

where “trace” denotes the matrix trace on the Clifford algebra and we are utilising the fact that, for our choice of σ, σ(γ0 , γ1 γ2 ) = σ(γ0 , γ0−1 ) = 1. Connes proves that trace(εζ(γ)) is the Euclidean area of the triangle in the complex plane with vertices corresponding to the tangent vectors ϕ(κ−1 j γ). Then the additive group cocycle on 0 given by X trace(εζ(γ)) c(1, γ1 , γ1 γ2 ) = γ∈0

is what Connes calls the “volume” or area cocycle on 0. Thus we find that we have computed the character of our Fredholm module to be: τc (γ0 , γ1 , γ2 ) = c(1, γ1 , γ1 γ2 )σ(γ1 , γ2 ) for γ0 γ1 γ2 = 1, with τc being zero when γ0 γ1 γ2 6= 1 (the normalisation differs from [Co2] p. 295, but conforms with [CM]). This formula extends to give a non-trivial element of the cyclic cohomology of the smooth subalgebra Aσ,g via the formula X f 0 (γ0 )f 1 (γ1 )f 2 (γ2 )c(1, γ1 , γ1 γ2 )σ(γ1 , γ2 ), τc (f 0 , f 1 , f 2 ) = γ0 γ1 γ2 =1

for f 0 , f 1 , f 2 ∈ Aσ,g . Summarizing the discussion above, we have the first result of this section. Proposition 9. There is a 2-summable Fredholm module (F, H) over Aσ,g whose Chern character is given by the area cyclic 2-cocycle τc . Therefore, by the index pairing in [Co2], one has index(P (F ⊗ I)P ) = h[τc ], [P ]i, where P denotes a projection in Aσ,g ⊗ K(H1 ) and index(P (F ⊗ I)P ) denotes the index of the Fredholm operator P (F ⊗ I)P .


671

Assembling this proposition with our results from Sect. 10 we have: Theorem 17. Let P denote a projection in Aσ,g ⊗ K(H1 ) Then in the notation of Corollary 12 of the previous section, one has index(P (F ⊗ I)P ) = 2(g − 1)(rank E 0 − rank E 1 ) ∈ Z, where index(P (F ⊗ I)P ) denotes the index of the Fredholm operator P (F ⊗ I)P acting ∂E+ 0 ] − [/∂E+ 1 ] ∈ K0 (Σg ). on the Hilbert space P (H ⊗ H1 ) and µ−1 σ [P ] = [/ Corollary 13. Let P be a projection into a gap in the spectrum of the discrete Hamiltonian Hτ . Then P ∈ Aσ,g , and may be regarded as a twisted convolution operator by a function p on 0. Then in the notation of Corollary 12: X p(γ0 )p(γ1 )p(γ2 )c(1, γ1 , γ1 γ2 )σ(γ1 , γ2 ) index(P F P ) = γ0 γ1 γ2 =1

= 2(g − 1)(rank E 0 − rank E 1 ) ∈ Z. Note that this explains the integrality of the cyclic 2-cocycle, X p(γ0 )p(γ1 )p(γ2 )c(1, γ1 , γ1 γ2 )σ(γ1 , γ2 ), γ0 γ1 γ2 =1

in two different ways: firstly as the index of the Fredholm operator P F P , and secondly as the topological index 2(g − 1)(rankE 0 − rankE 1 ), which is also clearly an integer. 12. The Non-Commutative Unit Disc In [Klim+Les1,2] Klimek and Lesznewski have introduced a non-commutative unit disc and higher genus Riemann surfaces. Their disc algebra can be realised as a Toeplitz algebra obtained by compressing the commutative algebra of functions on the disc using the projection onto a holomorphic subspace of one of its representation spaces. We shall describe their construction in a slightly more general setting. The algebra Cc (G/K) acts by multiplication (f 7→ M (f )) on L2 (G/K, µ) for any quasi-invariant measure µ. The group G also has an induced σ-representation W on this space, and we shall suppose that there is an irreducible subrepresentation on a subspace which is projected out by P . (This is certainly true in the case considered in [Klim+Les1].) The algebra P M (Cc (G/K))P then gives the non-commutative analogue of Cc (G/K). Now, by definition G also acts and therefore defines automorphisms of this algebra. Since it commutes with P the covariance algebra P M (Cc (G/K))P o G is the same as P (Cc (G/K) o G)P , which is the compression of the imprimitivity algebra A = Cc (G/K) o G. For higher genus surfaces one simply takes the 0-invariant part of P Cc (G/K)P , which is consistent with our constructions above. Suppose now that the irreducible subspace is defined by a reproducing kernel. Invariance of the kernel means that it is defined by twisted convolution with a continuous σ-positive definite function ξP or, equivalently, that Z P = W (ξP ) = ξP (g)W (g) dg.

672


Now observe that ξP can be identified with an element of the imprimitivity algebra so the covariance algebra can be identified with the compression ξP ∗ A ∗ ξP of the imprimitivity algebra. In the cases of interest ξP is the σ-positive-definite function associated with a C ∞ vector, and so is smooth. This means that the natural module ξP ∗ M for ξP ∗ A ∗ ξP retains the structure of a Fredholm module.

References [At] [Av+K+P+S] [Av+S+S] [Av+S+Y ] [Bel] [Bel+E+S] [Bost] [BrSu] [CGT]

[CEY] [Comtet] [Comtet+H] [Co] [Co2] [CM] [Cu] [Elliott]

[Green] [GH] [Gr] [Gr2] [Helg]

[Iengo+Li] [Ji] [JuKas]

Atiyah, M.F.: Elliptic operators, discrete groups and Von Neumann algebras. Astérisque no.32–33, 43–72 (1976) Avron, J., Klein, M., Pnueli, A., Sedun, L.: Hall conductance and adiabatic charge transport of leaky tori. Phys. Rev. Lett. 69, 128–131 (1990) Avron, J., Seiler, R., Simon, B.: Charge deficiency, charge transport and comparisons of dimension. Commun. Math. Phys. 159, 399–422 (1994) Avron, J., Seiler, R., Yaffe, I.: Adiabatic theorems and applications to the integer quantum Hall effect. Commun. Math. Phys. 110, 33–49 (1987) Bellissard, J.: K-theory of C ∗ -algebras in solid state physics Springer Lecture Notes in Physics 257, 1986, pp. 99–156 Bellissard, J., van Elst, A., Schulz-Baldes, H.: The non-commutative geometry of the quantum Hall effect. J. Math. Phys. 35, 5373–5451 (1994) Bost, J.: Principe d’Oka, K-théorie et systémes dynamiques non commutatifs. Invent. Math. 101 no. 2, 261–333 (1990) Brüning, J., Sunada, T.: On the spectrum of gauge-periodic elliptic operators. Méthodes semi-classiques. Vol. 2 (Nantes, 1991).Astérisque 210, 65–74 (1992) Cheeger, J., Gromov, M., Taylor, M.: Finite propagation speed, kernel estimates for functions of the Laplace operator and the Geometry of complete Riemannian manifolds. Jour. Diff. Geom. 17, 15–54 (1982) Choi, M., Elliott, G., Yui, N.: Gauss polynomials and the rotation algebra. Invent. Math. 99 no. 2, 225–246 (1990) Comtet, A.: On the Landau levels on the hyperbolic plane. Ann.Phys. 173, 185–209 (1987) Comtet, A., Houston, P.: Effective action on the hyperbolic plane in a constant external field. J. Math. Phys. 26, 185–191 (1985) Connes, A.: Non commutative differential geometry. Publ. Math. I.H.E.S. 62, 257–360 (1986) Connes, A.: Noncommutative geometry. San Diego, CA: Academic Press, Inc., 1994 Connes, A., Moscovici, H.: Cyclic cohomology, the Novikov conjecture and hyperbolic groups. Topology 29, 345–388 (1990) Cuntz, J.: K-theoretic amenability for discrete groups. J. Reine Angew. Math. 344, 180–195 (1983) Elliott, G.: On the K-theory of the C ∗ -algebra generated by a projective representation of a torsion-free discrete group. In: Operator Algebras and Group Representations, London: Pitman, 1983, pp. 157–184 Green, P.: The structure of imprimitivity algebras. J. Func. Anal. 36, 88–104 (1980) Griffiths, P. and Harris, J.: Principles of algebraic geometry. New York: Wiley, 1978 Gromov, M.: Volume and bounded cohomology. Publ. Math. I.H.E.S. 56, 5–99 (1982) Gromov, M.: Kähler-hyperbolicity and L2 Hodge theory. Diff. Geom. 33, 263–292 (1991) Helgason, S.: Differential Geometry, Lie Groups and Symmetric Spaces. Pure and Applied Mathematics, 80, New York-London: Academic Press, Inc. [Harcourt Brace Jovanovich, Publishers], 1978 Iengo, R., Li, D.: Quantum mechanics and the quantum Hall effect on Riemann surfaces. Nuclear Phys. B 413, 735–753 (1994) Ji, R.: Smooth dense subalgebras of reduced group C ∗ -algebras, Schwartz cohomology of groups and cyclic cohomology. Jour. Func. Anal. 107, 1–33 (1992) Julg, P., Kasparov, G.: Operator K-theory for the group SU(n, 1). J. Reine Angew. Math. 463, 99–152 (1995)


[Kas1] [Kas2]

[Kas3] [Klim+Les1] [Klim+Les2] [Ma] [MC] [Mos] [M+R+W] [Nak+Bel] [PR] [PR1] [PR2] [Ren1] [Ren2] [Ren3] [Rief] [Ros] [Sh] [Si] [Sun] [Tak] [Xia]

673

Kasparov, G.: Lorentz groups, K-theory of unitary representations and crossed products. Soviet Math. Dokl. 29, 256–260 (1984) Kasparov, G.: K-theory, group C ∗ -algebras and higher signatures. Conspectus, (1980), published in ‘Novikov conjectures, index theorems and rigidity’, vol. 1, editors S. Ferry, A. Ranicki, J. Rosenberg, Lond. Math. Soc. Lecture Note Series 226, Cambridge: Cambridge University Press, 1995 Kasparov, G.: Equivariant KK-theory and the Novikov conjecture. Inv. Math. 91, 147–201 (1988) Klimek, S., Lesznewski, A.: Quantum Riemann surfaces I: the unit disc. Commun. Math. Phys. 146, 105–122 (1992) Klimek, S., Lesznewski, A.: Quantum Riemann surfaces II: The discrete series. Lett. Math. Phys. 24, 125–139 (1992) Mathai, V.: In preparation McCann, P., Carey, A.: A discrete model of the integer quantum Hall effect. Publ. RIMS, Kyoto Univ. 32, 117–156 (1996) Mostow, G.: Strong rigidity of symmetric spaces. Ann. Math. Studies, 78, Princeton N J: Princeton University Press 1973 Muhly, P., Renault, J., Williams, P.: Equivalence and isomorphism for groupoid C ∗ -algebras. J. Operator Th. 17, 3–22 (1987) Nakamura, S., Bellissard, J.: Low energy bands do not contribute to the quantum Hall effect. Commun. Math. Phys. 131, 283–305 (1990) Packer, J., Raeburn, I.: Twisted cross products of C ∗ -algebras. Math. Proc. Camb. Phil. Soc. 106, 293–311 (1989) Packer, J., Raeburn, I.: On the structure of twisted group C ∗ -algebras. Trans.Am. Math. Soc. 334, 685–718 (1992) Packer, J., Raeburn, I.: Twisted cross products of C ∗ -algebras. II, Math. Ann. 287, 595–612 (1990) Renault, J.: A groupoid approach to C ∗ -algebras. Lecture Notes in Mathematics 793, Berlin: Springer, 1980 Renault, J.: Représentations des produits croisés d’algèbres de groupo¨ıdes. J.Operator Th. 18, 67–97 (1987) Renault, J.: The ideal structure of groupoid crossed product C ∗ -algebras. J. Operator Th. 25, 3–36 (1991) Rieffel, M.: C ∗ -algebras associated with irrational rotations. Pac. J. Math. 93, 415–429 (1981) Rosenberg, J.: Continuous trace algebras from the bundle theoretic point of view. Jour. Aus. Math. Soc. 47, 368–381 (1989) Shubin, M.: Discrete Magnetic Schrödinger operators. Commun. Math. Phys. 164, no.2, 259–275 (1994) Singer, I.M.: Some remarks on operator theory and index theory. Springer Lecture Notes in Math. 575, 1977, 128–137 Sunada, T.: A discrete analogue of periodic magnetic Schrödinger operators. Contemp. Math. 173, 283–299 (1994) Takesaki, M.: Theory of operator algebras I. New York: Springer-Verlag, 1979) Xia, J.: Geometric invariants of the quantum hall effect. Commun. Math. Phys. 119, 29–50 (1988)

Communicated by A. Connes

Commun. Math. Phys. 190, 675 – 695 (1998)

Communications in


Hidden Symmetries of the Principal Chiral Model Unveiled C. Devchand1,2 , Jeremy Schiff1 1 2

Department of Mathematics and Computer Science, Bar–Ilan University, Ramat Gan 52900, Israel International Centre for Theoretical Physics, 34100 Trieste, Italy

Received: 20 November 1996 / Accepted: 25 April 1997

Abstract: By relating the two-dimensional U(N) Principal Chiral Model to a simple linear system we obtain a free-field parametrisation of solutions. Obvious symmetry transformations on the free-field data give symmetries of the model. In this way all known “hidden symmetries” and Bäcklund transformations, as well as a host of new symmetries, arise.

1. Introduction The definition of complete integrability for field theories remains rather imprecise. One usually looks for structures analogous to those existing in completely integrable hamiltonian systems with finitely many degrees of freedom, such as a Lax–pair representation or conserved quantities equal in number to the number of degrees of freedom. A very transparent notion of integrability is that completely integrable nonlinear systems are actually simple linear systems in disguise. For example, the Inverse Scattering Transform for two dimensional integrable systems such as the KdV equation establishes a correspondence between the nonlinear flow for a potential and a constant–coefficient linear flow for the associated scattering data. Similarly, the twistor transform for the self-dual Yang-Mills equations converts solutions of nonlinear equations to holomorphic data in twistor space; and for the KP hierarchy Mulase has explicitly proven complete integrability by performing a transformation to a constant–coefficient linear system [11]. In all these examples, a map is constructed between solutions of a simple, automatically– consistent linear system and the nonlinear system in question. This is distinct from the Lax–pair notion of linearisation, with the nonlinear system in question arising as the consistency condition for a linear system. Just as the dynamics of completely integrable systems gets trivialised in an auxiliary space, it seems that the confusing plethora of symmetry transformations of these systems arise naturally from obvious transformations on the initial data of the associated linear

676

C. Devchand, J. Schiff

systems. This idea has been exploited recently by one of us [16] for the KdV hierarchy: A linearisation of KdV, mimicking Mulase’s for the KP hierarchy, was used to give a unified description of all known symmetries. The central feature of Mulase’s construction is a group G on which the relevant linear flow acts. The group G (or at least a dense subset thereof) is assumed to be factorisable into two subgroups G+ and G− . For the KP hierarchy G is a group of pseudo-differential operators. For KdV and for the two-dimensional Principal Chiral Model (PCM), as we shall see in this paper, G is a “loop group” of smooth maps from a contour C in the complex λ plane to some group H. This has subgroups G− (resp. G+ ) of maps analytic inside (resp. outside) C. Mulase notes that any flow on G induces flows on G± , but the flows on the factors induced by a simple linear flow on G can be complicated and nonlinear. This is the genesis of nonlinear integrable hierarchies; complete integrability is just a manifestation of the system’s linear origins. The universality of this kind of construction was noticed by Haak et al [8]. We consider on G the linear system d U = U,

(1)

where d is the exterior derivative on the base space M of the hierarchy, U is a Gvalued function on M and a 1-form on M with values in G+ . Consistency (Frobenius integrability) of this system requires d = ∧ . In fact for KP, KdV and PCM we have the stronger condition d = ∧ = 0, and (1) has the general solution U = eM U0 ;

dM = ,

U0 ∈ G.

(2)

The initial data U0 determines a solution of the linear system, and hence a solution of the associated nonlinear hierarchy. A hierarchy is specified by a choice of G with a factorisation and a choice of one-form . The purpose of this paper is to provide a description of the two-dimensional Principal Chiral Model in the general framework of Mulase’s scheme. We show that for the appropriate group G, and a choice of one-form within a certain class, solutions of Eq. (1) give rise to solutions of PCM. Thus there is a map giving, for each allowed choice of and each choice of initial data U0 , a solution of PCM. The allowed choices of are parametrised by free fields. The known hidden symmetries and Bäcklund transformations of PCM all have their origins in natural field-independent transformations of U0 . We also reveal other symmetries, corresponding to other transformations of U0 as well as to transformations of the free fields in . We were motivated to reconsider the symmetries of PCM by a recent paper of Schwarz [17], in which infinitesimal hidden symmetries were reviewed. However the mystery surrounding their origin remained. Further, Schwarz’s review did not encompass the work of Uhlenbeck [19] or previous work on finite Bäcklund transformations [9]. We wish to present all these results in a unified framework and to lift the veil obscuring the nature of these symmetries. 2. The Principal Chiral Model The defining equations for the U(N) PCM on two-dimensional Minkowski space M with (real) light-cone coordinates x+ , x− are ∂− A+ = 21 [A+ , A− ], ∂+ A− = 21 [A− , A+ ],

(3)

Hidden Symmetries of Principal Chiral Model Unveiled

677

where A± take values in the Lie algebra of U(N), i.e. they are N × N antihermitian matrices. Considering the sum and difference of the two equations in (3) yields the alternative “conserved current” form of the PCM equations ∂− A+ + ∂+ A− = 0 ,

(4)

together with the zero-curvature condition ∂− A+ − ∂+ A− + [A− , A+ ] = 0 .

(5)

The latter has pure–gauge solution A± = g −1 ∂± g ,

(6)

where g takes values in U(N). Substituting this into (4) yields the familiar harmonic map equation (7) ∂− (g −1 ∂+ g) + ∂+ (g −1 ∂− g) = 0. This is manifestly invariant under the “chiral” transformation g 7→ a g b, for a and b constant U(N) matrices. At some fixed point x0 in space-time, we may choose g(x0 ) = I, the identity matrix. The chiral symmetry then reduces to g 7→ b−1 g b.

(8)

There is a further invariance of the equations under the transformation g 7→ g −1 .

(9)

Equation (3) has obvious solutions [21] A+ = A(x+ ) ,

A− = B(x− ) ,

(10)

respectively left- and right–moving diagonal matrices, i.e. taking values in the Cartan subalgebra. (This type of solution is familiar from WZW models and for commuting matrices the Eqs. (3) indeed reduce to WZW equations.) In greater generality, the PCM equations imply that the spectrum of A+ (resp. A− ) is a function of x+ (resp. x− ) alone. Thus general solutions take the form: + − A+ = s0 (x+ , x− )A(x+ )s−1 0 (x , x ) + − A− = se0 (x+ , x− )B(x− )e s−1 0 (x , x ),

(11)

where A(x+ ) and B(x− ) are antihermitian diagonal matrices, and s0 (x+ , x− ), se0 (x+ , x− ) are unitary. For given A(x+ ), B(x− ), we have seen that there exists at least one such solution, that with s0 = se0 = I. We shall see in the next section that a solution A± of the PCM is determined by the diagonal matrices A(x+ ) and B(x− ), together with another free field; and our construction leads to solutions of precisely the form (11). Moreover, we shall prove in Sect. 6 that hidden symmetries and Bäcklund transformations act on the space of solutions with given A(x+ ) and B(x− ).

678


3. Construction of Solutions In this section we give the formulation of the PCM in the framework of Mulase’s general scheme. Let us begin by defining a one-form on two-dimensional Minkowski space M with coordinates (x+ , x− ), =−

B(x− ) − A(x+ ) + dx − dx . 1+λ 1−λ

(12)

Here A(x+ ), B(x− ) are arbitrary diagonal antihermitian matrices, depending only on x+ , x− respectively. Clearly, d = ∧ = 0 ,

(13)

dU =U

(14)

so that the linear equation

is manifestly Frobenius–integrable. The general solution is U (x+ , x− , λ) = eM (x M (x+ , x− , λ) = −

+

,x− ,λ)

1 1+λ

Z

U0 (λ) ,

x+ x+0

A(y + )dy + −

(15) 1 1−λ

Z

x− x− 0

B(y − )dy − ,

where U0 , the initial condition, is a free (unconstrained) element of the group G in which U takes values. We need to specify this group. Remarks. 1) Since A, B are anti-hermitian, hermitian–conjugation of (14) yields dU (λ)† = −U (λ)† (λ∗ ), whereas U −1 satisfies dU −1 (λ) = −U −1 (λ)(λ). We therefore obtain the condition U † (λ∗ ) = U −1 (λ).

(16)

2) has poles at λ = ±1, so it is analytic everywhere in the λ-plane including the point at ∞, except in two discs with centres at λ = ±1. We therefore introduce a contour C, the union of two small contours C± around λ = ±1 (such that λ = 0 remains outside both of them), dividing the λ-plane into two distinct regions: the “outside” {|λ − 1| > δ} ∩ {|λ + 1| > δ} and the “inside” {|λ − 1| < δ} ∪ {|λ + 1| < δ}, where δ < 1 is some small radius. I Definition. G is the group of smooth maps V = V (λ) from the contour C to GL(N, C) satisfying the condition V † (λ∗ ) = V −1 (λ).


679

We are going to pretend that there exists a Birkhoff factorisation G = G− G+ , where G− denotes the group of maps analytic inside C and G+ denotes the group of maps analytic outside C and equal to the identity at λ = ∞. The corresponding Lie algebra decomposition is G = G− ⊕ G+ . This factorisation is definitely a pretence; but the point is that sufficiently many elements of G do factor this way so that the results we will obtain using this factorisation do hold. For a more precise discussion we refer to [19, 8]. We now have the spaces in which the objects in (14),(15) take values. Clearly, is a one-form on M with values in the Cartan subalgebra of the Lie algebra G+ . The matrix U = U (x+ , x− , λ) is a map from M to G and U0 (λ) is an element of G (independent of x± ). Consider a solution U of (14). Assuming the existence of a Birkhoff factorisation for U , we can write (17) U = S −1 Y , where S −1 : M → G− and Y : M → G+ . Now, applying the exterior derivative on both sides and using (14) yields SS −1 = −dSS −1 + dY Y −1 .

(18)

SS −1 , which takes values in the Lie algebra G, decomposes into its components in the G− and G+ subalgebras. The above equation allows us to write separate equations for the projections: (SS −1 )− = −dSS −1 , (19) (SS −1 )+ = dY Y −1 . Here the suffix notation denotes the projection of an element of G into G± . We introduce a one-form Z taking values in G+ , Z = dY Y −1 = (SS −1 )+ .

(20)

Now, since S takes values in G− , it is analytic at λ = ±1 and has two power-series representations, converging in discs with centres at λ = ±1, viz. S=

∞ X

sn (x+ , x− )(1 + λ)n =

n=0

∞ X

sen (x+ , x− )(1 − λ)n ,

(21)

n=0

where the coefficients s0 (x+ , x− ), se0 (x+ , x− ) are U(N)-valued matrices. Inserting these expansions in (SS −1 ), we see that only the s0 and se0 terms survive the projection to the G+ subalgebra, yielding Z = (SS −1 )+ = − Define

s0 A(x+ )s−1 se0 B(x− )e s−1 0 0 dx+ − dx− . 1+λ 1−λ

A+ = s0 A(x+ )s−1 0 ,

A− = se0 B(x− )e s−1 0 .

(22)

(23)

These satisfy the PCM equations (3). The proof is immediate. From (20), d Z = Z ∧ Z. Inserting the form (22) in this equation yields

(24)

680


∂− A+ 1 ∂+ A− − + 1−λ 1+λ 2

1 1 − 1−λ 1+λ

[A+ , A− ] = 0 .

Since Y takes values in G+ , for consistency this equation needs to hold for all values 1 1 and 1+λ must be separately of λ away from ±1. In other words, the coefficients of 1−λ zero. This yields precisely the two equations in (3) as integrability conditions. Note that the solutions (23) have precisely the form (11). We have seen that for given diagonal matrices A(x+ ) and B(x− ), a solution of the linear field–independent system (14) determines a solution of the PCM in the spectral class of A and B. In fact the general solution of (14) takes the form (15), where the eM factor contains only spectral information (i.e. A, B). Everything else is encoded in the free element U0 (λ) ∈ G. So the freely–specifiable data {A(x+ ), B(x− ), U0 (λ)} corresponds to a solution of the PCM. Given any choice of these three fields, a solution of the PCM can be constructed in the following stages: (a) (b) (c) (d)

Construct the corresponding U (x+ , x− , λ) from (15). Perform the factorisation (17) to obtain S(x+ , x− , λ). Perform the two expansions (21) to extract the coefficients s0 (x+ , x− ) and se0 (x+ , x− ). Insert these in (23) to obtain a solution of the PCM.

Note that this procedure is purely algebraic, though the factorisation may not be very easy to perform in practice. However, it is clear that for any choice of A(x+ ), B(x− ) (which is tantamount to fixing the spectral class of A± ), every U0 (λ) ∈ G corresponds to a solution of the PCM. In fact there is a large redundancy, for a right–multiplication U0 7→ U0 k+ ;

k+ ∈ G +

(25)

corresponds to a right-multiplication U 7→ U k+ , which does nothing to alter the S −1 factor in (17). PCM solutions therefore correspond to G+ orbits in G, or equivalently, U0 (λ)’s from the Grassmannian G/G+ . This correspondence is, however, still redundant: Consider a left–multiplication by a diagonal matrix analytic inside C, U0 7→ h− U0 ;

h− ∈ G0,− , the maximal torus of G− .

(26)

Since this commutes with the diagonal eM , it corresponds to a transformation S −1 7→ h− S −1 . However, since h− is a diagonal matrix, the A± in (23) do not notice this transformation; they are invariant. The correct space of U0 ’s corresponding to solutions of (3) in each spectral class of A± is therefore the double coset G0,− \G/G+ . In particular, natural transformations of U0 (λ) preserving this double coset correspondence induce symmetry transformations on the space of PCM solutions.

4. The Extended Solution The fact that the consistency condition (24) with Z given by (22) yields the PCM equations is well known. Writing (20) in more familiar form, dY = ZY , it is precisely the PCM Lax-pair [14, 21],


681

1 A+ Y = 0, ∂+ + 1+λ 1 ∂− + A− Y = 0 . 1−λ

(27)

It is easy to check that the Y we have defined above has all the properties required of a solution of this pair of equations: 1. As a function of λ, the only singularities of Y on the entire λ-plane including the point at ∞ are at λ = ±1. 2. The solution of the system (27) is easily seen to satisfy the reality condition (16) Y † (λ∗ ) = Y −1 (λ).

(28)

3. There is an invariance of the Lax system: Y (x, λ) 7→ Y (x, λ)f (λ), which is usually fixed by setting (29) Y (x0 , λ) = I , for some fixed point x0 . This invariance corresponds to right–multiplications (25) of U0 and the condition (29) corresponds to choosing a representative point on the G+ orbit of U0 in G. 4. At λ = ∞, ∂+ Y = ∂− Y = 0, so Y (x, λ = ∞) is a constant and using (29) we obtain Y (x, λ = ∞) = I.

(30)

5. The system (27) yields the expressions A+ = (1 + λ)Y ∂+ Y −1 ,

A− = (1 − λ)Y ∂− Y −1 ,

(31)

which together with (29) and (6) imply that Y (x, λ = 0) = g −1 .

(32)

We have already seen that the A± solving (3) may be recovered from power series expansions around λ = ±1 of the S −1 factor of U using the expressions (23). We now see that solutions may equally be obtained from the Y factor using (32) and (6). We can also obtain solutions from the Y factor by expanding around λ = ∞. Denoting the leading terms consistently with (30), Y (x, λ) = I +

f (x) + ..., λ

(33)

where f (x) is antihermitian, the λ = ∞ limit of (31) yields the expressions A± = ∓∂± f ,

(34)

which identically satisfy (4) and shift the dynamical description to (5) instead, which acquires the form 1 (35) ∂− ∂+ f + [∂− f, ∂+ f ] = 0 . 2 This equation is known as the “dual formulation” of the harmonic map equation (7). A Y (x, λ) obtained from the factorisation procedure automatically yields a solution of

682


this equation on expansion around λ = ∞. We therefore see that the factorisation (17) produces a Y (x, λ) which interpolates between the dual descriptions of PCM solutions; yielding a U(N)–valued solution g −1 of Eq. (7) on evaluation at λ = 0 and a Lie-algebravalued solution f of the alternative equation (35) on development around λ = ∞. The G+ –valued Y (x, λ) thus encapsulates these dual descriptions of chiral fields and this field was aptly named the extended solution of the PCM by Uhlenbeck [19]. We shall later need information about the next-to-leading-order term in the expansion of Y around λ = 0. If we substitute Y = (I + λϕ)g −1 + O(λ2 ),

(36)

where ϕ is a Lie-algebra-valued field, into (31), and use (6), we obtain the following first-order equation for ϕ: ∂± ϕ + [A± , ϕ] = ±A± .

(37)

The consistency condition for this is just (4). Reflecting the G+ –valued extended solution Y (x, λ), there is also the G− –valued S(x, λ), which clearly also describes some extension of the PCM solution given by the expression (23). Using dSS −1 = −(SS −1 )− = −(SS −1 ) + (SS −1 )+ , we find the following flows for the components of S, which we shall need later: ∂+ sn = sn+1 A − A+ sn+1 , n X sr B − A− sr ∂ − sn = , 2n−r+1 r=0 n X

(38) (39)

ser A − A+ ser , 2n−r+1

(40)

∂− sen = sen+1 B − A− sen+1 .

(41)

∂+ sen =

r=0

Using (23) and these equations for n = 0 yields the interesting flow equations: ∂+ A+ = s0 ∂+ A s−1 + [A+ , [A+ , s1 s−1 0 0 ]], ∂− A− = se0 ∂− B se−1 + [A− , [A− , se1 se−1 0 0 ]].

(42)

5. Symmetry Transformations Unveiled Non-space-time symmetry transformations of the PCM were traditionally derived using mainly guesswork inspired by analogies with other integrable models like the sineGordon model. Their origin remained largely veiled in mystery and they were therefore called “hidden symmetries”. Previous discussions of them have recently been reviewed by Schwarz [17] and Uhlenbeck [19]. In the framework of the present paper there is nothing “hidden” about these symmetries. As we shall see, in terms of the freefield data U0 (λ), A(x+ ), B(x− ), the veil hiding these symmetries is entirely lifted: the most natural field-independent transformations of these free fields, which preserve their analyticity properties in their respective independent variables, induce the entire array of known symmetry transformations of PCM fields and more. Moreover, the algebraic structure of the symmetry transformations is completely transparent when acting on the free-field data, and there is no need to compute commutators and check closure


683

using the complicated action of the symmetries on physical fields. The physical fields automatically carry representations of all the symmetry actions on the free-field data. In this section we classify PCM symmetry transformations according to the corresponding transformations of the free fields. The formulas for the induced transformations on the extended solutions Y , on the chiral fields g and on the potentials A± will be derived in the next section. 5.1. Symmetry transformations of U0 . We first list symmetry transformations which leave A(x+ ) and B(x− ) unchanged. 5.1.1. Right dressings. Right-actions by elements of the G+ subgroup (25) have already been seen to correspond to trivial redundancies and have already been factored out. This leaves the possibility of right–multiplying U0 by an element of G− , U0 7→ U0 k− ;

k− ∈ G − .

(43)

Such transformations fall into the following classes: a) k− = b, a constant (i.e. an element of U(N)). This may easily be seen to induce the transformations Y 7→ b−1 Y b and g 7→ b−1 gb, i.e. the symmetry (8). (µ) b) If we take k− = I + N λ−µ π , having a simple pole at a single point λ = µ outside C (here N (µ) is a λ-independent matrix), the transformations induced on the chiral fields are precisely the Bäcklund transformations of [9, 13]. I PCM we could conc) We are presently considering the U(N) PCM. For the GL(N, C) sider finite transformations with k− in a triangular subgroup of G− . Such transformations induce the explicit transformations discussed by Leznov [10]. We will not go into details of this. d) General k− (λ) infinitesimally close to the identity. This is a realisation of the algebra G− on the free-field U0 (λ) and is a remarkably transparent way of expressing the action of the celebrated loop algebra of hidden symmetries [6] of the PCM. The precise structure of this algebra has not been properly identified before. e) General finite k− (λ). This finite version of the infinitesimal symmetries in d) reproduces (modulo some details) the loop group action on chiral fields g and on extended maps Y given by Uhlenbeck in Sect. 5 of [19]. 5.1.2. Left dressings. Left actions on U0 by elements of G0,− have already been pointed out to leave the associated solution of the PCM invariant (see (26)). We wish to consider only left actions on U0 that descend to the double coset G0,− \G/G+ , i.e. actions by elements that commute with G0,− . Thus we have only the transformations U0 7→ h+ U0 ;

h+ ∈ G0,+ .

(44)

This is the action of an infinite-dimensional abelian group, which has not yet appeared in the literature. The infinitesimal version of this gives an infinite set of mutually commuting flows also commuting with the PCM flow. This is the PCM hierarchy. 5.1.3. Reparametrisations of U0 (λ). These are transformations generated by λ-diffeo morphisms (45) U0 (λ) 7→ U0 (λ + (λ)).

684


General reparametrisations can move C± to curves that do not enclose ±1. The easiest way to prevent this is to restrict the diffeomorphisms to those that fix ±1. For infinitesimal diffeomorphisms this condition is not strictly necessary. It turns out however that the infinitesimal diffeomorphisms fixing ±1 are technically simpler (in terms of their action on g, Y ) and these give (modulo a detail that will be explained) the “half Virasoro” algebra described in [17]. We show how this can be extended to a full centreless Virasoro algebra. The only finite reparametrisations of the λ-plane preserving ±1 are U0 (λ) 7→ U0

aλ + b bλ + a

,

a2 + b2 = 1.

(46)

These induce the S 1 action of sect. 7 of [19]. 5.2. Symmetry transformations of A(x+ ), B(x− ). We now consider symmetries that keep U0 fixed. For symmetries acting just on A(x+ ) it is natural to consider a) Shifts A(x+ ) 7→ A(x+ ) + α(x+ ), where α(x+ ) is a diagonal antihermitian matrix. b) Rescalings A(x+ ) 7→ ρ(x+ )A(x+ ) where ρ(x+ ) is a scalar function. c) Reparametrisations A(x+ ) 7→ A(x+ + (x+ )). There are other possibilities. Similar symmetries exist for B(x− ). All these symmetries are new. 5.3. Other symmetry transformations. Two other symmetries of PCM should be mentioned. The first is a particularly significant combination of an action on U0 with an action on A, B. The second is not strictly within the class of symmetries we have been considering, as it acts on the coordinates as well as the fields. 5.3.4. Inversion. The transformation U0 (λ) 7→ U0 (λ−1 )

and

(A, B) 7→ (−A, −B)

(47)

may easily be seen to induce the inversion symmetry (9). 5.3.5. Lorentz transformations. The transformation U0 invariant,

A 7→ θ+ A,

B 7→ θ− B

−1 ± x± 7→ θ± x

(48)

induces the residual Lorentz transformations in light cone coordinates A± 7→ θ± A± ,

−1 ± x± 7→ θ± x .

We can also consider more general reparametrisations of x± .

(49)


685

6. Induced Symmetries of PCM Fields As we have already claimed, natural transformations on the free–field data, U0 (λ), A(x+ ), B(x− ) induce, through Birkhoff factorisation, rather complicated transformations on the PCM fields Y (x, λ), g(x), A± (x); and (field–independent) representations of symmetry algebras induce (field-dependent) representations on the PCM fields. In this section we prove this for the intereresting and not immediately obvious cases listed in the previous section. We also comment on the relation with previous results in the literature. 6.1. Right dressings. Consider the transformation induced by (43) on U (x, λ). U = S −1 Y 7→ Unew = S −1 Y k− .

(50)

Birkhoff factorisation of Y k− yields (in the obvious notation) −1 Unew = S −1 (Y k− )− (Y k− )+ = Snew Ynew .

(51)

In other words, we have the symmetry transformation Y 7→ (Y k− )+ ,

(52)

which is just the representation of G− described by Uhlenbeck in Sect. 6 of [19] (except that she uses a subgroup of G− ). We can equivalently write Y 7→ (Y k− Y −1 )+ Y .

(53)

Now writing k− = I + (λ) with (λ) ∈ G− an infinitesimal parameter, we obtain the infinitesimal version of this, (54) Y 7→ I + (Y (λ)Y −1 )+ Y . We note that this directly gives the generating function of [4] for these transformations, which was originally obtained by extrapolation from the leading terms in a power series expansion [6]. The G+ projection corresponds to taking the singular part at λ = ±1. This may be done using a contour integral, so that this transformation takes the form Z Y (x, λ0 )(λ0 )Y −1 (x, λ0 ) 0 1 dλ Y (x, λ) . (55) Y (x, λ) 7→ I + 2πi C λ0 − λ Here C± are oriented counter-clockwise around ±1. The transformation for g may be read off by taking the λ → 0 limit, yielding the form of the transformation given in [18, 17], Z Y (x, λ0 )(λ0 )Y −1 (x, λ0 ) 0 1 (56) dλ . g 7→ g I − 2πi C λ0 The parameter of this infinitesimal transformation, (λ) is an arbitrary infinitesimal G− element. In particular, if we introduce a basis {T a } for the Lie algebra of antihermitian matrices, we can take (λ) proportional to λr T a , r ∈ Z . This gives an infinite set of transformations, which we denote Jra , and which satisfy the commutation relations X c fcab Jr+s , (57) [Jra , Jsb ] = c

686


P where the fcab are the structure constants defined by [T a , T b ] = c fcab T c . Although the commutation relations of a centreless Kac-Moody algebra thus appear, this is not sufficient to identify the symmetry algebra G− with a centreless Kac-Moody algebra. We illustrate this in two ways: first we show that in G− there exist certain linear relations absent in a Kac-Moody algebra, and second we show that in G− the Jra are not a spanning set. The crucial point is that although we can certainly try to expand elements of G− in Laurent series, and finite sums of matrices of the form λr T a are certainly in G− , the natural way to expand an element of G− is in a Taylor series in λ + 1 (or alternatively in λ − 1). Taking (λ) in (56) proportional to (λ + 1)n T a , for n ≥ 0, we can define a set of transformations Kna satisfying the relations X b c ]= fcab Kn+m n, m ≥ 0. (58) [Kna , Km c

r

Considering the expansion of λ in powers of λ + 1 (valid in |λ + 1| < δ), we find that the Jra are expressed as linear combinations of the Kna in the following way:  P r r  (−1)n+r r≥0 Kna n=0 n a Jr = P . (59)  ∞ (−1)r n − r − 1 K a r 0 to a contour around ∞; and for r = 0 to a pair of contours around 0 and ∞. 6.2. The Bäcklund transformation. The element k− ∈ G− in (43) can clearly have all variety of singularities outside C. Trying to give k− just one simple pole at the point λ = µ outside C, suggests the natural form [21] N (µ) . (61) k− (λ, µ) = I + λ−µ For the satisfaction of the reality condition (16) for elements of G− we require that N† ¯ if π is a projector N† = N µ−µ¯ = −N . These conditions are satisfied by N = (µ − µ)π, satisfying π 2 = π = π † . Such transformations thus correspond to finite right-dressing transformation of the particular form µ − µ¯ π . (62) U0 7→ U0 I + λ−µ Note that k− in fact has a singularity at λ = µ¯ as well, since (I −π) has zero determinant. Using (50) we obtain the transformation µ − µ¯ −1 −1 Y (λ)πY (λ) Y (λ) . I+ (63) U 7→ S λ−µ In order to factorise the middle factor, we introduce a hermitian projector P = P † = P 2 , independent of λ (but not of x± ). Using this we see that µ¯ −1 Y (λ)πY (λ) Y (λ) I + µ− λ−µ µ¯ µ−µ ¯ µ−µ¯ −1 P I + P I + Y (λ)πY = I + µ− (λ) Y (λ) λ−µ λ−µ¯ λ−µ µ¯ µ−µ¯ µ−µ ¯ (I ) (I P I + P Y (λ) . = I + µ− − P Y (λ)π + − π) λ−µ λ−µ λ−µ¯ To have an acceptable factorisation, all we need now is that the right-hand factor above be regular outside C. Specifically, we require regularity at µ and µ, ¯ which yields algebraic conditions relating the projectors P and π, viz. (I − P )Yµ π = 0 ,

P Yµ (I − π) = 0,

where Yµ denotes Y (λ) evaluated at λ = µ. If we write π = v(v † v)−1 v † (see [9]), these equations are solved by the expression −1 † † P = Yµ v v † Yµ† Yµ v v Yµ .

688


Now we can read-off the induced transformation rules for Y and g. These are just the known PCM Bäcklund transformations [9, 13, 21]. 6.3. Left dressings. Here we consider in detail the left dressings (44). Matrices h+ ∈ G0,+ commute with M , so such transformations act by left multiplication on U , i.e. U 7→ h+ U = h+ S −1 Y = S −1 (Sh+ S −1 )Y . Hence the action on Y is given by Y 7→ (h+ S −1 )+ Y = (Sh+ S −1 )+ Y.

(64)

For an infinitesimal transformation h+ = I + , ∈ G0,+ and we have Y 7→ I + (SS −1 )+ Y Z 1 S(λ0 )(λ0 )S −1 (λ0 ) 0 = I+ dλ Y, 2πi C λ0 − λ implying

1 g→ 7 g I− 2πi

Z C

S(λ0 )(λ0 )S −1 (λ0 ) 0 dλ . λ0

(65)

(66)

In general has the form (λ) =

∞ X n=1

α en αn + (1 + λ)n (1 − λ)n

,

(67)

en are constant infinitesimal diagonal matrices. The integral in (66) is where the αn , α evaluated by computing the residues of the integrand at λ0 = ±1. For example, the case en zero yields the transformation rules α1 6= 0 with all other αn , α g −1 δg = −s0 α1 s−1 0 , −1 δA+ = A+ , [s1 s−1 0 , s 0 α1 s0 ] , δA− =

(68)

− 21 [A− , s0 α1 s−1 0 ].

en zero we find Similarly, if α2 6= 0 with all other αn , α

−1 −1 g −1 δg = − s0 α2 s−1 0 + [s1 s0 , s0 α2 s0 ] , −1 −1 −1 −1 , δA+ = A+ , [s2 s−1 0 , s0 α2 s0 ] − [s1 s0 , s0 α2 s0 ]s1 s0 −1 −1 −1 δA− = − A− , 41 s0 α2 s0 + 21 [s1 s0 , s0 α2 s0 ] .

(69)

The formulae for δA± are computed using the variation of the relation (6), δA± = ∂± (g −1 δg) + [A± , g −1 δg],

(70)

and Eqs. (38)-(41). The latter also allow one to check directly that the above transformations are indeed infinitesimal symmetries, i.e. that ∂− δA+ + ∂+ δA− = 0. Now considering the sector of PCM in which A = α1 , independent of x+ , we see that the ∂+ -derivations of A± given by (3) and (42) are effected by the transformations (68). So left dressing transformations with only α1 non-zero correspond to x+ translations in this sector. Similarly the transformations (69) can be seen to be related to coordinate translations in an extended system (described in the appendix) belonging to a hierarchy associated to the PCM. Whenever an infinite dimensional abelian symmetry algebra


689

(like G0,+ ) is identified in a system, it is possible to define a corresponding hierarchy. Traditionally, for each generator in the algebra a coordinate is introduced and the flow in each coordinate is defined as the infinitesimal action of the corresponding symmetry. In our formulation there is an alternative way to define a PCM hierarchy. Instead of working on a space M with coordinates (x+ , x− ), we work on a larger space M with − 2P coordinates (x+1 , . . . , x+P , x− 1 , . . . , xP ) and replace the of (12) by =−

P X An (x+ )dx+ n

(1 + λ)n

n=1

n

+

− Bn (x− n )dxn n (1 − λ)

,

(71)

where the An (x+n ), Bn (x− n ) are all antihermitian diagonal matrices, each depending on only one coordinate. The associated nonlinear equations are again the equations dZ = Z ∧ Z, where Z = (SS −1 )+ and S is a map from M to G− . For the case P = 2 we write out this system of equations in full in the appendix. Another possibility of obtaining a hierarchy within our framework is to enlarge M to a space with 2N P a− coordinates (xa+ n , xn ), 1 ≤ n ≤ P , 1 ≤ a ≤ N , and taking =−

N a a+ P X X A (x )H a dxa+ n

n=1 a=1

n

(1 + λ)n

n

+

a a− Bna (xa− n )H dxn n (1 − λ)

,

(72)

where {H a }, a = 1, . . . , N is a basis for the algebra of antihermitian, diagonal N × N matrices. In this hierarchy, left dressings on U0 correspond precisely to coordinate translations in the sector with the scalar functions Aan , Bna constant. The physical or geometric significance of these PCM hierarchies remains to be understood. An alternative approach to defining a PCM hierarchy was given in [1]. 6.4. The Virasoro symmetry. In this section we consider the symmetries of PCM associated with reparametrisations of U0 (λ). We consider the infinitesimal reparametrisations U0 (λ) → U0 (λ + m λm+1 ), where the m are infinitesimal parameters and m ∈ Z, or, equivalently, variations δU0 = m λm+1 U00 (λ). The prime denotes differentiation with respect to λ. These variations give rise to a centreless Virasoro algebra of infinitesimal symmetries of PCM. In [17] Schwarz documents the existence of “half” of this algebra. Schwarz’s symmetries are associated with reparametrisations that fix the points λ = ±1. We shall see that from a technical standpoint these are simpler to handle than the full set of symmetries. But there is also a fundamental reason to make such a restriction. If we were to consider finite reparametrisations, we would need to ensure that the contour C remains qualitatively unchanged. The simplest way to do this is to require the points λ = ±1 to be fixed. In [19] Uhlenbeck identifies an S 1 symmetry of PCM. It is a simple exercise to check that this symmetry corresponds, in our formalism, to global reparametrisations of the λ-plane fixing the points ±1, i.e. Möbius transformations of the form aλ + b , a2 + b2 = 1. (73) λ→ bλ + a At the level of infinitesimal symmetries, however, the need to fix ±1 is really superfluous, and so we find a full Virasoro algebra of symmetries. But as we have said above, the symmetries fixing ±1 are technically easier, which is why Schwarz was able to identify them, and also for the more general symmetries we can be quite certain that there exists no exponentiation.

690


With this introduction, we consider the variations δm U0 = m λm+1 U00 (λ). These manifestly realise the algebra [δm , δn ] = (n − m)δn+m . This realisation descends to the physical fields. Using U0 = e−M S −1 Y we have the chain of implications δm U0 = m λm+1 (−M 0 e−M S −1 Y − e−M S −1 S 0 S −1 Y + e−M S −1 Y 0 ), δ m U = eM δm U 0 = m λm+1 (−M 0 S −1 Y − S −1 S 0 S −1 Y + S −1 Y 0 ), δm S = −(Sδm U Y −1 )− S = −m λm+1 (−SM 0 S −1 − S 0 S −1 + Y 0 Y −1 ) − S, δm Y = (Sδm U Y −1 )+ Y = m λm+1 (−SM 0 S −1 + Y 0 Y −1 ) + Y.

(74) (75) (76) (77)

In the last equation we have used the fact that for all m, λm+1 S 0 S −1 takes values in G− . Of the remaining two terms, the first has a G+ piece originating in the double pole of M 0 at λ = ±1. To explicitly compute this is a simple exercise. For the second term, we use a contour integral formula for the projection. We thus arrive at the final result

Z m+1 0 µ Y (µ)Y −1 (µ) 1 dµ 2πi C µ−λ R R (−1)m −1 m+1 1 +(−1)m s0 A s−1 + s1 s0 , s0 A s−1 − 0 0 2 (1 + λ) 1+λ 1+λ R R 1 m+1 1 −1 −1 + se0 B se−1 + s e − s e , s e . (78) B s e 1 0 0 0 0 (1 − λ)2 1−λ 1−λ δm Y Y −1 = m

R x− R R R x+ Here A and B are shorthand for x+ A(y + )dy + and x− B(y − )dy − respectively. 0 0 The g transformations are read off by setting λ to zero. In the expression for δm g, the contour integral term is evaluated, depending on the value of m, by shrinking C to a contour around either 0 or ∞. Explicitly for the SL(2) subalgebra of the Virasoro algebra, we obtain (omitting the overall infinitesimal parameters), −1 R R −1 −1 R R −1 s0 Be e1 se0 , (e s0 Be s0 )+ s1 s0 , (s0 As−1 s0 ) , g −1 δ−1 g = φ+(s0 As−1 0 )−(e 0 ) − s −1 R −1 −1 R −1 −1 s0 Be g δ0 g = − s1 s0 , (s0 As0 ) − se1 se0 , (e s0 ) , −1 R −1 R −1 −1 R R −1 −1 s0 Be e1 se0 , (e s0 Be g δ1 g = f −(s0 As0 )+(e s0 )+ s1 s0 , (s0 As−1 s0 ) . 0 ) − s We see that in these formulae, not only do the leading coefficients s0 , s1 , se0 , se1 in the expansions of S appear, but also the fields φ and f , coefficients in the expansions of Y around 0 and ∞ respectively (see Sect. 4). The work required to check directly that these, or any of the δm ’s, are symmetries is formidable, but we again emphasize that the advantage of the present framework is that such direct checks are not necessary in order to prove that the physical fields carry a representation of the full centreless Virasoro algebra. Schwarz [17] has previously found half a Virasoro algebra. We observe that if we define transformations 1m = δm+1 − δm−1 a substantial simplification takes place, yielding the formula


1 1m g = −m g 2πi

Z C

691

µm−1 (µ2 − 1)Y 0 (µ)Y −1 (µ)dµ

R R −1 + 2(−1)m (s0 A s−1 ) − 2(e s ) . B s e 0 0 0

(79)

We will see in Sect. 6 (see Eqs. (83),(84)) that the second and third terms in the above expression are individually symmetries of PCM that mutually commute and commute with all the symmetries being considered here. Removing these terms gives exactly the “half-Virasoro” symmetries of [17] Z 1 e µm−1 (µ2 − 1)Y 0 (µ)Y −1 (µ)dµ, m ∈ Z. (80) 1m g = −m g 2πi C Thus we see the precise nature of Schwarz’s symmetries as combinations of reparametrisations preserving the points λ = ±1 with certain simple symmetries that act on the A, B fields but leave U0 invariant. Taking the appropriate combinations we see that for the e 0, simplest Schwarz symmetry 1

and using (34) and (37),

e 0 g = φ − f, g −1 1

(81)

e 0 A± = ∓2A± + [A± , f ]. 1

(82)

This is easily checked to be a symmetry. The symmetry 10 acts on the physical fields in a much more complicated way: R R −1 s0 Be s0 ), g −1 10 g = φ − f − 2(s0 As−1 0 ) + 2(e R R −1 −1 10 A+ = −4A+ + [A+ , f ] − 2 [s1 s0 , A+ ], (s0 As−1 s0 Be s0 ) , 0 ) + A+ , (e R −1 R −1 −1 s1 se0 , A− ], (e s0 Be 10 A− = 4A− + [A− , f ] + 2 [e s0 ) − A− , (s0 As0 ) . 6.5. Transformations of the free fields A(x+ ), B(x− ). Following the by now familiar reasoning, an infinitesimal transformation A(x+ ) 7→ A(x+ ) + δA(x+ ) induces the following transformations on Y, g, A+ , A− : R (s0 δA s−1 0 ) δY = − Y, 1 + λ R δg = g (s0 δA s−1 0 ), R −1 −1 δA+ = s0 δA s0 − A+ , s1 s−1 0 , (s0 δA s0 ) , R δA− = A− , (s0 δA s−1 0 ) . R R x+ Here we have written δA as shorthand for x+ δA(y + )dy + . As expected, the spectrum 0 of A− remains invariant, while that of A+ is shifted. Using the flow equations for s0 , s1 , it is easy to check that these are genuine symmetries, i.e. that ∂− δA+ + ∂+ δA− = 0. There are a variety of possibilities for δA(x+ ). If {H a }, a = 1 . . . N , is a basis of the algebra of antihermitian diagonal matrices, we can consider variations δA(x+ ) ∼ (x+ )m H a , a = 1, . . . , N , m ∈ Z. This gives a loop algebra of symmetries, corresponding to translations of A(x+ ). Taking δA(x+ ) ∼ (x+ )m A0 (x+ ), m ∈ Z, gives a centreless Virasoro algebra of symmetries, corresponding to reparametrizations of A(x+ ). Taking

692


δA(x+ ) ∼ (x+ )m A(x+ ), m ∈ Z, gives an infinite dimensional abelian symmetry algebra corresponding to x+ -dependent rescalings of A(x+ ). Clearly these symmetries are not independent: The latter two families can be written in terms of the first family, but the generators are then field dependent combinations of the generators of the first family. Analogous sets of symmetries can be obtained from infinitesimal variations of B(x− ). The simple variation δA(x+ ) = A(x+ ), where is a constant infinitesimal parameter, yields the symmetry R (83) δg = g(s0 A s−1 0 ), whereas the transformation B 7→ (1 + ζ)B, where ζ is also an infinitesimal parameter, yields R (84) δg = ζg(e s0 B se−1 0 ). These transformations were used in Sect. (6) to make contact between our Virasoro symmetries and those of [17]. 7. Concluding Remarks We have seen that formulating the nonlinear equations of motion (3) of the PCM in the form of the simple linear system (1) makes the precise nature of their integrability completely transparent. It yields a novel free-field parametrisation of the space of solutions, which we have used to classify all the symmetries of on-shell PCM fields in terms of natural transformations on the free-field data. The confusing cacophony of symmetry transformations in the literature is thereby seen to arise in the most natural fashion imaginable. We have thus demonstrated that this notion of complete integrability, previously applied to traditional soliton systems, like the KP, NLS and KdV hierarchies, encompasses the Lorentz–invariant PCM field theories. We believe that this notion of integrability is a universal one and we expect a clarification of the nature of the integrability of the self-dual Yang-Mills and self-dual gravity equations by similarly reformulating the twistor constructions for these systems. Indeed Crane [3] has already discussed a loop group of symmetries in terms of an action on free holomorphic data in twistor space. Our construction raises many questions. 1) Standard integrable soliton systems exhibit multiple hamiltonian structures and infinite numbers of conservation laws, both these phenomena being symptoms of their integrability. These phenomena ought to have a natural explanation in terms of the associated simple linear systems (free-field data). For the PCM, some work on such structures exists [5]. 2) The free-field parametrisation of solutions of PCM should play a critical role in the quantisation of the theory. What is the relation with standard quantisations? (The PCM can be quantised in different ways, using either the field f or the field g as fundamental, giving different results [12].) How are we to understand quantum integrability? 3) There is a large body of related mathematical work, mostly focusing on the enumeration and construction of solutions of the PCM in Euclidean space (for recent references see [2]). Most of our formalism goes through for the case of Euclidean space, but the reality conditions are different, and a little harder to handle. An important class of solutions are the unitons [19, 20]. These correspond, up to the need for right dressings by G+ elements, to Y ’s with finite order poles at one of the two points ±1, and regular elsewhere. We wonder: What are the corresponding U0 ’s? (The work of Crane on self-dual


693

Yang-Mills [3] may have an analog.) Is there a natural geometric understanding of our construction? Or a relation with the constructions of [20] or [2]? 4) Is there a geometric interpretation of our PCM hierarchy?

Appendix. The PCM Hierarchy In Sect. 6.3 we have described a procedure to generate a PCM hierarchy. In this appendix we illustrate this procedure by obtaining the simplest integrable extension of the PCM equation. We use the given in (71) for P = 2. Using Z = (SS −1 )+ we obtain the following form for Z: [C+ , B+ ] B+ A+ dx+1 + dx+2 + Z= − 1+λ (1 + λ)2 1+λ A− dx− [C− , B− ] B− − 1 + dx2 . + + 1−λ (1 − λ)2 1−λ

The six fields A+ , B+ , C+ , A− , B− , C− are defined in terms of the coefficients of S and − the free fields A1 (x+1 ), A2 (x+2 ), B1 (x− 1 ), B2 (x2 ). They depend on the four coordinates − − + + x1 , x2 , x1 , x2 and are constrained in virtue of their defining relations thus: A+ commutes with B+ , A− commutes with B− and the spectra of A+ , B+ , A− , B− depend only − on x+1 , x+2 , x− 1 , x2 respectively. If we nevertheless ignore these constraints and simply substitute the above form for Z into dZ = Z ∧ Z, we find: 1. [A+ , B+ ] = [A− , B− ] = 0. 2. The following system of evolution equations for A+ , B+ , A− , B− : ∂2+ A+ = − 21 [A+ , [[B+ , C+ ], C+ ]] − [B+ , ∂1+ C+ + 21 [[A+ , C+ ], C+ ]], ∂1− A+ = 21 [A+ , A− ], ∂2− A+ = 21 [A+ , 21 B− + [C− , B− ]], ∂1+ B+ = [B+ , [A+ , C+ ]], ∂1− B+ = 21 [B+ , A− ], ∂2− B+ = 21 [B+ , 21 B− + [C− , B− ]], ∂1+ A− = 21 [A− , A+ ], ∂2+ A− = 21 [A− , 21 B+ + [C+ , B+ ]], ∂2− A− = − 21 [A− , [[B− , C− ], C− ]] − [B− , ∂1− C− + 21 [[A− , C− ], C− ]], ∂1+ B− = 21 [B− , A+ ], ∂2+ B− = 21 [B− , 21 B+ + [C+ , B+ ]], ∂1− B− = [B− , [A− , C− ]]. These evidently imply that the spectra of A+ , B+ , A− , B− depend only on x1+ , x2+ , x1− , x2− respectively, as required. 3. The following evolution equations for C+ , C− :

694


∂1+ C− = − 41 A+ − 21 [A+ , C− ], ∂2+ C− = − 18 B+ + 41 ([C− , B+ ] − [C+ , B+ ]) + 21 [C− , [C+ , B+ ]], ∂1− C+ = − 41 A− − 21 [A− , C+ ], ∂2− C+ = − 18 B− + 41 ([C+ , B− ] − [C− , B− ]) + 21 [C+ , [C− , B− ]]. (In fact, from the dZ = Z ∧ Z equation, both of the C− evolutions appear commutated with B+ and both of the C+ evolutions appear commutated with B− .) This system is a 4-dimensional integrable system, but its physical or geometric interpretation is not immediately apparent. It has a variety of interesting reductions apart from the reduction to PCM by setting B− = B+ = 0. We can consistently reduce by taking A− = B− or A+ = B+ or both. Or we can take just B− = 0 (or B+ = 0) + in which case the x− 2 (or x2 ) dependence becomes trivial. For all these reductions, and the full system as well, the methods of this paper give a free-field parametrisation of solutions. Acknowledgement. We should like to thank Bernie Pinchuk and Larry Zalcman for discussions on Runge’s theorem. One of us (CD) is happy to thank the Emmy Noether Mathematics Institute of Bar–Ilan University for generous hospitality.

References 1. Bruschi, M., Levi, D., Ragnisco, O.: The chiral field hierarchy. Phys.Lett. 88A, 379–382 (1982) 2. Burstall, F.E., Guest, M.A.: Harmonic two-spheres in compact symmetric spaces, revisited. Preprint (1996) 3. Crane, L.: Action of the loop group on the self-dual Yang-Mills equation. Commun. Math. Phys. 110, 391–414 (1987) 4. Devchand, C., Fairlie, D.B.: A generating function for hidden symmetries of chiral models. Nucl.Phys. B194, 232-236 (1982) 5. Dickey, L.A.: Symplectic structure, Lagrangian, and involutiveness of first integrals of the principal chiral field equation. Commun. Math. Phys. 87, 505–513 (1983) 6. Dolan, L.: Kac-Moody algebra is hidden symmetry of chiral models. Phys. Rev. Lett. 47, 1371–1374 (1981) 7. Feller, W.: An introduction to probability theory and its applications, Volume I. New York–Chichester– Brisbane–Toronto: John Wiley & Sons, 1967 8. Haak, G., Schmidt, M., Schrader, R.: Group Theoretic Formulation of the Segal-Wilson Approach to Integrable Systems with Applications. Rev. Math. Phys. 4, 451–499 (1992) 9. Harnad, J., Saint-Aubin, Y., Shnider, S.: Superposition of solutions to Bäcklund transformations for the SU(N) principal σ-model. J. Math. Phys. 25, 368–375 (1983): Quadratic psuedopotentials for Gl(N, C) I principal sigma models. Physica 10D, 394–412 (1984); Bäcklund transformations for nonlinear sigma models with values in Riemannian symmetric spaces. Commun. Math. Phys. 92, 329–367 (1984; The soliton correlation matrix and the reduction problem for integrable systems. Commun. Math. Phys. 93, 33–56 (1984) 10. Leznov, A.N.: Bäcklund transformation for main chiral field problem with an arbitrary semisimple algebra. Preprint (1991) 11. Mulase, M.: Complete integrability of the Kadomtsev-Petviashvili equation. Adv. Math. 54, 57–66 (1984); Solvability of the super KP equation and a generalization of the Birkhoff decomposition. Inv. Math. 92, 1–46 (1988) 12. Nappi, C.R.: Some properties of an analog of the nonlinear sigma model. Phys. Rev. D21, 418–420 (1980) 13. Ogielski, A.T., Prasad, M.K., Sinha, A. Chau Wang, L-L.: Bäcklund transformations and local conservation laws for principal chiral fields. Phys. Lett. 91B, 387–391 (1980)


695

14. Pohlmeyer, K.: Integrable Hamiltonian systems and interactions through quadratic constraints. Commun. Math. Phys. 46, 207–221 (1976) 15. Rudin, W.: Real and complex analysis. New York–St.Louis–San Francisco–Toronto–London–Sydney: McGraw-Hill Book Company, 1966 16. Schiff, J.: Symmetries of KdV and loop groups. Preprint (1996) (Archive number solv-int/9606004) 17. Schwarz, J.H.: Classical symmetries of some two-dimensional models. Nucl. Phys. B447, 137–182 (1995) 18. Ueno, K., Nakamura, Y.: The hidden symmetry of chiral fields and the Riemann-Hilbert problem. Phys. Lett. 117B, 208–212 (1982) 19. Uhlenbeck, K.: Harmonic maps into Lie groups (classical solutions of the chiral model). J. Diff. Geom. 30, 1–50 (1989) 20. Ward, R.S.: Classical solutions of the chiral model, unitons, and holomorphic vector bundles. Commun. Math. Phys. 128, 319–332 (1990) 21. Zakharov, V.E., Mikhailov, A.V.: Relativistically invariant two-dimensional models of field theory which are integrable by means of the inverse scattering problem method. Zh. Exp. Teor. Fiz. 74, 1953–1973 (1978) (English translation: Sov. Phys. JETP 47, 1017–1027 (1978)) Communicated by R. H. Dijkgraaf

Commun. Math. Phys. 190, 697 – 721 (1998)

Communications in


Asymptotics of a Class of Solutions to the Cylindrical Toda Equations Craig A. Tracy1 , Harold Widom2 1 Department of Mathematics and Institute of Theoretical Dynamics, University of California, Davis, CA 95616, USA. E-mail: [email protected] 2 Department of Mathematics, University of California, Santa Cruz, CA 95064, USA. E-mail: [email protected]

Received: 23 January 1997 / Accepted: 8 May 1997

Abstract: The small t asymptotics of a class of solutions to the 2D cylindrical Toda equations is computed. The solutions, qk (t), have the representation qk (t) = log det (I − λ Kk ) − log det (I − λ Kk−1 ), where Kk are integral operators. This class includes the n-periodic cylindrical Toda equations. For n = 2 our results reduce to the previously computed asymptotics of the 2D radial sinh-Gordon equation and for n = 3 (and with an additional symmetry constraint) they reduce to earlier results for the radial Bullough-Dodd equation. Both of these special cases are examples of Painlevé III and have arisen in various applications. The asymptotics of qk (t) are derived by computing the small t asymptotics t a k det (I − λ Kk ) ∼ bk , n where explicit formulas are given for the quantities ak and bk . The method consists of showing that the resolvent operator of Kk has an approximation in terms of resolvents of certain Wiener-Hopf operators, for which there are explicit integral formulas.

1. Introduction We consider here solutions of the cylindrical Toda equations qk00 (t) + t−1 qk0 (t) = 4 (eqk (t)−qk−1 (t) − eqk+1 (t)−qk (t) ),

k ∈ Z,

(1.1)

satisfying the periodicity conditions qk+n = qk . The integer n is arbitrary but fixed. It follows from results in [9] that solutions valid for all t > 0 are given by qk (t) = log det (I − λ Kk ) − log det (I − λ Kk−1 ),

(1.2)

698

C. A. Tracy, H. Widom

where Kk is the integral operator on R+ with kernel X ω

ω k cω

e−t[(1−ω)u+(1−ω −ωu + v

−1

)u−1 ]

,

(1.3)

ω running over the nth roots of unity other than 1. In the case n = 2 we have qk+1 = −qk and (1.1) becomes, with q equal to either qk , q 00 (t) + t−1 q 0 (t) = 8 sinh 2q(t), which can be reduced to a particular case of the Painlevé III equation. The connection with Fredholm determinants was discovered by McCoy, Tracy and Wu [6], and in the same paper the asymptotics as t → 0 of these solutions q(t) were determined. (Note that all asymptotics as t → ∞ are trivial.) The asymptotics as t → 0 of det (I − λ2 K02 ) = det (I −λ K0 ) det (I +λ K0 ) were determined in [7]. (See also [2], where the asymptotics were found for a family of kernels including this one as a special case.) The asymptotics of det (I − λ K0 ) itself were stated without proof in [10]. A class of periodic cylindrical Toda equations arises in thermodynamic Bethe Ansatz considerations [3]. There the additional constraint q−k−1 = −qk is imposed. The solutions (1.2) satisfy this constraint as long as the coefficients cω satisfy cω = −ω 3 cω−1 . (This follows from the fact that det (I − Kk ) = det (I − K−k−2 ) in this case, which is proved by applying the change of variable u → u−1 .) The case n = 3 of this gives the cylindrical Bullough-Dodd equation (q = q3 now) q 00 (t) + t−1 q 0 (t) = 4 (e2q(t) − e−q(t) ), which can be reduced to another special case of Painlevé III. Asymptotics of a class of solutions to PIII including this one were announced in [5]. This paper is devoted to the determination of the asymptotics of the quantities det (I − λ Kk ) in the general case, under the condition stated below. (In the final sections we shall compare our results in the cases n = 2 and 3 with those cited above.) We write K for K0 and consider at first only the asymptotics of det (I−λ K). This is no loss of generality since Kk is obtained from K upon R ∞ replacing the coefficients cω by ω k cω . The problem reduces to the asymptotics of 0 R(u, u; λ) du as t → 0, where R(u, v; λ) is the resolvent kernel of K, the kernel of K (I − λ K)−1 . Using operator techniques, we show that R(u, u; λ) is well-approximated on [1, ∞] by the corresponding function when the exponentials in (1.3) are replaced by e−t(1−ω)u and on [0, 1] by the corresponding function when the exponentials are replaced by −1 −1 e−t(1−ω )u . (Actually the kernels have to be modified first by multiplying by factors (u/v)β with β depending on λ.) We shall show that after these replacements we obtain operators which can be transformed into Wiener-Hopf operators, whose resolvent kernels have explicit integral representations. By these means the problem becomes that of determining the asymptotics of certain integrals. This is achieved by contour-shifting, and we find in the end that as t → 0, t a (1.4) det (I − λ K) ∼ b n with a and b constants given explicitly in terms of certain zeros of the function X cω (−ω)s−1 . h(s) := sin πs − λ π ω

Asymptotics of Solutions to Cylindrical Toda Equations

699

These are the values at λ of those zeros which equal 1, · · · , n when λ = 0. To state the result precisely, we denote by αk = αk (λ) (k ∈ Z) the zeros of this function indexed so that αk (0) = k. The zeros depend analytically on λ as long as they are unequal, and when λ = 0 they are the integers. We derive the asymptotics (1.4) under the assumption that there is a path in the complex plane C running from 0 to λ such that everywhere on the path < α0 < < α1 , < α0 < 1, < α1 > 0,

(1.5)

and no zero lies in the strip < α0 < < s < < α1 . With this assumption the constants a and b are given by the formulas a=

1 X 2 (n + 1)(2n + 1) , α − n α 6 Q |j| 0, Z ∞ Z ∞ β −(1−ω) u e u f (v) dv e−xu u−β du v −ωu + v 0 0 Z ∞ Z ∞ dy e−yv v −β f (v) dv. = x + 1 − ω(y + 1) 0 0 This can be seen for < ω < 0 by using the integral reresentation Z ∞ 1 = e−y(−ωu+v) dy −ωu + v 0 in the integral on the left above and interchanging the order of integration. The identity follows for all ω ∈ since both sides are analytic functions of ω in this domain. Now suppose that λ K + f = f . Then if we integrate both sides of the identity with respect to dρ(ω) and multiply by λ, the left side becomes the Laplace transform of u−β f (u), which we denote by g(x), and the the right side becomes Z ∞Z dρ(ω) g(y) dy. λ x + 1 − ω(y + 1) 0 Using the fact y −β g(y) ∈ L2 , which we know by the previous lemma, we see that the integral is a bounded function of x. If we recall the definition (2.5) then we see that the identity becomes Z ∞ x + 1 β K0 (y + 1, x + 1) g(y) dy (x ≥ 0), g(x) = λ y+1 0 or x−β g(x − 1) = λ

Z

∞

K0 (y, x) g(y − 1) y −β dy

(x ≥ 1).

1

Now we know that x−β g(x) is in L2 (0, ∞) and that g(x) is bounded. It follows that x−β g(x − 1) belongs to L2 (1, ∞). The right side above is the operator with kernel


707

K0 (y, x) acting on this function. Thus (if f 6= 0) the operator λ P + K00 P + has 1 as an eigenvalue, where 0 denotes transpose, so I − λ P + K00 P + is not invertible. But this implies I − λ P + K0 P + is not invertible, whereas we know that it is. This contradiction establishes the lemma. Lemma 4. The operators I − λ P ± Kt± P ± are uniformly invertible for sufficiently small t. Proof. We consider I − λ P + Kt+ P + and for this it is enough to show that the I − λP + Kt+ are uniformly invertible. The kernel of P + Kt+ is χ(1,∞) (u) Kt+ (u, v) and the substitution u → t−1 u allows us to consider instead the operator with kernel χ(t,∞) (u) K + (u, v). We write this (not displaying the variables u and v) as χ(t,1) K0 + χ(1,∞) K + + χ(0,1) (K + − K0 ) + χ(0,t) (K0 − K + ). Recalling the definitions of our various projection operators we see that the first kernel corresponds to the operator Pt− K0 , and we know that the I − λ Pt− K0 are uniformly invertible for sufficiently small t. The second and third summands correspond to the operators P + K + and P + (K + − K0 ), which we know by Lemma 1 to be trace class. The last summand corresponds to the operator P − (K0 − K + ), which we know by Lemma 1 to be trace class, left-multiplied by multiplication by χ(0,t) , which converges strongly to 0. An application of Fact 2 shows that this last operator is o1 (1). The strong limit of the sum of the four operators is, of course, K + and we know by Lemma 2 that I − λ K + is invertible. Hence we can apply Fact 3 to deduce the result. We can now fill in the details of the proof of (2.4) outlined earlier. Thus we begin with the representation   I − λ P − Kt P − −λ P − Kt P + . I − λ Kt =  + − + + −λ P Kt P I − λ P Kt P Applying Lemma 1 to the nondiagonal entries we deduce   I − λ P − Kt− P − −λ P − K0 P +  + o1 (λ). I − λ Kt =  + − + + + −λ P K0 P I − λ P Kt P Lemma 3 tells us in particular that the diagonal entries of this matrix are invertible for small t so we may factor out   0 I − λ P − Kt P −   0 I − λ P + Kt P + on the left, leaving 

I

 −λ (I − λ P

+

Kt+ P + )−1 P + K0 P −

−λ (I − λ P − Kt− P − )−1 P − K0 P +

 

I

on the right. Next we combine the uniform invertibility of the I − λ P ± Kt± P ± proved in Lemma 3 with Fact 1 to deduce that the inverses of these operators converge strongly

708


to (I − λ P ± K0± P ± )−1 . Since P ± K0 P ∓ are trace class, by Lemma 1, we deduce by Fact 2 that the matrix above is M + o1 (λ), where M is the matrix obtained by replacing Kt± by K0 . Thus  I − λ Kt = 



I − λ P − Kt P −

0

0

I − λ P + Kt P +

 M + o1 (λ).

Now we have to know that M is invertible, and we see this as follows. If, instead of the operator I − λ Kt which depends on t, we had started with the operator I − λK0 then we would have obtained the exact representation  I − λ K0 = 



I − λ P − K0 P −

0

0

I − λ P + K0 P +

 M.

Since both I − λ K0 and the matrix on the left are invertible, by our assumption, we deduce that M is invertible. Next we go through a similar process starting with the operator I − λ Kt+ rather than I − λ K. Using the fact that P − Kt+ = P − K0 + o1 (1), which we know by Lemma 1, we obtain in this case   I − λ P − K0 P − 0  M + o1 (λ). I − λ Kt+ =  + + 0 I − λ P Kt P From these matrix representations and the facts that M and I −λ P − K0 P − are invertible and I − P ± Kt± P ± uniformly invertible we deduce, using Fact 3 with o1 (1) replaced by o1 (λ),  (I − λ Kt )−1 = M−1 

(I − λ P − Kt P − )−1 0

 (I − λ Kt+ )−1 = M−1 



0 + −1

(I − λ P Kt P ) +

(I − λ P − K0 P − )−1 0



0 + −1

(I − λ P Kt P ) +

 + o1 (λ),

 + o1 (λ).

Comparing lower-right entries of the matrices gives P + (I − K)−1 P + = P + (I − Kt+ )−1 P + + o1 (λ), which is half of (2.4). The other half is obtained similarly. Remark. To apply (2.3) to (2.1) we need something extra, e.g., that (2.3) holds uniformly for these λ. With a little care our argument gives this also, but we spare the reader the details.


709

3. The Resolvents of K ± We are going to find R ∞integral representations for the integrals on the right side of (2.3), and we consider 1 Rt+ (u, u; λ) du first. The substitution u → u/t shows that this R∞ equals t R+ (u, u; λ) du, where R+ (u, v; λ) is the resolvent kernel of the operator K + . For this we require only that be a compact subset of {ω ∈ C : < ω < 1, ω 6∈ R+ }, −1

(3.1) +

since the term 1−ω does not appear in the exponent in the kernel of K . The derivation will involve an initial step which is valid only when is contained in the left half-plane so we assume this to begin with. We shall also assume that λ is so small that h(s) 6= 0 for < s = 21 , so that with the notation of the last section we may take sλ = 21 , β = 0. Eventually these two assumptions will be removed by an analytic continuation argument. Because β = 0 the kernel of K + is Z −t(1−ω)u e + K (u, v) = dρ(ω). (3.2) −ωu + v Z

If we set

e−(1−ω)u eωux dρ(ω),

A(u, x) :=

B(x, u) := e−ux ,

Z

then

∞

K + (u, v) =

A(u, x) B(x, v) dx. 0

Lemma 2 of the preceding section tells us that B(u, x) is the kernel of a bounded operator from L2 (R+ ) to L2 (R+ ) and, with our assumption on , that the same is true of A(x, u). The above shows that K + = AB, and the operator BA has kernel Z ∞ Z Z dρ(ω) dρ(ω) = . B(x, u) A(u, y) du = x − ωy + 1 − ω (x + 1) − ω(y + 1) 0 We use the general fact AB(I − λAB)−1 = A(I − λBA)−1 B to deduce that R+ (u, u) is given by an inner product, (3.3) R+ (u, u) = (I − λBA)−1 B( · , u), A(u, · ) . We begin by computing

f := (I − λBA)−1 B( · , u).

Thus we want to solve Z ∞Z f (x) − λ 0

Z

or f (x − 1) − λ

1

dρ(ω) f (y) dy = e−ux (x + 1) − ω(y + 1)

∞

Z

dρ(ω) f (y − 1) dy = e−u(x−1) x − ωy

(x ≥ 0),

(x ≥ 1).

The substitution x → ex brings this to the form of a Wiener-Hopf equation, so we can use the factorization method to find the solution. We begin by decreeing that the last identity holds for all x ≥ 0, in other words we define f on (−1, 0) by the identity. Then we define

710


Z F− (s) :=

∞

Z xs−1 f (x − 1) dx,

1

xs−1 f (x − 1) dx.

F+ (s) :=

1

0

These belong to the Hardy spaces H2 (< s < 21 ), H2 (< s > 21 ), respectively. We take Mellin transforms of both sides of the equation, and find that for < s = 21 , Z λπ (−ω)s−1 dρ(ω) F− (s) = eu u−s 0(s). F− (s) + F+ (s) − sin πs (The exponential in the integral is made definite by taking | arg(−ω)| < π.) We write this as (3.4) H(s) F− (s) + F+ (s) = eu u−s 0(s), where

λπ h(s) =1− H(s) := sin πs sin πs

Z (−ω)s−1 dρ(ω).

(3.5)

This function is bounded and analytic in each vertical strip of the complex s-plane, away from the zeros of sin πs, H(s) − 1 → 0 exponentially as =s → ±∞ and 21 +i∞ = 0. arg H(s) 1 2 −i∞

Thus there is be a representation H(s) =

H− (s) , H+ (s)

where H− (s)±1 are bounded and analytic in < s ≤ 21 + δ for some δ > 0 and H+ (s)±1 are bounded and analytic in < s ≥ 21 − δ. We multiply (3.4) by H+ (s) and use the decomposition F = F− + F+ of an arbitrary function in L2 (< s = 21 ) into boundary functions of functions in H2 (< s < 21 ) and H2 (< s > 21 ) to write the result as = −H+ (s) F+ (s) + eu u−s 0(s) H+ (s) . H− (s) F− (s) − eu u−s 0(s) H+ (s) −

+

The two sides are boundary functions of functions in H2 (< s < 21 ) and H2 (< s > 21 ), respectively, so they both vanish. This gives the representation eu −s F− (s) = u 0(s) H+ (s) . (3.6) H− (s) − Now (see (3.3)) we have to multiply f (x) by A(u, x) and integrate with respect to x over (0, ∞). This is Z Z ∞ Z ∞ Z dρ(ω) f (x) e−(1−ω)u eωux dx = e−u dρ(ω) f (x−1) χ(1,∞) (x) eωux dx.

0

0

The Mellin transform of f (x − 1) χ(1,∞) (x) equals F− (s) and the Mellin transform of eωux equals (−ωu)−s 0(s), so Parseval’s formula for Mellin transforms shows that the above equals Z Z ds −u , (−ωu)s−1 dρ(ω) 0(1 − s) F− (s) e 2πi


711

the outer integration taken over < s = 21 . (All vertical integrals are taken in the direction from −i∞ to i∞.) Next we recall (3.6) and use the integral representation of the operator G → G− to write the above as Z Z Z −s0 0(1 − s) ds u 0(s0 )H+ (s0 ) ds0 , (−ωu)s−1 dρ(ω) H− (s) 2πi s0 − s 2πi the inner integral taken over < s0 = Z Z (−ω)s−1 dρ(ω)

1 2

+ δ. Alternatively, this may be written

0(1 − s) ds H− (s) 2πi

Z

0

u−s −1 0(s0 + s)H+ (s0 + s) ds0 , s0 2πi

where now the inner integral is taken over < s0 = δ. The integrands of these integrals vanish exponentially at infinity, and u occurs to the power −s0 − 1, which has real part −δ − 1. Thus we may integrate with respect to u from t to ∞ under the integral signs and deduce that Z ∞ R+ (u, u; λ) du t

Z Z (−ω)s−1 dρ(ω)

=

0(1 − s) ds H− (s) 2πi

Z

0

t−s 0(s0 + s)H+ (s0 + s) ds0 . s02 2πi

It follows from (3.5), and the gamma function representation of the last factor there, that Z (−ω)s−1 dρ(ω) 0(1 − s) = λ−1 (1 − H(s)) 0(s)−1 .

Thus we have shown (reverting to the resolvent Rt+ ) that Z ∞ Rt+ (u, u; λ) du λ 1

Z

1 2 +i∞

= 1 2 −i∞

−1

(H− (s)

−1

− H+ (s)

−1

) 0(s)

ds 2πi

Z

δ+i∞ −s0

t

δ−i∞

0(s0 + s)H+ (s0 + s) ds0 . s02 2πi

This was proved if λ is sufficiently small and if lies in the left half-plane. Let us remove the latter condition first. For any η define the measure ρη by ρη (E) = ρ(E − η). This has support + η. For all η in a neighborhood in C of [−1, 0] the set + η is contained in the region (3.1). For η near −1 the set will also lie in the left half-plane. If λ is small enough the condition on the zeros will be satisfied for the measures ρη for all η in a neighborhood of [−1, 0]. For such λ we know that the above formula holds for η in a neighborhood of −1. But both sides are analytic functions of η in our neighborhood. Thus the formula must hold for η = 0 also, which is what we wanted to show. To remove the condition that λ be small we must modify the formula to read Z ∞ Rt+ (u, u; λ) du λ 1

Z

sλ +i∞

= sλ −i∞

(H− (s)−1 − H+ (s)−1 ) 0(s)−1

ds 2πi

Z

δ+i∞ −s0

t

δ−i∞

0(s0 + s)H+ (s0 + s) ds0 , s02 2πi (3.7)

712


where sλ ∈ (0, 1) is as in the previous section, a continuously varying function of λ such that 1 − H(s) is nonzero on the line < s = sλ . Both sides of (3.7) are analytic functions of λ for λ in a neighborhood of our path, they agree near λ = 0, so they agree everywhere on the path. As for R− (u, u; λ) on (0, 1) the change of variable u → u−1 transforms the kernel K − (u, v) into Z Z −t(1−ω−1 )u e e−t(1−ω)u + e dρ(ω) = (−ω) dρ(ω −1 ). K (u, v) := −ωv + u −1 −ωu + v Therefore

e+ (u−1 , u−1 ; λ), R− (u, u; λ) = u−2 R

e+ is the resolvent kernel for K e + , and so where R Z ∞ Z 1 e+ (u, u; λ) du. R− (u, u; λ) du = R 0

1

e + replaces H(s) by H(1 − s) and so the integral It is easy to see that replacing K + by K is equal to (3.7) with H(s) replaced by H(1 − s) and sλ replaced by 1 − sλ . Now that we have these explicit representations it is obvious what we do: in the inner integral in (3.7) we move the line of s0 -integration from < s0 = δ to < s0 = −δ. We can do this if δ is small enough. The residue at the double pole at s0 = 0 contributes Z Z ds 1 ds − (H(s)−1 −1) log t+ (H− (s)−1 −H+ (s)−1 ) (0(s) H+ (s)) 0 , (3.8) 2πi 0(s) 2πi the integrations taken over < s = sλ , and the error term is O(tδ ). We do the same with R ∞H(s) replaced by H(1 − s) and add, and so we have obtained the asymptotics of λ 0 R(u, u; λ) du. 4. Asymptotics of det (I − K) —The Periodic case The formula for H(s) is now H(s) = 1 −

λπ X cω (−ω)s−1 , sin πs

ω running over the nth roots of unity other than 1. If in our sums we set −ω = πi e n (2j−n) (j = 1, · · · , n − 1), then | arg (−ω)| = | nπ (2j − n)| < π as required. If we also πi set z = e n s , then the above may be written as sin πs H(s) = sin πs + λπ

n−1 X

ω −1 cω z 2j−n ,

j=1

or

n−1 h i X ω −1 cω z 2j . 2i sin πs H(s) = z −n z 2n − 1 + 2iλπ j=1


713

Recall that sin πs H(s) = h(s). The expression in brackets above is a polynomial of 2πi degree n in z 2 and its zeros are the quantities by e n α , where α runs through the zeros αk = αk (λ) (k = 1, · · · , n). With this notation the right side above is equal to z −n

Y

(z 2 − e

2πi n α

α

)=

Y

πi

πi

(z e− n α − z −1 e n α )

α

Y

πi

e n α.

(4.1)

α

Here and below the index α runs over the set {α1 , · · · , αn }. The last product, a square root of the product of the roots of the polynomial, equals ±1 or ±i. This product equals πi πi e n (1+···+n) = in+1 for λ = 0 and so for all λ. Recalling that z = e n s we see that we have obtained the representation H(s) =

π (−1)n 2n−1 Y sin (s − α). sin πs n α

We now evaluate the integrals in (3.8). For λ sufficiently small again, the αk will all lie in the strip 21 < < s < n+ 21 . We may assume this since the usual analytic continuation will give the general case. To evaluate the first integral in (3.8) we consider Z ds (H(s)−1 − 1) s 2πi taken over the infinite rectangle which is the contour running from n+ 21 −i∞ to n+ 21 +i∞ and then from 21 + i∞ to 21 − i∞. On the one hand this equals n times the first integral in (3.8), and on the other hand it equals the sum of the residues at the poles between the two lines. Thus we have shown that Z 1 X ds = α H 0 (α)−1 . (4.2) (H(s)−1 − 1) 2πi n α For the second integral in (3.8) we have to write down the explicit expression for the factors H± (s). These are given by Q H+ (s) =

α

0( s−α (−1)n 2n−1 0(1 − s) s n + 1) s Q n , H− (s) = n . s−α 0(s) α 0(− n )

It is readily verified that H(s) = H− (s)/H+ (s) and that H− (s)±1 and H+ (s)±1 are are bounded and analytic in < s ≤ 21 + δ and < s ≥ 21 − δ, respectively, for small λ. Thus, they are the correct factors. The second integral in (3.8) may be written Z (0(s) H+ (s))0 ds , (H(s)−1 − 1) 0(s) H+ (s) 2πi and by the above expression for H+ (s) this equals Z

−1

(H(s)

i ds h 1 X 00 ( s−α0 + 1) n + log n , − 1) 0 n 0 0( s−α 2πi n + 1) α

714


where in the sum α0 also runs over the set {α1 , · · · , αn }. The contribution of the term log n is exactly log n times (4.2). To evaluate the rest of this integral we use the characteristic property of the Barnes G-function, G(z + 1) = 0(z) G(z). Putting z equal to s−α0 n + 1 and taking logarithmic derivatives gives 0

0

0

G0 ( s−α 00 ( s−α G0 ( s−α n + 2) n + 1) n + 1) − = . 0 0 s−α s−α s−α0 G( n + 2) G( n + 1) 0( n + 1) We integrate (H(s)−1 − 1)

X G0 ( s−α0 + 1) n s−α0 G( n + 1) α0

over the same infinite rectangle as before. (This is justified by the fact that H(s)−1 − 1 vanishes exponentially at ∞ in vertical strips while G0 (z)/G(z) grows like z log z.) By the above relation the result is exactly the integral we want, and so computing residues gives the formula 0 Z X 00 ( s−α0 + 1) ds + 1) 0 −1 1 X G0 ( α−α 1 −1 n n H (α) . (H(s) − 1) = s−α0 α−α0 n 2πi n 0( n + 1) G( n + 1) α0 α,α0 Thus, we have shown that Z ∞ t + b+ (λ) + O(tδ ), λ Rt+ (u, u, λ)du = a+ (λ) log n 1 where 0 + 1) 0 −1 1 X 1 X G0 ( α−α 0 −1 + n H (α) . α H (α) , b (λ) = a (λ) = − 0 α−α n α n G( n + 1) α,α0

+

(4.3)

R1 We must add to this λ 0 Rt− (u, u, λ) du which, as was mentioned earlier, is obtained by replacing H(s) by H(1 − s). The zeros of this function which lie near 1, · · · , n for small λ are n − α + 1. Hence (4.3) is replaced by a− (λ) =

1 X (n − α + 1) H 0 (α)−1 , n α

0 0 + 1) 0 0 −1 1 X G0 ( α n−α + 1) 0 −1 1 X G0 ( α−α n H (α) = − H (α ) . b (λ) = − α0 −α α−α0 n n G( n + 1) G( n + 1) α,α0 α,α0

−

d H(1 − s) = −H 0 (1 − s). Adding Here we have used the periodicity of H and the fact ds and using (2.3), we see that Z ∞ t λ + b(λ) + O(tδ ) R(u, u; λ) du = a(λ) log (4.4) n 0

where a(λ) = a+ (λ) + a− (λ), b(λ) = b+ (λ) + b− (λ). Now to obtain the asymptotics of log det (I − K) we must replace λ by µ, multiply the above by −dµ/µ and integrate from 0 to λ. (Notice the factor λ on the left side of (4.4) and recall the minus sign in (2.1).) We obtain from (3.5) that for a zero α(λ) of H


715

we have λ dα/dλ = H 0 (α)−1 . Thus the coefficient a(λ) in (4.4) may be written (after replacing λ by µ and thinking of α as α(µ)) −

1X (2α − n − 1) µ dα/dµ. n α

Multiplying by −dµ/µ and integrating gives µ=λ 1 X µ=λ 1X 2 (α − (n + 1) α) = α2 n α n α µ=0 µ=0 since, as we have already seen,

P

α is independent of λ. Similarly b(µ) may be written

0 + 1) 1 X G0 ( α−α n µ d(α − α0 )/dµ, 0 α−α n G( + 1) 0 n α,α

and multiplying by −dµ/µ and integrating gives −

X α,α0

log G(

µ=λ α − α0 + 1) . n µ=0

If we recall that when µ = 0 the zeros are 1, · · · , n we see that the formulas for the constants in (1.4) are the ones stated in the introduction. To obtain the asymptotics of the qk (t) we must consider det (I − λ Kk ) instead of det (I − λ K). This amounts to replacing the coefficients cω by ω k cω , and this in turn amounts to replacing H(s) by H(s + k). The zeros of this function modulo n are α1 (λ) − k, · · · , αn (λ) − k. But these are not the zeros which are to replace the α, α0 in our formulas for a and b since they do not arise from the zeros whose values are 1, · · · , n when λ = 0. Rather, the replacements must be αk+1 (λ) − k, · · · , αn (λ) − k, α1 (λ) + n − k, · · · , αk (λ) + n − k, which are the zeros with this property. Thus, for the asymptotics of qk (t) we make these replacements in our formulas and the corresponding replacements with k − 1 instead of k, subtract, and take logarithms. The result is found, after some computation and the use of the functional equation for the G-function, to be the asymptotics stated in the introduction. 5. Asymptotics of det (I − K) – The Nonperiodic Case The coefficients a(λ) = a+ (λ) + a− (λ), b(λ) = b+ (λ) + b− (λ) of the last section were in general given by integral formulas. They were Z ds , (5.1) a+ (λ) = − (H(s)−1 − 1) 2πi Z 1 ds + + (0(s) H+ (s)) 0 , (5.2) b (λ) − a (λ) log n = (H− (s)−1 − H+ (s)−1 ) 0(s) 2πi

716


with the formulas for a− (λ), and b− (λ) obtained by replacing H(s) by H(1 − s). The integrations may be taken over < s = 21 if λ is small enough and, as usual, this is no loss of generality. To find a and b we integrate a(µ) and b(µ), respectively, with respect to −dµ/µ over a path from 0 to λ. Write Z π (−ω)s−1 dρ(ω), ϕ(s) := sin πs so that H(s) = 1 − λ ϕ(s). Making this replacement in the integrand in (5.1) and integrating gives Z λ Z λ dµ ϕ(s) 1 −1 =− dµ = log (1 − λϕ(s)), − 1 − µϕ(s) µ 1 − µϕ(s) 0 0 so the contribution to the coefficient of log t is Z ds − log (1 − λϕ(s)) 2πi over < s = 21 . Replacing H(s) by H(1 − s) gives the same contribution since we may make the substitution s → 1 − s. Therefore Z ds . (5.3) a = −2 log (1 − λϕ(s)) 2πi For general λ the integration is to be on < s = sλ . Now we go to (5.2), which may be written Z ds Z H 0 (s) H 0 (s) ds 00 (s) 1 + −1 + − + . (5.4) b+ (λ) − a+ (λ) log n = 0(s) H(s) 2πi H− (s) H+ (s) 2πi By a computation like the earlier one we see that the first integral becomes after the µ-integration Z ds 00 (s) log (1 − λϕ(s)) . 0(s) 2πi Then we replace s by 1 − s, make the substitution s → 1 − s, and add. We see that the contribution of the first integral in (5.4) equals Z 0 ds 0 (s) 00 (1 − s) + log (1 − λϕ(s)) . (5.5) 0(s) 0(1 − s) 2πi Finally, we look at the second integral in (5.4), which equals Z Z Z ds H+0 (s) ds H+0 (s) −1 ds = H(s) = (log H+ )0 (s) H(s)−1 . H− (s) 2πi H+ (s) 2πi 2πi Replacing H(s) by H(1 − s) replaces H+ (s) by 1/H− (1 − s), so after making the substitution s → 1 − s and adding we get Z ds . (5.6) (log H+ H− )0 (s) H(s)−1 2πi Recall that for < s = 21 ,


717

1 log H± (s) = ∓ log H(s) + 2

Z

log H(s0 ) ds0 , s0 − s 2πi

where the integral is the Hilbert transform, a principal value integral over < s0 = 21 . So Z log H(s0 ) ds0 . log H+ (s) H− (s) = s0 − s πi Since the Hilbert transform commutes with differentiation we get Z 1 H 0 (s0 ) ds0 , (log H+ H− )0 (s) = πi H(s0 ) s0 − s and so (5.6) equals −

1 2π 2

Z Z

0 H 0 (s0 ) −1 ds H(s) ds. H(s0 ) s0 − s

The µ-integration gives Z λ Z λ −µ ϕ0 (s0 ) 1 1 dµ 0 0 − = ϕ dµ (s ) 1 − µϕ(s0 ) 1 − µϕ(s) µ (1 − µϕ(s0 )) (1 − µϕ(s)) 0 0 = ϕ0 (s0 )

1 − λϕ(s) 1 log , ϕ(s0 ) − ϕ(s) 1 − λϕ(s0 )

and so the contribution of the second integral in (5.4) is Z Z 1 − λ ϕ(s) ds0 1 ϕ0 (s0 ) log ds. − 2 2π ϕ(s0 ) − ϕ(s) 1 − λ ϕ(s0 ) s0 − s

(5.7)

Thus b − a log n equals the sum of (5.5) and (5.7). As usual, for general λ the integrals are taken over < s, s0 = sλ . Remark. The double integral (5.7) is exactly the constant in the known asymptotics for the determinants of the truncated Wiener-Hopf operators associated with ϕ (specifically, ϕ( 21 + iξ) is the Fourier transform of the convolving kernel), and (5.3) is (minus twice) the leading coefficient in the asymptotics. One can see by the argument of Section 2 how both these things arise and conclude also that (5.7) equals det M−1 . The extra ingredient here is therefore the integral (5.5). 6. The Case n = 2 In this case the only root is ω = −1 and we may take c−1 equal to 1 since it occurs only in the product λc−1 . Thus the kernel of K0 is e−2t(u+u u+v

−1

)

,

and the equation (for either qk ) is q 00 (t) + t−1 q 0 (t) = 8 sinh 2q(t).

(6.1)

718


We have in this case h(s) = sin πs − πλ, the zeros are given by α0 =

p 1 1 arcsin πλ = log(πiλ + 1 − π 2 λ2 ), α1 = 1 − α0 , π πi

and αk+2 = αk + 2. The square root is that branch which is positive for λ = 0 and the logarithm that branch which is 0 there. From this it is easy to see that the set (1.6) consists of the rays (−∞, −1/π] and [1/π, ∞) and 3, the proposed region of validity of our√formulas, is the complex plane cut along these rays. If we note that the function πiλ + 1 − π 2 λ2 maps 3 onto the right half-plane, we see that |< αk (λ) − k| < 21 for all λ ∈ 3 and so the “extra” condition on the α is satisfied. The range of validity is therefore all of 3. Using the formulas stated in the introduction we find that det (I − λ K0 ) ∼ b (t/2)a with a = α02 + α0 , b =

G( 21 ) G( 23 ) 0( 21 ) G( 21 )2 = . G( 21 + α0 ) G( 23 − α0 ) 0( 21 − α0 ) G( 21 + α0 ) G( 21 − α0 )

For det (I + λ K0 ) we replace λ by −λ, which amounts to replacing α0 by −α0 . If we multiply the two results together we recover the asymptotics for det (I − λ2 K02 ) determined in [7] and [2]. For q0 we have the asymptotics A log(t/2) + log B + o(1), where A = 2 α0 , B =

0( 21 − α0 ) , 0( 21 + α0 )

in agreement with [6]. This is the solution of (6.1) which is asymptotic to −2λK0 (4t) as t → ∞, where this K0 is the Bessel function. For λ 6∈ 3 the asymptotics are different. For λ > 1/π, eq0 (t) has an infinite sequence of zeros as t → 0 and for λ < −1/π, it has an infinite sequence of poles; this follows from the fact that as t → 0 the spectrum of K0 fills up the interval [0, π]. A heuristic derivation of the asymptotics for λ on the cut is given in [6]. In the next section we present a similar derivation for some cases of n = 3. 7. n = 3 and Cylindrical Bullough-Dodd The cylindrical Bullough-Dodd equation q 00 (t) + t−1 q 0 (t) = 4e2q − 4e−q ,

(7.1)

arises in the special case of n = 3, where cω = −ω 3 cω−1 . Then q1 = 0, q2 = −q3 and (7.1) is satisfied by q = q3 . If we set ζ := e2πi/3 , then cζ may be chosen arbitrarily. If we choose it to be ζ(1 − ζ) then cζ 2 = ζ 2 (1 − ζ 2 ), c−1 = 0 and √ h(s) = sin πs + 2π 3 λ sin(π(s + 2)/3). Again λ is the one free parameter. The zeros are given by α0 =

1 3 λ 1 − arcsin + , α1 = 1, α2 = 2 − α0 , 4 2π 2 2λc


719

√ where λc = 1/(2 3π). Now 3 is the complement of the union of cuts (−∞, −3λc ] ∪ [λc , ∞). For λ ∈ 3 the zeros satisfy 1 5 < α0 ∈ (− , 1), < α1 = 1, < α2 ∈ (1, ), 2 2 and so the extra condition is again automatically satisfied and our formulas hold for all λ ∈ 3. If we write 1 (7.2) q(t) = A log( ) − log B + o(1) t then the connection formulas give in this case α+2 2α+1 −A 0 3 0 3 A = −2α, B = 3 , 0 1−α 0 2−2α 3 3 where we wrote α for α0 . For large t,

√ q(t) ∼ 6λK0 2 3t .

Asymptotics at the critical value λc . This section and the following ones are heuristic. Using the differential equation (7.1) one can determine the correction terms to (7.2): B2 4 B4 1 2−2A 2+A t − t + t4−4A + . . . . q(t) = A log( ) + log B + t (1 − A)2 B(2 + A)2 2(1 − A)4 (7.3) This is valid for λ ∈ 3. To understand the higher order terms in more detail it is convenient to define w(t) = exp (−q(t)) . where w satisfies the equation w00 =

1 02 1 0 4 (w ) − w + 4w2 − . w t w

(7.4)

The asymptotics we proved become the statement w(t) = BtA (1 + o(1)) . Using (7.4) to calculate the higher order terms in the small t expansion for w we find w(t)=BtA 1 −

4B 12B 2 4+2A t2−2A 2+A + t + t + ··· B 2 (1 − A)2 (2 + A)2 (2 + A)4

(j + 1) 2j B j 2j+jA t + ··· (2 + A)2j 2 4 24(A − 2A − 2) 4−A 6−3A t + t + · · · . + (2 + A)2 (1 − A)2 (4 − A)2 B (1 − A)4 (4 − A)2 B 3

+

(7.5)

In contrast to (7.3) the terms t2m−2mA only appear for m = 1 in the above expansion. As λ varies from 0 to λc α varies from 0 to − 21 , so A varies from 0 to 1 and B from 1 to ∞. Observe that the first two terms in (7.5) are of the same order in t as t → 0 (and

720


A → 1) whereas the others are of lower order. This suggests that when λ = λc we have w(t) ∼ t1 as t → 0, where 4 t2−2A (7.6) 1 := lim B 1 − 2 = 2 log(1/t) − log 2 + 2 log 3 − 2γ. 2 1 B (1 − A) 3 α→− 2 We now use the differential equation (7.4) to find the higher order terms, which are polynomial in t and 1 . (The only property of 1 used in the formal expansion is d1 /dt = −2/t.) The expansion is 4 7 4 4 4 8 2 w(t) = t1 + t 1 + 1 + + t 8131 + 21621 + 2401 + 80 +O(t10 41 ). 9 3 9 2187 (7.7) Thus (as for the n = 2 analogue [6]) if one were to alter the constant appearing in (7.6) then the solution of (7.4) whose asymptotics is (7.7) would not match onto the solution that approaches 1 as t → ∞. These asymptotics at λ = λc were checked by numerically solving (7.4) in both a forward and backward integration. There was agreement to nine decimal places at t = 1/4. Asymptotics at the critical value −3λc . We proceed as above and examine all terms that would be of the same order of magnitude as λ → −3λc , when α → 1 and A → −2. These are the terms of the geometric series, those involving the powers t2j+jA . Summing the series we see that we must compute lim

α→1

B tA 1−

2Bt2+A /(2

Defining

+

2 A)2

=

1 . 2t2 (log t − log 3 + γ)2

2 = log t − log 3 + γ

we thus see that at λ = −3λc , w(t) ∼

1 2t2 22

.

(7.8)

To compute higher order terms it is convenient to look at v(t) = 1/w(t). Using the differential equation and only the property d2 /dt = 1/t of 2 we find t8 32 76 40 40 20 v(t) = 2t2 22 + 862 − 52 + 42 − 32 + 22 − 2 + O(t14 10 2 ). 9 3 9 9 27 81 (7.9) Asymptotics for λ > λc . Think of λ as being on the lower part of the cut [λc , ∞). Then 1 3 α = − − iµ, 2 2 where µ :=

1 λ 1 arccosh + , (µ > 0). π 2 2λc

Thus A = 1 + 3iµ. Here again the first two terms in (7.5) are of the same order as t → 0 whereas the others are of lower order, and we obtain


w(t) = BtA −

721

t1−2A + O(t4 ). B(1 − A)2

Substituting in the values of A and B in terms of µ we find 2t 3iµ log(t/3) 0(1/2 − iµ/2)0(−iµ) = e + O(t4 ). w(t) = 3µ 0(1/2 + iµ/2)0(iµ) If we had taken λ to be on the upper part of the cut then we would have replaced µ by −µ. The result would have been precisely the same. Remark. In [5] a method was described to find connection formulas for solutions of a class of equations including (7.1). Away from the critical values the short-range asymptotics stated there correspond to the first two terms in (7.5). As for the asymptotics at the critical values, our formulas agree with [5] at λ = −3λc but at λ = λc we differ by a factor of 2. Acknowledgement. This work was supported in part by National Science Foundation Grants DMS-9303413 (first author) and DMS-9424292 (second author). The authors also thank the Volkswagen- Stiftung for their support of the Research in Pairs program at Oberwolfach; the first results of the paper were obtained during the authors’ visit under this program.

References 1. Barnes, E. W.: The theory of the G-function. Quart. J. Pure and Appl. Math. 31, 264–314 (1900) 2. Basor, E. L., Tracy, C. A.: Asymptotics of a tau function and Toeplitz determinants with singular generating functions. Int. J. Mod. Phys. A 7, 83–107 (1992) 3. Cecotti, S., Fendley, P., Intriligator, K., Vafa, C.: A new supersymmetric index. Nucl. Phys. B386, 405–452 (1992) 4. Gohberg, I. C., Feldman, I. A.: Convolution equations and projection methods for their solution. Transl. Math. Monogr. 41, Providence, RI: Am. Math. Soc., 1974 5. Kitaev, A. V.: Method of isometric deformation for “degenerate” third Painlevé equation. J. Soviet Math. 46, 2077–2082 (1989) 6. McCoy, B. M., Tracy, C. A., Wu, T. T.: Painlevé functions of the third kind. J. Math. Phys. 18, 1058–1092 (1977) 7. Tracy, C. A.: Asymptotics of a τ -function arising in the two-dimensional Ising model. Commun. Math. Phys. 142, 297–311 (1991) 8. Widom, H.: Asymptotic behavior of block Toeplitz matrices and determinants II. Adv. in Math. 21, 1–29 (1976) 9. Widom, H.: Some classes of solutions to the Toda lattice hierarchy. Commun. Math. Phys. 184, 653–661 (1997) 10. Zamolodchikov, Al. B.: Painlevé III and 2D polymers. Nucl. Phys. B432 [FS], 427–456 (1994) Communicated by T. Miwa

Communications in Mathematical Physics - Volume 190

Communications in Mathematical Physics - Volume 221

Communications in Mathematical Physics - Volume 220

Communications in Mathematical Physics - Volume 235

Communications in Mathematical Physics - Volume 223

Communications In Mathematical Physics - Volume 283

Communications In Mathematical Physics - Volume 270

Communications in Mathematical Physics - Volume 208

Communications in Mathematical Physics - Volume 186

Communications In Mathematical Physics - Volume 294

Communications in Mathematical Physics - Volume 217

Communications In Mathematical Physics - Volume 274

Communications in Mathematical Physics - Volume 239

Communications in Mathematical Physics - Volume 306

Communications in Mathematical Physics - Volume 264

Communications in Mathematical Physics - Volume 227

Communications in Mathematical Physics - Volume 184

Communications in Mathematical Physics - Volume 261

Communications in Mathematical Physics - Volume 225

Communications In Mathematical Physics - Volume 263

Communications in Mathematical Physics - Volume 211

Communications In Mathematical Physics - Volume 293

Communications in Mathematical Physics - Volume 246

Communications In Mathematical Physics - Volume 298

Communications in Mathematical Physics - Volume 234

Communications In Mathematical Physics - Volume 288

Communications in Mathematical Physics - Volume 304

Communications In Mathematical Physics - Volume 292

Communications in Mathematical Physics - Volume 233

Communications in Mathematical Physics - Volume 253

Communications in Mathematical Physics - Volume 222

Communications in Mathematical Physics - Volume 190

Communications in Mathematical Physics - Volume 221

Communications in Mathematical Physics - Volume 220

Communications in Mathematical Physics - Volume 235

Communications in Mathematical Physics - Volume 223

Communications In Mathematical Physics - Volume 283

Communications In Mathematical Physics - Volume 270

Communications in Mathematical Physics - Volume 208

Communications in Mathematical Physics - Volume 186

Communications In Mathematical Physics - Volume 294

Communications in Mathematical Physics - Volume 217

Communications In Mathematical Physics - Volume 274

Communications in Mathematical Physics - Volume 239

Communications in Mathematical Physics - Volume 306

Communications in Mathematical Physics - Volume 264

Communications in Mathematical Physics - Volume 227

Communications in Mathematical Physics - Volume 184

Communications in Mathematical Physics - Volume 261

Communications in Mathematical Physics - Volume 225

Communications In Mathematical Physics - Volume 263

Communications in Mathematical Physics - Volume 211

Communications In Mathematical Physics - Volume 293

Communications in Mathematical Physics - Volume 246

Communications In Mathematical Physics - Volume 298

Communications in Mathematical Physics - Volume 234

Communications In Mathematical Physics - Volume 288

Communications in Mathematical Physics - Volume 304

Communications In Mathematical Physics - Volume 292

Communications in Mathematical Physics - Volume 233

Communications in Mathematical Physics - Volume 253

Communications in Mathematical Physics - Volume 222

Recommend Documents