Aspects of Infinite Groups: A Festschrift in Honor of Anthony Gaglione (Algebra and Discrete Mathematics)

Algebra and Discrete Mathematics A Festschrift in Honor of Anthony Gaglione Algebra and Discrete Mathematics ISSN: ...

Author: Benjamin Fine | Gerhard Rosenberger | Dennis Spellman

20 downloads 656 Views 10MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
Report copyright / DMCA form

DOWNLOAD PDF

Algebra and Discrete Mathematics

A Festschrift in Honor of Anthony Gaglione


ISSN: 1793-5873

Managing Editor: RUdiger Gobel (University Duisburg-Essen, Germany) Editorial Board: Elisabeth Bouscaren, Manfred Droste, Katsuya Eda, Emmanuel Dror Farjoun, Angus MacIntyre, H.Dugald Macpherson, Jose Antonio de la Pefia, Luigi Salce, Mark Sapir, Lutz Strlingmann, Simon Thomas

The series ADM focuses on recent developments in all branches of algebra and topics closely connected. In particular, it emphasizes combinatorics, set theoretical methods, model theory and interplay between various fields, and their influence on algebra and more general discrete structures. The publications ofthis series are of special interest to researchers, post-doctorals and graduate students. It is the intention of the editors to support fascinating new activities of research and to spread the new developments to the entire mathematical community.

Vol. I:

Aspects of Infinite Groups: A Festschrift in Honor of Anthony Gaglione eds. Benjamin Fine, Gerhard Rosenberger & Dennis Spellman


ASPECTS A

I"

INIT~ G~O ~

A Festschrift in Honor of Anthony Gaglione

editors

Benjamin Fine Fairfield University, USA

Gerhard Rosenberger Universitat Dortmund, Germany

Dennis Spellman Temple University, USA

World Scientific NEW JERSEY • LONDON • SINGAPORE • B E I J I N G • SHANGHAI • HONG KONG • TA I P E I • CHENNAI

Published by

World Scientific Publishing Co. Pte. Ltd. 5 Toh Tuck Link, Singapore 596224 USA office: 27 Warren Street, Suite 401-402, Hackensack, NJ 07601 UK office: 57 Shelton Street, Covent Garden, London WC2H 9HE

British Library Cataloguing-in-Publication Data A catalogue record for this book is available from the British Library.

Algebra and Discrete Mathematics - Vol. 1 ASPECTS OF INFINITE GROUPS A Festschrift in Honor of Anthony Gaglione Copyright © 2008 by World Scientific Publishing Co. Pte. Ltd. All rights reserved. This book, or parts thereof, may not be reproduced in any form or by any means, electronic or mechanical, including photocopying, recording or any information storage and retrieval system now known or to be invented, without written permission from the Publisher.

For photocopying of material in this volume, please pay a copying fee through the Copyright Clearance Center, Inc., 222 Rosewood Drive, Danvers, MA 01923, USA. In this case permission to photocopy is not required from the publisher.

ISBN-13 978-981-279-340-9 ISBN-IO 981-279-340-2

Printed in Singapore by World Scientific Printers

ASPECTS OF INFINITE GROUPS

Editors: Benjamin Fine, Gerhard Rosenberger, Dennis Spellman

PREFACE This volume consists of contributions by participants and speakers at the conference entitled Aspects of Infinite Groups held at Fairfield University in March 2007 in honor of Prof. Anthony Gagliones sixtieth birthday. Prof. Gaglione of the United States Naval Academy at Annapolis has made important and varied contributions to several different areas of Infinite Group Theory and Combinatorial Group Theory. Along with Herman Waldinger he developed and studied many important topics within the commutator calculus. Along with Dennis Spellman he proved a magnificent result tying together the concepts of residual freeness and universal freeness. This result was also done independently by Vladmir Remeslennikov and can be considered as one of the seminal steps in the final proof of the celebrated Tarski problems. The final proof of the Tarski problems was accomplished by O.Kharlampovich and A.Myasnikov and independently by Z.Sela. Finally Prof. Gaglione, along with G.Baumslag, B.Fine, A.Myasnikov, V.Remeslennikov and D.Spellman completely developed the theory of discriminating and sqaurelike groups. This important class of groups was introduced by G.Baumslag, A.Myasnikov and V.Remeslennikov as an outgrowth of the development of algebraic geometry over groups. The papers in this volume provide an interesting mix of results on modern infinite discrete group theory ranging from classical combinatorial group theory to algebraic geometry over groups to noncommutative algebraic cryptography. The main speakers were primarily people who worked closely with Prof. Gaglione. They were Prof. Michael Anshel City University of New York, New York City, NY Prof. Gilbert Baumslag City University of New York, New York City, NY Prof. Benjamin Fine

v

vi

Fairfield University, Fairfield, CT Prof. Alexei Myasnikov McGill University, Montreal, Canada Prof. Gerhard Rosenberger TU Dortmund, Dortmund, Germany Prof. Dennis Spellman Temple University, Philadelphia, Pennsylvania Prof. Gaglione received his Ph.D. in Mathematics from Polytechnic University in Brooklyn, New York in 1972. He has been a professor at the United States Naval Academy since 1977. He is the author or coauthor of more than 70 journal articles representing the areas of combinatorial group theory described earlier. He is also the author or coauthor of four books. Benjamin Fine Gerhard Rosenberger Dennis Spellman

CONTENTS Preface

v

Actions, Commutator Identities, and the Algebraic Eraser™ I. Anshel, M. Anshel and D. Goldfeld

1

Virtually Free-By-Cyclic One-Relator Groups: I G. Baumslag and D. Troeger

9

Some Cryptoprimitives for Noncommutative Algebraic Cryptography G. Baumslag, Y. Bryukhov, B. Fine and G. Rosenberger

26

On the Derived Subgroups of the Free Nilpotent Groups of Finite Rank R. D. Blyth, P. Moravec and R. F. Morse

45

A Recurrence Relation for the Number of Free Subgroups in Free Products of Cyclic Groups T. Camps, M. Darfer and G. Rosenberger

54

The Baumslag-Solitar Groups: A Solution for the Isomorphism Problem A. E. Clement

75

Unification Theorems in Algebraic Geometry E. Daniyarova, A. Myasnikov and V. Remeslennikov Reflections on Commutative Transitivity B. Fine and G. Rosenberger Groups Universally Equivalent to Free Burnside Groups of Prime Exponent and a Question of Philip Hall A. Gaglione, S. Lipschutz and D. Spellman Changing Generators in Nonfree Groups R. Goldstein vii

80

112

131

149

viii

Matrix Completions Over Principal Ideal Rings W. H. Gustafson, D. W. Robinson, R. B. Richter and W. P. Wardlaw

151

A Primer on Computational Group Homology and Cohomology D. Joyner

159

Doubles of Residually Solvable Groups D. K ahrobaei

192

An Application of a Group of Ol'shanskii to a Question of Fine et. aZ. S. Lipschutz and D. Spellman

201

Quotient Isomorphism Invariants of a Finitely Generated Coxeter Group M. Mihalik, J. Ratcliffe and S. Tschantz

212

Localization and I A-automorphisms of Finitely Generated, Metabelian, and Torsion-Free Nilpotent Groups M. Zyman

228

ACTIONS, COMMUTATOR IDENTITIES, AND THE ALGEBRAIC ERASER™ Iris Anshel 31 Peter Lynas Ct. Tenafly, NJ 07670, USA Michael Anshel City College of New York, New York, NY 10031, USA Dorian Goldfeld' Columbia University, Department of Mathematics, New York, NY 10027

Dedicated to Tony Gaglione on his 60th birthday.

Abstract: An algebraic structure arising in the formulation of a lightweight key agreement protocol yieldsa new concept of commutator whose identities are quite traditional

1. Introduction

In [AAGL ] the authors introduced a key agreement protocol for publickey cryptography suitable for implementation on lightweight platforms,that is, those subject to severe cost and resource constraints. Careful examination reveals a hidden notion of commutator possessing identities analagous to those formulated by P. Hall, W. Magnus, E. Witt. The question as to whether or not these eminent mathematicians of the twentieth century developed the theory of commutators for cryptographic purposes was raised in [AG]. Our method is to formulate the necessary mathematical primitives as monoid (group) actions and then identify the objects of interest. We conclude by inviting the reader to explore a certain action in the context of 'The authors would like to thank SecureRF for its support of this research

2

nonhopfian groups. This paper is dedicated to Professor Anthony Gaglione on his 60 th Birthday. 2. The Algebraic Eraser™ and its Key Agreement Protocol: Let M, N denote monoids (or groups), and let

II: M

-t

N,

be a homomorphism. In addition, let S denote a monoid (or group) which acts on M on the left as a monoid of endomorphisms (or a group of automorphisms), i.e., S - t End(M), where we view End(M) as a monoid. The action of s E Son m E M, is denoted by 8 m , and the semi direct product of M and S, denoted M XI S, is the monoid (or group) constructed in the classical manner:

(ml,sr) (m2,s2) = (mI81m2, SIS2). The Algebraic Eraser ™ is, in essence, a right action of M XI S in the direct product N x S (viewed as a set), together with some additional apparatus which allows for cryptographic applications. The action E is specified as follows: E : (N x S) x (M XI S)

-t

N x S

is given by

In practice we often use the more compact notation,

That E is a right action follows from the elementary computation with the operation *: given (n, s) E N x Sand (ml' SI), (m2' S2) E M XI S then

((n, s) * (ml' sd)

* (m2' S2) = (nII(8 m1 ), SSI) * (m2' S2)

= (nIIe(mf 1m2)), SSI S2) = (n, s) * ((mr, S1) (m2' S2))' From the point of view of effective computation, the identity above allows us to compute * iteratively, which will be useful for applications. In particular, observing that (1,1) * (m, s) = (II(m), s) for all (m, s) E M XI S, if one

3

expressed an element (m, s) as a product in M > 1, the kernel H of

if;: (a,b; [a,bt)

-+

(a,b; a2,b2n ,abab)

has a one-relator presentation on 4n - 2 generators with relator R, parametrized by n. The relator R is a surface word. We prove this theorem with a series of lemmas and computations. Our main tools are Tietze transformations and the Reidemeister-Schreier procedure. The definition of R is given by Equation 7 in Section 5.3. To start, we observe Lemma 5.1. {bi I 0 ~ i < 2n} U {ab i I 0 ~ i < 2n} is a right transversal of H closed under initial segments. 5.1.

The relators p(b-l[a, b]nbl ).

We need to consider 0 ~ £ ~ 2n - 1. In this section we will show that f p(b-f[a, blnb ) is conjugate to p([a, bl n ) if £ is even, and to p(b-1[a, blnb) if £ is odd. In detail, we will prove the following: f Lemma 5.2. Let 0 ~ £ ~ 2n - 1. If £ is even, p(b-f[a, blnb ) is conjugate to Va ... Vn-l with Vk = a(ab 2n - 2k , a)-1a(ab 2n -(2k+l), b)-la(ab2n -(2k+l), a)a(b 2k +1, b) (1)

f If £ is odd, p(b-f[a, blnb ) is conjugate to Wa . .. Wn-l where Wk = a( ab 2n -(2k-l), a)-la( ab 2n - 2k , b)-la(ab2n - 2k , a)a(b 2k, b) For both the Vk and the Wk, 0 modulo 2n.

~

k

~

(2)

n -1, and all exponents are computed

15

As will be seen, each word Vk is the rewriting of the (k+1)8t occurrence of [a, b] in [a, b]n, and each word Wk is the rewriting of the (k + 1)8t occurrence of [a,b] in b-1[a,b]nb. Before starting the proof, we enumerate some Schreier representatives; the computations depend on the the dihedral relations of Dn. Lemma 5.3. For all m, we have

(1) am = am (mod 2) (2) bm = bm(mod 2n) (3) bma = ab-m(mod 2n) We will most often use the third clause in the form: bma

= ab 2n -(m (mod 2n)).

5.1.1. The case when C is even

We begin by computing p([a, b]n). For Vk corresponding to the (k occurrence of [a, b], as above, one checks

Va

=

+ 1)st

O"(a, a)-10"(ab 2n - 1, b)-10"(ab 2n - 1, a)O"(b, b)

and further by induction that for 1 ::; k ::; n - 1

Vk

= 0"(ab 2n - 2k , a)-10"(ab 2n -(2k+ 1), b)-10"(ab 2n -(2k+l) , a)0"(b 2k +l, b) (3)

Equation (3) is valid for 0 ::; k ::; n - 1 if we agree to reduce exponents modulo 2n. Next we consider C > O. The last O"-term for the rewritten b- R prefix is 0"(b 2n - R, b)-I. Using v~ to denote the subword of p(b-R[a, b]nb R) corresponding to the (k + l)st occurrence of [a, b], we have

and, so long as C ;::: 2k v~ =

+ 1,

0"( ab R- 2k , a) -1 0"(ab R-( 2k +l) , b) -10"( abR-(2k+l) , a)0"(b 2n - ce -(2k+l)) , b)

(4) The next term depends on the parity of C. If C = 2p, the exponent of bin the first term of vp is zero, while if C = 2p + 1, it is not. Letting C = 2p, we have

16

and therefore for k > p

The last term of v~_1 is then 0"(b 2n -£-I, b), and it is now evident that the suffix resulting from b£ is the inverse of the prefix resulting from b-£. Thus for £

= 2p, p(b-£[a, b]nb£)

may be cyclically reduced to yield vb",v~_I' For k < p, the first term of v~ is 0"(ab£-2k,a)-1 and, noting that the expression for v~ when k > p reduces to that for vp when k = p, the first term of v~ for p ::::; k ::::; n - 1 is 0"(ab 2n - 2(k- p), a)-I. Thus the exponents of b in these terms, in the order in which they occur, are

° ::;

£, £ - 2, ... ,2,0, 2n - 2, ... ,£ + 2 On the other hand, the exponents of b in the first terms of the Vi (not primed), also in the order in which they occur, may be read off from Equation 3 as

0, 2n - 2, ... ,£ + 2, £, £ - 2, ... ,2 As 2 ::::; £ ::::; 2n - 2, and as the remaining three terms of Vi and v~ are determined for each i by the first term of each, it follows that the word vb ... V~_I which survives cyclic reduction is a rotation of Vo ... Vn-I. Thus for £ even, the words p(b-£[a, b]nb£) are all conjugate to p([a, b]n).

5.1.2. The case when £ is odd Next consider £ = 2p + 1. The sequence vb ... v~_1 begins as for £ even, with v~ given by (4) so long as 2k + 1 < £. What happens for k = p? Now v~ = O"(ab,a)-IO"(a,b)-IO"(a,a)-IO"(l,b)

and for k v~

> p (in fact, for k >= p)

= dab 2n -(2(k- p)-I) , a)-ldab2n - 2(k- p), b)-10"(ab2n - 2(k- p), a)0"(b 2(k-p) , b)

as may be confirmed by induction. Thus the last term prior to those originating with the b£ suffix is

0"(b 2«n-l)-p), b) or

17

So the suffix originating from bl is exactly as before, and again p(b-l[a,b]nbl ) reduces cyclically to yield vb ... V~_lV~ .•• V~_l' Working from the expressions we have given, the exponents of b in the first terms of the respective v~ include every odd integer between 1 and 2n - 1, in this order:

€, € - 2, ... ,3,1, 2n - 1, ... , f

+2

On the other hand, p(b-1[a,b]nb), cyclically reduced, is Wo ... Wn-i, where Wk

= a(ab2n -(2k-l) , a)-la(ab 2n - 2k , b)-la(ab2n - 2k , a)a(b2k , b)

The exponents of b in the first terms of the Wk are, respectively, 1, 2n - 1, ... ,3 including every odd integer between 1 and 2n - 1. The order is a cyclic permutation of that resulting from p(b-l[a, b]nbl ).

5.2. The relators p(b-la- 1 [a, b]nabl ) Next we prove Lemma 5.4. Let 0 -::; € -::; 2n - 1. If f is even, p(b-la- 1 [a, b]nabl ) is conjugate to Xo ... Xn-l with Xk = a(b 2n - 2k , a)-la(b 2n -(2k+1), b)-la(b 2n -(2k+1), a)a(ab 2k +1, b)

(5)

If f is odd, p(b-la- 1 [a, b]nabl ) is conjugate to Yo· .. Yn-l where Yk = a(b 2n -(2k+1) , a)-la(b 2n -(2k+2), b)-la(b 2n -(2k+ 2), a)a(ab 2k +2 , b) (6) For both the Xk and the Yk, 0 -::; k -::; n - 1, and all exponents are taken modulo 2n.

The first f

+ 1 terms of p(b-ia- 1[a, b]nabi ) are a(b 2n - 1,b)-l ... a(b 2n - l ,b)-la(abi ,a)-1

The next n terms are x~x~ ... X~_l' with x~ resulting from the (k occurrence of [a, b]. For k such that 2n - (f + 2k + 1) >= 0, x~ is

+ 1)st

a(b 2n -(H2k), a)-la(b 2n -(H2k+1) , b)-la(b 2n -(H2k+l), a)a(ab H2k +1, b)

If f = 0, these are all of the terms arising from [a, b]n. If f > 0 is even, there exists p such that € + 2p = 2n. Noting that p - 1 then satisfies

18

2n - (£ + 2(p - 1) + 1) >= 0, we can compute the last term of u(ab 2n - 1, b) and hence

X~_l

as

x~ = u(l, a)-lu(b 2n -l, b)-lu(b 2n -l, a)u(ab, b)

Continuing, for p .:::; k .:::; n - 1, x~ is thus readily seen to be u(b 2n - 2(k- p), a)-lu(b 2n - 2(k- p)-1, b)-lu(b2n - 2(k- p)-1, a)u( ab 2(k-p)+1 , b) Taking k

=

n - 1, we obtain

as the last term preceding the suffix originating from the trailing abR. Hence the suffix of p(b-Ra-1[a, b]nabR) is

u(abR, a)u(b 2n - R, b) ... u(b 2n -l, b) and again a cyclic reduction of the relator is possible. As in the proof of Lemma 5.2, x~ ... x~_l is a cyclic permutation of the cyclic reduction of p(a-1[a, b]na), readily computed as Xo ... Xn-l, where

Xk

=

u(b 2n - 2k , a)-lu(b2n-2k-l, b)-lu(b2n-2k-l, a)dab 2k +1, b)

If £ is odd, let p satisfy £ + 2p + 1 = 2n, so that x~

Subsequent x~

=

x~

= db, a)-lu(l, b)-lu(l, a)u(a, b)

(p .:::; k .:::; n - 1) are therefore given

u(b 2n -(2(k- p)-1), a)-lu(b 2n - 2(k- p), b)-lu(b2n - 2(k- p), a)u(ab 2(k- p), b)

When k

=n-

1, the last term of Wk is

u(abR- 1) and again the suffix is

u(abR,a)u(b2n - R, b) ... u(b 2n -l, b) Reducing cyclically leaves x~ .. ,X~_l' and we may again compute the ordered list of exponents of b in the leading terms of each x~ to see that, when £ is odd, the words p(b-Ra- 1[a, b]nabR) are all rotations of the cyclic reduction of p(b-1a- 1[a, b]nab) given by Yo . .. Yn-l, where

Yk

= u(b 2n -(2k+l), a)-lu(b 2n -(2k+2), b)-lu(b2n -(2k+2) , a)u(ab 2k +2, b)

This completes the proof of Lemma 5.4.

19

5.3. Parametric form for the kernel of ¢ It follows from Lemmas 5.2 and 5.4 that a complete set of relators for the kernel H of ¢ is R = {rl' r2, r3, r4} where rl =

p([a, b]n)

r2

p(b-1[a, b]nb)

=

r3

= p(a-1[a, bta)

r4

=

p(b-1a-1[a, b]nab)

In this section we combine these relators via Tietze transformations into a single relator with parameter n. To do so, we look more closely at the ri, recalling that each is a product of 4-term words. For each 0::; k ::; n -1, we abbreviate the four terms of the subword Vk defined in Lemma 5.2 as follows, from left to right:

= (J(ab 2n - 2k , a)-l V k,l = (J(ab 2n -(2k+ 1) , b)-l V k,2 = (J(ab 2n -(2k+l) , a) 2k V k,3 = (J(b +l , b)

v k,D

The abbreviations Wk,i, Xk,i and Yk,i for 0 ::; k ::; n - 1 and 0 ::; i ::; 3 are defined analogously, using the expansions given in Lemma 5.2 and Lemma 5.4. Our first task is to identify those terms occurring in the ri which are freely equal to 1. While including these terms to this point has simplified the derivation and statement of Lemmas 5.2 and 5.4, our subsequent analysis is simplified if we elide them. From Lemma 3.2 we have Lemma 5.5. (J (ab m , a) is trivial for no value of m, (J (bm , a) is trivial if and only if m = 0, (J(ab m , b) = 1 if and only if 0 ::; m < 2n-1, (J(b m , b) = 1 if and only if 0 ::; m < 2n -1, and both (J(l,a) and CJ(l,b) are trivial.

It follows readily that an equivalent set R' = {r~, r~, r~, r~} of relators for H in which no trivial terms appear is given

20

r~ r~

r~

= VO,OVO,l VO,2 Vl,OVl,2 ... V n -2,OVn -2,2 Vn-l,OVn-l,2Vn-l,3 = WO,OWO,2 ... W n -l,OWn -l,2 = XO,lXO,2 Xl,OXl,2 ..• X n -2,OXn -2,2 Xn-l,OXn-l,2Xn-l,3

I

r 4 = YO,OYO,2

...

Yn-2,OYn-2,2 Yn-l,O

We observe next that for 0 ::; i < n - 1, Vi,O = w~i and Vi,2 = wi~\o' and further that Vn-l,O = W;;:-~l 2 and V n -l,2 = Similarly, for 0 < i ::; n-1, Xi,O = Yi-l,2 and Xi,2 = Yi,~. Further, XO,2 = 'yo,6. In addition, we may use Lemma 3.3 to demonstrate that no generator occurs more than once in any of the r~, and hence that each (viewed as the equation r~ = 1) may be solved for anyone of its terms.

wo6'

Accordingly, we may replace each term of ri except VO,l and V n -l,3 by the inverse of the corresponding term of r~, solve the result for Wo,o, and then substitute this expression for wo,o in r~ to give -1) ( -1 W -1) R 1 = (V n-1- l ,3 (-1 WO,2VO,lWl,O W l ,2 2,O

-1

-1)

.,. (W n - 2 ,2 W n-l ,a

-1

W n - 12 ,

)

(WO,2 ... W n -l,OW n -l,2)

in place of ri and r~. Observing that there is no interaction between WO,2 and either of V n -l,3 or VO,l, we see that Rl is freely reduced as written. Similarly, r~ and r~ may be replaced by R 2 , obtained via simple Tietze transformations eliminating Yo,o after replacing terms of r~ by inverses of the corresponding terms of r~. In detail, -1 -1) R 2 = (( YO,2Yl,O (YO,2

.,.

. ..

(-1

-1

Y n -3,2Yn -2,O

) (-1

-1

Y n -2,2Yn -l,O

)

-1)

X n -l, 3X O,1

Yn-2,OYn-2,2 Yn-l,O)

again freely reduced as written. (In both equations, parentheses have been inserted for readability.) Finally, we observe (i) that the unmatched term XO,l in R2 is in fact the inverse of V n -l,3, unmatched in R l , (ii) similarly that X n -l,3 is the inverse of va,!' (iii) that the sets of generators occurring in Rl and R2 are otherwise disjoint, and (iv) that - except for the generators of the unmatched terms every generator which occurs in Rl occurs precisely twice in R l , once with exponent -1 and once with exponent 1, and similarly for R2. So we may solve Rl for V n -l,3 and substitute the resulting expression for xO,t in R2 to

21

obtain R with the property that each generator occurs either not at all, or once with exponent -1 and once with exponent 1:

R

-1 P -1 = YO,2 1 Yn -1,O

Q

-1 -1 VO,l W O,2 VO,l

1 WO,2

Q

2 YO,2 P2 Yn-1,O

(7)

where P1

=

II

( -1 -1) Yk,OYk,2

II

(Yk,OYk,2)

1::;k::;n-2

P2

=

1::;k::;n-2

II

Q1=

(

-1 -1) W k ,OW k ,2

1::;k::;n-1

Q2

=

II

(Wk,OWk,2)

1::;k::;n-1

This completes the proof of Theorem 5.1. 6. The kernel K of 1j; : H

---t

(z ; ) is free

Consider a one-relator group H = (X; r) where r is a surface word. Let Coo = (z ; ) be the infinite cyclic group. Choose some x E X which occurs in r. If'ljJ : H ---+ Coo is any map sending x to z and all other generators to 1, then, because r has the surface property, 'ljJ extends to a homomorphism. Further, Theorem 6.1. The kernel K of this homomorphism 'l/J : H

---+

Coo is free.

This theorem coupled with Theorem 5.1 proves that the groups On are virtually free by cyclic. To prove Theorem 6.1 we apply the Reidemeister-Schreier process to compute a presentation for the kernel K of 'l/J. Observe that a Schreier transversal for K is {x k IkE Z}, and that the Schreier representative for a word W is xk, where k is the exponent sum of x in w. Assuming first that r has the form

22

where bi, Ci and (i are all either 1 or -1, where by the surface property none of Ul, ... , Urn, VI,"" VP ' or WI,"" Wq are x, and where x precedes . k). Abb reVla . t'mg ul81 ... Urn 8 m b Y U, VIc1 ... Vp8 p b Y V, x -1 ,we examme p (-k x rx (1 (q b t an d WI ... Wq Y W, one compu es • •

j a(x- , x)

= 1 for

all 1 :::; j :::; k

-k 8 1 8i - 1 -1 .)-1 a (x U 1 . .. U i - l U i ,U, -k 81 8i - 1 .) _ -k . k a (x U 1 . .. U i - l ,U, - X U,x

•

a(~,x)

•

a (x

-k

and

= 1

C1

UXV 1 .. 'Vi

1

.)-1 ,V,

(X

-k+l

ViX

k-l)-1

and

a(x-kuxv~1 .. , Vi-I, Vi-I) = X-k+lViXk-1 •

a(x- k uxvx- 1 , X)-1 =

•

1 1 a(x- k uxvx- 1wi ... Wi , Wi)-1 = (X-kWiXk)-1

•

1

-k -I W (1 W(i-1 W) a (x uxvx l ' .. i - I ' i j 1 k a(x- uxvx- wx , x) = 1 for all

and 1 :::; j :::; k

Thus each term in the subgroup relator expands to a conjugate. Abbreviating X-kYiXk as Yi,k, we have

We may assume that x was chosen so that, in r, none of V~1, ... , v~n has its mate in v. Noting that V is not empty (r is freely reduced), it follows that there is some letter Vj of V whose mate lies in one of the segments U or w. From the detailed computation one sees then that for all k, p(x-krx k ) includes single occurrences of terms Vj,k and Vj,k+l' That is, the kth relator, for k an arbitrary integer, has one of the forms

or

where bk,ck

ak, bk

and

Ck

are words containing neither

Vj,k

nor

Vj,k+l'

Here

E {I, -I}.

As a similar statement holds if K is free.

x-I

precedes

x

in

r,

it follows readily that

23

8. Virtually locally free by cyclic groups

Having now given our explicit proof that the G n are virtually free-by-cyclic, we want to present an example which may have some bearing on the conjecture that one-relator groups with torsion are virtually free-by-cyclic. Example 6.1. The one-relator group G =< a, b; (b- I a 2 ba- 3 )2 = 1 >

has a subgroup K of index 2 with the following properties:

• K contains a normal subgroup L such that K / L is infinite cyclic; • L is locally free; • L is not free. It seems possible that the group G in this example is not not virtually free by cyclic. Its structure suggests that it may not be. In order to see that G has the properties claimed above, consider the homomorphism ¢ of G onto the group of order 2 generated by t which sends b to 1 and a to t. Then, using the method of Reidemeister and Schreier, it follows that the kernel K of ¢, is of index 2 in G, is generated by x = a 2 , y = band z = a-Iba and defined in terms of these generators by the single relation yxyx- I z- I xzX- 2 = 1. Now put

Then, it follows from the method of W. Magnus [10] used to solve the word problem for one-relator groups or directly from the method of Reidemeister and Schreier, that the normal closure L of x and z in K is defined by the relations

Now let Lo be the subgroup of L generated by all of the Zi (i E Z) together with Xo. Notice that Xl E Lo and hence X2 E Lo and similarly Xi E Lo for every i ?: O. Notice, by the same token, that if we define L-I to be the subgroup of L generated by all of the Zj and X-I, then Lo ::; L-I with Xo = x=-lz=ix=iz=ix-I' More generally if we now define L-i to be the subgroup of L generated by X-i and all of the Zj for j = 0,1, ... , we find that

24

and 00

We claim that each of the L-i is free on the generators X-i together with all of the Zj. Indeed we can describe this union by generators and relations by taking La to be free on the Zj together with Xa. Then L-I can be taken to be free on the Zj together with X-I and La is identified with a subgroup of L-I by identifying the Zj and identifying Xa with x:: I z=ix=i Z=iX-I. Since x:: I together with all of the Zj freely generate a free subgroup of the free group L-I' this identification does define an embedding of La in L_ I . The resulting system of generators and relations therefore defines an ascending union of free groups which is nothing more than the description of L in terms of the generators and relations that we obtained before. Consequently L is indeed a union of free groups and is therefore locally free. It remains to prove that it is not free. In order to see that this is the case, we abelianize L. This abelianization can be described as an abelian group with generators

z=tx=t Z=tx-I

Xa, X-I, X-2, ... and the

Zj

subject to the relations

Thus L abelianized is the direct product of a free abelian group on the Zj and a copy of the dyadic fractions, i.e., a multiplicative copy of the additive group of rational numbers of the form C12 m . But the abelianization of a free group is free abelian. Hence L is not free. Notice now that KI L is infinite cyclic, so we have proved that G is virtually locally free by cyclic. Since a subgroup of a virtually free by cyclic group is again virtually free by cyclic, G is not virtually free by cyclic.

7. Computational support As indicated in the Introduction, a computer implementation of Reidemeister-Schreier will be most useful for the application we have in mind here if it can work with the infinitely generated and infinitely related groups which arise routinely when one seeks suitable normal subgroups K of H. Similarly, the usual Tietze transformation package provided for cleaning up the presentations resulting from Reidemeister-Schreier needs to be extended to provide assistance dealing with possibly infinite relator sets. A

25 third requirement is for machine assistance in searching for patterns of the kind exposed in Section 6. Our program, exploiting streams and amb-based backtracking, has these capabilities. It is written in Scheme (see the pIt-scheme pages http://www . plt-scheme.org, as well as Abelson and Sussman!), and may be found on our website (http://www.caissny . org). References 1. Harold Abelson and Gerald J. Sussman. Structure and Interpretation of Computer Programs. MIT Press, Cambridge, MA, USA, 1996. 2. Gilbert Baumslag. Residually finite one-relator groups. Bul. Amer. Math. Soc, 73:618-620, 1967. 3. Gilbert Baumslag. Finitely generated cyclic extensions of free groups are residually finite. Bull. Austral. Math. Soc., 5:87-94, 1971. 4. Gilbert Baumslag. Some problems on one-relator groups. pages 75-81. Lecture Notes in Math., Vol. 372, 1974. 5. Gilbert Baumslag, Charles F. Miller III, and Douglas Troeger. Virtually freeby-cyclic one-relator groups. II. Article in preparation. 6. V. Egorov. The residual finiteness of certain one-relator groups. pages 100-121, 1981. 7. M. Feighn and M. Handel. Mapping tori of free group automorphisms are coherent. Preprint, 1997. 8. A. Howard M. Hoare, Abraham K arr ass , and Donald Solitar. Subgroups of finite index of fuchs ian groups. Mathematische Zeitschrift, 120:289 - 298, 1971. 9. A. Karrass, W. Magnus, and D. Solitar. Elements of finite order in groups with a single defining relation. Comm. Pure Appl. Math., 13:57-66, 1960. 10. W. Magnus. Ueber diskontinuierliche gruppen mit einer definierden relation (der freheietssatz). J. Reine Angew. Math., 163:141 - 165, 1930. 11. Wilhelm Magnus, Abraham Karrass, and Donald Solitar. Combinatorial group theory: Presentations of groups in terms of generators and relations. Interscience Publishers [John Wiley & Sons, Inc.], New York-London-Sydney, 1966. 12. J. P. McCammond and D. T. Wise. Coherence, local quasiconvexity, and the perimeter of 2-complexes. Geom. Funct. Anal., 15(4):859-927,2005. 13. Daniel T. Wise. The residual finiteness of positive one-relator groups. Comment. Math. Helv., 76(2):314-338, 2001.

SOME CRYPTOPRIMITIVES IN NONCOMMUTATIVE ALGEBRAIC CRYTOGRAPHY Gilbert Baumslag

Department of Computer Science, City College of New York, New York, N. Y. 10031 Yegor Bryukhov

Department of Computer Science, City College of New York, New York, N. Y. 10031 Benjamin Fine

Department of Mathematics,Fairfield University Fairfield, Connecticut 06430, United States Gerhard Rosenberger

Fachbereich Mathematik, Universitat Dortmund, 44227 Dortmund, Federal Republic of Germany

Dedicated to A. M. Gaglione on the occasion of his 60th birthday.

Abstract: Recently there has been an active line of research on noncommutative algebraic cryptography. This involves the use of noncommutative algebraic objects as the platforms for encryption systems. Most of this work, such as the Anshel-Anshel-Goldfeld scheme, the Ko-Lee scheme and the Baumslag-FineXu Modular group scheme use non abelian groups as the basic algebraic object. Some of these encryption methods have been successful and some have been broken. It has been suggested that at this point further pure group theoretic research, with an eye towards cryptographic applications, is necessary. In the present study, the start of large project, we discuss various methods for using noncommutative algebraic objects as methods to develop cryptographic schemes and cryptoprimitives. The project has several parts; the development of general algebraic schemes for cryptoprimitives, the implementation of appropriate platforms for these schemes and finally an analysis of the potential security of the schemes.

26

27 1. Introduction

Most common public key cryptosystems and public key exchange protocols presently in use, such as the RSA algorithm, Diffie-Hellman, and elliptic curve methods are number theory based and hence depend on the structure of abelian groups. The strength of computing machinery has made these techniques theoretically susceptible to attack and hence recently there has been an active line of research to develop cryptosystems and key exchange protocols using noncommutative cryptographic platforms. This line of investigation has been given the broad title of noncommutative algebraic cryptography. Up to this point the main sources for noncommutative cryptographic platforms have been nonabelian groups. In cryptosystems based on these objects, algebraic properties of the platforms are used prominently in both devising cryptosystems and in cryptanalysis. In particular the nonsolvability of certain algorithmic problems in finitely presented groups, such as the conjugator search problem, has been crucial in encryption and decryption. The important sources of nonabelian groups that can be used in cryptosystems are combinatorial group theory and linear group theory. Braid group cryptography, where encryption is done within the classical braid groups, is one prominent example. The one way functions in braid group systems are based on the difficulty of solving group theoretic decision problems such as the conjugacy problem and the conjugator search problem (see [AAG] and [KoL]. Although braid group cryptography had initial spectacular success, various potential attacks have been identified. Borovik, Myasnikov, Shpilrain [BMS] and others have studied the statistical aspects of these attacks and have identitifed what are termed black holes in the platform groups. These are subsets of the platform groups having the property that choosing cryptographic keys from the group outside of these subsets presents cryptographic problems. In [BFX 1] and [X] potential cryptosysterns using a combination of combinatorial group theory and linear groups were suggested and a general schema for these types of cryptosystems was given. In [BFX 2] a public key version of this schema using the classical modular group as a platform was presented. A cryptosystem using the the extended modular group SL 2 ('1l.) was developed by Yamamura ([Y]) but was subsequently shown to have loopholes ([BG],[S],[HGS]). In [BFX 2] attacks based on these loopholes were closed. It has been suggested that at this point further pure group theoretic research, and algebraic research in general, with an eye towards cryptographic applications, is necessary. In particular, although the present braid

28

group cryptosystems may be attackable the basic group theoretic ideas are important. What is then necessary is to look at other (nonabelian) group theoretical methods as well as additional potential platform groups. Along these lines in [BCFRX] an approach was followed based OP a nonabelian group having either a large abelian subgroup or two large subgroups which elementwise commute. Using this idea a general key transport protocol modeled on the classical Diffie-Hellman technique but using a nonabelian group was developed. Several potential groups that could be used as platforms were described there, in particular the automorphism group of a free group. The purpose of this paper is to introduce noncomm'ltative algebraic cryptography to a wider audience and to discuss three general schemes for developing cryptoprimitves in noncommutative cryptographic protocols. Here at first our cryptographic goals are modest. The model we use is that of sending an encrypted message and/or encryption key over public airwaves. Our adversary can view the encrypted message and in addition knows the general technique that we are employing. Our hidden secrets are the actual algberaic objects used and certain parameters of the algebraic objects. This is just a first step and in later work we will expand both the model and the capability of the adversary. In the present paper we consider two parts - the general scheme itself and potential algebraic platforms where the schemes can be implemented. This is an early part of a general project and a detailed study of cryptanalysis for these methods will follow. All three schemes involve a combination of combinatorial group theory, representation theory and free group rewriting methods. The first two schemes have appeared in various papers so we discuss them only briefly and concentrate on the third. The first method is a group theoretic version of the classical DiffieHellman method and is a generalization of the Anshel-Anshel-Goldfeld scheme. The second method uses a random choice of a subgroup in a linear group followed by a noise factor as the hard problem in group theory needed for encryption. It is a free group polyalphabetic cipher. The final scheme uses the difficulty of factoring in a noncommutative ring. In htis scheme we employ a type of Shamir 3-pass see [CJ]. In particular, relative to this scheme, we explore several different methods to use a formal power series ring R < < Xl, ... , Xn > > in noncommuting variables Xl, ... , Xn as a base to develop cryptosystems. Although R can be any ring we have in mind formal power series rings over the rationals Q. Further we utilize a result of Magnus that a finitely generated free group F has a faithful representation

29 in a quotient of the formal power series ring in noncommuting variables. In the next section we describe the basic ideas of free group cryptography . In section 3 we describe the general group theoretic Diffie-Hellman method. We briefly mention some potential platforms. In section 4 we describe the general method of Baumsalg-Fine and Xu and the potential platforms for this method. In section 5 we describe a general method for using the difficulty of factoring in a noncommutative ring to build a cryptosystem. As a potential platform for this method we suggest the formal power series ring mentioned above. In section 6 we look at some necessary mathematical results in this ring that are needed for crytography. 2. The Basics of Free Group Cryptography The basic idea in using combinatorial group theory for cryptography is that elements of groups can be expressed as words in some alphabet. If there is an easy method to rewrite group elements in terms of these words and further the technique used in this rewriting process can be supplied by a secret key then a cryptosystem can be created. The simplest example is perhaps a free group cryptosystem. This can be described in the following manner. We will use the books by Magnus, Karrass and Solitar [MKS] or Baumslag [B] as standard references for material on combinatorial group theory. All of the proposed schemes in this paper are based in a general sense on free group cryptography. Consider a free group F on free generators Xl, ""xr' Then each element gin F has a unique expression as a word W(XI, ... , x r ). Let WI, ... , W k with Wi = Wi (Xl, ... , x r ) be a set of words in the generators Xl, ... , Xr of the free group F. At the most basic level, to construct a cryptosystem, suppose that we have a plaintext alphabet A. For example suppose A = {a, b, ... } are the symbols needed to construct meaningful messages in English. To encrypt, use a substitution ciphertext

That is

Then given an word W(a, b, ... ) in the plaintext alphabet form the free group word W(WI , W2, .... ). This represents an element 9 in F. Send out 9 as the secret message. In order to implement this scheme we need a concrete representation of 9 and then for decryption a way to rewrite 9 back in terms of WI, ... , Wk. This

30

concrete representation is the idea behind homomorphic cryptosystems (see the article of Grigoriev and Ponomarenko [GP]). The decryption algorithm then depends on a very important idea that we will need later and which is known as the Reidemeister-Schreier rewriting process (see [MKSj for full details). Assume WI, .... Wk are free generators for some subgroup H of F. A Schreier transversal for H is a set {hI, ... , hi> ... } of (left) coset representatives for H in F of a special form (again see [MKS] for particular details). Any subgroup of a free group has a Schreier transversal. The Reidemeister-Schreier process allows one to construct a set of generators WI, ... , W k for H by using a Schreier transversal. Further given the Schreier transversal from which the set of generators for H was constructed, the Reidemeister-Schreier Rewriting Process allows us to algorithmically rewrite an element of H. Given such an element expressed as a word W = W (Xl, ... .x r ) in the generators of F this algorithm rewrites W as a word W*(WI' ... , Wk) in the generators of H. The actual algorithm is described in detail in [MKS] and [B]. An important feature of the Reidemeister-Schreier rewriting process is that to rewrite a word

where f.ij E {-1,1} and i j E {l,2, ... r}, the algorithm rewrites letter by letter from left to right (see [MKS] for complete details). This was an important feature of the Baumslag-Fine-Xu Modular group scheme (see [BFX 1,2]). One of the earliest descriptions of a free group cryptosystem as well as a homomorphic version of it was in a paper by W. Magnus in the early 1970's [M]. Pure free group cryptosystems are subject to various attacks especially length-based attackes and can be broken easily (see [GP] and [St]). Therefore enhancements must be made to free group cryptosystems to provide some security. 3. A General Schema for Nonabelian Group Diffie-Hellman The groundwork for the present noncommutative algebraic cryptography was laid in two seminal papers by Anshel-Anshel-Goldfed [AAG] in 1999 and Ko-Lee[KoL] in 2000. The resulting proposed cryptographic schemes can be considered as group theoretic analogs of the number theory based Diffie-Hellman method. Both methods are entirely general. However Anshel-Anshel-Goldfeld proposed using the classical braid groups and certain fixed parameters as a particular platform. Cryptanalysis has been done

31

on both methods but only relative to the platforms and parameters in these platforms that the developers suggested. In [BCFRX] the following generalization of the AAG scheme was described. Suppose that G is a finitely presented group that can be represented in a nice way - either as a matrix group or as words relative to a nice presentation. Further suppose that G has two large subgroups A I ,A 2 that commute elementwise. Alternatively we could use one large abelian subgroup A of G. The meaning of large is of course hazy but relative to the encryption scheme it means that within G it is difficult to determine when an arbitrary element is in Al or A2 (or A) and further Al and A2 (or A) is large enough so that random choices can be made from them. For our purposes large can mean that Al and A2 both contain nonabelian free subgroups. Now suppose that Bob wants to communicate with Alice via an open airway. The message (or the secret key telling them which encryption system to use) is encoded within the finitely generated group G with the properties given above. The two subgroups AI, A2 which commute elementwise are kept secret by Bob and Alice. Al is the subgroup for Bob and A2 the subgroup for Alice. Bob wants to send the key W E G to Alice. He chooses two random elements B I , B2 E Al and sends Alice the message (in encrypted form) BI W B 2. Alice now chooses two random elements GI , G2 E A2 and sends GIBI WB 2G2 back to Bob. These messages appear in the representation of G and hence for example as matrices or as reduced words in the generators so they don't appear as solely concatenation of letters. Since Al commutes elementwise with A2 we have

Further since Bob knows his chosen elements BI and B2 he can multiply by their inverses to obtain GI WG 2 which he then sends back to Alice. Since Alice knows her chosen elements GI , G2 she can multiply by their inverses to obtain the key W. It is assumed that for each message Bob and Alice would choose different pairs of random elements from either Al or A 2 . This method is a variation of a suggestion of Shamir now known as a Shamir 3-Pass or Three Pass Protocol (see [CJ]). We will apply a similar technique in a ring theoretic setting in section 5. In [BCFRX] several potential platform groups are suggested. These include the full automorphism group of a finitely generated free group, the matrix group SL( 4, Z) and the surface braid groups. Shpilrain and Ushakov [SU] used this method employing Thompson's group F as a platform. A

32

length based attack on their system was attempted by Tsoban [T]. Further work on this method in the surface braid groups is being done by Camps[C]. 4. Polyalphabetic Free Group Cryptosystems

In [BFX 1] the following general encryption scheme using free group cryptography was described. A further enhancement was discussed in [BFX 2]. We start with a finitely presented group

G=<XIR> where X

=

{Xl, ... , xn} and a faithful representation p: G

-t

G.

G can be anyone of several different kinds of objects - linear group, permutation group, power series ring etc. We assume that there is an algorithm to re-express an element of p( G) in G in terms of the generators of G. That is if 9 = W(Xl' ... ,x n ... ) E G where W is a word in these generators and we are given p(g) E G we can algorithmically find 9 and its expression as the word W(Xl' .. xn). Once we have G we assume that we have two free subgroups K, H with He KeG. We assume that we have fixed Schreier transversals for K in G and for H in K both of which are held in secret by the communicating parties Bob and Alice (see [B] for a description of Reidemeister-Schreier). Now based on the fixed Schreier transversals we have sets of Schreier generators constructed from the Reidemeister-Schreier process for K and for H. kl' ... k m ,...

for K

hl, ... , h t ,...

for H.

and

Notice that the generators for K will be given as words in Xl, ... , Xn the generators of G while the generators for H will be given as words in the generators kl' k2' ... , for K. We note further that Hand K may coincide and that Hand K need not in general be free but only have a unique set of normal forms so that the representation of an element in terms of the given Schreier generators is unique. We will encode within H, or more precisely within p( H). We assume that the number of generators for H is larger than the set of characters within

33 our plaintext alphabet. Let A = {a, b, c, ... } be our plaintext alphabet. At the simplest level we choose a starting point i, within the generators of H, and enclode a

-t

hi, b - t hHI' .... etc.

We now again use a Shamir 3-Pass specialized to this ring-theoretic setting. Suppose that Bob wants to communicate the message W(a, b, c ... ) to Alice where W is a word in the plaintext alphabet. Recall that both Bob and Alice know the various Schreier transversals which are kept secret between them. Bob then encodes W(hi' hi+1"') and computes in G the element W(p(h i ), p(h i +1), .. ) which he sends to Alice. This is sent as a matrix if G is a linear group or as a permutation if G is a permutation group and so on. Alice uses the algorithm for G relative to G to rewrite W(p(h i ), P(hHd, .. ) as a word W*(XI' ... x n ) in the generators of G. She then uses the Schreier transversal for K in G to rewrite using the Reidemeister-Schreier process W* as a word W**(kI' ... , k s .. ) in the generators of K. Since K is free or has unique normal forms this expression for the element of K is unique. Once Alice has the word written in the generators of K she uses the transversal for H in K to rewrite again, using the Reidemeister-Schreier process, in terms of the generators for H. She then has a word W***(hi' hHI' ... ) and using hi - t a, hHI - t b, ... decodes the message. In [FBX 1,2] an inplementation of this process was presented that used for the base group G the classical modular group M = PSL(2,7l,). Further it was a polyalphabetic cipher which was secure. The system in the modular group M was presented as follows. A list of finitely generated free subgroups HI, .... , Hm of M is public and presented by their systems of generators (presented as matrices). In a full practical implementation it is assumed that m is large. For each Hi we have a Schreier transversal hI,i, ... , ht(i),i

and a corresponding ordered set of generators WI,i, ... , W m(i),i

constructed from the Schreier transversal by the Reidemeister-Schreier process. It is assumed that each m( i) > > I where I is the size of the plaintext alphabet, that is each subgroup has many more generators than the size of the plaintext alphabet. Although Bob and Alice know these subgroups in

34

terms of free group generators what is made public are generating systems given in terms of the polynomials in noncommuting variables. The subgroups on this list and their corresponding Schreier transversals can be chosen in a variety of ways. For example the commutator subgroup of the Modular group is free of rank 2 and some of the subgroups Hi can be determined from homomorphisms of this subgroup onto a set of finite groups. Suppose that Bob wants to send a message to Alice. Bob first chooses three integers (m, q, t) where m q

=

=

choice of the subgroup Hm

starting point among the generators of Hm

for the substitution of the plaintext alphabet

t

=

size of the message unit .

We clarify the meanings of q and t. Once Bob chooses m, to further clarify the meaning of q, he makes the substitution

Again the assumption is that m( i) > > l so that starting almost anywhere in the sequence of generators of Hm will allow this substitution. The message unit size t is the number of coded letters that Bob will place into each coded integral matrix. Once Bob has made the choices (m, q, t) he takes his plaintext message W(a, b, ... ) and groups blocks of t letters. He then makes the given substitution above to form the corresponding matrices in the Modular group;

We now introduce a random noise factor. After forming T I , ... , Ts Bob then multiplies each Ti on the right by a random polynomial in F say RT; ( different for each Ti). The only restriction on this random polynomial RT; is that there is no free cancellation in forming the product TiRT;. This can be easily checked and ensures that the freely reduced form for TiRT; is just the concatenation of the expressions for Ti and RT;. Next he sends Alice the integral key (m, q, t) by some public key method (RSA, Anshel-Goldfeld etc.). He then sends the message as s random projective matrices

35

Hence what is actually being sent out are not elements of the chosen subgroup Hm but rather elements of random right cosets of Hm .. The purpose of sending coset elements is two-fold. The first is to hinder any geometric attack by masking the subgroup. The second is that it makes the resulting words in the the Modular Group generators longer - effectively hindering a brute force attack. To decode the message Alice first uses public key decryption to obtain the integral keys (m, q, t). She then knows the subgroup H m , the ciphertext substitution from the generators of Hm and how many letters t each matrix encodes. She next uses the algorithms described in section 2 to express each TiRTi in terms of the free group generators of F say WTi (Yl,", Yn). She has knowledge of the Schreier transversal, which is held secretly by Bob and Alice, so now uses the Reidemeister-Schreier rewriting process to start expressing this freely reduced word in terms of the generators of Hm. Recall that Reidemeister-Schreier rewriting is done letter by letter from left to right. Hence when she reaches t of the free generators she stops. Notice that the string that she is rewriting is longer than what she needs to rewrite in order to decode as a result of the random matrix RTi' This is due to the fact that she is actually rewriting not an element of the subgroup but an element in a right coset. This presents a further difficulty to an attacker. Since these are random right cosets it makes it difficult to pick up statistical patterns in the generators even if more than one message is intercepted. In practice the subgroups should be changed with each message. The initial key (m, q, t) is changed frequently. Hence as mentioned above this method becomes a type of polyalphabetic cipher. Polyalphabetic ciphers have historically been very difficult to decode [H]. 5. Factoring in Noncommutative Rings The following method is a further extension to the noncommutative ring setting of the nonabelian group theoretic Diffie-Hellman system. It uses the difficulty of factoring within some noncommutative rings together with the presence of a large and computable unit group. We suppose that we have a ring R with a large unit group U(R). By large we mean that U(R) contains a nonabelian free subgroup so that random choices can be made from U(R). We suppose further that there is no factoring algorithm in R. We also suppose that U(R) is computable and one knows that r E u(R) then r- 1 can be found. Suppose that Bob wants to send a message to Alice. Encoding is done within R so that elements of R represent messages. Bob wants to send the message r E R to Alice. He

36

randomly chooses an e E U(R) and sends Alice reo Alice randomly chooses f E U(R) and sends back fre. Bob knows (but presumably an attacker cannot figure out) e- l so he forms free- l = fr and sends this back to Alice. Alice applies f- l to get the message r. As a platform for this encryption method we propose the use of the ring of formal power series

R«

Xl, "',Xn

»

over a ring R in noncommuting variables Xl, ""xn' Although this can be done in an even more general context, for this study we will concentrate on rational formal power series, that is we consider the ring R to be the field of rational numbers Q. Throughout the rest of this paper we let H = Q « X I , ... ,X n

»

be the formal power series ring in noncommuting variables Xl, ... , Xn over Q. One of our primary tools for developing encryption methods will be based upon a faithful representation of a finitely generated free group within a quotient of H. This representation was developed by W.Magnus [MJ and is now known as the Magnus representation. If n 2: 2 this then provides free subgroups of all countable ranks within this quotient of H. Further by imposing additional relations we can obtain representations of free nilpotent groups. This method can be applied using the ring H as the platform. The power d defining H is a shared secret. Encryption is done in a polynomial in the noncommuting variables. This encryption can be done in a variety of ways. The simplest is perhaps the following. The coefficients of our polynomials are rational numbers. Code letters by rational numbers and the message is read off from the coefficients. Let d > 1 be an integer and impose the relations

x1 = x~ = ... = x~ = 0 on H. We call the resulting quotient H. Notice that the elements of H are polynomials of degree < d in the noncommuting variables Xl, ... , X n . The faithful representation of a free group is given in terms of the monomials al

= 1 + Xl, a2 = 1 + X2, ... , an = 1 + X n .

Notice that in the formal power series ring H we have the well known expansion 1 _ -1-- - 1 - Xi +Xi

+ Xi2 -

3 Xi

+ ......

37

Therefore each ai is invertible within H and hence invertible in H. Within H however the inverse is a polynomial of degree < d and so within H 1 _ -1-- - 1 +~

Xi

+ Xi2 -

3

Xi

+ ... + (-1) d-l x·d-l . I

Therefore each ai is in the unit group U(H) of H and therefore the set {aI, ... , an} generates a multiplicative subgroup of U(H). Note also that if d, the defining power, is kept secret, then inverses are unknown. Further we can show that the unit group U(H) consists of those polynomials with non-zero constant term. Suppose that Bob wants to send Alice the message T E Q[[XI, .. , xnll where xf = 0 for all i. Let R = T + S where S is an arbitrary polynomial with only powers higher than d. Bob chooses a random element of the unit group W. He sends Alice RW. Bob knows the inverse of W. Alice chooses another random V of the unit group and sends Bob back V RW. Bob multiplies by W- l and sends Alice V R from which Alice recovers R. Since she knows d she cancels all powers higher than d to obtain the message T. An attacker would need to factor RW and know the defining power d to attack the message. We mention that this scheme would not work if we restricted W to be in the Magnus free group within H. We will show in the next section that if Bob sends RW and Alice sends back V RW then as V can be peeled off and subsequently W can be peeled off so the attacker can get the message R.

6. Formal Power Series Rings and the Magnus Representation

We now describe some necessary algorithmic material needed to use the rings Hand H as cryptographic platforms. Recall that

H

=

Q

is the formal power series ring in noncommuting variables Xl, ... , Xn over Q and H is the resulting qotuient ring found by letting xf = 0 for all i. This D is kept secret in cryptography. One of our primary tools for developing encryption methods will be based upon the Magnus representation of a finitely generated free group within a quotient of H. If n ~ 2 this representation provides free subgroups of all countable ranks within this quotient of H. Further by imposing additional relations we can obtain representations of free nilpotent groups.

38

We first describe the Magnus representation and give a proof. The proof will lead us to two algorithms for describing when certain polynomials lie in the image of this representation. Further we describe the unit group of this quotient. First let d > 1 be an integer and impose the relations

x~ = x~ = ... = x~ = 0 on H. We call the resulting quotient H. Notice that the elements of H are polynomials of degree < d in the noncommuting variables Xl, ... , X n . The faithful representation of a free group is given in terms of the monomials

Notice that in the formal power series ring H we have the well known expansion

-1- = 1 1 +Xi

Xi

+ Xi2 -

3 Xi

+ ......

Therefore each Q:i is invertible within H and hence invertible in H. Within H however the inverse is a polynomial of degree < d and so within H

-1- = 1 1 +Xi

Xi

+ Xi2 -

3

Xi

+ ... + (-1) d-l Xid-l .

Therefore each Q:i is in the unit group U(H) of H and therefore the set {Q:I, ... , Q:n} generates a multiplicative subgroup of U (H). Note also that if d, the defining power, is kept secret, then inverses are unknown Magnus' result is the following. Theorem 6.1. The elements

freely generate a subgroup of U(H). Therefore the map given by

provides a faithful representation of the free group on YI, ... , Yn into H.

We present a proof, since as mentioned, the proof will lead us to an algorithm necessary for our encryption methods. Proof. Notice from the comment above that each Q:i is invertible within H. Therefore each Q:i is in the unit group U(H) of H and therefore the set

39 { ai, ... , an} generates a multiplicative subgroup of U (H). We show that no nontrivial freely reduced word in the ai can be the identity and hence the group they generate must be a free group. From the binomial expansion we have for any non-zero integer n, positive or negative,

(1

+ ai)n = 1 + nai +

terms in higher powers.

Now let

be a freely reduced word in the ai with each Inil ?': 1 and aij "I=- ai}+1 for j = 1, ... , k - 1. For later reference we call k the block length. In the ring H we then have

and hence W(al' ... , an)

=

(1

+ n1xi1 + higher powers in Xi1) ... (1 + nkxik + higher powers in

Xik)

The variables are noncommuting, so that in analyzing this product we see that there is a unique monomial term of maximal block length k where each Xij appears to the power 1. That is there is a unique monomial term

We stress here that this is of maximal block length since this will be important in the subsequent algorithm. Since each ni "I=- 0 this term must appear and therefore W(al' ... , an) "l=I. It follows that the group generated by 001, .. , an is freely generated by them. D The proof of the faithfulness of the Magnus representation leads us to several algorithms for dealing with the image in the power series ring. We will employ these algorithms in our cryptosystems. For the remainder of this section we will let F denote the free subgroup of H generated by the The first algorithm provides a method, given a polynomial in H, which is written in polynomial form, that we know to be in F, to write its unique free group decomposition. That is given

40

a polynomial in the noncommuting variables in F to rewrite f as

Xl, .'"

Xn that we know to be

In general there is no factoring algorithm in H. For any monomial Xi, ... Xik in H we call k the block length of the monomial in analogy with that of a free group word. Theorem 6.2. (Algorithm to Recover the Free Group Decomposition of Elements in F). Suppose f = f(XI, ... , xn) E H and it is known that f E F. There is an algorithm that rewrites f in terms of the free generators all ... , an, that is the algorithm uniquely expresses f as a free group word

The algorithm works as follows: Step 1: In f locate the monomial nXil ... Xik of maximal block length where n E Z \ {O}, each variable that appears in f appears in this monomial, and each variable is to the power 1. This k gives the block length for the corresponding free group word. Further the free group word must have the form

with each ni a divisor of n. Step 2: For each divisor ni of n both positive and negative sequentially form (1 + Xi,) -n, f. In exactly one such product the maximal block length will be k - 1 and there will be a unique monomial of block length k - 1 containing each variable in f except perhaps Xi, and each to the power 1. We then have

!I

is also in F. Step 3: Continue in this manner until we reach the identity. The free group decomposition of f is then where

f = (1 + X·1.1 )nl ... (1 + X·1.k )nk = a':' ... a nk tk . 1.1

Proof. Since we know that group decomposition

f

E F we know that there is a unique free

41

Hence, as in the proof that the representation is faithful, there is a unique monomial

of maximal block length where n E Z \ {O}, each variable that appears in f appears in this monomial, and each variable is to the power 1. Again as in the proof of Theorem 6.2, k gives the block length for the corresponding free group word. Now, since the free group representation is unique we have for each divisor ni of n

(1

+ xil)-ni f = (1 + Xil)n1-n

i

..•

(1

+ Xikt k .

Hence only for ni = nl will this term cancel. Hence there is exactly one such divisor such that (1 + Xi)-n i F will now have maximal block length k - 1 and have a unique monomial of the prescribed type. It follows then that one and only one such product will reduce f to a word of shorter block length. 0 A modification of the above algorithm can be used to determine if a general element of H is actually in F. Theorem 6.3. (Algorithm to Determine if f E H is in F). Suppose

f

=

f(XI, ... ,xn ) E H.

There is an algorithm that determines whether or not f E F and if it is, rewrites f in terms of the free generators aI, ... , an. The algorithm works as follows: Step 1: If the constant term of f i- 1 then f rJ. F. FUrther if f has any nonintegral coefficients then f rJ. F. Step 2: Assume f passes Step 1. If f does not contain a unique monomial nXil ... Xik of maximal block length in f where n E Z \ {O}, each variable that appears in f appears in this monomial, and each variable is to the power 1 then f rJ. F. Step 3: Suppose f passes Steps 1 and 2. Then in f locate the monomial with the characteristics described in Step 2. If f E F then k gives the block length for the corresponding free group word. Further the free group word must have the form

with each ni a divisor of n.

42

Step 4: For each divisor ni of n both positive and negative sequentially form (1 +Xil)-n1 f. If in such product the maximal block becomes k -1 and there is a no new monomial having the described characteristics above then f ¢:. F. Otherwise continue. Step 5: If eventually we arrive at the identity then f E F and the procedure yields the free product decomposition of f.

Proof. The proof follows in exactly the same manner as the proof of Theorem 6.2. 0 For certain cryptographic applications we need the full unit group U(H) of H. Over Q it can be described as those polynomials with nonzero constant term. Theorem 6.4. The unit group U(H) over Q consists precisely of those polynomials with nonzero constant term. Proof. There are two ways to look at the proof of this. Algebraically suppose the defining power is d > 1 and P(x) E H with nonzero constant term. Then P(x) is relatively prime to the polynomials xt and so is invertible in the factor ring in the standard way. Analytically if P(x) E H with nonzero constant term let P*(x) be the corresponding polynomial in H. Then P* (x) can be made into part of a convergent power series P**(x) in

C«

Xl, ... ,Xn

» .

Since P**(O) =I- 0 this power series is analytic at 0 and so its inverse is analytic at 0 and so has a convergent power series around 0 say Q**(x). The image of Q**(x) in H would then be the inverse of P(x). Conversely if P(x) E H is invertible it must have nonzero constant ten)[J Before we continue we mention one final item concerning multiplication within H. In general there is no factoring algorithm. However if f E H is known and g = fe with e E F is known then we can find e. We say that e can be peeled off f e. The algorithm to do this is essentially the same as the above two algorithms. We briefly explain. Suppose we are given f and fe. Then in fe there is a unique monomial extending the monomials in f exactly as in the proof of Theorem 6.2. By identifying this monomial we can find the free group decomposition of e and hence find e. The rings H and its quotient H, together with the Magnus representation, provide a very flexible platform for doing cryptography. First of all the

43

unit group U(R) contains a free group via the Magnus representation. Further there is no factoring algorithm within H. Keeping the defining power d secret makes determining inverses open only to those who know d. Hence this ring provides an ideal algebraic platform for the ring theoretic DiffieHellman method described in the last section. The actual method goes as was described there which we briefly repeat. Suppose that Bob wants to send Alice the message T E Q[[Xl, '" xnll where x~ = 0 for all i. Let R = T + S where S is an arbitrary polynomial with only powers higher than d. Bob chooses a random element of the unit group W. He sends Alice RW. Bob knows the inverse of W. Alice chooses another random V of the unit group and sends Bob back V RW. Bob multiplies by W-l and sends Alice VR from which Alice recovers R. Since she knows d she cancels all powers higher than d to obtain the message T. An attacker would need to factor RW and know the defining power d to attack the message. Further the unit group U(R) of H contains free subgroups. Via the algorithms described in this section we can move back and forth between the free group F and the representing polynomials. Hence this can be used as a platform for the free group BFX polyalphabetic cipher. 7. Relations on the Variables It was shown in [GB 2] that by imposing further relations on the variables, free nilpotent groups of all possible class size can also be embedded in quotients of H. This was used in [GB 2] to prove certain results concerning equations in free groups. This can be used further for encryption purposes. By imposing nilpotency relations on some of the variables in the power series, but not all, and keeping the relations secret a further level of security is imposed. This procedure is under development ([BBFGR]). 8. References [AAG] LAnshel,M.Anshel,D.Goldfeld, An Algebraic Method for Public Key Cryptography, Math.Res. Lett, 6, 1999, 287-291 Springer Verlag [GB 1] G.Baumslag, ory,Birkhauser 1993

Topics

in

Combinatorial

Group

The-

[GB 2] G. Baumslag, Residual Nilpotence and Relations in Free Groups J. Algebra, 2, 1965, 271-285 [BFX 1] G. Baumslag, B.Fine, and X.Xu, Cryptosystems Using Linear

44

Groups Appl. Alg. in Engineering,Communication and Computing, 17, 2006, 205-217 [BFX 2J G. Baumslag, B.Fine, and X.Xu, A Proposed Public Key Cryptosystem Using the Modular Group Cont. Math, 421, 2007, 35-44 [BCFRXJ G. Baumslag,T.Camps, B.Fine,G.Rosenberger and X.Xu, Designing Key Transport Protocols Using Combinatorial Group Theory, Cont. Math., 418, 2006, 35-43 [CJJ J.Clark and J.Jacob, A Survey of Authentication Protocol Literature; Version 1, No. 1997 , see www.lsv.ens-cachan.fr/spore/shamir.pdf [FJ B.Fine, The Algebraic Theory of the Bianchi Groups, Marcel Dekker, 1990 [GPJ D.Grigoriev and 1. Ponomarenko, Homomorphic Public-Key Cryptosystems Over Groups and Rings Quaderni di Matematica, 2005 [HJ P.Hoffman, Archimedes Revenge Fawcett-Crest, 1988 [HGSJ C.Hall, 1. Goldberg, B. Schneier, Reaction attaacks Against Several Public Key Cryptosystems Proceedings of Information and Communications Security ICICS 99, Springer-Verlag, 1999, 2-12 [KoLJ K.H. Ko,J.Lee,J.H. Cheon,J.W. Han, J.Kang, C.Park, New PublicKey Cryptosystem Using Braid Groups, inAdvan Advances in Cryptology - CRYPTO 2000 Santa Barbara CA, - Lecture Notes in Computer Science, Springer 1880, 2000 166-183 [KoJ N.Koblitz, Algebraic Methods of Cryptography, Springer, 1998 [MJ W. Magnus, Rational Representations of Fuchsian Groups and NonParabolic Subgroups of the Modular Group, Nachrichten der Akad Gottingen, 1973, 179-189 [MKS] W. Magnus, A. Karass and D. Solitar Combinatorial Group Theory, Wiley Interscience,New York, 1968 [StJ R. Steinwandt, Loopholes in two public key cryptosystems using the modular groups preprint Univ. of Karlsruhle, 2000 [X] Xiaowei Xu, Cryptography and Infinite Group Theory, Ph.D. Thesis, CUNY,2006 [YJ A.Yamamura, Public Key cryptosystems using the modular group, Lecture Notes in Comput. Sci., 1431, 1998, 203-216

On the derived subgroups of the free nilpotent groups of finite rank Russell D. Blyth

Department of Mathematics and Computer Science Saint Louis University, St. Louis, MO 63103, USA Email address: [email protected] Primoz Moravec

Fakulteta za matematiko in fiziko Univerza v Ljubljani Jadmnska 19, 1000 Ljubljana, Slovenia Email address:[email protected] Robert Fitzgerald Morse

Department of Electrical Engineering and Computer Science University of Evansville Evansville IN 47722 USA Email address: [email protected] URL: faculty. evansville. edu/rm43

Dedicated to Tony Gaglione on his Sixtieth Birthday We provide a detailed structure description of the derived subgroups of the free nilpotent groups of finite rank. This description is then applied to computing the nonabelian tensor squares of the free nilpotent groups of finite rank.

Keywords: Free nilpotent group, Derived subgroup, Nonabelian tensor square 2000 Mathematics Subject Classification: 20F18, 20J99

1. Introduction

A systematic structure description of the subgroups of the free nilpotent groups is given in S. Moran's paper "A subgroup theorem for free nilpotent groups" 1 that is based on the work of Gol' dina. 2 Moran's result is necessarily general and does not provide a detailed structure description for any specific subgroup of a free nilpotent group, such as its derived subgroup. The purpose of this paper is to provide a detailed structure descrip45

46

tion of the derived subgroups of the free nilpotent groups of finite rank. The motivation for this investigation is that the derived subgroup of a free nilpotent group of class c + 1 and rank n is isomorphic to the nonabelian exterior square of the free nilpotent group of class c and rank n. Moreover, the results presented in this paper give complete structure descriptions of the nonabelian tensor squares of free nilpotent groups of finite rank using a result from Blyth et al. 3 We fix our notation. Let Fn be the free group of rank n with generators II, ... , fn and denote the free abelian group of rank n by F:;b. Let Cn be a fixed Hall basic sequence of commutators in the free generators of Fn. The weight of the commutator Ci E Cn is denoted by Wi. We denote the subsequence of commutators of Cn whose weight is at most w by Cn,w. The number of commutators in Cn,w \Cn ,w-1 is denoted by M(n, w). The subset of simple left normed commutators in Cn of weight at most w is denoted by Sn,w. Let Nn,c = Fnhc+1(Fn) be the free nilpotent group of rank nand class cgenerated by gl, ... , gn. Denote by 'Dn,c the derived subgroup ofNn,c. The elements of Cn,c map to Nn,c via the natural homomorphism Fn ----t Fnhc+1 (Fn) = Nn,c. With slight abuse of notation we identify elements of Cn,c as the same as their images in Nn,c. The group Nm,w, which will be used several times in this paper, is defined as follows. We first fix a free nilpotent group K. Given m 2:: 2 and w 2:: 3, let ifm

=2

if m 2:: 3, where s = ISm,w-2 \ Sw,ll. Let k 1, ... , ks be the free generating set for K. Fix a bijection f3 from the set {1, ... , s} to Sm, w- 2 \ Sm,l. Associate a weight Wk i to each generator ki of K via f3 by setting Wk i to be the weight of the simple left normed commutator f3(i). Then for m 2:: 2 and w 2:: 3, define Nm,w

= K/R,

where R = ([k i , k j ] I Wk i + Wk j > w). The group Nm,w has a minimal cardinality generating set with s = ISm,w-2 \ Sw,ll generators. Our main result is the following description of the derived subgroup of a free nilpotent group of finite rank. Theorem 1.1. Let Nn,c be the free nilpotent group of class c 2:: 1 and rank n 2:: 1. If n = 1 or c = 1 then Nn,c is abelian and 'Dn,c is trivial. If n > 1

47 and c

c>

=

2 then Vn,c is free abelian of rank M(n,2) 2 then

where f

G). If n >

1 and

= ISn,c \ Sn,c-21·

In Section 2 we prove Theorem 1.1 and provide formulas for ISn,c \ Sn,c-21 and ISn,c \ Sn,ll. These formulas allow us to restate Theorem 1.1, giving an explicit rank of the free abelian factor of Vn,c and for the minimal number of generators for Nn,c (Theorem 2.3). The nonabelian tensor square G I8i G of a group G was introduced by Brown and Loday4 following the ideas of Dennis5 and Miller 6 and is of topological significance. In Section 3 we give a brief exposition of the nonabelian tensor square of a group and use the formulas found in Section 2 to give a full description of Nn,c I8i Nn,c' We apply Theorem 1.1 to obtain a general structure description for this nonabelian tensor square. Corollary 1.1. Let G = Nn,c be the free nilpotent group of class c and rank n > 2. Then

G I8i G ~ N n ,c+1 x F;b, where g

= ISn,c \ Sn,c-21 + (n!I).

2. The Derived Subgroup of a Free Nilpotent Group In this section we first prove Theorem 1.1 and then use a result of Gaglione and Spellman7 to find a formula for the number ISm,w \ Sm,11 of simple left normed commutators in Cm,w of weight at least 2. From this formula we derive explicit expressions for s, the minimal number of generators for Nn,w, and for ISm,w \ Sm,w-Il. The proof of Theorem 1.1 uses the following two theorems of Moran. I Theorem 2.1 (Theorem 1.5, Ref. 1). Every abelian subgroup of a free nilpotent group is free abelian. Theorem 2.2 (Theorem 3.1, Ref. 1). Let B be a subgroup of a free nilpotent group Nn,c of class c ~ 1 and rank n ~ 1. Then B is generated by a set of c subgroups

where

48

(i) for k

=

1,3, ... , c, the subgroup Bk is a free nilpotent group of class

LfJ;

(ii) for n = 2, that is when Nn,e is 2-generated, the subgroup B2 is infinite cyclic; otherwise the subgroup B2 is free nilpotent of class L~ J; (iii) for i + j ~ c, the subgroup [Bi' B j ] is contained in the subgroup (Bi+j,'" ,Be); (iv) for i + j > c, the subgroup [Bi' B j ] is trivial; and (v) for k = 1,2, ... ,c - 1, the quotient group

is a free abelian group freely generated by the images of the free generators of Bk in the quotient group. It can be possible for a subgroup B of Nn,e that one or more of the Bi in Theorem 2.2 might have rank 0. In such cases we treat these subgroups as trivial. Applying Theorem 2.2 to the subgroup Dn,e of Nn,e the subgroups Bl, ... , Be are constructed as follows. Set Bl to be a group of rank 0, as there are no basic commutators of weight 1 in Dn,e, and set

for k = 2, ... ,c. It follows that Dn,e is generated by Cn,e \ Cn,l. This fact can also be obtained by the Hall Basis Theorem. We now prove Theorem 1.1.

Proof of Theorem 1.1. Let Nn,e be a free nilpotent group of class c ~ 1 and rank n ~ 1. If n = 1 or c = 1 then Nn,e is abelian and its derived subgroup Dn,e is trivial. If c = 2 then Dn,e is abelian and hence free abelian by Theorem 2.1. The M(n, 2) commutators of Cn of weight 2 are independent and generate Dn,e. Hence Dn,e is free abelian of rank M(n, 2) = G). Suppose c > 2 and n > 1. Let B l , ... , Be be the subgroups of Dn,e constructed above. The abelian subgroups Be and B e- l are free abelian by Theorem 2.1 and both are contained in the center of Dn,e. The Hall Basis Theorem states that Cn,e \ Cn,e-l is an independent generating set for Be. By property (v) of Theorem 2.2 the quotient (Be-l,Bc)/(B e) is freely generated by the images of Cn,e-l \ Cn ,e-2. However, since B e- l is also central, (B e- l , Be) = B e- l X Be with rank ICn,e \ Cn ,e-21. Let A be the subgroup of Dn,e generated by the basis elements Sn,e \Sn,e-2 of (B e- l , Be).

49

Let N be the subgroup of Vn,e generated by the set (Cn,e \ Cn,l) \ (Sn,e \ Sn,e-2). By Theorem 2.2 this subgroup is generated by subgroups B I ,.··, B~_l' B~ that satisfy properties (i)-(v). We define this sequence of subgroups as before, replacing the subgroups Be and B e- l with B~ B~_l

= ((Cn,e \ Cn,e-l) \ (Sn,e \ Sn,e-l)) and = ((Cn,e-l \ Cn,e-2) \ (Sn,e-l \ Sn,e-2)).

It follows from A being central in Vn,e and from the Hall Basis Theorem that any element of Vn,e can be written as xy, where x E Nand YEA. Hence Vn,e = N A. The subgroup N is normal in Vn,e since A is central and Vn,e = N A. Any element 9 E N n A would have to be written both as a product of powers of the generators of N and as a product of powers of simple commutator generators of A. This is only possible if 9 = 1. Hence NnA = 1. Since both A and N are normal in Vn,e, it follows that Vn,e = N x A. To show that N is isomorphic to Nn,e we define a new sequence of subgroups of N. For i = 2, ... , C - 2 we define

Set N* = (S2,"" Se-2). The generators of the subgroups Si for i = 2, ... ,C - 2 'are independent and cannot be expressed as products of powers of the other generators. This holds since the generators are simple left normed commutators and no generator of N* has weight 1. Using the bijection f3 the generators of K are mapped to the generators of N*. The groups K and N* both have the same nilpotency class, either lc/2 J if n > 2 or lc/2J - 1 if n = 2. Hence the mapping of generators induces a homomorphism ¢ : K ---. N*. Since the Si are subgroups of the Bi then lSi, Sj] is trivial if i + j > c. Therefore the commutator [ki' kj ] is in the kernel of ¢ whenever the sum of the weights of the commutators f3(i) and ;3(j) is larger than c. Hence R ~ ker(¢). On the other hand, N* by construction does not introduce relations other than those found in Theorem 2.2. Hence R = ker(¢), and N* ~ K/R = Nn,e. We complete the proof by showing that N* = N. If Ci is a generator of N that is not a simple left normed commutator then Ci = [c q , cp ], where Wq > 1 and wp > 1. If cq and cp are simple commutators then Ci is a product of the generators of N*. If either cq or c p is not a simple commutator then we repeat the process and determine that all generators of N are products of simple commutators that generate N*. 0

50

The following result from Gaglione and Spellman 7 enables us to provide precise values for the rank s of K and for f in Theorem 1.1 in terms of n and c. Proposition 2.1. Let m and w be positive integers larger than 1. Then the value of ISm,w \ Sm,w-ll, the number of simple left normed commutators of Cm,w of weight exactly w, is

We derive an immediate consequence. Corollary 2.1. Let m and w be positive integers greater than 1. Then the

value of ISm,w \ Sm,w-21 is (

m+w-2) ((W-1)+ W(W-2)). w m+w-2

Proof. By Proposition 2.1,

ISm,w \ Sm,w-21 = ISm,w-l \ Sm,w-21 =

(w _

= (w =

2)

_ 2)

+ ISm,w

\ Sm,w-ll

(m; ~; 3) + -1) (m +: -2) (w

(m

+ w - 3)! (w - l)!(m - 2)!

+ (w _

1)

(m +ww - 2)

(m +ww - 2) ((W _ 1) + m+w-2 w(w - 2) ) .

D

Using Proposition 2.1 we may also determine a formula for the number

ISn,w \ Sn,ll of simple left normed commutators of Cn,c of weight at least 2. Proposition 2.2. Let w be a positive integer greater than 1. Let S:" = Sn,w \ Sn,l be the set of simple left normed commutators of Cn of weights 2, ... ,w. Then

S* I = ~ IS . \ S . 1= (n ~ n,) n,)-l I w

+w -

l)!(wn - w - n) , , w.n.

j=2

+ w!n! .

Proof. The proof is by induction on w for each fixed value of n. For w the right side gives

(n

+ l)!(n - 2) + 2n! 2n!

(n

+ l)(n 2

2)

+2

n(n -1) 2

= 2,

51

which is equal to ISn,2 \ Sn,ll = G). Suppose that the formula holds for w

= k 2:

2, that is, that

k

.\8 . 1= IS*I=""'IS k ~ n,) n,)-l j=2

(n+k-1)!(kn-k-n)+k!n! k' , .

.n.

Then, by the inductive hypothesis and Proposition 2.1,

18k+11

=

18kl +

1

8 n,k+1 \ 8 n ,kl

+ kin! + k(n + k - 1) k+1 + kin! k (n + k - I)! + (k + l)!(n - 2)! (k + l)((n + k - l)!(kn - k - n) + kin!) kn(n - l)(n + k - I)! + ---''----'--'----'(k + l)!n! (k + l)!n! (n + k - l)!(n + k)(nk - k - 1) + (k + l)!n! (k + l)!n! (n + k)!(n(k + 1) - (k + 1) - n) + (k + l)!n! (k + l)!n!

= (n

+k -

l)!(kn - k - n) kin! (n + k - l)!(kn - k - n) = kin! =

which shows that the formula holds for w holds for all integers w 2: 2.

=

+ 1,

k

and thus the formula 0

In particular, when w = c - 2, * (n IS c-21 =

+c -

3)!((c - 2)n - (c - 2) - n) (c - 2)!n!

(n

+c -

3)!(cn - 3n - c + 2) (c - 2)!n!

+ (c -

+ (c -

2)!n!

2)!n!

which is the value of the rank s of /C. We thus obtain the following refinement of the statement of our main result. Theorem 2.3. Let Nn,c be the free nilpotent group of class c 2: 1 and rank n> 2. Then

where f=(n+c-2) ((C-1)+ C(C-2») c n+c-2 and Nn.,c has a minimal cardinality generating set with s= generators.

(n

+c -

3)!(cn - 3n - c + 2) (c - 2)!n!

+ (c -

2)!n!

52

3. Application

In this section we apply Theorem 1.1 to describe the structure of the nonabelian tensor squares of the free nilpotent groups of finite rank. Let G be any group. Then the group G 181 G generated by the symbols 9 181 h, where g, hE G, subject to the relations

for all g, h, and k in G, where Xy = xyx- 1 for x, y E G, is called the nonabelian tensor square of G. Let \7( G) be the subgroup of GI8IG generated by the set {g 181 gig E G}. The group \7(G) is a central subgroup of G 181 G (Brown and Loday4). The factor group GI8IGI\7(G) is called the nonabelian exterior square of G, denoted by GAG. For elements 9 and h in G, the coset (g 181 h)\7(G) is denoted 9 A h. Hence G 181 G is a central extension of GAG by \7 (G) and we have the short exact sequence

1

----7

\7(G)

a

----7

GAG

----7

1.

The following theorem from Blyth, et al. 3 provides the basic structure for the nonabelian tensor square of a free nilpotent group of finite rank. Theorem 3.1. Let G n> 1. Then

= Nn,c be the free nilpotent group of class c and rank G 181 G ~ f(GIG') x GAG,

where f( GIG') is the Whitehead quadratic functor defined by Whitehead. 8

Since the abelianization of Nn,c is a free abelian group of rank n, the group f(N~~) is isomorphic to F(;;t') (Whitehead B ). It follows from Corollary 2 of Brown et al. 9 that 'D n ,c+1 is isomorphic to Nn,c A Nn,c' Putting these facts together we obtain Corollary 1.7 of Blyth et al.,3 which we now state. Corollary 3.1. Let G = Nn,c be the free nilpotent group of class c and rank n > 1. Then

The following theorem combines our detailed description of 'Dn,c+l from Theorem 2.3 with Corollary 3.1.

53

Theorem 3.2. Let G n> 2. Then

=

Nn,c be the free nilpotent group of class c and rank

where g=

and

Nn,c+l

(n+c-l) c+l

(c+ (C+l)(C-l)) + (n+l). n+c-l 2

has a minimal cardinality generating set with (n + c - 2)! (( c + 1) n - 3n - c + 3) + (c - 1) !n!

(c - l)!n! generators. Acknowledgements The authors thank the Institute for Global Enterprise in Indiana for its financial support of this research. The first and second authors thank the University of Evansville for its hospitality while visiting there. The second author thanks the Ministry of Science of Slovenia for supporting his postdoctoral leave to visit the University of Evansville. References 1. S. Moran, A subgroup theorem for free nilpotent groups, Trans. Amer. Math. Soc. 103, pp. 495-515 (1962). 2. N. P. Gol'dina, Free nilpotent groups, Dokl. Akad. Nauk SSSR (N.S.) 111, pp. 528-530 (1956). 3. R. D. Blyth, P. Moravec and R. F. Morse, On the nonabelian tensor squares of free nilpotent groups of finite rank, in Computational Group Theory and the Theory of Groups, eds. L.-C. Kappe, A. Magidin and R. F. Morse (American Mathematical Society, Providence, RI, 2008) 4. R. Brown and J.-L. Loday, Van Kampen theorems for diagrams of spaces, Topology 26, pp. 311-335 (1987), With an appendix by M. Zisman. 5. R. K. Dennis, In Search of New "Homology" Functors having a Close Relationship to K-theory, Unpublished preprint. 6. C. Miller, The second homology group of a group; relations among commutators, Proc. Amer. Math. Soc. 3, pp. 588-595 (1952). 7. A. M. Gaglione and D. Spellman, Extending Witt's formula to free abelian by nilpotent groups, J. Algebra 126, pp. 170-175 (1989). 8. J. H. C. Whitehead, A certain exact sequence, Ann. of Math. (2) 52, pp. 51-110 (1950). 9. R. Brown, D. L. Johnson and E. F. Robertson, Some computations of nonabelian tensor products of groups, J. Algebra 111, pp. 177-202 (1987).

A RECURRENCE RELATION FOR THE NUMBER OF FREE SUBGROUPS IN FREE PRODUCTS OF CYCLIC GROUPS T. CAMPS Fachbereich Mathematik, Universitiit Dortmund, Vogelpothsweg 87, 44137 Dortmund, Germany E-mail: [email protected] www.mathematik.uni-dortmund.de

M.DORFER Am Beutenbach 24 71254 Ditzingen, Germany E-mail: [email protected]

G.ROSENBERGER Fachbereich Mathematik, Universitiit Dortmund, Vogelpothsweg 87, 44137 Dortmund, Germany E-mail: [email protected] www.mathematik.uni-dortmund.de

Dedicated to A. M. Gaglione on the occasion of his 60th birthday. In this note we consider a free product G of finitely many cyclic groups of finite or infinite order and develop an explicit and straightforward recurrence formula for the number of free subgroups of G which includes only the given group theoretic data as the number of the free product factors and the orders of the given cyclic groups. Keywords: Free Products, free subgroups, recurrence formulas for free subgroups. Mathematical classification: 20E06, 20E07, 20F05, 20HlO.

54

55 1. Introduction We consider free products of finitely many finite or infinite cyclic groups of the following form:

G = C r1

* C * ... * C * 'Coo * ... * Coo -..-' r2

(1)

rd

u factors

where d and u are nonnegative integers with d + u ::::: 2, 7"i E 1'1, 7"i ::::: 2 (i E {1,2, ... ,d}), 7"1 ::::: 7"2 ::::: ... ::::: 7"d. Moreover we define t:= 1 if d = 0 and t := lcm (7"1,7"2, .•. , 7"d) if d ::::: 1. In this case let t := 7"iSi for all i E {1, 2, ... , d} . For these groups, Stothers [10] gave a recurrence relation for Fc (n), the number of free subgroups of G of index n, namely

Fc (n) = 0 if n '1= 0 (mod t) and for kEN,

1

~

Fc (tk) = (tk _ 1)! h (G,

1

k-l

L

Stk) -

~

(tk _ tj)! h (G,

Stk-tj)

Fc (tj)

(2)

)=1

where

(rr d

h (G , Stk ) =

i=l

(

(tk)! ) ((tk)!)U .k)'·7"iSi k

(3)

S,

is the number of homomorphisms from G to the symmetric group on a set M of tk elements satisfying the condition that the preimage of the stabilizers of the elements of M are free or trivial. The disadvantage of this recurrence relation is that one has to determine the number of homomorphisms from G into symmetric groups. The advantage of this recurrence relation is that it is easy to extend it analogously to a general recurrence relation for arbitrary free products H = HI * ... * H n , n ::::: 2, which is difficult to calculate explicitely but from which one gets a nice asymptotic behaviour for the number of subgroups in H of a given index. (see for instance [8] and [9]) An earlier general version for the number of subgroups of finite index in H is given by Dey [1]. This version was used to count the subgroups of finite index for the modular group C 2 * C 3 and more generally for the Hecke groups which are free products C 2 * C q of two cyclic groups, one of order 2 and one of order q ::::: 3 (see for instance [2], [3], [4], [5] and [7]). Apart from formula (2), Stothers [10] also studied a second recurrence relation for the number of free subgroups of G of index n and gave it

56 explicitly in the simplest cases Coo * Coo, C 2 * Coo, C 3 * C 3 , C 2 * C 3 , C 2 * C 4 , C 2 * C 2 * C 2 and C 2 * C 2 • In this paper we present the formula for all free products of type (1), thus generalizing the results of Stothers, and furthermore of Wohlfahrt [11] for the modular group C 2 * C 3 and of KernIsberner [6] for some Hecke groups C 2 * C q , q 2': 3. Before we can state the main result, we need a couple of preliminary observations. 2. Preliminaries For a group G of type (1) let 0'0

:= 1,

O'n :=

h (G, Stn) () , tn.

00

f:= LO'n zn ,

(n E N),

n=O 00

an := Fc (tn) (n EN),

g:= L

an+1zn

n=O

(I, 9 formal power series). With these definitions, (2) is equivalent to the differential equation

f' f

(4)

g=t-

which was used by Wohlfahrt in [11] to derive the recurrence relation for the number of subgroups in the modular group. Lemma 2.1. Let n E No be arbitrary but fixed. Then we have

k-1 nk-n(n-1) ... (n-k+1)=Ln1dk,1 1=1

(kEN)

(5)

with

dk,l = (_1)k-I-1 (k - I)! ( L ('1'1 , ... ,'PI-ll E{1,2, .. . ,k-1} 1-1 1 ::0'1'1 < ... 2 and k E {I, 2, ... , m} by Skm

(g,g, ... ,g (m-k)._ ) .- (Sk (g,g, ... ,g (m-I-k) )) ' m-I

I

m-I +g S k-I

with

I

( ' g,g,

... ,g (m-k))

Si (g) := 9 and ' , ... , 9 (m-I)) .._ 0 -. _. Som-I ( g, 9

sm-I ( ' , ... m g, 9

, 9 (m-I-m))

for all m ;::: 2. Proof of step 1: Induction on musing

Step 2: Every summand in S'k (g, g', ... , g(m-k)) {1,2, ... ,m}) is of type

(m

N, k

where a E N, li E {O, 1, ... , m - k} (i E {l, 2, ... , k}) and m-k.

h +l2 + ... +lk

E

Proof of step 2: Induction on m. We only look at the induction step from m - 1 to m (m ;::: 2). The cases k = 1 and k = m are trivial because ' ... ,g (m-I)) = S Im ( g,g,

sm-I ( ' I g,g,

(then use the induction hypothesis) and S: (g) For k E {2,3, ... ,m -I}, m

I

Sk (g,g, ... ,g

(m-k)

m-I

) = (Sk

I

... ,g (m-2)) I

= gm.

(g,g, ... ,g

m-I +g Sk-I

( ' g,g,

(m-I-k)

)) '

... ,g (m-k)) ,

E

=

60

so that every summand of Sf (g, g', ... , g(m-k)) is either one of (s km-l ( g, 9' , ... , 9 (m-l-k)))' or 0 f 9 Sm-l k-l ( g, 9' , ... , 9 (m-k)) an d th ose are known by the induction hypothesis. Step 3: Let (h, ... , lk) E N~ with II ::::: l2 ::::: ... ::::: lk = m - k where mEN and k E {I,2, ... ,m}. Then Sf (g,g', ... ,g(m-k)) contains a summand which is equal to the product of g(lI) g(l2) ... g(lk) and a suitable positive integer. Proof of step 3: Induction on m. The case m = 1 is trivial. The step from m - 1 to m (m ::::: 2): Let k E {I,2,oo.,m}, (l1,oo.,lk) E N~ with h + l2 + ... + lk = m - k. a) Let lk = O. Thus k ::::: 2. Belonging to (ll, ... , lk) we have the product

h:::::

l2

>

where (h, ... ,lk-l) is a (k - I)-tuple with II ::::: l2 ::::: ... ::::: lk-l and h + l2 + ... + lk-l = (m - 1) - (k - 1), so that there is (by induction hypothesis) a summand with respect to (l1, ... , lk-d in S~11 (g,g', ... ,g(m-k)) with the desired property. Then use the refor sm ( 9' , ... , 9 (m-k)) . · currence re Ia t IOn kg, b) Let lk =I- O. Thus k :s: m - l. Belonging to (h, ... , lk) we have the product

where (h, ... , lk-l, lk - 1) is a k-tuple with II +l2 + .. .+lk-l +lk -1 = (m - l)-k and II ::::: l2 ::::: ... ::::: lk-l ::::: lk-I ::::: 0, so that there is (by the induction hypothesis) a summand with respect to (h, ... , h-l, lk - 1) in S;;-1 (g,g', ... ,g(m-l-k)) with the desired property. Then use the recurrence relation for Sf (g, g', ... , g(m-k)) Step 2 and 3 together yield sm ' ... ,g (m-k)) = k ( g,g, (11 , ... ,lk)EN~

h +l2+ ... +lk=m-k2:l12:l22: ... 2:lk

61

with coefficients eZ:, ... ,lk E N (m E N, k E {I, 2, ... , m}) still to be determined, and we finally see

(m E N) by step 1. It remains to prove the recurrence relation for the coefficients. Obviously = 1 because 9 = Sf (g) = e6g(0). Now let m ;::: 2 and k E {I, 2, ... , m}, moreover define j1, j2, ... , jc-1 for a positive integer c to be those indices in {I, 2, ... , k - I} for which l ji > l ji +1 (i E {1,2, ... ,c-1}), and finally let jc:= k. (i) follows immediately from the fact, that eZ:, ... h g (lll ... g(lk) is a summand of Sf (g,g', ... ,g(m-k)), and the recurrence relation from step 1 provides us with the key information to realize (i). (iii) will be proved by induction on m. We will refer to the induction hypothesis by "IH". The case 6 = k being clear, we can assume (h, ... ,lk) E N~ with h ;::: l2 ;::: ... ;::: l8 > 0, lH1 = lH2 = ... = lk = 0 and h + l2 + ... + lk = m - k for a 6 E {O, 1, ... , k - I}. The case m = 2 is trivial. ... = lk = 0, so The step from m - 1 to m (m ;::: 3): If 6 = 0 then h

e6

= m and eml1"", 1k = 1 = (k':O) . 1. If 6 E {1,2, ... ,k -I} (=} c;::: 2) then k

Zc-1 =

jc-1

= 6 and

if I { k - 6 + 1 if

l8

> 1,

l8 =

1.

It follows

IH

=

- 1 (k m-16) e

m-1-(k-1)H L" ... ,lo

+

t; c-2

( Zi

k- 6 e . . .

m -1 )

m-1-kH

l" ... ,IJi-

m-1 +Zc-1 eL, , ... ,10-1 ,10 -1,0, ... ,0'

Moreover we have (in case of

l8

"

IJi- 1 ,11i+ 1 , ... '/0

(10)

> 1)

m-1 IH Zc-1 e L" ... ,10_1,10-1,0, ... ,0

=

(m -61) k _

m-1-k+8 eL" ... ,10_1,10-1

62

and (in case of lo = 1) m-1 !!i zc- 1e l l , ... ,10_1,l0-I,O, ... ,o -

(k

5 + 1) ( m - 1 ) k - 5+1

-

m-1-k+o-1 ell, ... ,lo_l

- 1) (m5-- 1(5--k 1)+ 5) (m k- 5 IH

(m -51) k -

m-1-k+o-o+o-l ell, ... ,lo_l

m-l-k+o ell, ... ,lo_l,O·

So (10) together with a further application of (i) gives em h, ... ,lk -

m- 1)

m-k+o k - 1 - 5 e h, ... ,lo

(

+

(mk -- 51) e

m-k+o 11, ... ,1 0 -

(ii): This is the lengthiest part with numerous calculations. The case m = 2 is clear. The step from m - 1 to m (m ~ 3): We first consider the case lk = 0, that is it ~ ... ~ lo > 0 and lHI = ... = lk = 0 for a 5 E {O, 1, ... , k - I}. Then of course k ~ 2 and (because the formula is correct for 5 = 0) we can assume 5 ~ 1, thus c ~ 2. By applying (iii), the induction hypothesis and once more (iii) it follows

c-1

L

i=1

m m - k

c-1 (

L

+5

m-1 ( ) m-l-1·. l· eh, ...

,I/~l,lji+l, ... ,lo,O, ... ,o

j,

m -1 ) l..

i=1

m-l-1'

eh,···,ljiJ~l,lji+l, ... ,lo,O, ... ,o

j,

c-l + m k- -k 5+ 5 L i=l

(

1)

m -

l·

e

m-l-1 J,

h, ... ,lji-l,lji+l,"',lo,o, ... ,o

j,

Because of jc = k and lk = 0 we get (

m lje

1)

m (iii) ( eh, ... ,lk_l

=

1)

m k - 1- 5

m-k+o ell, ... ,lo

•

(11)

63

=

~

((

i=l

==

km - k

m- 1) (m - k1.+ 6-1)

k-1-O'

].

6+ 6 L (m1·- 1) c-1

i=l

e

m-1-1 .. Jt

11, ... ,lji- 1,lji+ 1 , ... ,18,0, ... ,0 ,

J'/.

so the asserted formula can be derived from (11). Let now lk =1= O. Omitting the trivial case k = 1, we suppose k If c = 1 then h = l2 = ... = lk and Zl = 1, so

~

In the following, we examine the case c ~ 2. We introduce here the Notation:

e~,~l'~klll~i

:=

We consider several cases of values for i and ji. i = 1 andji ~ 2 or i E {2,3, ... ,c-1}: Case 1: lj.+1 < lji - l. a) lji = lji- 1 . Then

e~,~~,~:j~1,lji+1, ... ,lk·

2.

64

b)

Iji

< Iji- 1 . Then

Case 2: Iji+1 = Iji - 1. a) Iji = lji-l' Then: m-l

eLI ,... ,lji -1 ,lji -1,lji+1 , ... ,lk

i = 1 and]1 = 1: Case 1: h - 1 = Z2. Then:

Case 2: II - 1 > Z2. Then:

er';=t,12, ... ,lk

~ ~ (mz: 2)e;,;~;,~:~.~.'lklljr + (~= De~,~.I,lkh.

i = c:

Case 1: Zk-l

= Ik. Then:

65

Case 2: lk-I > lk. Then:

e

m-I

It, ... ,lk-l,h-I

IH c-I ( m -) 2 e m-2-1·Jr =

L

r=1

l·

11, ... ,lk_l,lk-II1jr

m -) 2 m-I-1k +( l _ 1 e It, ... ,h-l . k

)r

Now define integers lo and lk+1 such that lo - 1 > hand lHI < lk - 1. Then the four investigated special cases for i = 1, ji = 1 and i = c (in the above order) may uniquely be assigned to the formerly discussed cases 2.b), 1.b), La), 1.b), and by

M I := {i 11:::; i:::; C,lji+1 < lji -1,lji = lji-d,

iji+l

L

+

i=l

l j i - 1 =lji

L

+

i=l iji -1>iji+1

+

L r=l ljr- 1 =ljr

c

+

L

(12)

r=l ijr -l>ijr+l

On the other hand

Let r 2': 2. Case 1: ljr-l

-

1=

ljr,

that is

ljr_l

=

ljr-l.

67

em-l-ljr

it ,... ,ljr-1 ,ljr+l '" .,lk

Case 2: a) ljr_l

ljr-1 -

1

=

=

ljr'

ljr'

Then:

Let r = 1. a) jl = 1. Then:

b) jl 2': 2. Then:

which can uniquely be assigned to the cases 1.b) and 2.b) using the convention for la and moreover jo := O. So (13) is equal to

68

c

1'=2 ljr_l-1=ljr

c

+

L

1'=1

Ijr -1>ljr+1

c-1

+

L

1'=1 Ijr-1=ljr+1

which is equal to (12). This concludes the proof of Lemma 2.3.

o

3. The Main Results Theorem 3.1. For a group of type (1) let an

= Fc (tn),

then

(14) and

(15)

69

(n ~ 1) where hand Lm (m E {1, 2, ... , h}) are given thus: d

h := t (u

+d-

LSi + 1,

1) -

i=1

In the last formula

with T:= ( 1, ... ,1,2, ... ,2 , ... ,t-1, ... ,t-1,t, ... ,t) =:(Tl,T2, ... ,Th), ~

~

' - v - ' "-v--"

u+d-l-w1 u+d-l-w2

u+d-l-Wt_1

u times

wI" is the number of sets Vrl! ... , Vrd of positive multiples of '1'1, which contain fL (fL E {1, 2, ... t - 1}),

1 T ... T

L

bT,w:=

J1

( Ti1,···,Tjw )

... , 'I'd

(wE{1,2, ... ,h})

Jw

l:Sh < ... <jw:S h

Moreover, the d i ",i,,+1 and el,', ... ,lk are defined as in (6) on page 56 and (9) on page 58 respectively.

Proof. By (3) (see page 55), we have an

=

h (G, Stn) (tn)!

so t

n (tn + It+ a n +l

=

d

-

1

1=1

d

s,

n n (tn + jri)

an

i=1 j=1

or equivalently

n (tn + It+ = (tn + t t n n (tn + jri) t-l

t (n

+ 1) an+!

1;1 s,-1

i=1 j=1

d

-

1

an

(if d ~ 1)

(16)

70

and

for all nonnegative integers n. In (16), the right side can be written as a polynomial in n. For d 2': 1 and i E {I, 2, ... , d} let Vri be the set of all positive multiples of ri and let Wz be the number of Vri containing l (l E {I, 2, ... , t - I}). Then (16) becomes t

(n

(g

+ 1) an+! = (tn + tt

(tn

+ It+ d- 1- W1 )

an.

(n E No)

(17)

(Using t = 1 if d = 0, the formula in case of d = 0 is contained therein.) The product

(tn + tt

g

t-1 (

(tn + l)u+d-1- Wl

)

in (17) consists of d

t (u

+d-

1) - LSi

+ 1 =:

h(2': 1)

i=l

factors, so we can expand the right side of (17) as a polynomial in n of degree 2': 1. For this purpose we define

T:= ( 1,,,.,1,2,,,.,2 ,,,.,t-1,,,.,t-1,t,,,.,t) =: (T1,T2,,,.,Th), '----v-" '----v-" ~ ~ u+d-1-W1 u+d-1-w2 u+d-1-Wt_1 u times and we get

where

bT,m

1

:= (Tj1 , ... ,TjTn) l::;ji < ... <j~::;h

T- .. 'TJ1 J~

(mE{l,2,,,.,h})

and bT,o := 1. By virtue of (7) (see page 57) and m

Km = t

((t - 1)!)u+d-1 t

U

bT,m

d

IT rt i=l

i

-

1

(Si - I)!

(m E {O, 1,,,., h})

71

we get

t (n + 1) an+! h

=Koa n +

L

n (n-1) ... (n-m+1)a n

m=l

for all n E No which is equivalent to the differential equation h

tf'

=

Ko!

+L

zm !(m)

m=l

or after division by

!

to

In terms of the formal power series g, the last equation reads

by (4) (page 56) and (9) (page 58). Observing that

comparison of the coefficients yields the desired result.

o

72

4. Examples Formula (15) contains (h - l)-fold sums, so easy expressions can only be expected if hE {1, 2}. h = 1 is exclusively possible for d ~ 2, namely for the group G = C 2 *C2 , and (14), (15) give a1

=

1,

a n+1 = L 1a n = an = 1,

(n

~

1)

as was shown by Stothers in [10]. The case h = 2: For d = a we have h = 2 if and only if u = 2, that is G = Coo * Coo. For d = 1 we have h = 2 if and only ift = 2 and u = 1, that is G = C 2 *Coo · For d ~ 2 we have h = 2 in exactly the following cases:

* C3 , G =C2 * C 3 ,

(d = 2,81 = 82)

G =C3

G =C2 * C2

G = C2

* C4 ,

(d=2,8l=82+ 1)

* C2.

(d

= 3)

For all of these groups, formula (15) is (because of eb = 1, 2I = 1, of the form an+l = Llan

+ L2 (n -

L2

1) an

+t

n-l

L ak+1 an-k-1

e6,o =

(n ~ 1)

k=O

where

with Kl = tKobT,l, K2 = t 2 K obT ,2 = t 2.

We calculate the following data table: G Coo * Coo C2 * Coo C3 * C3 C2 * C 3 C2 * C 4 C2 * C 2 * C2

Ko 1 2 2 5 3 1

T {1,1} {1,2} {1,2} {1,5} {1,3} {1,1}

bT,l 2 3

"2 3 "2 6 5 4

bT,2 1 1 "2 1 "2 1 5 1

"3

"3

2

1

Kl 2 6 9 36 16 4

K2 1 4 9 36 16 4

L1 3 5 6 = 2t 12 = 2t 8 = 2t 4 = 2t

L2 1 2 3=t 6=t 4=t 2=t

1)

73 so that we finally have for G

= Coo * Coo: n-l

an+l

= (n + 2) an + L

akan-k

(n;:::: 1)

k=l

and for G = C 2

* Coo: n-l

an+l = (2n

+ 3) an + L

akan-k·

(n;:::: 1)

akan-k·

(n;:::: 1)

k=l

In all other cases:

n-l

an+l = t (n

+ 1) an + L

k=l

These are the special cases considered by Stothers in [10]. (The "-" in front of the sum has to be corrected there.) The formula for G = C 2 * C 3 may also be found in [11]. If we take the group G = C 2 * C 6 and use (14) and (15), we deduce the result from [6].

References 1. 1. M. S. Dey, Schreier Systems in Free Products; Proc. Glasgow Math. 7 (1965), 61-79. 2. B. Fine, D. Spellman, Counting Subgroups of the Hecke Groups; Int. J. of Algebra and Computation 3 (1993), 43-49. 3. M. Grady, Counting the Subgroups of an Infinite Group; JCMCC 20 (1996), 89-96. 4. M. Grady, M. Newman, Counting Subgroups of Given Index in Hecke Groups; Contemporary Math. 143 (1993), 431-436. 5. W. Imrich, On the Number of Sugroups of a given Index in SL2 (/Z); Archiv d. Math. 31 (1978), 224-231. 6. G. Kern-Isberner, Rekursionsformeln fur die Anzahl von Normalteilern in freien Produkten zyklischer Gruppen; Dissertation, Dortmund, 1985. 7. Mong-Lung Lang, Chong-Hai Lim, Ser-Peow Tan, Subgroups of the Hecke Groups with Small Index; Lin. and Multil. Algebra 35 (1993), 75-77. 8. T. Muller, Counting Free Subgroups of Finite Index; Archiv d. Math. 59 (1992), 525-533.

74

9. T. Miiller, Subgroup Growth of Free Products; Invent. Math. 126 (1996),111131. 10. W. W. Stothers, Free Subgroups of the Free Product of Cyclic Groups; Math. Camp. 32 (1978), 1274-1280. 11. K. Wohlfahrt, Uber einen Satz von Dey und die Modulgruppe; Arch. Math. 29 (1977), 455-457.

The Baumslag-Solitar Groups: A Solution for the Isomorphism Problem Anthony E. Clement

Department of Mathematics, Brooklyn College, The City University of New York, 2900 Bedford Avenue Brooklyn, NY 11210, USA E-mail: [email protected] Abstract: The object of this note is to give a concise proof of the following theorem: Let G = B(m,n) = (a, bi a-1bma = bn 1m # 0, n # 0, m, n E Z) and H = B(m',n') = (X,Yi x-1ym'x = yn'lm' # 0, n' # 0, m', n' E Z). Then G eo< H if and only if m = m' and n = n'.

Keywords: Baumslag-Solitar groups, isomorphism problem, semi-direct products

1. Introduction

The theorem quoted in the abstract above solves, in principle, the isomorphism problem for the class a = B(m,n) = (a, b; a-1bma = bn Im =I=- 0, n =I=0, m, n E Z) of the Baumslag-Solitar groups. D.l. Moldavanskii's proof [4] was rather different. The approach we take here is more direct and makes use of the fact that certain quotients of these groups can be expressed as semi-direct products of a very special kind. We capitalize on the inherent semi-direct product nature of B(m,n) and analyze inherited actions of the infinite cyclic group on some specific subgroups of B(m,n) and their quotients. We will prove the following:

a

Theorem 1.1. Let = B(m,n) = (a, b; a-1bma = bn Im =I=- 0, n 0, m, n E Z) and let H = B(m',n') = (x,y; x-lym'x = yn' 1m' =I=- 0, n' 0, m', n' E Z). Then a ~ H if and only ifm = m' and n = n'. 2. Notation The following notation is used throughout: 0(1)= [a, a] for the first derived group of a 75

=I==I=-

76

G(2)= [G(l), G(1)l for the second derived group of G G = S ) 0, k E 7L} = gPa(b); (2) S = (... , bi , ... (i E 7L); ... , bi = bf-l' ... (i E 7L)), where bi = a-iba i , (i E 7L) ; (3) G/S ~ 7L. Now we turn to the proof of Theorem 1.1 in the forward direction. We first take care of the case n - m = O. Observe that (B(m,m) = gp(bm ) and B(m,m)/gp(bm ) ~ 7L * 7L m . If G ~ H, then n' = m' by Lemma 3.1. So since B (m, m) / gp( bm ) ~ 7L * 7L m and B (m' , m') / gp(bm') ~ 7L * 7L m" m = m' and n = n' follows. So we can assume from now on that n - m > O. Let T = {h E Hlhk E H(l) for some k > 0, k E Z}. Then, as in Lemma 3.2, we find T = gPH(Y) = (.. ·,Yi, ... (i E 7L); ... ,Yi' = Y~l, ... (i E Z)) and H/T ~ 7L. Now we are in position to formulate the following lemma whose proof relies on ideas similar to those for Lemma 3.1.

77

Lemma 3.3. Let a = B(m,n) with n - m > 0, H = B(m',n') with n'm' > 0, and
0, l' E Z, b' E B"}. Moreover, the relation a"-lb,ma" = b'n holds in L. To see this, observe that (the subscripted version) all-lb~mall = b~n holds in Land b' has the form b' = b~~1 ... b~:l, where ti = ±l. Since the b~'s commute, a"-lb,ma" = all-l(b~~1m ... b~~lm)all = 1 m II l-lb'E2ma" a"-1b'E1m II = b'E1nb'E2n b~Eln = (b'E1 b'El)n = a1I- b'E1 21 a a 22 ... 21 a 21 22 ... 2£ 21 ... 21 '"-.".-''"-.".-'

'"-.".-'

b'n. Similarly, analyzing actions inherited in analogous quotients of

H

=

T)(J

(x),

we see that T IT(l) = ( ... , Yi, ... (i E Z); ... , Yi' = Yi~l' ... (i E Z), ... , [Yi, Yj] 1, ... (i,j E Z)), whence

which implies that HIT(l) /QIT(l) = TIT(l) /QIT(l)

)(J

((xT(l»)QIT(l»)

=

78

and so

H/Q Letting M = follows, where 1, ... (i,j E Z)) {x"ky'l k E Z,

~

T/Q

) 0 'lj;) = V(¢» U V('lj;), where 0 E {V, i\, ----;}, and V(Vx¢» = V(3x¢» = V(¢» ,,{x}. We write ¢>(XI, ... ,Xn ) in the case when V(¢» (XI, ... ,Xn ) E .c(X) and ml, ... ,mn E M then one can define, following the conditions F1)-F3), the relation "¢> is true in M under the interpretation Xl ----; ml,.'" Xn ----; m n" (symbolically M F ¢>(ml, ... , m n )). It is convenient sometimes to view this relation as an n-ary predicate ¢>M on M. If h : X ----; M is an interpretation of variables then we denote ¢>h = ¢>M (h(XI)"'" h(xn)). A set of formulas h for every ¢> E . In this case one says that is realized in M. The following result is due to Malcev, it plays a crucial role in model theory. Theorem [Compactness Theorem] Let K be a class of C-structures and , 'lj; E .c(X) are called equivalent if ¢>h = 'lj;h for any interpretation h: X ----; M and any C-structure M. One of the principle results in mathematical logic states that any formula ¢> E .c( X) is equivalent to a formula 'lj; in the following form: (1)

where Qi E {V,3} and 'lj;ij is an atomic formula or its negation. One of the standard ways to characterize complexity of formulas is according to their quantifier prefix QIXI ... Qmxm in (1). If in (1) all the quantifiers Qi are universal then the formula 'lj; is called universal or V-formula, and if all of them are existential then 'lj; is existential or 3-formula. In this fashion 'lj; is V3-formula if the prefix has only one

85

alteration of quantifiers (from V to 3). Similarly, one can define 3V-formulas. Observe, that V- and 3-formulas are dual relative to negation, i.e., the negation of V-formula is equivalent to an 3-formula, and the negation of 3formula is equivalent to an V-formula. A similar result holds for V3- and 3Vformulas. One may consider formulas with more alterations of quantifiers, but we have no use of them in this paper. A formula in the form (1) is positive if it does not contain negations (i.e., all'lfJij are atomic). A formula is quantifier-free if it does not contain quantifiers. We denote the set of all quantifier-free formulas from .c(X) by qf,.c(X), and the set of all atomic formulas by At.c(X). Recall that a theory in the language £ is an arbitrary consistent set of sentences in 12. A theory T is complete if for every sentence ¢ either ¢ or -,¢ lies in T. By Mod(T) we denote the (non-empty) class of all 12-structures M which satisfy all the sentences from T. Structures from Mod(T) are termed models of T and T is a set of axioms for the class Mod(T). Conversely, if K is a class of 12-structures then the set Th(K) of sentences, which are true in all structures from K, is called the elementary theory of K. Similarly, the set Thv(K) (Th3(K)) of all V-sentences (3-sentences) from Th(K) is called the universal (existential) theory of K. The following notions play an important part in this paper. Two 12-structures M and N are elementarily equivalent if Th(M) = Th(N) , and they are universally (existentially) equivalent if Thv(M) = Thv(N) (Th3(M) = Th3(N)). In this event we write, correspondingly, M = N, M =v N or M =3 N. Notice, that due to the duality mentioned above M =v N {=} M =3 N for arbitrary 12-structures M and N. A class of 12-structures K is axiomatizable if K = Mod(T) for some theory Tin 12. In particular, K is V- (3-, or V3-) axiomatizable if the theory T is V- (3-, or V3-) theory.

3. Algebras

There are several types of classes of 12-structures that playa part in general algebraic geometry: prevariaeties, quasivarieties, universal closures, and Aalgebras. We refer to [34] for a detailed discussion on this and related matters. Here we present only a few properties and characterizations of these classes, that will be used in the sequel. Most of them are known and can be found in the classical books on universal algebra, for example, in [30]. On the algebraic theory of quasivarieties, the main subject of this section, we refer to [15].

86 3.1. Congruences

In this section we remind some notions and introduce notation on presentation of algebras via generators and relations. Let M be an arbitrary fixed C-structure. An equivalence relation () on M is a congruence on M if for every operation P E F and any elements m1, ... ,mnF , m~, ... ,m~F EM such that mi rvB m~, i = 1, ... ,np, one has pM(m1, .. " m nF ) rvB pM(m~, ... , m~F)' For a congruence () the operations pM,

P E F, naturally induce welldefined operations on the factor-set M/(). Namely, if we denote by m/() the equivalence class of mE M then pMjB is defined by pMjB(ml/(), ... , m nF /())

= pM(m1, ... , m nF )/()

for any m1,"" m nF E M. Similarly, cM / B is defined for c E C as the class cM /(). This turns the factor-set M/() into an C-structure. It follows immediately from the construction that the map h : M ---4 M / (), such that h(m) = m/(), is an C-epimorphism h : M ---4 M/(), called the canonical epimorphism. The set Con(M) of all congruences on M forms a lattice relative to the inclusion ()1 ~ ()2, i.e., every two congruences in Con(M) have the least upper and the greatest lower bounds in the ordered set (Con(M), ~). To see this, observe first that the intersection of an arbitrary set e = {()i, i E I} of congruences on M is again a congruence on M, hence the greatest lower bound for e. Now, the intersection of the non-empty set {() E Con(M) I ()i ~ () 'V ()i E e} is the least upper bound for e. The following result is easy. Lemma 3.1. Let M be an C-algebra, {()i l i E I} ~ Con(M) and iEI Bi · Then M/B embeds into the direct product DiE I M/()i via the diagonal monomorphism m/B ---4 DiE I m/()i.

() =

n

A homomorphism h : M ---4 N of two C-structures determines the kernel congruence ker h on M, which is defined by m1

rvkerh

m2

{==}

h(m1) = h(m2),

m1, m2 EM.

Observe, that if () E Con(M) and B ~ ker h then the map h : M/() ---4 N defined by h(m/()) = h(m) for m E M is a homomorphism of C-structures. Definition 3.1. A set of atomic formulas t:. ~ Atc(X) is called congruent if the binary relation ()t::. on the set of terms Tc(X) defined by (where t1,t2 E T.c(X))

87 is a congruence on the free .c-algebra T.c(X). The following lemma characterizes congruent sets of formulas. Lemma 3.2. A set of atomic formulas ~ ~ At.c(X) is congruent if and

only if it satisfies the following conditions: (1) (t = t) E ~ for any term t E T.c(X); (2) if (tl = t2) E ~ then (t2 = tI) E ~ for any terms h, t2 E Tc(X); (3) if (h = t2) E ~ and (t2 = t3) E ~ then (h = t3) E ~ for any terms tl, t2, t3 E T.c(X); (4) if (h = SI), ... ,(tnF = snF) E ~ then (F(tl, ... ,tnF ) F(SI, ... ,SnF)) E ~ for any terms ti,Si E T.c(X), i = 1, ... ,nF, and any functional symbol F E .c. Proof. Straightforward.

o

Since the intersection of an arbitrary set of congruent sets of atomic formulas is again congruent, it follows that for a set ~ ~ Atc(X) there is the least congruent subset [~] ~ Atc(X), containing ~. Therefore, ~ uniquely determines the congruence Bt::. = B[t::.]. For an .c-algebra M generated by a set M' ~ M put X = {xm I m E M'} and consider a set ~M' of all atomic formulas (tl = t2) E At.c(X) such that M F (h = t2) under the interpretation Xm -+ m, m EM'. Obviously, ~M' is a congruent set in At.c(X) (the set of all relation in M relative to M'). A subset 5 ~ ~M' is called a set of defining relations of M relative to M' if [5] = ~M" In this event the pair (X I 5) termed a presentation of M by generators X and relations 5. Lemma 3.3. If (X I 5) is a presentation of M then M ~ Tc(X)/Bs. Proof. The map h' : X -+ M' defined by h'(x m ) = m, m EM', extends to a homomorphism h : T.c(X) -+ M. Clearly, tl "'kerh t2 if and only if (tl = t 2 ) E [5] for terms t l , t2 E T.c(X). Therefore, T.c(X)/Bs ~ T.c(X)/ ker h. Now the result follows from the isomorphism T.c(X)/ ker h ~ M. 0 3.2. Quasivarieties

In this section we discuss quasivarieties and related objects. The main focus is on how to generate the least quasi variety containing a given class of structures K. A model example here is the celebrated Birkhoff's theorem which describes Var(K), the smallest variety containing K, as the class

88

HSP(K) obtained from K by taking direct products (the operator P), then substructures (the operator S), and then homomorphic images (the operator H). Along the way we introduce some other relevant operators. On the algebraic theory of quasivarieties we refer to [15] and [30]. We fix, as before, a functional language £ and a class of £-algebras K. We always assume that K is an abstract class, i.e., with any algebra M E K the class K contains all isomorphic copies of M. Recall that an identity in £ is a formula of the type

where t, s are terms in £. Meanwhile, a quasi-identity is a formula of the type

where t(x), sex), ti(X), Si(X) are terms in £ in variables x = (Xl, ... , xn). A class of £-structures is called a quasivariety (variety) if it can be axiomatized by a set of quasi-identities (identities). Given a class of £structures K one can define the quasivariety Qvar(K), generated by K, as the quasivariety axiomatized by the set Thqj(K) of all quasi-identities which are true in all structures from K, i.e., Qvar(K) = Mod(Thqj(K)). Notice, that Qvar(K) is the least quasivariety containing K. Similarly, one defines the variety Var(K) generated by K. Observe, that an identity Vx(t(x) = sex)) is equivalent to a quasiidentity V x( X = X --t t( x) = s( x)), therefore, Qvar(K) ~ Var(K). Before we proceed with quasivarieties, we introduce one more class of structures. Namely, K termed a prevariety if K = SP(K). By Pvar(K) we denote the least prevariety, containing K. The prevariety Pvar(K) grasps the residual properties of the structures from K. An £-structure M is separated by K if for any pair of non-equal elements ml, m2 E M there is a structure N E K and a homomorphism h : M --t N such that h( ml) "I=h(m2)' By Res(K) we denote the class of £-structures separated by K. In the following lemma we collect some known facts on prevarieties.

Lemma 3.4. For any class of £-structures K the following holds: 1) Pvar(K) = SP(K) ~ Qvar(K); 2) Pvar(K) = Res(K); 3) Pvar(K) is axiomatizable if and only if Pvar(K)

= Qvar(K).

89 Proof. Equality 1) follows directly from definitions. 2) was proven for groups in [34], here we give a general argument. It is easy to see that Res(K) is a prevariety, so Pvar(K) ~ Res(K). To show converse, take a structure M E Res(K) and consider the set I of all pairs (ml,m2), ml,m2 E M, such that ml =I- m2· Then for every i E I there exists a structure M E K and a homomorphism hi : M ----) M with hi(ml) =I- h i (m2). The homomorphisms hi, i E I, give rise to the "diagonal" homomorphism h : M ----) Ilo M, which is injective by construction. Hence M E SP(K), as required. 0 3) is due to Malcev [31]. Prevarieties play an important role in combinatorial algebra, they can be characterized as classes of structures admitting presentations by generators and relator. Namely, let X be a set and ~ a set of atomic formulas from c(X). Following Malcev [30], we say that a presentation (X I ~) defines a structure M in a class K if there is a map h : X ----) M such that D1) h(X) generates M and all the formulas from ~ are realized in M under the interpretation h; D2) for any structure N E K and any map f : X ----) N if all the formulas from ~ are realized in N under f then there exists a unique homomorphism g : M ----) N such that g(h(x)) = f(x) for every x E X. If (X I ~) defines a structure in K then this structure is unique up to isomorphism, we denote it by FK(X, ~). Theorem [30] A class K, containing the trivial system £, is a prevariety if and only if any presentation (X I ~) defines a structure in K. To present similar characterizations for quasivarieties we need to introduce the following operators. As was mentioned above, P(K) is the class of direct products of structures from K. Recall, that the direct product of C-structures M i , i E I, is an C-structure M = DiE! Mi with the universe M = DiE! Mi where the functions and constants from C are interpreted coordinate-wise. If all the structures Mi are isomorphic to some structure N then we refer to DiE! Mi as to a direct power of N and denote it by N!, By Pw(K) we denote the class of all finite direct products of structures from K. Recall, that a substructure N of a direct product DiE! Mi is a subdirect product of the structures M i , i E I, if pj(N) = M j for the canonical

90

projections Pj : I1iEI Mi -- Mj , j E I. By P,.(K) we denote the class of all subdirect products of structures from K. Let I be a set, D a filter over I (i.e., a collection D of subsets of I closed under finite intersections and such that if a E D then bED for any b -> and epimorphic direct limits of structures from K. The following result gives a characterization of quasivarieties in terms of direct limits. Lemma 3.6. For any class of .c-structures K the following holds:

Proof. See [15] ([Corollary 2.3.4]).

o

3.3. Universal closures In this section we study the universal closure Ucl(K) = Mod(Thv(K)) of a given class of .c-structures K. Structures from Ucl(K) are determined by local properties of structures from K. To explain precisely we need to introduce two more operators. Recall [5,34]' that a structure M is discriminated by K if for any finite W of elements from M there is a structure N E K and a homomorphism set

92

h : M ---., N whose restriction onto W is injective. Let Dis(K) be the class of .c-structures discriminated by K. Clearly, Dis(K) ~ Res(K). To introduce the second operator we need to describe local submodels of a structure M. First, we replace the language .c by a new relational language .cre!, where every operational and constant symbols F E :F and c E C are replaced, correspondingly, by a new predicate symbol RF of arity nF + 1 and a new unary predicate symbol Re. Secondly, the structure M l l . MTel MTel turns into a .c re -structure Mre , where the predIcates Re and RF are defined by R1) for m

E

M the predicate R.;tTel (m) is true in

Mrel

if and only if

eM =m;

R2) for mo, ml,"" m nF E M the predicate R-j;:lTel (mo, ml,"" m nF ) is true in Mrel if and only if FM(ml,'" ,mnF ) = mo.

Third, if .co is a finite reduct (sublanguage) of .c then by Meo we denote the reduct of Mrel, where only predicates corresponding to constants and operations from .co are survived, so Meo is an .cae/-structure. Now, following [30], by a local submodel of M we understand a finite substructure of Me o for some finite reduct .co of .c. Finally, a structure M is locally embeddable into K if every local submodel of M is isomorphic to some local submodel of a structure from K (in the language .coel ). By L(K) we denote the class of .c-structures locally embeddable into K. It is convenient for us to rephrase the notion of a local submodel in terms of formulas. Let .c' be a finite reduct of.c and X a finite set of variables. A quantifierfree formula cp in .c' is called a diagram-formula if cp is a conjunction of atomic formulas or their negations that satisfies the following conditions: 1) every formula -,(x = y), for each pair (x,y) E X 2 with x in cp;

i=

y, occurs

2) for each functional symbol F E .c' and each tuple of variables (XO,Xl, ... ,X nF ) E xnF+I either formula F(XI,".,X nF ) = Xo or its negation occurs in cp; 3) for each constant symbol c E .c' and each x E X either x = c or its negation -,(x = c) occurs in cpo We say that cp is a diagram-formula in .c if it is a diagram-formula for some finite reduct .c' of.c and a finite set X. The name of diagram-formulas comes from the diagrams of algebraic structures (see Section 3.4).

93

The following lemma is easy. Lemma 3.7. For any local submodel N of M there is a diagram-formula 'PN(X) in a finite set of variables X of cardinality [N[ such that M F 'PN(h(X)) for some bijection h : X ---4 N. And conversely, if M F 'P(h(X)) for some diagram-formula 'P(X) in C and an interpretation h : X ---4 M then there is a local submodel N of M with the universe heX) such that 'P = 'PN (up to a permutation of conjuncts).

Corollary 3.1. An .c-structure M is locally embeddable into a class K if and only if every diagram-formula realizable in M is realizable also in some structure from K. Lemma 3.8. For any class of .c-structures K the following holds:

8 Ucl(K) = L(K); 9 U cl(K) = SPu (K); 10 Dis(K) c(X), i.e, a subset p ~ cI>c(X) that can be realized in a structure from Mod(T). A type p is complete if it is a maximal type in cI> c(X) with respect to inclusion. It is easy to see that if p is a maximal type in X then for every formula i.p E cI> c( X) either i.p E P or --, i.p E p. Definition 4.1. A set p of atomic or negations of atomic formulas from cI> c(X) is called an atomic type in X relative to a theory T if pUT is consistent. A maximal atomic type in cI> c( X) with respect to inclusion termed a complete atomic type of T.

97 It is not hard to see that if p is a complete atomic type then for every atomic formula 'P E At.c(X) either 'P E P or -, 'P E p. Example 4.1. Let M be an .c-structure and m = (ml,"" m n ) E Mn. Then the set atpM(m) of atomic or negations of atomic formulas from .c (X) that are true in M under an interpretation Xi f---+ mi, i = 1, ... , n, is a complete atomic type relative to any theory T such that M E Mod(T).

We say that a complete atomic type p in variables X is realized in M = atpM(m) for some m E Mn. Every type p in T can be realized in some model of T (i.e., a structure from Mod(T)). If p cannot be realized in a structure M then we say that M omits p. There are deep results in model theory on how to construct models of T omitting a given type or a set of types. For an atomic type p ~ .c(X) by p+ and p- we denote, correspondingly, the set of all atomic and negations of atomic formulas in p. If S is a set of atomic formulas from .c(X) and M is an .cstructure then by VM(S) we denote the set {(ml, ... ,mn ) E Mn 1M 1= S(ml, .. " m n )} of all tuples in Mn that satisfy all the formulas from S. The set V M (S) is called the algebraic set defined by S in M. We refer to S as a system of equations in .c, and to elements of S - as equations in .c. Sometimes, to emphasize that formulas are from .c we call such equations (and systems of equations) coefficient-free equations, meanwhile, in the case when .c = .cA, we refer to such equations as equations with coefficients in algebra A. Following [4] we define Zariski topology on M n , n ~ 1, where algebraic sets form a prebasis of closed sets, i.e., closed sets in this topology are obtained from the algebraic sets by finite unions and (arbitrary) intersections. If p is an atomic type in .c in variables X = {Xl, ... , xn} then V M (p+) is an algebraic set in Mn. More generally, for an arbitrary type p in X by p+ we denote the set of all positive formulas in p, i.e., all formulas in the prenex form that do not have the negation symbol. If p is quantifier-free type, i.e., a type consisting of quantifier-free formulas, then formulas in p+ are conjunctions and disjunctions of atomic formulas. if p

Lemma 4.1. Let M be an .c-structure and n E N. Then for a subset ~ Mn the following conditions are equivalent:

V

• V is closed in the Zariski topology on Mn,• V = V M(P+) for some quantifier-free type p in variables {Xl, ... , x n }.

98

o

Proof. Straightforward. 4.2. Coordinate algebras and complete types

Let M be an .c-algebra. For a set 8 of atomic formulas from q,dX) denote by Rad M (8) the set of all atomic formulas from q,dX) that hold on every tuple from V M(8). In particular, if V M(8) = 0 then Rad M (8) = Atc(X). It is not hard to see that Rad M (8) is a congruent set of formulas, hence it defines a congruence that we denote by BRad(S). The .c-structure Tc(X)jBRad(S) is called the coordinate algebra of the algebraic set V M (8). If Y = V M(8) then the coordinate algebra TdX)jBRad(S) is denoted by feY) and Rad(8) - by Rad(Y). The following result gives a characterization of the coordinate algebras over an algebra M. Proposition 4.1. A finitely generated .c-algebra C is the coordinate algebra of some non-empty algebraic set over an .c-algebra M if and only if C is separated by M. Proof. Let Y be an algebraic set in Mn. With a point p = (ml' ... ,mn ) E M n we associate a homomorphism hp : Tc(X) ----; M defined by hp(t) = tM(ml, ... , m n ). Clearly,

BRad(Y)

=

n

ker hp-

pEY

Therefore, the diagonal homomorphism I1 pE Y : Tc(X) ----; a monomorphism f(Y)

= TdX)jBRad(Y)

I1PE Y M

induces

----; MIYI.

It follows that r(Y) E SP(M). Now, by Lemma 3.4 SP(M) = Res(M), so feY) E Res(M). Suppose now that C is a finitely generated .c-algebra from Res(M) with a finite generating set X = {Xl, ... , x n }. Let C = (X I 8) be a presentation of C by the generators X and relations 8 ~ Atc(X). In this case C is isomorphic to Tc(X)jBs. To prove that C is the coordinate algebra of some algebraic set over M it suffices to show that Rad M (8) = [8]. If (tl = t2) (j. [S] then there exists a homomorphism h : C ----; M with tj"1(h(XI), ... , h(xn)) i= t~(h(XI)' ... ' h(xn)). Obviously,

99

(h(XI),"" h(xn)) E V M(S) so (tl RadM(S) = [S].

=

t2) rf- RadM(S). This shows that D

Lemma 4.2. Let p be a complete atomic type in variables X. Then:

• p+ is a congruent set of formulas; • p+ = RadM(p+) for every £-structure M with V M(P)

=1=

0.

Proof. Indeed, since p is realized in some model M of T its positive part p+ satisfies the assumptions of Lemma 3.2, hence it is congruent. It follows that p+ determines a congruence ()p on Tc(X). Since p is complete one has p+ = RadM (p+). D Definition 4.2. Let X be a finite set of variables and p a complete atomic type in variables X. Then the factor-algebra Tc(X)/()p of the free £algebra Tc(X) is termed the algebra defined by the type p and the tuple (xI/()p, ... ,Xn/()p) is called a generic point of p.

Clearly, any complete atomic type p in variables X in a theory T is realized in the factor-algebra Tt:,(X)/()p at the generic point x (XI/()p,""xn/()p), so atpTcCX)/(lp(X)

=

p.

Indeed, for any atomic formula tl = t2, where tl, t2 E T,dX) one has (h = t2) E p if and only if h "'(lp t2, which is equivalent to the condition Tc(X)/()p F (h = t2) under the interpretation Xi I----' xi/()p' The generic point (xI/()p,"" xn/()p) satisfies the following universal property. If p is realized in some £-structure Mat (ml,"" m n ) E Mn then the map Xl -. ml,"" Xn -. mn extends to a homomorphism T.c(X)/()p -. M. Lemma 4.3. Let T be a universally axiomatized theory in.c. Then for any

finitely generated £-structure M the following conditions are equivalent:

1) M E Mod(T); 2) M = T.c(X)/()p for some complete atomic type p in T. Proof. Let X = {Xl, ... , xn} be a finite set and (X I S) a presentation of an £-structure M, i.e., M ~ Tc(X)/()s. If p = atpM(x), X = (Xl, ... , xn) then [S] = p+ and Tt:..(X)/()p ~ Tc(X)/()s ~ M. Therefore, 1) implies 2). To prove the converse, let p be an atomic type in T. We need to show that Tc(X)/()p E Mod(T). Since p is a type in T there exists a model N E Mod(T) and a tuple of elements fj = (YI,"" Yn) E Nn such that

100

=

N' is a substructure of N generated by Yl,'" ,Yn then N'. Since the theory T is axiomatized by a set of universal sentences one has N' E Mod(T). Hence, Tt:,(X)jBp E Mod(T). 0

p

atpN(y). If

T.c(X)jBp ~

4.3. Equationally Noetherian algebras The notion of equationally Noetherian groups was introduced in [4] and [7]. Let B be an algebra. For every natural number n we consider Zariski topology on Bn. A subset Y 1 then the amalgamated product

< a > *K < b, c; [b, c] =

1>

q

where K =< aP >=< b > is not CT. The situation for HNN extensions of CT groups is not as general but can be carried through for extensions of centralizers of abelian malnormal subgroups. Theorem 2.4. Let B be a CT gmup and K an abelian malnormal subgmup of B. Then the HNN extension

Bl

=< t,B;rel(B),r1kt = k for all k

E

K >

117

is aCT-group.

3. Commutative Transitivity, CSA and Universally Free Groups Myasnikov and Remeslennikov in their study of fully residually free groups introduced the concept of a CSA group (conjugately separated abelian group). First we recall the following necessary definition.

Definition 3.1. Let G be a group and H a subgroup of G. His malnormal in G or conj ugately separated in G provided g -1 H g n H = 1 unless g E H. Using this we define the concept of a CSA group.

Definition 3.2. A group G is a CSA-group or conjugately separated abelian group provided the maximal abelian subgroups are malnormal. Each CSA group must be CT. The converse however is not true in general.

Lemma 3.1. The class of CSA groups is a proper subclass of the class of CT groups. Proof. We first show that every CSA-group is commutative transitive. Let G be a group in which maximal abelian subgroups are malnormal and suppose that M1 and M2 are maximal abelian subgroups in G with z =J 1 lying in M1 n M 2. Could we have M1 =J M2? Suppose that w E M1 \ M 2. Then w- 1zw = z is a non-trivial element of w- 1M 2w n M2 so that w E M 2. This is impossible and therefore M1 c M2 . By maximality we then get M1 = M 2. Hence, G is commutative transitive whenever all maximal abelian subgroups are malnormal. We now show that there do exist CT groups that are not eSA. In any non-abelian CSA-group the only abelian normal subgroup is the trivial subgroup 1. To see this suppose that N is any normal abelian subgroup of the non-abelian CSA-group G. Then N is contained in a maximal abelian subgroup M . Let g ~ M . Then N

= g-lNgn N c g-lMg n M.

The fact N =J 1 would imply that gEM which is a contradiction. Now let p and q be distinct primes with p a divisor of q - 1. Let G be the non-abelian group of order pq . Then it is not difficult to prove that

118

the centralizer of every non-trivial element of G is cyclic of order either p or q . Thus G is commutative transitive. However, the (necessarily unique) Sylow q-subgroup of G is normal in G. Hence from the argument above G cannot be CSA. 0 Although the class of CSA groups is a proper subclass of the CT groups in the presence of full residual freeness (in fact even in the presence of just residual freeness) they are equivalent. Definition 3.3. A group G is residually free if for each non-trivial 9 E G there is a free group Fg and an epimorphism hg : G ---; Fg such that hg(g) #- 1. Equivalently for each 9 E G there is a normal subgroup N g such that G/Ng is free and 9 rJ. N g • The group G is fully residually free provided to every finite set S c G \ {1} of non-trivial elements of G there is a free group Fs and an epimorphism hs : G ---; Fs such that hs(g) #- 1 for all 9 E S. Lemma 3.2. If G is fully residually free then GT

== GSA.

The proof of this depends on a beautiful theorm due independently to Gaglione and Spellman [GS] and Remeslennikov [R] tying together full residual freeness and the property of being universally free which we will explain shortly (see [R] for a proof of Lemma 2.3). In the 1960's G. Baumslag [GB 1] proved that a surface group is residually free, answering a question of Magnus. To do this he introduced what is now called extensions of centralizers. This concept became one of the main tools used by Kharlampovich,Mayasnikov and Remeslennikov in their structure theory of fully residually free groups and by Kharlampovich and Maysnikov in their solution to the Tarski problems. As somewhat of an offshoot of this B. Baumslag [BB 1] proved: Theorem 3.1. If G is residually free then the following are equivalent: (1) G is fully residually free (2) G is GT From this a truly amazing result developed that is in some sense the beginning of the solution of the Tarski problem. This was done independently by Gaglione and Spellman and Remeslennikov: Recall first some ideas from logic and model theory. We start with a first-order language appropriate for group theory. This language which we denote by Lo is the first-order language with equality

119

containing a binary operation symbol . a unary operation symbol -1 and a constant symbol 1. A sentence in this language is a logical expression containing a string of variables x = (Xl, ... , x n ), the logical connectives V, /\, rv and the quantifiers V, 3. A universal sentence of Lo is one of the form \fX{ ¢(x)} where x is a tuple of distinct variables, ¢(x) is a formula of Lo containing no quantifiers and containing at most the variables of X. Similarly an existential sentence is one of the form 3x{ ¢(x)} where x and ¢(x) are as above. If G is a group then the universal theory of G consists of the set of all universal sentences of Lo true in G. We denote the universal theory of a group G by T hv( G). Since any universal sentence is equivalent to the negation of an existential sentence it follows that two groups have the same universal theory if and only if they have the same existential theory. The set of all sentences of Lo true in G is called the first-order theory or the elementary theory of G. We denote this by Th(G). We note that being first-order or elementary means that in the intended interpretation of any formula or sentence all of the variables (free or bound) are assumed to take on as values only individual group elements - never, for example, subsets of, nor functions on, the group in which they are interpreted. The Tarski conjectures, solved independently by Kharlampovich and Myasnikov (see [KHM 1-5]) and Sela (see [Se 1-5]), say essentially that all countable nonabelian free groups have the same elementary theory. The following was well-known and much simpler. Theorem 3.2. All nonabelian free groups have the same universal theory. As we will see all finitely generated fully residually free groups will also have the same universal theory as the class of nonabelian free groups. Definition 3.4. A universally free group G is a group that has the same universal theory as a nonabelian free group. Gaglione and Spellman [GS] and independently Remeslennikov [Re] were able to extend the theorem of B. Baumslag to show that fully residually free is equivalent to universally free and that these (in the presence of residual freeness) are equivalent to both being CT and being CSA. As mentioned this is in some ssnse the starting off point for the solution of the Tarksi conjectures. Theorem 3.3 (GS). ,[Re) If G is residually free then the following are equivalent:

120

(1) G is fully residually free

(2) G is GT (3) G is GSA (4) G is universally free Since this result a complete structure theory and algorithmic theory of the fully residually free groups has been developed. An important aspect of this development is that elements of fully residually free groups can be expressed as infinite words on a generating system. These infinite words can be manipulated and handled in an analogous manner to ordinary words in free groups (see [KMRS]). Commutative transitivity becomes essential in building examples of fully residually free groups classifying them via the following construction. Definition 3.5. Let G be a CT group, let u E G\ {I} and let M = Za(u) where Za(u) is the centralizer of u in G. Suppose A is an abelian group. Then the group H

=
is cyclic then H =

z, for all z EM>

and is called the free rank one extension of the centralizer M of u in

G. Theorem 3.4. (Baumslag, Myasnikov, Remeslennikov [BMR3j) Let G be a fully residually free group and A an abelian fully residually free group. Then a centralizer extension of G by A is again fully residually free. The proof of this result which is fundamental in all further considerations of fully residually free groups depends on the fact that the result can be reduced to free rank one extensions of centralizers and then on the following "big powers" argument. It is not hard to see that in a free group F if botn'bl ... tnkbk = 1 for infinitely many values of nl, infinitely many values of n2"" infinitely many values of nk then t must commute with at least one of bo , ... , bk . Hence the family of homomorphisms ¢k : F(u, t) ----t F from the rank one extension of the centralizer GF (u) into F, defined for every positive k by ¢(t) = uk and ¢k IF= id, is a discriminating family, as required.

121

As mentioned earlier, G.Baumslag [GB 1] used this type of argument to show that the orient able surface groups 8 g with g 2: 2 are all residually free. This answered a question posed by Magnus. Recall that for g 2: 2 the group 8 g which is the fundamental group of an orient able surface of genus g has the presentation

8 g =< aI, bl , ... , ag, bgi [aI, bl] ... [a g, bg] = 1 > . Baumslag observed that each 8 g embeds in 8 2 and residual freeness is inherited by subgroups so it suffices to show that 82 is residually free. He actually showed more. If F is a nonabelian free group and u E F is a nontrivial element which is neither primitive nor a proper power then the group K given by

=< F * Fi U

K

= U

>

where F is an identical copy of F and u is the corresponding element to u in F, is residually free. A one-relator group of this form is called a Baumslag double. In our terminology he proceeded by embedding K in the free rank one extension of centralizers H

=

by K

=
.

The group H is then residually free and hence K is residually free. Therefore every Baumslag double is residually free. The group

8 2 =< al,b l ,a2,b 2i[al,b l ] = [a2,b 2] > is a Baumslag double answering the original question. The class of finitely generated fully residually free groups was introduced in a different direction by Sela in his proof of the Tarski problems. In Sela's approach these groups appear as limits of homomorphisms of a group G into a free group. In this guise they are called limit groups. Therefore a limit group is a finitely generated fully residually free group. The paper by Bestvinna and Feign [BeF] gives a nice description of the equivalence of the two approaches.

4. The Commutative Transitive Kernel Commutative transitivity and universal freeness raised certain questions about the relationship between fully residually free and n-free.

122

Definition 4.1. A group G is n-free if any subset of n or fewer elements in G generate a free group. Observe that if G is an n-free group then G is also an m-free group for all 1 :s; m < n. The I-free groups are precisely the torsion free groups. 2-free groups are commutative transitive. Lemma 4.1. If G is an n-free group for any n transitive.

~

2 then G is commutative

Proof. Suppose that G -I- 1 is a 2-free group and let u E G. Let M = Ga (u) . We claim that M is locally cyclic and therefore abelian. Suppose a, b E M. Since G is 2-free, a, u generate a free group. Since a, u commute this must be cyclic and hence they are both powers of a single element g. Thus a = gCt, U = g{3 for some Ct, (3. Similarly, since band u commute and b, u generate a free group, we have an element h with b = h ii , U = h'Y for some 8, 'Y. Now consider < h, g >c G. This is free because G is 2-free however this has the relation g{3 = h'Y. Therefore < h, 9 > must be cyclic and hand 9 are both powers of a single element. Hence a and b are both powers of this element and any 2-generator subgroup of M is cyclic. A straightforward induction then shows that any n-generator subgroup < aI, ... , an >C Mis also cyclic. Thus, centralizers of non-trivial elements are locally cyclic and 2-free groups are commutative transitive as claimed. 0 The concept of n-freeness arises in many places. From straightforward topological considerations it follows that an orientable surface group of genus 9 ~ 1 is 2g - 1 free while a nonorientable surface gorup of genus 9 ~ 2 is g - 1 free. These results were generalized in an algebraic manner by Fine, Gaglione,Rosenberger and Spellman (see [FGRS 1]). In order to further study commutative transitivity, Fine, Gaglione, Rosenberger and Spellman [FGRS 2] introduced a subgroup T( G) called the commutative transitive kernel that carries the information about commutative transitivity in much the same way that the derived group G' carries the information about commutativity. Definition 4.2. Let N be a normal subgroup of G and let To(G, N) = N and inductively define Tn+I(G, N) = subgroup of G generated by Tn(G, N) tgether with those commutators [a, c] such that aRnc where Rn is the transitive closure of the relation on G - Tn(G, N) by xy == yx mod Tn(G, N). Now let

123

Then T(G,N) eNG' and T(G,N) is normal in G Lemma 4.2. Let N be normal in G. Then GjN is CT if and only if N = T(G, N). Further GjT(G, N) is CT Definition 4.3. The commutative transitive kernel of a group G, denoted T(G), is T(G, 1). The commutative transitive kernel behaves in much the same way as the derived group. In particular we have the following theorem. The important result is Theorem 4.1. Let G be a group. Then (1) T( G) is a characteristic subgroup of G and contained in G' (2) GjT(G) is CT (3) Gis CT if and only ifT(G) = 1 Unfortunately the commutative transitive kernel is not canonical for commutative transitivity in the same sense as the derived group is for commutativity. To make this precise let F be a covariant functor on the category of groups. We say that F is a T-like functor if for each group G [lJ F(G) is a characteristic subgroup f G and is contained in G' [2J G j F( G) is commutative transitive [3J G is commutative transitive if and only if F(G) = {I}. The set of T-like functors can be partially ordered by Fl :::; F2 if Fl (G) c F2(G) for every group G. A canonical subgroup for commutative transitivity would be equivalent to a minimum T-like functor. In [FGRS 1J the following was proved. Theorem 4.2. The family of T-like functors has no minimum. C. Delizia and C. Nicotera further studied the commutative transitive kernel for locally finite groups [DN 1,2J. They also developed an analog of the commutative transitive kernel for power commutative groups [DN 3,4J and study the structure of this kernel for locally nilpotent groups. In another direction they consider groups where nilpotency is transitive on 2-generator subgroups [DN 5J.

124

5. RG Groups and a Classification of One-Relator CT Groups In presenting examples of infinite CT groups we mentioned that one-relator groups with torsion are CT. The question then arises as to how to classify both the one-relator CT groups and the one-relator CSA groups. In this section we describe a recent classification of such one-relator groups done by Fine, grose Rebel, Myasnikov and Rosenberger [FgRMR]. This classification is related to two more general questions. (1) The general classification of one-relator fully residually free groups (2) The Gersten conjecture that a torsion-free one-relator group is hyperbolic if and only if it does not contain any Baumslag-Solitar group B m,n =< x, y; yx m y -1 = x n >, mn r--I- 0 . Recall that one-relator groups with torsion are hyperbolic. In order to present the classification we need another concept introduced by Fine and Rosenberger [FR] and independently by M.Cohen and M.Lustig [CL] Definition 5.1. A group G is a restricted Gromov or RG group if given < g, h > is cyclic or there exists a positive integer t with gf -1= 1,h t -1= 1 and < gt,ht >=< gf > * < ht >. g, h E G either

In particular free groups and torsion-free hyperbolic groups are RG. Theorem 5.1 (FR). Torsion-free RG groups are CT. Now the classification. For one-relator groups with torsion, combining results of Fine and Rosenberger [FR] and Gildenhuys,Kharlampovich and Myasnikov [GKM] we obtain the following result ([FgRMR]). Theorem 5.2. ((FgRMRJ) Let G be a one-relator group with torsion. Then the following are equivalent: (1) Gis CSA (2) Gis RG (3) G does not contain a copy of the infinite dihedral group < x, y; x2 = y2 = 1 > For torsion-free one-relator groups the situation is different. A torsionfree one-relator group fails to be a CSA group if and only if its contains

125

a copy of some nonabelian Baumslag-Solitar group BI,n with n -I- 1 or a copy of the group FI x Z, the direct product of a free group of rank 2 and an infinite cyclic group (see [FgRMR]). In [FgRMR] it was proved that a one-relator group fails to be an RG-group if and only if it contains a copy of one of the Baumslag-Solitar groups BI,n. Recall that BI,1 is a free abelain group of rank 2. It follows that if G is a torsion-free one-relator group which does not contain a free abelian subgroup of rank 2 then the following are equivalent. (1) G is a CSA group (2) G is an RG-group (3) G does not contain a copy of some BI,n with n -I- -1,0,1. Further if G is a torsion-free one-relator group then G is CT if and only if it does not contain a copy of F2 x Z or a copy of the Kleian bottle group BI,-l. Combining all these we get the following results. Theorem 5.3. ([FgRMRj) Let G be a torsion-free one-relator group which does not contain a copy of Z x Z = BI,I. Then the following are equivalent: (1) G is GSA (2) Gis RG (3) G does not contain a copy of one of the Baumslag-Solitar group BI,m =< x, y : yxy-l = xm > with mE Z \ {-I, 0, I}. Theorem 5.4. ([FgRMRj) Let G be a torsion-free one-relator group. Then G is GT if and only if G does not contain a copy of F2 X Z or a copy of the Baumslag-Solitar group BI,-l =< x, y; yxy-l = X-I > (the Klein-bottle group). Notice that since one-relator groups with torsion are commutative transitive, this fact together with Theorem 4.3 provides the total classification of one-relator commutative transitive groups. 6. Commutative Transitivity and Discriminating Groups As an outgrowth of the development of the theory of algebraic geometry over groups (see [BMR2] [BMR1]), Baumslag, Myasnikov and Remeslennikov introduced the concept of discriminating groups. This class of groups developed a life of their own and have been extensively studied (see [FGMRS] and the references there). Definition 6.1. Let G and H be groups. G separates H provided that to every nontrivial element h of H there is a homomorphism 'Ph : H -+ G

126

such that CPh(h) -=I- 1. G discriminates H if to every finite nonempty set S of nontrivial elements of H there is a homomorphism CPS : H ----+ G such that cps(s) -=I- 1 for all s E S. The group G is discriminating provided that it discriminates every group it separates. Algebraic geometry over groups was created as a tool to attack the celebrated Tarski conjectures on the elementary theory of free groups. By analogy with classical algebraic geometry we may view the discrimination of H by G as an approximation to H much like the localization of a ring at a prime. (Think of a set of generators for H as a set of variables.) The following is the main criterion for determining whether a group is discriminating. Lemma 6.1. ([BMRj) A gmup G is discriminating if and only if G discriminates G x G. In the application of algebraic geometry it is as important to know both whether a group is discriminating and whether it is not discrimninating. Hence there are general questions on nondiscrimination of various classes of groups. General idea to show nondiscrimination is to show that some universal property which is true in G but cannot be true in G x G or to find a number - dimension etc. - which is additive so that this number cannot hold in both G x G and in G. Here commutative transitivity plays a large role. Lemma 6.2. A nonabelian CT gmup is nondiscriminating. It follows that all the following classes of groups are nondiscriminating: (1) Any torsion-free hyperbolic group and in particular any nonabelian free group is non discriminating (2) Nonabelian free solvable groups and their nonabelian subgroups are nondiscriminating. (3) The free product of two nondiscriminating groups is nondiscriminating. In subsequent work it was shown that a nilpotent group is discriminating if and only if its torsion-free abelian (see [FGMS]). 7. An Extension of Commutative Thansitivity In [BFGRS] a study was initiated to extend the concept of commutative transitivity. This begins in the following direction.

127

Definition 7.1. Let V be a variety of groups that contains the abelian variety A. Then we say that a group G is a centralizer- V group or evgroup if for each 9 E G the centralizer of 9 is in V.

In this context CT-groups are centralizer abelian or CA - groups. Since varieties are closed under subgroups is is clear that any group G in V is a eV-group. Further since we are assuming that the variety V contains the abelian variety it follows that any CT-group is ev. There are certainly eV-groups that are not CT-groups. For example the group G

=< a, b; aba- 1 = b- 1 >

is not CT but is eM where M is a variety which contains the abelian and metabelian variety. We now show that there are nontrivial examples of eV-groups, that is groups G that are not in V or commutative transitive but which are ev. We call a group a nontrivial eV-group if it is ev and not in V and not CT.

Theorem 7.1. A free product of nontrivial eV-groups is a nontrivial evgroup. Theorem 7.2. Let G and H be nontrivial eV-groups. Then the unrestricted wreath product Gwr H is a nontrivial ev -group.

Mimicing methods of Levin and Rosenberger [LR] We can show that the class of eV-groups is closed under certain types of group amalgams.

Theorem 7.3. Let A and B be eV-groups and An B = K proper and malnormal in both A and B. Then the amalgamated free product A *K B is a eV-group. In particular a nontrivial free product of eV-groups is again ev. Theorem 7.4. Let B be a eV-group and K a nontrivial abelian malnormal subgroup of B. Then the HNN group

Bl =< t,B;C1kt = k for all k E K > is a eV-group.

128

8. References [AAR] P. Ackermann, V. grosse Rebel and G. Rosenberger, On Power and Commutation Transitive, Power Commutative and Restricted Gromov Groups Cont. Math. ,360,2004,1-4 [GB 1] G. Baumslag On generalized free products Math. Z., 78, 1962,423438 [BFGS] G.Baumslag, B. Fine, A.M. Gaglione and D. Spellman, Reflections on discriminating groups, J. Group Theory, 10, 2007, 87-99 [BFGRS] G.Baumslag, B. Fine, A.M. Gaglione,G.Rosenberger and D. Spellman, On Centralizer Varietal Groups in preparation [BMR1] G. Baumslag, A.G. Myasnikov, V.N. Remeslennikov, Algebraic geometry over groups I. Algebraic sets and ideal theory J. Algebra, 219, 1999, 16-79 [BMR2] G. Baumslag, A.G. Myasnikov, V.N. Remeslennikov, Discriminating and co-discriminating groups J. of Group Theory, 3, 2000,467-47 [CL] M. Cohen and M. Lustig, Very small group actions on JR-trees and Dehn twist automorphisms Topology, 34, 1995, 575-617 [DN 1] C. Delizia and C.Nicotera, On the Commutative Transitive Kernel of Locally Finite Groups Alg. Colloquium, 10, 2003, 567-570 [DN 2] C. Delizia and C.Nicotera, On the Commutative Transitive Kernel of Certain Infinite Groups JP Journal of Algebra, Number Theory and Applications, 5, 2005, 421-427 [DN 3] C. Delizia and C.Nicotera, On the Power-Commutative Kernel of Locally Nilpotent Groups Int. J. of Math. and Math. Sciences, 17, 2005, 2719-2722 [DN 4] C. Delizia and C.Nicotera, On the Power-Commutative Kernel of Finite Groups Journal of Science, Ferdowsi University of Mashad , 5, 2005, 3-8 [DN 5] C. Delizia and C.Nicotera, Groups in Which the Bounded Nilpotency of Two-Generator Subgroups is Transitive preprint [F] B.Fine, On Power Conjugacy and SQ-universality for Fuchsian and Kleinian Groups, in Modular Functions in Analysis and Number Theory, University of Pittsburgh Press, 1983, 41-55 [FGMS] B. Fine, A.M. Gaglione, A.G. Myasnikov and D. Spellman, Discriminating groups: A Comprehensive Overview CRM Preprint , 2006

129

[FGMS1] B. Fine, A.M. Gaglione, A.G. Myasnikov and D. Spellman, Discriminating groups J. of Group Theory, 4, 2001, 463-474 [FGRS 1] B.Fine,A.M.Gaglione,G.Rosenberger and D. Spellman, The Commutative Transitive Kernel Algebra Colloquium, 2, 1997, 141-152 [FGRS 2] B.Fine,A.M.Gaglione,G.Rosenberger and D. Spellman, Free Groups and Questions About Universally Free Groups, in Proceedings of Groups St. Andrews/Galway 1993, London Mathematical Soc. Lecture Notes Series 211, 1993, 191-204 [FR 1] B.Fine and G.Rosenberger, On Restricted Gromov Groups Comm.in Aig. , 20, 8, 1992, 2171-2182 [FgRMR] B.Fine, V. grosse Rebel, A. Myasnikov and G. Rosenberger, A Classification of CSA, Commutative Transitive and Restricted Gromov One-Relator Groups Result. Math. to appear [GS] A.M. Gaglione and D. Spellman, Even More Model Theory of Free Groups, in Infinite Groups and Group Rings, World Scientific, 1993, 37-40 [GKM] D.Gildenhuys, O. Kharlampovich and A. Myasnikov, CSA Groups and Separated Free Constructions Bull. Austral. Math. Soc. ,52, 1995, 63-84

[K] M.Kasabov On Discriminating Solvable Groups preprint [LR] F. Levin and G. Rosenberger, On Power Commutative and Commutation Transitive Groups, in Proc. Groups St Andrews 1985 , Cambridge University Press, 1986 , 249-253 [LS] R.C.Lyndon and P.E. Schupp, Combinatorial Group Theory, Springer-Verlag 1977 [MKS] W. Magnus, A. Karrass, and D. Solitar, Combinatorial Group Theory, Wiley 1966, Second Edition, Dover Publications, New York 1976 [MRe] A.G. Myasnikov and V.N. Remeslennikov Exponential groups 1: Foundations of the theory and tensor completion Omsk University, N9 1993 1-25 [MRe 1] A.G. Myasnikov and V.N. Remeslennikov, Exponential groups, preprint New York, 1994 [MRe 2] A.G. Myasnikov and V.N. Remeslennikov, Exponential groups 2: Extensions of centralizers and tensor completion of CSA groups, preprint [MS] A.M. Myasnikov and P. Shumyatsky, On Discriminating Groups and c-Dimension J. of Group Theory, 200

130

[Ne] B.B.Newman, Some results on One-Relator Groups Math. Soc. , 74, 1968, 568-571

Bull. Amer.

[P] S.J. Pride, The two generator subgroups of one-relator groups with torsion Trans. Amer. Math. Soc. , 234, 1977, 483-496 [Su] M. Suzuki, The Nonexistence of a Certain Type of Simple Groups of Odd Order Proc. Amer. Math. Soc. ,8,1925,686-695 [W] L.Weisner, Groups in which the normalizer of every element but the identity is abelian Bull. Amer. Math. Soc. , 31, 1925, 413-416 [Wu] Y.F. Wu Groups in which Commutativity is a Transitive Relation J. of Algebra, 207, 1998, 165-181

GROUPS UNIVERSALLY EQUIVALENT TO FREE BURNSIDE GROUPS OF PRIME EXPONENT AND A QUESTION OF PHILIP HALL A. Gaglione

Department of Mathematics U.S. Naval Academy Annapolis, MD 21402 S. Lipschutz

Department of Mathematics Temple University Philadelphia, PA 19122 D. Spellman

Department of Mathematics Temple University Philadelphia, PA 19122

Subject Classification: Primary 20E26; Secondary 20E05, 20E06 Abstract :This paper proposes the Tarski Problem for free groups in a Burnside variety, B n , where n is a sufficiently large odd integer so that Adian's results hold. We note that just as in the case of absolutely free groups it is easy to show that the nonabelian free groups in Bn for n as above are all universlly equivalent.

1. Introduction

We bear in mind two problems which resisted solution for decades and succumbed only to the fresh attacks of supremely talented mathematicians. For our purposes we shall dub the questions (1) and (2) below as The Tarski Problem and The Burnside Problem respectively. (1) Are the nonabelian free groups elementarily equivalent? (2) Must the finitely generated nonabelian free Burnside groups of fixed finite exponent be finite? 131

132

Of course the answers to (1) and (2) are now known to be (1) yes and (2) no respectively. (1) was solved independently by Kharlampovich and Myasnikov on the one hand and by Sela on the other. Kharlampovich and Myasnikov applied algebraic geometry over groups invented by G. Baumslag, Myasnikov and Remeslennikov while Sela invented Diophantine geometry in groups to solve the Tarski problem. Let's focus on algebraic geometry over groups. By way of analogy let us consider a Noetherian integral domain R. The closed subsets in the Zariski topology on affine n-space over R are precisely the affine algebraic subsets. In order for the analog of this scenario to go through in groups the proper notions of domain and Noetherian had to be defined. One consequence of the correct definition of domain is that every nonabelian eSA group (See Section 3) is a domain. Since free groups are eSA, nonabelian free groups are domains. The correct notion of Noetherian is equationally Noetherian (See Section 6). Happily free groups are also equationally Noetherian. One felicitous consequence of this confluence of facts it, the existence and uniqueness (in the usual sense) of the decomposition of a closed set into a finite union of irreducible affine algebraic subsets. (We need to reserve the word variety for an equational class in this paper.) While the groups free in the variety of all groups are elementarily equivalent (provided their rank exceeds 1) it is easy to see that the corresponding result is false for groups free in many other varieties. For example, one can distinguish the free nilpotent groups (of fixed class) of different finite ranks by first order sentences. Perhaps the variety of all groups is unique in this regard? Or is it! We now turn our attention to Question (2). A negative answer was first provided by Adian who showed that, for all sufficiently large odd n, the nonabelian free groups in the variety Bn determined by the law xn = 1 were all infinite. Soon afterward Sirvanjan proved that, just as in the variety of all groups, the free group of countably infinite rank in the variety Bn embeds in its group free of rank 2 for all sufficiently large odd n. From this it easily follows that the nonabelian free groups in the varieties Bn, for all sufficiently large fixed odd n, have the same universal theory in the sense of first order logic. For our purposes we can say the most when n = p is a sufficiently large prime. In any event, if n is a sufficiently large odd integer, the free groups in Bn are eSA; hence, the nonabelian free groups in such varieties are domains. We do not know whether or not they are equationally Noetherian. The Tarski problem relativized to such Bn will require new techniques for its solution. This paper provides some halting first steps and should be viewed as an invitation to our colleagues to ponder

133

possible approaches.

2. Preliminaries We let w be the first limit ordinal, which we identify with the first infinite cardinal ~a. If Hand G are groups we say that G is H -inclusive provided it contains a subgroup isomorphic to H; G is H -exclusive provided it is not H-inclusive. For each positive integer n, shall be a group cyclic of order n. If G is a group and H is a nonempty class of groups, then we say that H separates G provided for every 9 E G\ {I} there is a group Hg E H and a homomorphism Hg such that G, (g, h) f---+ gh. A universal sentence of La is one of the form 'v'x 0, then V is discriminated by an r-generator member; hence, it is finitely discriminable. Suppose V is finitely discriminable. Suppose r is a positive integer and G = (b l , ... , br ) E V discriminates V. Let Fr(V) be freely generated relative to V by aI, ... , ar· Then we get an epimorphism 'ljJ : Fr(V) --+ G, ai 1-+ bi, 1 :::; i :::; r. Suppose that Wk(Xl, ... , xn) are finitely many words such that none of the equations Wk(XI, ... , xn) = 1 is a law in V. Then there are elements gi = ui(b1, ... , br ), 1 :::; i :::; n in G such that Wk(UI(b l , ... , br ), ... , un(b 1 , ... , br )) =1= 1 for all k. It follows that Wk( UI (aI, ... , a r ), ... , Un (aI, ... , a r )) =1= 1 in Fr(V) for all k. Hence, Fr(V) discriminates V . •

Definition 3.3. Let V be a finitely discriminable variety of groups. Then min{l :::; r < w : Fr(V) discriminates V} is the index of discrimination of V. If m is the index of discrimination of V, then D(V) = {Fr(V) : m :::; r < w}. Theorem 3.1. fGS} Let V be a finitely discriminable variety of groups. Let r ;::: 1 be a cardinal. Then Fr(V) ='1 Fs(V) for all cardinals s ;::: r if and only if Fr (V) discriminates V. In particular, if m is the index of discrimination of V, then Fm(V) ='1 Fs(V) for all m:::; s:::; w.

137

Theorem 3.2. Let V be a finitely discriminable variety of groups with index of discrimination m. Let G E V be Fm(V) inclusive. If D(V) discriminates G, then G ='1 Fm(V). Proof: Assume D(V) discriminates G. It will suffice to show that, if Uj(XI, ... ,X n )

=

1

Vk(XI, ... , Xn)

-I-

1

has a solution (gl, ... ,gn) E Gn, then it has a solution over Fm(V). Since D(V) discriminates G there is an integer r ~ m and a homomorphism 'lj; : G ----) Fr(V) such that 'lj;(Vk(gl, ... ,gn)) -I- 1 for all 1 ::; k ::; K. It follows that the primitive sentence 3XI, ... ,xn(Aj(uj(XI, ... ,Xn) = 1) A Ad Vk (Xl, ... , Xn) -I- 1)) is true in Fr (V). But since Fr (V) ='1 Fm (V) the above primitive sentence must also be true in Fm (V). Hence, the system has a solution over F m (V) . •

Corollary 3.1. Let V be a finitely discriminable variety of groups with index of discrimination m. Let G E V be finitely presented relative to V and suppose that G is Fm(V) inclusive. Then G ='1 Fm(V) if and only if D(V) discriminates G. Proof: One direction follows immediately from the theorem. Suppose G E V is finitely presented relative to V, is Fm(V) inclusive and G ='1 Fm(V), Suppose < al, ... ,an;RI, ... ,RJ >vis a finite presentation of G relative to V. Let wk(al, ... , an), 1 ::; k ::; K be finitely many nontrivial elements of G. Then the primitive sentence 3XI, ... ,xn(Aj(Rj(XI, ... ,Xn) = 1) A Ak (Wk (Xl, ... , Xn) -I- 1)) holds in G; hence, it holds in Fm (V) and there is (b l , ... , bn ) E Fm(v)n such that

Rj(b l

, ... ,

bn ) = 1

wk(h, ... , bn )

-I-

1

1::; j ::; J 1::; k ::; K.

It follows that the assignment ai I---> bi , 1 ::; i ::; n extends to a homomorphism'lj; : G ----) Fm(V) such that 'lj;(wk(al, ... , an)) -I- 1, 1 ::; k ::; K.

•

4. The Variety 0 of All Groups In this section we merely review known results about universally free groups (see Definition 4.1). These results will be contrasted later with results for the groups G ='1 F2(Bp) where p is an Adian-Sirvanjan prime. If 0 is the variety of all groups and r ~ 1 is a cardinal, then we write Fr for

138

Fr(O). Fw embeds in F2. For example, the commutator subgroup [F2' F 2] of F2 is free of countably infinite rank. Now let 2 S; r S; w. Then Fw ~ [F2' F2] S; Fr S; Fw from which it follows that Fr ='1 Fs for all cardinals 2 S; r < s. Thus 2 is the index of discrimination of O. The universal equivalence of the nonabelian free groups suggests the possibility of their elementary equivalence. Of course their universal equivalence is a far cry from their elementary equivalence. Suppose R is a commutative ring with 1. Within the category of unital R-modules an object P is projective just in case every short exact sequence O-+N-+M-+P-+O

splits. This is easily seen to be equivalent to P being a direct summand in a free R-module. Now suppose V is a variety of groups. We define a group P E V to be projective relative to V provided every short exact sequence of groups in V, 1 -+ K

-+

G

-+

P

-+

1,

splits. Essentially the same proof shows that this is equivalent to P being a retract of a group free in V. By the Neilsen-Schreier subgroup theorem, a group P is projective relative to the variety of all groups if and only if it is free. The Neilsen-Schreier subgroup theorem also implies that a group is freely separated (discriminated) if and only if it is residually (fully residually) free. Suppose G is a nonabelian residually free group. Suppose gh =f=. hg in G. Then their commutator [g, h] = g-lh-1gh is nontrivial. Thus, there is a free group Fr and an epimorphism 'ljJ : G -+ Fr such that ['ljJ(g) , 'ljJ(h)] = 'ljJ([g, h]) =f=. 1. Hence, Fr is nonabelian and r ~ 2. Since Fr is projective it is a retract in G and F2 S; Fr S; G; so, G is F2inclusive. Nonabelian free groups, of course, satisfy the existential sentence ::Ix, y( xy =f=. yx). Other properties of nonabelian free groups are that they are CT and even CSA. Here a group is GT or commutative transitive provided the centralizers of nontrivial elements coincide with the maximal abelian subgroups. That is rendered by the universal sentence

\:Ix, y, z(((y =f=. 1) 1\ (xy

= yx) 1\ (yz = zy))

-+

(xz

= zx)).

Moreover, a group is GSA or conjugately separated abelian provided maximal abelian subgroups are malnormal. That is equivalent to being CT and satisfying the universal sentence

\:Ix,y,z(((x =f=.1) 1\ (xy Free groups are CSA.

= yx) 1\ (Z-lyzx = xz-1yz))

-+

(xz = zx)).

139

Lemma 4.1. fE] A GT residually free group is GSA.

Proo!' Suppose G is eT and residually free. Let a E G\ {I} and suppose b,g-lbg E Gc(a) = {x E G : ax = xa}. Suppose to deduce a contradiction that 9 rf- Gc(a). Then [g,a] =I=- 1. Thus there is a free group F and an epimorphism 'l/J : G ----+ F such that ['l/J(g), 'l/J(a)] = 'l/J([g, a]) =I=- 1. But this is impossible. From ['l/J(g), 'l/J(a)] =I=- 1 we have 'l/J(a) =I=- 1. Moreover 'l/J(b), 'l/J(g)-l'l/J(b)'l/J(g) E GF('l/J(a)) and F is eSA. Hence, 'l/J(g) commutes with 'l/J(a) - a contradiction. Hence g E Gc(a) and Gis eSA . • fE] Let G be a nonabelian residually free group. The following three conditions are equivalent in pairs.

Theorem 4.1.

(1) G is fully residually free. (2) G is GT. (3) G is GSA.

Proo!' Lemma 4.1 has already established the equivalence of (2) and (3). It will suffice to show that (1) and (2) are equivalent. Suppose G is fully residually free. Let b E G\{l} and suppose a, c E Gc(b). Assume to deduce a contradiction that ac =I=- ca Then [a, c] =I=- 1. Thus there is a free group F and an epimorphism 'l/J : G ----+ F such that 'l/J(b) =I=- 1 and ['l/J(a), 'l/J(c)] = 'l/J([a, cD =I=- 1. But this is impossible. 'l/J(a)'l/J(b) = 'l/J(b)'l/J(a), 'l/J(b)'l/J(c) = 'l/J(c)'l/J(b) and F is CT; hence, 'l/J(a)'l/J(c) = 'l/J(c)'l/J(a) - a contradiction. Therefore ac = ca and G is eT. Now suppose G is CT. The proof will proceed by induction on the cardinality n of S = {gl, ... , gn} 1 and the result is true for all 1 :::; k < n. Suppose first that S is not contained in an abelian subgroup of G. Then some pair of elements of S, which we may take to be gn-l and gn, does not commute. Thus T = {gl, ... ,gn-2,[gn-l,gn]} is contained in G\{l} and, by inductive hypothesis, there is a free group F and an epimorphism 'l/J : G ----+ F such that 'l/J does not annihilate any element of T. But then 'l/J cannot annihilate any element of S either. It remains to treat the case where the gi commute in pairs ,which hypothesis we now assume. Since G is a residually free CT group it is eSA by Lemma 4.1. We claim that there is some 9 E G such that g-lgng does not commute with gn-l. Otherwise, since G is eSA, gn-l would be central in G. But a nonabelian eT group must be centerless; so, we have arrived at a contradiction. The claim is

140

established. Pick one such g. Hence U = {gl, ... ,gn-2,[gn-1,g-l gng ]} is contained in G\{l}. By inductive hypothesis there is a free group F and an epimorphism 'ljJ : G -4 F such that 'ljJ does not annihilate any element of U. But then 'ljJ cannot annihilate any element of S either. That completes the induction . •

Definition 4.1. A group G

=' be a presentation of G. Suppose that one of the relators say rl is a primitive element in Fn with basis {Xl, X2, ... ,xn }. Let ¢ be an automorphism of Fn such that ¢(rl) = Xl. Now < Xl, X2,'" ,xnlxl' ¢(r2), ¢(r3),'" > is another presentation of G. We can now eliminate Xl from the generating set and therefore G has rank less than n. Hence we may assume that in any presentation of G with n generators then no relator is primitive. Let ¢ be an automorphism of Fn such that ¢(rl) is cyclically reduced and has no cut vertex. Now G has a presentation of the form < Xl, X2,'" ,xnl¢(rl), ¢(r2),'" >. Without loss of generality let us assume that ¢(rl) starts with Xl to some positive power. Clearly {xI¢(rl),x2,'" ,xn } generates G and thus we have the following result. Theorem 0.1. Let G be a group of rank n which is not free and let X = {XI,X2,'" ,xn } be a set of generators of G. There exists a new set of generators for G, {UI,U2,'" ,un}, where Uj is a word in X such that {UI' U2,'" , un} is not a basis for F n , the free group with basis X. In particular they do not generate Fn. Moreover we can make UI be a non-primitive word in Fn. References 1. Lyndon, R., Schupp,

P. "Combinatorial Group Theory." Springer-Verlag, (1977). 2. Whitehead, J., H., C. "On certain sets of elements in a free group." Proc. London Math. Soc. 41, (1936), 48-56.

Matrix Completions over Principal Ideal Rings William H. Gustafson

Texas Tech University, Lubbock, Texas Donald W. Robinson

Brigham Young University, Provo, Utah R. Bruce Richter

University of Waterloo, Waterloo, Ontario William P. Wardlaw

U. S. Naval Academy, Annapolis, Maryland

Dedicated to Anthony M. Gaglione on his sixtieth birthday and to the memory of William H. Gustafson

Abstract We show that if A is a k x n matrix over a principal ideal ring R, with k < n, and if d is any element of the ideal generated by the k x k minors of A, then A forms the top k rows of an n x n matrix of determinant d. This parallels a 1981 result of Gustafson, Moore, and Reiner, and continues a program initiated by Hermite in 1849. Then we use these results to obtain an extension of a 1997 result of Richter and Wardlaw for good matrices.

1. Introduction

If A is a k x n matrix with k < n, the matrix completion problem intiated 151

152

by Hermite asks if A can be completed to an n x n matrix with prescribed determinant d. Gustafson, Moore, and Reiner, at the beginning of [5], give a brief summary of the history of the problem of completing a k x n matrix with k < n over certain commutative rings to an n x n matrix over the same ring with appropriate determinant. They also include references to some of the principal players in this program initiated by Hermite in 1849. In our contribution below, we show in Theorem 1 that principal ideal rings are among the rings over which this matrix completion is always possible. Theorem 2 states the relationships between six properties of a k x n matrix over a commutative ring. It extends a similar theorem in [9] by giving a best possible exposition of these relationships. Finally we show in Theorem 3 that if such completions are always possible over each ring in a given collection of rings, then they are also always possible over the unrestricted direct product of the collection. Throughout this paper, R will denote a commutative ring with identity. If A is a k x n matrix over R with k :S n, then Dk(A) denotes the ideal of R generated by the k x k sub determinants of A. We say that A has left block form if A is equivalent over R to a matrix E = [L 0], where L is a k x (k + 1) matrix over Rand 0 is the k x (n - k - 1) zero matrix over R. That is, there are matrices P E GL(k, R) and Q E GL(n, R) such that PAQ = [L 0]. Note that if k = n - 1 or k = n, the 0 block is missing and E = L; indeed, we can take A = E = L. 2. Results The following lemma was proved but not explicitly stated in [5], and was used to prove their main result. For the sake of completeness, we include a proof here.

Lemma. Let A be a k x n matrix over the commutative ring R with identity, let k < n, and let d E Dk(A). If A has left block form over R, then A enlarges to an n x n matrix A * over R whose determinant is d and whose top k rows form the matrix A. Proof. Let P E GL(k, R) and Q E GL(n, R) be such that PAQ = E = [L 0], where L is k x (k + 1). Clearly, Dk(A) = Dk(E) = Dk(L). Let j Cj = (_l)k+l+ det(L j ), where L j is the k x k submatrix of L obtained by deleting the jth column of L. Thus we can write d E Dk(A) as a linear combination d = L. ajcj = det(L *), where L * is the (k+ 1) x (k + 1) matrix

153

obtained from L by adding [al a2 ... ak+l] as its last row. Let p = det(P) and q = det(Q), and multiply the last row of L* by the unit pq to obtain the matrix M* with det(M*) = pdq. Now let E* be the direct sum of M* and the (n - k - 1) x (n - k - 1) identity matrix In-k-l; thus,

E*

=

[~* In_Ok-J

= [;]

is an n x n matrix over R with det(E*) = pdq whose first k rows form the matrix E = P AQ. It follows that the matrix

A has det(A*)

° I n-°k- l ]E Q _l=[P-IEQ-1]=[A] FQ-l A'

*=[P-l

= d and

*

its first k rows form the matrix A.

D

Our first main result is Theorem 1. Suppose that R is either a Dedekind domain or a principal ideal ring, and that A is a k x n matrix over R with k < n. If d is any element of Dk(A), then there is an nxn matrix A* over R with determinant det(A*) = d whose first k rows form the matrix A.

Proof. In view of our lemma, we need only establish that A has a left block form over R for each of the two cases. When R is a Dedekind domain, Theorem 1 is the main result of [5], where they proved a lemma that every k x n matrix over a Dedekind domain R has a left block form over R. They comment that this lemma was established in a more general form by Levy [6] in 1972. When R is a principal ideal ring, W. C. Brown shows in [2, Thm. 15.24, p. 194], that every matrix over R has a Smith normal form. When A is k x n over R with k < n, its Smith normal form is a left block form for A ~R.

D

Since every principal ideal domain is also a Dedekind domain, Theorem 1 only extends the result of [5] when R is a principal ideal ring with nonzero divisors of zero. We were especially interested in the connection between Theorem 1 and the 1997 result [9] regarding good matrices. In [9], R was a commutative ring with identity and an r x n matrix A over R was defined to be left good if, for every vector x in RlXT, the ideal (xA) generated by the entries in the vector xA is the same as the ideal (x) generated by the entries of the

154

vector x. Our lemma allows us to extend the Main Theorem of [9] to our second main result. Theorem 2. Consider the following statements about an r x n matrix A over the commutative ring R with identity.

(1) The rows of A extend to a basis of RIxn. (2) A can be enlarged to a matrix A* E GL(n, R). (3) A has a Smith normal form [Ir 0]. (4) A has a right inverse over R. (5) Dr(A) = R. (6) A is left good. Then (a) The statements (1), (2), and (3) are equivalent over any commutative ring R with identity. (b) The statements (4), (5), and (6) are equivalent over any commutative ring R with identity. (c) The statement (3) implies the statement (4) but in general they are not equivalent. (d) If A has left block form then all six statements are equivalent. Proof. Theorem 2 (a), (b), and (c) was proved in [9], except for the implications (2) =} (3) and (5) =} (4), and the fact that (4) ~ (3). The statement (2) means that there is an (n - r) x n matrix A' over R and an n x n matrix B* over R such that A*=

[~,]

and A * B* = I is the n x n identity matrix. But then it is clear that AB* = [Ir 0] is a Smith normal form for A. That is, (2) =} (3). The implication (5) =} (4) is immediate from [8, Cor. 1.28, p. 84]. However, for the sake of completeness we give the following elementary proof. If M is any m x n matrix over R and v = (CI, ... , cr ) a vector of column indices of M, so that 1 :::; Cj :::; n, we let M(v) denote the m x r submatrix of M whose jth column is the cjth column of M. It is easy to see that if In is the n x n identity matrix, then M(v) = M In(v). Now each r-subset {CI' ... ,cr } of {I, 2, .. , ,n} with 1 :::; CI < C2 < ... < Cr :::; n corresponds uniquely to a vector v = (CI, ... , Cr), and we can number these vectors (perhaps lexicographically) VI, V2, ... , VN with N = (~).

155

Let dj = det(A j ) with Aj = A(vj). Then Dr(A) = R implies 1 = 'L. bjd j for scalars b1 , b2 , ... , bN in R. Now, for each j = 1, 2, ... , N, let B j be the n x r matrix B j = In(vj)Adj(A j ), and let B be the n x r matrix B = 'L.bjBj. Then AB

=

I)jABj

=

L

=

LbjAIn(Vj)Adj(Aj)

bjAjAdj(Aj )

=

L

bjdjIr

= Ir

and (4) A has a right inverse B. That is, (5) =} (4). The following example from [4], attributed to Kaplansky in [1, p. 7], shows that (4) =f? (3) in Theorem 2 (c), and hence the result of Theorem 2 (c) is the best possible. Let R be the ring of polynomials in x, y, z over the real numbers modulo the ideal generated by x 2 + y2 + z2 - 1. This is the ring of polynomial functions on the standard 2-sphere in 3-space. The 1 x 3 matrix A = [x y z] has a right inverse AT. If it had a Smith normal form [1 0 0], then there would be a matrix Q E GL(3, R) such that AQ = [1 0 0]. Assume such a Q exists with last column q = [f 9 hjT. Then A q = xf +yg+zh = 0 for all points on the 2-Sphere. Thus q provides a tangent vector field to the 2-sphere which, because of independence of the columns of Q, is never zero on the 2-sphere. But no such vector field exists, as is shown in [3, p. 70]. This contradiction shows (4) =f? (3). (In fact, the same argument shows directly that A does not have a left block form, since that would require an invertible Q with AQ = [u v 0].) This completes the proof of Theorem 2 (a), (b), and (c). To establish (d), we first observe that when r = n, the implication (4) =} (2) is a tautology. Then we use our lemma to show that (5) =} (2) when r < n and A has a left block form over R. It is clear from (5) that 1 E R = Dr(A). By our lemma, A can be enlarged to a matrix A* with determinant 1 when r < n. It is well known that a matrix over a commutative ring R with identity is invertible over R if and only if its determinant is a unit in R. (See [7, Thm. 50, p. 158].) Hence (5) =} (2). 0 We remark that if A is an (n - 1) x n matrix over R, then it is already in left block form, so statements (1) - (6) of Theorem 2 are equivalent. In particular, if A has a right inverse, then it extends to an n x n matrix which is invertible over R. (The latter was shown using an outer product argument in [9].) Recall that in the proof of Theorem 1 we observed that if R was either a principal ideal ring or a Dedekind domain, then every r x n matrix over

156

R with r :::::: n had a left block form. Thus we have the following corollary to Theorem 2.

Corollary. If R is a principal ideal ring or a Dedekind domain, then statements (1) - (6) of Theorem 2 are equivalent. This corollary extends the Main Theorem of [9] from principal ideal rings to rings which are either principal ideal rings or are Dedekind domains. Our next theorem allows further extension of the class of rings for which certain properties mentioned above hold. Let R be a commutative ring with identity. Then R has property L if every r x n matrix A over R with r :::::: n has a left block form over R. R has property C if every r x n matrix A over R with r < n has, for each dE Dr(A) an n x n completion A* with det(A*) = d. R has property G if statements (1) - (6) of Theorem 2 are equivalent for every r x n matrix A over R with r :::::: n. Note that L =} C =} G, by our Lemma and Theorem 2. Theorem 3. Let P be anyone of the properties L, C, or G, and let R = EBjRj be the unrestricted direct sum of the commutative rings R j (j E J), where each R j has identity 1j. Then R has property P if and only if each R j has property P. Proof. We consider R to be an internal direct sum, so each R j is a subring and an ideal of R. For each a E R, aj = alj denotes the projection of a into Rj; we call aj the j-component of a. Thus (aj h = 0 if j =f. k and (aj)j = aj for all j, k E J. If A is a matrix over R, then we let Aj = ljA be the matrix of the same size over R j obtained by replacing each entry in A by its j-component. We write Ai to denote a matrix chosen with entries in R j , to distinguish it from the j-component Aj = ljA obtained from a matrix A already chosen with entries in R. In the proofs below, we will often define a matrix A over R by first specifying a matrix Aj over R j for each j E J, and letting A be the matrix of the same size over R with j-component Aj = ljA = Ai. Now suppose that R has property L and that Aj E (Rjyxn with r :::::: n. Since Aj E Rrxn, there are matrices P E GL(r, R) and Q E GL(n, R) such that PAjQ = E = [L 0], with L E w x (r+l). But Ai = ljA' implies that PAjQ = P(ljA')Q = (ljP)(Aj)(ljQ) = PjAjQj = E = E j = [L j 0] with Pj E GL(r, R j ) and Qj E GL(n, R j ). Thus, R j has property L.

157

On the other hand, suppose that for each j E J, R j has property L, and that A E Rrxn with r n. Then Aj E (Rjyxn for each j E J, and so there are matrices Pj E GL(r, R j ) and Qj E GL(n, Rj) such that PjAjQj = [Lj Ol with Lj E (Rjyx(r+l). Let P E wxr be the matrix with j-component IjP = Pj and let Q E Rnxn be the matrix with j_ component IjQ = Qj for every j E J. It is easy to see that P E GL(r, R), Q E GL(n, R), and PAQ = [L Ol with L E Rrx(r+l) such that IjL = Lj for each j E J. Thus, R has property L. Now suppose that R has property C and that Aj E (Rj)rxn with r < n and d E Dr(A~). Since Aj E Rrxn, there is an A* E Rnxn whose first R rows form the matrix Aj and with determinant det(A*) = d. But the first R rows of IjA* = (A*)j E (Rj)nxn also form the matrix Aj and det((A*)j) = d = dj . Thus, R j has property C. On the other hand, suppose that for each j E J, R j has property C, A E Rrxn with r < n, and d E Dr(A). For each j E J, IjA = Aj E (Rjyxn has ljd = dj E Dr(Aj) and has an n x n completion (Aj)* over R j with det((Aj)*) = d = dj . Let A* be the n x n matrix over R with IjA* = (A*)j = (Aj)* for each j E J. Since det((A*)j) = dj for each j E J, it follows that det(A*) = d. Since the first r rows of (A*)j form the matrix Aj for each j E J, it follows that the first R rows of A * form the matrix A. That is, A* is the n x n completion of A with determinant d. Hence, R has property C. Suppose R has property G and that Aj, (Bj)T E (Rjyxn satisfy AjBj = (Ir)j, which is statement (4) of Theorem 3 for the ring R j . Let E = [Ir Ol- [Ir Olj, A = E+Aj, and B = ET +Bj. Note that A = [Ir 0li if i -=1= j, Aj = Aj, and similarly for B. Then AB = Ir shows that A satisfies (4) for the ring R. Since R has property G, A must also satisfy (2), so A has an invertible completion A* over R. It follows that (A*)j E GL(n,R j ) is the n x n completion of Aj = Aj over R j . Thus, (4) =} (2) in R j , so R j has property G. Finally, suppose for each j E J that R j has property G and that A, BT E R rxn satisfy AB = I r . Then AjBj = (Ir)j for each j E J, and so property G ensures that each Aj can be completed to an (Aj)* E GL(n, Rj ). Now let A* be the n x n matrix over R with j-component IjA* = (A*)j = (A j )*. Then A* E GL(n,R) and its first R rows form the matrix A. Thus (4) =} (2) in R, so R has property G. 0

s:

158

References 1. H. Bass, Introduction to some methods of algebraic K-theory, CBMS 20, Amer. Math. Soc., Providence, RI, 1974. 2. W. C. Brown, Matrices over Commutative Rings, Dekker, New York, 1992. 3. M. J. Greenberg, Lectures on Algebraic Topology, W. A. Benjamin, New York, 1967. 4. W. H. Gustafson, P. R. Halmos, and J. M. Zelmanowitz, The Serre Conjecture, Amer. Math. Monthly 85 (1978), 357-359. 5. W. H. Gustafson, M. E. Moore, and I. Reiner, Matrix completions over Dedekind rings, Linear and Multilinear Algebra 10 (1981), 141-144. 6. L. S. Levy, Almost diagonal matrices over Dedekind domains, Math. Z. 124 (1972), 89-99. 7. N. H. McCoy, Rings and Ideals, Mathematical Association of America, Washington, 1965. 8. B. R. McDonald, Linear Algebra over Commutative Rings, Dekker, New York, 1984. 9. R. B. Richter and W. P. Wardlaw, Good matrices: matrices which preserve ideals, Amer. Math. Monthly 104 (1997) , 932-938.

A primer on computational group homology and cohomology using GAP and SAGE David Joyner

Department of Mathematics, US Naval Academy, Annapolis, MD, [email protected].

Dedicated to my friend and colleague Tony Gaglione on the occasion of his sixtieth birthday These are expanded lecture notes of a series of expository talks surveying basic aspects of group cohomology and homology. They were written for someone who has had a first course in graduate algebra but no background in cohomology. You should know the definition of a (left) module over a (non-commutative) ring, what Z[G] is (where G is a group written multiplicatively and Z denotes the integers), and some ring theory and group theory. However, an attempt has been made to (a) keep the presentation as simple as possible, (b) either provide an explicit reference or proof of everything. Several computer algebra packages are used to illustrate the computations, though for various reasons we have focused on the free, open source packages, such as GAP [Gap] and SAGE [St] (which includes GAP). In particular, Graham Ellis generously allowed extensive use of his HAP [EI] documentation (which is sometimes copied almost verbatim) in the presentation below. Some interesting work not included in this (incomplete) survey is (for example) that of Marcus Bishop [Bi], Jon Carlson [C] (in MAGMA), David Green [Gr] (in C), Pierre Guillot [Gu] (in GAP, C++, and SAGE), and Marc Roder [Ro]. Though Graham Ellis' HAP package (and Marc Roder's add-on HAPcryst [RoJ) can compute comhomology and homology of some infinite groups, the computational examples given below are for finite groups only. 1. Introduction

First, some words of motivation. 159

160

Let G be a group and A a G-module a . Let A C denote the largest submodule of A on which G acts trivially. Let us begin by asking ourselves the following natural question. Question: Suppose A is a submodule of a G-module B and x is an arbitrary G-fixed element of BfA. Is there an element bin B, also fixed by G, which maps onto x under the quotient map? The answer to this question can be formulated in terms of group cohomology. ("Yes", if Hl(G, A) = 0.) The details, given below, will help motivate the introduction of group cohomology. Let Ac is the largest quotient module of A on which G acts trivially. Next, we ask ourselves the following analogous question. Question: Suppose A is a submodule of a G-module Band b is an arbitrary element of Bc which maps to 0 under the natural map Bc ---+ (B f A)c. Is there an element a in ac which maps onto b under the inclusion map? The answer to this question can be formulated in terms of group homology. ("Yes", if H1(G, A) = 0.) The details, given below, will help motivate the introduction of group homology. Group cohomology arises as the right higher derived functor for A t--------+ A c. The cohomology groups of G with coefficients in A are defined by

(See §4 below for more details.) These groups were first introduced in 1943 by S. Eilenberg and S. MacLane [EM]. The functor A t--------+ A C on the category of left G-modules is additive and left exact. This implies that if

is an exact sequence of G-modules then we have a long exact sequence of cohomology 0---+ AC---+Bc ---+ CC ---+ Hl(G,A)---+ Hl(G,B) ---+ Hl(G,C) ---+ H2(G,A) ---+ ...

(1)

aWe call an abelian group A (written additively) which is a left Z[G]-module a Gmodule.

161

Similarly, group homology arises as the left higher derived functor for A f------+ Ae. The homology groups of G with coefficients in A are defined by

Hn(G,A) = Tor~[el(Z,A). (See §5 below for more details.) The functor A f------+ Ae on the category of left G-modules is additive and right exact. This implies that if

is an exact sequence of G-modules then we have a long exact sequence of homology

H2(G,C) ----+ H1(G,A) ----+ H1(G,B)----+ Hl(G,C) ----+ Ae ----+ Be ----+ Ce ----+ o.

..• ----+

(2)

Here we will define both cohomology Hn(G, A) and homology Hn(G, A) using projective resolutions and the higher derived functors Ext n and Tor n. We "compute" these when G is a finite cyclic group. We also give various functorial properties, such as corestriction, inflation, restriction, and transfer. Since some of these cohomology groups can be computed with the help of computer algebra systems, we also include some discussion of how to use computers to compute them. We include several applications to group theory. One can also define Hl(G, A), H2(G, A), ... , by explicitly constructing co cycles and coboundaries. Similarly, one can also define HdG,A), H 2 (G,A), ... , by explicitly constructing cycles and boundaries. For the proof that these constructions yield the same groups, see Rotman [R], chapter 10. For the general outline, we follow §7 in chapter 10 of [R] on homology. For some details, we follow Brown [B], Serre [S] or Weiss [W]. For a recent expository account of this topic, see for example Adem [A]. Another good reference is Brown [B]. 2. Differential groups In this section cohomology and homology are viewed in the same framework. This "differential groups" idea was introduced by Cartan and Eilenberg [CE], chapter IV, and developed in R. Godement [G], chapter 1, §2. However, we shall follow Weiss [W], chapter 1.

162

2.1. Definitions A differential group is a pair (L, d), L an abelian group and d : L - t L a homomorphism such that d2 = O. We call d a differential operator. The group

H(L)

= Kernel (d)jlmage (d)

is the derived group of (L, d). If

then we call L graded. Suppose d (more precisely, diLJ satisfies, in addition, for some fixed r -I- 0,

We say d is compatible with the grading provided r = ±l. In this case, we call (L, d, r) a graded differential group. As we shall see, the case r = 1 corresponds to cohomology and the the case r = -1 corresponds to homology. Indeed, if r = 1 then we call (L, d, r) a (differential) group of cohomology type and if r = -1 then we call (L, d, r) a group of homology type. Note that if L = EB~=_ooLn is a group of cohomology type then L' = EB~_ooL~ is a group of homology type, where L~ = L-n' for all n E Z. For the impatient: For cohomology, we shall eventually take L = EBnHomc(Xn, A), where the Xn form a chain complex (with +1 grading) determined by a certain type of resolution. The group H(L) is an abbreviation for EBnExt Z[c] (Z, A). For homology, we shall eventually take L = EBnZ@Z[C] X n , where the Xn form a chain complex (with -1 grading) determined by a certain type of resolution. The group H(L) is an abbreviation for EBn Tor ~[C] (Z, A).

Let (L,d) = (L,dL) and (M,d) = (M,dM) be differential groups (to be more precise, we should use different symbols for the differential operators of Land M but, for notational simplicity, we use the same symbol and hope the context removes any ambiguity). A homomorphism f : L - t M satisfying do f = f 0 d will be called admissible. For any nEZ, we define nf : L - t M by (nf)(x) = n· f(x) = f(x) + .,. + f(x) (n times). If f

163

is admissible then so is nf, for any n E Z. An admissible map f gives rise to a map of derived groups: define the map f* : H(L) ~ H(M), by f*(x + dL) = f(x) + dM, for all x E L.

2.2. Properties Let

f be an admissible map as above.

(1) The map f* : H(L) ~ H(M) is a homomorphism. (2) If f : L ~ M and 9 : L ~ M are admissible, then so is f + 9 and we have (J + g)* = f* + g*. (3) If f : L ~ M and 9 : M ~ N are admissible then so is go f : L ~ N and we have (g 0 J)* = g* 0 f*. (4) If (3)

is an exact sequence of differential groups with admissible maps i, j then there is a homomorphism d* : H(N) ~ H(L) for which the following triangle is exact: H(L)

H(N)

/ (4)

H(M) This diagram b encodes both the long exact sequence of cohomology (1) and the long exact sequence of homology (2). Here is the construction of d*: Recall H (N) = Kernel (d) jlmage (d), so any x E H (N) is represented by an n E N with dn = O. Since j is surjective, there is an m E M bThis is a special case of TMoreme 2.1.1 in [G].

164

such that j(m) = n. Since j is admissible and the sequence is exact, j(dm) = d(j(m» = dn = 0, so dm E Kernel(j) = Image (i). Therefore, there is an £ E L such that dm = i(£). Define d*(x) to be the class of £ in H(L), i.e., d*(x) = £ + dL. Here's the verification that d* is well-defined: We must show that if we defined instead d* (x) = £' + dL, some £' E L, then £' - £ E dL. Pull back the above n E N with dn = 0 to an m E M such that j (m) = n. As above, there is an £ E L such that dm = i(£). Represent x E H(N) by an n' E N, so x = n' + dN and dn' = O. Pull back this n' to an m' E M such that j(m') = n'. As above, there is an £' E L such that dm' = i(£'). We know n' - n E dN, so n' - n = dn", some n" E N. Let j(m") = n", some m" E M, so j(m'-m-dm") = n' = n-j(dm") = n'-n-dj(m") = n'-n-dn" = O. Since the sequence L - M - N is exact, this implies there is an £0 E L such that i(£o) = m' - m - dm". But r~(£o) = i(d£o) = dm' - dm = ief') - i(£) = i(£' - f), so f' - £ E dL. (5) If M = L ffi N then H(M) = H(L) ffi H(N). proof: To avoid ambiguity, for the moment, let dx denote the differential operator on X, where X E {L,M,N}. In the notation of (3), j is projection and i is inclusion. Since both are admissible, we know that dMIL = d L and dMIN = dN. Note that H(X) C X, for any differential group X, so H(M) = H(M) n L ffi H(M) nNe H(L) ffi H(N). It follows from this that that d* = O. From the exactness of the triangle (4), it therefore follows that this inclusion is an equality.

o (6) Let L, L', M, M', N, N' be differential groups. If

o -----.

L ~M ~N - - 0

fl o -----.

.,

91

L' ~M'

hI

(5)

., -.L- N' - - 0

is a commutative diagram of exact sequences with i, i', j, j', j, g, h all admissible then

H(L) ~ H(M)

1·1

.,

9·1

H(L') ~ H(M')

165

commutes, j. H(M) ---. H(N)

9·1

h·l ./

'. H(N') H(M') ---. commutes, and d. H(N) ---. H(L)

1·1

h·l d.

H(N') ---. H(L') commutes. This is a case of Theorem 1.1.3 in [W] and of Theoreme 2.1.1 in [G]. The proofs that the first two squares commute are similar, so we only verify one and leave the other to the reader. By assumption, (5) commutes and all the maps are admissible. Representing x E H(M) by x = m + dM, we have

+ dN) = hj(m) + dN' = gi'(m) + dN' g*(i'(m) + dM') = g*i:(m + dM) = g*i:(x),

h*j*(x) = h*(j(m) =

as desired. The proof that the last square commutes is a little different than this, so we prove this too. Represent x E H(N) by x = n + dN with dn = 0 and recall that d*(x) = £+dL, where dm = i(£), £ E L, where j(m) = n, for m E M. We have

On the other hand,

d*h*(x)

= d*(h(n) + dN') = f!' + dL',

for some f!' E L'. Since h(n) EN', by the commutativity of (5) and the definition of d*, £' E L' is an element such that i'(£') = gi(£). Since i' is injective, this condition on £' determines it uniquely mod dL'. By the commutativity of (5), we may take f!' = J(£).

166

(7) Let L, L', M, M', N, N' be differential graded groups with grading +1 (i.e., of "cohomology type"). Suppose that we have a commutative diagram, with all maps admissible and all rows exact as in (5). Then the following diagram is commutative and has exact rows:

This is Proposition 1.1.4 in [W]. As pointed out there, it is an immediate consequence of the properties, 1-6 above. Compare this with Proposition 10.69 in [R]. (8) Let L, L', M, M', N, N' be differential graded groups with grading -1 (Le., of "homology type"). Suppose that we have a commutative diagram, with all maps admissible and all rows exact, as in (5). Then the following diagram is commutative and has exact rows: _d_.~

- j _ .~

Hn{N)

_j_~~

Hn(N') _d_.

H n _ 1 (L)

f. . __

~

H

+ (N') n 1

~

_d_.

HnCL')

_i~~

Hn(M')

~

J

Hn_1(L') _ _

~

This is the analog of the previous property and is proven similarly. Compare this with Proposition 10.58 in [R]. (9) Let (L, d) be a differential graded group with grading T. If dn = dl Ln then dn +r 0 dn = 0 and

(6) is exact. (10) If {Ln I nEil} is a sequence of abelian groups with homomorphisms d n satisfying (6) then (L, d) is a differential group, where L = EBnLn and d = EBndn.

2.3. Homology and cohomology When T = 1, we call Ln the group of n-cochains, Zn = Ln n Kernel (d n ) the group of n-cocycles, and Bn = Ln n dn-1(L n - 1) the group of ncoboundaries. We call Hn(L) = Zn/Bn the nth cohomology group. When T = -1, we call Ln the group of n-chains, Zn = LnnKernel (d n ) the group of n-cycles, and Bn = Ln ndn+ 1(L n+ 1) the group of n-boundaries. We call Hn(L) = Zn/Bn the nth homology group.

.

167

3. Complexes We introduce complexes in order to define explicit differential groups which will then be used to construct group (co)homology. 3.1. Definitions Let R be a non-commutative ring, for example R = Z[G]. We shall define a "finite free, acyclic, augmented chain complex" of left R-modules. A complex (or chain complex or R-complex with a negative grading) is a sequence of maps

... - ;

X n+l

0,,+1 -;

Xn

On

---t

Xn-

1

0,,-1 -;

X n-2

-; ...

(7)

for which OnOn+l = 0, for all n. If each Xn is a free R-module with a finite basis over R (so is ~ Rk, for some k) then the complex is called finite free. If this sequence is exact then it is called an acyclic complex. The complex is augmented if there is a surjective R-module homomorphism E : Xo -; Z and an injective R-module homomorphism f1. : Z -; X-I such that 00 = f1. 0 E, where (as usual) Z is regarded as a trivial R-module. The standard diagram for such an R-complex is . .. -------+

X2

82

-------+

X1

81

-------+

Xo

80

X -1

-------+

z ___

Z

1

ro

o

8_ 1

---+

X-

2 -------+ ...

Such an acyclic augmented complex can be broken up into the positive part

and the negative part

o -; ~ -;JJ. X -1 0-1 -; '71

X -2

0_2 -;

X -3

-; ...

Conversely, given a positive part and a negative part, they can be combined into a standard diagram by taking 00 = f1. 0 E.

168

If X is any left R-module, let X* = HomR(X, Z) be the dual Rmodule, where Z is regarded as a trivial R-module. Associated to any f E HomR(X, Y) is the pull-back f* E HomR(Y*, X*). (If y* E y* then define f* (y*) to be y* 0 f : X ---> Z.) Since "dualizing" reverses the direction of the maps, if you dualize the entire complex with a -1 grading, you will get a complex with a +1 grading. This is the dual complex. When R = Z[G] then we call a finite free, acyclic, augmented chain complex of left R-modules, a G-resolution. The maps Oi : Xi ---> X i - 1 are sometimes called boundary maps. Remark 3.1. Using the command BoundaryMap in the GAP CRIME package of Marcus Bishop, one can easily compute the boundary maps of a cohomology object associated to a G-module. However, G must be a p-group. Example 3.1. We use the package HAP [El] to illustrate some of these concepts more concretely. Let G be a finite group, whose elements we have ordered in some way: G = {9b ... , 9n}. Since a G-resolution X* determines a sequence of finitely generated free Z[G]-modules, to concretely describe X* we must be able to concretely describe a finite free Z[G]-module. In order to represent a word w in a free Z[G]-module M of rank n, we use a list of integer pairs w = [[i 1,el],[i 2,e2], ... ,[ik,ek]]. The integers ij lie in the range {-n, ... ,n} and correspond to the free Z[G]-generators of M and their additive inverses. The integers ej are positive (but not necessarily distinct) and correspond to the group element gej' Let's begin with a HAP computation.

r---------------------------GAP--------------------______~ gap> LoadPackage ("hap") ;

true gap> gap>

G:~Group

([ (1,2,3), (1,2) 1);;

R:~Reso1utionFiniteGroup(G,

4);;

This computes the first 5 terms of a G-resolution (G

= 83)

The bounday maps 8i are determined from the boundary component of the GAP record R. This record has (among others) the following components:

• R! .dimension(k) - the Z[G]-rank of the module X k ,

169 • R! . boundary(k, j) - the image in Xk-l of the j-th free generator of

Xk, • R! . elts - the elements in G, • R! . group is the group in question.

Here is an illustration:

r----------------------------

GAP ----------------------------~

gap> R! .group; Group([ (1,2), (1,2,3) ]) gap> R! .elts; [ (), (2,3), (1,2), (1,2,3), (1,3,2), (1,3) ] gap> R! .dimension(3); 4 gap> R! .boundary(3,1); [ [ 1, 2 ], [ -1, 1 ] gap> R! .boundary(3,2); [ [ 2, 2 ], [ -2, 4 ] gap> R! .boundary(3,3); [ [ 3, 4 ], [ 1, 3 ], -3, 1 ], -1, 1 ] ] gap> R! .boundary(3,4); [ [ 2, 5 ], [ -3, 3 ], [ 2, 4 ], -1, 4 ], [ 2,

1 ],

[ -3, 1 ] ]

In other words, X3 is rank 4 as a G-module, with generators {iI, 12, 13, f4} say, and

Now, let us create another resolution and compute the equivariant chain map between them. Below is the complete GAP session:

r-----------------------------

GAP ------------------------------

gap> G1 :=Group ([ (1,2,3), (1,2) ]); Group([ (1,2,3), (1,2) ]) gap> G2 :=Group ([ (1,2,3), (2,3) ]); Group([ (1,2,3), (2,3) ]) gap> phi: =GroupHomomorphismBylmages (G1, G2, [ (1,2,3) , (1,2) ], [ (1,2,3) , (2,3) ] ) ; [ (1,2,3), (1,2) ] -> [ (1,2,3), (2,3) ] gap> R1:=ResolutionFiniteGroup(G1, 4); Resolution of length 4 in characteristic 0 for Group ([ (1,2), (1,2,3) ]) gap> R2:=ResolutionFiniteGroup(G2, 4); Resolution of length 4 in characteristic 0 for Group ([

(2,3),

(1,2,3)

])

.

170

gap> ZP_map:=EquivariantChainMap(Rl, R2, phi); Equivariant Chain Map between resolutions of length 4 . gap> map := TensorWithlntegers( ZP_map); Chain Map between complexes of length 4 . gap> [ fL gap> [ 2, gap> gap>

Hphi := Homology( map, 3); f2, f3 1 -> [ f2, f2*f3, fl*f2-2 Abelianlnvariants(Image(Hphi»; 3 1 GroupHomology(Gl,3);

[ 6 1 gap> GroupHomology(G2,3);

[ 6 1

In other words, H (1)) is an isomorphism (as it should be, since the homology is independent of the resolution choosen).

3.2. Constructions Let R

= Z[G].

3.2.1. Bar resolution This section follows §1.3 in [W]. Define a symbol [.] and call it the empty cell. Let Xo = R[.], so Xo is a finite free (left) R-module whose basis has only 1 element. For n > 0, let g1, ... , gn E G and define an n-cell to be the symbol [g1, ... , gn]. Let

where the sum runs over all ordered n-tuples in Gn. Define the differential operators d n and the augmentation module maps, by

E,

as G-

171

f(g[.]) = 1,

9 E

G

d 1 ([g]) = g[.]- [.], d 2([gl,g2])

= gl[g2]- [glg2] + [gl], n-l

dn ([gl, ... , gn])

= gl [g2, ... , gn] + I: (-I)i[gl, ... , gi-l, gigi+1, gi+2, ... , gn] i=1

+ (-I)n[gl,'"

,gn-l],

for n ~ 1. Note that the condition f(g[.]) = 1 for all 9 EGis equivalent to saying f([.J) = 1. This is because f is a G-module homomorphism and Z is a trivial G-module, so f(g[.]) = gf([.]) = 9 . 1 = 1, where the (trivial) G-action on Z is denoted by a '. The Xn are finite free G-modules, with the set of all n-cells serving as a basis. Proposition 3.1. With these definitions, the sequence

... -+

X2

d2 -+

X1

d -+ 1

X0

€ -+

'71 til

-+

0,

is a free G-resolution.

Sometimes this resolution is called the bar resolutionc . There are two other resolutions we shall consider. One is the closely related "homogeneous resolution" and the other is the "normalized bar resolution". This simple-looking proposition is not so simple to prove. First, we shall show it is a complex, Le., d 2 = O. Then, and this is the most non-trivial part of the proof, we show that the sequence is exact. First, we need some definitions and a lemma. Let f : L -+ M and 9 : L -+ M be +1-graded admissible maps. We say f is homotopic to 9 if there is a homomorphism D : L -+ M, called a homotopy, such that • Dn = DILn : Ln -+ M n+ 1 , • f - 9 = Dd + dD. CThis resolution is not the same as the resolution computed by HAP in Example 3.1. For details on the resolution used by HAP, please see Ellis [E2J.

172

If L = M and the identity map 1 : L -> L is homotopic to the zero map o : L -> L then the homotopy is called a contracting homotopy for L. Lemma 3.1. If L has a contracting homotopy then H(L)

= o.

proof: Represent x E H(L) by I! E L with dl! = O. But I! = 1 (I!) -O(I!) = + Dd(£) = dD(I!). Since D : L -> L, this shows I! E dL, so x = 0 in H(L).D Next, we construct a contracting homotopy for the complex X* in Proposition 3.1 with differential operator d. Actually, we shall temporarily let X-I = Z, X-n = 0 and d_ n = 0 for n > 1, so that that the complex is infinite in both directions. We must define D : X -> X such that

dD(£)

• • • • •

D-I = Dlz : Z -> X o, Dn = Dlxn : Xn -> X n+l , eD_ I = 1 on Z, dIDo + D_Ie = 1 on X o, dn+IDn + Dn-Idn = 1 in X n , for n 2: 1.

Define

n> 1, D_ I (1) = [.],

Do(g[.]) = [g], Dn(g[gl, ... ,gn]) = [g,gl, ... ,gn]' and extend to a Z-basis linearly. Now we must verify the desired properties. By definition, for m E Z, eD_I(m) = e(m[.]) eD_ I is the identity map on Z. Similarly,

n>O,

= me([.]) = m.

(dIDo + D-Ie)(g[.]) = dl([g]) + D_ I (1) = g[.]- [.] + D_ 1 (1) = g[.]- [.] + [.] = g[.]. For the last property, we compute

Therefore,

173

dn+lDn (g[gl, ... ,gn])

=

d n+l([g,gl,'" ,gn])

=

g[gl, ... ,gn]- [ggl,'" ,gn] n-l + _l)i-l [g, gl, ... ,gi-l, gigi+l, gi+2, ... ,gn]

2:) i=1

and

D n - 1 d n (g[gl, ... ,gn])

= D n - 1(gd n ([gl, ... ,gn])) = D n - 1 (ggl[g2, ... ,gn] n-l

+ 2) -l)i g [gl, ... , gi-l, gigi+l, gi+2,···, gn] i=1

+ (_l)ng[gl"'"

gn-l])

= [ggl,g2,'" ,gn] n-l + 2:)-l)i[g,gl,'" ,gi-l,gigi+l,gi+2,··. ,gn] i=1

+ (_1)n[g,gl,'"

,gn-l].

All the terms but one cancels, verifying that dn+1D n + D n - 1dn = 1 in X n , ~ 1. Now we show d 2 = O. One verifies d 1 d 2 = 0 directly (which is left to the reader). Multiply d k D k - 1 + Dk-2dk-l = 1 on the right by d k and d k+1 D k + Dk- 1 d k = 1 on the left by d k :

for n

dkD k- 1 d k + Dk-2dk-ldk

= d k = dkdk+1 D k + dkD k- 1d k· Cancelling like terms, the induction hypothesis dk-ldk = 0 implies d k dk+l = O. This shows d 2 = 0 and hence that the sequence in Proposition 3.1 is exact. This completes the proof of Proposition 3.1. 0 The above complex can be "dualized" in the sense of §3.1. This dualized complex is of the form

o --+

'7l tfJ

M

--+

X -1

d- 1 --+

X -2

d-2 --+

X -3

--+ ..•

The standard G-resolution is obtained by splicing these together.

174

3.2.2. Normalized bar resolution

Define the normalized cells by

*_{[91, ... ,gnJ, if allgi =/:-1,

[gl,···,gn J -

O 'f , 1 some gi

= 1.

Let Xo = R[.J and n 2': 1,

where the sum runs over all ordered n-tuples in Gn. Define the differential operators dn and the augmentation map exactly as for the bar resolution. Proposition 3.2. With these definitions, the sequence

••• ---->

X2

d2

---->

X1

d1 ---->

Xo

€

'77

----> u... ---->

0,

is a free G -resolution.

Sometimes this resolution is called the normalized bar resolution. proof: See Theorem 10.117 in [RJ. 0 3.2.3. Homogeneous resolution

Let Xo = R, so Xo is a finite free (left) R-module whose basis has only 1 element. For n > 0, let Xn denote the Z-module generated by all (n + 1)tuples (gO,.'" gn)· Make Xi into a G-module by defining the action by g: Xn ----> Xn by 9 : (gO, ... , gn)

I--->

(ggo, . .. , ggn),

9 E

G.

Define the differential operators an and the augmentation c, as Gmodule maps, by c(g) = 1, n-l

an (go, .,. ,gn) = 2:(-I)i(gO,'" , 9i-l,[Ji,gHl, ... ,gn), i=O

for n 2': 1. Proposition 3.3. With these definitions, the sequence

175

••• ----t

X2

02

----t

X1

01

----t

Xo

€ ----t

Z

----t

0,

is a G-resolution.

Sometimes this resolution is called the homogeneous resolution. Of the three resolutions presented here, this one is the most straightforward to deal with. proof: See Lemma 10.114, Proposition 10.115, and Proposition 10.116 in [R]. 0 4. Definition of Hn(G, A) For convenience, we briefly recall the definition of Ext n. Let A be a left R-module, where R = Z[G], and let (Xi) be a G-resolution of Z. We define Ext z[C](Z, A)

= Kernel (d~+l)/Image (d~),

where d~:

Hom(Xn_1,A)

----t

Hom(Xn,A),

is defined by sending f : X n - 1 ----t A to fd n : Xn ----t A. It is known that this is, up to isomorphism, independent of the resolution choosen. Recall Ext i[c] (Z, A) is the right-derived functors of the right-exact functor A f-----+ AC = Homc(Z, A) from the category of G-modules to the category of abelian groups. We define

(8) When we wish to emphasize the dependence on the resolution choosen, we write Hn(G,A,X*). For example, let X* denote the bar resolution in §3.2.1 above. Call C n = Cn(G, A) = Homc(Xn , A) the group of n-cochains of G in A, zn = zn(G, A) = C n n Kernel (8) the group of n-cocycles, and Bn = Bn(G,A) = 8(C n - 1) the group of n-coboundaries. We call Hn(G,A) = zn / B n the nth cohomology group of G in A. This is an abelian group. We call also define the cohomology group using some other resolution, the normalized bar resolution or the homogeneous resolution for example. If we wish to express the dependence on the resolution X* used, we write

176

Hn(G, A, X*). Later we shall see that, up to isomorphism, this abelian group is independent of the resolution. The group H 2 ( G, Z) (which is isomorphic to the algebraic dual group of H2(G,C X )) is sometimes called the Schur multiplier of G. Here C denotes the field of complex numbers. We say that the group G has cohomological dimension n, written cd(G) = n, if Hn+l(H,A) = 0 for all G-modules A and all subgroups H of G, but Hn(H, A) =I- 0 for some such A and H. Remark 4.1. • If cd( G) < 00 then G is torsion-freed. • If G is a free abelian group of finite rank then cd(G) = rank(G). • If cd( G) = 1 then G is free. This is a result of Stallings and Swan (see for example [RJ, page 885). 4.1. Computations

We briefly discuss computer programs which compute cohomology and some examples of known computations.

4.1.1. Computer computations of cohomology GAP [Gap] can compute some cohomology groupse. All the SAGE commands which compute group homology or cohomology require that the package HAP be loaded. You can do this on the command line from the main SAGE directory by typingf

sage -i gap_packages-4.4.10_3.spkg Example 4.1. This example uses SAGE, which wraps several of the HAP functions . .-----______________________ SAGE

I

sage: G = AlternatingGroup(5)

dThis follows from the fact that if G is a cyclic group then Hn(G,7l..) i= 0, discussed below. eSee §37.22 of the GAP manual, M. Bishop's package CRIME for cohomology of p-groups, G. Ellis' package HAP for group homology and cohomology of finite or (certain) infinite groups, and M. Roder's HAPCryst package (an add-on to the HAP package). SAGE [Stl computes cohomology via it's GAP interface. fThis is the current package name - change 4.4.10_3 to whatever the latest version is on http://www.sagemath.org/packages/optional/atthetimeyoureadthis.Also.this command assumes you are using SAGE on a machine with an internet connection.

177 sage: G.cohomology(l,7) Trivial Abelian Group sage: G.cohomology(2,7) Trivial Abelian Group

4.1.2. Examples Some example computations of a more theoretical nature.

(1) HO(G,A)

=

AG.

This is by definition. (2) Let L/ K denote a Galois extension with finite Galois group G. We have Hl(G,LX) = 1. This is often called Hilbert's Theorem 90. See Theorem 1.5.4 in [W] or Proposition 2 in §X.1 of [S]. (3) Let G be a finite cyclic group and A a trivial torsion-free G-module. Then Hl(G,A) = O. This is a consequence of properties given in the next section. (4) If G is a finite cyclic group of order m and A is a trivial G-module then

This is a consequence of properties given below. For example, H2(GF(q)X,C) = O. (5) If IGI = m, rA = 0 and gcd(r, m) = 1, then Hn(G, A) = 0, for all n~1.

This is Corollary 3.1.7 in [W]. For example, H 1 (A 5 ,7L/77L) = O.

5. Definition of Hn(G, A) We say A is projective if the functor B f----' HomG(A, B) (from the category of G-modules to the category of abelian groups) is exact. Recall, if P.Z = ...

-->

P2

d2

-->

is a projective resolution of 7L then

P1

d1 n € '71 --> r-o --> ~ -->

0

(9)

178

It is known that this is, up to isomorphism, independent of the resolution choosen. Recall Tor ~[c] (Z, A) are the right-derived functors of the rightexact functor A ~ Ac = Z 0z[c] A from the category of G-modules to

the category of abelian groups. We define

(10) When we wish to emphasize the dependence on the resolution, we write Hn(G, A, Pz).

Remark 5.1. If G is a p-group, then using the command ProjectiveResolution in GAP's CRIME package, one can easily compute the minimal projective resolution of a G-module, which can be either trivial or given as a MeatAxe g module. Since we can identify the functor A ~ Ac with A ~ A0z[c] Z (where Z is considered as a trivial Z[G]-module), the following is another way to formulate this definition. If Z is considered as a trivial Z[G]-module, then a free Z[G]-resolution of Z is a sequence of Z[G]-module homomorphisms

satisfying: • (Freeness) Each Mn is a free Z[G]-module. • (Exactness) The image of Mn+1--Mn equals the kernel of Mn--Mn-l for all n > O. • (Augmentation) The cokernel of Ml--Mo is isomorphic to the trivial Z[G]-module Z. The maps Mn --Mn-l are the boundary homomorphisms of the resolution. Setting TMn equal to the abelian group Mn/G obtained from Mn by killing the G-action, we get an induced sequence of abelian group homomorphisms

... --TMn--TMn_l-- ... --TM1--TMo This sequence will generally not satisfy the above exactness condition, and one defines the integral homology of G to be gSee for example http://wvw.math.rwth-aachen.de/-MTX/.

179

Hn(G,Z) = Kernel(TMn~TMn_l)/Image(TMn+l~TMn) for all n

> o.

5.1. Computations We briefly discuss computer programs which compute homology and some examples of known computations.

5.1.1. Computer computations of homology Example 5.1. GAP will compute the Schur multiplier H 2 (G, Z) using the AbelianlnvariantsMultiplier command. To find H 2 (A 5,Z), where A5 is the alternating group on 5 letters, type . -_____________________________ GAP ____________________________--, gap> AS:=AlternatingGroup(S); Alt ( [ 1 .. 5 J ) gap> AbelianlnvariantsMultiplier(AS); [ 2 J

So, H 2 (A 5 , q ~ Z/2Z. Here is the same computation in SAGE: . -__________________________ SAGE sage: G = AlternatingGroup(S) sage: G.homology(2) Multiplicative Abelian Group isomorphic to C2

Example 5.2. The SAGE command poincare_series returns the Poincare series of G (mod p) (p must be a prime). In other words, if you input a (finite) permutation group G, a prime p, and a positive integer n, poincare_series(G,p,n) returns a quotient of polynomials f(x) = P(x)/Q(x) whose coefficient of xk equals the rank of the vector space Hk(G, ZZ/pZZ) , for all k in the range 1 ::; k ::; n . r -____________________________

sage: sage: (x'2 sage: sage:

G = SymmetricGroup(S) G.poincare_series(2,10) + 1)/(x'4 - x'3 - x + 1) G = SymmetricGroup(3) G.poincare_series(2,lO)

SAGE

180 1/ (-x + 1)

This last one implies

for 1 ::; k ::; 10.

Example 5.3. Here are some more examples using SAGE's interface to HAP: . -__________________________ SAGE sage: G = SymmetricGroup(S) sage: G.homology(l) Multiplicative Abelian Group sage: G.homology(2) Multiplicative Abelian Group sage: G.homology(3) Multiplicative Abelian Group sage: G.homology(4) Multiplicative Abelian Group sage: G.homology(S) Multiplicative Abelian Group sage: G.homology(6) Multiplicative Abelian Group sage: G.homology(7) Multiplicative Abelian Group

isomorphic to C2 isomorphic to C2

isomorphic to C2 x C4 x C3 isomorphic to C2

isomorphic to C2 x C2 x C2 isomorphic to C2 x C2 isomorphic to C2 x C2 x C4 x C3 x CS

The last one means that

(Z/2Z)2 x (Z/3Z) x (Z/4Z) x (Z/5Z). r - - - - -________________________

SAGE

sage: G = AlternatingGroup(S) sage: G.homology(l) Trivial Abelian Group

sage: G.homology(1,7) Trivial Abelian Group sage: G.homology(2,7) Trivial Abelian Group

5.1.2. Examples Some example computations of a more theoretical nature.

181

(1) If A is a G-module then Tor~[G](Z,A) = Ho(G,A) = AG ~ AIDA. proof: We need some lemmas. Let to : Z[G] ----t Z be the augmentation map. This is a ring homomorphism (but not a G-module homomorphism). Let D = Kernel (to) denote its kernel, the augmentation ideal. This is a G-module. Lemma 5.1. As an abelian group, D is free abelian generated by G1 = {g - 1 I 9 E G}. We write this as D = Z(G - 1). proof of lemma: If d E D then d = L9EG mgg, where mg E Z and LgEG mg = O. Thus, d = L9EG mg(g - 1), so D c Z(G - 1). To show D is free: If L9EG mg(g - 1) = 0 then L9EG mgg - L9EG mg = 0 in Z[G]. But Z[G] is a free abelian group with basis G, so mg = 0 for all 9 E G. 0 Lemma 5.2. Z 0z[G] A = AIDA, where DA is generated by elements of the form ga - a, 9 E G and a E A. Recall AG denotes the largest quotient of A on which G acts triviallyh. proof of lemma: Consider the G-module map, A ----t Z0z[G]A, given by a f-------> 10a. Since Z0Z[G] A is a trivial G-module, it must factor through A G. The previous lemma implies AG ~ AIDA. (In fact, the quotient map q : A ----t AG satisfies q(ga - a) = 0 for all 9 E G and a E A, so DA C Kernel (q). By maximality of A G , DA = Kernel (q). QED) SO, we have maps A ----t AG ----t Z 0z[G] A. By the definition of tensor products, the map Z x A ----t A G, 1 x a f-------> 1 . aDA, corresponds to a map Z0z[G] A ----t AG for which the composition AG ----t Z0z[G] A ----t AG is the identity. This forces AG ~ Z 0z[G] A. 0 See also # 11 in §6. (2) If G is a finite group then Ho(G, Z) = Z. This is a special case of the example above (taking A = Z, as a trivial G-module). (3) H1(G,Z) ~ G/[G,G], where [G,G] is the commutator subgroup of G. This is Proposition 10.110 in [R], §10.7. proof: First, we claim: DI D2 ~ G/[G, G], where D is as in Lemma 5.l. To prove this, define G ----t DI D2 by 9 f-------> (g-1)+D2. Since gh-1(g-l) - (h-1) = (g-1)(h-1), it follows that e(gh) = e(g)e(h), so e is

e:

hImplicit in the words "largest quotient" is a universal property which we leave to the reader for formulate precisely.

182

a homomorphism. Since D / D2 is abelian and G / [G, Gj is the maximal abelian quotient of G, we must have Kernel (£I) c [G, Gj. Therefore, £I factors through £I': G/[G,Gj-t D/D2, g[G,G] 1-+ (g-l) +D2. Now, we construct an inverse. Define T : D -t G /[G, G] by 9 - 1 1 - + g[G, G]. Since T(g-l+h-l) = g[G, G]·h[G, G] = gh[G,Gj, it is not hard to see that this is a homomorphism. We would be essentially done (with the construction of the inverse of £I', hence the proof of the claim) if we knew D2 C Kernel (T). (The inverse would be the composition of the quotient D/D2 -t D/Kernel(T) with the map induced from T, D/Kernel(T)-t G/[G,G].) This follows from the fact that any x E D2 can be written as x = (2: g mg(g - 1»(2:h m',,(h - 1» = (2: g,h mgm',,(g - 1)(h - 1», so T(X) = I1 g, h(ghg-lh-l)mgm~[G,Gj = [G,G]. QED (claim) Next, we show H 1 (G,Z) ~ D/D2. From the short exact sequence

o -t D -t Z[Gj -..:. Z -t 0, we obtain the long exact sequence of homology ... -t H 1 (G,D) -t Hl(G,Z[G])-t

H 1 (G,Z)!-; Ho(G,D)!.. Ho(G,Z[G]) ~ Ho(G,Z) -t O.

(11)

a

Since Z[Gj is a free Z[Gj-module, H 1 (G, Z[G]) = O. Therefore is injective. By item # 1 above (i.e., Ho(G,A) ~ A/DA ~ Ae, we have Ho(G,Z) ~ Ze = Z and Ho(G,Z[G]) ~ Z[G]jD ~ Z. By (11), E* is surjective. Combining the last two statements, we find Z/Kernel (E*) ~ Z.This forces E* to be injective. This, and (11), together imply f must be O. Since this forces to be an isomorphism, we are done. 0 (4) Let G = F/R be a presentation of G, where F is a free group and R is a normal subgroup of relations. Hopf's formula states: H 2 ( G, Z) ~ (F n R)/[F, RJ, where [F, R] is the commutator subgroup of G. See [RJ, §1O.7. The group H 2(G,Z) is sometimes called the Schur multiplier of G.

a

6. Basic properties of Hn(G, A), Hn(G, A)

Let R be a (possibly non-commutative) ring and A be an R-module. We say A is injective if the functor B 1-+ Home(B, A) (from the category of Gmodules to the category of abelian groups) is exact. (Recall A is projective if the functor B 1-+ Home(A, B) is exact.) We say A is co-induced if it has the form Homz(R, B) for some abelian group B. We say A is relatively

183

injective if it is a direct factor of a co-induced R-module. We say A is relatively projective if 7r :

Z[G]®z A""""", A, x 121 a f------> xa,

maps a direct factor of Z[G]®z A isomorphically onto A. These are the Gmodules A which are isomorphic to a direct factor of the induced module Z[G]®zA. When G is finite, the notions of relatively injective and relatively projective coincide i . (1) The definition of Hn(G,A) does not depend on the G-resolution X* of Z used. (2) If A is an projective Z[G]-module then Hn(G, A) = 0, for all n 2: l. This follows immediately from the definitions. (3) If A is an injective Z[G]-module then Hn(G, A) = 0, for all n 2: l. See also [S], §VII.2. (4) If A is a relatively injective Z[G]-module then Hn(G,A) = 0, for all

n2:l. This is Proposition 1 in [S], §VII.2. (5) If A is a relatively projective Z[G]-module then Hn(G,A) = 0, for all

n2:l. This is Proposition 2 in [S], §VII.4. (6) If A = A' EB A" then Hn(G,A) = Hn(G,A') EB Hn(G,A"), for all n 2: O. More generally, if I is any indexing family and A = EBiEI Ai then Hn(G,A) = EBiEIHn(G, Ai), for all n 2: O. This follows from Proposition 10.81 in §10.6 of Rotman [R]. (7) If

is an exact sequence of G-modules then we have a long exact sequence of cohomology (1). See [S], §VII.2, and properties of the ext functor [R], §10.6. (8) A f------> Hn(G, A) is the higher right derived functor associated to A f------> AC = Homc(A, Z) from the category of G-modules to the category of abelian groups. This is by definition. See [S], §VII.2, or [R], §1O.7. iThese notions were introduced by Hochschild [Ho].

184

(9) If

is an exact sequence of G-modules then we have a long exact sequence of homology (2). In the case of a finite group, see [S], §VIII.1. In general, see [S], §VII.4, and properties of the Tor functor in [R], §1O.6. (10) A ~ Hn(G, A) is the higher left derived functor associated to A ~ AG = Z 0z[G] A on the category of G-modules. This is by definition. See [S], §VII.4, or [RJ, §10.7. (11) If G is a finite cyclic group then

Ha(G, A) = A G , H 2n - 1 (G, A) = A G jN A, H2n(G,A) = Kernel (N)jDA, for all n ~ 1. To prove this, we need a lemma. Lemma 6.1. Let G = (g) be acyclic group of order k. Let M and N = 1 + 9 + g2 + ... + gk-l. Then

..• --+

Z[G]

N

--+

Z[G]

M

--+

Z[G]

--+

Z[G]

N

--+

Z[G]

M

--+

Z[G]

€

--+

Z

=9-

--+

1

0,

is a free G-resolution. proof of lemma: It is clearly free. Since MN = NM = (g - 1)(1 + 9 + g2 + ... + gk-l) = gk _ 1 = 0, it is a complex. It remains to prove exactness. Since Kernel (€) = D = Image (M), by Lemma 5.1, this stage is exact. To show Kernel (M) = Image (N), let x = L7':~ mj gj E Kernel (M). Since (g - l)x = 0, we must have ma = ml = ... = mk-l. This forces x = maN E Image (N). Thus Kernel (M) C Image (N). Clearly M N = 0 implies Image (N) C Kernel (M), so Kernel (M) = Image (N). To show Kernel (N) = Image (M), let x = L~;:~ mjgJ E Kernel (N). Since Nx = 0, we have 0 o. Observe that

= €(Nx) = €(N)€(x) = k€(x), so

L;;:~ mj

=

185

x

= ma' 1 + mIg + m2g 2 + ... + mk_lg k- l = (ma - mag) + (ma + ml)g + m2g 2 + ... + mk_lg k- l = (ma - mag) + (ma + ml)g - (ma + ml)g2 +(ma + ml + m2)g2 - (ma + ml + m2)g3 + ... +(ma + .. + mk_l)gk-l - (ma + .. + mk_l)gk.

where the last two terms are actually O. This implies x = -M(ma + (ma+mt)g+(ma+ml +m2)g2+ ... +(ma+ .. +mk_t)gk-1 E Image (M). Thus Kernel (N) C Image (M). Clearly N M = 0 implies Image (M) C Kernel (N), so Kernel (N) = Image (M). This proves exactness at every stage.D Now we can prove the claimed property. By property 1 in §5.1.2, it suffices to assume n > O. Tensor the complex in Lemma 6.1 on the right with A: ... --->

Z[G] @Z[G) A ~ Z[G] @Z[G) A ~ Z[G] @Z[G) A ~ Z[G] @Z[G) A ~ Z[G] @Z[G) A-=' Z @Z[G]A

--->

0,

where the new maps are distinguished from the old maps by adding an asterisk. By definition, Z[G] ®Z[G] A ~ A, and by property 1 in §5.1.2, Z ®Z[G] A ~ AIDA. The above sequence becomes

This implies, by definition of Tor,

and

Tor~~Gl(Z, A) = Kernel (N*)/Image (M*) = A[N]I DA. See also [S], §VIII.4.1 and the Corollary in §VIII.4. (12) The group H2(G, A) classifies group extensions of A by G. This is Theorem 5.1.2 in [W]. See also §10.2 in [R]. (13) If G is a finite group of order m = IGI then mHn(G, A) = 0, for all This is Proposition 10.119 in [R]. (14) If G is a finite group and A is a finitely-generated G-module then Hn( G, A) is finite, for all n 2: 1. This is Proposition 3.1.9 in [W] and Corollary 10.120 in [R].

186

(15) The group Hl(G, A) constructed using resolutions is the same as the group constructed using 1-cocycles. The group H2(G, A) constructed using resolutions is the same as the group constructed using 2-cocycles. This is Corollary 10.118 in [R]. (16) If G is a finite cyclic group then

HO(G,A) 2n H - 1 (G,A) H2n(G,A)

=

AC,

=

Kernel NIDA,

=

A CINA,

for all n :::: 1. Here N : A --+ A is the norm map N a = L9EC ga and DA is the augmentation ideal defined above (generated by elements of the form ga - a). proof: The case n = 0: By definition, HO(G,A) = Ext~[Cl(Z,A) = Homc(Z,A). Define T: Homc(Z,A) --+ AC by sending f f-----+ f(l). It is easy to see that this is well-defined and, in fact, injective. For each a E AC, define f = fa E Homc(Z,A) by f(m) = mao This shows T is surjective as well, so case n = 0 is proven. Case n > 0: Applying the functor Homc(*,A) to the G-resolution in Lemma 6.1 to get ... .I'\ E A) of normal subgroups of A is termed a solvable filtration of A if AI A>. is solvable for every ,\ E A and n>'EA A>. = {I}. We shall say that H is solvably separable in A if n~=l H A>. = H, that is if and only if n~=l A>. c H. Now let H ~ A, then (A>.I'\ E A) is called an H-filtration of H if n>'EA H A>. = H. Let ¢ : H ---t K be an isomorphism between subgroups H of A and K of B. Two equally indexed filtrations (A>. 1,\ E A) and (B>.I'\ E A) of A and B respectively are termed (H, K, ¢)-compatible if (A>.nH)¢ = B>.nK (\f'\ E A). The following Proposition of Baumslag [2] will help us to prove one of the results: let (A>.J'\ E A), (B>.JA E A) be solvable (H, K, ¢)-compatible filtrations of the residually solvable groups A and B respectively. Suppose

197

(A>JA E A) is an H -filtration of A and (B>.IA If, for every A E A, {A/A>.

E

A) is a K-filtration of B.

* B/B>.;HA>./A>. = KB>./B>.},

is residually solvable, then so is G

= {A * B; H = K}.

3. Doubles of residually solvable groups In this section we prove the theorems concerning the doubles of residually solvable groups.

3.1. M eta-residual-solvability Let X be a group property. Then a group G is meta-X if there exist A and Q of property X and a short exact sequence 1 ----+ A ----+ G ----+ Q ----+ 1. Here we prove that in general the amalgamated products of doubles of residually solvable groups are meta-residually-solvable. Proposition 3.1. Let A be a residually solvable group, C be a subgroup of A, and 11- II be an isomorphic mapping of A onto A. Then the generalized free product of A and A amalgamating C with C, G

= {A * A; C = C}

is an extension of a free group by a residually solvable group. Proof. Let <jJ : G ----+ A; then K = ker<jJ = gp(aa-1Ia E A). K is free by the theorem of Hanna Neumann mentioned in subsection 2.1, since AnK = {1} = Anker<jJ.

Therefore G is an extension of a free group by a residually solvable group.

o Corollary 3.1. G is meta-residually-solvable. Proof. Since free groups are residually solvable, then by Proposition 3.1, G is residually solvable-by-residually solvable. That is to say that G IS meta-residually-solvable. o

3.2. Effect of solvably separability on the amalgamated subgroup and residual solvability, Proof of Theorem 1.1 We now prove that if we impose the solvable separability condition on the amalgamated subgroup of doubles of residual solvable groups then the resulting group is residually solvable.

198

Proof. Assuming C is solvably separable in C and A is residually solvable, we want to show that G is residually solvable. That is we must show that for every non-trivial element (1 i=)d E G, there exists a homomorphism, ¢, from G onto a solvable group S, ¢ : G ----) S, such that d¢ i= 1. We consider two cases: Case 1: Let 1 i= d E A. There exists an epimorphism ¢ from G onto A, so that d¢ = d. Since A is residually solvable, there exists .A E N, such that d rf. oAA, where oAA, is the .A-th derived group of A. Now put S = Ajt5A A, a solvable group of derived length at most .A. Note that the canonical homomorphism, (), from A onto S, maps d onto a non-trivial element in S. Now consider the composition of these two epimorphisms, ()o¢, which maps G onto S. The image of d in S is non-trivial: () 0

Case 2: Let 1 i= d

rf.

¢(d)

= d() = do AAi=s1.

A but d E G. Now d can be expressed as follows:

d=albla2b2···anbn (aiEA-C biEA-C).

Since the equally indexed filtrations, {oAAhEN' and {oAAhEN of A and are compatible, we can form GA :

A

Note that G A is residually solvable (by mapping it to one of the factors and noting that the kernel of the map is free). Consider the canonical homomorphism () from G onto GA. Since C is solvably separable in A, i.e.

n

CoAA = C,

AEN .A E N can be so chosen that ai

rf. CoAA, ai rf. CoAA

(for i

= 1,··· ,n).

Hence aloAA b1oAA··· anoAAbn oAAi=c>.1.

This completes the proof of theorem by using Baumslag's Proposition [2], we recalled in Section 2.3. 0 Note also, if G is residually solvable then C is solvably separable in A, since solvable separability is equivalent to n~l H AA C C.

199

3.3. Solvable separability is a sufficient condition for residual solvability; Proof of Theorem 1.2 Note that the condition of solvable separability of the amalgamated subgroup in the factors, in the case of doubles, is necessary. The following theorem shows that the amalgamated product of doubles is not residually solvable where the factors are residually solvable groups. Proof. D is meta-residually solvable by Corollary 3.1. By Lemma 2.1, there exists an epimorphism from D onto A. Let K be the kernel of this epimorphism. Since G is normal in A, by using Lemma 2.3

Now we want to show that D is not residually solvable. We proceed by contradiction. Suppose D is residually solvable. Let d be a non-trivial element in [K, D]. The existence of such an element is guaranteed by Lemma 2.2. Now assume fJ, is a homomorphism of D onto a solvable group S, so that dfJ, =/=8 1. Since fJ, is an epimorphism, GfJ, is a normal subgroup of S, and by

(* ),

If we can show that S = GfJ" then [KfJ" S] = 1, which implies that dfJ, =81, a contradiction. We now need to show that S = G fJ,. We have that D -7> S induces a homomorphism from D/G onto S/GfJ,. Since A/G and A/e each have a perfect subgroup, this induces a homomorphism from A/G to 1. So, S/GfJ, = 1 and hence GfJ, = S. 0 References 1. G.Arzhantseva,P. de la Harpe and D.Kahrobaei The true prosoluble completion of a group: Examples and open problems, Journal of Geometiae Dedicata, Springer Netherlands 124 (2007), 5-26. 2. G.Baumslag, On the Residual Finiteness of Generalized Free Products of Nilpotent Groups, Trans. Amer. Math. Soc 106 (1963), 193-209. 3. G.Baumslag, Positive One-Relator Groups, Trans. Amer. Math. Soc 156 (1971), 165-183. 4. G.Baumslag, A Survey of Groups with a Single Defining Relation, London Math. Soc. Lecture Note Ser. Camb. Univ. Press, Proc. of Groups St Andrews 121 (1986), 30-58. 5. G.Baumslag, Topics in Combinatorial Group Theory, Birkhauser-Verlag, 1993. 6. M.Burger and S, Mazes, Finitely Presented Simple Groups aqnd Products of Trees, C.R. Acad.Sci. Paris Ser. 1 Math. 7 (1997), 747-752.

200 7. R.Camm, Simple Free Products, J. London Math. Soc. 28 (1953), 66-76. 8. K.W. Gruenberg, Residual Properties of Infinite Solvable Groups, Proc. London Math. Soc. 7 (1954), 29-62. 9. P.Hall, The Splitting Properties of Relatively Free Groups, Proc. London Math. Soc. 4. (1978), 343-356. 10. D.Kahrobaei, A Simple Proof of a Theorem of Karrass and Solitar, Contemp. Math. Soc. 372. (2005), 107-108. 11. D.Kahrobaei, On Residual Solvability of Generalized Free Products of Finitely Genertaed Nilpotent Groups,J. Group Theory (accepted) (2007). 12. D.Kahrobaei, Residual Solvability of Generalized Free Products, PhD Thesis, CUNY Graduate Center (1994) 13. P.H. Kropholler, Baumslag-Solitar Groups and Some Other Groups of Cohomological Dimension Two, Comment. Math. Relv. 65 (1990), 547-558. 14. B.H. Neumann, An Essay on Free Products of Groups with Amalgamation, Philos. Trans. Roy. Soc. London Ser. A 246 (1954), 503-554. 15. H. Neumann, Generalized Free Products with Amalgamated Subgroups. ii, Amer. J. Math. 71 (1949), 491-540. 16. P. Neumann, SQ-universality of Some Finitely Presented Groups, J. Aust. Math. Soc. (1973). 17. J.P. Serre, Trees, Springer-Verlag, (1980).

AN APPLICATION OF A GROUP OF OL'SHANSKII TO A QUESTION OF FINE ET. AL. Seymour Lipschutz

Department of Mathematics, Temple University, Philadelphia, Pennsylvania, 19122 Dennis Spellman

Department of Mathematics, Temple University, Philadelphia, Pennsylvania, 19122

Dedicated to Tony Gaglione on his Sixtieth birthday.

1. Introduction

The classical commensurator of a subgroup may be found in [KM] where it is called the virtual normalizer. Proposition 2.4 of [BBFGRS] asserts that any nontrivial finitely generated subgroup H -=I=- {I} in a free group F has finite index in its commensurator. Following the above proposition the authors of [BBFGRS] define a group G satisfying the commensurator condition as one for which every finitely subgroup H -=I=- {I} has finite index in its commensurator. The authors of [BBFGRS] asked for a finitely generated infinite group G with the commensurator property that is not hyperbolic. In [0] Ol'shanskii constructed a 2-generator infinite simple group with the property that every nontrivial proper subgroup is infinite cyclic. We show that any such group satisfies the commensurator condition but is not hyperbolic. In [BBFGRS] the authors define for finitely generated groups a condition called property R. We give a definition, which coincides with theirs in the case of finite generation, but which applies to arbitrary groups. We consider operators which preserve property R as well as those that do not; moreover, we reflect upon the relationships (or lack thereof) between property Rand various other finiteness conditions. Finally we introduce a generalization of property R which we call weak property R or property WR. The paper is divided into six sections beginning with this Introduction (Section 1). In Section 2 we introduce the commensurator condition. In Section 3 we recall 201

202

various conditions on maximal abelian subgroups and apply these to argue that Ol'shanskii's group is an example of the type we seek. In Section 4 we study property R. In Section 5 we introduce property WR. In Section 6 we pose some questions.

2. The Commensurator Condition In this section we introduce the commensurator of a subgroup and the commensurator condition.

Definition 2.1. Let G be a group and H a subgroup in G. The commensurator of H in G, denoted by commc(H), consists of the elements 9 E G such that

IH: H n H91
. For each 1 : in < Xl, ,Xk >. This contradicts the fact that G satisfies property R. The contradiction shows that [< Y >: H] < 00. Hence, G' satisfies property R and property R is preserved under homomorphic images. Finally suppose that A is an upward directed set and (G.x).xEA is a family of property R groups indexed by A. Suppose that, for each ,J1, E A with >. < J1, there is a homomorphism P.x,!, : G.x ----> G and suppose further that, for each >., J1" 1/ E A with>' < J1, < 1/, one has p.x,v = P.x,!'P!',v where we write our maps to the right of their arguments and compose accordingly. Let G be the direct limit determined from this data and, for each>' E A, let P.x : G.x ----> G be the limit map. It will suffice to show that, if r = hI, ... , ,d is any finite subset of G, then the subgroup of G generated by r satisfies property R. Fix such a subset r. Then there is >. E A and gl, ... ,gk E G.x

208 such that "/j = gjP>', j = 1" k. Now < "/1, ... , "/k > is a subgroup of the homomorphic image G>.p>.. Hence, < "/1, ... , "/k > satisfies property R. 0 Let G be a finitely generated group. We wish to examine the relationship between property R and various other finiteness conditions for G. Specifically we consider (1) The maximum condition. (2) The minimum condition. (3) Residual finiteness. (4) Hopficity. The following will be useful for our purposes. Lemma 4.1. Let G be an infinite group generated by finitely many torsion elements. Then G does not satisfy property R.

Proof. Suppose X

= {Xl, ... , xd

n(j) 2: 2,j = 1, ... , k. Then H H,j = 1, ... ,k.

generates G and x?(j)

= {I}

=

1 where each

has infinite index in G yet x?(j) E 0

For example, the infinite dihedral group presented as D =< a, b; a 2 = 1, a-lba = b- l > is generated by {a, ab} and each of a and ab has order 2 in D. Thus D violates property R. In the same paper in which Ol'shanskii constructed the group in Section 3 he showed how to modify the construction to create a 2-generator infinite group in which every proper subgroup is finite cyclic. Since such a group clearly satisfies both the maximum and minimum conditions neither one (nor both) implies property R as a consequence of Lemma 4.1 applied to Ol'shanskii's modified construction. Conversely property R implies neither condition. The groups B N of Example 4.1 violate the maximum condition as the normal closure of < a > is isomorphic to the additive group of the ring Z[~l and is not finitely generated as a group. The infinite cyclic group, for example, satisfies property R but violates the minimum condition. The infinite dihedral group D =< a, b; a 2 = 1, a-1ba = b- 1> is easily seen to residually be in the family of finite dihedral groups Dn =< a, b; a 2 = 1, bn = 1, a-1bna = bn - 1 > of order 2n. Hence, D is a finitely generated residually finite group and thus also Hopfian. This shows that the Hopf property or even residual finiteness does not imply property R. On the other hand Ol'shanskii's group of Section 3 is not residually finite as the only finite image is the trivial group {I}. Ol'shanskii's group is, however, Hopfian as it is easy to see every simple group must be.

209 Proposition 4.3. Property R is not preserved in unrestricted direct products or even unrestricted direct powers. Proof. Let N be the set of positive integers. Since the infinite dihedral group D is residually in the family {Dn : n E N} of finite dihedral groups, D embeds in the unrestricted direct product IInENDn. If IInENDn satisfied property R then so would the subgroup D a contradiction. Each Dn, being finite, satisfies property R. Let Soo, the infinite symmetric group, be the group of all permutations of N which move only finitely many integers. For each n E N fix an embedding of Dn into Soo. Now Soo satisfies property R since it is the direct limit of the family {Sn : n E N} of finite symmetric groups. But IInENDn embeds in the unrestricted direct power S!. Hence, S! violates property R. D 5. Weak Property R Proposition 5.1. The following conditions on a group G are equivalent. (WR1) If Go is any subgroup of G and Go* is any homomorphic image of Go, then the set of torsion elements in G o* forms a locally finite subgroup of G o*. (WR2) If X is any finite subset of G and N is any normal subgroup of < X >, then N has finite index in < X > if and only if for each x E X there is a positive integer n(x) such that xn(x) EN. Note that the set of torsion elements in a property S group forms a locally finite subgroup. However, property S is not in general preserved in homomorphic images. This latter fact can be seen from the result that free groups satisfy property S. Proof. (of Proposition 5.1) WR1

===}

WR2

Assume G satisfies WRl. Let X = {Xl, ... , xd be a finite subset of G and let Go =< X >. Let N be a normal subgroup of Go. Suppose that for j = 1, ... , k there is a positive integer n(j) such that x;(j) E N. Then {N Xl, ... , N Xk is contained in the set T of torsion elements of the homomorphic image GaiN of Go. Thus, GaiN =< NXI, ... , NXk > is a finite group and so [< X >: N] = [Go: N] < 00. Hence G satisfies WR2. WR2

===}

WR1

210

Suppose G satisfies WR2. Let Go be a subgroup of G and let ¢ : Go - ? G o* be an epimorphism. It will suffice to show that, if T is the set of torsion elements in G o* and (91" 9k) E Tk is a finite tuple of elements ofT, then the subgroup < 91" 9k > of G o* is finite. For each j = 1" k choose a preimage Xj E Go of 9j under ¢. Let H =< X1"Xk > nKer(¢). Now since each Xj maps into T there is a positive integer n(j) such that x7(j) E H, j = 1, ,k. Furthermore, letting X = X1"Xk, H is normal in < X > since Ker(¢) is normal in Go and H =< X > nKer(¢). Since G satisfies WR2 we must have that H has finite index in < X >. Then

< X > j H = < X > j ( < X > nK er( ¢)) = « X> Ker(¢)jKer(¢) =< X1¢"Xk¢ >=< gl"gk > is finite. Hence G satisfies WRl.

o

Definition 5.1. A group satisfies weak property R or property WR provided it satisfies either one (and therefore both) of the conditions WR1 or WR2 of Proposition 5.1. Definition 5.2. A group is a U-group unique.

if roots, when they exist, are

Definition 5.3. A group is an FC-group provided every conjugacy class is finite. Proposition 5.2. (B.H. Neumann [N}): A torsion free FC-group is abelian. Proposition 5.3. Let G be a torsion free property WR group. Then Gis aU-group. We note that both property WR1 and the fact that torsion freeness implies uniqueness of roots within the category are well-known properties of nilpotent groups. Proof. (of Proposition 5.3) Suppose that G is a torsion free property WR group. Suppose n is a positive integer and x, y E G satisfy xn = yn. We must show that x = y. We may assume without loss of generality that G =< x, Y > since property WR is clearly inherited by subgroups. Let xn = Z = yn. Then z is central in G so < z > is normal in G. Moreover, since xn = yn = z and {x,y} generates G we must have [G :< z >J < 00 as G satisfies property WR. Now let w E G be arbitrary. Then

211

< z >] = [G : Gc(w)][Gc(w) :< z >] we see that [G: Gc(w)] < 00. It follows that G is an Fe-group. By Proposition 5.2, G is abelian. But then, in the torsion free abelian group G, xn = yn implies (xy-l)n = 1 which in turn implies x = y. 0 We note that a slight variation of the proof shows that torsion free property S groups are U-groups. 6. Questions Question 6.1. Must every finitely generated property R group be Hopfian? Question 6.2. Are property R groups closed under finite direct products? Are they closed under finite direct powers? Question 6.3. Does property WR imply property R? Question 6.4. Must every torsion free property WR group embed in a property WR group admitting roots? What about torsion free property R groups? What about torsion free property S groups? 7. References [BBFGRS] G. Baumslag, O. Bogopulski, B. Fine, A.M. Gaglione, G. Rosenberger and D. Spellman, On some finiteness properties in infinite groups Alg. Colloquium, 15(1), 2007, 1-22 [FR] B. Fine and G. Rosenberger, On restricted Gromov groups Comm.

In Alg., 20(8), 1992, 2171 2181 [KM] I. Kapovich and A.G. Myasnikov, Stallings foldings J. Alg. , 248, 2002, 608 668 [M] A.I. Malcev, Nilpotent torsion free groups Izv. Akad. Nauk. SSSR, 67, 1949, 347 366 [N] B.H. Neumann, Groups with finite classes of conjugate elements Proc. London Math. Soc., 3, 1951, 178 187

[0] A.Yu. Olshanskii, Infinite groups with cyclic subgroups Soviet Math. Dokl., 20(2), 1979,343 346

Quotient Isomorphism Invariants of a Finitely Generated Coxeter Group Michael Mihalik

Mathematics Department, Vanderbilt University, Nashville TN 37240, USA John Ratcliffe

Mathematics Department, Vanderbilt University, Nashville TN 37240, USA Steven Tschantzk

Mathematics Department, Vanderbilt University, Nashville TN 37240, USA

1. Introduction

The isomorphism problem for finitely generated Coxeter groups is the problem of deciding if two finite Coxeter matrices define isomorphic Coxeter groups. Coxeter [4]] solved this problem for finite irreducible Coxeter groups. Recently there has been considerable interest and activity on the isomorphism problem for arbitrary finitely generated Coxeter groups. In this paper we describe a family of isomorphism invariants of a finitely generated Coxeter group W. Each of these invariants is the isomorphism type of a quotient group WIN of W by a characteristic subgroup N. The virtue of these invariants is that WIN is also a Coxeter group. For some of these invariants, the isomorphism problem of WIN is solved and so we obtain isomorphism invariants that can be effectively used to distinguish isomorphism types of finitely generated Coxeter groups. We emphasize that even if the isomorphism problem for finitely generated Coxeter groups is eventually solved, several of the algorithms described in our paper will still be useful because they are computationally fast and would most likely be incorporated into an efficient computer program that determines if two finite rank Coxeter systems have isomorphic groups. 212

213

In §2, we establish notation. In §3, we describe two elementary quotienting operations on a Coxeter system that yields another Coxeter system. In §4, we describe the binary isomorphism invariant of a finitely generated Coxeter group. In §5, we review some matching theorems. In §6, we describe the even isomorphism invariant of a finitely generated Coxeter group. In §7, we define basic characteristic subgroups of a finitely generated Coxeter group. In §8, we describe the spherical rank two isomorphism invariant of a finitely generated Coxeter group. In §9, we make some concluding remarks.

2. Preliminaries A Coxeter matrix is a symmetric matrix M = (m(s, t))s,tES with m(s, t) either a positive integer or infinity and m(s, t) = 1 if and only if s = t. A Coxeter system with Coxeter matrix M = (m(s, t))s,tES is a pair (W,5) consisting of a group Wand a set of generators 5 for W such that W has the presentation W

=

(5 I (st)m(s,t) : s, t E 5 and m(s, t) < 00).

We call the above presentation of W, the Coxeter presentation of (W, 5). If (W,5) is a Coxeter system with Coxeter matrix M = (m(s, t))s,tES, then the order of st is m(s, t) for each s, tin 5 by Prop. 4, p. 92 of Bourbaki [3], and so a Coxeter system (W,5) determines its Coxeter matrix; moreover, any Coxeter matrix M = (m(s, t))s,tES determines a Coxeter system (W, 5) where W is defined by the corresponding Coxeter presentation. If (W, 5) is a Coxeter system, then W is called a Coxeter group and 5 is called a set of Coxeter generators for W, and the cardinality of 5 is called the rank of (W, 5). A Coxeter system (W,5) has finite rank if and only if W is finitely generated by Theorem 2 (iii), p. 20 of Bourbaki [3]. Let (W,5) be a Coxeter system. A visible subgroup of (W, 5) is a subgroup of W of the form (A) for some A 2} such that an edge (s, t) is labeled by m(s, t). Coxeter diagrams are useful

214

for visually representing finite Coxeter groups. If A c S, then ,6.( (A), A) is the sub diagram of ,6.(W, S) induced by A. A Coxeter system (W, S) is said to be irreducible if its C-diagram ,6. is connected. A visible subgroup (A) of (W, S) is said to be irreducible if «(A), A) is irreducible. A subset A of S is said to be irreducible if (A) is irreducible. A subset A of S is said to be a component of S if A is a maximal irreducible subset of S or equivalently if ,6.( (A), A) is a connected component of ,6.(W, S). The connected components of the ,6.(W, S) represent the factors of a direct product decomposition of W. The presentation diagram (P-diagram) of (W, S) is the labeled undirected graph r = r(W, S) with vertices S and edges {(s,t): s,t E Sand m(s,t) < oo}

such that an edge (s, t) is labeled by m(s, t). Presentation diagrams are useful for visually representing infinite Coxeter groups. If A c S, then r( (A), A) is the sub diagram of r(W, S) induced by A. The connected components of r(W, S) represent the factors of a free product decomposition of W. For example, consider the Coxeter group W generated by the four reflections in the sides of a rectangle in E2. The C-diagram of (W, S) is the disjoint union of two edges labeled by 00 while the P-diagram of W is a square with edge labels 2. Let (W, S) and (W', S') be Coxeter systems with P-diagrams rand r ' , respectively. An isomorphism ¢ : (W, S) ----> (W', S') of Coxeter systems is Hn isomorphism ¢ : W ----> W' such that ¢(S) = S'. An isomorphism 'Ij; : r ----> r ' of P-diagrams is a bijection from S to S' that preserves edges and their labels. Note that (W, S) ~ (W', S') if and only if r ~ r'. We shall use Coxeter's notation on p. 297 of [5 for the irreducible spherical Coxeter simplex reflection groups except that we denote the dihedral group D~ by D2(k). Subscripts denote the rank of a Coxeter system in Coxeter's notation. Coxeter's notation partly agrees with but differs from Bourbaki's notation on p.193 of [3]. Coxeter [4]] proved that every finite irreducible Coxeter system is isomorphic to exactly one of the Coxeter systems An, n ~ 1, B n , n ~ 4, Cn, n ~ 2, D 2 (k), k ~ 5, E 6 , E 7 , E s , F 4 , G 3 , G 4 . For notational convenience, we define B3 = A 3 , D2(3) = A 2, and D 2(4) = C 2 The type of a finite irreducible Coxeter system (W, S) is the isomorphism type of (W, S) represented by one of the systems An, B n , Cn, D 2 (k), E 6 ,

215

E 7 , E s , F4, G 3 , G 4 · The type of an irreducible subset A of S is the type of ((A),A). The C-diagram of An is a linear diagram with n vertices and all edge labels 3. The C-diagram of Bn is a V-shaped diagram with n vertices and all edge labels 3 and two short arms of consisting of single edges. The Cdiagram of is a linear diagram with n vertices and all edge labels 3 except for the last edge labelled 4. The C-diagram of D2 (k) is a single edge with label k. The C-diagrams of E 6 , E 7 , Es are star shaped with three arms and all edge labels 3. One arm has length one and another has length two. The C-diagram of F 4 is a linear diagram with edge labels 3,4, 3 in that order. The C-diagram of G 3 is a linear diagram with edge labels 3,5. The C-diagram of G 4 is a linear diagram with edge labels 3,3,5 in that order.

en

3. Elementary Quotient Operations In this section we describe two types of elementary edge quotient operations on a Coxeter system (W, S) of finite rank. The first we call edge label reduction and the second we call edge elimination. Suppose sand t are distinct elements of S with m(s, t) < 00. Let d be a positive divisor of m = m(s, t), with d < m, and let N be the normal closure of the element (st)d of W. Then a presentation for WIN is obtained from the Coxeter presentation for (W, S) by adding the relator (st)d. As m = (mld)d, the relator (st)m is derivable from the relator (st)d and so the relator (st)m can be removed from the presentation for WIN. Assume d > 1. Then the presentation for WIN is a Coxeter presentation whose P-diagram is obtained from the P-diagram for (W, S) by replacing the label m on the edge (s, t) with the label d. We call the operation of passing from the Coxeter system (w, S) to the quotient Coxeter system (WIN,{sN: s E S}) the (s,t) edge label reduction from m to d. For example, if we reduce the 4 edge of F 4 to 2, we obtain the Coxeter system A2 x A 2. Now assume d = 1. We delete from the presentation for WIN the generator t and the relator st and replace all occurrences of t in the remaining relators by s. Suppose r is in Sand k = mer, s) < 00 and e = mer, t) < 00. Then we have the relators (rs)k and (rs)l in the presentation for WIN. Let d be the greatest common divisor of k and e. Then there are integers a and b such that d = ak+be. This implies that (rs)d is derivable from (rs)k and (rs)l and so we may add the relator (rs)d to the presentation for WIN. Then (rs)k and (rs)l are derivable from (rs)d and so we can eliminate the relators (rs)k and (rs)l from the presentation for WIN. We do this for each

216

r in 8 such that m(r, s)
WCi) is the quotient homomorphism, then

N Ci+1)(W)

= 'rJ;-l(N(WCi)))

for each i = 1, ... , £ - 1, and WCe) has no basic subgroups of rank greater than 2, and £ is as small as possible. We have that WCHl) = (WCi))C2) for each i = 1, ... , £ - 1. Therefore the isomorphism type of W(i) for each i = 1, ... , £ is an isomorphism invariant of W. It follows from the Basic Matching Theorem that £ does not depend on a choice of Coxeter generators for W, and so £ is an isomorphism invariant of W. We call £ the spherical rank 2 class of W. We have £ ~ 1 with £ = 1 if and only if W has no bask subgroups of rank greater than 2. Figure 2 shows the P-diagrams of a sequence WCl), ... , WCl) with £ = 4 for the Coxeter group W = WCl). Define N2 = N(£)(W). Then N2 is a characteristic subgroup of W such that W 2 = W / N2 has no basic subgroups of rank greater than 2. The isomorphism type of W 2 is an isomorphism invariant of W which we call the spherical rank 2 isomorphism invariant of W. Let 'rJ : W ----> W2 be the quotient homomorphism, and let S2 = 'rJ(S). Then (W2' S2) is a Coxeter system that can be obtained from (W, S) by a finite series of elementary edge quotient operations.

2

3 3

2

2

3

3

3

3

Figure 2

226 9. Conclusion

Let (W,8) be a Coxeter system of finite rank. In this paper, we have described three characteristic subgroups Nb, N e , N2 of W each leading to a quotient isomorphism invariant of W. It is interesting to note that N2 ~ Ne ~ Nb,

and so the quotient isomorphism invariants corresponding to Nb, N e , N2 are progressively stronger. The algorithm for finding a P-diagram for the system (Wb, 8b) starting from a P-diagram of (W, 8) is computational fast. The algorithm for finding a P-diagram for the system (We, Se) is slower since it has to determine the bases of (W, 8) of type D 2 (4q + 2) that satisfy the conditions of Theorem 5.4; but, this algorithm is only slightly slower since the conditions of Theorem 5.4 are easy to check. The algorithm for finding a P-diagram for the system (We, Se) would most likely be incorporated into an efficient computer program that determines if two finite rank Coxeter systems have isomorphic groups, since the even isomorphism invariant would usually determine that two random finite rank Coxeter systems have nonisomorphic groups. The algorithm for finding a P-diagram for the system (W2,8 2 ) is the slowest, but it is not much slower, since it only has to find a subdiagram of the P-diagram of (W, 8) of type A 3 , C 3 or G 3 before it performs an edge quotient operation on an edge of the subdiagram, and therefore reduces the complexity of the P-diagram. If the sub diagram is of type A3 or G 3 , then the edge with label 2 is eliminated. If the subdiagram is of type C 3 , then the 4 edge label is reduced to 2. The algorithm then repeats the routine of searching for a subdiagram of type A 3 , C 3 or G 3 and performing the corresponding edge quotient operation. The algorithm for finding a P-diagram for the system (W2 ,82 ) would most likely be useful in an efficient program that determines if two finite rank Coxeter systems have isomorphic groups, since the solution of the isomorphism problem for finite rank Coxeter systems that have no bases of rank greater than 2 is considerably simpler than any general solution of the isomorphism problem.

227

References l. P. Bahls, Even rigidity in Coxeter Groups, Ph.D. Thesis, Vanderbilt Univer-

sity,2002. 2. P. Bahls and M. Mihalik, Reflection independence in even Coxeter groups, Geometriae Dedicata 110 (2005), 63-80. 3. N. Bourbaki, Groupes et algebres de Lie, Chapitres 4, 5, et 6, Hermann, Paris, 1968. 4. H.S.M. Coxeter, The complete enumeration of finite groups of the form R; = (RiRj)kii = 1, J. London Math. Soc. 10 (1935), 21-25. 5. H.S.M. Coxeter, Regular Polytopes, Dover, New York, 1973. 6. G. Maxwell, The normal subgroups of finite and affine Coxeter groups, Proc. London Math. Soc. 76 (1998), 359-382. 7. B. Miihlherr, The isomorphism problem for Coxeter groups, In: The Coxeter Legacy: Reflections and Projections, Edited by C. Davis and E.W. Ellers, Amer. Math. Soc., (2006), 1-15. 8. M. Mihalik, J. Ratcliffe, and S. Tschantz, Matching theorems for systems of finitely generated Coxeter groups, Algebr. Geom. Topol. 7 (2007), 919-956.

Localization and I A-automorphisms of finitely generated, metabelian, and torsion-free nilpotent groups Marcos Zyman *

Department of Mathematics, The City University of New York-BMCC New York, New York 10007, USA E-mail: [email protected] Given a nilpotent group G and a prime p, there is a unique p-local group G(p) which is, in some sense, the "best approximation" to G among all p-local nilpotent groups. G(p) is called the p-localization of G. Let I A( G) be the group of automorphisms of G that induce the identity on G/[G, G]. IA(G) turns out to be nilpotent so its p-localization exists. Two groups are said to be in the same localization genus if their p-localizations are isomorphic for all p. We prove that if two finitely generated, torsion-free nilpotent, and metabelian groups lie in the same localization genus, their I A-groups also lie in the same localization genus. The method of proof involves basic sequences and commutator calculus.

Keywords: Nilpotent groups, p-localization, I A-automorphisms.

1. Introduction

The objective of this paper is to investigate the interaction between the I Aautomorphisms and p-localization of finitely generated, torsion-free nilpotent, and metabelian groups. In particular we prove that if two such groups lie in the same localization genus, their I A-groups also lie in the same localization genus. The proof requires an understanding of basic commutators and commutator calculus. Before stating the main theorem precisely, we discuss some initial notions and facts (see Refs. 1, 2, and 3). A group G is called p-local if the map x f--+ xn from G to itself is a bijection for n relatively prime to p. For every nilpotent group G there is a homorphism of nilpotent groups e : G -4 G(p) which p-localizes G . This means that G(p) is p-local, and for every p-local nilpotent group K, the map e* : Hom (G(p),K) -4 Hom(G,K) given by e*(l

where wt(bk) ::::; C - 1 for each k; and v(i) is relatively prime to p. Direct computations involving commutator calculus give

where Dil =

II([bk,Ad[Ak,xz])*&. k>l

In general, for each i, we may construct a sequence of elements of G':

(1) where Dil

= II([b k , Az][Ak,xz])*, k>l

Dij

= II([h,Dl(j-l)][Dk(j-I),xd)~

for j > 1,

k>l

I'J+2(G(p)), for j

= 1,2, ... ,

We can therefore choose C¥i E G' such that ai = Ai, and Dij E I'J+2 (G) 8 such that Dij = 151/ ) for each j = 1,2, ... , c- 2; where Ii = e(g) for 9 E G. Using Lemma 3.2 yet again we see that

cp

8(-.) _ - -A8 r5d1 (8) ... r5dC-2 (8) X. - x. • i1 i(c-2) .

Let fJi

= C¥i D i1 ... D i (c-2)

E

G'.

Define the following map on the generators of G: f(Xi)

=

xifJi·

It is routine to show f can be extended to an element of I A( G) (see p. 47 of Ref. 6). Then, fp and cp8 coincide on the p-generators of G(p) and will 0 therefore be equal as I A-automorphisms. The proof of Theorem 2.2 is now complete, and Theorem 1.1 readily follows. 4. Examples

4.1. An example where InnG

=I IA(G)

Let (denote the center of G. If it were the case that InnG = IA(G) for all nilpotent groups G, Theorem 1.1 would be trivial since we would have

240

due to the facts that (G/()(p) ~ G(p)/(p) (see Ref. 2) and H/((H) ~ InnH for any group H. To demonstrate that Theorem 1.1 is nontrivial in general we compute I nnG and I A( G) where G is free nilpotent of class 2 and rank 3. It will then be clear that InnG -=f. IA(G). Let

G=(x,y,z) be free nilpotent of class 2 on the generators {x,y,z}. Put C12 = [x,y], C13 = [x,z], and C23 = [y,z] (we will use similar notations in §4.2). Then

is a basic sequence of basic commutators on {x, y, z}. Since G is free nilpotent, every g E G can be uniquely written as

I

,

I

e'

e'

e'

If g' = xelye2ze3c1~2c133c233 is another element of G, standard commutator calculus gives ,_

gg -

Xel+e~ Ye2+e; Ze3+e; Ce12+e~2-e;e2 Ce13+ 12 13

e ;3- e ;ea e23+e~3-e~e3

c23

.

(7)

Consider the following nine elements of I A( G):

Aspects of Infinite Groups: A Festschrift in Honor of Anthony Gaglione (Algebra and Discrete Mathematics)

Infinite Groups: Geometric, Combinatorial and Dynamical Aspects (Progress in Mathematics)

Fundamental Structures of Algebra and Discrete Mathematics

Algebra 04 Infinite groups, Linear groups

Infinite Groups.. Geometric, Combinatorial and Dynamical Aspects

Algebra of Infinite Justice

Algorithmic aspects of combinatorics (Annals of discrete mathematics 2)

Algorithmic aspects of combinatorics (Annals of discrete mathematics 2)

Buddha Nature: A Festschrift in Honor of Minoru Kiyota

The geometry of discrete groups

Pearls of discrete mathematics

Pearls of Discrete Mathematics (Discrete Mathematics and Its Applications)

Handbook of Linear Algebra (Discrete Mathematics and Its Applications)

Discrete Subgroups of Lie Groups

Discrete groups

Grassmannians of Classical Buildings (Algebra and Discrete Mathematics)

Buddha Nature: A Festschrift in Honor of Minoru Kiyota

Discrete groups and geometry

84 (Aspects of mathematics)

Logic, Language and Computation: Festschrift in Honor of Satoru Takasu

Conformal Geometry of Discrete Groups and Manifolds

Infinite Ascent: A Short History of Mathematics

Conformal geometry of discrete groups and manifolds

Mathematics: A Discrete Introduction

Infinite dimensional groups and manifolds

Infinite Dimensional Groups and Manifolds

Discrete Groups in Geometry and Analysis: Papers in Honor of G.D.Mostow on his Sixtieth Birthday

Mathematics: A Discrete Introduction

Topics in Discrete Mathematics

Geometric and combinatorial aspects of commutative algebra

Infinite Abelian Groups

Aspects of Infinite Groups: A Festschrift in Honor of Anthony Gaglione (Algebra and Discrete Mathematics)

Infinite Groups: Geometric, Combinatorial and Dynamical Aspects (Progress in Mathematics)

Fundamental Structures of Algebra and Discrete Mathematics

Algebra 04 Infinite groups, Linear groups

Infinite Groups.. Geometric, Combinatorial and Dynamical Aspects

Algebra of Infinite Justice

Algorithmic aspects of combinatorics (Annals of discrete mathematics 2)

Algorithmic aspects of combinatorics (Annals of discrete mathematics 2)

Buddha Nature: A Festschrift in Honor of Minoru Kiyota

The geometry of discrete groups

Pearls of discrete mathematics

Pearls of Discrete Mathematics (Discrete Mathematics and Its Applications)

Handbook of Linear Algebra (Discrete Mathematics and Its Applications)

Discrete Subgroups of Lie Groups

Discrete groups

Grassmannians of Classical Buildings (Algebra and Discrete Mathematics)

Buddha Nature: A Festschrift in Honor of Minoru Kiyota

Discrete groups and geometry

84 (Aspects of mathematics)

Logic, Language and Computation: Festschrift in Honor of Satoru Takasu

Conformal Geometry of Discrete Groups and Manifolds

Infinite Ascent: A Short History of Mathematics

Conformal geometry of discrete groups and manifolds

Mathematics: A Discrete Introduction

Infinite dimensional groups and manifolds

Infinite Dimensional Groups and Manifolds

Discrete Groups in Geometry and Analysis: Papers in Honor of G.D.Mostow on his Sixtieth Birthday

Mathematics: A Discrete Introduction

Topics in Discrete Mathematics

Geometric and combinatorial aspects of commutative algebra

Infinite Abelian Groups

Recommend Documents