Threshold Graphs and Related Topics, Volume 56 (Annals of Discrete Mathematics)

THRESHOLD GRAPHS AND RELATED TOPICS ANNALS OF DISCRETE MATHEMATICS General Editor: Peter L. HAMMER Rutgers Universit...

Author: N.V.R. Mahadev | U.N. Peled

8 downloads 531 Views 21MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
Report copyright / DMCA form

DOWNLOAD PDF

THRESHOLD GRAPHS AND RELATED TOPICS

ANNALS OF DISCRETE MATHEMATICS

General Editor: Peter L. HAMMER Rutgers University, New Brunswick, NJ, USA Advisory Editors: C. BERGE, Universit6 de Paris, France R.L. GRAHAM, AT&T Bell Laboratories, NJ, USA M.A. HARRISON, University of California, Berkeley, CA, USA V. KLEE, University of Washington, Seattle, WA, USA J.H. VAN LINT, California Institute of Technology, Pasadena, CA, USA G.C. ROTA, Massachusetts Institute of Technology, Cambridge, MA, USA T. TROTTER, Arizona State University, Tempe, AZ, USA

56

THRESHOLD GRAPHS AND RELATED TOPICS

N.V.R. MAHADEV Northeastern University Boston, MA, USA

U.N. PELED University of Illinois at Chicago Chicago, IL, USA

1995 ELSEVIER AMSTERDAM

�9L A U S A N N E

�9N E W Y O R K �9O X F O R D �9S H A N N O N

�9T O K Y O

ELSEVIER SCIENCE B.V. Sara Burgerhartstraat 25 P.O. Box 211, 1000 AE Amsterdam, The Netherlands

ISBN: 0 444 89287 7 �9 1995 Elsevier Science B.V. All rights reserved. No part of this publication may be reproduced, stored in a retrieval system or transmitted in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, without the prior written permission of the publisher, Elsevier Science B.V., Copyright & Permissions Department, P.O. Box 521, 1000 A M Amsterdam, The Netherlands. Special regulations for readers in the U.S.A. - This publication has been registered with the Copyright Clearance Center Inc. (CCC), 222 Rosewood Drive, Danvers, MA 01923. Information can be obtained from the CCC about conditions under which photocopies of parts of this publication may be made in the U.S.A. All other copyright questions, including photocopying outside of the U.S.A, should be referred to the publisher. No responsibility is assumed by the publisher for any injury and~or damage to persons or property as a matter of products liability, negligence or otherwise, or from any use or operation of any methods, products, instructions or ideas contained in the material herein. This book is printed on acid-free paper. Printed in The Netherlands

Dedicated to my mother Prasuna who was my first teacher and to my father Nadimpalli V. S u b r a h m a n y a m ~ho inspired me to do m a t h e m a t i c s NVRM

Dedicated to my dear mother Malka and in m e m o r y of my father Ze'ev who gave me all that I have UNP

n~9~l In~Y)~ 17 7N~ n"~ ~'~N 7 ~

~9D 13q3-,qIN

"Pl

This Page Intentionally Left Blank

Preface Threshold graphs have a beautiful structure and possess many important mathematical properties such as being the extreme cases of certain graph properties. They also have applications in many areas such as computer science and psychology. Their many characterizations can be relaxed in different directions to obtain new and important classes of graphs. For this reason, interest in these graphs gained momentum during last 20 years. In 1980, Golumbic devoted a chapter to them in his book Algorithmic Graph Theory and Perfect Graphs [Go180]. Since then many new results related to this topic were discovered, and by now more than 100 articles have been published in various fields. Even as we started to write this book, significant results were discovered as late as this summer. We believe that this subject will continue to attract much attention and there is a need for a coherent presentation of the existing results to serve as a reference. In writing this book, we unified several scattered results and rewrote some proofs, occasionally giving new ones. Because of space considerations we could not include every result related to threshold graphs, and we had to exercise our personal bias. In particular, we cover very little from the vast fields of hypergraphs and Boolean functions. This book is self-contained, except for very few places where we use known results from the literature. However, some chapters assume general background from linear programming or complexity theory. We tried to organize the book as much as possible so that every chapter could be studied independently after Chapter 1 and in some cases Chapter 2. Occasionally, we repeated some definitions for the convenience of the reader. We included many open problems and research ideas to make the book attractive to graduate students and researchers interested in graph theory. We typeset this book using [4TEX 2e under emTEX, and we thank Eberhard Mattes for making the excellent em~-F~X system available to the general vii

viii

Preface

public. The pictures were prepared with a combination of TEXCAD and PI~These tools enabled us to produce a camera-ready copy very easily and efficiently. Several people read parts of the manuscript and gave valuable comments. They include Waleed A1-Jasem, Srinivasa Arikati, Chris Brown, Kristine Cirino, Yee-Hong Lui, Francois Margot, Thomas Raschle, Ron Shamir, Andrea Sterbini, and Julin Wu. We owe them a debt of gratitude for their help. We are grateful for the kind support of the Swiss Federal Institute of Technology in Lausanne for inviting the first author to give seminars on the topics of this book, and of the Department of Mathematics at Northeastern University and the Mathematics, Statistics, and Computer Science Department of the University of Illinois at Chicago for enabling us to spend time together to accomplish what was not possible with e-mail alone. We thank Arjen Sevenster of Elsevier Science for his help in publishing this manuscript. Special thanks are due to Peter Hammer, who introduced both of us to the subject of threshold graphs, invited us to write this book and encouraged us throughout this project. Our deepest thanks go to our wives Aparna and Ofra and our children Maya, Shilpa, Tsoni and Benny for their sacrifices and endurance through all these long years. Without their constant love and support we could not have completed this undertaking.

June 199,5

Contents Preface

vii

Basic Terminology 1

2

Threshold Graphs 1.1 Motivation . . . . . . . . . . . . . . . . . . . . . . . 1.2 Basic Characterizations . . . . . . . . . . . . . . . . . . . . . . 1.3 Minimizing Integral Weights . . . . . . . . . . . . . . . . . . . 1.4 Perfect Graphs and Algorithms . . . . . . . . . . . . . . . . . 1.4.1 Perfect Graphs . . . . . . . . . . . . . . . . 1.4.2 Algorithms . . . . . . . . . . . . . . . . . . 1.5 Threshold and Split Completions . . . . . . . . . . . . . . . . 1.6 Longest Cycles and Hamiltonicity . . . . . . . . . . . . . . . . 1.6.1 Hamiltonicity . . . . . . . . . . . . . . . . . 1.7 Total Coverings and Total Matchings . . . . . . . . . . . . . . 1.7.1 Main Results . . . . . . . . . . . . . . . . . 1.7.2 Related Results . . . . . .................

. . . . . .

. . . . . . . . . . . .

. . . . . . . . . . . .

7 7 8 14 17 17 19 19 22 25 27 28 31

Ferrets Digraphs and Difference Graphs

33

2.1 2.2 2.3 2.4

33 38 46 52

Introduction . . . . . . . . . . . . . Ferrets Digraphs, Characterizations T h e Ferrets Dimension . . . . . . . Difference Graphs . . . . . . . . . .

. . . . . . . . . . . . . . . ............... . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

3 Degree Sequences 3.1 3.2 3.3

59

Graphical Degree Sequences . . . . . . . . . . . . . . . . . . . Threshold Sequences . . . . . . . . . . . . . . . . . . . . . . . The P o l y t o p e of Degree Sequences . . . . . . . . . . . . . . . . ix

59 72 75

Contents 3.3.1

3.4

4

5

6

7

E x t r e m e Points

. . . . . . . . . . . . . . . . . . . . . .

76

3.3.2

Adjacency . . . . . . . . . . . . . . . . . . . . . . . . .

78

3.3.3

Linear D e s c r i p t i o n a n d Facets

. . . . . . . . . . . . . .

86

. . . . . . . . . . . . . . . . . . . . . . .

92

Difference Sequences

Applications

97

4.1

Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . .

97

4.2

A g g r e g a t i o n of I n e q u a l i t i e s . . . . . . . . . . . . . . . . . . . .

97

4.3

Synchronization . . . . . . . . . . . . . . . . . . . . . . . . . .

99

4.4

Cyclic S c h e d u l i n g . . . . . . . . . . . . . . . . . . . . . . . . .

105

4.5

G u t t m a n Scales . . . . . . . . . . . . . . . . . . . . . . . . . .

109

Split Graphs

111

5.1

Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . .

111

5.2

Basic P r o p e r t i e s . . . . . . . . . . . . . . . . . . . . . . . . . .

111

5.3

H a m i l t o n i a n Split G r a p h s

. . . . . . . . . . . . . . . . . . . .

116

5.4

T h e S p l i t t a n c e of a G r a p h

. . . . . . . . . . . . . . . . . . . .

117

The Threshold Dimension

123

6.1

Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . .

123

6.2

B o u n d s for t h e T h r e s h o l d D i m e n s i o n

. . . . . . . . . . . . . .

126

6.3

Dimensional Properties . . . . . . . . . . . . . . . . . . . . . .

128

6.4

Operations Preserving the Threshold Dimension . . . . . . . .

134

6.5

Restricted Threshold Dimension . . . . . . . . . . . . . . . . .

135

NP-Completeness

139

7.1

Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . .

139

7.2

The Partial Order Dimension

7.3

7.4

7.2.1

Preliminaries

7.2.2

The Reduction

. . . . . . . . . . . . . . . . . .

139

. . . . . . . . . . . . . . . . . . . . . . .

139

. . . . . . . . . . . . . . . . . . . . . .

141

Related NP-complete Problems

. . . . . . . . . . . . . . . . .

146

. . . . . . . . . . . . . . . . . . . .

147

. . . . . . . . . . . . . . . . . . . . . . . . . .

148

7.3.3

Cubicity . . . . . . . . . . . . . . . . . . . . . . . . . .

149

7.3.4

Threshold Dimension . . . . . . . . . . . . . . . . . . .

149

7.3.1

Interval Dimension

7.3.2

Boxicity

Other Complexity Results

. . . . . . . . . . . . . . . . . . . .

150

7.4.1

Preliminaries

. . . . . . . . . . . . . . . . . . . . . . .

150

7.4.2

Largest Threshold Subgraph . . . . . . . . . . . . . . .

152

Contents

8

xi

7.4.3

Weighted 2-Threshold Partition

7.4.4

R e d u c t i o n f r o m N M T S to M O N 3 P A R T

7.4.5

r-Cyclic Scheduling and H T r H

7.5

T h e Split D i m e n s i o n

7.6

Polar Graphs

.........

.............

154 158 160

. . . . . . . . . . . . . . . . . . . . . . .

163

. . . . . . . . . . . . . . . . . . . . . . . . . . .

168

2-Threshold Graphs

175

8.1

Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . .

175

P r o p e r t i e s of 2 - t h r e s h o l d G r a p h s

8.2

8.2.1

9

.............

As P e r f e c t G r a p h s

. . . . . . . . . . . . . . . .

176

. . . . . . . . . . . . . . . . . . . .

176

8.3

Bithreshold Graphs

. . . . . . . . . . . . . . . . . . . . . . . .

179

8.4

8.3.1 Decomposing Bithreshold Graphs ............ Strict 2 - T h r e s h o l d G r a p h s . . . . . . . . . . . . . . . . . . . .

179 186

8.5 8.6

Recognizing Threshold Dimension 2 ............... R e c o g n i z i n g Difference D i m e n s i o n 2 . . . . . . . . . . . . . . .

190 213

8.7

Intersection Threshold Dimension 2 ...............

224

The Dilworth N u m b e r

241

9.1

Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . .

241

9.2

G r a p h s of D i l w o r t h N u m b e r 2 . . . . . . . . . . . . . . . . . .

245

9.2.1

T h r e s h o l d Signed G r a p h s . . . . . . . . . . . . . . . . .

245

9.2.2 Forbidden Subgraphs . . . . . . . . . . . . . . . . . . . 9.3 T h e D i l w o r t h N u m b e r a n d P e r f e c t G r a p h s . . . . . . . . . . . .

250 252

10 B o x - T h r e s h o l d Graphs

257

10.1 I n t r o d u c t i o n . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.2 E l e m e n t a r y P r o p e r t i e s . . . . . . . . . . . . . . . . . . . . . .

257 258

10.3 A T r a n s p o r t a t i o n M o d e l . . . . . . . . . . . . . . . . . . . . . 10.4 F r a m e s of B T G r a p h s . . . . . . . . . . . . . . . . . . . . . . .

262 265

11 Matroidal and M a t r o g e n i c Graphs

271

11.1 I n t r o d u c t i o n . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11.2 M a t r o i d a l G r a p h s . . . . . . . . . . . . . . . . . . . . . . . . . 11.2.1 11.2.2

271 272

Forbidden Configurations . . . . . . . . . . . . . . . . . P4-vertices . . . . . . . . . . . . . . . . . . . . . . . . .

272 274

11.2.3 2 K 2 - v e r t i c e s a n d C4-vertices . . . . . . . . . . . . . . . 11.2.4 T h r e s h o l d Vertices . . . . . . . . . . . . . . . . . . . .

282 284

11.2.5

292

P r o p e r t i e s of M a t r o i d a l G r a p h s

.............

xii

Contents 11.3 Matrogenic Graphs . . . . . . . . . . . . . . . . . . . . . . . . 11.4 Matrogenic Sequences . . . . . . . . . . . . . . . . . . . . . .

12 Domishold Graphs 12.1 12.2 12.3 12.4

Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . Notation and Main Results . . . . . . . . . . . . . . . . . . . . Equidominating Graphs . . . . . . . . . . . . . . . . . . . . . Pseudodomishold Graphs . . . . . . . . . . . . . . . . . . . . .

13 The D e c o m p o s i t i o n M e t h o d 13.1 13.2 13.3 13.4 13.5

Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . The Canonical Decomposition . . . . . . . . . . . . . . . . . . Domishold Graphs and Decomposition . . . . . . . . . . . . . Box-Threshold Graphs and Decomposition . . . . . . . . . . . Matroidal and Matrogenic Graphs and Decomposition . . . . .

296 303

313 313 313 320 323

327 327 328 335 341 348

14 Pseudothreshold and Equistable Graphs 14.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14.2 Pseudothreshold Graphs . . . . . . . . . . . . . . . . . . . . . 14.3 Equistable Graphs . . . . . . . . . . . . . . . . . . . . . . . . 14.3.1 Strongly Equistable Graphs . . . . . . . . . . . . . . . 14.3.2 Some Strongly Equistable Perfect Graphs . . . . . . . . 14.3.3 Pseudothreshold and Outerplanar Graphs . . . . . . . 14.3.4 Equistability, Strong Equistability, and Substitution . . . . . . . . . . . . . . . . . . . . .

351

15 T h r e s h o l d W e i g h t s a n d M e a s u r e s 15.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15.2 Threshold Weights . . . . . . . . . . . . . . . . . . . . . . . . 15.3 Threshold Measures . . . . . . . . . . . . . . . . . . . . . . . . 15.4 Threshold and Majorization Gaps . . . . . . . . . . . . . . . . 15.4.1 The Threshold Gap . . . . . . . . . . . . . . . . . . . . 15.4.2 The Majorization Gap . . . . . . . . . . . . . . . . . .

375 375 376 386 401 403 409

16 Threshold Graphs and Order Relations 16.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16.2 Biorders . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16.3 Bidimensions . . . . . . . . . . . . . . . . . . . . . . . . . . .

435 435 437 444

351 351 356 357 359 363 370

Contents

xiii

16.4 Relations of Bidimension 2 . . . . . . . . . . . . . . . . . . . . 16.5 Multiple Semiorders . . . . . . . . . . . . . . . . . . . . . . . .

17 E n u m e r a t i o n 17.1 I n t r o d u c t i o n . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17.2 E n u m e r a t i o n of Threshold Graphs . . . . . . . . . . . . . . . . 17.3 E n u m e r a t i o n of Difference Graphs . . . . . . . . . . . . . . . .

455 460

467 467 468 477

18 E x t r e m a l P r o b l e m s 483 18.1 I n t r o d u c t i o n . . . . . . . . . . . . . . . . . . . . . . . . . . . . 483 18.2 Large Interval and Threshold Subgraphs of Dense Graphs . . . 484 18.3 Maximizing the Sum of Squares of Degrees . . . . . . . . . . . 488 19 O t h e r E x t e n s i o n s 19.1 I n t r o d u c t i o n . . . . . . . . . . . . . . . . . . . . . . . . . . . 19.2 G e o m e t r i c E m b e d d i n g s of Graphs . . . . . . . . . . . . . . . . 19.3 Tolerance Intersection Graphs . . . . . . . . . . . . . . . . . . 19.4 Universal Threshold Graphs . . . . . . . . . . . . . . . . . . .

497 . 497 498 505 509

Bibliography

513

List of N o t a t i o n s

528

Author

530

Index

Index

533

This Page Intentionally Left Blank

Basic Terminology We present here the basic terminology and notations used t h r o u g h o u t the book. Additional definitions and notations are introduced in the book as needed. For sets A and B, A C_ B indicates t h a t A is a subset of B, whereas A C B indicates t h a t A is a proper subset of B. Let C be a collection of sets. A set A C C is maximum if I A I _ IBI for all B G C and minimum if IAI _ IBI for all B C C. The collection C is nested if for every two sets in C, one is a subset of the other.

a poset (partially ordered set) is a pair (P, _>), where P is a set and _> is a reflexive, a n t i s y m m e t r i c and transitive relation on P. If x _> Y and x -r y hold, we write x > Y. If x >_ Y or y >_ x, x and y are comparable. Otherwise, x and y are incomparable, and we denote this condition by x I] Y- A poset with no i n c o m p a r a b l e elements is said to be total. A chain is a set of m u t u a l l y c o m p a r a b l e elements, and an antichain is a set of m u t u a l l y i n c o m p a r a b l e elements. T h e Dilworth Theorem states t h a t the largest cardinality of an antichain equals the smallest cardinality of a set of chains partitioning P. An element x is maximal if there is no element Y such t h a t y > x. Similarly, x is minimal if there is no element Y such t h a t x > Y. An element x covers an e l e m e n t y i f x > y and there is no z such t h a t x > z > y. T h e Hasse diagram of a finite poset is a drawing where each element is represented by a point, and if x covers y, x is drawn above y and is joined to it by a line. a preorder is a pair (P, > ) , where P is a set and > is a reflexive, transitive relation on P. I f x > y and Y > x, we denote this condition b y x ~ y. If x > Y and y ~ x, we denote this condition by x > y. T h e terms comparable, incomparable, total, chain, antichain, and m a x i m a l and minimal element are defined for preorders as for posets, and the Dilworth T h e o r e m carries through. T h e set of real numbers is denoted by N, and we put R + - {x C R " x >

Basic Terminology

O} and R + - {x E 1 R ' x _> 0}. For a real-valued function f, we use the notation f(S) - E f(s).

sES

A function f 9 {0,1} n + {0,1} is called a Boolean function. a l , . . . , a , and t, the Boolean function f defined by

For reals

n

f ( x l , . . . ,Xn) -- 0 ~

E aixi ~ t i=1

is called a threshold function. The support of a vector ( X l , . . . , X n ) is the set {i 9 x~ r 0}. If S C_ { a l , . . . , a n } , then the characteristic vector of S is the vector ( X l , . . . , X n ) given by 1, if ai E S xi -0, otherwise. For a real x, [xJ denotes the largest integer i < x and Ix 1 denotes the smallest integer i > x. A graph G is an ordered pair (V, E), where V - V(G) is a set of elements called vertices, E - E(G) is a set of elements called edges, and each edge is an unordered pair of vertices (its ends or end-vertices or end-points). If the two ends are the same, then the edge is called a loop. Note that we do not allow parallel edges, i.e., all edges are distinct. The graph is said to be finite if V is a finite set. Unless otherwise indicated, all graphs considered from now on are finite and loopless. A graph with n vertices and e edges is referred to as an (n, e)-graph. An edge with ends a, b is denoted by ab. Two vertices a and b are adjacent (or neighbors) if ab is an edge, and non-adjacent (or non-neighbors) otherwise. In the latter case, ab is a nonedge. If a vertex is an end of an edge, then they are incident. Two edges are adjacent if they share a c o m m o n end. The adjacency matrix (aij) of a graph is defined by 1, if the i-th and j - t h vertices are adjacent

a i j - { O, otherwise.

Similarly, the edge-vertex incidence matrix (bij) is defined by 1, if the i-th edge and j - t h vertex are incident

b i j - { O, otherwise.

Basic Terminology

3

The neighborhood Na(v) of a vertex v in a graph G is the set of all neighbors of v, and its closed neighborhood is Na[v] - Na(v)U {v}. When G is understood, we omit the subscript G. For subsets A and B of V,

NA(v) - N(v) n A,

U NA(v). vEB

Similarly, N(v) denotes the set of all non-neighbors of v, and NA(V) -- N(v)N A. A vertex is isolated if its neighborhood is empty, and is dominating if its closed neighborhood is the entire set of vertices. We define a binary relation on V by a ~ b z---->,N[a] ~ N(b). This relation is a preorder and is called the vicinal preorder of G. A graph H - ( W , F ) i s a subgraph of G - (V,E) if W C_ V and F C_ E. We then say that G contains H. If W - V, then H is a spanning subgraph of G. If F is the set of all edges in E with both ends in W, then H is the subgraph of G induced by W, and is denoted by G[W]. The set of edges of G[W] is denoted by E(W). Similarly, if W is the set of all ends of edges in F, then H is the subgraph induced by F. Also, G - W - G[V - W] and

G-F-(V,E-F). Two graphs G1 - (V1,E1) and G2 - (V2, E2) are iso?rtoFphic if there is a bijection f 9 V1 --+ V2 such that for all a, b C V1 we have ab C E1 " ;" f(a)f(b) C E2; in other words, G1 and G2 are two labelings of the same graph. We denote this condition by G1 " G2. A property of graphs that is preserved under isomorphism is called a graph property. A graph property P is hereditary if, whenever a graph has property P, all its induced subgraphs also have property P. The degree of a vertex v is deg(v) - IN(v)[. The degree sequence of a graph with vertices V l , . . . , v~ is d - ( d e g ( v l ) , . . . , deg(vn)). Every graph with the degree sequence d is a realization of d. A degree sequence is unigraphic if all its realizations are isomorphic. It is strongly unigraphic if there is a unique graph (V,E) with V - { 1 , . . . , n } and deg(i) - di for all i. A graph is a unigraph if its degree sequence is unigraphic. A strong unigraph is defined similarly. For a graph property P, a degree sequence is potentially P-graphic if it has a realization with property P, and forcibly P-graphic if all its realizations have property P. The degree of an edge is the unordered pair of degrees of its ends. The terms edge-degree sequence, edge-unigraph etc. are defined similarly. For more information on degree sequences, see [TCC88a, TCC88b, TCC89, RaoSl].

4

Basic Terminology

The complement of a graph G - (V,E) is the graph G - (V,F) such that ab E F ~ ab ~ E for each pair a, b of distinct vertices. The graph G - (V, E) is edgeless if E - O. The complement of an edgeless graph is a complete graph. A subset W of V is a stable set (or an independent set) if G[W] is an edgeless graph, and is a clique if G[W] is a complete graph. A k-clique is a clique of size k. A proper coloring (or simply a coloring) of G is a partition of V into stable sets, called color classes. A clique partition of G is a partition of V into cliques. The size a(G) of a m a x i m u m stable set is the stability number of G, the size a;(G) of a m a x i m u m clique is the clique number of G, the size )~(G) of a minimum coloring is the chromatic number of (7, and the size n(G) of a minimum clique partition is the clique cover number of G. A subset W of V is a vertex cover if V - W is a stable set. The size p(G) of a minimum vertex cover is the vertex cover number of G. A path P~ is a graph of the form (V,E), where V - { 1 , . . . , n } and E - {12, 2 3 , . . . , ( n - 1)n}. We say that P~ is a path joining 1 and n. For n >_ 3, a cycle C~ is a graph of the form (V,E), where V - { 1 , . . . , n } and E - { 1 2 , 2 3 , . . . , ( n - 1)n, nl}. a graph G is connected if for every two vertices a and b, G contains a path joining a and b; G is disconnected otherwise. A maximal connected subgraph of a graph (with respect to graph containment) is a connected component. A graph G is 2-connected if every two vertices belong to a cycle of G. A maximal 2-connected subgraph is a

block. A matching in a graph is a set of mutually non-adjacent edges. A vertex is saturated by a matching if it is an end of one of its edges, and is unsaturated (or missed) otherwise. A matching is perfect if it saturates every vertex of the graph.

A bipartite graph is a 2-colorable graph. We indicate a 2-coloring of a bipartite graph by (A, B; E), where A and B are the color classes and E is the set of edges. Hall's Theorem states that a bipartite graph (A, B; E) has a matching saturating every vertex of A if and only if every subset X of A satisfies IX[ t

if and only if

uvEE.

(1.5)

12

Threshold Graphs

Proof. 1) =~ 2): If G has an alternating 4-cycle a, b, c, d with ab, cd E E and ad, bc ~ E, then Wa + Wb > t

Wa + Wd t

Wb + W~ 6): If m = 1, then D1 is a clique and Equation (1.3) is satisfied. So assume m > 1. From Condition 5 it follows that two vertices are equivalent in the vicinal preorder if and only if they are in the same set Dj. Choose representatives xj E Dj, j = 1 , . . . , m. Then X 1 ~

.."

-'~ X m .

(1.7)

It follows from (1.6) and (1.7) that X 1 is adjacent to x.~ and therefore D.~ C_ N ( x l ) and also N[xm] = D1 U . . . U D ~ and N[D~] = D1 U . . . U Din.

1.2

Basic Characterizations

13

It then follows t h a t N ( X l ) = Dm, since if Xl were adjacent to x i for some j < rn, then N[xi] would also equal to D1 U -.. U Dm and x/ would have to belong to Din, a contradiction. From this we obtain N(D1) = Din. If rn = 2 we are done. So assume rn > 2. It now follows from (1.6) and (1.7) t h a t x2 is adjacent to xm-1 and therefore Dm U Din_ 1 C N ( D 2 ) . Thus N [ D m _ I ] z D2 U . . . U D m and hence N(D2) = D~ U Din_ 1. Proceeding like this we establish Condition 6. 6) => 7): The result follows from (1.2)if k < [~J and f r o m ( 1 . 3 ) i f k > [~J. 7) => 6): From (1.4) with k - m and the value of 5m+l we have that - I D 0 1 = IDII + .-. + I D m l - 1. Since Do is the set of isolated vertices, Equation (1.3) holds for k = m. If m = 1 we are done, so assume t h a t m > 1. From (1.4) with k = 0 and the value of 50 w e have t h a t (51 --- IDm I and now E q u a t i o n (1.2) holds for k = 1. If m = 2 we are done, so assume rn > 2. Now from (1.4) with k = m - 1 w e h&ve (5m_1 = IDol + -.- + IDm[-1. Since N ( D 1 ) = Din, Equation (1.3) holds for k = r n - 1 and hence by (1.4) with k = 1 Equation (1.2) holds for k = 2. Proceeding like this we establish Condition 6. 6) ~ 8): Assign the weight j to every vertex of Dj for j = 0 , . . . , rn and let t=rn. 8) => 2): The proof is the same as 1) => 2). ,, We note that Condition 7 of the theorem does not refer to the case k = [~J. However, it follows easily from Condition 6 that =

(Sk+l -- (Sk - } - I D ~ - ~ I -

1

for

k-

Lind.

(1.8)

Exercises 1. Let d -

(dl >_ " " _> d~) be a degree sequence, and define ei-max{j'dj

>_ i }

f - max{j'dj

i-1,...,n >_ j }.

Prove t h a t d is a degree sequence of a threshold graph if and only if ei = eli-k 1 for i = 1 , . . . , f . 2. Show t h a t if eLgraph G has exactly one pair of vertices with equal degrees, then G is a threshold graph and the equal degrees are equal to both f and d / o f Exercise 1 (Ruch and Gutman[RG79]).

14

1.3

Threshold Graphs

Minimizing Integral Weights

Although we defined in Section 1.2 a threshold graph G = (V, E) as a graph having a separator with non-negative real coefficients, it is clear that we could equivalently require the separator to have non-negative integral coefficients. In that case, the vertex-weights w satisfy

w(U)

_ t + 1

U c_ V stable,

(1.9)

U C_ V non-stable.

(1.10)

Orlin [Or177] proved the surprising result that for every threshold graph, if we minimize t subject only to the inequalities (1.9), (1.10) and w _> 0, then the optimum solution (w, t) is unique and integral. Moreover, it minimizes each individual w(vj) at the same time. T h e o r e m 1.3.1 (Orlin) Let G - (V, E) be a threshold graph with the vicinal preorder V 1 ~ " ' ' ~ V n o f its vertices. Consider the linear program

minimize t subject to (1.9), (1.10), w >_ O.

(1.11)

The optimum solution (w,t) of (1.11) is unique and integral, and also minimizes each w(vj). It is given by the following computation. Let s be the smallest index j such that vjvj+l C E. If no such index exists, set s - n. Then f

o,

w(vj)

_

t

-

Ei
t + 1 whereas w(v)+~,~.~, w(vi) _ 1 + 2~,-~, w(v~), and this lower bound is met by (1.12). Further, since S is stable, (1.9)imposes the lower bound t _> ~ < ~ w(v~), which is also met by (1.12). Again, for vj C K, (1.10) imposes the lower bound w(vj) >_ t + 1 - w(vf(j)), which is met by (1.12) (note that vf(j) C S). To see that (1.12) gives a feasible solution, note first that it satisfies 0 _< W(Vl) _< -.. _< w(v~). Every edge of G has at least one endpoint inK. For each vj C K, since w(vj) + w(vf(j)) - t + 1, it follows that w(vj) + w(v) >_ t + 1 for every neighbor v of vj. Thus (1.10) is satisfied. To verify (1.9), it is enough to consider the maximal stable sets. G has exactly n - s + 1 maximal stable sets, namely S, which satisfies (1.9) with equality, and the sets {vj, v ~ , . . . , vf(j)_~} for j > s. Since vf(j) ~- vf(j)-l, each of the other maximal stable sets also satisfies (1.9) with equality. Finally, we show the uniqueness of the optimum (1.12). Denote by (w, t) the optimal solution given by (1.12) and let (w', t) be any optimal solution. Then w _< w' as we have shown. Therefore, since S is stable, (1.9) on (w', t) implies B

IUI, where

x is any vertex of Din;

9 for m even, VU C_ S - y, INK-~(U)I >_ IuI, where x is any vertex of Dm and y is any vertex of D ~2. Conditions 1,2,3 of the Theorem are special cases of Hall's conditions with U - Do U ... U Dk, k - 0 , . . . , [-~-~], and in addition for m even with U - Do U ... U Din__ - y. On the other hand these special cases imply the 2 full Hall's conditions by the nesting of neighborhoods (Equation (1.2)). 9 Chvs [Chv72] has given a sufficient condition for a general graph to be Hamiltonian" Let dl __j, where (,) follows from the existence of the special matching. Hence by the assumption dj _< j, we have dj - j - I K I - rsrcontradicting j < ~. Therefore rn is odd, or rn is even and j r y. Then we have (o) dj (*-) I N K ( { 1 , . . . , j } ) I - 1 + INK_~({1,...,j})I--> 1 + j , where (,) follows from the nesting property (Equation 1.2)), and (o) follows from the existence of the special matching. This contradicts the assumption dj _<j. Case 2" j C K. Then dj _> [{j + 1 , . . . , n } l - n j > ~, contradicting the assumption dj < j < ~. "If"" This part follows from Chv/~tal's general sufficiency condition (1.23), but the proof below does not use his result. Since no special matching of size IS[ exists, Hall's conditions are violated. Therefore there exists a subset U C_ S (y ~' U in case rn is even) such that < Iu[. Let j be the largest element of U. We may extend U to { 1 , . . . , j} because this does not change NK-x(U). Thus there exists an element j E S such that ]NK_~(j)I- [NK_~({1,..-,J})l < J,

(1.24)

and therefore dj ~_ j. Choose the smallest j satisfying (1.24). We now have (,) d j - 1 - I N K _ ~ ( j ) I >_ I N K _ ~ ( j - 1)1 >_ j -

1 _> d j - 1,

where (,) follows from the minimality of j. Therefore dj - j. It remains to show t h a t j < ~. I f j >_ ~ , t h e n dj >_ ~, and since IS] >_j, IKI >_ dj, and I S ' I + I K ] - n, we h a v e j - dj - ~ - I S ] II(]. Therefore NK(j) -- K_ and j is adjacent to x. By the minimality of j, there exists a matching of

1.7

Total Coverings and Total Matchings

27

{ 1 , . . . , j - 1} into K - x, which can be augmented by the edge j x to form a special matching, a contradiction. Therefore j < ~. 9 R e m a r k . The "only if" part of the above theorem can just as easily be proved using the existence of a Hamiltonian cycle and not the special matching. We conclude with the following result" 1.6.10 Let G be a graph with LCL 1 and with no induced P4 or C4. Then G has cycles of length 3 , . . . , l .

Theorem

P r o o f . If C is a cycle of length k + 1 > 4, consider four consecutive vertices a, b, c, d along C. Then either ac or bd is an edge of G, for otherwise G has a P4 or C4. Therefore by removing b or c from C, we obtain a cycle of length k. 9 In particular, Theorem 1.6.10 applies to threshold graphs by condition 2 of Theorem 1.2.4. Thus a Hamiltonian threshold graph G = (V, E) is also pancyclic (i.e., has cycles of all lengths 3 , . . . , IVI). We note that the graphs without induced P4 or 6'4 are studied by Golumbic [Go178] under the name "trivially perfect graphs".

1.7

Total Coverings and Total Matchings

In this section we define total coverings, total matchings and related parameters and determine these parameters for threshold graphs. All the members of V(G) U E(G) are called elements of G. A vertex v of G is said to cover itself, all edges incident to v, and all vertices adjacent to v. Similarly an edge e of G covers itself, the two end-vertices of e, and all edges adjacent to e. A set At C V U E is called a total cover of G if every element of (7 is covered by some element of At, and At is minimal. Following the notation of [ABLFN77], we call the number c~2(G) - min{ I A t l ' A t is a total cover of G } the total covering number of G, and the sets achieving the minimum are called minimum total covers. Two elements in G are called independent if neither one covers the other. A set Mt C V U E is called a total matching of G if the elements of Mt are pairwise independent, and Mt is maximal. The number ~2(G) - max{ I M t l ' M t is a total matching of G }

28

Threshold Graphs

is called the total matching number of G, and the sets achieving the m a x i m u m are called maximum total matchings. In addition, two other related numbers are defined: a~(G) - max{ I & l ' A t is a total cover of G }, /3;(G) - min{ IMt['Mt is a total matching of G }. ! l We determine a2(G), ~2(G), a2(G ),/32(G ), and related parameters for a threshold graph G.

1.7.1

Main Results

We start with some results about total coverings, complete graphs, and threshold graphs. F a c t 1.7.1 ( [ A B L F N 7 7 ] ) If G is a connected graph of order n > 2, then

1 _ n - 1 . G has a v e r t e x Vl adjacent to every other non-isolated vertex of G. Let At = V(G) - {Vl}. Obviously At covers G, but no proper subset of At covers G, so At is a total ! cover with [Atl - n 1, and a2(G ) >_ n - 1. .. T h e o r e m 1.7.12 If G is a connected threshold graph of order n >_ 2, then

Z;(a)P r o o f . By Fact 1.7.1 and Theorem 1.7.8, we have fl;(a) >_ a2(a) = [w(G)/2]. Recall the proof of Theorem 1.7.8. It is not hard to see that the total covering constructed there is a total matching of size [w(G)/2] in

a, so C o r o l l a r y 1.7.13 If G is a threshold graph, then

t 3 ; ( G ) - [w(G)/2] + number of isolated vertices of G.

1.7.2

Related

Results

By restricting the total coverings and matchings of G = (V, E) to subsets of V or E, we obtain the vertex and edge covering numbers, stability number, and m a x i m u m matching. We give below the values of these parameters. The proofs are simple and are omitted. Let

ao(G) - min{ ]At['At is a total cover of G and At C_ V ( G ) } , fl0(G) - max{ [Mtl" Mt is a total matching of G and Mt C_ V(G) }. F a c t 1.7.14 If G -

(V, E) is a threshold graph, then

ao(G) - w ( G ) - 1 + number of isolated vertices of G,

Z0(a) - I V l -

1.

32

Threshold Graphs

Let c~0(G) - max{ IAtl" At is a total cover of G and At C_ V(G) }, ) - min{ IMtl" Mt is a total matching of G and Mt C_ V(G) }.

r

F a c t 1.7.15 If G is a connected threshold graph of order n >_ 2, then !

%(G) - n - 1, ~0(G) - 1. Let Ctl(G ) /~1((~)

--

--

min{ I & l ' &

is a total cover of G and At C_ E(G)},

max{ I M t l ' M t is a total matching of G and Mt C_ E(G) }.

F a c t 1.7.16 If G is a threshold graph, then the size of a maximum matching of G is/31(G) = [(w(G) + / ) / 2 ] , where 1 is number of edges in a maximum

matching of the bipartite graph obtained by dropping the edges of a maximum clique of G. Consequently, Ol(G) - - - I V ( a ) l 1)/2J C o r o l l a r y 1.7.17 If G is a threshold graph with partition V - P U Q as in Fact 1.7.7, then G has a perfect matching if and only if IV] is even and IN(u~)] _> I P [ - i + 1 for 1

w,.t x of

has a loop in D;

2. for i < F~], no vertex of Di has a loop in D; 3. i f m is odd, every vertex of in D;

Dim1 with at most

4. if m is even, every vertex of

D[~ 1 with

one exception has a loop

at most one exception has no

loop in D. Thus the number of symmetric Ferrets digraphs D satisfying U(D) = G is 1+

ID[~ 1 I

P r o o f . Since D is a symmetric digraph satisfying U(D) = G, the only freedom left in choosing it is to choose which vertices have loops, and this freedom must be used to eliminate all alternating 4-anticircuits. Assume that x C Di, y E Dj, i > j, and xy C E. Then in D we have IF+(x)I > IF+(y)I and (y, x) E A. Since the out-neighborhoods in D are nested, we have (x, x) E A, i.e., x must have a loop in D. For i > [~] and for every x C Di, there exists a y E D [ 2 ], and by Condition 6 of Theorem 1.2.4, we have xy C E. This shows that Condition 1 of the theorem is necessary. Similarly Condition 2 is necessary. If x and y are distinct vertices of D [ ~ ] , then by Condition 6 of Theorem 1.2.4, xy C E if and only if m is odd. Therefore to avoid the forbidden configurations (d) and ( e ) i n Figure 2.2, at most one of x and y can have a non-loop if m is odd, and at most one can have a loop if rn is even. This shows that Conditions 3 and 4 of the theorem are necessary. It is easy to see that these conditions are also sufficient for D to be a Ferrers digraph. " D e f i n i t i o n 2.1.5 ([Fis85]) Let :Y be a finite family of intervals on the line,

let G -

(Z, E) be the graph defined by 1112 E E

if and only if

I1 M I2 # 0,

I1,/2 G 2-

and let D = (2", A) be the digraph (binary relation) defined so that (/1,/2) C A if and only if every point of I1 is to the left of every point of I2. Then G is

2.1

Introduction

37

m

said to be an i n t e r v a l g r a p h , its complement G is said to be a c o i n t e r v a l g r a p h , and D is said to be an i n t e r v a l o r d e r . Interval graphs and interval orders were characterized by Gilmore and Hoffman, and by Fishburn" T h e o r e m 2.1.6 ( [ G H 6 4 , Fis70, Fis85]) 1. A graph G is a cointerval graph if and only if G is a comparability graph (has a transitive orientation) and has no induced subgraph isomorphic to 2K2. 2. A digraph D is an interval order if and only if D is transitive and acyclic (has no directed cycles) and does not have configuration (a) of Figure 2.2. The next result follows easily from Theorem 2.1.6 and the forbidden configurations of Figure 2.2. T h e o r e m 2.1.7 Let D be a loopless digraph. Then D is a Ferrets digraph if and only if D is an interval order. In that case U(D)is a cointerval graph. Conversely, if G is a cointerval graph, then every transitive orientation of G is a loopless Ferrers digraph. D e f i n i t i o n 2.1.8 A graph G - (V, E) is called a difference g r a p h if there exist real numbers av, v C V and T such that 1. [a~] < T

v 6 V;

2. for every pair of distinct vertices u, v C V, uv C E

if and only if

l a u - a v I > T.

It follows immediately that if V is partitioned as V - X U Y, where X { v E V ' a v >_ 0 } and Y - { v 6 V ' a v < 0 }, then X and Y are stable sets in G, i.e., that G is a bipartite graph with bipartition (X, Y). In Section 2.4 we show that for a bipartite graph G with bipartition (X, Y), G is a difference graph if and only if the vicinal preorder of G is total on X, or equivalently on Y. The following result follows therefore from Condition 3 of Theorem 1.2.4. T h e o r e m 2.1.9 A bipartite graph G with bipartition (X, Y) is a difference graph if and only if adding to G all possible edges with both ends in X yields a threshold graph.

38

Ferrers Digraphs and Difference Graphs

Another easy result connecting Ferrers digraphs and difference graphs is the following: T h e o r e m 2.1.10 A digraph D is a Ferrers digraph if and only if its bipartite representation B ( D ) is a difference graph.

Figure 2.4 illustrates the difference graph B ( D ) of the Ferrers digraph D of Figure 2.3. Figure 2.4: The difference graph B(D), where D is the Ferrers digraph of Figure 2.3.

e+ c+

- a~

b-

d+

c-

a+

d-

b+o

m e-

Ferrers Digraphs, Characterizations

2.2

In this section we present several characterizations of Ferrers digraphs, based on Cogis [Cog82a]. The development is analogous to the characterizations of threshold graphs in Section 1.2. The directed analog of the degree partition is given by the following definition. D e f i n i t i o n 2.2.1 (resp in-degrees) 5+ -- 0 (resp. 5o exists). Let D + -

Consider a digraph D whose distinct positive out-degrees are 5+ < ... < 5+ (resp 51 < "'" < 5m-)' and let -- O) (even if no vertex of in-degree 0 or out-degree 0 { v e V ' I F + ( v ) I - 5+} f o r i - 0 , . . . , m + (resp. D~ {v e v - I r - ( v ) l - 67} for i - o, ,m-) The sequence D + D+ (resp. D o , . . . , D ~ _ ) is called the o u t - d e g r e e p a r t i t i o n (resp. i n - d e g r e e p a r t i t i o n ) of D. 9

m+

.

"

.

.

.

~"

9 9 ~

m+

Ferrers Digraphs, C h a r a c t e r i z a t i o n s

2.2

39

To state the appropriate directed analog of the concept of "adding an isolated vertex to a graph", consider a digraph D - (IT, A) and the poset (P+, c_), where P+ - {F+(v), v E V} U {0, V}. Now add a new vertex z to D such that r - ( z ) - ~ and N1 C_ F+(z) C_ N2, where N2 covers N1 in P+. We then note that z has been added as an in-isolated vertex to D. Similarly, to state the directed analog of "adding a dominating vertex to a graph", consider the poset ( P - , C_), where P - - {V-(v), v E V} U { e , V}, and add a new vertex z to D such that P + ( z ) - V and N~ C_ F - ( z ) C_ N2, where N2 covers N1 in P - We then note that z has been added as an out-dominating vertex to D. T h e o r e m 2.2.2 ([Cog82a]) Let D - (V,A) be a digraph with out-degree partition and in-degree partition as in Definition 2.2.1. Then the following are equivalent: 1. D is a Ferrets digraph; 2. the out-neighborhoods F+(v), v C V are nested, or equivalently the inneighborhoods F-(v), v C V are nested; 3. D can be constructed from the empty digraph by repeatedly adding an in-isolated or an out-dominating vertex;

~o (a) m+ -- m - (= m,

ay);

(b) 5+ - 5+1 + [D~-_~+11

i - 1 , . . . m; i - 1 , . . . m;

.

(a) m + - m - (= m, say); (b) f o r x e D + and y e D 7, (x,y) e A ~ i + j > m ;

6. there exist real (equivalently non-negative integer) weights w+(v) and w - ( v ) for each v e V and a real (equivalently non-negative integer) threshold t such that

(a)

_ j, D has an arc from every vertex of Di+ to every vertex of D~;. It follows that

r+(D1+) c .-. c r+(D m++ ),

(2.1)

r-(D:)

(2.2)

c .-- c V-(DT~-).

From (2.1), the nesting of the out-neighborhoods, and the definition of Do, we have P + ( D + + ) - D : v --. u D~_. Fromthis and (2.1)it follows that F+(D++_I) C_ D~- U -.. U D~_. However, F+(D++_I) ~ D~- U ... U D~_, since otherwise F - ( D 1 ) F-(D;)F+(D++), contradicting (2.2). Therefore r +(D++_I)

- D 2 U -.. U D~_.

Continuing in this way, we find m

F+ (D+) -

U

D~-

i - 1 , . . . , m +,

(2.3)

D~-

j - 1,..., m-.

(2.4)

k=m+ - i + 1

and similarly m+

F-(D)-) -

U k=m--j+l

From (2.3), it follows that for x E D + and y C D~-, (x, y) r A ~ i + j > m +. Similarly from (2.4), for x C D + and y r D 7, (x,y) C A ~ i+ j > m - . This proves part (a) of Condition 4 and Condition 5, and by taking cardinalities in (2.3) and (2.4), we obtain parts (b) and (c) of Condition 4. 4) =:v 2)" Parts (b),(c)of Condition 4 can be formulated as follows" m

C-

E

ID;I

i-O,...,m

(2.5)

ID+]

i-0,...,m.

(2.6)

k = m - i-t-1

5~-=

~ k=m-i+l

42

Ferrers

and

Digraphs

Difference

Graphs

We show by simultaneous induction on i that + x E Din_ i

m

:- 1-'+(x) -

~

D~-

i-

0,...,m

(2.7)

k=i+l m

r-(y) -

y E D~+~

U D+ k=m-i

i -

O, . . . , rn - 1

(2.8)

((2.8) is trivial for i = - 1). These equations clearly imply the desired Condition 2. By (2.5) and (2.6), both sides of the equations in (2.7) and (2.8) have the same cardinality, and therefore it is enough to show the inclusions r + ( x ) C_ Ukm__i+lD~- for (2.7) and r - ( y ) __DUk~__m_iD + for (2.8). Basis: i - 0. If x E D +, then by definition of Do, I'+(x) C_ Ukm__~D~-, which proves (2.7) for i = 0. If y E D1, then by (2.7) y E r + ( x ) , i.e., x E r - ( y ) , for each x E D +. Therefore D + C_ ['-(y), which proves (2.8)for i - 0 . I n d u c t i o n s t e p : Assume (2.7) and (2.8) hold for i = 0 , . . . , r and prove t h e m for i = r + 1. Let x E Dm-~-l. By (2.8), F - (D~-+I) - Uk~__m_iD + for i - 0 , . . . , r. By taking unions and using F - ( D o ) - e , we obtain F - (U; +1 D~-) - Uk~=m_~ D + ~ x. Hence r + ( x ) C u~n__~+2D~-, which proves (2.7) for i = r + 1. Let y E D~-+2. By (2.7) we have I'+(D+_i) - Uk~__i+l D~- for i - 0 , . . . , r + 1. m .. . , r + l , we h a v e y E F+( D i + n _ i ) for such i. S i n c e y E Uk=i+ID~fori-0, + Hence, since by (2.7) all vertices of D i n _ i have the same out-neighborhood we have I ' - ( y ) _D D+_~ for i - 0 , . . . , r + 1. Taking unions, we obtain Iir+l + rn F - (y) __ ~ t.)i=0 Dm-i - Uk=m-r-1 DTk, which is (2.8) for i - r + 1. 5) =~ 6): If rn = 0, then D has no arcs, and hence zero weights and threshold satisfy Condition 6. Therefore we assume that rn > 0. Put p = ~ ] and b = 2IV[ + 1, and define the following non-negative integer weights and threshold: For every vertex v E D~:,

w+ (v) -

{b

b p+2 -

b ~+l-k

k- 0

k-p+l

~''"

~

p

m

(2.9)

and t-

b p + 2 - 1.

(2.10)

To show that these weights and threshold satisfy Condition 6, we prove the following facts:

2.2

Ferrers Digraphs, Characterizations

43

(a) for ~), ~ C {+, - } , and for x C D/~ and y C D~, wC)(x) + w e ( y ) >

t r

i + j > m;

(b) w+

D~

+w-

D~

p, then

P r o o f of (a)" A s s u m e i + j > m. generality that i > p. If j _< p, then

Then we may assume without loss of

WC)(X) -~- W ( } ( y ) -- b p+2 -- Dm + l - i -~ b j > b p+2 > t,

and if j > p, then

~ (x) +

~

(y) -

bp+ 2 _ b m + l - i + b p+2 - - b m + l - j

> 2b p + 2 - 2b m-p > b p+2 > t.

Conversely, assume i + j p, then w~(x) + w~

- b~ + b p+2 - b m + l - j < b i -t- b p+2 - b i -

P r o o f of (b)" w+

D+

+ w-

D~-

-

p

E

(ID+l + ID;I) z

k=O

b "+1 - 1

21vI b - 1

= b p+I -

1 < t.

t + 1.

44


P r o o f of (c)-

) m-i

bp+2 - bm+l-i -~- E

bk

(IV+l + IV~-I)
m, and from fact (a) w+(x) + w - ( y ) > t, and therefore w+(X)+ w - ( Y ) > t. Conversely, assuming that (X x Y ) N A - e and also w+(X) t, contradicting our assumption w+(X)

t

w+(3) + w - ( 1 )

>

t

w+(3) + w-(2)

>

t

Multiplying the first two inequalities by 2 and adding, and also adding the last four inequalities, we obtain 4w+(1) + 2w+(2)+ 2w+(3)+ 2 w - ( 1 ) + 2 w - ( 2 ) + 4w-(3) < 4t 4t < 2w + (2) + 2w + (3) + 2w-(1) -4- 2w-(2). Therefore 4w + (1) + 4w- (3) < 0, contradicting non-negativity. 2. Recall Theorem 1.3.1. We say that an integer t is a feasible threshold for a threshold graph G - (V, E) if there exists a feasible solution (w,t) for the linear program (1.11). Theorem 1.3.1 asserts that the minimal feasible threshold uniquely determines the corresponding integer weights w for G. Analogously, a non-negative integer t is said to

Ferrers Digraphs and D i f f e r e n c e G r a p h s

46

be a feasible threshold for a Ferrers digraph D = (V, A) if there exist integer weights w+(v), w-(v) for each v C V satisfying Condition 6 of Theorem 2.2.2. Here the minimal feasible threshold does not determine the corresponding weights uniquely, as illustrated by the following two examples: 9

t=5;

w+ -(1,2,4),

w- - ( 2 , 4 , 5 )

or

w + = (1,3,5), w - = (1,3,5). 9

11; w + = (1,2,4,8), w - = (4,8,10,11)

t =

or

w + = (1,2,6,11), w - = (2,6, 10, 11).

2.3

T h e Ferrers D i m e n s i o n

In this section, based on Cogis [Cog82b], we define the Ferrers dimension of a digraph, and show that it generalizes the dimension of a partially ordered set (poset). Known results of Dushnik and Miller and of Ore about the poset dimension are extended to the Ferrers dimension and given independent proofs. It is also shown that computing the Ferrers dimension of a digraph is polynomially equivalent to computing the dimension of a poset. We use here the terminology of digraphs and equivalently of binary relations, whichever is more convenient. We assume that all digraphs are finite. Recall that a p r e o r d e r is a reflexive and transitive relation, a p a r t i a l o r d e r is an antisymmetric preorder, and a l i n e a r o r d e r is a total partial order. D e f i n i t i o n 2.3.1 An embedding of a digraph D = (V, A) in the Cartesian product D1 x ... x Dk of digraphs Di = (V/,Ai) is a mapping f : V V1 • "" x Vk with f ( v ) = ( f l ( v ) , . . . , f k ( v ) ) such that for every x , y C V,

(x,y) E A ~

(fi(x), fi(y)) E Ai for each i.

The following result and definition [DM41] are well-known. T h e o r e m 2.3.2 ( D u s h n i k a n d M i l l e r ) Every partial order P is the intersection of all the linear orders (on the same elements) containing P.

2.3

The Ferrers Dimension

47

D e f i n i t i o n 2.3.3 ( D u s h n i k and Miller) The d i m e n s i o n do(P) of a partial order P is the smallest number of linear orders whose intersection is P. Ore [Ore62] characterized the dimension of a partial order in the following way, which motivates the term "dimension":

T h e o r e m 2.3.4 ( O r e ) For every partial order P, do(P) is the smallest number of linear orders such that P has an embedding in their Cartesian product. Bouchet [Bou71] generalized "partial order" to "digraph" and "linear orders" to "Ferrers digraphs" in Theorem 2.3.2, Definition 2.3.3 and Theorem 2.3.4. These results are as follows.

T h e o r e m 2.3.5 Every digraph D is the intersection of all the Ferrets digraphs (on the same vertices) containing D. P r o o f . Every digraph D = (V, A) satisfies

D-

A (~,y)~tA

Each digraph under the intersection sign is a Ferrers digraph, because it has just one non-arc, and thus cannot contain any of the configurations of Figure 2.2. If the intersection is empty, then D itself is a Ferrers digraph for a similar reason. 9

D e f i n i t i o n 2.3.6 The Ferrers d i m e n s i o n dF(D) of a digraph D is the smallest number of Ferrers digraphs (on the same vertices) whose intersection is D. T h e o r e m 2.3.7 For every digraph D, dF(D) is the smallest number of Ferrets digraphs such that D has an embedding in their Cartesian product. Moreover, each fi in the embedding can be required to be an injection, or in other words, the Ferrets digraphs can be required to have the same set of vertices as D.

P r o o f . For this proof only, denote the smallest number in the theorem by dE(D). If D = O 1 N . - - N Dk, then f, defined by fi(v) = v for each i, is an

48


embedding of D in D1 x ... x Dk. Therefore dE(D) do(P(D)). Assume that D = O1 n ... n Dk where each D~ = (V, A~)is a Ferrers digraph (by Theorem 2.3.7 we may assume that the vertex set of each Di is V). We exhibit an embedding f of P(D) = (12(D), C_) in the Cartesian product of k copies of the linear order (N, >) (N denotes the non-negative integers) as follows. For each R 6 I;(D) and for each i, put

fi(R) - min r6R

IVZ( )l

where F + refers to the Ferrers digraph Di, and put

f(R)-(fl(R),...,A(R)). To show that f is an embedding, we need to show that for every R, S C V(D), R C_ S if and only if I(R) >_ f ( S ) , i.e., if and only if f~(R) >_ f~(S) for all i. The "only if" part follows directly from the definition of fi(R) as a minimum. For the "if" part, we have for some ri C R and si C S"

Ir+(,,)l- min lr+(r)l- A(R) > f~(S)- min Ir+(s)lr6R

--

s6S

Therefore, since the out-neighborhoods of Di are nested, we have for all i and all r E R,

2 Hence for all r G R, -

N

D

i

N

(2.12)

i

where F refers to D. Now there are two cases regarding S. C a s e 1" S - 1/1(v) for some v E V. Then s~ E 1/1(v) for each i, which means that nj F+(si) - F+(si) _D F+(v). By taking the intersection over i we have

i

i

d

which gives with (2.12) F+(r) D r+(v) for all r E R. This means that r C 1/1(v) - S for all r E R, giving the required conclusion R C S. C a s e 2" S V 2 ( v ) - r - ( v ) f o r some v E V. Since si E S, we have (si, v) C A C_ Ai for all i. By definition of si, and since the out-neighborhoods of Di are nested, this implies that (r, v) C Ai for all i and all r C R. Since

52


D - D1 ffl ... f'l Dk, this gives (r,v) E A for all r E R, and therefore R c_ s. Since P(D) has an embedding in the Cartesian product of k linear orders, it follows from Theorem 2.3.4 that do(P(D)) _

do(P(D)). Now we show the reverse inequality dr(D) < do(P(D)). Let f be an embedding of P(D) - (12(D), C_) in the Cartesian product (C, _ t. P r o o f . Use Theorems 2.4.3 and 1.2.4. We only sketch the argument that if G x is a threshold graph, then G satisfies Condition 4. We may assume without loss of generality that Xo - Y0 - O. Then Do - ~ and each Di is entirely contained in X or in Y, where D o , . . . , Dm is the degree partitiori of G x . For x r Di C_ X and y r Dj C_ Y we have by Condition 6 of Theorem 1.2.4 that xy C E if and only i f j - s , s - 1 , . . . , s - i + 1, in other words i + j > s. Similarly xy E E if and only if i + j > t. Hence s - t and the result follows. 9 C o r o l l a r y 2.4.5 A bipartite graph is a difference graph if and only if every induced subgraph without isolated vertices has on each side of the bipartition a dominating vertex, that is, a vertex adjacent to all the vertices on the other side of the bipartition.

56


P r o o f . If G is a difference graph with bipartition (X, Y) and without isolated vertices, then by Condition 3 of Theorem 2.4.4, the vertices of largest degrees in X and Y are dominating in G. Since by Condition 2 of Theorem 2.4.4 every induced subgraph of a difference graph is itself a difference graph, it has the same property. Conversely, the condition in our theorem implies Condition 2 of Theorem 2.4.4, which means that G is a difference graph with bipartition (X, Y). In the following, we give another forbidden subgraph characterization for a bipartite graph to be a difference graph. P r o p o s i t i o n 2.4.6

1. A bipartite graph is a difference graph if and only if it has at most one non-singleton connected component, and this component, if any, is a difference graph. 2. A connected bipartite graph is a difference graph if and only if it has no induced Ps, the path on five vertices. P r o o f . (1) follows easily from Condition 2 of Theorem 2.4.4. The necessity part of (2) also follows easily from the same condition. We prove the sufficiency part of (2) by showing that forbidding P5 implies Condition 2 of Theorem 2.4.4. Let Xl, X2 ~ X , Yl, Y2 E Y satisfy XlYl, x2y2 E E, xly2, x2yl ~ E. Since G is connected, it has a path from Xl to y2; let P be a shortest such path. Then P is an induced path having at least two edges distinct from X l y 1 and x2y2. But then P and G have an induced Ps, contradicting the given condition. .. The following is a forbidden subgraph characterization for an arbitrary graph to be a difference graph. P r o p o s i t i o n 2.4.7 A graph is a difference graph if and only if it has no triangle, no induced 2K2, and no induced pentagon. P r o o f . If G is a difference graph, then G is bipartite, so G has no triangle and no induced pentagon. From Condition 2 of Theorem 2.4.4, G has no induced 2K2 either. Conversely, if G has no triangle or induced 2K2 or pentagon, then it follows easily that G is bipartite and satisfies the conditions of Proposition 2.4.6, so G is a difference graph. []

Difference Graphs

2.4

57

T h e following characterizations of difference graphs is analogous to Condition 3 of T h e o r e m 1.2.4. 2 . 4 . 8 Let X N Y - 0 . A graph G on X U Y is a difference graph with bipartition (X, Y) if and only if there exist partitions ( X 1, X 2) of X and ( y 1 , y 2 ) of Y such that Proposition

o

The subgraph of G induced by X I [.J y1 is complete bipartite with bipartition ( X 1, y 1 ) .

2. N ( X 2) ~ ]/1 and

N(Y 2) C X l ;

3. the vicinal preorder is linear on X 2 and on y2. P r o o f . " O n l y if"" Using Condition 4 of T h e o r e m 2.4.4, choose any integer r such t h a t 0 _< r _< t, and set X 1 - Xr U "'" U Xt, y1 _ Yt+l-r U "'" U Yt, X 2 - X - X 1, y2 _ y _ y1. T h e n Conditions 1 and 2 are satisfied. Condition 3 is also satisfied by Condition 2 of T h e o r e m 2.4.4. " I f " " Note t h a t G is bipartite with bipartition (X, Y) by Conditions 1 and 2. We show t h a t Condition 2 of T h e o r e m 2.4.4 is satisfied. Assume xl, x2 E X , Yl, Y2 E Y and XlYl, x2Y2 E E. By s y m m e t r y , there are three cases to consider: 1 9 Xl, X2 ~ X 1 , yl E Y 1 , y2 E y 1 U y 2 . In this case, x2yl E E. 2. Xl,X 2 ~ X 1, Yl, 92 C y2. W i t h o u t loss of generality, assume N ( y l ) C_

N(y2). T h e n y2xl ~ F__.,. 3 9 Xl E X 1 , x2 C X 2 , y2 E y1 , yl E }f 2. In this case xly2 C E. In each case, xly2, x2yl ~ E is impossible, proving the required condition.

9

Chapter 3 Degree Sequences 3.1

Graphical Degree Sequences

A g r a p h i c a l d e g r e e s e q u e n c e (or just a g r a p h i c a l s e q u e n c e or a d e g r e e s e q u e n c e ) is the sequence of vertex degrees of a simple graph, i.e., a finite undirected graph without loops or multiple edges. Every graph is said to be a realization of its degree sequence. The theory of graphical degree sequences is intimately connected to degree sequences of threshold graphs. In this section we list eight criteria for a graphical degree sequence. One criterion is that the sequence is non-negative and integer and is majorized in the sense of HardyLittlewood-Pdlya by a threshold sequence. In Section 3.2, the threshold sequences are characterized as those graphical sequences that satisfy many of the inequalities in the above criteria with equality, as well as by having a unique labeled realization. In Section 3.3 we study the convex hull of graphical degree sequences of a given length, and the threshold sequences turn out to be its extreme points, among other results. The final section, Section 3.4, shows that the difference sequences play among bipartite degree sequences a role analogous to the one played by the threshold sequences among all graphical sequences. To state the criteria we need some definitions, starting with majorization. For many characterizations and applications of majorization, see HardyLittlewood-Pdlya [HLP52] and Marshall and Olkin [MO79]. D e f i n i t i o n 3.1.1 Let a = ( a l , . . . , an) and b = ( b l , . . . , b~) be real vectors of length n. Denote the i-th largest component of a (resp. b) by a[~] (resp. b[~]). 59

60

Degree

Sequences

We say that a m a j o r i z e s b, denoted by a ~ b, if k

k

a[i]>~b[i I i=1

fork-l,...,n,

i=1

with equality for k - n. The majorization is s t r i c t , denoted by a ~- b, if at least one of the inequalities is strict, namely if a is not a permutation of b. The relation a ~ b is a precise formulation of the intuitive idea that the common sum of the components of a and b is distributed more evenly in b than in a. Let us remark that our notation is a little different than the one used in [HLP52, MO79]. These authors use ~- for majorization and do not use a special notation for strict majorization. The main result about majorization in integers that we use here was proved by Muirhead [Mui03] in 1903. To state it we define unit transformation. D e f i n i t i o n 3.1.2 If a is a vector and ai >_ aj+2, then the following operation is called a u n i t t r a n s f o r m a t i o n from i to j on a" subtract 1 from ai and add 1 to aj. Clearly if a ~ is obtained from a by a sequence of unit transformations, then a ~- a'. Muirhead's result is that the converse is also true for integer vectors. T h e o r e m 3.1.3 ( M u i r h e a d L e m m a ) If a and b are integer vectors and a ~ b, then some permutation of b can be obtained from a by a finite sequence of unit transformations. (Had we allowed ai > aj + 1 in the definition of a unit transformation, then b itself could have been obtained, because any two components could now be exchanged by repeated unit transformations.) P r o o f . We generally follow the proof of 5.D.1 of [MO79]. Another proof can be obtained by modifying the proof of 2.B.1 in [MO79]. Assume without loss of generality that a l > ' ' ' >_ a n and bl _> ... >_ b~. Also assume that a -~ b, for otherwise we are done. Then there is a largest index 1 for which l

l

i=1

i=1

3.1

Graphical

Degree

Sequences

61

Consequently, since a ~ b, we have l+l

l+l

E b,- E i=1

i=1

Hence bl+l > al+l, and there is a largest index k _< 1 for which bk < ak. Therefore ak >

bk >_ b l + l

> al+l.

Let a' be obtained from a by a unit transformation from k to 1+ 1. We show below that a ~ ~ b, and hence a repetition of the argument with a ~ replacing a will eventually obtain b. Clearly }-'~i=1 ai' > -- ~ i = 1 b i h o l d s f o r a l l r < k a n d all r > 1. If this inequality failed for some r - k , . . . , l, namely Y'~r'~i=Iai! < Eir=.l bi, then since at+ 1 ' i}l.

62

Degree Sequences

The D u r f e e n u m b e r

of d is given by f - f(d) -I{j'dj

> J}l.

It is convenient to visualize d by means of the Ferrers diagram of d, an n x n 0/1 matrix whose row sums are d l , . . . , d ~ and in which the l's in each row are left-justified. For example, the Ferrers diagram of the sequence d = (4, 2, 2, 0, 0) is illustrated in Figure 2.3. The conjugate sequence is then the sequence of column sums of the Ferrers diagram, in this case d* = (3,3, 1, 1 , 0 , . . . ) , and the Durfee number is the length of the largest principal submatrix that is full of l's, in this case f = 2. The complementary submatrix of this principal submatrix, i.e., the submatrix of rows and columns f + 1 , . . . , n, is entirely zero. We say that a vector d = ( a l l , . . . , dn) is a proper sequence if 1. the di are integers; 2. n - 1

>_ dl>_ . . . >_dn >_0.

Clearly when trying to characterize graphical degree sequences, we may restrict attention to proper sequences. A second convenient way to visualize a proper sequence d, which turns out to be more useful for us here than Ferrers diagrams, is by means of its corrected Ferrers diagram [Ber76, FF62], defined as follows. D e f i n i t i o n 3.1.6 Let d - ( d l , . . . , d n ) be a proper degree sequence. The corrected Ferrers d i a g r a m of d is an n z n matrix C - C ( d ) of 0 's, 1 's a n d , 's (special kind of 0 's) such that 1. all the diagonal entries and no others are , ' s ; 2. the row sums are d l , . . . ,

dn;

3. the 1 's in each row are to the left of the 0 's in that row. The corrected conjugate sequence of d is the sequence d' of column sums of C. It is given by the formula d~ - I{i " i < k and di >_ k - 1 } l

+ l{i " i > k and di >_ k}l.

3.1

Graphical

Degree

63

Sequences

The c o r r e c t e d D u r f e e n u m b e r

of d is defined by ~ j - 1}1.

m - re(d) - I { j ' d j

An example is shown in Figure 3.1. Figure 3.1: The corrected Ferrers diagram C(d) of the proper sequence d - (2, 2, 2, 2, 2). The corrected conjugate sequence is d' - (4, 4, 2, 0, 0). The corrected Durfee number is ram3.

*

1

1

0

0

1

*

1

0

0

1

1

*

0

0

1

1

0

*

0

1

1

0

0

*

Again m is the size of the largest principal submatrix of C that is full of l's and *'s, and the complementary submatrix is all O's and *'s. It is easy to see from the definitions of f and m that d]+l < f. If equality holds, then m - f + 1 and d~+ 1 - d/+l; otherwise m - f. The corrected conjugate sequence d t need not be proper, because it can have an increase, but the only possible increase occurs when m = f, namely d~ - f - 1 and d~+ 1 - f. When a corrected Ferrers diagram C(d) is symmetric, it is the adjacency matrix of a graph G with degree sequence d. By the definition of C(d), the vicinal preorder of G is total, and so, by Condition 5 of Theorem 1.2.4, G is a threshold graph and d is a threshold sequence. Every threshold sequence arises in this way by the same result. We are now ready to state the eight criteria for a proper sequence to be graphical. Our proof here is adapted from [SH91]. Criterion 9 is from Peled and Srinivasan [PS89b]. T h e o r e m 3.1.7' ( S i e r k s m a a n d H o o g e v e e n ) Let d - ( d l , . . . , d n ) proper sequence with ~i~=1 di even. Then the following are equivalent:

be a

Degree

64

Sequences

1. d is a graphical sequence;

2. the R y s e r Criterion" there exists a bipartite graph in which the degrees of each color class are all,..., tin, where cli-[ di+l,

[

di,

fori-l,...,f for i - f +

(3.1)

1,...,n;

3. the B e r g e Criterion: k

k

di 5): Given k and r, every proper sequence d satisfies n

k(k-1) + ~

min{k, di} -

i=k+l n~r

n

k(k - 1) + ~ i=k+l

i=n-r+l

n-r

k(k-1)+

min{k, di } 6)- Given k, there exists some 1, 0 _< l _< k, such that d l , . . . , d/ > k - 1 ~nd dl+l,..., dk < k - 1. Substituting 1 for k in 5), we have l

n

~-~ di < l ( n - r - 1 ) - + i=1

di

for each r - 0 , . . . , n - 1.

i=n-r+l

Choose r - n -

k in the above to obtain l

n

E di < l ( k - 1 ) + ~ i=1

di.

i=k+l

Therefore k

n

k

i=k+l

i=I+1

~ di+l(k-1)+ ~

i=1

n

=

~ i=k+l

di

k

di+~min{k-l,

di}.

i=1

6) ~ 7)" Inequality (3.5) is seen to be equivalent to the following inequality" n

k

k

k

di >_~_. di + ~ m a x { - d i , - k + 1} - ~ max{0, d i - k + 1} i=k+l

i=1

i=1

i=1

3.1

Graphical

Degree

Sequences

67

k

k

= y ~ ( m a x { k - 1 , d i } - k + 1) - ~--~max{k - 1, di} - k ( k i=1

1).

i=1

7) ~ 8)- Put l - d~. Since k _< f, we have k _< 1 _< n, and dl,..., dl > k, d r + l , . . . , d~ _< k - 1 (refer to the associated Ferrers diagram). For the same reason d~ - d ~ - 1 for i - 1 , . . . , k. We then have, using (3.6) for 1 to get the first inequality, k

k

k

n

d'i - y~(d* - l) - ~ d: - k - kl + ~_~ d i - k i=1

i=1

i=1

i=/+1

l

>_ k ( 1 - 1 ) + ~ m a x { / -

1, di} - l ( 1 -

1)

i=1

l

= (k- l)(1- 1)+ ~max{/-

1, di}

i=1

k

l

>_ ( k - l ) ( 1 - 1 ) + ~ di + ~ i=1

(l-l)

i=k+l

k

= Edi. i=1

8) =~ 9): Set e = d and let C be the corrected Ferrets diagram of e. We present an algorithm that transforms C and with it its vector e of row sums. The algorithm is shown in Figure 3.2 and an example is illustrated in Figure 3.3. We remark that in the intermediate steps of the algorithm e need not be proper, since e l , . . . , e / m a y be unsorted and e 2 , . . . , e/_l may exceed n - 1, as can be seen by running the algorithm on the proper degree sequence (9, 9, 9, 9, 8, 4, 4, 4, 4, 4, 4). However, we prove below that at termination C(e) becomes symmetric, and this guarantees that e is proper, since the original e is proper and the column sums e ll , . . . , e} can only decrease during the algorithm. We show below that the algorithm is not stuck when i = 1: the problem is to show that when i = 1, s is a non-negative even integer (if s were negative or odd, the algorithm would be required to transfer l's above the first row, an impossibility). To see this, we show that e satisfies the H/isselbarth Condition after each iteration with i > 1. Denote by e and e' the row and column sums at the beginning of the iteration and by ~ and ~' these sums at the end of the iteration. We assume by induction on f - i that e satisfies (3.7) and show

68

Degree Sequences

Figure 3.2: An algorithm transforming a graphical sequence into a threshold sequence. for i := f d o w n t o 1 do ! 8 "-- e i -- el,

if s < 0 t h e n transfer the last Isl l's of row i to the end of row i - 1; if s > 0 t h e n do transfer the last [~J l's of c o l u m n / t o the end of row i; if s is odd t h e n transfer the last 1 of column i to the end of row i - 1 od od

t h a t ~ does t h e s a m e . Since ek -- ek a n d e^'k - e 'k for k - 1 , . . . , i - 2, as well as e~ ^ - ek for k - i , . . ., f , we o n l y h a v e to show t h a t ~2k=li-1 ek __~ ~-'~k=li-1e~. ^ If s < 0, t h e n i-1

i-1

i-1

i

E ~J- E ~ + I~1- E ~J + ~ - ~'~- E ~J- ~'~

j=l

j=l

j--1

i

i-1

j=l

i-1

-< E ~ - ~'~- E ~ - E ~. j=l

i-1

j=l

i-1

j=l

i - 1 ^!

If s > 0 is e v e n , ~ j = l ej -- ~'~'~j=l e j a n d ~-~j=l ej c o n c l u s i o n is d i r e c t . If s > 0 is o d d , we h a v e i-1

i-1

i-1

i-1 ! ~--~j=l ej, a n d

SO t h e

i-1

E e J - E e J + l < E e ~ + l - E e ~ +1. j=l

j=l

j=l

j=l

E q u a l i t y c a n n o t h o l d here, since this w o u l d i m p l y , t o g e t h e r w i t h t h e e q u a l i t i e s ~j - ~} for j - i , . . . , f , t h a t E j n = l ~j - E j n = l d j is o d d , c o n t r a r y to t h e a s s u m p t i o n of t h e t h e o r e m . T h e r e f o r e ~ satisfies (3.7) in this case too. It n o w follows t h a t w h e n i - 1, s is an e v e n n o n - n e g a t i v e i n t e g e r , a n d t h e r e f o r e t h e a l g o r i t h m does t e r m i n a t e : t h a t s is e v e n follows f r o m t h e f a c t s t h a t ~ kn= l ek is e v e n a n d e !k - ek for k - 2 , . . . , m; t h a t s _> 0 follows f r o m (3.7) for k - 1. A t t e r m i n a t i o n t h e l ' s of rows 1 , . . . , m of C are to t h e left of t h e 0's of t h e s e rows, a n d t h e l ' s of c o l u m n s 1 , . . . , m are a b o v e t h e 0's of t h e s e

3.1

Graphical Degree Sequences

69

Figure 3.3: Illustrating the algorithm in the proof that 8) =~ 9) on the example d (8, 8, 3, 3, 3, 3, 3, 3, 3, 1). Blank entries are O's or ,'s. The row sums e and column sums e ~ are shown in the margins of C. The horizontal and vertical separating lines indicate that f - 3. The moving l's are shown in boxes. 19

8

8

3

2

2

2

2

2

1

1

1

1

1

1

1

1

1

1

1

1

0

8

5

3

3

3

2

2

2

.

1

1

1

1

1

1

1

1

1

.

1

1

1

1

1

1

1 ~-~

1

1

.

1

1

1

1

1

1

1

1

1

1

1

1

~19

1

1

1 [-i~

1

1

1

1 [-i~

1

1

1

1 [-i-]

1

1

1

1

i=3

i-2

s-5

s=-I 8

5

3

.

1

1

1

1

1

1

1

.

1

1

1

1

1

1

1

.

1

1

1

1

1

1

1

1

1

1

1

1

1

1

1

1

1

1

I 9

3

1

i=1 s-0

3

2

2

2

1

1

1

1

1

1

1

70

Degree Sequences

columns. This fact and the equalities ei - ei~ for i - 1 , . . . , m guarantee that C is symmetric, and hence it is the adjacency matrix of a graphs G with degree sequence e. Clearly the vicinal preorder of G is total, so G is a threshold graph, and hence e is a threshold sequence. The only operations the algorithm performs on C are to repeatedly transfer a 1 from a row to a higher row. Since d is proper, the net effect is the transfer of l's from components of d into larger or equal components, and consequently e N d. 9) =~ 1): This follows from Corollary 3.1.4. .. F a c t 3.1,8 1. The Hh'sselbarth Criterion is part of the Berge Criterion, but is fully equivalent to it, as follows f r o m the theorem. It could also be formulated as k

k

i=1

i=1

for each k -

1, . . . , m

because, as we have noted, m - f or else m - f + 1,din - d~. 2. As we have seen in the proof above, the Bollobds and Griinbaum Criteria (3.5) and (3.6) are just two f o r m s of the same inequality. 3. If in (3.6) we make the substitution

di - n - 1 -

dn+a-i,

where d is the proper complementary sequence, and perform some sireple manipulations like the ones used in the proof of 6) ~ 7), we obtain n - - ]r

n

di_ deg(k), yet k has a neighbor 1 that is not a neighbor of j, contrary to Condition 6 of Theorem 1.2.4. Since the majorization e ~ d is strict, r > 0, and thus the realization G ~ of d is a non-threshold graph. Therefore d cannot have any realization that is a threshold graph, because threshold sequences have a unique realization by Theorem 3.2.1. 8) =~ 1)" Assume that, if possible, d satisfies 8) but is not a threshold sequence. By 9) of Theorem 3.1.7, there exists a threshold sequence e satisfying e ~ d, and since d itself is not a threshold sequence, the majorization must be strict. But this contradicts 8). .. Characterization 8 above is also discussed in [RG79].

3.3

The Polytope of Degree Sequences

This section discusses the special role played by the threshold sequences in the polytope of degree sequences. It is based on [PS89b] and uses linearprogramming duality and the structure of threshold graphs to show that the threshold sequences are the extreme points of this polytope, to characterize adjacencies on it, to provide a totally-dual-integral system of linear inequalities for it, and to describe its facets. Some of these results were obtained earlier by Koren [Kor73], using the ErdSs-Gallai inequalities. Our approach here fits naturally the modern trend of using LP duality and some combinatorics to obtain polyhedral results. We use here the shorter terminology "degree sequence" for "graphical degree sequence". D e f i n i t i o n 3.3.1 Let n be a positive integer. We denote by T)Sn the collec-

tion of all graphical sequences of length n, not necessarily proper, i.e., ~ ) ~ n -- { ( X l , - . . ,

Xn) " (Xl,... ,Xn) is a graphical sequence}.

76

Degree Sequences

The p o l y t o p e of d e g r e e s e q u e n c e s ~)n is defined as the convex hull of DS~, 7?n = conv(DSn).

3.3.1

Extreme Points

Consider the following combinatorial optimization problem: given a real vector (Cl,...,c~), n

maximize

~

CiX i

i:1

subject to

(3.10)

(xl,...,x~)ED3~,

or equivalently n

maximize

~ cixi i=1

subject to

(3.11)

(Xl,...,x~)CD~.

Figure 3.4 presents a very simple greedy algorithm producing an optimal solution for the problems (3.10) and (3.11). Figure 3.4: The Greedy Algorithm for problem (3.11). v := { 1 , . . . , n } ;

E :=O; for all {i, j} C_V do if ci + cj > 0

then E := E U ij od; (Xl,..., Xn) : - - degree sequence of G = (V, E).

Fact 3.3.2 1. It is clear that the algorithm produces an optimum for problem (3.10) and therefore also for problem (3.11), because adding the edge ij to E adds ci + cj to the objective function.

2. If ci + cj = 0 for some {i,j} C_ V, we have the freedom of not adding the edge ij to E and still obtain an optimal solution. This freedom is sufficient to obtain all graphs with optimal degree sequences. 3. In particular, the following conditions are equivalent:

3.3

77


(a) c~ + cj 7~ 0 for all {i,j} C_ V; (b) there is only one optimal degree sequence; (c) there is only one graph on the vertex set { 1 , . . . , n} with an optimal degree sequence. .

.

The algorithm can be thought of as the usual greedy algorithm of the uniform matroid on the edges of the complete graph with edge costs ci + cj. The graph G produced by the algorithm is always a threshold graph. Indeed it satisfies Condition 5 of Theorem 1.2.~, namely the vicinal preorder on V is total and coincides with the numerical order of the components ci.

We now have enough information to characterize the extreme points of Dn, a result originally proved by Koren [Kor73]. T h e o r e m 3 . 3 . 3 A degree sequence is an extreme point of T)~ if and only if

it is a threshold sequence. P r o o f . " O n l y if"" Let d be an extreme point of T~n. Then for some objective function Ein__=l CiXi, d is the unique optimum of the linear program (3.11), so the Greedy Algorithm must produce d on input (Cl,..., c~). But by Part 5 of Fact 3.3.2 the algorithm always produces a threshold sequence. Hence d is a threshold sequence. " I f " : Let d = ( d l , . . . , dn) be a threshold sequence, and let G = (V, E) be a realization of d with degree partition D o , . . . , Din. Choose c = ( c l , . . . , c~) as follows: for each vertex i E Dj, take 2j-m-l,

ci-

2j-m,

if j_ ~.

This choice of c is illustrated in Figure 3.5 for the cases m = 5 and m = 6. It is easy to check that the Greedy Algorithm with input (Cl,...,cn) outputs precisely the graph G = (V, E) and its degree sequence d. Moreover, by Part 3 of Fact 3.3.2, d is the unique optimum of the linear program (3.11). Therefore d is an extreme point of T)n. 9 As a corollary, we give a new proof of Theorem 3.2.1 without using the theorem of Havel-Hakimi. This was originally shown by Koren [Kor73].

78

Degree Sequences

Figure 3.5: Examples illustrating the proof of the "if" part of Theorem 3.3.3. A line between sets Di and Dj indicates that every vertex of Di is joined to every vertex of Dj. The ovals on the left indicate cliques.

Ira-51

Ira-01 -6

-4

3

6

|

@

-7

@

-5 -3

I

2

-1

3.3.4 (Theorem 3.2.1) A degree sequence is threshold if and only if it has a unique labeled realization.

Corollary

P r o o f . Let d be a degree sequence having realization G - (V, E) with V = { 1 , . . . , n}. Then the following statements are equivalent" 9 d is a threshold sequence; 9 d is an extreme point of 7Pn; 9 d is the unique o p t i m u m of the linear program (3.11) for some c; 9 G is the unique realization of d on the vertex set V (by Part 3 of Fact 3.3.2).

3.3.2

Adjacency

We now develop criteria for two extreme points of ~)n to be adjacent and produce a formula for the number of threshold sequences adjacent to a given one.

3.3


79

L e m m a 3.3.5 Let T1 = (V, E1) and T2 = (V, E2) be two threshold graphs with degree sequences f and g, respectively. If IE1/~ E21 >_ 2, then f and g are not adjacent extreme points of ~Dn (A denotes symmetric difference). P r o o f . Assume that, if possible, f and g are adjacent extreme points of Z)n. Then by Theorem 3.3.3 and the definition of adjacency there exists a c - ( C l , . . . , c~) such that c. x - E ~ I c~x~ is maximized over the threshold sequences only by f and g. Hence the Greedy Algorithm on input c will produce one of T1 and T2, say T1. Let e = ij C E1/~ E2. We have ci + cj = 0 (for otherwise e must be in all or none of the optimal graphs), and hence e C E1 according to the Greedy Algorithm. Put E3 = E1 - e . We assert that T3 = (V, E3) is a threshold graph. This is a contradiction, since T3 is different from both T1 and T2, hence its degree sequence h differs from both f and g by Corollary 3.3.4, yet h maximizes c.x. To prove the assertion, assume T3 is non-threshold. Then by Condition 2 of Theorem 1.2.4 T3 has an alternating 4-cycle, but T1 has none, so the alternating 4-cycle must involve the nonedge ij. In other words, there exist two other vertices k, 1 with ik, j l C E3 and ij, kl ~ E3. This means that ik, j l C E1 and kl C} E l , and by the Greedy Algorithm we have ci + ck >_ 0, cj + cl _> 0, ck + cl < 0, and, as noted above, ci + cj = 0. This gives the contradiction ci + cj + ck + ct >_ 0 > ci + cj + ck + cl. To prove the converse of Lemma 3.3.5, we determine in the next lemma which single edges can be dropped from or added to a threshold graph to obtain another one. L e m m a 3.3.6 Let G = (V,E) be a threshold graph with degree partition D 0 , . . . , D m , and let x C Di, y C Dj be distinct vertices. 1. If e = xy C E, the graph G - e obtained by dropping e from G is a threshold graph if and only if i + j = rn + 1. 2. If e = xy ~ E, the graph G + e obtained by adding e to G is a threshold graph if and only if i + j = rn. P r o o f . Recall that by Condition 6 of Theorem 1.2.4, xy C E if and only if i+j>rn. We begin by proving Part 1. Given that i + j = rn + 1, assume that, if possible, G - e is not a threshold graph. By Theorem 1.2.4, G - e has an alternating 4-cycle, but G does not. Therefore there exist two other vertices z , w C V satisfying xz, yw C E, zw ~ E. Let z C Dk, w C Dl. Then

80

Degree Sequences

i+k_> m+l,j+l_> m + l , and k + l < m+l. Adding the first two of these inequalities on the one hand, and adding the third one to i + j - m + 1 on the other hand, results in a contradiction. Conversely, given that i + j > r e + l , it follows that ( m + l - i ) + ( m + l - j ) < m + 1. Since e is an edge of G, we have 1 _< i , j 2 (for if ID[~] I - 1, then the single vertex of D [ ~ 1 has the same degree as the vertices of D [ ~ j , contrary to the definition of a degree partition). Similarly there exists a vertex w E Dm+l-j such t h a t w =fig and hence yw C E. If we can choose z and w as distinct vertices, then we also have zw (} E, and G - e has the alternating 4-cycle z z w y , as required. The only possible obstacle to choosing z and w as distinct vertices occurs when i - j and IDm+~-~l- 1. In that case 2i > m + 1 _> 2, hence i _> 2 and Dm+2-i is defined and distinct from Do. We may then choose a v E Dm+2-i such that v r x (the only possible obstacle to doing t h a t is m + 2 - i - i and I D i l - 1, which is not the case since both x and y are in Di). Then c e r t a i n l y w C v , a n d w v ~ E a s ( m + l - i ) + ( m + 2 - i ) _ < m + l , so G - e has the alternating 4-cycle x v w y , as required. We now derive the second part of the l e m m a by applying the first one to the complement G of G, which is also a threshold graph. To do so, we need to determine the degree partition C o , . . . , Ct of G. There are two cases to consider. C a s e 1" D 0 - r e . Then ( C 0 , . . . , C t ) ( e , D m , . . . , D 0 ) , hence l - m + l and Di -- Cm+l-i for i - 0 , . . . , m. Adding to G an edge between Di and Dj results in a threshold graph if and only if dropping from G an edge between Cm+l-i and Cm+l-j results in a threshold graph. By Part 1, the latter holds if and only if (m + 1 - i) + (m + 1 - j) - 1 + 1, i.e., i + j - m. C a s e 2: Do - e . Then ( C o , . . . , C t ) - ( D m , . . . , D 1 ) , hence 1 - m - 1 and Di - Cm-i for i - 1 , . . . , m. Again, adding to G an edge between Di and Dj results in a threshold graph if and only if dropping from G an edge between Cm-i and C~_j results in a threshold graph, which holds if and only if (m - i) + (m - j) - 1 + 1, i.e., i + j - m. " We are now ready to characterize adjacency on ~ . T h e o r e m 3 . 3 . 7 Let T~ - (V, Ea) and T2 - (I7, E2) be the (unique) realizations on the vertez-set V of the threshold sequences f and 9, respectively.

3.3

The P o l y t o p e of Degree Sequences

81

Then f and g are adjacent extreme points of Z)~ if and only if IE, /~ E2I = 1. P r o o f . The "only if" part is Lemma 3.3.5. We prove here the "if" part. Given that E 1 / ~ E2 = {e}, we may assume without loss of generality that E1 = E2 U {e}. Let D o , . . . , Dm be the degree partition of T2, and let e = xy with x C Di and y C Dj. Since adding e to T2 results in the threshold graph T1, we have i + j = m by Part 2 of Lemma 3.3.6. We consider the c given in (3.3.1) and distinguish the following two cases. C a s e 1: i ~ ~. Then one of i and j is larger than [~J and the other one is not, and therefore c~ + cy = 2i - m + 2j - m - 1 = - 1 . In this case define c~ by ,+1 ifv-xorv-y Cv Cv otherwise. C a s e 2- i - j - ~. Then c~ - cy - - 1 . In this case define c' by

_{ 0, ifv-xorv-y Cv

cv, otherwise.

Figure 3.6 illustrates c~in Case l for m = 5, i = 4 a n d j = 1. The graph produced by the Greedy algorithm on input d is also shown. In both cases it is easy to verify that the Greedy Algorithm on input c~ will produce T1 and its degree sequence f. Furthermore, since c~ + c'v - 0 if and only if {u, v} = {x, y}, the only other optimum of (3.10) with objective function c'. x is g by Part 2 of Fact 3.3.2. Therefore f and g are adjacent extreme points of Z)~. .. We now develop a condition for adjacency of two threshold sequences that does not refer to their realizations as threshold graphs. L e m m a 3.3.8 Let f = ( f l , . . . , f ~ ) be a threshold sequence with the unique realization T = (V, El) and g = (91,... ,in) a degree sequence with a realization G = (V, E2), where V = { 1 , . . . , n } . If for some i r j we have

for k - i , j otherwise,

(3.12)

P r o o f . The proof is by induction on min(gi,gj). positive components of g.

Let q be the number of

fk

_ ~ gk + l, ( gk,

then ij E El.

82

Degree

Sequences

Figure 3.6: An illustration of cI and the graph produced by the Greedy Algorithm for m - 5, i - 4, and j - 1, with the conventions of Figure 3.5.

(

t

Do )

-6

y)

-4

D1 -

1

-3 3 3~1 -2

3.3


83

gj) - - 0, say gj - - O. Then the number of positive components of f is q + 1 (resp. q + 2) if g~ is positive (resp. zero), and by Condition 4 of Theorem 1.2.4 the largest degree in T is q (resp. q + 1). On the other hand, the largest degree in G is at most q - 1, and so fk < q - 1 for each k -r i, j. Therefore q (resp. q + 1) - max(fi, fj), and one of the non-isolated vertices i , j of T has the largest degree in T. Therefore ij C E1 by Condition 4 of Theorem 1.2.4. I n d u c t i o n step: min(gi,gj) > 0. In this case the number of positive components of f is also q. If one of f~ and fj is q - 1, then c l e a r l y i j E El. If not, then fl - q - 1 for some 1 5r i,j by Condition 4 of Theorem 1.2.4. Then gl - q - 1, so 1 is adjacent to all non-isolated vertices in both T and G. Delete 1 from both T and G and apply induction to the resulting induced subgraphs. 9 R e m a r k . One might wish to generalize Lemma 3.3.8 by relaxing one of its assumptions. Instead of assuming Condition (3.12), we wish to assume only fk >__gk for all k, and to conclude that by dropping zero or more edges in the unique realization of f we can obtain some realization of g. Under the additional assumptions that (a) g is a threshold sequence and (b) fl >_ "'" >_ f~ and g l ~ "'" ~ gn, the conclusion is true [HIS78]. However, (a) is not enough without (b), as can be seen from the example f - (5, 5, 5, 4, 4, 3), g - (4, 1, 0,2,2,3).

B a s i s : min(gi,

T h e o r e m 3.3.9 The threshold sequences f - ( f l , . . . , f~), g - ( g l , . . . ,gn) are adjacent extreme points of T)n if and only if there exist indices i 7~ j such that 1, f o r k - i , j f k - gk O, otherwise Or

gk - f k -

1, f o r k - i , j O, otherwise.

P r o o f . The "only if" part follows easily from Theorem 3.3.7. For the "if" part, assume without loss of generality that Condition (3.12) holds. Let T1 = (V, El) and/12 - (V, E2) be the unique realizations of f and g respectively on the vertex set V - { 1 , . . . , n } . Then ij E E1 by Lemma 3.3.8, and from the uniqueness of realizations of threshold sequences, it follows that E2 E1 {i j}. The result now follows from Theorem 3.3.7. 9

84

Degree Sequences

The next theorem exhibits the threshold sequences adjacent to a given threshold sequence d without referring to its realization, except for its degree partition, which is easily read from d itself. T h e o r e m 3.3.10 Let f be a threshold sequence of length n and D o , . . . , Dm the degree partition of its realization on the vertex set { 1 , . . . , n}. The threshold sequences g adjacent to f on Dn are precisely the ones obtained in one of the following two ways: 1. choose some r set

0 , . . . , L~] and distinct indices i e D~, j E Dm_~ and fk,+l, fk

gk -

fork-i,j otherwise;

2. choose some r = 1 , . . . , [m@lJ and distinct indices i C DT, j e Dm+l-r and set fk - 1 , for k - i , j gkfk, otherwise.

l

P r o o f . This follows from Lemma 3.3.6 and Theorem 3.3.7. ,, The next result counts the extreme points of D~ that are adjacent to a given one. T h e o r e m 3.3.11 Let f = (f 1 , . . . , f n ) be a threshold sequence with a realization having degree partition D o , . . . , Din. Then the number of extreme points of l?,~ adjacent to f is equal to

ID~lIDm-~l + r=0

ID~IIDm+I-~I + r=l

IDql 2

where q P r o o f . This follows from Theorem 3.3.10. In particular, the binomial coefficient accounts for the cases where i and j are in the same block of the degree partition. " We can now determine the threshold sequences with the largest and smallest number of adjacent sequences. T h e o r e m 3.3.12 Let f = ( f l , . . . , fn) be a threshold sequence with a realization G having degree partition D o , . . . , Din.

3.3

85

T h e P o l y t o p e of D e g r e e Sequences

~

The number of extreme points adjacent to f is a maximum, equal to

if aria only if

of

foUo

g hota "

fi-O

for all i;

9 G is stable, i.e., 9 G is a clique, i.e., fi-n-1

for all i;

9 G is a star, i.e., for some j

fi

_~

fori-j fori #j;

n-l, 1,

9 G is a costar, i.e., for some j fi

,

fori-j

_ J" o,

~-2, fo~i#j.

The number of extreme points adjacent to f is a minimum, equal to n, if and only if [D~ I - 1 for i # 0, [~1 IDFmll

(3.13)

- 2 m

iDol

- n-

y ~ iDol. i=1

P r o o f . 1- By T h e o r e m 3.3.7, the n u m b e r of neighbors of f equals the number of ways in which we can add or drop an edge to or from G and preserve its thresholdness. This is obviously at most (~), and this upper bound is achieved if and only if every present edge can b e dropped and every absent edge can be added. By L e m m a a.a.6 this happens if and only if every present edge joins Di to Dm+l-i for some i and every absent edge joins Di to D,~-i for some i. It follows from this condition that m _< 2, for otherwise G has edges between Dm and Din-1 e v e n though m + ( m - 1) > m + 1. Moreover, if m - 2 we must have ID21 - 1, otherwise there are edges within D2 e v e n though 2 + 2 >~- m + 1. In case m - 0, G has no edges and f - 0. In case m - 1, [D01 must be 0 or 1, for otherwise there are absent edges joining Do to itself even though 0 + 0 < m; thus G is a clique or a costar. In case m - 2

86

Degree Sequences

we must have ] D 2 1 - 1 as observed above, and also ID01- 0, for otherwise there are absent edges between Do and DI even though 0 + 1 < m; thus G is a star in this case. It is easy to see that conversely, the degree sequences of a stable set, a clique, a star and a costar do meet the upper bound. 2" We denote d i - ]Di] and consider the parity of m. C a s e 1" m - 2p. Then dp _> 2 (otherwise the single vertex of D; has the same degree as the vertices of Dp+l). From Theorem 3.3.11, the number of extreme points adjacent to f is

E did2p-i + i=0

did2p-i+l + i=1

= dod2p -Jr-E did2p_i + i=1

did2p_i+l -nti=1

>_ d o + ( p - 1 ) + ( p + l ) + l =

d0+2p+

1.

This lower bound is achieved if and only if equalities (3.13) hold. In that case the value of the bound is do + 2 p - 1 - (n - ~i=1 2; di) + 2p + 1 - n. C a s e 2" m - 2p + 1. Then dp+l > 2 (otherwise the single vertex of D;+I has the same degree as the vertices of D;). The rest of the analysis is similar to Case 1. 9

3.3.3

Linear Description and Facets

In this subsection we use linear programming duality and the structure of threshold graphs to provide a linear description of Dn. Lemma

3 . 3 . 1 3 Let S and T be disjoint subsets of { 1 , . . . , n } .

Then the

inequality xi- ~xi

_< ] S [ ( n - 1 - I T ] )

(3.14)

iES lET is valid for I)n. Moreover, the degree sequence of a graph G satisfies (3.1~) with equality if and only if 1. S is a clique of G; 2. T is a stable set of G; 3. every vertex of S is adjacent to every vertex of S U T in G;

3.3

87

T h e P o l y t o p e of D e g r e e Sequences

~. no vertex of T is adjacent to any vertex of S U T in G.

P r o o f . To show that (3.14) is valid for T~, we may assume that x is the degree sequence of a graph G. Set C - {ij 9 i E S , j E T}. It is easy to see that

x~ _< ISI(n - 1 - I T I ) + ICI,

(3.15)

iES

equality holding if and only if Conditions 1 and 3 of the lemma hold; ,

- ~ xi _ O.

Our remaining task is thus to show that the linear equations (3.25)-(3.26) have a non-negative solution. For each j E { 1 , . . . , n}, let Pj be the set of all indices i such that YS,,T, appears in the j-th equation of (3.25)-(3.26). Then by (3.22)-(3.24) we h~w, forj CI,

Pj -

{i.jcT~}-{iEI'i>j}U{iCK'ij~E}

=

{iEI'i>_j}U{iCK'ci
_ [cil}.

_<j}U{i E I'cj

>_ Ic l}

(3.28)

Let 7r(1),..., ~r(n) be the permutation of 1 , . . . , n that sorts the components of c in nondecreasing absolute values, resolving ties as follows: within I reverse the original order, within K maintain the original order, and place all components of I before equal components of K; in other words, 1. Icr(1)l _~ - ' ' ~ 2. if

Ic (01 -Ic (j)l

for some i -7(:j, then < 0 > 0

if r e I if 7c(i),~r(j) e K,

(a)

(i - j)(Tr(i) - 7r(j))

(b)

7r(i) < 7r(j) if 7r(i) C I,~r(j) e K.

Then P r ( j ) - {Tr(1),...,Tr(j)} by (3.27) and (3.28). We now rearrange the n equations (3.25)-(3.26) so that the t-th new equation is the 7r(t)-th old equation, and also rearrange the n unknowns YS~,T~ so that the t-th new unknown is Ys,(,),T,(~). The new system has the form A y - b where the coefficient matrix A has zeros above the main diagonal and ones everywhere else, and the components of b are nonnegative and nondecreasing. This system has a unique, nonnegative solution y, as required.

C o r o l l a r y 3.3.15 The system (3.14) of linear inequalities is totally dual integral; that is, whenever C l , . . . , cn are integers, the dual of the linear program (3.17) has an optimal solution in integers. 9 For more information on totally dual integral systems, see [EG84, PayS0, Sch86]. In order to identify the facets of ~ , we need to know its dimension and the dimension of a related polytope. For m > 1, n > 1, let Km,~ denote the complete bipartite graph with bipartition { 1 , . . . , m } , {m + 1 , . . . , m + n}, and let Z)m,~ denote the convex hull of the degree sequences of the spanning subgraphs of Km,n (just as T)n is the convex hull of the degree sequences of the spanning subgraphs of K~, the complete graph on { 1 , . . . , n}).

3.3

91


Lemma 3.3.16

1. dim ~)1 -- O, dim 792 - 1;

2. dim/)m,~ - m + n - 1 for m >_ 1, n >_ 1; 3. dim:Dn - n f o r n > 3. P r o o f . 1" This is clear. -~m+n 2: Since/)m , n is contained in the hyperplane ~ i m= l xi __ xz_.i=m+l xi, we have dim/)m,n _< m + n--1. To prove the reverse inequality, consider any spanning tree of Km,n and let A be its edge-vertex incidence matrix. The m + n - 1 rows of A are linearly independent, and each of them lies in /)m,n. Hence dim/)m,~ >_ m + n - 1. 3" Koren [Kor73] proved this fact by showing that/:)n has interior points for n >_ 3. We prove it as follows. For n odd, let H be any Hamiltonian cycle in Kn, and for n even, let H consist of any cycle through n - 1 vertices and a single edge at the remaining vertex. Let A be the edge-vertex incidence matrix of H. In each case A is nonsingular and its rows lie i n / ) ~ . Hence dim Dn > n. Theorem

3 . 3 . 1 7 For n >_ 4, the facets

of ~) n

are

the following:

1. inequalities (3.1~) f o r all sets S, T such that S 5r 0 , T 5r ~g, S N T - 0 , S U T C_ { 1 , . . . , n } , I S u T I - 2 , . . . , n - 3 , n; 2. xi > 0 f o r i - 1, . . . , n; 3. xi < n -

l fori-

1,...,n.

The facets of 2)a are only of the f o r m 1 above.

P r o o f . By Theorem 3.3.14, every facet of 7Pn has the form of inequality (3.14) for some sets S, T with O :/- S U T C_ { 1 , . . . , n}, S N T - O. Let 7PS,T denote the convex hull of the degree sequences satisfying (3.14) with equality. Thus by Part 3 of L e m m a 3.3.16, (3.14) is a facet of Dn if and only if dim DS, T -- n - 1. From L e m m a 3.3.13 it follows that dim DS, T -- dim DIsI,IT I + dim Z)ISuTI, where dim Z)ISI,ITI is understood to be 0 if either S or T vanishes and dimD0 is 0. Thus for S - e , (3.14) is ~ facet of Z)n if and only if ITI 1 and ITI >_ 3, which means that n >_ 4 and T - {i} for some i, and gives

92

Degree Sequences

the facet xi >_ O. Similarly for T - O, (3.14) is a facet if and only if Isl 1 and Isl _ 3, which implies that n _> 4 and gives the facet xi _< n - 1. Finally i f S - ~ e and T r e , (3.14) is a facet if and only if IS'l + I T I - 1 + dim Z)lsur I = n - 1, or equivalently dimZ)lsur I = IS u TI, which holds if and only if IS t2 T I - 0, 3, 4 , . . . , n - 2 by Lemma 3.3.16. This case corresponds to 1 of the lemma. 9 We mention in closing this subsection the work of Stanley [Sta91], which discusses all the faces of Z)~.

3.4

Difference Sequences

In this section we characterize the degree sequences of difference graphs and show that they play among bipartite degree sequences a role similar to the one played by the threshold sequences among all graphical sequences. A s p l i t s e q u e n c e is the degree sequence of a split graph. Note that if one realization of a degree sequence is a split graph, then all its realizations are split graphs. This follows from Theorem 3.1.11, since every alternating 4-cycle of a split graph G ( K , S ) must alternate between K and S, and a transfer along it leaves K as a clique and S as a stable set. The role of split and threshold sequences among all graphical sequences is depicted in the following result. T h e o r e m 3.4.1 Let d - (dl,... ,dn) be a proper sequence with an even s u m and a corrected Durfee n u m b e r rn. Then 1. d is a graphical sequence if and only if it satisfies the first rn Erd5sGallai inequalities (3.3); 2. d is a split sequence if and only if it satisfies the first rn E r d S s - G a l l a i inequalities, the rn-th one with equality; in this case d has a split realization G( {1, . . . , rn } , { rn + 1, . . . , n } ) ; 3. d is a threshold sequence if and only if it satisfies the first rn ErdSsGallai inequalities with equality.

P r o o f . Part 1 is Theorem 3.1.9 and Part 3 is Condition 4 of Theorem 3.2.2. Part 2 follows from Part 1, the fact that the m-th Erd6s-Gallai inequality can be written as Eirn=l d i -Ein.=m+l di ~ ?Tt(?Tt- 1) (see Equation 3.9), and L e m m a 3.3.13. 9

3.4

Difference

93

Sequences

To formulate analogous results for bipartite graphs, we say that a pair of integer sequences x = ( x l , . . . , x p ) and y = ( y l , . . . , y q ) is b i p a r t i t e resp. d i f f e r e n c e r e a l i z a b l e if there exists a bipartite resp. difference graph whose color classes have degrees x and y. The following famous characterization of bipartite realizable pairs of sequences is known as the Gale-Ryser Theorem. Several proofs can be found in [BerT6, FF62, MO79]. We give here a short proof using Theorem 3.4.1. Theorem

3.4.2 A pair of non-negative integer sequences X-

(X 1 ~__ " ' "

~

Xp) and

y-

(Yl ~__ " ' "

_~

Yq)

with Yl 1, yq > 1, and Yl _~ p - 1, for otherwise delete xp or yq or delete yl and decrease each xi by 1 without affecting the bipartite realizability or the majorization. Clearly x and y are bipartite realizable if and only if the sequence xl+p--1

>_ "" > _ X p + p - - 1 > yl > -- ... >_yq

(3.29)

is a split sequence. For this sequence m - p, since Yl __~ P - 1. Also since xp _> 1, the first p Erd6s-Gallai inequalities for this sequence can be written as k

p

i=1

q

~

k+~min(k,

i=k+l

i=1

~-~(xi+p-1)__ 2 and permuting the components of z so that Zl _> " - >_ zp, we can take

z ' - (Zl + C a s e 2" Yl - - P" Then by Corollary 2.4.5 (x, y) is difference realizable if and only if (x",y")is difference realizable, where x " - ( x ~ - 1 , . . . , x p - 1) and y " - (y2,..., yq). By induction (x", y " ) i s difference realizable if and only if there does not exist a bipartite realizable (z", y") with z" ~- x". This is true

96

Degree Sequences

in turn if and only if there does not exist a bipartite realizable (z, y) with z ~- x (if z" exists, take xi - z it! + 1; conversely if z exists, then zp _> 1 and take z it l - z i - 1). ,, We conclude this section by showing how to recognize difference sequences when the partition is not given. T h e o r e m 3 . 4 . 6 L e t d - ( d a , . . . , dn) be a sequence of positive integers. Let di - m a x ( d l , . . . , d n ) and put eo - n - di. Then 1. if d is a difference sequence, then there exists some j 5r i such that dj - Co; 2. if there exists some j r i such that dj - Co, let d' be obtained f r o m d by dropping two entries equal to di and dj and subtracting 1 f r o m the remaining entries. Then d is a difference sequence if and only if d' is.

Proof.

1. Assume that there is a difference graph G with bipartition (X, Y) and degree sequence d. Since the degrees di are positive, G has no isolated vertex. By Corollary 2.4.5, each one of X and Y has a dominating vertex. Hence the result. 2. If G is a difference graph with bipartition (X, Y) and degree sequence d, we may assume without loss of generality that di is a degree of a vertex in X. By Corollary 2.4.5, X has a dominating vertex x, whose degree must be di, and Y has a dominating vertex y, whose degree must be e0. Then G - {x, y} is a difference graph with degree sequence d'. The converse is shown in a similar way.

From Theorem 3.4.6, one can easily give a linear-time algorithm to test whether a given sequence is a difference sequence. The details are omitted.

Chapter 4 Applications 4.1

Introduction

In this chapter we consider some applications of threshold graphs. Section 4.2 deals with aggregations of inequalities in 0-1 programming. Section 4.3 treats synchronizing competing processes in a complex system such as a large computer. Section 4.4 discusses job scheduling. Section 4.5 introduces G u t t m a n scales in psychology.

4.2

Aggregation of Inequalities

Chvgtal and Hammer [CH73, CH77] coined the name "threshold graphs". Their motivation for introducing these graphs was the aggregation of linear inequalities in integer programming. Consider the set-packing problem whose constraints are A x < 1,~ x binary,

(4.1)

where A is an m x n 0/1 matrix and lm is the m-vector of l's. If m is large compared to n, it takes much storage to store the matrix A of coefficients. This motivates the search for a t x n matrix B, not necessarily 0/1 but with t smaller than m, such that the constraints (4.1) are equivalent to the constraints B x < It (4.2) x binary, 97

98

Applications

in the sense that both constraints have exactly the same solutions. The constraints (4.2) are referred to as an aggregated form of the constraints (4.1). It is important to emphasize that both sets of constraints use the same set of binary variables. We can always obtain a single equation equivalent to (4.1) by adding slack variables to convert (4.1) into equations and taking an appropriate linear combination of these equations, but this uses more variables. Naturally, one is interested in the smallest possible t such that (4.2) is equivalent to (4.1). It is denoted by t(A). In particular we wish to know whether t(A) - 1, namely whether the constraints (4.1) can be aggregated into a knapsack constraint in the same binary variables. Consider the intersection graph G - (V, E) of the columns of A, namely V - { 1 , . . . , n} and ij C E if and only if the i-th and j - t h columns of A have a 1 in a common row (every graph can be obtained in this way from its own edge-vertex incidence matrix and possibly from other 0/1 matrices). Clearly, a binary vector satisfies the constraints (4.1) if and only if its support is a stable set of G. Thus the constraints (4.1) can be aggregated to a knapsack constraint if and only if G is a threshold graph. It turns out that t(A) depends only on G and not on A itself. We define the threshold dimension t(a) of ~ graph G - (V, E) as the smallest integer k such that E can be covered by k threshold spanning subgraphs of G. The relation of this definition to the more usual usage of the term "dimension" in connection with intersections is that t(G) is the smallest k such that the complementary graph G is the intersection of k threshold graphs. Chapters 6 and 7 treat the threshold dimension in more detail. Our purpose here is to present the result of Chvs and Hammer that t ( A ) - t(G). T h e o r e m 4.2.1 ([CH77]) Let A be a 0/1 matrix with intersection graph

G. Then t(A) - t(G). P r o o f . Assume that G is covered by t threshold subgraphs G1,..., Gt. Each G~ has by definition a separating inequality b(~). x .B~ + Brj > 1. If ij C E~, then the characteristic vector of {i,j} violates (4.2), hence it also violates (4.1) and so ij C E. Thus each G~ is a subgraph of G. Likewise, if ij ~ E~ for each r, then the characteristic vector of {i,j} satisfies (4.2) and (4.1) and so ij ~ E. Thus the G~ cover E. We now show that each G~ is a threshold graph. Indeed if not, then by Condition 2 of Theorem 1.2.4 G~ has an alternating 4-cycle i, j, k, 1 with

Bri + Brj > 1

Bri A- Brk < 1 B~j + B~t < 1.

These inequalities are clearly contradictory and so G~ is indeed a threshold graph. Therefore E can be covered by as few as t threshold subgraphs of G, i.e., t(G) < t(A). ,,

4.3

Synchronization

In this section, based on the work of O r d m a n [Ord85, Ord86, Ord89], we consider the problem of synchronizing processes running in parallel in a large system such as a computer. The processes require some resources such as an output device or access to a memory location, and it may happen that several processes cannot use a resource at the same time. We then need a mechanism that will not allow a process to enter a phase wherein it uses the resource until such time as this usage will not conflict with that of other processes. Such mechanism is said to ensure mutual exclusion. We ignore other problems here, such as preventing deadlock of the whole system and ensuring a finite waiting time for each process.

100

Applications

What is usually done is that each process has a section of its program, called the critical section, where it uses the resources in competition with other processes. Before it enters its critical section, the process executes an entry routine, designed to verify that it is safe to do so. After it completes its critical section, the process executes an exit routine, to record the fact. The communication between the processes is done entirely in terms of integer shared variables. Access of the shared variables is done via operations known as synchronizing primitives. In the entry routine of the test-and-set synchronizing primitive, the process keeps examining a shared variable until the latter reaches a fixed set of values, and then resets the value of the shared variable, and control passes to the next statement. This may be written in the following way: test V until V C S then V := function(V). The exit routine does not need the test of V, but it can formally be described by the same kind of statement by letting S contain all possible values of the variable V. A restricted special case of test-and-set is the P V-chunk synchronizing primitive. It can be described by a statement of the form test V until V > c then V := V - c , where c is a constant that can depend on the process in question. The variable V is initialized to some positive value, and the above statement ensures it never becomes negative. The exit routine can be described by the same kind of statement with a negative c, and has the effect of simply increasing V. Thus V can be thought of as a semaphore indicating the available quantity of some resource. The process needs at least c units of the resource, and as soon as this quantity is available, the process enters its critical section and decreases V by c. Upon exiting it increases V by the same amount to signal that it no longer uses the resource. The minimal sets of processes that cannot be in their critical sections simultaneously form the edges of a certain hypergraph on the set of processes. We consider the case that this hypergraph is in fact a graph G, namely that each minimal set of processes that cannot be in their critical sections simultaneously consists of exactly two processes. The graph G is called the exclusion graph of the processes. If G is a threshold graph with separator }-'~in_=lWiXi ~ t, then mutual exclusion can be achieved by using one PV-chunk variable V initialized to

4.3

Synchronization

101

the value t. The entry routine of processor i is to test V until V >_ wi ~Lndthen decrease V by wi; its exit routine is simply to increase V by wi. Conversely, if a single PV-chunk variable suffices, then G is a threshold graph. This was recognized by Henderson and Zalcstein [HZ77], who defined threshold graphs by this property and found many of their properties. They called these graphs PV-chunk definable graphs. Clearly, if the exclusion graph G of the processes is a threshold graph, the minimum number of resource units needed to ensure mutual exclusion by means of PV-chunk operations is equal to the Orlin threshold t of G as given in Section 1.3. The state of the resource is expressed by the PV-chunk variable V, which can assume the t + 1 values 0 , . . . , t. We show below that when the exclusion graph is a threshold graph, even if mutual exclusion is performed by means of the more powerful testand-set synchronizing primitives, the shared variables must be able to assume at least t + 1 collective states. If the exclusion graph G is not a threshold graph, we have to use several PV-chunk variables, say V I , . . . , V~. Each variable Vk represents a separate resource type and each process i requires a certain quantity wki of this resource. A set of processes with a characteristic vector z can be in their critical sections simultaneously if and only if ~in=l WkiZi 3)" Since G is triangulated it does not contain C4 or Cs, and since G is t r i a n g u l a t e d and 2K2 is C4, G does not contain 2K2. 3) =~ 1)" Let K be a m a x i m u m clique in G with the fewest possible edges in V - K . Assume that, if possible, two vertices x, y in V - K are adjacent. If there are two vertices z, w in K such t h a t xz, yw E E but xw, yz ~ E, then those four vertices induce a 6'4, a contradiction. It follows t h a t either x ~ K Y or y ~ K X. Assume without loss of generality t h a t x ~ K Y. Now, if x has two non-neighbors z, w in K , then x, y, z, w induce a 2K2, a contradiction. If y has at most one non-neighbor in K , say z, then K - z + { x , y } is a larger clique t h a n K in G, a contradiction. It follows t h a t x has exactly one non-neighbor in K , say z, and y has at least one more non-neighbor in K , say w. Now z must have a neighbor u in V - K t h a t is not a neighbor of x, for otherwise K ~ - K - z + x is also a m a x i m u m clique and V - K ~ has fewer edges t h a n V - K , a contradiction. Now, yu E E or else x, y, z, u induce a 2K2 in G. Applying to u and y the above a r g u m e n t about x and y, we find t h a t u ~ K Y and u has exactly one non-neighbor w ~ in K. Now, x, y, u, z, w ~ induce a C5, a contradiction. .. Theorem

5.2.2 ( [ F H 7 7 a ] )

For every split graph G ( K , S ) ,

conditions are equivalent: 1. G is an interval graph; 2. G is a cocomparability graph;

J. D(S) IK N V(H)[ , and 0 otherwise. The deficiency of B is the sum of the deficiencies of its components. Notice that the deficiency of B can be computed easily. T h e o r e m 5.3.2 ([BH80]) Let G ( K , S ) be a Hamiltonian split graph with IXl < IKI. Let X be any subset of S and Y any subset of N ( X ) . Let B be the bipartite graph having partition ( X , N ( X ) - Y) and all edges of G between X and N ( X ) - Y, and let B have h critical components and deficiency k. Then 21Y I > 2k + h. Furthermore if h - O , then 21Y I >_ 2k + 1.

5.4

117

T h e S p l i t t a n c e of a Graph

Let C, C + and C - be as in the Lemma 5.3.1. If D is a deficient component of B with deficiency d, then there are at least d edges of C oriented from D n S to Y and at least d edges of C from Y to D N S. It follows that at least 2k edges of C are between the S-vertices of the deficient components and Y. If R is the set of S-vertices of a critical component, then there is at least one C-edge either from R to Y or from Y to R. Otherwise by L e m m a 5.3.1, the critical component equals the entire G, contradicting the assumption that IsI < IKI. It follows that there are at least h edges of C between the S-vertices of the critical components and Y. Thus there are at least 2k + h C-edges between X and Y. Hence 2k + h _< 21YI. This proves the first part. For the second part, assume that h = 0 and, if possible, 2k = 21Y [. Let D* be the set of vertices of the deficient components of B. Then Proof.

C+(D * N S) c_ (D* O K) U Y, C - ( D * n S) c_ (D* N K ) U Y. Also since 2k =

21YI, we

have

ID q SI =

IN n

It" I + k =

IN n

h'l + IYI.

(5.1)

Hence by Lemma 5.3.1 it follows that V = D* U Y. Therefore S = D* N S and K = (D*O It')U Y. But then [Xl- I1(1 by ({5.1), a contradiction to

ISI
d~, be a graphic sequence.

m(d) - m a x { k " dk >_ k - 1 }

We

(5.3)

and, f o r 0 _ < k _ < n ,

1(

crk(d) - -~ k ( k - 1 )

k di + ~n di ) .

- ~

i=1

(5.4)

i=k+l

When there is no danger of confusion, we simply write m and crk in place of rn(d) and crk(d), respectively. Re 11 that r e ( d ) i s the corrected Durfee number of d as in Definition 3.1.6.

L e m m a 5.4.2 2. O"0 > ' ' "

1. ak > O for O < k < n. > O'm_ 1

and

(Tin < " "

< ~n.

3. The sequence {crk } has a single m i n i m u m at k - rn if dm > m - 1, and two adjacent m i n i m a at k - m 1 and k - m, if din - m 1.

5.4

The Splittance of a Graph

119

P r o o f . Part 1 is a consequence of the Erd6s-Gallai inequalities [EG60] k

n

~_~di _ dm >_ m - 1 > k-1; and positive for m + 1 j, d) i = j E C c + l , 8 -- a = t, or e) i = C a s e 4: x : u'is, y : tttjt. (x, y) E L~ if a) i E C~,j i<j,c) i,jEC~+2, i>j,d) i=jEC~+l,s=a,t=b,

(s, t) because x, y are i,j E C~+I, i < j, c) j E Cc+2, s = t = b. ~ C~, b) i , j E Cc+l, ore) i=jEC~+2,

s=b,t=a. C a s e 5: x = u'i~, y = ujk. In this case (y,x) E P, contradicting the incomparability of x and y, so the case is impossible. C a s e 6: x = u'i~, y = u'jk. (x,y) E L~ if a) i E C~,j ~ C~, or b) i , j E C~+I. C a s e 7: x = u~k, y = uj~ or y = u'5~ or y = u'jl. (x,y) E L~ if j ~ C~. C a s e 8: x = uik, y = ujt. In this case k = 1 because x, y are incomparable. Then (x, y ) E L~ if j C C~. C a s e 9: x = u'ik, y = ujt. In this case i -r j because x, y are incomparable. Then (x,y) E L~ if a) i E C~,j q~ C~, b) i,j E C~+l, i < j or c) i , j E C~+2,

i>j. C a s e 10: x = u'ik, y = u'jt. (x,y) E L~ if i E C~. C a s e 11: x = u'ik, y = ujl. In this case (i,k) = (j, 1) or k l . In each possible case, one of the alternatives leading to (x, y) E L~ must hold for some c. [] Theorem

7.2.4 It is NP-complete to determine if the dimension of a given

partial order is at most 3.

7.2

T h e P a r t i a l Order D i m e n s i o n

P r o o f . This follows from Lemma 7.2.1, Lemma 7.2.2, and Lemma 7.2.3.

145 9

T h e o r e m 7.2.5 It is NP-complete to determine if a given bipartite graph

can be covered by at most 3 difference subgraphs. P r o o f . Again this follows readily from Lemma 7.2.1, Lemma 7.2.2, and Lemma 7.2.3. 9 T h e o r e m 7.2.6 For every fixed k >_ 3, the problem of determining whether ch(B) _ 3, it is NP-complete to determine if the boxicity of a graph G is at most k, even if G is the complement of a bipartite graph.

7.3.3

Cubicity

A unit interval graph is the intersection graph of unit intervals (closed intervals of length 1) on the real line. The cubicity c(G) of a graph G is the minimum number of unit interval graphs whose intersection is G. It exists since Kn and K n - e are unit interval graphs. Clearly b(G) _ 3, it is NP-complete to determine if the cubicity of a graph G is at most k, even if G is the complement of a bipartite graph.

7.3.4

Threshold D i m e n s i o n

Recall that the threshold dimension t(G) of a graph G is the minimum number of threshold subgraphs required to cover E(G). L e m m a 7.3.7 Let B be a bipartite graph whose vertices are partitioned into stable sets X and Y. Let B' be obtained from B by adding all edges between vertices in X (i.e., making X into a clique). Then ch(B) - t(B'). P r o o f . t ( B ' ) < o h ( B ) : Let B~,... ,Bk be difference subgraphs of B that cover its edges, and let B ~ , . . . , B~ respectively be obtained from them by making X into a clique. Then by Theorem 2.1.9, the B~'s are threshold subgraphs of B' that cover its edges. Therefore t(B') Y. A Hamilton cycle in a r-hypergraph is a cyclic ordering of all its vertices such that every r consecutive vertices form an edge. If G - (V, E) is a graph with a weight function w" E ~ Z, and if T is a subgraph of G, then w ( T ) - ~ e E ( T ) w ( e ) . We consider the following recognition problems: LARGEST

THRESHOLD

SUBGRAPH"

I n s t a n c e : a graph G - (V, E) and a positive integer h _< IEI. Q u e s t i o n " Does G have a threshold subgraph r - (v, E') with IE'I >_ h?

7.4

Other Complexity Results

151

SMALLEST THRESHOLD SUPERGRAPH: I n s t a n c e : A graph G = (V, E) and a positive integer h. Q u e s t i o n : Is there a threshold graph T = (V, E') that can be obtained by adding at most h edges to G? WEIGHTED 2-THRESHOLD PARTITION: I n s t a n c e : A graph G = (V, E) with two weight functions Wl, w2 from E to Z and an integer W. Q u e s t i o n : Can G be partitioned into two threshold subgraphs T1, T2 (i.e., E(T1)U E(T2) = E(G) and E(T1)NE(T2) = O)such that wl(T1)+ w2(T2) >_

W? WEIGHTED 2-THRESHOLD COVER: I n s t a n c e : A graph G = (V, E) with two weight functions Wl, w2 from E to Z and an integer W. Q u e s t i o n : Can G be covered by two threshold subgraphs T1, T2 (i.e., E(T1)U E(T2) = E(G)) such that wl(T1)+ w2(T2) >_ W? MONOTONE 3-PARTITION (MON3PART)" I n s t a n c e : A set U of 3k elements and a size t(u) C Z + for each u C U. 1 Q u e s t i o n : Let T - -~ ~ueu t(u). Can U be partitioned into k ordered triples Vi = (ai, bi, ci), i = 1 , . . . , k, such that i- 1,...,k t(ai) -~- t(bi) + t(ci) = T, i=l,...,k-1 t(ai) 3. Construct a graph G - (V, E) with two weight functions wl, w2 and a constant W as follows: v - {x,,...

,Xn} u

u

7.4

Other

Complexity

Results

E -

155

u @ {x xj, i,j

Wl (XiXj)

x xj},

i:/:j

-/

t

0, ifxi, xj appear i n c k f o r s o m e k , 1, otherwise, 1 if xi appears in ck, 1 + g--~, 1, otherwise,

Wl(XiCk) -

W 1 (Xi-X j ) -- W 1 (-Xi-Xj) Wl('XiCk)-

-- 1

for all i r j,

1,

w2 (e) - 0 for all e C E, W - n(n-

1 1) + n m + -g.

Thus G has 2 n ( n - 1)+ 2rim edges, namely all possible edges except the edges between clauses or between complementary literals. Clearly, using rational rather than integer values for Wl, w2 and W is just a convenience, which does not affect the answer of the question. We show below that W E I G H T E D 2-THRESHOLD PARTITION on G with Wl, w2, W has an answer "yes" if and only if the given instance of ONEIN-THREE 3-SAT has an answer "yes". The proof is based on the following two lemmas. L e m m a 7.4.4 If T -

(V, E') is a threshold subgraph of the graph G defined 1)+ nm. Equality holds if and only if T has a split partition ( K , S ) such that If C_ { X l , . . . , X n , - ~ l , . . . , g n } , I1(I- ~, NT(C ) K for k - 1 , . . . , m , and there is an ordering { U l , . . . , u n } of the vertices of S - {C1,... , Cm} such that INT(Ul)[ - O, I N r ( u 2 ) l - 1 , . . . , INz(u )l - 1.

above, th n IE'I_

P r o o f . We describe a collection of threshold subgraphs T of G with n ( n 1) + n m edges each. Choose literals U l , . . . , Un containing exactly one of xi, x-~ for each i. Put S - { U l , . . . , u ~ , c ~ , . . . , c m } and If - V - S - { g l , . . . , g ~ } . Let T have all the edges uiuj with i 7~ j as well as all the edges cigj. In addition T has the edges uigj if and only if i > j. We now show that the graphs T described above are all the largest threshold subgraphs of G. Let T' be any threshold subgraph of G. We show that T' has at most n ( n - 1) + n m edges. In G, every edge between two literals conflicts with (i.e., forms an alternating 4-cycle with) exactly one other edge between literals. Thus by Condition 2 of Theorem 1.2.4, T' can have at most 1 [(2~) _ n], - n ( n - b e1)t wedges een literals. Next we show that T' has at most n m edges joining clauses to literals. Indeed, in G the edges xick and

156

NP-Completeness

gicl conflict if k -~ l. Therefore if NT,(Xi)N C 7~ 0 and NT,(g~)N C 7~ 0, then NT,(Xi)N C - NT,(gi)N C - {ck} for some k, in which case T' has exactly 2 edges between {x~,g~} and C. Otherwise NT,(X~)N C - ;g or NT,(-&)NC - r in which case T' has at most m edges between {x~,g~} and C. Since m > 2, T' has at most nm edges between literals and clauses, and T' attains this bound only if from each pair {xi,gi}, one literal, call it ui, is joined in T ~ to every clause, and the other one to none. Therefore T' can be a largest threshold subgraph only if T' has exactly n ( n - 1) edges between literals and exactly nm edges between literals and clauses. Let T' be a largest threshold subgraph and put K - { g l , . . . , u--~}, where the ui are as defined above. Clearly K is a clique of T', for if uiuj is not an edge of T', then ~iCl conflicts with -5jc2 in T', an impossibility. Put S - { U l , . . . , u ~ , c l , . . . , c m } . T' cannot contain any edges uiuj, because this edge conflicts with gigj of K. Thus S is a stable set in T'. By Condition 3 of Theorem 1.2.4 we may reindex the variables so that NT(Ul) C_ ... C_ NT(U~). Since u~g~ is not an edge, it follows that INT( n)I _< 1, and hence INT(Ur,_I)I < n - 2 , . . . , [Nz(ul)l 3, otherwise NMTS is trivial. We construct the following instance of MON3PART: Put F - m a x { a i , bi, ci 9 1 _ 2 in G[B]. ,, T h e o r e m 7.6.4 POLAR is NP-complete. P r o o f . Clearly P O L A R is in NP. We observe that the following problem is NP-complete by Corollary 7.6.2 and Lemma 7.6.3: ADDIT: I n s t a n c e : A graph G without subgraphs isomorphic to H I , . . . ,/-/4 (not necessarily induced). Question: Is G C C? We reduce ADDIT to POLAR. Let G be an instance of ADDIT. We construct in polynomial time a matching X meeting every triangle of G as follows: 1. if two triangles of G have a common edge e, then put e in X; 2. if two triangles of G have exactly one vertex v in common, then from each of these two triangles put the edge that is not incident to v in X; 3. if a triangle has no common vertex with any other triangle, put exactly one arbitrary edge of the triangle in X. Clearly X meets every triangle of G. Using the fact that G does not contain H 1 , . . . , / / 4 as a subgraph, it is straightforward to verify that X forms a matching.

NP-Completeness

172

We now transform the graph G to an instance H of P O L A R as follows. For each edge e - ala2 C X , delete e and introduce new vertices a 3 , . . . , a l l so that a l , . . . , axl induce the gadget H~ shown in Figure 7.4. Further, put Figure 7.4" A gadget He replacing the edge e -

a la2

E X.

a7

a8

al0

a5

a6

a3

a4

--

I

-- a9 al

a2

"

"

I

all

He

Me

-

NG(al)-

NG[a2] ,

We -

NG(a2)- N G [ a l ]

,

and join ax0 to every vertex of Me and all to every vertex of Are. Finally, add a new component F isomorphic to K1,2. The resulting graph is H. Obviously, H can be constructed in polynomial time. It remains to show that G C C if and only if H is polar. Assume that (G, A , B ) C C. We assert that for each edge ala~ - e C X , if al C B then M~ C_ A. Indeed, e is contained in a triangle T, and T has a vertex in B other than a l . Hence if M~ N B r ~, then degB(al ) >_ 2, a contradiction. We now exhibit a polar partition (H, W1, W2). Let A C_ W1, B C_ W2, and let any one vertex of F be in W1 and the other two in W2. For each edge e - axa2 E X , we show how to assign the new vertices of He to W1 and W2 so that W1 remains stable and W2 remains a disjoint union of cliques. We distinguish the following two cases. C a s e 1" al, a2 C B. Assign a3, a 4 , a s , a9 t o W1 and the other vertices of H~ to W2. Clearly a3, a 4 , a s , a9 form a stable set and do not have any neighbors in W1. In W2, the new vertices form an isolated triangle on as, a6, aT, and two isolated vertices al0, all (al0, all are isolated because Me U Are C_ A C_ W1

7.6

Polar Graphs

173

by the assertion). Thus W2 remains a disjoint union of cliques. C a s e 2- al C B, a~ C A (the case al C A, a2 C B is similar). Assign the vertices a3, a6, as, all to W1 and the other new vertices of H~ to W2. Again the vertices a3, a6, as, all form a stables set and have no neighbors in W1 (all does not have any neighbors in W1 because N~ C_ B C_ W2 as a2 C A). The new vertices of W2 form in W2 isolated cliques asa-t, a4, a9, and al0 (al0 has no neighbors in W2 because M~ C_ A C_ W1 by the assertion). Hence (H, W1, W2)is polar, as required. Conversely, assume that (H, W1, W2) is polar. Then V(F) N W1 7/= ;g, for otherwise H[W2] will contain an induced subgraph K1,2. It follows that H[W~- V(F)] is edgeless, for otherwise H[W1] will contain an induced [~1 (-J K2. Now we distinguish the following two cases. C a s e 1" H[W1] contains an edge. Since H[W1] is a complete multipartite graph with an edge, it is connected, and since F is a component, H[W1] C_ F. Hence H - F C_ H[W2], and since H[W2] is a disjoint union of cliques, H~ cannot exist and hence X is empty. Hence G is triangle-free. Thus G - G f"l H[W2] C 1C2, where/Cp denotes the family of graphs all of whose components are cliques Kn with n _< p, and (G, 0, V(G)) C C. C a s e 2" H[W1] is edgeless. Put A - V(G) N W~ and B - V(G) N W2. We show that (G,A,B) C C. Clearly H[A] is edgeless and H[B] is a disjoint union of cliques. We show that G[A] is edgeless and G[B] C 1C2,proving that

(G,A,B) EC. To show that G[A] is edgeless, assume that, if possible, G[A] has an edge e. Since H[A] is edgeless, this edge must be of the form e - ala2 E X, with al, a2 C A C_ W~. Then a3, a4 E W2 (since H[W1] is edgeless), and hence exactly one of as, a6 is in W2, say as. Then aT is in W2 and hence H[W2] contains an induce K1,2 on a3, as, a~, a contradiction. It follows that G[A] is edgeless. To show that G[B] E lC2, we show equivalently that G[B] has neither triangles nor induced K1,2's. Assume first that G[B] contains a triangle T. Exactly one of the edges of T is in the matching X because G - X is trianglefree. But then K1,2 is isomorphic to H[T] C_ H[B] C_ H[W2], a contradiction. Assume now that G[B] contains an induced K1,2 on the vertex set {a~, a2, w} with center al. Clearly exactly one of its two edges, say e - ala2, is in the matching X, because H[W2] is K1,2-free. Then the vertices al, w along with the vertices a8, al0 of He induce a C4 in H, because w E Me. Since al, w C W2 and H[W2] is K~,2-free, it follows that a8, al0 C W~, contradicting Case 2. It follows that G[B] C 1C2. 9

Chapter 8 2-Threshold Graphs 8.1

Introduction

Recall from Chapter 7 the result of Yannakakis that the problem of determining if the threshold dimension of a graph is at most k is NP-complete for every fixed k >_ 3. This problem is clearly polynomial for k - 1. In this chapter we study the graphs of threshold dimension at most 2, called

2-threshold graphs. Recall that two edges of a graph G are said to conflict if their endpoints induce in G a C4, a P4 or a 2K2. Also recall that the conflict graph G* of G is constructed as follows:

v ( a * ) - E(a) E(G*)

-

{ef'eandfconflictinG}.

Let H be a subgraph of G and let e and f be edges of H. Clearly, if e and f conflict in G, they also conflict in H. Therefore if H is a threshold graph, no two of its edges conflict in G, and the vertices of G* corresponding to the edges of H form a stable set. This shows that the chromatic number of G* is bounded by the threshold dimension of G:

x(G*) < t(G).

(8.1)

Chvs and Hammer [CH77] observed that the inequality (8.1) often holds with equality, and asked whether any graph satisfies it with a strict inequality. Cozzens and Leibowitz gave an example of a family of graphs satisfying (8.1) as a strict inequality for every value of x(G*) >_ 4, as we have seen in 175

2-Threshold Graphs

176

Section 6.1. We do not know the answer for the case x~(G*) = 3. The case x(G*) _< 2 has been studied extensively because it is necessary for G to be 2-threshold. Partial results on its sufficiency have been obtained by several authors. Ibaraki and Peled [IP81] and independently Cogis [Cog791 and Doignon et al. [DDF84] proved that if x~(G*) is bipartite and G is a split graph, then G is 2-threshold. Ibaraki and Peled also proved that if x~(G*) is bipartite and has at most two non-singleton connected components, then G is 2-threshold. They conjectured that, in fact, a graph G is 2-threshold if and only if G* is bipartite. This conjecture has recently been proved by Raschle and Simon [RS95]. It implies a polynomial-time algorithm to recognize 2threshold graphs. An earlier independent work of Ma [Ma93] using results of Ma and Spinrad [MS94] gave another polynomial-time algorithm to recognize 2-threshold graphs, reducing the problem to recognizing difference dimension at most 2. These results are discussed in Sections 8.5, 8.6 and 8.7. In the first few sections we study properties of 2-threshold graphs and interesting subclasses of them. In Section 8.2 we present the result of Hammer et al. [HMP89] that the 2-threshold graphs are weakly triangulated. We also present the results of Chv~tal et al. [CHMd87] and of Hammer and Mahadev [HM85] that these graphs and their complements are perfectly orderable. In Section 8.3 we present the results of Hammer and Mahadev [HM85] on the class of bithreshold graphs, motivated by Boolean biregular functions, and show that their complements are 2-threshold graphs. In Section 8.4 we consider a special class of 2-threshold graphs that are also comparability graphs, called strict 2-threshold graphs and studied by Mahadev and Peled IMP88].

8.2 8.2.1

Properties of 2-threshold Graphs As Perfect Graphs

A graph G is called weakly triangulated if neither G nor its complement contains an induced cycle of size 5 or more. A perfect order for a graph G is a total order < on V(G) such that no induced P4 with edges ab, bc, cd satisfies a < b and d < c. A graph possessing a perfect order is called perfectly orderable. The weakly triangulated graphs and the perfectly orderable graphs are known to be perfect [BC84]. We show below that the 2-threshold graphs and their complements are both weakly triangulated and perfectly orderable.

8.2

Properties of 2-threshold Graphs

177

T h e o r e m 8.2.1 ( [ H M P 8 9 ] ) The 2-threshold graphs are weakly triangulated. Proof. Observe first that Cs, 6'6 and P6 are not 2-threshold graphs, as their conflict graphs are not bipartite (see Figure 8.1). Since every induced subgraph of a 2-threshold graph is also a 2-threshold graph, it follows that the 2-threshold graphs do not contain Cs, 6'6 or P6 as induced subgraphs. Hence the 2-threshold graphs do not contain any induced cycles of size five or more. Figure 8.1: The conflict graphs of C5, C 6 and P6.

v

v

~

v

v

We now prove that for n >_ 5, C~ is also forbidden for 2-threshold graphs, by showing that for n _> 5, the conflict graph of C~ contains an odd cycle. Let V l , . . . , v~ be the vertices along a cycle C=. Clearly, V l V 3 : V2V4~ V3V5~ 9 . . ~ V n - l Vl~ VnV2~ V l V 3

is an n-cycle in C~*, which settles the case where n is odd. For n - 4k + 2, V l V 2 k + 2 ~ V2V2k+3~ V 3 V 2 k + 4 ~ 9 9 9 ~ V 2 k + l V 4 k + 2 :

is a (2k + 1)-cycle in Cn*

9

V2k+2Vl

For n - 4k

VlV2k+2: V2V2k+3: V3V2k+4~ 9 . . : ?22k-lV4k~ V2kVl~ V2k+lV2~ V2k+2Vl

is a (2k + 1)-cycle in Cn*. T h e o r e m 8.2.2 ( [ C H M d 8 7 ] ) able.

The 2-threshold graphs are perfectly order-

178

2-Threshold

Graphs

P r o o f . Consider an arbitrary 2-threshold graph G - (V, E) and let it be the union of threshold graphs T1, T2 defined on the same vertex set. E n u m e r a t e the vertices of V as X l , . . . , x ~ so that x, ~- ..- ~ x~ in T1. Let k be the largest of all the subscripts i such that T~ has an edge xixj with i < j. We may assume that k exists, for otherwise G - T2 is a threshold graph, which is obviously perfectly orderable. Write A - { X l , . . . , x k } and B - V - A. Thus X k X k + 1 C::_.E(T1) and B is stable in T1. Enumerate the vertices in B as y l , . . . , Yn-k SO that Yl ~ " ' " ~ Yn-k in T2. Now define the linear order < by Xl
1, the induction hypothesis implies { x 0 , . . . , Xh-1, Y o , . . . , Yh-1 } N {v0, vl} -- e and a double AP6Vo, V~,Yh-1,...,Xh-~. Therefore {xh,Yh}gl {VO, Vl} -- O . From this and the fact that Xh-lYh-1 II XhYh w e obtain the double AP6 Vo, V l , Y h - l , Y h , Xh, Xh-1, whose complementary double AP6 VO, Vl , Xh, Xh-1, Yh-1, Yh satisfies our claim. F a c t 8 . 5 . 8 L e t v o , . . . , v5 be an AP6 and v2v5 - x o y o II II XhYh a path in G* satisfying {x h, Yh} N {V0,721} r • . Then a double AP6 exists.

P r o o f . We assume that { x 0 , . . . , Xh-1, Y o , . . . , Yh-1 } ~ {Vo, Vl } "-- ~ without loss of generality. Thus Fact 8.5.6 guarantees either an AP6 Vo, vl, Y h - l , . . . , Xh-1 or an AP6 VO, V l , X h - I , . . . ,Yh-1. Because of s y m m e t r y it suffices to discuss the first possibility. Figure 8.12" A configuration in the proof of Fact 8.5.8.

Yh -- VO ~ ~ _ h ~

l _~
1 and the set L = {v E V : x v C E - Ei, yv C E - Ei} is not a clique. Observe that the vertices in L can be marked in O(IVl)-time. To obtain a nonedge in L, if any, simply build and use the adjacency lists of a[L], which can be done in O(IEI + I V l ) - t i m e . . .

8.6

8.6

R e c o g n i z i n g Difference D i m e n s i o n 2

213

R e c o g n i z i n g Difference D i m e n s i o n 2

In this section we present the necessary background from Ma and Spinrad [MS94] for Ma's algorithm for recognizing threshold dimension 2, to be discussed in the next section. Recall from Section 2.4 that a difference graph, also known as a chain graph, is a bipartite graph in which the neighborhoods of the vertices in each color class are nested. Given a bipartite graph G, its intersection difference dimension idime(G) is the minimum number of difference graphs whose intersection is G, and its union difference dimension udimg(G) is the minimum number of difference graphs whose union is G. We consider that udimd(G) = 1 if G has no edges. Note that udimd(G) is the same as ch(G) of Section 7.2. Our problem here is to recognize when idima(G) _< 2. Since the class of difference graphs is closed under bipartite complementation, an equivalent problem is to recognize when udime(G) _< 2. Other equivalent problems are the recognition when a given split graph can be covered by 2 threshold subgraphs, solved by Ibaraki and Peled [IP81]; the recognition of Ferrers dimension 2, solved by Cogis [Cog79]; and the recognition of bidimension 2, solved by Doignon et al. [DDF84]. The solution of Ma and Spinrad reduces the problem to recognizing poset dimension 2. Recall from Theorem 16.4.4 that a poser has dimension 2 if and only if its incomparability graph is transitively orientable, which gives a polynomial-time recognition algorithm for poset dimension 2. In fact, there are O(n2)-time recognition algorithms for poset dimension 2, and the reduction to be presented here can be done in the same time bound, but we would not go into these details, since the recognition of threshold dimension 2 has a larger time bound. A linear extension of a strict poset P = (X, R) is a strict poset (X, L), where L contains R and is complete. A realizer of P is a set of linear extensions whose intersection is P, and the poser dimension of P, do(P), is the minimum size of a realizer of P. D e f i n i t i o n 8.6.1 Let P - (X, R) be a strict partial order. The b i p a r t i t e r e p r e s e n t a t i o n of P is the bipartite graph B ( P ) = ( X , X ' ; E) with color classes X and X', where X ' = {x' : x C X } is a disjoint copy of X , and xy' C E if and only if y f p x, i.e., x
1. 9 We remark that the estimates in the proposition are crude. Additionally, one can show that if G has a NTRM, it has one in which the boxes form a m a x i m u m clique, that G has at most n 2 / 4 maximum cliques, and that these cliques can be obtained efficiently. See [Ma93] for the details. From now on we assume that the vertices of G are already partitioned for us into boxes B, segments S and points P, with B forming a maximal clique, the vertices that become isolated in G - B forming the set P, and the remaining vertices forming the set S and inducing a bipartite graph. We ask whether we can position these boxes, segments and points to form a NTRM so that their intersections represent the adjacencies of G. We may now assume that no two boxes are twins, namely have the same closed neighborhood, for otherwise we can delete one of them without affecting the answer (if we find a N T R M without this box, we can then add it as an exact or approximate copy of its twin box). Similarly we may assume that no two points are twins, namely have the same open neighborhood. As for the segments, we assume for the moment that the bipartite graph that they induce is connected, and therefore the segments partition uniquely into two sets that we may assign arbitrarily as horizontal and vertical segments. We may then assume that no two horizontal and no two vertical segments are twins, for similar reasons. The search for a NTRM is done in two stages. In the first stage we ignore the tips of the segments and try to place the other key-points (corners of boxes, points, and bases of segments) in the plane so as to reflect the adjacencies of the corresponding vertices, except for adjacencies between segments. In the second stage we arrange the segments, without disturbing their intersections with the boxes, to reflect their adjacencies with each other. Let us define a binary relation ~ on Ii{2 as follows" ( x l , y e ) __ ( x 2 , y 2 ) , when X 1 ~ X 2 and y~ _< y2. We say that ( x i , y l ) d o m i n a t e s (x~,y2), written as (xl,yl) -~ (x2, y2), when (xl, yl) ~___ (x2, Y2)and (xl,Yl) -r (x2,y2). Two different points that do not dominate each other are said to be incomparable. Clearly ~ is a partial order on Ii{2, and it is 2-dimensional, since it is the

230

2-Threshold Graphs

intersection of the linear orders of the separate z and y coordinates. The strict partial order -~ is also 2-dimensional. We are interested in finding domination relations between some of the key-points in every NTRM of the given graph G, namely between the corners b of the boxes, the bases s of the segments and the points p (since we are now in the first stage, we ignore the tips of the segments). It turns out that to a large extent these can be deduced directly from the neighborhoods of G. Every NTRM of G must satisfy the following, where we use the letters b, s, p to denote both the key-points of the geometrical objects and the corresponding vertices of G" 9 For a box corner b, a base s and a point p we have s-. s c:_ N(b)

(8.11)

p--< b ~

peN(b).

(8.12)

N[bl] C N[b2]

(8.13)

9 For box corners bl a n d b2 we have b1 ~ b2 ~

(the ~ the ~

part follows from b~ C b~ and from the absence of box twins; part follows from Step 5 of the normalization of the TRM).

9 For a point p and a box b we have b-.~ p ~

Yb' E N(p)

(8.14)

b-~ b'

follows from Step 3 of the normalization of the TRM). Hence (the < by (8.13) (8.15) b -~ p r Vb' e N(p) N[b] C N[b']. 9 For points pl and p2 we have

Pl ~ P2 ~

(8.16)

N(pa) D N(p2)

(the ~ follows from the absence of point twins; the < Step 3 of the normalization of the TRM).

follows from

9 For a point p and a segment s we have s -.~ p ~

(the ~

N(p) C_ N ( s ) N B

follows from Step 3 of the normalization of the TRM).

(8.17)

8.7

Intersection Threshold Dimension 2

231

9 For segment bases 81,82 E ~x or 81,82 E ~y we have the implication 81 "~ 82 ,r

N ( S l ) N B D N(s2) N B.

(8.18)

Let Kb, Kp, Kx, Ky be the sets of box corners, points and bases of the vertical and horizontal segments, respectively. Let R be the binary relation on K = Kb O I(p O K~ O Ky, R such that (a, b) C R if and only if a -~ b according to (8.11)-(8.18). The relation R can be computed from the neighborhoods of G. If G and the partition of its vertices into B, P, S~, Sy have a NTRM, then P = (K, R) is a strict poset of dimension 2, which we know how to check in polynomial time. If it turns out that P has a realizer, we can use the positions of the elements of K in the two linear extensions of P as coordinates of the corresponding key-points in R 2. However, we want to ensure that the bases of the vertical segments are lower than any other key-point and that the bases of the horizontal segments are to the left of any other key-point. Therefore we solve the restricted poset dimension 2 problem given by P and the following two sets of incomparable pairs of P:

{(a,b) : a C Kx, b ~ Kx,(a,b) ~ R} R2 = {(a,b) : a C Ky,b ~ Ky,(a,b) ~ R}

1~ 1

:

(i.e., we ask whether R is the intersection of two linear extensions containing RU R1 and Rt.J R2, respectively; this can be determined in polynomial time by Theorem 8.6.14). If the answer is negative, the current B is discarded. If the answer is positive, the two linear extensions are used to place the box corners, the points and the segment bases in the plane. Since the bases of the vertical segments are lower than any other key-point, we can lower them down further to the x axis without changing their intersections with the boxes or any domination relations, and similarly for the bases of the horizontal segments. We then obtain a NTRM for G except for the intersections between segments, since we have not yet placed the segment tips. Now we enter the second stage and try to arrange the segments bases and tips without changing the intersections between segments and boxes, so as to reflect the intersections between segments. Let us ignore the intersections between segments and boxes for the moment. We say that a bipartite graph has a segment intersection model when it is the intersection graph of vertical and horizontal segments in the first quadrant whose bases are on the coordinate axes. The question is then whether the bipartite subgraph of G induced by the segment vertices has a segment intersection model.

232

2-Threshold Graphs

P r o p o s i t i o n 8.7.2 A bipartite graph has a segment intersection model if and only if it is the intersection of two difference graphs with the same color classes. Proof. " O n l y if"" Consider a segment intersection model and its bipartite intersection graph Q. Consider the grid defined by the lines containing the segments. The horizontal and vertical lines of the grid correspond to the rows and columns of the adjacency matrix A of Q, and A has a 1 in a given position if and only if the corresponding horizontal and vertical segments intersect. Thus A is the intersection (component-wise product) of two adjacency matrices A1 and A2:A1 has l's in the positions corresponding to the grid points on the horizontal segments, and similarly for A2 and the vertical segments. Therefore Q is the intersection of the the bipartite graphs/-/1 and /-/2 whose adjacency matrices are A1 and A2, respectively. Now //1 is a difference graph, since every two rows of A1 are comparable by inclusion according to the lengths of the corresponding segments. Similarly//2 is a difference graph, since every two columns of A2 are comparable. Figure 8.26 illustrates this argument. "If": We assume that the bipartite graph Q is the intersection of difference graphs //1 and //2 having the same color classes. Let the color classes be X : { X l , . . . , x k} and Y : {yl,. 9 9 , Yl}, where N , l (xl) D " ' " D N , 1 (Xk)

(8.19)

N , 2 ( Y l ) _~ " ' " __~ NH~(y,).

(8.20)

Consider the adjacency matrices A of Q, A1 of HI and A2 of//2 as having columns corresponding to Xl,..., xk from left to right and rows corresponding to y l , . . . , y t from bottom to top. By (8.19) and (8.20), the l's in every row of A1 are consecutive from the first column on, and the l's in every column of A2 are consecutive from the first row on. We construct horizontal segments corresponding to the l's in the rows of A1 and vertical segments corresponding to the l's in the columns of A2. The i-th vertical segment intersects the j-th horizontal segment if and only if AI(j, i) = A2(j, i) = 1, which is equivalent to A ( j , i ) = 1 since A is the component-wise product of A1 and A2. In other words, the i-th vertical segment intersects the j-th horizontal segment if and only if xi and yj are adjacent in Q. Therefore we have constructed a segment intersection model for Q. ,, Proposition 8.7.2 reduces the recognition of bipartite graphs having a segment intersection model to the recognition of intersection difference di-

8.7

233

Intersection Threshold Dimension 2

Figure 8.26: A segment intersection model and its intersection graph Q. The adjacency matrix A of Q is the intersection of adjacency matrices A1 and A2 of difference graphs.

Y4

Yl

Ys

Y2

Y3

y

Y2

Y4

Yl X1

Xl

X2

X3

X2

X3

y4

0

Y4

1

1

0

y4

0

1

0

ys

0

Y3

1

0

0

y3

1

1

0

y2

1

Y2

1

1

1

y2

1

1

1

yl

0

yl

1

1

0

yl

1

1

1

Xl

X2

X3

Xl

X2

X3

Xl

X2

A

X3

A1

A2

2-Threshold Graphs

234

mension 2 (or equivalently to the recognition of union difference dimension 2 of the bipartite complement). In turn, this problem is reduced by Theorem 8.6.12 to recognizing poset dimension 2. However, we must also take into account the domination relations between segment bases imposed in the first stage by the intersections of segments with boxes. We partition the sets I(= and h'y of segment bases on the x and y axes into equivalence classes called windows. Two elements of I(~ or two elements of I(y are in the same window when they intersect the same boxes. After the first stage we are free to exchange the positions of segment bases only if they are in the same window; if 81,82 E I(= are in different windows, then the first stage has imposed a definite order relation between them, say sl -4 s: (because N(Sl)M B D N(s2) M B). As the proof of Proposition 8.7.2 shows, the segments are induced by the ordering of the roots, whereas the roots on each axis are ordered by the neighborhood containment relations in one of the two difference graphs. We can therefore formulate the problem of finding a segment intersection model for the bipartite subgraph Q = (S=, Sy; E) of G induced by the segment vertices as follows. The color classses S= and Sy are partitioned into windows X0,... ,Xp and Y0,..., Yq, respectively (corresponding to the partitions of the segment bases into windows). We seek difference graphs HI a n d / / 2 with color classes S=, 5'y such that H1 ('1/-/2 = Q and such that

Va C Xi, b E Xi+l,

NH, (a) D___NH, (b)

Va C Y/, b C Y/+I, NH2(a)D___NH2(b). Equivalently we can consider the bipartite complement Q - (S=, Sy; E) of Q and seek difference graphs H1 and H2 with color classes S=, Sy such that H1 U/-/2 - Q and such that

Va e X,, b e X,+,,

Va C

b C Y,+,,

NT, ' (a) C_ NT, ' (l,)

(a) C_

(t,).

Construct a new bipartite graph Q' - (S= u U, S v u V; E U Eu U Ev), where U = {ua,...,uq} and V = {v~,...,vp} are sets of new vertices and

Eu = { u ~ y : y C Y j , i < _ j } Ev

=

{vix:xCXj,i a l > ' ' ' > am > 0 such that x i x j is an edge of G1 if and only if ai q- aj > S. We may assume that S - al >_ 2 (otherwise multiply all these integers by 2), hence we can insert an integer T such that S > T > a l > ' " > am > O. We now continue the construction to obtain the bj's. Let G2 be the graph induced by Y. G2 also is a threshold graph, hence there exists a permutation r of { 1 , . . . , n} such that each yr(j) is either an isolated vertex or a dominating vertex in the subgraph induced by y~(~),..., yr(j). The initial step of this part of the construction is as follows. If y~(1) is not adjacent to any xi, set S " - S + 2,

T " - T + 2,

ai " - ai q- 1 for all i,

br(1) "- 1,

and if yr(1) is adjacent to X l , . . . ,Xj and non-adjacent to x j + l , . . . ,x,., then do not change S, T and the ai, and put br T - aj. We continue by constructing br b,(~). Note that we may assume for each j that v ( 1 ) , . . . , r ( j ) are consecutive indices among 1 , . . . , n. Therefore at the general step of this part of the construction, we have constructed thresholds S, T and weights a l , . . . , a m , bk+l,...,bs-1, and must now construct either bk, where the next vertex yk to be introduced is adjacent to each of y k + l , . . . , y~-l, or b~, where the next vertex y~ to be introduced is adjacent to none of them. C a s e 1: yk is introduced. It is adjacent to X l , . . . , x i and non-adjacent to Z i + l , . . . , Zm for some i, 0 _ T

(9.6)

4(bk+l - ai) - 1 < ;- S - T + 1

>bk+l-a~.

(9.22)

250

The Dilworth Number

Inequality (9.22) is a relaxation of (9.21), and by repeating these steps as necessary, we eventually find a suitable value for b~. In the extreme case i = m, the argument is the same with am+~ = 0. In the extreme case i = 0, the argument is even easier, since inequality (9.16) drops and there is no lower bound for b~ except for the strict lower bound 0. Therefore we change the existing weights and threshold as follows: aj bj b~ S T

"""-

aj + 1 bj -]- 1 1 S+2 T+2.

j- 1,...,m j-k+l,...,s-1

This completes the construction in all cases. 9 The next Theorem follows directly from Lemma 9.2.3 and Lemma 9.2.4. T h e o r e m 9.2.5 A graph is a TS graph if and only if its Dilworth number is at most 2. C o r o l l a r y 9.2.6 Every TS graph is a comparability graph. P r o o f . Let G - (V, E) be a graph of Dilworth number at most 2 with its vertex set partitioned into two sets X - { x ~ , . . . , x m } and Y - { y ~ , . . . , y ~ } such that x l ~- " . ~ xm and yl ~ "'" ~- y~. Orient each edge ab as (a, b) if a - xi, b - yj for s o m e i , j , o r a - x i , b - xj w i t h i < j , o r a yi, b - yj with i < j. The resulting orientation is transitive. 9 9.2.2

Forbidden

Subgraphs

By the Dilworth Theorem, a graph has Dilworth number at most 2 if and only if it does not contain an antichain of size 3. Thus by obtaining all the minimal graphs with an antichain of size 3, we get a characterization of TS graphs by forbidden subgraphs as stated below. A proof of the next theorem can be found in [BHd85b]. T h e o r e m 9.2.7 A graph G is a TS graph if and only if it does not contain any of the 21 graphs listed in Figure 9.2 as induced subgraphs. Using Theorem 9.2.7, one can characterize the split graphs of Dilworth number 2 as follows.

9.2

251

G r a p h s of D i l w o r t h N u m b e r 2

Figure 9.2: The forbidden subgraphs for TS graphs; G2i+l - G2i, i - 1,..., 10.

_

i I

G1

G2

G4

G6

I I I

X>

alo

G8

a12

. k/Z/_ v

v

w

Q14

v

w

G18

G16

w

v

w

w

G2o

v

252

The Dilworth Number

T h e o r e m 9.2.8 ([FH77a, B H d 8 5 b ] ) For each graph G, the following ditions

con-

are equivalent:

1. G and G are interval graphs; 2. G is a split TS graph; 3. G contains as induced subgraphs neither 2K2, C4, C5, nor Gs, Gg, G14, G15 of Figure 9.2. Proof. 1) =~ 3)" As we have mentioned after Definition 1.4.1, G is an interval graph if and only if G is triangulated and G is a comparability graph. Since G and G are interval graphs, G contains neither C4, C5 (being a triangulated graph) nor Gs, G14 (being a comparability graph) nor their complements 2K2, Gg,

G15. 3) =~ 2)" Since G does not contain 2K2, C4, C5, it is a split graph by Theorem 5.2.1. For the same reason, the only induced subgraphs of Figure 9.2 that G might possibly contain are Gs, Gg, G14, G15. But since by assumption G does not contain them, it is a TS graph by Theorem 9.2.7. 2) =~ 1)" Since G is a split TS graph, it is both triangulated and a comparability graph (by Corollary 9.2.6). But G is also split, and it is a TS graph by Theorem 9.2.5. So G is also triangulated and a comparability graph. Thus G and G are triangulated and cocomparability graphs, and therefore both are interval graphs by the above-mentioned characterization of interval graphs.

9.3 T h e D i l w o r t h N u m b e r and P e r f e c t G r a p h s In this section we examine the relationship between the perfectness of a graph and its Dilworth number. A graph is called minimally imperfect if it is not perfect, but every proper induced subgraph is perfect. L e m m a 9.3.1 If G is a minimally imperfect graph, then

D(G) -IV(G)I. Proof. The proof amounts to showing that V(G) is an antichain. Assume that, if possible, V(G) is not an antichain, and let x ~ Y- If x and y are

9.3 T h e D i l w o r t h N u m b e r and Perfect Graphs

253

not adjacent, then color the perfect subgraph G - {x} with w(G) colors, and then assign to x the color of y, proving that x(G) - c0(G), a contradiction to minimal imperfectness. If x and y are adjacent, then they are not adjacent in G, which is also minimally imperfect [Lov72], and for which y ~ x, so the previous argument applies. Therefore V(G) is an antichain, as required. 9 C o r o l l a r y 9.3.2 ([Pay83]) If D(G) deg(E). By the incomparability, there exist vertices B and D such that A B and E D are edges and AD and E B are nonedges. Since deg(A) > deg(E), there e x i s t s a vertex C such t h a t AC is an edge and E C is a nonedge. Thus G contains the configuration .T'(A,B, C , D , E ) and is not matrogenic. An example of a non-matrogenic BT graph is illustrated in Figure 10.2. 9 Figure 10.2: A non-matrogenic BT graph.

I I I I The following two results for BT graphs generalize similar results for threshold graphs.

Proposition 10.2.3 The complement of a B T graph is BT. Proof. Complementation does not alter vertex incomparabilities or equality of degrees. 9

Theorem 10.2.4 B T is a forcible property of a degree sequence. In other words, if G and H are graphs with the same degree sequence and G is BT, then H is also BT. We may thus unambiguously define a degree sequence to be B T if (all) its realizations are B T graphs.

Box-Threshold Graphs

260

P r o o f . Since there is a degree-preserving bijection between the vertices of G and H, we may assume that G and H have the same vertices and that each vertex has equal degrees in G and H. Then by Corollary 3.1.11, H can be obtained from G by a sequence of transfers, where a transfer involves dropping the two edges xu and zy and adding the two nonedges x z and yu of an alternating 4-cycle. We may therefore assume that H is obtained from G by a single transfer as above. Moreover, by the symmetry of the four vertices of the alternating 4-cycle, it is enough to show that if x II P in H, where p is a fifth vertex, then deg(x) = deg(p). Since x II p in H, x and p are opposite corners of an alternating 4-cycle A in H. If neither x u nor x z is a side of A, then A is also an alternating 4-cycle in G and the conclusion follows since G is BT. If both o f x u and x z are sides of A, then x II Y II p i n G, and hence deg(x) = deg(y) = deg(p). If xu is a side of A and x z is not, then the configuration in G and H is as illustrated in Figure 10.3, where the vertices of A are x, u, p, q. Figure 10.3" Configurations in the proof of Theorem 10.2.4. q

x

z

q

x

z

T

9

9

9

i I I

I I I

I I I

I I I

I ,k v

v

p

u G

y

I

I

i

9

~k

.L

v

v

v

p

u

y

H

If pz is an edge, then x I] P in G; otherwise x II Y It P in G. Therefore the conclusion follows in both cases as before. Finally, if xz is a side of A and x u is not, we repeat the argument with pu replacing pz. 9 The next theorem gives an easy test of the BT property. To state it, we use the following definitions. D e f i n i t i o n 10.2.5 A bipartite graph with bipartition ( X , Y ) is said to be c o l o r - r e g u l a r ( C R ) if all the vertices of X have the same degree and all the vertices of Y have the same degree. Two disjoint sets X and Y of vertices in a graph G are said to be c o m p l e t e l y c o n n e c t e d , C R c o n n e c t e d , or

10.2

Elementary Properties

261

c o m p l e t e l y d i s c o n n e c t e d when the edges joining them in G form a com-

plete, a CR, or an edgeless bipartite graph, respectively. A similar definition applies in the case of a connection of X to itself, where the subgraph induced by X should be complete, regular, or edgeless, respectively. Note that complete connection and complete disconnection are special cases of CR connection. D e f i n i t i o n 10.2.6 Assume that a graph G has the degree sequence

d? i.e., exactly m i vertices have degree di, with d l > of degree di constitute the i - t h b o x Bi of G.

"'" > d~. The rni vertices

is B T if and only if for each subscript i there is some k(i) such that Bi is completely connected to B I , . . . ,Bk(i)_l, Ct~ connected to Bk(i), and completely disconnected from

Theorem

10.2.7 A graph with boxes B I , . . . , B ~

Bk(0+l,. 9 9 B~. P r o o f . " O n l y if"" Recall that N(x) denotes the set of neighbors of a vertex x. For every vertex x C B~, let k(z) be the largest k such that N ( z ) N Bk r ~. Then by the BT property (10.2), {x} is completely connected to B 1 , . . . , Bk(x)-I and completely disconnected from Bk(x)+l,..., B~. Moreover, if y is any other vertex of B~, then since deg(z) = deg(y), we must have k(x) = k(y) and IN(x) N Bk(~)l = IN(y)N Bk(y)l. This means, first, that k is a function of i alone and not of the particular x in Bi, and may be denoted by k(i); and, second, that for j = k(i) and trivially for each j 5r k(i) all the vertices x E Bi have the same IN(x)N Bj]. In particular, all the vertices x C Bk(~) have the same IN(z)N Bil. Therefore Bi and Bk(~) are CR connected. " I f " : Let x II Y for some x C B~, y E Bj. We have to show that i = j. First of all, k(i) = k(j), for if, say, k(i) > k(j), then N(y) C_ N(z), contradicting non-comparability. Let us denote by p the common value of k(i) and k(j). The only way that x and y can be non-comparable is for each of them to have a neighbor in Bp that is not a neighbor of the other. Therefore Bi and Bj are neither completely connected to Bp nor completely disconnected from Bp. This implies that k(p) is equal to both i and j. " Note that a BT graph is a threshold graph if and only if every two boxes are either completely connected or completely disconnected. This means that


262

the adjacency matrix is equal to the corrected Ferrers diagram of the degree sequence. Deleting a vertex of degree 1 from the graph of Figure 10.2 results in a non-BT graph. This shows that BT is not hereditary, and hence cannot be characterized by forbidden configurations. However, the next theorem shows that the BT property is preserved when an entire box is deleted from the graph. The results of Section 10.4 can be interpreted as a characterization of the BT property in terms of forbidden configurations of boxes. T h e o r e m 10.2.8 If G is BT and H is obtained from G by deleting an entire

box, then H is also BT. P r o o f . let the vertices of the deleted box have degree d in G. Assume that, if possible, H is not BT. Then it has vertices x, y, z such that degu(x ) > degH(y), yet z is adjacent to y but not to x. Since G has the same configuration and is BT, dega(x ) _< dega(y). Hence d e g a ( y ) - degH(y) > dega(x ) - d e g u ( x ) , which means that in G, Y has more neighbors of degree d than x has. By Theorem 10.2.7, y is adjacent in G and H at least to all vertices u such that dega(u) > d, whereas x is adjacent in H at most to these vertices. Hence degu(y ) > degH(x), which is a contradiction. ..

10.3

A Transportation M o d e l

Theorem 10.2.4 raises the question of characterizing BT sequences without constructing an arbitrary realization and testing it for the BT property. Here we give such a characterization, which does not even assume that the sequence is realizable. A transportation network consists of suppliers A1,...,A~ with supplies al,. 9 a~ of some commodity, consumers B 1 , . . . , B~ with demands b l , . . . , b~, and capacities c~j of the route from A~ to Bj. A flow is a matrix (f~j) such that 0 deg a y and show that z ~ y in G. Some third vertex z is adjacent to z but not to y. We assert that for each vertex w of G, i f t ~ C B, then wz C B. Indeed, 22 C B U R and 2~ ~ B. If zx C B, then t~2 C B by Condition 2 as asserted. If 22 C R, then 29 ~ R by Condition 3, i.e., 2~ is not an edge of F, hence xw C B U R by Condition 1. But t~ 7~ 2 because t ~ is an edge of F and 2~ is not, hence xw ~ R by Condition 3, and therefore 2t~ C B as asserted. Returning to the proof that x ~- y, we assume that, if possible, w r x, y, wy is an edge of G and wx is not. Then t~2 ~ B, so t ~ ~ R by the assertion, hence wy C R. Therefore t~2 ~ R by Condition 3, i.e., t~2 is not an edge of F. Since zx is an edge of F, we have 2 :/- t~, and since wy C R, we have zy ~ R by Condition 3. Hence zy C B by Condition 1, and this contradicts the fact that z is not adjacent to y in G. " T h e o r e m 10.4.4 Let F be a graph with loops whose edges are colored black or red. Then F with its colors is the frame of some B T graph if and only if 1. F is a threshold graph with loops; 2. the black edges of F form a threshold graph with loops; 3. the red edges of F form a matching with loops; ~. every two vertices of F differ in their number of incident black edges or in their number of incident red edges. We say that the vertices differ in their black degree or their red degree, where a loop contributes 1 to the degree. Moreover, in that case no three vertices of F can have the same black degree.

268


P r o o f . " O n l y if"" Assume that F is the frame of a BT graph G. By Theorem 10.4.3, Conditions 1 through 3 hold. To prove Condition 4, assume that ~ and ~ are distinct vertices of F, say deg a z > deg a y, and that ~ and have the same black degree. By Condition 2, for every z, z z E B if and only if ~2 E B. But some z is adjacent to x and not to y in G, hence zy ~ B and 22 E R. By Condition 3, ~,~ ~ R, i.e., 29 is not an edge of F. We assert that no red edge is incident with 9, which shows that 2 and 9 differ in their red degrees. Indeed, if @~ E R, then w x ~ R by Condition 3 and @2 ~ B by the above, and this contradicts Condition 1, thus proving t h e assertion and Condition 4. Moreover, if 2 and r have the same black degree, then by Conditions 3 and 4, their red degrees differ by exactly 1, and so no three vertices can have the same black degree. "If"" We construct a BT graph G whose frame is F. For each vertex v of F, let G have four vertices v0, vl, v2, v3. For each black edge v w E t3 of F with v 7~ w, join each vi to each wj. For each red edge v w E R of F with v -~ w, join each vi to wi and Wi+l (indices modulo 4). For each black loop vv E B of F, join each vi to each vj, i 7~ j . For each red loop vv E R of F, join each Vi to Vi+I and v i - , . This construction is illustrated in Figure 10.5. Note that each red edge or loop of F contributes 2 to the degrees of the vertices of G corresponding to its ends, a black loop contributes 3, and a black edge that is not a loop contributes 4. It is sufficient to show that F is the frame of G, for this implies that G is BT by Theorem 10.4.3. Clearly for each vertex v of F, all the vi have the same degree in G. Therefore it only remains to show that if v and w are distinct vertices of F, then deg a v0 =fi dega w0. If v and w have the same black degree, then by Condition 2, either both have a black loop or both do not, so the black edges incident to v and w contribute equally to the degrees of v0 and w0. Also by Condition 4, v and w have different red degrees, hence deg a v0 r deg a w0 as required. Assume now that the black degree of v is larger than the black degree of w. Then by Condition 2, for each vertex x of F, z w E B implies x v E B , but not conversely. Therefore the black edges contribute more to deg a v0 than to dega w0. We now prove that for each vertex y of F, y w E R implies yv E R U B , which shows that deg C v0 > dega w0 as required. Assume that, if possible, y w E R and yv ~ R U B. Let x be a vertex of F such that x v E B and x w ~ B. Then x-~ y (sinceyv ~ B ) , hence by Condition 3, x w ~ R, i.e., x w is not an edge of F. This contradicts Condition 1. .. The construction used in the "if" part of the proof of Theorem 10.4.4 can be generalized to yield all the degree sequences of BT graphs with a given frame.

10.4

Frames of BT Graphs

269

Figure 10.5: The construction of a BT graph with a given frame.

vo . . . . ~

wo

V0

W0

Vl

Wl

-

Wl

V2 ~

W2

.,.

W2

V3 -

. W3

?-)3

.,.

vwEB

W3

vwER

V2

"

-

131

V2

"

V3

...

-

VO

Y3

.

vv E B

~

Vl

-

vv E R

VO

Chapter 11 Matroidal and Matrogenic Graphs 11.1

Introduction

We have seen in Theorem 1.2.4 that a graph G - (V, E) is threshold if and only if G does not contain any alternating 4-cycle. Therefore, if we call a set of edges I C_ E "independent" when no two edges of I induce an alternating 4-cycle, then a spanning subgraph (V, I) is a threshold graph if and only if I is independent. Obviously the independent subsets of E form an independence system, i.e., a subset of an independent set must be independent. The circuits of this independence system, namely the minimal dependent sets of edges, are the sets of two edges whose endpoints induce an alternating 4-cycle in G. We call these sets couples. We are interested here in those graphs for which the above independence system is a matroid. As is well-known, one definition of a matroid is an independence system such that if C and C ~ are distinct circuits and e C C N C', then C U C ' - { e } is dependent. In the present situation, where every circuit has cardinality 2, this condition simply reads as follows" if a, b, c are distinct edges and {a, b}, {b, c} are couples, then {a, c} is a couple.

(11.1)

An equivalent condition is the following. Define a reflexive and symmetric binary relation R on E by having aRb if and only if a = b or {a,b} is a couple in G. Then our independence system is a matroid if and only if R is transitive. In that case we say that G is a matroidal graph. Although the 271

272

Matroidal and Matrogenic Graphs

corresponding matroids are uninteresting, the matroidal graphs themselves have a nice structure, which permits us to find their threshold dimension and show that they are perfect graphs. Peled [Pe177] introduced these graphs, characterized them by two forbidden configurations, the chordless pentagon and configuration ~ of Definition 11.2.1, and found their structure. We present his results in the next section. FSldes and Hammer [FH78b] introduced a variation of the above concept. Instead of an independence system on the edge set E, they considered an independence system on the vertex set V whose circuits, i.e., minimal dependent sets, are the sets of four vertices that induce an alternating 4-cycle of G. If this independence system is a matroid, the graph G is said to be matrogenic. The matrogenic graphs are characterized by forbidding configuration 9r alone. Furthermore, a matrogenic graph can have at most one chordless pentagon, and hence the structure of matrogenic graphs is almost the same as that of matroidal graphs, even though the two are defined in an apparently different way. We characterize the matrogenic graphs and present their structure Section 11.3. In Section 11.4 we present the work of Marchioro et al. [MMPS84] that characterizes the degree sequences of matrogenic graphs. These results were also obtained independently by Tyshkevich [Tys84].

11..2

Matroidal Graphs

11.2.1

Forbidden Configurations

We begin our study of the matroidal graphs by characterizing them by forbidden configurations. Definition 11.2.1 A configuration ~ consists of vertices A, B, C, D, E such that A is adjacent to B, C but not to D, whereas E is adjacent to D but not to B, C. As with all configurations, unspecified edges (in this case among B, C, D and between A and E) may or may not exists. See Figure 11.1. We denote the fact that A, B, C, D, E generate a configuration jz in this order by 5(A,B,C,D,E).

T h e o r e m 11.2.2 A graph is matroidal if and only if it contains neither the configuration ~" nor a chordless pentagon.

11.2

M a t r o i d a l Graphs

273

Figure 11.1: The forbidden configuration $'(A, B, C, D, E). A

B

E

P r o o f . The "only if" part is immediate: if .T(A, B, C, D, E), then the edge D E forms a couple with both A B and AC, which do not form a couple with each other, contradicting transitivity of the binary relation R; similarly each edge of a chordless pentagon forms a couple with two adjacent edges. To prove the "if" part, assume that R is not transitive, so that there exist edges a, b, c satisfying aRb, bRc, but not aRc. It follows that a, b, c are distinct edges having a total of five or six endpoints, according as a and c have a common endpoint or not.

Five endpoints. Put a = AB, b = CD, c = B E with AC and B D nonedges (Figure ll.2(a)). If E C is a nonedge, we have ~ ( B , A, E, D, C), so assume that E C is an edge. Since b, c form a couple, B C and D E are nonedges (Figure ll.2(b)). Now if AD is a nonedge, then JZ(B, A , E , C, D), and if AD is an edge, then A, B, E, C, D induce a chordless pentagon if A E is a nonedge and $'(A, B, E, C, D) otherwise. Six endpoints. Put a = AB, b = CD, c = E F with AC, BD, CE, D F nonedges (Figure 11.3). Since a r c does not hold, some endpoint of a is adjacent to some endpoint of c. By symmetry we may assume that B E is an edge, and then we have .T(B, A, E, D, C). 9

C o r o l l a r y 11.2.3 The complement of a matroidal graph is matroidal.

P r o o f . This follows immediately from the fact that 5 and the chordless pentagon are self-complementary. 9

274


Figure 11.2" Configurations in the proof of Theorem 11.2.2.

c

A a

AT

b

B

a

D

/

B

---

c

b D

,,~,

E

E

(~)

(b)

Figure 11.3" A configuration in the proof of Theorem 11.2.2.

j

A

C

//

B

/%

/

D

/

/

/

/

/

/

c

11.2.2

F

P4-vertices

In this subsection we describe the s t r u c t u r e of those m a t r o i d M graphs for which every vertex belongs to an induced P4 (a p a t h on 4 vertices). 1 1 . 2 . 4 Let vertices A1, A2, B1, B2 induce a P4 with edges Al131, BIB2, B2A2 (Figure 11.~). We then say that A1,A2, B1,B2 i n d u c e a P4 in t h i s o r d e r and denote this fact by 7 9 4 ( A ~ , A 2 , B~,B2). We call the A~ the e x t e r n a l v e r t i c e s and the Bi the i n t e r n a l v e r t i c e s of this P4, and similarly call the edges AiBi the e x t e r n a l e d g e s and the edge B1B2 the i n t e r n a l e d g e of the P4. Definition

1 1 . 2 . 5 Assume that ~P4(A1,A2, B1,B2) in a matroidal graph G, and let X be a fifth vertex of G. If X is adjacent to A1, then X is adjacent to A2, B1 and B2. If X is adjacent to B1, then X is adjacent to B2.

Lemma

11.2

Matroidal Graphs

275

Figure 11.4: P4(A1, As, B1, B2). B1 "

i

"" B2

\

/ \

/ x

/ /

"

\ \ \

A1

A2

P r o o f . If X is adjacent to A1 but not to A2, then we have the configuration S'(A1, B I , X , B2, A2). If X is adjacent to A1 and A2 but not to B1, then in case X is not adjacent to B2 the five vertices induce a chordless pentagon, and in case it is we have ~ ( X , A2, B2, B1,A1). Finally, if X is adjacent to B1 but not to A1, A2, B2, then we have JZ(B1,A1,X, A2, B2). " 1 1 . 2 . 6 In a matroidal graph, if edges a and b form a couple of the form P4 and edges b and c form a couple, then the latter couple is also of the Lemma

P r o o f . We may obviously assume that a, b and c are distinct edges. Assume that P4(A1,A2, B1, B2) with a = AIBI and b = A2B2. Since c forms a couple with each of a and b, there are a fifth vertex X and a sixth vertex Y with c = X Y . If the couple bc is of the form 2K2, then by L e m m a 11.2.5 X and Y are not adjacent to B1, and hence X Y forms a couple with BIB2, contradicting transitivity. If the couple bc is of the form C4, then by L e m m a 11.2.5 X and Y are both adjacent to A2 and B2, a contradiction. " C o r o l l a r y 1 1 . 2 . 7 Let R' be the binary relation defined on the edges of a

graph G so that e R ' f if and only if e = f or e and f form a couple of the form P4. If G is matroidal, then R' is an equivalence relation. The next l e m m a says that if a vertex in a matroidal graph belongs to some P4's, then it is either external in all of them or internal in all of them, and we may call it an external vertex or an internal vertex unambiguously. The same applies to edges too. Lemma

1 1 . 2 . 8 A vertex or an edge in a matroidal graph cannot be internal

in one P4 and external in another P4.

276


The result for edges follows from the one for vertices, which we prove below. Assume that P4(A1,A2, B1,B2), and assume that, if possible, A1 is an internal vertex in another P4. Then A1 must have neighbors in the new P4, which by Lemma 11.2.5 are also adjacent to B1 and A2. It follows that B1 cannot belong to the new P4, so A1, having degree 2 in the new P4, must have two new neighbors X and Y in it. But one of X and Y is external in the new P4, and since it is adjacent to A2, A2 is also adjacent to the internal vertex A1 of the new P4 by Lemma 11.2.5 applied to the new P4. This is a contradiction.

L e m m a 11.2.9 In a matroidal graph, all the internal vertices form a clique and all the external vertices form a stable set. P r o o f . We prove the result for the internal vertices, and the result for the external vertices then follows by Corollary 11.2.3. If BIB2 and B2B3 are two internal edges, then B1 is adjacent to B3 by Lemma 11.2.5. Now assume that B1B2 and B3B4 are two internal edges and show that the B i form a clique. If some of B1, B2 are adjacent to some of B3, B4, the result follows from Lemma 11.2.5. If not, let A1 be an external vertex adjacent to B1. By Lemma 11.2.5 A1 is not adjacent to B3 and B4, and therefore B3B4 forms couple with each of B1A1 and BIB2, contradicting the transitivity of R. 9 In order to describe the structure of the equivalence classes of the binary relation R ~ of Corollary 11.2.7, we introduce the following terminology. Definition 11.2.10 Let A 1 , . . . , A p , B 1 , . . . , B p be 2p distinct vertices for some p > 2. We say that these vertices induce (in this order) a net (resp. a n e t - c o m p l e m e n t ) of o r d e r p and denote this fact by A f ( A ~ , . . . , Ap, B 1 , . . . , Bp) (resp. A T ( A ~ , . . . , Ap, B 1 , . . . , B p ) )

if all the Ai form a stable set, all the Bi form a clique, and each A~ is adjacent to Bi and to no other Bj (resp. each Ai is adjacent to all the Bj except for Bi). Figure 11.5 displays a net and a net complement of order 3. If N ' ( A I , . . . , A;, B I , . . . , Bp) holds in G, then ~?(B1,...,/3p, A1, . . . . Ap) holds in G, and conversely; and if 794(A1, A2,/~1~/~2), then Af(A1, A2,/31,/32) and A?(A1, A2,/~2,/~1 ). L e m m a 11.2.11 In a matroidal graph, two or more edges forming an equivalence class under the binary relation R' of Corollary 11.2.7 induce a net.

11.2

Matroidal Graphs

277

Figure 11.5: A net and a net-complement of order 3. A1

A1

B

2

_

A2

A3 (a) a net

A2

B1

A3

(b) a net-complement

P r o o f . Let A 1 B 1 , . . . , A p B ; be the distinct edges of the equivalence class with the Ai external and the Bi internal vertices. Clearly if Ai = Aj, then Bi = Bj and as the p edges are distinct, we have i = j. Similarly Bi = Bj implies i = j. Also by Lemma 11.2.8 A~ r Bj for all i,j. Thus we have 2p distinct vertices. By Lemma 11.2.9 the Ai form a stable set and the Bi form a clique. Finally for i -r j, A~ is not adjacent to Bj since we have 794( Ai, Aj, Bi , Bj ). .. Although the equivalence classes of R' are edge-disjoint, they may share vertices. The next lemma is crucial in determining the situation in these cases. L e m m a 11.2.12 In a matroidal graph G, let E and E ~ be distinct equivalence classes under the binary relation R' of Corollary 11.2.7, each having two or more edges. Assume that some vertex is an endpoint of an edge of E and of an edge of E'. Then each of E and E' has exactly two edges and these four edges induce in G a net-complement of order 3. P r o o f . Let us first consider the case that the common vertex is internal. Let 7)4(A1,A2, B1,B2) where AIB1 and A2B2 belong to E, and let the edge XB1 belong to E'. Then X must be external, since it follows from Lemma 11.2.5 that internal edges do not belong to couples, and so X -r B2 by Lemma 11.2.8. Also X r A1 since E -r E', and X :/- A2 since A2 is not adjacent to B1. Therefore X is a fifth vertex, which is adjacent to B2 by Lemma 11.2.5, but not to A1 or A2 by Lemma 11.2.9. By assumption ~P4(X, Y, B~, Z) for some

278


vertices Y, Z. By Lemma 11.2.8 Z -~ X, A1, A2 and by definition of a couple Z -~ B1, B2. Thus Z is a sixth vertex, which is adjacent to B1 and B2 by Lemma 11.2.9 but not to X by definition of a P4 (see Figure 11.6). Figure 11.6" A configuration in the proof of Lemma 11.2.12.

al

B1

B2

A2

A

A /

/ \

/ /

u

We assert that Y = A2. Clearly Y r Z, B1, B2 by Lemma 11.2.8; Y -~ A1 since A1 is adjacent to B1 and Y is not; and Y -J= X. Therefore if Y -r A2, then Y is a seventh vertex. In the latter case, since YZ forms a couple with XB1, it cannot form a couple with XB2 by transitivity, and therefore Y must be adjacent to B2. But then Y is adjacent to B1 by Lemma 11.2.5, contrary to the assumption that YZ forms a couple with XB1. This proves the assertion Y = A2, from which it follows that Z is adjacent to A2 and thus also to A1 by Lemma 11.2.5. We have therefore shown that ~(A1,A2, X, B2,B1,Z). Moreover, since A2 was an arbitrary external vertex other than A1 belonging to an edge of E, it follows from the assertion that E has only two edges A1B1 and A2B2. By symmetry E' also has only two edges XB1 and A2Z, and the lemma is proved in this case. We now examine the other possibility, that the edges of E and E' share an external vertex but not an internal one. We show that this is impossible. Let 7)4(A1, A2, B1, B2) where A1B1 and A2B2 belong to E, and let the edge XA1 belong to E'. Then X must be internal, and therefore by assumption X -r A2, B1,B2. By Lemma 11.2.5 X is adjacent to A2, B1,B2. By assumption E' contains another edge that forms a P4 with XA1. This edge cannot contain B1 or B2 by assumption and it cannot be of the form A2Y, since Lemma 11.2.5 would imply that Y is adjacent to A1, contrary to the definition of P4. Thus this edge is of the form YZ, where Y is an internal sixth

11.2


279

vertex and Z is an external seventh vertex (see Figure 11.7). We now apply Lemma 11.2.5 several times to the two P4's. Since B1 is adjacent to A1, it is adjacent to Z. Since Z is adjacent to BI, it is adjacent to B2. Since B2 is adjacent to Z, it is adjacent to A1. But this contradicts the definition of P4.

Figure 11.7: A configuration in the proof of Lemma 11.2.12. z Y x v

A1

B1

w

B2

v

A2

In order to generalize Lemma 11.2.12 to more than two equivalence classes, let us consider a new abstract graph G whose vertices are the equivalence classes of two or more edges under R'. By Lemma 11.2.11, the edges of G that belong to a single vertex of G induce a net in G. Two vertices of G shall be adjacent when the corresponding two nets of G have vertices in common. L e m m a 11.2.13 For a matroidal graph G, let X l , . . . , 2 q be a sequence of

vertices of G such that X~ is adjacent to some of X 1 , . . . , X~-I for each i 2 , . . . , q, and q >_ 2. Then each )[i contains exactly two edges of G and these 2q edges induce a net-complement in G. P r o o f . The proof is by induction on q. The basis q - 2 is the content of Lemma 11.2.12. Assume that q > 2 and that the lemma holds for q - 1 . Then each of J ( 1 , . . . , Xi-1 contains exactly two edges of G and these 2q - 2 edges induce in G a net-complement N ' ( A ~ , . . . , Ap, B 1 , . . . , Bp). By assumption Xq is adjacent to 2 i for some i - 1 , . . . , q - 1. Hence by Lemma 11.2.12 J~q contains exactly two edges of G, which induce in G a net N(Q, T, R, S), and Xi contains exactly two edges of G, which induce in G a net N(A~, A~, B~, B~). It further follows from Lemma 11.2.12 that these two nets share exactly one internal vertex and exactly one external vertex, and their six distinct vertices induce in G a net-complement of order 3. Hence by renaming Q, T, R, S if ,v

280


necessary we may assume that Q = A~ and S = B~, as illustrated in Figure 11.8. Figure 11.8: A net-complement in the proof of Lemma 11.2.13.

Q=Ar

R

w

As

8

v

S = Br

v

T

Clearly T is distinct from B I , . . . , Bp and R is distinct from A I , . . . , Ap. Assume that T is one of A I , . . . ,Ap, say T = At. Then R -r Bs because B~ is adjacent to T and R is not. If R r Bt, then the fact that R is adjacent to A~ but not to At contradicts Lemma 11.2.5 as applied to 7)4(As, At, Bt, B~). Therefore R - Bt and the vertices of G in ) ( 1 , . . . , 32q are just A I , . . . , Ap, B 1 , . . . , Bp, which induce a net-complement of order p in G, as was to be proved. Now assume that T is distinct from A 1 , . . . , Ap. Then R is distinct from A I , . . . , Ap, because by Lemma 11.2.5 R = Bt would imply that R is adjacent to T. Then for each t -r s we can apply Lemma 11.2.5 to P4(At, A~, B~, Bt) to conclude that T is adjacent to Bt but not to At and R is adjacent to both At and Bt. Thus we have a net-complement of order p + 1, ~?'(A1,..., Ap, T, B 1 , . . . , Bp, R), as was to be proved, tt C o r o l l a r y 11.2.14 For a matroidal graph G, the vertices belonging to a

single connected component of G induce in G a net or a net-complement. These nets and net-complements are vertex-disjoint. P r o o f . If a connected component C of G contains exactly one vertex of G, then the corresponding vertices of G induce in G a net by Lemma 11.2.11. If C contains two or more vertices of G, then these vertices can be arranged in a sequence satisfying the conditions of Lemma 11.2.13, and so the corresponding vertices of G induce a net-complement. Assume that C1 and C2

11.2


281

are connected components of G such that the corresponding nets or netcomplements N1 and N2 of G share a vertex X. Then X belongs to some P4 contained in N1, hence X is an endpoint of an edge of G belonging to a vertex X1 of G belonging to C1. Similarly X is an endpoint of an edge of G belonging to a vertex X2 of G belonging to C2. By definition of G, X1 is adjacent to 22, hence C1 - C2, hence N1 - N2. 9 Thus we have shown that in a matroidal graph G, the vertices that belong to P4's partition into disjoint sets, which we now call cells, such that each cell induces in G a net or a net-complement, and each P4 in G is induced by the vertices of a single cell. We number the cells as 1 , . . . , k and denote by E i (resp. P) the set of external (resp. internal) vertices of the i-th cell. We say that cell i dominates cell j, where i =/= j, when each vertex of E i is adjacent to each vertex of I j, and no vertex of E j is adjacent to any vertex of I i. L e m m a 11.2.15 The domination relation between the cells of a matroidal

graph is a linear order. P r o o f . By definition the domination relation is irreflexive and antisymmetric. We show that it is transitive and total. Totality. Consider any two cells, say cell 1 and cell 2, and let AiBi be any external edge of cell i, with Ai external and Bi internal (i = 1, 2). We assert that either A1 is adjacent to B2, or else A2 is adjacent to B1, but not both. If neither holds, then by Lemma 11.2.9 we have JP4(A1, A2, B1, B2), contrary to the fact that the vertices of each P4 are contained in a single cell. If both hold, then by Lemma 11.2.5 each vertex of E 1 is adjacent to each vertex of 12 and each vertex of E 2 is adjacent to each vertex of I 1. Let "]')4(Ai, A~, Bi, B~) in c e l l / - 1,2. Then we have 7'4(A1,A2, B;,B~), again contrary to the fact that the vertices of each P4 are contained in a single ceil. This proves the assertion, say A1 is adjacent to B2 and A2 is not adjacent to B1. Then it follows from Lemma 11.2.5 that cell 1 dominates cell 2. Transitivity. In view of the totality, it is enough to show that it is impossible for cell 1 to dominate cell 2, cell 2 to dominate cell 3, and cell 3 to dominate cell 1. Assume that, if possible, this is so, and let A1 E E 1, B2 C 12 , Aa E E 3, and B3 E 13, with A3 adjacent to B3. By Lemma 11.2.9 B2 is adjacent to B3 and A1 is not adjacent to A3. By our assumption we have 7?4(A1, A3, B2, B3), contrary to the fact that the vertices of each P4 are contained in a single cell. 9

282


Lemma 11.2.15 allows us to renumber the cells so that cell i dominates cell j if and only if i < j. This determines completely the structure of the subgraph induced by the edges of couples of the form P4 in a matroidal graph. We summarize the results of this subsection by the following theorem. T h e o r e m 11.2.16 In a matroidal graph G, the vertices belonging to couples of the form P4 partition into disjoint sets called cells. Each cell induces in G a net or a net-complement. All the internal vertices form a clique and all the external vertices form a stable set in G. The cells can be indexed so that the external vertices of cell i are adjacent to the internal vertices of cell j if and only if i < j. We call a matroidal graph all of whose vertices belong to couples of the form P4 a cell graph.

11.2.3

2I(2-vertices and C4-vertices

In this subsection we complete the description of those matroidal graphs for which every vertex belongs to some edge in some couple. In particular, we show that every vertex of such a graph belongs either only to couples inducing a P4, or only to couples inducing a 2K2, or only to couples inducing a 6'4.. Moreover, a matroidal graph cannot contain couples of the form 2K2 and 6'4 at the same time. As usual, by a perfect matching inK2 we mean a graph with 2m vertices, all of degree 1. Figure 11.9 displays a perfect matching 3K2 and its complement 3K2. Figure 11.9: A perfect matching and its complement.

3K2

3K2

11.2


283

T h e o r e m 11.2.17 Let G be a matroidal graph in which the set M of vertices belonging to couples of the form 2K2 is not empty. Then M induces a perfect matching in G and the vertices of G outside M partition into a clique C whose vertices are adjacent to every vertex of M and a stable set I whose vertices are adjacent to no vertex of M . P r o o f . Let edges A B and C D form a couple of the form 2K2, and let X be a fifth vertex. We observe that X is adjacent to all of A, B, C, D or to none of them. Indeed, if X is adjacent to A, then by the transitivity of R, X A cannot form a couple with CD, and therefore X is adjacent to both C and D (Figure 11.10). Figure 11.10: A configuration in the proof of Theorem 11.2.17. A

C \

/ \

X

/ x

/

\

/

B

D

A repetition of this argument shows that X is also adjacent to B. From this observation it follows that the binary relation "equals to or forms a couple of the form 2K2 with" is transitive, and thus an equivalence relation. By definition each equivalence class induces in G a perfect matching, and from the assumption of the theorem there is at least one equivalence class g with at least two edges A B and CD. Let M' be the set of endpoints of the edges of g. By the above observation the vertices of G outside M ~ partition into a set C of vertices adjacent to every vertex of M ~ and a set I of vertices adjacent to no vertex of M ~. Any two vertices X, Y of I are not adjacent, otherwise the edge X Y would belong to g. Every two vertices X, Y of C are adjacent, otherwise X A would form a couple with both Y C and Y D , contradicting transitivity. Thus I is a stable set and C is a clique. It follows that the vertices of M ~ belong only to couples formed from two edges of g, and that C U I induces a split graph, which has only couples of the form P4, if any. Therefore M ~ = M. 9


284

C o r o l l a r y 11.2.18 Under the conditions of Theorem 11.2.17, G has no couples of the form C4, C contains all the internal vertices and I contains all the external vertices of G. P r o o f . We have seen that G has no couples of the form C4. Let B1 be any internal vertex of G. Then we have 794(A1,A2, B1, B2) for some A1,A2, B2, and It follows from Lemma 11.2.5 that the edge B I ~ 2 does not belong to any couple of G. Therefore at least one of B1, B2 must be in C. If B1 is in I and B2 is in C, then A1 cannot be in C since it is non-adjacent to B2, and cannot be in I U M since it is adjacent to B1, a contradiction. Therefore B 1 must be in C and C contains all the internal vertices. It follows from this that A1 and A2 must be in I, so I contains all the external vertices. .. By taking the complement of G we obtain from Theorem 11.2.17 and Corollary 11.2.18 the following analogous results. T h e o r e m 11.2.19 Let G be a matroidal graph in which the set M of vertices belonging to couples of the form C4 is not empty. Then M induces a complement of a perfect matching in G and the vertices of G outside M partition into a clique C whose vertices are adjacent to every vertex of M and a stable set I whose vertices are adjacent to no vertex of M. C o r o l l a r y 11.2.20 Under the conditions of Theorem 11.2.19, G has no couples of the form 2K2, C contains all the internal vertices and I contains all the external vertices of G. The following summarizes the results of this subsection. D e f i n i t i o n 11.2.21 A matroidal graph G all of whose vertices belong to couples is called a s t r i c t m a t r o i d a l g r a p h . T h e o r e m 11.2.22 A graph G is strict matroidal if and only if the vertices of G partition into subsets M and S, M induces a perfect matching or a complement of a perfect matching, S induces a cell graph, and the vertices of M are adjacent to every internal vertex and no external vertex of S. 11.2.4

Threshold

Vertices

We now have to consider those vertices of a graph G that do n o t b e l o n g to any couples. We call these vertices threshold vertices, since they induce

11.2


285

a threshold subgraph of G. Recall the definition of the vicinal preorder on the vertices of G. We extend its definition to subsets of vertices so that X~Yifandonlyifx~yforallxCX, yEY. If G is matroidal, then as seen in the previous subsection, its vertices belonging to couples partition into sets M, I 1, E l , . . . , [k E k, where M is the set of vertices that belong to couples of the form 2K2 or C4, I t is the set of internal vertices of cell l, and E l is the set of external vertices of cell l. These vertices induce in G a strict matroidal graph S, relative to which

[k ~ ... ~_ I 1 ~ M ~ E 1 ~ "'" ~- E k.

(11.2)

The threshold vertices of G induce a threshold graph T, and they can be labeled as X 1 , . . . , Xt so that relative to T

X , ~ ... ~ Xt.

(11.3)

Moreover, (11.2) and (11.3) remain true relative to G, because they could only be violated if a vertex of T belonged to a couple of G (there is an exception in the case that Xi ~ Xj relative to T but Xi ~- Xj relative to G; in that case we adopt the convention that i < j). Thus the sets M , I ~ , E 1 , . . . , I k , E k, { X 1 } , . . . , {Xt} are totally ordered by ~ relative to G in a way that is a common refinement of both (11.2) and (11.3). Conversely, assume that S is a strict matroidal graph satisfying (11.2) and T is a threshold graph disjoint from S and satisfying (11.3). If S and T are connected by edges so that relative to the resulting graph G the sets M, I 1, E l , . . . , I k, E k, {X1},..., {Xt} are totally ordered by ~- in a way that is a common refinement of both (11.2) and (11.3), then G is matroidal and its threshold vertices are precisely the vertices of T. Indeed, the Xi are comparable with every vertex of G and hence do not belong to couples of G; the other vertices of G do belong to couples since S is strict; further, R is transitive on S by assumption and hence remains transitive on G. It remains to see how S and T are to be connected by edges so as to satisfy the above necessary and sufficient requirements. By Theorems 11.2.17 and 11.2.19, if a vertex of T is adjacent to some vertices of M, it is adjacent to all vertices of M. By Lemma 11.2.5, if a vertex of T is adjacent to some vertices of E t, it is adjacent to all vertices of E t t_J I t, and if it is adjacent to some vertices of I l, it is adjacent to all vertices of I t. It therefore follows that a vertex of T cannot be equivalent to a vertex of S' under the vicinal preorder. The remaining question is to which sets M, I 1, E l , . . . , I k, F_,k a given vertex of T should be adjacent.


286

We begin to consider this problem by assuming that S is a single cell, i.e.,M-Oandk-1. T h e o r e m 11.2.23 Let S be a cell with external vertices E and internal vertices I. Let T be a threshold graph disjoint from S, whose vertices X 1 , . . . , Xt satisfy (11.3) relative to T. If S and T are joined by edges so that the resulting graph G is matroidal and its threshold vertices are precisely X 1 , . . . , Xt, then the following conditions hold: 1. there is an index r - O , . . . , t such that { X r + l , . . . , X t } is a stable set, or equivalently such that X~+I is not adjacent to X~+2 (provided r + 2 - X j , where j is the smallest index other than r such that X j and X~ are not adjacent, and i is the largest index such that Xi and X~+I are adjacent (if i or j is not defined, the corresponding condition on s is omitted; Xo :'- XN is considered to be true); 3. each vertex of I is adjacent to X1, . . . , X~, but not to X r + l , . . . , X t / ~. each vertex of E is adjacent to X1, . . . , X~, but not to X s + l , . . . , X t. Conversely, r and s can always be chosen to satisfy 1 and 2, and if S and T are connected according to 3 and ~, then the resulting graph G is matroidal and its threshold vertices are precisely X 1 , . . . , Xt. P r o o f . Assume that G is matroidal and its threshold vertices are X1, . . . , Xt. Then there are indices 0 _< s - A, X 1 , . . . , X ~ are adjacent to B and hence to every vertex of I. Since A >-- X r + l , . . . , Xt, X r + l , . . . , Xt are not adjacent to B' and hence not to any vertex of I. Condition 4 has a similar proof.Assume that 1 fails and there is an edge connecting two vertices among X ~ + I , . . . , Xt. Then by 3 this edge forms a couple with any internal edge of S, contradicting the assumption that X 1 , . . . , Xt are threshold vertices of G. It remains to prove 2. We assert that I ~ Xj. Indeed, otherwise we would have

11.2


287

Xj ;,-- I, and since the vertices of I are adjacent to X~, Xj would be adjacent to X~, contradicting the definition of j. Similarly we see that Xi ~ I. Since Xi ~- I ;,- Xj and s is the largest index such that X~ ~ I, we have i _ r + 1 would contradict the definition of i. Thus it is always possible to choose r and s to satisfy 1 and 2, often in different ways. Now assume that this was done and S and T were joined by edges according to 3 and 4. We complete the proof by showing that (11.4) holds relative to G. First we show that if u _< v, then X~ ~ X~ relative to G. Indeed, let B be an internal vertex adjacent to X~. Then by 3 v _< r, hence u _< r and by 3 B is adjacent to X~. A similar use of 4 results in the same conclusion for external vertices. It remains to show that X~ ~- E ~- X~+I and X~ ~- I ~- X~+I. X~ ~ E: Let Z be any vertex adjacent to some vertex of E. Then Z is in I o r T. In the first case Z is adjacent to X~ by 3 and we are done. In the second case we have Z - Xq for some q and q _< s by 4. If j is defined, then by 2 Xq ~ Xj relative to T, hence q < j, and by the definition of j we have that Z coincides with or is adjacent to X~. If j is undefined, then X~ is adjacent to all other vertices in T, so again Z coincides with or is adjacent to XT. E ~- X~+I" Let Z be any vertex adjacent to X~+I. Since s < r by 2, and by 3 and 4, Z must be in T, and so Z - Xq for some q. Then i is defined and satisfies q < i, and by 2 i < s. Therefore q < s and so by 4 Z is adjacent to the vertices of E. Xs ~ I: Let Z be any vertex adjacent to some vertex of I. If Z is in S, then by 3 and 4 and the fact that s < r, Z is adjacent to X~ and we are done. If Z is in T,we have Z - Xq for some q and q < r by 3. If j is defined, we distinguish the cases s 7~ r and s - r. In the first case, since by 2 X~ ~- Xj relative to T and therefore s < j, Xs is adjacent to XT by the definition of j, and therefore X~ coincides with or is adjacent to Xq. In the second case X~ ~- Xj by 2 so that r < j, by the definition of j { X 1 , . . . , X~} is a clique, and in particular since q, s _< r we have that Xq coincides with or is adjacent to X~. Again, if j is undefined, then X~, and hence X~, is adjacent to all

288


other vertices in T, so t h a t Xq coincides with or is adjacent to X~. I ~- Xs+l" Let Z be any vertex adjacent to Xs+l. If Z is in S', then by 4 Z is internal, and therefore Z is adjacent to all other vertices of I and we are done. I f Z i s i n T , then Z - X q for s o m e q . Since b y 2 i_< s < s + l , and by the definition of i, X~+I and X~+I are not adjacent. This means t h a t either s + 1 - r + 1, in which case q _< r by 1, or else X~+I and XT+I are distinct non-adjacent vertices, in which case Xq ~- XT+I relative to T, hence again q _< r. Thus in both cases Xq is adjacent to the vertices of I by 3. .. We illustrate T h e o r e m 11.2.23 with the following example. Let T be the threshold graph illustrated in Figure 11.11, relative to which one has X1 ~- X2 "~ X3 ~- X4. Table 11.1 summarizes all the possibilities of m a t r o i d a l graph whose threshold vertices are X1, . . . . X4 and whose other vertices induce a cell. These graphs are illustrated in Figure 11.12 for the case t h a t the cell is a P4. Figure 11.11" A threshold graph.

X3

X1

X2

X4

Table 11.1" A summary of the possibilities for the graph of Figure 11.11. Case

r

i

j

s

(a) (b)

1 2

1 1

4 3

1 1

(c)

3

2

0

(d) (e)

3 4

2 1

1 0

Order of vertices X1 X1 I X1 I

~ ~ ~ ~~-

I I X1 I X1

~ ~ ~ ~~-

E X2 22 X2 X2

}.- 2 2 }.- f_, ,.-, 2 3 ,.~ X 3 ,~ X 3

~ ~ ~.~:,-

X3 X3 ~, E X4

~ ~~~~-

24 24 24 X4 E

Having e x a m i n e d the case where S' is a cell, we now generalize to the case t h a t S is a cell graph. The next theorem describes the situation in this case. Theorem

1 1 . 2 . 2 4 Let S be a cell graph with cells 1 , . . . , k satisfying Ik;, - . . . ~ - I 1 ~- E 1~- . . . ~ - E k

(11.5)

relative to S. Let T be a threshold graph disjoint from S with vertices X 1 , . . . , X t satisfying (11.3) relative to T. If S and T are joined by edges

tl.2

Matroidal Graphs

289

Figure 11.12: Matroidal graphs corresponding to Table 11.1.

X3

Xl

X2

X4

X3

X1

(~)

X3

X1

X2

X4

X2

X4

(b)

X2

X4

X3

X1

(c)

(d)

X3

X1

X2

X4

~v

v

v

v

(e)


290

so that the resulting graph G is matroidal and its threshold vertices are precisely X 1 , . . . , Xt, then for each cell l there exist indices rl, st satisfying 1 of Theorem 11.2.23 with respect to I t and E l, and furthermore Sk ~

"'"

< 81

~

rl ~

...

~ rk.

(11.6)

Conversely, r~ and st can always be chosen to satisfy 1 and 2 as well as (11.6), and if the connecting edges satisfy 3 and ~ with respect to I t and E ~, then the resulting graph is matroidal and its threshold vertices are precisely X1, . . . , Xt.

P r o o f . If G is matroidal with threshold vertices X 1 , . . . , X t , then the sets 11, E l , . . . , I k, E k, {X1 } , . . . , {Xt } are totally ordered by ~ relative to G in a way that is a common refinement of both (11.5) and (11.3). For each cell l, I l U E t U { X 1 , . . . , Xt } induces a matroidal graph whose threshold vertices are X 1 , . . . , Xt and whose other vertices induce a cell. By Theorem 11.2.23 there are indices rl and st satisfying 1-4 of that theorem with respect to I t and E I. Using (11.4) applied to rt, st, I t and E t and also (11.5), we obtain (11.6). Conversely, assume that S and T are given as above. Then by Theorem 11.2.23, for each cell 1 it is possible to choose indices rt, st satisfying 1-2 of that theorem. Furthermore, it is also possible to satisfy (11.6), for example by choosing rl . . . . . rk and Sl . . . . = sk. We assert that if the vertices of I t are joined to X 1 , . . . , Xr~ and the vertices of E l are joined to X 1 , . . . , X~ so as to satisfy 3-4 of the theorem, then the resulting graph G is matroidal and its threshold vertices are X 1 , . . . , X t . To show this, it is sufficient to demonstrate that (11.3) and (11.5) continue to hold relative to G and that (11.4) holds relative to G with rl, sl, I l and E t It is immediate to obtain (11.3) from 3 and 4 of Theorem 11.2.23. As for (11.5), 11 ~- E 1 follows from Theorem 11.2.23. To show that 12 ~ 11, assume that Xq is adjacent to the vertices of I ~. Then q _< rl by (3), hence q _< r2 by (11.6), and Xq is adjacent to the vertices of 12 by 3. Similarly all the relations of (11.5) can be proved. It remains to prove the relations Xr z > E t > X~z+1 and Xs~ ~- I l ~ Xs~+1. Let us prove X~.t ~ E l , the other relations being similar. Let Z be any vertex adjacent to the vertices of E z. If Z is in T, then Z coincides with or is adjacent to X~ z by Theorem 11.2.23. If Z is in S, then by definition of a cell graph and by (11.5), Z must be in 11 U ... U I k. By 3 Z is adjacent at least to X 1 , . . . , X m , where m - m i n ( r l , . . . , r k ) . But m - rt by (11.6), and therefore Z is adjacent to X~. 9

Corollary 11.2.25 Let G be a matroidal graph all couples of which are of the f o r m P4. Then G is a split graph.

2


291

roof. We use the results and notations of Theorem 11.2.24. Let h be the rgest index such that Sl _< h _< rl and { X ~ , . . . , Xh} is a clique. Then if < rl, Xh is not adjacent to Xh+x. Therefore in all cases { X h + l , . . . , X~ } is a able set. We assert that in fact { X I , . . . , Xh} is a clique and { X h + l , . . . , Xt } a stable set. Indeed, if sl < h, the first statement is obvious; if Sl - h, follows from the facts that X1 >- I x, . . . , X~ >- I ~ and that X~ ~ are adjacent to all the vertices of 11. If h < rl, the second statement obvious; if h - rl, it follows from the fact that X~+I and X~+2 are not tjacent. Since X 1 , . . . , X~ 1 are adjacent to the vertices of 11 U . . . U I k, u U 11 U ... U I k is a clique. Since X~ l + l , . . . , X t are not tjacent to any vertex of E 1 tJ . . . t.J E k, { X h + l , . . . , X t } U E 1 U " " [_J E k a stable set. .. We conclude the description of the matroidal graphs by answering the ~estion which vertices of T are adjacent to M. h e o r e m 11.2.26 Let S be a cell graph and T a threshold graph as in Theo:m 11.2.2~. Let M be a perfect matching inK2 or a complement of a perfect ~atching inK2 disjoint from S and T, with rn > 2. If the edges joining M S and T are such that the resulting graph G is matroidal with threshold ~rtices X1, . 9 Xt , then 1. the vertices of M are adjacent to all the internal vertices and to none of the external vertices of S; 2. if h is the largest index such that Xh is adjacent to the vertices of M (h - 0 if none), then Sl < h < rl, { X l , . . . , X h } U 11 U . . . U I k is a clique and { X h + l , . . . , X t } t2 E 1 tJ . . . U E k is a stable set. ~onversely, it is possible to choose an index h such that S 1 < h < r l , X I , . . . , X h } is a clique and { X h + l , . . . , X t } is a stable set. If the vertices f M are joined to the vertices of { X 1 , . . . , X h } U 11 U . . . U I k and to no ther vertices, then the resulting graph G is matroidal with threshold vertices ~'1,''', Xt.

'roof. Condition 1 has been proved in Subsection 11.2.3, together with the mt that the vertices of S partition into a clique whose vertices are adjacent ) M and a stable set whose vertices are not. If h - 0, then the vertices of { X 1 , . . . , X t } U .~1 I,_J . . . I,.J E k are not djacent to M, hence they form a stable set, Sl - 0, and 2 follows. The

292


case h - t is similarly easy, and therefore we now assume that 0 < h < t. Since (11.3) holds relative to G, the vertices of M are adjacent to X 1 , . . . ,Xh. It follows that { X 1 , . . . , X h } t0 I 1 tO ... tO I kis a cliqueand { X h + l , . . . , X t } tO E 1 U ... U E k is a stable set. Since M is comparable with every vertex of S and T relative to G, it satisfies in particular 11 P- M >- E 1 and Xh >- M >Xh+l relative to G. By transitivity of >- we have Xh >- E 1 and 11 >- Xh+l, from which it follows that Sl _< h _< rl and 2 is proved. Conversely, by Corollary 11.2.25 there is an index hi _< h _< rl such that { X 1 , . . . , X h } tO I1 U ... tO I k is a clique and {Xh+~,...,Xt} U E 1 U ..- tO E k is a stable set. Assume that M is joined to the vertices of { X 1 , . . . , X h } U I 1 U " " U I k and to no other vertices of S U T. In order to prove that the resulting graph G is matroidal with threshold vertices X 1 , . . . , Xt, it is sufficient to show that M is comparable with every vertex of S U T relative to G. We show below that each cell l satisfies I 1 >- M >- E t and t h a t X 1 > - M f o r q _ < h a n d M > - X q forq_>h+l. I t >- M: Let Z be any vertex adjacent to some vertex o f M . I f Z i s i n M or in S, then Z is adjacent to all vertices of I t. If Z is in T, say Z - Xq, then q h + 1" This is proved similarly. 9

11.2.5

Properties of Matroidal Graphs

The structure of matroidal graphs has been completely described in the previous three subsections. Each matroidal graph can be decomposed into a cell graph S, induced by the vertices that belong to P4's and described by Theorem 11.2.16, a perfect matching or a complement of a perfect matching M, induced by the vertices that belong to 2K2's or C4's, and a threshold graph T induced by the threshold vertices. The vertices of M are adjacent to all the internal and to none of the external vertices of S'. The edges joining T to S' are described by Theorems 11.2.23 and 11.2.24, and the edges joining T to M are described by Theorem 11.2.26. We can now determine the threshold dimension t(G) of a matroidal graph G, which is the original

11.2


293

reason of interest in these graphs. T h e o r e m 11.2.27 Let G be a matroidal graph and let E l , . . . ,Eq be the

equivalence classes of the relation R of page 271. Then

t(G) = max(IE, I,..., levi). P r o o f . Put t = m a x ( l E l l , . . . , IEql). If fewer than t subgraphs of G cover all its edges, then one of these subgraphs H has two edges from the same equivalence class Ei. These two edges form a couple in G and therefore in H too, and so H cannot be a threshold graph. This shows that t(G) >_ t. To prove the reverse inequality, we exhibit t threshold subgraphs of G covering its edges. We construct one of these subgraphs, called H, as follows. Since G is matroidal, it decomposes into S, M and T as above. Consider a cell of S. If it is a net of order p, J~f(AI,...,Ap, BI,...,Bp), then its p external edges AiBi form one equivalence class, and each internal edge BjBk forms an equivalence class by itself. There are p ways to select one edge from each of these equivalence classes, giving p threshold subgraphs of the net covering its edges. We select one of these p threshold subgraphs and put its edges in H. If the cell is a net-complement of order p, A/'(A1,..., dr, B 1 , . . . , Bp)), then each pair AjBk, AkBj, j ~ k, of external edges forms an equivalence class, and each internal edge BjBk forms an equivalence class by itself. The collection of the external edges AjBk with j < k and all the internal edges is a threshold subgraph of the net-complement containing one edge of each of these equivalence classes. This collection and a similar one obtained with j > k cover all the edges of the net-complement. We select one of these two and enter its edges in H. Now consider M, and assume it is a perfect matching inK2. Then each of its m edges forms an equivalence class by itself and is a threshold subgraph of M, and these m subgraphs obviously cover the edges of M. We select one of these edges and put it in H. If M is a complement inK2 of a perfect matching induced by the vertices A 1 , . . . , Am, A 1 , . . . , A" with all edges present except for AiA~, i - 1 , . . . , m, then each edge X Y forms an equivalence class with ZU, where Z is the unique vertex of M not adjacent to X and U is the unique vertex of M not adjacent to Y. The collection of edges AjA'k with j < k and all edges AjAk forms a threshold subgraph of M having one edge of each of these equivalence classes. This collection and its complement, which has a similar form, cover all the edges of M. We select one of these two collections and put its edges in H.

294


Each of the remaining edges of G (namely the edges of T and the edges joining S, M, T and the different cells of S) forms an equivalence class by itself. We put all of them in H. The H constructed in this way has exactly one edge of each equivalence class Ei, and it is clear that we can construct t H's of this form that will cover all the edges of G. Moreover, we assert that each H constructed in this way is a threshold subgraph of G, in other words, no two edges a and b of H form a couple in H. Indeed, if both a and b are in the same cell of S, or if both are in M, then the assertion follows by construction. Otherwise a and b do not form a couple in G, because G contains enough edges so that the endpoints of a and b induce a triangle in G, namely the edges of T and the edges joining S, M, T and the different cells of S. But these edges are in H as well, so the endpoints of a and b induce a triangle in H too, and so a and b do not form a couple in H. This proves that t(G) 0, then b/rn _ D(G[X]) and in particular

D(G) >_ max(D(G[C]), D(G[Q])). On the other hand, no antichain of G can meet both C and Q, since every vertex of C is comparable to every vertex of Q. Hence

D(G) < max(D(G[C]), D(G[Q])). If T is a spanning threshold subgraph of G and G[X] is an induced subgraph of G, then T[X] is a spanning threshold subgraph of G[X]. Therefore t(G) >_ t(G[X]), in particular t(G) >_ On the other hand, if Tc is a spanning threshold subgraph of G[C] and T o is a spanning

11.4

Matrogenic Sequences

303

threshold subgraph of G[Q], then the spanning subgraph of G consisting of the edges of Tc U TQ together with all the edges of G[K] and all the edges of G joining C to K is a threshold subgraph of G. This can be easily verified by noting that no alternating 4-cycle of this subgraph can meet both C and Q. If we apply this fact to a minimum collection of spanning threshold subgraphs of G[C] covering its edges and similarly for G[Q], we see that

t(G) 1 a n d d~ _> 1. F r o m u - 1 - A1 - d~ - dl - 1 we o b t a i n dl - u a n d t h e r e f o r e :r

~

du+ 1 -

.

.. = dn_ 1 - 0.

3. B y 1 above dm - rn, a n d t h e r e f o r e d~n_ 1 >_ rn. H e n c e by 2, r n - 1 _< u. B u t m c a n n o t be u + 1, since by 1 we have Am - - 1 5r u - 1 - A , + I . T h e r e f o r e m < u.

11.4


309

4. F r o m 3 we h a v e - 1 -

A.+I

-

d; - d,+l

-1

-

A,+2

-

d~,+l-du+2

-1

-

A~

-

d;_~-dn.

T h e n f r o m 2 we h a v e du+2 = ".. = dn - 1 a n d 1 _< d , + l - d; - u + 1. F r o m t h e l a t t e r e q u a t i o n we d e d u c e t h a t d; _> u a n d t h e r e f o r e d~ _> u. B u t f r o m 2 we h a v e u - dl >_ d,. T h e r e f o r e d. - u. 5. Since d~ - u, we have m >_ u by definition of m , b u t by 3 we h a v e m _< u. T h e r e f o r e m - u. H e n c e by definition of m d , + l < L,, f r o m w h i c h it follows t h a t d; _ u, a n d so d; - u. T h e first e q u a t i o n of 4 now i m p l i e s t h a t d,+l - 1. 6. T h u s d has r e m 5.4.4, it u v e r t i c e s of a s t a b l e set. u, G is a net

b e e n c o m p l e t e l y d e t e r m i n e d as d - (u ", 1~). B y T h e o follows now t h a t G is a split g r a p h . Since m - u, t h e first G f o r m a m a x i m a l clique, a n d so t h e last u v e r t i c e s f o r m Since t h e last u degrees are 1 a n d t h e first u degrees are of o r d e r u.

Condition a: We h a v e n - 2 - / ~ 1 - d~ - d 1 ~ (?% - 1) - 1. It follows t h a t d~ - n - 1 a n d dl - 1, a n d t h e r e f o r e d - (1 ~) a n d G is a p e r f e c t m a t c h i n g . Condition 5" W e h a v e 2 AI - d ~ - dl. If d 1 - 3 a n d dl - 1, t h e n d - (14, 0), w h i c h gives t h e w r o n g A. T h e only o t h e r p o s s i b i l i t y is d~ - 4, dl - 2. F u r t h e r m o r e , 2 - /k2 d ~ - d2. If d~ - 3 a n d d2 - 1, t h e n we would h a v e t h e a b s u r d d3 > d2. T h e only o t h e r p o s s i b i l i t y is d~ - 4, d2 - 2. T h i s d e t e r m i n e s d as d - (2s), a n d t h e r e f o r e G is a p e n t a g o n . 9 1 1 . 4 . 6 Let G ( K , S ) be a split graph with a clique K , a stable set S, and a proper degree sequence da. Let H be a graph with a proper degree sequence dH. Then the A sequence of G (9 H is given by

Lemma

A - (AK, AH, As), wh e re

A.As

-

k-tKI (/Nk+l(dG),.

. ., /Xk+s(dG)),

-

I 1"


310

P r o o f . Put d = dH, e = da, f = da| diagram of e is given by

Since G is split, the corrected Ferrers

-IkQ 0s) where Jk is the all-1 matrix and Ik is the identity matrix of order k, O~ is the all-O matrix of order s, P is k x s, and Q is s x k. On the other hand, the corrected Ferrers diagram of f is given by

C(f) =

(

Jk--Ik

Jkx~

P

)

C(d)

Q

0~•

,

0~

where J, I and 0 are all-l, identity and all-O matrices of the indicated orders, respectively, and n = Iv(g)l. From the structure of C ( f ) , it is apparent that !

!

9 for i - 1 , . . . , k, f[ - fi - ei + n - ei - n - e i - ei; 9 fori-k+l

,..., k+n,

f" . fi . n + d.~ _ k -.n

9 fori-k+n+l,...,k+n+s,f[-fi-ei_

di-k

' k - di-k ; di_

-ei-~.

If d is a proper sequence with boxes B I , . . . , Br and a A sequence A = A(d), we denote by ABk the subsequence of A corresponding to the box Bk. We call a p r i m i t i v e sequence of length p (not necessarily a prime) a sequence of length p of the following types: (p-1,-1,...,-1),

(1,...,1, I-p),

(0,...,0).

We are now ready for the characterization of matrogenic split sequences. T h e o r e m 11.4.7 A proper sequence d is the degree sequence of a m a t r o g e n i c split graph if and only if all its ABk are p r i m i t i v e sequences and 1. the n u m b e r q of non-zero p r i m i t i v e sequences is even; 2. f o r k = 1 , . . . , q/2, the k-th and ( q - k ) - t h are equal.

non-zero p r i m i t i v e sequences

11.4


311

P r o o f . " I f " : Let G be a realization of d = (dl,. 9 dn) with boxes B 1 , . 9 9 , B r and A = Aa. The graph G is BT by Theorem 11.4.4, since the sum of the elements of every primitive sequence is 0. We use induction on r, the number of boxes of G. If r = 1, then A = 0 by Condition 1. Hence G is threshold and thus matrogenic split. For the induction step, we distinguish three cases: C a s e 1: G has dominating vertices (dl = n and S = 0. C a s e 2: G has isolated vertices (d~ = 0).

1). In this case, let K = B1

In this case, let K = O and

S=B~. C a s e 3: G has no dominating and no isolated vertices. K = B1 and S = B~.

In this case, let

In Case 3, B~ must be stable, otherwise by the BT property the vertices of B1 would be dominating in G. Similarly, each vertex of B~ has some neighbors in B1, otherwise the vertices of B~ would be isolated in G. Therefore B1 is a clique. Moreover, each vertex of B2 U ... U B~-I is adjacent to each vertex of B1 and to no vertex of B~. In all cases, the subgraph induced by K U S is a split graph G'(K, S), and if H denotes the subgraph induced by v(a)-v(a'), then G = G'(I'(,S)QH. Hence A = (AK, AH,/kS)

by Lemma 11.4.6. In Case 1, AK = 0 since the vertices of K are dominating, and As is empty, hence AH satisfies the conditions of the theorem. Similarly in Case 2, AK is empty and As = 0, and AH satisfies the conditions of the theorem. In Case 3, AK and As are not equal to the zero sequence as B1 and B~ are neither completely connected nor completely disconnected. Hence by Condition 2, Air = As, and again AH satisfies the conditions of the theorem. Since in Case 3 AK and As are equal primitive sequences, G' is a net or a net-complement (or a graph on two vertices) by Lemma 11.4.5. In all cases, H is a split matrogenic graph by the induction hypothesis, and hence G is split matrogenic by Theorem 11.4.3. " O n l y if"" Let G be matrogenic split with boxes B , , . . . , B~. We use induction on r. If r = 1, G must be complete or edgeless, hence A is the zero sequence. For the induction step, we have by Theorem 11.4.3 that G = G' @ H, where H is again a matrogenic split graph and G'(K, S) is a

312


split graph which is (a) a complete graph or ( b ) a n edgeless graph or (c) a net or (d) a net-complement. By Lemma 11.4.6, A = (A/,-, AH, As), where in Case (a) As is empty and A/s- = 0, and in Case (b) A/,- is empty and As = 0. By Lemma 11.4.5, both A/s- and As are primitive sequences and if both are non-zero, they are equal. Moreover, in Case (a) in Case (b) B1 U B~, in Case (c)or (d).

B1, v(a')

-

B,,

The result then follows by induction.

Chapter 12 Domishold Graphs 12.1

Introduction

Recall from Definition 9.1.3 that a dominating set of a graph G = (V, E) is a subset S of V such that every vertex not in S has a neighbor in S, and that the domination number 7(G) of G is the size of a smallest dominating set of G. We have seen in that section that the domination number is bounded above by the Dilworth number. Also recall that the threshold graphs are defined as the graphs for which the characteristic vectors of the stable sets are separated from the characteristic vectors of the nonstable sets by a hyperplane. In a similar spirit Benzaken and Hammer [BH78] introduced the domishold graphs as the graphs for which the characteristic vectors of the dominating sets are separated from the characteristic vectors of the nondominating sets by a hyperplane. They characterized the domishold graphs and showed that the threshold graphs form a proper subclass of domishold graphs. They also introduced the concept of domistable graphs. Later, Payan [PayS0] introduced a class of equidominating graphs, and Cherynak and Chernyak [ccg0], introduced a class of graphs called pseudodomishold graphs. We study these results in this chapter. In addition, some enumeration problems and other characterizations of domishold graphs are discussed in Chapter 13.

12.2

Notation and Main Results

We begin with the definitions of domishold graphs and domistable graphs. 313

314

Domishold Graphs

1. A graph G - (V, E) is called a d o m i s h o l d g r a p h if there exist a positive real-valued function w on V and a real 0 such that for all S C V,

D e f i n i t i o n 12.2.1

S is dominating if and only if

>_ o. xES

2. A d o m i s t a b l e g r a p h is a graph all of whose minimal dominating sets

are stable sets. 3. A graph G - (V, E) is called e q u i d o m i n a t i n g if there exist a positive integer-valued function w on V and a positive integer 0 such that for all Q c_ V, Q is a minimal dominating set if

only if

w(x) -- 0 xEQ

Clearly, in the definition of domishold graphs one can assume without loss of generality that w and 0 are positive integers. We refer to w(x) as the weight of x and to 0 as the threshold value. The domishold graphs and the domistable graphs were defined in [BH78] and the equidominating graphs were defined in [PayS0]. The results and concepts in this section are from [BH78]. As usual, N(x) denotes the neighborhood of vertex x and we put M(x) = V - ( N ( x ) U {x}). a stable dominating pair is a set of two nonadjacent vertices each of which is adjacent to every other vertex. Let Sk denote an edgeless graph on k vertices, Kk denote a complete graph on k vertices, and J2k denote the complement of a perfect matching on 2k vertices (kK2). The graph J2k is called a cocktail-party graph. We write Q d o m G to indicates that Q is a dominating set of G. Let us now define a binary relation ~d on the vertex set V of G as follows" D e f i n i t i o n 12.2.2 For x, y E V, x Ld Y if and only if for all Q c V - { x , y}, (Q u {y}) dom G ~ (Q u {x}) dom G. L e m m a 12.2.3 The relation ~d is reflexive and transitive, i.e., it is a preorder.

P r o o f . The reflexivity is obvious. To prove the transitivity, we assume that i, j, k are distinct, i ~e j and j ~d k, and show that i Ld k. Let Q c_ V - { i , k}

12.2

315

Notations and Main Results

be such that ( Q U { k } ) d o m G . I f j ~ Q, t h e n Q U { j } ) d o m G a n d h e n c e (Q u {i}) dom G. If j C Q, put Q' = Q - j. Then Q u {k} = ( ( Q ' u {k}) u {j }) dom G, and hence ((Q'u { k}) U {i}) dom G, and the latter set contains k but not j. So ((Q'U {j}) u {i}) = Q u {i} dom G. Hence i ~d k. 9 In view of Lemma 12.2.3, the relation ~d is called the dominal preorder of G. We now state the main result of this section. T h e o r e m 12.2.4 For every graph G equivalent:

(V, E), the following conditions are

1. G is domishold; 2. the dominal preorder ~d is linear; 3. G can be built from the empty graph by repeated application of the operation G' ~ G", where G" = (G' U S;) | Kq | J2~ and p + q + r > O; in other words, G can be obtained from an empty graph by repeatedly adding an isolated vertex, a dominating vertex, or a stable dominating pair; ~,. V can be partitioned into sets 1/1, 1/2 and 1/3 inducing the graphs SIVll, KIV21, Jig31 respectively, with the following properties: (a) each vertex in 1/2 is adjacent to each vertex in 173; (b) for each i E Vx, g ( i ) N V3 induces a cocktail-party subgraph of G; (c) the neighborhoods of the vertices of V1 are nested; 5. G can be obtained from some threshold graph with split partition (K, S) by substituting cocktail-party graphs for some vertices of K; 6. the graphs of Figure 12.1 are not induced subgraphs of G. Proof. 1) =v 2): It is enough to show that for every two vertices x, y with w(x) > w(y), we have x ~d Y. This is indeed the c~se, since for every

QcV-{x,y}, Q u {y} dom G ==~w(Q) -4- w(y) >_ 0

w(Q) + w(x) > 0 :=~ Q u {x} dom G, where w(Q) - EzeQ w(z). 2) =V 3)" First we show that if x is any maximal vertex in the linear order

Domishold Graphs

316

Figure 12.1" The forbidden induced subgraphs of domishold graphs.

H1

Ha

H2

H4

Hs

~d, Q is any maximal stable set, and x ~ Q, then x is adjacent to every vertex in Q. Since Q is a maximal stable set, Q dom G. Now for every y C Q, Q = ((Q - y ) u {y}) dom G and hence ((Q - y ) u {x}) dom G, since x ~d Y. But the only neighbor of y in ( Q - y ) u {x} is x. Thus x is adjacent to y, as required. We now show that G contains an isolated vertex, a dominating vertex, or a stable dominating pair. The result then follows by induction since removing isolated vertices or a dominating set preserves Condition 2, as is easy to verify. Assume that G contains no isolated vertex or dominating vertex. Let x be a maximal vertex in ~d and y C M(x) (y exists since x is not dominating). We show that X(y) = N(x). Suppose that, if possible, zx C E and zy q~ E. Extend {y, z} to a maximal stable set Q, which will not contain x. Then, as shown above, x is adjacent to every vertex in Q, contradicting xy ~ E and y C Q. This shows that N(x) C_ N(y). It follows that y is also a maximal vertex in Le and hence, by symmetry, N(y) C_ N(x). It follows that every vertex in M(x) has the same set of neighbors as x. Thus if z C N(x) (z exists since x is not isolated), then {z, y} dom G and hence {x, y} dom G as x is a maximal vertex. Thus G has a stable dominating pair {x, y}, as required. 3) =~ 4): Let Go,..., Gt be a sequence of graphs with Go = 0, Gt = G and

G~+I - (G~ u Sp,) | Kq, @ &~,,

i-O,...,t-

i.

12.2

Notations

and

Main

317

Results

Putting i

i

i

and using an easy induction, we see that this partition has the desired property. 4) =~ 5)" Identify each vertex of V3 with its unique nonneighbor in V3 to obtain a clique V~. The resulting graph is clearly a threshold graph with split partition (1/2 U V3', 1/1), 5) =~ 6)" The graphs of Figure 12.1 are not threshold graphs, nor do they contain a stable dominating pair. Hence they cannot be obtained from a threshold graph by substituting cocktail-party graphs for some vertices in the clique. The result now follows since if G satisfies Condition 5, so does every induced subgraph of G. 6) => 3)" It is enough to show that G contains an isolated vertex, a dominating vertex, or a stable dominating pair. If G is not connected, then G contains isolated vertex since H1 is forbidden. So assume that G is connected. Then by a well-known result about P4-free graphs (Lemma 14.3.6), G is not connected since H2 is forbidden. If a connected component of G has one vertex, then it is a dominating vertex in G, and if it has two vertices, then they form a stable dominating pair in G. Otherwise, G contains two components, each with at least three vertices. Let xl, x2, x3 be vertices of a component such that x2 is adjacent to the other two. Similarly, let yl, y2, y3 be vertices of another component with y2 adjacent to the other two. Then, depending on the adjacency between xl and x3 and between yl and y3, G contains Ha,/-/4 or Hs as an induced subgraph, a contradiction. 3) =~ 1)" The empty graph is obviously domishold. Assuming that G' = (G u Sp) | Kq @ J2r and that G is domishold, we show that G' is domishold too. Let wl represent the weight of each 1 E V(G), and 0 the threshold for G. Let w* - mint wt/2. We may assume that 2w* < 0, since otherwise G is a clique, and we can change all weights wt for 1 C V(G) as well as 0 into 1 to achieve2w* < 0. Let us also put W - 1 + ~ t e y ( a ) wl and let us define 0 ~ 0 + p W and

i , wi -

W 0 + pW O+pW-w*

v(G),

iESp, i E Kq, i G J2~.

318

Domishold Graphs

The weights w~ and the threshold O' characterize the dominating sets of G'. Indeed, every minimal dominating set D' of G' is of one of the following three forms: 1. D ' = {k}, k e Kq; 2. D' = { x , y } , x C J2r, x r y, y C J2,. U Sp t2 V(G);

3. D' = D tO Sp, where D is a minimal dominating set of G. It is straightforward to verify that these are also precisely the minimal sets Q' of G' satisfying w'(Q') > 0'. tt Since each of Ha,//4, Hs of Figure 12.1 contains an induced 6'4, it follows that all threshold graphs are domishold graphs. The next theorem gives other characterizations of threshold graphs among the domishold graphs. T h e o r e m 12.2.5 For a domishold graph G, the following conditions are equivalent: 1. G is a threshold graph; 2. G is a C4-free graph; 3. G is a split graph; .~. G is an interval graph; 5. G is a domistable graph.

P r o o f . 1) =~ 2,3,4,5): This follows from Theorem 1.2.4 characterizing threshold graphs, Theorem 2.1.6 characterizing interval graphs, and the fact that a minimal dominating set of a threshold graph considered as a split graph G(K, S) is S itself or consists of one vertex from K and all its nonneighbors in S. 3) =~ 2): This follows from Theorem 5.2.1. 4) =~ 2): This follows from Theorem 2.1.6. 5) =~ 2): Since G is a domishold and domistable graph, it must contain a dominating vertex or an isolated vertex (otherwise, as in the proof of 3) =, 4) in Theorem 12.2.4, if x is any maximal vertex with respect to the dominal preorder, y C M ( x ) and z C N(x), then {y, z} is a minimal dominating set that is not stable). Delete any dominating or isolated vertex from G, and the

12.2

Notations and Main Results

319

resulting graph is clearly again a domistable and domishold graph. Hence this process can be repeated until all the vertices are removed. Since no vertex of an induced C4 can become isolated or dominating, it follows that G is C4-free. 2) => 1): This follows from Condition 6 of Theorem 12.2.4 and Condition 2 of Theorem 1.2.4. 9 A linear inequality n

E

WiXi ~__ 0

i=1

with Wl >_ " " >_ Wn > 0 is called domigraphic if W l , . . . , w ~ are weights and 0 is a threshold of a domishold graph. The condition n

y~'~wi >__O i=1

is obviously necessary for the inequality to be domigraphic since V(G) is a dominating set. In the case n - 1, it is sufficient too. For n - 2, this condition and w2 < 0 => wl < 0 are again sufficient. The following theorem gives a recursive characterization of domigraphic sequences for n >_ 3. Theorem

12.2.6 A linear inequality n

E wixi >_ 0 i=1

with W 1 ~ " ' " ~___W n > 0 and n >_ 3 is domigraphic if and only if one of the following conditions holds: 1.

W 1 ~

0 and ~i~2 wixi >_ 0 is domigraphic;

2. w~ < O, E ~ 2 wi < 0 and ~i~2 wixi >_ 0 - Wl i8 domigraphic; 3. Wl < O, W 1 -~- W2 ~__ 0 and ~i~3 wixi >_ 0 is domigraphic. P r o o f . Let the inequality ~-~iL1W i X i ~-- 0 be domigraphic and let G be the associated graph. Let x be a maximal vertex of the dominal preorder. Without loss of generality, we may assume that w(x) - w I (&s in the proof of 1) => 2) of Theorem 12.2.4). If x is dominating, then Condition 1 is satisfied. If x is isolated, then Condition 2 is satisfied. If G has no isolated or dominating vertex, then there exists another maximal vertex y such that {x, y} is a stable

Domishold Graphs

320

dominating pair (as in the proof of 2) ~ 3) of Theorem 12.2.4). Without loss of generality, we may assume that w(y) = w2. Then Condition 3 is satisfied. Conversely, suppose one of the Condition 1 - Condition 3 is satisfied. Let G' be the domishold graph associated with the domigraphic inequality in each case. Then, clearly, the linear inequality of the theorem corresponds to a domishold graph obtained from G' be adding an isolated vertex (Condition 2), or a dominating vertex (Condition 1), or a dominating stable pair (Condition 3). "

12.3

Equidominating Graphs

Payan [PayS0] defined equidominating graphs as follows: D e f i n i t i o n 12.3.1 A graph G = (V, E) is said to be e q u i d o m i n a t i n g when there exist a weight function w: V ~ Z + U {0} and a number t E Z + U {0} such that for every S C_ V, S is a minimal dominating set ~

~

w(x)-

t.

xES

The class of equidominating graphs neither includes nor is included in the class of domishold graphs. As evidence, observe that 2K2 is an equidominating graph but not a domishold graphs. To see that it is equidominating assign a weight of 1 to each vertex of one edge and a weight of 2 to each vertex of the other edge, and let t = 3. From Theorem 12.2.4, 2K2 is forbidden for domishold graphs. On the other hand, K2,z is a domishold graph (Theorem 12.2.4), but not an equidominating 'graph. To see this, let (A, B) be the bipartition of K2,3 into stable sets with ]A I = 2. Then A and any pair {x, y} with x C A and y C B are minimal dominating sets, and hence every vertex should receive the weight t/2. But then B is a minimal dominating set with total weight exceeding t. The following theorem characterizes the equidominating graphs that are also domishold graphs. T h e o r e m 12.3.2 ([Pay80]) The following conditions are equivalent for every graph G: 1. G is both a domishold graph and a equidominating graph;

12.3

Equidominating Graphs

321

2. G has weights and threshold that characterize simultaneously the dominating sets by inequality and the minimal dominating sets by equality; 3. G can be built from the empty graph or from a cocktail-party graph (the complement of a perfect matching) by repeatedly adding an isolated vertex or a dominating vertex; ~. G has no induced graph isomorphic to H i , . . . , H4 of Figure 12.2.

Figure 12.2: The forbidden induced subgraphs for domishold and equidominating graphs.

H1

H2

H3

H4

P r o o f . 2) =~ 1)" This is obvious. 1) =~ 3)" We proceed by induction on IV(G)]. Let G - (V, E) be a domishold graph with weight function w~ and threshold t~, as well as an equidominating graph with weight function w~ and threshold t~. By Theorem 12.2.4, G has an isolated vertex, a dominating vertex, or a stable dominating pair. C a s e 1" G has an isolated vertex x 9 For i - 1, 2, let WG_ ~ x be the restriction of w~ to G - x, and let t ia_~ - t ~ - wb(x). It is easy to check that G - x is domishold and equidominating with these modified weights and thresholds. Hence by induction G - x satisfies Condition 3, and so G does too. C a s e 2: G has a dominating vertex x. The argument is similar to Case 1 with t~_~ - t~. C a s e 3" G does not have an isolated or a dominating vertex, but has a stable dominating pair x, y. Observe that a set S C V(G) is a minimal dominating set of G if and only if S is a minimal dominating set of G - {x,y}, or S - {x,y}, or for some v e V ( G ) - {x,y}, S - {x,v} or 5' - {y,v}. Since we m a y assume that IV(G)I >_ 3, it follows that w~ - t2c/2. Hence S is a minimal dominating set of G if and only if ISI - 2. We assert that G is a cocktail-party graph. Since every two vertices of G form a dominating set, G does not contain any induced K1 tO K2 or an i n d u c e d / 3 . Hence G does not contain any isolated vertices or induced K1,2 or induced K3. Therefore every 9

322

D o m i s h o l d Graphs !

q

connected component of G has exactly two vertices, and hence G forms a perfect matching, and G satisfies Condition 3. 3) =~ 2)" The empty graph trivially satisfies Condition 2 with null w a and ta - O. The cocktail-party graph satisfies Condition 2 with wa - 1 and to - 2. Assume now that x is an isolated or dominating vertex of G such that G - x satisfies Condition 2 with weights wa-~ and threshold t a - ~ . Then we obtain wa and ta as follows: 9 If x is an isolated vertex, then S is a minimal dominating set of G if and only if x 6 S and S - {x} is minimal dominating set of G - x. We then set for all v 6 V -

wa(v)

--

wa_~(v)

wa(x)

=

y~ wa_~(v) + 1 - ta_~ v~V-{~}

ta

=

E w a - x ( v ) + 1. v~V-{~}

{x}

9 If x is a dominating vertex, then S is a minimal dominating set of G if and only if S - {x} or S is a minimal dominating set of G - x. Then we set for a l l v C V - { x }

-

a(x)

-

ta_

ta

--

tG-x.

In each case it can be verified that G is both a domishold graph and a equidominating graph with weights wa and threshold ta, and hence condition 2 is satisfied. 3) =~ 4)" This follows from the fact that the graphs H 1 , . . . , / / 4 neither are cocktail-party graphs nor do they contain isolated or dominating vertices. 4) =~ 3)" Assume that G contains no induced subgraph isomorphic to the graphs of Figure 12.2, and assume further that G contains no isolated or dominating vertices. We complete the proof by showing that G must be a cocktail-party graph. G is connected, for otherwise G contains HI. Hence by a well-known result about P4-free graphs (Lemma 14.3.6), G is not connected since G is H2-free. Each connected component of G has at least two vertices since G has no dominating vertices. If each component of G has exactly two

12.4

P s e u d o d o m i s h o l d Graphs

323

m

vertices, then G is a cocktail-party graph, as required. Otherwise, G has a component with at least two vertices Xl,X2 and another with at least three vertices yl, y2, y3 such that Y2 is adjacent to both yl and y3. But then, depending on whether or not 91 and y3 are adjacent, these five vertices induce Ha or//4 in G, a contradiction. .. The following corollary follows from Condition 3 or 4 of Theorem 12.3.2.

Corollary 12.3.3 The threshold graphs are equidominating (and domishold).

12.4

P s e u d o d o m i s h o l d Graphs

Chernyak and Chernyak [CC90] call a graph G - (V, E) a pseudodomishold graph if there is a function w" V , R + U {0}, w ~ 0, and a nonnegative real t such that for every S c_ V, SdomG

?, ~ w( x ) >_t xES

--(SdomG)

:, ~ w ( x ) _ O,

P4}.

Let tn denote the number of non-isomorphic n-vertex graphs in M without isolated vertices, dominating vertices, or stable dominating pairs (tn -- 0 for n < 4), and put tnX n n>l

For n > 6, it is possible to evaluate tn from the definition of M by partitioning the vertex set of the complementary graph into cycles and paths of length 3 or more. Let Un denote the number of non-isomorphic n-vertex HP graphs, and put u(x) - ~ unx n. n>l

The following result expresses u ( x ) i n terms of t(x). C o r o l l a r y 12.4.2 t ( ~ ) + x - ~3 u(x)

-

1-

2x-

x 2 + x 3-

x 4"

L2.4

325

Pseudodomishold Graphs

Figure 12.3" Forbidden configurations and induced subgraphs for HP graphs.

(a) Forbidden configurations. Dashed lines indicate nonedges and solid lines indicate edges. Other edges may or may not exist.

w

v

"i__ '. i I I I w

w

w

v

w

(b) Forbidden induced subgraphs

w

w

Domishold Graphs

326 An estimate of un is given by the following result.

Corollary 12.4.3

For

n > 6, (2.32) n-1 < un < (2.37) n-1.

We can compare the above estimates with an estimate of the number d~ of non-isomorphic n-vertex domishold graphs. Put

d ( x ) - E dnxn" n>l Then we have the following result.

Corollary 12.4.4 XmX 3 d(x) Further,

dn ~" c r n, w h e r e r ..~

-

1 - 2 x - x 2 + x 3"

2.25

is t h e m a x i m u m

root (in absolute

value)

of the equation x 3 - 2x 2 - x + 1 -

O.

We see that un is larger than dn, consistent with the fact that all domishold graphs are HP graphs, but not conversely.

Chapter 13 The Decomposition Method 13.1

Introduction

In Definition 11.4.2 we saw how to compose a split graph with an arbitrary graph, and in this chapter we investigate the opposite operation of decomposition. This subject was studied by Tyshkevich [Tys84, TysS0], Chernyak and Chernyak [CC90, COS1]and Tyshkevich and Chernyak [TC85b, TC85a]. It turns out that every graph has a unique decomposition into indecomposable components. Tyshkevich and Chernyak [TC85a] also introduced a generalization from the class of split graphs to the larger class of polar graphs that we saw in Section 7.6, and obtained a hierarchy of decompositions and a unique decomposition at each level of the hierarchy. The decomposition method proves to be a unifying approach to the study of many graph problems, since many properties of graphs hold for a graph if and only if they hold for each indecomposable component, perhaps with a small change which is easy to pinpoint. It simplifies the description of the structure of several graph families, leads to their enumeration, and serves as a tool to explore them. Section 13.2 presents the decomposition, the hierarchy, and the unique decomposition result. Section 13.3 applies this theory to threshold and domishold graphs, Section 13.4 applies it to box-threshold graphs and enumerates them, and Section 13.5 does the same for matroidal and matrogenic graphs. 327

The Decomposition Method

328

13.2

The Canonical D e c o m p o s i t i o n

In Section 7.6 we defined the class of polar graphs and its subclasses (c~,/3), and discussed the complexity of their recognition problems. In this section we present a theory of decomposition of polar graphs based on Tyshkevich and Chernyak [TC85b]. To review the definition, a graph G = (V, E) is said to be polar if V can be partitioned into subsets A and B, possibly empty, such that each of the graphs G[A] and G[B] is a union of cliques. This partition is called a polar partition. If in addition every connected component of G[A] (resp. G[B])is (a clique)of size at most c~ (resp./3), we say that G belongs to the class (c~,/3) of polar graphs. The polar partition may not be unique: for example, if G is the path P3 with consecutive vertices a, b, c, then we may take A = {b}, B = {a,c} or A = {a,b}, B = {c}, putting G in (1, 1)in both cases; if G is the cycle C4, we may take A to consist of three vertices, putting G in (2, 1), or we may take A to consist of two adjacent vertices, putting G in (1, 2), or we may take A to consist of two non-adjacent vertices, putting G in (2, 1) again. Other examples are G = C6 with A consisting of three mutually adjacent vertices and G E (1, 3), and G bipartite, with G C ( ~ , 1). The class (1, 1) is precisely the class of split graphs. We indicate a polar graph with its polar partition as G(A, B), and use the (c~,/3) notation for it as well. We say that polar graphs G(A,B) and G'(A', B') are polar isomorphic if there is an isomorphism of G to G' mapping A onto A' (and hence B onto B'). It can be checked that the graph in Figure 13.1 is not polar. Figure 13.1" A non-polar graph.

w

v

w

Clearly if G(A,B) r (o~,/3), then G(B, A) C (~, c~). Another easy result is the following one, establishing a hierarchy among the polar graphs. P r o p o s i t i o n 13.2.1 If a < (~' and/3 < /3', then (a,/3) C_ (a',/3'), and the

inclusion is proper if at least one of the inequalities is strict.

13.2

The Canonical Decomposition

329

P r o o f . The inclusion is trivial. We show that the graph

C = (/3' + I)K~, U (c~' + I )K~,, which clearly belongs to (c~',/3'), cannot belong to (c~',/~'- 1) if /3' > 1. Indeed, if G(A, B) E (c~',/~'- 1), then A must contain at least one vertex of each of the c~' + 1 copies of KZ,. Consequently, G[A] contains a stable set of size c~' + 1, which is not the case by G(A, B) E (c~',/3'- 1). Similarly if c~' > 1, G cannot belong to (c~'- 1, fl). 9 We now define the operation of graph composition, which specializes to the operation of Definition 11.4.2 for the class (1, 1). D e f i n i t i o n 13.2.2 The classes of polar and ordinary graphs are denoted by

7) and G, respectively. We define the c o m p o s i t i o n operator

O:Pxg~g as follows. If G(A,B) C 79 and H C ~ are vertex-disjoint, then G(A, B ) 0 H is obtained from G U H by adding every edge between A and H. The graphs G and H are called the c o m p o n e n t s of the composition. Note that if in the above definition H is polar and H(C, D) E T', then the composition Q = G O H is also polar Q(A U C, B U D). Thus O is a binary operation on the class of polar graphs. In general, composition is associative when all components except possibly the rightmost are polar. Therefore the algebra (T ~, O ) i s a semigroup, and each class (c~,/3) is a subsemigroup. D e f i n i t i o n 13.2.3 A graph G that can (cannot) be written in the form F(A, B ) 0 H is called d e c o m p o s a b l e ( i n d e c o m p o s a b l e ) . If in addition

we require that F(A, B) E (c~,/3), G is said to be decomposable 5ndecomposable) at t h e level (c~,/3). If in addition G is polar, G(C,D) E (c~,/3) and H E (c~,/3), then G(C,D) issaid to be decomposable (indecomposable) in t h e class (c~,/3). It can be verified that Cs is an indecomposable polar graph and the graph of Figure 13.1 is an indecomposable non-polar graph. To state the decomposition result below, we define an equivalence relation on the set of polar graphs as follows: 1. each polar graph G(A,B) is equivalent to itself (but when A and B are non-empty, not to any other polar graph even if they are polar isomorphic);

330


2. every two graphs of the form G(O, B) are equivalent, and every two graphs of the form G(A, rg) are equivalent. It is easy to see that the composition Q is commutative on every class of equivalent graphs. T h e o r e m 13.2.4 1. every graph from the class (a, fl) can be represented as a composition of components (necessarily from (a, fl)) that are indecomposable in the class (a, fl). The decomposition is unique up to permutations of consecutive equivalent components.

2. every graph not from the class (a, fl) and decomposable at the level (a, fl) decomposes uniquely as H(C, D) @ F, where H(C, D) C (a, fl), F ~ (a, fl) and F is indecomposable at the level (a, fl). Before proving the theorem, we use it to make the following definition. D e f i n i t i o n 13.2.5 Let G(A,B) be decomposable in the class (a, fl). Consider its decomposition into indecomposables in this class. If in this composition we replace each maximal set of consecutive equivalent components by their composition, we obtain a representation of G in the form

G(A, B)

-

Xl

(~ . . .

(~ X n ,

X k --

Gk(Ak, Bk) e (a, fl),

(13.1)

where Xk and X k + l a r e not equivalent and each Xk satisfies one of the following three conditions: 1. Ak - O C Bk; 2. Ak r O - Bk; 3. Ak, Bk r 0 and Xk is indecomposable in the class (a, fl). The decomposition (13.1) is referred to as t h e c a n o n i c a l d e c o m p o s i t i o n of G ( A , B ) in t h e class (a, fl). Likewise, if F does not belong to the class (a, fl), its c a n o n i c a l d e c o m p o s i t i o n at t h e level (c~,/3) has the form F

= X 1 (~ . . .

(~ X n

(~

H,

where the Xk are as above, H ~ (a, fl) and H is indecomposable at the level

13.2

The Canonical Decomposition

331

13.2.4. The existence of the decompositions in question follows from the finiteness of the graphs, and what we need to prove is their uniqueness. P A R T 1" Let G(A, B) C (o~,/~). For each a C A, we let d(a) denote IN(a)N BI, and for each X C_ A, d(X) - ~-~aEXd(a). Similarly for each b E B, d(b) denotes IN(b)N AI, and for each Y C_ B, d(Y) - ~be~" d(b). Since the induced subgraph G[B] is a union of cliques of cardinalities at most /3, there exists a unique partition B - B1 tO -.. tO BZ, where G[Bj] ji(j and the nj are non-negative integers. Thus Bj uniquely partitions into cliques of cardinality j" Bj - Bj~ U ... U Byn,. Similarly A - A 1 U . . . U Ac~, where G[Ai] - m i K i and the m i are nonnegative integers, and Ai uniquely partitions into stable sets of cardinality i" Ai -- A l l I..J . . . I..J Aim,. Observe that since the Air are stable sets, for each canonical decomposition (13.1), each All is contained in one of the Bk, and similarly each Bjt is contained in one of the Bk. We represent G(A, B) in the form Proof of Theorem

/..

G(A,B) - X1 6) H(C,D),

Xa - GI(A1,B~),

where X1 is the first term of a canonical decomposition (13.1). We want to show that X1 is uniquely determined by G(A, B), from which it follows that H(C,D) is also determined, and the result follows by induction. We distinguish three exclusive cases. Case 1" A - / ~ and some Bjk satisfies d(Bjk) - O. Denote the union of all such Bjk by N. We must have A1 - 0, for otherwise (0, Bjk) is a further component of X1. Similarly B1 - N, and so G1 - G[N] and X1 is uniquely determined by G(A, B). Case 2" B r e and some Bik satisfies d(Aik) - i . IBI. Denote the union of all such Aik by N. In analogy to Case 1 we must have A1 - N, B 1 - O , G1 - G I N ] a n d X1 is uniquely determined. C a s e 3: The situations under the previous cases do not occur. We may also assume that A, B r O, for otherwise the theorem is trivial, and therefore we have that A1, B1 -r O and X1 is indecomposable. We note that /..

t..

/..

^ ^ d(Bjk) { < j" JAil, if Bjk C Be, j I./~ll, if Bjk ~ B1.

(13.2)

If, for example, the first line of (13.2) is false, then (;g, Bjk) is a right component of X1. The second line follows similarly because in that case Bjk C_ D. Equation (13.2) shows that B1 is uniquely determined by A1.

332

The Decomposition M e t h o d

Assume that G(A, B) can also be represented in the form ~..

G(A, B) - X~ fi) H'(C', D'),

A!

X~ - Gi (A'I, J~l),

where X~ is the first A term ofh another canonical decomposition. We assert that one of the setsA ! B1 and B~ contains the other one. For assume that, if A h possible, x C B a B 1 and y C B ~ B1. Then the assumptions on x imply that h A! ~. x C B~ND' and therefore Aa C_ ANN(x)C_ Aa. Similarly the assumptions on y imply that A1 C_ A 1. Thus A1 A-! A 1. But as noted above, A1 determines B1 h by (13.2), and therefore B1 -- BJ' contrary to assumption. This proves the assertion. Next we assert that .B 1 -- B 1. Assume that, if possible, B 1 C B1. A! A A Then by theA argument ofA ! the t .previous assertion, A 1 C_ A1, Aand since A1 ! determines B1, we ..have A 1 C A1. Moreover, every., vertex of A 1 of adjacent to every vertex of/31 - B[, and every vertex of B~ of non-adjacent to every v e r t e x of A 1 - A 1. This shows that (AI,Ba)is a component of (A1,_B1) , contradictinfi^ the indecomposability of X1. This proves our second assertion that B1 - B~, and therefore D - D'. We also note that

d(Ail)

IDI, if A~, c_ A1, _ i.

(13.3)

for otherwise X1 decomposes. Equation (13.3) shows that D uniquely deterA ~ A! ! mines A1, and thus A1 - A 1. Therefore X1 - X 1. P A R T 2" Assume that G ~ (c~,/3) and G has two decompositions

G - H(C,D) Q F - H'(C',D') (!) F', with H ( C , D ) , H ' ( C ' , D ' ) C (c~,~)and F , F ' ~ (c~,~). It follows that the set R - V ( F ) N V(F') is not empty, for otherwise F ' is an induced subgraph of H and so F ' E (c~,/3), or symmetrically. Assume that , if possible, R' = V ( F ' ) - R 7/=r We represent R' in the form R ' - C1 U D1 with C1 C_ C and D1 C_ D. But then we have the decomposition F ' - F(R')(CI,D1)fi)F(R), which contradicts the indecomposability of F ' at the level (c~,/3). This shows that R ' - e , i.e., V(F') C_ V(F). By symmetry we also have V(F) C_ V(F'), and so V(F') - V(F). Since F is an induced subgraph of G, V(F) uniquely determines F, and also C and D as the sets of vertices of G adjacent to all and to none of the vertices of V(F), respectively. The sets C and D uniquely determine H. 9

13.2

T h e Canonical Decomposition

333

C o r o l l a r y 13.2.6 The canonical decomposition of the graph G(A, B) in the class (a,/~) coincides with its canonical decomposition in every class (c~',/3') containing (a, fl). In other words, a polar graph has a unique canonical decomposition. Proof. This follows from Part 1 of Theorem 13.2.4. C o r o l l a r y 13.2.7 Every non-polar graph G has a unique representation of the form G = H(C,D) 0 F, where H(C,D) is polar and F is an indecomposable non-polar graph. Proof. This follows from Part 2 of Theorem 13.2.4. P r o p o s i t i o n 13.2.8 If (a',fl') D (c~,fl), then there exist graphs indecomposable at the level (a,/~) but decomposable at the level (a', fl'). Proof. We may assume without loss of generality that a' > a. Let F be a non-polar indecomposable graph, such as the graph of Figure 13.1. Define a graph G by its canonical decomposition at the level (a',/Y) as G = S~,(A,O)O F, where S~, is an edgeless graph on a' vertices. We assert that G is indecomposable at the level (a,~). Assume the contrary, and let G1(A1, B1) be the first component of the canonical decomposition of G at the level (c~, fl). Let G2(A2, B2) be the first component in the canonical decomposition of GI(A1,B1)in the class (a', fl'). Then G2(A2, B2)is the first component of the canonical decomposition of G at the level (a',/3'). Since A1 does not contain a stable set of size c~', the same is true of A2. But then G2(A2, B2) cannot be the same as S~,(A, rg), contradicting Theorem 13.2.4. Now we specialize to the class (1, 1), the class of split graphs. Two polar isomorphic split graphs are said to be split isomorphic. We would like to use the canonical decomposition at the level (1, 1) to give a criterion for split graphs to be isomorphic. We need the following lemma, which spells out the freedom in choosing a split partition of a split graph. L e m m a 13.2.9 Let G(A,B) and G'(A',B') be isomorphic split graphs on the same vertices. If they are not split isomorphic, then there exists a vetrex x such that A = A ' U { x } (and therefore B' = B U { x } ) , we have the decompositions

G(A,B) G'(A',B')

= H(A- {x},B)O/(l({X},~), = H ' ( A ' , B ' - {x})O SI( e , {x}),

334


and H ( A - { x } , B ) and H ' ( A ' , B ' - { x } ) are split isomorphic, or the same holds with reversing the role of the primed and unprimed letters. P r o o f . The sets A - A ' = A N B' and A ' - A = A ' N B are both cliques and stable sets, hence their cardinality is at most 1. If they are both empty, then G(A,B) and G'(A',B') are split isomorphic, which is not the case by assumption. If both sets have cardinality 1, say A - A ' = {x} and A ' - A = {y}, then N(x) = N ( y ) = A N A', except that x and y are possibly adjacent; thus x and y are twins and by swapping them we turn (A, B ) i n t o (A', B'), so again G ( A , B ) a n d G'(A',B')are split isomorphic, contrary to assumption. Therefore one of the sets, say A - A', consists of a single vertex x and the other one, say A ' - A, is empty. Then the required properties are seen to hold. ,, We are now ready for characterizing split isomorphism, first proved by Tyshkevich.

Theorem 13.2.10 ([Tys80]) Let G(A,B) G'(A', B')

=

al(A1,B1)

@

-

G'1(A'I, B~) @

am(Am,Bm)

...

@

-..

@ G~n(A'n, B;)

be the decompositions of split graphs G and G' into indecomposable components in the class (1, 1). Then G and G' are isomorphic graphs if and only

if 1. m = n ; 2. Gi(Ai, Bi) and G'i(A~,B~) are split isomorphic for i -

1,...,m-

1;

3. either G,~(Am, Bin) and G~(A~, B~) are split isomorphic, or else one of them is K1 ({x }, ;g) and the other one is S1 (;g, {x }). P r o o f . The sufficiency is clear. For the necessity, we have two cases. If G and G' are split isomorphic, then Part 1 of Theorem 13.2.4 implies that our conditions hold with the first alternative in Condition 3. If G and G' are not split isomorphic, then Lemma 13.2.9 and Theorem 13.2.4 imply that our conditions hold with the second alternative in Condition 3. 9

Theorem 13.2.11 ([Tys80]) Let :

~'

-

~I(A1,B1)@ -

...

QGm(A,~,Bm)Q H

~'1 (A'I, B'I) @ "'" @ Gin (din, B'n) @ H'

13.3

Domishold Graphs and Decomposition

335

be the decompositions of non-split graphs G and G' with Gi(Ai, Bi) and G~(A~,B~) indecomposable in the class (1,1) and H and H' non-split indecomposable at the level (1, 1). Then G and G' are isomorphic graphs if and only if 1. m - n ; 2. G~(A,, B,) and G~(A~, B~) are split isomorphic for i = 1 , . . . , m; 3. H and H' are isomorphic. P r o o f . This is Part 2 of Theorem 13.2.4 specialized to the class (1, 1).

13.3


We apply the theory of canonical decomposition presented in Section 13.2 to enumerate and classify the domishold graphs, which we saw in Chapter 12. First we do the same with the simpler threshold graphs. This section is mainly based on Tyshkevich and Chernyak [TC85b]. We use the notation I 1, graphs of the form (13.4) and (13.5) are not isomorphic. Graphs of the form (13.4) (or (13.5)) corresponding to different compositions n -- rtl - J r " .

Jr- n s

and

n -

ml

-Jr'"-Jr

are isomorphic if and only if s - t + 1, ns - 1, for i - 1 , . . . , t - 1, or vice versa.

mt

1 and ni

n t - - m t --

-

mi

T h e o r e m 13.3.3 Let cm denote the number of non-isomorphic threshold graphs without isolated vertices that have m edges, with Co - 1 by convention. Let D ( m ) denote the set of partitions of m into distinct (positive integer) parts. Then 1.

2.

E cm x m m>O

l-I(l+xJ) 9 j_>l

Proof. The two parts of the theorem are equivalent by the well-known generating function for the number of partitions of integers into distinct parts. We prove Part 1 by exhibiting a suitable bijection [TC85b]. Part 2 has been proved by different techniques by Peled [Pel80], see Chapter 17. The partition m -- ml

- - I - ' ' ' nt- rrtk

with

ml

>

'''

>mk

:>

0

is mapped to the graph K1 @

Sml--m2--1 (~ ''" (~

K1 @

Sink_l--ink--1 (~

K1 @ Sm k C

D(m),

13.3


337

where empty graphs So are understood to be omitted. By Part 3 of Theorem 13.3.2, this mapping is injective. Moreover, for the same reason, every threshold graphs without isolated vertices can be represented in the form K~ Q Sl~ Q . . . Q K~, Q Sl, with positive ni and li, and it is easy to reconstruct the partition that maps to it. Thus the mapping is surjective. .. Turning now to domishold graphs, we denote by D the set of polar graphs in the class (2, 1) that are domishold. We easily deduce the following theorem from Theorem 13.2.4 and Condition 3 of Theorem 12.2.4: Theorem

13.3.4

1. All domishold graphs belong to the class (2, 1).

2. A graph in the class (2, 1) is domishold if and only if all its indecomposable components in this class have one or two vertices. .

o

The algebra (~, 5)) is a free semigroup generated by the polar graphs K l ( { X } , e ) , SI(•, { x } ) a n d S2(;g, {x,y})with x,y non-adjacent. The set of domishold graphs in the class (1, 1) coincides with the set of threshold graphs.

For the classification of domishold graphs, it is convenient to work with their complements, the codomishold graphs, which by Part 3 of Theorem 13.3.4 are the compositions of components of the form KI({X}, e ) , S l ( e , {x}) and M ( e , {x, Y}) with x, y adjacent. Using the notation Mtm - S~ Q M m - S[ Q M TM, we see that the codomishold graphs are the graphs that have one of the following forms of decomposition" M/lrn I @ I(n I @ M/2rn 2 @ tin 2 @ . . . @ MItrnt @ lin t

(13.6)

Mllrnl Q linl Q Ml2rn2 @ l(n2 @ "" Q l(nt_l @ Mltrrtt

(~a.7)

I'(nl @ Ml2m2 @ I'(n2 @ Ml3m3 @"" @ Mltmt @ I~nt

(13.8)

I(nl Q Mt~m~ Q K~2 Q Mt3.~3 Q " Q

K~_I Q Mt,.~,

(~a.9)

with t >_ 0. Here, as above, Kn denotes K~. We call a composition of the form (13.6) or (13.8) with nt >_ 3 or of the form (13.7) or (13.9) with It -t- rat > 2 &

standard decomposition.

338


T h e o r e m 13.3.5 Every codomishold graph with at least three vertices has a unique standard decomposition. P r o o f . Observe that in each standard decomposition, every component with the possible exception of the last two is uniquely determined. For instance, if the graph has isolated vertices or edges, then [1 and ml are the numbers of isolated vertices and edges, respectively, n l is the number of dominating vertices remaining after the previous vertices are deleted, and so on. It follows that in any two standard decompositions, the numbers of components differ by at most 1. The following are the only cases. Case 1" One standard decomposition ends with Mtm q) K~. Then n >_ 3, and n is the number of vertices of degree at least 2 in the graph induced by the vertices of these last two components. Therefore n is uniquely determined, and with it the last two components. C a s e 2" One standard decomposition ends with K~ Q Mzm. Then l + m >_ 2, and n is the number of dominating vertices in the graph induced by the vertices of these last two components. So, again, n and the last two components are uniquely determined. C a s e 3" All standard decompositions have exactly one component, namely Mllml or K ~ . This case is trivial because the graph has at least three vertices. 9 Every threshold graph is a unigraph, i.e., determined up to isomorphism by its degree sequence (in fact, a threshold sequence has a unique labeled realization). In contrast, non-isomorphic domishold graphs can have the same degree sequence. For example, the complete bipartite graph K2,3 and the pentagon with one chord have the same degree sequence (3, 3, 2, 2, 2). The first one is domishold, since its complement has the standard decomposition M2,0 (~ K3; the second is not, since it contains an induced P4, which is forbidden by Theorem 12.2.4. However, a domishold graph without isolated vertices is an edge-unigraph, i.e., determined up to isomorphism by the list of degrees of its edges , where the degree of an edge is defined as the unordered pair of degrees of its endpoints. To show this, we make the following definition. D e f i n i t i o n 13.3.6 Let a graph G have an alternating ~-cycle t with edges ab, cd and nonedges ad, bc, and let deg(a) = deg(c). We then say that G admits a swap t, and the graph t G is obtained from G by swapping the edges and nonedges oft. See Figure 13.2.

13.3


339

Figure 13.2: A swap.

a

d

b

a

c

d I

I

I I Ark

I I

b

d e g ( a ) - deg(c)

c

d e g ( a ) - deg(c)

Chernyak and Chernyak [CC81] have proved that if two graphs without isolated vertices have the same list of degrees of edges, then one can be obtained from the other by a sequence of swaps. Therefore it is enough for us to prove the following result. P r o p o s i t i o n 13.3.7 If G is a domishold graph admitting a swap t, then G

is isomorphic to tG. m

P r o o f . Clearly the complement G also admits t as a swap, and so it is enough to show that G and tG are isomorphic. By Theorem 13.3.5, G has a unique standard decomposition in one of the forms (13.6) - (13.9). Without loss of generality we may assume that the first component of this decomposition contains one of the vertices a, b, c, d of t, for we may delete all preceding components, if any. Considering the polar graph G(A, B), we examine the possible cases of which of the sets A, B contains each vertex of t, taking into account the fact that A is a clique and B induces a matching with isolated vertices. C a s e 1" a, b, c, d C B. Since a and c have the same degree, they must be in the same component, which must also contain their neighbors c and d. But then clearly tG is isomorphic to G. C a s e 2" a,b,c E B, d C A (the case a,c,d C B, b C A is similar). The neighbors a and b are in the same component, and c must be in a later component, since the vertex d of A is a neighbor of it but not of a. Therefore a, b C Miami, hence deg(a) - 1, and it follows that deg(c) - 1. This implies


340

that d E K ~ , nl - 1, and c E Mt2m2. But now swapping a and cgives a graph isomorphic to G and also equal to tG. C a s e 3" b,c,d E B, a E A (the c a s e a , b,d E B, c E A i s similar). Since a E A is adjacent to b E B but not to d E B, d (and with it its neighbor c E B) must be in an earlier component than b. Hence c,d E Mt~n~, and therefore deg(c) - 1, from which deg(a) - 1. Hence b is the only vertex in its component, and this component is the last one. But this contradicts the definition of the standard decomposition, so this case is impossible. C a s e 4: a,b E A, c,d E B (the case c,d E A, a,b E /3 is similar). We must have c, d E Mt~nl, so deg(c) - 1, hence deg(a) - 1. This implies that A - {a, b}. Now if a and b are in the same component, then this component is K ~ and is the last component, contradicting the definition of the standard decomposition. If a and b are in different components, then since deg(a) - 1, the component containing a has no other vertices and is the last component, again contradicting the definition of the standard decomposition. Therefore this case is impossible. C a s e 5" a, c E A, b, d E /3 (the case b, d E A, a, c E B is similar). Since a is a neighbor of b and c is not, the component containing a is earlier than the one containing c. But this contradicts the fact that d is a neighbor of c but not of a, so this case is impossible. 9 We conclude this section by classifying the forcibly domishold degree sequences, i.e., those degree sequences all of whose realizations are domishold. We prove, in fact, that a forcibly domishold sequence must be unigraphic. As usual, we denote a star on n + 1 vertices by KI~. 13.3.8 A forcibly domishold degree sequence is unigraphic, and its realization is of one of the following types:

Theorem

1. a threshold graph; 2. rnK2 U Kin with rn, n > 1; 3. T(A, B ) Q inK2 U Kin with rn, n >_ 1, where T is a threshold graph. Conversely, every graph of the one of the above types is domishold and a unigraph. P r o o f . Let d be a forcibly domishold sequence with a realization G. We have to show that G i s o f t y p e 1 - 3 . I f G E (1,1), then G i s o f t y p e 1 b y Part 1 of Theorem 13.3.4, and we are done. Assume now that G ~ (1, 1).

13.4

Box-Threshold Graphs and Decomposition

341

We show below that if G is indecomposable at the level (1, 1), then it is of type 2, from which it follows that if G is decomposable at the level (1, 1), then it is of type 3. Since all realizations of d are domishold, and since every transfer (cf. Corollary 3.1.11) done on the complete bipartite graph K2,3 yields the nondomishold pentagon with a chord, G has no induced K2,3. Consider the standard decomposition of G. Since G is indecomposable at the level (1, 1), the first component of the decomposition is of the form M0ml, rrtl ~ 1. If there are no other components, then rn~ >_ 2 because G ~ (1, 1), and hence G is of type 2. Assume now that there are other components. Then G is triangle-free, since G is (K2 U K3)-free. Hence in the polar representation G(A,B), IAI _< 2. By the definition of a standard decomposition, the last component must then be of the form Mr,0 with It >_ 1. Further, t - 2 and nl 1 since G is triangle-free. This means that -

-

G

--

M0rrt

I

@ K1 @ M I 2 0

--

rnlK2 U Kll2,

so G is of type 2, as required. To prove the converse, it is enough to notice that if G is of type 1 3, then it is domishold, and every transfer done on it yields an isomorphic graph, hence it is a unigraph by Corollary 3.1.11. 9

13.4

B o x - T h r e s h o l d Graphs and D e c o m p o s i tion

This section is adapted from Tyshkevich and Chernyak [TC85a]. We characterize box-threshold (BT) graphs in terms of their decomposition into indecomposable components at the level (1, 1), and then use these results to enumerate them. All the polar graphs in this section are in the class (1, 1) (split) and the decompositions will always be at the level (1, 1). We use the notation Na(x) for the set of neighbors of a vertex z in a graph G, and ~ a for the vicinal preorder ~ on the graph G. If A and B are sets of vertices of a graph, we use NA(B) to denote A N ( U N(b)), and A ~- B to indicate bEB

that a ~- b for all a C A and b C B, and similarly for A ~ B. L e m m a 13.4.1 A composition is BT if and only if each component is BT.

342


P r o o f . Let F = G(A, B) Q H. We have to show that F is BT if and only if G and H are BT. Note that A ~ V ( H ) ~ B. " O n l y if"" Assume that F is BT. To show that H is BT, consider vertices x,y of H with degH(x ) > degH(y ). Since Na(x) = Na(y), we must have degF(x ) > degp(y). Since F is BT, this implies that x L~ Y, and therefore x ~H Y, so H is BT. To show that G is BT, consider vertices x,y of G with dega(x ) > dega(y ). If x and y are both in A or both in B, then since NH(X) = NH(y), we must have degr(x ) > degp(y). Since F is BT, this implies that x ~ p y, and therefore x ~ a y, as required. If not, then necessarily x C A and y C B, which already implies that x ~ a y. " I f " : Assume that G and H are BT. To show that F is BT, assume that degF(x ) > degF(y ). If y E A, then x E A and dega(x ) > dega(y ). Since G is BT, we have x ~ a y and hence x ~ p y. If y E V(H), then x E V(H) or x C A. In the first case x ~ p y since H is BT, and in the second case x ~ p y since A ~ V(H). If y C B, we similarly reach the same conclusion x LF YTherefore F is BT. .. The next two lemmas identify the indecomposable BT graphs. Observe that if a split graph G(A, B) is biregular, then all the vertices of A have the same degree and all the vertices of B have the same degree. L e m m a 13.4.2 A split graph with more than one vertex is an indecompos-

able B T graph if and only if it is biregular and has no dominating or isolated vertices. P r o o f . " O n l y if": Let G(A,B) be an indecomposable graph in the class (1, 1) with more than one vertex and also a BT graph. By being indecomposable with more than one vertex, G has no dominating or isolated vertex. We show that it is biregular. Let A' be the set of vertices of A with largest degrees, and let B ' = N B ( A - A'). Since G is BT, we have A' ~ A - A' and therefore B' is completely joined to A'. Unless A ~ = A, we then have the decomposition

G(A, B) = G'(A', B - B') Q H ( A - A', B'), contradicting the indecomposability of G. So A' = A and all the vertices of A have the same degree. Similarly all the vertices of B have the same degree. " I f " : Let G(A, B) be a split biregular graph with no isolated or dominating vertices. To show that G is BT, notice that if deg(x) > deg(y), then x C B and y E A, which implies in turn that x ~ y. To show that G is indecomposable, assume the opposite and let G = G'(A', B ~) Q H be a decomposition

13.4

Box-Threshold Graphs and Decomposition

343

of G with G ~ and H non-empty. There must exist a vertex x C A' such that NB,(X) 7/= rg, for otherwise the vertices of B' are isolated in G, and if B' = rg then the vertices of A ~ are dominating in G. Similarly there must exist a vertex y C B' such that N ( x ) is a proper subset of A'. Now pick any vertex h of H, and we must have dega(x) > dega(h ) > dega(y ). But this contradicts the biregularity of G. L e m m a 13.4.3 A non-split graph with more than one vertex is an indecomposable B T graph if and only if it is regular and has no dominating or isolated vertices. P r o o f . " O n l y if"" Let G be an indecomposable non-split graph with more than one vertex and also a BT graph. By being indecomposable with more than one vertex, G has no dominating or isolated vertices. To show that G is regular, let A be the set of vertices with the largest degree in G, let H = (1 N(a), and let B = V(G) - (A U H). For each a E A and x E B U H aG_.A

we have deg(a) > deg(x), and so A ~ BU H. Each b e B has a non-neighbor in A, for otherwise b C H; therefore by A ~- B U H, b has no neighbors in B U H . I f H r 0 , then by A ~- H, A is a clique and G decomposes as a = G ' ( A , B ) O G[HI, where G[H 1 is the subgraph induced by H. This contradicts the hypothesis, and so H = O. If A is a clique, then G is a split graph G(A, B), contrary to hypothesis. Therefore there is a nonedge uv in A. S i n c e u ~- B, v has no neighbors in B. Let b b e a v e r t e x o f B , if any. Since it is not isolated, it has a neighbor z C A. Since A >- b, the closed neighborhood N[z] contains A U {b}. However, N[v] is a proper subset of A, and thus deg(z) > deg(v), contradicting the definition of A. This proves that B = 0 , and so G is induced by A and is therefore regular. " I f " : Let G be a regular graph without dominating or isolated vertices. By being regular, G is BT. To show that G is indecomposable, assume the contrary and let G = G'(A, B ) Q F, where G' and F are non-empty graphs. It follows that A ~- B, and hence deg(a) > deg(b) for all a E A and b E B. Now A r 0 , for otherwise the vertices of B are isolated, and B 7~ 0, for otherwise the vertices of A are dominating. This contradicts the regularity of G. " The next theorem characterizes BT graphs in terms of their indecomposable components at the level (1, 1).

The Decomposition M e t h o d

344

T h e o r e m 13.4.4 Let G = Xx Q ... Q X~ be the decomposition of a graph G into indecomposable components at the level (1, 1). Then G is B T if and

only if 1.

X I

, . . . ,

Xn-1 are split biregular graphs;

2. Xn is a split biregular or non-split regular graph. P r o o f . Note that if Xi has more than one vertex, then it cannot have dominating or isolated vertices by being indecomposable. The result then follows immediately from Lemmas 13.4.1, 13.4.2 and 13.4.3. [] We now enumerate the BT graphs using the above results. Let us denote by ap the number of non-isomorphic indecomposable split graphs on p vertices, by bp the number of non-isomorphic indecomposable non-split graphs on p vertices, and by cp the number of non-isomorphic BT graphs on p vertices. We also use the generating functions of these sequences, namely

a(x) - ~ apx p,

b(x) - ~ bpx p,

p>_l

p>_l

c(x) - ~ cpx p. p>_l

By the uniqueness of the canonical decomposition and Lemma 13.4.1, we have the following relation: 4x) =

1 -a(x)

Below we show how the ap and bp can be computed. C o m p u t i n g ap: Clearly a~ - 1 and for p > 1, by Lemma 13.4.2, ap is the number of non-isomorphic biregular split graphs on p vertices with no dominating or isolated vertices. Therefore ap --

Z Amn, m+n=p m,n>O

where A,~n is the number of non-isomorphic biregular split graphs with no dominating or isolated vertices, having the form G ( A , B ) with IAI = m, It l = S u c h ~ graph G ( A , B ) i s specified by the adjacency matrix M whose rows correspond to the vertices of A and whose columns correspond to the vertices of B. It is an m x n 0/1 matrix with constant row sums r, 0 < r < n, and constant column sums s, 0 < s < m. Let us denote by

13.4

Box-Threshold Graphs and D e c o m p o s i t i o n

345

Ad~n~ the set of all such matrices, and by Sm and S~ the symmetric groups of degrees m and n respectively. We define an action of the direct product Sm x S~ o n . / ~ m n by

(7r,a)M = PTrMP~ -1 ,

~ESm,

aES~,

where P~, Po are the permutation matrices of 7r, a, respectively. Then Amn is the number of orbits of this group action. By Burnside's Lemma, we have 1

A.~ =

~

F(rr, a),

TIt! TL! (Tr,o')e"~mXSn

where F(~r, a ) i s the number of matrices M C JPlmn that are fixed by (re, a), that is (Tr, a ) M = M. To compute F(r~, or), we write rr and cr as products of disjoint cycles 71- - -

where rri is a cycle of length

7f'l

9 9 9 7Ck~

0"

~

O" 1 9 9 9 0 " I ~

mi and aj is a cycle of length nj,

77"t1 :> " ' "

~

?Ytk~

Using the notation m = ( m l , . . . , m k ) a n d n = ( n l , . . . , n t ) , we observe that F(rr, a ) d e p e n d s only on m and n and not on the particular rc and a, and we may denote it by F ( m , n). There are m!/ml...mk permutations rr corresponding to m and similarly for n. Hence rtl >

"'"

~

rtl, m

-- ml

+'..

+ ink,

rt - - rtl + . . .

1

Am,~ - }-~. 77"tl . . .

?Tt k Tt 1 . . .

+ rtl.

r(m, n), TLl

where the summation extends over all the partitions m , n of the integers rn, n into positive parts. To compute F(m, n), consider the action of (rr~, aj) on the submatrix M~j of M consisting of the m i rows cycled by rri and the nj columns cycled by aj. The entries of Mij are partitioned into dij = g c d ( m / , nj) cycles, each one of length rninj/dij (i.e., the least common multiple of m i and nj). Each such cycle meets each row of Mij nj/dij times, and each column of Mij mi/dij times. The matrix M is fixed by (rr, a) if and only if in each submatrix Mij, each cycle is all-0 or all-1. If exactly t cycles of Mij are all-l, then Mij has constant row sums tnj/dij and constant column s u m s tmi/dij. We denote by e~...~k;,~..., , the number of m x n 0/1 matrices M fixed by (rr, a) for which the rows cycled by rri have constant row sums ri and the columns cycled by


346

O'j have constant column sums sj. Then by the above discussion we obtain the generating function

E

"''Yk~kZ~ a ' ' ' z ~ ~ - -

Crl,...,rk;sl,...,sty~l

rl ,...,rk 81 ,...,3l

H

l t

~(~) + ~(~)

>_ t.

But these inequalities are clearly inconsistent with t > 0. The following theorem characterizes pseudothreshold graphs. T h e o r e m 14.2.7 For every graph G equivalent:

(V, E), the following conditions are

1. G is pseudothreshold; 2. P* n Q* - 0 and G is 3K2-free; 3. V can be partitioned into sets P, Q, R such that (a) every vertex in P is adjacent to every vertex in P U R, (b) no vertex in Q is adjacent to another vertex in Q u R, (c) there are no three mutually nonadjacent vertices in R. P r o o f . 3) =~ 1)" An appropriate assignment is

w(u)-

..

0, i f u c Q 1, i f u E R 2, if u C P

t-2.

14.2

Pseudothreshold Graphs

355

1) =~ 2): This follows from Lemma 14.2.4 and Lemma 14.2.6.

2) =~ 3): We prove this by means of a simple algorithm that terminates in O(rt 4) steps either by showing that Condition 2 does not hold or by constructing the partition described in Condition 3. The algorithm is as follows. First of all, find P* and Q*. (This can certainly be done in O(n 4) steps.) Then find out whether P * N Q* = ~. (If not, stop: Condition 2 does not hold.) Then set S = V - (P* U Q*); note that by the definition of P* and Q*, every vertex in S is adjacent to all the vertices in P* and to no vertex in Q*. Let So consist of all the vertices in S that are adjacent to no other vertex in S; define

P=P*,

Q=Q*USo,

R=S-So.

Find out whether there are three mutually nonadjacent vertices in R. If not, stop: by Lemma 14.2.5 P, Q and R have all the properties described in Condition 3. If, on the other hand, there are three mutually nonadjacent vertices Ul,U2,U3 C R, then each ui is adjacent to some vi E R. Using the fact that R N (P0 U Q0) = o , we may now easily verify that the set {Ul, u2, u3, Vl, v2, v3} induces a 3K2 in G. (To see this, note that the vj's are distinct, mutually nonadjacent and that each vj is adjacent to exactly one ui.) Hence Condition 2 does not hold. " The corollary below follows from the proof of the above theorem.

Corollary 14.2.8 If G is pseudothreshold, Equation (14.1) can be satisfied with t = 2 and w(u) C {0, 1,2} for all u.

"

We conclude this section with the following remarks.

Remarks 1. The degree sequences of pseudothreshold graphs on n vertices lie on the boundary of the convex hull of the degree sequences of all graphs on n vertices, because they satisfy the facet inequality

Z iEP

x ~ - ~ x~ _< IPl(n- 1 - I Q I )

(14.2)

iEQ

with equality, where P, Q, R is the partition of V as in Condition 3 of Theorem 14.2.7. This follows from Lemma 3.3.13.

356

Pseudothreshold and Equistable Graphs

2. The converse is not true. For instance, the graphs K1@K3,3 and K1@C6 have the same degree sequence (6, 4, 4, 4, 4, 4, 4) satisfying (14.2) with equality, where P is a singleton corresponding to the vertex of K1, Q is empty, and R consists of the other six vertices. But the first graph is not a pseudothreshold graph and the second one is. 3. The complement of a pseudothreshold graph need not be pseudothreshold. For instance, 3K2 is pseudothreshold. In view of the above remarks, the following questions are interesting.

Problem 1. Which degree sequences lie on the boundary? 2. Which degree sequences are potentially pseudothreshold?

14.3

Equistable Graphs

Payan [Pay80] introduced the equistable graphs, and later Mahadev, Peled and Sun IMPS94] obtained several interesting results on these graphs. We present the results of IMPS94] in this section. Definition 14.3.1 A graph G - (V,E) is said to be equistable if there exists a mapping ca" V --. N + such that for all S C_ V, S is a maximal stable set ~

c a ( S ) - ~ c a ( v ) - 1. vES

Payan [PayS0] showed that the threshold graphs and the domishold graphs are equistable, and announced that the P4-free graphs are equistable. In the next subsection we introduce a subclass of equistable graphs the strongly equistable graphs. We show that the strongly equistable graphs are closed under disjoint unions and joins, and therefore the P4-free graphs are strongly equistable. Later in this section, necessary conditions for equistability and sufficient conditions for strong equistability are obtained and used to characterize the equistability of some classes of perfect graphs, pseudothreshold, and outerplanar graphs. The structure of these equistable graphs is also given.

14.3

Equistable Graphs

357

Finally, we show that for a wide family of graphs, equistability implies strong equistability, and discuss the closure of various classes of strongly equistable graphs under graph substitution. An induced path on four vertices is denoted by P4(a, b, c, d) to indicate that its vertex set is {a, b, c, d} and its edges are ab, bc, cd.

14.3.1

Strongly Equistable Graphs

Let G = (1,1,E) be a graph. We denote by $ the set of maximal stable sets of vertices of G, and by 7- the set of all other nonempty sets of vertices of G. Put A - A(G) - {~v" V ~ R+'~v(S) - 1 for all S C $}. Thus A(G) is a set defined by a finite number of linear equations and nonnegativity inequalities, i.e., it is the intersection of an affine set with the first orthant. Since each vertex belongs to some maximal stable set, A ( G ) is bounded and therefore it is a polytope, possibly empty.

Definition 14.3.2 A graph G is said to be strongly equistable if for each T E 7- and each c 2, Vi

(14.11)

-

0

v j _> 2, vi.

( 4.12)

(14.9)

14.3

Equistable Graphs

365

To prove that G is strongly equistable, let T E T and c _< 1 be given, and assume that, if possible, w(T) - c for all w E ~4. Then the equation aJ(T) - c is a consequence of Equations (14.9)-(14.12), and therefore a linear combination of them. Therefore the equation w ( T ) - c takes the form t

E (~p[O,.)(p)-~-(.u(NQ(p))] Jr E ")/i[('u(~il ) + ("g(]r pEP t

-t- ~

+ (,M(Q)]

i=1 t

~_~ aij[w(kij) - co(kii)] + ~ ~ a'ij[w(k~j ) - w(k~l)]

i = 1 j_>2

(14.13)

i = 1 j_>2 t

pEP

i=1

' Since w(p) appears only once in the sysfor some multipliers 5p, ~/i, aij, aij. tern (14.9)-(14.12), and with a coefficient of 1, we have 5p = 1 if p E P N T, 5p=0ifpE P-T. Similarly for j _ > 2 , aij = 1 ifkij E K i n T , aij = 0 i f k~j E K~ - T, a~j - 1 if k~j C K[ N T, a}j - 0 if k~j E K~ - T. Therefore, in order for the coefficient of w(k~l) in the linear combination (14.13) to assume its correct value (1 if k~l E T, 0 if not), 3'~ must be equal to ]Is n T I. Similarly 7i must also be equal to IK[ N T I. Thus we can rewrite (14.13) in the following way: t

pEPnT

[~(p) + ~(xq(p))] + E ~ ( Q ) i=1 t

+ ~(/q n T) + ~(/(~

n T ) --

IP n TI + ' ~ ~,

(14.14)

i=1

where

~ - IIq n T I Note that

IK[ n TI.

(14.15)

t 1 ~/i - 71R n T I.

(14.16)

i=1

Therefore 1 IP N T] + ~]R n T] - c.

(14.17)

Further, in order for w(q), q E Q, to assume its correct coefficient in (14.14) (lifqEONT, 0ifqEO-T),wemusthave 1

INpnr(q)l + -~lR A T

]e 1 I - ~[ 0

for q E Q N T forqEQ-T.

(14.18)


366

From (14.15)-(14.17), c is an integer, and from c _< 1 we obtain c - 0 or C--1. C a s e 1" c - 0. Then by (14.17) P A T - R A T - 0 . Then the left-hand side of (14.18) is 0, which implies Q n T - O. Therefore T - O, contradicting TET. C a s e 2" c - 1. From (14.17), either P n T - {p} and R n T - O, or else P A T - O and R A T - {kij, k~l } (by (14.15)). In the first possibility, (14.18) can be rewritten as [I.N{p}(q).-

1 forqCQNT 0 for q E Q - T,

which means that Q N T - NQ(p). Thus T - { p } U N Q ( p ) , which is a m a x i m a l stable set, contradicting T C T. In the second possibility, the left-hand side of (14.18) is 1, which implies Q - T - o . Thus T - {kij, k~l} U Q, which is a m a x i m a l stable set, contradicting T E T. 9 D e f i n i t i o n 1 4 . 3 . 2 0 A planar graph is said to be o u t e r p l a n a r if it can be embedded in the plane so that every vertex lies on the boundary of the infinite region. Obviously, a graph is outerplanar if and only if each of its blocks is outerplanar. The following l e m m a is used to prove Theorem 14.3.23 below. L e m m a 1 4 . 3 . 2 1 Every 2-connected outerplanar graph with more than two vertices is a Hamiltonian graph. P r o o f . E m b e d the 2-connected outerplanar graph in the plane so t h a t all vertices lie on the b o u n d a r y B of the infinite region. Let C - V l , . . . , v k be ~ cycle all of whose edges lie on B. If C is not Hamiltonian, there is a vertex Ul ~ C adjacent to some vertex of C, say Vl. Since Ul is on B, Ul is in the exterior of C. Let v~ be any vertex of C, r :/: 1. By 2connectivity there is a cycle C' - V l t t l U 2 . . . containing the edge VlU 1 and the vertex v~. Let rn be the smallest index such that u,~ coincides with a vertex on C, say vj. Note that U l , . . . , u m - 1 are in the exterior of C and t h a t 2 _< j _< k. If j - 2, then V l , U l , . . . , u m , v 3 , . . . , v k is a cycle enclosing the interior of the edge VlV2, so that edge cannot lie on B, contradicting the choice of C. A similar contradiction occurs if j - k. If 2 < j < k, then either the cycle V l , U l , . . . , u m , v j + l , . . . , v k encloses v2 or else the cycle Vl, U l , . . . , Urn, V j - I , . . . , ?22 encloses vk, so either v2 or vk is not on B, a contradiction. 9

14.3

Equistable Graphs

367

D e f i n i t i o n 14.3.22 A flower Fn

-

Fn(?tl,...,?.tn;Yl,...,Vn)

is a graph consisting of an even cycle UlVlU2V2...UnVn whose chords are precisely the short chords UxU2, U2U3, . . . , U~Ul. Figure 1~,.3 illustrates the flowers F2 and F6.

Figure 14.3: Flowers.

V2

vx

U2

Y3 ..

U

V4 v

U

2

.Yl

6

,., V6

Ul

?,)2

~

~

U

2/ V5

F2

F6

T h e o r e m 1 4 . 3 . 2 3 Let G be an outerplanar graph. Then the following conditions are equivalent:

1. G is equistable; 2. G is strongly equistable; o

(a) each block of G is a flower, a square, or a complete graph on at most 3 vertices; (b) in a block of G that is a flower or a square, no vertex of degree 2 in the block is a cut vertex:

368


(c) in a block of G that is a complete graph, not all vertices are cut vertices.

P r o o f : 3) =~ 2)" By Theorem 14.3.4 we may assume that G is connected. If a block of G is a square, then by Condition 3b, this block is the entire G, and it is strongly equistable by being P4-free. If the block is a single vertex, then it is the entire G by connectivity and the same conclusion holds. We may therefore assume in addition that each block of G is a flower or a complete graph on two or three vertices. We construct a set S of vertices as follows" from each block that is a flower, put in 5' all the vertices of degree 2 in the block; from each other block put in S exactly one vertex that is not a cut vertex. Clearly the vertices of S in each block from a maximal stable set in the block, and further, by Condition 3b, S contains no cut vertex of G. Therefore S is a maximal stable set of G. It is easy to verify that S satisfies the condition of Theorem 14.3.10, and hence G is strongly equistable. 2) =~ 1)" This follows from Lemma 14.3.3. 1) =~ 3)" First we prove Condition 3a. Every block B of G is outerplanar. If B has at most four vertices, then B is clearly F2, a square, or a complete graph on at most three vertices. So assume that B has at least five vertices. By Lemma 14.3.21 B has a Hamiltonian cycle C - Zl . . . zk. We assume that G is embedded in the plane so that every vertex lies on the boundary of the infinite region. Therefore every chord of C lies in the interior of C. We now show the following two facts" F a c t 1 No f o u r consecutive vertices along C induce a P4. P r o o f . Assume without loss of generality that C has a P4(Xl, x2, z3, x4). By Theorem 14.3.8, G has a vertex v adjacent to both x2 and x3, but to neither Xl nor x4. Clearly v is in B, and hence v - xi for some i _> 6. Note that zi is the only vertex of G adjacent to both x2 and x3 by planarity. If x4xi-1 ~ E , extend { X l , X 4 ~ X i _ I } t o & maximal stable set S, and S will violate Theorem 14.3.8 with respect to P4(xl,x2, x3, x4). Therefore x 4 x i - , E E. See Figure 14.4. Now consider P4(x2, xi, xi-1, x4). The only vertex that can possibly be adjacent to both xi and xi-1 is x3, which is adjacent to x2, a contradiction to Theorem 14.3.8. This proves Fact 1. F a c t 2 Between every four consecutive vertices along C, there is exactly one chord, and this chord joins two vertices at distance 2 along C. P r o o f . Consider for example Xl,X2, X3, X4. By Fact 1, there is a chord between them, and it is enough to show that XlX4 is not a chord. If XlX 4 is & chord, then between x2, z3, x4, xs or between xk, x l , x 2 , x3 there is no chord,

14.3

Equistable Graphs

369

Figure 14.4: Illustrating the proof of Fact 1. Dashed lines indicate nonedges. X4 X3

9

X 2

Xl

Xi-1

contradicting Fact 1. This proves Fact 2. From Fact 2, it follows that k is even and B has a spanning flower fn(tll,

. . . , tln;

Yl,.

. . , Yn),

where n - k/2 >_ 3. It remains to show that the cycle U l . . . t t n is chordless in B. Assume for example that U l U i C E , where 2 < i < n. Then the P4(vl, Ul, u~, v~) and any maximal stable set S containing { V l , . . . , v~} contradict Theorem 14.3.8. This proves Condition 3a. To prove Condition 3b, consider first a block B that is a flower fn(tll,

. . . , tln;

Yl,

. . . ~ ~)n)

and assume for example that Vl is a cut vertex. Let x be any neighbor of t~1 outside B. Then the P4(x, vl, u2, v2) and any maximal stable set S containing {x, v2, v~} contradict Theorem 14.3.8. Similarly, consider a block B that is a square abcd, and assume for example that a is a cut vertex. Let x be any neighbor of x outside B. There is no vertex adjacent to both a and b, yet there is P4(x, a, b, c), again contradicting Theorem 14.3.8. The proof of Condition 3c is similar to the proof of Condition 3b. 9

370

14.3.4


Equistability, Strong Equistability, and Substitution

In Subsection 14.3.2 we saw among other things that for split graphs and block graphs, equistability is equivalent to strong equistability. Below we show that more generally, the same is true for all perfect graphs. Recall the definitions of .A, S, and T at the beginning of Subsection 14.3.1. Theorem

14.3.24

1. If G is a perfect graph and .A 7~ 0, then .A has an integer-valued weight function w. 2. A(G) has an integer-valued weight function if and only if G has a clique meeting all maximal stable sets. 3. If G is equistable, and if for every T C T there exists some co E A such that w(T) is not in the open interval (0,1), then G is strongly equistable. ~. In particular, if G is an equistable perfect graph, and more generally if G is an equistable graph with a clique meeting all maximal stable sets, then G is strongly equistable. P r o o f . 1" Consider the clique matrix M of the complementary graph G, whose rows are the incidence vectors of the maximal stable sets of G. By Ldvasz's Perfect Graph Theorem [Lov72], G is perfect. Hence by Chvs Theorem [Chv75], the extreme points of the polytope defined by Ma~ _< 1, aJ _> 0 are integer-valued. Note that A is a face of this polytope, and since by assumption A r 0 , .4 contains an extreme point of the polytope. 2: Let co be an integer-valued weight function in .4. Since every maximal stable set S satisfies aJ(S) - 1, S must contain exactly one vertex x with w(x) - 1. Let C be the set of all vertices x such that aJ(x) - 1. Clearly C meets all maximal stable sets. Moreover, C is a clique, otherwise some two vertices of C can be extended to a maximal stable set S with a~(g) >_ 2. Conversely, if C is a clique meeting all maximal stable sets, let aJ be the characteristic vector of C. Clearly w is an integer-valued weight function in

A. 3: Since G is equistable, there exists a non-negative weight function a / s u c h that J ( S ) - 1 if and only if S E $. To show that G is strongly equistable,

14.3

Equistable Graphs

371

let T E 7 " a n d c_< 1 be given. B o t h w a n d w ~ are in A. I f c < 0 there is nothing to prove. If c - 0, then co(T) r 0 since T r e (any vertex x C T can be extended to a set 5' C $; if w'(x) - O, then co'(S- x) - 1 even though S-x ~,5'). If0 < c < 1, then co(T) r c. If c - 1, t h e n a / ( T ) r csince

T s. If G is equistable, then ~4 -~ ~. If in addition G is perfect, then by Parts 1 and 2 above, G has a clique meeting all maximal stable sets. If G is equistable and has such a clique, then by Parts 2 and 3, G is strongly equistable. [] Note that the flower Fs is a non-perfect equistable graph having a clique meeting all maximal stable sets. The strongly perfect graphs [BC84] are the graphs in which every induced subgraph has a stable set meeting all maximal cliques. Their complements then have a clique meeting all maximal stable sets. Moreover, the strongly perfect graphs are perfect [BC84], and so are their complements. A clique of the form N[v] for some vertex v is said to be a simplicial clique. Clearly a simplicial clique meets all maximal stable sets. We now characterize the graphs satisfying the sufficient condition of Theorem 14.3.10 using simplicial cliques. 4:

T h e o r e m 14.3.25 For every graph G - (V, E), the following conditions are equivalent:

1. G has a maximal stable set S such that for every u, v C V - S, uv C E if and only if Ns(u) n N s ( v ) # ~; 2. every edge of G is in a simplicial clique. P r o o f : 1) ~ 2)" For every s C S, N[s] is a clique by the "if" part of Condition 1. These simplicial cliques cover E by the "only if" part of Condition 1. 2) =~ 1)" Let N[v~],... ,N[vk] be all the distinct simplicial cliques, and let S - {Vl,..., vk}. The set S' is stable, since if vivj C E, then N[vi] C_ N[vj] and N[vj] C_ N[vi], contradicting the distinctness of N[vi] and N[vj]. Further, S is maximal stable, because every vertex belongs to some simplicial clique by Condition 2. Again by Condition 2, if u, v C V - 5' and uv ~ E, then u and v cannot have a common neighbor vi, because N[vi] is a clique.

C o r o l l a r y 14.3.26 If G - (V, E) is a graph satisfying Condition 1 of Theorem 14.3.25, then for each x C V there exists some w C A with co(x) - 1.

~

~

~

C~

~

9 r bo

~

II ~

~ ~

Ch

~'~ ~ "

~"

~.~.~

~i,

9

-

mr
1, we have the same conclusion, otherwise wp~ could be decreased below 1, contradicting the optimality of the threshold assignment. Similarly wq + w~ _< t - 1. On the other hand, wp + wq + Wpq ~ t together with wpq < 1 imply that w p + w q > t - 1, and similarlyw~ + w ~ > t - 1. These four inequalities are inconsistent. ..

15.2

T h r e s h o l d Weights

379

The following result follows directly from the constraints (15.1). It is very useful in proving that certain graphs possessing symmetry properties are heavy. L e m m a 15.2.5 Let ab be an edge and cd a nonedge of a graph G. Every feasible threshold assignment (w; t) for G satisfying wc + Wd > Wa + Wb must satisfy Wab >_ 1. In particular, if e -- xy is an edge, zy is a none@e, and Wz > wx, then we > 1. The following corollary is an example of applying Lemma 15.2.5. C o r o l l a r y 15.2.6 All cycles of length 4 or more and their complements are heavy. Proof. Let G be the k-cycle Ck. Consider any optimal threshold assignment (w;t) of G. Each of the k cyclic permutations of the vertex weights of w around the cycle accompanied by a corresponding permutation of the edge weights is again an optimal threshold assignment. The average (w*; t) of these k optimal threshold assignments is yet another optimal threshold assignment (this follows from the convexity of the set of optimal threshold assignments). But (w*;t) enjoys the property that its vertex weights are constant. Let e be an arbitrary edge of G and let f be a nonedge adjacent to e, which exists since k > 4. By applying Lemma 15.2.5 to (w*; t), we see that w~ > 1. Thus G is heavy. The proof for the complements of cycles is similar. 9 The heavy graphs cannot be characterized by forbidden configurations, since they are not closed under taking induced subgraphs. In fact, every graph with at least one edge has an induced threshold subgraph with one edge, which is not heavy. The following theorem gives a convenient characterization of heavy graphs in terms of the optimum value of a linear programming problem having fewer variables and constraints than (15.1). T h e o r e m 15.2.7 A graph G [El

>

max

(V, E) is heavy if and only if

E dizi

iEv

s.t.

for each maximal stable set iES

Scv

z>_O,

(15.2)

T h r e s h o l d W e i g h t s and M e a s u r e s

380

where di is the degree of vertex i. Proof. The linear programming dual of (15.1) is max

~-~ ys S

s.t. S

e

Vi C V S ~i

(15.3)

e ~i

y~ _< 1

Ve C E

y>_O. The constraints of (15.3) imply that E s ys _ di

VicV

S ~i

y>_O. In other words, again using linear programming duality, if and only if IE]

_> min E Ys

=

max

s

s.t.

~ ys >_di Vi

E dizi iEv

s.t.

~ z i O.

A direct proof of Theorem 15.2.7 not using LP duality can be found in [MP91]. We apply Theorem 15.2.7 to show that several types of graphs are heavy. L e m m a 15.2.8 Ps (the path on 5 vertices) is heavy.

15.2

Threshold Weights

381

P r o o f . Denote the vertices as 1 , . . . , 5 along the path. Let z be feasible for (15.2). Then Zl + z3 + zs _< 1, z2 + z4 _< 1, and z3 _< 1. After multiplying these inequalities by 1, 2, and 1, respectively, and adding, we obtain Zl + 2 ( z 2 + z3 + z4) + zs _< 4, which is the condition for a heavy graph in Theorem 15.2.7.

T h e o r e m 1 5 . 2 . 9 If G - ( V , E) is a regular graph and x(G) < G is heavy.

IVI/2,

th~

P r o o f . Let r be the degree of every vertex, and let S 1 , . . . , S X be a partition of V into X - x(G) stable sets. If z is feasible for (15.2), then x k=l iESk

iEV

which is the condition of Theorem 15.2.7 for G to be heavy. T h e o r e m 1 5 . 2 . 1 0 Let G = (V,E) have a proper vertex-coloring in which each color has at least two vertices and all the vertices of the same color have the same degree. Then G is heavy. In particular, the complete k-partite graph K ~ .....~nk is heavy if mj >_ 2 for each j, the perfect matching inK2 is heavy if m >_ 2, and so on.

P r o o f . Let '--q'l,-.., Sk be a partition of V into stable sets, and let Dj be the common degree of the vertices of Sj for each j - 1 , . . . , k. If z is feasible for (15.2), then k

iEV

k

Dy j=l

E Dj iCSj

j=l

d /2 -IEI, iEV

which is the condition of Theorem 15.2.7 for G to be heavy. 9 Recall that N ( x ) denotes the open neighborhood of a vertex x, namely the set of all neighbors of x. The next theorem has a construction that preserves heaviness. T h e o r e m 15.2.11 Let G be a heavy graph, let x be a vertex of G, and let H be obtained from G by adding a new vertex y such that N(y) C N ( x ) in H (y is not adjacent to x). Then H is heavy. In particular, if v is a non-isolated vertex of a heavy graph and we add a new vertex of degree 1 adjacent to v, the graph remains heavy.

Thre s hold Weights and Measures

382

P r o o f . Let di denote the degree of vertex i C V ( H ) in H. The degree of vertex i E V(G) in G is then given by

d'i

if i ~ N(y) _ f di, d i - 1, if i c N(y).

In particular, d2 - d~ >_ dy. Let z be a feasible solution of (15.2) for H. Then the following constitutes a feasible solution of (15.2) for G:

, {

zi -

zi, if/=fix z~ + zu, i f i - x .

By applying the constraints of (15.2) to z on H and Theorem 15.2.7 to z / on G, we obtain

E

di Zi --

iEV(H)

iq~N(y)U{x,y}

_ 0, v C V and a threshold r < 1 such that ~ v e S Wv 1 for each edge u v C E . In other words, the minimum of r is smaller than 1 when subject to the maximal stable set and the minimal non-stable set constraints and the non-negativity of w. We therefore regard the minimum of r subject to the above constraints as a certain measure of the non-thresholdness of G. This motivates the following definition.

15.3

387

Threshold Measures

1 5 . 3 . 1 The t h r e s h o l d m e a s u r e v ( G ) of a graph G - (V, E) is the optimal value of the following linear programming problem.

Definition

min

s.t.

r

~ wv _ l for each edge uv C E w>O.

To emphasize the dependence of the optimum w on G, we sometimes denote it by w(G). Thus the optimum value of 15.5 is smaller than 1 if and only if G is a threshold graph. In that case, the minimum can be found by the very efficient and elegant algorithm of Section 1.3. The optimum value is equal to 1 if and only if G is a pseudothreshold graph. We have seen in Section 14.2 an algorithm to recognize these graphs and a characterization of their structure, from which it follows that when the optimum of (15.5) is 1, there is always an optimum solution for which the components of w have values of 0, 1/2 and 1. Figure 15.2 illustrates optimal solutions of (15.5) for a threshold graph and a non-threshold graph. Like (15.1), the linear programming problem (15.5) has exponentially many constraints, since in general there are as many maximal stable sets. This indicates that finding T(G) may be hard. We show below that this is indeed the case. We begin our general discussion of the threshold measure by determining the threshold measure of a disjoint union of two or more graphs. 15.3.2 w(G1 [..J G2) --~ (w(G1),w(G2)). Consequently T(G1 U G2) -- 7"(G1) -k- T(G2). This generalizes immediately to the union of any number of graphs.

Theorem

P r o o f . For k = 1,2, it is convenient to denote by Ak and Bk the edge-vertex incidence matrix and the maximal-stable-set-vertex incidence matrix of Gk. In other words, Ak has a column for each vertex and a row for each edge of Gk, and the corresponding entry is 1 or 0 according as the vertex is or is not an endpoint of the edge. Similarly Bk has a column for each vertex and a row for each maximal stable set, and the corresponding entry is 1 or 0 according

Threshold Weights and Measures

388

Figure 15.2" Threshold measures of (a) a threshold graph and (b) a non-threshold graph. Tz

8 9

5 4

T-1 4

1

1

9

l

9

~ ~1

12 / '

(b) as the vertex belongs or does not belong to the maximal stable set. We also let e k and fk denote all-1 column vectors of appropriate dimensions. Then the linear programming problem (15.5) for Gk can be written as follows, where tt k and u k denote row vectors of dual variables corresponding to the vector constraints of (15.6)" min #k

s.t.

Vk

A k w k >_ ek - - B k w k + rk.f k > 0 Wk>_O.

(15.6)

The linear programming dual of (15.6) is the following" T(Gk) lVk

-

max

13,k e k

s.t.

ttkAk -- v k B k _ O,

(15.7)

~'k >_ O.

To formulate the corresponding linear programming problems for G1 U G2, we use the fact that its maximal stable sets are precisely the unions of maximal

15.3

Threshold Measures

389

stable sets of G1 and G2. If i and j index the maximal stable sets of G1 and G;, respectively, then the maximal independent set constraint of (15.5) for G1 tO G2 reads ( B l W l ) i -~- (B2w2) j ~ T. We can therefore write (15.5) for G1 [-J G2 as follows, where jttl, jM2, Pl, P2 and uij are the dual variables to the corresponding constraints of (15.8), and where ~1 and rl2 are auxiliary column vectors introduced for convenience: r(GIUG2)

--

rain T s.t.

/-t2 Pl P2 Pij

A l W l ~ el A2w2 k e2 - B l W l + T~I -- 0 - B 2 w 2 + r12 - 0

(15.8)

--(?]1)i- (?]2)j -t'- T ~ 0 W 1 ~ 0,

W2 ~ 0.

The linear programming dual of (15.8) is as follows" 7-(G1 [,.J G2) Wl w2 (?]1)i

-

max s.t.

I-I,l e l -~ ~2e2 ~t 1A1 - PlB1 < 0 t t 2 A 2 - p2B2 < 0 ( P l ) i - E j 17ij = 0 (m)j

T

-

aj

= 0

~ i j vii = 1 tt 1 > 0 , tt 2 > 0 ,

aj >_ O.

The theorem asserts that if wl, r~ and w~, r~ are optima of (15.6) for k - 1,2, respectively, then (w~, w~), r~ + r~ defines an optimum of (15.8) when the auxiliaries rl~, rl~ are determined from the equations of (15.8). To show this, note that the constraints of (15.8) are satisfied, and then consider optima t t ; , v ; of (15.7). They satisfy the following complementary slackness conditions: tt*k(Akw*k -- ek) = 0 (15.10) v*k(--Bkw*k + r[~f k ) = O.

(15.11)

We define variables for (15.9) as follows:

It is easy to see that these variables define a feasible solution of (15.9). They also satisfy all the required complementary slackness conditions: (15.10),

390


(15.11), and the condition .;

+

-(,;),-

- 0

By theorem 15.3.2 we may assume without loss of generality that G is connected. We now consider how the threshold measure of a join of two graphs is related to their threshold measures. Unlike the case of the union, we only give estimates, for a reason that will be shown soon. T h e o r e m 1 5 . 3 . 3 Let G1 and G2 be disjoint graphs with largest stable set sizes OL1 and a2, respectively. Then C~lt~2 ~1 -t'- C~2

~ T(G1 (~ G 2 ) ~

~1 max(o~ 1 , c~2 ) 9

P r o o f . The upper bound for T(G10 G2) follows from the fact that assigning to every vertex a weight of g1 and taking T -- 71 max(a1, a2) defines a feasible solution of (15.5) for G1 | G2. To prove the lower bound, let Sk be a stable set of size ak in Gk for k = 1, 2. For every feasible solution w, T of (15.5) for G1 @G2, Sk possesses a vertex vk satisfying Wvk

max

wTb

s.t.

Ab < e

(15.14)

b binary, because A b

max

wTb

s.t.

A b w

b_>O

(15.15)

#>_0.

Since A T >_ O, the inequality in (15.15) is equivalent to the existence of tt >_ 0 satisfying AT# _> w and r - eTpt. Therefore (15.13) is equivalent to the following: T(G)

--

min

7"

s.t.

Aw>e

(15.16)

ATIt >_ w eT# - r

#>o,

w>O.

The constraints involving w in (15.16), namely A w >_ e, ATtt >_ w and w >_ 0, can be replaced by A A T # >_ e, thus eliminating w. Indeed, this condition is clearly necessary, and conversely, if it holds and tt >_ 0, then w defined by w - A T # will satisfy its constraints in (15.16). This proves (15.12) and the moreover statement of the theorem. ..

15.3

Threshold Measures

393

There are several variations of Theorem 15.3.6. By the symmetry of A A T, the linear programming dual of (15.12) is given by "r(G)

-

max s.t.

err A A T v 0.

(15.17)

Further, if G has edges, then w(G) > 0, which permits us to transform the variables/~ and r - e T # of (15.12) according to 1~-#

t T

1 T

and rewrite (15.12) as follows" =

max

t

=

s.t.

AAT)~ >_ te er,~ -- 1 ,~>0

max

min

( A A r,~)i

s.t.

eT~- 1 ,~>0.

(15.18)

Similarly, we can transform the variables v and r - e r u of (15.17) according to p=-- v S - - - -1 T

T

and rewrite (15.17) as follows" 1

=

min s.t.

s A A T p O

=

min

max

o

3

s.t.

"(AATp)j "---e Tp -1 p>_O.

(15.19)

The problem (15.19) is the linear programming dual of (15.18). The formulations (15.18) and (15.19) exhibit ~ 1 as the value of a two-person zero-sum game with payoff matrix A A T. The pure strategies of the players are the edges of G, and the payoff to the first player is the number of common endpoints of the two chosen edges. The optimal mixed strategies are the optimal ,~ and p.

:394

T h r e s h o l d W e i g h t s and M e a s u r e s

A vertex cover (or transversal) is a set of vertices meeting every edge. Using (15.17) and essentially reversing the steps in the proof of Theorem 15.3.6, we can obtain the following theorem, which together with Definition 15.3.1 gives a combinatorial rain-max relation for bipartite graphs:

T h e o r e m 15.3.7 If G -

( V , E ) is a bipartite graph with an edge-vertex incidence matrix A, then r ( G ) is the optimal value of the following linear programming problem:

max s.t.

r

~

zv >_ r

for each minimal vertex cover C c_ V

vec

(15.20)

zu+z~0.

Moreover, if v is an optimum of (15.17), then z of (15.20).

A T v is an o p t i m u m

P r o o f . By arguments similar to those of Theorem 15.3.6, we obtain the following from (15.17)" r(G) - m a x { r " A z < e, A T v < z, e T v -- r , z > O , v > 0}.

The existence of v > 0 satisfying A T v < z and e T v -- r is equivalent to T _ 1 red and b >_ 1 blue vertices. Assign a weight of ir to each red vertex and a weight of-g1

396

Threshold

Weights

and Measures

to each blue vertex. Then G is equitable if and only if the largest total weight of a stable set is 1.

P r o o f . Let i and j index the red and blue vertices of G - (V, E), respectively. We can write the linear programming problem (15.12) as follows, using the vertex variables w = ATtt, which by Theorem 15.3.6 constitute an o p t i m u m of (15.5): r(G)

-

min

~#~j i,j

s.t.

E

#ij -- Wi

J E

#ij -- Wj

(15.21)

i Wi nt- Wj ~ 1

ijcE

#~j >_ 0

ijEE

#~j - 0

ij~E.

If G is equitable, then (15.21) has an o p t i m u m satisfying wi + wj = 1 for all ij C E. Then by the connectivity of G all the wi are equal and all the wj b for all i are equal. Since ~ i wi - ~ i j pij - ~ j wj, it follows that wi - -;-4-g r wj = ~+b for all j, and r(G) - r+b" ~b Since w is a feasible solution of (15.5) every stable set S must satisfy ~ e s Wv < ;~b" This means that with the vertex weights given in the theorem, the total weight of every stable set is 1 or less. Since the stable set of all red vertices has a total weight of 1, the "only if" part of the theorem follows. Conversely, if G satisfies the condition of the theorem, then

1

~

max

s.t.

(

l~xi+

1 )

7" i

-b j

xi nt- xj ~ 1

~xj

ij E E

(15.22)

x binary. Since the constraint matrix of (15.22) is totally unimodular and there are no isolated vertices, the condition "x binary" can be relaxed to x >_ 0. Then

15.3

Threshold

397

Measures

by linear programming duality there exist

)~ij, ij

C E, satisfying

Aij

b

ijEE

1

~j >_ j:ijEE

for all i

r

(~5.23)

1

for all j

i:ijEE

A~j > 0

ij C E.

But every solution of (15.23) must satisfy all its inequalities except for the non-negativity constraints with equality, as can be seen by summing the /-inequalities or the j-inequalities. Therefore the vector A*=

rb A>O r+b -

satisfies AATA * - e, which proves the "if" part. 9 Next we characterize equitable bipartite graphs by a generalization of Hall's condition for the existence of a perfect matching in a bipartite graph. T h e o r e m 15.3.10 Let G be a connected bipartite graph having r >_ 1 red and b >_ 1 blue vertices. Then G is equitable if and only if every set X of red vertices satisfies 1

IN(X)l >_ -~ F ~h~

(15.24)

N ( X ) d ~ o t ~ th~ ~ t of ~ighbo~ of th~ w~tic~ of X .

P r o o f . Denote by R and B the sets of red and blue vertices, and assign weights of !~ and g1 to their vertices, respectively. Condition (15.24) states that the total weight of N ( X ) is at least the total weight of X, for each XcR. To prove the "if" part, let S be any stable set of vertices, and take X - S N R, so that S and N ( X ) are disjoint. Then B includes the set ( S N B ) U N ( X ) , whose total weight is at least the total weight of ( S N B ) U X = S by (15.24). Hence B is a stable set of maximum weight, and so G is equitable by Theorem 15.3.9. To prove the "only if" part, assume that G is equitable, and therefore the weight of every stable set is at most 1 by Theorem 15.3.9. For every set


398

-IXl+w(b-IN(X)l

1 X C R, the set X U ( B - N ( X ) ) i s stable, and hence 1 _> r giving (15.24). 9 From Theorem 15.3.10 and Hall's condition we immediately obtain the following.

Corollary 15.3.11 Let G be a bipartite graph with an equal number of red and black vertices. Then G is equitable if and only if G has a perfect matching. To characterize equitable trees, we need some notation. We continue the convention that i, i0 index red vertices and j, jo index blue vertices. For a tree T and any edge e = ij of T, deleting e from T produces two subtrees. We denote by Tij the subtree containing i and by Tji the subtree containing

j. P r o p o s i t i o n 15.3.12 Let T = (V, E) be a tree and let x be an assignment of vertex-weights, possibly negative, satisfying

E xi - E xj. i

(15.25)

j

Then there exist unique edge-weights Aij, ij E E satisfying ~ij

-

xi

for all i

-

xj

f o r aU j.

(15.26)

j:ijEE i:ijEE

An explicit formula for the ~iojo = E ieTio3o

xi-

,~ij i8 given by E

xj-

JET/0~ 0

E

xj-

jET30i 0

y~ xi.

(15.27)

iET~0,0

P r o o f . The existence and uniqueness of the Aij follow from an algorithm that solves (15.26) "from the leaves up". That is, choose a leaf (a vertex of degree 1), say i ~, and its unique neighbor j'. Set ~i,j, = xi, as dictated by (15.26), delete i' and the edge i'j' and replace x~ by x j , - x~,. The result is a smaller tree that still satisfies (15.25), to which we can apply induction. The basis of the induction is the one-edge tree, where (15.25) is utilized. To prove the explicit formula (15.27), we show below that the Aij defined by (15.27) satisfy

E j:iojEE

Xo.

(15.28)

15.3

399

Threshold Measures

Figure 15.5: Illustration of the proof of Proposition 15.3.12. i0

Tjxio

~/j2io

Tjkio

A similar proof holds for the other equation of (15.26). Let j l , . . . ,jk be all the blue neighbors of i0, as in Figure 15.5. Then

k

k(

E ~,oj- E ~,o~- E r=l

j:ioj6E

r=l

j

i

E x~j6Tj,.i o

Zx )

iETj,., o

xo)

proving (15.28). " We are now ready for a simple combinatorial characterization of equitable trees. T h e o r e m 1 5 . 3 . 1 3 Let T be a tree having r > 1 red and b >_ 1 blue vertices. For each edge e - iojo of T, define _ ~(~) r

r

b(~) b '

(15.29)

Tiojo containing io, respectively. Then T is equitable if and only if ~ >_ 0 for each edge e. In that case )~ is an optimum of the threshold measure problem (15.18).


400

P r o o f . Formula (15.29) is obtained from (15.27) by setting x~ xj __ ~. 1 Therefore by Proposition 15.3.12, )~ satisfies 1

A~j -

-

j:ijEE

all i

r

Aij

1 ~

-

1?- and

(15.30) all j,

i:ijEE

where T - (V, E). Thus if )~ _> 0, then )~ satisfies (15.23), and the argument of the "if" part of Theorem 15.3.9 shows that T is equitable. Conversely, if T is equitable, then as in the "only if" part of Theorem 15.3.9, the equations (15.30) admit a solution )~' _> 0. But by Proposition 15.3.12 these 1 equations have a unique solution given by (15.27), where xi - 1 and xj = -g, namely the ,k defined by (15.29). Therefore ,V - ,k and ,k _> 0 [] We conclude this section with two examples of equitable trees. T h e o r e m 15.3.14 Let T be a tree in which all red vertices have degree 2 (equivalently, T is obtained by subdividing each edge of a tree with a new vertex). Then T is equitable. Proof. To see vertex from j

T satisfies b - r + l . A l s o , each edge e - iojo of T satisfies r(e) - b(e). this, use the bijection mapping each red vertex i of Tiojo to the blue j of Tiojo such that i is the immediate successor of j along the path to i0. Therefore formula (15.29) gives A(e)-r(e)

(1 1) r

r + l

>0"

A complete k-ary tree is a rooted tree in which every internal vertex has exactly k children, and all the leaves have the same distance from the root. Theorem

15.3.15 Every complete k-ary tree is equitable.

P r o o f . Let T - (V, E) be a complete k-dry tree, and assume without loss of generality that its root is blue. Let h be the height of T (the distance from the root to each leaf) 9 The total number of vertices of T is IV] - k h]+~ l l l 9 The numbers r and b of red and blue vertices of T are functions of k and h. Specifically, if h is even, then b - P(k, h) "- 1 + k 2 +

k 2]Ch -

-

Q ( k , h) . -

kh + 2 - 1

k 4 --~-""" + ~h __

IVI - P ( k ,

1

h) ]~2

__1

~

1

15.4

401

Threshold and Majorization Gaps

and if h is odd, then k h+l -- 1 k:-i

b - R(k,h) "- g ( k , h - 1 ) -

k h+l - 1

- s(k, h) . - IVI - R(k, h) - k k~---=-V To compute the A~ of (15.29) for a given edge e, it is convenient to always use that subtree of T - e that contains the root, using either (15.29) or its analog with red and blue switched. Let 1 be the height of this subtree. If h is even, then

_

P(k,l) Q(k,1) V~(k~h) - Q(k, h)'

R(k,~)

S(k,1)

-d-(-i l h ) - P ( k, h ) '

for k even for k odd,

and if h is odd, then

~ -

P(k, 1) Q(k,l) 5 ( ; , ~ - R(k,h)' R(k,~) S(k,t)

for k even = 0,

for k odd.

In all cases, one obtains Ae _> 0.

15.4

Threshold

and

Majorization

Gaps

In this section we introduce two measures of non-thresholdness of a degree sequence: the threshold gap, based on the work of Hammer, Ibaraki and Simeone [HIS78, HISS1], and the majorizaion gap, based on the work of Arikati and Peled lAP94]. It turns out that these two measures coincide, and we investigate their properties and the relations between them. Recall from Section 3.1 that a vector d - ( d l , . . . , d~) with integer components is called a proper sequence if n - 1 >_ dn >_ " " >_ dl > 0. Recall also the definition of the corrected Ferrers diagram C(d) representing a proper sequence d (Definition 3.1.6), and of the corrected conjugate sequence d' and the corrected Durfee number m of d. Also recall (Theorem 3.2.2) that a proper sequence d is threshold if and only if C(d) is symmetric (i.e., d' - d), in which case C(d) is the adjacency matrix of the unique labeled realization of d. We define a distance between proper sequences as follows.

402

T h r e s h o l d Weights and M e a s u r e s

D e f i n i t i o n 15.4.1 The distance between proper sequences d and e of the same length n is defined as n

l i d - ell -

(15.31)

I & - e l. i=1

In other words, the distance is just 1/2 of the Ll-norm of the difference d - e . If ~'~'~in._=ldi and ~i~1 ei are even, as is the case when d and e are graphic sequences, then the distance lid- ~11 is an integer. The reason is that

9

di~ei

di>_ei

i

di<ei

Since 2in___l di and ~i~a ei are even, so are ~d,>~, di--~-~d,<e, di and ~d,~, ei, and therefore ~i~1 I d i - ei] is an even integer. We denote by 7~T,~ the set of proper threshold sequences of length n. The threshold gap of d is defined as follows. D e f i n i t i o n 15.4.2 If d is a proper sequence of length n, its t h r e s h o l d g a p is eE D"I'n

lid-

Thus the threshold gap of d is a measure of the non-thresholdness of d. In Subsection 15.4.1 we determine the threshold gap of a proper sequence and all the threshold sequences that realize it. Recall the definitions of the majorization ~ and strict majorization ~relations between sequences of length n (Definition 3.1.1). Recall also the definition of unit transformation (Definition 3.1.2). If a is obtained from b by a unit transformation, we say that b is obtained from a by a reverse unit transformation. By Condition 9 of Theorem 3.1.7, every degree sequence d is majorized by some threshold sequence, and by Condition 8 of Theorem 3.2.2 the majorization is strict if and only if d is not a threshold sequence. From this and from the Muirhead Lemma (Theorem 3.1.3) it follows that if d is any degree sequence, then some threshold sequence can be obtained from d by a finite number of successive reverse unit transformations. This motivates the following definition of the majorization gap of d as a measure of the non-thresholdness of d.

15.4


403

D e f i n i t i o n 15.4.3 The m a j o r i z a t i o n g a p R ( d ) of a graphical sequence d is the smallest number of reverse unit transformations required to transform d into a threshold sequence. Thus R(d) = 0 if and only if d is already a threshold sequence. In Subsection 15.4.2 we give a formula for R(d) and show that it is equal to the threshold gap of d. We also determine its m a x i m u m for a fixed number of edges or vertices, as well as exhibiting all the sequences that realize this maximum, which can be thought of as the most non-threshold in this sense. We also discuss the related questions of the difference gap and some others.

15.4.1

The Threshold Gap

Before we discuss the threshold gap, we introduce two binary operations on the proper sequences of length n. D e f i n i t i o n 15.4.4 If d and e are proper sequences of length n, their j o i n and m e e t are defined as the proper sequences d V e and d A e given by (dV e)i

-

max(di, ei)

i - 1,...,n

(dA e)i

-

min(di, ei)

i - 1,...,n.

It turns out that DT-~ forms a lattice under the operations of join and meet, having the largest element ( n - 1, n - 1 , . . . , n - 1) and the smallest element (0, 0 , . . . , 0). This follows from the following proposition. Proposition

15.4.5 DT~ is closed under joins and meets.

P r o o f . For integers k _> 1 and h >_ 0, we denote by S(k, h) the set of the smallest h positive integers excluding k, that is to say

S(k,h)-

{1,...,h}, {1, ,k-l,k+l,...,h+l},

ifhk.

If d is a proper threshold sequence whose unique labeled realization on the vertices 1 , . . . , n is G, then the neighborhood of each vertex i in G is Na(i) = S(i, di). Conversely, if a graph G on the vertices 1 , . . . , n has neighborhoods satisfying this condition, then G is a threshold graph with degree sequence d, so d is a threshold sequence.

404

Threshold

Weights and Measures

We prove the closure of 2)7",~ under joins; the results for meets can be proved similarly or deduced from the closure under complements. Let d, e, C 2)Tn, and let G and H be the threshold graphs on the vertices 1 , . . . , n in which vertex i has degree d~ and e~, respectively. Thus N o ( i ) = S(i, d~) and NH(i) = S(i, ei) for each i. Put f = dV e and F = G tO H. Then for each i we have

NF(i) -- S(i, d~) U S(i, e~) - S(i, max(d~, e~)) - S(i, f~). Hence f is a threshold sequence, i.e., f C ~)']"n. 9 We now define an easily-computed quantity t(d) associated with a proper sequence d, and show that it is equal to the threshold gap of d. D e f i n i t i o n 1 5 . 4 . 6 If d is a proper sequence of length n and corrected Durfee

number rn, we put 1

m

t(d) - -~ ~

!

1

d~[ -

Id~ -

7-51

n

E

!

Id - d l.

( 5.a2)

~=m+l

To justify the equality between the two expressions in the definition of t(d), consider the corrected Ferrers diagram C - C(d), and divide it into four submatrices M, A, B, 0 as illustrated in Figure 15.6. Figure 15.6: The corrected Ferrers diagram of a proper sequence. The submatrix M is full of l's except on its main diagonal; the submatrix O is full of O's. 1

m

m+l

n

M

B

A

O

m

m+l

Since each row and column sum of M is m - 1, d~ - di is equal to the difference between the i-th column sum of A and the i-th row sum of B. Since

15.4


405

the l's of A are at the top of the columns and the l's of B are at the left n ~" 1, if Cij r Cji of the rows, we have Id~- di I - ~j=~+~ u~j, where u~y - ~ O, otherwise Consequently m

m

i=1

n

i=1 j = m + l

A similar argument involving the rows of A and the columns of B shows that n

m

E Id:-d i=m+l

n

l-E E

Uij~

i=1 j = m + l

and therefore the equality in Definition 15.32 has been justified, and we also have an interpretation of t(d) as a measure of the deviation of C(d) from symmetry. Furthermore, we have ~i~=~ di - m ( m - 1) + a + b, where a and b are the numbers of l's in the submatrices A and B, respectively. Therefore if ~in=.l di is even, as is the case when d is graphic, a and b have the same parity, and it follows that t(d) is an integer. As a first step in proving that t(d) is the majorization gap of d, we associate with each proper sequence d of length n with corrected Durfee number m the sequences d and d defined by

d i - { d'i' f o r i - 1 , . . . , r n

di,

for i - m

+ 1,..., n

cli- { di, f o r / - 1 , . . . , r n d}, for i - m + l , . . . , n .

Referring to Figure 15.6, we see that C(d) is obtained from C(d) by replacing the submatrix B with A T, whereas C(d) is obtained by replacing A with B r. Therefore d and d are proper sequences and their Ferrers diagrams are symmetric, i.e., d, d C 7PTn. To find the corrected Durfee numbers r'n - re(d) and rh - re(d) of d and d, we notice that for C - C(d) we always have Cm+l,~n = 0 (for if Cm+l,,~ = 1 one would also have Cm,m+l = 1 by dm >_ din+l, and m could be increased by 1). In contrast, x - C~n,m+l can be either 0 or 1. From the definition of d it is clear that d - C(d) satisfies C',~+~,,~ = 0 and therefore rh - m. On the other hand, d' - C(d) s a t i s f i e s C m + l , m = x and Cm+2,m+l = 0, and therefore rh is m or m + 1 according as x is 0 or 1. We have therefore verified the following formula: ~-m

~_{ '

m, re+l,

if d i n - m - 1 if d i n > m - 1 .

(15.33)

Thre s hold Weights and Measures

406

It follows directly from (15.31) and (15.32) that for each proper sequence d, the proper threshold sequences d and d satisfy

lid- rill - l i d - rill -

t(d).

(15.34)

T h e o r e m 15.4.7 E v e r y proper sequence d of length n satisfies t(d) -

min l i d - cll.

cEDTn

P r o o f . Let g: C 77T~ be optimal, i.e., lid- ~11- mincev~rn lid- ell. Let d be the corrected Ferrers diagram of ~: and rh its corrected Durfee number. First we show that rn _< rh _< rh, where rh is the corrected Durfee number of d, given by (15.33). Assume that, if possible, rh < m. Put h = ~:,~+1 < rh, and define the sequence c by { ~i+1, fori-h+l,...,rh ci ~, for i - rh + 1 g:i, otherwise. Since C is symmetric and h < rh, c is a proper threshold sequence, and its symmetric corrected Ferrers diagram is obtained from ~' by enlarging its corrected Durfee,square from size rh to size rh + 1. We have C~+l - h < c~+1 - r h _< r n - 1 _< dm _< drh+l and therefore Ic~+l - d~+l I < 1~=~+1- d~+a I - ( ~ - h).

(15.35)

In addition, we have in any case that (i-

h+ 1,...,~).

(15.36)

By adding the inequalities (15.35) and (15.36), we obtain IIc-dll < II~-dll, contradicting the optimality of ~:. This proves that m < ~n, and a similar argument shows that rh < rh. We have shown that m < rh < rh, and therefore by (15.33) rh is either m or rh. We show below that l[~:- dll - l i d dll if r~ - m, and it can be similarly shown that I1~- dl[ - l i d - dll if ~ - ~ . Consequently, at least one of J and d is an optimal threshold sequence, and the required conclusion

that l i d - ~11-

t(d) will follow from (15.34).

As stated above, we consider the case ~ - m and have to show that I 1 ~ - d l l - lid-dll. Note that ~ ~nd d ~re proper threshold sequences with the

15.4

T h r e s h o l d a n d M a j o r i z a t i o n Gaps

407

same corrected Durfee n u m b e r m. If g: - d, we are done. Otherwise there must be some i - m + 1 , . . . , n such that g:i r a~i, because rows m + 1 , . . . , n of the s y m m e t r i c corrected Ferrers diagram determine it completely. Let k be the largest such i satisfying g:i > di, if any. Put h - g:k and define the sequence c' by

, ~ c.i-1, ci - ~ ci,

fori-k,h otherwise.

The sequence c' is proper, because by the maximality of k we have g:k > dk _> dk+l >_ ck+l, and by the s y m m e t r y of its corrected Ferrers diagram it is a threshold sequence. Clearly its corrected Durfee n u m b e r is again rn. Since c~ - g : k - 1 >_ dk - dk, we have I c ~ - d k [ - Ig:k--dkl-- 1. Hence I I c ' - d l l _< I]g;- dll because in any case I c ~ - dh] _< Ig:h- dhl + 1. It follows that c' is another optimal threshold sequence and we may replace g: with c'. R e p e a t e d application of this procedure eventually allows us to assume that ci _0 0, otherwise.

5~

1, i f d ' p - d p > O 0, otherwise,

Similarly, - 5~

-

-1, ~ ( e ) - 5~(d) -

ifd'~-d~>l O, otherwise.

I

1

l)+,~nd

412

Threshold

Weights and Measures

Figure 15.7: Illustrating a transfer from p to 7r. Solid lines represent l's and possibly . ' s .

T

~T

I

1 ....

I

C a s e 2: 7r 7~ T and p - or. Here, ~ r ( e ) - ~ ( d ) and 8 ; ( e ) - ~ ( d ) Case 1. Also 6 p ( e ) - ( e ' p - ep) + - ( d ' p - d p + 2) +. Hence

8 p ( e ) - 6p(d) -

2, if d'p - dp > 0 1, if d'p - - d p - - - - 1 O, otherwise.

C a s e 3" 7 r - T and p5r a. H e r e S p ( e ) - S p ( d ) )+ - ( d rI - d ~ - 2 ) Case 1. S i n c e 6 r ( e ) - ( G - e/r --2,

6r(e)-6r(d)

-

are as in

-1, O,

and 5o(e)-5~,(d) + we obtain

are as in

if d'~ - dr > 2 if d" - d~ - 1 otherwise.

C a s e 4" 7r - ~- and p - ~r. We now h a v e S r ( e ) - S r ( d ) as in Case 3, and 5 p ( e ) - 6p(d) as in Case 2. To complete the proof, note that 5i(e) - 5i(d) except for i E {~r, p, ~, ~-}. For future reference, note the following from the proof of L e m m a 15.4.12. Necessary and sufficient conditions for 6(e) - 5 ( d ) - 2 in Cases 1, 2, 3 are: 1. if 7r -~ T, p -~ or, then (a) d ' -

d~> 1,

15.4


(b)

r

413

0,

(c) f ~ - d ~ < O , (d) s

d~>_ 1.

2. if 7r 7~ r, p - (r, then (a) d ' -

(b)

d~>_ 1,

l

(c) d ' -

d~_> 1.

3. if 7c - r, p -r a, then

d'-

2,

(b) d ' p - d p < O ,

d'-

O.

L e m m a 1 5 . 4 . 1 3 Let d be a proper sequence. For 1 ( i , j (_ n, if di < j - 1, ! then dj < i. P r o o f . The proof is simple. L e m m a 1 5 . 4 . 1 4 Let d C D~ and let p be the largest index i such that di < d~. Then p < n. Further, 1. A s s u m e d'p (_ p -

!

1 and let q - d;. Then dq - p -

!

1 and dq ( p -

2.

2. A s s u m e dpt >_ p and let q - dpt + l. Then dq - p and dql _ d~, so p < n by definition of p. The fact that p < n justifies our mentioning of column p + 1 (of C) below. In both cases q is the position of the last 1 in I ! column p, so Cqp - 1. The assumption on p implies dp+ 1 _< dp+l _< dp < dp. ' 1 < dp, ' implying that Cq,p_t_ 1 r 1. We prove the two parts as Hence dp+ follows" !

1. See Figure 15.8. If d; _ q and Cq,p+l r "k. It follows that Cq,p_t. 1 -- 0, and therefore (since Cqq - * and Cqp - 1) dq - p - 1. Also dp < d; - q gives Cpq - 0, and again C(q, q) - * implies d; _< p - 2.

T h r e s h o l d Weights and M e a s u r e s

414

Figure 15.8: Illustrating Part 1 of Lemma 15.4.14. q

P

~ * ~ 1 0 0

*

!

2. In this case, apply L e m m a 15.4.13 with i - p and j - 1 - dp. T h e n ! j - dip + 1 - q and dq < p, as required. In our case p < q. See Figure 15.9. We have seen t h a t Cq,;+l r 1, b u t Cqp - 1, so dq _> p. If q > p + 1, t h e n Cq,p+l = 0 and dq - p, as required. If q - p + 1, t h e n Cqp - Cp+l,p = 1 and Cqq - , . On the other h a n d , Cq,q+l m u s t be 0 if ! it exists, for otherwise Cp;;+l would be 1, c o n t r a d i c t i n g dq < p. T h u s again dq - p.

Figure 15.9: Illustrating Part 2 of Lemma 15.4.14.

Lemma 1,...,p-

p

q

,

0

1

*

Let d C Dn and let p be as in L e m m a 15.~.1~. FOr r 1, dr >_ p implies dr < dlr.

15.4.15

-

-

15.4


415

Figure 15.10" Illustrating the proof of Lemma 15.4.15. r

p

t

~ * ~ I 0

0

*

P r o o f . A s s u m e to the c o n t r a r y t h a t d~ > d'~. Let t - d~ + 1 > p (since d~ > r, t is the position of the last 1 in row r. See Figure 15.10). By c o n s t r u c t i o n C~t - 1, and by a s s u m p t i o n Ct~ - O, so d't ~_ r and dt < r - 1. B u t t h e n dlt > dt c o n t r a d i c t i n g the definition of p. 9 Lemma

15.4.16

Let d be a proper sequence. If there exists a k such that

d'k < d'k+l, then 1. d'k - k - l , d ' k +

~-k;

2. k is unique. P r o o f . 1" If d~ < k - 1 or d~ > k, then d~+ 1 < d~ since the l's of C(d) are left-justified. Hence d~ - k - 1. T h e n again by this reason and the a s s u m p t i o n t h a t d~ < d~+l, we obtain d~+ 1 - k. 2" T h e s a m e p r o p e r t y of C(d) implies t h a t the conditions d~ - i - 1 and d~+ 1 - i are possible for at most one i. ,, 1 5 . 4 . 1 7 (1) Let d C Dn have corrected Durfee number m, and let C(e) be obtained from C(d) by deleting the last 1 in row i, where 1 di+l; (b) if i - r n , then dm >_ m. Then e C D , . Further, if d} > d,, then 5(e) > 5(d). (2) Let d G Dn, and let C(e) be obtained from C(d) by adding a 1 to the end of column i, where 1 1, then d~ < d~_l;

Lemma

416


(b) if i = m, then dm >_ m. Then e e D , . Further, if d~ >_ d~, then 5(e) > 5(d). P r o o f . 1: The a s s u m p t i o n s on i g u a r a n t e e t h a t e is a proper sequence. The c o l u m n of the 1 t h a t is removed is j = 1 + di > m. This implies t h a t e E Dn. Further, if d~ > di, then Cji - 1, and thus dj ~_ i. But the a s s u m p t i o n s on i i m p l y d~ - i. Therefore by equation (15.38),

5(e) - 5(d) - (d'i - di + 1) + - (d'i - di) + + (d~ - dj - 1) + - (d} - dj) + - 1 + O. 2: This is similar to Part 1.

..

Let d c Dn with d~ > 0 and m - 2. Then S(d) 0 implies n _> 2. Using d2 < dl _< n - 1, d~ - 1, d2 _> 1, and di - 1, d~ - 2, for i - 3 , . . . , d2 + 1, we obtain n

~(d)

-

~ (d'~ - d~)+ i=1

=

(n-

_< ( n -

1-dl)+

-~-(d2- 1)(2-

1)

1 -d2) + + d2- 1

=

n-l-d2+d2-1

"~

rt ~2.

We need three more l e m m a s to prove T h e o r e m 15.4.10. Lemma

1 5 . 4 . 1 9 Let d C D~. A s s u m e that dn > O, d~ < d'a, and let p be the

largest i such that di < d~. Let q be such that dq > dqt and either (a) q < n or (b) q - n , dn > 2. Then Sk(d) < X k ( d ' ) - 1,

for k -

p,...,q-

1.

Since d C Dn, Sk(d) S~(d')d~ > d~,

( a s s u m p t i o n on p) ( a s s u m p t i o n on q) ( a s s u m p t i o n on p)

I dq >_ dq-+-i

(d~ < dl ~ d'n - 0 , d. > 0)

d. > d ' + l .

d~ > d~,

i-k+l,...,q-1 i-q+l,...,n-1

15.4

417


Adding all these inequalities, we obtain a contradiction, S~(d) > S ~ ( d ' ) + 1. W h e n q - n, the inequality for dq drops, but by assumption the inequality for dn can be strengthened by 1, and the same contradiction is obtained.

1 5 . 4 . 2 0 Let d C Dn, a s s u m e that dl < d'1, dn > O, and let p be as in Lemma 15.~. 19. Then

Lemma

Sk(d) p + 1. From L e m m a 15.4.14, dq - p. S i n c e p < q,

Threshold Weights

418

and

Measures

dp _> dq - p. Thus d l > . . . > dp. Using L e m m a 15.4.15, we obtain di _< d~ for i - 1 , . . . , p - 1. This and the assumption d l < d~ complete the proof. 9 P r o o f of T h e o r e m 15.4.10. From L e m m a 15.4.12 we have R(d) >_ [ ~ 1 , as a reverse unit transformation can decrease 5(d) by at most 2. We show that if 5(d) > 0, we can always construct a proper sequence e such that 1. e is obtained from d by a reverse unit transformation, 2. e is a degree sequence, and 3.

= e(d)-

2.

It then follows that 5(d)is even and R(d) - 6(d) 2 " To show that e is a degree sequence we use the Berge Condition of Theorem 3.1.7, i.e., & ( e ) is even and Sk(e)_< Sk(e')for k = 1 , . . . , n . To show that 5(e) = 5 ( d ) - 2 we use the observation made after the proof of L e m m a 15.4.12. Without loss of generality, we may assume that dn > 0, for if dn = 0, then we work with c - ( d l , . . . ,d~-l). We may also assume that d 1 < d~, for if d 1 - d i ( - 7 " t - 1), then we work with c - ( d 2 - 1 , . . . , d ~ - 1). We distinguish two cases. C a s e 1" There exists an i > 1 such that di < d~. Let p be the largest such i. The basic idea is to transfer the 1 at the end of column p to the end of row 1. Let s = dl + 2 be the destination column for the moving 1. We have two subcases now. ! ! C a s e 1.1" dp _< p - 1 . Then q - dp is the source row for the moving 1. Also dq = p - 1 by L e m m a 15.4.14 and hence p is the source column for the moving 1. Define e = d - uq + Ul. Then for i = 1 , . . . , n , (a) ei = di except for e l - - d l q- 1 and eq - - d q - 1, and (b) e Ii - - d Ii except for epI - d Ip - 1 and Cs,

1.

We first prove that e is a degree sequence. Since 5's(e') = S'n(e), it sumces to prove S'k(e) _< S'k(e'), for k = 1 , . . . , s - 1. This is done as follows: Sk(e)=Sk(d)+l_< Sk(d') = S k ( e ' ) for k = 1 , . . . , q - 1 (by L e m m a 15.4.21), Sk(e) = Sk(d) < Sk(d') = Sk(e') for k = q , . . . , p - 1,

15.4


Sk(e)=Sk(d) p. From L e m m a s 15.4.20 and 15.4.21 we have Sk(d) < Sk(d'),

k-

1,..., n-

1.

(15.39)

Let q - dp! + 1 be the source row for the moving 1, and define e as in Case 1.1. We first show that Sk(e) _< Sk(e') for k = 1 , . . . , s - 1. It follows from 1 as before, and I / ! dq - dq ~ - 2 since dq - p ~ 2 and dq - d s - O. C a s e 2: i - 1 is the only index with the property di < d}. Then dn - 1, for dn >_ 2 implies d~ - n - 1 - d~ > dl >_ d2, contradicting the assumption of the case. We assert that d~ >_ dl -t- 2. Indeed, we show that d~ - dl ~- 1 implies that S~(d) is odd, contradicting the assumption that d is a degree sequence. We have d~ - d 1 + 1 , d~ 0}, A' - {i " 1 2

_< d'l -d~ + ~; d'~

(~n~ dl _> d~ ~na d~, d~ _> 0).

i>2

If dl > 2, then 5(d) < Ei>l d ~ - 2 - 2q - 2. If dl - 1, then d - 12q0r, and so 5(d) - 2 q - 2. Conversely, let d satisfy ~ > l ( d } - d ~ ) + - 2 q - 2 . Assume that, if possible, dl>

1. T h e n

(d i - d l ) + 2 (d~-d~)+

dl-dl _< dl _< d~,

--

i>_2.

Adding these inequalities, we obtain 2q - 2q, hence all the above inequalities hold as equalities. Therefore for i >_ 2, if d} > 0, then di - O. But this fails for i - 2, as d~ >_ dl - 2. [] Let us now consider the majorization gap of the degree sequences of a fixed length n. We make use of the following functions:

f(n)

f(n)-l, -

L~J

[~],

g(n) -

f(n),

ifn-3mod4 otherwise.

We reproduce the definition of Dn for convenience. For positive integers n, Dn - { d -

( d l , . . . , dn)" d is proper, Sk(d) < Sk(d') for k - 1 , . . . , n}.

proper degree sequence of length 5(d) _ 5, equality holds if and only if

Theorem

15.4.25

Let d be a

for n ~ 3 mod 4

d-

F~I

~

o~

d-l~J

n.

Then

~

n-3 2 '

for n - 3 mod 4 or

To prove Theorem 15.4.25, we find it convenient to prove the following theorem:

15.4


Theorem

15.4.26

423

For d C D~, 5(d) 5, equality

holds if and only if d -

[~-~1 n or d -

We need several results to prove these theorems. about the functions f ( n ) and g(n).

1. f ( n -

1) < f ( n ) for n > 1, f ( n -

2. f ( n ) > n 3. g ( n - 1 )

First, some simple facts

1) < f ( n ) for n > 3;

2 for n _> 2, with strict inequality for n > 5;

< g(n) for n > 4, g(n) > n + l f o r n > 7 .

Next, three lemmas about corrected Ferrers diagrams. 1 5 . 4 . 2 7 Let 0 r d C D n . Put s - dl + 1, p - d~s, q - d~p + 1, r - ds. Thus s > _ r e > p > 1 and q > m > r > 1. Assume that the following hold (See Figure 15.11): Lemma

1. s > m + 1, and if equality holds, then p < m; 2. q < s ; 3. r 5(d)+2;

2. there exists a sequence f c D~ such that (a) (i) f l

-

dl -

1 or (ii) f l

-

dl

and f'~ - 1;

(b) 5 ( f ) > 5(d) + 1; (c) S n ( f ) and Sn(d) have the same parity. P r o o f . We begin by proving s t a t e m e n t 1. First note that p _< m _< q and r < p. Secondly, we m a y assume that the first r columns of C(d) are full, i.e., d~i - n - 1, i - 1,...,r, (15.41) for otherwise we can fill column 1, then 2, and so on up to r (by adding l's at the end of these columns) without going out of D~, and increase 5(d) at each step, by L e m m a 15.4.17.

424


Figure 15.11: Illustrating L e m m a 15.4.27. r

p

m

q

s

*

0

m

0

Counting in two ways the number of l's in rows 1 , . . . , s and columns r + 1 , . . . , p of C(d), we obtain p

~

d:-(p-r)(q-1)+ i=r+l

(15.42)

(di-r).

i=q+l

Since d e Dn, ~p(d) ~ ~p(d'). This implies, by (15.41) and (15.42), p(s - 1 )

p

dr+ 2 >_ 9.. > dp, i < p _< q, a contradiction. Also r < i < p, then q - 1 _< d~ < d~+1 -

15.4

425


di-s-1

I a d'~,

Hence

i - r + 1,...,p.

(15.44)

I I l l Again by Lemma 15.4.16, dlq+l ~ dq+ 2 ~ . . . ~ ds, f o r if d i < di+ 1 f o r some q q >_ m, a contradiction. Therefore for i - q + 1 , . . . , s , di _ d'~ - p . Hence

d'~ > d,,

i-

q + 1,...,s.

(15.45)

Using (15.41), (15.44) and (15.45), we have q

5(d) - r(n - s) + ~

(d'i - di) + + ~

i=p+l

(d'i - di).

(15.46)

i=q+l

Define a sequence e such that C(e) is obtained from C(d) by making the first p columns full and deleting the s-th column, i.e., , ei-

f n-l, / 0, d~

fori-l,...,p fori-s otherwise

It is easy to check (using s > m + 1) that e C Dn by Lemma 15.4.17. Further, s--1

~(~) -

~(~,,-

~)+

i=1 q

= p(n-s+l)+

s-1

y~ (d'i - d i ) + +

y~ (d'i - p )

i=p+l

=

i=q+l q

~(~ - ~) + ( p - ~)(~ - ~) + p + ~

(d'~ - d~) + +

i=p+l

(d'~ - p)

(~s d' - p)

i=q+l q

=

r(n-s)+(p-r)(n-s)+p+

Z

(d',- d,) + +

i----p+l

(d'i - di) i--q+l

=

~

( p - di)

i-q+l

~(d) + ( p - ~)(~ - ~) + p -

~ i=q+l

( p - d~)

(by (15.46))


426

=

5(d)+(p-r)(n-s)+p-

)_2 ( P - r ) + i=q+l

=

~(d) + (p - ~)(~ - ~) + p -

~_~ ( d i - r ) i=q+l

(~ - q ) ( p - ~) + ~

(d~-

~)

i=q+l

> 6(d)+ ( p - r)(n - s ) - r(n - 1 - s) = ~(d) + ( p - ~)(~ - ~ ) - r(~ - ~ ) + >

~ ( d ) + ~.

( ~ ~ < p-

(by (15.43))

~)

So 5(e) >_ 5(d)+ 2. This proves Statement 1. We now prove Statement 2. If S'~(e) has the same parity as S~(d) (before the achievement of (15.41)), then f = e has the required properties. If the parities differ, take f = e + u l . In this case C ( f ) can be obtained from C(d) by making the first p columns full and deleting all the l's except the first 1 in the s-th column. So f C Dn by Lemma 15.4.17. The other required properties of e follow from S~(I) = S~(e)+ 1, 5~(f) = 5 1 ( e ) - 1, and 5~(f) = 5~(e) for i = 2,...,n. ,, L e m m a 15.4.28 relaxes one of the assumptions of L e m m a 15.4.27. Lemma

15.4.28

Under the conditions of Lemma 15.4.27 except r 3, then there exists a sequence and 5(e) > 5(d) + 1;

e C Dn such that el - d l - 1

2. if m > 4, then there exists a sequence f c Dn such that (a) (i) f l = d l -

1 or (ii) f l - - d ,

and f~ = 1;

(b) 5 ( f ) > 5(d)-4- 1; (c) S ~ ( f ) and Sn(d) have the same parity. P r o o f . We begin by proving Statement 1. First, observe that m < n - 1, because m = n - 1 and p = m would imply Sin(d) > Sm(d'), contradicting d E Dn. Secondly, assume without loss of generality that (15.41) holds as in

428


t h e p r o o f of L e m m a 15.4.27. Define a s e q u e n c e e such t h a t C ( e ) is o b t a i n e d f r o m C(d) by m a k i n g t h e first m - 1 c o l u m n s full, if necessary, a n d d e l e t i n g t h e c o l u m n s - rn + 1, i.e.,

ei! m

rtml 07

d~, Then e E

Dn

~ for i - 1 , . . . , r n for i - m + 1 otherwise.

1

by L e m m a 15.4.17. Using t h e facts d} - n -

di

m, d~ - m - 1 for i - r + 1 , . . . , definition of m a n d r), we o b t a i n

5(d) - r ( n -

1 - m)+ m-

m,

d i 'n +

r < (m-

1 -

m,

1 for i - 1 , . . . , r, < 1 (by _ m -

dm+l - r

1)(n - 1 - m ) +

m - r.

Also, 5(e) - ( r n -

1)(n - 1 - ( r n

- 1)) - (rn - 1)(n - 1 - r n ) + m - 1.

It is now clear t h a t if r > 1, t h e n 5(e) > 5(d). If r = 1, t h e s a m e c o n c l u s i o n holds, since m - 1 > 1 a n d n - 1 - m > 0. T h i s proves S t a t e m e n t 1. W e now prove S t a t e m e n t 2. If Sn(d) before t h e a c h i e v e m e n t of (15.41) a n d S~(e) h a v e t h e s a m e parity, t h e n f = e has t h e r e q u i r e d p r o p e r t i e s . O t h e r w i s e t a k e f = e + Um+l. O n c e again, C ( f ) can be o b t a i n e d f r o m C(d) by m a k i n g t h e first m - 1 c o l u m n s full a n d d e l e t i n g all t h e l ' s e x c e p t t h e first 1 in c o l u m n m + 1. T h e r e f o r e f C Dn by L e m m a 15.4.17. F u r t h e r , (~(f) = ( ~ ( e ) - 1, as (~l(f) = ( ~ l ( e ) - 1, a n d 5~(f) = 5~(e) for i = 2 , . . . , r n . Hence 5 ( f ) = (rn - 1)(n - 1 - rn) + m - 2. N o w it is clear t h a t if r > 2, t h e n 5 ( f ) > 5(d). If r - 2, t h e s a m e c o n c l u s i o n holds since m - 1 > r ( b e c a u s e m >_ 4) a n d n - 1 - m > 0. Finally, if r - 1, t h e n t h e conclusion holds again, since

5(f)

-(m-2)(n-l-rn)+n-l-m+m-2 >_

(m-2)(n-l-rn)+m-1

>

n-l-rn+m-1

(asn-2>rn)

(asm>_4andn-l-m>O)

=

W e are now r e a d y to prove T h e o r e m s 15.4.25 a n d 15.4.26.

15.4


429

P r o o f o f T h e o r e m 1 5 . 4 . 2 6 . T h e s t a t e m e n t is true for n - 1, so a s s u m e n > 2. If d n - 0, t h e n by induction on n, 8(d) < f ( n 1) < f ( n ) , and f ( n - 1) < f ( n ) for n > 3. We m a y therefore assume t h a t d~ > 0, and c o n s e q u e n t l y m > 2. If m 2, then 5(d) < n - 2 by L e m m a 15.4.18, and hence 8(d) _< f(n). E q u a l i t y holds if and only if d~ - d2 and n - 2 - f ( n ) , which implies n < 4. Hence we m a y assume m _> 3. P u t s - dl + 1, p - d~s, q - dp~ + 1, r - ds, and observe as before t h a t p _< m < q, and m < s Proposition. If s > m + 1, then there exists a sequence e C Dn such that el

-

-

dl

-

1 and 5(e) > 5(d). Consequently we may assume that s - m.

To prove the proposition, first assume t h a t q > s. Define a sequence e such t h a t C(e) is o b t a i n e d from C(d) by m a k i n g the first p columns full, and deleting the s-th column. T h e n e C D~ and 5(e) > 5(d) by L e m m a 15.4.17. Now a s s u m e q < s. T h e n r d~ < dq+l m + l or p < m, t h e n the required sequence e exists by L e m m a 15.4.28. If s - m + 1 and p - m, t h e n the required sequence e exists by L e m m a 15.4.29. This proves the proposition, and we m a y assume t h a t s - m. F u r t h e r , we m a y a s s u m e t h a t the first m - 1 columns of d are full, for otherwise we m a k e t h e m full w i t h o u t leaving D~, t h e r e b y increasing 8(d) by L e m m a 15.4.17. Then d - ( m - 1) n ,

5(d)- (m-

1)(n-

1 -(m-

1)),

and 5(d) reaches a m a x i m u m when n--1

m-

2 ' n ~ 7-1,7,

1 -

nodd neven.

(15.47)

Therefore 5(d)_
7, and ~i(d) - g(n). If d n - - 0 , then ~(d) 0, and therefore m > 2. Also m - 2 implies, by L e m m a 15.4.18, that ~i(d) _< n - 2 < g(n), again contradicting our assumption, so we assume that m > 3 . P u t s - d l + l . Proposition. If s > m + 2, then there exists a sequence f E En such that

(~(f) > ~(d), contradicting ~(d) - g(n). Consequently we may assume that s-m ors-m+ l. We prove the proposition by showing that, when s > m + 2, 1. if d's - 1, then there exists an f E En such that 5(f) > 5(d); 2. if d'~ >_ 2, then there exists an f E E~ such that 6(f) >_ 5(d), and if equality holds, then fl - d~ (so that f and d have the same s) and f;-1. C a s e 1" d's - 1. The basic idea is to work with the sequence ( d l 1, d 2 , . . . , d~) and introduce a 1 at the end of the first row if the parity becomes odd. Put t - s - l , p - d ~ t , q d tp + l . T h e n p _ < m_< q. Assume that q >_ t. Define a sequence f such that C ( f ) is obtained from C(d) by deleting the last 1 in column s - 1 and the 1 in column s. Then f C E~ and 5(f) > 5(d) by L e m m a 15.4.17. Therefore we may assume q < t. Put r = dr. See Figure 15.13. C a s e 1.1: t > m + 1 or p < m. Define a sequence c such that C(c) is obtained from C(d) by deleting the last 1 in the first row. Note that c E D~ by L e m m a 15.4.17 and, further, c satisfies all the hypotheses of L e m m a 15.4.28. By the latter, there exists a sequence g E Dn such that 5(g) >_ 5(c) + 1. The required sequence f is defined by f

_ J" g, (g~ + 1 , 9 2 , . . . , g ~ ) ,

\

Sn(g) even S~(g) odd.

15.4


431

Figure 15.13" Illustrating Case 1 of the Proposition in the proof of Theorem 15.4.25. r

m

p

m

ts

.jrm

0

To see this, recall that in the proof of L e m m a 15.4.28, C(g) is obtained from C(c) by making the first p columns full and deleting the t-th column (if r _ p - r). Note that when S~(g) is odd, C(f) could be obtained from C(c) by making the appropriate columns full and then deleting all but the first 1 in column t. Therefore f C Dn in this case by L e m m a 15.4.17. Clearly f E Dn also holds when S~(g) is even. Further, Sn(f) is even in both cases. Thus f C E~. From the construction of c and f it is clear that 5(c) - (5(d)+ 1 and (5(f) _> ( 5 ( g ) - 1. Therefore 5(f) >_ 5 ( g ) - 1 _> (5(c) - 5 ( d ) + 1. 1.2" t - m + 1 and p - m. This is similar to Case 1.1, except that here we use L e m m a 15.4.29 instead of L e m m a 15.4.28. Case

C a s e 2" 2 _< d', _< rn. Put p - d'~ and q - d'p + 1. If q _> s, then we may remove the last two l's in the s-th column without leaving En and thereby increase (5(d) by L e m m a 15.4.17. We therefore assume that q < s. Then the required f exists by L e m m a 15.4.28.

This completes the proof of the proposition and hence we may assume that s - m or s - m + 1.

Threshold Weights

432

and

Measures

It is convenient here to define a new function. Let a _> 6 be a fixed integer such t h a t a - 2 m o d 4. For 0 _< k _< a, define k(a

h~(k) -

k(a-

-- k),

k)-

k

l,

even

k odd,

i.e.,

h~(k)- 2 L"~-"~ 2 J. Note t h a t the m a x i m u m of ha(k) occurs at k -

~2

1, 7, 7 + 1~, ~

and the

m a x i m u m is ~2-4 4 " Proposition. if and only if

If s - m or s - m + 1, then 5(d) _< h n - l ( m -

d

-

(m-l)

d

-

(m -

~

1), with equality

formodd

1) n - 1 ( m -

2)

or

d - m(m

-

1) n - 1

f o r m even.

To prove the proposition, consider first the case t h a t s - m. If columns 1 , . . . , m - 1 are m a d e full in this order, then d stays in Dn and 5(d) increases at each step by L e m m a 15.4.17. For odd m, the resulting d - ( m - 1) n also belongs to E~, and so it is the only sequence of E~ satisfying s - m t h a t m a x i m i z e s 5(d). T h e m a x i m u m in this case equals ( m - 1 ) ( n - 1 - ( m 1)) hn-1 (m1 ) . For even m, the above d does not belong to E~, and so the only sequence of E~ satisfying s - m t h a t maximizes 5(d) is the previous sequence in the filling-up process, n a m e l y d - ( m - 1 ) ~ - l ( m - 2). T h e m a x i m u m in this case equals ( m - 1)(n - 1 - ( m 1 ) ) - 1 - h~-i (m - 1). Now consider the case t h a t s - m + 1. We assert t h a t if d'~ _> 2, t h e n there exists f C E~ such t h a t 5 ( f ) > 5(d), and consequently we m a y a s s u m e t h a t d'~- 1. To prove the assertion we distinguish two cases. ! C a s e 1" 2 _< d~ < m. P u t p - d~, r - d~, q - dp + 1. We m a y a s s u m e t h a t q < s, for otherwise the last two l's in the s-th column of d m a y be r e m o v e d w i t h o u t leaving E~, t h e r e b y increasing 5(d) by L e m m a 15.4.17. T h u s q - m. If r _ p - r. Using dli > rn and di - rn for i -- 1 , . . . , r, d~ - m - 1 and di - rn ' for i - r + 1 , . . . , p, d~ - di - r n - 1 for i - p + 1 , . . . , m , din+ 1 - - p, d m + l - r and d} - 0 for i - rn + 2 , . . . , n , we obtain 5(d) - ~ ( d ' i - di) + p i=1

r.

(15.48)

15.4

Threshold

and Majorization

Gaps

433

Let f be a sequence such that C(f) is obtained from C(d) by (a) deleting the s-th column and (b) adding a 1 at the end of the (r + 1)-st column if p is odd. Then f C En by L e m m a 15.4.17 and 7-

5(f)

>

~-~(d'~- ( d ~ - 1)) i=1 r

=

r

i=1

>

5(d).

(by (15.48) a n d r > p - r )

C a s e 2" d~, - m. If m > 4, then the required f exists by L e m m a 15.4.29. Now assume the special case m - 3. Then d - 32d'2-21n-d'2-1 with d~ > 2 (see Figure 15.14).

Figure 15.14: Illustrating a special case in the proof of Theorem 15.4.25. m

* 1

1 1

1 1 * 1 1 0

1

1

m s

:

d;-t-1

*

:

1 1 1

n

8

1 1

0

1

If d~ - 2, then Sn(d) - n + 6 is odd, since n - 3 mod 4, contradicting d C En. Therefore d~ >_ 3. Hence 5(d) - ( n - 4 ) + + ( d ; - 3 ) + + ( 2 3) + + ( 3 - 2 ) + - n+d~-6 _< 2 n - 7 , and writing n - 4 k + 3 , we have 5(d) '
O. P r o o f . Assume that (f; t l , . . . , t i n ) iS a constant threshold representation for T ~. Since there is only a finite number of strictly positive values f ( x ) - f ( y ) tj, we can find an e > 0 that is a lower bound for all these values. We then have

xPjy ---.. f(x) >_ f(y) + tj + and also

xPjy - .

f(x) >_ f(y) - tj

Threshold Graphs and Order Relations

462

It follows that f and the following weights for G(T') satisfy Condition 1 of Lemma 16.5.4: w(x~ y) -- I tj--tj,-4- e, ifif xPjy xP~y. Therefore every cycle of G has a non-positive weight. Thus m

m

E pj(tj + ~)+ E p}(-tj) < O, j-1

j=l

which implies m

E qjtj > O. j=l

Conversely, since there is only a finite number of simple cycles from 7', we can find an e > 0 such that each cycle satisfies rn

m

E qjtj - e E p j >_ O. j=l

j=l

Using the same weights on G and taking the argument in reverse order, one derives the existence of a mapping f such that (f; t ~ , . . . , tin) is a constant threshold representation for 7'. 9

binary relation P is irreflexive and for all a, b, c, d C X,

Corollary 16.5.6 A

a

semiorder if and only if P is

(aPb A cPd) ~

(aPd V cPb),

(aPb A bPc) ~

(aPd V dPc).

Figure 16.~ shows the configurations forbidden by these conditions. P r o o f . The "Only if" part follows from the definition. To prove the "if" part, we assume that P does not have the configurations of Figure 16.4, and show that the 1-tuple (1) is a constant threshold vector for P. By Lemma 16.5.5, what we need to show is that q > 0 for every simple cycle from P. We show this by induction on the length of the cycle. The statement is true for cycles of length 1, 2 or 3 since the configurations G1, G2 and G3 of Figure 16.4 are forbidden. Let C be a simple cycle of length >_ 4 with q _< 0, if that were possible. If all arcs of C are in P, then we must have G3 or G2, a

16.5

Multiple Semiorders

463

Figure 16.4" The forbidden configurations for a semiorder.

G1

G2

G3

G4

G5

contradiction. Hence C has arcs in P'. Assume that C has two consecutive arcs in P. Then it has consecutive vertices a, b,c, d with aPbPcP'd, and hence aPd holds since Gs is forbidden. We can use the arc (a, d) to shortcut C and obtain a shorter cycle with the same q _< 0, contradicting the induction hypothesis. The only remaining possibility is that C has no consecutive arcs in P, and since q _< 0, its arcs actually alternate between P and P' and q = 0. Consider consecutive vertices a, b, c, d with aPbP'cPd. Then aPd holds since G4 is forbidden, and we can use the arc (a, d) to shortcut C and obtain a shorter cycle with q = 0, again contradicting the induction hypothesis. 9 The following theorem characterizes multiple semiorders. T h e o r e m 16.5.7 Let T' - ( P 1 , . . . , P r o ) be an m-tuple of binary relations on X , with m > 1. There is a constant threshold representation for ~P ( P 1 , . . . , Pro) if and only if no m-cyclone from 7:) is balanced. P r o o f . " O n l y if"" Assume that P admits a constant threshold representation ( f ; t l , . . . , t m ) . Then

xpjy ~

f ( x ) - f(y) > tj,

xpjy ~

f ( x ) - f(y) >_ -tj.


464

If we consider any k-cyclone, write these implications for all its arcs and sum the resulting inequalities, we obtain m

m

m

o > E pJJ + E p}(-tJ)- E(pJ- p})tj. j=l

j= l

j=l

Moreover, the inequality is strict since at least one arc from some Pj is used by the cyclone. This clearly implies that pj r p} for at least one j. Hence the cyclone is not balanced. "If"" Assume that no cyclone from P is balanced. Observe that each Pj is either reflexive or irreflexive (otherwise we construct a balanced 2-cyclone by taking one loop in Pj and another one in P~). Consider first the case m - 1. If P1 is irreflexive, we show that the 1-tuple (1) is a constant threshold vector for P1. By Lemma 16.5.5, it is enough to show that q > 0 for each cycle. If C were a cycle with q _< 0, we could add Iq[ loops from P~ to obtain a balanced 1-cyclone, a contradiction. Similarly, If P1 is reflexive, then ( - 1 ) is a constant threshold vector for P1. Now assume that m >_ 2. Since there is only a finite number of values f ( x ) - f(y), we may always look for a nonzero threshold tl, and by an appropriate change of scale, even assume that tl - +1. We treat the case that P1 is irreflexive, setting tl - 1 (the other case is similar). By Lemma 16.5.5, we have to show the existence of a common solution ( t 2 , . . . ,try) to all the inequalities

q2t2 + ' " + qmtm > --ql associated with simple cycles from P. By Helly's Theorem (see [Chv83, p. 266]) applied to the convex sets defined by these inequalities in the Euclidean space of all ( m - 1)-tuples (t2, t 3 , . . . , tin), it is sufficient to show that every m of these inequalities have a common solution, say

qi2t2 + " " + qimtm > -qil

(16.9)

with i - 1 , . . . , m. By the Farkas Lemma, this is equivalent to the following assertion" given real numbers 11,..., lm with li >_ 0 for each i, and li > 0 for at least one i, m

-

i=1

0

j -

2,...,

(16.10)

16.5

Multiple Semiorders

465

implies m

liqil

> 0.

(16.11)

i=1 !

Since qij -- Pij - - P i j is an integer, we only have to verify the assertion for rational numbers l~ (because the real tuples ( / 1 , . . . , lm) satisfying l~ >_ 0 and (16.10) form a polyhedron with rational extreme points and directions). It follows that we need only consider li's that are natural numbers. Now each of the equations in (16.9), for fixed i, comes from a cycle Ci from 7). Assuming that li is a natural number, we consider the cycle C[ obtained by traversing the cycle Ci li times, and then the union U of these C~. Then U is an m-cyclone that uses ~i~__1lip}j arcs from P~ and ~i~=1 lipij arcs from Pj. Equation (16.10) tells us that for j - 2 , . . . , m , these two quantities are equal. Since by assumption U is not balanced, we deduce m

liqil 7~ O. i=1

In order to establish (16.11), and thus complete the proof, it remains to show that (16.10) together with m

liqil < 0

(16.12)

i=1

lead to a contradiction. If (16.12) were true, we would of course have one of the qil strictly negative (because all li are nonnegative), and thus the mcyclone U would use at least one arc (x,y) from P1. Now we form a new m-cyclone U' by adding m

I ~ liqil I i=1

times the arc (x, x) to U. Since P~ is irreflexive, (x, x ) i s in P~, so the new arcs increase the left-hand side of (16.12) to zero. Thus U' is a balanced m-cyclone, a contradiction. 9 Let us make the following remark. The proof above used only a restricted version of Helly's Theorem, namely the convex sets were open half spaces in I~ m-1 (a similar result for closed half spaces can be found in [Chv83, p. 146]). This restricted version follows easily from the Farkas L e m m a as follows. Assume that A x > b is unsolvable, where A has m - 1 columns. Then Am > tb, t > 0 is also unsolvable. Hence by the Farkas L e m m a there exist y > O, Yo > O, (y, yo) r (0, 0), satisfying y T A -- O , - - y r b + Yo -- O, and,

466

moreover, we m a y find such (y, eliminating yo, we are left with c o m p o n e n t s and satisfying y r A most m inequalities among Ax


Y0) with at most m positive components. By a non-zero y >_ 0 having at most rn positive -- 0, yTb >_ O. Therefore a subsystem of at > b is unsolvable.

Chapter 17 Enumeration 17.1

Introduction

In this chapter we look at enumeration results of some families of graphs. For convenience we assume that all our graphs have the vertex set N = {0,...,n1}. Recall that two graphs G = (N,E) and H = (N,F) are the same unlabeled graph when they are isomorphic, i.e., when there is a permutation 7r of N such that ij C E if and only if ~r(i)Tr(j) e F. We say that G and H are the same labeled graph when E = F. Many enumeration results are expressed in terms of ordinary and exponential generating functions. The ordinary generating function (ogf) of a sequence a0, al, a 2 , . . . is ~ > 0 a~ x~ and its exponential generating function (egf) is ~ > 0 anxn/n!" The elementary properties of these functions are discussed in most introductory combinatorics books. In this chapter we have occasion to use the Burnside Lemma and other basic enumeration results, which can be found in the same books. We have already seen in Section 13.4 and 13.5 that decomposition techniques can be used to find the egf of the number of unlabeled box-threshold, matroidal and matrogenic graphs. See also Section 12.4 for the egf of the number of unlabeled domishold and hereditary pseudodomishold graphs. In Section 17.2 we discuss enumeration of threshold graphs, both unlabeled and labeled. It turns out that the enumeration of unlabeled threshold graphs can be used to derive the q-binomial theorem, and the enumeration of labeled threshold graphs can be used to derive a theorem of Frobenius connecting the Eulerian polynomial with Stirling numbers of the second kind. In 467

468

Enumeration

Section 17.3 we enumerate difference graphs, both unlabeled and labeled. Recall that difference graphs are bipartite. We refer to a bipartite graph G = (N, E) together with a fixed partition of N into two stable sets as bipartitioned. Section 17.3 discusses labeled and unlabeled difference graphs, either bipartitioned or not.

17.2

E n u m e r a t i o n of Threshold Graphs

We begin with the enumeration of unlabeled threshold graphs, based on Peled [Pel80]. Denote by r,~mk the number of unlabeled threshold graphs having n vertices, m edges, and m a x i m u m clique size k, and consider the generating function

Tn(x, q) - ~ 7"nmkqmxk. m,k We evaluate theorem. Lemma

T~(x, q) in two ways, and compare them to obtain the q-binomial

17.2.1

Tn(x, q) = x(1 + qx)(1 + q2x)... (1

+

qn-lx).

(17.1)

P r o o f . For each binary vector u = ( u 0 , . . . , Itn-1) ~ {0, 1}" with u0 = 1, we construct a graph G(u) - ( N , E ) on the vertex set N - { 0 , . . . , n 1} where E is such that for all i < j we have ij E E if and only if uj = 1. By Condition 4 of Theorem 1.2.4, G(u) is a threshold graph for each u, and every threshold graph can be obtained in this way. Moreover, G(u) is isomorphic to G(v) if and only if u - v. Indeed, the "if" part is obvious. Conversely, assume that u r v yet G(u) is isomorphic to G(v), and let j be the largest subscript such that uj --fi vj. Then G(u') is isomorphic to G(v'), where u ' = ( u 0 , . . . , uj) and v ' = ( v 0 , . . . , vj), since removing an isolated or a dominating vertex does not affect isomorphism. But this is impossible, since one of G(u') and G(v') has an isolated vertex and the other one does not. From this we already see that the number of unlabeled threshold graphs on n vertices is 2 "-1. The number of edges of G(u) is ~2j=0 n-1 jUj. Further, the set K - {j 9 uj 1} is a m a x i m u m clique of G(u). Indeed, K is clearly a clique containing 0, and consider any other clique K ~ % K. Then K ~ contains some element i > 0

17.2

E n u m e r a t i o n of T h r e s h o l d Graphs

469

with ui = 0, and so i is adjacent only to elements j C K such that i < j, i.e., to fewer than IKI elements. Hence IK'I _< IKI. We conclude that the unlabeled graphs enumerated by Tnmk are in oneto-one correspondence with the binary sequences u such that n-1

u0-1,

n-1

m-Eju

j,

k-Eu

j =o

j,

j =o

and (17.1) follows. 9 We remark that by putting q = i in (17.1) and extracting the coefficient of x k, we find that the number of unlabeled threshold graphs having n vertices n--1 and maximum clique size k is ~m r~,~k - (k-l)" This result can also be obtained by counting symmetric corrected Ferrers diagrams having n rows and corrected Durfee number k (Definition 3.1.6). The diagram is determined by the sequence of its first k row sums, which are non-increasing, do not exceed n - 1, and the last of them is equal to k - 1. The number of such [ n-1 \ sequences is [k-l), as can be seen by standard techniques. The next lemma makes use of the Gaussian polynomials [n.]_

[3J

(q)~

,

where(q)n-(1-q)(1-q2).-.(1-qn).

(q)j(q)n-j

L e m m a 17.2.2 T~(x,q) -

.

J

1

"

P r o o f . If G is a split graph having a split partition (K, S) with K a clique and S stable, and if K is a maximal clique, then K is in fact a maximum clique, since no vertex of S can be adjacent to all the vertices of K. If in addition G is a threshold graph, then by Condition 3 of Theorem 1.2.4, G is determined up to isomorphism by n, IK[, and the degrees of the vertices of S. Hence the unlabeled threshold graphs enumerated by rnmk are determined by the partitions of the integer m - (k2) (the number of edges outside IKI)into at most n - k positive parts (the positive degrees in S) not larger than k - 1. It is well-known [And76, p. 33] that the ogf for the number of partitions of integers into at most A positive integers not larger than B is [A+B]. Therefore rn~nk is the coefficient of qm-(~) in In-l], k-1 which is equal to the coefficient of nl q m x k in t h e s u m ~ j =

[ j-1 n-l] q (~)xj 9

This proves 17 92 9

"

Enumeration

470

By comparing (17.1) with (17.2), changing the index of summation and increasing n by 1, we obtain the following. C o r o l l a r y 17.2.3 (The q - B i n o m i a l T h e o r e m )

(1 + x)(1 +

qz)...

(1 +

q~-ax) - j~o

q(~)xJ"

(17.3)

Identity (17.3) goes back to Cauchy [Cau93] and probably to Euler. Many important identities are easily derivable from (17.3) and vice versa. For example, by letting n --+ cx~, we can obtain the following formal-power-series identity of Euler" (1 - x)(1

- qx)(1

-

q2x) . . . .

q('~)(-x)J

j~0 (1 - q)(1 - q~ :- (1 -

qJ)"

(17.4)

From (17.4), again using (17.3), we obtain the companion identity of Euler:

1 (1 - x)(1

- qx)(1

oo -

q2x)...

xj

= j~0 (1 - q)(1 - q2)...

(1 -

qi)"

(17.5)

From (17.4) and (17.5) we can get, again using (17.3), the following identity of Heine: (1 - ax)(1 - qax)(1 - q 2 a x ) . . . = ~-,~ (1 - a)(i - qa~_2..!l_. - qJ-la)x j (1-x)(1-q/)(1-q2x)... ~o ( 1 - q ) ( 1 - u --). ( 1 - q J ) (17.6) Andrews [And76] derives many series-product identities from (17.6), including the q-binomial theorem itself. Knuth [Knu71] discusses an interesting connection between the fact that [~] is the number of k-dimensional subspaces of GF(q) ~ and the fact that it is the egf for the number of partitions of integers into at most k parts not exceeding n - k. The connection is via matrices in row-echelon form, whose shape can be interpreted as a threshold graph. We now present the work of Beissinger and Peled [BP87] on enumerating labeled threshold graphs and the identity of Frobenius. Let tnjk denote the number of labeled threshold graphs with n vertices, j isolated vertices, and exactly k distinct vertex-degrees. For convenience we set tojk = ~jo~ko,

(17.7)

17.2

E n u m e r a t i o n of T h r e s h o l d G r a p h s

471

where ~ is the Kronecker symbol. We denote by

tn - ~ tnjk

(17.8)

j,k

the total number of labeled threshold graphs on n vertices. We determine the generating function n

F(x,y,z)- ~ t~jk~.yJz k.

(17.9)

n,j,k

Theorem

17.2.4

The generatingfunction defined by (17.9) is given by 1/z+e~Y-1 F(x,y,z)- (1 - xz)1/z - e~ + 1" (17.10)

P r o o f . It is clear that for n >_ 2 one has t~jk - 0 unless 0 _< j _< n and 1 _< k _l

(17.11)

since any number of isolated vertices can be added to or removed from a threshold graph without affecting its thresholdness, as follows from Condition 4 of Theorem 1.2.4.

tnok- ~tnjk

for n >__2

(17.12)

j>__l

because, by Theorem 1.2.4, the complement of a threshold graph with n >_ 2 vertices, no isolated vertices, and k distinct vertex-degrees is a threshold graph with n vertices, one or more isolated vertices, and k distinct vertexdegrees. Equations (17.11) and (17.12) can be combined to obtain the recurrence

The boundary conditions (17.14) and (17.15) below apply to all classes of graphs, not just to threshold graphs. t~,~_,,k - 0

(17.14)

472

Enumeration

tnnk -- 5kl

(17.15)

for n > 1.

As an aid in determining F(x, y, z), consider the generating function

XnZk

yj(x, z) - E E t~j~ ~!

(17.16)

n>0 k>0

Then r 0 ( x , z) Xn

E E t~o~-fzk n)>0 k > 0

Xn

= 1 + o + E E t~o~., Z k

by (17.7),(17.14)

n>2 k>0

Xn

= 1+ E E Et~J~-~z k

by (17.12)

n>_2k>_Oj>_l

X~zk

= 1 + ~ ~ ~ ~njk n! j>_l n>_2 kkO

= 1 + ~-~ ( F j ( x ' z ) - O>_k~ t ~

k>_O ~-'~tljkxzk

-- l-Jr-j~>l ( F j ( x ' z ) - - E ~ j Ok>_Oe ~ k o Z k - -

by (17.16)

k>_O

by (17.7),(17.14),(17.15). Thus

Fo(x, z)

-

~

j>l

Fj(x, z)+ 1 - xz.

(17.17)

It follows from (17.11) and (17.16)that

xj Fj(x, z) - z~Fo(x, z) for j _> 1.

(17.18)

By summing (17.18) for j > 1 and using (17.17), we obtain

Fo(x,z)- 1 + xz - z(e ~ - 1)F0(x,z) and therefore F 0 ( x , ~) -

1-xz 1 - ( e ~ - 1)z

x-l/z e~ - (1 + l/z)"

(17.19)

17.2

E n u m e r a t i o n of T h r e s h o l d Graphs

473

By (17.18) we have

F(x,y,z)-

~ Fj(x,z)y j - Fo(x,z)(1 + z(e ~ y - 1)) j>o

and (17.10) follows. 9 To expand functions with denominators like that of (17.10), we use the Eulerian polynomials An(u), defined as follows (cf. Comtet [Com74] or Riordan [Rio58]): u-

1

U -- e v(u-1)

1

v~

U n>l

n'

= 1 + - ~ An(u)--r,

A o ( u ) - 1.

(17.20)

One combinatorial interpretation of the Eulerian polynomials is that for n _> 1, the coefficient of u m in An(u) is the number of permutations 7rl,..., 7r~ of 1 , . . . , n having m - 1 ascents, that is to say, positions j such that 7rj < 71"j+ 1 [GKP89, Equation (7.56)]. By substituting u - 2 in (17.20), we obtain 1

Xn

-

2

~

(17.21)

gn

n>O

where gn - A~(2)/2 for n >_ 1 and go - 1. The well-known numbers g~ were studied by Cayley in the 19th century and several references to them can be found in Knuth [Knu73] and Good [Goo75]. gn is the number of ordered partitions of the set { 0 , . . . , n - 1}, that is to say, the number of ways to partition the set into non-empty blocks and to arrange the blocks in a sequence. Equivalently, g~ is the number of those n-letter words over the alphabet { 1 , 2 , . . . } that for some k contain precisely the letters { 1 , . . . ,k}, repetitions allowed. To see this interpretation of gn, we only have to write 1

2-e ~

=

1

1 - ( e ~ - 1)

=

~-~(eX

-

1) k

k>0

and note that (e ~ - 1)k is the egf for the number of words over the alphabet { 1 , . . . , k} with each letter appearing at least once. Because of this interpretation of g~, we have

g n - ~ kk=0 !

nk '

(17.22)

474

Enumeration

where (~} denotes the Stirling number of the second kind, i.e., the number of partitions of { 0 , . . . , n - 1} into k unordered non-empty blocks. This agrees with the well-known expansion (see Comtet [Com74] or Riordan [Rio58])

1)

E k>0

"

k

"

Knuth [Knu73, Ex. 5.3.1.4] presents the following asymptotic for gn: g--5~-1 ( 1 ) nl9 -- 2(ln2)n+ 1 + k>, ~ Re (ln2 + 27rik) ~+1

n > - 1"

(17.23)

It is not hard to use (17.23) to obtain that

1

(1)

We can now enumerate the labeled threshold graphs. T h e o r e m 1'/'.2.5 The number of labeled threshold graphs on n vertices is given by 1, if n - 0,1 t~ 2(g~ - - ngn-1), if n _> 2. (17.24) P r o o f . According to (17.9) and (17.8), F(x, 1, 1) = E~>0 tnxn/n!" evaluate F(x, 1, 1)using (17.10), we obtain (1 - x)e ~ (2 2-e ~ -(l-x)2-e

) ~

1

If we

x~

-l-at-x-~-2E(gn-ngn_l)----(, n>2

~"

and the result follows. .. We now see a combinatorial argument that gives another proof of Theorem 17.2.5, as well as of the theorem of Frobenius. We denote by Tnk the set of labeled threshold graphs with the vertices { 0 , . . . , n - 1}, no isolated vertices, and exactly k distinct vertex-degrees, so that t~0k = IT~kl.

(17.25)

We also denote by Dnk the set of ordered partitions ( B 1 , . . . , Bk) of { 0 , . . . , n 1} into non-empty blocks B~ such that IB~] >_ 2. It is easy to see that

,D~k, - k ' { nk} - n(k - 1)'{k " - 1 1} "

(17.26)

17.2

475

E n u m e r a t i o n of Threshold Graphs

L e m m a 17.2.6 There exists a bijection between Dnk and Tnk. P r o o f . Both sets are empty if n - 1, so we may assume that n > 2. Given a partition d - ( B 1 , . . . , Bk) E Dnk, we construct a labeled graph G on the vertices 0 , . . . , n - 1 by joining each vertex of Bi to all the other vertices of B1 U - . . U Bi if i + k is odd, and to none of them if i -+- k is even. Then G is a threshold graph by Condition 4 of Theorem 1.2.4, and it has no isolated vertices, since the vertices of Bk are dominating. We show that G has exactly k distinct vertex-degrees by showing that two vertices v C Bi and w C Bj have the same degree if and only if i - j. The "if" part is obvious. Conversely, assume that i < j. If i + j is even, then i -t- 1 < j and every vertex of Bi+l is adjacent to exactly one of v and w, hence v and w have different degrees. If k + i is even and k + j is odd, then v has some neighbor in B1U---UB~ (because IB~I > 2), whereas w has no such neighbor; therefore v and w again have different degrees. A similar argument shows the same conclusion if k -4- 1 is odd and k -4-j is even. Therefore G C Tnk. Given G C Tnk, we construct an ordered partition d as follows. Let G1 - G. If Gi has been constructed as a threshold graph with two or more vertices, then it has either isolated vertices or dominating vertices, but not both, and we remove all these vertices to obtain Gi+l. By an argument similar to the one in the previous paragraph, we can show that two vertices of G have the same degree if and only if they are removed at the same time. Since G has exactly k distinct vertex-degrees, the last non-empty graph is Gk, and it has at least two vertices. We put the vertices of Gi - Gi+~ into the block Bk+~-i for i -- 1 , . . . , k and obtain an ordered partition d - ( B 1 , . . . , Bk) C Dnk. It is easy to see that the two mappings defined above between Dnk and Tnk a r e inverses of each other, and the proof is complete. 9 The second proof of Theorem 17.2.5 can be seen as follows. From (17.25), (17.26) and Lemma 17.2.6, we obtain " k

"

1} 1

(17.27)

for n >_ 1, and this equation also holds for n - 0 by our convention that took - ~k0. If we sum (17.27) over k, the left-hand side becomes t~/2 for n >_ 2 by (17.12), the right-hand side becomes g n - ng~_~ by (17.22), and (17.24) results. The reason we presented the first proof of Theorem 17.2.5 is that it enables us to prove the theorem of Frobenius [Frol0] (see also Comtet [Com74]),

IV

0

~.

I

....

I

._

~ ~,

~

I

I

~

~

IV

I

I

I

I

I--t,

~

~

I ~

oc--F-'

0

~

"~

i,~

~

~

9

~ ~

~

,~ o

o

~

I

CD

0

~

~.~

t'~

IV ~

I

9

~

.-

I

~

~

I

'ov

o

~

I I

;._,

o

~ ~

I

"--,.1

-~

0

~

O

c--c-

0

I

"

O0

-q

9

II

'--'

~1~

+

II

I

~ I

I

I

II

I

c~

I

~-~

I

o~

9

II

I

II

~

~

~

:~

~ 9

~ 9

~o~

IV

~

'ov

o ~ "~

i.-~~

~

0

~

~

o-'

~'~

0

~-

~

~

~

~

l:r'

0

eo

~

O~ ~

I

~

~

~. .

to

~

~

0 ~ o0

0~

0

""

~
0 , . ( k " k+l

}

-

"

k + l

'

we find

xs

1 + E ~2Ej! s>_l

" j_>0

{;} -

E k>0

([e~( e ~ - 1)]k) ~ - E k_>0

~.k!

{r + k -~

and therefore 2

j>0 J!

{ ~ } _ ~ > 0 ( : ) k ~ > ok?2 { r + l } { s - r + l } k+ 1 k+ 1 _ _

T h e o r e m 17.3.2

s>l.

The number of labeled difference graphs on n vertices is

(l+gn)/2. Proof. By Theorem 2.4.4 every difference graph consists of zero or more isolated vertices and a connected difference graph, and conversely. As in Theorem 17.3.1, the egf for the number of labeled non-empty bipartitioned connected difference graphs is ~k>l(e ~ - 1) 2k. Hence the egf for the number of labeled non-empty connected difference graphs is 71 E k ~ l ( ex - 1) 2 k where 1 the factor g comes from exchanging the two color classes. By adding 1 we also include the empty graph, and by then multiplying by e~ we allow for the

17.3

E n u m e r a t i o n of Difference G r a p h s

479

isolated vertices. Thus the egf for the number of labeled difference graphs is given by

(

e~ l + a . y -k>l ~(e~_l)2k 1

= z e~

(1+

) 1(

)

_ 2 e~ 1 + k>0 ~(eX-

1 )l(e~ 1 _ (e~_ 1)2 - ~

-+

1) 2k

1 ) 2_e~

and the number of labeled difference graphs on n vertices is the coefficient of W., namely (1 + gn)/2. " In the rest of this section we consider the enumeration of unlabeled difference graphs. We denote by | the set of unlabeled threshold graphs on n vertices, and by A, the set of unlabeled bipartitioned difference graphs on rz vertices. By (17.1) we have I O n l - 2n-l, as already noticed. We use the following connection between IZX~Iand IO~l. X n

L e m m a 17.3.3 [An[- IOnl + IAn_ll

7/

>_ 1.

Proof. Consider the mapping f 9 An -+ | given by f ( D ) - T, where D - (X,Y; E) C A~ is a bipartitioned difference graph with bipartition (X, Y), T - ( X U Y, E U E x ) , and E x is the edge-set of the complete graph on X. By Theorem 2.1.9 f is onto | but it is not one-to-one. To find f - x ( f ( D ) ) , we note that T and D are determined by their degree partitions according to Theorem 1.2.4 and Theorem 2.4.4. In T, the degree of every vertex of Y/ is [Xk U . . - U Xk-i+x[ and the degree of every vertex of Xi is IYk tO... U Yk-i+x [q-IX0 U . . . LI Xk[- 1. Since Xi and Y/are non-empty for i >_ 1, it follows that the degree partition of T is simply (Y0,..., Yk) unless one of the following conditions holds in D" 1. x 0 - o , IYkl - 1, in which case the degree partition of T is ( Yo , . . . , rk-1,

Yk

I,.,J X l

, X2 , . . . , Xk

);

2. I X 0 [ - 1, in which case the degree partition of T is ( ro , . . . , rk - l , Yk

[,-J X o

, X l , . . . , X k).

It follows that f - l ( f ( D ) ) - {D} unless Condition 1 or 2 holds for D, in which case f - l ( f ( D ) ) - {D,D*}, where D* is obtained from D by moving

480

Enumeration

the singleton Yk to X in Condition 1 and the singleton X0 to Y in Condition 2. Note that if Condition 1 holds for D, Condition 2 holds for D* and conversely. It follows that IOnl - Izx f- Izx'~l, where A" denotes the set of unlabeled bipartitioned difference graphs satisfying Condition 1 or 2. We complete the proof by indicating a bijection from A~ to An-l- It is given by deleting the singleton Yk in Condition 1 and the singleton X0 in Condition 2. The resulting bipartitioned difference graph D' has isolated vertices in X in Condition 1 and does not have them in Condition 2. Therefore we can uniquely reconstruct D from D' by adding a singleton Yk to Y if D 1 has isolated vertices in X and adding a singleton X0 to X if D' has none. This shows that indeed we have a bijection. 9 Theorem

17.3.4

The number of unlabeled bipartitioned difference graphs

on n vertices is 2 n. P r o o f . By Lemma 17.3.3 and the fact that IO~1- 2 n-l, we have [ A n [ - 2 n-1 q- 2 n-2 + " "

q- 20 + [ A o [ - 2n - 1 + 1 - 2 ~.

Another way to see this result is to modify the bijection between | and the set of binary words of length n - 1 into a bijection between A~ and the set of binary words of length n. We omit the details. 9 17.3.5 The number of unlabeled difference graphs on n >_ 1 vertices is 2~-2 + 2[~J -1.

Theorem

P r o o f . Again, according to Theorem 2.4.4, a difference graph consists of zero or more isolated vertices and a connected difference graph. Hence the required number is ~ - - 0 c~, where c~ is the number of unlabeled difference graphs with no isolated vertices on s vertices. According to Theorem 2.4.4, a bipartitioned difference graph with no isolated vertices is determined up to isomorphism by the cardinalities of the blocks X 1 , . . . , Xk and Y1,..., Yk, so the number of unlabeled bipartitioned difference graphs with no isolated vertices on s vertices is equal to the number of compositions of s into an even number of parts, i.e., sequences ( x l , . . . , x k , Y l , . . . , Y k ) of positive integers whose sum is s. For s _> 2, there a r e (;[21) compositions of s into 2k parts, and therefore a total of ~k (~k-~l) - 2s-2 compositions of s into an even number of parts. Since we are counting non-bipartitioned difference graphs

17.3

481

E n u m e r a t i o n of Difference Graphs

with no isolated vertices, Cs equals the number of inequivalent compositions of s into an even number of parts, where the composition ( x l , . . . , xk, yl,. 9 9 yk) is considered equivalent to itself and to ( y l , . . . ,xk, zl,... ,xk). We find c~ by the Burnside Lemma. All 2 ~-2 compositions into an even number of parts are invariant under the identity permutation. A composition into an even number of parts is invariant under the permutation 7r that switches color classes if and only if it has the form (if:l,...,3g.k, X l , . . . , X k ) , and ( x a , . . . , x k ) must then be a composition of s/2. Hence there are (~s even)compositions of s into 2k parts that are invariant under 7r, where the Iversonian (s even) stands for 1 if s is even and for 0 otherwise. By summing over all k >_ 1, we find that there are 2~/2-1(s even) compositions of s into an even number of parts that are invariant under 7r. Hence by the Burnside L e m m a c~ = 89(2 ~-2 + 2~/2-1(s even)) for s >_ 2. Clearly Co = 1 and cl - 0. therefore the number of unlabeled difference graphs on n >_ 2 vertices is given by n

~ - ~ Cs s~O n

1+ 0+ ~

(2 s-3 +

2s/2-2(s

even))

s~-2 n

= 2+ ~

(2 s-3 + 242-2(s even))

s----3

=2+2~-2-1+

(1+2+4+...2[~J

-2)

- 1 + 2 '~-2 + 2[~J -~ - 1 - 2 ~-2 + 2[~J -~. This value is also correct for n - 1.

9

Chapter 18 Extremal Problems 18.1

Introduction

Consider the following problems. How many connected threshold subgraphs can a graph have? What graphs with n vertices and rn edges have the largest number of connected threshold subgraphs? These problems still remain open. Paul ErdSs conjectured that for a fixed number of vertices, the complement of a matching contains the largest number of threshold subgraphs (private communication). Erd6s et al. [EGOZ89] consider the following problems. Given a graph G with n vertices and m edges, what is the largest number of edges that a chordal subgraph of G can have? What about an interval subgraph or a threshold subgraph? We report the results of these authors on interval subgraphs and threshold subgraphs in Section 18.2. Boesch et al. [BBB+90] considered the following problem. Which graphs on n vertices and m edges maximize the sum of squares of the degrees? It turns out that these graphs must be threshold graphs. However, not every threshold graph maximizes the sum of squares of the degrees. Only partial results are known in characterizing these threshold graphs. We report these results in Section 18.3. 483

484

Extremal Problems

18.2

Large Interval and Threshold Subgraphs of Dense Graphs

Given a graph with n vertices and m edges, what is the largest number of edges a chordal subgraph can have? What about an interval subgraph or a threshold subgraph? ErdSs et al. [EGOZ89] studied this problem for m >_ n2/4 + 1. They gave a complete answer for chordal subgraphs, and partial answers for interval and threshold subgraphs. We report their results for interval and threshold subgraphs. All results in this section are from [EGOZ89]. Let G be the class of graphs with n vertices and m edges, and 7 / b e a class of graphs closed under vertex deletion. Then there is a number f such that every member of ~ contains a member of 7-/with at least f edges, and there is a member of G containing no member of 7t with more than f edges. In practice, f may be hard to find, and it may be possible to bound it by finding a lower bound fl (there must be a member of 7-t with at least fl edges) and an upper bound f2 (there need not be a member of 7-t with more than f2 edges). The questions above are only interesting if the graphs contain sufficiently many edges. The complete bipartite graph Kn/2,n/2, for example, has no triangles and thus contains no interval graphs larger than a path ( n - 1 edges) and no threshold graph larger than a star (n/2 edges). We only consider graphs with n vertices and at least n2/4 + 1 edges (sufficient to guarantee the existence of a triangle), and refer to them as dense graphs. We define the size of a graph as the number of edges in the graph. The results in this section can be summarized as follows (all graphs are assumed to be dense graphs)" 9 Every graph has an interval subgraph of size > (1 + c)n. 9 There is a graph with no interval subgraph of size > 3n/2 - 1. 9 Every graph has a threshold subgraph of size > (1 + c)n/2. 9 There is a graph with no threshold subgraph of size > (1 + 0.5)n/2. The two constants c remain to be determined. By the star of a vertex a we mean the graph consisting of all vertices in the closed neighborhood of a, along with the edges joining a to all its

18.2


485

neighbors. By the star of an edge ab we mean the union of the star of a and the star of b. We call a graph G an edge-star or an e-star if G has an edge ab such that G is the star in G on the edge ab. We need the following Lemmas. L e m m a 18.2.1 Let G be a threshold graph with no K4. Then G has an edge ab (called the r o o t edge) such that every edge of G is either incident to a (the r o o t v e r t e x ) , or is of the form bc for some c such that abc is a triangle of G. Proof. Let a, b be the first two vertices in the non-increasing order of degrees in the threshold graph G. " It is a known principle that if a graph has average degree A, it has a vertex of degree considerably larger than A or almost no vertices of degree much below A. The next lemma makes this more formal. L e m m a 18.2.2 For every 1 > e > 0 there is a d > 0 (depending only on e) such that every graph with n vertices and average degree A has a vertex of degree exceeding (1 + d)A or fewer than en vertices of degree at most (1 - e)A. Proof. Choose d < e2/(1 - e ) . Suppose the conclusion is false. Then the en smallest degrees do not exceed ( 1 - e)A each, for a total not exceeding e ( 1 - e)An, and the remaining ( 1 - e)n degrees do not exceed (1 + d)A each, for a total not exceeding (1 - e ) ( 1 + d)An. Thus the grand total degree for the whole graph is at most (1 + d - e 2 - ed)An < An, contradicting the fact that the total degree is An. 9 Another commonly-used fact is that in the neighborhood of the largestdegree vertex there must be a vertex of "not too small" degree. The next lemma states this more formally. L e m m a 18.2.3 Let G have average degree exceeding n/2. Let the largestdegree vertex of G have degree (1 + c)n/2. Then in its neighborhood there must be a vertex of degree exceeding

1-c+

2c 2 ) n i + c 2"

Proof. If not, then each of the (1 + c)n/2 vertices in the neighborhood has degree not exceeding that, while each of the (1 - c ) n / 2 vertices not in that

Extremal Problems

486

neighborhood has degree not exceeding (1 + c)n/2. Again, the total degree of all the vertices is no more than n2/2, a contradiction. .. Yet another general result is that if not too many vertices of small degree are deleted from a dense graph, the resulting graph is still dense. The next l e m m a formalizes this. L e m m a 18.2.4 Let 1 > e > O. If G is a dense graph on n vertices, and at most en vertices of degree at most ( 1 - e ) n / 2 are deleted from G, the resulting graph is still dense. P r o o f . The number of deleted vertices is k < en and the total number of edges deleted is at most k(1 - e ) n / 2 . The number of remaining edges is therefore at least

~ -~-+ 1 - k(1-

n

( ~ - k) ~

e)~ -

4

k(2~+ 1+

4

k) > ( ~ - k) ~ -

4

+ 1.

L e m m a 18.2.5 ([Edw]) If G has n vertices and at least n2/4 + 1 edges, then G has an edge that is on at least n/6 triangles. T h e o r e m 18.2.6 There is a constant c > 0 such that for sufficiently large n, every dense graph on n vertices has an interval subgraph with at least (1 + c)n edges. P r o o f . Let e - 0.1 and d - 0.01. If there is a vertex with at least (1 + d ) n / 2 neighbors, then the largest-degree vertex a has degree (1 + f ) n / 2 , where d _< f < 1. Then by Lemma 18.2.3, a has a neighbor b with degree exceeding (1 - f + 2_]_!_~ ~ The e-star on edge ab is an interval graph with more than l+fJ~" d~ < -l+f ~ this (1 + -l +~f ) n 1 edges. For sufficiently large n ~ and for c < V -e-star has at least (1 + c)n edges. Otherwise, by L e m m a 18.2.2, there are e2 fewer than en vertices of degree at most (1 - e)n/2 (because d < i-~-~, as required in the proof of L e m m a 18.2.2). If all these vertices are deleted, the resulting graph G ~ is still dense by Lemma 18.2.4. By L e m m a 18.2.5, G' has an edge ab lying on at least ( n - en)/6 - 0.15n triangles. Let abc be one of these triangles. The set of edges consisting of these triangles, the star on a, and the star on c form a graph H with at least n(0.45 + 0.45 + 0.15) - 2 edges. This is at least (1 + c)n for sufficiently large n and c < 0.05. Further, it is easy to construct an interval model for H. 9

18.2


487

T h e o r e m 18.2.7 There is a constant c > 0 such that every dense graph on n vertices has a threshold subgraph with at least (1 + c)n/2 edges. P r o o f . Let 0 < e < 1/4 and 0 < d < e2/(1 - e) be two constants. If there is a vertex a of degree at least (1 + d)n/2, then the star on a is a threshold graph with at least (1 + c)n/2) edges for each c, 0 < c _< d. Otherwise, by Lemma 18.2.2, the number of vertices of degree at most (1 - e)n/2 is k < en. Delete all these vertices to obtain G ~. By Lemma 18.2.4, G ~ is dense. Hence, by Theorem 18.2.5, G' has an edge ab lying on at least ( n - k)/6 triangles. The union of these triangles and the star on a is a threshold graph whose number of edges is at least

n ~(1-r provided c _< ( 1 -

n-k 6

n >~(1-r

n-en 6

2 = 5 ~(1 - r

n ->(l+c)g'

4e)/3.

F a c t 18.2.8 There is a dense graph on n vertices whose largest interval subgraph has exactly (3/2)n - 1 edges. P r o o f . Consider the dense graph G obtained by adding an edge ab to the complete bipartite graph Kn/2,n/2. Let H be any interval subgraph of G. Delete one vertex from H, making sure it is a or b if H contains one of them. The resulting induced subgraph of H is still an interval subgraph of a bipartite graph on n - 1 vertices, and hence is a forest and has at most n - 2 edges. Adding back the star on the deleted vertex adds at most (n/2) + 1 edges, showing that H has a total of at most (3/2)n - 1 edges. On the other hand, let abc be any triangle in G. It is easy to see that the union of the stars of a, b and c is an interval graph with exactly (3/2)n - 1 edges. .. F a c t 18.2.9 For n sufficiently large and any fixed e > O, there is a dense graph on n vertices having no threshold subgraph with at least (1.5 + e)n/2 edges. (Hence the c in Theorem 18.2.7 cannot exceed 1/2.) P r o o f . Let A and B be vertex sets of size (1 + c)n/2 and ( 1 - c)n/2 respectively and add all ( 1 - c 2 ) n 2 / 4 edges from A to B, where c = 1/3. Now divide A into two equal parts A1 and A2 and add (cn)2/4 + 1 edges between those two parts, distributed as regularly as possible. The degree of a vertex in B is simply (1 + c)n/2, while each vertex in A1 has (1 - c)n/2 edges leading to

488

Extremal Problems

B and at most just over ((cn)2/4)/((1 + c)n/4) = nc2/(1 + c) edges leading to A2; so no star can be a big enough threshold graph. Consider a threshold subgraph containing a triangle. The graph is tripartite so has no K4; hence by Lemma 18.2.1 it has a root vertex and a root edge. We distinguish three cases: (i) root edge from A1 to A2, root vertex in either set; (ii) root edge from B to A1, root vertex in B; (iii) root edge from B to A1, root vertex in A1. All other cases are equivalent to these. (i) The root edge is from A1 to A2 and lies on at most ( 1 - c)n/2 triangles (the other vertex of the triangle must be in B); the root vertex has in addition about nc2/(1 +c) edges from A1 to A2. The total size of this threshold graph is about (1 - c)n + nc2/(1 + c), which equals (1.5)n/2 (since c = 1/3). (ii) The root edge joins B to A1. It lies on about c2n/(1 + c) triangles, and the degree of the vertex at B is at most (1 + c)n/2, of which c2n/(1 + c) edges have already been counted. The total size of this graph is thus about (1 + c)n/2 + c2n/(1 + c), which again equals (1.5)(n/2). (iii) The root edge again joins A1 to B, but we consider also the edges from the vertex in A1 to B, of which there are (1 - c)n/2. The total size of the threshold graph is about 2nc2/(1 + c) + (1 - c)n/2 = n/2. ..

18.3

M a x i m i z i n g the S u m of Squares of Degrees

Recall that a graph having n vertices and e edges is called an (n, e)-graph. In this section we consider the following problem. Given n and e, which (n, e)graphs have the largest sum of squares of the degrees? We refer to these graphs as a-optimal. Boesch et al. [BBB+90] consider this problem and show that these graphs must be threshold. However, not every threshold graph is a-optimal. For every valid choice of n and e, the authors define two threshold (n, e)graphs, which always exist and are unique, and prove that at least one of them is a-optimal. This enables us to compute the optimal value for any valid choice of n and e. We give examples showing that not both need be a-optimal, as well as a-optimal graphs that are not of this form. This leaves open the problem of characterizing the threshold graphs that are ~r-optimal. In this section we report the results of [BBB+90]. If r is a function from graphs to integers, then a graph G is called r

18.3

M a x i m i z i n g t h e S u m of Squares of Degrees

489

optimal if r >_ r for all graphs G' with the same number of vertices and edges as G. One such function of interest here is a(G) - ~ n d~, where ( d l , . . . , dn) is the degree sequence of G. Throughout this section we assume that n and e denote, respectively, the number of vertices and edges of a graph. Thus 0 deg(a), consider the graph G' obtained from G by replacing edge ac with bc. It is easy to see that a(G') > a(G). Thus if a graph G is a-optimal, then for every two vertices a, b, deg(a) >_ deg(b) if and only if N[a] D_ N(b). Therefore G is a threshold graph by Condition 5 of Theorem 1.2.4. 9 Lemma

18.3.2 A graph G is a-optimal if and only if its complement G is

a-optimal. P r o o f . The result is immediate, since a(G) - ~"~in__l[(rt -- 1) - d~]2 or(G), where k(n, e)is a constant depending only on n and e. Lemma

18.3.3 Let G -

+

K1 0 H be an (n, e)-graph. Then

1. If G is a-optimal, then H is a-optimal. 2. Conversely, if H is a-optimal and if there is a connected a-optimal (n, e)-graph, then G is a-optimal. P r o o f . Part 1 follows from the fact that a(IQ | H) = k(n, e)+ a(H), where k is a constant depending only on n and e. To prove Part 2, let G t be a connected a-optimal graph. Then G ~ is a threshold graph of the form K1 | H', and both H and H' are ( n - 1, e - n + 1)-graphs. By part 1, H' is ~r-optimal, hence a(H) = a(H'), a(G) = cr(G'), and G is a-optimal. 9 D e f i n i t i o n 18.3.4 For fixed n and e, G1 and G2 are the following threshold

(n, e)-graphs (Figure 18.1)" (~l(p, q, r) --

I~p (~ ( ~q [,_JKl,r),

490

Extremal Problems

where n - p + q + r + l, e - ( ~ ) + p ( q + r + l ) + r ,

G2(p, q, r) - Sp I.J (I(q (]~ KI,~),

Figure 18.1: The (7, 6)-graphs of the forms G1 and G2.

GI(0, 0, 6) L e m m a 18.3.5

G2(2, 0,4)

1. G~ (p, q, r) is the complement of G2(p, q, r).

2. For every valid n and e, there exist (n, e)-graphs of the form G1 and G2.

~. If G1 (p, q, r) and G1 (pt qt rt) are (n, e)-graphs, they are isomorphic, and similarly for G2.

P r o o f . 1" This is obvious from the definition. 2" By part 1, it is enough to show that G2(p, q, r) exists whenever 0 < e < (2). Let k be the largest integer satisfying (k2) < e. Then k < n. If k - n~ then e -

(2) and we can t a k e q -

n-l,p-r-

0. I f k _< n - l ,

take

qe - ( k 2 ) _> 0, r - k - q > 0 (for otherwise e > ( ~ ) + k - (k+l), contradicting the maximality of k), and p - n - k - 1 > 0. 3: Let G1 (p, q, r) and GI(p', q', r') be (n, e)-graphs. If p - p', then clearly q _ ql and r - r' and we are done. Now assume that p' - p + 5 and 5 > 0. When p is increased by 1, (~) + p ( n - p)increases by n - p 1. Therefore

p,+,n

p 1,+,n

2,+

p

18.3

Maximizing the Sum of Squares of Degrees

491

Hence e - r' >_ (e - r) + (n - p - 1) - e + q, i.e., q + r' _< O, and we conclude that q - r' - 0 as well as 6 - 1. Therefore G1 (p, q, r) - G1 (p, 0, r) - Kp+a | S~_p_a,

G~ (p', q', r') - G~ (p', q', O) - Kp, | Sq, - Kp+, | Sn-p-1. We frequently use the following operation in the proof of Theorem 18.3.7 below (see Figure 18.2). Let G be a graph, a and b two vertices of G of degrees a and/3, respectively, T1 a set of m neighbors of a, all different from b and having the same degree tl, and T2 a set of m non-neighbors of b, all different from a and having the same degree t2. Then G ( a , b, Tx, T2) is obtained from G b y d e l e t i n g all edges from a to T1 and adding all edges from b to T2. A simple calculation shows that

a(G(a, b, T1, T2)) - ~r(G) - 2m(/3 - a + t2

-

tl

-~- m

-t- 1)

(18.1)

Figure 18.2: An operation used in the proof of Theorem 18.3.7.

G

T~

a

T~

T2

b

T2

G(a,b, T1,T2)

F a c t 1 8 . 3 . 6 If e < 3 and n is arbitrary, then both G1 and G2 are or-optimal

(n, e) -graphs. P r o o f . This follows from L e m m a 18.3.1, since if e _< 3, all the threshold (n, e)-graphs have the same ~r. 9 Theorem

1 8 . 3 . 7 For every fixed n and e, at least one of the (n, e)-graphs

G l a n d G2 is a-optimal.

492

Extremal Problems

P r o o f . We prove the result by induction on n. Obviously, the result holds for n - 1. Assume that for every e' q + r - 1), and

18.3

493

Maximizing t h e S u m of Squares of Degrees

Figure 18.3: The graph G used in the proof of Theorem 18.3.7.

s~

Y

G -

9 H'-

Is

@

G2(p, q, r)

H(a,b,T~,T2).

T h e n c r ( H ' ) - a ( H ) - 0 by (lS.1). Note t h a t H ' has the form ((/s I,.JS;)@ K~) U Sp_q+l, where K ' and S' are cliques and stable sets of the i n d i c a t e d cardinalities, all positive. P u t m - min(q + r - 2 , p - q + 1), and observe t h a t m > 0 by (18.3) and our a s s u m p t i o n . In H', let 9 b be the s a m e v e r t e x b as before, now of degree 2q + r - 1, 9 c be a v e r t e x in K~+r_ 1 of degree q + r - 1, 9 T; be a set of m vertices

*' in I(q+r_ 1 other

9 T~ be a set of m vertices

in Slp_q_t_l of

9 H"-

H'(c,b,T~,T~).

t h a n c, of degree q + r -

degree 0,

1

494

Extremal Problems

Then a( H") - (r( H') = 2 m ( m + 2 - r ) by (18.1). I f m = q + r - 2 , then ( r ( H " ) - a ( H ' ) = 2mq > 0 since we are in Case 1. I f m = p - q + l , then ~r(H")-a(H') = 2m(p+3-q-r) > 0 because p > q + r 1. This p roves (18.4). We now show t h a t r = 0. For otherwise, let m = min(p, r) > 0 by (18.2), and in G let 9 a be the vertex y of degree n - 1, 9 b be the vertex x of degree q + 1, 9 T1 be a set of m vertices from Sp of degree 1, 9 T2 be a set of m vertices from Kr of degree q + r, and

9 G ' - G(a,b, T1,T2). Then ~r(G')-a(G) = 2re(re+q-p) by (18.1). If m = p, then a(G')-cr(G) = 2mq > 0, and if m = r, then a ( G ' ) - a ( G ) = 2m(r + q - p ) > 0 by (18.4), so in both cases we have a contradiction to the a - o p t i m a l i t y of G. This proves t h a t r = 0. Thus H is of the form Kq+l Ig Sp. Now if p = 1, then G is in the form G2, as required. Otherwise p _> 2 by (18.2), and we let 9 a be the vertex y of degree n - 1, 9 b be a vertex of Sp of degree 1, 9 T1 - Sp - {b}, a set of vertices with degree 1, 9 T2 be a set of p - 1 vertices in Kq (which exist by (18.4) and r - 0) of degree q + 1, and

9 G ' - G(a,b, T1,T2). Then c r ( G ' ) - a ( G ) - 0 by (18.1). Thus G' is also a-optimal. Clearly, G' is an (n, e)-graph of the form G ~ ( p - 1, p, q - p + 1). C a s e 2" q - 0. Then H is of the form K~ t2 Sp+l, which is isomorphic to a2(p + 1, 1, 0). By (18.3) and q - 0 we certainly have r - 1 >_ 1, and we are again in Case 1. 9 R e m a r k . For (7, 6)-graphs, G2 is not a-optimal, since a ( G l ) > or(G2) (see Figure 18.1). By considering the complements of these two graphs as well,

18.3

Maximizing the Sum of Squares of Degrees

495

we conclude that neither G1 nor G2 need always be a-optimal. Moreover, the (6, 7)-graph G of Figure 18.4 is not of the form G1 or G2, yet is a-optimal along with G1 and G2, since all these graphs have the same cr value. Figure 18.4: Examples of it-optimal (6, 7)-graphs.

GI(1, 2,2)

G2(1, 1,3)

G

Chapter 19 Other Extensions 19.1

Introduction

In this chapter we discuss three unrelated concepts on threshold graphs. In Section 19.2 we look at a generalization of threshold graphs based on geometrical embeddings of graphs. The threshold graphs can be thought of as the graphs for which there exist a non-negative real weight w~ associated with each vertex u and a positive real threshold t such that u v is an edge if and only if w u w v > t . If we allow the weights and threshold to be any reals, we obtain the generalized threshold graphs characterized by Reiterman et al. in [RRS89]. If the weights are replaced by vectors and the product is replaced by the scalar product, we obtain a further generalization of the threshold dimension. In Section 19.3 we introduce a new class of graphs called C-tolerance intersection graphs, due to Jacobson et al. [JMM91]. Golumbic et al. [GM82, GMT84] defined the tolerance graphs by assigning to each vertex u an interval I~ and a non-negative tolerance t~, such that u v is an edge if and only if IIu n Iv I _> min(t~, t~). These graphs clearly generalize the interval graphs. Monma et al. [MRT88] defined the threshold tolerance graphs by assigning to each vertex u a non-negative weight w~ and a tolerance t~ such that u v is an edge if and only if w~ + w~ > min(t~, t~). Both these classes are special cases of C-tolerance intersection graphs. In Section 19.4 we study the universal graphs for the class T~ of threshold graphs on n vertices. A d-universal graph for a class C of graphs is a graph G such that every graph in d is an induced subgraph of G. Hammer and 497

498

Other Extensions

Kelmans [HK94] completely characterize those T~-universal graphs with the least number of vertices that are themselves threshold graphs.

19.2

Geometric Embeddings of Graphs

Given a simple graph G, a representation of G is a function that assigns to each vertex x of G a vector ~ C R d, such that two vertices x and y are adjacent in G if and only if x and y satisfy some specified geometrical condition. Representations of graphs of this type have been considered by numerous authors. Reiterman et al. [RRS89] consider the representation of a graph G = (V, E) such that for some real threshold t, xyCE

if and only if

~y_t.

The scalar product dimension dp(G) is the minimum number d such that G admits such a representation in R d. The authors show that dp(G) t , or equivalently, using w' - e ~, t' xy E E

-

e t,

if and only if

w~wy~ ~ >_ t',

wherew t:V--+N +,t ~EN +,andN +={xCN:x>0}. Relaxing f r o m N + to N, we have the following generalization of threshold graphs. D e f i n i t i o n 19.2.1 A graph G = (V, E) is a g e n e r a l i z e d t h r e s h o l d g r a p h when there are a threshold t C N and a label x C N 1 associated with each vertex x, such that for all distinct x, y C V, xy C E

if and only if

x y >_ t.

(19.1)

Thus the threshold graphs are precisely those generalized threshold graphs that can be represented by a positive labeling and a positive threshold. Generalized threshold graphs are not necessarily threshold. For instance, the

19.2


499

graph 2K2 is a generalized threshold graph, as shown in Proposition 19.2.2. However, if in Definition 19.2.1 the labeling and the threshold are nonnegative, then G = (V, E) is in fact a threshold graph, as is easy to see. We can always ensure, by slightly decreasing t if necessary, that the inequality in (19.1) never holds with equality. P r o p o s i t i o n 19.2.2 A graph is a generalized threshold graph if and only if it is the disjoint union of two threshold graphs or the complement of a difference graph. P r o o f . Suppose G - (V, E) is the disjoint union of threshold graphs G1 = (V1, EI) and G2 - ( 89 E2), and let x ~ x C R (x e V), ti > 0 represent G~ for i - 1, 2, where x >_ 0 (x E V). By a simple rescaling we may assume that t 1 t 2 t . Then the labeling x ~ x ~ defined by x'-

~" x, ( -x,

for x E V~ for x C 1/2

together with the threshold t represent G. Suppose the complement G is a difference graph with color classes V1 and V2. By Theorem 2.4.3, if we add to G all the edges joining vertices of V1, it becomes a threshold graph T. Let x H x be a positive labeling and t > 0 a threshold representing T, such that x y =/= t for all distinct x, y. Then G is represented by x ~ x ~ and t ~, where x'-

~ x, [ -x,

for x C V1 for x E

t ! --

--t.

Conversely, let G be a generalized threshold graph represented by x H x and a threshold t such that the inequality in (19.1) never holds with equality. Put

{xc

0. Then there is no edge between V1 and 1/2. The induced subgraph G1 - (VI, El) is a threshold graph as the labeling is non-negative on V1. As for the induced subgraph G2 - (1/2, E2), we use the non-negative labeling x ~ - x and the same threshold t to see that G2 is also a threshold graph. Next, assume t _< 0. Then V1 and 1/2 are cliques and the non-negative labeling x ~ [x[ together with the threshold Itl represent a threshold graph

500

Other Extensions

(V, E") such that for all x C 1/1 and y C 1/2, we have xy E E if and only if xy ~ E". Thus by Theorem 2.4.3 G is the complement of a difference graph.

C o r o l l a r y 19.2.3 The generalized threshold graphs have Dilworth number at most 2. Proof. This follows from Propositions 19.2.2 and 2.4.2. 9 The following theorem gives a characterization of generalized threshold graphs by forbidden induced subgraphs. The proof can be found in [RRS89]. T h e o r e m 19.2.4 ([RRSS9]) A graph is generalized threshold if and only if it does not contain any of the graphs G1,..., G10 of Figure 19.1 as induced subgraphs.

Figure 19.1: The forbidden induced subgraphs for generalized threshold graphs.

": 9 i i i I t ! G1

G2

Ga

G4

G5

G6

G7

G8

G9

GlO

We now consider the dimension of generalized threshold graphs. Recall that the threshold dimension t(G) of a graph G = (1/, E) is the minimum number t of threshold spanning subgraphs G1,..., Gt of G whose union is E; in other words, two vertices form a nonedge in G if and only if they form a nonedge in each of G1,..., Gt. By taking exponents, we can say equivalently

19.2


501

that the threshold dimension t(G) of G is the minimum number t with the property that there are a function f " V ~ IRt with f ( x ) > 0 (x E V) and a vector c > 0 in IRt, such that for distinct x, y C V,

xy~E

if and only if

f(x),J'(y) t.

The s c a l a r product d i m e n s i o n dp(G) of a graph G is the minimum number d such that G admits a representation in N d. The assignment x ~ x, where x is the row of the vertex-edge incidence matrix of G corresponding to x, together with the threshold t - 1, provide the simplest example of a representation of G. This shows that dp(G) is well-defined and gives the trivial upper bound dp(G) _< n, where n is the number of vertices (since the vectors a generate a subspace of dimension _< n over N). A better upper bound for dp(G) is established in the following theorem, which shows that the scalar product representation is more efficient than the threshold one, in the sense that it represents more graphs using small dimensions.

T h e o r e m 19.2.6 Every graph G satisfies dR(G) < t(G). P r o o f . Let G - (V,E), t(G) - s, and E - U~=IEk , where the (V, Ek) are threshold graphs. Let V - Dok U ... U D mkk be the degree partition of (V, Ek), k - 1 , . . . , s, as in Definition 1.2.3. Recall that for every distinct x , y C V with x C D~, y C D~,

xyCEk

if and only if

i+j>_mk+l.

502

Other Extensions

We construct a labeling x ~ x with values in IR~ by p u t t i n g 1

(x)k - M i- ~'~k f o r k - 1 , . . - , s ,

ifxcD/k,

where M > s is arbitrary. If xy C E, then xy C Ek for some k and x C D/k, y E D) for some i , j satisfying i + j _> mk + 1. Hence 1

1

x y > xkyk -- M i - ~ m k M j-~mk - M i+j-mk >_ M.

If xy ~ E for distinct x, y, then for every k - 1 , . . . , s we have xy ~ Ek. Let x E D~, y E D). Then i + j - i n k < 1. Thus xkyk -- M ~+j-mk _< 1, and hence xy-

~-~xkyk < S < M. k=l

This proves t h a t our labeling together with the threshold t - M represent G in N s, and hence dp(G) _ n/2 for n _> 4. W h e n G is the disjoint union of two or more graphs each of which has at least one edge, then again dp(G) is much smaller than t(G), as can be seen from the proposition below. Proposition

1 9 . 2 . 7 If G is the disjoint union of graphs G I , . . . , Gn, then

dp(G) m a x / t i is for the mom e n t unspecified. We show that if t is sufficiently large, then the assignment

19.2

503


x ~ x', where ~ e ' - Ai(x + d i) for x C V/, i - 1 , . . . , n, represents G in IRm with the threshold t. Indeed, if x E !Vi and y C Vj, then

x'y' - A i ( x + d i ) A J ( y + d j) - ( x + d i ) T A iT A J ( y + d j) - ( x + d i ) T A J - i ( y + d j) m-2 (j -i)Tr ~(j = ~ xkyk + Xm-lY~n-1 COS ~ + ~ / t - tjxm-1 sin k=l

-

T/

v/t - t~ sin ~ (jy -m -i)Tr 1

i)Tr ?~

+ ~ / ( t - t i ) ( t - tj)cos (j - i)Tr

n

n

Now if i - j, then x'y' - x y + t - ti, and hence x y > t if and only if x y > ti, which happens for x :fi y if and only if xy is an edge of Gi. If i -r j, then obviously lira x y -- cos (j - i)Tr < 1. t ---*oo

t

r/

Hence x y < t for all x in Gi, y in Gj if t is sufficiently large. This verifies that we have defined a correct representation for G. 9 R e m a r k . In general, the upper bound of Proposition 19.2.7 cannot be improved. For instance, d e ( K 2 ) = de(2K2) = 1, but dp(3K2) > 1. On the other hand, it is not known whether de(V, UE~) 4. 2. d p ( P n ) - 2

>__ 5

3. dp(nK2) - 2 for every n >_ 3. ~. dp(Kn,~) - n for every n >_ 1. P r o o f . 1" Let Xl , . . . , X n be vertices of Cn along the cycle. For i - 1 , . . . , n, the vectors 27ri 27ri) R2 cos~,sin-C n

n

represent Cn in IR2 with the threshold t - cos 2_~. Hence dp(Cn) __min(t~,tv).

508

Other Extensions

In particular, if t~ _< IS~ l, then vertex 1 is dominating. If t 1 ) ISll and vertex 1 is not isolated, let u be adjacent to 1. Then tu

Observe that the Sv are nested sets. i _< j _< [~], then ISuNSvl-i+l

if j < [~J ifj >

Further, for u E Di and v C Dj, if

<m+l-min(tu,

tv),

so u and v are not adjacent. If i _> j > [~J, then

n Xvl-

+ 1 _> min(t~, t~),

so u and v are adjacent. Finally, if/_< [~J < j, then ISunSvl-i+l_>rn-j+2

if and only if

i+j>m.

Since by Condition 6 of Theorem 1.2.4, u and v are adjacent if and only if i + j > m, G is a min-tolerance chain graph with the assigned nested sets and tolerances. 2" Let G be a max-tolerance chain graph on the vertices V l , . . . , v ~ . We may assume without loss of generality that the associated nested sets are { 1 , . . . , r i } for i - 1 , . . . , n , where 1 _< r~ _< ... _< r~. Let the corresponding tolerance of vertex vi be ti. We may also assume that ti _< ri for all i, for if ti > ri, then vi is isolated and we can disregard vi. Consequently, we can associate the interval Ii - [ti, ri] with each vertex vi. Now vertices vi and vj are adjacent if and only if min(ri, rj) >_ m a x ( t i , t j ) . This holds if and only if Ii N Ij 5r 0 , and thus G is an interval graph. Conversely, if G is an interval graph with associated intervals [ti, ri], i - 1 , . . . ,n, we may assume without loss of generality that the ti and ri are positive integers. Then the sets { 1 , . . . ,ri} along with the tolerances ti represent G as a max-tolerance chain graph.

19.4

509

Universal Threshold Graphs

3: Recall that the complements of threshold tolerance graphs (coTT graphs) have the adjacency defined by

ij C E

if and only if

wi + wj < min(ti, tj).

Again we may assume that the ti are positive integers. Hence the coTT graphs are the same as the sum-tolerance chain graphs with nested sets { 1 , . . . , t i } and tolerances wi + 1/2. 9 R e m a r k . There is no known characterization of threshold tolerance graphs by forbidden subgraphs. However, a polynomial-time algorithm for recognizing this class of graphs is given in [MRT88].

19.4


In [HK94], Hammer and Kelmans studied the universal graphs for threshold graphs. Given a class C of graphs, a graph G is called a C-universal graph, or simply a C-UG, if every graph in C is isomorphic to an induced subgraph of G. If in addition G has the least number of vertices among all C-UGs, then G is called a minimum C-UG, or simply a C-MUG. Observe that if d is closed under complements, then G is a C-UG whenever its complement is, and similarly for C-MUGs. Likewise, if C* is the class of all induced subgraphs of the graphs in C, then every C-UG is a C*-UG, and similarly for MUGs. Let T~ denote the class of all threshold graphs on n vertices. We characterize all the :m-MUGs that are themselves threshold (i.e., the threshold T~-MUGs), a result originally proved by Hammer and Kelmans [HK94]. N o t a t i o n . We denote by (G, xy) the graph obtained from a graph G by adding two adjacent new vertices x, y, and joining y to all vertices of G. In other words, (O, xy) = (O U x) @ y. Similarly, we denote by (G,g-~y) the graph obtained by adding two nonadjacent new vertices x, y, and joining y to all vertices of G. In other words, (a,

= (G 9 y) u x.

Note that (G, xy) = (G, yx). Theorem

19.4.1 For every graph G, the following conditions are equivalent:

1. G is a (threshold)2/-~-UG;

510

Other E x t e n s i o n s

2. (G, xy) is a (threshold)~n+I-UG; 3. (G,-~y) is a (threshold)~n+I-UG. P r o o f . By Condition 4 of Theorem 1.2.4, if one of the three graphs is threshold, so are the other two. 1) =~ 2): Assume that G is a ~ - U G . Let T be any threshold graph on n + 1 vertices. If T contains an isolated vertex x, then T - {x} is isomorphic to an induced subgraph of G, and hence T is isomorphic to an induced subgraph of (G, xy). Otherwise T contains a dominating vertex y, and the proof is similar. 2) ~ 1): Assume that (G, xy) is a Tn+I-UG. Let T be any threshold graph on n vertices. The graph T' = T U {a} is isomorphic to an induced subgraph of (G, xy) on some vertex set X. Clearly, X does not contain y, since T' contains an isolated vertex. We may assume without loss of generality that x is the isolated vertex. It follows that T is isomorphic to the subgraph induced by X - {x} in G. The fact that 1) ,z---->, 3) follows from 1) ~ 2) by a simple complementarity argument. 9 Notation. Let a ~ = ( a l , . . . , a t ) b e a 0-1 sequence. Let H = G(a ~) be the graph constructed from G as described below: 1. s e t H : = G ; 2. for i = 1 , . . . , r , do H "-

(H, xiyi), (H, x-~),

if ai = 1, if ai = O.

1. Observe that G(a ~) is isomorphic to G(b ~) if and only if a ~ = b ~, and that, as with threshold graphs, there is a 1-to-1 correspondence between the set of 0-1 sequences a ~ and the set of graphs of the

F a c t 19.4.2

form 2. From Theorem 19.~. 1, it follows that G(a n) is a T~+k- UG if and only if G is a Tk- UG. 3. Since K1 is a threshold ~ - U G , K l ( a ~-a) is a threshold Tn-UG for each 0-1 sequence a n-1. Thus there are at least 2 n - 1 threshold T~-UGs, each with exactly 2 n - 1 vertices.

19.4


511

In fact, the 2 n-1 threshold Tn-UGs K l ( a n - l ) are all the threshold T~-MUGs, as we show below. T h e o r e m 1 9 . 4 . 3 If G is a Tn-MUG, then

1. the vertices of G can be partitioned into a stable set { S l , . . . , s n } and a clique { C l , . . . , Cn-1}. Hence G contains exactly 2n - 1 vertices. 2. If the si are indexed so that deg(sl) _ k - 1 for k - 1 , . . . , n - 1. Similarly, G contains as induced subgraphs all threshold graphs of the form I~UKn_~ for r - 1 , . . . , n - 1 . Now setting r - 1 we obtain deg(s~) _< 1. Then setting r - 2 we obtain deg(s2) _< 2. Continuing in this way, we obtain deg(sk)

Threshold Graphs and Related Topics, Volume 56 (Annals of Discrete Mathematics)

Threshold Graphs and Related Topics (Annals of Discrete Mathematics, Volume 56)

Threshold Graphs and Related Topics

Topics on Perfect Graphs (Annals of Discrete Mathematics, Volume 21)

Eulerian Graphs and Related Topics: Part 1, Volume 2 (Annals of Discrete Mathematics, Volume 50)

Eulerian Graphs and Related Topics: Part 1, Volume 1 (Annals of Discrete Mathematics, Volume 45)

Eulerian Graphs and Related Topics: Part 1, Volume 1 (Annals of Discrete Mathematics, Volume 45)

Studies on Graphs and Discrete Programming (Annals of Discrete Mathematics)

Topics on Steiner Systems (Annals of Discrete Mathematics, Volume 7)

Eulerian Graphs and Related Topics

Eulerian Graphs and Related Topics

Eulerian Graphs and Related Topics

Eulerian Graphs and Related Topics

Graph Theory and Applications (Annals of discrete mathematics, Volume 38)

Topics in Discrete Mathematics

Convexity and Graph Theory (Annals of Discrete Mathematics, Volume 20)

Fourth Czechoslovakian Symposium on Combinatorics, Graphs and Complexity (Annals of Discrete Mathematics, Volume 51)

Algorithmic Graph Theory and Perfect Graphs, Volume 57, Second Edition (Annals of Discrete Mathematics)

Fourth Czechoslovakian Symposium on Combinatorics, Graphs and Complexity (Annals of Discrete Mathematics, Volume 51)

Theories of Computational Complexity (Annals of Discrete Mathematics, Volume 35)

Theories of Computational Complexity (Annals of Discrete Mathematics, Volume 35)

Lukasiewicz-Moisil Algebras (Annals of Discrete Mathematics, Volume 49)

Combinatorics '86 (Annals of Discrete Mathematics)

The Steiner Tree Problem (Annals of Discrete Mathematics, Volume 53)

The Steiner Tree Problem (Annals of Discrete Mathematics, Volume 53)

Lukasiewicz-Moisil Algebras (Annals of Discrete Mathematics, Volume 49)

Knot Groups. Annals of Mathematics Studies. (Annals of Mathematics Studies, No. 56)

Fourth Czechoslovakian Symposium on Combinatorics, Graphs and Complexity (Annals of Discrete Mathematics)

Topics in Finite and Discrete Mathematics

Algorithmic Graph Theory and Perfect Graphs, Second Edition (Annals of Discrete Mathematics 57)

Discrete Mathematics of Neural Networks: Selected Topics

Threshold Graphs and Related Topics, Volume 56 (Annals of Discrete Mathematics)

Threshold Graphs and Related Topics (Annals of Discrete Mathematics, Volume 56)