Analysis and Optimization of Systems: Proceedings of the 9th International Conference, Antibes, June 12-15, 1990 (Lecture Notes in Control and Information Sciences) (English and French Edition)

Lecture Notes in Control and Information Sciences Edited by M.Thoma and A.Wyner 144 A. Bensoussan, J. L. Lions (Editors...

Author: A. Bensoussan | J.L. Lions

13 downloads 559 Views 37MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
Report copyright / DMCA form

DOWNLOAD PDF

Lecture Notes in Control and Information Sciences Edited by M.Thoma and A.Wyner

144 A. Bensoussan, J. L. Lions (Editors)

Analysis and Optimization of Systems Proceedings of the 9th International Conference Antibes, June 12-15, 1990

Springer-Verlag Berlin Heidelberg New York London Paris Tokyo Hong Kong

Series Editors M. Thoma • A. Wyner

Advisory Board L. D. Davisson • A. G. J. MacFarlane • H. Kwakernaak J. L. Massey • Ya Z. Tsypkin • A. J. Viterbi

Editors A. Bensoussan INRIA - Universite Paris IX Dauphine J. L. Lions College de France - CNES, Paris INRIA Institut National de Recherche en Informatique et en Automatique Domaine de Voluceau, Rocquencourt, B.P.105 78153 Le Chesnay/France

ISBN 3-540-52630-7 Springer-Verlag Berlin Heidelberg New York ISBN 0-387-52630-7 Springer-Verlag New York Berlin Heidelberg This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting, reproduction on microfilms or in other ways, and storage in data banks. Duplication of this publication or parts thereof is only permitted under the provisions of the German Copyright Law of September 9, 1965, in its current version, and a copyright fee must always be paid. Violations fall under the prosecution act of the German Copyright Law. © Springer-Verlag Berlin, Heidelberg 1990 Printed in Germany The use of registered names, trademarks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. Printing: Mercedes-Druck, Berlin Binding: B. Helm, Berlin 2161/3020-543210 Printed on acid-free paper.

FOREWORD

These Proceedings of the 9th series of the I n t e r n a t i o n a l Conference "Analysis and Optimisation of Systems" gather more than 100 papers covering b o t h Theory and Applications in the field of Systems Analysis and Control (1 invited paper - 96 contributed papers - 10 papers presented at the two Invited Sessions).

This year again, we have noticed countries represented among the This vitality shows the complete continues to progress and allows promising results and applications

a large international participation through 18 speakers. maturity of this discipline which, regularly, scientists and engineers to expect the most for the future.

We would like to express our thanks to the Organizations which have sponsored this Conference : AFCET, CNRS, IEEE-CSS, IFAC, IIASA, INSERM, SEE, SIAM and SMAI.

We also would like to extend our gratitude to : the authors who have shown a real interest in this Conference the many reviewers who have accepted the hard task of selecting papers the chairpersons for having run with efficiency all the sessions of the Conference all the members of the Organization Committee the Scientific Secretaries who deserve a special mention the staff of the Public Relations Department for their excellent work i n making this Conference really happen. Professor M. THOMA and the Editor SPRINGER VERLAG who have accepted to publish this series in the Lecture Notes in Control and Information Sciences.

A. BENSOUSSAN

J.L. LIONS

THIS CONFERENCE IS UNDER THE SPONSORSHIP OF : AFCET

Association Franqaise pour la Cybern6tique Economiquc et Technique

CNRS

Centre National de la Recherche

IEEE-CSS

The Institute of Electrical and Electronics Enginecrs, Inc. Control System Society "Participating Societies"

IFAC

International Federation of

IIASA

International Institute for Applied

Scientifique

Automatic

Control

Systems

Analysis

INSERM

Institut National de la Sant6 et de la Recherche M6dicale

SEE SIAM

Soci6t6 des Electriciens et Electroniciens Society for Industrial and Applied Mathematics

SMAI

Socidt6 pour les Mathdmatiques Appliqu6es et Industrielles

ORGANIZATION COMMITTEE K.J. A. A. P. P. A. A. I.D. J.L. M. R. J.C.

ASTR6M BENSOUSSAN BENVENISTE BERNHARD FAURRE FOSSARD ISIDORI LANDAU LIONS THOMA VINTER WILLEMS

Lund Institute of Technology INRIA Universit6 Paris-Dauphine INRIA - IRISA - Rennes INRIA - Sophia-Antipolis SAGEM, Paris ENSAE, Toulouse Universita di Roma I.N.P.G., Grenoble Cotl6ge de France/CNES, Paris Technische Universit~t, Hannover Imperial College, London GrOningen U n i v e r s i t y

SCIENTIFIC SECRETARIES L. J.F. A.

BARATCHART BONNANS SULEM

INRIA INRIA INRIA

-

-

Sophia-Antipolis Rocqucncourt Rocquencourt

CONFERENCE SECRETARIAT Th. S. E.

BRICHETEAU GOSSET MANY

INRIA-France INRIA-France INRIA-France

AUTHORS

ABU EL ATA S AKIAN M. ANANTHARAM V. ANTOULAS A.C. AUBIN J.P. BAIRD C.R. BARATCHART L. BARBOT J.P. BARDI M. BEHAEGEL E. BENSOUSSAN D. BERGER W.A. BLANCHON G. BONGERS P.M.M. BONILLA E.M. BONNAN$ J.F. BOSE A.K. BRDYS M.A. BROGLIATO B. BULGAKOV A. YA. BYRNES C.I.

851 113 937 297 821 467 477 314 103 145 873 341 423 307 279 423 632 674 798 95 821-861

CANPONT D. CARVALHO M. TEIXEIRA. M. CHA I T.Y. CHANGYOU L. CHANG SHI-CHUNG CHANTRE P. CHAVENT G. CHEN CHUN-HUNG CHERNOUSKO F.L. COIC A. COMMAULT C.

37 900 684 985 133 851 452 133 580 85 t 361

DESCUSSE J. DE KEYSER R.M.C. DE LUCA A. DELFINI P. DI BENEDETrO M.D. DION J.M. DODU J.C. DONG CAO DULUC R.

288 77 833 623 843 361 423 755 549

EL-ANSARY M.

890

FALCONE M. FAYAZ M.A. FELIACHI A. FEVO"I'TE G. FLANDOLI F. FLIESS M. FLORCHINGER P . FONG I-KONG FRANKOWSKA H. GABASOV R. GAUBERT S. GAUVR1T M.

I 03 611 208 37 694 778-851 2 2 8 - 258 133 519 570 957 912

INDEX

GEORGES D. GOMES G. GOUZE J.L. GRIZZLE J.W. GRORUD A. GUGLIELMI M.

600 912 324 401-843 704 497

HAMAM Y. HAMMOURI H. HAN CHI-GEUN HANUS R. HARCAUT J.P HEJMO W. HEUBERGER P.S.C. HOANG NGOC MINH V. INSTALLE M. IOFFE A. ISIDORI A.

600 391 413 664 27 714 307 381 755 442 821-861

JACOB G. JODAR L.

381 87

KAMEN E.W. KARATZAS i. KEANE M.A. KINNAERT M. KIRILLOVA F.M. KOKOTOVIC P.V. KOWALEWSKI A. KOZA J.R. KULC"ZYCKIP.

655 3 47 880 570 861 174 47 714

LAKRORI M. LALL H.S. LAMNABHI-LAGARR1GUEF. LANARI L. LEBRET G. LEHOCZKY J.P. LEPSCH': A. LE GLAND F. LESCH K. LOBRY C. LOISEAU J.J.

37 123 611 833 37 1 3 218 228 590 623 37 1

MALABRE M. MARCHETTI C. MESSAGER F. MIAN G.A. MILHEIRO de OLIVEIRA P. MONACO S. MORAAL P.E. MORDUKHOVICH B.S. MYSLINSKI A.

279 187 778 218 198 788 401 539 164

NAVARRO E. NIKOUKHAH R. NORMAND-CYROT D.

87 67 788

OLIVI M. OTHMAN S.

477 391

OUSTALOUP A. OZVEREN C.M.

767 947

PARDALOS P.M, PENG S. PENG Y. PERCELL P.B. PEREZ A. PERRY R.J. POMET J.B. POMMARET J.F. POURTALLIER O. PRALY L.

413. 724 664 123 361 341 808 271 745 808

RENEKE J.A. ROCHA P. RUBIO J.E. RUNCHAL A.K.

632 332 154 145

SACHS G. SASTRY S.S. SHREVE S.E. SPIZZICHINO F.

590 861 3 238

TAKEUCHI Y. TALAY D. TAPIERO C.S. THEPOT J. TSIOTRAS G.

248 704 967 735 967

ULANICKI B. ULIVI G. URYAS'EV S.P.

674 833 432

VIARO U. VIGNERON C. V1NTER R.B.

218 549 529

WAGNEUR E. WAHNON E. WAYNE BEQUETTE B. WANG D. WANG P,K.C. WENBIN LIU WIELONSKY F. WILLEMS J.C. WILLSKY A.S. XU GAN-L1N XU HONG YAO.

977 487 57 600 145 154 477 297-332 947 3 467

YANG WE1LIN YE YINYU YONG J1ONGMIN YOU YUNCHENG

133-351-924 413 559 642

ZHANG SIY1NG ZI-QIANG LANG

351-924 507

TABLE OF CONTENTS

INVITED

CONFERENCE

Optimality conditions for utility maximization in an incomplete market I. K a r a t z a s , J.P. Lehoezky, S.E. Shreve, Gan-Lin

NUMERICAL

Xu

........................

ALGORITHMS

Nonlinear control of missiles through a geometric approach J.P. H a r c a u t . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

27

Non linear control of a batch evaporative crystallization using an algorithm of "L/A" type G. Fevotte, D. Canpont, M. Lakrori . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

37

Genetic breeding of non-linear optimal control strategies for broom balancing J.R. Koza, M.A. Keane . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

47

Process control using nonlinear programming techniques B. W a y n e Bequette . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

57

Construction of autonomous boundary-value linear systems from acausal input-output functions R. Nikoukhah ...............................................................................

67

PC-TACT : personal computer tool for advanced control techniques R.M.C. De Keyser . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

77

Solving singular regular systems A2X" + A1X' + AoX = F(t) without increasing the dimension of the problem L. J o d a r , E. Navarro . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

87

Matrix spectrum dichotomy and generalized Lyapunov matrix cquation A. Ya. Bulgakov . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

95

Discrete approximation of the minimal time function for systems with regular optimal trajectories M. Bardi, M. Faicone . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

103

Analyse de l'algorithme multigrille FMGH de rdsolution d'6quations d'Hamilton-Jacobi-B ellman M. Akian . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

113

A dynamic programming based gas pipeline optimizer H.S. Lali, P.B. Percell . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

123

Two effective approaches for hydroelectric generation scheduling

Shi-Chung I-Kong

Chang,

Fong,

Weilin

Chun-Hung

Yang, Chen

........................ . ..............................

133

VII

CONTROL OF DISTRIBUTED P A R A M E T E R S Y S T E M S Iterative solution of a free-boundary problem arising in microscopic particle manipulation inside a liquid layer P.K.C. W a n g , E. Behaegei, A.K. R u n c h a l .......................................... 145 Optimality conditions for elliptic variational inequalities Liu Wenbin, J.E. Rubio . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

154

Minimax shape optimization problem for Von Karman system A. M y s l i n s k i ..............................................................................

164

Optimality conditions for a parabolic time delay system A. Kowalewski . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

174

LINEAR

AND

NONLINEAR

FILTERING

A note about singular perturbations in the filtering of a Markov chain C. M a r e h e t t i . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

187

Filtres approch6s pour un probl~me de filtrage non lindaire discret avec petit bruit d'obscrvation P. M i l h e i r o De Oliveira . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

198

;.................

Decentralized filtering for interconnected systems A. F e l i a c h i . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

208

Split forms of z-domain algorithms for linear prediction and stability analysis A. Lepschy, G.A. Mian, U. Viaro . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

218

Time-discretization of the Zakai equation for diffusion processes observed in correlated noise P. F l o r c h i n g e r , F. Le Gland . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

228

Finite-dimensional stochastic filtering in discrete time : the role of convolution semigroups F. S p i z z i c h i n o . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

238

On the decompositions of observations with non-Gaussian additive noise and their innovations processes Y. T a k e u c h i . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

248

Nonlinear filtering with dependent noises. The case of unbounded coefficients P. F l o r c h i n g e r . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

258

VIII A L G E B R A I C AND

GEOMETRIC S Y S T E M T H E O R Y

Group theory and controllability of partial differential control systems J.F. Pommaret ..............................................................................

271

Algebraic characterization of invariant zeros at infinity for generalized systems E.M. Bonilla, M. Malabre . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 279 The dynamic block decoupling problem : a minimal solution by precompensation J. Descusse . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

288

Minimal rational interpolation and Prony's method A.C. Antoulas, J.C. Willems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

297

Discrete normalized coprime factorization P.M.M. Bongers, P.S.C. Heuberger

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 307

A forward accessibility algorithm for nonlinear discrete time systems J.P. B a r b o t . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

314

Stabilisation globale de syst~mes non-lin6aires par un contr61e positif J.L. Gouz~ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

324

Controllability of delay-differential systems P. Rocha, J.C. Wiilems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

332

Geometric approach to parametric sensitivity and gain suppression W.A. Berger, R.J. Perry . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

341

A kind of nonlinear system and its reduction and structural control Weilin Yang, Siying Zhang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

351

Rejet de perturbation dans les syst~mes structur6s C. Commault, J.M. Dion, A. Perez . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

361

A new canonical form for descriptor systems with outputs J.J. Loiseau, G. Lebret . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 371 Evaluation transform and symbolic calculus for nonlinear control systems V. Hoang Ngoc Minh, G. Jacob . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

381

Immersion in infinite dimension H. Hammouri, S. Othman . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

391

On observers for smooth nonlinear digital systems J . W . Grizzle, P.E. Moraal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

401

NONLINEAR

PROGRAMMING

An interior-point algorithm for large-scale quadratic problems with box constraints P.M. P a r d a l o s , Yinyu Ye, Chi-Geun Han . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

413

IX Optimisation des r6seaux 61ectriques de grande taille G. Bianchon, J.C. Dodu, J.F. B o n n a n s . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

423

Adaptive variable metric methods for nondifferentiable optimization problems S.P. U r y a s ' e v . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

432

Composite optimization : second order conditions, value functions and sensitivity A. Ioffe . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

442

A new sufficient condition for the well-posedness of non-linear least square problems arising in identification and control G. Chavent . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .; . . . . . . . . . . . .

452

SIGNAL PROCESSING An identification technique for adaptive systems in the case of poor excitation Hong Yao Xu, C.R. Baird . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

467

Asymptotic properties in rational L2-approximation L. Baratchart, M. Olivi~ F. Wielonsky . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 477 A min-max testing approach to failure detection and identification E. Wahnon . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

487

On-line detection of minimal order for linear pieccwise stationary systems M. Guglielmi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

497

A simple method for identification of linear dynamics with hysteresis nonlinear input Lang Zi-Qiang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

507

DETERMINISTIC

CONTROL

A priori estimates for operational differential inclusions and necessary conditions for optimality H. F r a n k o w s k a . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

519

Relaxed controls, for time delay systems R.B. Vinter ..................................................................................

529

Maximum principle for nonconvex finite difference control systems B.S. M o r d u k h o v i e h . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

539

Functional R. Duluc,

viability constraints C. Vigneron . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

549

Maximum principle of optimal controls for a nonsmooth semilinear evolution system J i o n g m i n Yong . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

559

X New algorithms of solving extremal problems R. Gabasov, F.M. Kirillova .............................................................

570

Constrained controls in linear oscillating systems F.L. Chernousko ...........................................................................

580

Fuel savings by optimal aircraft cruise with singular and chattering control G. Sachs, K. Lesch . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

590

Optimal path planning of manipulators with singular configurations, workspace and collision-free constraints D. Wang, D. Georges, Y. Hamam . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

600

On singular output tracking in multivariable nonlinear systems M.A. F a y a z , F. L a m n a b h i - L a g a r r i g u e . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

611

CONTROLLABILITY AND STABILIZATION DISTRIBUTED PARAMETER SYSTEMS

OF

Formal controllability and physical controllability of linear systems C. L o b r y , P. Deifini . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

623

Conditions yielding weak controllability for a class of linear hereditary systems J.A. Reneke, A.K. Bose . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

632

Nonlinear exponential stabilization of Boussinesq equations Y u n e h e n g You . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

642

STOCHASTIC

AND

ADAPTIVE

CONTROL

Direct adaptive control in a state-space setting E.W. K a m e n . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

655

Pole placement via generalized predictive control Y. Peng, R. H a n u s . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

664

Separation principle in optimizing control of state-constrained dynamical systems under bounded uncertainty M.A. Brdys, B. Ulanicki ................................................................

674

Unification of some MIMO adaptive control algorithms and global convergence analysis T.Y. Chai .....................................................................................

684

Boundary control of a stochastic parabolic equation with nonsmooth final cost F. Flandoli ..................................................................................

694

XI Approximation of Lyapunov exponents of stochastic differential systems on compact manifolds A. Grorud, D. Talay . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 704 Synthesis of closed-loop system controlling a random object Hejmo, P. Kuiczycki . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 714

W.

Maximum principle for stochastic optimal control with non convex control domain S. Peng . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

MULTIDECISION

CONTROL AND

724

GAMES

Nash vs Stackelberg strategies in a capital accumulation game J. Th~pot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

735

A two player dynamical game with imperfect information O. P o u r t a l l i e r .............................................................................

745

An interactive multiple criteria decision supporting tool with application to a simplified regional development problem Cao Dong, M. Installe . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

755

CONTROL OF FINITE D I M E N S I O N A L S Y S T E M S CRONE control : principle synthesis, performances with non-linearities and robustness-input immunity dilemma A. O u s t a l o u p . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Vers une

767

stabilisation non-lingaire discontinue

M. Fliess, F. Messager . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

778

A combinatorial approach of the nonlinear sampling problem S. Monaco, D. N o r m a n d - C y r o t . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

788

Non-linear MRAS in robots motion control B. Brogliato .................................................................................

798

Adaptive control of feedback equivalent systems J.B. Pomet, L. Praly . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

808

NONLINEAR

ZERO DYNAMICS

AND

OUTPUT R E G U L A T I O N

Viability kernels, controlled invariance and zero dynamics for nonlinear systems J.P. Aubin, C.I. Byrnes, A. Isidori . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

821

Output regulation of a flexible robot arm A. De Luca, L. Lanari, G. Ulivi . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

833

An analysis of regularity conditions in nonlinear synthesis M.D. Di Benedetto, J.W. Grizzle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

843

XII Discontinuous predictive control, inversion and singularities. Application to a heat exchanger M. Fliess, P. C h a n t r e , S. Abu El Ata, A. Coic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

851

The analysis of singularly perturbed zero dynamics A. I s i d o r i , P.V. Kokotovic, S.S. Sastry, C.I. Byrnes,

861

........................

ROB U S T N E S S Elargissement des objectifs de robustesse des syst~mes de commande D. B e n s o u s s a n . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

873

Right half plane poles and zeros and robustness limitations in feedback systems M. Kinnaert .................................................................................

880

Stabilizing control of a singularly perturbed system driven by wide-band noises M. E I - A n s a r y . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

890

Control o f uncertain dynamical systems using strictly positive real systems M. Carvalho Minhoto Teixeira . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 900 Parameter robust control design based on parametric identification quality M. Gauvrit, G. Gomes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

912

Rotation symmetric structure in control systems and related stability analysis Weilin Yang, Siying Zhang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

924

DISCRETE EVENTS SYSTEMS A hydrodynamic limit for a lattice caricature of dynamic routing in circuit switched networks V. A n a n t h a r a m ............................................................................

937

Aggregation and multi-level control in discrete event dynamic systems C.M. O z v e r e n , A.S. Willsky . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

947

An algebraic method for optimizing resources in timed event graphs S. G a u b e r t . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

957

WIP and CSP-1 quality control in a two-stages queue like production C.S. T a p i e r o , G. Tsiot~ras . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

967

Modulo'fds and pseudomodules : 3 - the lattice structure problem E. Wagneur ..................................................................................

977

Modelling and analysing a class of flexible manufacturing system L. C h a n g y o u . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

985

I N V I T E D CONFERENCE

OPTIMALITY CONDITIONS F O I l UTILITY MAXIMIZATION IN AN INCOMPLETE MAltKET T Ioannis Karatzas Columbia University New York, NY 10027 **

John P. Lehoczky** Steven E. Shreve *#

Gan-Lin Xu Carnegie Mellon University Pittsburgh, PA 15213

ABSTRACT The problem of maximizing expected utility of final wealth in an incomplete market is investigatsd~ The incomplete market is modelled by a bond and a finite number of stocks, the latter being driven by a d--dimensional Brownian motion. The coefficients of the bond and stock price processes are adapted to this Brownian motion, and the number of stocks is less than or equal to the dimension of the driving Brownian motion. It is shown that there is a way to "fictitiously" complete this market so that the optimal po~tfoho for the resulting completed market coincides with the optimal portfolio for the original incomplete maxket. A number of equivalent characterizations of the fictitious completion are given, and examples are provided. SECTION 1. INTj~.ODUCTION This paper utudies the problem of an agent who receives a deterministic initial capital, and must invest it in an incomplete market so as to maximize the expected utihty from wealth at a prespecified terminal time. The market consists of a bond and m stocks, and stock prices are driven by a d---dimensional Brownian motion. Incompleteness in this market arises when m is strictly smaller than d. The market coefficients, i.e., the interest rate, the rates of stock appreciation, and the stock volatihty coefficients, are random processes adapted to the full

Research supported by the National Science Foundation under Grant DMS--87-2307g. Research supported by the National Science Foundation under Grant DMS-87-02537. Vpresented at the gth Intsr'natwnal Conference on Analysis and Optimization of Systems (Antibes, June 1990), and at the MS[ Workshop on the Mathematical Theory o/Modern Financial Markets (Cornell University, July 1989).

d---dimensionat Brownian motion. When m < d, it is impossible to construct a portfolio consisting of the bond and the m available stocks so as to completely hedge the risk associated with these coefficient processes. In Sections 2 through 5, we define the utility maximization problem faced by the agent. In Section 6 we present the solution when the market is complete (m = d), and complete hedging is possible. This solution proceeds in three steps. First, on the underlying probability space one determines a new measure P* which discounts the growth inherent in the market; under this measure, the expected value of the final wealth attained by any "reasonable" portfolio is equal to the initial capital. Secondly, among all random variables whose expectation under the new measure is equal to the initial endowment, a most desirable one is determined. Thirdly, it is shown that a portfolio can be constructed which obtains this most desirable random variable as terminal wealth; this portfolio is optimal. A complete market is one in which the agent can construct a portfolio which attains as terminal wealth any random variable whose P * - expectation is equal to the initial capital. Because such a construction is possible, it is said that the agent can hedge against the risk associated with this market. Mathematically, the construction of a portfolio uses the fact that any martingale with respect to a Brownian filtration can be represented as a stochastic integral with respect to the Brownian motion; the integrand in this representation leads directly to the portfolio that we are seeking, if m=d. However, if there are fewer than d stocks, (m < d) this line of argument fails. In Section 7 we introduce a convenient way of thinking about an incomplete market: fictitious completion. When m < d, then one introduces new, fictitious stocks so as to create a complete market. If these fictitious stocks have a high appreciation rate, then under an optimal portfolio the agent wiU hold a long position in them, but if they have a low (even negative) appreciation rate, then he will hold a short position. Thus one would expect to be able to adjust the appreciation rates of the fictitious stocks so that the agent, by optimal choice, does not invest in them at all. These judiciously chosen fictitious stocks allow us to write down the complete market solution for the utility maximization problem, but they are superfluous in the actual implementation of the optimal portfolio which must then also be optimal for the original incomplete market. The fictitious completion with the above property is the least favorable to the agent, because the portfolio which is optimal under this completion is available to him under every other fictitious completion. We thus have the notion of a ~inimax fictitious completion: for every fictitious completion we compute the portfolio which maximizes the expected utility of final wealth, and then choose the completion that makes this maximum expected utility as small as possible. As explained in Section 7, a convenient way to parametrize fictitious completions of an incomplete market is by a certain space of continuous local martingales, each of them being the Radon-Nikodym derivative process of the new measure P*, mentioned in the discussion of complete markets. One would like to be able to characterize the local martingale corresponding to the minimax fictitious completion, and to show that it gives rise to an optimal portfolio in the original incomplete problem. This program is carried out in Section 8. Section 9 provides two examples in which the minimax fictitious completion can be computed fairly explicitly. In the first example, the utility function is logarithmic, and it is discovered that the fictitious stocks in the minimax completion should have rates of appreciation

equal to the interest rate of the bond. This is a very general result, insensitive to the nature of the dependence of market coefficients on the driving Brownian motion. In the second example, it is assumed that the utility function is of the power form, and that the driving Brownian motion splits into two independent parts; the first part drives the stock price processes, whose coefficients are adapted solely to the second part. The minimax local martingale is exhibited as the solution to a martingale representation problem, and the optimal portfolio is found to be given by the formula already known to be correct for deterministic model coefficients. Our model for the financial market can be traced back to Merton (1969, 1971) and Samuelson (1969). The modern mathematical approach to portfolio management in complete markets, built around the ideas of equivalent martingale measures and the creation of portfolios from martingale representation theorems, began with Harrison & Kreps (1979) and was further developed by Harrison & Pliska (1981, 1983), in the context of option pricing. Phska (1986), Cox & Huang (1987a, 1987b) and Karatzas, Lehoczky & Shreve (1987) adapted the martingale ideas to problems of utility maximization. Much of this development appears in Section 5.8 of Karatzas & Shreve (1988), from which Section 6 of the present paper is drawn; see also the review article of Karatzas (1989) for a survey of financial economics problems in complete markets. A first step toward a martingale analysis of incomplete markets was taken by Pages (1987), who considered a Brownian model in which the number of stocks was strictly less than the dimension of the driving Brownian motion. However, the coefficients of the bond and stock prices in this model were allowed to depend on the underlying Brownian model only through the bond and stock prices themselves. Thus, the vector of bond and stock prices formed a Markov process. This specialization created an essentially complete market, and thus it avoided the more interesting case of a market with genuinely unhedgeable risk. However, Pages did characterize the class of equivalent martingale measures which could arise in an incomplete model, and this laid the groundwork for further developments. A more substantive step was taken by He & Pearson (1988) in a discrete-time, finite probability space model, where the authors proposed finding the optimal intermediate consumption and terminal wealth corresponding to each of the equivalent martingale measures, and then searching over those policies to find a pair yielding the minimum expected total utility. Using separating hyperplane arguments, they were able to show that the total utility obtained by this two-step "minimax" process is the optimal total value for the incomplete problem. He & Pearson have also studied the incomplete problem in a continuous-time, Brownian model. In an early version of He & Pearson (1989), the authors consider Pag~s's characterization of the family of equivalent martingales measures and search over this family for a "minimax" equivalent martingale measure which would lead them to the optimal consumption and portfolio processes just as in a complete market. The martingale associated with this measure would be the "Arrow-Debreu" state prices in the incomplete model. However, the continuous-time model is more subtle than one might expect, and although it is now clear that Azrow-Debreu state prices exist for the incomplete model under some assumptions, it is not clear that they are associated with a martingale. The present paper uses local martingales rather than martingales to address the issue of market incompleteness in continuous-time models. This work was motivated by the aforementioned previous version of He & Pearson (1989), and by the use of local martingale

methods introduced by Xu (1990) and Xu & Shreve (1990) in the study of incompleteness induced by a prohibition on the short--selling of stocks. Using the stochastic duality theory of Bismut (1973), Xu formulated a dual problem whose solution could be shown to exist and could then be used to obtain existence and characterization of the solution of the original problem. As this paper shows, such duality methods can also be used in the traditional incomplete Brownian market model. While we still do not know if the minimax equivalent martingale measure sought by He & Pearson exists in any generality, we show here that the solution to Bismut's dual problem is a "least favorable local martingale" which can be used to generate a sequence of equivalent measures. The existence of this least favorable local martingale is sufficient for the study of many models. A notable exception is the incomplete model in which the agent's endowment is a stochastic process; we do not know how to obtain existence and characterization of the optimal policy for such a model in terms of a least favorable local martingale, unless it is actually a martingale. tie & Pearson (1989) have incorporated Xu's local martingale techniques into their original work. He & Pearson (1989) report the existence of an optimal portfolio for the terminal wealth utility maximization problem when the index of relative risk aversion is everywhere greater than or equal to one, and they report similar results for the problem with intermediate consumption and consumption at the terminal time when the index of relative risk aversion is everywhere less than or equal to one. Our paper deals only with the case of terminal wealth utility maximization when the index of relative risk aversion is everywhere less than or equal to one; the generalization to also allow for intermediate consumption is straight-forward. Whereas He & Pearson (1989) assume that some augmentation of the market model will result in Markov prices, we allow general It5 price processes. He & Pearson (1989) do not provide the full set of equivalent conditions contained in our Theorem 8.4, nor do they use our assumption (8.2), which plays a fundamental role in our proof of Theorem 8.4. For a fuller account of this theory, and for proofs of the results which are not included hcre, we refer the interested reader to the paper Karatzas, Lehoczky, Shreve & Xu (1989). SECTION 2. 3?tIE MARKET MODEL We assume a model for the financial market consisting of one bond with price P0(t) given by (2.1)

dP0(t) = r(t)P0(t)dt,

P0(0) -- 1,

and m stocks with prices-per--share Pi(t), i = 1,...,m, satisfying the equations

(2.2)

d dPi(t) = Pi(t)[bi(t)dt + Z aij(t)dW(J)(t)],

j=1

i = 1,...,m.

ttere W = r~(W~U,...,W~dj)* _ t~ is a d-dimensional Brownian motion on a probabihty space ( f l , J , P ) , and we denote by {.~t} the P-augmentation of the filtration generated by W. We suppose that d > m, i.e., the number of stocks does not exceed the number of sources of uncertainty in the model.

7 The interest rate r(t), the vector bit ) = (bl(t),...,b~(t)) of stock appreciation rates, and the volatility matrix act ) = {crij(t)}l< m are the coefficients of the model. They are assumed to l< j< d be progressively measurable with respect to {~'t}, and it is also assumed that r(.) and o(.) are bounded uniformly in (t,w), and that /*

(2.3)

T

[ J

J]b(t)J[dt 0, let X x'~r denote the wealth process corresponding to a portfolio ~r defined by Xxjr(0) = x and (3.2)

dXX'~r(t) = r(t)xX'Tr(t)dt + xX'~r(t)Tr*(t)[(b(t) - r(t)l)dt + a(t)dW(t)]

= r(t)Xx3r(t)dt + xx'~r(t)~r*(t)a(t)dW0(t). In other words,

(3.3)

~rCs)a(s)dWo(s)+ 0

I t ( s ) - 8 9 I1~ * (s)~s)ll2]ds} '

SECTION 4. UTILITYFUNCTIONS We introduce a utility function U: (0,~) -~ ~, which is strictly increasing, strictly concave, twice continuously differentiable, and satisfies (4.1)

U'(0)-A l i m U ' ( x ) = | x~ 0

U'(|

limU'(x)=0. x-~ |

We denote by I: (0,| -~ (0,~) the (continuous, strictly decreasing) inverse of U ' , and consistent with (4.1) we note that (4.2)

I(~) A lira I(y) = 0, I(0) ~ lira I(y) = ~. y-~| y~ 0

We introduce the Legendre transform of U: (4.3)

Off) _Am i n [xy - U(x)] = yI(y) - U(I(y)), x>0

0 < y < ~.

The function lJ is strictly increasing, strictly concave and satisfies (4.4)

U'(y) = I(y),

0 < y < ~.

It is easily verified that (4.5)

U ( x ) = m i n { x y - lJ(y)} = x U ' ( x ) - 0 ( U ' ( x ) ) ,

y>0 From (4.3) and (4.5) we have the useful inequalities

0 < x < ~.

9 (4.6) (4.T)

u(I(y)) _> u(~) + y[I(y) - x], v x > 0, y > 0, 0 ( u ' ( x ) ) _> 0(y) + x[U' (x) - y], v x > 0, y > 0.

SECTION 5. THE UTILITY MAXIMIZATION P R O B L E M The problem at hand is, for a given initial wealth x > 0, to choose a portfolio a" that maximizes E u ( x X ' r ( T ) ) . If U(0) > - | this quantity is defined (but may be | for all portfolios r, whereas it may be undefined if U(0) = - | and U(co) = |

Thus, we define ,/#(x) to be the set

of portfolios ~r for which E max{0,-u(xX'Tr(T))) < |

(5.1) and set

V(x) =a

(5.2)

sup n~(x)

E u(xX'rr(T)).

We assume throughout that

(5.3)

v(,:) < |

v x > 0.

A portfolio process rr which attains the supremum in (5.2) is called optimal. In Section 8 we give conditions which ensure the existence of an optimal portfolio, and provide several characterizations for it. SECTION 6. THE COMPLETE M A R K E T SOLUTION The market model of Section 2 is complete if and only if a(t) is invertible for all t, i.e., d = m. Under this condition, the solution of the utility maximization problem has a simple solution, which we present below. Define the nonnegative local martingale

(6.1)

it , 1 I t z0(t ) =A e~p { - 0 O (s)dW(s) - , ~ 0 IIO(s)ll2ds},

a~d note from (3.3) that for any portfolio process ~r, flCt)Z0(t)xX'~(t) = x e~p { f t

( / C s) ~(s) - 0* (s))dWCs)- 21 f t

0

0

II~ . (s)~Cs) - 0(s)ll 2 ds}

is a nonnegative local martingale, hence a supcrmartingale. It follows that

10 (8.2)

E [Z(T) Z0(T ) xX'r(T)] 0,

so we may define a function ~o: (0,| -~ (0,~) by (6.4)

~o(Y) = E[~(T)Z0(T)I(y~(T)Z0(T))].

The function ~$0 inherits from I the property of being a continuous, strictlydecreasing mapping

of (0,| onto itself, and so ~'0 has a (continuous, strictly decreasing) inverse ]/o from (0,| itself. We define (6.5)

onto

~ a= I(]/o(x)fl(T)Z0(T)).

6.1 LSMMA. The random variable ~ satisfies (6.6)

E[fl(T)Z0(T)~] = x,

(6.7)

E ma~{0,-U(~)}
0

by analogy with (6.4). If

(7.7)

~V(y) < |

we may define ~ v to be the inverse of ~ ,

v y > 0,

and set

~x =~ I(~/~(x)fl(T)Zv(T))

(7.8)

by analogy with (6.5) If the fictitious stocks introduced in this section were really available, then EU(~,) would be the maximal expected utility of final wealth (Theorem 6.2). Since these stocks are not available, we have

(7.0)

v(x) ~= s u p

EU(XX'~(T)) _<E u ( c ) ,

and equality holds if there exists a portfolio process r such that

(7.10)

Xx'~(T) = C'

i.e., if the terminal wealth ~

can be financed without investment in the fictitious stocks. In light

of (7.9), such a z would be optimal for the problem of utility maximization in the incomplete market. In the next section, we discuss properties which 7r and v must have in order that they be related by (7.10). SECTION 8. ~

T I

0

CONDITION~ FOR OPTIMALITY IN THE INCOMPLETE

Let us define L2[0,T] to be the set of {~'(t)}-adapted ~d--valued processes r satisfying H~t)[I2dt < ~, a.s. We decompose L2[fl,T] into the orthogonal subspaces

15 K(c) A={v E L2[0,Tll a(t)u(t) = 0 V t E [0,T], a.s. P},

K~(a) ~ {~ e 1,2[0,T]I ~(t) e l~nge(a*(t)) V t e [0,T], a.s. P}. Note that the process # of (2.4) belongs to K• whereas the process u of (7.4) belongs to K(r On the other hand, if v E K((~) is given, then (7.4) can be solved for the fictitious stock appreciation rate process a. Thus, K(c) provides a parametrization space for fictitious completious of the incomplete market. We define KI(a) A_ {g E K(cr); v satisfies (7.7)}, and we recall definition (7.8). Then

E[#(T)

(8.1)

Z (t)r

= x.

We shall assume on occasion that the function I(.) has the property

(8.~.)

I ( ~ ) _ o

for some a E (0,1) and fl E(1,m). One consequence of (8.2) is that if JFv(y ) < m for some ~, E K(r and some y > 0, then ,.~ (y) < | for a]l y > 0, so that v E Kl(g). Let x > 0 and ~r E ,A(x) be given, and consider the statement that ~- is optimal for the incomplete market utility maximization problem: (A) Ootimalitv of ~r: Eu(xX,~'(T)) < Eu(xX,~(T)), V ~r E ~4(x). We shall characterize condition (A) with the help of the following conditions. For A E KI(c), consider the statements: (B) Finaneiliby of ~

There exists a protfolio ~r E ~ ( x ) such that Xx'~(T) = ~ ~, a.s.;

(C) Minimality of A: E U ( ~ ) < EU(~)

V L, E gl((r);,

(D) Dual optimality of A: For all v E K~((~), E0(~A(x)fl(T)Z~,(T)) < E0(~/A(x)fl(T)ZA(T)); (E) Parsimony of A: E[~7(T)Zy(T)~] _<x,

V ~, E Kl(a).

The relations among these conditions are expounded in the following two theorems. In the interest of brevity, we omit the lengthy proof of Theorem 8.1. The examples in Section 9 use only Theorem 8.2, whose proof we provide. 8.1 THEOREM. Assume that U(0) > ---m. Then conditions (B)--(E) are equivalent and imply (A) with the same ~- as in (B). Conversely, suppose that ~"E ..4(x) satisfies (A) and (8.2) holds; then there exists

A E Kl(r for which (B)--(E) hold.

The assumption U(0) > --m in Theorem 8.1 rules out the important special case of a logarithmic utility function. In the absence of this assumption, the following weaker rcsult can be established.

16 8.2 Tazo~r.M. Conditions (B) and (E) are equivalent, and imply both (C) and (A) with the same ~r as in (B). PIt00F: (B) ~ (E): The inequality E[fl(T)Zv(T)XX'~(T)] < x follows from (6.2) applied to the fictiously completed market corresponding to v. (B) v CA): The argument for this is given in the last paragraph of Section 7. (B) ~ (C): Under (B), we have (A), which means that V(x) = E U ( ~ ) . Now use

(7.9). (E) ~ (B): If ~r can be found for which xX'~r(T) = ~ , then the proof of (6.7) can be applied to show that ~r c M(x). Assume (E), and consider the Brownian martingale M(T) A=E[fl(T)ZA(T)~[~t]. From (8.1) we have M(0) = x. Imitating the argument preceding Theorem 6.2, we produce r e L2[0,T] such that t

M(t) = x + fl(T)~ =

= x +

f0

. r (s)dW(s),

- ' 3 . + M t O*.t + A 0

Decompose r as r = r + r where r E K• A E K((r), we may simplify the above equation to r 1 /~(T)~ -- x + J0 " g ~

+ IT ~ 0

(t))]

9 [dW(t) + O(t)dt + ,~(t)dt].

and r E K(cr). Because O E K• r) and

* * (~b'(t) + M(t)0 (t))dWo(t)

[r .

+ M(t) A. (t)][dW(t) + 0(t)dt + A(t)dt].

We will show that (8.3)

r

+ M(t)A(t) = 0 V t E [0,T], a.s.

We will then define

s

~

~t 1 * [x + 0 Z ~ -o(r

* + M(s)0 (s))dW0(s)].

Because r + M(t)0(t) e Kx(c), this d-dimensional vector can be written as r = ~ (t)a(t) for some m-dimensional a. Indeed,

+ M(t)0(t)

17 (8.4)

a(t) -- (a(t)a (t))'la(t)(r

§ M(t)0(t)).

We can define

~(t) A

(8.5)

~(t) f~(t)zA(t)x(t)

and it is then easily verified that 2(0) = x, :Xi(T) = ~ ,

'

and d•

= r(t)~:(t)dt +

X(t)~*(t)a(t)dWo(t) (cf. (3.2)). In other words, once we prove (8.3), then (8.4), (8.5) will define ~" e ..4(x) such that (B) holds. In order to establish (8.3), consider an arbitrary u E K((r) and introduce the sequence of stopping times {vn}:: l given by

rn = T h inf {t E [O,T]; M(t) + I fi~*(s)dW(s)l +

I t (llr

~ + IIr

0

+ IIA(s)ll ~ + 114s)ll~)ds _>n}. Denote un(t) A y(t) l[0,vn](t). Then for e E (-1,1),

rthrn , (8.6) ZA%eun(t) -- ZA(t)exp{- eJ0

e2 [tA~'n

u (s)(dW(s) + A(s)ds) - - 2 - j0

e--nlel < ZA§

(8.7)

ZA(T)

H4s)l[2ds},

(T) < en] el -

9

Because of (8.7) and the fact that A e K,(cr), we have A + e Vn e K,(~). It follows from (8.6), (8.7), the Dominated Convergence Theorem, and condition (E) that

(..8)

o = ~ E I~(T) Z~§176

;-E[~(T)Za(T) ~i

e=0 = ~E~(T) ~ ~ Z~§176

~=0

I?",, (s)(dW(s) + a(s)ds)] =-E[M(T~) s?", (s)(dW(s) + a(s)ds)].

According to ItS's rule, (8.9) M(rn) 0 u (s)(dW(s) + A(s)ds) =

v (t)(M(t)A(t) + r

+

(t)dW(t)

18 gTn

+ J|0

Ft ,

*

M(t)[|0uj(s)(dW(s) + A(s)ds)](C~(t)

*

§ r

From the definition of rn, we see that the two stochastic integrals on the right-hand side of (8.9) have expectation zero, so (8.8) implies that E is arbitrary, [M(t)A(t) +

r

u (t)(M(t)A(t) + r

= 0. Since u e K(cr)

= o. Letting n -~ | we obtain (8.3). []

8.3 Remark. Using Theorem 8.2, one can prove Theorem 8.1 by establishing the implications CA) (B), (C) v (D) and (D) v (E). In each of these, the hypothesis is a statement of optimality, and in each case, the proof of the implication is a variational argument similar to that just used for the implication (E) ~ (B). We have the following result concerning the existence of A E Kl(g) which satisfies (D), and by inference, the existence of an optimal portfolio process. 8.4 TlfE01tBM. Assume that (8.2) and U(0) > - co hold, as well as T

(i)

E I llO(t)ll~dt ) the second argument and returns -1 otherwise. The use of real-valued logic allows arbitrary composition of functions in the function set. The set of atoms for this problem consisted of velocity v, angle 0 , and angular velocity co. In addition, the set of atoms contained a smattering of random constants between -1.0 and +1.0. When a particular control slrategy (i.e. LISP S-expression) evaluates to any positive value for particular values of state variables v, 0 , and co at a particular time step, the force F is applied from the positive direction. Otherwise, the force is applied from the negative direction. Note that the control strategy can be viewed as defining a surface that separates the state space into parts. The environment consists of 10 randomly generated initial starting condition cases. Position x is chosen between -0.5 and +0.5 meters. Velocity v is chosen between -0.5 and +0.5 meters/ second. Angle 0 is chosen between -0.5 radians and +0.5 radians. Angular velocity co is chosen between -0.5 and +0.5 radians/second. We do not know of an analytic solution to this three dimensional broom balancing problem. However, an approximate optimal control strategy for a conventional linear approximation to

52 the solution has been computed by Keane. This analytic solution to the linear approximation calls for application of the positive force when co >if2, where A B (Sinh K(t + z ) - 2Sinh Kt)

I2=

K

and, otherwise, a negative force. Here t=z+

vA c

and

where the quantity A is computed as follows:

If

( v / C ) 2 ' then A = Sign O

>--- T --

Otherwise A : Sign (v/C). D, B, C, K are computed as follows: D= ~[4 B=

mc me +mp -F

D[m c + m,] C=

F [mo+mp]

K:~/D Since this solution to the linear approximation to the problem involved hyperbolic functions, we replaced the hyperbolic sine function by terms (up to the cubic level) from the Taylor series expansion to obtain an approximation to this approximation. The resulting "pseudo optimal strategy" serves as a benchmark for the genetically produced results below. Note that this

B3 "pseudo optimal strategy" is not used by the genetic algorithm in any way. Note also that the state transitions of the systems are controlled by Anderson's equations for 9 (t) and a(t) above. Time was discretized into 400 time steps of .02 seconds. The total time available before the system "times out" for a given control strategy is thus 8 seconds. If the square root of the sum of the squares of the velocity v, angle 0 , and angular velocity to (i.e. the "norm") is less than 0.07 (the "criterion"), the system is considered to have arrived at its target state. If a particular control strategy brings the system to the target state for a particular initial starting condition case in the environment, its fitness for that environmental case is the time required (in seconds). If a control strategy fails to bring the system to the target state before it "times out", its fitness for that environmental case is set to 8 seconds. The "fitness" of a control strategy is the average time for the strategy over all 10 initial starting condition cases in the environment. The process starts with the random generation of 300 random control strategies recursively composed from the available functions and atoms above. Examples of such random control strategies are linear control strategies such as v + 0 , co 0",/'~.57,and 0 > co and non-linear control strategies such as 02 + to. and 0 to31vl. As can be seen, the initial population of random control strategies in generation 0 includes many highly unfit control strategies, including strategies that totally ignore the state variables, strategies that repetitively apply the force in only one direction, strategies that are correct only for a particular few specific starting condition cases in the environment, strategies that are totally counter-productive, and strategies that cause wild oscillations and meaningless gyrations. In one run, the average time consumed by the initial random strategies in the initial random population averaged 7.79 seconds. In fact, many of these 300 random individuals "timed out" at 8 seconds for all 10 of the environmental cases (and very likely would have timed out even if more time had been available). However, even in this highly unfit initial random population, some control strategies are somewhat better than others. The best single control strategy for the initial random generation was the simple non-linear control strategy v 0 , which averaged 6.55 seconds. This control strategy correctly handled two (of the simpler cases) environmental cases and timed out for 8 of the environmental cases. Note that this control strategy is partially blind in that it does not even consider the state variable to in specifying how to apply the "bang bang" force. Nonetheless, in the land of the blind, the one-eyed man is king.

54 The genetic crossover operation is then applied to parents from the current generation selected with probabilities proportionate to fitness to breed a new population of offspring control strategies. Although the vast majority of the new offspring control strategies are again highly unfit, some of them tend to be somewhat more fit than others. Moreover, some of them are slightly better than those that came before. For example, the average population fitness improved from 7.79 to 7.78, 7.74, 7.73, 7.70, 7.69, 7.66, 7.63, 7.62 seconds per environmental case in generations 1 through 8, respectively, This generally improving (but not necessarily monotonic) trend in average fitness is typical of genetic algorithms. In generation 6, the best single individual was the non-linear control strategy t9 > v 3 I 0 + v I. It required an average of 6.45 seconds. In generation 7, the best single individual in the population correctly handled (i.e. did not time out) 4 of the 10 environmental cases. It required an average of 5.72 seconds. In generation 8, the best single individual was the non-linear control strategy [0 + m + 2.762v + 03 ]3. It required an average of only 2.56 seconds. This individual is not partially blind and considers all three state variables. It correctly handles 10 of the 10 environmental cases. Note that the number of correctly handled environmental cases ("hits") is a statistic that we find useful for monitoring runs, but this statistic is not used by the genetic algorithm. In generation 14, the average time required by the best single individual in the population dropped below 2 seconds for the f'trst time. In particular, the non-linear control strategy 0 + co + v + v 2 [ co - 0.734] 2 > v 0)2 required an average of only 1.74 seconds. It, too, correctly handled all 10 of the environmental cases. Two histograms provide a graphical picture of the learning of the population from generation to generation. The "hits histogram" shows the number of individuals in the population that correctly handle a particular number of environmental cases. The "fitness histogram" shows the number of individuals in the population with whose fitness value lies in a particular docile of

55 the fitness values. Both of these histograms have an undulating left-to-right "slinky-like" motion as the population progressively learns. In generation 33, the best single individual in the population was the non-linear control strategy 3 v + 3 0 + 3o~+v 0 2 > v r After its discovery, this single best control strategy was retested on 1000 additional random environmental starting condition points. It performed in an average of 1.76 seconds. In another test, it averaged 3.20 seconds on the 8 comers of the cube, but could not handle 2 of the 8 comers of the cube. The benchmark pseudo optimum strategy averaged 1.85 seconds over the 1000 random environmental starting condition cases in the retest. It averaged 2.96 seconds for the 8 comers of the cube. It was unable to handle the 2 worst comers of the cube. These results (in seconds) are summarized in the table below: P E R F O R M A N C E FOR THREE DIMENSIONAL BROOM BALANCING PROBLEM CONTROL STRATEGY

WORST 2 CORNERS

1000 POINTS

Benchmark Pseudo Optimum

1.85

2.96

Infinite

3v+3 0+3r

1.76

3.20

Infinite

5.0 CONCLUSION In summary, we used the new "computing paradigm" to discover a non-linear control strategy for the three dimensional broom balancing (inverted pendulum) problem which slightly outperformed our benchmark "pseudo optimum" solution for the problem. We believe our nonlinear control strategy is a very close to the time-optimal solution to the problem.

6.0 REFERENCES Anderson, Charles W. Learning to control and inverted pendulum using neural networks. IEEE

Control Systems Magazine. 9(3). Pages 31-37. April 1989.

56 Barto, A. G., Anandan, P., and Anderson, C. W. Cooperativity in networks of pattern recognizing stochastic learning automata.In Narendra,K.S. Adaptive and Learning Systems. New York: Plenum 1985. Booker, Lashon, Goldberg, David E., and Holland, John I-I.Classifier systems and genetic algorithms. Artificial Intelligence 40 (1989) 235-282. Davis, Lawrence (editor) Genetic Algorithms and Simulated Annealing London: Pittman 1987. Goldberg, David Eo Computer-Aided Gas Pipeline Operation Using Genetic Algorithms and

Rule Learning. PhD dissertation. Ann Arbor: University of Michigan. 1983. Goldberg, David E. Genetic Algorithms in Search, Optimization, and Machine Learning. Reading, MA: Addison-Wesley1989. Holland, John H. Adaptation in Natural and Artificial Systems, Ann Arbor, MI: University of Michigan Press 1975. Koza, John R. Hierarchical genetic algorithms operating on populations of computer programs.

Proceedings of the 11th International Joint Conference on Artificial Intelligence (IJCAI). San Mateo, CA: Morgan Kaufman 1989. Koza, John R. Econometric modeling by genetic breeding of mathematical functions. Proceedings of International Symposiumon Economic Modeling. Urbino, Italy: 1990a. In Press. Koza, John R. Genetic computing: a paradigm for genetically breeding populations of computer program to solve problems. Machine Learning. In press 1990b. Koza, John R. and Keane, Martin A. Cart centering and broom balancing by genetically breeding populations of control strategy programs. Proceedings of International Joint

Conference on Neural Networks, Washington, January 1990. Volume I. Schaffer, J. D. (editor) Proceedings of the Third International Conference on Genetic

Algorithms (ICGA). San Mateo, CA: Morgan Kaufmann Publishers Inc. 1989. Widrow, Bernard. Pattern recognizing control systems. Computer and Information Sciences

(COINS) Symposium Proceedings. Washington, DC" Spartan Books, 1963. Widrow, Bernard. The original adaptive neural net broom balancer. 1987IEEE International

Symposium on Circuits and Systems. Volume2.

P R O C E S S C O N T R O L USING N O N L I N E A R P R O G R A M M I N G T E C H N I Q U E S

B. Wayne Bequette Department of Chemical Engineering, Rensselaer Polytechnic Institute Troy, NY 12180-3590, USA

ABSTRACT Some of the most difficult problems associated with process control are due to process nonlinearity, manipulated variable constraints, uncertain parameters and unmeasured variables. In this paper a nonlinear programming approach is developed to estimate process parameters, unmeasured state variables and process disturbances.

A constrained optimization-based

procedure is also used to maintain a desired output variable trajectory, similar to techniques that have proven successful for linear systems. The process model, characterized by a set of nonlinear differential equations, is transformed into algebraic equations using orthogonal collocation on finite elements. A system with inverse response characteristics and a bioreactor model are used as examples. INTRODUCTION Within the past 10 years there has been an increasing use of advanced multivariable control system techniques which are known as model predictive control methods. A survey of the field has been performed by Garcia e t al. (1989). One of the most common predictive control methods is Dynamic Matrix Control (DMC) (Cutler and Ramaker, 1980). DMC is based on selecting a set of L future manipulated variable moves (control horizon), to minimize an objective function based on a least squares minimization of the differences between model predicted outputs and a desired output variable trajectory over a prediction horizon, R, as shown in Figure 1. Although the DMC optimization is performed for a sequence of future control moves, only the next control move is implemented. Model uncertainty and process disturbances are handled by calculating an additive disturbance as the difference between the process measurement and the model prediction at the current time step. It is assumed that the future disturbances are equal to the current disturbance, and a new trajectory is calculated. Notice that predictive control is really an open-loop optimal control strategy with feedback provided by the re-optimizing procedure. Predictive control strategies have been well-received by industry because they are intuitive and explicitly handle constraints. One limitation to the existing methods is that they are based on linear systems theory and may not perform well on highly nonlinear systems.

58 desired trajectory (future)

Y a

c I

t I

1

u 1

I

a 1

I

~ I

I

I

1

I

1

I

I

R t k current time

Prediction Horizon

maximum I

H

. . . . . .

I

,

I .............. I ["-1 1.... 1........................................ minimum

I

1

1

I

I

1

[

1

1

~I~,~-----

t

I

past control moves

1

1

I

1

I

Control Horizon Figure 1. Predictive Control Approach For these types of processes we feel that it is important to use nonlinear predictive control techniques with nonlinear dynamic models that have a physical basis. In this paper we first discuss dynamic process models in general. Next, we present a strategy for the regulation of the system, using a predictive control approach that accounts for constraints on manipulated and state variables. It is important to have a model that is representative of the current state of the process, so we then present a method to perform parameter estimation of the nonlinear dynamic system. Nonlinear Process Models This paper is based on the control of process systems that are described by lumped parameter models, that is, systems that can be modeled by sets of first order ordinary differential equations. In addition, we include deadtime on process inputs or output variables. Deadtime can be due to transport (flow through pipes) or measurement delays (analytical instrumentation, etc.). Mathematically, the representation is

59 dx

dt - f(x,u(t-0),p,l)

dynamic modeling equations

(1)

gl(x,u,p) = 0

algebraic equilibrium relationships, etc.

y = g2(x(t-qS))

state-output relationships

(2) (3)

where state variables

X

tl

=

manipulated variables

p

=

parameters

1

=

load disturbances (measured or unmeasured)

y

=

0

=

output variables deadtime between manipulated and state variables

00

=

deadtime between state and measured variables

This type of formulation may also be used for distributed parameter systems that have been "lumped" by discretizing the partial differential equations and forming ordinary differential equations. In order to si,alplify the solution of this problem, we use orthogonal collocation on finite elements to reduce the differenti',d equations to a set of algebraic equations. The dynamic model can then be stated as Ax - f(x,u,p,l,0) = 0

(4)

where A is a matrix consisting of collocation weights (Finlayson, 1980) and x is now a vector of state variable values at each collocation point. NONLINEAR PREDICTIVE

CONTROL

(NLPC)

The linear predictive control techniques can be extended to handle nonlinear systems by formulating a nonlinear programming problem (Eaton et al., 1989).

A single input-single

output system is shown as an illustration.

rain

tk+Tp k+RN c q~ (u,x) = fe 2 dt = Y. Y. Wj(Ysp(i,j)-Ypred(i,j)) 2 tk i=k j=l

u(k),...,u(k+L- I)

subject to Ax - f (x,u,p,l,0) = 0 U mni _-tt ~

a

else /z ~ -

also

the

the

accuracy

condition of

input

possible accuracy give is

up

of

some t h r e s h o l d

condition

number,

i.e.

number the

very

error

important

of closure of

is

the

the

value

IIAx - fll

approximated

due

the

connected

together with

connected

solution x

(by

solution to

value

w i t h t h e n u m b e r o f d i g i t s in a ceil o f t h e c o m p u t e r used). T h e r e a t , a

the

computation

number data

(or

with

this

simply,

the

e r r o r o f c l o s u r e ) , It is t h i s p a i r w h i c h g u a r a n t e e s t h e e s t i m a t i o n : fix - xll/lIxll ~ /a(A)IIAx - fll/llfll. Not too simple is the question: and as t h e e r r o r try

to

what

must

be c o n s i d e r e d as c o n d i t i o n n u m b e r

o f c l o s u r e in o t h e r c o m p u t a t i o n p r o b l e m s .

i n d i c a t e such notions f o r t h e c l a s s i c a l p r o b l e m -

In t h i s r e p o r t

we will

the p r o b l e m on d i c h o t o m y

of a m a t r i x s p e c t r u m by a i m a g i n a r y a x i s , i m a g i n a r y in its ample m e a n i n g , when it

96 is n e c e s s a r y to m a k e s u r e t h a t the m a t r i x axis

(such

maximal

matrices

invariant

are

called

subspaces

A h a s no e i g e n v a l u e s on t h e a n i m a g i n a r y

exponent~ally

L § (A)

and

dichotomic),

L (A),

but

corresponding

also

to

the

to

indicate

parts

of

the

s p e c t r u m lying in t h e l e f t and r i g h t h a l f - p l a n e , r e s p e c t i v e l y .

Thus,

for

separation

of

example, a

in

maximal

the

theory

invariant

of

automatical

regulation,

L § (A) p l a y s

a

subspace

very

theproblem

impotent

of

role.

If

a r i s e s w h e n c o n s t r u c t i n g a n o p t i m a l c o n t r o l u(t), w h i c h m i n i m i z e s t h e v a l u e CO

S o

( llCx(t)ll2+

in t h e s o l u t i o n s of t h e s y s t e m d x / d t

llBu(t)ll z) d t

: Ax. It is w e l l - k n o w n

that

to c o n s t r u c t

such

u(t), a s u b s p a c e L,(W) f o r t h e m a t r i x

[ ;.]

W=

A

-C'C

is n e c e s s a r y to be found.

It would s e e m q u i t e n a t u r a l

to solve it by f i r s t

computing the eigenvalues

W

lying in t h e l e f t h a l f - p l a n e a n d t h e n by c o m p u t i n g e i g e n - a n d p r i n c i p a l v e c t o r s .

It c a n be e a s i l y s h o w n t h a t such a s c h e m e if r e a l i z e d on a c o m p u t e r c a n p r o v e non s o l v a b l e . eigenvalues dealing

The

is

with

thing

not

the

is t h a t one

a computer

the

which

problem

can

we m u s t

matrix

Ao,

where of

close

to

it.

We know

computation

be c o n s i d e r e d

consider

owing to w h i c h we c a n only g u a r a n t e e ,

of

its

the

computer.

such

Thereby,

about

instead

det(A-Al)=O we deal

for

which

there

with exist

of

this

co-called

matrix

eigenvalues

a spot a

correctly

A general

stated.

"indefiniteness

In f a c t , principle",

t h a t i n s t e a d o f a m a t r i x A we h a v e a c e r t a i n

c is a s m a l l p o s i t i v e n u m b e r c h a r a c t e r i z i n g

equation

as

of m a t r i x '

of

that

IIA - Aoll -< cllAoll,

the number of

its

matrix

only

the

of d i g i t s

matrix

"c-spectrum", B(A),

so

in a cell

A satisfying i.e.

that

with

the

a set

of

IIB(A)II < r

d e t ( A + B(X) - M)=O.

It spots means

of

is

well

known:

"e-spectrum" that

the

conditioned.

This

if

self-adjoint

possesses o f

problem is

A is

not

of the

matrix

a diameter

eigenvalues case w i t h

non

of

which the

of

(A=A'),

does n o t

each

of

e x c e e d cllAll,

self-adjoint

self-adjoint

then

matrices.

matrices There

its

which

is

well

exist

the

97 e x a m p l e s w h e n t h e s p o t s of t h e " c - s p e c t r u m "

can have rather

c a n lead to t h e v a g u e n e s s or t h e d i m e n s i o n s of s u b s p a c e s typical

example

can

be

taken.A

two-parameter

family

large diameters which

L § (A) a n d L (A). Here, of

matrices

A (t)

of

a the

twenty first order: -20

-20.5L

0

A

b)

-19

-19.5L

(t)= 0

-2

0

-2.5L

0

0

-i

0

0

0

0

i

has a characteristic equation 20

20

I IC-j-k) -

(l-~)(

19

2

5 ~t

I--I (-j-o.s}) = o. J:t

J=l Let E)

=

1.5/-19

2tz(A~(l))=0.5

Ptt(A~(t))=l, dimension

( a t l=4,

of

L (A-(L)) --

l e t ~ = 3 2 -39 = 3 . 6 c - I 0 , are

is not

eigenvalues less

than

of 2,

at

l=10 l e t E) = 1.5e-19) t h e n

the

matrix

A-(1),~

whereas

the

dimension

consequently of

'.~

-

e q u a l to 1. It is to be n o t e d t h a t

the

L (A (l))

is

0

A-(l)o

do

of

the

determine

the

A, F a d d e e v D.K.

and

t h e e l e m e n t s of t h e m a t r i c e s Ao(l) a n d

n o t d i f f e r m o r e t h a n by t h e value ~ ( a t t = i0, ~ = 1.5e-19).

The

existence

eigenvalues quality

of

problem"

such for

examples

arbitrary

of s t a b i l i t y of a s e p a r a t e

is

connected

with

non-self-adjoint

matrices.

e i g e n v a l u e 2, of t h e m a t r i x

F a d d e e v a V.N. (4) p r o p o s e d to use one n u m e r i c a l p a r a m e t e r b e t w e e n t h e e i g e n v e c t o r s of m a t r i c e s this

parameter

is too

large,

"the

instability To

connected with the angle

A a n d A" c o r r e s p o n d i n g to }t. If t h e v a l u e o f

the problem

of

the

determination

of t h i s

eigenvalues

is unstable. Unfortunately, this parameter is not used as a measure of stability of eigenvalues

in the concrete corriputations.

Probably,

it

is connected with the

fact, that no efficient algorithm to compute this parameter was advanced.

There

exist

ill-conditioned, conditioned.

the

some

matrices

subspaces

L + (A)

A

whose and

separate

L (A)

eigenvalues

corresponding

to

though them

are

being well

98 The p r e s e n c e of t h e like e x a m p l e s compute

the

bases

preliminarily

computed

(Kublanovskaya adjusted,

V.N.)

and

Balzer)

of

a

was

sign-functions

methods

subspaces.

all

necessary

in

of

certain

and

the

special

worked

In

invariant

led t o t h e d e v e l o p m e n t of a l g o r i t h m s

subspaces eigenvalues.

It

are

applications.

be

destined

In

method

must

likelihood,

For

it

is

this

the

in

compute

the

first

the

A.A,

QR-algorithm were

Kublanovskaya

V.N.,

that

on

having

V.V.)

theoretically

maximal

and

without

(Voevodin

mind

information

case,

purpose,

(Abramov

to

L (A)

method

born

only

and

this

orthogonal-power

sign-functions

out.

L + (A)

which

these

invariant subspaces

foremost

matrix that

problem

the

is

becomes

their stable computation.

Before

separating

non-self-adjoint

matrices

the

projectors

onto

let

us c o n s i d e r

maximal

a more

invariant

particular

subspaces

and important

of

Hurwitz

p r o b l e m , h e r e a s p e c i a l a t t e n t i o n m u s t be p a i d to t h e L y a p u n o v m a t r i x e q u a t i o n .

As a c o n d i t i o n n u m b e r of t h e H u r w i t z p r o b l e m i t w a s s u g g e s t e d in w o r k (5) t o use

a

stability

quality

parameter

~:(A).

K(A)=2-11AII-IIHII, if t h e Lyapunov m a t r i x matrix

of

dimensions

determined

H,

NxN),

otherwise

has

~(A)=o,.

s o l u t i o n s of t h e s y s t e m d x / d t

Its

algebraic

expression

looks

as:

e q u a t i o n A*H + HA + I = 0, (I b e i n g s i n g l e

only

one

This

solution

parameter

is

in

the

directly

class

of

positively

connected

with

the

= Ax by t h e e q u a l i t y (6): T

It is obvious, dx/dt

that

tlx(t)ll 2 dt ))

K(A) = 211All (Sup ( m a x T->O IIx ( 0 ) I1=1

J" o

t h e less is ~(A) t h e m o r e

rough

is t h e

stability of the

system

= Ax w i t h r e s p e c t t o t h e d i s t u r b a n c e s o f t h e e l e m e n t s in t h e m a t r i x A.

As

an

error

of

closure

in

the

numerical

solution

of

the

Hurwitz

closure

of

the

Lyapunov

problem

o

IIA X + XA + Ill

is

used

equation.

is

the

True

which

is

theorem

the on

error the

of

error

of

closure

(7):

if

matrix

X=X" > O,

= IIA'X + XA + Ill < 1 t h e n t h e H u r w i t z ' A a n d

IIH-XJI/IIHII ~ A,

It Hurwitz

is

the

problem

theorem

which

Im(A)-2-11AIJ.IIXIIJ/~(A) "< /~.

allowed

(6) up to t h e a l g o r i t h m

to

close

the

algorithm

with a guaranteed

a n d d e s c r i b e d in i t s f i n i t e f o r m in w o r k (7).

of

solution

accuracy,

of

the

substantiated

99 Here

we

errors

in t h e

played

a

like

to

note

that

computation algorithm of

very

computation equation

would

important

algorithm

role

of

in

as

detailed

analysis

of

the

matrix

function

exp(tA) a t

estimating

positively

A ' H + HA + I = 0,

a

it

the

determined was

shown

influence

solutions in

(8).

the

of

of

computational

these

the

Thereat

Hurwitz A

errors

Lyapunov

in

a

matrix

estimation

of

the

m a t r i x e x p o n e n t i a l exp(tA) w a s amply m a d e use of:

Ilexp(tA)l] -< v/K(A)'.exp(-tllAII/~(A)),

t>--0.

Now, let us p a s s over to t h e s e p a r a t i o n o f m a x i m a l

invariant subspaces of a

non-self-adjoint matrix.

If

A is

the

exponentially dichotomic,

then

for

the

sought

for

projector

G

o v e r L + (A) a w e l l - k n o w n i n t e g r a l p r e s e n t a t i o n 00

1 G = ~-~ S (izI - A ) - l d z -CO

is valid. The value of the n o r m IIH;I o f t h e s y m m e t r i c a l m a t r i x 1 co H = ~-~ J" (A ~ + i z I ) - ! ( A - i z I ) - i d z

(1)

-00

can

be

there If

used

exists

as

a

only

dichotomy one

numerical

eigenvalue,

IIHII is n o t too large,

characteristic.

then

IIHll = ~,

If

since

the

i t can be e a s i l y o b t a i n e d t h a t

and H c h a n g e v e r y l i t t l e a t

small v a r i a t i o n s

of

A for

on

the

imaginary

integral

(1)

no

axis sense.

estimation showing that

G

w h i c h t h e p r o j e c t o r G and

t h e m a t r i x H have been built. This c o n t i n u i t y is c o n n e c t e d : i) w i t h t h e g u a r a n t e e of c o r r e c t n e s s o f t h e c o m p u t i n g p r o c e s s ; 2) w i t h t h e n e c e s s i t y t h a t t h e s o l u t i o n a d m i t o f a p r a c t i c a l u s a g e in p r o b l e m s .

It

is

interesting

to

note

that

if

A is

exponentially dichotomic,

then

for

H

f r o m (1) t h e p r e s e n t a t i o n C0

H = J" G ' ( t ) G(t) d t -00

is

valid.

Here

G(t)

is

d / d t ( G ( t ) ) = AG(t) + 6(t) determined

the

the

parameter

~(A) = 2.1IAII.IIHII.

Green

bounded of

matrix at

the

all

determined t.

Using

exponential

as

this

the

solution

equation,

dichotomy

of

of

equation

S.K.Godunov

(9)

the

A:

matrix

100 So, S.K.Oodunov proposed to

simultaneously compute the matrix

H and the

p r o j e c t o r O on L § (A) while s t u d y i n g t h e e x p o n e n t i a l d i c h o t o m y of A. In (10) i t

is s h o w n t h a t

to e s t i m a t e

matrices

G and

Ft, a s y s t e m

of

matrix

e q u a t i o n s c a n be used Gz-

G = O, GA - AG = 0, ~

HA

+

A

(GH) ~

~

H

+

GH = 0, (2)

9

G CG

w h i c h g e n e r a l i z e s t h e Lyapunov m a t r i x

(I-G)

-

C(I-G)

= 0

e q u a t i o n f o r t h e c a s e of t h e n o n - H u r w i t z A.

We h a v e p r o v e d t h e f o l l o w i n g t h e o r e m .

THEOREM: a) i f f o r some C=C'> 0 t h e r e e x i s t m a t r i c e s G a n d H(C)=(H(C)) ~ > 0 satisfying

system

if a m a t r i x

(2),

A has

then matrix

A h a s no e i g e n v a l u e s

no e i g e n v a l u e s

on t h e

imaginary

on t h e i m a g i n a r y

axis,

then

system

~

axis;

(2)

at

b) any

9

C=C > 0 h a s t h e only one s o l u t i o n w i t h r e s p e c t to G a n d H(C)=(H(C)) > 0 t h e r e a t 0O

G = l / ( 2 n i ) s (zI-A)- ldz;

where

~- is

eigenvalues

a A

closed

contour

having

located

strictly

in

theorem

presents

a

equation.

In t h e

same

error

of c l o s u r e

the algorithm control

and

the

computations

determination

of

of

either

correct

of

left

a

half-plane.

known

problem,

theorem

is o b t a i n e d ,

occurring

projectors

error

of c l o s u r e

In (11) t h e r e errors

over

the

was

exercise

assertion

in t h e

carried

clear

is the

to

important theorem

L (A)

and

based

(9). T h e

that

role

of

of

work

~:(A)>~:~

i.e.

that

matrix on t h e

in t h e in for

projectors

the

over

analysis

destined variant

of

for

the

of

the

in t h i s a l g o r i t h m

H a n d t h e v a l u e ~(A) w i t h t h e that

this

computations

a complete

on a

the

that

is u s e d

results

out

all

Lyapunov

in one a l g o r i t h m

by S.K.Godunov

G, m a t r i x

It

in t h e c o m p u t a t i o n s

L § (A) a n d

method suggested

in t h e p r o j e c t o r

contains

analogous

of

the

which

on

which plays an v a l u e ~(A). T h i s

subspaces.

or

the

self-crossings

G, H a n d t h e

computational

signs,

no

(lO) a t h e o r e m

of t h e e r r o r

the

orthogonal-power results

work

of matrices

invariant

influence

generalization

in t h e H u r w i t z

and play the role maximal

H = l / ( 2 ; r ) J ' ( A ' + i z l ) - l C ( A - i z I ) - 1dz

there

indication exist

no

" p r a c t i c a l " d i c h o t o m y of t h e s p e c t r u m A by t h e i m a g i n a r y a x i s .

L e t us p r e s e n t

one of t h e p o s s i b l e a l g o r i t h m s

of G a n d H s u g g e s t e d by A.N.Malyshev (12).

of a simultaaleous d e t e r m i n a t i o n

101 i) W i t h t h e h e l p o f t h e T a y l o r s e r i e s

A = exp(xA'),

B = -I,

0

and using

the

Choletsky

LL 9 =~. T h e r e a f t e r ,

2). AdBI;

We

Az, B2; ...;

a

we find

recurrent

by m e a n s

I

Aj_IO

0

X - -

m

A

)-ill = ~

approximations inequalities

) C,

m

invertability m

m

- B

= r to

O

~.

L by d e c o m p o s i n g

procedure

of

which

the

is r e d u c e d

transformation

0

A

matrix

pairs

at

every

P, s u c h a s

B

J

J

m --~

B*(A* - B') -t

m

determination

]I

" 9• !

Bj_I

It turns out that at

II(A - B

the

•

]=

to

Thereat

of

triangular

B o= L-XBo = L -I

to t h e c h o i c e o f t h e o r t h o g o n a l

P[Bj-tAj-IO

m

a lower

A 0 = L-IAo ,

successive

Are,Bin; ...

dt

0

decomposition

out

step of the recursion

e) = 211All f e x p ( t A * ) . C . e x p ( t A )

0

we s u p p o s e t h a t

carry

a t 9 = 1/(211AII)

G,

n~

(A - B )-ICA* - B* )-I m

at

m

rather

IIAII.IIHII a n d

"f2-'-IIAII.H

is

rn

large the

estimated

) 2UAII.H.

m

m

follows

velocity

of

beginning

from the

the

fact

convergency

from

some

m

that of

the

by

the

of the form:

II El" (A" - B * )-i in m tn

II (A - B rn

rn

G II :i C o n s t V/KCA)' exp(-2m/~c(A)),

)(A -B ) - 2 - t l A I I . H Ill

EP.

II --~:" providing a "actual"

102 absence

of the dichotomy of the m a t r i x

computation

of

the

matrices

G,

spectrum

H and

the

by the

value

imaginary

~:(A) with

axis,

indicated

or

the

the true

signs.

BIBLIOGRAPHY:

1. T u r i n g A.M. Rounding-off e r r o r s in m a t r i x processes. Quart.

J. Mech., 1, 1948,

287-308. 2. J.von Neuman and H.H.Goldstine. Numerical inverting of matrices of high order. - B u l l . Amer. Math. Soc., 1947, v.S3, no.ll, 1021-1099. 3.

Guaranteed

accuracy

Godunov S.K.,

for

the solving of

Antonov A.G.,

linear

K i r i l j u k O.P.

systems

and

in Euclidean

Kostin V.I.

-

spaces./

Novosibirsk:

"Nauka", 1988. (Russian). 4.

F ad d eev

D.K.

and

Faddeeva

V.N. Computational

methods

of

linear

algebra.-

Gos.lzdat.Fiz.Mat.Lit., Moscow, 1963. (Russian). 5. Bulgakov A.Ya. - Sibirsk.

Math. Zh.21(1980),

No 3, (32-41); English Transl.

in

Siberian Math. J. 21 (1980). 6.

S.K.Godounov

(Godunov)

calculatives dans calculatif (Proc.

du

Fifth

A.J.Boulgakov

(A.Ya.Bulgakov),

le probleme de Hurwitz et methodes a l e s

probleme Intcrnat.

Information

and

Sci.,

de

Hurwitz),

Conf.

vol.

Analysis

(Versailles,

44,

1982)),

and

surmonter

Optimization

Lecture

Springer-Verlag,

Difficultes

Notes

1982,pp.

(aspect

of

Systems

in Control

846-851.

and

(English

ab s t ract , p.845.) 7.

A.Ya.Bulgakov and Lyapunov's

S.K.Godunov,

equation,

Calculation

Computational

of

Methods

positive definite of

linear

solutions

algebra,

of

"Nauka",

Novosibirsk, 1985,pp. 17-38. (Russian). 8. Bulgakov A.Ya. Computation of the stable

matrix,

Computational

exponential

Methods of

linear

function of an asymptotically algebra,

"Nauka",

Novosibirsk,

1985, pp.4-17. (Russian). 9. Godunov S.K. - Sibirsk.Math. Zh. 27 (1986), no.5, pp.24-37. 10. Bulgakov A.Ya. - Sibirsk.Math. Zh. 30 (1989), no. 4, pp.30-39 11.

Bulgakov

A.Ya.

non-self-adjoint

Guaranteed

accuracy

matrix,Numerical

of

calculating

analysis,"Nauka",

or

invariaat

subspaces

Novosibirsk,

1989,pp.

12-93, (Russian). 12.

Malyshev

A.N.

Preprint

Novosibirsk, 1988 (Russian).

no.6,

Inst.Math.,

Siberian

Branch

Acad.

Sci.

USSR,

Discrete approximation of the minimal time function for systems with regular optimal trajectories

M. Bardi

M. Falcone

and

Dipartimento di Matematica P. e A. Universi~ di Padova via Belzoni 7 35131 Padova - Italy

Dipartimento di Matematica Unlversi~ di Roma "La Sapienza" P. Aldo Moro 2 00185 Roma - Italy

Abstract. In this paper we prove an estimate of the rate of convergence of the approximation scheme for the nonlinear minimum time problem presented in [2]. The estimate holds provided the system have time-optimal controls with bounded variation. This estimate is of order

v with respect to the

discretization step in time, if the minimal time function is H61der continuous of exponent v. The proof combines the convergence result obtained in [2] by PDE methods, with direct control-theoretic arguments.

1. I n t r o d u c t i o n . In this paper we continue the study of an approximation scheme for the classical minimum time problem for nonlinear systems which we began in [2]. We consider the continuous-time controlled dynamical system in IRN (1.1)

~ y' = b(y, (x) t y(0) = x

and the corresponding discrete-time system, with time step h > 0,

(I .2)

xj+ X0

= xj + h b(xj,aj) =

X

where the controls are taken in a given set A~IR M. For a given compact target set T we are interested in the minimum times T(x) and hNh(x) taken respectively by systems (1.1) and (1.2) to reach T , where Nh(x ) indicates the minimum number of discrete steps. Note that T is finite only on the set R of points controllable to the target T in finite time, that it tends to + ~ near D R and that R is not known a priori. We recall that the Dynamic Programming method provides time-optimal controIs in feedback form for the discrete-time problem, once the discrete Bellman equation for N h is solved. In [2] we considered the new unknown functions

104

(1.3)

v(x) :=

1 - e -T(x), if T(x)0. The general structural assumptions on b and T we need are the following b:]RNxA---~IR N is c o n t i n u o u s , l b ( x , a ) - b ( y , a ) ] < Llx-y I

(At)

and lb(y,a)l < K(l+lyl), V x , y e IR N and "v' ae A ; T i s compact ;

(A2)

there exists 7, 0 0 there e~-tsts an a h ~ Ph such that R

(2.2)

~

c~(s) - (Xh(S) ds < h V(a,[O,R])

q

0

Proof. Set Ik:=[kh, (k+l)h[ for k=0,1 ..... n , n such that (n+l)h < R < (n+2)h and In+l:=[(n+l)h,R]. We can define c~h in Ik as any value taken up by (x in the same interval, or for instance

ah(S):= lim a(t) forse Ikt--.->kh+ Then f Ik

~(s) _ O~h(S) ds < h V(a,I k) ,

and (2.2) follows easily. In the previous paper [2] we have proved the following result (the definition of v and v h is (1.3)):

108

Theorem 2,2 Assume (A1), (A3), (A4). Then v h converge to v uniformly on compact subsets of IRN. Corollary 2.3 Under the assumptions of Theorem 2~2 for any given compact subset K of R there exist -h and T, such that K~P,h

a n d hNh(x) The spatial domain occupied

by a spherical particle with normalized radius rp centered at xp is specified by B(xp) = {x 9 R 3 : [lx-xp[[ _< rp}, where [1. [[ denotes the Euclidean norm. Let S = {x 9 R 3 : 0 < x3 < ~3,(Xl,X2) 9 FB}, where ~3 is a specified positive number. For the single particle case, the spatial domains occupied by the liquid layer and the gas are given by s

= (f/-NS)\B(xv)

and ~2g = f~+ f3 S respectively. In f~g, the steady-state motion of the gas is described by the following dimensionless Navier-Stokes equations: 3

~--~uiDiu+ V p g - v a V 2 u = O ,

divu=O,

(1)

i=1.

where u = (ul,u~, u3) denotes the gas velocity (normalized by pgv~/h~); and D~ = O/Ozi. IIere, we have neglected the gravitational effects. Similarly, in fl/, the steady-state motion of the liquid is described by 3

C viDiv + V p / -- V / V 2 V = --gVX3,

divv = O,

(2)

i=1

where v = (vl, v2, v3) denotes the liquid velocity normalized by p/v//ho, p~ the liquid pressure (normalized by plv~/h~); and g the acceleration due to gravity (normalized by vl/ho).2 3 Assuming no slip at the rigid bottom plane $ 8 and at the particle surface, we have the following boundary conditions: v

where O ~ ( x . ) = { x ~ ~

= o

on

SsuOt~(xp),

(3)

: IIx - x~ll = ~ } .

When the liquid is confined to a container, we may also impose no slip conditions at the lateral surface SLy" = { x 9 /~3 : 0 < ~3 < h ( ~ l , ~ 2 ) , ( ~ , ~ )

9 ors}, V = 0

i.e. On SLF.

(4)

At the top boundary surface ST = {x E R 3 : x3 = k3,(xl,x2) 9 FB}, the gas velocities at the jet entrance and suction port are specified, i.e. u = (O,O,~,s)

on Ss,

u = (O,O,-~,s)

o,, Ss,

(5)

148 where u j and us are given positive constants; S j and S s are subsets of ST representing respectively the jet and suction port surfaces given by

s, = {x

-

, , , i i , , , r l l , i i

U

7 "C

"'bt?" ((t \ !if ,J ii

GAS JET

? i

/ i

/ i

i

i

i

i

i

i

LIQUID LAYER

k "" i

I

PART lCLE

"ql"]

Xi

J I m

P

Fig.la Sketch of streamlines for the gas and liquid flows induced by the gas jet and the suction port.

SUCTION PORT

GAS JET

GAS JET

U INTERFACE ,. - 7 ~ r ~ ' = - - - . ~

,

~

,,'/ / .................

BOTTOM PLANE

.

- - - - -4- - - L

LAYER

......

I

Q

.

-- _

~4~S~-i.

,t~,~

../4,t

...

---

~,,-"

'~

;T ......

PARTICLE 0

' ....

J ..,'T

t

.............

X! ~

Fig.lb Sketch of the streamlines for the gas and liquid flows induced by the displaced gas jet and suction port.

1,52

GA.S Js

"''Li/I

~V/II Ill/ l/

ID./#1

~9.. 49

/,[/tl

1,~',';_~.,./III/

t'/ ~t i :

, l

i\ ~''--

/11/#

~ +l~'''" ......

~ I t ~ # ', *

l l ~ . -

-#

lit

I

........

-~-------,,',,\\\~..lz,[ . x \\

\x-'--..--

t

~\\\\'~'-'~,

'

/

-%%\~'~'--#a

i 39, 8E

39.165

3~

1 39.99

t

I i

I (JO. 1S

I qO. 32

]

I

gO. ~i9

VEUTORSI~II,.IEI 5.00E-Ol UNITS i

i

I

2?~''

-7, ; : . . . .

tt

"

I

~..-//t

. . . . . . .

"~

"r

.~,,

,

-411,.._" .

O

GAS ,/ET

~'IOR SOtL.~i ~'.OOE+O0UNITS SUC'CZON PORT

,~ , ''"

I - - 1

.,~\X..,

. . . . . .

'

i"

' ~"\

i "

"

,

"

Fig.2 Velocity vector field of liquid and gas with particle along the suction port axis.

153

G~

JET

~'TOR

S(~LEI

2.0(~+00 UNITS SUCTIOI~ POI~T

V i i i . / i / I

t/,7, /1

l

J""

GAS .TET

9 , , ~ "~ ", \

x/l/I/.,

,k~--.~

,,\\\\~

I i I ~ --

l,'l,

. .

.

.

•i ~ 3 . q9

t~//

%

.

.

.

.

t 39.8~

39165

~C'f~

~1.s

'I

.

.

.

I 39.99

S.OOE-OI

i

~.%

"/

#

/

I

.

.

I hlO. I S

I ~ 0 . 3 :>

I

i

YO.~9

IJIT$

I

;,,,-\\~

--////. - A " ' , , ,, , - ' M , --

_277~ .

9

~ ,

,

,

,

. . . . . .

'

9

/

l't

I

i

'

"

! i. , , , .... . . . . . .

!

::;iX'ix.

pARTICLE

.

,

i!!!i!!i ..:~

.....

'~'9. q9

I 39.65

i

i

,

~ ~ "

i'''l'','''i''','''I'' 39.82

39.99

; IJO. 15

'

i

iiO. 3e

liO. li9

Fig.3 Velocity vector field of gas and liquid with displaced particle.

OPTIMALITY CONDITIONS FOR ELLIPTIC VARIATIONAL INEQUALITIES

Liu Wenbin And J.E. Rubio School of Mathematics, University of Leeds Leeds LS2 9JT, U.K.

Suwa~ary. By

the

penalty method,

Ekeland's

Variational Principle

and

necessary conditions

of

lower-semicontinuity of some set-valued mappings, optimal

control for some abstract elliptic

obtained

in

the

conditions.

The

cases idea

where is

to

the find

convex

variational sets

optimality

inequalities are

satisfy

conditions

some

smoothness

first

for

some

penalized problems by Ekeland's Variational Principle then to pass limits to obtaln the optimality conditions. some

known

optimality

It is shown that these conditions lead to

conditions

in

necessary conditions for some problems.

many

cases.

They

also

yield

new

Our results give uniform forms for

several known optimality conditions for elliptic variational inequalities.

1.Introduction.

Optimal control problems for elliptic variational inequalities have been much

studied.

conditions,

One

of

the

main

difficulties

even though optimality

important cases (see for example,

conditions

is

to

obtain

optimality

have been obtained in some

[3],[4],[6],[7],[I0],[12],[13])o Recently,

Shi Shuzhong got some optimality conditions for abstract variational inequalities [14] [IS], which covers very wide range of problems. seem very complicated to apply.

But his results

In this paper we will, by using the

penalty

method, Ekeland's Variational Principle and lower semicontinuity of some set-valued

mappings,

problems. Then

establish

we show that

general

optimality

in many cases,

conditions

the conditions

for

this

lead to

known

results. We also show that they give new necessary conditions for a problem posed in [7] and [13]. It seems that our optimality conditions tend to unify known results. The methods used in this paper can be applied to obtain some necessary conditions for abstract strongly monotone variational inequalities

155 (see [14],[15]) without essential changes and we shall discuss this

problem

in some later papers.

2. Some studies about the lower limits of set sequences.

m

let V be a Banach space and V

its dual space.

As in iS], we give the

following definition: DEFINITION 2.1. Let A cV for n=l,2,..-. The lower limit of {A } is the set, n n denoted as lim A n-~

n=l,2,-- and

n'

given by lim A ={ y~V: there is {yn } such that yn ~ A n for n n-~

yn~y strongly in V}.

A well-known result is that lim A is always a closed set iS]. Let K be n n-~m a closed nonempty convex set in V; we will see later that lim(K-Yn)~(Yn-K), n-~ where Yn~K for n=l,2,-- and llyn-YoiIv~O,plays an important role. We note that li__mm(K-Yn)n(Yn-K] is not empty. The problem whether llm(K-Yn)n(Yn-K)=(K-Yo)n n-~ n-J

O, there is P>O such that IIUlIu~P if (y,u)gKXUad and g(y)+h(u)~Q. TI~OREM 3.1.

[4] (OCV) has at least one solution.

[]

We consider now the optimality conditions for (OCV). hong

In [14], Shi Shuz-

gave some optimality conditions for it but unfortunately,

conditions complicated

one of his

is wrong as shown by V. Barbu and his corrected results seems very to use

[IS]. Nevertheless,

the

idea in [14]

seems applicable.

157 By using a different treatment,

we give the following conditions for (0CV).

THEOREM 3.2. Let (Y0,U0) be a solution of (OCV).If the mapping y~(K-y)n(y-K) is l.s.c at Y0'

then there are P0~V and ~e 0g(y0)cH

(ag is the generalized

gradient in [8]) such that for any (z,v)~K>4Jad,

(3.3)

b(Yo,Z-yO)-(f+Buo,Z-yo)ZO,

(3.4)

b(s,PO)=(~,s),

(3.s)

-(B(V-Uo), p o ) - h (Uo, V-Uo),

(3.6)

b(Yo,Po)-(Po,f+BUo)=O,

(3.7)

b(Po,Po)~(~,po),

f o r any se U k(K-Yo)n(Yo-K)H,

Ca. 89

b3J : < c I a.e.

Uad

functional This

intensity

optimization

in

~Cv9

1 X vC• 0 C2..79, cI

satisfies

The

[

solution.

Following

Id v / d x 2 1

].

symmetric

condition

v ,~tntmtztn&

cost

ape

respectively

+ 1C@~

the

lwCx,v) l

:

is

lot

C~.4]

and

HIcc/)

the

a unique

the

constants 9 the

Ca. Sb

]:

on

has

function

that

and

indicating

to v ~ U a d

the

C~.39

suitable

Ca. 79

consider

stlbject

Ce. 5]

V rl ~ H ~ C CD

acting

satisfies

problem

shall

forms

+ kbCw,r

parameter

forces and

- UllZ~Dro.]dx

< Cx I , x~]

=

the

described

aCf,D] R

+ CUl~.Z I

~;s i s

H~CnD

form

= bCf,w,r

~

C2.33

dx

C~..'~)

~)a

formulation

aCw,r

tensile

+ ~Ul~Zlyb

dx

element,

well

pl a t e

equations

:

- u2.2zl]r I

} where

variational

~ar man

as

constant.

as

The

[~

~

Ca. 6D

a.~ > ~

that

].

In

: [ Hoc)CCZ) ]3

dx

a given

a given

continuous

+ u2,~z22

s [CUl~Z ~ QC v3

I dz QC vD

bC . . . . . b

~ R give n by

bCu,zD

x~

the

,

I CUllZll L%C v)

d 6 L2C{)a ) i s

We

~

=

ICz9

k

~

aCu,zD

bCu,z,r)

a~

x. H I CCD

is

assumed

Ca.89

functional

to

,

= c~ and be

cs

are

given

nonempty.

approximates is

>

stiffness

nondifferentiable.

of

167 Lemma

2.1

C DE. l E D

]

optimal Proof

For

X

does

solution

shal i

v

given

section

more

3

we

t.

ping

of

C)t

[

].

Let

us

shape

vector

field

E

~- C C [ O , ~ , )

:

TtCVD

*%

f)

for

there

order

C)t

}

=

an

an

condition to

do

it

we

depending image

DO .

f)t w e

To

of

a

descri

shall

use

problem

this

exists

(2".g).

In

as

variational

and

:

0 such

implicit

of

~

Ca'q-.ir.~).

= ft ~ Tt

two

canonical

employing

C w t , f t)

parameter

C w t , f t)

system ft

derivative

solution

< e t HeC O D -

consists

material

to

constant

differentiability

this

pair

the

exists

- w

:

Ca. iK~)

o%" t h e

the

of

the

respect

= w 0 E H 0 Cfl),

there

Proof

calculate

continuity

Ca. 169

system

be

to

~-

write

ft we

t

C3.3D

are can

--~ w t

problem

uncoupled.

show ~

Frechet

H 02 C O b

with

differentiability

of

(3.3). D

Lemma

3. S

: The

functions (a.

16)

wt

are

aC~,r

material E

given

derivatives

HOOCh%t) by

+ a'Cw,cb)

and

ft

~

w

~

Ho2Cf~L D

HaCOO

and

~" ~

satisfying

HaCOD

the

~y~tem

:

= bCf,w,r

+ bC~',w,r

+ b'Cf,w,r

+ kbCw,r

0 a C f , O) where

Cd,r

+ a'Cf,~))

of

= - b C 'w,w, D) - b C w, w, o) - b 'C w, w, D)

= Idiv

[dVCO)]Cdx

, VCO)

= VCO,X),

C3.4) 2 V D ~ HoC~'I3 a'C. , . ) , b ' C .

,.),

Q

b'C ..... D bC . . . . . ) = O.

These

are

Frechet

derivatives

,respectively, forms

are

with

given

by

respect :

of

the to

forms

parameter

aC. ,.), t at

a

bC. ,.), point

t

170

a'Cw,~D

= J

[ div

VCO)

- ~C

DVCOD

+

DVCODD]AwA~

dx

+ C3.5)CaD

f Q b'Cw,~D

dim

Cdiv

= g [ dim f~

VCOD

- CDVCOD

VC03

+ ~DVCODD]CAw~r

- CDVCOD

+ ~DVCODD]

+ X/wAr

~w~

dx

C3. S D C b D

C3. 6)Cc9 ~ZC~DVCOD~Zf

CDVCO) denotes

Proof

: Subtracting

domain t,

the

the

~] [ 1 9

passing

account

VC~fD~DVCODI~x

+ ~DVCODD

DVCOD

transforming

-

jacobian

from

] ,dividing

to

the

]Vf

+ ATV~w

matrix

of

the

integrals

both

the

t

C2. i O D - C ~ - . i ~ D , C 3 . 2 D , C 3 .

the as

SD

we

div

VCOD,

the

f]t i n t o

0

[

>

A~

as

+

C2.7D over

equations taking

C3.4D

dx

integrals

resulting

-

= ~ii

equations

the

well

obtain

VCOD

+ ~DVCOD)V~

matrix

of

--+

C div

C2.16)

domain

sides

with

~

CDVCO)

equations

over

limit

-

[ i~

by

into

]. o

Next

we

H o C ~ t)

determine

• H C Q t)

detern~nes the

the

increment

Definition function Cw t)

where

~

= w

II

H 2 C R ~'9 i s

shape

the [

system

the

parameter

19

]

HoCQtD

:

is

function

t value

The

shape

determined

the

C~.7).

The

pair

I IHeCC e

prolongation

~-

Cw t,ftD

shape

derivative

with

respect

value

to

only. W

derlvative

by

E

H2Cf)O

of

the

:

t W + oCtD

+

oCt)

of

of

derivative

the

increment of

3.3 wt

a

~atisfying

C~.~)

/t

-~

of

the

0

for

t

--~ O,

function

wt

w0

=

w

~ H ~ C Q t)

~

H COD,

into

wt

the

space

H'~ ~ 2 D . Let the

us

recall

functlon

holds W

[ l~

wt

~

] that

Ho~CQt D

if

the

exists,

shape then

derivative the

W ~ H~CC/]

following

of

condition

: = w

- ~w

VCOD

C3. VD

9

w

~

and

H~CCO

ia

~w

a

is

Integrating

the

,nate~'ial

gradient by

of

pa~ts

derivative

function the

w

system

of with C3.4D

the

function

respect two

to

wt

~

H

C~tD

xo

times,

taking

into

171 account

C3. VD

obtain

the

CW,FD of

~

the

and

eliminating

system

H~CC/9

x

system

of HeC{iD

CS. VD

derivative~

equations of

the

of

determining solution

the

shape

Cwt,ftD

VCOD

matrix

the ~

we

derivative

HO2Cs

D

x

HO2Cs

D

:

aCW,~D

= bCF,w,~D

+ bCf,W,~)

aCF,))>

= -~bCW,w.'Q)

+ kbCW,,~D

+ IICW, w,~D

HO2C~D

V 9~ ~

C3.83 I

li ( w ' w ' ~ j Taking that

into W and

W

= F

OW/"On where ing

of

point

[W

~-~A~

account

=

the

5

W

as

+ AwVCODnDA~

well

following

as

] dx

C3.7D

boundary

it

is

easy

conditions

[-O~'w.'3n a

]VCODn

,

OCOw."gh'~D/On.

integral

f] i n

the

OF/'On From

i i in

CS. 89.

dJff~,%O

of

direction

V,

=

[-@~'f/On 8

C3.~),CS.

the

cost

defined

IO>

]VCODn

it

functional

by

= max

+ I F

:

(3.113COD

C3.11DCaD by

we

introduce

an

- -~

.[i,

adjoint

: - $ n

6C•

dx

V )) a

ill,

+

C3. i ~ 9

A q ~h9 d y an

measure

llgCa)

C3.1iJCbD

V ,~ ~ HCfI)

d~

Dirac

[wC'~b [

CS.

>

x > 0 x = O x < 0.

eliminate [HoCCD]~-

max

defined

if if if

= bCp,w,)))

is

=

d J C Q , VD

in in

dx

v

E

U

Remark

2.. i a s

derivative at

a

point

of v

~

well

the U

in

as

cost

C3. 1 5 )

we

functional

direction

~

~

U

:

< C sgn

wCx))C-

I

WlCX)

XlVCXa)~b

]

CAwZkp

+ ZlfAq)XlVCX2D~n

[ x E

Q~

I

dx

+

>

References

[1]

Adams

(2]

Begis D. , G l o w i n s k i R. c I g ? J ) , Application de la elementes finis a l'approximation d'un probleme optimal. Applied ~thematics and Optimization ~

[3]

Bendsoe M.P. , O l h o f f N. , S o k o l o w s k i J.Clg853, Sensitivity analysis of problems of elasticity with unilateral constraints. Journal of Structural Mechanics 13 : 2-Ol-a~.

[4]

Cea J.Clg71), Paris.

R.A. c I g T J ) ,

Sobolev

Optimisation.

Spaces,

Theorie

Academic

et

Press,

algorithmes.

New

York.

methode des de domaine : 130-16g.

Dunod,

Chenais D. C l g S 7 ) , Optimal design of midsurface of shells : di f f e r e n t i a b i l i ty proof and sensitivity computation. Applied Mathematics and Optimization 16 : g3-133.

173 [6]

Ciarlet Lecture

[73

Delfour M. , Z o l e s i o 5. P . ( 1 0 8 8 3 , v i a m i n m a x differentiability. Optimization a 6 : 834-86~..

[83

F u j i N. C 1 0 8 6 3 , N e c e s s a r y conditions zation problem in elliptic boundary Journal on Control and Optimization

[0]

H a u g E.J. , C e a ]. e d s C l g 8 1 ) , Optimization oF distributed parameter systems. NATO A d v a n c e d Study Institute Series, S e r i e E, 40, S i j t h o f f and Noordhoff, Alphen aan den ~ijn, Amsterdam.

[I0]

Hlavacek I., N e c a s J. Clg82.3, elliptic unilateral boundary sent method. RAIRO Numerical

[11]

bILchlin S.G. C l g 7 O D , physics. Mir, M o s c o w

[1~]

Miyoshi T. , ( 1 0 7 6 ) , A m i x e d f i n i t e e l e m e n t method solution oF the Karman equations. Numerische $5 : 255-360.

[13]

M u r a w i e O L.A. C I g 8 4 5 , On the existence of the solutions to variational problems in domains with free boundary. Doklady of the Academy os ~ciences os the USRR, Mathematics 248 : 541-S44 (in Russian).

[14]

Myslinski A. , S o k o l o w s k i 5. 0

Q) .

The

equations of

V

(x,t)

~: Q

9

V

{~

i

~

R

(7)

i=i are

a .lJ .(x,t)

of

side

condition

n a

where:

the

(6)

(4]

(i)

+

real

(5)

C OO

functions

constitute

is w r i t t e n

in

the

aij(x't)

c~

defined

a Neumann

following

on

[closure

The

problem.

left-hand

form

n Oy

_

O))A

Oy(x,t)

-

Ox.

i,j= 1

q[x,t)

C8)

3

a where:

is

a normal

derivative

at

r

directed

towards

the

0)? A

exterior being

of

the q(x,t)

First

we

unique

, c o s ( n , x i)

=

the

shall

For

this

introduce

case

the

F

prove of

Sobolev =

a Hilbert

an

y(x,t-r)

mixed

any

direction

~]

cosine

of

n

n

-

and (9)

conditions

for

initial-boundary

the value

existence problem

of

(i)

+

a (5)

L2(Z).

pair

space

of

Hr'S(Q)

HOc0,T;Hr(~]))

space

i-th to

+ v(x,t]

sufficient

v 6

for

is

exterior

the

where

purpose

Hr'S(Q)

Js

at

c(x,t)

solution

for

which

Q

normal

normed

n

real

numbers

([I0}o

p.

HSc0,T;H0(~]))

by

6)

r,s defined

~

0

we

by (I0]

176

II y ( t )

I[ 2

dt

+ II y II 2

H r (~]) where:

the

Chapter

The

spaces

i

[9]

of

problem

the

(5)

the

we

H s ( 0 , T ; H ~ (~]))

are

for

initial-boundary

defined

in

on

cylindeer first,

on

Q2

. In t h i s

way

introduce

15.2

the

=

Qj

Theorem

the

the Q

mixed can

solving ' etc. the

be

proved

(i)

until

+

the

solution

using

(5)

on

procedure

in t h e

a the

covers

previous

step

one.

((j-l)r.jr)

the

solution

in t u r n Q

next

simplicity,

Using

and

i.e.,

and

cylinder

determines

Ej

+

method, Q1

whole

For

a unique

(i)

constructive subcylinder

Hr(~2)

respectively.

existence

value

(11)

H s ( 0 . T ; H ~ (~]))

following

Q x Ej

([I0],

p.

, Ej

811)

notations: = F

• Ej

can

we

for

prove

j =

the

1 ..... K

fol l o w i n g

result. Lelrm~a i Let u ~ H -I/2'-I/4

f]

(Q)

~ H -I/2,-I/4

v e

L 2 (E)

(12)

(Q])

(13)

where f.(x,t) J yj_l(', qj

u(x,t)

(j-l)~)

- b(x,t)yj_l(x,t-r)

~ H I/2

(~])

(14)

e L2(Zj)

(15)

where qj(x,t)

=

Then,

there

exists

mixed

initial-boundary

+ v(x,t)

c(x,t)yj_l(x,t-r

a unique

solution

value

problem

~ H3/2"3/4(Qj)

yj (i),

(4),

for

the

(14).

Proof We

observe

that

for

j = 1

,

yj_llQo

(x,t--r)

=

9

o

(x,t--T)

and

177

yj_iIE,

(x,t-r)

respectively

= 9o(X't-r)

Then

the

assumptions

O

(13) 9 O

(14)

,

H3/2

E

and

3/4(Qo)

assumptions solution

are

(15)

are

Yo

HI/2(Q)

Yl ~ H 3 / 2 " 3 / 4 ( Q I )

f2

H-I/2"-I/4(Q2)

,

easy

to n o t i c e

the

that

Yl 6 H3/2"3/4(QI) 3.1

([9],

the

p.19)

we

[0,r] Trace

that

~

exists the

a unique

Theorem Let

,

for

that

and

ylIXI

follows

2 1

p.

9)

Qj

E

2)

a

uniqe

to v e r i f y L2

(X 1 )

the

using

that

fact

that

Theorem

implies

that

continuous HI/2(~).

(Ql)

. We

implies

Then,

shall

from

mapping

L 2 ( X I)

j = 3 ..... K

from

Again

H 3/2"3/4

is

It

the

continuous

ylIXl

Y2 e H 3 / 2 " 3 / 4 ( Q

~ yl ~

linear,

Thus,

for a n y

from

is yl(-,T)

,

[I0] a

~-

These

of

have

Then

that

L 2 ( X o)

Yl ~ H S / 2 " S / 4 ( Q I )

hence .

~

assume

existence we

HI/2(~) (13)

we

j = 2

yl(-,t)

is

solution

result

now

of there

sun~narize

.

1 and

9o ~ H3/2"3/4(Qo) there

exists

a unique

initial-boundary

value .

j = 1 ..... K

2. P r o b l e m

shall

Neumann

formulation.

now

formulate

problem.

horizon

The

time

The

performance

Icy)

:

u

be

" 9o ~ L 2 ( X o)

for

We

prove

(Theorem

yo,9o,9o,V

Then.

can

, H I ' I / 2 ( X I)

foregoing

~o

the

u ~ H-I/2"-I/4(Q)

YlIZ 1

H3/2"3/4(Q I)

~-

c HI/2(Q)

Theorem

Yl

and

ensure

condition

t

~ H3/4(~)

if

" Next

yl(-,r)

and

mapping

the

to

sufficient

fulfilled

problem

the

(I) +

optimal

T

is f i x e d

functional

by

t,v)

and

u ~

(5).

Moreover,

~

HI/2(Q)

H-I/2"-I/4(Q). for

the

y(-,jr)

mixed

~ HI/2(~)

Theorems

boundary control problem U = L2(X)

in o u r

is g i v e n

Yo

y ~ H3/2"3/4(Q)

Optimization

us d e n o t e

with

, v ~ L2(E)

solution

Let

SQly(x

given

the

space

of

Iz

(Nv) vat dt

for

the

controls.

problem.

by

- Zdl 2 dxdt

+

(16)

178 where: N

k I k

is

0

and

a strictly

Finally,

we

kl+k 2

positive

assume

the

> 0)

;

linear

zd

is

a

operator

following

on

constraint

given element L 2 (E) into on

controls

in L~(Q); L 2 (Z) . v

e

Uad

where Uad

Let

is

a

closed,

y(x,t;v)

value v E

denote

problem

Uad

(i)

We

~

control

problem

I (v o)

I (v)

Then

from

a unique by

the

from

the

c is

V

the

(x,t)

the

-

(17)

of

the

mixed

1 that

initial-boundary

to

for

a

any

is

given v

e

v 9

simplify Uad

solution

, we of

8p(v) at

p(:

0

characterized

condition

of

the

Z

0

V

performance

v 6

Uad

(18)

functional

(16)

we

can

express

(18)

form

Zd) (y(v)

(19),

we

define the

- Y(Vo))

dxdt

+ k 2 LNVo(Vg

- v o)

introduce the

the

adjoint

adjoint

variable

equation p

= p(v)

dr

dt

Uad

and

h

0 (19)

for

= p(x,t;v)

every as

the

equation

+ A

(t)

=

ap (v) at

Uad

wel l-defined

V v 9 To

control

p

(v - v o)

form

U

(16)

control

following

k I ~Q(y(v ~

Uad

of

corresponding

Theorem

L2(Q)

Theorem

following

i17 t h e

solution

at

equivalent

v 9

optimal

I' (v o) Using

(5)

subset

functional

H3/2"S/4(Q)

~

the

+

note

performance y(v)

convex

p(v)

kl(Y(v)

+ b(x,t+T) -

p(x,t+r);v)

zd)

x

= ~

~

,

t

9

(0,T-T)

(20)

(T-T,T)

(21)

*

+ A

=

(t)

0

p(v)

=

kl(Y(v)

-

z d)

x 9 r~ , t e

x

,-s_- f )

(22)

179 ap(v)

(x,t)

=

c(x,t+'r)

(x,t)

=

0

p(x,t+T;V)

X ~ F

, t ~

(0,T-r)

(23)

x

,

(T-r,T)

(24)

A ap(v)

E

F

t

~

A where n

A

(t)p

ap(v)

00

__ ~ i,j=l

(x,t)

,

existence

=

apply

the

Lemma

1

solution

zd

(with

v

t = T

and

in

of

Q

change time,

(20)

on

(20)

+

first,

this

t'

on

be

to

solved

solving

(20)

until

purpose, to

the

is e a s y

can

etc.

of v a r i a b l e s )

i.e.,

(24)

It

(24)

QK-I

. For

+

method.

, i.e.,

turn

cylinder

sense

problem

. problem

an o b v i o u s

reversed

the

a constructive

from

QK

whole

for

using and

startffng

the

(x,t)

+ tile

we

may

problem

(20)

= T-t)

2 hypothesis

z d ~ L2(Q)

We

given

covers

(with

the

p(v)

c o s ( n , x i) --~--]

proved

subcylinder

Theorem

(24)

Let

for

be

in t i m e

on

Op (v)

aji(x,t)

a unique

can

that

procedure

+

of

Q

backwards (24)

(25)

i,j= 1

cylinder notlce

[ a i j ( x , t ) a--~i lap

n ]~

A

The

~a

and

of

any

~ HS/2"3/4(Q)

simplify

purpose (20),

(19)

setting (21)

and

Q

(21)

we

x

get

X1

f

for

using

(y(v)

(T-T,T)

(Y(Vo)

1

v ~ L2(Z)

v = v

by

Theorem

the

the

satisfied.

there

exists

problem

adjoint

in

o - Y(Vo))

respectively

- z d) (y(v)

be

(20)

+

and

- Y(Vo))

unique

+

(24).

multiplying

both

+

(20)

integrating

then

adding

dxdt

glven

solution

(24).

equation

then

for

a

(20)

(24),

Then,

over both

~ sides

For

this

sldes x

of

(0,T-T) of

(20),

=

Q aP(v o ) . ] 0t + A ( t ) p ( v o) (y(v)

[ =

IQ

Y(Vo))

dxdt

+

T--T + f

f o

b(x,t+~)

p(x,t+T;Vo)

(y(x,t;v)

- y(x,t;Vo))

dxdt

=

180 =

#

p ( v o) ~

(y(v)

-Y(Vo))

dxdt

+

- Y(Vo))

dxdt

+

q + ~

A*(t)

Q

P[Vo)(Y(v)

T--T + f

f o

Using

the

(26)

b(x,t+r)

p(x,t+T;v

) (y(x,t;v)

f) equation

can

be

(i),

rewritten

~ Q p ( v o) - ~

-y(x,t:Vo))

dxdt

(26)

o

(y(v)

the

first

integral

on

the

right-hand

side

of

- Y(Vo))dxdt

-

as - Y(Vo)]dxdt

=

-~QP(Vo)A(t)

(y(v)

T - I

# P ( x ' t ; v o) b(x,t) ( y ( x , t - T . v ) - y(x,t-r;Vo)) dxdt = o ~] T-r = - f P ( V o ) A ( t ) (u - Y(Vo))dxdt -I J" P(x'L'+~" ;Vo)b(x't'+r) Q --T ~ (y(x,t' ;v)

The

second

Green's

- y(x.t' ;Vo]]

integral

formula,

A

on the

right-hand

can be e x p r e s s e d

(t)p(Vo](Y(V)

(27)

dxdt

side

- u

=

lay (v)

oy (Vo) ]

o Using

the

boundary

right-hand

side

T

of

condition (28)

[~y (v)

0 optimal

problem (34)

condition

following

Theorem

exists

to

and

(35)

be

+

(39).

also

performance

with

constraints

v

in

the

form

(33)

wit]]

(17),

there

fulfilled.

the

control

rewritten

o

which

functional on

control

satisfies

the

maximum

(32).

Re#erences [I]

[2]

[3]

[4]

[5]

[6]

[7]

[8] [9]

[I0]

[ii]

[12]

Knowles. G . , "Time-Optimal Control of Parabolic Systems with Boundary Conditions Involving Time Delays". J. Optim. Theory AppI., Vol. 25. 1978, pp. 5 6 3 - 5 7 4 . I I,

(ek)

P k + /JkC'+ Pk+IG'B = (Pk+ PkO')'A+ (Pk+i+ Pk+IC')B + (pk+ PkC')H~

I t e m ark

If A be the s u p r e m u m of the real p a r t s of the n o n z e r o eigenvalues (AI) of the m a t r i x B , then, for any t,

IlCz - P~)s'~ll

_< c, ~'

w h e r e G is a p o s i t i v e c o n s t a n t , and k < O. T h e v e c t o r P k ( f . ) ( I -- PB)r ~B belongs to R ( B ) , and is called t h e k-th o r d e r b o u n d a r y layer term.

T e n d i n g 9 to O, w e get

(S_~)

0 = po B po(o) + Po(o)

(so)

Po

PoG"

= p

= poA + p i B

+ p o H ~I

= PoC'-A+ P o C ' H 9

a n d , for k > 1, p~(o) + pk(o)

= o

Pk = P k ' A + Pk+1 B + P k H ~

(Sk)

/6k(7" = Pk6"'A-+ P ~ G ' H 9 Using the decomposition of ~ n

p = p+ p w e get finally

with

for every p, w e have

~'B = 0

and

PPB = PPB = "P,

193

(s_~)

0

po(O) + ?o(O) (So)

= /3oB =

p

/~o = poA + ,6tB + PoHi)

? oC" = ['oc'-E + PoC "H a n d , for k >_ 1,

p~(o) + ?k(o)

(sk)

Pk

= 0 =

pkA

+

Pk+lB

+

pkH~l

?~c" = ?kC"T+ i'kC'U~ in I t b - f o r m , we get

And

po(0) + 2;0(0) (So)

=

p

dpo

= poAdt + p t B d t + poHdy

dPo

=

P o C ' A C -tdt + P o C S l t C - ' d Y

and, for k >_ 1,

pk(o)

+ ?k(o)

dpk

(Sk)

= o = pkAdt + @k+iBdt + pkHdy

di'k = @kC'AC-'dt + ['kC'~C-'dy Remazks

9

C o n s i d e r t h e s y s t e m ( S _ I ) ; a n d , we h a v e , for e v e r y t

poCO = FoCt) and,

d'fo

= ~oAdt + ~lBdt +'PoHdY

a n d , if we a p p l y t h e o p e r a t o r Pn~ we o b t a i n

d~o = "~oAPBdt + poH PBdy Now,

by the unicity of the decomposition

of semimartingales, w e have

~'oH = FoH pn. This m e a n s

that w e m u s t verify the following equality

PDH

= PBHPB

or

MIT

= MHPB.

1 94 F o r i n s t a n c e , t h i s e q u a l i t y is v e r i f i e d if t h e m a t r i x H is d i a g o n a l c o n s t a n t on e v e r y b l o c k of the matrix

B:

A s PB a n d B h a v e t h e s a m e b l o c k - s t r u c t u r e ,

we o b s e r v e e v e r y b l o c k of B .

P~ and H commute.

In the sequel, we shall assume that. 9

We

need to a s s u m e the invarlance of k e r B by the operator A in order to justify the

existence of the equations in ]~k. 9

T h e c o m p o n e n t Pk is obtained by solving the Poisson-equation

d (pk-l-- fo'pk-l(s)Hdy)--pk-tA Let J~fn be the inverse of B in the subspace R ( P n -- I) :

HBB

= B I I B = P D -- I

and

HBPD

= P D I I B = O.

Then

~k=--~d( P k - 1 - - fo'Pk-l(S)Hdy)~ + pk-l.4~v. 9

T h e c o m p o n e n t P'k satisfies the following equation

d'~k = p k A P B d t + ~'ktt P B d y , obtained by applying the operator PD.

5

Asymptotic

Result

In this section, we give an approximation result for the (m + 1) th order. Towards this end, we consider the following system of equations

p0(0)+ P0(0) (So)

a n d , for 1 < k < m ,

=

p

dpo

ffi poAdt + p l B d g .4- p o H d y

dPo

= P o G * A G - S d t + PoG*HG-Sd~/

195

pk(o) + e~(o) ( s k)

=o

tips, = p k A d t + p k + t B d t + p k H d y = PkC 9149

dPk We choose

Pm+ls u c h

+ PkC*HC- 9

that

p.~+~(0) dpm+t Proposition

= pm+J.Adt + Pm+lHdy

G.I

F o r a n y t E [O, T1, w e h a v e

E~Clp'(~) - ~ "kCPkCt) + PkC~)c'C~))123 = oC.~I",+~)) k=O

Proof We d e f i n e , for a n y t E [0, T],

r~.C~) =

p'CO - ~ ~kCp~C~) + PkCOc'ctl)) - ~ ' + I p . , + I C O k=O

It can be s h o w

that r m9 is solution of the following equation B

dr" =

r'~(O) = Thus

, ~ C - + Ale~+ r~,Uey

0

we have

(t(r:(s)(B+ A), B u t , for every p, we have C P ( ~ + A L p )

fot

fot

< 0, so we obtain

~(Id.(~)l ~) -< ~'('+~)o~ + I1~ 11'

/: ~ClrkCs)l~d.

Using proposition 4.3, and then the G r o a w a l t - B e llman inequality, we obtain, for any t E I0, T]

~.(l~c,)l ~) _< ~(..+,o and we get the desired result.

196

Decentralization and aggregation

6

We consider the systems (SO) and (Sk)

vo(O)+

?o(O)

(So)

= p

dpo

= poAdt + P t B d t + p o H d y

d~'o

= fioC*AG-*dt + PoC*HC-*dy

and, for k > 1,

pk(0) + ?k(0) (Sk)

dpk

=0 = p~Adt + ~k+xBdt + p ~ H d y

dPk = P k C * A G - C d t + P k G * H C - * d y If we apply the o p e r a t o r PB to the different equations of the systems (So) and (Sk), we get (S'o)

P~176 d~o

=F = p o A P B d t + poHPl3dy

and, for k _> I,

(sk)

LCo)

=0

d~k = pkAPmdt + p k H P B d Y B y subtraction, we have Po(o)

--

d]30 = p o G C A G - ' d t + ~6oC~HC-Cdy and, for k > 1,

~(o) + ?k(0) -- 0 (Sk)

d~k

= ~ k A ( I - - P B ) d t + Pk+IB d t + p k I t ( I -

dP k

= PkC*AC-'dt + Pk_xC*HC-*dY

PD)dy

Thus, by the block-diagonal structure of B, /5k and /Sk m a y be calculated by solving all equations on every block of B. The dimension of these systems is reduced.

Now, consider the systems (~'o) and (S'k) ~o(O)

=

CSo) d'fo = "foAPBdt + "foH PBdy

197 a n d , for k > 1, ~k(o)

CTk)

= o

d'pk = p k A P f l d t + pkH PBdy or,

cT,~)

~k(O)

P u t p ~ = PkQ , P ~ =

PkQ a n d A ~ = M A Q .

d'~

= 0

= "fkAPBdt + ~ k A P n d t + "fkH PBdy + ~kH PBdy T h e n we o b t a i n t h e a g g r e g a t e s y s t e m s

(s~) dp a = paAadt + paHdy a n d , for k > 1, p~(0)

dp~

= 0

= p~Aadt + ~ k A Q d t + "pkHdy

Remark T h e s e d i f f e r e n t e q u a t i o n s a r e of d i m e n s i o n

r, w h e r e r is t h e n u m b e r of t h e e r g o d i c s e t s

of t h e m a t r i x B . References [1] C o d e r c h - W i l l s k y - S a s t r y - C a s t a n o n ,

Hierarchical Aggregation of Linear S y s t e m s with Mttl-

tiple T i m e Scales , I E E E T r a n s . on A u t . C o n t r o l , A C - 2 8 , 1 0 1 7 - 1 0 2 9 , 1983. [2] C o d e r c h - W i l l s k y - S a s t r y - C a s t a n o n ,

Hierarchical Aggregation o/ Singularly Perturbed Fi-

nite Stats Markoe P r o c e s s e s , S t o c h a s t i c s , 8, 2 5 9 - 2 8 9 , 1983. [3] C o u r t o i s , Decomposability, A C M M o n o g r a p h [4] D e l e b e e q u e ,

S e r v i c e s , 1977.

A Reduction Process for Perturbed Markoe Chains , J. S i a m A p p l .

Math,

43, 2, 325-350, 1983. [5] D e l e b e c q u e - Q u a d r a t ,

actions , A u t o m a t i o n ,

Optimal Control Markoe Chains A d m i t t i n g Strong and Weak Inter17, 2 8 1 - 2 9 5 , 1981.

[6] K a t o , Perturbation T h e o r y / o r Linear Operators, S p r i n g e r - V e r l a g , 1966. [7] M a r c h e t t i ,

Mgthodss de Perturbations Singuli~res en Filtrage non Lingaire , T h ~ s e , U n i -

v e r s i t 6 d e P r o v e n c e , 1986.

FILTRES APPROCHI S POUR UN PROBLI ME DE FILTRAGE NON LINI AIRE DISCRET AVEC PETIT BRUIT D'OBSERVATION P a u l a Milheiro de Oliveira * INRIA centre de Sophia Antipolis BP 109 06561 Valbonne Cedex (France)

Abstract W e study the ~ymptotic behaviour of a nonlinear one-dimensional filtering discrete time problem, as

some parameter r tends to O. We treate the case of a nonlinear discrete time problem coming from a continuous time one with small observation noise. Finite dimensionM aproximate filters are proposed and results concerning estimations of their performance are stated and proved. For the result concerning the error between the approximate filter and the optimal one, only a sketch of the proof will be given. It makes use of probability changes and differentiation with respect to the initial condition. FinMly, we present the results obtained when applying those filters to an example and we notice that the propositions previously stated are verified numerically.

1

Introduction

On consid~re le prohi~me de filtrage non lin6aire unidimensionnel, eu t e m p s discret, suivant: On a un signal {X~} v6rifiant

x~+, = x~ + b (x~) zx, + . v ~

~ + , , x0 = ~

(1)

et on dispose d ' u n e observation y donn6e par g

o~ {wk} et { ~ } sont des bruits blancs gaussiens s t a n d a r d s ind6pendants et ~ une v.a. ind6pendante de {w~} et {~h~}. Le param~tre e eat suppos6 petit et A t = ~ , cz > 0. Le probl~me de filtrage consiste b. calculer, pour toute fonction mesurable r l'esp6rance conditionnelle de r 6taut donn6e l'observation yt j u s q u ' k l'instant k E {0, 1 , . . . , K}. On remarque qu'on peut regarder ce probl~me comme 6tant la discr6tisation d'un probl~me en temps continu, n s'agit, k cheque instant t E [0, T], d'estimer le signal {X,} solution de l'6quation diff6rentieUe stochastique dXt = b ( X t ) d t + a d w : , X0 = ~ (3) au vu de l'observation {Yt} donn6e par

dY~ = h(Xt) dt + ~dw~ , Y0 = 0 "et Faculdade de Eugenlmria da Universidade do Porto, Run dos Bragas, 4099 Porto Codex (Portugal).

(4)

199

oh wl,w 2 sont des processus de Wiener standards ind6pendants et ~ est une variable al6atoire ind6pendante de w 1 et w 2. (cf. [Le Gland], [Picard]). I1 est naturel de penser que si a est grand (donc At petit) le comportement de (1)-(2) est proche de (3)-(4) alors que, si a est petit, il s'en 6carte. I1 se trouve en falt que le cas critique est a = 1. Plus pr6cis6ment on s'int6ressera au calcul de l'estimation du signal {Xe} sachant l'observation j usqu'~, l'instant k, )(k = E[Xe[Y0k], laquelle minimise l'erreur quadratique moyenne. Vu les difficult6s num6riques qu'un tel calcul pr4sente, on cherche des approximations de {)(a} (filtre optimal) donn4es par des sch6mas r6cursifs. Le comportement asymptotique de ces filtres lorsque r tend vers 0 fern objet de notre 6tude.

2

Construction

des filtres approch6s

Pour des raisons li6es g l'application d'une m6thode de changement de probabilit6s, pour obtenir les estimations de l'erreur quadratique moyenne, il sera plus commode de consid6rer notre observation sous la forme suivante:

~+~ = h(x~) + ~

~+,,

(5)

off {va} est un bruit blanc gaussien standard ind6pendant de {wk}. Formellement, l'observation ffk+~ contient la m6me information que l'observation yk. On suppose que ~'~ est la tribu des observations jusqu'~ l'instant k:

Alors 17d = y0~-I ce qui signifie que le probl~me de filtrage pour 17"~ correspond en fait ~t une pr6diction pour Y0k. On consid6re le processus Me+t=Me+b(Me)At+0(~e+l-h(Me)),

Mo=mo

(6)

et on veut 6tudier la fa~on dont il approche Xk = E[XklY~]. Des rdsultats qu'on obtiendra sur l'erreur de pr4diction Xk - Me on d6duira immediatement les r6sultats 6quivalents sur l'erreur de filtrage, puisque, voir le paragraphe 2.1, on a

I s - .eel < c a t , o~ Re = E[Xe_d~?]. Pour la construction du flltre approch6 on peut s'inspirer de ce qui se passe dans le cas lin6aire et ainsi, par g6n6ralisation, choisir une approximation du gain. L'estimation de Xe - Me sera obtenue b. partir de l'6tude d'une suite r$currente. Par contre, pour estimer )~k - Mk, on introduira d'abord deux changements de probabilit6s et on obtiendra une expression pour Xe - M~ qui fait intervenir les d4riv6es de Xk par rapport ~ la condition initiale. Sur cette expression, on conditionnera les diff6rents termes par rapport ~ la tribu ~k des observations et on proc~dera ~ des estimations asymptotiques. Selon la valeur de c~, les changements de probabilitds choisis seront diffdrents et on aboutira, naturellement, ~. des expressions asymptotiques diff~rentes pour Xe-Me. Darts une premiere section on 6tudie le cas a > 1 (section 2.1) et ensuite le cas a = 1 (section 2.2). Le cas cr < 1 offrant certaines difficult6s dans l'application de cette m.4thode de d6monstration, on se limitera k faire une remarque proposant un filtre approch6 "performant".

200 HypothSses:

On suppose que

(H1) ( est une variable aJdatoire de loi de probabilitd po telle que I

poet', fl ol'Po( )d

_ eh > O, Vz. (It3) best une fouction C 2 ~. d~riv&s born~es, [b'(z)[ _< lib'I[, Vz. (H4) A t = ~ , 2.1

Le cas

a>_ l e t t , . = k A t . a>

1

On considSre d'abord le cas oh At = ~', avec a > 1. On propose un filtre approch~ unidimensionnel et, pour ce filtre, on estime XL. - M~ et )(~ - M,.. Le filtre a p p r o c h d On prendra 8 = a At darts le sch+ma (6), i.e. M,.+~ = Mk + b(M~) A t + a A t (Ok+x

h(M,.))

Mo = mo

(7)

On obtient facilement le r6sultat suivant: P r o p o s i t i o n 2.1 Mk dtant donng par (7), on a l'estimas r

tk+l 1

E[(Zt.+t - M,+,)2I 5 c e x p t - c T ~

+ ce.

(8)

Preuve On peut utiliser les d~veloppements de Taylor des fonctions b e t h, b(X~) = b(M,.) + b'((X)(x~ - M~) et h(X~) = h(M~) + h'((~)(X~ - Mk) , pour obteuir l'~galitd Xk+l - M,.+, = [1 - ~r At h,({x) + b,(c.x) At] (Xk - M~) + a v / ~ { w k + , - v k + 0 .

(9)

E

On utilise les hypothb.ses pour dfimontrer que, pour v "assez petit" (e _< (ah)-~l{"-l)), B EI(X%+, - M~+,) 21 _ (1 - d)k+'E[(Xo - M0) 21 + ~ - , o~t

At A --a- 1 - ( 1 - c h - - + t l b ' l l A t ) 2

qi.

G i has Then

no

free

D i can

be

A compatible partitioning

Gi2]0

G i D i -- Gil Dil + Gi2 Di2 -- M i

(22)

Since Dil is invertible: Gil = [Mi-GizDi2] Di}

(23)

Gil is used to satisfy GiD i = Mi, a n d Gi2 is obtained by optimizing Ji.

The

matrix G i can be written: G i -- Gil E l + Gi2 E 2

(Z4)

where E 1 and E 2 are selection matrices. Substitute

(23) into (24):

G i -- H i + Gi2 T i where

(25)

H i = M i Di~ 1 E l T i = (E 2 - Di2 Di~ 1 E 1 ) It remains

propagates

as:

to determine

K i and

Gi2 that minimize

Ji"

The

error

covariance

214

# = FiP + PF~ + BiQB I + KiRiK ~ + (Hi + Gi2T i) Si(Hi + Gi2Ti) T

(26)

P(o} = Po The Hamiltonian corresponding to the optimization problem defined by equations 17, 19, 26 is given by: B = trace (UiP + [FiP + PF[ + BiQB l + KiRiK ~ + (Hi+Gi2Ti)Si(Hi+GizTi)T]A T where

A, m a t r i x of L a g r a n g e

multipliers, is determined

(27)

by:

(28)

=- aB =- [Ui + F i' A + ,%Fi]

A(Tf) = V i

(Z9)

K i and Gi2 are obtained from the necessary optimality conditions (30)

al~/aK i = 0 a n d a~/aGi2 = 0

These conditions give: K i = p R~ ~

(31)

Gi2 =

(32)

- H i S i TiT(Ti S i T~)-*

The gain matrix, given by equation (31) is the Kalman gain associated with the following subsystem:

z i = A i zi + [Bi

Gi][ wvi ]

(33)

Yli = zi + ;Ji S i n c e (Ai, I i) i s o b s e r v a b l e

(34) f o r a n y A i t h e n F i : [ A i - K i] i s s t a b l e . Q.E.D.

S u f f i c i e n t c o n d i t i o n f o r t h e s t a b i l i t y o f t h e l o c a l f i l t e r c a n be s t a t e d in t e r m s o f the observabilty

~t

of the original system.

Ei = [ Li

Theorem 2:

0]

If (A, L i) is observable then the local filter is stable.

Proof: Consider equation (6). Define :

=

[ -Alp

-MN] ;

~i = [li

O]

215 where

(A, Li)

is observable if and only if

(F, C i) is observable since they are

related by a similarity transformation (T i in eq. 7).

p( [ ~ Write

-T-r F Ci

(~I )n-l~lr

...

Therefore:

) = n

[Ai o]+[o

F =

0

0

Then:

ci~k [li~ where

•k(1,2)

o]

+

[o

li.k(1,21]

is t h e b l o c k (1,2) o f n k c o m p a t i b l e

with the partition

used

here.

The observability matrix corresponding to the system (F,Ci) is:

Ci F

[ Ii

0

i Ai

0 :

:

i

Am-I

0

0

0

a1(1,2)

o

0

am_l(1,2)

o

LG

nn_l(1,2)

+

J: : ".11-I

~i ~ n-I

The Therefore

n

.

columns

Ai

of

the

above

observability

matrix

are

t h e m c o l u m n s of t h e f o l l o w i n g m a t r i x a r e l i n e a r l y

[ I i T AI

""

independent.

(A~I~-I]T

This corresponds local filter designed

linearly independent.

to t h e o b s e r v a b i l i t y using

equations

m a t r i x o f t h e local s u b s y s t e m .

(33) a n d

(34) is s t a b l e .

Q.E.D. V.

Consider

the 3rd order

EXAMPLE

system:

[120] ill

x=

2

4

5

1

I

3

x

+

2

1

w

Hence the

216 Let x I be the observed variable at Station variables at Station

(I), a n d x 2 and x 3 be the observed

(2), i.e.,

Zl = Xl

Therefore,

it is s o u g h t , to e s t i m a t e z i a t S t a t i o n S i.

C o n s i d e r S t a t i o n (1).

The measurement

is g i v e n

by:

Ytl = zl + Pl Let the required

additional

m e a s u r e m e n t on t h e i n t e r f a c e

variable

z I = x 2 be

given by:

Y12 = [1 Let:

2] r z{ +

v1

Q = R 1 = I; S 1 = Zx2 identity matrix Using the notations of the paper: A 1 = I,

M I = 2,

Then:

GIDI=M1

gives

Let

Dll = I

Then

G l l = (2 - 2 G12) G 1 = [GII

=

E 1 = 2 [1 1]

-.

G1 = [2

0]

+ G12 [ - 2

2 [1

0] 0]

=

0]

+

=

[2 [-2

G12 [0

I]

0] 1]

1]

[:][ [:]1

0l - [Z 01 -

G 1 = [.4

It remains

2]

GII [I

T 1 = [0

GI= [2

Finally:

= [I

GIIDII + GI2DI2 = GII + 2G12 = Z

GI2]

H1 = M1 D [ }

And :

DII r

[ - 2 1] -

[ - 2 11

.S]

to s o l v e f o r K 1. U s i n g e q u a t i o n

(31}, t h e

T h e n , t h e local f i l t e r a t s t a t i o n (1) is g i v e n by:

z 1 = - 1 . 6 7 z 1 + 2 . 6 7 Y l l + [ .4 S i m i l a r l y , a local f i l t e r is d e s i g n e d

. 8 ] Y12

for station

{2).

steady-state

g a i n is 2.67.

217 VI.

The p a p e r

a d d r e s s e s t h e d e s i g n o f d e c e n t r a l i z e d f i l t e r s for l a r g e - s c a l e i n t e r -

connected systems. models.

The

variables. unbiased

These filters are

decoupling

These and

CONCLUSION

is

independent

stable.

designed

achieved filters

use

measurements

local

Sufficient condition for

o b s e r v a b i l t i y of original s y s t e m .

u s i n g i n d e p e n d e n t local dynamical

through

of

the

information only

and

interface they

are

s t a b i l i t y is s t a t e d in t e r m s of t h e

An example is g i v e n .

ACKNOWLEDGEMENT

This research was sponsored in par$ by the National Science Foundation under grant

ECS-8707139,

Research

Center

in part

under

by the

grant

West

PEL-21-88,

virginia University Energy and

in part

by

ONR

and

under

;r

contract

NO0014-K-0651.

REFERENCES

[1]

J.L. S p e y e r , "Computation a n d T r a n s m i s s i o n R e q u i r e m e n t s f o r D e c e n t r a l i z e d LQG C o n t r o l P r o b l e m s , " IEEE T r a n s . A u t o . Control, Vol. AC-24, No. 2, p p . 266-269, 1979.

[2]

C.Y. Chong, " H i e r a r c h i c a l E s t i m a t i o n , " P r o c e e d i n g o f 2 n d I~IIT/ONR C3 W o r k s h o p , M o n t e r e y , CA, J u l y 1979.

[3]

A.N. Willsky, et.al., "Combining a n d U p d a t i n g of Local E s t i m a t e s a n d Regional Maps Along S e t s of O n e - D i m e n s i o n a l T r a c k s , " IEEE T r a n s . Aurora. Control, Vol. AC-27, No. 4, pp. 799-813, 1982.

[4]

A.T. Alouani, J.D. Birdwell, " D i s t r i b u t e d Estimation: C o n s t r a i n t s on t h e Choice of t h e Local Models," IEEE T r a n s . A u t o m a t i c Control, Vol. AC-33, No. 5, pp. 503-506, 1988.

[5]

C.W. S a n d e r s , E.C. T a c k e r , T.D. L i n t o n , "A New Class of D e c e n t r a l i z e d F i l t e r s f o r I n t e r c o n n e c t e d S y s t e m s " , IEEE T r a n s . A u t o m a t i c Control, Vol. AC-19, pp. 259-262, 1974.

[6]

A. Feliachi, R.K. S p e r r y , " D e c o u p l e d F i l t e r s f o r I n t e r c o n n e c t e d S y s t e m s , " P r o c e e d i n g s o f 27th IEEE-CDC, A u s t i n , TX, p p . 2367-2368, December 1988.

[7]

E.C.Y. Tse, J.V. Medanic, W.R. P e r k i n s , " G e n e r a l i z e d H e s s e n b e r g T r a n s f o r m a t i o n f o r R e d u c e d - O r d e r Modelling of L a r g e - S c a l e S y s t e m s " , I n t . J. C o n t r o l , Vol. 27, No. 4, pp. 493-512, 1978.

SPLIT FORMS OF z - D O M A I N A L G O R I T H M S FOR LINEAR PREDICTION A N D S T A B I L I T Y A N A L Y S I S

Antonio Lepschy, Gian Antonio Mian and Umberto Viaro Department of Electronics and Informatics, University of Padova via Gradenigo 61A - 35131 Padova, Italy

A b s t r a c t - Many algorithms in linear prediction and s t a b i l i t y analysis may be expressed by the same general t w o - t e r m recursion involving two polynomials of consecutive degrees and the reciprocated polynomial of one of them. The properties o f these algorithms are interpreted from a geometrical point of view that refers to the loci described by the zeros of the polynomials in the related sequences as a characteristic real p a r a m e t e r varies. An analysis of all possible threeterm (split) forms involving only the symmetric and/or the a n t i s y m m e t r i c parts of three consecutive polynomials generated by the same t w o - t e r m recursion, is carried out. 1. I N T R O D U C T I O N A number of algorithms in linear prediction and s t a b i l i t y theory can be expressed by a recurrence relation for generating a sequence of polynomials of either ascending or descending degree [ I , 2] . The Levinson algorithm is perhaps the most popular of these. As is known, it was conceived in the c o n t e x t oF mean square estimation to find in a c o m p u t a t i o n a l l y e f f i c i e n t way an all-pole predictor. However, it may be used for d i f f e r e n t purposes; for instance, a tight relation exists between this algorithm and the s t a b i l i t y c r i t e r i a of Lehmer, Schur, Cohn, Marden, and Jury [ 3 ] , as well as the Nevanlinna-Pick problem ELI) 9 Another f a m i l y of z-domain algorithms comprises the so-called Routh-type methods, obtainable through a suitable variable transformation from the original s-domain three-term Routh procedure or from its t w o - t e r m form [ 5 ] . In particular, if the usual bilinear transformation is adopted, the algorithms for discrete-time systems modelling presented b y B i s t r i t z in E6] are obtained, whereas if the transformation s=(z +z- )12 is used, the B i s t r i t z s t a b i l i t y - t e s t procedure ~ 7 , 8 ] is obtained. Other families of z-domain algorithms may be found by transforming the s-domain Euclid-type a l g o r i t h m presented in ~ 9 ] . Much interest has been paid in the recent l i t e r a t u r e to unifying interpretations of some of the mentioned procedures. This has been done, for instance, by Benidir and Picinbono [ 10], by Delsarte et al. E 4 ] , and by Vaidyanathan and M i t r a [ I I ] according to either a network-synthesis approach or to an interpolation approach. Another a t t e m p t has been made in [ 12] for s-domain algorithms by adopting a different point of view; specifically, it has been shown that many algorithms can be regarded as particular cases of a unique general recursive procedure from which new interesting algorithms may be generated as well. A similar approach can be applied to zdomain algorithms as will be briefly outlined in Section 2 . In this way, it is also possible to give a geometrical c h a r a c t e r i z a t i o n of the various procedures along lines analogous with those followed in E13-] for the s-domain. The considered algorithms may be given either of the following recursive forms: ( i ) the t w o - t e r m form by which every polynomial in the related sequence is expressed by a suitable combination o f the preceding polynomial and its r e c i p r o c a t e d one, and [ i i ] the t h r e e - t e r m form which relates the s y m m e t r i c or a n t i s y m m e t r i c part of three consecutive polynomials in the sequence. The latter have been called in the literature split forms E14, 15], or immitance-domain forms

219

[ 16J as opposed to s c a t t e r i n g - d o m a i n ones. Such f o r m s are a t t r a c t i v e because they e n t a i l c o m p u t a t i o n o f a s m a l l e r n u m b e r o f c o e f f i c i e n t s and a l l o w one t o i n t e r p r e t the a l g o r i t h m s terms of reactance functions.

the in

In S e c t i o n 3 we c l a s s i f y and analyse f r o m the p o i n t o f v i e w o f the c o m p u t a t i o n a l c o m p l e x i t y all possible split r e l a t i o n s a s s o c i a t e d w i t h the same g e n e r a l t w o - t e r m r e c u r s i o n d e r i v e d in Section 2.

2. G E N E R A L

TWO-TERM

FORM

L e t us d e n o t e the i - t h degree p o l y n o m i a l in a s e q u e n c e by i

P.(z] = ~, a.. z j i iI j=0 and its r e c i p r o c a t e d p o l y n o m i a l by i P.(z) = z i P.(z -1] = ~ a.. z i-j . I

I

(1]

(2]

U

j=O The Common f o r m o f the t w o - t e r m

s t e p d o w n ( b a c k w a r d ) r e c u r s i o n s m e n t i o n e d in S e c t i o n 1 is

Ri(z) Pi I (z) = F.(z) P . [ z } + G.{z) P.(z) -

I

I

I

(3}

1

w h e r e Ri(z), Fi(z), Gi(z) are p o l y n o m i a l s o f s u i t a b l e d e g r e e (which is, in g e n e r a l , d i f f e r e n t From i and m a y d i f f e r From step to step) such t h a t Pi(z) is a Schur p o l y n o m i a l (all its zeros are inside the unit c i r c l e ) i f and o n l y i f Pi_l(Z) is Schur and the p a r a m e t e r s belong to c e r t a i n d o m a i n s . In p a r t i c u l a r , i f R.(z) = r

(4) = - ai0/aii w h e r e r i m a y be v i e w e d as a scaling f a c t o r ( w h i c h is equal to I - k 2 in the s t a n d a r d f o r m ) , r e c u r sion (3) r e d u c e s to the ( b a c k w a r d ) L e v i n s o n r e l a t i o n : i

z

i

'

F.Cz} = 1 I

"

Gi(z) = k i

=

-

PiC0)/Pi(O)

r z P (z) = P.(z} + k . P . ( z ) (S) i i-1 I I ; in which k i is the s o - c a l l e d i - t h r e f l e c t i o n c o e f f i c i e n t . As is k n o w n Pi(z) is Schur i f and o n l y i f P i _ l ( z ] is Schur and I k i I < 1 . A g e o m e t r i c a l insight i n t o this a l g o r i t h m can be g a i n e d by c o n s i d e r i n g t h a t the zeros o f both sides o f iS) b e l o n g t o the r o o t locus f o r the e q u a t i o n : P.(z) + c P.(z) = 0 1

(6)

i

f o r c = k i . This locus is s y m m e t r i c w i t h r e s p e c t t o the unit c i r c u m f e r e n c e and its " i " b r a n c h e s [which include the real axis) i n t e r s e c t this c i r c l e f o r ] c I = I . C l e a r l y , f i x i n g a r o o t o f (6) o,1 this locus c o r r e s p o n d s to assigning the v a l u e o f c and, thus, l o c a t i n g all o t h e r r o o t s too. The l e f t - h a n d side o f (5) has a zero at the o r i g i n and, t h e r e f o r e , the z e r o s o f Pi_l(ZJ lie on the o t h e r " i - 1 " b r a n c h e s o f the locus f o r the same v a l u e o f p a r a m e t e r c, i.e., ki, as the o r i g i n . The a l g o r i t h m m a y be e x t e n d e d t o the m o r e g e n e r a l case [4"] o f R.(z) : r .

(z-q),

I

q real,

] q I < I .

(7)

I

C o n s e q u e n t l y , the zeros o f Pi_1(z) (which c o r r e s p o n d t o the same v a l u e o f c as q) lie inside the unit c i r c l e i f Pi(z) is a Schur p o l y n o m i a l . The s i t u a t i o n is d e p i c t e d in Fig. I f o r a Schur p o l y n o m ial Pi(z) w i t h i = L I . If, instead, R.(z) = r. ( z + l } 2 , I

where

i

F.(z) = l+h. + (1-h.) z , I

I

a g a i n r. is a s c a l i n g f a c t o r and !

I

G . ( z ) = ( - I ) i h. ( z - l ) I

I

(8)

220 Im z

/

/

\

!

t

\

i

)o

9 o"~o

O

Sk

Ak

9

O

O

If, instead, only s y m m e t r i c or only a n t i s y m m e t r i c parts a r e o f interest, the same split relation (either that corresponding to column 1 or that corresponding to column 2, respectively) will be used in all steps ( s t r i c t l y recursive split procedures). On the other hand, it may be shown that the relations corresponding to columns 3 and 4 (as well as those corresponding to columns 1 and 2, to columns 5 and 6, and to columns 7 and 8) can be given a common structure so that one might consider as being s t r i c t l y recursive also the procedure involving s y m m e t r i c and a n t i s y m m e t r i c parts alternately. Observe that the split algorithms must be p r o p e r l y i n i t i a l i z e d by computing the s y m m e t r i c or a n t i s y m m e t r i c part of Pn-I from the given original polynomial Pn" To this purpose, use can be made o f equations (22a) and (22b). Also, if the complete polynomial Pr' r < n , is to be recovered, as is the case in model reduction, both Sr and A r are to be computed from the same pair of parts of Pr+l and Pr+2 " These considerations must be taken into account when evaluating the computational c o m p l e x i t y of the algorithm. For b r e v i t y , we shall limit attention to the split relations corresponding to columns 1 and3,respectively. They turn out to be of the following form: Li Si-2 + Mi Si-1 + N.J S.z = 0

(23)

L'., Si-2 + M'.n Ai-1 + N'.z Si = 0

(2q)

and

where tile combination polynomials L., M., N. and L'., M'., N~ are as specified in the Appendix. I

I

I

I

I

I

Clearly, the computational complexity, i.e., the number of a r i t h m e t i c a l operations (additions/ subtractions and multiplications/divisions) necessary to compute every coefficient of Si_2, d i r e c t ly corresponds to the number of addenda in the combination polynomials and to the number of their coefficients d i f f e r e n t from + 1, as will be shown with reference to some specific case. The simplest forms of equation (23) are therefore obtained (possibly, after the e l i m i n a t i o n of factors common to L i, M i and N i) when Ni(z) = n i = const., and Li(z) and Mi(z) are (symmetric) polynomials of degree not greater than 2 and 1, r e s p e c t i v e l y . In particular, L i may also be o f degree 1; in this case Li(z) = I i z (which may be regarded as a degenerate second-degree polynomial) and Mi(z) = m i ( z + l ) . Indeed, this is the case for the split Levinson algorithm and for the B i s t r i t z algorithm presented in ]'8~ . Specifically, by r e c a l l i n g (5), the split Levinson algorithm takes the form: r i ri_ 1 z Si_ 2 - r i (z+l) Si_ 1 + n.z S.s = 0

(25)

with n i = (l-ki_1)(1+k i} and, by recalling (17) and the fact that R i = Ria i f R i is considered as a (degenerate) third-degree polynomial , the B i s t r i t z a l g o r i t h m is:

225 1

1

r i r i _ I z Si_ 2 -

~ r i fi (z+1)Si_ 1 +S. = 0 . I

(26)

Of course, one can exploit the scaling factors to further s i m p l i f y the computations, e.g., by f o r c ing to + I two of the coefficents I i, m i and n i. In particular, it is possible to set I i = I and m i = -I in (25) (i.e., r k = I, V k ), thus obtaining the simplest version of the split Levinson algorithm } 15~; the simplest version of (26) is obtained by choosing I i = n i = I (i.e., r k = 2, V k ). The number of a r i t h m e t i c a l operations that are needed to compute a c o e f f i c i e n t of Si_ 2 from those of Si_ I and Si is the same in the two cases: 2 additions and I m u l t i p l i c a t i o n , in g e n e r a l . In fact, referring for instance to the simplest split version of (26) (Bistritz algorithm) and denoting by bh, k = bh,h_ k the c o e f f i c i e n t of z k and z h-k in Sh(Z), h = i-2, i - I , i , we have: bi_2, 0 = fi.(bi_1,0 + bi_1, I) bi_2, k = fi.(bi_1,k + bi_1,k+ I) - bi,k+ I ,

(27a) 0 < k < i-2 .

(27b)

Notice that, although the computational c o m p l e x i t y of the simplest versions of (25) and (26) is the same, they do correspond to d i f f e r e n t t w o - t e r m recursions and, thus, to different sequences of complete polynomials Pk ' k < n, starting from the same original polynomial Pn" In other words they d i f f e r in the i n i t i a l i z a t i o n rule. From the previous considerations, it is also apparent that the s i m p l i c i t y o f a specific split form is not s t r i c t l y related to the s i m p l i c i t y of the corresponding non-split algorithm. The main feature of the z-domain Routh-type a l g o r i t h m (8) is to retain, except for the factor (z+l), either the symmetric part or the a n t i s y m m e t r i c part, in the same way as in the s-domain version either the even or the odd part is retained. It follows that equation (23) and the corresponding equation relating the a n t i s y m m e t r i c parts only, are a restatement of this simple rule. Equation (24) becomes instead r i ri_ I (z+1) 2 Si_ 2 + r i { hi_ I ~I - (-I) i'] + h.i ~1 + (-1)i~}(z-1) Ai_ I - S.j = 0

(28)

I

in which L i is a non-degenerate polynomial o f degree 2 . To construct the entire sequence, one also need the companion relation corresponding to column 4 of Tab. I, which allows one to obtain Ai_ 2 from Si_ I and A i. Obviously, the simplest form of (29,1 and of its companion relation is obtained for r k = I, V k . As a consequence of the bilinear transformation, starting from a Schur polynomial Pn' the zeros of polynomials Sk and (z+1)Ak_ I , V k , a l t e r n a t e along the unit circle, like the zeros of Sk and A k. It could be shown that a similar property also holds for the s y m m e t r i c and a n t i s y m m e t r i c parts of t w o consecutive polynomials generated according to (2S) and (26). ~. CONCLUSIONS A common form has been considered for the t w o - t e r m recurrence relations corresponding to: ( i ) the Levinson (Nevantinna-Pick) algorithm, ( ii ~ the z-domain algorithm obtained from the classical Routh algorithm via the usual bilinear transformation, ( i i i ) t h e B i s t r i t z algorithm,and ( iv ) other Euclid-type s t a b i l i t y - t e s t algorithms, A geometrical interpretation of their properties has been given. An analysis o f the eight possib|e t h r e e - t e r m (spiitJ forms associated with the general t w o t e r m form, has been carried out, Particular a t t e n t i o n has been paid to the relation involving the s y m m e t r i c (antisymmetric) parts only and to the relations involving the s y m m e t r i c and a n t i s y m m e t r i c parts, alternately. The conditions leading to the c o m p u t a t i o n a l l y most e f f i c i e n t formulae have been p o i n t e d out and discussed,

226 APPENDIX With reference to the notation adopted in (22), the combination polynomials in (23) and (24) take the forms: Li = F(Fis- Gis) R i a - ( F i a - G i a )

Ris] Ri-I Ri-1

(At)

Mi = E(Fi-l,s - Gi-l,s) Ri-l,a - {Fi-l,a - Gi-l,a) R i - l , s ] (Fi ~'i - Gi (~i) Ni = ~(Fi-1 ,a + Gi-l,a) R i - l , a - ( F i - l , s

(A2)

+ Gi-l,s) Ri-l,s'][-(Fis-Gis) R i a - ( F . , a - G.la } R.zs] +

+ E(F.. . ) R l.-.l , a - { F .l -. l , a - G .l .- l , a ) R .l--l,S . ] ~Fia-Gia)Ria-(Fis-Gis)Ris] I--I,S - G .I--I,S

(A3)

and L'., = [(F.la - G.,a) R.la - {Fis - Gis) Ris] Ri_ I Ri_ I-

(A4}

M'i = [(Fi_1, s + Gi-l,s) Ri-l,s - (F i-l,a + Gi-1, a } R i - l , a ] (Fi F:i - G i (]i)

(AS}

N'i = [{F.,s - G.,s) R.,s - (F.,a-G ia)Ria] [{F i-I ,a- G i- I ,a) R i - l , s - {Fi-i ,s- G i-1 ,s) Ri-l,a -] + +[{Fi i -

,a

+G

i-l,a

)R

i-l,a

-(F

i-l,s

+G

i-I,s

)R

. ] [ ( F . - G )R. - { F . - G . ) R . ] . i-~,s is is la Ja la ts

(A6)

REFERENCES

['1 ] T. Kailath, "A view of three decades of linear filtering theory", IEEE Trans. Inf. Theory, vol. IT-20, No. 2, pp. lZ16-181, 1974. [-2] J. Makhoul, "Linear prediction: A tutorial review", Proc. IEEE, vol. 63, No. 4, pp. 561-580, 1975. ~3] E. Jury, Theory 1964.

and Application

of the z-Transform Method. J. Wiley & Sons., New York,

F4] P. Delsarte, Y. Genin and Y. Kamp, "On the role of the Nevanlinna-Pick problem in circuit and system theory", Int. J. Circuit Theory Appl., vol. 9, pp. 177-187, 1981. [5]

K. ~striSm, Introduction to Stochastic Control Theory. Academic Press, New York, 1970.

[6]

Y. Bistritz, "Direct bilinear Routh stability criteria for discrete systems", Systems Lett., vol. 4, No. 5, pp. 265-271, 1984.

Control

]-7] Y. Bistritz, "A new unit circle stability criterion", in Mathematical Theory of Networks and Systems ( Proc. MTNS-83 Int. Symp., Beer-Sheva, Israel, June 20-24, 1983 ), pp. 69-87. [8]

Y. Bistritz, "Zero location with respect to the unit circle of discrete-time linear system polynomials", Proc. IEEE, vol. 72, No. 9, pp. 1131-1142, 1984.

Eg] A. Lepschy, G.A. Mian and U. Viaro, "Euclid-type algorithm and its applications", Int. J. Systems Sci., vol. 20, No. 6, pp. 945-956, 1989. [-10] M. Benidir and B. Picinbono, "Comparison of some stability criteria of discrete-time filters", IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-36, No. 7, pp. 993-1001, 1988. [11] P. VaidyanathanandS. M i t r a , " A unified structural interpretation of some well-known stability-test procedures for linear systems", Proc. IEEE, vol. 75, No. 4, pp. 478-496, 1987. ['12] W. Krajewski, A. Lepschy, G.A. Mian and U. Viaro,"A unifying frame for stability-test algorithms for continuous-time systems", IEEE Trans. Circuits Syst., vol. CAS-37, No. 2, 1990.

227 E13] A. Lepschy, G.A. Mian and U. V i a r o , " A s t a b i l i t y test for continuous systems", Systems Control Lett., vol. 10, No. 3, pp. 175-179, 1988. ~14] P. Delsarte and Y. Genin, "The split Levinson algorithm", IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-34, No. 3, pp. 470-477, 1986, [15]

P. Delsarte and Yo Genin,"On the splitting of classical algorithms in linear prediction theory", IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-35, No~ 5, pp. 645653, 1987.

[16]

Y. Bistritz, H. Lev-Ari andT. K a i l a t h , " I m m i t a n c e - d o m a i n Levinson algorithms", Proc. IEEE ICASSP-86, Tokyo, pp. 253-256, 1986.

[17]

Y. B i s t r i t z , " C o m m e n t on 'Zero location with respect to the unit circle of discrete-time linear system polynomials' '% Proc. IEEE, vol. 74, No. 12, pp. 1802-1803, 1986.

[18]

A. Lepschy, G.A~ Mian and U. V i a r o , " A common setting for some classical z-domain algorithms in linear system theory", Int. J. Systems Sci., vol. 21, ~990.

TIM E - D I S C R E T I Z A T I O N OF THE ZAKAI EQUATION FOR DIFFUSION PROCESSES OBSERVED IN CORRELATED NOISE* Patrick FLORCHINGER Universitd de Metz D~parternent de Math~matiques Ile du Saulcy F-57045 METZ C5dex

Franqois LE GLAND INRIA Sophia-Antipolis Route des Lucioles F-06565 VALBONNE C~dex

Abstract A time dlscretization scheme is provided for the Zakai equation, a stochastic PDE which gives the conditional law of a diffusion process observed in white-noise. The case where the observation noise and the state noise are correlated, is considered. The numerical scheme is based on a Trotter-like product formula, which exhibits prediction and correction steps, and for which an error estimate of order ~f is proved, where 8 is the time discretization step. The correction step is associated with a degenerate second-order stochastic PDE, for which a representation result is available in terms of stochastic characteristics [8,9,11]. A discretization scheme is then provided to approximate these stochastic characteristics. Under an additional assumption on the correlation coefficient, an error estimate of order v ~ is proved for the overall numerical scheme. This has been proved to be the best possible error estimate by ElllottGlowinski [5].

1

Introduction

T h e purpose of this paper is to present a computable time discretization scheme for the Zakai equation of nonlinear filtering with correlated noises, and to provide an estimate of the rate of convergence. In the case of independent noises, the problem has been studied by Newton [13], KorczllogluMazziotto [6], B e n n a t o n [1], D i M a s i - P r a t e l l i - R u n g g a l d i e r [4], e i c a r d [14], Bensoussan-GlowinskiRascanu [2] and LeGland [12]. Some of these authors have actually considered the associated Zakai equation. T i m e discretization schemes have been provided with a rate of convergence of order 6, where 6 is the time discretization step. In the case of correlated noises, the problem has been studied by Elliott-Glowinski [5]. T h e bcst approximation of the continuous filter based on the values of the observation process at a regular

9Rcseareh partially supported by USACCE under Contract DAJA45-87-M-0296.

229 partition (with mesh 5) hm been considered, and it has been proved that the rate of convergence is of order V"~. However, no algorithm is provided to actually compute this approximation. The paper is organized as follows. In Section 2, the nonlinear filtering problem is presented. Some results are recalled in Section 3, on the Zakai equation, and on a related degenerate secondorder stochastic PDE. A Trotter-like product formula is then considered, with aa error estimate of order 5. However, this numerical scheme is not computable. In Section 4, a representation result in terms of stochastic characteristics is presented for the degenerate second-order stochastic PDE. This part follows mainly the work of Krylov-Rozovskii [7,8] - see also Kunita [11]. Under an additional assumption on the correlation coefficient, a time discretization scheme is presented in Section 5 based on an approximation of the stochastic characteristics - with an error estimate of order v/~. In addition, this numerical scheme is actually computable, as far as time discretization is concerned.

2

The

filtering

problem

On the probability space (~, F , P), consider the stochastic differential system dXt = b( X,) dt + a( X,) dW, + p(Xt) dV, dY~ = h(X,) dt + dVt

where {IV,, t _> 0} and {Vt, t > 0} are independent Wiener processes, with covariance matrix I (identity) and r respectively. For the clarity of exposition, it is assumed throughout the paper that r=I. Throughout the paper, the coefficients will satisfy the following hypotheses (i) b is a measurable and bounded function from R"* to Iq.'~, (ii} cr is a continuous and bounded[unction on R"*, with a ~=a n ' , such that 9 a is a uniformly elliptic m x m matriz, i.e. a(x) > cd, Oa~,~ 9 ~ ~= j=a ~ is a measurable and bounded function on R"*, (ii O p is a continuous and bounded[unction from I t " to the space of m x d matrices, with A C = pp', such that

i, ~ Op'k

9 ak =

is a measurable and bounded [unction on R'*,

9 ~ =~j=l ~Od,i

is a measurable and bounded function on R'*,

(in) h is a measurable and bounded function from It. '~ to R d.

With the diffusion signal {Xt, t _ O} are associated the two partial differential operators L =

' g i,j=l

L o A=1 ~

i,j=l

+ c"q

02 OxiOxi

+

.. i=l

a '." ' J ~ + ~ b -i O. OxiOxj i=a Oxl

Ozi '

230 An other family of partial differential operators to be considered is m

9 0

- ' B~ =z~ h~ + ~ P'~-Oxl

l 0 the original probability measure P is equivalent on [0, T] to the reference probability measure p t with Radon-Nikodym derivative ZT, so that under p t

dX,

= [b(X,) -

p(X,)h(X,)] dt + ~(X,) dW~ + a(X,) dY~

(2.1)

where {Wt, t >__ 0} and {Yt, t _> 0} are independent Wiener processes, with covariance matrix I (identity). By the Bayes formula E t ( f ( X , ) Z, l Y,)

E(f(S,)[y~)

=

In addition

Et(Z, [Yd f

E t ( f ( X ~ ) Z , l Y,) = J f ( z ) p t ( x ) dx ,

where the unnormalized conditional density {pt, t > O} satisfies the Zakai equation

d p =$ L * p~dt

+B~p~dYt k

The purpose of the next section is to recall existence, uniqueness and regularity results for this equation and related stochastic partial differential equations.

3

T h e Zakai e q u a t i o n , a n d r e l a t e d s t o c h a s t i c P D E ' s

For all n_> 0, let H " denote the usual Sobolev space, with norm [[. [[,. The following shorthand notations will be used throughout the paper: I" 1 -~ t[" ][0 and ![" I[ ~ I]" Ill 9 Consider the Zakai equation dpt = L*p~ dt + B~p, dY~~ .

(3.1)

The following existence, uniqueness and regularity result is proved in Krylov-Rozovskii [7] and Pardoux [15]. T h e o r e m 3.1 Let n>_O be fixed. Assume that the coefficients satisfy 9 a, c and p have bounded derivatives up to order (n+l), 9 b and h have bounded derivatives up to order n, and that the initial condition satisfies po E H". Then the equation (3.1) has a unique solution p E M2(O, T ; H"+I). In addition

9 pC L2(~; C([O,T]; H")),

231 . the following estimate holds Et[ sup [Iptll~] -< cllpoll~ 9 o 0}, using some classical approximation scheme. For instance, using the backward Euler scheme would result in the following global approximation scheme --t i

with same error estimate.

6

Conclusion

A time-discretizatiou scheme of the Zakai equation for diffusion processes observed in correlated noise has been proposed, based on the stochastic characteristics introduced in [8] and [11]. Under an additional assumption on the correlation coefficient, it has been shown that the rate of convergence of this approximation is of order v~, where 6 is the time discretization step. The same rate of convergence has been obtained in EUiott-Glowinski [5] for a different approximation 9 on one hand, the approximation considered in [5] has a probabilistic interpretation, which is not the case for the time discretization scheme presented here, 9 on the other hand, the latter is actually computable, whereas no numerical algorithm is provide to compute the approximation considered in [5]. Another point of interest would be to study some particle approximation for the degenerate second-order stochastic PDE, adapting the results presented in Raviart [16] for deterministic firstorder PDE's.

237

References [1] J.F. BENNATON, Discrete time Galerkin approximations to the nonlinear filtering solution, J.Math.Anal.Appl. 110 (2) 364-383 (1985). [2] A. BENSOUSSAN, R. GLOWINSKI and A. RASCANU, Approximation of Zakai equation by the splitting-up method, in: Stochastic Systems and Optimization (Warsaw-1988), (ed. J.Zabczyk) 257-265, Springer-Verlag (LNCIS-136) (1989). [3] Yu.N. BLAGOVESCHENSKII and M.I. FREIDLIN, Certain properties of diffusion processes depending on a parameter, Soviet Math. 2 633-636 (1961). [4] G.B. DI MASI, M. PRATELLI and W.J. RUNGGALDIER, An approximation for the nonlinear filtering problem with error bound, Stochastics 14 (4) 247-271 (1985). [5] R.J. ELLIOTT and R. GLOWINSKI, Approximations to solutions of the Zak~i filtering equation, Stoch.Anal.Appt. 7 (2) 145-168 (1988). [6] H. KOREZLIOGLU and G. MAZZIOTTO, Approximations of the nonlinear filter by periodic sampling and quantization, in: Analysis and Optimization of Systems, Part 1 (Nice-198.~), (eds. A.Bensoussan and J.L.Lions) 553-567, Springer-Verlag (LNCIS-62) (1984). [7] N.V. KRYLOV and B.L. ROZOVSKII, On the Cauchy problem for linear stochastic partial differential equations, Math. USSR Izvestija 11 (6) 1267-1284 (1977). [8] N.V. KRYLOV and B.L. KOZOVSKII, Characteristics of degenerating second-order parabolic Ito equations, J.Soviet Math. 32 (4) 336-348 (1982). [9] H. KUNITA, Stochastic partial differential equations connected with nonlinear filtering, in: Nonlinear Filtering and Stochastic Control (Cortona-1981) (eds. S.K.Mitter and A.Moro) 100169, Springer-Verlag (LNM-972) (1982). [10] H. KUNITA, Stochastic differential equations and stochastic flows of diffeomorphisms, in: Ecole d'Ete de Probabilites de Saint-Flour XII (1982) (ed. P.L.Hennequin) 144-303, Springer-Verlag (LNM-1097) (1984). [11] H. KUNITA, First order partial differential equations, in: Stochastic Analysis (Katata and Kyoto-1982) (ed. K.Ito) 249-269, North-Holland (1~984). [12] F. LE GLAND, High order time discretization of nonlinear filtering equations, in: #Sth IEEE COG (Tampa-1989) 2601-2606 (1989). [13] N.J. NEWTON, Discrete approximations for Markov-chain filters, Ph.D Thesis, Imperial College (1983). [14] J. PICARD, Approximation of nonlinear filtering problems and order of convergence, in: Filtering and Control of Random Processes (ENST/CNET-1983), (eds. H.Korezlioglu, G.Mazziotto and J.Szpirglas) 219-236, Springer-Verlag (LNCIS-61) (1984). [15] E. PARDOUX, Stochastic partial differential equations and filtering of diffusion processes, Stochastics 3 (2) 127-167 (1979). [16] P.A. RAVIART, An analysis of particle methods, in: Numerical Methods in Fluid Dynamics (Como-1983) (ed. F.Brezzi) 243-324, Springer-Verlag (LNM-1127) (1985).

F I N I T E - D I M E N S I O N A L S T O C H A S T I C F I L T E R I N G IN D I S C R E T E T I M E : TIIE ROLE OF CONVOLUTION SEMIGROUPS Fabio Spizzichino Dept. of Mathematics, University " La Sapienza " Piazzale A. Moro, 5 - 00185, R o m e

Summary.

We consider a stochastic dynamic system in discrete time with an observable

component Yn C ~ C ~Rw (observation), and a non-observable component X n E ~ C ~}~v (signal). The signal is a Markov chain and, at any step, the conditional density of the observation given the signal is fixed in the exponential class. We face the problem of finding transition kernels for X n for which the system admits filters of dimension v; we reduce this problem

to an analytical procedure in which the first step can be solved in terms of

convolution semigroups of probability distributions.

1. Introduction W e consider a stationary discrete-time stochastic process ({Xn}, {Yn})n=l,2,...(Xn E :~ C ~v, Yn e qJ C ~w) whose dynamics is described by the following properties:

{Xn} is a Markov chain with given

transition densities pn(x [ x'), Yn is conditionally independent on X (n-l) = (Xl, X2, ... , Xn-1) and y(n-1) _~ ( y t , y 2 ..... Yn-I). given X n (n = 2 , 3, ...), the conditional densities fn(Y 1 x) ----fyn(y [ X (n-l) = x(n-1);

y(n-l)=

y(n-1); Xn = x)

are given, n = 1, 2 , . . . . Let Pl be the probability density function of X1, pn(x [ y(n-1)) be the conditional density of X n given y(n-z) = Y (n-l) conditional density of X n given y(n) = y(n)

(prediction density),

(filtering density).

7rn(x ] y(n)) be the

The sequences of filtering

densities and prediction densities obey the "Dynamical Bayes Formula": 7rl(x ]Yl) oc Pl(X) 9 f l ( y l [ x),

...,

zrn( x [ y(n))

c< pn(X [ y(n-1)) . fn(Yn I x )

(i.i) pn(X [ y (n-l)) = J

Zr(n_l)(x' [ y (n-l)) 9 pn_l(x] x') dx'

the symbol cx indicates that two functions differ for a multiplicative quantity which does not depend on tile a r g u m e n t x E ~'.

239 In this paper we consider the time-homogeneous case in which pn(x t x') and fn(Y }x) do not depend on n. One is usually interested in finding out conditions on the triple Pl, p(x [ x'), f(y [ x), under which the recursive formula (1.1) actually allows for feasible computation of rrn and Pn. For this problem the following definition is of interest. Let cO. be an index set and B (~ be a ~r-algebra of subsets of ql.

Definition A family ~2 ---- {~u(x); x E E) of probability density functions on :s indexed by uE~

filter-conjugate

is

if there exists a measurable function T : ~ x ctJ --* % such that

[[- ~u(X') 9 p(x [ x') d x ' ] - f(y [ x) c< ~o(u.y)(x)

V u E ~

V y E ctJ.

(1.2)

E As it can be immediately checked by a direct application of (1.1): ( p l ( x ) = f ~uo(X') . p ( x ] x ' ) dx' , E imply

~Uo E ~ ,

~ filter-conjugate)

~rn(x [ y(n)) = ~Un(Uo;y(n))(x ) e ~, '~ y(n) e cLjn where u n : ~ x ~n .., cU. are recursivelly defined by ut(uo; Y,) = ~~

The ~

Yl), u2(uo; y(2)) = ~(ul ( uo, y,), y~), ...

(1.3)

stochastic process ul(u0,Y1) , u2(u0,Y1,Y2), ... is the "filter-process" and it

results to be a Markov chain as can be easily checked. The definition of a filter-conjugate family is the suitable translation of the concept of "conjugate family" in bayesian Statistics (see e.g. [2]) to the present case, where we must replace the usual Bayes formula with the dynamic Bayes formuta (1.1); see also the discussion presented in [6]. Filtering theory motivates the search for a filter-conjugate family with the following properties:

a) dim ~ < co,

b) There exists an algorithm for obtaining ~ and (u from the

knowledge of the elements E, ~, f(ylx), p(x[x'). W e consider here an exponential family of "observation densities" {f(y [ X)}xE t

(see

Sect. 2). Our aim is to build "compatible" transition kernels p(x ] x') which allow for the existence of filter-conjugate families with dim ~ = dim E = v and, in particular to show the role that convolutions semigroups of probability distributions can have in the problem. The results we obtained can be extended to the case in which p(x [ x') and f ( y [ x) depend 011 n. It has been preferred to treat the time-homogeneous case so as not to obscure the substancial aspects of the problems.

240 2. Sufficient conditions for the existence of a v-dimensional filter-conjugate family. W e assume that f (y ] x) is of the form f(Ylx) = a(x) 9 b(y) 9 exp { < x , d ( y ) > } x 6 ~" c ~v, y E ~ c ~v

(2.1)

with suitable functions a : t ~ RA', b : ~ -~ N4-, d : ~ -~ Nv and < 9 , 9 >

denoting the scalar

product. As shown in [1], [4], [7], under different kinds of conditions, the exponentiality of the family {f(y [ x), x 6 ~} is a necessary condition for the existence of a filter-conjugate family with % contained in a Euclidean space. Consider a quadruple ( ~ , %c~V,K:~xt~

K, c, p) with

+ U {0},c:~

and satifying the following conditions: I

e x p { } 9 K(x;x') dx' cr e x p { < c ( u ) , x > } c(u) + d ( y )

6 ~

e(x') = ~ 9

V u ~ ~,

V x 6 t , V u 6 91.

Vy 6 ~

(2.2) (2.3)

I p(x) ~ 9 K (x;x') dx

(2.4)

t

V x' 6 :E and for some constant 7 > 0. I

p(x') e x p { } dx' < ~o

u u 6 %

(2.5)

The elements t , y, a, d are those appearing in the representation (2.1). Remark

If K(x;x') satisfies (2.2) and xl, x 2 6 X are such t h a t xl-Fx 2 q t , it follows

V u 6 ~

J'~ e x p { } 9 K(x I -4-x2 ; x ' ) d x ' =

= r

9 [ ; ~ e x p { } 9 K(xl;x' ) dx'] 9 [ ; ~ e x p { } - K(x2';x' ) dx]

where r

is a positive quantity depending on u, ~, c but non depending on xl,x 2.

V x, x' 6 t ,

define

p(x Ix')

7.

-

)

X(x; x,)

By the condition (2.4), it results

(2.6)

f t p(x I x ' ) dx = 1

V x' 6 it~ and so {p(x I x')}xE % can

be seen as a stationary transition kernel for the signal process {Xn}. Define, moreover, p(x) 9 e x p { } ~~215 -- I ~ p(0) 9 e x p { }

de

V x e ~,

~ -- {~.(x); x e ~ } , , ~

(2.7)

241 The

following result

extends

and

summarizes

arguments

contained

in the

paper

by Bather [1] and it is the starting point for the subsequent discussion.

2.1 T h e o r e m ~P is filter-conjugate with respect to the pair {f(y ] x)}, {p(x I x')}. Proof Our task is simply to check the validity of Eq. (1.2). Fix u E qJ., y E o~. By taking into account (2.1), (2.6) and (2.7), we have [J'a~ ~u(x') 9 p(x]x') d x ' ] 9 f(y I X) r [.rE p ( x , ) , e x p { } 9 a(•

g ( x ; x') dx'] a ( x ) . b ( y ) . e x p { < d ( y ) , x > } or

i r e e x p { } - K(x; x')dx'] p(x) 9 e x p { < d ( y ) , x > } cr p(x). e x p { < c ( u ) + d ( y ) , x > } c< r

~o(u,y) = c(u) + d(y) E ~

by the condition (2.3) and then ~o(u,y)(X) E 9.

[] For

a given f(y[x)

of the

form

(2.1),

the

problem

of finding

a filter-conjugate

family with dim ~ = v can then be solved by the following procedure: -

find ~ c Nu, c : q.t --* N v, K : E x ~: -* R + U { 0 } , satisfying the conditions (2.2) and (2.3) find p : t --* R-{- satisfying the conditions (2.4) and (2.5)

-

-

obtain the transition kernel p(xlx') by means of (2.6) and the conjugate family 9 by means of (2.7). In the terminology introduced by Bather, p(x) is an

invariant function and c(u) is a

connecting function. From a fixed K(x ; x') satisfying (2.2) we can obtain respectively one or infinitely many transition kernels p(x I x') depending on the sum of multiplicities of positive eigenvalues with positive eigenfunctions for the integral operator in (2.4) being one or greater than one. The procedure described just above will be illustrated in detail in the example shown in the last Section. Tile first step of the procedure asks for finding qJ., c, K satisfying the conditions (2.2) and (2.3). The form of the solutions for the Eq. (2.2), was already studied in detail by Bather [1] in the case with v = l , c linear. In terms of the symbols introduced here, one of his results can be stated as follows: if K(x;x') solves Eq. (2.2) with ~, ~

R open intervals and with c(u) = r.u

+ k (r > 0, k E ~), then it must result K(x;x') = g(x'-r-x)-exp{k.x} for some function g: ~

-*

~+ u

{o}.

W e are rather interested in obtaining sufficient conditions for (~

to satisfy Eq. (2.2).

This will be made in general in the Sect. 3; now we concentrate attention on the particular case c ( u ) = r. n (r e ~ ) .

242

For functions g, g: Nv _, R + U { 0 } , we shall use the notation Zg _= {x 9 Rv[ g ( x ) > 0 } . If Z1, Z 2 are two subsets of Nv, the symbol Z I + Z 2 denotes the set {x 9 ~v[ x

=

z 1--I- z2, zl 9 Z 1,z 2 9 Z2}

r . Z - - {z' 9 ~ V l z ' = r . z , z 9

and, for r 9 ~ , x 9 ~v, Z C ~v,

ZWx-

{z' 9 ~Viz'=z-J-x,z

9 Z}

2.2 L e m m a Fixr > 0andg:

~v._,~+U

{0} such t h a t

Zg + r . ~ c

~ ' a n d put

c (u) = r.u, %tO {u 9 Nv I J'Nv g(z) 9 e x p { l d z < + c~}, K(x;x') ~ g(x'- r.x) The triple (c,~

(2.8)

K) satisfies the condition (2.2).

Proof J ' t e x p { } 9 K(x;x') dx' = J'~. g(x'- r.x) 9 e x p { } dx'. For any fixed x 9 i , g(x'- r 9 x) is positive if and only if x' 9 Z g + r 9 x and we can write J'~ e x p { } 9 K(x;x') dx' =

J'tn(zg

+ r.x)

exp{ } 9 g(z) dz (2.9) The hypotesis Z o + r-x c :F, V x 9 ~:, implies that the set {[IM(Zg + r.x)] - r-x} does coincide with Zg, V x 9 ~; and so (2.8) entails J'~ e •

9 K(x;x') dx' = e x p { < r - u , x > } . J ' Z 9 e x p { } - g(z) d z .

This completes the proof since the integral in the right hand side does not dcpcn on • [] The arguments contained in this Section can be resumed by the following s t a t e m e n t s 2.3 T h e o r e m Let the family of observation densities f(y[x) be of the form (2.1) and g (g: ~v_.. ~ + U { 0 } ) be such that Z g + r .~" c ~ for some r > 0. Suppose that for some 7 > 0 we can find a non null

p(x)

solution p(x) _> 0 for the equation p(x') = 7 9 J'~" ~

9 g(x'- r.x) dx

Suppose moreover we can find a (non empty) set % c ~.v such that r {u 9 ~v I J ' t p(x') 9 e x p ( } dx' < ~ , d ( ~ ) -I- r. ~ c ~

J'Zg g ( x ' ) . e x p { } d x ' < co} (2.10)

p(x)

F o r p ( x Ix')----- 7 " atx~./x,~.g(x'-~ J ~ ) r.x), there exists a f i l t e r - c o n j u g a t e f a m i l y g i v e n by

243 9~ = {~u(X); x E ~'}uEOd

with

p(x)-exp(} ~Ep(9).exp{}d 0 , ~o(u,y) = r.u + d(y).

~u(X) = Corollary

Consider t h e a s s u m p t i o n s of T h m . 2.3; suppose m o r e o v e r t h a t t h e "initial" density of X 1 is given by p l ( x ) cx p(x) . e x p { } ' r

I YI . . . . . Yn) or p(x) 9 e•

pn(X

l Yl . . . .

with

u1 e ~

x E E, t h e n

{ )

Un =

rn-Z" Ul +

n-i rn_h E1 h " d(Yh)

3. C o n v o l u t i o n semigroups and solutions of Eq. (2.2) In this Section we w a n t to o b t a i n triples ( c , % , K ) satisfying t h e condition (2.2) with c : ~v _, ~ v of more general form. First we shall briefly recall, some well-known definitions and results (see e.g. [3]) a b o u t infinitely divisible d i s t r i b u t i o n s whose s u p p o r t is c o n t a i n e d in [0,+oo). This t h e o r y has indeed a f u n d a m e n t a l role in o b t a i n i n g our results (as it m a y be already clear by t a k i n g into a c c o u n t the R e m a r k presented in t h e Sect.2). Let M be a probability d i s t r i b u t i o n on t h e real line; M is infinitely divisible if for all n E N t h e r e exists a probability d i s t r i b u t i o n M n such t h a t M = Ma *...* Mn = (Mn) n* where * d e n o t e s t h e convolution operation. A d i s t r i b u t i o n M r is infinitely divisible if and only if it is the

probability

distribution

of X s + r - X s

(s>0)

for a

time-homogeneous

i n d e p e n d e n t i n c r e m e n t s {Xt}. T h e family of p r o b a b i l i t y d i s t r i b u t i o n s { M r } r > 0

process

with

associated to a

fixed t i m e - h o m o g e n e o u s stochastic process with i n d e p e n d e n t i n c r e m e n t s forms a convolution semigroup: Mt~+t2 = Mr1. Mt2. For a p r o b a b i l i t y distribution M on [0,+oa) d e n o t e by ~M its Laplace t r a n f o r m :

A

~M(s) = I 0 real f u n c t i o n

exp{- s 9 z} dM(z) ~

on

[0,+oo)

is t h e

Laplace

transform

of a p r o b a b i l i t y

distribution.

M if a n d only if ~(0) = 1 and it is completely monotone, i.e. it possesses derivatives ~(n) of all orders a n d (-1) n. ~(n)(s) > 0, s > 0. M is infinitely divisible if a n d only if, for n = 2, 3, . . . , the positive n - t h root !0n=(~M) 1/a is the Laplace t r a n s f o r m of a p r o b a b i l i t y d i s t r i b u t i o n ; this happens r

if a n d only if ~M = e x p { - r

where r has a completely m o n o t o n e d e r i v a t i v e a n d

= 0. If { M r } is a convolution semigroup a n d ~M1(S) = e x p { - r ~ M r ( s ) = e• Given r

{- ~-. r

V ,- > 0

we shall henceforth d e n o t e by M r (r

then (3.1)

t h e d i s t r i b u t i o n with Laplace t r a n s f o r m

244 exp {- r. r

In order to follow the line of the present discussion, the following remark is

now of interest Remark Let M r (r

be an infinitely divisible distribution on [0,+co)

0. T h e Eq. (3.1) can be written I

exp{- s. x ' ) m r ( r

admitting a density m r ( r

v ;,

') dx' = exp{- r. r

0

i.e. the function K(x;x') = mx(x') satisfies (2.2) with t stress t h a t M r (r

-- [0,-t-co) and c(s) = - r

We

can be seen as the distribution, at time t = r, of a process with

independent increments {Xt (~)

We recall finally that an infinitely divisible M is (strictly) slable if, in particular, M ---(Mn)n*where, V n, Mn is of the same type of M: M(z) ---- Mn( c~ )

(3.2)

for constants c n > 0. The norming constants arc necessarily of the form c n ~- n 1 / a with 0 0

and define c(u) -- - r

-u ), % -

(-co, 0],

x' K(x ; x') -- J" q(x'- z) dM• 0

(3.4)

3.1 T h e o r e m T h e triple (c,~

K) defined in (3.4) satisfies the condition (2.2) with t = [0,+co).

Proof T h e proof is now straightforward. V x > 0, K(x;x') is the density of the convolution of Q and Mx (r then, by the convolution theorem for Laplace transforms~ for s > 0 +00 4-00 +oo ]" exp {-s 9 x'} 9 K(x ; x ' ) d x ' = J" exp{-s 9 x'} 9 q(x') d x ' - f exp{-s.x'} dMx(x') o

= ~Q(s)

0

9 exp{-•

- r

r

exp {-x 9 r

0

= exp {x 9 c(-s)}.

[]

245

We remark that if r e.g. [3] pg. 295):

= r.s

Mr(r

(r>O), then { M r (r

= l[r.r ' +oo)(Z)

r>0, r > 0

K(x;x') ----q(x'- r-x); so we reobtain, for the trivial case 9 -

is a "translation semlgroup" " (see and K ( x ; x ' )

in (3.4) reduces to

[0,+oo) Z9 c [0,+oo), the result of

Lemma 2. When Mx(~b)admits a density mx (r

K(x ; x') in (3.4) can be replaced by

Xf

K(x ; x ' ) ~ f

mx(r

'-z)

dQ(z)

(3.5)

0

which shows that, in this case, we can consider also non absolutely continous measures Q. If in particular r K(x;x') = ~

= - s ~ (0 < a < 1), then by (3.3) T h m . 3.1 applies with 9 J'X'mt(e) ( x'-____s ) dQ(z) 0

(3.6)

x 1/a

Remark. We see that when we build K ( x ; x ' ) elements: the convolution semigroup {M• (r {Mx (r

by means of (3.4), we must choose two

and the measure Q. The form of the semigroup

is directly determined by the choice of the "connecting function" c(u); the choice of Q

has consequences on the form as well on the existence of the " i n v a r i a n t function" p(x). The latter aspect is clarified by means of the example presented in the next Section.

Analogously to what we did in T h m . 2.3 we can now draw the following conclusions: Let f(y[ x) be of the form (2.1) with d(y) _< 0, t -= [ 0 , + o o ) and let M r (r_>0) be a convolution semigroup of probability distributions on [0,+oo). Suppose we can find a density q on [0,+oo) such that, for some 3' > 0, the equation -~oo

p(x') = 7 " ]'o

e(x)

X'

a(x) "[~0 q(x'- z)dMx(z)] dx

as a non null solution p(x)_>0. Suppose moreover we can find so_> 0 such that

;

--}-oo p(x') 9 e x p { < - s , x ' > } d x ' < oo, o

-[-oo J" q(x') . e x p { < - s , x ' > } d x ' < oo V s > so. o

p(x) x' With respect to the transition kernel p(x [ x') = 3' 9 a(xi - p ( x ' ) J'0 q(x'- z) dMx(z) there

exists a filter conjugate family given by 9 ~ {~u(X)}uEq.t with ~ = (-co, -So] and ~u(~x) cr p(x) 9 exp{u - x}, indeed ~ satisfies the Eq. (1.2) with the expression -{-r

~0(u,y) = log [ f

exp{u 9 x'} dMl(x')] + d(y). 0

246 If 2e = (.co, X] or :E = [~, +oo) for s o m e X e ~ we can apply Thin. 3.1 in order to obtain triples (c,qs

satisfying the condition (2.2), by merely considering a linear transfor-

m a t i o n of the signal process; when X -- {x 6 ~v[ x = (xl,...,Xv) , xj > ~j, j =

1,2,...,v}

for a

fixed ~ E ~v we m a y use the theory of processes with i n d e p e n d e n t i n c r e m e n t s with s t a t e space ~ + v . Indeed let M z (~j) (7->0, j = l , . . . , v ) be t h e (absolutely continous) probability distribution at time t = v of a process {Xt (r

with i n d e p e n d e n t i n c r e m e n t s with s t a t e space ~ + v and, for

x -- (xl,...,xv), d e n o t e by K(x;x') the j o i n t p r o b a b i l i t y density function of the distribution MxI(T1)* ... * Mxv(!PV)(x'). T h e n

~+v

e x p { - < s , x ' > } 9 K(x;x') dx' = l]~ S

~+v

1-Ij exp{- r In

analogy

exp{-<s,x'>}

dMxj(r

x/ ---- e x p { - < r with

(3.5)

further

solutions

of

the

Eq.

(2.2)

can

be

obtained

by

c o n v o l u t i o n with positive measures Q on ~ + v .

4. Discussion of a particular case T h e p r o c e d u r e described in Sect. 2. and the results of Sect. 3. are now illustrated

by

discussing a p a r t i c u l a r case. Let f(Y I x) = ~

9 x 6 9 y6-i . exp {-x 9 y}

(4.1)

+oo x > 0, y > 0, 6 > 0, r(6) = ;

s 6-z - e x p {-s} ds. 0

We have

q.J = [0,+co,

E c [0, + c o ) ,

a(x) = x 6, d(y) = -y. Since d(q.]) = (-oo, 0], we need

cO. = (-oo, u0] for s o m e u 0 in order to satisfy the condition d ( ~ ) + r - eLI. c ~ with r > 0. W e put E - [0, + c o ) and consider the c o n v o l u t i o n s e m i g r o u p associated a stable process z-3/2

{Xt}t_>0 with cr = 1/2 ; the density of X t is m(1/2)(z) = ~ +co

f

0

exp {-s 9 z} 9

By (3.2) we h a v e Q(0) = 0,

Z-3/2 ,/~rr

9

exp {- ~ z } dz =

9 exp {- ~ z } :

exp {- -/2 9 s 1/2}

7-2 m r ( 1 / 2 ) ( z ) -- J 2 -1 ~ " z 3~2 exp {- ~ }. If we put in (3.5) Q(z) = 1

V z > 0

(4.2)

we o b t a i n K ( x ; x') = m x ( l / 2 ) ( x ') = J2. 7r

,

X

x,a/---5

X . ~2

9 exp {- 2-x' }

(4.3)

247 E q u a t i o n (2.4) then becomes ? ~176

p(x')oc3'.

0

4~-~r'

x (1-~) x ' ' 3 / 2 "

X2

"

exp { - 2.x-------3} dx

this a d m i t s solutions of the form p(x) oc x -6-1 which do not satisfy the condition (2.5). W e t h e n consider the measure Q such t h a t d Q ( z ) = z -1/2 dz. W i t h such a choice, (3.4) yields K(x ; x ' )

=

x , t1/ 2

. exp {- 2.x' x----~-2}" F u n c t i o n s of the form p(x) or x "6 are positive

eigenfunctions of the integral o p e r a t o r in (2.4); p(x) 0r x "6 satisfies the m e n t i o n e d condition (2.5) when 6 < 1, so we find t h a t , when f(y [ x) is given by (4.1), (0 < 6 < 1) and the t r a n s i t i o n s kernel is

X2

p(x I x') (x p ( x 'p(x) ) 9 a(x) 9 K ( x ; x ' ) = x -2"5 9 x '6-1/2 9 exp { - 2.x' }

a filter-conjugate family of densities 9 ~o(x) = (su ) 1 4

" x -6 9 exp {u 9 x};

--- {~u(X)}uEOd. exists

x > 0 . It results ~,(u,x) = -

with qJ. =

(-co,0)

and

J 2 . lu] - x. T l f f s i s a s l i g h t

extension of t h e situation considered in t h e e x a m p l e iv) of [1], where 6 -- 1 Aknowledgments I like to t h a n k M a r t i n J a c o b s e n and W o l f g a n g J. R u n g g a l d i e r for valuable discussions.

References [1] B a t h e r J.A.

Invariant Conditional Distributions. A n n . M a t h . S t a t . 36 (1965), 829-846

[2] Diaconis P. and Ylvysaker D.

Conjugate priors f o r exponential f a m i l i e s A n n .

Star. 7"

(1979), 217-226. [3] Feller W. [4] F e r r a n t e

P r o b a b i l i t y T h e o r y and its Applications (Vol. 2). Wiley (1970) M. and t~unggaldier W.J. On necessary conditions f o r the existence o f f i n i t e dim-

ensional f i l t e r in discrete time. To a p p e a r on Systems and C o n t r o l Letters. [s] Levine J. a n d Pignie G.

Exact Finite D i m e n s i o n a l Filters f o r a class o f N o n l i n e a r Dis-

c r e t e - T i m e S y s t e m s . Stochastics 18 (1986) , 97-132 [6] l~unggaldier W . J . and Spizzichino F. F i n i l e - d i m e s i o n a l i t y in discrete time non linear filtering f r o m a Bayesian Statistics viewpoint. S t o c h a s t i c Modelling and Filtering, G e r m a n i A. (Ed.),

Lecture notes in Control and I n f o r m a t i o n Sciences. 91. (1987) Springer Verlag

[7] Sawitzki G. Finite D i m e n s i o n a l Filter S y s t e m s in Discrete Time. Stochastics 5_ (1981), 107114 [8] V a n S c h u p p e n J.H. Stochastic filtering theory: a discussion on concepts, methods and resuits. S t o c h a s t i c control theory a n d s t o c h a s t i c differential systems.

K o h l m a n n M. a n d Vogel

W. (Eds.). Lecture notes in C o n t r o l and I n f o r m a t i o n Sciences 1 6 (1979), Springer-Verlag.

ON THE DECOMPOSITIONS OF OBSERVATIONS WITH NON-GAusSIAN ADDITIVE NOISE AND THEIR INNOVATIONS PROCESSES* YOSHIKI TAKEUCHI A-2361

Summary.

IIASA Laxenburg,

Austria ~

This paper is concerned with information structure of the observation with

additive non-Gaussian noise under the assumption that the noise belongs to a class of continuous martingales.

It is known that such an observation is decomposed into

a process with additive Gaussian noise and the quadratic covariation process of the additive noise.

It is also known that the innovations process is decomposed into a

standard Brcwnian motion process and the quadratic covariation process.

In this

paper, a number of sufficient conditions are obtained for the observation to have the information structure such that the information in the quadratic covariation process is not contained in the additive Gaussian part of the observation and/or the Brownian motion part of the innovation process.

I. INTRODUCTION Up to the present, many researches were devoted to nonlinear filtering and its associated innovations problems If] - [ 1 7 ]

Mainly,

they are concerned with the case

of additive Gaussian noise Ill - [12], and relatively small number of researches have been reported on the case of additive non-Gaussian noise. For the class of observations in which the additive noise is a non-Gaussian continuous martingale, we reported results on nonlinear filtering problems [13] and innovations problems[14]f" - [17]

In [13], it was shown that if the additive noise

is a non-Gaussian continuous martingale,

the observation is decomposed into two com-

ponents: one is a process with additive Gaussian noise and the other is the quadratic covariation process of the additive noise.

Also, the author showed in [15] that

the innovations process of such an observation process is a non-Gaussian martingale with the same quadratic covariation process as the additive noise.

The innovations

also decomposed into two components: one is a standard Brownian motion process and the other is the quadratic covariation process, and the latter is common in both decompositions of the observation and innovations processes. In this paper, we will give a number of sufficient conditions for having the unusual information structure such that the information in the quadratic covariation process is not contained in the additive Gaussian part of the observation and/or the Brownaian motion part of the innovations.

~

The first condition is that the family of

On leave from Dept. of Industrial Engineering, Kanazawa Institute of Technology, 7-i Ohgigaoka, Nonoichi-machi, Ishikawa 921, JAPAN

249 ~-algebras t.

generated by the observation or innovations

The second one is the stochastic

process is discontinuous

independence between

and the quadratic covariation process of the additive noise. the existence of a process which is computable

in

the Brownian motion part The third condition is

from the observation and contains a

martingale part that does not belong to the space of martingales Brownian motion part of the observation noise.

generated by the

These conditions cover a large class

of observations with non-Gaussian additive noise, and we will see that usually, information structure of such observation is different with additive Gaussian noise. understanding

We will also give a number of examples for better

of the results reported in this paper.

In this paper, mathematical denotes

symbols are used in the following way.

tile transpose of a vector or a matrix.

a nonsingular

square matrix, A -I denotes

The prime

The Euclidean norm is I.lo

the inverse matrix of A.

(~,~, P) is a complete probability space where ~ i s a s a m p l e

If A is

The triplet

space with elementary

events ~, ~ is a ~-algebra of subsets of ~, and P is a probability measure. denotes

the

from that of the observation

the expectation and E{.I~} , ~ C ~

the conditional

expectation,

E{.}

given ~, with

respect to P. ~[.} is the minimal sub-~-algebra

of ~ with respect to which the fami-

ly of >-measurable

is measuraie.

sets or random variables

{~

If ~I and ~2 are sub-

~-algebras of ~, then ~ I V ~ 2 denotes the minimal ~-algebra which contains b o t h ~ 1 and ~2"

Also, for ~ C ~ a n d

AE~,

{~t; 0 E t S T }

be a non-decreasing

{xt; 0 K t K T}

is said to be

~AA

denotes

the family { B A A ,

family of ~-algebras.

adapted to

E or

F-a~pted

all t E [0,T].

It is assumed that all random variables

~-measurable.

Unless otherwise stated, stochastic

BE~}.

Let F E

A stocastic process x E

if xt(~) is ~t-measurable

for

and stochastic processes are

properties are that with respect

to P.

2. OBSERVATION AND INNOVATIONS PROCESSES AND THEIR DECOMPOSITIONS Let y E {Yt; 0 J t iT}

be an m-dimensional

yt = O + f~hs(~)ds + n t ,

observation

process given by

t E [0,T] ,

where h E [ht; 0 ~ t i T }

is an m-dimensional

notes an m-dimensional

noise process.

(i)

signal process and n E {nt; 0 E t i T }

Most discussions

de-

on the innovations processes

have been made for the case where the noise n is a Gaussian process,

especially a

Brownian motion process.

In this paper, we are concerned with a more general case

where n is a non-Gaussian

continuous martingale with an absolutely

continuous quad-

ratic covariation process. Let (~,~, P) be a complete probability decreasing and right-continuous

space and F E { ~ t ; 0 K t K T }

family of sub-~-algebras

of 5.

We will assume that

all the stochastic processes including y, h and n are adapted to F. we will consider the case where n belongs that n has the form:

be a non-

In this paper,

to a class of continuous martingales

such

250 nt = fO Rs (~ where w - {wt; 0< t denote

the quadratic covariation process of m and ~, i.e., <m,w> is an m-dimensional Fadapted process such that {mt~ t- <m,~>t; 0 i t iT} well-known that there exists f ~ 2 ( F ) dition m ~ 4 2 ( ~ , F )

such that

is an F-martingale [I0] <m,G> t= f~ fs(~)ds.

is equivalent to <m>t ~ 0, where m E {Nt(~); 0! tiT}

nent of m orthgnnal to ~2(w,F),

i.e.,

It is

Then, the conis the compo-

255 ~t(~) ~mt(~ ) -f~f~(~)dOs,

(46)

and is the quadritic variation process of ~, i.e., is an F-adapted process such that {~t 2- <m>t; 0 i t iT}

is an F-martingale [II]

4. PROOFS OF THEOREMS 5 AND 6 In this section, we will use the following notation for simplicity. and ~2(y)

Let IM2(F)

respectively denote the space of square integrable F- and Y-martingales.

Let ~2(y) denote the set of ~m-valued Y-adapted f E[ft(~); 0~ t i T} with the property (41).

We can define ~i2(v,Y), i.eo, the space of square integrable martingales

generated by ~ similarly as (42). Now, we have the following lemma. Lemma 7.

Let E~$2(F;Y) and x be defined by

x t ~ t - ~ 0 - f ~ E { a s ( ~ ) I ~ s } d s E f~[as(~ ) -E{as(~ ) I~s}]ds+mt(~ ). Then, x E ~ 2 ( Y )

and xE$2(F;Y).

If m ~ 2 ( F )

(47)

is its martingale part, then we have

<x>tI~t = <m>tI~ t

(48)

t[%t = <m,~>tr> t.

(~9)

and

Proof: Noting (39), we see that It$'s stochastic differential formula implies xt2 = x02 + 2f~xsdxs +<m>tI~t. By (50), it is clear that <m>tI~ t is ~t-measurable~

(50) Since

x02 + 2f~xsdxs = xt2 - <m>tl~ t (51) and the left-hand side of (51) is a Y-martingale, we have (48) by the definition and the uniqueness of .

Similarly, by applying It$'s stochastic differential formula

to the F-adapted process {xtvt; 0N t ST}, we have xt~t=Xo~O+f~sdxs+S~xsd~s+<m,~>tI~t . (52) Since the sum of the first three terms in the right-hand side of (52) is a Y-martingale by

v, xE~2(Y), we have (49). This completes the proof.

L e n a 2.

If ~ is an orthogonal base of ~2(y),

then w is an orthogonal base of

~2(F;Y) ~{mE~2(F); m is a martingale part of x ~ 2 ( F ; Y ) } .

(53)

]M2(~,Y)=~(Y)

(54)

Namely, > ~2(~,F) ~ 2 ( F ; Y ) ,

where ~(Y)~

Proof:

{mE~i2(Y); m0(~) =0}.

Let x~$2(F;Y), and assume that its martingale part mE~I2(F;Y)

fies <m>tl~t# 0 and <m,w>tI~t = 0, ~(Y).

(55)

By eemma i, we have <x>tI~t#0 and <x,v>tI~t=0.

consequently, ]M2(v,Y) ~ ( Y ) .

satis-

Then, the process x defined by (47) belongs to Hence,

x~2(~,Y)

and

This completes the proof.

The following lemma is easily shown by the well-known results for the observation with additive Gaussian noise.

256 Lemma 3.

If v is not an orthogonal base of ~2(y),

i.e., if ~ 4 2 ( v , Y ) ~ ( Y ) ,

then we have (22).

Proof:

If ~t=q]t for all tE [0,T], then v is the Gaussian innovations process

for the observation 9 with additive Gaussian noise. Theorem 3.1], we have ]M2(~,Y) = ~ ( Y ) .

Hence, by (C- 3) and [i0;

This completes

the proof.

Now, we will give the proof of Theorems 5 and 6.

Proof of Theorem 5:

By Lemma 2, we have

~2(w,F) ~ 2 ( F ; Y )

> ]M2(~,Y) ~ ( Y ) .

Hence, if there exists xE$2(F;Y) ~2(w,F),

whose martingale part m does not belong to

then v is not an orthogonal base of ~ ( Y ) .

Hence, by Lemma 3, we have

(22).

Proof of Theorem 6: $2(F;V),

First, let us show (22).

Since q/tC~t, tE [0,T], if x E

Hence, S2(F;V) CS2(F;Y)

and we have (22) by Theorem 5.

then xES2(F;Y).

In order to show (23), note that from (i), (4), (14) and (16), we have (56)

v t = 0 + S~(hs (w)- h(s)}ds + S~ R~(s,~)dw s and vt =

ft ~-89 O+~ K 0 ~s,~){hs(~)-

fi(s)}ds+wt.

(57)

Let us regard v as an observation process and ~ as its component with additive Gaussian noise.

Then, we see that the innovations process and its Brownian motion

component of "observation v" are respectively v and ~ since E{ht(~)- fi(t)I~/t} = E{E{ht(~)- fi(t) I%t } 12/t} = 0.

(58)

Hence, we can apply Theorem 5 by replacing y and y by v and v, respectively. if there exists xE$2(F;V) ~2(w,F),

whose martingale part m E ~ 2 ( F ; V )

then we have (23).

Remark 5.

Namely,

does not belong to

This completes the proof.

As we can see from the proof of Theorem 6, for the observation with

feedback: yt=8+S~{gs(~)-~(s)}ds+f~ where ~(t) ~E{gt(~)i~t},

we have v = y

R~(s,~)d~s,

and the innovations informational

(59) equivalence

to the observation always holds, provided that (59) has a unique strong solution.

5. CONCLUDING REMARKS We obtained sufficient conditions for (22) and (23) and it was shown that for a large class of non-Gaussian martingales n, the information in Z is not contained in y and v.

We were not concerned with the innovations informational equivalence

to the observation in this paper.

It should be noted that the condition of innova-

tions informational equivalence is mainly related with the stochastic properties of h and {Ro89

0 ~ t~T}

whereas the condition for (22) and (23) is described

by the properties of R 0 and ~ and is almost independent of h.

257 REFERENCES [i] T. Kailath, "An innovations approach to least-squares estimation--Part I: Linear filtering in additive white noise," IEEE Trans. on Automat. Contr., vol. AC-13, pp. 646-655, 1968. [2] P.A. Frost, Estimation in Continuous Time Nonlinear Systems, P h . D . Dissertation, Stanford Univ., Stanford, Calif., 1968. [3] P.A. Frost and T. Kailath, "An innovations approach to least-squares estimation --Part I~: Nonlinear estimation in white Gaussian noise," IEEE Trans. on Automat. Contr., vol. AC-16, pp. 217-226, 1971. [4] T. Kailath, "A note on least squares estimation by the innovations method," SIAM J. Contr., vol. I0, pp. 447-486, 1972. [5] J. M. C. Clark, "Conditions for the one-to-one correspondence between an observations process and its innovations," Center for Computing and Automation, Imperial College, London, U.K., Tech. Rep. i, 1969. [6] V. E. Benes, "Extension of Clark's innovations equivalence theorem to the case of signal Z independent of noise, with fZs2dS< ~ a.s.," Mathematical Programing Study, vol. 5, pp. 2-7, 1976. [7] D. S. Cirelson, "An example of a stochastic differential equation having no strong solution," Theory Probability Appl., vol. 20, pp.427-430, 1975. [8] V. E. Bene~, "Nonexistence of strong nonanticipating solution to stochastic DEs: Implications for functional DEs, filtering and control," Stochastic Processes and their Appl., vol. 5, pp. 243-263, 1977. [9] D. F. Allinger and S. K. Mitter, "New results on the innovations problem for nonlinear filtering," Stochastics, vol. 4, pp. 339-348, 1981. [I0] M. Fujisaki, G. Kallianpur and H. Kunita, "Stochastic differential equations for the nonlinear filtering problem," Osaka J. Math., vol. 9, pp. 19-40, 1972. [ii] H. Kunita, Estimation of Stochastic Processes, Sangyo-Tosyo, Tokyo, 1976. (in Japanese) [12] R. S. Riptser and A. N. Shiryayev, Statistics of Random Processes I: General Theory, Springer-Verlag, New York, 1977. [13] Y. Takeuchi and H. Akashi, "Least-squares state estimation of systems with statedependent observation moise," IFAC J. Automatica, vol. 21, pp. 303-313, 1985. [14] Y. Takeuchi, "Generalized innovations equivalence for observations with nonGaussian noise," Proc. International Sessions 26th SICE Annual Conference, Hiroshima, Japan, July 15-17, vol. 2, pp. 1069-1072, 1987. [15] Y. Takeuchi, "On non-Gaussian innovations processes for observations with nonGaussian noise," Proc. 25th IEEE Conference on Decision and Control, Athens, Greece, Dec. 10-12, pp. 1035-1036, 1986. [16] Y. Takeuchi, "Innovations informational equivalence for observations with nonGaussian noise," Proc 26th IEEE Conference on Decision and Control, Los Angeles, California, Dec. 9-11, pp. 1163-1168, 1987. [17] Y. Takeuchi, "Innovations informational equivalence for a class of observations with independent non-Gaussian noise," submitted to IEEE Trans. on Inform. Thoery.

NONLINEAR THE

FILTERING

CASE

OF

WITH

UNBOUNDED

Patrick UF~A C N P ~

No

paper

noises

concerns

and

a one

coefficients. respect define

to

prove

paths

of

the

a formal

the

Malliavin

density

for

filter

is

the

the

formula.

unique

problem

continuity

Then,

we

filter.

Metz

by

prove

Finally,

solution

of

of

filter means

of

the

filter

With

this

and

we

~iven

of

a

the

with

aim,

we

prove

a result

that

Zakai

unbounded

the

existence

prove

correlated

with

process.

the

we

with

process

unnormalized

calculus, the

de

filtering

observation

continuous

Kallianpur-Striebel

Universit~

observation

we

the

COEFFICIENTS.

LORRAINE

a nonlinear

dimensional

First,

NOISES

FLORCHINGE~

399,

INDIA

This

DEPENDENT

a by

regular

unnormalized

equation.

INTRODUCTION

The

aim

compute given It

the

is

of

the of

known

that

minimize

that

the

nonlinear

history

well

which

in

"'at b e s t "

the

of

compute

this

obtain,

for

the

an

of

the

best

with

{yt}

is

to

the

following

process.

a

functional

to

to

any

t.

{x L }

of

xt

expectation

field

time

:

process

conditional

for

conditional

the

this

sigma

up

expectation

the

on

the

process

is

stochastic

estimation

respect

conditional t,

theory unknown

risk

observation

each

an

observation

quadratic

functional

paths

filterin~

paths

~enerated Notice

functional

distribution

by

that

to

reduces

of

xt

to

given

{ys,OSsst}. Some be

theorical

solved

9 The

are

continuity

observation 9 The state

an The of

scalar aim,

we

discuss

questions following

of

the

in

nonlinear

filterin~

which

have

to

:

filter

with

respect

to

the

paths

of

the

process.

existence

9 Gompute

case

the

the

of

uniqueness purpose

of

a nonlinear observation present the

a regular

general

main

form result this

ideas

the

paper

process of

and

for

nonlinear

for

filtering

a survey

density

of

is

the

solutions to

solve

problem

with

with the

unbounded previous

results

of

filter.

filtering to

equations

these

those

questions

dependent

works

on

[i?]

topics

[i8].

the

and

With

these and

in

noises

coefficients.

[i63,

and

equations.

a

this and

259 I STATEMENT

OF THE

We consider process

P~OBLEM. on a probability

- observation

stochastic

(xt,yt)~nx~P

differential

system

(O,~,P)

solution

the of

pair the

system

followin~

:

t=l

~=1

dyt=h(xt)dt

+ dvt

x o donn~

loi

de

space

(i.l)

mo

Yo=O where

w and

~m and Notice the

v are

two

independent

~P respectively. that since the process

system

process

and

is c o r r e l a t e d . Furthermore, assume 9 X o is a vector

the

the

field

9 XI,..,Xm,XI,..,Xp 9 h is a function

Wiener

v t appears

observation

followin~ on

are

in c ( ~ n , ~ p)

with

in

definitions

both

process,

hypothesis

~n with

bounded

processes

less

vector

such

than

X"o +

system

on the

in

of Ci.l)

coefficients

linear

fields

that

the

values

:

~rowth.

on ~n

~ hs163 h a s

less

than

t=l

linear For

~rowth

any

fin

t~s

associated

with

~o(s

order

and

any

the

system

p)

2 CONTINUITY

In this

we

OF THE

obtain

are

a paths remains

process of

of the notion

we are

function

~,

as

the

of

define

functional

= Es

t)

interested

This means that on the measures by

paths valid

i s used.

interest

by Chaleyat-Maurel Next, we study the paths That

solutions

(l~ filter

the

:

/ F~]

a.s.

FILTER.

section,

formula

observation which

(l.i)

> ~t~(y)

o f t h e f i l t e r ~t" sense errors made filter

bounded

explosive

O(Ys/OSs~t).

F~ =

hand,

avoid

>

y where

to

The

in nonlinear

in s continuity

of

in

continuity

properties

the model drives in the of the observation. On the formula when

which

an

different filterin~ the

filter

implies of

the of the continuity

been

discussed

approximation notions have with

right other

that

respect

observation p r o c e s s f o r t h e B a n a c h norm. of continuity in nonlinear filterin~

has

Co

the been

260 introduced bounded

by Clark

[8]

coefficients.

in t h e

We aim

F mappin~

Co([O,T],~P)

in ~,

norm

Co(s

such

on

That

notion

of

continuity

allows to estimate observation. The

continuity

where been

the

of

the the

studied

first

of

uncorrelated

the

continuous

that

filter

= ~t~(y) by

paths

one

has

with

by S u s s m a n n

for

by Fleming-MiLLer

[153

last results have continuity of the

been improved by Sussmann filler with respect Lo the that

Vc>O B Kc>O /

Banach

formula only

"cubic

Lwo the

such

the

which

a

unique system~

coefficients

and

h is a C 2 f u n c t i o n

h is

the

to

uncorrelaLed

unbounded

[303

when

with

with

a funcLional

a.s.

a paths

associated

of

respect

whenever

is s c a l a r

systems

existence wiLh

F(y)

~ives signal

observation

case

to prove

sensor

a polynomial

has

problem"

function.

These

[31] who proved Banach norm when

:

[Lh[

§ ~

~ X t h l 2 < c h 2 § Kc

L=I

The

of

continuity

the

(i.e.3~,i~p,s.t.

filler

X ~0)

associated

has

been

with

handled

correlated

by Davis

[9]

systems

in

the

case

t

of

a scalar

observation

with

bounded

Kolhman C123 w h e n t h e v e c t o r f i e l d s notice that Davis-Spathopoulos [113 of correlated system with unbounded Here,

we aim

to study

the case

observation

is scalar

Assume

XI,... ,Xm,X I are

that

function Denote

with

b y ~t

less the

with

than

of

the

~t ( x ) ' x § Moreover,

assume

that

a correlated

by

E11iott-

system

where

the

coefficients.

C 3b v e c t o r

of

and

X~, l ~ p commute. Moreover, have studied a particular case coefficients.

unbounded

exponential

solution

coefficients

and

fields

that

h Is

a

equation

:

Cs

~rowth.

followin~

~(~

integral

)(x)ds.

condition Tn

(H)

Vr>O,Vc>O,3K~>O

/[Xh~+

sup

[GO

L defined

denoted

a.

for

some

constant

such

that

:

~ in

[0,I[.

L,j=i

D.h (C~)

For every

i,j,iSi,jsn,

the f u n c t i o n s

D~Dih

~

I/2 and

(I + h x) are

i + hz

in C~(~n).

Thus, a c c o r d i n ~ w i t h t h e o r e m 5.2 from [4], we can f o l l o w i n g u n i q u e n e s s r e s u l t to t h e s o l u t i o n o f e q u a t i o n

THEOREM

equa*ioa

4.2

: Se~

(4.1)

~(x)

has a u n i q u e

l" b (~n))

in C ( s

: i + h 2zf x )

n

~or

solution

p

every

such

C ( [ O , T ] ; C 2,1" b (~n,Z))

x in ~n.

state (4,1)

the :

Suppose

that

~ha*

e x p ( M ' ~ M) p.

for

every

H' 0.

These algorithms obviously satisfy : qf0 D ~.d0 D q f l D ~ 1 ~ ... D Clf~tD r

D ...

Hence the respective limits, qf* and r are equal. This limit is well known as being the supremal (E,A,B) invariant subspace contained in ffrv IV 81], [O 85]. Both algorithms will be used hereafter in order to describe some precise structure of (1). A

A

A

A

When dealing with (E, A, B, C), the associated recursions (8) and (10), say e~eit and ~0it obviously satisfy (see for instance (12)): ~ 0 D q~St0= ~V'1 = ~ 1 = ~ 2 ~... A

A

A

A

Hence, for (E, A, B, C), we shall only deal with : (11) ~ , e 0 = ~ , ~ , e I t + l = 5 % c ' ~ - 1 ( h n B + E ~ f I t )

It>0.

We fwst have the following result : Lcmma 1 : The following algorithms are equivalent to (11) (see (4)-(5)) : (12) ~ , e 0 = ~ ,

c~eit+l=,~nA-l(imB+E~lfit + .~)

(13) ~ l f 0 = ~ , c ~ e I t + l = f f ( , n [ i - l ( I m B + E C ~ e I t ) r Proof :

32 ]

It20, It>0.

i) the equivalence between (11) and (12) is direct since Ker P0= . ~ . ii) for the equivalence between (12) and (13), we shall first prove that, for any c~c ~(~ :

(14) A - I [ ( I m B + E ~ ) ~ . C ~ I ] = A - I ( I m B + E ~ ) $

$2 "

For that, let x e A-l[(Im B + E~) 9 -~1 ] ' then, because of (5), there exist Xle g l @ Ker A, x2e 32, t ~ ~, b ~Im B and XlE_~ 1 such that : x = Xl~ x 2 and AXl+ Ax2 = (b+Et) $ x 1, that is : Xle A'l(Im B + Er

and x2e A'I(_.~_1 )=: Ker A 9 82 , which imply :

A-l[(Im B + ECg) ~ -~1 ] c A-l(Im B + EcC) + Ker A 9 ~2 = A-l(Im B + E~) $ 82 . On the other hand, A'l(Im B + E~g) 9 82 c A'l(Im B + Ec~) + A-I(_.~I )c A-l(Im B + ECg+_~l ), which ends the proof of (14) and therefore that of ii) * The relation between algorithms (8) and (13) is described in the following Lemma :

282 Lemma 2

:

(15) ~ 0 nr Proof :

~cr c!,eIt nqATix+l=,cf(, n A -1 [( Im B + ECV'it-1)n( Im B + E r

Ix > 1.

First note that q.e 0 n~121=: ~(~ nSes = ,5~. Next, from (8) and (13), it follows :

q e Ix n~.eIt+l=5% n [A -1 ( Im B + E q ? I x - 1 ) l n [ a -1 ( Im B + E ~?It) @ g2 ]" On the other hand, let x 9 [A -1 ( I m B + EqfIt-1)] n [A -1 ( Im B + E r s 11 e A - l ( I m B + E q

?It-1) C 81 ~ K e r A , Sl2 9 A " I ( I m B + E

$ 8 2 ], there then exist

~eIt) c 81 ~ K e r A a n d s 2 9

that : x = s 11 = s12 + s2, which implies : s2= s 11 - s12 9 (81 ~ Ker A ) n $2 =: 0 (remember (5)). Therefore, x e A -1 ( I m B + Eq2 I t - l ) n A -1 ( I m B + E ~ i t ) and (15) follows,

2 STRUCTURE ALGORITHM AND ZEROS AT INFINITY : In the proper case (rank E = n), the infinite zero structure only depends on the transfer matrix of the system (see for instance [CD 82]). In that case, this structure at infinity has been characterized algebraically with the help of the famous Silverman's Structure Algorithm [S 69] (see [SK 83]). In the case of generalized systems, there exist zeros and poles at infinity, and the infinite structure is not completely characterized by the transfer matrix ; more precisely, the invariant zeros at infinity are not only transmission zeros but also include (possibly) some input / output decoupling zeros at infinity [R 74]. The Structure Algorithm has been extended by Lewis to this class of generalized systems (see [LB 85]) and it is thus possible to define some kind of structure of zeros at infinity directly from this Structure Algorithm in a similar way to that of [SK 83]. A preliminary result in that direction has been given in [M 89]. Meanwhile, as a part of a new canonical form ILL 89], four different types of zeros at infinity have been geometrically characterized as follows. Their respective orders, say mi,ni,Pi and q i ' satisfy : I c a r d {m i = g + l } = d i m [ ( K e r E n r (16)

q c a r d {n i = I x + l } [card

It) / (Ker E n q,e Ix+l)[ VIx >_ 0

= dim[(ImB n E"~ It-1)/(imBn

{Pi = g+1}

[,.card {qi = It+l}

Eq, e It)]VIx_> 1

d i m [ ( I m B n Eq,~ It) / (Im B n E~fa9- Ix)[

Vit >_ 0

d i m [ ( K e r E n q,e Ix) / (Ker E n r

VIx _> 0

Ix)[

The number of n i equal to 1 is given by : card{n i = 1 } = dim [ Im B / ( Im B n Im E)] = dim ( ~ ). For more details, see [LL 89]. Recall however that, for the time being, there exists no dynamic interpretation of these lists mi,ni,Pi and qi in terms of transmission or input / output decoupling properties. The main objective of this paper is to provide an algebraic characterization of the four lists described in (16) with the help of the following Refined Structure Algorithm (see Corollary 1) : Refined Structure Algorithm : Initialization (k=O) :

283 (17.a) T 0 [ E A B ] = [ E1 A1B1]

with TO a maximal row compression on E

(18.a) S0[~O0 ~COc]=[ DI00~0'Cl]

w~ ~o, ma~m~ ~ow com,~,~o~ ~176 [~0 ]

(17.b) I[ E A ; ] : [ E'I AI BI] (18.b) I [ O C ] = [ O ^

^

A

C1]

^

A 0, 13(i, D 1, and C 1 are empty. Iteration k :

j

(19.a) 0 (20.a)

0

Sk [ D k

Ek+i Ak+l Bk+l] withTk amaximal row compression on

0

7 Dk+x ck+l ]

with Sk a maximal row compression on []/Dk[

LB--kj

Ak ^ A A ^ Bk E k + 1 Ak+ 1 Bk+ 1 with Tk a maximal row compression on (19.b) ,', TkL 0~ 0 = 0 Ak B__kJ 0

(20.b)

(19.c)

A [o,c,I D,,, ,+i

AAEA 1

Sk B-k Akj ^ Ek Ek+l

0

Ck+ I

0

0

[A1

^ . Dk with Sk a maximal row compression on ^

L--BkJ

] =Zk

(20'c) ]I ^= D Ak 1L D k + Note that this proposed algorithm is exactly Lewis' one, but applied simultaneously (in parallel) to both systems (2) and (6). Then, the following hold for k >_.0 and directly follows from [M 89] and [LB 87] : Fac~ l, : f Ker Ek+ I = Ker E ~ qf, k (21) Ker Dk+ l = B ' I ( E qfk) qy, kr KerCk+ I = q,ok+l

284

Ker Ek+l (22)

= K e r E n c~ek

Ker Dk+l

= B-I(E

~/'k~_0)

~l-ekn Ker C^ k + 1 = d~ek+l A

for (22), note that Ker E = Ker E and Ker P0 = -~---0" We are now in a position to establish our main result : Theorem 1 : For any k _> 1, the following hold : A ,'~.k- 1 (23) Ker E k n Ker E k + l = Ker E n (24) Ker D k n Ker [ ) k + l = B-I(E ~ k - 1 ) A k (25) qfk n Ker C k + l = The following Corollary is thus immediate and provides an algebraic characterization (through our Refined Structure Algorithm) for the different types of zeros at inf'mity of (I) recalled in (16) : Corollary_ 1 : card { m i = ~t+l } = rank (El.t+2) - rank ( Z ~ + l )

"v' ~ > 0

card { n i = ~+1 } = rank ( Dlx+l ) - rank ( Atx) card {n i = 1 } = rank B(_B0)

k/~t > 1

card { Pi = p.+l } = rank ( A~t+l ) - rank ( D g + l )

V l.t > 0

card { qi = p.+l } =rank (E~+I) - rank ( Ep.+l )

'v' ~ > 0

The proof of Theorem 1 deeply relies on the following Lemma which depicts some interesting distributive properties and the proof of which is given in Appendix : Lemma 3 : For any k > 0, the following hold : (27) Ker E n (qfk + ~ , k + l ) = Ker E n cv'k + Ker E n ~ f k+l (28) Im B n (EC~"k + E~ ek+l) = I m B n Eq7 k + I m B n E ~ k+l (29) , ~ k = clpkn~lek+l Proof of Theorem 1 : i) ii)

(23) follows from (21.a), (22.a) and (29) 0 From (21.b) and (22.b), it follows : A

Ker D k n Ker Dk+ 1

-----

1

A

B- [ Eclpk-ln (E q ek ~ .r A

A

A

On the other hand, for any x e E%ek-ln (EClf'k ~ ~,q3), there exist v ~ q ek-1, v E c f k and -~)~-~:0 A

A

such that x = Ev = Ev 9 -~0" Then : -~0 = Ev - Ev ~ Im E n ~

=: 0 (remember (4')).

Therefore : E q f k - l n (E~ ek ~ ~...~_0) = E q / ' k ' l n EC~ k , which implies : KerDk nKer Dk+l

= B - l ( E q f k - l n E~l'ek) = B - I [ E ( q f k - l n c~k)] =

B-1 E ~ k - 1

, because of (27) , because of (29),

285 which establishes (24) 0 iii)

From (22.c), it follows :

clTk nc~ekc~ Ker

Ck+l= q2k n c~k+1

^ 1= then : cv'k n Ker Ck+

~k

(since

~k

c

~k

for all k and because of (29)), which ends the proof of

Theorem 1 .

CONCLUSIQN : We have presented here a refined version of the Structure Algorithm introduced by Lewis (See [LB 85]) in the case of generalized systems. This refinement amounts to performing simultaneously Lewis'Algorithm onto the given system and on some projection of this system, where the algebraic part of the state equation is eliminated. In this way, we have been able to characterize algebraically all the four types of invariant zeros at infinity of the system, which are geometrically described in iLL 89]. Furthermore, we have extended the well-known algebraic characterization of q ek (see (21.c)) [LB 87] to ~fa9-k (see (25)) : each step of these algorithms (8) and (I0) can be computed as the Kernel of some map derived from this Refined Structure Algorithm. Connections between the geometry of the system and that of its projection have also been established (see for instance Lemma 1 and (29)). Our future objective will be to use these results in order to derive a left inverse of a given (left invertible) system with a minimal pole structure at infinity, that is with the least possible number of differentiators. This minimal pole structure at infinity is in correspondence with the tran~mis~i0n ?:eros at infinity of the system (recall [CP 84]), which have to be further characterized from a geometric point of view.

REFERENCES : [CD 82] C o m m a u l t C., Dion J.M. : "Structure at infinity of linear multivariable systems : a geometric approach", IEEE Trans. on Automat. Contr., AC-27, n~ pp. 693-696, 1982 [CLM 89] C o m m a u i t C., Lafay J.F., Malabre M. : "Structure of linear systems : geometric and transfer matrix approaches", IFAC Workshop "System Structure & Control : State Space & Polynomial Methods", September 25-27, 1989, Prague [CP 84] Conte G., Perdon A.M. : " Infinite zero module and infinite pole module", Proceedings of the 6th Int. Conference on Analysis and Optimisation of Systems, Lecture Notes in Control and Information Sciences, Springer Verlag, Vol. 62, pp. 302-315 [CPW 88] Conte G., P e r d o n A.M., W y m a n B.F. : " Fixed poles in transfer function equations", SIAM J. Control & Optimization, Vol. 26, n ~ 2, pp. 356-368, March 1988 [LB 87] Lewis F., Beauchamp G. : "Computation of Subspaces for Singular Systems", MTNS' 87, Phoenix, June 1987 iLL 89] Loiseau J.J., Lebret G. : "A new canonical form for singular systems with outputs", in this book of Proceedings. [ L O M K 89] L o i s e a u J.J., Oz~aldiran K., M a l a b r e M., K a r c a n i a s N. : "A feedback classification of singular systems",IFAC Workshop "System Structure & Control : State Space & Polynomial Methods", September 25-27, 1989, Prague [M 89] Malabre M.: "On infinite zeros for generalized systems", MTNS'89, Amsterdam, June 1989 [M 73] Morse A.S. : "Structural invariants of linear multivariable systems", SIAM J. Contr.& Opt., Vol. 11, n~ pp. 446-465, 1973 [O 85] Oz~;aldiran K.: "Control of descriptor systems", P h . D . Thesis, Georgia Institute of Technology, May 1985 [R 74] Rosenbrock H.tl. : "Structural properties of linear dynamical systems", INT J. of Control, vol. 20, n ~ 2. pp. 191-202, 1974 iS 69] Silverman L.M. : "Inversion of multivariable linear systems", IEEE Trans. on Automat. Contr., AC- 14, n~ pp. 270-276, 1969

286 [SI~ 83] ~iiverman L.M., Kitap~i A. : "System Structure at infinity", Systems & Control Letters, 3, pp. 123-131, 1983 [V 81] Verghese G.C.: "Further notes on singular descriptions", JACC, TA4, Charlottesville, 1981 [WS 85] W y m a n B.F., Sain M.K. : " On the design of pole modules for inverse systems", IEEE Trans. on Circuits and Systems, CAS-32, n ~ 10, pp. 977-988, October 1985

A P P E N D I X : P r o o f of L e m m a 3 : Consider the generalization of Morse's group [M 73] corresponding to all possible changes of bases in the domain ~ (noted V), in the codomain . ~ (W), in ~U' (G) and in ~ (H) and all possible proportional state feedbacks (F) and proportional output injections (R). With each (E,A,B,C) system is associated an orbit under this group action. All equivalent systems within the same orbit have some common structural properties which are more nicely described on the particular element of the orbit called the canonical fomL As any element of the orbit, this canonical form (which is fully described in [LL 89]) is derived from (E,A,B,C) through an element of the transformation group, say (V c, Wc,Gc,Hc,Fc,Rc), namely :

(A.1)

Fc = ~A c = |B c = [C c =

W c E Vc ; where W c and V c are isomorphisms W c (A+BFc+RcC) V c Wc B Gc 9 where G c is an i s o m o r p h i s m Hc E Vc " where H c is an i s o m o r p h i s m

We shall first prove this Lemma 3 for systems in the canonical form : L~mm~ A,1 : For any k > 0, the following hold : (A.2)

Ker E c n (flec k + ~,eck+l ) = Ker E c n fleck + Ker E c n ~ c k+l

(A.3)

Im B c n (Ecfleck + E c ~ c k+l) = Im B e n Ecflec k + Im B c n Ecq~eck+l

Proof : The proof is based on both following properties : i) First of all, it is very easy, working on the canonical form (Ec,Ac,Bc,C c) (see the description of the foml in ILL 90]), to show that all the subspaces which appear in (A.2) and (A.3) can be spanned exclusively by vectors of the canonical bases of ~(~ and . ~ . ii) Next, it is relatively easy to see that for any triple ( ~ , 3 , ~ ) of subspaces of a vector space ~ , a basis of which can be chosen as a subset of a given basis o f ~ , i.e. : ff~, = {{ei }},S = {{ej }}, and cC= {{e k }}, where {ei }, {ej }and {ek }are elements of the basis of ~ , then distributlvity always holds, that is :

5%n($+~)=SZng+~n~. Therefore (A.2) and (A.3) directly follow , Now, a series of three results, which make the connection between Lemrna A.I and Lemma 3, is established : Fact A.1 : (A.4) (A.5)

Im E c = W c Im E Im B c = W c Im B

(A.6)

KerCc=Vc-1KerC

(A.7)

Ker E c = V c- 1 Ker E

Proof :

Directly from ( A . 1 ) ,

(=Vc "1 5ec,,)

287

Fact A,2 : With ~ (a.8) then :

defined in (4), let :

" ~ 0 = Wc " ~ {~

= Im Ec * ~:..c 0

~ 0 : =

Vc-1 cv'k

(A.10)

qfc k

(A.11)

~ag-ck

=

Vc-1

(A.12)

C~fck

=

Vc-I c~fk

gO. k

Proof : Only (A. 12) will be proved here, since (A. 11) and (A.10) are similar. First note that: r 0 = Vc-1 ~ f 0 . Now, suppose that (A.12) is true for some k _> 0, then, from (A.1), Fact A.1 and Fact A.2, it follows : ~fck+ 1= Vc-1 if.c n Vc-1 (A+B Fc+RcC )- 1W c- i (WcEd~fk+Wci m B+Wc._..~) = Vc-lc~ok+ 1 , We are now able to end the proof of Lemma 3 : Proof Qf Lemm~ ~ : i) (27) follows from (A.2), (A.7) and Fact A.3 0 ii) (28) follows from (A.3), (A.5), (A. 1) and Fact A.3 0 iii)

First of all, note that (29) holds for k = 0 since r -0 = q f 0 n e~;,1 . Now, suppose that (29) holds for

some given k > 1, that is : r = qfk-I n d~ek, then, from Lemma 2, (28) and (27) : q f k n c~ek+l = , ~ n A-l[( im B + Eqfk-l) n ( i m B + E~k)] = fff, n A - l [ I m B + E % ~ k - l n ( i m B + E ~ k ) ] = ~ e t . d k

,

THE DYNAMIC BLOCK DECOUPLING PROBLEM: A MINIMAL SOLUTION BY PRECOMPENSATION J. DESCUSSE Laboratoire d'Automatique de Nantes, URA CNRS 823, ENSM ,1 Rue de la No~, 44072 Nantes C~dex 03, France

Abstract : The purpose of the present paper is to provide a new dynamic solution to the Block Decoupling Problem based upon a procedure which extends to the block case that one previously given in [14] for Morgan's Problem. This procedure is minimal in the sense that it leads to the least infinite zero structure for any decoupled system reachable from the original one. Kevwords: Block decoupling problem ; block essential orders ; infinite zero structure ; minimal dynamic compensator.

1) Introduction . During the last 25 years, a great deal of interest has been brought to the theory of input-output decoupling for linear time invariant systems. Since the early works of Morgan[12] and of Falb and Wolovich [9], many authors have tackled this famous problem. It is rather difficult to give an exhaustive bibliography on this topic, so many papers have been published ! (Many references can be found in [10], [13] as well as in [7] for very recent ones). The theory is still in progress. Till a recent past , the general block decoupling problem remained unsolved without particular assumptions made on the system or on the feedback laws (see for instance [1], [5], [11], [13]). A complete solution with static state feedback is given in [18] When it is not achievable one can use dynamic state feedback [16] or precompensation.(See [10] for details on static or dynamic state feedback as well as precompensators). The scope of the present paper is to provide a new dynamic procedure which generalizes that one of Wang [14] for Morgan's Problem. Two of its main interests can be noticed : it leads to the least infinite zero structure for the dynamically decoupled system. Secondly, it can be extended with some arrangements to the nonlinear case [8]. Preliminary results are also presented in [6]. The paper is organized as follows : next section is devoted to notation and basic concepts. In the third section, we introduce the block decoupling problem with static state feedback. In the last one, we provide a new algorithmic dynamic procedure which can be used when static state feedbacks are not available. At last , we recall the definition of the "block essential orders" [17] , [18]. We also recall that they are the least infinite zero orders reachable for decouplable systems. 2) Notation and basic concepts We shall consider, in the sequel, linear time invariant systems described by :

289 ,T-,0 I ~ = Ax + Bu

[ y = Cx where xe ,,~ = ~n, u 9 ~ = ~m, y 9 ~ = [p,p; [p, denotes the field of real numbers. The set of natural integers will be written as I]t. Associated with 7_.0we shall consider a k-partition of the output y into k nonempty subsets of components Yi, each of dimension Pi (0 < Pi, '~ie k Pi = P). This partition induces a corresponding partition of C, written as {Ci} k. We shall assume, without loss of generality, that B is monic. The same symbols will be used to denote a map or any of its matrix representations in particular bases. Let 3 the image of 1 3 , a n d K e r C t h e kernel ofC. Let B b e a m a t r i x , t h e n sp{B}will denote 3 . For any space ~.,~, dim ~,~ is its dimension. If ely c ff-.~, ff.~/cV will denote the factor space of ~ - byC~. A set of p elements will be written as {.}p and k will be used for the (not necessarily ordered) set of integers {1, 2 ..... k}. The number of elements of a given list {.} is noted as card {.}. The symbol U will denote the union of sets with repeated common elements. The rank of (C, A, B) defined as the rank of its transfer matrix will be written as p ; Pi will denote the rank of (C i, A, B). In what follows, we shall use cFi* the maximal (A,B) - invariant subspace contained in Ker C i , also written as sup I(A, B, ker Ci), fi~i~ that one contained in nj~ i Ker Cj, while $~'i" will denote the maximal controllability subspace contained in ~i*Recall that %Pi* and "-~i* are the respective limits of "I.S.A." and"C.S.A." defined by : I.S.A. C.S.A. flYi 0 = ~

..~i 0 = 0

cFiP- = Ker Cin A "1 ( s

"1 + 3)

SLip. = ~i" 'q (A ,..%ip.-1 + 3)

We shall also use "C.I.S.A." defined by: 80 = o 8p. =

3

+ A(Ker

C N ,~,p._t)

of which the limit is written 8 * = i n f 0-.t (C, A, 3 ) Now, we recall possible definitions of the infinite zero structure of (C,A,B) (See for instance[2], [7]). Define P~+I = d i m ( 3 nCVp./ 3 n q P * ) # > 0 (2.1) = dim(C~/ where

{~PZ}

and %P* are

C~p.) related

p. > 0

(2.2)

to sup I(A, B, kerC).

The infinite zero structure of (C,A,B) is the list of integers {pp. } , or equivalently {n'i} defined by n' i = card {pp. ~ i}. has Pl infinite zeros of which the orders are the n'i's. Recall that Pl = P.

290 3) The Static State Feedbaqk Block Dec0vDling Problem (SSFBDP) SSFBDP is defined as follows: "For a given k-partition of the output {Ci} k find necessary and sufficient conditions under which there exist static state feedbacks u = Fx + ~-i~k Gi vi such that v i controls Yi without affecting yj, j#i, i ek". This problem remains unsolved in its most general setting. Up to now, the only available results deal with special situation when the static state feedback is "regular", which means that G = [G 1 I...I Gk] has rank m, or when it is "restricted", in other words when u = GFx + Gv, [5],[11]. Theorem 3.1 1"13] : Let ,T-.0 and a k-partition of the output be given. Then the regular SSFBDP is solvable if and only if : = )lie k ~ c~ ~i

(3.1)

As an immediate consequence, we have Corollary :~,1 : Let T.0 and a k-partition of the output be given. Then the regular SSFBDP is solvable if and only if : (3.2) = Zi~k ~ n n j ~ i 97*J The proof is not difficult to perform and is left to the reader. Lemma ~).1 : Let E,0 and a k-partition of the output be given. Then, if P = T-ie k Pi we have : = ,T-,i~k ~ n ~i r

ely* = c~iek clYi

(3.3)

Proof : It is quite similar to that one given in [3], for right invertible systems, turning Pi into pi.E] Now suppose that (3.1) or (3.2) fails. One can search for dynamic solutions. For this purpose an auxiliary system is joined to ~0. It is defined as T.a ,

/ xa = D l x + D2x a + Ev 7.a

] [ u = F l x + F2x a + Gv

where x a e "~a = I~n~ and v e Ig m. We shall say that decoupling is achievable with dynamic compensation if it is possible to find ,T-,asuch that the composite system T.0 x ,T-,adefined on [pn+no can be decoupled using static state feedbacks. Note that this problem is always solvable, under non restrictive conditions[13] , [15].

291 Our purpose here is to provide a new solution to this problem, when p = ,T_,iek Pi. This new solution will be based upon an algorithmic procedure, which generalizes that one proposed in [14] for Morgan's Problem (k=p). From it, it will be possible to define the essential orders of the output related to the given k-partition {Ci} k. 4 I The Dvnamic Block Decoupling Problem Consider the following [14] available when k=p.

Dynamic Algorithm (D.A.), which generalizes that one of

Steo 1 " From the given data ({Ci} k, A, B) compute ~1 = -T-,iek$ n nj~ i c]yj . If ~1 = ~, Stop ! If ~1 c ~,write ~ =

~1 @ ~2- Define Gl and G2 such that ~ I = I m B G 1

and ~ 2 = I m B G 2 Steo 3 : Consider ,T-,0and write u = G1 Vl + G2 v2 = A x + BG 1 v 1 + BG2v 2 assume that dim v 2 = q, and put integrators in series with the q corresponding inputs of v 2. One obtains rewriting v 1 as w 1 :~

~x + BG 2 v 2 _--

"w1 +

0

y = C(,T_,1

=Cx

Steo 4 - Go to step 1 and resume the procedure with T.1 instead of ,T-,0 Theorem 4.1 9 Let .T_,0 and a k-partition of the output be given. Then, if p = ,T-,i~kPi, "D.A." "converges" after a finite number of "loops", say N, towards an extended ,T-,Nfor which SSFBDP is solvable. Proof - The central idea of the proof amounts to show that dim n i c k c~i* (,T_,l) < dim ni~kClYi*(,T_,l+l), where I denotes the I-th loop of "D.A.". Indeed, if this inequality is true, after a finite number of steps say N, we shall fulfil clP*(T.N) = ~i~ k_ ClPi (,T-,N), and lemma 3.1 does the rest. To show the above inequality it is sufficient to perform the ;roof for I = 0. This is done in Appendix 1. [3 We shall now pay attention to the structural properties of TN, and mainly to its infinite zero structure. For this purpose, consider the infinite zero structure of the

292 subsystem ( Ci(T.N), A(,T_.N) + B(T.N)Fi(T.N), B(T.N)Gi(.T-,N)), where Fi(T.N) is any friend of ..9~i*(.T_,N)and Gi(,T_,N) satisfies Im B(,T_,N)Gi(,T_,N)= ~(T.N)r~,-q:~,i*(T.N). We shall note it simply as ~ (Ci(,~,N), ..%i*(T.N)). It can be computed using either (2.1) or (2.2). We shall note it { 5i1~ (,T-,N)} or equivalently {13iv(,T_,N)} with 13iv(,T_,N)= card { 5il~(,T_,N) > v}. The infinite zero structure of ,T-,Nwill be noted 0 i ~k (Ci(7-N), "-'%i*(,T-,N)) and, with the above notation , is defined by {T.i~kSit~(T.N)} or equivalently , from the above counting procedure , by 0 i ~k{ 13iv(,T--N)}" Lemma 4.1: Assume that p = T.ie kp i. Let N denotes the final loop of "D.A.". Then the infinite zero structure of ,T-,Nis not different from ui ~k (Ci('7--'0), "9~i*('T--0))' Proof: since .T-,N is decouplable using regular static state feedback, it follows that [4] ,T_,~(7.N) = Oi ~k,T_,~ (Ci (T.N), ,-gLi*(,T_,N)) Assume that C i ,.9:~i*P.(,~,0)=C i (T.N) ..9~.i*#(~N)

(4.1)

From (4.1) one can deduce that SIP'+1 (,~-.0) = 5il~+1 (,T-,N)

p2.0

which states the result. The proof of (4.1) is given in Appendix 2.F] Definition 4.2117]_ . [18]: The infinite zero structure ~

(C i , --%i*(.T_.0)) is said to be the

essential structure of the output Yi, i~k. It is written as ,T_,e(Ci,A,B).

The essential

structure of (C, A , B), relatively to the given output k-partition, is defined as ui~ k (Te(Ci,A,B)). The terminology of "essential orders" comes from the following property. ProPosition 4.1 [17]. [18]: Let ~

and a k-partition of the output be given. Assume that

P = .T-,iekPi. Then if SSFBDP is solvable for some feedback u = Fx + ~iek Gi vi then .T_~o(Ci,A+BF,BGi) >_T.e(Ci,A,B )

REFeReNCES

[1] Commault C., Descusse J., Dion J.M., Lafay J.F., Malabre M. " Influence de la structure & I'infini des syst~mes lin~aires sur la solution de probl~mes de commande. APII, vol. 20, pp. 207-252, 1986

293 [ 2]

Commault C., Descusse J., Dion, J.M., Lafay J.F., Malabre M. : About new decoupling invariants : the essentiel orders. Int. Journ. of Contr., vol. 44, 3, pp. 689-700, 1986

[ 3]

Descusse J. : Sur la structure & rinfini des syst~mes lineaires d~couplables : le cas des syst~mes inversibles & droite. Outils et ModUles Math~matiques pour I'Automatique, I'Analyse des Syst~mes et le Traitement du Signal, vol. 3, Editor I.D. Landau. Ed CNRS, Paris 1983

[ 4] Descusse J., Lafay JoF., Malabre M. : On the structure at infinity of linear block decouplable systems: the general case. IEEE Trans. on Automat. Contr., AC-280 12, pp. 1115-1118, 1983 [ 5]

Descusse J., Lafay J.F., Kucera V. : Decoupling by restricted static state feedback: the general case. IEEE Trans. on Automat. Contr., AC-29, 1, pp. 79-81, 1984

[ 6]

Descusse J. : A new approach for solving the linear decoupling problems. Proc. of the 10th World Congress on Automatic Control, IFAC, July 27-31, 1987, Munich, FRG

[7]

Descusse J., Lafay J.F., Malabre M. : A survey on Morgan's Problem. Proc. of the 25th IEEE Conf. on Decision and Control, Dec 10-12, 1986, Athens, Greece ; Solution to Morgan's Problem. IEEE Trans. on Automat. Contr., AC 33, 8, Aug. , pp. 732-739, 1988

[ 8] Descusse J. : Towards a dynamic solution to nonlinear block decoupling . Preprints of the International Conference "Automatique Non Lin~aire" of C.N.R.S. , June 13-17 , Nantes France , 1988; "New Trends in Nonlinear Control Theory", LNCIS, vol. 122, Springer Verlag.

[ 9] Falb P.L., Wolovich W.A. : Decoupling in the design and synthesis of multivariable control systems. IEEE Trans. on Automat. Contr., AC-12, 6, pp.651-669, 1967 [10] Hautus M.L.J., Heymann M. : New results in linear decoupling. Proc. of the 4th INRIA Conf. on Analysis and Optimization of systems. Springer Verlag, vol 44, 1980 [11] Kamiyama S., Furuta K. : Decoupling by restricted state feedback. IEEE Trans. on Automat. Contr., AC 21, pp. 413-415, 1976 [12] Morgan B.S. :The Synthesis of linear multivariable systems by state feedback, JACC 64, pp. 468-472, 1964 [13] Morse A.S., Wonham W.M. : Status of non interacting Control. IEEE Trans. on Automat. Contr., AC-16,6, pp. 568-581, 1971 [14] Wang S.H. : Design of precompensator for decoupling problem. Electronics letters, 6, pp. 739-741, 1970 [15] Wonham W.M. : Linear multivariable control : a geometric approach, 2nd edition, Appl. of Mathematics, Springer Verlag, vol. 10, 1979

294 [16] Commault C., Dion J.M., Torres J.:lnvariant spaces of linear systems. Application to block decoupiing. Proceedings of the 27-th IEEE CDC , Austin , USA ,Dec 1988.

[17] Commault C., Descusse J., Dion J.M., Torres J.: Block decoupling invariants. Geometric and transfer matrix characterization.-IFAC Workshop on Systems Structure and Control; 25-27 september, Pague, Tch~coslovaquie.

[18] Descusse J.:Block noninteracting control with (non)regular static state feedback. A complete solution. -IFAC Workshop on Systems Structure and Control; 25-27 september, Pague, Tch~coslovaquie.

Vl>0"

"dim niek cF i (7-1+1)< dim nie k cF i (ZI) ,

It is enough to perform the proof for I = 0. To proceed we need several auxiliary lemmas. Let P 9 ,~, x ~'a ~ ~ x '~'a the projection into ~, along "~'a, where ~. x '~'a is related to T.1 , then t

*

P c[Pi (7-1) = cFi (,T-,0)

i ek

The proof of this lemma is not difficult to perform and we shall omit it. For 7.1 = (Ae, Be, Ce) above defined, we have 9 niek flPi*(Z1) = P n i e k c/Pi*(Z1) Proof 9 Start from Ae n i e k flPi*(7.1) c flYi*(,T_,l)+ ~e c nie k (Vi*(T.1) + ~e)

x[-4] :,,,n[ {[Xo] l+

,o,1,,Jll

where sp{(Xli/UlI)} =flYi*(Z1). From the respective definitions of B 1 ( $1 = 7.iek ~ *

n nj~ i clYj ) and q'Pi (7.0) = sp {(Xli)}, by lemma A1 , we have also, using a repeated action of the modular distributivity rule "

Ix4

A e n ~. (Y. 1) c n sp { iek

I

iek

} + Sa

(A. 1 )

295 Now assume that r~ie k fiYi (,7-,1) = P n i e k c~Yi*(-T-,1)-This means that there exist

onto~.generat~of niek_ctYi*(Y4)written as [~I" From (A.1) we then obtain, by projection B 2 U e hie k 1,f'i*(~O) + @1

c $ r~c~iekqYi*(.~.,0)+ $1 = $1 which is impossible by the respective definitions of ~1 and ~ 2 13

Lemma A.:~ " Assume that SSFBDP is not solvable for ,T_,0. Then, for ,7_,1 = (C e, A e, Be) above defined, we have 9 dim hie k cFi*(,T_.,1)< dim nie_k (s

PrQQf " From the preceeding lemma we know that the generators of hie k c~Yi*(,T_,l)are written asIXI 1, where 0 ~ X I e hie k cFi*(.T_.0).

LU j So

dim nie_k ~qYi*(T.1) < dim hie k cV'i*(,7_,0)

(A.2)

Assume that equality holds. Then, from (A.1), one obtains A X I + B 2 U 1e hie k cFi*(T.0) + $1 which leads to A hie k flYi*(T.0) c rhiek cFi*(T_.0) + 9 The latter implies that chic k clYi*(,T_.0) is (A,B) - invariant, and so decoupling would be achievable for TO, which is impossible by assumption. Thus, (A.2) holds as a strict inequality. [] APPENDIX 2 "PI+I "-q~'i# (~1+1) = "-%ip" (E,I)

~' I> 0,

I~> 0,

e k"

It is enough to show that this equality holds for I = 0. The rest will follow by transitivity.

i) "P .~i I~ (,7_.1)c ..,%i# (-~,0)" The proof is performed by induction. This inclusion is true for I~ = 0. Assume that it is also true for I~ - 1. Then, from CSA, "-q~'p" i (TI) = ~'i* (,7-,1) n (A e ..%i1~'1 (,7-,1) + ~e)

296 By projection on to ~. along '~'a, we have 9 P --,~ip" (7-,1) = P (~i* (7-,1) c~ (A e ,~q,iIx-1 (,~,1) + Se))

= P ~'i* (7-,1) c~ P (A e ..~i #-1 (7-,1) + ~e)

1"'3 since Ker P =/D- I [q~l

It follows, from Lemma A.1, that 9 P ~"iix (7-,1) = ~i (T0) c~ (Sp {A X I + B 2 UI} + 51)

= P ~i (7,1) c~ P (A e -~,iix -1 (7_,1 ) + ~e)

sinceKerP;t]

It follows, from Lemma A.1, that : P -..~iP" (.7_,1)= ~i* (T.0) n (Sp {A Xl + B2 UI} + 51) where

(A.3)

sp

fx,]

(A.3)

} ---b21~ ~'1 (21)

From (A.3) we have : P ..~i # (T.1) c ~'i* (-7--0) n (A -.~i p--1 (7_,0) + 5) = ..~i p- (7,0) ii) "..~i p- (,T_,0) c P ...~i I~ (7_,1 )" It is enough to show that V X I e '~iix (7,-,0), there exists U I such that

The proof is still perform by induction. The inclusion is true for Ix = 0 . A s s u m e that it is a l s o true for g-1. Then V R I a v e c t o r of ~ i I x -1 , t h e r e exist X 1 , U I, U 1 such that :

"%iIx ) RI = A X I + B2U I + B1U I e ~di* (7,0) Moreover, from lemma A.1, there always exists W I such that :

RI = Ae

+

I+

Wl

(~1)

By assumption

and so"

R--L] E ~i (~I) ('h (A e,~i -I (~'.I) +~e ) = '-'~Pi(7,1) Wi which ends the proof. 0

Minimal rational interpolation and Prony's method A. C. Antoulas Rice University & E.T.H. Ziifich and

J. C. Willems University of Groningen

Abstract. A new method is proposed for dealing with the rational interpolation problem. It is

based on the reachability of an appropriately defined pair of matrices. This method permits a complete clarification of several issues raised, but not answered, by the so-called Prony method of fitting a linear model to given data.

1. Introduction.

Given the array of scalar pairs points

(xi, Yi), i E N , xi~xj, i ~ j ,

(1)

with finite entries, we seek all rational interpolants, that is rational functions

y(x) = n(x) d(x)'

(2.1)

n, d: coprime,

(2.2)

=y,, i

(2.3)

where

which interpolate the array (1):

N_

Moreover, we wish to keep track of the complexity of interpolants, which is defined to be their McMillan degree:

deg y := max {deg n, deg d}. The following problems arise: (a) Find the admissible degrees of complexity, i.e. those positive integers ~ for which there exist interpolants y(x) with deg y = ~c (b) Given an admissible degree ~r construct all corresponding solutions.

298

The first complete answer to the above questions was provided by Antoulas and Anderson [1986] using the so-called L6wner matrix as the main tool. For further developments, see Antoulas and Anderson [1989], [1990] as well as Anderson and Antoulas [1990]. In the present paper we will discuss a novel approach to answering the above questions. This allows a complete analysis of Prony's method of modeling, which for example, although widely used in digital signal processing is not well understood. Suppose that there exists an interpolant y ( x ) = n(x)/d(x) having complexity ~; let n(x) := no + n l x + . . .

+ n ~ ~, d(x) : = d o + d l x + " ' " + d)cx ~,

with either n): or d K different from zero. The c o e f f i c i e n t v e c t o r s corresponding to n, d

are

defined as:

c~(n) := (n0

nl

...

n~),, c K ( d ) : = ( d o d I . . .

d,)'.

Conditions (2.3) imply that the coefficients ni, d i satisfy the following sets of linear equations: n(xi) - yid(xi) = O, i ~ N.

In order to write the above system of equations in matrix form, the N•

constant matrices I~:

and I ~ which depend just on the data, are defined; their i th row is [I~]i := ( 1 x i . . . x~ r ), [ l ~ j i : = - y i ( 1 x i . . . x ~ ). With the notation introduced, the above system of linear equations can now be expressed as follows:

f~,c,(n) + ft,c,(d) = O,

(3)

while the coprimeness condition n, d: coprime,

(2.2)

must be fulfilled. It readily follows that every interpolant of complexity 1< must satisfy (3), (2.2) and vice versa. The former being a linear constraint it is straightforward to deal with. The difficulty lies in dealing with constraint (2.2), since as already mentioned we are interested in keeping track of interpolant complexity at the same time. A very common approach to rational interpolation is sometimes referred to as Prony's method. It is based on the assumption that N = 2rn+l data pairs are always interpolated by a rational function of complexity m, since such a rational function contains exactly 2m + 1 free parameters. This assumption however is true for g e r e r i c , i.e. randomly generated, data only. In such a case, (I~ra l~m) has size Nx(N+I) with full row rank; therefore, up to a non-zero constant, there are unique c~:(n), c~:(d) which solve equation (3). Moreover, the coprimeness constraint (2.2) is generically always satisfied. Consequently in the generic case, one interpolant (of minimal complexity m) can be computed this way. For n o n - g e n e r i c data however, trying to fit 2m + 1 data pairs with a rational function of complexity m, may or may not give a solution. Even if a solution is obtained

299

this way, questions (a) and (b) formulated above, remain unanswered and no understanding of the coprimeness constraint is achieved. In the sequel we will expand on the approach which is based on equation (3) and constraint (2.2), and show how the system-theoretic concept of reachability allows us to take care of this coprimeness constraint. 2. The main results.

elii

Consider the pair of NxN, Nx2 matrices defined in terms of the data array (1):

X2

q

F :=

G : = ( g t g2): = x/v

Y2

(4)

N

A closer examination of the matrices 1~ and 1~: defined in the previous section, shows that they are partial reachability matrices of the above pair, containing ~ 1c2, can be represented in the f o r m

where

p, q

c~:(n) = Spc~l(Oll ) + Sqc~2(012),

(17.1)

c~(a3 = Spc~1(021) + Sqc~(Oz2)

(i 7.2)

are coprime polynomials of degrees ~c-~ct, ~c-~c2 respectively, and satisfy the

cop rimeness constraint (11).

303

Remarks (i) In view of the above theorem, here is the solution procedure involving equation (3) which is the starting point for Prony's method. First, construct the full reachability matrix R N of (F, G); as already mentioned, this is the same as the matrix (I/N I1N) which enters in (3), up to column permutation (this column permutation is crucial in the sequel). Proceeding from left to right, determine the first column, say FKIg2 of R N which becomes linearly dependent on the pre-

vious ones (i.e. linearly depend

on

FJgi, j < 1'~1,i =

1, 2).

The corresponding vector, after permuta-

tion of its entries, will be [ c~'(011) ]. %(021) J Because of reachability of the pair (F, G) the next linear dependence which is needed is that of

F':2gt, where lc2 : = N - ~cl, on the previous columns of R w. The corresponding vector, after permutation of its entries, will be [%(012)

r 1" The linear dependencies of all intermediate or subsequent columns are of no interest. All solutions to the problem are now parametrized by means of (17.1,2). (ii) The kernel of (ll N I~N) has dimension N + 2. But out of all coefficient vectors in this kernel, only two are needed. Their choice is dictated by the teachability property of the associated pair (F, G). Moreover, these two coefficient vectors are the basis for parametrizing interpolants of complexity below, equal to, and above N. (iii) The matrix (Sp Sq) is very closely related to the Sylvester resultant of p, q. The difference between the two lies in the fact that the Sylvester resultant has more columns. Therefore, because of the coprimeness of p, q, (Sp Sq) in the theorem above, must have full rank. [] 5. An example. Consider the data array containing 4 pairs: (0, 0), (1, 3), (2, 4), (1/2, 2), According to (4):

i0 i [!0] 1

-3

, G:=

2

-4

1/2 It follows that the fuU teachability matrix of the above pair is

-2

304 iI R4 =

]

0 -3

0 0 1 -3

0 1

0 -3

0 1

0 -3

0 1

0 1 -3

-4

2

4

-16

8

-32

16

-64

-8

-2 112 -1 1/4 -1/2 1/8 -1/4 1/16 -1/8

Recall remark (i) of the previous section. The first linear dependence occuring in R 4 is that of the fourth column: Fg2+6Fg 1+ g2=0.

It follows that x:t = 1. Therefore according to the same remark, ~c2 = N - ~cI = 4 - 1 = 3. This means that the next linear dependence is that of the 7 th column on the previous ones. It turns out to be 2F3gl - 9F2gl + 4Fgl - 2Fgl + g2 = O. The corresponding coefficient vectors are c1(011) = (0 6)', cl(O21 ) = (1 1)',

c3(012 ) = ( 0

4 -9

2)',

c3(022 ) = (1 - 1)'.

One column reduced version of O is therefore

e(x)= [ 6x

x(x - 4)(2x - 1) 1.

x+l

-

(2.x-

1)

J

By (i) of the main theorem, since 6x and x + 1 are coprime polynomials, there is a unique minimal interpolant with McMillan degree 1, namely y~in(x) =

6x x+l "

By (i) of the same theorem, there are no interpolants of degree 2. The next family of interpolants has McMillan degree 3. It can be parametrized in terms of the second degree polynomial p

as

follows: y(x) =

x(x - 4)(2x - 1) + (.o2x2 + p l x + po)6X -

(2x - I) + (p2x2 + p ~ x +po)(X+ 1)

The coefficients of p must satisfy the constraints (11), which in this case turn out to be po + 1 s O , 2p2 + 2pi + 2 p o - 1 s O , 4P2 + 2Pl + Po - 1 s O , P2 + 2pl + 4Po ~ O.

By letting p 2 = p i = 0

and p 0 = 2

in the above family of interpolants, we obtain the Lagrange

interpolating polynomial for our data array l(x) = 3 ( 2 x 2 - 9 x + 16).

The next family o f interpolants has McMillan degree 4. It can be parametrized in terms of a third degree polynomial p and a first degree polynomial q , as follows:

305

y(x)=

(p3x3 +p2 x2 + p l x + p0)6x + (qt x + qo)x(x - 4)(2x - 1) (P3X3 +p2 x2 + pl x + po)(X + 1) - (qlx + qo)(2x - 1)

Firstly, p, q must be coprime, i.e. p3q 3 -p2q2oql + p l q o q 2 - p o q 3 ~ O.

Then, constaints (11) must also be satisfied: po + qo ~ O, 2p3 + 2p2 + 2pl + 2po - ql - qo ~ O, 8p3 + 4p2 + 2pl + po - 2ql - qo ~ O, p3 + 2p2 + 4pl + 8po ~ O.

Notice that the first constraint is a non-linear one while the latter four are linear ones, as mentioned in remark (ii) of section 2. 5. Generalizations and concluding remarks. The above results can be generalized considerably. Consider the array consisting of the data

xi, vl, ri, i ~ N_, of size l / l , rixp, rixm respectively, satisfying the constraints x i ~ x j , i~:j, and r a n k V i = r i < p.

The former constraint is the so-called distinct-point constraint. The more general multiple-point interpolation problem involves information about derivatives. As it turns out, the latter constraint can be made without loss of generality. The left tangential or left directional interpolation problem consists in finding all p x m rational matrices Y(x) satisfying viY(xi) = Yi, i ~ N_,

keeping track of their complexity at the same time. The right tangential or right directional interpolation problem, as well as the bi-tangential or bi-directional interpolation problem, can be defined similarly. The procedure presented in the preceeding sections for the scalar interpolation problem, can be used to answer questions (a) and (b) formulated in the introduction, for all of the interpolation problems mentioned above. The complete solution of these problems, using the idea of defining appropriate reachable and observable pairs of matrices, is given in Antoulas, Ball, Kang, and Willems [1990]. In conclusion, the observation that the matrices 1~K and 1~ can be regarded as partial reachability matrices of a reachable pair (F, G), allows us to obtain complete insight into the rational interpolation problem. As a consequence, it becomes clear in what respect Prony's method of data fitting is deficient, and how to modify it so that it yields the complete answer.

306

6. References. A. C. Antoulas [1981], "On canonical forms for linear constant systems", Int. J. Control, 33: 95-122. B. D. O. Anderson and A. C. Antoulas [1990], "Rational interpolation and state variable realization", Linear Algebra and Its Applications, Special Issue on Matrix Problems, in press. A. C. Antoulas and B. D. O. Anderson [1986], "On the scalar rational interpolation problem",

IMA J. of Mathematical Control and Information, 3: 61-88. A. C. Antoulas and B. D. O. Anderson [1989], "On the problem of stable rational interpolation", Linear Algebra and Its Applications, 122-124:301-329. A. C. Antoulas and B. D. O. Anderson [1990], "State-space and polynomial approaches to rational interpolation", Proc. MTNS '89, Birkh/iuser, Boston. A. C. Antoulas, J. A. Ball, L Kang, and J. C. Willems [1990], "On the solution of the minimal rational interpolation problem", Linear Algebra and Its Applications, Special Issue on Matrix Problems, in press. T. Kailath [1980], Linear systems, Prentice Hall.

Discrete Normalized Coprime Factorization Peter M.M. Bongers, Peter S.C. Heuberger Laboratory for Measurement and Control Department of Mechanical Engineering and Marine Technology Delft University of Technology, Mekelweg 2, 2628 CD Delft, The Netherlands

ABSTP~ACT

In this paper a reliable algorithm is developed to perform a normalized coprime factorization of proper discrete time finite dimensional linear time invariant systems. Instead of using the bilinear transform the factorization is calculated directly. The system is allowed to have a singular state-space matrix. It is shown that a modified discrete time Riccati equation plays a crucial role to obtain a state-space realization for the factorization.

I. INTI~ODUCTION

IlL {,he theoretical work of Desoer et al. (1980), Vidyasagar et a1.(1982), Vidyasagar (1984) the benefits of using coprime representations in stability analysis of controlled systems are shown. In the continuous time case Nett et. al. (1984), Meyer and Franklin (1988) and Vidyasagar (1988) derived state--space representations for the normalized coprime factors. Glover and McFarlane (1988), (1989), McFarlane (1988) established the importance of normalized coprime factors in the Hoo controller design. They explicitly solved a continuous time four block Hr control problem by using a normalized coprime representation of the plant. In practical situations a continuous time plant is controlled by a discrete controller using sampling and zero order hold circuits. So in order to design accurate discrete time controllers the controller design procedure has to be done in discrete time. The first step in discrete Hoo control design with normalized coprimc factors is to establish whether or not in discrete time normalized coprime factors exists and can be represented in state--space forms. Chu (1988) gave state-space representations for discrete coprime factors with an inner numerator under the condition that the system has no poles in the origin. Poles in the origin are of major importance since in discrete time very often time--delays have to be incorporated in the system model (appendix 1). In this paper we will show the existence of a normalized coprime factorization of a discrete time plant with possibly poles in the origin.

3O8 II. PB.ELIMINAILIES

Stable multivariable linear systems can be studied by considering them as transfer function matrices having all entries belonging to the ring 7/. Moreover, in many cazes (e.g. convolution operators) the ring 7/is commutative and is an integral domain (i.e. 7/has no divisors of zero). The class of possibly unstable systems are elements of the quotient field F of 7/. Throughout this paper we let (Vidyasagar et.al. 1982, Desoer et.al. 1980): :7: := { a/b [ a e 7/, b E ~ 0 }, a quotient field of 7/ G := a (not necessarily commutative) ring with identity. 7/:= a subring of G which includes identity I := { h ~ 7/I hq e G }, the set of multiplicative units of G 3" := { h e 7/[ h-l e 7/}, the set of multiplicative units of 7/ Note that:

Jczc~c~c7

(1)

In the sequel of this paper we will study real rational finite dimensional discrete time invariant systems. The ring {~ is identified with [RL~ the space of proper real-rational functions with no poles on unit circle with norm H-f]| []J(z)Hoo= sup ~[j(eJ0)]. The subring 7/is identified with U~IIco 00

The procedure to obtain a normalized right coprime factorization for the plant G is to solve the fticcati equation (12 c,d) to obtain Q, calculate F and choose an It. The equivalent for the normalized left coprime factorization is a direct result from theorem 1 and is given in the following corollary. COItOLLAR.Y 1 Given a minimal realization (10): G(z):=D+C(zI-A)'IB:= [ ~ ]

E5r

and define:

then [1q IN] is a normalized left coprime factorization of G(z) iff: a)

K = (APCt+BDt)(I+CPCt+DDt)q

b) c)

RtR = (I+CPCt+DDt)q P - A P A L- BBt+(APCt+BDt)(I+CPCt+DDt)-I(CPAt+DBt) = 0

d) p = pt > 0 Plt00F Let G = i'r

with (IQ,I~) E 2' a nlcf of G, then G t = ~@[-t with (~lt,l'r

a nrcf of G t, so the

realization of [IQ N] follows from theorem 1. cl I~.EMMtK Note that we don't need the assumption that A is invertible, as is the case in Chu (1988). Using proposition 2.1 it is straight forward to show that this assumption is indeed superfluous. This is of major importance since in discrete time very often time-delays are incorporated in the system. In order to solve the normalized coprime factorization for the plant in discrete time by means of standard techniques, the equation (12@ Q - AtQA - CtC + (AtQB + CtD)(BtQB + DtD + I)-I(BtQA + DtC) = 0 can be written as a standard Riccati equation. Define:

A,: [A

E0IJ,

:[0

The standard Riccati equation with AI,Bt,CI,RI,Q1 is : Q I - A I t Q t A I - CttCl + AItQIBI(BltQLBI + Rl)-lGtQiA1 = 0

312 Substituting the definitions of Al etc. gives:

[77]-r-ooo1[o evaluating this equation gives (12c). Which means that using standard techniques the discrete time normalized coprime factorization problem can be solved. Note that the sufficient conditions for the existence of a positive solution of the Riccati equation are still valid.

IV. CONCLUSIONS

Theorem 1 and Corollary 1 show that with standard mathematical tools the normalized coprime factorization can be calculated, which is necessary to design discrete time controllers,that satisfy H~o robustness bounds. Since in practical applications one will in general be dealing with a discrete time problem, this is an important step towards the solution of the Hoe control problem in discrete time.

V. LITEKATUgs

Anderson, B.D.O., J.B. Moore (1979), Optimal Filtering, Prentice Hall, Inc., Englewood Cliffs, N.J. Chu, C.C. (1988), On discrete inner-outer and spectral factorizations,Proc. Amer. Control. Conf., Atlanta, Georgia Desoer, C.A., R-W. Liu, J. Murray, It. Seaks (1980), Feedback System Design: The Fractional Representation Approach to Analysis and Synthesis, IEEE Trans. Automat. Contr., AC---25, 399---412 Glover, K., D. McFarlane (1988), Robust stabilization of normalized coprime factors: an explicit Hoe solution, Prec. Amer. Control. Conf., Atlanta, Georgia Glover, K., D. McFarlane (1989), Robust stabilization of normalized coprime factor plant descriptions with Ho0---boundeduncertainty, IEEE Trans. Automat. Contr., AC-33, 821-830. Heuberger, P.S.C (1990), PhD Thesis (to appear), Delft University of Technology, The Netherlands. Huang, Q., R. Liu (1987), A necessary and sufficient condition for stability of a perturbed system, IEEE Trans. Automat. Contr., AC---32, 337-340. McFarlane, D. (1988), Robust controller design using normalized coprime factor plant descriptions, PhD. thesis, University of Cambridge. Meijer, D.G., G.F. Franklin (1988), A connection between normalized coprime factorization and linear quadratic optimal regulator theory, IEEE Trans. Automat. Contr., AC--32, 1041-1047

313

Nett, C.N., C.A. Jacobson, M.J. Balas (1984), A connection between state---space and doubly coprime fractional representations, IEEE Trans. Automat. Contr., AC--29, 831---832. Rosenbrock, H.H. (1970), State-space and mullivariable Theory, Thomas Nelson and sons Ltd. London, Great Brittain. Vidyasagar, M. (1984), The Graph Metric for Unstable Plants and Robustness Estimates for Feedback Stability, IEEE Trans. Automat. Contr., AC--29, 403--418. Vidyasagar, M. (1988), Normalized coprime factorizations for nonstrictly proper systems, IEEE Trans. Automat. Contr., AC,---33, 300-301. Vidyasagar, M., H.Schneider, B.A. Francis (1982), Algebraic and Topological Aspects oI" Feedback Stabilization, IEEE Trans. Automat. Contr., AC-27, 880---894.

APPERDIX 1 In this appendix we show the existence of a singular state--matrix due to time-delays. Let the discretized plant be given by: Xk§ ---- A x k 4- Bvk Yk = Cx k 4- Dvk

In general the discrete time controller will have one time---step delay between the measured outputs and computed plant inputs. Using discrete time control design algorithms the time-delay has to be incorporated in the plant description. Let the time-delay be described by: Zk§ = Uk Vk ---- Zk

Then the augmented plant can be written as:

[x< -Zk*lJ

[::] +

Yk = [C D] Zk Hence the augmented state matrix is singular.

A FORWARD FOR

ACCESSIBILITY

NONLINEAR

DISCRETE

ALGORITHM TIME

SYSTEMS

J.P.BARBOT LABORATOIREDES SIGNAUX ET SYSTEMES CNRS-ESE PLATEAU DE MOULON, 91190 GIF SUR YVETTE, FRANCE.

ABSTRACT In this paper we present a SAMDI algorithm (Symbolic Algebraic Manipulation for Discrete time system), which permits us to know whether a system is accessible for nonlinear discrete time. This algorithm uses the symbolic language REDUCE. In our algorithm we use a set of Gij vector field, which allows us to work with the same geometrical tools as for the continuous time system. Therefore, the SAMDI algorithm uses the same procedures of the SAM (Symbolic Algebraic Manipulation) programme, which was realized for continuous time systems.

INTRODUCTION The local controllability of a linear analytic system is based on the concept of infinitely small displacements in the neighbourhood of a state point. This displacement is generated during an infinitely short period o f time, which is impossible to realize for discrete time systems, because there the sampling period is fixed. In order to create this infinitely small displacement in the neighbourhood of a state point, M. Fliess and D. Normand-Cyrot [1] have chosen a very short input variation. This new concept allowed us to introduce the notion of weak accessibility in discrete time [2], [3], which is an equivalent relation on the state manifold and allows us to use of the orbit's theorem [4]. Thus we obtain a criterion of weak accessibility in terms of the tangent state space at a point of the Lie algebra. This is generated by a set of Gij vector field, introduced by S. Monaco and D. Normand-Cyrot [5]. We may consider this vector field as a derivation of the input of a causal flow (which is the difference equation of the discrete time system) and a non-causal flow (which is the inverse of the difference equation of the discrete time system), or vice versa. This

315

composition of functions draws all the field vectors back to the initial point. For reasons of computing and because of its more physical character [6], we use in this algorithm the concept of "forward accessibility" [7], which is a restriction to the accessibility in the causal, followed by the non-causal, displacements. From the above-mentioned works, we will introduce below a SAMDI (Symbolic Algebraic Manipulation for Discrete time system) algorithm of forward accessiblity for the discrete time system. This programme has been realized with the symbolic language REDUCE [8] and also using some procedures of SAM (Symbolic Algebraic Manipulation) [9] (for continuous time systems). In order to simplify our calculations the SAMDI algorithm has only been realized for discrete time systems with a single input. Following theory is only used to introduce the algorithms easily and the reader can turn to the references to get a more extensive account on them.

Theory

on the Gid vector

fields

Consider a linear analytic discrete time system Z: X(k+l) A__f(X(k)) +g(X(k)) U(k) =Ah(X(k),U(k)) where X e R n, U e R and f, g, h are the vectorial analytic functions of the appropriate dimension. Definition 1 is the set of inputs U, such that h(.,U) is inversive for all X. Remark In this paper we consider the set of ~ , which is not empty and where all the inputs U belong to f2. We have to remind the fact that, if in a point X, we apply a composed function with its inverse, we remain in that same point. For the analytic discrete time system S. Monaco and D. Normand-Cyrot [5] introduced the following set of vector fields:

G(U) I XO = ~ h'l("U)'h("U+e)oe I XO, e = 0

316

We can interprete these vector fields graphically in the following figure:

/

( "

7,0 /'

Figure 1

\ xs" "

Prooosition 1 The differential operator G(U) applied to function h is equal to the Lie derivative of this function h on the vector G(U).Id (see [5]). From the G(U) vector fields we deduce of new set of vector fields Gij (where i e N andj Z). Thus

Gi,01x 0 =

~G(U) ~ U i [U--0,x0

with G0,0Ix 0 = G(U)Iu=0, x 0 In the same way we find

G0,j [X 0 =

f-J h(.,u)-l.h(.,U+e).fJ [ U=0, ~0,X 0 ~ e

and Gij Ix 0 = ~i+l f-j h(.,U)-l.h(.,U+e).fj [O=0,e--0, X0 ~UiOe

Remark The fJ function is equal to the fJth function of f and f0 is equal to the identity function.

317

FORWARD ACCESSIBILITY As a result of the research on weak accessibility of discrete time systems [1], [2], and [3], and on the notion of action [11], B.Jakubczyk and E.D.Sontag [7] introduced the notion of forward accessibility. Below you will find the definition of a linear system with discrete time system 2; with a scalar input. We assume that the elements of input A are open in ~ and zero inputs in element A. We first define A+(Xo ) as a forward accessible set of X0. A+(X0)= { X 9 R n / 3 k 9 N , ~ / i 9 k , 3Ui+,Ui - 9 Asuchthat X = h(.,U0" )'l....h(.,Uk')-l.h(.,Uk+)....h(.,U0 +) Ix 0 } From A+(X0) it is possible to define the forward accessibility. Dgfinition 2 A system is said to be forward accessible if, and only if, there exists V(X0) in the neighbourhood o f X 0 such that A+(X0) n V(X0) = V(X0). From this definition and the field vectors Gij [5], B.Jakubczyk et E.D.Sontag formulated the following theorem: Theorem 1 [7] The discrete time system 2; is forward accessible in X0 if, and only if, the vector fields generated in X 0 by the smallest Lie algebra F+(~) contains all the elements of Gij where id 9 N, is the dimension of n. R~mark The forward accessibility in X0 (r+(X0)) is the dimension of the general field vector generated in X 0 by F+(Y.). We find the proof of this theorem in [7] or in [6] with the mathematical tools introduced in [2]. The set of vector fields used in the above theorem is the only difference with the one given by D. Normand-eyrot [2]. TheQr~m 2 [21 The discrete time system s is weakly accessible at X 0 if, and only if, the vector fields generated in X 0 by the smallest Lie algebra F(Y.), which contains all the elements of Gij with i e N andj 9 Z.

318

Remarks a. F+(Y.) is included in F(~) and therefore forward accessibility means weak accessiblity. b. In a recent paper of A. Mokkadem [11], he introduces the concept of quasiaccessibility (on a Zariski topology) and studies the links of these concepts of accessibility. Thus, for some systems the results of our algorithm can be applied to the quasi-accessibility. COMPUTATION OF THE Gkj.VECTOR FIELDS As shown in [5], the Gi,j, with i, j e N, are easy to compute. Thus, like before, the vector fields G0,0 can be defined as follows

G0,01 X0 =

0 FG(.,0)-I.FG(.,e) de I X0,~0

which is equal to GO,O IX0 = j(f-1) I f(X0) -g [ x 0 where j(f-l) is the Jacobian matrix of function f-1. As a result of the following equation 3f-l.f

I X0= j(f-1) I rcx0) J(0 t x0 = Id

where Id is the matrix identity, we get: IS0,01 x 0 = J(f)-I ] x0.g [ X0l where the vector fields Gi,0 are equal to:

Gi,01 x 0 =

~i[J(f)+J(~).U]-I 0U i I X0,U=0 .g Ix 0

The properties of the derivatives of the matrix give us the following equation: [Gi,0 ]X 0 = (-1) i i! ([J(f)'l.j(g)]iJ(f)-l.g) I X0]

In the same way, the vector fields Gk,1 are equal to:

Gk,l IX 0 =

0k+l f-1 FG(.,U)-I.FG(.,U+e).fl 0 Uk 0 e [U--O'~O'Xo

319 which gives Gk,1 IX 0 = [j(f)]-I

1x0.Gk,01 r(x0) ]

Finally, from their recurrent property, we obtain: [Gk,j+l ] X0 =

[J(f)]-I I x0.Gkd l r(x0) ]

INTRODUCHTON OF THE "SAMDI" ALGORITHM AND CRITERIA TO STOP As shown above, the vector fields Gi,j are products of the Jacobian matrix, which are easily computable in the symbolic language REDUCE [8]. As an example we given the following little programme: FOR I:=1 : n DO BEGIN FOR J:=l : n DO BEGIN JF(I,J):=DF(F(I),X (J)); END; END; where DF(F(I),X(J)) is the symbolic function of the derivation of the first component of the vectorial function F, compared to the variable X(j). Note that the inverse of the matrices and their product are basic functions of the REDUCE language. Moreover, the calculation of the Lie brackets and the dimension of the vectorial space generated by a certain number of vectors are given in [9]. Therefore we limit ourselves to the following introduction of some stopping criteria. In this algorithm the first step must be to calculate the dimension of the space generated by the vector fields Gi,0 and X0 with i E N, for a finite number of vector fields. Therefore, as a result of equation (2) and the Cayley-Hamilton theorem, we find the following stopping criterion: CRITERION 1 We stop the calculation of Gj,0 when rank [Gj,0....G0,0] I x 0 = rank [Gj-I.0..... G0,0] ix0. The second step consists of calculating the dimension of the vector fields generated in X0 by the smallest Lie algebra 1-'+(~), which contains all the elements of Gi,j. For this we use the property of involution, see [12] and [9].

320 CRITERION 2 Consider a set of 13vector fields. If ~ Lf a 13and "~ Lg ~ 13we have [Lf,Lg] Ix ~ EB(X) where EB(X) is the vectorial space generated by the vector fields 1] en X. From this we can conclude that the dimension of EB(X) is equal to the dimension of space vector X, by means of the smallest Lie algebra containing 13. The following step is to calculate the vectorial space of the Gk,l where k and 1 are the integers. In order to stop this calculation we use criterion 1 for exponent k and the Cayley-Hamilton theorem for exponent I. However, in equation (4), the matrices are function of the exponents. For this, we need to assume that X0 is not a unique point. Also, the rank of a Jacobian matrix is a function for which there exists an open set of 7 state points, such that x ~ 7, and that the rank is maximal. Therefore we can introduce the following criterion: CRITERION 3 We stop the calculation of Gj,p as soon as rank [Gj,p+l ...... Gj,0] Ix 0 = rank [Gj,p ...... Gj,0] I X0.

The last criterion is evident, but we should not omit to use it in a programme: CRITERIQN 4 Whatever the vectorial space Eli generated by vector fields of n dimension, the dimension of El3 is smaller or equal to n. With the help of these criteria we can realize a SAMDI algorithm, represented in Figure 2. EXAMPLES The following examples are arbitrarily chosen to illustrate some properties of forward accessibility. Examole 1 Consider the following discrete time linear system: X(k+l) = A X(k) + B U(k) where X e R n, U ~ R and A, B are the appropriate dimensions of the matrices.

321

The forward accessibility is different from the usual discrete time commandability [12], by the fact that matrix A must be regular (see [6] for a comment on this subject). If matrix A is regular, all the inversive inputs are equal to R~ The forward accessiblity rank r + is then r+= rank [B,AB ........An'IB]

Consider the following linear analytic system Y= X1 (k) 3+X2(k) X(k+l) = FG(X(k),U(k)) - ~.X ] ( k + l ) = - [X2(k+l) = X2(k)+U(k) This system is inversive on levels XI>0 and XI0

G0,t = (9 X12 (X13+X2)2) "l (-3(XI3+X2)2-1, 9 X12 (X13+X2)2) T

The system is forward accessible, but we find a set of singularities for (X13+X2) 2 equal to zero, which correspond to Xl(k+l) equal to zero. Note that all the singularities found for this system are a result of the too restrictive definition of forward accessibility. Indeed, whatever the values of X(0), with the help of U(0) = - (X13+X2)3-X2 and U(1) = (X13+X2) 3 we find that state X(2) is equal to (0, 0)T. In the same way, from (0, 0) it is possible to go to any point (XI, X2) with the help of control law U(0) = X1 and U(1) = X2-X1.

CONCLUSION The above presented algorithm of forward accessibility is a symbolic tool which is useful when we study a linear analytic discrete time system with scalar input. Indeed, if the system is not forward accessible in an open set which is not empty, it is of no use to look for control law. However, if the system is forward accessible, it is not always possible to control all the states. We also have to limit ourselves to inversive systems, but this restriction does not hold for sampling systems [ 14]. Finally also note that the extension of SAMDI algorithms to linear analytic discretetime systems with multi-input only give some computational problems (sufficient memory, time of calculation, increase of the number of vector fiels index in order to take into account eachinput .... ).

322

ACKNOWLEDGEMENT I especially wish to thank M rs Doroth~e Normand-Cyrot and M r Salvatore Monaco for their precious help and suggestions. Without their publications [1], [2], [3] and [5], this article could not have been written. The research has been made possible by a grant from the European M.R.E.S.

,

t

GT,0 C,)HPIJTI~T I OH TR I RI IGU LP,~ 1Z,~T 1 C+

X ) ' ~ . 'i.~-

,+,,'+'O_.~F:~TE~XO~

I ~ '..t.o

-.,,

,p-

[

~/~-.

vm

'v~

TIlE

,f.

-,II--C}-~;:R I T E R I

fK' ~RITEF.~ON~ #

-'-...5/

C~r

.o ~".... I L

I,

-

INIjOLUT i C,N R~-IC'l T ~ IRNGULRR Z Z ~ T I O N

"NO

~

TIIE SYSTEM IS F011NAKD ACCESSIBLE

SVSTEH ISII'T 1

FOnHanD A C C E S S [riLE

Figure 2

323

REFERENCES

(1)

M.Fliess, D.Normand.Cyrot (1981) " A Lie-theoretic approach to nonlinear discrete-time controllability via Ritt's formal differential groups" Systems and Control Letters, Vol 1, N~ pp 179-183

(2)

D.Normand.Cyrot (1983) " Throrie et pratique des syst~mes non linraires en temps discret" Thesis of Doctorat d'Etat, University of Paris Sud, Orsay

(3)

B.Jakubczyk, D.Normand.Cyrot (1983) " Orbites de pseudo groupes de difffomorphismes et commandabilit6 des syst~mes non linraires en temps discret" C.R. Acad. Sciences Pads

(4)

H.J.Sussmann (1973) " Orbits of families of vector fields and integrability of distributions "Transactions of the American Mathematical Society Vol 180 pp 172-188

(5)

S.Monaco, D.Normand.Cyrot (1984) "Developpements fonctionnels pour les syst~mes non lineaires en temps discret" Report of I.A.S.I, C.N.R Italien (in French) and "Functional expansions for nonlinear discrete time Systems" Math. System Theory 21 pp 235-254 (1989).

(6)

J.P.Barbot (1989) " MEthodes de calcul appliqudes aux syst~mes non linEaires sous 6chantillonnage" Thesis of Docteur en Science, University of Paris Sud, Orsay

(7)

B.Jakubczyk, E.D.Sontag (1988) " Controllability of nonlinear discete time systems: A Lie-algebraic approach" Proposed to S.I.A.M J. Control and Optimization

(8)

A.C.Hearn (1984) " R.E.D.U.C.E User's manual " The Rand Corporation, Santa Monica USA

(9)

R.Marino, G.Cesaro (1982) " Nonlinear control theory and Symbolic Algebraic Manipulation " Ed Springer-Verlag Proc. Beer-Sheva Israel

(10) E.D Sontag (1986) "Orbit theorems and sampling " M.Fliess and M. Hazewinkel Eds Algebraic and Geometric Methods in nonlinear control theory pp 441-483 D.Reidel Publishing Company (11) A.Mokkadem (1989) "Orbites de semi groupe de morphismes rrguliers et systrmes non linfaires en temps discret" to appear in Forum Mathrmafique. (12) A.Isidori (1989) " Nonlinear Control Systems : An introduction " Communications and control engineering series (2nd Edition) Springer-Verlag (13) R.E.Kalman (1960) " On the general theory of control systems" Proc. First International Congress on Automatic Control, Moscou. London, Butherworth 61, Vol 1 pp 481-492 (14) S.Monaco.D.Normand.Cyrot (1985) "On the sampling of a linear analytic control system" 24 th Conf. Decision and Control Ft Lauderdale pp 1457-1462.

S t a b i l i s a t i o n globale de s y s t ~ m e s n o n - l i n ~ a i r e s p a r un contr61e positif. Jean-Luc Gouz4 INRIA Sophia-Antipolis 06565 Valbonne Cedex, France A b s t r a c t : We give sufficient conditions for global stabilization in a fixed domain around an equilibrium of a class of controlled systems of non-linear ordinaa-y differential equations in n-space, that can be expressed as the difference of a quasi-monotone increasing and a monotone increasing system. These systems are usual in biological modelling. We consider the cases when the control is scalar, or scalar and positive. We apply the theory to LotkaVolterra systems. R 4 s u m 4 : Nous donnons des conditions suffisantes de stabilisation globale dans une r4gion fix4e par un contr61e autour de l'4quilibre pour une classe de syst~mes diff6rentiels nonlin6aires en dimension n qui sont diff6rence d'un syst~me quasi-monotone croissant et d'un syst~me monotone croissant, fr4quents dans la mod41isation biologique. On consid~re le cas d'un contr61e scMaire et d'un contr61e positif scMaire. On prend pour exempte les modules de Lotka-Volterra.

I

Introduction

Nous nous int4ressons dans cet article au probl~me de la stabilisation globale vers l'4quilibre, dans une r4gion fix~e born4e de l'espace, d'une classe de syst~mes diff~rentiels non-lin4aires, par un contrSle en boucle ferm4e. L'approche la plus usuelle, et sans doute la plus g4n4rale, clans ce type de probl~me, consiste k utiliser des fonctions de Liapunov (voir [8]); mais il est souvent difficile, quand le syst~me est sp4cifi4, d'exhiber une telle fonction. D'autres approches existent, qui utilisent par exempte la th6orie de la vari4t4 centrale ([1]), mais elles sont surtout locales. Nous utiliserons ici une autre approche globale, ne faisant pas appel aux fonctions de Liapunov, mais nous restreindrons la classe des syst~mes non-lin4aires 4tudi4e. Ces syst~mes soar issus, dans notre cas, de probl~mes de mod41isation biologique, par exemple en dynamique des populations. I1 est alors normal que les signes de certaines grandeurs jouent un rSle important, car ils peuvent s'interpr4ter biologiquement (voir par exemple [7] et [2]). Nous consid4rons donc la classe de systbmes non-lin4aires n-dimensionnels: ~ = f(x)

- g(x)

(1)

avec les conditions suivantes vMables pour x dans un ouvert convexe W C R~: ~Ofi , (1:) >_0 pouri # j e t

( x ) > _ O ( i , j = l . . . . ,n)

(2)

Les fonctions f e t g sont suppos4es C 1. On dit que f e s t quasi-monotone croissante et que g est monotone croissante si elles v4rifient (2)(volt [13]). S i g -- 0, o11 dit aussi clue le syst~me

325 (1) est coop~ratif (voir [4]). Des conditions de ce type i n t e r v i e n n e n t souvent dans les syst~mes difffirentiels modfilisant des ph~nombnes biologiques ([7]), car, par exemple, le signe de ~ ( x ) traduit si la variation de l'esp~ce j influence positivement ou nfigativement 13 variation de l'esp~ce i. Remarquons que cette classe de syst~mes est d~jk tr~s vaste ([3]). Un exemple i m p o r t a n t est c e h i des modNes de Lotka-Volterra (voir [12,11]) qui s'fierivent:

,i:i = xi(bi + f i cqx~)

i =

1 ....

, n

(3)

j=l

Les syst~mes coop~ratifs et comp~titifs (en inversant le temps) ont ~t~ beaucoup ~tudi~ m a t h ~ m a t i q u e m e n t (voir [5,9]). Nous avons utilis~ ces r~sultats pour obtenir u n critbre surfisant de convergence globale pour le syst~me (1) vers un ~quilibre suppos~ unique ([3]). Nous appliquons ici ce critSre k u n probl~me de stabilisation globale dans une r~gion fix~e de ce systSme par u n contr61e scalaire et par un contr61e scalaire positif (la positivit~ dn contrSle a yant souvent un sells dans les probl~mes biologiques). Plus pr~cis&nent, nous cherchons k stabiliser (en boucle ferrule ou par des contr61es constants) le syst~me : 2 = f ( x ) -- 9(x) - ud(x)

(4)

o~ u est un scalaire et d une application que nous prendrons d'abord trbs simple pour l'~tendre ensuite. Dans la suite, quand nous parlerons de stabilisation globale, cell voudr3 dire que nous savons exhiber une r~gion born4e conten~m dans W, c o n t e n a n t un seul point fixe a s y m p t o t i q u e m e n t stable dont le bassin d ' a t t r a c t i o n contient la r4gion. Nous l y o n s limit~ notre exposition au cas u scalaire, mais on peut facilement ~tendre l' dtude au cas multivariable (volt la premiere remarque de la section 6). Dans la section 2, nous citons le crit~re de convergence globale vers l'~quilibre, que nous appliquons au probl~me de stabilisation dans la section 3, avec u n contr61e positif en section 4. Nous terminons en section 5 par une application k la stabilisation du module de LotkaVolterra et quelques remarques et interpr4tations biologiques. Notations: Pour x E R ~, on ~crit x >__ 0 si xi >_ 0 i = 1 . . . . , n et x > 0 si xi > 0 i = 1 , . . . , n . On note R~_ = {x E R"; x >_ 0} (c'est l ' o r t h a n t non-n~gatif fermi) et, si z < y, [z,y] = {z ; x < z < y} (intervalle ferm~ m d i m e n s i o n n e l ) . De m~me, on notera par e ~ le vecteur ( e ~ ' , . . . , e~"), par In x le vecteur (In a'l . . . . . In x,~). Si W C R " est ouvert, h : W ~ R " est C 1, et x E W, on note par Dh(x) 13 matrice jacobienne Ohi/O.rj(x). Soil x : J --+ R ~, o~l J e s t u n intervalle rdel, on note if: la d~rivfie de x pat" rapport an temps t. Si W C R ~ est ouvert, h : IV ~ R " est C 1, pour l'~quation diff~rentielle .i: = h(x) et une condition initiale x(0) = xo dans W, nous noterons .v(t, xo) ou parfois x(t) la solution maximale issue de x0. Enfin, on note (ei), i = 1 . . . . . n la base canonique de R ". 2

Un

crit&re

de

convergence

globale

Nous citons ici le crit~re de convergence globale en renvoyant ~ [3] pour t'exposition. Soil donc le syst~me (1) v~rifia.nt d,~ns I.V ouvert COllvexe les conditions (2), avec f et g suppos~es

326

C 1. Nous construisons alors un syst~me dans R2~:

{v

= f(v)

-

g(~)

= f(~)

-

g(v)

(.~)

ddfini pour (y, z) E W x W. On a Mors le th6or~me suivant: T h 4 o r ~ m e 2.1 S'il existe (yb,zb) E IV x W tels que :

z b 2. Extension

et exemples

biologiques

On peut 6tendre les r6sultats des sections 3 et 4 au cas de plusieurs contr61es E uiei. Si l'on a k contr61es, on supposera que les (n - h) in6galit6s restantes des th6or6mes 3.1 et 4.1 sont v6rifi6es, et l'on choisira ui fonction de xi. On peut 6tendre les r6sultats de la section 5 au cas o{t d(x) est une application diagonale (Dd(z) est une matrice diagonale) n6gative pour x _< x* et positive pour a" >_ x*, et donc S~allnulant en x ' . On va prendre comme exemp!e les modNes de Lotka-Volterra (3). On suppose que le systhme a un 4quilibre unique x* > 0 (voir ([11]), et, apr~s le changement de variable = In x, on obtient:

= ~ + ce~

(10)

On pose C = F - G, off F est une matrice & 416ments en dehors de la diagonale positifs et G une matrice positive. On suppose que l'on peut exercer un contrSle sur la premi&re composante, le syst6me boucl6 s'6crit donc: = F(d - d') - a(e ~ - d') - u(~)q

(11)

et le syst6me associ4 (7) s'6crit:

~

u

,,(z)e~ )

(12)

avec A=

-G

F

et

V = e z - e ~"

(13)

Nous allons choisir u(~) = k(e ~* - e~;), k constante positive; de plus, nous supposerons d bijective (volt ([3])); le syst~me (12)s'6crit alors:

~"

U

off le d6ternfinant de Al(k) est u n polyn6me du deuxi~me degr6 en k; si les hypotheses sur les in6galit6s du th6or6me 3.1 sont remplies, on pourra donc de plus choisir k pour

330

que Al(k) soit bijectif, donc que le syst~me (14) ait (~*,~') comme unique @quilibre. On pourra donc appliquer le th@or~me. Donnons deux exemples num@riques, le premier avec un contr61e u de signe variable, le deuxi~me avec un contr61e de signe fix@ : - Le syst~me (10) avec C =

4 -5

et b =

1

peut @ire stabilisfi vers son @quilibre instable (0, 0) avec un contr61e sur la premiere composante u(4) = 3 ( d ' - d r ) dans l'intervalle (dans R 2) ] - oo, ln2]. - Le syst~me (10) avec

50 1 ) ( : : ) C =

0

-2

3

I

i

--3

et b =

(16)

admet un @quilibre instable en (0,0, 0); il peut ~tre stabilis~ globalement dans l'intervalle (darts R a) [-ln(0.9, 0.88, 0.2), ln(1 + 10 -a, 1 + 2.10 -a, 1 + 10-3)] avec un contr61e positif sur la premiere composante que l'on peut prendre nul si ~1 _< ~ et @gal ~ u(~l) = 6.104(e ~ - e ~) si ~1 _> ~[- Remarquons que ce contr61e n'est pas d@rivable en ~*, mais on v@rifie que le syst~me boucld reste C 1 partout. * Biologiquement, ~ repr~sente un taux de croissance; l'exemple des modules de LotkaVolterra nous dit que, dans certains cas, on pourra stabiliser un syst~me biologique si on salt agir sur le taux de croissance d'une esp~ce dans les deux sens; et, dans d'autres cas, si on ne salt agir que dans un sens (en la chassant ou en augmentant le taux de reproduction par exemple). R e m e r c i e m e n t s " L'auteur tient & remercier les rapporteurs pour leurs remarques judicieuses. lZdfdrences [1] D. Aeyels, Stabilization of a class of nonlinear systems by a smooth feedback control, Systems control Lett. 5 (198.5), pp. 2S9-294. [2] J.L. Gouz6, Structure des modbles matMmatiques en biologie, in: A.Bensoussan et J.L. Lions, eds., Analysis and optimization of systems, Lecture Notes in Control and Information Sciences No. 111, Springer-Verlag, Berlin (1988) [3] J.L. Gouz4, A criterion of global convergence to equilibrium for differential systems, Rapport Inria No. 894, (1988) [4] M.W. Hirsch, Systems of differential equations which are competitive or cooperative I: Limit sets, SIAM J. Math. Anal.,13 (1982),pp. 167-179.

331

[5] M.W. Hirsch, Systems of differential equations which are competitive or cooperative II: Convergence almost everywhere, SIAM J. Math. Anal., I6 (1985), pp 432-439. [6] M.W. Hirsch, The dynamical approach to differential equations, Bull. Amer. Math. Soc., ll (1984), pp 1-64. [7] R.M. May, Stability and complexity in model ecosystems, Princeton University Press, Princeton (1974) [8] N.Rouche et J.Mahwin, Equations diff~rentielles ordinaires, tomes 1 et 2, Masson, Paris (1973) [9] It.L. Smith, Systems of ordinary differential equations which generates an order preserving flow. A survey of results, SIAM Review, 30 (1988), pp. 87-113 [10] H.L. Smith, Competing subcommunities of mutualists and a generalized Kamke theorem, SIAM J. Appl. Math, 46(1986), pp 8.57-874. [11] Y. Takeuchi, N. Adachi and H. Tokumaru, Global stability of ecosystems of the generalized Volterra type, Math. Biosci., 10 (1980), pp 119-136. [12] V. Volterra, Variations and fluctuations in the numbers of coexisting animal species, (1927) in: F.M. Scudo and J.R. Ziegter, eds., The golden age of theoritical ecology: 1923-1940, Lecture notes in biomathematics No. 22, Springer-Verlag, Berlin (1978)

[13]

w. Walter, Differential and integral inequalities, Springer-V'erlag, Berlin (1970)

CONTROLLABILITY OF DELAY-DIFFERENTIAL SYSTEMS Paula Rocha Technical University of Delft P.O. Box 356 2600 AJ Delft

Jan C. Willems University of Groningen P.O. Box 800 9700 AV Groningen

A b s t r a c t : The concept of controllability is introduced and investigated for the class of AR delay-differential systems with separable AR descriptions. For this class of systems, it is shown that a system Z described by the AR equation

R(al,o'2)w=o (with a I the

differentiation- and a 2 the delay-operator) is controllable if and only if rank R(A,e-~) is constant for all A9 s

1. INTRODUCTION The aim of this paper is to investigate the concept of external controllability for delay-differential systems. The contributions to the area of delay-differential systems are vast, and can be divided into two main streams. On the one hand, the infinite dimensional state space approach of, among others, Bhat and Koivo, and Manitius Trigianni, [BK], [MT], and on the other hand the algebraic approach developed, for instance, in [L]], [M] and IS], which views delay-differential systems as systems over rings. Both these approaches are concerned with internal system properties, such as, for instance, controllability and observability of the state space. Recently, an external characterization of approximate controllability of the standard observable realizations

of delay-differential

transfer

functions has been given by Yamamoto, [Y]. However, although externally characterized, this controllability

property is defined on an

internal level,

as it concerns state

space realizations. Our purpose is to investigate for the particular case of delay-differential systems the concept of controllability introduced by Willems [W2] for general dynamical systems. In the behavioral framework of [W2], a system is viewed essentially as a set of external trajectories,

and thus controllability is defined as a property of these trajectories.

333

Here, we will consider the class of the s o - c a l l e d AR d e l a y - d i f f e r e n t i a l systems, and derive a controllability test in terms of the system AR representations. 2. CONTROLLABILITY OF DYNAMICAL SYSTEMS

We start with the basic definition of a dynamical system introduced in [W1].

D e f i n i t i o n 1. A dynamical system Z is defined as a triple Z := (T,W,~), where T c_R is the time set, W is the signal set and ~9 _c Wr := {w:T-+W} is the system behavior. Thus, a system is essentially characterized by its behavior, i.e. by the family of all trajectories in the domain W"T which are compatible with the system laws. Note that in this definition no distinction is made between inputs and outputs among the system variables. In fact, our approach is more general than the input/output framework, as input/output systems can be viewed as dynamical systems in the sense of Definition 1, if instead of inputs u and outputs y we speak of system trajectories = col(u, y).

In the sequel we will consider excIusively continuous time systems, i.e., systems with time set T=R. Intuitively, we will say that a system is controllable if it has limited range memory, i.e., if the values of the system trajectories on two intervals ( - c o , t ] and [t+d, +co) are in some sense independent provided that d > 0 is sufficiently large. Given two trajectories wl,w2eW R we define the concatenation of tox with w~ at time t e R as the trajectory w e W R given by w[(_|

and wl[t+|

We will

denote w := w 1 A w 2. D e f i n i t i o n 2. A dynamical system E = ( R , W , ~ ) is said to be controllable if the following condition holds. {Wle~ , W2e~ , teR}.~{3d>0

3w*e~B s.t. WlAUr*t+AdW2EB }

3. AR AND MA DELAY-DIFFERENTIAL SYSTEMS

The dynamical system E=(R,Rq,B) will be called an AR delay-differential system if its behavior B can be described as the solution set of a finite number of linear equations on the derivatives and delays of the trajectories, as indicated below.

334

~'*~ ~'~

d*

k=kl ~ ~=el Rke

VteR

--

dt k w(t - ~ ) = 0

with k l < k 2 positive integers, ~l

. .

ni

Bsz=block-diag

L 'sJ 4---- hi-1 -"-->

9

?J B61=0

Now, if one applies this result to systems without input but with outputs (E,A,0,C), one obtain the following result that is deduced by duality9 Split in two of the lists {qi} and {vi} under the action of ..q'R={(W,V,0,S,0,R)}:

375 Any triple (E,A,C) can be given, through the action of E'R, the following equivalent form: [ sE-A ]= block-diag [ sEi -Ai]

C = [Cij]

i=1 to 2, j=l to 6

* [ sE 1 -A 1 ]= block-diag [Lkij] with Lkii unchanged and Cil =0 'Vi * [ sE 2 -A 2 ]= block-diag [L~i] with

Ls unchanged and Ci2 =0 'v'i

* Associated to the two types of {qi} there are two types of blocks:

[ sE 3-A 3 ]= block-diag [L~i]

with L(i =

is 1 . -11, . 9 ~i

Ci3=0 Yi

L 1J* 4--- ~i-1 --~

[ sE 4-A 4]= block-diag [L~i]

with L~i =

I

s

1`

1

is $ 4--+

4--

.ol]}

014---- block-diag { [ 0

024=0

* Associated to the two types of {vi} there is two types of blocs:

[ sE 5-A 5]= block-diag [Lmi]

with

S . Lm i =

.

Ci5=0 Vi

| mi

sl -J $ is1 [sE6-A6]=block-diag[Lqi ]

.

11,

~

with Lq,= L

.

jqi-1

sl 4--

qi

C26= block-diag {[1 0 . .

4.

"-')'

0 ]}

C16=0

Let us now come back to general systems with input and output equations; The reduction procedure of ~l(s), ~2(s) and s lead to a split in two of the lists {~i} and {qi}, and a split in four of the list {vi}; the list of the finite elementary divisors is unchanged.

376 We are now able to state our main result : Theorem 2,1 9 Any (E,A,B,C) quadruple (1.1) can be given through the action of ,.q"the following equivalent block-diagonal form :

II

II

_~

j i

-,Z"

3 m

3

/

I / ~ Oj

II

I ,_S

377 One can remark the following important particular cases : oi =1 corresponds to a zero column in sEc- Ac and Cc [,i =1 corresponds to a zero row in sEc- Ac and Bc qi =1 corresponds to a zero column in sEc- Ac but a non zero column in Cc ni =1 corresponds to a zero row in sEc- Ac but a non zero row in Bc Note again than some zero columns (respectively zero rows) appear in Bc (resp. in Cc) when B is not monic (resp. if C is not epic).

11.2 INVARIANTS. CANONICAL FORM : Let us now introduce the following algorithms. (2.2.1)

q2"~

q/~+]=KerCc-~-](E~/~+lmB)

(2.2.2)

ff.,~0=,,~'C "t~#+l=KerCc'~A'i(E~'ro~-~'+lmB)

J

(2.2.3)

8~

8#+i=E'l(A(KerC~3~)+ImB)

l,U, oc$oc:,U,1 c ... ~j,~S~..."

(2.2.4)

flJ'~

"lJ'~'+l=E'l (A(Ke rCc~fl.Y~)+l m B)

J

(2.2.5)

-~'~

%~+l=E(KerCnA'l(~'-#+lmB)) l ,,~o ~q/,oD~l ... ~ p - _ ~ . . .

(2.2.6)

~o=,,~,

,,~+l=E(KerC,qA-1(,~#+lmB)) J

(2.2.7) _8~

l ~2"0Dff-~'0D"V1...Dq/'#D'V'~'...

-

-

~#+l=A(KerCc~E "18_~)+lmB

(2.2.1), (2.2.2) and (2.2.4) were introduced by 0zs [6]; (2.2.3), (2.2.5) and (2.2.7) were defined by Malabre [7]. The convergence of (2.2.6) is a consequence of the above mentioned inclusions. We shall now characterize the indices which appear in theorem 2.1 in terms of the dimensions of the steps of these algorithms. Prooosition 2.1 9 {(S-Or.i)kij} is the list of the invariant factors of (Eq/*/E ,,,,,,,,~*[IA+BF+RCII"V*/,.%*), the map induced in the, quot,ient subspa,ces,~"/,.,%* and Eq/' /F_.,~ where F and R are such (A+BF+RC)'],/' c:E'12" and ~ ='1,/" n3 9

-

.

[ W ' * n . ~ '-I ]

- card {oi > P.}=d'm tfl),,n flj,~._ll

V #>1

card {~>__p.}=dim Lfl), n8 ~-I

V p.>l

378

-

card {r~i> i~}=dim I --~+'~#-tl

9 rKe rE n ff.,.9.~-i] -card{mi=~}=omlKerEnfls I

- card {pi = #}=dim

~'lmBnE~"-'l

llmBnEff~_] j

Particular case:

v

1

V ~>1

V p.> 1

card {~i> I~}=dim I--~*+-~-' l

V I1> 1

card {hi = p.}=dim I ImBnE%~rL-2IV i imBnE1s j 1~>-2

card

= ,}=dim

IKerEn f17~-' 1

V

>_ 1

card {ni = ;}=dim ,J ImB "[ llmBnlmEJ

Proof: The characterization of the polynomials {(s-oq)kij} is due to Malabre [7]. The other relations can be obtained by direct counting of the number of vectors in bases of the considered subspaces (see for ideas Loiseau [8]). We can remark that the algorithms (2.2.1) to (2.2.6) are invariant under feedback, output injection and changes of basis. This shows us that the indices {cri}, {7i}, {~J, {~}, {mi}, {ni}, {Pi}, {qi}

and the polynomials {(s-ai)kij} constitute a complete set of invariants for quadruple (E,A,B,C) under the transformation group S. In other words, this proves the canonicity of the form described in Theorem 2.1 . It is now important to compare this form with other existing ones and particulary the canonical form of Van Der Weiden and Bosgra.

III COMPARISON WITH THE CANONICAL FORM OF VAN DER WEIDEN AND BOSGRA; Van der Weiden and Bosgra (VDW & B) introduced [1], for somes quadruples (E,A,B,C), a canonical form which looks like our form. We shall compare both canonical forms. Two facts must be pointed out : 1)In the VDW & B's process, only the dynamical part of Dg is considered as the "actual" state space. Thus the system is described by the so-called "strictly generalized state space form" where E has a particular description "

o] From our point of view, the system is in a general form (see (1.1)): the state space is .,%. This is not a restrictive constraint : one can also, in our case, consider systems in strictly generalized state space form ( this is just a matter of basis transformations.).

379 2)The authorized transformations, in both cases, are in close connextion with the underlying state space. - This is why VDW & B use the" strict pencil equivalence" : PVDW&a(S)= M P(s) N with

[ o u=~,o//-A=I

Mll M12RI]~-sI-A11"A12B1]rNll

-

0 y~2]

-A=2B2/|N21 N22

=Pvow~.a(s)

0 X2 S JL -01 -02 0 JL F1 0 R~ and F1 only act on the dynamical part of 3(5 X2 and Y2 allow combinations between the other part of ~ and the inputs or the ouputs. In our case, the authorized transformations can be formalized by :

wl, w12RllEs A1 AI , :tE v11v12i]

W021W22 R2 0 S

-A21 -A22 -C1 "C2

V21 V22 F1 F2

=

Po(S)

R21 and [ F1 F2] act on the full state space 3P.,. JR' Only internal transformations (changes of bases) are allowed on the inputs and outputs. This second point shows that the "strict pencil equivalence" and our "proportional equivalence" are really different. We shall now illustrate this difference with an example. Consider the following systems which are equivalent in the sense of VDW & B :

l-s lolo- I 9 [-sE1-A1B1

I0s

P2cs~;EsE2A2~I

111 1

~'IS~=L ci o]=Ioo ~IoI

02

L. q-5-5 o[6-_.1

Esl~ o silo

=

0 0 1 I1 100

Indeed, they have the same VDW & B canonical form:

,oo iooiI oiOOI

r,oOOoqro,_, 0

s

sll

010

o., .,Jl., 0 1 0 j l 0 0 1 ~oo ~ 1 7060 0 0 0 0

s01 = 1 0 0 1 0 =PlvDW&B(S) L1 0 0 0

loolioooj lOOI

rio oo O]ro s,o 1-1 0

L~176 I~ oo,

P1VDW&B(s)=P2vow&B(S)

loo roso, oll II oo,,1 -L~olo _-P2vow,o~s~ 00 0 / L 0 0 0 -1

0 00

Now look at their canonical form under the action of our group (W,V,G,S,F,R).

380 s01 ~1 ro '~176 =Plo(s)

LSOo'tJL? o Oo'jLOo o Oo;, ~J--I oo,o

Pl =2 "ml =1

L1 o o o

o1 Fo,oo sl0

?JL?'o'oJL~ oj=Looo,

= P2o(S)

ql =3 ;nl =1

1000

Since they do not have the same lists of invariants, these systems do not belong to the same orbit. This implies that both transformation groups are actually different: and the associated canonical form too.

IV CONCLUSION: We have introduced a new canonical form for descriptor systems with an output equation. In other words, we have characterized by nine lists of invariants the orbit of any (E,A,B,C) quadruple under the action of the proportional group. It generalizes the form given by Morse [3] in the strictly proper case. The considered systems have been taken in a general form without any restriction of regularity, regularizability, controllability. E and A are not even assumed to be square. More properties of this canonical form will be investigated in future work. AcknowledgemQn(; The authors gratefully thank Dr. M. Malabre for helpful conversations.

References: Van Der Weiden ,A.J.J. & Bosgra,O.H., "The determination of structural properties of a linear multivariable system by operation of system similarity.", Int.J.Control, 1980, Vo1.32, N~ pp 489-537. [2] Gantmacher.F.R (1959), "The theory of matrices, Vol.2, Chelsea, New York. [3] Morse,A.S. "Structural invariants of linear multivariable systems", SIAM.J.Control, Vol.11, N~ August 1973. [4] Loiseau,J.J.;C)zs & Karcanias.N, "A feedback classification of singular systems", IFAC WORKSHOP "System Structure & Control :State Space & Polynomial Methods" sept 25-27,1989 Prague. [5] Jaffe,S. & Karcanias,N. (1981), " Matrix pencil characterization of almost (A,B)-invariant s..ubspaces: A classification of geometric concepts", Int.J.ControI,Vol.33, n~ 1, pp.51-93. [6] Ozs of descriptor systems", PHD thesis Georgia institut of technology, May 1985. [7] Malabre,M., " A structural approach for linear singular systems", International Mini Symposium on Singular Systems, Atlanta, 4-6 December 1987. [8] Loiseau.J.J., "Some geometric coniderations about the Kronecker normal form", Int.J.Control, Vol42, n~ pp. 1411-1431, (1985). [1]

E v a l u a t i o n T r a n s f o r m a n d S y m b o l i c C a l c u l u s for N o n l i n e a r

Control Systems

V. Hoang Ngoc Minh ~ G. Jacob, L.I.F.L. - U.A. 369 C.N.I~.S. Universit6 Lille I, 59655 Villeneuve d'Ascq, France. A b s t r a c t : Given a nonlinear control system, one can view its output function as a signal, parametrized by the primitives of the input functions. This signal can be formally described by its Fliess' power series, that is a formal power series on noncommuting variables. The temporal behaviour of the system can be derived h'om this symbolic description by a transform, that we call "Evaluation transform" and that generalizes the inverse Laplace transform to the nonlinear area. We developpe here the basic tools of that symbolic calculus by introducing a "kernel" for our Evaluation transform. This kernel can be viewed as some "temporal memory" of the systena in the Volterra's meaning as well as in the programmation meaning.

1o I n t r o d u c t i o n

and notations

Following M. Sliess ([2]) the output Y(~) of any nonlinear control system can be symbolically described by its generating series S = ~ < SIw > w (also called Fliess' wEZ ~ power series) that is a formal power series on noncommuting variables belonging to an alphabet % -- {z0, Zl,...,zm}. This series is nothing else as the adequate generalization of the Heaviside calculus ([2], [3], [8], [9]) in the nonlinear area : the notions of transfer function and of impulsive response, coding signals produced by linear or multilinear control systems, axe generalized to generating series and Volterra series, coding signals produced by nonlinear control systems. The inverse Laplace transform allows to recover the signal symbolically described by some transfer function. In the same way, the Evaluation transform ([5], [6]) allows in return to easy derive the temporal behaviour of the system from its symbolic description by computing the Evaluation Eu(w)

of each word w, for the i n p u t a :

integral

( a z~

a z~

...

a ~m), as b e i n g tile i t e r a t e d

/0'

~uw. It follows from the Fliess' f u n d a m e n t a l f o r m u l a (also called Peano-

Balker formula) which presents the o u t p u t y(t) as the E v a l u a t i o n of the g e n e r a t i n g series S : y(t) =

~ < S l w > g~,(w)(~). So the o u t p u t y(t) can b e viewed as a signal wEZ9

d e p e n d i n g on the parameters ~(~) =

/0'

a~(~-)dT,

z E Z ([5], [6]).

We develop here, in c o n t i n u a t i o n of [5] a n d [6], the basic elements of this symbolic calculus using the notations of E v a l u a t i o n via kernel functions which plays the rule of t e m p o r a l m e m o r y of systems in the Volterra's m e a n i n g as well as in the p r o g r a m m a t i o n m e a n i n g . So the Evaluation transform becomes a f u n c t i o n a l d e p e n d i n g o n the kernel a n d on the inputs. This systematic t r e a t e m e n t has been used in [7] to write M A C S Y M A programs by use of its recursive definition a n d of the recursive i n t e r n a l r e p r e s e n t a t i o n by the b i n a r y tree of the linear c o m b i n a t i o n s of the n o n c o m m u t a t i v e r a t i o n a l fractions describ ed by (Coz j o ) * P ~ z q ( c l zj~ )*P~zi~ (c2 zj~ )'1'~... zi~_~ (ck-1 z j ~ _ ~ )*P~-~ z i k (cl, zj~ )*pk, where p o , 9 9 9 Pk a r e integers, Co, .. 9 ck are complex n u m b e r s , Zjo , . . . , z j k a n d Zio , 9 9 9 zi~ are letters in Z.

382

Recall that a f o r m a l p o w e r series o n t h e a s s o c i a t i v e v a r i a b l e s z E Z ( n o n c o m m u t i n g if card Z > 2) with coefficients in/it(([1]), is any m a p p i n g : S:Z*

~ ~( w l

~< S I w >,

and the set of ~11 formal power series over Z is denoted by Zr( >. A formal power series S will be written as a formal sum E < SIw > w, where w6Z" < SIw > is the coefficient of the word w in S. A formal power series S E ~r( > will be said q u a s i r e g u l a r if and only if its constant term < S]~ > is equal to O. The s u m of two formal power series S and T is the formal power series S + T in ~f > defined by : V w E Z*,

< S + T I w > = < S [ w > + < TIzu > .

The C a u c h y p r o d u c t , noted by ".", of two formal power series S mad T is the formal power series S.T in iK > defined by : V w E Z*,

< S.TIw > =

~ ltjvEZ*

< S[u > < T I v > . ~It l J : l o

The symbol "." will be omitted when there is no ambiguity. For aa~y quasiregular formal power series S in/T*2 >, S* represents classically the formal power series E S'~' In commutative variables, it coincides with the rational n>0

fractio~ 1/(~

-

s ) , ~nd in ~his case ,,~ have S "~ = (1/(~ - S)) ~. 2. S y m b o l i c c a l c u l u s

2.1. E v a l u a t i o n o f f o r m a l p o w e r s e r i e s Let Z = {zo, z l ; . . . , zm} be a finite alphabet. D e f i n i t i o n 2.1.1. : W e wii1 call i n p u t r e l a t i v e to Z t/~e given of a vector a = ( a z~ a z' ... a z~ ) of" p i e c e w i s e c o n t i n o u s rea~ vaJued f u n c t i o n s defined on a c o m p a c t intez~val [0, t] C ~ + . ConventionaJly t h e O_component o f a n y i n p u t is a z~ ~ I. F o l l o w i n g K . T . Chen ([2]), we will coil p a t h a s s o c i a t e d t o t h e i n p u t a --(a

z0

a"

...

a zo,

), the time depenae~t vector r = ' ( ~,o

~,,

...

~,m ) de~ned

as follows :

/"

/o"

In all the sequel, we suppose that the function f is Stieltjes integrable with respect to { ~ } ~ z defined over [0,t], (t _> 0), and f vanishes at zero. The unit step (vanishing at zero) is noted " u n " : v , E ]O,t],

~(~)

= 1,

~(0)

= O.

383

D e f i n i t i o n 2.1.2. : T h e E v a l u a t i o n o f t h e w o r d w in Z* w i t h r e s p e c t t o t h e k e r n e l f , for the i ~ p u t a = ( a ~~ a ~ . . . a ~" ) related to t h e finite a l p h a b e t Z , is defined by i n d u c t i o n on the l e n g t h o f w as follows :

&(f;w)(O

=

,

&(f;v)(r) d~,(r)

if

w = vz.

This definition is extended to g ( > in the following way : D e f n i t i o n 2.1.3. : W e will call E v a l u a t i o n o f t h e f o r m a l p o w e r series S in g~" > w i t h r e s p e c t to t h e k e r n e l f , for theinpu~ a = ( a ~~ a ~* . . . a ~=) related to the finite a l p h a b e t Z , w h e n it is defined, the functionaJ : Ca(f;S) =

E

< SIw > Sa(f;w).

wEZ ~

In particular, for f = u n , the E v a l u a t i o n o f t h e f o r m a l p o w e r series S in f f ( < < Z > > , f o r the i n p u t a = ( a ~~ a ~ . . . a == ) r e l a t e d to the finite a j P h a b e t Z , is the function ([5], [6]): & ( s ) = & ( ~ ; s).

According to the Fliess' fundamental formula, we see the Evaluation of a formal power series S can be viewed as a transform that associates to S the signal depending on the primitives {(~ },~z of the inputs functions, and the Evaluation transform is notlfing else as a generalization of inverse Laplace and Fourier transforms ([5], [6]). The two following lemmas can be obtained easily by induction : L e m m a 2.1.1. : L e t u, v be two w o r d s in Z * . T h e n : Ca(f; UV) -'- Ca(~Ta(f ; u); V).

L e m m a 2.1.2.

:

L e t z be a letter in Z . For any n ~_ 1, we h a v e :

C,(/; z'~)(t) =

fo' f ( r ) (r

7n_~ - r ~).v

d~,(r).

I~ particular, for z = z0, we h a v e : Ea(f;z'~)(t) =

~

t

(t --

f(r)

T) n-1

(n-1)!

dr.

From the lemma 2.1.2., we can see that tile Evaluation transform is an "extension on noncommutative variables" of the exponential transform of the "ordinary generating

series" S = E

c n z " , which associates to S the "exponential generating series" ga(S) =

n>0

c,, ~ n>O

well known in "combinatorics" ([4]):

384

C o r o l l a r y 2.1.1. : Let z be a Jetter in Z. T h e n for any n >_ O, S~(z'~)(t) = ~"n-~. " zn p ~ r t i c ~ , ~or z = zo, ~hen ~o~ ~ y ~ >__ o, s a ( ~ ) ( t ) = ~t,~.

T h e o r e m 2.1.1. : Let S and T be two formal p o w e r series in g~ >. Let r be a scalar. T h e n : Z~(f; S + r.T) = E~(f; S ) + r.E=(f; T).

P r o o f : The proof is immediate since the linearity of the ordinary integrals

9

T h e o r e m 2.1.2. : Let S and T be two #orrnaJ power series in ~ ( >. T h e n : E~(f; S . T ) = E~(E~(f; S); T).

P r o o f : Actually, by the definition 2.1.3., we have : &(E=(f;S);T)=

~

~

< S]u > < T l v > E ~ ( E ~ ( f ; u ) ; v )

uEZ* v6Z*

= ~

~

< Sl~ >< TI~ > eo(f;~)

(lemma 9LL)

uEZ* vEZ*

\ u E Z * vEZ*

= Eo(f; S . T )

9

The introduction of the kernel f for the Evaluation function G ([5], [6]) allows to give a notion of memory for the system. This kernel can be viewed as the temporal memory of the system in the Volterra's meaning as well as in the p r o g r a m m a t i o n meaning ([7]), that justifies our approach. So the Evaluation transform becomes a functional depending on the kernel f and on the inputs {aZ}zEz. For implement these functionals, we used the A-notation of MACSYMA and the recursive internal representation by binary trees of the linear combinations of the noncommutative rational fractions described by

(co Zio )*P~ z , (Cl z~, )*p, z,2 (c~ zj~ )*~.~.. z~ _, (c~_~ zi~ _, )*~-'z,~ (ok zj~ )*~, where p 0 , . . . , ;k are integers, Co, 9

ca are complex numbers, Z j o , . . . , zjk and Zio , . . . , zik are letters.

T h e o r e m 2.1.3. : (Convolution theorem) L e t z be a letter in Z. Let H be the formal p o w e r series on the only letter in z. T h e Evaluation with respect to the kernel f , for the input a = ( a z~ a zl ... a zm ) related to the finite aiphabet Z , of the formal p o w e r series H is : E~(f; H ) ( t ) =

/o'

h(~z(t) - ~z(~-))df(r),

385

where h(~z(t)) the EvaIuation of the formal power series H. In particular, if H is a formal power series in g( >, then :

g~(f; I t ) ( t ) =

/'

h(t - , ) d r ( r ) .

Proof. : (i) F i r s t ease : H = z", n _> 0 : For n = 0, the result is i m m e d i a t e . If n > 0 then we have : 8 ~ ( f ; zn)(t) =

f ( T ) ((~(t)--U ( n - -~-z(r))"-I 1)! d~z(r')

= /'f(~)d[ - (4z(t)-~-~('))"]~,

(leumaa 2.1.2.)

J

(integration by parts)

=

/'

h(~,(t) - ~ , ( r ) ) d f ( r )

(ii) G e n e r a l case : H = E

( / ( 0 ) = 0)).

H " z " ' we have :

n>__o

E=(f; H)(t) = ~

H , e ~ ( f ; zn)(t)

n>O

~(~))~

(~(t)

-~i

df(~) (case (~))

n>o

=

i'

h((,(t) - ~(r))df(r).

In p a r t i c u l a r , for z = zo, since ~ 0 ( t ) = t, t h e n we have result

9

2 . 1 . 2 . : Let G E f~2 > be a s series, and let H be a power series on the only letter in z. T h e Evaluation with respect to the kernel f , for the i n p u t a = ( a ~~ a ~ . . . a ~'~ ) r e l a t e d ~o t h e / ~ n i t e alphabet Z, of the formM power series C H is : Corollary

s

~ ( a H ) ( t ) = < al~ >

h(~(t)) +

/' h(~.(t) -

~(T))dEo(a)(,-),

where h((~(t)) the E v a / u a t i o n o f the f o r m a / p o w e r series H. /'ormai power series in g ( >, then :

e ~ ( c ~ ) ( t ) = < cl~ >

h(t) +

In particular, i f 2I is a

/' h(t-,)aeo(C)(,).

386 P r o o f : Actually, since G H = < Gle > H + G I H , where G1 is the quasiregular formM power series Lr.2 >. Hence by the theorem 2.1.1, the Evaluation of the formal power series G H , for the input a = ( a ~~ a z~ ... a ~ ) related to the finite alphabet Z, is < G]e > h ( ~ ( t ) ) + g ~ ( G l t I ) ( t ) . Using the Convolution theorem in the particular case where the kernel f is the Evaluation of the quasiregular formal power series G1, we have the expected result * 3. C a l c u l u s e x a m p l e s Example

3.1.

In this example, we c o m p u t e the Evaluation of the series z *'~ for any letter z and for any integer n > 0. Lemma

3.1.1. : For any integer n > 1, we have g~((c~z)*~)(t) = exp(~(~(t))g~(c~(z(t)) ,

where the g,, are polynomials and verifie the following inductive equations :

{

g,,(a(~(t)) =

/o

g~-l(a(~(t)) + a

g~-l(a((~(t) - (~(r)))d~0-)

if

n > 1.

P r o o f : Since ga((az)*)(t) exp(c~(z(t)), we can write gl((~(z(t)) = 1. ~Ve suppose that the result is true for any u, 0 _< u < n - 1. For v = n, we have ( a z ) * ' * = ( a z ) * ( a z ) *~-~. By the induction hypothesis and by the corollary 2.1.2, we obtain : =

sa((~z)*")(t)

=

exp(~(z(t))j._l(~4z(t)) + ]o exp(~((~(t) - (~(~)))g.-l(~((~(t)

= exp(c~,(t)) g , - l ( a ~ ( t ) ) hence we have the expected result

Lemma

3.1.2.

:

9

+ a

- (~0-)))dexp(~(~0-))

g , , - l ( a ( ~ ( t ) - ~('r)))d~z(T) ,

_i()

The f~ilj, j.(~(,(t)) = j----0 ~ '~ -J i (~(,)J(~) j! , f o r n >_ I, is the

unique solution o f the inductive, equations :

~,,(~(t)) =

I

j,,-l(,~(t)) +,~

/

1 if ~ = 1 t

9,,-~(~(~(t)- ~O-)))d~(T)

if ,, > i.

387

Proof:

Given a I = 1 E •(

1, we have g , ( a ~ ( t ) ) = E~(Gn)(t), with G,, E f/( > . Thus we have (see the corollary 2.1.2.) E~(G,,) = $~(G,,-1)+S~(azG~-l). This equation is true if G~ = (1 + az)G~_l. In other words : n--1

~n : ('l'JvOlZ)n--i = ~ ( ~ j=O Since $~((az)J)(t) =

j!

1}(O~Z)J.

, j >_ 0, tl~en we have for any integer n >__1 : n--1

By the l e m m a 3.1.1. and the l e m m a 3.1.2., we have the following proposition : Propositon

: For any positive integer n, for any complex number a, we

3.1.1.

.,/]ave :

e~[(c~z)*~](t) =

exp[c~z(t)]~.; n J- i [c~G(t)]Jj! if

n > O.

In particular,/'or z = z0, we have : C

,.~[(aZo)

*n

](t) =

n--1

exp(at) ~

j=o

1 (at) j

n-

if

7z = 0

if

n>O.

J

By the theorem 2.1.2. and the proposition 3.1.1, we deduce the following theorem Theorem

3.1.1. : For any positive integer n, for any complex number a, we have

f(t)

[a(4~(t) j~ ~ ( r ) ) ] j df (7)

if

n=0

if n > O .

In particulaz, for z = zo, we have :

g~[f; (azo)*"](t) =

~ j=O

n J- 1

exp[a(t - ~-)] [a(t J[ - T)] j df(-r)

if

n=O

if

n>O.

388

E x a m p l e 3.2. In this example, we compute the Evaluation of the formal power series S that verifies the following polynomial equation :

S + #lSz + fl2Sz 2 + ... fl, Sz" = G, where z is a letter of Z, and G a formal power series of ~" >. We have S K = G, 71

where K is the formal power series of Zrx"> defined by E

flkzk with/30 = 1.

k=0

Since the constant term < Kle > = fl0 = 1 does not vanish, then the formal power series/x~-1 exists and is a formal power series in the single commutative variable z. Suppose that K admits r complex distinguished roots # 1 , . . . , #r of respective multiplicity order m l , . . . , mr. One can express mfically K -1 under partial fraction decomposition form

~_~ Ern~ (_-"~-l)k ~l,k f~l,k,

where for any I e [1..r] and for any k e [1..rn,], A,,k e ~T, and

l=l k=l

f V' " Let hl,k({~(t)) be the Evaluation of Hi,k:

each Hl,k is \-fiT]

hl,k(~z(t))=exp(~l))~(k--1

So S = GK -1 can be expressed as ~ m ,E

1

(proposition 3.1.1.).

Al,k GHl,k. By the theorem 2.1.1. and by (_#l)k

l=l k=l

the corollary 2.1.2, we obtain the Evaluation of S :

Za(S)(t) =

(_#l)k < Gle > hl,k(~z(t)) -t-

hl,k(~z(t)

--

G(r))dg.(G)(r)

.

1=1 k = l

Let us indicate that in the particular case z = z0, the above calculation corresponds mutadis mutandis to the inverse Laplace transform in the study of linear control systems. As conclusion, we can say that if the Fliess' series is considered as a symbolic encoding of input/output behaviour of the nonlinear control systems, then the Evaluation transform allows in return to easy derive the temporal behaviour from this symbolic description, (see [6] for a simple obtaintion of the Taylor expansion of the Volterra kernel). Thus, we get a generalization of the notion of transfer function (generating series on one variable) and impulse response, encoding signals produced by linear or multilinear systems, and the Heaviside calculus (Fourier and Laplace transforms) as already pointed out by M. Fliess and al. ([2], [3], [8], [9]). In the symbolic calculus for linear control systems area, the integration operator is noted by ,,1,, ~ . Here, it coincides with the letter z0. And the letters z of the Fliess' encoding alphabet Z plays an analogous

389 part : they encode the "Stieltjes integration operators". And we have the following Evaluations of some usual formal power series : S

$.(f; S) f(t)

d Zn

~f

~ (n - 1) f t exp({z(t)- {~(r)) ({*(t) -{*(r))Jdf(r) ,=o J ,o J! f ' (r - ~(,-))~ an ni df(r)

z*",n > 1

E

e/o-)

E

CnZ n

n>O

Jo

n>O

then, in particular Z* ~ ~

Z ~"

' e~p(r

n_~O

/

n:>O

~--~(_1)% 2"

'(r

t

~z(,))df(~)

-

-

~(~))

e~p(r

-

r

cos(~.(t) - (~(r))df(r)

n~O

~--Lj(__ 1)- z2n+ i

t sin({,(t) -

~.(r))df(r)

n>O

In particular, for f = un, we have the following Evaluations ([5], [6], [7]):

s

&(s) 1

s

Zn ,n

z ,n>_l

exp(~z(t)) ~

C(t)

~ - ~ Cn Zn n>O

n>O

then, in particular Z* ~ ~

Zn

exp(~(t))

n>O

Z

nZ n

4~(t) exp(r

n>O

~-~j(-1)~z 2~

cos(r

n)O

sin(~(t)) n~O

j

j[

390

4. R e f e r e n c e s [1] [2] [3]

[4] [5]

[6] [7] [8] [9]

J. Berstel and C. R e u t e n a u e r , S6ries rationelles e~ leurs langages, Masson, Collection E.r~.I., 1984. M. Fliess, Fonctionnelles causales non lindaires et ind~termin~es non commutative, Bull. Soc. Math France, 109, 1981, pp. 3-40. M. Fliess, M. L a m n a b h i and F. Lamnabhi-Lagarrigue, An algebraic approad~ to nonlinear functional expzmsions, IEEE Trans. Circ. Syst., CAS-30, 1983, pp. 554-570. D. Foata, La s~rie g~n~ratrice exponentielle dans les probl~mes d'~num~ration, S4mlnaire de Math. Sup., Presses de L'Universi[5 de Montr4al, Montr6~tl, 1974. V. Hoang Ngoc Minh, ElEments d'un calcul symbolique pour les syst6mes dynamiques non lindaires, Journ6es-S6minaire "TraJtements Alg6briques ct Informatiques des S~ries Formelles Non Commutatives", Lille, December 1988. V. Hoang 1Ngoc Minh and G. Jacob, Symbolic calculus and Volterra series, IFAC Symposium "Non Linear Control Systems Design", Capri, Juin 1989. V. Hoang iNgoc Minh and G. Jacob, Transformation d'Ewlua~ion et Calcul Symbolique pour les Systbmes Non Lindaires, LIFL technical report, IT 168, 1989. M. Lamnabhi, A new symbolic calculus for the reponse of nonlinear systems, Systems ~z Control Letters, 1982, pp. 154-162. M. Lamnabhi, Functional za2aJysis of nonlineaa" circuits : a generating power series approach, IEE proceeding, Vol. 133, Pt H, N ~ 5, pp. 375-384.

IMMERSION IN INFINITE DIMENSION H. Hammouri & S. Othman Laboratoire d'Automatique et de G6nie des Proc~d~s Universit~ Claude Bernard Lyon I, 43 Bd de 11 Nov. 1918; 69622 Villeurbanne FRANCE.

Abstract

:

The input-output map is the intrinsic object of a dynamical system . The immersion problem consists of sending a dynamical system to another one via a transformation which preserves the input-output maps. Many authors have studied the immersion of nonlinear systems into a finite dimensional state affine systems ( up to output injection ). This result can be applied to the synthesis of observers of nonlinear systems. In this paper we prove that under some assumptions, any autonomous single output system can be immersed into an infinite dimensional linear system on some Banach space. Keyw0rds : Nonlinear systems, infinite w

dimensional systems, immersion.

Introduction.;

The immersion problem is introduced by Fliess and Kupka [3] . They proved that if the observation space of some nonlinear system is finite dimensional then we can immerse the nonlinear system into a state affine system. We recall the definition of the immersion. Let (.T_.I), (~2) be the two following systems : 1

(T.1) k= F(u,x) y

(T_.2) ~'=

x(t) ~ M 1 , y(t) ~ Bm , u(t) c Rp

Ht(x) F2(u'z)

z(t) r M2 , w(t) c Rrn , u(t) c B p

W = H2(Z)

Where M 1 and M2 are two manifolds. We say that (T-l) is immersible into (~,2) if there exists a continuous function : T" M 1 ~ M2 SuChthat: If x(u,xO,t) and z(u,T(xO),t) are the unique trajectories of (~1) and (]~2) which start at xOand :(x O) respectively at t=to, then y(u,xO,t)=Hl(x(u,xO,t)) coincides

392 with W(U,T(X0),t)=H2(z(u,T(X0),t)) for all x0 and all admissible controls u(.) along some interval of t. Note that to construct an observer system, many authors classify the systems which are immersible into state affine systems up to output injection. See [1], [4], [5], [6], [7], [8]. All these immersions are finite dimensional. Under some assumptions, the authors in [2] constructed an infinite dimensional unitary immersion. In this paper we give an immersion of some autonomous nonlinear system into two infinite dimensional linear systems. The generalization to the non autonomous case can be obtained in the same w a y . w 2 An infinite dimensional immersion : A)

Banach

immersion

:

Let (T_.) be the system : {~( = f(x) y = h(x)

x(t) c M

y(t) E R

Where M is an analytic manifold, f a vector field and h a real analytic function Such (T_.) is said to be analytic. Let O be the observation space ( i.e. the R-vector space generated by the family { Lkf(h), kr N }, where Lkf(h) is the kth Lie derivative and L0f(h) = h ). Remark that if this family is not free then O is finite dimensional and by Fliess-Kupka's theorem [3], we can immerse (~) into a finite dimensional linear system. Along this paper the family { Lkf(h) , kc N } is free and satisfies the following assumptions : (A1) I I Lkf(h) II oo = Pk < oo Where I I I loo is the norm of L~176 functions on M ). (A2)

sup~ o

~k+l

( the space of essentially bounded mesurable

= ;k < +oo

IJk Now set e k = Lkf(h) and consider the R-vector space : E=

ale i

s.t:

~lailPio ~

li = I(ei) .

< +oo. ~ui

b)

III II = suPi~ o

where II i II is the norm of the continuous linear form .

In this way, we identify a continuous linear form on E with

. ..

an

ordered

lail

i['[0ai ( supi~o ._~ < +o= ) and we denote the norm by II II, 9 =

~.Li

N o w let L be the shift ooerator on E defined bv L(e=~ = e=..l,

for i r N

sequence

394 Remark 2 : 1) L is well defined and continuous on E §

§

§

Since L ( ~ ale i ) = % aiei+ 1 and II i=0

i=0

,.I- ee

+~

,Y_.,aiei+ 1 II = ,Y_.,la~l#~§ < ~,% la~l#~ i=0

i=0

i=0

~li+ 1

where

;~ =supiz o - -

which is finite by (A2) .

2) L is exactly the extension of the Lie derivative Lf to the functional this m e a n s that L restricted to O coincides with Lf.

space E,

NOW let A be the adjoint operator of L, A is a linear operator of H into itself defined by : = "=i--~oai+l

A(~C[ ) i = oai Let

T " M

T(x) : E - +~,o

--, H be the valuation function defined by : 9B +ee

for every x r M

,

i

where e i = Lf(h).

aie i ----, ~ aiei(x) I-0

i=0

By claim 1 part (i) T is well defined. In this paper we consider on H two topologies : (i) Its natural Banach topology as above. (ii) T h e *-weak topology. The second topology is defined by ' A s e q u e n c e {fn}ncN of H converges to fcH if and only if 9 V zcE ;

limn-.,o~ (fn-f)(z) = 0

Definition

3 :

We say that a function g : M ----. H is "-weakly continuous if and only if g is continuous with respect to the "- weak topology. Lemma 4 : (i) (ii)

The operator A is continuous. T h e function T is "-weakly continuous.

395 Proof (i) By a s s u m p t i o n

(A2) :

supi~0

~'li+ 1

= ;~ < +oo

Pi Let ~ E H ;

~= ~

,

i=0

II ~ IIH = sup=z 0 p. I

I ~:i+~l II A~; II H =suPiz0 ~

I ~i+1 I ~i+1 _< supiz0 _ _ 'Ui+l

~i

_ _ _0

;

3 No

,

V N>_N0

;

0 < ~ lanlPn < n=N - "~

A n d then : +~

oo

NO

I (~xo) - T(x))( T. ae i ) I i=0

-
0 such that if d(x n, x ) < q ( d ( . , . ) is a metric on M which is compatible with the topology of M ) then I Lfi(h)(xn) - Lfi(h)(x) I -< d K for 0 ~; i < N O , wh e re q depends on K and ~. +oo

Set K =

~ l a n l , w e can choose n=0 This m e a n s that :

Let ~ C H ;

~=

.=

~;~ ,

N 1 E N, such that

II ~ IJH = supi~0

Pi

Vn_>.N 1 ; d(Xn,X)_< q .

396 +

eo

I ("Kx.) - ~x) X)-'. aie i ) I ~ r

[]

i=O

Now consider t h e system :

(S)=

(t)=Cg t

~tcH

; w(t) c R

Where C is the continuous linear form : C : H - -

-'[a i i=O

.B ._, a o

Because A is a linear bounded operator, it becomes an infinitesimal generator of an analytic group e tA . And V ~0r H ; :3 ~(t) a unique analytic solution of ~t = A ~;t such that ~(0) = ~0 Theorem 5 ; Under the assumptions (A1), (A2), the system (~) is immersible into (s) for the "-weal~ topology on H . Proof of theorem 5 : Let xOr M and y(t) the output of the system (T_.) with respect to the initial state x O . Let w(t) be the output of (S) with respect to T(x0) . Let us prove that : y(.) - w(.) in some interval [0,T] . By analyticity of (T.) : +=o

13

+~

rl

t (n)/n ~ t Ln h o y(t) = ~: ~y ~v, = :~ ~ f()(x) n=o n=o Since

on some interval

[O,T1], TI>O .

w(t) is analytic, 13

+co

w(t) = :~ -~w(n)(o) n=O

on some interval

ill

More generally : Since C is a continuous linear form :

[O,T2], T2>O 9

397 n

d w = CAn~(t)

,

where A n is defined as follows :

dt n

for n>l for

An~ = A(Anq~)

V~r H

A ~ is the identity operator.

n=O

This gives " w(n)(0) = CAn~(0) = CAnT(x 0) + oo

= C [ ' ~ T i + n ( X 0) i=O + ~ i+n

=Cl-[Lf

0

(h)(x)

i=O

= Lnf(h)(x0) This implies that w(t) = y(t) on [0,T] where T = inf (T1,T2). [7 In order to construct an observer for the above infinite dimensional linear system, sometimes we minimize some quadratic convex criterion, for this we need a smooth norm. For this reason we have to embed H into a Hilbertian vector space FI for which the old immersion can be extended. B)

Hilbert

Now Let

l

immersion

H be the Hilbert space :

§ a i s.t 9 +~ ~ "=

:

i=O

Jaij2 < +co / 2

P i (i+1)

2

The scalar product and the norm of H are defined by : + ,=

+ .

+ ~

akb k

N 1 ;

2

I Lfk(h)(xn) - Lfk(h)(x) J < d M

for 0< k 2n + V"2"n, we have that

~(~+,,~+~ ~+,,~k+1) ___600.

n 100 200 300 400 500 600 700 800 900 1000

Itr. 14.90 15.10 15.40 15.70 16.10 16.00 16.00 16.00 16.00 16.00

Time 0.57 3.27 10.98 25.02 47.72 83.99 129.29 197.37 274.35 372.31

Table 1: fl=0.99, p=n Ls, cond=6, deg=6, na(x')=n/2.

3.2.

Computational l~esults with Sparse Matrices

Next, we use our algorithm for solving large-scale practical engineering problems. The obstacle problems, elastic-plastic torsion problems, and journal-bearing problems can be formulated as (1.1). Using instances of tile obstacle problem, we are going to test our algorithm. T h e obstacle problems are generated in the manner explained in [11]. For these problems, Q is a special block tridiagonal matrix. Superdiagonal and subdiagonal blocks of (2 are - I E/~,~x,~, where I is an identity matrix, and diagonal blocks are also tridiagonal matrices whose diagonal elements are 4 and their superdiagonal and subdiagonal elements are -1. All blocks of Q have a same size rn by m, where rn 2 = n. Note that Q is a very sparse syrmnetric positive definite matrix. Let c=-h

2 where h =

1 m + I"

Upper bounds and lower bounds of variables are given by (sin(9.2al) sin(O.3a2)) 3 < x, __ 0 et 3' > 0 song choisis trSs grands par rapport s c, La forme (2.6) correspond ~ la minimisation des pertes actives, alors que la forme (2.7) correspond h la recherche du plan de tension le plus ~lev~ possible. 0 u t r e les infigalit~s (2.4)(2.5) et les in~galitds V " s'5crivent : Pi~ + s = vi(O, V), Pi~ = VI(0, V),

V i e G,

Vi r G,

Q,g + Ci - S, - QI~ = r Ci - Si - Qir = r

S: L-y"

p? z~2 -~ + L2 n + ~. / ( : )

__ - tl, en i ( s ) = i a n d go t o S t e p III.

8.

i = i + 1, return to point 2.

-

psiII~g s.

II: + A * ' c :

S t e p III

x ' + l = z~,g "+t = 9~.

S t e p IV

s = s + 1, return to Step II.

Let us introduce some additionM designations. Let

T ( x ) = {y E an: f ( y ) f(alX + c~2y) for all cq, a2, x, y such that al+a2=

1,

al >0,

a2>O,

xET(x~

yET(x~

We next formulate a theorem about the convergence of Algorithm 4.2. Theorem

4.2 Lct a function f: R '~ ---* 12 be strictly convex (possibly nonsmooth) on the set T(x~

be given, and let the sequence {Aj}, j = 0, 1 , . . . of positive numbers satisfy the conditions

i=0

i=0

let v > 0

438 Then there ezists 5 such that

f(x ~) - f* _ ( g , Y - z) + ( ( z , y )

for all y E X

is nonempty, where ((x, y) is uniformly small with respect to [Ix - yl] on each compact subset I( C X, i.e. for each e > 0 there exists 6 > 0 such that I1~;(=, y)ll/ll= - yll < for =, y ~ so, 11= - yll < ~.

L e m m a 6.1 Let the function f : R n --~ 1~ be convez on 1~", and the set 13 E R nx,~ be convez.

Then the

function (p(B) = f ( x - pBBT~) is weakly convez on 13 and O~( B ) = { - p ( ~ ( ~ + r

e Of(= - pnnrr

L e m m a 6.1 gives a formula for the subdifferential of tile function qo,(B). For the a d a p t a t i o n of matrix B ' , the following gradient method can be used: B~+, = B~ + %1 (~k'9 "T + 9 " ~ r ) B~,i = 0, 1 . . . . .

(16)

where ~' denotes the normalized vector g~ E Of(x~) (see (12)). Analogous to Algorithm 4.2, we write an algorithm with t h e m a t r i x modification formula (16). Algorithm Step I

6.1 Ini tialization s = 0, Bo 1 = f , i = - 1 , j = - 1 , z ~ = xinlt,g ~ E Of(z~

S t e p II

1.

B8 =

2.

i=0.

S, B.%l,

if IIB$+~g'll = o otherwise

440 3.

P,i = argmino> o f (x" - pB~B~rg').

4.

j=j+l,j(s,i)=j.

5.

x~ = x" --psiB~B~Tg ,.

6.

Compute g~ e Of(x~) such that (g~, 23~B~'g ") < O.

7.

Bt+l = B Z + ) U ( ~ g ~ r

8.

If ]]x~ - x'll :> v, then i(s) = i and go to Step III.

9.

i = i + 1 and return to point 3 of Step II.

~_

g s ~is T )B~"

Step III

x~+x = -,,~z'~ a~+l = g~.

Step IV

s = s + 1 and return to Step II.

We formulate a theorem about the convergence of Algorithm 6.1 for smooth objective functions. T h e o r e m 6.1 Let the function f: 12'~ ~ R be strictly convex and smooth, Ll be a Lipschitz constant of the function f on the set T(x~

and L2 be a Lipschitz constant for gradient V f ( x ) on the set

T~(xo) d=ef[~'x:~eT(~0)min[Ix--y]l 0 and a sequence of positive numbers {A./} satisfying co

./=o

j=o

Then for Algorithm 6.1 there exists ~ such that

IIg~ll ~ 2~L=. R e m a r k As was mentioned before the exact steepest descent (see Step II-2 of Algorithm 6.1) is not implementable. Rough approximations are used for practical purposes.

7 1. 2. 3. 4.

5. 6. 7. 8.

References EAVES, B.C., ZANGWJLL, Generalized Cutting Plane Algorithms, SIAM Journal on Control, 9, pp. 529-542, 1971. BERTSEKAS, D.P., At~o ~r S.K., A Descent Numerical Method for Optimization Problems with Non-Differentiable Cost Functionals, SIAM Journal on Control, 11, pp. 637-652, 1973. WOLFS, P., A Method of Conjugate Subgradients for Minimizing Nondifferentiable Functions, Mathematical Programming Study 3, pp. 145-173, 1975. LSMAltECIIAL,C., STrtODIAT, J.J., AND BHIAIN, A., On a Bundle Algorithm for Nonsmooth Optimization, Nonlinear Programming 4, Edited by O.L. Mangasarian, R.R. Meyer, and S.M. Robinson, Academic Press, New York, I981. MI~F'LIN, R., A Modification and an Extension of Lemareehal's Algorithm for Nonsmooth Minimization, Mathematical Programming Study 17, pp. 77-90, 1982. KIWlEL, K.C., Methods of Descent for Nondifferentiable Optimization, Springer-Verlag, Berlin, 1985. DI~M'JANOV, V.F., AND VASlL'I~V,L.V., Nondifferentiable Optimization, Springer, New York, 1985. ROBINSON, S.M., Newton's Method for a Class of Nonsmooth Functions, SIAM Journal on Numerical Analysis, to appear.

441

9. Sltolt, N.Z., On a Structure of the Algorithms for Numerical Solution of the Optimal Planning and Designing, Kiev, PhD Thesis, 1964 (in Russian). 10. DENNIS,J.N., AND Moltl~, J.J., Quasi-Newton Methods, Motivation and Theory, SIAM Review, 19, pp. 46-89, 1977. 11. SltoR, N.Z., Minimization Methods for Non-Differcntiable Functions, Springer-Verlag, 1985. 12. POLJAR,B.T., Subgradient Methods: A Survey of Soviet Research, Proceedings of a IIASA Workshop "Nonsmooth Optimization," Edited by C. Lemarechal and R. Mifflin, 1977. 13. UR.YAS'EV,S.P., Stochastic Quasigradient Algorithms with Adaptively Controlled Parameters, IIASA, Laxenburg, Austria, WP-86-32, 1986. 14. EJtMOLIEV, Yu., AND WETS, R.J.-B., EBS. Numerical Teehiques for Stochastic Optimization, Springer-Verlag, 1988. 15. U~tYAS'SV,S.P., Adaptive Variable Metric Algorithms for Nonsmooth Optimization Problems, IIASA, Laxenburg, Austria, WP-88-60, 1988. 16. ROCKA~'ELLAR,R.T., Convex Analysis, Princeton Mathematics, Vol. 28, Princeton Univ. Press, 1970. 17. PSIIENYClINYI,B.N., Necessary Conditions for an Extremum. Dekker, New York, 1971. 18. NESTEROV,YU.E., Minimization Methods for Nonsmooth Convex and Quasiconvex Functions, Economika i mat. metodi, USSR, XX, pp. 519-531, 1984 (in Russian). 19. NIJRMINSK1, E., Numerical Methods for Solving Deterministic and Stochastic Minimax Problems, Naukova Dumka, Kiev, 1979 (in Russian). 20. HOVFMAN,A., Weak Convex Functions, Mullifunclions and Optimization, 27. IWK d. TH Ilmenau, tieft 5, pp. 33-36, 1982.

COMPOSITE

OPTIMIZATION:

SECOND

VALUE FUNCTIONS

problems

=

By

~(F(x)),

not

constraints.

is

Such

the of

a

all)

paper

we

composite

we

map

Composite minimization

problems

and

~ appear

give

Formula

for

Further

F

given on

is a s m o o t h

point

convex

(in

reduction

optlmlzatlon the

It

general

Forms

for

problems

wlth

order

lower

second

apply

f(X)

Form

to

obtaln

new

a neigborhood

of a

INTRODUCTION.

is the n ~ m e

involving

for a class

functions

of

of u n c o n s t r a i n e d

the f o r m

(I)

map from a Banach

space

x ~ X) into a n o t h e r B a n a c h

space

of

is

X Y

(or f r o m and g

is a c o n v e x

function

Y. The

role

important

composite

classes

programming, some

the

optimization

In composite optimization.

r(x) = g ( F ( x ) ) where

of

a

as

of

and

of is

classes

a

class

Functions

typically

Functlons

optimization

a

important

second order conditions and sensltlvlty estlmates

i.

mean

composite

smooth

problems

practically

In

epi-derlvatlve

Involvlng

F

Function.

(If

AND SENSITYVITY

opt|mlzatlon

constraints

where

non-smooth) many

composlte

without

CONDITIONS,

Alexander Ioffe of M a t h e m a t i c s , T h e T e c h n i o n H a i f a 32000, I s r a e l

Department

ABSTRACT.

ORDER

problems

equivalently,

of

optimization

semi-infinite of

optimization

shape

As an example,

problems

programming,

optimization

in o n e or a n o t h e r

sense,

let us c o n s i d e r

determined

-

with

relaxed see

reduced

the p r o b l e m

[Ill,

by

the

constraints optimal

that

many

(mathematical

control

[I2],[RI],

to a p r o b l e m

fact

problems,

[DZ])

can

of s u c h sort.

be

443

(p)

fo: X ~ R,

where

fo(X)

minimize

G(x) E K,

subject to

Set

G: X ~ U, E: X ~ Y

Y = RxUxV,

K = {u:O(u)

~ O}

and

take a continuous

and

dist(K,u)

E(x) =0, K

is a closed convex cone in

sublinear function

s ce+(u)

for some

function of the intersection of the polar of K

~(u)

U.

such that

c > 0 (say,

the support

w i t h the unit ball in

U~ )

and consider the functions

g1(y) = g1(=,u,v)

= max {~,e(u),llvll},

L

g2(y) = max {~,O(u)} + kllv]l; + Ilvll)

g3(y) = ~ + k(~+(u) +

(where

~

:

max { ~ , 0 } ) .

Finally,

we fix an

x ~ X

and set

F(x) = (fo(X) - fo(~),G(x),E(x)), fi(x) = Ei(F(x)),

i =1,2,3.

Then the following "reduction principles" I. If

x

is a strict local minimizer

hold:

to eiteher of

f.

and an admissible

1

element for II. local

(P), then

is an isolated local solution to the problem;

If the equality constraint

in

(P)

solution

f

attains

sufficiently III.

x

to

large

(P),

then

is an

in

(P)

are

f

Q(x)

r

=

{x:

G(x)

~ G}.

If

(cf.

[Ii]),

C

if, say, x

x ~ x

s u f f i c i e n t l y close to

=

x

(i.e.

x

x

x,

x

close to

{0},

then

at

is a x

for

the cost then

x

function is

is

a local

x.

means

that

x

we have

x ~ G and

a

fIT]

sufficient is that

condition

Q'(x)

is an isolated local solution of

we have (other than

for

is onto.

[P].)

satisfying

G(x)

f (x) > f (x). 0

of a neigborhood of

x

~ r.dist(O,Q(x)),

(For a survey of regularity results see Indeed

and

minimum

(P),

at

regularity given by the theorem of Ljusternik

for any admissible

at

in

~ C

such that for any dist(A,x)

A

local

x

attains a local minimum at

3

(Regularity of the constraint there is a constant

regular

admissible element

solution to (P) if and only if

where

a

at

k;

If the constraints

Lipschitz and

2

is regular

~ K, E(x)

(P), then

= O) which

is

This means that for any

x

0

x

itself)

either the inequality above

444 holds,

IIECx)II > 0

or

or

@(GCx)) > 0 which means that

f (x) > 0 , whereas 1

f (x) = O. On the other hand, if the latter is true and I than fo(X) > fo(X) which means that x is an isolated (P). This proves Principle Let

us

minimizer

prove for

f

valid and let

x

admissible other hand,

also

I for

Principle

III.

It

is

such

that

fo

since

~ fo(U)

of

local

to

that

CP).

solution

x

Suppose

is the

x. By regularity,

(r+l)(dist(K,G(x))

Ux - ull ~

is Lipschitz,

~(x) = s

obvious

solution

be any point close enough

u

is admissible, of

g1"

if it is a local

3

x

a

local

latter

is

there is an

+ llE(x)ll). On the

we have

~ fo(X)

+ lllx - ull

fo(X) + 1(r+l)(dist(K,G(x)) + llE(x)U) ~ f3(x). Using principle (P)

from

I, we can obtain sufficient

sufficient

conditions

for

either

and Ill, we can do the same for necessary f3

([BP],

[Ii],

[12],

reduction

to

procedures

for problems

[R2]).

composite

Thus, of

composite

unconstrained

optimization

advantages; of certain this

theory.

it also

connection

we

This

offers

difficulties

a good

number

of questions

(e.g.

condition

of

of [R2],

why

by

epi-derivative

of

instrument

technical

many

point"

them

problems

describing

the standard

sometimes

that

contain

a

composite

to many and

problems,

in specific formula

function

aesthetical the nature

better a pair

branches

seen.

In

of

"twin"

that

cover

[I4] and actually clarify a

Hessian

must be solved

a

4 and 5 stated

from which

are

[K2],

numerical

optimization.

and the other sufficient, [KI],

complicated

pair of second order conditions begin

has

second

order

conditions

or why the necessary

[KI] cannot have a sufficient

what kind of technical

We

approach

3

develop

[W]. Theorems

a unified

2 and

one necessary

cannot work in sufficiently order

of

to

via composite

"observation out

Theorems

the most recent developments

[FS],

analysis

approach

and ways

mention

second order conditions,

optimization

offers

for

f.; using principles II 1 associated with f and 2 makes it possible to use

III

with constraints

optimization

of

(of any order)

conditions

Principle

in section 4 show a way to sensitivity

conditions

"twin"

in order

ete.).They to obtain

second

also

show

a workable

cases. for

which

in proofs of all the above mentioned

the is

the

results.

lower

second

principal

order

technical

445 2.

Let

LOWER SECOND ORDER E P I - D E R I V A T I V E OF A COMPOSITE FUNCTION

f

be a function

order e p i - d e r i v a t i v e

X

which

is finite

at

x

along

h E X

f

at

with respect

The

is

limit as

x" = O, we write basic

if

t 9 0

0

for

[RI]

of

the

lower

all

second

nonzero

h,

then

A = F'(~),

= {y* 9 a g ( y ) :

K(x') = {h: g ' ( y ; A h ) (we write simply

~

and

K

if

f

w(h)

A'y"

c)

is

C 2 near

then

S

be

where

if

L

at

containing

defined by

x

a

local

x.

(I) . We set

= F"(~)(h,h);

s <x',h>}.

x;

is the linear subspace of and

7 + L

the collection

converging to zero. For any #s(h,w)

attains

x" = O) and assume that

L + Im A = Y

continuos with respect to Let

that

Y.

(regularity condition)

dom g - Y,

epi-derivative

= x'},

a) g is finite and lower semicintinuos at b) F

f

linear manifold

In what follows we consider functions 7 = F(~),

order

is the following:

m i n i m u m on e v e r y f i n i t e d i m e n s i o n a l

~(x*)

).

f"(x;h).

property

>

- t<x~,h'>)

of the epigraphs of the functions (see

its role in o p t i m i z a t i o n

f'~(x;h)

to

second

this means that the e p i g r a p h of the function h ~ f " ( x , x ~ ; h )

h ~ t-a(f(x + th) - f(x) - t<x",h>)

determines

lower

is

the upper Kuratowski

If

The

[RI]:

f'i(x,x';h)- = lim inf t-2(f(x + th') - f(x) h'~ h t 9+0 Geometrically,

x.

x" 6 X ~

defined as follows

of

on

at

there

z ~ X

such

spanned b y that

g

is

y + Az

of sequences

s E S

is a

Y

and any

= tim sup inf t 2(g( 7 + tn Ah" ng~ h'gh

s =

{t } of p o s i t i v e n h e X, w E Y, we set + t n% )

- (7)

- tn < x ' , h ' ~ ) ,

numbers

446 lim sup inf = sup lim sup n 9 m h'9 h e > 0 n ~ ~ i. The following

THEOREM

- either or

-

f"_(x,x~;h)

f2(x') ~ ~

= - ~

alternative for some

In

the

last

holds:

h;

and

f"(x,x';h)

+~

case

min ~s(h,w)). sES w -> ~s(h,w) is either =

the f u n c t i o n

or it is convex

continuous

with

in case w h e n g(-)

is a continuous

of the first

convex

is continuous,

f"_(x,x';h)

part

function.

g ( F ( x + th)) = g ( y +tAh w(.)

identically

equal

to

d o m ~;(h, ") = ~(x').

We shall give a sketch of the proof

and since

inf IIh'-hll<e

In this case

+ t2w(h))

it follows

of the theorem

+ o(t 2)

that

= lim inf t-a(g(y +tAh + t2w(h')) h'-) h

-g(y)

- t<x',h'>)

t ->+0 min

=

s ~S

lim sup tn~(g(y

+tnAh"

+ t2w(h))n - g(Y)

- tn<x%h'>)

h'-~h n-~oo

=

If

min #s(h, w(h) ). s E S

f[(~,x';h)

> - m

for any

lim inf t-1(E(~

h,

then

+tAh + t2w(h))

- g(y)

- t<x~,h>)

z 0

t 4+0 and,

consequently,

~(x ~) ~ ~.

g'(y;Ah)

In a similar

z <x',h>

for any

way we conclude

that

h

which

implies

h ~ K(x ~)

if

that

f[(~,x';h)

is

finite.

3. SECOND O R D E R

It is not a difficult second

order necessary

THEOREM

is that

2.

~ ~ ~

to d e r i v e

from Theorem

1 the f o l l o w i n g

condition.

A necessary and

matter

CONDITIONS

condition

#s(h,w(h))

~ 0

for

x

for all

to be a local h ~ K,

s E S

minimum or,

of

f(.)

equivalent-

447 ly, that for any

h E K, s ~ S

there is a

A

y* E ~

- ~[(h,y*)

such

that

~0.

For the "twin" sufficient condition we need some attraction property for

the cone

K

of critical directions.

(AP) ~ ~ z

and any bounded

This is a convenient one:

{h n}

sequence

a subsequence converging to a certain We observe

THEOREM 3

(AP)

for

and all nonzero h E K

0

for all

condition).

If

provided

that

f(.),

h E K

there is a

a

h

if

has

~ ~ z.

dim X < m.

holds,

(AP)

~s(h,w(h))

or, equivalently,

y* E ~

g'(y;Ah n) ~ 0

that

h E K.

is trivially satisfied if

(sufficient

minimizer

local

g'(y;Ahn)

that

The property

such

then

x

is a strict

> 0

for

a11

s E S

that for any s E S and any nonzero

such that

- #~(h,y*)

> 0

(2)

It follows from Theorems 2 and 3 that in every specific s i t u a t i o n we must be

able

to

calculate

@s

second order conditions. case

when

the

programming any

original

say,

problem

Since

with the Lagrangian, has

If,

#g

to

non-positive

(P)

order

to

be for

obtain

is a polyhedral is a

standard then

is p r e c i s e l y

a workable

function problem

~(h,y*)

the H e s s i a n

observed

in

this

connection

> O,

V

h E K,

condition but a weaker

natural necessary counterpart

that

calculated. terms.

in finite dimensional so called piecewise of them the

g

function

on

~

for

associated

is

always

(3)

In fact,

in

[RI]

(that is both

linear quadratic

g(').

is actually

(3)

>

by

ill]).

in some specific

Rockafellar

spaces

it does not have a

(by which we mean that replacing in

is either affine or quadratic.) f(-)

= 0

h ~ 0

the f u n c t i o n

situations

considered

dim X < ~

composite and

#s can

this has been functions

dim Y < m) with

(The latter means that the domain

can be broken into finite number of polyhedral g(.)

is the

mathematical

#~(h,y*)

It is possible to mention a few more cases in w h i c h

in different

of

function

one because

does not make the condition necessary - see

done

(which

of

y* ~ ~. Therefore the standard Lagrangian c o n d i t i o n

is still a sufficient

be e x p l i c i t l y

pair

we obtain the standard second order conditions.

a

g

in

with finitely many constraints),

h E K,

It

or

pieces and on each of

It turns out

epi-differentiable

and

that

in this case

a formula

for

the

448 epi-derivative class

was

obtained.

of functions

(see

This

[C2],

hold already for a function on quadratic functions.

result

[14]) R2

hidden

in

function of tiability

[K2]

extended

too more

to a more

general:

is the m a x i m u m

general

it may

of a linear

not

and a

A formula for the e p i - d e r i v a t i v e of the m a x - f u n c t i o n

in

t (and

of

not

which

f(x) =

is

can be

but

case

x

when

f(t,x)

max

f

has

a

polynomlal-type

is finite dimensional).

general

max-functions

(involving

behavior

a

A c r i t e r i o n for epi-differenmaximization

over

arbitrary

compact spaces and also defined on infinite dimensional Banach spaces) ven in [14]. We also mention the p a p e r b y Darkowskii and Levitin similar work was done in case when m a x i m i z a t i o n

as

is gi-

[DL] where a

is a p r o b l e m of concave prog-

ramming. In all

these

cases

epi-differentiable. involved

infinite

~s

does not

actually

This, unfortunately, number

possible

condition

sequences

not

taking

s

and

into

there

account

the

on

s

and

f(.)

is

is not a typical case when problems

of constraints

is considered,

semi-lnfinite p r o g r a m m i n g or optimal control. all

depend

is

such

as

problems

of

In general we have to scan over no

variety

hope of

that,

say,

second

a

order

necessary

directional

behaviors of a convex function can be s u f f i c i e n t l y powerful.

4. SENSITIVITY ANALYSIS

We now c o n s i d e r what happens Namely,

if we begin to slightly perturb

p(u) = inf g(F(x) x (For

simplicity

infimum above fixed small We

the map

F.

we consider the "value" function

shall

we

consider

is calculated

the

case

subject

of

a

+ u). "local"

to the condition

value

function

when

llx - xll < ~

for

the some

e > O. This does not cause any real loss of generality.) formulate

second order sufficient order directional

two

theorems

with

estimates

condition is satisfied,

derivatives of

p

at

and,

in

case

exact formulas for

when

the

the first

zero. (It should be remarked at this

449 point

that

order

derivatives

classical

we

typically of

problems

need

value

with

second

order

functions

equality

in

information

extremal

constraints

standard problems of mathematical

to

calculate

problems

and

-

see

first

lIT]

[ B ] , [ G ] , [GJ],

for

iS], for

programming.)

We set

p [ ( O , u ) = lim inf t-*(p(tu) - p ( O ) ) t ~ +0

and f o r

any

s

e S,

t~1(p(tn) -

p~+C0;u) : lim sup n9 THEOREM 4. For any

s ~ S

p(0))

~

we have

p~+(O;u) ~ inf ~s(h,w(h) + u) h If the sufficient condition of Theorem 3 is satisfied, p~(O;u) = Therefore,

if

Cs

differentiable

at

does 0

depend

along every

p'(u) = To formulate

Cs(h,w(h) + u)

min min s~Sh~K

not

on

u

~s(h ) = {ym ~ ~:

s,

then

p

is

directionally

and

r

min h~K

the second theorem,

then

+ u).

we set

for

h ~ K

- r

z 0}.

THEOREM 5. We always have

p~(O;u) ~

This

becomes

Moreover,

if

an

equality

Cs

(3.7)

and their norms by : lt("(t)ll = ~ ,

lb(O}l = IlA{t)l} ( 1 - < . , iiy(oil~.

A(t)

>=)~

(a.s)

At points to of a pa.th where V(to) --- 0, one may still define v(t) and a(t) by continuity provided that { T h e fight-hand sides in (3.7) are defined (3.9) V(to) = 0 =, and have a limit when t --+ to, t 7~ to Notice that (3.9) holds automatically as soon as ~o'(x) is injective for any z in C! Now that 90(C) is equipped with C2-pathes with respect to the arc-length, we define the r a d i u s o f c u r v a t u r e p(t) of a path P = ~o([z,y]) at parameter t by : p ( 0 = lla(t)lV ~ =

Itv(t)ll=cx- < v, A(t) ilA(t)ll" ~

>=)-~, > [IV(~)ll 2 -IIA(t)II -

(3.x0)

-

Following [51, we introduce also the g l o b a l r a d i u s o f c u r v a t u r e pa(t,t') of P at ~ seen from ~', which is given by the formula 0

pa(t,t') =

Sgn(f - t ) ( 1 - < v,v' >=)89 Sgn(t'-Q

ifSgn(t'-t)

2 0

>0

ifSgn(t'-*)

>0

and = 1. Geometrically, pa(l, *') is the distance of P ( t ) to the intersection of the two haK spaces normal to each end of the subpath of P located between P(t) and P(t'). It is related to the usual radius of cm'vature p(t) by limp(t,t') = p(t) (3.12) t~t I

455

We denote also by 5(t,t') the arc l e n g t h between the two points P(t) and P(t') of the path P = ~([~, v]) :

a(t,t') = / " IlV(r)lldr

(3.13)

and by O(t, t') the d e f l e c t i o n between the path directions at the two saane points : @(t,t') = c o s - l ( < v(t),v(t') >)(in radian),

(3.14)

which statisfles (cf [5]) :

O(t,t') 2 )rdr A(C,%P) (3.21)

< e,~(c,~,p), e >_ e(c,~,~)

In view of (3.20) it is reasonable to suppose that R a n d / ~ has been chosen such that as to satisfy

o

(4.4)

~(C)closed in F

(4.5)

Then : i) The NLLS problem (2.3) i~ Q-wellposed on 0 = {z 9 FId(z,~(C)) < / ~ }

(4.6)

for ~he p.~eudo-dis~ance 6(x,y) on C defined in (~.1). ii) More precisely, if zj 9 O,j = O, 1 satisfy

Iiz0 - z, lir + m ~ d(zj,~(C)) < d < ~ i=0,1

(4.7)

457

for some d, then the corresponding solution~ &j,j = O, 1 of (2.3) ~atisfy the s t a b i l i t y e s t i m a t e :

6(:~0,:~1) _< (1

d/R)-'ltzo - =,ll~

-

(4.8)

P r o o f : Hypothesis (2.1) (3.9) and (4.4) imply, using theorem 5.12 of [5], that r is strictly quasiconvex with a neighborhood v~ given by (4.6), which, together with the fact that ~(C) is closed, implies existence, uniqueness and Lipschitz stability of the projection on r all over ,9, and absence of local minima by theorems 3.6, 3.9 and 3.5 of [5]. [] Of course, if the strong property (4.2) on the derivative c2'(x)-the so-called "sensitivity Matrix" in the finite dimensional case-holds, then one obtains stability for the ]Ix - Y[IE distance on C, and c2(C ) is necessarily closed : C o r o l l a r y 4.1 Let hypothesis (2.1) and (.{.2) hold, and R >_ Ra be lower bounds ~o R(C,~,79) and RG(C,~,79) a~ defined in (3.16) (3.17) and (3.21).

If: RG > 0

(4.9)

Then T(C)

is closed in F

(4.10)

and conclusion~ i) and ii) of theorem (4.2) hold, with (4.8) replaced by o',,,ll:~o - :~,IIE

< ( I --

dlR)-'llzo

-

~,11,~

( 0, Ik,"(=)(y, y)ll~ < #llyll :v= e C, Vy e c Then we have the

(4.1~)

458 C o r o l l a r y 4.2 Let hypoihe~i~ (2.1) and (4.2) hold, and define : r: =

,~1~

,

e

=

(~l,~..)diam c

(4.17)

and : P~ =

R 0 < O 0. (This of course requires that the "variety" r itself has a "bounded curvature"). The condition Re: > 0 then can ahvays be satisfied by reducing the size of C, and is hence less critical. Let us first consider the case where E is finite dimensional, for example in parameter estimation in O.D.E.s, or in P.D.E.s after discretization has been performed. This case is very important practically, as it is the only one which one can actually attempt to solve on a computer ! One may then try to use the above geometrical theory to determine if the setting of the parameter estimation problem is satisfying, ie if the knowledge of C and ~ allows for a unique, stable determination of the parameter, and if so, which accuracy is required on the data. This requires the estimation of lower bounds R a n d / ~ to the smallest radius of curvature (3.16) and global radius of curvature (3.17), which for sure is not an easy task. When the dimension of E is not too large, one can try a numerical determination of R using (3.16) (3.10) and Ro using (3.17) (3.11), and use theorem 4.1 which given the most precise sufficient condition. This includes intensive computation (namely, along all segments [x, y] with extremities x and y located on the (relative) boundary OC of C[), which may quickly become unaffordable when the number of parameters is larger than a few units... But the reward for this computational effort is a treasurable ixfformation on the wellposcdncss of the NLLS problem and the absence of local minima in the objective function for any z in O, which is practically very useful both for the engineer who has set the

459

parameter estimation problem ("do I have enough information for recovering my parameter in a unique and stable way ?") and the numerical analyst in charge of the computations ("is my optimization routine going to be stuck in local m i n i m a 7"). W h e n this numerical approach is impossible, one may think of calculating analytically (i.e. with paper and pencil) a lower bound R using (3.16) (3.10), upper bounds @ using (3.25) and A using (3.24) (all these quantities are expressed by simple formula involving only r - x ) and ~"(z)(y - z , y - z), and then use theorem 4.2 to get information on the wellposedness of the NLLS problem. There are yet no example where this approach has been used, but the corresponding theory is just being released now in [5] and in this paper, so we hope that some application along this line will show up in the future. Let us turn now to the case where E is infinite dimensional, as for example in parameter estimation in P.D.E.s. We expect here that the generic (in an imprecise sense ...) situation is R(C,~o,'P) = 0, so that the above geometrical theory does not apply. As a support for this assertion, we refer to [2] where it was shown, for the model problem of the estimation of a diffusion coefficient function in a 1-D elliptic equation, that one can find, when the discretization is refined, a sequence of pathes on which the smallest radii of curvature tends to zero (these small radii of curvature are obtained for perturbations of the diffusion coefficient having smaller and smaller support containing one stationary point of the solution to the elliptic PDE). It would however be maybe possible to prove some wellposedness results in the somewhat academic case where the solution to the elliptic equation does not possess any stationary point. A bright singular point in this dark picture of the situation for the ilffinite dimensional case is given in [6], where tl~e estimation of the shape (i.e. a function) and the phase (a number) of a plane wane is discussed, and analysed using the above geometrical theory. To conclude on ilffinite dimensional E, let us mention that the geometric theory may reveal as a useful tool for analysing how the well-poscdness of the NLLS problem deteriorates when E is approximated by larger and larger finite dimensional spaces - multiscale aa~alysis of functions should play a crucial role here (see [2] and [71 for vet3 preliminmT results).

5

Well-posedness of the regularized N L L S problem

We investigate in this paragraph the wellposedness of the regularized problem (2.3) under the minimum set of hypotheses (2.1) (2.4). Of course, as we have not required any compact injection from s into E, this minimum set of hypothesis does not ensure in general even the existence of a solution .~, in opposition to the linear case (i.e. ~2 E s F)) where the same hypotheses ensure the existence of a unique f&. We shall be able in this paragraph to quantify the natural intuition that "a m i n i m m n amount" of regularization should be added in order to compensate for the non-linearity of ~o, mad restore a situation similar to that of the linear case. As mentioned at the end of paragraph 2, the study of the wellposedness of the regularized problem (2.3) can be made very simply by applying all results of paragraph 4 to the NLLS problem (2.3) with a proper choice for E, C, F, ~ m~d z as exptaned in (2.5). For sake of simplicity, we shall explicit this approach only for the case of corollary 4.2. Using hypothesis (2.1) and (2.4), we know that :

3&M > 0, II~,'(~).yllr -< aMII'JII~,W E C n C,V,j e C

(5.1)

3B > 0, IW'(z).(y, Y)IIF -< Bllyllg,w E c n E, vu E x

(5.2)

(notice that if E itself happens to be aa~ Hilbert space and the choice e = t3 is made, then &M coincides with aM defined in (4.2) and r coincidcs with/3 defined in (4.16)).

460 As in corollary 4.2, we shall need the size of C in the parameter space g :

m~(C,g)

= sup II= - Y l l ~

(5.3)

aa,yEC

but the position of the a-priori guess x0 with respect to C with also play a role through the "radius of C seen from x0 in g" : rad(C, xo, $) = sup

a:EC

11~ -

~01l~

(5.4)

In order to express the results in a simple form, we introduce the dimensionless quantities : (5.5) (5.6) (5.7) (5.8)

q g d(z,z') r

= = = =

rad(C, z0, s e/(/~diam(C, $))

S)

d(z,z')/(fldiam(C,s &M/(3diaan(C,$))

(position index of x0 w.r.t. C) (adimensional regularization pm-ameter) (adimensional distance in data space) (adimensionat upper bound to sensitivity)

Notice that 1/2_ ~

(5.zo)

,-ad(C, zo,~).

Using the adimensional variables (5.5 thru 8), we rewrite (5.23) (which implies/Q;.~ > 0!) as : g > 77

rsin 1/z + ((1 + ~ / ~ ) ~ / :

- 1) cos 1/g > r~

if 2/rr _< g if 1/~r < g < 2 / ~

(5.20)

and (5.22) as : ~

= ~{(~sln 1/~ + ((1 + r

- 1) cos 1/~) ~ - ,7;}'/~

(~.21)

if l/;r < ~ < 2/7r Hence we see that if g satisfies (5.24), then the regularized problem (2.3)~ is Q-wellposed on the neighborhood O of c2(C ) of adimenslonal size ~n.x given by (5.25), which is the announced result. [] \u have illustrated on figure 1 the q -+ g.fin function for various values of (, which makes clearly visible that the choice > m ~ { 2 / , ~ , ~} (5.22) ensures the Q-wellposedness of the regularized problem independently of the sensitivity index (. Figures 2 and 3 illustrate how the size dm~ of the neighborhood of ~p(C) depends on ~ (ta.kcn larger than g.~. of course !), ~7 and (. These curves can be used for example to determine, given an estimation of an upper bound dm~ of the measurement and model error, the smallest mnount of rcgularization g to be used in order to restore wellposedness of the NLLS problem and suppress local minima on a neighborhood of ~(C) large enough to contain the expected data,.

462

.p._t,o,' _~, .

I

1.0 from top to bottom : zeta = 10 zeta = 5 zeta= 1

O.B

/ /

"

/

0.7 2/.rr.

0.5 '

'

+

'

0.5

I

'

'

'

0.7

'

I

'

'

O.B

'

'

1.0

Figure 1: T h e n f i n l m u m value ~ m i n of t h e r e g u l a r i z a t i o n p a r a m e t e r as f u n c t i o n of the positio1~ index q of t h e a-priori guess and t h e sensitivity i n d e x ~ of t h e ~ m a p p i n g

I 25.0

2.2

/

20. o

/ /

from left to fight : eta = 1/'2, zeta = 0 eta = 0.57, zeta = 0 eta = 2/pi ~ eta=0"76~an zeta eta=0.88/ Y eta=l.OI

.ts. 0

10.0

-

/ r

/ / / /

/

from ]eft to fight : eta = 1/2, zeta = 0 eta = 0.57, zeta = 0 eta = 2/pi / eta = 0.76~ an zeta eta=0.88/ Y = .

1. 5

r r

ops_bor

'

/

#/// //7/// ////// ////// //////

0. 7

5.0

0.0

''l'l''J'l''''l''''l~l''

0.0

1.0

2.0

0.0 3.0

4.0

5.0

0.0

0.5

1.0

1.

.p~_bor Figure 2: T h e d.,~x f u n c t i o n giving Lhe size of t h e c y l i n d r i c a l n e i g h b o r h o o d for t h e regularized p r o b l e m , as f u n c t i o n of 7, for five values of t h e p o s i t i o n i n d e x 77 of t h e initial guess z0, a n d a zero value of t h e s e n s i t i v i t y p a r a m e t e r s ~ for t h e two curves 77 = .5 a n d q = .57 c o r r e s p o n d i n g to ,7 < 2 / 7 r . Left : general

overview

; Right

: close

up on the

[0,1.5]

interval.

463

0.40

,-~~

I

0. 3 0 eta = 1/2

0. :30

0. 2 0

eta

from left to right :

0, 2 3

zeta = 0 z e t a = 1/2 zeta = 1 zeta = 5

zeta = 10

~

0. 15

0. 1 0

0. 0 0

/

0.57

from left to right : zeta = 0 z e t a = 1l'2 zeta = 1 zeta = 5 zeta= 10

0. 0 8

I I ~ 0. 5 0

0. 55

I I 0. 60

0. 0 0

1 I 1 I I I 0. G5

[

.p,_b~

0. 50

O. 7 0

0. 55

]

0. 60 I

0. G5 0. 70 ,P,-~o' ]

Figure 3: Influence of the sensitivity parameter ( on ~/-~x for values of the position index 71 of the in]tiM guess smaller than 2/~.

References [1] C. CHAVENT,"Local Stability of the Output Least-Square Parameter Estimation Technique'", Mat. Apllc. Comp., Vol 2 no1, pp 3-22, (1983) [2] G. CHAVENT, "New Trends in Identification of Distributed Parameter Systems", Proceedings of the IFAC World Congress, Mnchen, Pergamon Press, (1988) [3] G. CHAVENT, "On the Uniqueness of Local Minima for General Abstract Non-linear LeastSquare Problems", Inverse Problems 4, pp 417-433, (1988) [4] G. CHAVENT, "Quasiconvex Sets and Size • curvature condition ; Application to non linear inversion", INR.IA Report 1017, April 89 (Submitted to JAMO) [5] G. CHAVENT, "New size x curvature conditions for strict quasiconvexity of sets", INRIA Report to appear, 1989 [6] W. W. SYMES, "The Plane Wave Detection Problem", to appear [7] G. CHAVENT, J. LIU, "Multiscale parametrization for the estimation of a diffusion coefficient in elliptic and parabolic problems", Preprint IFAC Symposium on Control of Distributed Parameter Systems, Perpignan, France, June 26-29, 1989

AN IDENTIFICATION T E C H N I Q U E FOR A D A P T I V E S Y S T E M S IN THE CASE OF POOR EXCITATION

Hong Yao Xu and Co Robert Baird Department of Electrical Engineering Technical University of Nova Scotia Halifax, Nova Scotia, Canada B3J 2X4

Abstract. An Adaptive Identification Technique (AIT) is presented primarily for adaptive systems with insufficient persistent excitation. In this technique, the P(t-1) matrix is reset reasonably back to P*(t-1), and a new forgetting factor ~.(t) is designed to track the parameters of slowly timevarying processes. In addition, a weighted filtering algorithm is proposed such that a modified least squares algorithm is presented in terms of the procedures mentioned above. Theoretical analyses and simulation results have demonstrated that the proposed AIT methodology can overcome the bursting problems of adaptive systems in the case of poor excitation. Key words:

Identification; adaptive systems; least squares algorithm; bursting; excitation;

simulation.

1

Introduction

Many identification techniques have been proposed for adaptive systems in recent years. However, the estimated parameters may burst from instability in adaptive systems, if the preconditions of these identification techniques cannot be satisfied. It is well-known that when persistent excitation is gone in adaptive systems, most existing identification algorithms will not only fail to identify the parameters correctly, but also exhibit a bursting phenomenon [1], or so-called "estimator windup" [2]. A variable forgetting factor was presented [3] to solve this problem, which can be only used in deterministic systems, so the identification problem is still unsolvable. Isermann has proposed a simple remedy so-called "switch off" when identification conditions are violated [4]. However, the method [3] is not suitable for stochastic systems, and to "switch off" the identification is not effective [5]. If there is poor persistent excitation and when the adaptive control systems run for a certain time, the bursting phenomena may occur sometimes in practical applications. The proposed AIT has a strong

468 robustness to avoid this problem and has a good convergence property. Simulation examples are given to illustrate its results.

2

Bursting

Phenomena

In

Adaptive

Systems

Consider a linear system described as follows: A(q "1 )y(t)=B(q" 1)u[t-d(t)]+C(q "1 )0~(t)

(2.1)

where, A(q'l)= a o + alq "1 + . . .

+ anq-n

B(q'l)= b o + blq-t + . . . + bnq -n , b o ~ 0 C(q-1)= 1 + c l q -1 + . . .

+cnq -n

and y(t), u(t) and ~(t) represent the output, input and noise sequence, respectively, t is the sampling instant and d(t) denotes variable time delay of the system. The noise sequence {~(t)} is a Martingale difference sequence defined on a probability space (~, E, P) and adapted to (E t, t e h0, where Et is generated by the observations up to t and Eo includes initial condition information. Considering the system (2.1), we propose: Assumption set A : (i)

(ii) (iii)

E{('~

Et-t} = 0

a.s.

(2.2)

E{ o)(t) 2 [ F.t-1} =0"2< oo

a.s.

(2.3)

N lim 1 N._~=,sup~ ~ (0(02 0

(3.5)

where Equations (3.3)_(3.5) are described similarly to Sin and Goodwin [10] in which do(t)=1 and X(t)=l. r(t-1)=r(t-2)[;L(t-1)+~[t-d

o(t)] TP(t-2)~[t-d

o(t)],

=:(-1)>0

(3.5)

If [r(t-1)13minP1(t-1)
p., V i e [ t-M+1 . . . . .

t ]

(3.24)

check the diagnosing condition (3.10) and the filtering condition (3.25) so as to decide if some proper actions need to be taken correspondingly, and keep e*t(t ) and P*(t-1) constant until (3.21) is satisfied. Step 5. The weighted filtering procedures of e(t), y(t) and u(t) are implemented through a detecting criterion such that the system can be protected to avoid such a bursting phenomenon.

(1) If the condition A

Q(t) = inf II e(k)-e*t(t-M) {I2 > E max > 0

for E: max chosen

(3.25)

ke [t-M,t] is fulfilled , we obtain the filtered parameters: k e ( t ) = ~ ) f ( k ) = e * t ( k - M ) + 1--~ ,~, {Ae(i)-e*t(t-M)} M. i=k-M+l

y(t)=yf(k)={

k ~,e'~ i=k-M+l

u(t)=uf(k)={

k ~L, e "~i u(i)}/{ i=k-M+l

y(i)}/{

k ~/_, e'F,i}, i=k-M+l

V k ~ [ t-M+1 . . . . .

> 0

k ~/_~e-~i}, i=k-M+l

> 0

ibid

ibid

t ]

(3.26)

(3.27)

(3.28)

(2) If we have A

R(t) = sup II e(k) - e*t(t-M){{ 2 < s rain

forE; min chosen

(3.29)

k e It-M+1, t ] where 0 < Emi n < E.max, k = t-M, t-M+1 . . . . .

t. Hence, no filtering procedure is needed.

A

(3) Otherwise, we only require e(t) to be filtered using Equation (3.26). Equations (3.28) and (3.29) show the weighted filtering, where e-~ i represents a weighting factor, its decreasing rate is depended on ~. M denotes the length of measurements, M=10~20, ~=0.01~0.1 are recommended here. In (3.27) and (3.28), we use a weighting factor e-~ i to emphasize old data which indicate that the system operated better in the past. Simulation results have shown that the variations of e(t), y(t) and u(t) are reduced obviously by (3.26)~(3.28).

473 It must be pointed out that in Equations (3.13), (3.14), (3.21), (3.24), (3.25) and (3.29), ^

since the same term {{0(i)-e*t(t-M){{ 2 {s calculated only once every M sampling intervals, a number of computational efforts can be saved. The MRLS algorithm has good convergence properties. Its theoretical analyses will be derived from the following lemma 2 and lemma 3. Lemma 2. For the system (2.1), the MRLS algorithm (3.1)_(3.15) is used in order to achieve:

II~(t)-ell 2

(1)

_< K l l l ~ ( 0 ) - e l l

2,

t > 1

(3.30)

where K 1 = condition number of [P(-1) " l ] - - A l . ) m a x P ( - 1 ) ' l / ' O m i n P ( - 1 ) ' l = ' o m a x and.Orni n denote the maximum and minimum eigenvalue, respectively.

lira " ~ (2) N--,o~ t=l

e(t) 2 ;L (t) +,~ [ t . d o (t)]T p (t. 2),~ [t_do ( t ) ]

40, ~(t) was very largely reduced to 0"2=0.005 which implied that the persistent excitation became poor. As a result, all the estimated A

parameters e(t) shifted to the wrong area. ~2, one of the estimated parameters, is shown in Figure 1. ^

During 500 < t < 1000, several bursting phenomena occurred and then e(t) all became false value: e(2500) = [ 0.41, -0.302, 0.005, -1.01 ]. However, in the same case, the proposed AIT can demonstrate its strong robustness, one of its results is shown in Example 2. Example 2. Consider that the system and simulation conditions are the same as those in Example I. Assumed that at t > 40 the persistent excitation o)(t) was numerously decreased to be insufficient. In this case, we selected M=15, s s 6ma• 6min =0.015 and ,u.=0.0015. The A

estimated parameters 0(2000)=[-0.428, -0.344, 1.096, 0.303] almost exactly converged to the true values 0=[-0.43, -0.34, 1.10, 0.30] for the minimum variance control of STR. This indicates

475 that the AIT can still identify the real parameters even if the input exciting suddenty becomes insufficient. One of the estimated parameters, 92, is shown in Figure 2.

5

Conclusion

Adaptive identification becomes a very difficult task of an adaptive system in the case of poor excitation. The proposed AIT can fulfill it and overcome the bursting problem. Theoretical analyses and simulation studies have demonstrated the AIT methodology that is basically established by four proposals: 1 ) The P(I-1) matrix is reset as P*(t-1) based on Equations (3.5)~(3..12); 2) Weighted filtering is given to 8(t), y(t) and u(t) by (3.25)~(3.29); 3) Remembering units are employed to store some historical data through (3.21)~(3.24) for filtering and resetting; 4 ) A forgetting factor function ~.(t) is designed according to (3.13) ~ (3.15). However, it is recognized that if system excitation is too insufficient, the AIM may not work well. Therefore, further research work needs to be done to find out the minimum persistent excitation that the AIM can tolerate.

6

References

I . Anderson, B. D. O. (1985). "Adaptive Systems, Lack of Persistency of Excitation and Bursting Phenomena". Automatica, Vol. 21, No. 3, pp.247-258. 2 . Astrom, K. J. (1983). "Theory and Applications of Adaptive Control--A Survey". Automatica, Vol.19, No.5, pp.471-486. 3. Fortescue,T. R., L. S. Kershenbaum and B. E. Ydstie (1981)."Implementation of Selfquning Regulators with Variable Forgetting Factors". Automatica, Vol. 17, No. 6, pp. 831- 835. 4 . lsermann, R. and K.H. Lachmann (1985). "Parameter-adaptive Control with Configuration Aids an~ Supervision Functions". Automatica, Vol. 21, No. 6, pp.625-638. 5. Foley, M., K. Walgama, Y. H. Jin and G. Fisher (1988). "Adaptive Control Based on a Modified Kalman Filter Predictor". Preprints of the 8th IFAC Identification and System Parameter Estimation. Beijing, PRC. Vol.1, pp.342-347. 6. Goodwin, G. C. and K. S. Sin (1984). Adaptive Filtering, Prediction and Control Prentice Hall, New Jersey. 7. Goodwin, G. C., P. J. Ramadge and P. E. Caines (1980). "Discrete-time Multivariable Adaptive Control". IEEE Trans. Automatica Control, AC-25, pp.449-456. 8. Anderson, B. D. O. and C. R. Johnson, Jr. (1982). Exponential Convergence of Adaptive Identification and Control Algorithms ". Automatica, Vo~.t8, pp. I.

476 9. Xu, H. Y. (1988). "Robust Protection-Time Delay Tracing Adaptive Control and Its Application". Preprints of the 8th IFAC Identification and System Parameter Estimation9 Beijing, PRC. Vol. 2. pp.1064-1069. 10. Sin, K. S. and G. C. Goodwin (1982). "Stochastic Adaptive Control Using a Modified Least Squares Algorithm". A u t o m a t i c a , Vo1.18, pp. 315-321.

1

t32

9 .

1.00,50.0-0.5 -1.0 -1.5

'

I

100

'

I

'

200

I

'

300

I

'

400

I

9

500

I

'

600

I

9

700

|

'

800

I

"

/

K

900 1000

Fig. 1. The bursting parameter [32 of the RLS in Example 1

0.6

132

0.4 0.20.0-0.2 -0.4

9

0

I

|

50

100

9

I

150

'

I

200

9

I

250

9

I

300

9

I

350

9

I

400

9

I

450

9

I

500

Fig. 2. The convergent parameter ~2 of the AIT in Example 2

Asymptotic

Properties

in Rational

/2-approximation

L. Baratchart, l~f. Olivi, F. |Vielonsky Institut National de Recherche en Informatique et Automatique Avenue E. Hughes, Sophia-Antipolis 06560 Valbonne (France)

Abstract: This paper is concerned with the problem of best rational approximation of given order n in the Hardy space/:/2. We show that, generically, all critical points converge to the fnnction in tI~ as n increases to infinity. This property shows in turn that local maxima can appear only for a finite range of orders. This has consequences on an algorithm to find local minima previously described by some of the authors [3].

I

Introduction

Let us recall briefly the/2-approximation problem as described, for instance, in [2]. Consider the real Hardy space H~,I~ of functions f analytic in the complement of the closed unit disk 0 , vanishing at infinity, that can be written f ( z ) = F_.k~__l.f~z -k with .fk E R. and such that the norm Ilfll 2 = g:k .t2 is finite. The assertions that the coefficients fk are real and square integrable are respectively equivalent to the facts that the function f maps the real line into itself and satisfies a growth condition at the frontier T of U: sup- 1 r>l

27~

fo ~,"]f(rei~

< c~.

As in [2], let P~ be the set of real polynomials of degree at most n, and T'~ the subset of monic polvnomials, of degree n whose roots are in U. Moreover, let ",x'l C H~,I~ consist of all rational fractions h = p/q, where p E P,,-1 and q E T'~. The problem is to minimize, for arbitrary n, the criterion F)(h) = I[/- hll ~ where h is in x-1 The relevance to system theory of this question arises from the need of describing the input-output behaviour of a given system (i.e. its transfer function) by a finite dimensional model, whose transfer function is therefore rational. For more details, and a stochastic interpretation of the 12 norm, we refer the reader to [1]. Of course, in this context, our problem can only appear as the prototype of an integral criterion, which can admit many variations and generalizations, putting for instance weight functions, aclditional constraints, a.nd going over to the multi-input multi-output case. In the sequel, however, we shall restrict ourselves to the simple formulation given above. Apart from a classical formulation using the so called normal equations [4], this problem has been mostly tackled using differentiation, namely a gradient algorithm [6]. In [:3], an algorithm is described, which converges to a local minima by integrating numerically a differential equation. In all three cases, critical points (i.e. points where the derivative is zero) are on the saddle, because they are the ones that can be computed, and the trouble comes of course from the possibility of having several local minima. 9a . ~ r L

9

478

Let us take a closet" look at ti~e criterion F}(p/q) that we want to minimize. As q is settled, the polynomial p is uniquely determined as the orthogonal projection of f onto the n-dimensionM linear subspace 1.~ of H~,it defined by lq = P~-l/q. We shall denote by L'~(q) this projection and replace tile former criterion by the following one:

~L,~(q) = f

L}(q) ~" q

At each order, we get a set of critical points for this criterion and the goal of this paper is to describe their asymptotic behavior. These critical points may be of two different kinds: either L~(q) and q are coprime, in which case the point q is sa.id to be irreducible, either they share a common factor and we get a reducible point. When q is reducible, the variational argument (due to Ruckebush) used in [1] to show that it cannot be a minimum applies as wel! to show it cannot be a maximum either unless .f is already rational. In the sequel, we shall suppose that the element f of H~.it to be approximated is not already a rational function a.nd we shMl make the mild assumption that it is analytic on a domain strictly containing the complement of the open disk U. Then we show that, generically, critical points converge in /2-norn- to the function f as the order (i.e. tile degree of the polynomials) growths to infinity. Using this fact, we deduce that criticM points which are local maxima can only appear for a finite range of orders.

2

P r o p e r t i e s o f t h e c r i t e r i o n d,i a n d its c r i t i c a l p o i n t s 9

,;'7

As in [2], we introduce a.n other Ilardy space H+lZ, which contains analytic functions g in the unit disk U, naapping tile real axis into itself and such that

sup

Ig(rJe)!=g0 < ~ .

r =

As 5 = z -~ on the unit circle T and the coefficients in the power series expansion of 5 are real, we also have I ,;" ; , I dz < f ' g > = 2 i T r J ~ .f~-:JJ(-)--. z z This scalar product verifies the two following obvious properties which will be used in the sequel: 1) for all/," E Z, the multiplication by z k is an isometry of L~.~(T), i.e. for IT' ), all .f, 5' E L2.r~t< zkf, z~g > = < f, 9 >; (1) -

-

2) for al! f , g, h in L2,rc(T) such that f9 and f h are in L=,Iz(T).

< fg, h > = < s, .fh > .

(:2)

479

Let f be the function of H~,rt to be approximated and let g = f~, its image in the space FI2,1L" + W i t h the assumption made on f, there exists a real ,\ > 1 such that g is analytic in the open disk Ua centered at 0 of radius ,\. Let q in 79~ and (7 defined as in [2] in the following way: ]

~(:) = ~-q(:). Z

T h e Weierstrass division theorem (cf [5]) or more precisely its one dimensional version (also known as the Hadamard representation, cf [7]), applied to the function cq~ of/:f~rt shows ~hat there exists a unique function v(g , q) analytic in Ua and a unique polynomial w(g, q) of degree n - t such that: 9~ = o(g, q)q + w(9, q). It follows from [2] that if you seek in the space of rational fractions Vq for the minimum Lj(q)/q of ~by, you get Lf(q) = w(g, q). The quotient v(g,q) of the former division which we shall simply denote by v when no confusion can arise, possesses the helpful property to give the value of the criterion 02 at ;he corresponding point q: p r o p o s i t i o n 1 Let q be eLpoint of 79~ and v the correspondin 9 quotient, II~ll ~ = eTfq).

Proof: Using (1) and (2), the value of the criterion at q is:

Z(q) ' ' _ , g - L(q) _ >.

L ( q ) , f _ L(q) > - - < g -

62(q)= .

This set of derivatives vanishes iff for all polynomials P in C ~-1 [z]

< f -L(q) q

'

pL(q) >=0. q2

From the definition of the scalar product, we get

or

q

q

If a is a root of order m of the polynomial q, this implies f ,_, L ( q ) , , , dz Vl ~ {L...,m},jT v t , ) - - ~ - - t ~ j i : _ a)~ - 0. Then, hy the residue formula, the following derivatives should vanish:

W 6 {O,...,n~- i}, rvZ(~)] ~''

L--~J

B y induction, o' is a zero of order m

of

vL(q).

(~) = 0.

This proves the equivalence of the two

assertions and proposition 2.

3

A s y m p t o t i c behaviour of critical points

We denote by Cn the subset of T'~ containing the critical points at order n, and we put C = U,C,. By choosing a point in C, for each n, we construct a sequence of quotients (v,~). In order to prove that the family of functions (v,,) is normal, we use the integral representation:

1 f~ g(r162

v ( z ) = 2i---~..,

q(()

de

(-

z

where it is any real such that 1 < ;L < .\. Using this expression we prove the l e m m a 1 Let /

a real number such that 1 < #' < ~, there exists on the open set Uu, an uniform bound for the set of functions (v~) which depends only on the function g.

Proof: On the unit circle T, the quotient i[/q is of modulus 1. Then by using the maximum principle over the complement of the unit disk U, we get

v~ ~ c-u, lq-~I ~(() _ N , W e :r.,, Iw.(:) - w.~(~)l < Iw.m(z)l. By Rouch~'s theorem, w, and wum will have the same number of zeros in the open set U,,, but using proposition 2, a quotient corresponding to an irreducible critical point of order n has at least n zeros in U. As the order of points in the subsequence (wp) tends to infinity, wti,,, must be equal to zero. This is a contradiction with the assumption made on the circle T~,,. By letting IF vary continuously, we get a compact circular annulus containing infinitely many zeros for wti~ and thns this limit n-lust vanish on the open disk Uu. We just showed that every convergent snbsequence of (v,a) converges to zero uniformly on every compact set of U,,. Since it is normal, this is true for the sequence (t,,) itself. By using tim proposition 1, we get the F-convergence of any sequence of irreducible critical points to tile function to be approximated as the order of points tends to infinity. In order to generalize this fact to sequences containing also reducible points, we shall have to restrict ourselves to functions with tile property that Cn is finite for each n. This can be proved to be generic in various contexts. For instance, it is shown in [1] that such functions form a set of first category in the disc algebra of U u where # > 1. We first prove the p r o p o s i t i o n 3 Let p E 79~, a critical point such that the fraction Ly(p)/p is irreducible. Let r ET)~ a n d q = p r . Then (i) LI(cl) = rLl(p) ~ff r divides v(g,p). (ii) if (i) is verified, we have the following equivalence: q is a critical point iff p is a critical point and r divides the quotient v(g,p)/p.

Proof:

Apply the above mentioned division theorem to gc7 and gl'5:

J~7 = v(j, q)q + rz(q ) and ~ = ~,(g, p)p + LAp). Multiply the second equation by F:

g~7 = v(g,p)~p +/:LAp). Let us denote v(g,p) by vp and divide vpF by r:

vpF = v(vp, r)r + L~g(r). Plugging this expression in (3), we get

s~ = v(,,,, r)~ + (Lo#(r)p + ~L:(?))

(3)

482

and the second term on d~e right-hand side is of degree strictly lower than that of q. Thus we have t,~F = v(9 , q)r + Lff.(r)

(4) In order to prove (i), suppose first that L/(q) = rL/(p) holds. Applying to the second equation of (4) the assumption that p divides Lj(q), and hence that + divides Lf(q) we get (nsing the nsual notation for division) dpL~(~). As roots of/5 lie in the complement of the unit disk, r divides L~g(r). But L~-(r) is the remainder of a division by r and then it must be zero. The previous pair of equations beconles

I zj(q) = ,'ZAp),

(5) t,pr

= v(g,q)r.

The second equation of (5) shows that r divides vp. Conversely, suppose that @% The first equation of (4) implies that rIL~7,(r ). As the degree of L~7,(r) is strictly lower than that of r, it must be zero and (4) reduces as in the former case to (5). Suppose now that the assertions of (i) are verified and let prove (ii). By proposition 2, the fact that q is a critical point means that qlu(9, q)Ll(q) i.e.

p,-1~,(7)~G(p) or

pq~,~&(,,)

which yiekls that p]vpL1(p) i.e. p is critical. This yields also that r ] ( ~ ) L f ( p ) . Moreover as (1) is verified, r ] ( ~ ) p . As LZ(P) and p are relatively prime, we deduce that r]-~. Conversely, this last relation implies that q[vp. By the second equation of (5), we get q]u(g, q)r~. As roots of ? lie in the complement of the unit disk, q divides v(9, q)r and in particular v(9, q)rLl(p). Then using the first equation of (.5), q divides v(fl, q)Lf(q) and q is critical. O If critical points are irreducible, there exists an order over which the corresponding quotients v have more than any preassigned number of zeros. To get our generalization, we prove that such an order exists even in the case of reducible points. Following proposition 3, these points are generated by adjoining to irreducible critical points q of lower order, zeros from v/q. \u show that for a fixed order, the number of such zeros is bounded from above. Let I,~ be the subset of C, containing irreducible critical points of order n and let q in I , . We denote by Z(v/q), the number or zeros of the quotient u/q in the disk U. Then Z(v/q) is finite. Indeed, with the assumption made on f , the quotient v is a.nalytic on the open set U.\ which conta.ins the compact disk C. If Z(v/q) is not finite, v vanishes in U which means that the function f to a.pproximate is already a rational fraction, but we discarded this case ill the introduction. Let us set one more notation:

R. = max{Z(,,/q),q ~ z.}, then R , is finite. It is obvious when I , itself is finite. Otherwise, let suppose that R , is not finite, then we can select a sequence of critical points (qt) in In whose corresponding

483 quotients (vl) have a n u m b e r of zeros growing to infinity. From this sequence, we can extract as before a subsequence which tends to zero. But this means that there is a. sequence of critical points of order n which converges to the function f and then f is again rational. Indeed, we have

gqt = vzql + Lf(ql).

(6)

T h e functions vt converge uniformly to zero on ~" and the polynomial.~s ql and (7~ are also b o u n d e d on/_7 as their degree and their coefficients are. T h e n by (6), Lf(q~) is bounded. Vve can successively extract two subsequences such that Li(qt ) and ql will converge respectively to some polynomials p and q, uniformly on U. By taking the limit, the equation (6) becomes gq = p on U and thus f is equal to/3/(~. As a conclusion, at order n + R~, quotients v corresponding to irreducible critical points as well as reducible ones which come fl'om irreducible points of order n have all a.t least n zeros. At order maxp=0

or

gi E {0 .... , n -

1},< f

L(q__i),.o~7(L(q)) . > . < f _. L(q), ziL(q) > = 0 . q q q q=

As L_~ is the orthogonal projection of f on tile space l,~, we know that q Vk E { O , . . . , n - 1 } , < f

L(q) z k --, q

q

>=0,

(7)

so the last equality reduces to Vi E { O , . . . , n - 1 } , < f

L(q) -iL(q) - -q, - q2 > = O.

(s)

484

Combining (7) and (8), we get q- - < f -

q

>

>

J~q/

~

>

and using (9), we gct

2 < 7' q

q

Oqi

> "

But

=

[Cl2Z'~-i _ 2(lqz i] =

2q~13,

so by (9) again, we obtain 05

OqiOqj < f -

_L _( q ) , f _ q

L(q) > = 2 [ < - -v,, - -vj> + < f q q q

L(q),9(tzivj > J . q ~ q3

Now, the variation of the criterion in a neighbourhood of the critical point q following a direction given by tile real vector (.\1, ..., An) in the space 7~ is

;~ As the family of polynomials (ui) is independent, we choose the numbers ,\i such that

486

The value of (})Aq(,\I,..., ,\,) becomes

[ ui u,_>+

As the order of q growths, f _ L_~ tends to zero, then the variation following the chosen II q Ii . . . . direction becomes positive which means that the critlcat pomt q ma.y not be a maxnnum. t h e o r e m 2 Let f , be a function as in theorem 1. maxima can only appear for a finite range of orders.

Then, critical points which are local

References [1]

L. Baratchart, Sur l'approximation rationnelle l a pour les syst~rnes dynamiques lin4aires, Th~se de doctorat d'dtat, Universitd de Nice, September 19S7.

[2]

L. Baratchart and M. Olivi, Index of critical points in /=-approximation, Systems & Control Letters 10 (19SS), 167-174.

[3]

L. Baratchart and M. O!ivi, New tools in rational L 2 approximation, proc. of tile 8th IFAC Syn~positun on Identification and System Parameter Estimation, Vol. 2, Beijing 198S.

[4] J. Delia Dora, Contribution ~ l'approximation de fonctions de la variable complexe au seas de Hermite Paddet de Hardy, th~se d'dta.t Univ. Sc. et Mdd. de Grenoble, 1980. [5]

R.C. Gunning and H. Rossi, Analytic functions of several complex variables, PrenticeHall, Englewood Cliffs, N J, 1965.

[6]

B. Wahlberg, On system Identification and Model Reduction, rep. LiTHISY-I-0847, LinkSping University, 1987.

[7]

J.L. Walsh, Interpolation and Approximation by Rational Functions in the Complex Domain, AMS colloq. Pub. XX, 1969.

A M i n - M a x T e s t i n g A p p r o a c h to Failure D e t e c t i o n and I d e n t i f i c a t i o n Elias Wahnon IRISA/Centre IINRIA de Rennes 35042 Rennes Cedex, France

Abstract The problem of failure detection and identification in stochastic linear timeinvariant systems is approached as a Min-Max Hypothesis Testing problem. A statistical test is proposed and a Min-Max testing technique is followed, leading to a linear transformation of the residual vector, which results in a new statistical decoupling of failure influences. The resulting problem is to detect a change of the mean in a gaussian vector sequence and to estimate the random time of failure occurrence.

1

Introduction

Detection and identification of actuator and sensor failures is an important problem in engineering systems which rely on sophisticated control systems to achieve high Performances. The methods to deal with the Failure Detection and Identification (FDI) problem are generally classified in two main categories : 1. Redundancy Methods : the simplest way is the hardware redundancy, i.e., several identical measurement and/or control units, such as the quadriplexed digital flight control system, where the failure decision is generally made by a majority voting scheme. However, this passive m e t h o d increases the system cost and complexity. An active way is to exploit the analytical redundancy or parity checks, i.e., temporal relationships between the actuator inputs and sensor outputs which are, for perfectly known deterministic systems, zero when the system is operating normally and deviate from zero when a failure occurs, (see [7]). 2. Filtering Methods : known also as Detection Filters and originally proposed by Beard[2], this methodology is based on the design of a full-order observer where the gMn is chosen so that the direction of the observer output error vector in the output space, allows to identify the failed component. White and Speyer[ll] reformulated this theory as an eigensystem assignment problem, developing the required linear implicit equations for the observer gain and the solvability conditions. Massoumnia

488

et al. [8], relying on geometric concepts and exploiting the dual relationship between the FDI and the Control Decoupling problems, presented a method for designing a dccoupling filter. Viswanadham et ai.[10] proposed a frequency d~,main approach to develop generalized parity space techniques based on the theory of stable factorizations and showed the equivalence of this generalized parity vector approach with the concept of detection filters. Essentially, the so far cited references concern the problem of FDI for deterministic linear-time-invariant (LTI) systems. However, as was recently pointed out by Kerr[5], the original motivation was the problem of failure detection in navigation applications which are described by stochastic LTI models. In this paper, the FDI problem for stochastic LTI systems is approached as a Min-Max Hypothesis Testing problem. A 'detection filter', i.e., a linear filter, based on the no-failure hypothesis H0, where the gain matrix is chosen so that, upon the failed system hypothesis Hi, the mean of the filter residual vector propagates along distinct but fixed directions for each failure, is implemented. A statistical test is proposed and a Min-Max hypothesis testing technique is followed (see [9], [6], where this technique was proposed for the detection of changes in the AR part of a multivariable ARMA process). This technique leads to a linear transformation of the original residual vector which results in a statistical decoupling of failure influences. The problem, finally being a problem of detecting a change of the mean in a non-white gaussian random vector sequence and the estimation of the random time of occurrence, simple well-known detectors can be used. Two methods are also discussed for the special case where the number of failures is higher than the dimension of the measurement vector. In Section 2, the FDI problem for stochastic LTI systems is formulated. The Min-Max hypothesis testing approach is developed in Section 3. Extension to the special case of more failures than measurements is presented in Section 4. Simulation results of a working example are discussed in Section 5. 2

Problem

Formulation

Consider the normally operating system described by the following state space model :

x(k+l)

= Ax(k)+Bu(k)+w(k) y(k) = Cx(k) + v(k)

where x E R" and y E R "~.

w(k)~N(O,Q) v(k) ~ N(0, R)

(1)

489 The failed system is modeled as :

x(k + 1) = Ax(k) + Bu(k) + w(k) + ~ I{k>_n}fimi(k)

i=l

(2)

y(k) = Cx(k) + v(k) where fi 6 R '~, assumed known, is the failure direction associated with the i th actuator failure, rni(k) is a general time-varying unknown scalar which may be a function of x(k) or u(k), 7/ represents the random time of the i-th failure occurrence and IA is the indicator function of the event A. The FDI problem can be stated as the problem of detecting on-line a failure and to isolate, statistically, the influence of the i-th failure from the j-th, j # i, so that an identification of the failed component should be possible. For simplicity of notation, take 7i = T, i = 1 , . . . , r. Assume now that a linear filter, based on the nominal model, Eq. (1), has been implemented where the gain matrix, K, should be later specified :

u(k) = ,.,(k)-C~o(k) ko(h + 1) = A~:o(k) + Bu(k) + IC,(k)

(3)

Let H0 be the hypothesis which states that the system is described by the nominal model, Eq. (1), and H1 the hypothesis that the system is described by the Eq. (2). Under Ho, u(k) is a zero mean non-white gaussian sequence with variance matrix, V(k), given by :

v(k) = cP(k)c T + R P(k + 1) = (A - I(C)P(k)(A - KC) T

(4)

+ KRK T + Q and assume that the steady state value of V(k) has been attained, denoted here by V. Under HI, decompose u(k) into : ~(k)

=

y(k) - c ~ 0 ( k )

=

v(k) - c ~ ( k )

+ c [ ~ , ( k ) - ~o(k)V{k>,}

~ ( k ) + ~(k)hk_>, } where ux(k) ,,, N(O, V) and/~(k) is given

by:

o~(k -I- 1) -- (A - KC)o~(k) -F f, ml(k) ,(k) = Ca(k) ; ~(k_~,)e~,(k)

where ei is the i-th m 9 1 unit vector, leads to the following equation for ~he mean variation :

o,(I~ + 1) = (A - Z(C)~(k) - K~i6,(k) The presence of the i(ei as a failure direction is a problem since K is not known. However, as pointed out by White and Speyer[11], taking gi being any direction such that e{ : Cg~, this equation can be written as Eq. (5) and #(k) can be made to lie in the plane generated by CKel and ei. Therefore, we use the actuator failure case, Eq. (1) as the general problem. The F D I problem can be formulated as the following hypothesis testing problem :

U,~de,'Ho: ~,(k) ~ iV(O, V) UnderHl : 3% k < T : V ( k ) ~ N ( O , V ) k >_ ~: . ( k ) ~ N ( , ( ~ ) , V) 3

The

Min-NIax

Testing

Approach

The statistic :

(G)

to 2 .r(k)v-l.(~) allows to test beLween H0 and Hi. Furthermore, Eo(to) =

m

El(to) = I m

m +

,~(k)v-b(k)

kT

where Ei(') is the expectation under Hypothesis Hi. Now, assume that there exists a gain matrix K such that the failures associated with the fl, i = 1 , . . . , r, in the system described by the mean Eq. (5), are dctcctablcs according to the definition given by Beard [2], i.e., #(k) maintain a fixed direction in the output space due to fi and all eigenvalues of A - K C can be arbitrarily specified, except for the constraint on the conjugate symmetry. This gain matrix ensures the decoupling of the failure set, { f l , . . - , fr}, influences in tlle output space and can bc computed, for cxamplc, by the method presented in

491

[11]. Therefore, assuming that a such gain matrix has been found, we have that for each fi,i = 1 , . . . , r :

rank[Cfi, C(A - [(C)fi, C(A - KC)2fi,...] = 1 and

assume

(7)

also :

rank[CfI,Cf2,...,Cf~] = r, r _ r) , given by :

Under the condition in Eq. (7), #(k) maintain a fixed direction, Cf~, in the output space due to {mi(1);l = k - 1, h - 2,...,7"},i = 1 , . . . , r . Therefore,

(adl,. .,Cf,mx){[k 1,] + +

i A21rnl(k - 3)

;~2~mr(k 3)

i

+ ....

2,1

(Cft,Cf2,...

Cf~)

I

3.42

and 7, Eq. (10), can be formally writen as : =

....

We look now to isolate, statistmally, the different failure influences by building, for each failure, a test which should be insensitive to the other failures. Take, for example, the first failure fl and partition the matrix )c =zx [ 3S cTl i - T t2] ,whereSll F22 is a scalar and ~c22 is a ( r - 1) * ( r - 1) matrix. We shall follow a Robust Min-Max hypothesis testing technique (see [9], [61), by considering for the failure f;, the least favorable case for the remaining failures ( f 2 , - . . , f.)- For a fixed test leve! c~, given by : PHo(to > A) < a', where, A is the threshold fixed by c~, the power of the X2-test to, Ptil(to > A) is an increasing function of the non-centrality parameter 7, Eq. (10). By the Min-Max approach we are looldng to find :

min

(~,-..,~,)

7 =

min

(~,...,~r)

[.All, (.s

, .s

[-7-11 F12 ] ] .T-r .T.22

3,41 A'42 :

3~I,

492

The minimum, obtained for : (.M2,...,.M~) r = --~-~19rT.A41, is given b y : min 3' = 3Al(Scm - ~129r~)'Su~)-A41 (~4~,...,M,)

-~175axl ] = 3/li{(1,-.7-i2.7-~i)(cfl,..., C f ~ ) r v - i } v {y-l(c

f l , . . .,

cA)(1,-f~27~)r}ax~

Tlicrefore, considering the following transformation of the original residual vector sequence, u( k ), : ~,,(k) = ( 1 , - 7 1 2 : r ~ ) ( c f ~ , . . . ,

cA)rv-l,(k)

(11)

we have that ul(k) is a non-white gaussian scalar sequence, zero mean under H0 and witll a non-zero mean under Hi (for k > r) given by :

E(vt(k))

= (1,-.7"i2.Y~i)(cfi,...,Cf,.)Tv-1/.z(k)

= (l'-212"~)(CI~'""C/~)rW-~(C/~'""C/~) =

(~,-j:,~:#)j:

J(4.

~

and, V a r ( vl ( k ) ) = .T m - .T12.T~I.T'f~2. i.e., ul(k) is only sensitive to the failure fl and is not affected by the remaining

failures (f2,.-., A)Finally, returning to the original problem stated in Eq. (2), given the failure set {f~,.-., fr} with the unknown random times of occurrence {T1,..., Tr}, we build for eacli failure, f/, the required transformation, ui(k), of the residual sequence u(k) which is essentially a statistical decoupling of failure influences. The problem being now of detecting a change of the mean and the estimation of the r a n d o m time of occurrence, simple detectors can be proposed. In particular, the well-k/iown Hinldey detector can be useful, implementing for each ui(k) two sucti detectors, to detect an increase or a decrease in the mean of ui(k). The Hinldey detector, introduced in [4] and analysed in [1] is given (for example, to detect an increase

493

in the mean of/]i(k)) by the following recursive procedure : TO

~

0.

~ffk

:

T k _ 1 "-~ Y i ( ] r

Mk

=

Alarm

5

- - ry

rain Tj

(12)

l<j

where A is the threshold and 5, fixed apriori, is the minimum jump to be detected. Remarks: 1.) The Min-Max technique leads to a linear transformation of the residual vector v(k) which hides, for each failure, the subspace spanned by the remaining failures in the output space. Consequently, the existence of a gain matrix satisfying the condition in Eq. (7) is required. Otherwise, different failures m a y span the same subspaces and no hiding should be possible. 2.) Even though the Hinkley's detector was developed to detect a change of the mean in a white gaussian sequence, the experience shows that this detector is robust with respect to the whiteness assumption. 4

More

Failures

than

Measurements

In the special case, r > m, i.e., more failures than measurements, the assumption, Eq. (8), cannot be satisfied and some modification are required for the identification of the failed components. Two methods are here discussed : 1. Partial Filters : Take, for simphcity, rn = 2 and r = 3, i.e., (fl, f2, f~). Then, if we assume that the probability of simultaneous failures is very low (i.e., zero), then by building two filters, according to Section 3, the first for f; and f2, and the sccond for fa and, for example, ft, we obtain, by adding an heuristic reasonning, a complete failure identification system. However, if simultaneous failures are assumed, we are not generally able to identify the failed components, except for the case where the mean jump size m a y be used to make the failure decision. 2. Clustering : Define the following distance measure between fi and fj (see an analogous definition in [3]) :

(Cfl)T(cfJ) d(fi, fj) = l - pii = l - { Cfi ll C f j l

(13)

and make the required clustering, i.e., replace by a single failure vector fi the failure vectors fj which are close in the sense of the distance measure, Eq. (13), such that we stay with the required number of failure vectors. Note that pij is nothing but the cosine angle between the images of fi and fj in the measurement space. Let us r c m ~ k that with the clustering method, some failed components m a y not be

494

well identified since the failed component direction in the output space has a main projection in the clustered direction but also a smalt component in other failure directions.

5

Simulation Results

Given an stochastic LTI system described by the Eq. (1), where : A=

034 123

-3

; /3 = [fl, f2] ; fl ----

0 2 5

1 0

=

11 ] -i/2 i/2

1

C=

[0101 0 0 !

; Q = G G T;

G=

1/2

t/3

0

.05

the condition in Eq. (8), rank [Cfi, C f2] = 2 = dim y, is satisfied and the gain matrix K, calculated by the method presented in [11], which ensures the condition 25/8 91/24 in Eq. (7) is given by : f ( = 5/4 53/12 . Furthermore, the closed-loop matrix 2 29/6 A - I ( C is stable and its eigenvalues are freely chosen ( here, Ii = 1/2, A2 = 1/4, and Aa = 1/6). The steady-state variance matrix, V, has been computed according to Eq. (4) and the required transformation of the residual vector is given, according to Eq. (11), b y :

where z,r(k) isolates the influence of {ml(j))~=r and u.~(k) the influence of {m:(j))~: T. Since the system matrix A is not stable a feedback control has been used : r .4 l.S9 4 . a 7 ] u(k) : - D ( x ( k ) - Xre/), where D = ] j and ~.T : (1, 1, 1). 1.27 8.52 17.33 "~ef L Fig. 1 presents the simulation results for the case where at the instant 300 the second actuator faiied (a constant bias failure of 10). For the nominal residual vector, both elements of this vector jump at the failure instant while for the transformed residuals, only u~(k) jumps while u~(k) rests insensitive to the failure f2Therefore, the transformed residual sequence performs a statistical decoupling of failure influences, where u~(k) is only sensitive to the failure fl and u~(k) to the failure f2-

495

6

Conclusion

A Min-Max hypothesis testing approach for failure detection and identification in stochastic LTI systems has bcen presented. A statistical test has been proposed and a Min-Mmn approach has been developed. This approach leads to a linear transformation of the original residual vector which results in a statistical decoupling of failure influences. Simulation results of a working example have also been presented. The problem of failure isolation by the Min-Max approach, when the optimal gain matrix is used in the filter which generates the innovation sequence, is actually investigated.

Acknowledgment I would like to thank Dr. A. Bcnveniste and Dr. M. Basseville from IRISA/INRIA, Rennes, France, for the fruitful discussions.

References [1] M. Basseville, 'Edge detecticn using sequential methods for change in level Part II : Sequential detection of change in the mean', IEEE Trans. Acoust. Speech Sig. Process., vol ASSP-29, 1981, pp. 32-50. [2] R.V. Beard, 'Failure accommodation in linear systems through self reorganization', Mass. Inst. Technol., Cambridge MA, Rep. MVT-7I-1, Feb. 1971. [3] A. Benveniste, M. Basseville and G.V. Moustalddes, 'The asymptotic local approach to change detection and model validation', IEEE Trans. Automat. Control, vol AC-32, 1987, pp. 583-592. [4] D.V, Hinkley, 'Inference about the change-point from cummulative sum tests', Biometrika, voI 58, 1971, pp. 509-523. [5] T.H. Kerr, 'A critique of several failure detection approaches for navigation systems', IEEE Trans. Automat. Control, vol AC-34, July 1989, pp. 791-792. [6] G. Le Vey, 'Analyse modale et surveillance vibratoire des machines tournantes', Thhse de l'Univ. Rennes 1, 1988. [7] X.C. Lou, A.S. Willsky and G.C. Verghesse, 'Optimally robust redundancy relations for failure detection in uncertain systems', Automatica, vol 22, 1986, pp. 333-344.

496

[8] M.A. Massoumnia, G.C. Verghesse and A.S. Willsky, 'Failure detection and identification', IEEE Trans. Automat. Control, vol AC-34, March 1989, pp. 316-321. [9] A. Roug6e, M. Basseville, A. Benveniste and G.V. Moustakides, 'Optimum robust detection of changes in the AR part of a multivariable ARMA process', IEEE Trans. Automat. Control, vol AC-32, 1987, pp. 1116-1120. [10] N. Viswanadham, J.H. Taylor and E.C. Luce, 'A frequency-domain approach to failure detection and isolation with application to GE-21 turbine engine control systems', Control Theory and Advanced Technology, vol 3, MITA Press, March 1987, pp. 45-72. [11] J.E. White and J.L. Speyer, 'Detection filter design : Spectral theory and algorithms', IEEE Trans. Automat. Control, vol AC-32, July 1987, pp. 593603.

1

0.5 0 -5 -0.5 -I0

0

200

400

i

i

15

600

-1

0

*,,(k) : N.mi.al II~idual

200

400

600

*.~(k) : 'l'r,.~h.rm.M Ih:~id..I

10 5 0

0

-5 -10

9

0

a

t

200

400

_~

600

0

l

a

200

400

600

Fig 1 Time histories of Nominal and Transformed Residuals for the case of a Constant Bias Failure of the Second Actuator.

I

ON-LINE DETECTION OF MINIMAL ORDER FOR LINEAR PIECEWISE I STATIONARY SYSTEMS

t

M. GUGLIELMI Laboratoire d'Automatique de NANTES U.R.A. 823 Ecole Nationale Sup6rieure de M6canique 1 rue de la No~ 44072 NANTES Cedex 03 Membre du GRECO Traitement du Signal et des Images Abstract: This work deals with the detection of minimal order for linear systems with piecewise constant coefficients. In this case, it is impossible to use off-line algorithms. We propose, here, a real-time method able to detect the minimal order of a linear system. The basic mathematic tool is given by a simple divisibility condition of the charateristic polynomial of the differential equation for stationary systems. This necessary and sufficient condition is used to define an algorithm able to give the reduced equation. Then, we show that this result can be directly extended to linear piecewise stationary systems for which we search a change if the order. The real-time method that we obtain, is checked in simulation on deterministic systems and on systems with measured noisy informations. I- INTRODUCTION: The knowledge of the minimal order for dynamical systems is a very important information. In control systems (where supplementary modes may be a drawback) as in the pattern recognition problem in the aim of classification (diagnostic...), it is fundamental to describe the linear observed system by a lowest order differential equation. For stationary system, it exists well-known algorithms able to get the exact minimal order. In the determistic case, the principle of these algorithms, like the HO's one, are based on the calculation of the rank of the HANKEL matrix [1] build with the input/output of the system. In the stochastic environment, the order is also obtained by using algorithms which compute the rank of matrix build with the variance/covariance informations [2]. The mimal dimension of the system can also be directly obtained by using the AKAIKE's criterion [3]. Finally, some parametric identification methods can be implemented with a on-line test to check the process order (BURG-LEVINSON [3]). The most important drawback of these methods is that they need a large sample of signal, so they are unable to work on real time. In many cases, it is very interesting to have a mean able to get the order information of a dynamical system from real time input/output information, specially if we turn our attention on systems for which the

498

order can decrease. The reducibility of a differential equation can be studied by the means of the one-parameter groups [4]. But this approach is not quite easy to use in an algorithm based on input/output informations. Our paper deals with the definition of areai time algorithm able to detect a change in the order for piecewise stationary linear systems. Its organization is: after a simple example, in the paragraph 1I, we give the necessary and sufficient condition which must be satisfied by a nth linear stationary system to be represented by a n-1 th differential equation. This condition is a relation concerning the initial values, which uses the quotient of the division of the characteristic polynomial of the differential equation by its factors. From this condition, we obviously get a generalization (paragraph III) for a preducibility greatest than one. In the IV paragraph, we propose a algorithm able to detect on-line a change in the order when the linear system has piecewise stationary coefficients. Two examples are given in paragraph V. I[: N E C E S S A R Y and S U F F I C I E N T R E D U C I B I L I T y C O N D I T I O N : II-1: A simole example: Consider the following second order linear stationary system: y(2)(t) + aly(1)(t ) + a 0 y(0)(t) = 0 diy(t) . dt I Among all the trajectories obtained in the phase plane @~ ) when the initial conditions y(0)(0) and y(1)(0) are varying, two of them are straight lines. In this case, the trajectories can be represented by a first order differential equation. For example, the figure I shows that for the system described by: y(2)(t) + 3 y(1)(t) + 2 y(0)(t) = 0 In the paper, we note: y(i)(L) =

.............................. p 09, -2.00

-1.50

-1.00

Notice that the two straight line trajectories are the solutions of the 2 nd order differential equation: y(2)(t) + a!y(D(t ) + a 0 y(0)(t) = 0

-0.50

- 1 . 0 0 -"

(only if the initial conditions verify y(1)(0) + 2 y(0)(0) = 0 case [~ or y(1)(0) + y(0)(0) = 0 case [~]). Here, we

- 2 . 0 0 "3

~3.00 -~ [rojectolce de :~ + AY + BY = 0 (A=3, B=2) en fonctlon des conditions inltio}es

FIGURE 1

propose a method which detects directly, in the general n th order case, the presence and which deletes the supplementary useless dynamics in the differential equation.

499

II-2: Reducibility of 1:

Let's have the linear homogeneous system (Y) described by the linear differential equation with constant parameters: y(n)(t)+ aly(n-1)(t)+ a2y(n'2)(t) + ......... + any(0)(t) = 0

(eq. tI-1)

With the n initial conditions y(n-1)(0), y(n-2)(0) ..... y(0)(0) We suppose that y(t) belongs to C ~176 diy(t) We note: y(i)(t) - dt i -T :~ (t) = [ xl(t ), x2(t) ......... Xn_l(t)] T = (t) = [ Xl(t), x2(t) ......... Xn_!(t), Xn(t) ] P()~) is the characteristic polynomial of the differential equation II-1 When it will be necessary and when no ambiguity will be possible, we shall use equally, in the paper,P@) like a differential operator or like a polynomial in of X.When P(X) is the fifferential operator, we note: P(X) o x(t)= x(n)(t)+ alx(n-1)(t)+ a2x(n-2)(t) + ......... + anX(0)(t) II-2-1:Theorem: The trajectory described by the equation (II-1) can be obtained by a n-1 th order linear differential equation if and only if it exists a set of n-1 coefficients {a i ) such as if the polynomial: Q(X) = ) n - l + o~ ~n-2+ c~2 )~n-3 § ....... + O~n.1 1 we have: y(n'l)(0)

+ Cr

divides

) + cc2y(n'3)(0) + ....... + c~n_ l y(0)(0) = 0

In other words:

y(n'l)(0) = -

where

y T ( 0 ) = [y(n'2)(0), y(n-3)(0)...y(0)(0)]

P()~)

(eq ii-2)

~Ty(0)

Demonstration: 1 ~ Necessary_ condition: Suppose that y(t) can be represented by the n-lth order differential equation: y(n-1)(t)+ C~l y(n'2)(t)+ (z2y(n-3)(t) + ........ Then, the characteristic polynomial:

+ Ctn_ly(0)(t) = 0

(eq. II-3)

5OO

Q(X) = xn-l+ oc1 ~n-2+ a2 )~n-3 + ....... + an_ 1 divides P(X). From the division of P(~,) by Q(X) which gives: P(X) = (7~ - ~0) Q 0 0 + K

where K is a constant.

we have: P()0 o y(t)= [(X- X0 ) Q(L) + K] o y(t) Thanks to the properties of the differential operator, we get: 0 = [ (X- %0) Q(X) ] o y(t) + K o y(t) 0 = (7~ - 7~0) o[Q(X) o y(t) ] + K o y(t) Thus:

0= 0 + K o y(t) = K y(t) Finally K=0 so

Q(X) divise P(X)

In the other hand, the condition II-2 is obviously satisfied: Q.E.D. 2 ~) Sufficient condition: Let's have the differential equation with the characteristic polynomial Q(X). By hypothesis, we suppose that: Q(~) = ~n-l+ o~1 ~n-2+ cc2 ~n-3 + ....... + ~n-1 divides P(X) If w(t) is a trajectory defined by: w(n-1)(t)* a 1 w(n-2)(t) + ........ +con_ lW(0)(t) = 0

(eq II-5)

with: w(i)(0)= y(i)(0) for i varying from 0 to n-2. We define e(t) by:

e(t) = y(t) - w(t).

From the expression: e(n)(t)+ a l e ( n - 1 ) ( t ) § a 2 e ( n - 2 ) ( t ) § .........

+ ane(O)(t)

We can easily show (Annex 1) that e(t) satisfies: i=n i=n-1 e(n)(t) + ~ ai e(n-i)(t) = - anW(0)(t ) + ~ (~ - ai)w(n-i)(t ) i=1 i=1 The second member of this equation is equal to 0 due to the fact that the characteristic polynomial of this part is:(7~ Q(X) - P(X)). Using the hypothesis, we have:

P()0 = (7~ - X0)Q(X).

Thus: (Q(X) - P(X)) ow(t) = {9~ - (9~ - 9~0) 1o [Q(k) o w(t)] = ~0 [Q(X) o w(t)] = 0 So e(t) -- 0

t , if and only if all the initial conditions are equal to zero. For

i from 1 to n-2, it is satisfied by definition of e(t).

501

To get e(n-l)(0) =0 we have: e(n-1)(0) = y(n-1)(0) - w(n-1)(0) y(n-1)(0) = w(n-1)(0 ) = . {o~1 w(n-2)(0) + ........ +t:Zn_lW(0)(0)} = - {(z1 y(n-2)(0)+ ........ +an_ly(0)(0)} = -

-T ~ Y(0)

So, the trajectory w(t) is the same that y(t) if the above condition is satisfied and we can say that: y(t) has a model described by a n-1 o r d e r d i f f e r e n t i a l e q u a t i o n Q.E.D.

II-3: Reducibility of order >1; The above result can be enlarged: 11-3-1 :Theorem: The following linear differential homogeneous system: y(n)(t)+ aly(n-1)(t)+ a2y(n-2)(t ) + ......... + any(0)(t) = 0 can be represented by a linear differential homogeneous equation of order 1 (l O, k E f_fl(to,T) such that for almost all t E [to,T] the map F(t,.) is k(t) - Lipschitzian on y(t) + f i b Ha) The function t~-.~ dist(g(t),F(t,y(t))) belongs to s )

From [2, Theorem 8.2.8 and Corollary 8.2.13] we deduce that the function t ~ dist(g(t), F(t, y(t))) is measurable whenever assumptions H1), II~) are satisfied. Theorem

=

1.2 ([15]) Let 6 > 0, M = sup~e[o,T_to ]HG(t)[I. Assume that l i t ) - II3) hold true and set

re(t):

(. Z

,(t)= re(t)(, + Z (s)ds)

If , ( T ) < fl, then for all xo E X with I[yo-xo[I O, there exist x e S[to.T](X0) and f Es T; X ) satisfying (g), (10) such that for all t E [to, T] fix(t)

-

y(t)l I
0 such that for all 71 near y(to) we have

d i s t c x r ((y,g), { ( x , f ) is a trajectory-selection pair o[(8) on [to,T] ,x(to) = ~}) _< LILT]- Y(to)]l We compare next trajectories of (8) and of the convexifted (relaxed) differential inclusion:

z'(t) e Ax(t) + ~'~ F ( t , z ( t ) )

(12)

We say that a set-valued map U : [to,T] "-~ X has an integrable selection if there e ~ s t s an integrable single-valued map [to,T] ~ t ~ u(t) E U(t). T h e o r e m 1.4 Let (y,g) be a trajectory-selection pair of the relaxed inclusion (12) on [to,T]. Assume that F and y satisfy all the assumptions of Theorem 1.2 and that F(.,y(.)) has an integrable selection. Let ~](.) be defined as in Theorem 1.2. If rl(T ) < fl then for every s > 0 there exists a mild trajectory x of (8) on [to,T] satisfying x(t0) = y(to) and Itx - Y][c O, k E s and for almost every t E [to,T], F(t,.) is k(t)-nipschitz on xo + 5B. Then for every u e eaF(to,xo) dist ( a ( h > o + hu, ~(to + h, to)xo) = o(h) whcre limh~o+ o(h)/h = O. Fix u e ~ F(to, x0) and set y(to + h) = C(h)zo + f,~+t, a ( t o + h Set r = sup,e[O,h] dist(u, ~6F(to + s, y(to + s))). Then, by continuity of F , Theorem 1.2 yield that for some C > 0 and every small h > 0, there exists ]ly(to + h) - xh(to + hll < Cr Applying Theorem 1.4 we end the proof.

Proof--

s)uds = a ( h ) x o + l~u + o(h). limb-o+ ~(h) = 0. This and co X xh E $[to,to+h](o) such that El

C o r o l l a r y 2.2 Under all assumptions of Theorem 2.1 ~6 -/;'(to, Xo) C lim inf /~(to + h, to)xo - G(h)xo h~o+ h

Consequently, if xo E Dora A, then Axo + e-O F(to,xo) C liminf R(to + h, t o ) x o - xo h~o+ h We introduce next an analog of the variational equation for differentia] inclusions. For this we need to extend the notion of derivative to set-valued maps. D e f i n i t i o n 2.3 Let ~ be a set-valued map from a Banach space X to another Y and lel y E .T(x). d e r i v a t i v e d ~ ( x , y) is the set-valued map from X to Y defined by

v e dT(x,y)(u)

~=.

hn~+_ d ( "~,g ( x + h u h )h- - Y )

= 0 for some uh -+ U

When .T is locally Lipsehitz at x then the above definition may be rewritten as v e d.T(x,y)(u)

~

lim dist h~O+

v,

= 0

The

524 We refer to [2] for properties and to [8] - [12] for the applications of set-valued derivatives in the finite dimensional context. Below dF(t; x,y) denotes the derivative of the set-valued map F(t,., .), i.e. its partial derivative with respect to the state variable. Let (y,g) be a trajectory-selection pair of the differential inclusion (8) defined on the time interval [t0,T]. We "linearize" (8) along (y,g) replacing it by the " v a r i a t i o n a l i n c l u s i o n " :

w'(t) E Aw(t) + dF(t;y(t),g(t))(w(t)), w(to) = u Consider the solution map Sc,z from X to the space C(to,T;X) • s

(15)

T ; X ) defined by

5cx(~) = { ( z , / ) i s a trajectory-selection pair of (8) on [t0,T], z(t0) = ~ } The following result was proved in [15]. T h e o r e m 2.4 ( V a r i a t i o n a l i n c l u s i o n ) If F and y verify assumptions HI) - It3), then every trajectoryselection pair (w,~r) of the linearized inclusion (15) on [t0,T] satisfies (w, ~) E dSc,L(y(to),(y, g))(u). A stronger result may be proved when the map t ~ F(t,y(t)) contains an integrable selection. Considcr the "convex" linearization of (8) along (y,g):

w'(t) E Aw(t) + d~-6 F(t;y(t),g(t))(w(t)), w(to) = u

(16)

In the theorem stated below we consider the solution map Sc(() = 3[t0.Tl(~) as the set-valued map frmn X to the space C(to, T; X). T h e o r e m 2.5 Under all assumptions of Theorem 1.2 assume that F(.,y(.)) has an integrable selection. Then every mild trajectory w of the lincarized inclusion (16) defined on [to, T] satisfies w E d Sc(Y( to), y)(u). Proof--

By Theorem 1.4 we may replace F by ~ F . Then the result follows from Theorem 2.4.

[]

The derivative of the set-valued map ~ F(t,x) has the following useful property: If F(t,.) is locally Lipsctfitz on a neighborhood of x, then for every y E F(t,x)

dF(t;x,y) + T~F(t,~)(Y) C d-~'hF(t;x,y)

(17)

where T-~f(t,~)(y ) denotes the tangent cone of convex analysis to "~F(t, x) at y. See [15] or [2] for the proof. We provide next two comparison theorems for trajectories of the control system (2) and the differential inclusion (1). Let Z be a complete separable metric space, X be a separable Baaach space and f : 1Z+ x X • Z ~ X be such that for all (x, u) E X • Z the map f(-, x, u) is measurable and for every t E 1%+, f(t,., .) is continuous. Consider a set-valued map U : It+ x X .-~ Z with closed nonempty images and define the set-valued nlap

F : IZ+ • X...* X by V ( t , x ) ~ It+

x

X, F(t,z) = I ( t , x , U ( t , z ) )

T h e o r e m 2.6 If U is measurable with respect to t and continuous with respect to x, then solutions of (1)

and (2) do coincide. Proof-It is enough to show that for every trajectory-selection pair (x,v) of the differential inclusion (1) defined on [t0,T] there exists a measurable selection u(t) E U(t,x(t)) such that for almost every t E [t0,T], v(t) = f(t,x(t),u(t)). By [2, Theorem 8.2.8] the map t ",-* U(t,x(t)) is measurable and for every u E Z the single-valued map f(.,x(.),u) is measurable. Applying [2, Theorem 8.2.9] we end the proof. [] We observe that in general the images of the map F defined above are not closed, while the calculus of differential inclusions developed in previous sections deals only with closed-valued maps. For this reason we provide one more comparison theorem. Denote by s the set of maps f : It+ ~ X such that for

all 0 O

suchthat Vhe[O,e], z+hB~(w) CK}

respectively. See [2, Chapter 4] We apply results obtained in the previous sections to derive necessary conditions for optimality for three optimal control problems. Consider a continuous, Gs differentiable function ~p : X • X ~ R. aald closed subsets K0, K1 C X. For every T > 0 set UT = {u : [0,T] ~ Z I u(t) e U(t) is measurable}.

526 3.1

Problem

with fixed end time and free end point

Consider T > 0 and the optimal control problem minimize qo(x(0), x(T))

(18)

over mild solutions of the semilinear control system

z'(t) = Am(t) + f(t,x(t),u(t)), u E LtT, z(O) E I(o

(19)

Let (z,~) be a trajectory-control pair of (19). We associate with it the linear equation

z'(t) = AZ(t) + ~(t,z(O,~(t))z(t)

(20)

and denote by S~(t; s) the solution operator of (20). That is the only strongly continuous solution of the operator equation

VO<s a.e. in [0,T] ,~eU(t)

(22)

and the transversality condition: p(O) e ff'~xl(Z(O),z(T)) + TKo(z(O))-

(23)

where Tlco(z(O))- denotes the negative polar cone to the contingent cone Ti(o(z(0)). ProofDefine the set-valued map F : [0,T] x X -,~ X by F(t,x) = f(t, x, U(t)). From [2, Theorem 8.2.8] follows that F is measurable with respect to t. By the assumption a) it is also k(t)-Lipschitz with respect to x. Consider the differential inclusion

x'(t) E Ax(t) + F(t,x(t)), z(O) E Ko

(24)

By Theorem 2.7 and continuity of ~v, the trajectory-selection pair (z(.),f(.,z(.),~(.))) is optimal for the problem (18), (24). Thus we may replace the control system (19) by the differential inclusion (24). Consider the linear control system w'(t)

=

Aw(t)+ ~

y(t)

e

Tazi(t,~(t),cr(,))(f(t,z(t),~(t)))

w(O) E TKo(z(0)) (25)

The reachable set/~L(~) of (25) by the mild trajectories from ~ at time T is given by RL(() =

{

S~(T;0)~+

/;

S~(T;s)y(s)ds l Y E s

e T~/(,M,),u(,))(f(s,z(s),g(s))) a.e.

}

From Theorem 2.5 we deduce that every mild trajectory w E C(0, T; X) of the "linear" differential inclusion

w'(t) E Aw(t) + d'd-S F(t;z(t),f(t,z(t),g(t)))(w(t))

527

on [O,T] verifies w E dSc(z(O),z)(w(O)). From the definition of the derivative for almost every t E [0,T]

V w E X , ~x(t,z(t),-ff(t))w E dF(t;z(t),f(t,z(t),~(t)))(w) I[ence, using (17) we deduce that every solution w E C(O,T;X) of the linear control system (25) verifies w E dSc(z(0), z)(w(0)). Since z is optimal, using Corollary 1.3 we deduce that for every trajectory w of (25) we have (U(p(z(0),z(T)),(w(0), w(T)) I >_ O. The last inequality yields that for every integrable selection y(t) E "d'5f(t,z(t), U(t)) and every ~ E T1r we have

Define p(.) by (21). Setting y(t) = f ( t , z ( t ) , g ( t ) ) i n the above we get p ( 0 ) - -~7(z(O),z(T)) E TK0(z(0))-. llenee the transversality condition (23) holds true. On the other hand applying (26) with ~ = 0 we obtain: for every integrable selection y(t) E ~ f(t, z(t), U(t))

fo T (3~(T; t)*p(T), y(t)) dt = < p(t), f(t, z(t),g(t)) > almost everywhere in [0, T]. This yields the ma.ximum principle (22). t::l 3.2

Problem

w i t h free e n d t i m e a n d f r e e e n d p o i n t

Consider the optimal control problem minimize { (p(x(O),x(T))iT > 0 }

(27)

over mild solutions of the semilinear control system (19). T h e o r e m 3.2 Let (z,~) be an optimal trajectory-control pair of the above problem and T denote the cor-

responding optimal time. Then the same conclusions as in Theorem 3.1 are valid. Furthermore if f(.,x, u) and U are continuous at T and z(T) E Dora A, then sup (p(T),f(T,z(T),u))

~ (p(T),-Az(T))

~eu(o P r o o f - - By the proof of Theorem 3.1, z is optimal for the problem (27), (24) and it is enough to prove the last statement. By Corollary 2.2 and optimality of z, for every u E ~5 F(T, z(T)), 0~2 (z(0), z(T))(Az(T)+ u) _> 0. tfence the result. [] 3.3

Problem

w i t h fixed e n d t i m e a n d e n d p o i n t c o n s t r a i n t s

Consider T > 0 and the optimal control problem minimize { V(x(O),x(T)) [ x(T) E Kx }

(28)

over mild solutions of tile semilinear control system (19). T h e o r e m 3.3 Let (z, ~) be an optimal trajectory-control pair and let Q C DKr(z(T)) be a convex cone with nonempty interior and P C T1~'o(z(0)) be a convex cone. Further assume that for every (t,x) E [0, T] x X

the set f ( t , x , U ( t ) ) is closed. Then there exist A > O, ~o ~ P - , ~T E Q- not vanishing simultaneously such that the map pit)=

S~T;t)* (-)~ ff--~x (z(O),z(T)) - ~T)

(29)

satisfies the maximum principle (22} and the transversality condition (p(0), -p(T)) The proof can be found in [15].

=

~V~(z(0),z(T)) + (r

~r)

(30)

528 R,eferences [1] AUBIN J.-P. ~: CELLINA A. (1984) DIFFERENTIALINCLUSIONS.Springer-Verlag, Grundlehren der Math. Wissenschaften, Vol.264 [2] AUBIN J.-P. & FI~ANKOWSKA H. (1990) SET-VALUED ANALYSIS,Birkh~,user, Systems and Control: Foundations and Applications [3] CASTAING C, & VALADIEI~ M. (1977) CONVEX ANALYSIS AND MEASUR.ABLEMULTIFUNCTIONS. Lecture Notes in Mathematics, n ~ 580, Springer Verlag, Berlin [4] CLAR,KE F. (1983) OPTIMIZATIONAND NONSMOOTII ANALYSIS.Wiley Interseience [5] DUNFOR, D N. & SCIIWAI~TZ J.T. (1967) LINEAIL OPERATORSPart I: General theory. Interscience Publishers, Inc., New York [6] FATTORINI II. ~z FR,ANKOWSKA H. (to appear) Necessary conditions for infinite dimensional control problems. Mathematics of Control, Signals, Systems [7] FILIPPOV A.F. (1967) Classical sobttions of differential equations with multivalued right hand side. SIAM J. Control & Optimization, 5, 609-621 [8] FR,ANKOWSKA H. (1987) The maximum principle for an optimal solution to a differential inclusion with end point constraints. SIAM J. Control ~z Optimization, 25, 145-157 [9] FR,ANKOWSKA II. (1987) Local controllability and infinilesimal generators of semi-groups of set-valued maps. SIAM J. Control ~: Optimization, 25,412-432 [10] FI~ANKOWSKA ]I. (1989) Set-valued analysis and some control problems. Proceedings of the International Conference 30 years of Modern Control Theory, Kingston, June 3-6, 1988, E.R,oxln Editor, Marcel Dekker [11] FI~,ANKOWSKA H. (1989) Local controllability of control systems with feedbacks, Journal of Optinfization Theory and Applications, 60, 277-296 [12] FI~ANKOWSKA tI, (1989) Contingent cones to reachable sets of control systems. SIAM J. Control &: Opt. 27, pp. 170-198 [13] FR.ANKOWSKA lI. (1989) Optimal trajcctories associated to a solution of contingent lIamillonJacobi equations. Applied Math. & Optim., 19, pp. 291-311 [14] FII.ANKOWSKA II. (to appear) Some inverse mapping theorems, Ann. Inst. IIenri Poincar~, Analyse Non Lin~aire, [15] FR,ANKOWSKA II. (to appear) A priori estimates for operational differential inclusions, J. Diff. Eqs. [16] HILLE E. & PIIILLIPS R,.S. (1957) FUNCTIONALANALYSISAND SEMI-GKOUPS.American Mathematical Society, Providence, l~hode Island [17] PAZY A. (1978) SEMI-GR,ouPs OF LINEAI~.OPER.ATOItSAND APPLICATIONSTO PARTIALDIFFEItENTIAL EQUATIONS.Springer, Berlin [18] TOLSTONOGOV A.A. (1986) DIFFERENTIALINCLUSIONSIN BANACfl SPACES. Nauka, (in Kussiaa) [19] WAZEWSKI T. (1963) On an optimal control problem. In: Differential Equations and Appfications, Proc. Conf. Prague, 1962 [20] QIJI ZIIU (to appear) On the Solution Sets of Differential Inclusions in Banach Space, J. Diff. Eqs.

Relaxed Controls for Time Delay Systems by l's

B. Vinter

(Department of Electrical Engineering, Imperial College, Exhibition Road, London SW7 2BT, England)

Despite the attention timt optimal control problems involving time delays have received over the last two decades, little attention has been paid to the important question of how such problems should be relaxed in order to assure existence of minimizers. Recently Warga has proposed a relaxation procedure for fully nonlinear problems with delays in the dependent variables and in the controls, and showed that the resulting relaxed problem has a solution. We show, through an example that this relaxation procedure can fail to give a p r o p e r e z t e n s i o n , i.e. the effect of relaxation can be to reduce the infimum cost. Sufficient conditions are given for a proper extension. A new relaxation prot:edure is provided for which the extension is proper in certain situations where the former extension is not,. Abstract:

1. Introduction. There is a substantial literature treating optimal control problems with time delays. In a number of respects our understanding of problems with time delays matches that of delay free problems. Wc note in particular that necessary conditions of optimality, akin to the Pontryagin Maximum Principle for delay free prob[enls, have also reached a high level of refinement [4] for problems with time delays. Itowcver one important strand in the delay free theory is the study of relaxation procedures and of conditions for c•

of minimizers. This is an area which, in the context of time delay problems,

has received very little attention. Our objcct here is to describe some of the difficulties involved, and to indicate how, in some respects, they can be overcome. The optimal control problem we address is Minimize g(x(1)) subject to

(e)

x(t) = f(t,x(t), u(t), u(t-01), u(t-02)),

a.e. [0,11,

x(0) = •

u(t) ~ ~,

a.e. [--02,11.

The data associated with this problem are: real numbers 0 < 0i.< 0 2 < 1, a point x0E Rn, a set f~ C Rm and functions g: Ru

R f R 1+n Win3

Ru

We have chosen this problem for simplicity of exposition, and to keep notationM complexity to a minimum.

No difficulties arise in extending the theory to follow to apply to control problems where

delay terms are present also in t.he state variable. We could also have looked at problems involving more than two delays. Our reason for clmosing precisely two delays is that cert,ain difficulties encount.ered with two delays typify those for several delays, and are not shared by problems wit.h one delav. Notice lhat we have adopted dynanfical equations where there isageneral .onlinear dependence

530 on u(t), u ( t - - 0 t )

and

u(t--02). (In the special case that the dependence is 'separable',i.e. f is

expressible as f(t,x(t), u(t), u ( t - 0 1 ) , u ( t - 0 2 )

= fl(t,x(t), u(t)) + f2(t,x(t), u(t--01) ) + f2(t,x(t), u(t--O2)

the task of finding an appropriate notion of relaxation is a simple adaptat, ion of standard delay free theory). T h e hypotheses we impose are f(-, x, u0, Ul, u2) is measurable and f(t, x,.,.,.) is continuous, f2 is compact, g is continuous, 3 a constant c and a function k(-) E L 1 such that Ill(t, x, u o, u 1,

u2)ll _< c(

t + Ilxll), for all (t, x, (u0, ux, u2) ) E [0,1] • R n x f23 , and

Hf(t, x, u 0, Ul, u2) - f ( t , y, u0, Ul, u2)l[ < k(t) l l x - y l l for all (t, (u0, ul, u2) ) E [0,1] x f23 and x , y E N n. W a r g a [5] was the first, to call attention to the speciat difficulties encountered in addressing existence questions for problems with non-separable time delays in the control. relaxation procedure ( 'weak' rela.xation, as we shall call it,).

Warga also proposed a

The contributions of this article are,

firstly to illustrate through an example that weak relaxat.ion can have the undesirable effect of reducing the m i n i m u m cost (in other words weak relaxation m a y fail to be proper), to give a simple testable criterion for properness of weak relaxation and, finally, to supply a more refined notion of rela~,:ation which is proper in certain situations where this criterion fails.

2. Delay Free Problems. For purposes of comparison we briefly review a standard procedure for relaxing delay free problems. Throughout this section then we look at Minimize g(x(1)) subject to

.~(t) = f(t,x(t), u ( t ) ) ,

a.e. [0,q,

x(0) = x0, u(t) e f~,

a.e. [0,1].

The hypotheses on the data are those of the previous section, when tile delay free problem is regarded as a special ease of (P). A measurable function u: [0,1] ~

R m satisfying u(t) E f2 a.e. is called an ordinary control. A

pair (x(.), u(-)), comprising an ordinary control u(.) and an absolutely continuous function x(-) which satisfies the differential equation is called an ordinary process. called admissible.

If x(0) = x 0 the ordinary process is

The optimization problem, posed over admissible ordinary processes, is called the

original problem (Poriginal)"

Two questions now arise. When does rite original problem have a

minimizer and. in the event that there is no minimizer, how can we supplement the class of admissible ordinary processes to ensure existence of a minimizer? 13orb have very satisfactory answers. As for the first question, a simple geometric condiLion "f(q x, .Q) is convex for all (t, x) E [0,1] x ~n,, is I~nowll to assure existence of an optimal admissible ordinary process (See, e.g. [3]). An existence

531 theory, applicable for nonconvex 'velocity sets' f(t, x, f~), involves introduction of the notion of relazed process.

A relazed control is a measurable essentially bounded function, often written ~ or t---~ ~t' taking vMues in the space of regular probability measures on ft. ('Measurable' here is understood in the sense that t - - < g , # t >

is measurable for each g E C(fl).)

We denote the set of relaxed controls by Jtl,. The

set .46 can be regarded as a subspace of the topological dual space (L[0,1; C(fi)])*; it acts on elements r in the primal space according to 1

r ~f

J" O(t,u) d#t(u ) dt. 0

We equip it with the relative weak* topology. A relazed process is a pair (x, #) comprising a relaxed control and an absolutely continous function x which is a solution to the differential equation, in the sense that :~(t) = f f(t, x(t), u) d , t ( u ). It is admissible if x(0)=x 0. (Under the hypotheses, given /~, there is a unique x such that (x,#) is an admissible relaxed process.) An ordinary control u(.) can be regarded as an element in .21,; we identify it with the point t ~ 6 ( t ) , where 6~ denotes the unit measure concentrated at the point a.We find that .At, is a non-empty, compact set. .At, = closure{ordinary controls} The mapping

u - - x(-) : .Ag ~

C([0,1];~I n) is continuous, where (x,,u) is the admissible

relaxed process associated with /~. The problem posed over admissible relaxed processes is called the relazed problem

(Prelaxed).

We

deduce immediately from the preceeding assertions:

Theorem 2.1 The relaxed problem has minimizer, and tile infimum cost for the original problem coincides with the minimum cost for the relaxed problem. In symbols Inf{ Poriginat } = Min{Prelaxed}. The fact that the infimum costs coincide is summarized by the statement "tile relaxed problem is a

proper extension of the original problem".

Properness of the extension is desirable for a number of

reasons. It means that there is a close connection between the problem of primary interest (the 'original' problem) and the problem which replaces it.

It also suggests a methodology for finding

ordinary admissible processes which come close to achieving the infimum cost in situations where existence of a minimizing ordinary admissible process is not assured: we should solve the relaxed problem and then approximate the 'relaxed' minimizer by an ordinary one.

A full account, of these

ideas is to be found in Warga's book [4].

3. Relaxation Via a Reduced Problem. We return now to problem (P). In relation to this problem we term ordiT~ary controls measurable fuuctions u(.): [ - 0 2 , 1 ] ~ n such that u(.) E f~ a.e.

Ordin~Lr~

532 processes and admissible ordinary processes have the obvious meanings (c.f. Section 2). Problem (P), posed over admissible ordinary processes, is the original problem (Poriginal). A well-known technique (see e.g. [4]) reduces a control system with commensurate time delays to a delay free problem. ('Commensurate' means that 0t/0 2 is a rational number.)

To avoid some minor

technical difficulties, in describing this technique we assume a little bit more than commensurability, namely 9

01 and 0 2 are rational.

This means that there exist integers 0 < N 1 < N 2 < N such that 01 = N1/N and 0 2 = N2/N.

We

write A = 1/N. The basic idea is to section ordinary controls and the corresponding trajectories into segments of length A, and to stack these segments to form higher dimensional

vector valued functions

on the interval [0,A]. The resulting functions satisfy a delay free differential equation. The reduced problem is expressed in terms of a partitioned state vector Y~ = col [x0,.., XN_l] and control vector fi = col [U.N 2 .... u 0 .... UN_l]. It is

Minimize ~(2(A)) subject to subject to ~,(s) = f(s,~(s), fi(s)) , (R,P)

a.e. [0,~],

fi(t) E f2, a.e. [0,A], together with the mixed endpoint conditions x0(0 ) = x 0 and xi(0 ) = Xi.l(1), for i = 1,2,..,N-1 .

The d a t a for this problem is constructed from that for (P): = ~ (N2+N), ~(.~) = g(xN_i) and the components {fi}iN-~ of f are fi(s,~.fi) = f(iA+s, xi, ui, Ui_N1 , Ui_No).. It is a straightforward matter to show that (RP) and (P) (posed over ordinary processes

are

equivalent, in the sense that there is a one to one mapping from ordinary admissible processes for (P) and ordinary processes for (RP) which satisfy the mixed boundary conditions. This mapping carries an ordinary process (x,u) for (P) into (2,fi) where ~(s) = col[x(s), x ( A + s ) , x(2/',+s),..,x([N-1]A+s] and fi(s) = col[u(-N2A+s ) .... x([N-IlA+s]. This reduction procedure, together with the theory of the preceeding section, suggests a relaxation procedure for the time delay problem.

We can relax (R.P), which has no delays, along the lines of

Section 2. It is not difficult to show that the relaxed extension of (RP) has a solution and is a proper extension (the a r g u m e n t s involved in establishing riffs latter property are a little bit different to those underlying Section 2, since we must ensure that. given an 'admissible' relaxed process for (RP), we can find a suitable ordillary process approximating it and which satisfies the mixed botmdary comlitions). The procedure of replacing (P) by the reduced problem (liP) and relaxing (R.P) we shall refer to ;us 'relaxation

via the reduced problem'. This procedure in certain respects achieve~, the desired

533 objectives. The effect of relaxtion via the reduced problem is to guarantee existence of minimizers, and the relaxed problem is a proper extension of (P). IIowever it is unsatisfactory ill certain respects. Notice first of all that the dimension of the state and control spaces in (P~P) can be very large ((Nxn) and ([N2+N]xm respectively). The fact that we have posed (P) on the time interval [0,1] is merely a normalization procedure, and the above technique for eliminating delays works on an arbitrary finite time interval; however we note that, for fixed time delays 01 and 02, t.he dimension of the spaces involved increases rapidly with the length of the underlying time interval. Apart from this, in passing to the reduced problem on the time interval [0,A] the connections with the original problem are somewhat obscured.

Finally, relaxation via the relaxed problem is only possible for commensurate

delays.

4. Weak Relaxation. We now describe another relaxation procedure for the time delay problem, due to Warga [5]. Take an ordinary process (x(-), u(.)), and define u0(t ) = u(t), ul(t ) = u ( t - 0 1 ) and u2(t ) = u ( t - 0 2 ) , for t e [0,1].

Then (x(-), (u0(-), u l ( ) , u2(.)) ) satisfies C .~(t) = r(t,x(t), u0(t), Ul(t), u2(t)),

a.e.

[0,1]

(4.1) /(u0(t.), ul(t), u2(t)) ~ f~3,

-

x(0) = x 0

together with Ul(t ) = u 0 ( t - 0 t )

,01 Min (Pw_relaxed).

(strict inequality). But problem (Ps_relaxed) is an extension of problem (Poriginal) so Inf (Poriginal) _> Min (Ps_relaxed)

> Min (Pw_relaxed).

We summarize our findings:

Proposition 7.1 The data for problem (P) can be chosen so that 0 i and 02 are rational, 02= 201_ , and (Pw-relaxed) is not a proper extension of (Poriginal). Observe that Proposition 6.1 predicts that, in this example, (Ps.relaxed)

is a proper extension of

(Poriginal)" There is one further respect in which this example is revealing. Define

11(01,02) : = {the infimum cost of the original problem for delay parameters O1 and 02}. It is of interest to know if this function is continuous, for if it is not the well-posednness of problem (17) is called into question in circumstances where the delay parameters have uncertain values,

l:;xamining

can be discoa~inuous. Let 0 be a fixed rational number, 20 and {0(I)},{0( ')} be sequeuces of rational points such that 0(1)~ 0 and 0(.2)

variants on the above example tells us that r/ 0 < 0 < 0.5.

Let

for which 0f'e)~90(. I) for i = 1,2,.. l

---

1

For each i sufficiently large, the function #t = u is a weakly

relaxed control in tile above example when we adopt

0 (.I) and 0(e) as delay parameters, and the cost is 1 1

zero. Since the cost cannot be negative, anud in view of Theorem 6.1, we must have

,1(0[12,0(e)) = 0, llowever ~/(fl, 20) > 0

as

for all i sufficiently large.

we have shown. Tiros ~l is discontinuous at (0,20).

538 REFERENCES

[1]

T. Andrews, J. Rosenblueth and R. Vinter, Relazation Procedures for Time Delay Systems, in preparation.

[2] [3]

C. Dellacherie and P. A. Meyer, Probabilit~s et Potentiel, Iterman, St,rasbourg, 1975 W. It. Fleming and R. W. Rishel, Deterministic and Stochastic Optimal Control, Springer, New York, 1975.

[4]

J. Warga, OpIimal Control of Differential and Functional Equations, Academic Press, New York,

[5]

J. Warga, Nonaddilively Coupled Delayed Controls, preprinL

!966.

MAXIMUM

PRINCIPLE DIFFERENCE

FOR

NONCONVEX

CONTROL

FINITE

SYSTEMS

Boris S. Mordukhovich Department

of Mathematics,

Wayne

State University

Detroit, M i c h i g a n 48202, U S A Optimal control problems are considered for a family of finite difference systems with constraints on the control and phase variables. Such systems arise in computer calculations of optimal control systems with continuous time and aiso in the simulation of discrete large-scale optimization problems. We regard discrete approximation systems as a process with decreasing time quantization period. New necessary optimality conditions for such systems are obtained in the form of ~he approximate maximum principle which is fulfilled the more precise the less is the time quantization period. We consider arbitrary nonlinear control systems without any asssumptions about convexity. Constructive methods are proved for approximating the state constraints so thai, the stability of the Pontryagin maximum principle is ensured in computer calculations of nonconvex control systems with coutinuous {~ilne. The results obtained are used for the analysis o[ qualitative aspects and the development of numerical algorithms of opt,treat control. 1. I n t r o d u c t i o n The paper deals with optimal control problems for finite difference systems of the form

z(t + h) = z(t) + hf(x(t),u(t),t), v(t) E U(t),t E T/, = {to, to + h , . . . , t l - h},

(t.l)

(1.2)

with various constraints on the state variables. Such problems arise in computer work when the time derivatives are replaced by finite differences : ~:(t) ~ [x(t + h) - x(t)]h -~, et,c. It is then natural to consider the problems ~ a constructive interpretation of the corresponding problems for continuous dynamic systems without using the abstraction of continuous time. We note also that finile difference dynamic systems can be used for modeling a variety of static problems of discrete optimization of large dimensiouality. Optimization problems for systems (1.1), (1.2), regarded as a process as h ; 0, occupy an intermediate position between optimal control problems for systems with continuous and discrete time (with fixed h) and lead to the appearance of essentially new effects some of which are analyzcd below. I~ is well-known that the Pontryagin maximum principle [P 61] gives some necessary conditions for an extremum in general optimal control problems for continuous-time systems described by ordinary differential equations. But, for its discrete analogue to hold as necessary optimality conditions, certain assumptions about convexity [B 73, CCP 70] are required, since wil,hout ~hem the

540 discrete maximum principle is invalid even in the simplest problems with a fl'ee right-hand end. The connection between convexity and maximum type conditions in the theory of extrernal problems in reflected in the classical Kuhn-Tucker theorem [R 70] and follows naturally from the general results of nonsmooth analysis [C 83, Mot 88] (the d~aracteristic form of the normal cone to the convex set). On the other hand, the unconditional satisfaction of the maximum principle in systems with continuous time is linked with the "hidden convexity" property inherent to such systems fIT 74], which can be traced back to Lyapunov's theorem on convexity of the values of a nonatomic vector measure [L 40]. It is important to emphasize that, in order to utilize the Pontryagin maximum principle correctly in computations of actual control systems with continuous time on computers operating discretely, we have to prove the stability of necessary optimality conditions supplied by the maximum principle in difference approximations of nonconvex systems with constraints on control and state variables. Our main result is to prove the necessary optimality conditions, spccifically for finite difference control systems with constraints on the trajectory, in the form of the approximate maximum principle which is valid when the perturbation of constraints is constructively matched with the step of a difference mesh without any assumptions about convexity. In control problenls where the right-hand end of trajectory is free and the left-hand end is fixed, our result is in accord wit,h the quasimaximum principle of [CK 71]. During the proof of the approximate maximum principle, an approximate analogue of the hidden convexity property is found to be inherent to finite difference systems so that we can solve the problem posed in [Mo 71] of finding the reasons why finite difference systems with a small quantization period behave like any discrete systems with a priori convexity conditions. We shall use the following standard notations: R" is the n-dimensionalEuclidean space of column vectors (they are written in rows) with the norm II "II and the scalar product < -,. >; coX, d X are the convex hull and the closure of the set X; E is tile identity matrix; o(h) and O(h) a.re vector quantities such that

o(h)/h

--, 0, IlO(~dll _< K~ as z, s 0. In addition to the usual notation for a

matrix product, we put

.l:=i

( AiAi-1 ... Aj for i > j,

fiat= J

J E 0

for i = j - 1 , for i < j - 1

2. P r o b l e m F o r m u l a t i o n and Definitions. Wc take the control problem

5c = f(x,u,t), u(t) E U(t), t E T = [to, t1],

(2.1)

~ ( z ) < O,i= 1 , . . . , m ; ~ ( z ) = O,i= m + 1 , . . . , r e + p ,

(~.2)

I = ~Oo(Z)~ min,

(2.3)

541 where z = (xo, xx) = (x(to), x(t~)) E R 2~ and T is a fixed intervM. For each positive integer we consider the quantization step hN = (q - to)/N and construct, a sequence of finite difference problems of minimizing the functional (2.3) with z = ZN = (XN(t0), zN(t~)) and the constraints

xN(t + hN) = XN(t) + hNf(xN(t),uN(t),t),

(2.4)

UN(t) ~ U(t),t ~ TN = {to, t0 + hN,... ,t, -- hU),

(2.5)

(PI(ZN) 0,i = 0,1,...,m,()~oN) 2 + . . . + (.~+p,N) 2 = 1,

(2.~3)

where z} = (x~(to),x~(t,)),ei(h) --+ 0, e(t,h) + 0, as h .L 0 uniformly with respect to t C TN, N = 1,2,.... It follows from (2.10), (2.12) and (2.13) that e(t, hN) 0 there exist No such that

~oi(x/v(to),ZN(tx)) --7i/v _ No, N e A.

(4.2)

We note that the concept of an essential constraint in a sequence of finite difference optimization problems corresponds to the concept of an active inequality-type constraiat in usual problems of mathematical programming and optimal control.

It is easy to see that, if the constraint is

not essential along the sequence A = {N}, then it is inessential along the sequence A, C A. IIence it can be assumed without loss of generality that, for the sequence of optimM trajectories {x~v(')} in problems (2.3) - (2.6), the first g constraints in (2.6) are essential and the remaining

m -g.,O < g < m, are inessential along all N = 1 , 2 , . . . . From the optimal trajectory increment A z } = (Az~v(10) , Az~v(q)) we construct the (g + 1) -dimensional column vector

e/v(Az~) .

((-~e~ . .

and wc consider the vectors b/v = Ax~

.

,(Tgz ~~176

(~.3)

g/v(b/v, O, u) = g/v(b/v, A0,.x}(t,)), and the set

C-'/V = {aN(b,,O,u) : bN e B/V,O < 0 < N -- 1,u e U(ro)},

(4.4)

where B/V = {x E /{", [[xl[ _< h/v}. We form the convex hull COGN of the set (4.4). The following lemma shows that the set coG/v can be shifted by an amount of order o(h/v) as hN ~. 0 in such a way that it does not intersect the negative orthand in Re+x:

Y = {y = ( v ~

e R TM : v' < 0,i = 0 , 1 , . . . , e } .

(4.5)

L e m m a 4.2. There exists a sequence of (g + 1)-dimensional quantities of order o(h/v), h/v J. O, such that

( c o a ~ + o(h/v)) a z = r N > No

(4.6)

Using this result, we can prove the approximate maximum principle in problems (2.3) - (2.6) in the case of any perturbations ")'iN of inequality-typed constraints.

546 T h e o r e m 4.3.

Let {x~

be optimal processes in problems (2.3)- (2.6) where the

inequalties in (2.6) arc essential along the sequence {x~

N = 1 , 2 , . . . , for i = 1 , 2 , . . . , g and are

inessential for i = g + 1 , . . . , m. Then the relations of the approximate maximum principle (2.8) (2.13) hold for these processes, and

e,(hN) = O(h,v),i = 1,2,...,g;AiN = 0, i = g + 1 , . . . , m , N

>_ No.

(4.7)

P r o o f . By Lemma 4.2, the separability theorem for convex sets and the structure of sets (4.4), (4.5), there exist vectors #N = ( # ~ . . . #~) E R ~, (#~ i rtOtPi/ o',

O~al

+ . . . + (#~)2 = 1, N > No, such that

o

ttNK~xo~ZN), b~) + (3-~-(z,-,,, ao,,x~(t,))] + o(1,,) ___0

(4.S)

for any bN e BN, IIbNII _< hN, and parameters {0(N), u(N)} of one-needle-shaped variation, where It'y >_ O,i = 1,...,g. We put Auv = #'N,i = O, 1,...,g;.~iN ----0, i = e + 1 , . . . , m , and obtain sets of Lagrange mutipliers which satisfy (2.12), (2.13), (4.7), while el(h) = O(h) follows from (4.1). Using thc standard transformation of the method of increments [GK 71] we obtain from (4.S) the relation

O~~ o O~t o (AoN-~x (zN) + . . . + ,~eN-~x (zg) -- ~b~

bN) -- hN[H(x~

(4.9)

v

r b-0+,), ,,,-0) - ~r(~o (To), r

,6(,-0), ~-0)] + o(I,.) >__o

along the trajectories (2.8), (2.9) corresponding to {x~

With bN = 0 we get (2.10) from

(4.9), and with u = u~(ro) we obtain (2.11) by reduction ad absurdum. [] 5. T h e A p p r o x i m a t e M a x i m u m P r i n c i p l e in G e n e r a l C o n t r o l S y s t e m s w i t h E q u a l i t y Type Constraints Let us study the discrete approximations (2.3)- (2.7) of problem (2.1)- (2.3) with cquality type constraints. Our example below shows that if 51N = 0 or 5iN J. 0 too rapidly, the approximate maximum principle may not be satisfied in ( 2 3 ) - (2.7). E x a m p l e 5.1 We consider the two-dimensional control problem =

v,y(0) = 0,.~ = w,s(0) = 0, x = (y,s) E R 2,

=

(,,, ~) e

(5.1)

m , t e [o, 11,

I = ~o(x(1))

= s(1)

--~ min

(5.2)

with fixed left-haad end and the constraint at the right-hand end of the trajectory ~,,(x(1)) = y(1) = 0

(5.3)

with the quantized control domain in (2.1):

V = {(0, 0), (0,-1), (1, -2), (vq, -3))}

(5.4)

547

It is easy to show that, in the corresponding finite-difference approximation (2.3) - (2.5), (2.7) of the problem (5.1) - (5.4), there exists the unique sequence of solutions u~v(t) = ( 0 , - 1 ) , x ~ ( t ) = (0,--t),t e T N , N = 1 , 2 , . . . , where r

(5.5)

- (--~lN,--MN) for the corresponding trajectories of the adjoint system (2.8), (2.9)

with Lagrange multipliers ~dmissible in (2.13). The Hamilton-Pontryagin function along {X~v('), r h ~ the form [IN(U, t) = - - & N V - - ~ o u W , and for the optimal control (5.5) we obtain H N ( u ~ ) = ~o~v. It can be shown directly that max{I-IN(u) : u e U} - HN(U~ in

(2.13),

>__ 1 for any {-'~0N,/~IN} a,dmissible

i.e. condition (2.10) does not hold.

We introduce a coordination condition between perturbations 51u and the step hN: . 5~1v lm - - = oo, i = m + l , . . . , m + p ,

(5.6)

n~oo hN

and we show that in the caze of (5.6) the relations (2.8)- (2.13) are necessary for optimality in (2.3)

-(2.7). T h e o r e m 5.2.

Let the processes {x~(-),u~(.)} be optimal in problems (2.3) - (2.7) with

any perturbations 7iN and the coordination condition (5.6) for &N- Then approximate maximum principle (2.8)- (2.13) holds in (2.3)- (2.7); moreover ei(hN) = O ( h s ) , i =

1,... ,m.

P r o o f . We can assume for simplicity that ~ol -= 0 in (2.6), since these constraints were considered in Theorem 4.3. We put ~a+N(z) = ~a~(z) - 5m, ~a~-u= -~o~(z) - 5,v, i = m + 1 . . . . , m + p, and write constraints (2.7) as

~+,u(ZN) _ 0 9 T(t)~,: The function of C(] - oo, 01, X ) defined by: (T(t)~)(s)=~(t+s)

for all.~ 0. Thence, we will study the set s = h?. M Ca In this section we will first prove results about the simplest case which happens when the dynamics are just constrained by one delay. T h e n we will see that the other case can be generalize without great efforts. 1.1

The

one

delay

case:

We suppose that p = 1 and that:

;c -- {~ e c(] - oo, 0], x )

I ~o(0) E M ( ~ ( O ) ) } , 0 < 0 is given

No other assumptions need to be made about the set valued-map M in the first result we shail obtain. But the generality of the proposition does not provide us a useful way to compute iPA:(c;). T h a t is why we wiI1 need further results.

552 Proposition

1.1 Dpc(%0) = [D(M o %0)(0,%0(0))](1)

Proof-The next proposition involves the contingent derivative of the set-valued map M o %0.3 Let v E D~c(%0). By definition, we know that it is equivalent to: 3hl ~

0+ and %01E C ( ] - o o , h i ] , X )

such that:

{ 0 0 because, in fact, it is the set that will be useful in the application of the theorem (0.2). A new proposition will help us to do so. P r o p o s i t i o n 1.5 ~D~(~o)fl I n t B x ( A ) C 2?~c,(~) C g)~(~o) Cl Bx(A) Bcforc thc proof, let's state a preliminary result.

l ~ m m ~ 1.6 ~o~(90) = Bx(A) Proof-Let v E 7?jc(~o) Cl I n t B x ( A ) , then 3hl ~

0 , vl ~

Vk=l,...,p,

v such that:

0 O,

,(.)eu

//

/~

> -C, -

> 1. -

Now, we let tl~ be the set of all controls u(.) E U, such that the corresponding solution Za(') of (3.7) satisfies constraint (3.15). Then, by the construction of Q,, we see that

(3.17)

~(.) E Ua~d, Vn >_ 1.

565 Next, let us denote (3.18)

J,,=

inf

,,(.)eu2~

J,,(u(.)).

Then, we have the following result.

Theorem 3.4. Let (H1),(H2) a~d (H3) hold. The~, (3.19)

nm J: = J(~(.)) = J. f/,----+ OO

Pro@ First of all, by (3.17), we have J,, < Jn(fi(')). Thus, it follows from Theorem 3.3 that

~l~ J,, < J".

n-..-+ (:~

Now, suppose (3.19) is not the case. Then, one can find a constant 6 > 0 and a sequence

{urn(.)} c u~, such that Jm(um('))
0,

v~(.) 9 U.

Now, we let 2,,(.) be as in Section 3 and 2 % ) be defined as in (4.2) replacing (x,(.), u(.)) by (2,~(.), a(-)). Then by Theorem 3.4, we know that as n - , oo, (4.4)

0 < a(n) = Fn(~(.)) = d(_oo,j'_,/a](J,(~(')) _I.

568

Substitute (4.1S) in to (4.11), we have (see [9]) v~T (4.20)

_> (r

~,,(T)) + r

o 0

= /oT[H,~(t, xn(t),u(t),r162

_ Hr,(t,z,~(t),un(t),r162

This can be regarded as an approximating maximum principle. Then, it is natural to expect that we can take the limits to get the final result. The rest of the proof is to achieve this. We note that (by (4.5)) (4.21)

{ Iz•(t)- ~(t)] ~ 0 ,

ir

/

_

r o 0,

uniformly in t E [0,T],

r

• r

On the other hand, from the equation (4.18) and the Gronwall's inequali~, we see that (4.22)

Ir

_< c,

Vn >_ 1,t 6 [0, T].

Thus, by the reflexivity of X, we may assume that

{

/~(.,x"(.),u"(.))*r

(4.23)

~ r

in L2(GT;X),

fo,,(.,x"(.),u"(.)) L A(.), in L2(O,T;X), r ~ r in L2(O,T;X),

for some functions r A(.) and 0(.). Next, we will derive the equation which the function r satisfies. To this end, we need the following result. Lemma 4.1. Let eAt he a compact semigroup on reflexive Banach space X. Let gu(') 6 L~176 with

(4.24)

Ig,(.)l~(o,r~x) _ 1.

Then, there exists a subsequence of ga(') (stiU denoted it by itself) and a g(.) E L ~176 (0, T; X), such that for all t 6 [0, T],

~m I

(4.25)

/:

eX(~-r)Cg,~(r) - g(r))&" l = 0.

By noting the fact that the semigroup eAt is continuous in the uniform operator topology for t > 0 ([11]), one can easily prove Lemma 4.1. Now, by the above lemm% we can assume the following (see (4.18), (4.22) and

(4.23)): (4.26)

r

o r

strongly in X,

Yt 6 [0, T].

569 Then, for t < T, we have (4.27) ir ) - ea'(r-t)~b(T) _< I~P.Ct) - r

/"

ea'(,-t)--,, ~,~ ~-, ~"(~-), ~(~), r176r

+ leA'(r--t)CCr, CT) -- r

+ ML

+ Omens(u" # a) + 01r ~ - r176 I -~ o,

.

-*

f

e~Cf-t)lr

) - ~p(T)ldr

oo.

While, by [4] (Proposition 1.12 and Lemma 5.4), we know that if (4.28)

H~"(',z"('),~('),r176

~s'('),

inL~176

then, (4.29)

;(r) eO~H(r,~(r),~(r),r176162

a.e. r E [0,T].

Thus we obtain (2.12). While, the rest conclusions follow easily.

[]

References

[1] N. U. Ahmed and K. L. Teo, Optimal Control of Distributed Parameter Systems, North Holland, New York, 1981. [2] E. hsplund, Averaged norms, Israel J. Math., 5 (1967), 227-233. [3] A.V. Balakrighnan, Applied Functional Analysls, Springer-Verlag, New York, 1976. [4] V. Barbu, Optimal Control of Variational Inequalities, Pitman, Boston, 1984. [5] F. H. Clarke, Optimization and Nonsmooth Analysis, Wiley, New York, 1983. [6] I. Ekeland, Nonconvez minimization problems, Bull Amer. Math. Soc. (New Series), 1 (1979), 443--474. [7] H. O. Fattorini, A unified theory of necessary conditionz for nonlinear nonconvez. control systems, Appl. Math & Optim., 15 (1987), 141-185. [8] X. Li and Y. Ya~, Maximum principle of distributed parameter systems with time lags, Distributed Parameter Systems, Lecture Notes in Control and Information Sciences, Springer-Verlag, New York Voi.75,410--427, 1985. [9] X. Li and J. Yong, Necessary conditions of optimal control for distributed parameter syatems, submitted [10] J. L. Lions, Optimal Control of Systems Governed by Partial Differential Equations, Springer-Verlag, New York, 1971. [11] A. Pazy, Semigroups od Linear Operators and Applications to Partial Differential Equations, Springer-Verlag, New York, 1983. [12] L. S. Pontryagin, V. G. Boltyanskii, R. V. Gamkrelidze and E. F. Mischenko, Mathematical Theory of Optimal Processes, Wiley, New York, 1962.

NE~ ALGORITHMS O~ SOLVING EXTREN~L PROBLEMS Rafail Gabasov Byelorussian State University,

220080,f/insk,USSR

Faina M.Kirillova Institute of )~thematics,

22060~,

rvinsk, USSR

Algorithms os solving mathematical programming and optimal control problems using a new approach to the solution of linear programming problems are discussed. This approach has been developing in Einsk (USSR) from the begimling of the 70th. Numerous experiments by computer carried out for linear programming problems (in three computer centers independently) have demonstrated advantages of the adaptive method (R.Gabasov, F.N.Kirillova,

O.I.Kostuykova,

1980) and

another methods regarding to the simplex method and other known methods. 2rinciples of the adaptive method were used while algorithms of solving mathematical programming and optimal control problems were constructed. The paper gives a brief account of recent results obtained by participants of the Y~nsk Seminar on constructive problems.

theory of extremal

They deals with exact finite algorithms for solving optimal

control and nonlinear programming problems tion of nonlinear functions), ciples.

(using network interpreta-

support maximum and

6 -maximum prin-

Consider the classical optimal control problem

1 .

J(u) i

(1) - ~ Cx, u , ~ : ) ,

~(~(~),-~)

~0

h (.:~ C~'))

-

~(~.)

=:r.

,

(2) (3) (4)

,-t. e T , o,

(5) , Piecewise-continuous

R"

Rt

vector-functions

R"

~(s

k

s E T , which take on the

values from the set (5) generate continuous trajectories ~(%), s ~ T, of the system (2) satisfying the phase

(3) and the terminal constraints

571 (3),(4).

They are usually called admissible.

is called optimal if the cost function Complexity

I) infinite number of nonlinear dimensionality

are chosen;

3) nonlinear nature

Lhe problem

(I)-(5)

constraints

in finding

of transformations.

can be considered

efficient

methods

to the problem

of formation

which allow to solve

and numerous

lems are known.

methods

operation gradient

~rom the general point of

of passing feature

to finite-dimensional

the limit

(or its analogue) processing

dimensional

simplicity

of the basic method

consists

in calculating

of current

information

On the other hand,

finite-dimensional methods as well.

we introduce

problems However,

are always finite and determined

are used in the mathein our methods

interval

They accumulate

of the problem

and linear-quadratic

only by optimal control differ fundamentally

optimal

(I)-(5).

optimal

intervals

2. ;~&ile solving the problem

,

d ' 6 ~ ~,

Various methods known among

6.

,

(I).-(5) in

shortening

the time

The methods are finite for linear

control problems,

(I)-(5)

i.e. their realization

6"e A"

,

linear programming

A ~ R""

of solution are known for the problem

them is the simplex method.

of primal and

the principal part is ful-

filled by a linear model which is an interval problem:

d,

They

structures~

from the tradi-

by computer needs a finite number of full integrations conjugate systems and a finite storage.

c,x,

do

to solve the extremal problem.

As a result we obtain a method of solution of the problem approximations

a finite-

of solution.

unlike with mathemati-

the sizes of auxiliary problems

not depend on the accuracy needed

tional ones.

the

and a step along it. Instead for a com-

matical programming

which successive

is

of these principles.

linear problem with an efficient algorithm

cal programming

prob-

of the suggested approach

which in many known approaches

prehensive

(I)-(5) with a given

to the object being opti-

method of realization

On the one hand we abandon

consists

is not new: both various linearization

Characteristic

defined by a particular

(I)-(6)

of this comple-

of linear finite-dimensio-

the problem

mized and a finite storage of computer. view the idea of the approach

By virtue

as one of the ultimate

accuracy using finite number of references

methods

on output .signals ( ~ ) ,

ofthe space from which elements ~ s

The main idea of our approach

nal problems

control

the optimal value.

of the extremal problem is defined by three factors:

2) infinite

xity.

An admissible

(I) reaches

(6)

(6). The most

This is a cause for many

572 scientists

to confine

their investigations

reduces to the classical ting effective constrained

algorithms~

The same conclusions

optimal control problems

zation problems when methods

reduce

(the penalty function

of solution are grounded

However,

of solving complex optimization

For example

(mathematical

and finally

there exists a nt~ber of optimal control

package

LP ASY realizing

problems

such peculiar programming

solution

the known methods

software

of the phenomenon

me-

as methods

The fact is that an increase

which caWt be solved using the most powerful

source

to finite-dimenprogramming

to a more complicated

problems

programming

minimi-

The same is said

on a reduction

problems.

of solution leads

auxiliary problems

become helpless.

are often used when

in reality such a way cannot be considered

of a given accuracy of "simplified"

"the problem

to unconstrained

methods).

sional nonlinear programming problems thods).

to the phrase

L2 problem which can be solved by the exis-

and perfect

the simplex method

[d]

is clear enough for specialists.

generated from optimal

The Linear

control problems,

have

features which are seldom occured in general.linear

problems

and they are not considered

by the simplex

me-

thod created in the middle of the 40th. In this connection the problem method

a new method was created in ~ n s k

(6). Its characteristic

called a support preserves ons: The support unlike

are as follows:

the feasibility

is similar to the basis

of vectors

result

of decreasing

all the variables varies;

iteration

at iterati-

solution and

element;

a suboptimality

which do not satisfy

rion vary at the iteration. variable

~

of the simplex method but

in the method as an independent

is based on a principle

3) an iteration estimate.

the optimality

solution is improved at an

but the support is improved

5) the solution can be stopped using the suboptimality s -optimal

As a

crite-

As for the simplex method there only one

4) not only a feasible

(as in the simplex method),

on fo~ any

I) the

~ ; 2) a special construction

the latter itTis separated from the feasible

participates

well;

features

starts with any initial vector

to solve

feasible

solution with a preassigned

9 The method called "adaptive"

as

criteriaccuracy

~ 0

experiments

programmed

both on general problems

random-number

generators

mal control problems. published [ L , 3 ] . 3. The continual tion elements

~ny

and

and on special problems results

by different

obtained from opti-

of the experiments

constraint-inequality

of extremal problems.

~ested in numerous

(6) constructed

have been

is one of the complica-

Such a constraint

can be taken

573 into account

using the semi-infinite

;

d. -/mz,

(18)

the maximum condition

~'(r

=

m~

~'ce) 6 u

,teT

,

(19)

is true. The prototype Lagrange

of such optimality

multipliers

of variational

calculus

carried over to nonlinear programming method and the adaptive methods ther form.

conditions

which has been later

problems.

the optimality

They come not from the multipliers

is the rule of In the simplex

conditions

have ano-

but from the basis and

the support. The Lagrange multipliers (called simplex multipliers, potentials) are obtained from the basis and the support. The support

of the problem

(17) is closely connected with

579 locally controllability of the system. The support $,up = ~ s ~ p , ~ p ~ r e p r e s e n t s such a set of moments from the i n t e r v a l T and a set of natural numbers assigned to each moment that the system (17) is locally controllable. The Support ~aximum Principle: for each normal optimal control there exists such a support that along the accompanying trajectory of the conjugate system (1~)the maximum condition (19) is true. The statement extends the Pontryagin maximum principle: in the Support ~iaximum principle only special multipliers generated by the support are used. The situation is the same as in classical result of the simplex method, i.e. among optimal feasible solutions there are always basic feasible solutions. By virtue of this fact we can speak of the Lagrange basic vectors. Algorit]~ms of solving nonlinear optimal control problems are given in [ 5].

~EFEREUC ]~S [I] Gabasov R., liirillova F . M . 1984). Consideration of Optimal Control Problem Specifity of Generalizing 1~lathematical Programming.- In: Preprints of IFAC 9th Triennial World Congress, v.5, Budapest, 264-269. 2 ] Oabasov R. Xirillova F.~I. (19~I). Constructive ~iethods of Parametric and Functional Optimization.- In: Preprints of IFAC 8th Triennial World Congress, v.4, Kyoto, 111.-116. [ 31 Gabasov I{., Kirillova F. ~,{., Kostyukova 0.I., Pokatayev A.V. (1987). Optimal Program Controls and Flexible Feedback.- In: Preprints of IPAC Xth Trieilnial World Congress, v.8, Munich, 119-124. 4] Abstracts of International Soviet-Polish Seminar on ~hthematical J r/lethods of Optimal Control and their Applications (1989)~ Institute of ~athematics, f~insk, USSR. [ 5 ] Oabasov R., Kirillova F.~L (co-authors Thatyusl~in A.I., Kostyukova 0.I., Raketski V.M., Pokataev A.V.)(1984.-1990). Constructive ~ethods of Optimization, parts I-5. Linear Problems. Control Problems. Network Problems. Convex Problems. Nonlinear Problems. University Press, ~Insk,USSR. [6] Gabasov i~., Kirillova F.M., Pokataev A.V., Kalinin A.I. (1990). General Approach to Construction of Algorithms of Optimization of Nonlinear Control Systems with Nonsmooth Characteristics. In: Preprints of IFAC Xlth Triennial V~orld Congress, Tallinn, USSR (to appear).

CONSTRAINED

CONTROLS

IN L I N E A R

OSCILLATING

SYSTEMS

F.L.Chernousko Institute

f o r P r o b l e m s i n M e c h a n i c s , USSR Academy o f p r . V e r n a d s k o g o l O 1 , M o s c o w , 1 1 7 5 2 6 , USSR

~_U_~.@_EX.- We c o n s i d e r linear state

controlled by

The

means

of

Nell-known

control

as

a

linear

obtained is

Some

to

a

open-loop

the

system,

example

are

control

obtained;

also

of

arbitrary by

on

as

a

well

depend

on

initial

in

of

the under

which

constraints. number

a

absence

of

This linear

scalar

constrained

as

time

the

of

eigenfrequencies

and

which

obtaining

in

given

of

law

considered

are

imposed

a

constraints.

motions

applied

of

to a t e r m i n a l

for

natural is

the

they

and

applied

conditions

system

transfer

state

geometrical

is

controlled

on c o n s t r a i n t s

is

to

technique

satisfies

control.

An

initial

subject

sufficient

(pendulums)

process

point-to-point

combination

control applied

a

technique

this

oscillators

control

of

a given

control

system;

of c o n s t r a i n t s .

approach

a

from

Kalman's

non-controlled

the

a problem

system

Sciences

terminal

the

states.

system

has

the of An

zero

eigenvalues. INTRODUCTION Property systems paper

of

controllability

without [i]

proposed

Kalman to

consider

on

control.

satisfied. number control.

of

a

We

use

conditions These linear

with

Kalman's under

results

are

oscillators

on

as

solutions

systems

usually

considered

control.In

conditions

control

(natural)

linear

is

imposed

obtained

find

non-controlled we

sufficient

constraints

of linear

of

the

method

for the

applied controlled

combination

system.

control

a by

In

this

constraints

imposed to

linear

well-known

controllability

a

geometrical

which

for

his

and

scalar

give

of

of paper

imposed

constraints

system a

and

some are

arbitrary

constrained

581 GENERAL RESULT Ne c o n s i d e r = A(t)x lu(t)l Here

x

The

functions

problem

(i)

x(t

(2)

to all

satisfies

0

x~

fixed

or

x~

x(T)

are

control (1) (2)

n

vector, x m

u

is

matrices

a control, which

are

A and

find t

a

control

u(t)

satisfying

c

[t o , T]

and

such

that

and

terminal

conditions

the

= X1

given

the

solution

(3)

vectors,

t

is

fixed,

T may

be

either

o

free

(T

> t

). Me

denote

by

~t)

the

n~

matrix

defined

o

by

= A(t)m, where

E

~t 0) is

a

= E

(4)

unity

n

x

n

matrix.

The

solution

of

(i)

n

satisfying

x(t)

the

initial

= r176

condition

(3)

is

(5)

+ ~ ~-l(s)S(s)u(s)ds~ t o

Substituting

(5)

into

the

terminal

condition

(3)

we

obtain

T

~-1(t)B(t)u(t)dt

(6)

= X*

t o

where X)K :

~-I(T)x%

The c o n t r o l ( see

obtain

Xo

(7)

must satisfy

(6)

and (2).

Ne s e e k

it

as

[1] )

u(t) where

u(t)

-

= OT(t)c, c

O(t)

is a c o n s t a n t

B

piecewise

t

initial

) = x O,

Here

n,

of

is

a constrained

u c Rm),

a state

x

for

with

> 0

is n

system

c Rn ,

a = const

is time,

constraint of

(x

respectively

continuous

dynamic

+ B(t)u,

~ a,

t

are

a linear

(8)

= ~-1(t)B(t) n-vector.

Substituting

(8)

into

(6)

we

582

(9)

R( T )c = x * where

R(T)

is

a

symmetric

n

x n

matrix

T

R(T)

(lO)

Q( t )Or( t ) d t

= t o

Consider

a quadratic

f o r m (v

is a c o n s t a n t

n-vector)

T

(R(T)v,v)

= ~

IQT(t)vI z dt

(11)

> 0

t o

It f o l l o w s

from

the

(i)

system

v = O, the

see

R(T)

is c o n t r o l l a b l e

[2],

linear

and

R(T)

is a n o n - n e g a t i v e then

matrix.

is p o s i t i v e

definite.

In

If

for

all

this

case

solution

x

%.bAgr__e_m_. Let (i)

(ii)

is p o s i t i v e

s y s t e m (9) has a u n i q u e

c = R 1 (T)

system

(II) that

(i2)

the is

matrix

R(T)

be

controllable)

and

positive for

definite

(the

n-vector

the

any

inequalities T

IQ

(t)

K(T)

IR(T) K(T) hold

for

vl

vl

g ~(T)Ivl,

V t,

t

e [t

o

, T]

(13) (14)

Z %(T)Ivl

some

T > t

~T)

and

. Here

K(T)

is

n x n

matrix

(det

K

o

0), arbitrary

Ix*l then

~-I(T)

control

u(t)

The c o n t r o l

According lu(t)l =

to =

lOT(t)

(8),

are

positive

scalars,

v

is

an

If

< a %(T)

the

Ey_o.~.

n-vector.

%(T)

(8),

(15)

given

by ( 8 ) ,

(12)

satisfies

(12)

IQT(t)R-1(T) K(T) K-I(T)

x*l

=

R-I(T)

x*l.

(12) (2)

satisfies by i t s

(2),(3). definition.

583 Using

(13)

lu(t)l

we o b t a i n

~

/ZT)IK-I(T)

Now we s u b s t i t u t e lu(t)l

R-~(T)

x

= R(T)

~ ~T)~-I(T)I

The p r o o f

is

over.

R#_m#.!K.

The

matrix

matrix:

K(T)

= E

K(T)

R(T)

v

K(T)

K(T) In

x*l.

in

this

and use

vl

= ~T)

(13),

case

(14)

and (15)

~1 ( r ) l x

(14) MT)

is

can

be

a

lower

I*~ a

equal

to

bound

unity

for

the

n

minimal

eigenvalue

of

the

matrix

SYSTEM Ne

consider

a system

+ z

= u

I

Here

i

~i

are

describe

by

means

constraint

(2)

oscillators

..,

is

of

pendulums

of

springs.

a scalar

control

(16)

constants

a control

boundary

with

n

,

u

and

= ~l'o

~l ( 0 )

'"

coot dinates

a system

body

OSCILLATORS

linear

i = 1 '

eigenfrequencies,

a

of

OF

R(T).

or

e

subject a

to

system

> 0 (2).

of

u(t)

Control

are Equations

masses must

(16)

attached satisfy

to the

conditions

%i(o ) = ~l' o

i

= 1, ....

n, (17)

1

~(T)

. ~,,

We a s s u m e

&l(T)

that

.

all

n~,~

i

are

positive

= 1 .....

n.

and

different.

Denote

I

e

=

0

o

and

Q =

min

(~

1§

- ~ ) > 0 1

(18) 0 = ~

The of

0

system change

< e

1

0, U.~i~< U~ < Um~ and j = 1 , 2 , . . . , N - 1

i=0,1,.--,N-1

Assuming that the second-order Kuhn-Tucker and constraints - qualification conditions hold for the constrained optimization, a local saddle point of the Lagrangian may be found if ~ large positive value of c is chosen.

6

Algorithm

and

Simulation

The above method was applied for the optimal path planning of the first three finks of robot PAMIIL In the joint space defined by Q3 = {-~r > 0, 11Y(t) II IlY(t)ll"0

-l IlY(t)ll"0

-l 0 there is a partition s of [T - r,T] such that if {tp } refines s then NH( 2 n

p=0

L(, tp)[Htgt](p))

dt

Notice that ~--{f o sin(t - s) du(s) = f 0 cos(t - s) du(s), i.e., if f =

L;J

is in Ag(I - P0)

G H then fz = fl". The projection 1-I of (I - Po)GH onto clAg(I - P0)GH can be represented as follows:

,,-, where ~ =

,tl..,-:,:,_ -1 cosh T

,..,.:

de

639

If f =

is not Ap(I - Po)GH, i.e., f2 :~ f ; then f - FI f is orthogonal to Ap(I - Po)GH.

Clearly, for any 0 < r < T, (I - PT-r)GH is not contained in clAp(I - Po)GH. On the other hand, the first condition for weak controllability does hold. Since rank [13, oc 13] = rank [ 0

1] = 2 the first condition must hold for 0 < r < T.

Still computing L and checking the column independence condition for L ( , T - r) is instructive. If 0 < s < t _ 0

(1-p)liutII2+ (29)

small.

(1-e-el-e2)ilUxxll2

inequality +

1 ~ . we h a v e

K(/],e)llull 2 From ( 2 6 )

and

(2S)

we h a v e

+ (b+p(a-1)-K(~,e)-C2(~.e2))tlull2

+ (6 -CI(K'el)-C2(~'e2))

u6dx

~ G(t)

=

G(t;p,a,b,c.d)

9

(l+p)llutll2+(l+e+el+e2)llUxxll2+(b+p(a+l)+K(p,e)+C2(~,e2)) + ( ~d ' + C 1 ( K . e l )

Ilull2

+ C2(~,e2))lu6dx

fl From ( 2 7 ) N(t) (30)

and

(28),

it

s

that

= N(t;p,a,b.c,d)

~ (a-p

)llutll2

+ (p-pc

- ~3 p e l - P e 2 - e 3 ) l l U x x

+ (pb-pcCs-PC2(~,e2)-PK(~,e))llull2 + ( (l-p) c-C3(~, e3) )Zu~dx

9 + (pd - ~1 p3 C

Now we r e q u i r e

(K,el)-PC2(~,e2)-Cd(~,e3)-pcCS)~U

that 1 > p > 0 1

-

e

-

e I

a

-

p

> 0

-

,

e 2

>

0

6dx

-

II2

649 3 - ~el-e2)

p(1-e

-e 3 > 0

,

b+p(a-1)-K(p.e)-C2(~,e2)

(31)

b -

cC 5 - C 2 ( ~ . e 2 )

(l-p)c d

- Cl(~,el)

choose

parameters

requirements. e.el,e2.e and

e3

First

3

small 3 32

-

-

K(~,e)

in

take

C2(~.e2)

enough

' so

order 1

p

= ~

and

fix

-

-

> 0

,

> 0 ,

- ~3 p C 1 ( ~ , e l ) _ P C 2 ( ~ ,

pd

We c a n

- C3(~,e3)

-

> 0 ,

> 0

,

e2)-C4(~,e3)-pcC

5

as

follows

the

so

that

the

to 6

0

and 3 - ~

p(l-e Then

take

a

=

1

1 > p

= ~

eI -

.

Choose c =

Next

choose

b

> 0

such

b = max(K(~.e) with

c

chosen

and

e2)

-

e3

c

> 0

2C3(~,e3)

+

=

1/d such

i

> 0 that

.

that

+ C2(~,e2).cC5 fixed

as

above.

+ C2(~,e2) At

last

+ K(p,e)}

we c h o o s e

+ 1 d > 0

satisfying 3

d=max{6Cl(~,el)+6C2(~,e2),~Cl(~,el)+C2(~,e2)+cCs+2C4(~,e3)}+3 With obtain

such

from

a

(29)

(non-unique)

choice

and

following

(30)

gI { l l u t l l 2 (32)

the

of

+ IlUxx 112 + llu[I 2 +

~ max(3/2,2b,d/3){llutll2 ~ (2b+d){llutll2

all

the

involved

parameters,

relations,

fuedx) ~ G ( t ) + IlUxxll2 + Ilul[ 2 + f u 6 d x }

+ IlUxxll2 + Ilull 2 + f u 6 d x }

,

n and

(33)

1,ut,2 + 1,Uxx,2 + 89 ~1{ n u t l t 2

+

llUxx

]12 + llull2

i; q d x § 3;uedx + fu~dx + fuedx) +

[2

9

we

650 Substitute

the

relations

(32 / a n d

(33)

into

the

identity

(19).

we

obtain I

d

It follows

1

dt G(t)

2

+ 4(2b+d~

G(t) also

Define

d

~ 2

dt G(t)

shows

that

an equivalent

(34)

and

where

(32)

C(Uo,Vo)

u t = v~

, and

By

G(t)

is a L y a p u n o v - l i k e

norm of

yields

= G(O)

H

functional

of s o l u t i o n s .

by for

(~)

6 H

that

is d e f i n e d

the p a r a m e t e r s

the c o n t i n u a t i o n

of h y p e r b o l i c

as mild

solutions

remain

strong

these

solutions

solutions

of

by

(u,v)

(17)

as

chosen.

exist

[5],

uniquely

(Uo.Vo)

H

at a u n i f o r m

the [5, Thm.

(u,~/

1.1]

of

with

semilinear

since

in

and

of

by

equations

(11),

decay

fixed

theory

evolution

solutions

Finally.

^

.

t II(: )II~ ~ 2G(t I ~ 2G ( U o , V o l e x p (- 2 ( 2 g + d ) ) t o

(35)

and

= 0

t 2(2b+d))

~ G(O)exp(-

H(~)JI~ = ,,UxxH2 + ,,uH 2 + Hvl, 2 Then

+ N(t)

that

(34) (32)

1

G(t)

and

(11)

evolution

(35) for

6 E

t = O, u = u ~

.

implies

with

that

t 6 [0.~) Moreover.

exponential

its

equations

(u.ut)

and (35)

that

they

shows

that

rate.

references,

initial

[g]

for

two

states

(Uo.Vo)

and

^

(Uo,Vo)

respectively,

there

is a relation t

(361

H (vu((ttl)) -

for

t

in

continuity and

( ~^ I ) tH) H t)

( ^u~ v~

the

common i n t e r v a l

of

g : H~(O) ~ L 2 ( ~ )

the G r o n w a l l

there

~ I I (uv ~ o

exists

corresponding

a

inequality,

sequence

of

solutions sup

+ f "g(u(s))-g(:(s)) 0

of existence. , the

From t h e

denseness

it f o l l o w s

that

points

of

~)

local

in

(Uo.Vo)

s H

such

that

the

of

the

satisfy

If(u(t)v(t)) - (vn(t)un(t))llH ~ 0

as

n ~

te[u.T] for

each

compact

interval

[O,r]

in

the

existence

interval

ds

Lipschitz

E = ~(A)

for any I c E

ILL2(

H ,

I

651 solution

(u,v)

continuity

os

globally initial

starting Go(-,.

)

in

C([O,m);H)

data

(Uo,Vo)

from

(Uo,Vo)

implies

that

Besides in

H o

.

the

Thus

(35)

solution

(35)

holds

The p r o o f

for

is

together

(u,v) all

with

exists solutions

with

completed.

REFERENCES [13

O.P. Bhutani, P. Mital, and G. Chandrasekaran, On invariant solutions of the generalized Boussinesq equaitons, Intern. J. Engr. Sci., 26 (1988), pp. 307-310.

[2]

P. C o n s t a n t l n , C. F o i a s , B. N i c o l a e n k o , a n d R. Temam, Integral and Inertial Manifolds for Dissipative Partial Differential Equations, Sprlnger-Verlag, New Y o r k , 1 9 8 8 .

[3]

J.M. Chidaglia and R. Temam, Attractors for damped nonlinear hyperbolic equations, J. Math. Pures Appl., 66 (1987), pp. 273-319.

[4]

J. Hale, Asymptotic behavior of Dissipative systems, Math S u r v e y s & M o n o g r a p h s , V o l . 2 5 , AMS. P r o v i d e n c e , 1988.

[~]

A. H a r a u x a n d E. Z u a z u a , D e c a y e s t i m a t e s f o r some s e m i l i n e a r damped hyperbolic problems, ARch. R a t i o n a l Mech. Anal., 100 (2) (1988). pp. 191-206.

[G]

R. H i r o t a . Exact N-soliton solution long waves in shallow-water and in Math Phys., 17 ( 1 9 7 3 ) , p . 8 1 0 - 8 1 4 .

sol the nonlinear

wave equation lattices, J.

[7]

J.L. Lions, Control of Distributed Grauthier-Villars, Paris, 1985.

Singular

Systems,

[s]

H.P.

Appl.

McKean, Math.,

Boussinesq's equation on 34 ( 1 9 8 1 ) , p p . 5 9 9 - 6 9 1 .

the

circle,

Comm.

A. P a z y . S e m i g r o u p s partial differential

[10]

J. Sprekels. One-dimensional thermomechanical phase transitions with non-convex potentials of Ginzburg-Landau t y p e , IMA P r e p r i n t , ~505 (1989).

[11]

F. U r s e l l . The l o n g - w a v e p a r a d o x i n t h e waves, Proc. Cambridge Philos, Soc., 49

[12]

Y. You, T e c h n i c a l University. 1989.

of linear operators and applications equations, Springer-Verlag, 1983.

~102,

Center

of

Pure

[9]

Report

to

theory of gravity (1953), p. 685-694.

for

Appl.

the

Math.,

Purdue

DIRECT ADAPTIVE STATE-SPACE

CONTROL IN A SETTING*

Edward W. Kamen Department of Electrical Engineering University of Pittsburgh Pittsburgh, PA 15261 (USA)

SUMMARY Direct invarlant

adaptive

n-dimensional

2n-dimensional difference

adaptive

discrete-time

state representation

equation.

simple procedure

minimal

control of a single-input

The central

system is approached

constructed

result

The only assumption

is not required to be minimum phase.

(4n-dimensional) relationship

regression

estimation

this paper Elliott

schemes.

The estimation

exciting

system i3 exponentially desired poles

The approach

parameters

estimation

is that the

the given system

taken in the paper is based of an expanded

vector of the linear

parameters.

In this way the

is reduced to a

The particular

parametrization

of the parametrization

of controller

parameters

If the expanded

(as opposed to persistently

large values

constructed

in

derived by

i3 based on the standard

regression

exciting),

stable with the closed-loop

for sufficiently

of a

in

problem which can be solved using existing

version

least squares algorithm.

sufficiently

in particular,

vector with the coefficient

is a "state-space"

[I].

recursive

2.

on the system model

the desired controller

of the controller

linear-in-the-parameters parameter

parameters

of the system input as a linear function

containing

determination

in terms of a

from the input/output

the controller

order of the system is known a priori;

on the expression

directly

linear time-

of the paper is the derivation

for directly determining

pole placement.

single-output

poles

vector is

the closed-loop

located close to the

of the time index.

Preliminaries Consider

the single-input

single-output

time system defined by the input/output

y(k)

=

n Z aiY(k-i) i=l

In (i), k is the integer-valued output,

and u(k)

is

the

+

n Z b.u(k-i)~ i=0 discrete-time

real-valued

linear time-invariant

difference

control

equation

o index, input.

discrete-

(ARMA) model

(I) y(k)

is the real-valued

Note that

if

the

*This work was supported in part by Eglin Air Force Base under Contract F08635-90-K-0210.

No.

656 coefficient

b 0 is n o t

zero,

there

is a d i r e c t

feed between

the input

and

output. The coefficients assumptions

on the

is t h e m i n i m a l transfer

a. a n d b.

in

system model

order

of the

(i) a r e

unknown

(1) a r e t h a t

system.

the

fixed

order

Here minimal

constants.

n is k n o w n

order means

The only

and that

that

the

function n Z b,z -i 1 a(z)

n -i 7. a.z i=l

i-

does

not have Now

let #(k-l)

#T(k-l) where

any common

-

denote

[y{k-1)

"T" d e n o t e s

system

at t i m e

poles

the

a n d zeros.

the 2 n - e l e m e n t

y(k-2)

...

transpose

the

y(k-n)

vector

u{k-l)

operation.

u(k-2)

Defining

defined

by

... u(k-n)]

the

state

representation

(see

x(k)

of t h e

k to b e x(k)

we h a v e

regression

following

x(k+l)

= ~(k-l)

2n-dimensional

= Ax(k)

,

state

[2, p. 75])

+ bu(k) (2)

y(k)

~ cx(k)

+ b0u(k)

,

where

A=

b -

bI

b2

9 . .

0

0

0

0

~

0

0

...

1

0

0

0

.,.

0

0

0

...

0

0

0

0

9 . .

0

0

...

0

0

1

0

.,,

0

0 0

:

:

a2

...

1

0

...

0

0

0 0

an_ 1

b n 84

an

"aI

0

0

...

0

0

0

0

0

0

...

0

0

0

0

[b 0

0

...

0

1

0 th

...

0

bn_

...

1

0

0

1

0

th +(n+l) position

0] T

(n+l) position

c = Now

[a I

consider

a2

9 -.

an_ 1

the control

u(k)

an

bI

b2

generated

... b n _ 1 by

b n]

n

657 n

n

u(k) = - Z diu(k-i) i=l

-

Z ciY(k-i) i=l

,

(3)

where the c. and d. are the real-valued controller parameters. Note that in l l (3) we are not allowing the input u(k) at time k to depend on y(k) at time k since there may be a direct Let 8

c

feed from u(k)

denote the 2n-element

to y(k)

in the given system

column vector of controller

(i).

parameters

defined by 8~ = [c I

c2

... Cnd I

d2

Then the feedback

control

where x(k)=~(k-l)

is the state defined

u(k) = -e~x(k)

feedback

control

... d n]

(3) can be written

in the form

(4)

, in the previous

(4) into the state representation

state description

of the resulting

x(k+l)

closed-loop

section.

Inserting

the

(2), we have the following

system:

= (A-b0~)x(k)

(5) y(k) = cx(k)

+ b0u(k)

The 2n poles of the closed-loop of the closed-loop following

system are of course equal to the zeros

polynomial

l:

If n is the minimal

order of the system

all 2n poles of the closed-loop state feedback of the form

(ii)

det(zI-A+bsT). c

We then have the

result.

Proposition (i)

characteristic

the state representation teachability U = [b

system

(I), then

(5) are assignable

by using

(4);

(2) is reachable;

that is the 2nx2n

matrix Ab

...

A2n-lb]

is invertible. Proof:

From known results,

control

of the form

system

(I).

all 2n closed-loop

Thus if n is the minimal

assignable

by using state feedback

Since pole assignability

implies

order,

in [2, pp. 75-77];

manipulations

teachability,

however,

we also have

(2) is pole which proves (ii).

(i).

9

of the state representation

(2) is

the above proof avoids the matrix

{Pi:

i=I,2,...,2n}

within the unit disk of the complex plane, there i8 a unique controller det(zI-A+bST)c In addition,

by using a

given in [2].

Now given a desired set

reachable,

the state model

of the form u(k)=-8~x(k),

It should be noted that teachability proved

poles are assignable

(3) if and only if n i8 the minimal order of the given

=

(z-Pl) (z-P2)

using Ackermann's

formula

of closed-loop

if the state representation

parameter ... (see

poles

vector 8

c

located (2) is

satisfying

(z-P2n) [3]), we can express

(6) 8

c

in the

658 following

form.

~(z)

First,

=

writing

...

(z-pl) (z-p2)

(z-P2n)

2n-i Z ~i zi i=0

= z 2n +

define ~(A)

2n-1 ~ ~'iA i i=0

= A 2n +

w h e r e A is the s y s t e m m a t r i x of the s t a t e solution

8

satisfying

c

is the i n v e r s e

In the next be e 3 t i m a t e d

3.

Direct Again

0

...

0

(7)

i]

(8)

matrix

defined

we show that the c o n t r o l l e r

from measurements

Determ/nation consider

[0

of the t e a c h a b i l i t y

section

directly

Then the u n i q u e

(2).

8 T = ~U-I~(A) , c row v e c t o r g i v e n b y =

and U

representation

(6) is g i v e n b y

w h e r e ~ is the 2 n - e l e m e n t -i

,

of the

of the C o n t r o l l e r

the s y s t e m

system input

Parameter

(i) with t h e s t a t e

x(k+l)

= Ax(k)

+ bu(k)

y(k)

= cx(k)

+ b0u(k)

in P r o p o s i t i o n

parameter

vector 8

i. can

e

a n d output.

Vector

representation

(9) w h e r e A, b, c are as d e f i n e d controller Theorem

i:

parameter Suppose

in S e c t i o n

vector 8

c that n is the m i n i m a l

state m o d e l

(9) is r e a c h a b l e

closed-loop

characteristic

denote

Then the s y s t e m

u(k) where

8

= z

= x(k)

input u(k)

is the c o n t r o l l e r c d e f i n e d b y (8).

Proof:

With ~ defined by ~U-iAib-

the s t a t e

the

k e y result.

of the s y s t e m

vector

defined

2n-i + • ~ix(k+i-2n) i=0

Given

(I) so that the

a desired

parameter

vector

(9) y i e l d s

.

(10)

,

given by

(ii) (7) a n d ~ i3 the row

of U we have

i=0,1,2,...,2n-2

= 1 .

by

in the f o r m

- 8Tx(k)

(8), b y d e f i n i t i o n

equation

to d e t e r m i n i n g

2n-I i Z ~i z , i=0

can be e x p r e s s e d

0 ,

aU-iA2n-lb Solving

+

= ~u-l~(k+2n)

vector

order

(i.e. U i3 invertible) .

the 2'n-element column

~(k)

Our approach

polynomial 2n

~(z) let ~(k)

2.

is b a s e d o n t h e f o l l o w i n g

(12) (13)

659 .

x(k+i)

Multiplying

i-l.,

= Alx(k)

+

b o t h sides of

E A~-3-1bu(k+j) j=0 (14)

,

i=l,2,...

on the left b y ~U

-i

(14)

and using

(12) a n d

(13), we

obtain ~U-ix(k+i)

= ~U-IAix(k)

~U-ix(k+2n) Multiplying

,

= ~U-IA2nx(k)

i=0,i,2,...,2n-I

(16)

b o t h sides of the ith e q u a t i o n

in

~U -I ~i x (k+i) = ~U-iZiAix(k) Adding

(16)

and

Ix

~U -I

(k+2n)

+

(15)

+ u(k) (15) b y Zi yields

(17)

i=0,1,2, ...,2n-I

(17), we have

n

7. (k+i) i=0Zi x

]

~U-I

2nl

2n

=

]

7. ziA~ x(k)| i=0 ]

+

+ u(k)

.

Then s e t t i n g

~(k)

= x(k)

~(A)

and rearranging Throughout order

terms,

+

= A 2n +

2n-1 7. ~i Ai i=0

8T . ~U-I~(A) c we o b t a i n (ii).

the r e m a i n d e r

of this

so that the state m o d e l N o w let w(k)

2n-i 7. ~ i x (k+i-2n) i=0

denote

,

9 section

(9) is r e a c h a b l e

the e x p a n d e d

we a s s u m e

that n is the m i n i m a l

and the f o r m

4n-element

regression

(II) exists. vector defined by

~(k) ] w(k)

=

Lx(k-2n)] a n d let 8 d e n o t e

the 4 n - e l e m e n t

parameter

vector defined

by

= [~u-I _8T]

eT

c Then

(ii)

can be r e w r i t t e n

in the f o r m

u(k)

= 8Tw(k+2n)

,

or u(k - 2n) From

(18)

we see that 8 can be e s t i m a t e d

s y s t e m inputs direct

and outputs,

procedure

parametrization Elliott's

and since

for e s t i m a t i n g (ii) or

parametrization

The e s t i m a t i o n parameter

= 8Tw(k)

estimation

(18)

8

directly

from measurements

is c o n t a i n e d

o the c o n t r o l l e r

turns

(18) of the

in 8, we t h e r e f o r e

parameter

vector

out to be a " s t a t e - s p a c e "

8 . c version

have a

The of

[I].

of 8 b a s e d on schemes

(18)

c a n be c a r r i e d

such as g r a d i e n t - t y p e

out u s i n g e x i s t i n g

algorithms

or r e c u r s i v e

660 least squares

(RLS).

We shall use the RLS a l g o r i t h m

which is defined as

follows. Given a 4nx4n positive sequence

definite matrix Q, let P0,PI,P2,...

of 4nx4n symmetric matrices

denote the

defined by

P0 = Q

PkW(k+l)wT(k+l)Pk Pk+l " Pk -

,

k~0

(19)

.

1 + wT(k+l)PkW(k+l) Then the RLS estimate

~(k)

of 8 is given by

~(0) = e

o

= initial estimate

PkW(k+l) ~(k+l)

= ~(k)

+

[u(k-2n+l)

-wT(k+l)~(k)]

, k>0

. (20)

l+wT(k+l)PkW(k+l) Defining ~(k-2n) we can interpret time k-2n.

~(k-2n)

= wT(k)~(k-l)

as the estimate

,

at tim~ k of the system input u a t

We can then define the "input estimation e(k) = u(k-2n)

Rewriting

(20) in terms of e(k),

- ~(k-2n)

error"

.

we have

PkW (k+l) ~(k+l)

= ~(k)

+

e(k+l)

,

k>0

(21)

l+w T (k+l) Pk w (k+l) From known properties Proposition

~:

of the RLS ~igorithm,

The estimate

k*~, where 8* is an element

~(k) defined by of 4n-dimensional

By the above proposition, e*.

Unfortunately,

we have the following

(19) and

(20) converges

Euclidean

the RLS estimate

~(k)

result.

to 8* a3

space.

always

converges

to some

e* is not equal to 8 in general.

Thus, letting 8* denote c to 8 , we have that 8* is not equal to 8 in c c c if the expanded regression vector w(k) is sufficiently

the part of 8* corresponding general. exciting

However,

(SE), we can get 8* to be as close to 8 as desired. c c

First,

w(k)

SE if for some k~4n, k Z w(i)wT(i) i-0 Then applying

is positive

known results

(e.g. see

that w(k)

definite. [4]), we have the following

Theorem 2:

Suppose

is SE and that P0=Q=~I,

algorithm.

Then given any C>0, if ~ is sufficiently

lle*-ell< where

l J l J is the Euclidean

norm.

~>0, large,

result.

in the RLS

is

661 in T h e o r e m 2 i m p l i e s

The result

that we can get

as desired by taking the initialization that w(k)

is SE).

[4] to guarantee of ~(k)

corresponding

The Adaptive

to be

large

small

as

(assuming

One could also use the modified RLS algorithm developed that ~(k)~e as k~m and thus ~

in

(where ~ (k) is the part c c c that w(k) is SE. Thus it is possible to

to 8 ) assuming c parameter vector

identify the controller

4.

Ile~-el I

P0 to be sufficiently

8

c

(k)~8

directly

from input/output

data.

Controller

Again let ~

(k) denote the part of the ~(k) corresponding to the C controller parameter vector 8 . Then the adaptive controller is simply c realized by setting u(k) = _~T(k)x(k) . c closed-loop system is given by the state equation

The resulting

x(k+l) which can be rewritten x(k+l)

= Ax(k)

- b~T(k)x(k)

,

(22)

in the form = (A-becT)X(k)

Since the desired pole locations

+ b[sT-~T(k)c c ]x(k)

were chosen

to be within the unit disk,

the

system x(k+l) is exponentially

stable.

=

(A-bS~)x(k)

Then if

11%is sufficiently

small for k large,

II it follows

(22) is exponentially

from standard

closed-loop

system

closed-loop

poles will be "close to" the desired poles.

then have the following Theorem 3: (19),

Suppose

(20).

and in the limit as k ~ ,

the

Using Theorem 2, we

result.

that w(k)

is SE and P0=Q=~I,

Then if ~ is selected

(22) is exponentially

stable

results that the

sufficiently

~>0 in the RLS algorithm large,

stable with the closed-loop

the closed-loop

system

poles located close to the

desired poles in the limit. As noted in the previous algorithm

in

closed-loop

5.

section,

if w(k)

i3 SE and if the modified

RLS

[4] is used, ~ (k) will converge to e . In this case the c c poles will be assigned to the desired locations in the limit as

A n Example

Consider difference

the discrete-time

system given by the second-order

equation y(k) = y(k-l)

- y(k-2)

+ u(k-1)

+ 2u(k-2)

input/output

662

The poles of the system are located at 0.5+j.866 circle,

and thus the system is unstable.

which is outside the unit

The system is also not m i n i m u m phase

since there is a zero at z=-2. Letting x(k)

the coefficients

A =

denote the state vector d e f i n e d by T x (k) = [y(k-1) y(k-2) u(k-l) u(k-2)] of the state model

,

(2) are given by

1 0 0

c

-

-i

[i

1

2]

The teachability matrix is

U = [b

Ab

A2b

The objective

A3b] = 0

0

1

0

is to design the adaptive

closed-loop poles are at z=0,0. parameter vector 8

which accomplishes

c

8T = ~U-IA4

c

=

controller so that the

By Ackermann's

the controller

this is given by

[ 2

1

-7

-7

AS discussed in the previous section,

formula,

1

2

(23)

7]

the adaptive

controller

is realized by

setting u(k) = -~T(k) x(k) , (24) c where ~ (k) is the estimate of 8 generated from the RLS algorithm (19) and c c (20). Since w(k+l) depends on y(k-5) and u(k-5), to evaluate (19) and (20) for k>0 we need initial values of y(k)

for k=-5,-4 and initial values of u(k)

for k=-5,-4,-3,-2,-i.

(20) and the control law

Equations

(19),

(24) were

p r o g r a m m e d with the initial conditions y(-5) u(-5)

- 0, u(-4)

= S, y(-4)

= 0, u(-3) P0 = ~(0)

It should be n o t e d that u(-3) (Since ~(0)=0,

if u(-3)=0 by

The computer Figures

8 . c

- 0, u(-l)

= 0

(104) I =

o

was set equal to 1 to insure that ~(i)~0. (19) we see that ~(k)=0

implementation

1 and 2, respectively.

converge to zero.

= 0

- I, u(-2)

for all k>0.)

resulted in the control and output shown in Note that both the control and ~he output

Shown in Figure 3 is the estimate of the third element

This estimate does converge to the correct

value

(see

(23)).

The

of

663 e3timates

of t h e other e n t r i e s

Thus the c l o s e d - l o o p desired

poles

of e also c o n v e r g e to the correct c are p l a c e d (after an initial t r a n s i e n t

values. at the

locations. ACKNOWLEDGEMENT

The a u t h o r

wishes

tO the work p r e s e n t e d relevance

to t h a n k John C h i a s s o n in this paper,

of A c k e r m a n n ' s

for s e v e r a l

a n d in p a r t i c u l a r

discusslons

for p o i n t i n g

relating

out the

formula.

REFERENCES [i]

H. Elliott, "Direct A d a p t i v e P o l e P l a c e m e n t with A p p l i c a t i o n to N o n m i n i m u m Phase Systems," IEEE Trans. A u t o m a t i c Control, Vol. AC-27, 720-722, 1982.

[2]

G.C. G o o d w i n and K.S. Sin, A d a p t i v e Filtering, P r e n t i c e Hall, E n g l e w o o d Cliffs, NJ, 1984.

[3]

T.K.

[4]

E.W. Kamen, Convergence Processing,

Kailath,

Linear Systems,

Prentice-Hall,

Prediction

Englewood

and Control,

Cliffs,

NJ,

1980.

"A R e c u r s i v e P a r a m e t e r E s t i m a t o r Y i e l d i n g E x p o n e n t i a l u n d e r S u f f i c i e n t E x c i t a t i o n , " Circuits, Systems a n d S i g n a l Vol. 8, No. 2, pp. 207-228, 1989.

y (k)

8

u (k) '

1.2

6 9

~

4

0.4

2

. . . . . . . . . . . . e~--~ k 9 o9 -0.4 ) -0.0 "o -i. 2 9

Figure i. The control signal.

:=::: -2

......... 6 ......

9

Figure 2. The output response.

1.0~ 0.8 0.6 0.4 0.2'

.99 "::-.-e.

00,0,00

00000 4 ...................

Figure 3. Estimate of third entry of 8c.

pp.

k

Pole Placement via Generalized Predictive Control Youbin PENG"and Raymond HANUS Laboratoire d'Automatique, C.P.165,'Universit~ Libre de Bruxelles 50, Avenue F.D. Roosevelt, 1050-Brussels, Belgium Abstract: T h i s paper presents a tuning strategy for generalized predictive control(GPC). I t is shown that the GPCdesign can be made to be equivalent to the usual pole-placement technique by the introduction of suitable f i l t e r s and the selection of particular horizons. Hence, the desired closed-loop behaviour can be obtained and the s t a b i l i t y of the closed-loop system is guaranteed. The resulting GPC proves to be robust even i f overparameterization or near pole-zero cancellation is encountered. Two simulation examples i l l u s t r a t e the good performances of the proposed algorithm.

I. Introduction The basic idea behind long-range predictive controllers, is to build a predictive model of the plant over a range which is larger than the plant delay and to use this model in the control law design. Several long-range predictive control designs have been developed by many authors, for instance, Mosca et al.(1984),De Keyser et ai.(1985) and Clarke et ai.(1985). In particular, the generalized predictive control(GPC) approach derived in Clarke et ai.(1985) is able to handle control problems such as unknown and possibly variable time-delay, non-minimum phase behaviour and plant-model mismatch. GPC is essentially an algorithm in which the output y(t) is predicted over f i n i t e horizon and a sequence of control signals are calculated to minimize a quadratic cost function subject to an assumption made about the future control increments. I t offers several possible tuning knobs such as costing horizons and control horizon. However, there is no straightforward relationship between the cost-function used in GPC and the characteristic polynomial of the resulting closed-loop system. Usually, the desired dynamic behaviour can be approached by introducing suitable design f i l t e r s or polynomials. In particular, Gorez et ai.(1987) imposes the desired closed-loop poles by f i l t e r i n g the input, output and reference signals and using the f i l t e r e d signals in the cost function. The f i l t e r poles coincide with the desired closed-loop poles. However, there is s t i l l one part of the poles of the closed-loop system which depend on the horizons of GPC. These poles are not necessarily inside the unit circle, and the s t a b i l i t y of the closed-loop system is not always guaranteed. Here, we show that these poles can be assigned at the origin of the q-plane by the selection of particular Un leave from the Department of Automatic Control and Computer Engineering, Huazhong University of Sciences and Technology, Wuhan, China

665 horizons of GPC without influencing the poles assigned with the f i l t e r i n g process. In this case, the GPC design is equivalent to the usual pole placement technique. The major theoretical and algorithmic problem of the pole placement technique lies in the solution of the Diophantine equation. For instance, i f the system model equation is overparameterized, then a common factor between the numerator and denominator polynomials of the system transfer function w i l l certainly occur and the Sylvester resultant determinant w i l l be n u l l . In this case, the algorithm is ill-conditioned. When GPC is made to be equivalent to the pole placement, the same ill-conditioning problem w i l l occur i f overparametrization is encountered and there is no precaution. However, i f we use the recursive matrix inversion algorithm proposed by Favier(1987) in GPC, such drawback can be avoided by simply tuning the control horizon. Clarke et al.(1987a,b) and Mclntosh et ai.(1989) also derived conditions on the horizons of GPC so as to assigne the poles of closed-loop system at the origin of the q-plane. However, their conditions are less general. Furthermore, overparameterization can not be tolerated when a deadbeat control is achieved. This paper is organized as follows. Section I I defines the modelling assumptions and predictive model. Section I l l recalls the generalized predictive controller. Section IV investigates the closed-loop performances. Section V presents a new tuning strategy of GPC. Section VI shows the simulation results, and conclusions are given in section VII.

II. Modelling Assumptions and Predictive Model Consider a single-input single-output(SISO) discrete linear time-invariant system described by the following CARIMAmodel:

A(q~)y(t) =B(q") u (t)+C (q") ~ ( t ) / &

2. I

where { y ( t ) } and { u ( t ) } are sequences of outputs and inputs respectively. { ~ ( t ) } is a sequence of random variable with E{~(t)/F,.~)=O, E{~(t)2/F,.~):o ~ and F,., is the sigma algebra generated by {y(t-1) . . . . . y(O)} together with i n i t i a l conditions. A:l-q" is the differencing operator. A(q"), B(q") and C(q") are polynomials of degrees n,, nb and no in the backward s h i f t operator q": A(q"):l+a,q% ... +ao.q''~ B(q") = b,q"+ . . . +b,bq~b C(q'1)=1+c,q"+ . . . +c=q~ The optimal j-step-ahead predictor is given as follows. See Goodwin and Sin(1984). Theorem 1: For the system described by (2.1), given measured input and output data up to time t and any given u(t+j) for j>O, the optimal j-step-ahead predictor y ' ( t + j / t ) satisfies

C(q")y" ( t + j / t ) =Fj(q")y (t) +Ei(q")B(q") Au(t+j)

2.2

where

y" (t+j/t)=ECY(t+j)/F,}=y (t+j) - Ej{q") ~(t+j)

2.3

and

C(q") :Ej(q") A(q")+q JFj(q' )

2.4

666

where A(q"):AA(q"):I+A,q"+...+A~q ~, ni:n,+l, and Ej(q"):eo+e,q"+...+eF~qJ+', Fj(q")=~+~.,q"+...+~q', n,=n,, also,

I-~

E{ [y(t+j) -y* (t+j/t) ]2)=E{E[Ej(q")~ (t+j) ]~F,)= Z e,2o2

2.5

VVV

I-0

For the predictor (2.2) to be optimal for all t, i t is necessary that the initial conditions for y ~ be appropriately chosen. However, since C(q") is asymptotically stable, the effect of arbitrary i n i t i a l conditions will diminish exponentially. Thus we can simply define a prediction 9(t+j) with any i n i t i a l conditions as C(q-1))(t+j)=Fj(q~)y(t)+Ej(q1)B(q-')Au(t+j)

2.6

In the sequel, we will replace y * ( t + j / j ) by ) ( t + j ) . For j~k(k is the system time-delay), the prediction 9(t+j) depends entirely on available data, but for j>k, assumptions need to be made about the future control actions. It is generally assumed that the controls are to be performed in open-loop by ignoring the future noise sequence {~(t+j)}. III. G e n e r a l i z e d P r e d i c t i v e C o n t r o l

The generalized predictive controller(GPC) minimizes a multi-stage cost function of the form: N2 N2 J=E{Z [y(t+j)-w(t+j)]%>,Y-~u(t+j-l) 2} 3. I l-N1

1-1

where N, i s the minimum c o s t i n g horizon, N2 i s the maximum c o s t i n g horizon and ;~ i s a

nonnegative weighting factor, {w(t)} is the sequence of the reference signal. The expectation in (3.1) is conditioned on data up to time t assuming no future measurements are available. One can then ignore the future noise signals and use the prediction ~(t+j) to generate the control law. The prediction .~(t+j) can be separated in two parts as follows: Partioning Ej(q")B(q") according to Ej(q") B(q") =q"C(q") Gj(q-')+q+'Qj(q~)

3.2

where Gj(q")=go+g,q"+...+gj.,q-j§ Qj(q~)=qj.o+qj.lq"+...+qj.,qq"~, nq=nb-2 Substituting (3.2) in (2.6) leads to ~ (t+j)-Gj(q") Au(t+j- l)+~(t+j/t)

3.3

where ~(t+j/t) is the prediction of the output assuming there are no future changes in Au(t), i.e. C(q")) (t+j/t) :F~(q")y (t)+Qj(q")Au (t- I)

3.4

In the GPC, a control horizon Nu is introduced. That is, after an interval NuNu. Taking this hypothesis into account, the minimization of (3.1) with respect to the future auxiliary control sequence {Au(t) . . . . . Au(t+Nu-1)} yields

Or=[G~TG,+~I]"G,T(~-? (/t))

3.5

667 where

~r=[9(t+N,) . . . . . ~(t+N2) ] OT:[Au(t) . . . . . Au(t+Nu-I) ]

W%[w(t+N,). . . . . w(t+N2) ] ~(/t)r= [~(t+N,/t) . . . . . 9(t+NJt)]

3.6

and a (N2-N,+I)xN~ matrix

I gN,-, "'" gm-Nu gm 9gNI-Nu+I GI = 9 gN2-, " g:Nu

3.7

in which gj=O i f j,I]"G,r as KT=[KN,. . . . . KN2] and using (3.3) and (3.4) results in R(q") u(t)=T(q" )w(t )-S (q')y (t) N~

4. I N2

where R(q")=[C(q")+~ KjqIQ~(q~)]D(q"), S(qI) = Z K~Fi(q"), T(q")= ~.=KjC(q") I'N1

J-N1

4.2

I-NI

Taking (2.4) and (3.2) into account, the closed-loop polynomial can be written as

A'(qi):C(q ') [A(q")+q" N~ Kjqj(B' (q") -A(q")Gj(q") ]

4.3

J-N|

Hence, the closed-loop system has the poles corresponding to the roots of C(q") and that of Am(q")=A(q")+q' ~2KJqj(B' (q")-A(q")Gj(q"))

4.4

l-N1

By modelling assumption, C(q") is asymptotically stable. However, Am(q") is not always stable and more analysis must be undertaken. We note that Am(q") is depending on the choice of N~, N2, Nu and ~. In fact, i t is not a simple task to adjust the weighting factor X in such a way that Am(q") is stable. Here, we only discuss the case when ~=0. I f ~ is increased, the controller is progressively detuned. However the

668 closed-loop behaviour caused by non null I ,

if

it

is good and known, can be

alternatively obtained by introducing suitable design f i l t e r . This w i l l be discussed at the end of this subsection. As the calculation of K involves the inverse of matrix G~TG~, the rank of G, should be equal to N,. We now show when this condition is satisfied. The proof can be found in Peng(1990). Theorem 2: Two polynomials ~(q") and B(q") of order na and nb are r e l a t i v e l y prime i f and only i f rank(G,)=N, with either of a)N,>%, N2>_n~+N,-I, N.:nl; b)N,=%, N2>Nu+nb-1, Nu>nA. VVV Now, l e t us show when A'(q ~) is reduced to C(q"). Theorem 3: For the GPC approach described above, i f A(q") and B(q") are r e l a t i v e l y prime and i f N,>nb, N2>_%+N,-I, No=%, >.=0, or N,=%, N2>_Nu+n:-l, N,>%, >.:0, the closedloop c h a r a c t e r i s t i c equation (4.3) is reduced to A'(q")=C(q"). Proof: From theorem 2, G,TG, is i n v e r s i b l e . Let

~ Kjq~B' (q"):B~

"~. KiqJA(q-')G1(q"):A.(q~)+a,(q)

|-NI

4.5

i-~1

where B.(q") and A.(q") are polynomials without positive powers of q, Bp(q) and Ap(q) are polynomials with positive powers of q only. From the fact that the f i r s t j terms of Gj(q~) are the f i r s t j terms of the impulse response of B'(q;')/A(q'~), we deduce that Ap(q)=Bp(q). Hence, (4.4) can be rewritten as Am(q')=A (q") - q"A. (q") +q~B. (q '' )

4.6

A.(q") can be derived from (4.5) as N2

A.(q")= Z KjqJA(q")G~(q")-A.(q) J-N1 N2

= ~. Kj[(gF,~,+...+g~.~)+.. "+ (gF,a.A-,+gF2aJ - q~.2+ gH~.Aq-.~+,]

4.7

I'NI

Noting that KTG,=[I N2

0.....

0], we have

N2

Kjgj.,=1,

~ Kjg~=O, i=2 . . . . ,N.

4.8

I f Nu>nl, we can substitute the f i r s t nA equations of (4.8) into (4.7). This yields A.(q')=q(A(q")-l). Moreover, i f N,_>%, we have B~

Hence, for the values of N,, N2

and N, mentioned in the theorem statement, A~(q")=1 and A'(q")=C(q~).

W/V

I f C(q") is taken as unit, which is often the case of adaptive control, the selection of particular horizons described in theorem 3 w i l l lead to a deadbeat control law(i.e. A'(q')=1). In this case, the system output has generally a large over-shoot to reference and disturbance changes. In order to penalize this over-shoot, a desired characteristic polynomial can be assigned. This is the case of the pole placement control I er. The GPC algorithm can be easily adapted to provide a standard pole placement controller by considering the augmented system: A(q")y,(t) =B(q")Au,(t) +P(q") C(q~)~(t)

4 .g

where y,(t)=P(q")y(t), u,(t}:P(q")u(t) and P(q~) is a suitable polynomial.

669

We can derive the control law by minimizing the cost function which uses the f i l t e r e d signals: N2

N2

J=E{~ (y,(t+j) -w,(t+j) )2+lT_~u,(t+j- I )2) J-N1

4.10

l-I

Since the role of P(q")C(q") in the augmented system is similar to that of C(q") in original system, all the subsequent results of the original GPC s t i l l hold only with C(q") replaced by P(q")C(q") and y ( t ) , u(t) and w(t) replaced by y,(t), u,(t) and w,(t). However, the control law is s t i l l valid for y ( t ) , u(t) and w(t) since the common factor P(q") may be dropped in both sides of (4.1). Hence, the closed-loop characteristic equation is A" (q")=P (q'~)C (q") Am(q ~)

4.11

where Am(q" ) is the same polynomial as t h a t defined by (4.4). We can choose P(q") so as to assign part of the closed-loop poles to a desired l o c a t i o n . The closed-loop system has also the poles of Am(q" ) which is independent of P(q')C(q"). However, i f we take N,>nb, N~>n~+N,-I, No:nA, I=0, or N~:%, N=>N,+%-I, N,>nA, X=O(i.e. Am(q':)=1), the closed-loop system has only the poles of P(q')C(q") and poles at the o r i g i n of the q-plane. In t h i s case, the GPC design is equivalent to a standard pole placement c o n t r o l l e r where C(q") is the optimal observer polynomial. I t is c l e a r t h a t more general alternatively

Am(q") caused by non n u l l

X, i f

it

is good and known, can be

obtained by introducing i t to P(q").

V. Tuning Strategy of GPC The condition required when solving the Diophantine equation for a pole placement controller, is that A(q') and B(q~) are relatively prime. Here, we see from theorem 2 that this condition is also necessary for assuring the i n v e r t i b i l i t y of G,TGI. However, we can avoid the singularity of GITG~by simply ruducing the control horizon N. until G,TG, is regular. To this end, we use the recursive algorithm for inverting G,TG~with respect to N~. This recursive algorithm was f i r s t proposed by Favier(1987) in order to decrease the computation time of GPC. Here, i t is also used for avoiding the singularity problem of the matrix inversion. We summarize the idea as follows: The matrix M=G,TG, has the following particular form: N2-1

N2-1

I-NI-I

i-NI-I

7. g Z

N2-1

Z g,.,g,

M =

T. g,g,.,

N2-1

...

N2-1

Z g,.2

X g~g~-.u+,

I-NI*I N2-1

...

~' g~-Ig~N.*,

I-NI-1

I-N1-1

I-N1-1

N2-1

N2-1

N2-1

Z g~N.+,g, i=Nl-I

or in a concise form

2 g~.u.,g~, --i-N1-1

~ g~,=.2 I-N1-1

5.1

670

i M, : 9 : M :

with

5.2

MN~ 1

.2-1 F MH M, =,-NI-17" g2, MN:M and M'=|'NL"L

N2-1

with

M2

.........

N2-I

N,=[ -~a igH+'g} . . . . . j i-

N2-1

~" gH+'gH+2 ] ' LI=N'T and ~,= T. gj_i+,2

I-NI-I

where Mr, is a ( i - 1 ) x ( i - 1 ) matrix and e, is a scalar.

J-NH

matrix,

L, is a ( i - 1 ) x l

column matrix, N, is a I x ( i - l )

By using the block matrix inversion f o r m u l a [ K a i l a t h , algorithm f o r i n v e r t i n g M.

row

1980], we get the recursive

N2-1

I . Calculate MI'= [ 7. g2 ].i; I-NI-I

2. For i : 2 1 . . . , N , ,

calculate M,I by using the formula M,.,"+M,.II L~N~M,.I-,/#' _M,.I-,LJ#,"

M,I :

5.3 -NjM,.,"/#,

1/#,

where #h=e,-N,M,.I~L~ is known as the Schur complement of Mrl. The above recursive algorithm is v a l i d i f and only i f M,I e x i s t s f o r i : l , . . . , N

u.

In f a c t , M,~ e x i s t s i f and only i f MHI e x i s t s and #,~0. Hence, we can f i n d the maximum achievable value of Nu by examining when #N.+, is n u l l . We now present a tuning strategy of GPC: I . Choose P(q17 as the desired c h a r a c t e r i s t i c polynomial and use the augmented system (4.97 to derive the control law by minimizing the cost f u n c t i o n (4.107. 2. Assign the f o l l o w i n g values to the design parameters: NI>%,

N2>%+N,-I,

3.N, is set as f o l l o w s : replaced by N,+I.

~=0

5.4

Select N, so that #N. is non n u l l but ceases to be i f N, is

For p r a c t i c a l implementation, we require R(q17, S(q I ) to have bounded c o e f f i c i e n t s and hence near s i n g u l a r i t y of M must also be avoided. This is necessary f o r numerical

stability. To avoid near singularity of M, we proposed to stop step 3 when #,.+, is smaller than a lower bound EMI, where ~ is a small positive number for numerical robustness. Remark: From theorem 2, we have some interesting results when N,>%, N2_>%+NI-I and ~,=0: i) I f A(q" 7 and B(q") are relatively prime, M,~I exists (i.e. GI has full rank) i f and only i f N. (Feasible control set) Given set XcR n and a time stage k, a set of feasible controls at the k-th time stage for given X is defined as

uok(x) ~ {~cuk: x~xok(,~,~+ 1 . . . .

,%_~), ~+~uk+ 1 . . . .

If a given control law u(.) is feasible, that is u ( - ) C ~ f UDk(~)

(~9)

u~_lcuN_ 1] , then U k ( % )

g

for every q s Q. We shall also introduce a set of estimation type of notions

we have introduced so far by defining prediction ~ + 1 ( % , U k )

of measurement ~ + I

the k+1-th time stage which is consistent with information available at the k-th stage in a form of (%,Uk).

at

680 Def.6

(Measurement prediction)

zk+l(%, ~ ) ~= (Zk+ 1 a s 9 ( satisfy

the state

qeQ) (Xo,Wo,W1 , . ,-

wk ,

Vl,V 2 . . . .

Vk+l,~,UkJ

(20)

a n d measurement e q u a t i o n s f o r i r O,k .

We a r e now i n a p o s i t i o n

to f o r m u l a t e t h e main r e s u l t .

Theorem 1 An o p t i m a l v a l u e F o f t h e s y s t e m p e r f o r m a n c e i n d e x (10) i s g i v e n by :

= min

max

Uo~Uo

[g ( z . , u )

ZIBZl(Uo).

o

I

The f u n c t i o n H l ( Z l , U o) i s g i v e n by t h e l a s t

~(~) subject

where

to

= rain

step of the recursive

max [gk(Zk+l,Uk) + ~ + l ( ~ k , Z k + l , U k ) ] ~+1

uk r

Un_ 1

max Zn,X n

, for k ~ 1,N-2

(22)

(23)

[gN(ZN,UN_l) + f(xN) ]

subject to UN_I~UDN_I(X(~N_I)) , ~eZN(~N_I,UN_I),

If

procedure

),

, and Z k + l r

~_1(~N_1) = rain

XNr

(21)

+ Hl(Zl,Uo)]

o

: XN = fN-1 'UN-1 'WN-1)C

and (24)

~-I~X(~N-1)'WN-lCWN-1 ] "

uA(") is the CDP solution, then the following hold uAo = Arg

for every

min[ UoeUo

max

(25)

[go(Zl,Uo) + Hl(Zl,Uo) ]] ;

ZleZl(Uo )

%, Uk(~k) = Arg

subject to (22), for kcl,N-2

minimax

J

(26)

+ f(~)] ]

(27)

[gk(Zk+l,Uk) + ~ + l ( ~ , Z k + l , U k ) ]

,

and for every ~N-I' ~-1(~N-I ) = Arg

rain [ max Un_ 1 ~ ' ~

[gN(~,~_l)

subject to (24) A proof of the theorem is given in [4 ] A structure of the set Q (see Eq.4), Def.2 and Def.6 imply that

~ + I ( ~ % ) = [zk+1~s: ( ~ x ( ~ )

A ( 3wk~Wk) ^

( ~ Vk+leVk)Zk+ 1 = ~ + l ( X k + l ,Vk+ 1 ), where Xk+ 1 = fk+l (Xk'Uk'~k)] Therefore, Eqn.(20) can be w r i t t e n in the f o l l o w i n g form :

(28)

681

z.,k+~(~k,,.,k) = Zk+l(X(~),~)

(29)

Let us notice now that due to (29) the equation and inclusions (21) .. (27) do not depend explicitly on ~k; the information,vector ~k affects them only through that state estimation set X ( ~ ) .

We shall formulate this fact in terms of a sufficient

information function [2]. Corollary I The mapping ~

~ X(~)

is the sufficiently informative function for the CDP.

The Theorem and Corollary I together constitute the separation principle. The principle says that at any time stage k the closed-loop minimax control signal can be generated in the two following steps : first, the state estimate X ( ~ )

is pro-

duced and then based on this estimate a deterministic minimax control problem is solved and the control signal ~ ( ~ )

is obtained.

It is not difficult to show that the separation principle doesn't hold if the k-th time stage cost function depends explicitly on xk. Therefore, it is not applicable if

N-I

F (z,~,u)

= F(x 1 . . . .

xN,u 1 . . . .

UN_1) = ~ g k ( x k , ~ ) k=0

+ f ( x N)

It is well known that for such strucfures of the performance indices the separation principle is applicable in the case of stochastic systems. This is an important difference between stochastic and set-bounded modelling of uncertainty in optimal control of dynamic uncertain systems. An applicability of a separation principle can be achieved by introducing a new state variable Xn+ I in a standard way [2] Xn+1,k+ 1 = Xn+l, k + gk(Xk,Uk), with initial condition Xn+l, ~ = 0 The performance index then becomes equal to Xn+1, N and the Corollary 1 and consequently the separation principle is applicable to the extended system. Another interesting problem arises where some of the components of the vector Wk, k ~ O,N-I are constant but unknown parameters. Let us denote the corresponding vector of parameters by' a. Clearly, there is now a time structure in uncertainty associated with the state equations ~id the previous model is no longer valid. Again, by introducing new state variables in a form of the parameters, the applicability of the theory presented in the paper is achieved where the additional state equations are as follows : ak+ I = ak with an initial condition a ~ = a.

The exact solution of

the problem in this case and some suboptimal schemes in a form of adaptive algorithms will be reported elsewhere. 5.

Perfect State Information. If a state vector is available we have a qualitatively simpler case. We do not

now need to define a safe tube through the family [ ~ k ]. As previously, we define

682 recursively the following sets: XD k and

= xz ~

XD N = ~

[~n

(3~Uk)fk(~,~,Wk)C_XDk+1 ] for k ~ 0,N-I

(3O)

.

Clearly, if the state vector belongs to the set XD k then there exists at the k-th time stage control

uk

which independently on uncertainty realization guarantees that

X k + 1 ~ k + I and the same applies for k+1~ ..., N-I. The feasible control set at the k-th stage has the form :

~k(x) : [ ~ U k : fk(x,~,Wk)c_~k+1~ and the set is non-empty if

xgXD k .

The formulae (21) .. (24) are now simplified to where the operation max is taken with respect to the variable wk subject to WkeW k and where the state estimates X({ k) do not occur. For example, ~ ( x k) :

subject to

min uk

Ukr

and t h e r e s u l t i n g 6.

max wk

the formulae (22) and (23) are now as follows :

[gk(xk+q,Uk) + Gk+1(fk(Xk,Uk,Wk)) ]

k) and Wkr k , A A o p t i m a l c o n t r o l law i s a f u n c t i o n o n l y o f Xk, t h a t i s uk = Uk(X~).

Conclusions A problem of design of closed-loop minimax control strategies for state-

constrained dynamical discrete-time systems, with uncertain initial conditions, parameters (possibly time-varying) and exogenous input signals, and with noisy measurements has been considered. The uncertainty has been assumed to be modelled through bounds which define fixed sets within which lie unknown constants (certain of the parameters and initial conditions) and tubes for time-varying quantities (other parameters, plant disturbances and measurement noise).

A DP type recursive procedure has

been derived to design the control rules. It has been shown that the separation principle holds for .the class of problems considered in the paper. Based on applicability conditions of the separation principle and the conditions for the separation principle derived for stochastic problems, an important difference between deterministic set-bounded and stochastic modelling of uncertainty has been pointed out. 7. [1]

References D.P.Bertsekas and l.B.Rhodes (1971). On the minimax reachability of target sets and target tubes. Automatica, Vol.7, pp 233-247.

[2]

D.P.Bertsekas and I.B.Rhodes (1973). Sufficiently informative functions and the minimax feedback control of uncertain dynamic systems. AC-18, No.2, pp 117-124.

IP;~;~;Trans.on Aut.Gontr.,

683 ~]

M.A.Brdys and S.Nowacki(1982). Design of safe controls and feasible control rules for uncertain dynamical systems under state constraints with applications to water systems. Technical Report, Institute of Automatic Control, Warsaw University of Technology (in Polish).

[4]

M.A.Brdys and B.Ulanicki (1989). Separation principle in optimizing control of state-constrained dynamical systems under bounded uncertainty. Research Memorandum No.20, School of Electronic & Electrical Engineering, University of Birmingham.

[5]

J.D.Glover and F.C.Schweppe (1971). Control of linear dynamic systems with set constrained disturbances. IEEE Trans.on Aut.Contr., AC-16, No.5, pp 411-423.

[6]

W.Findeisen (1982). Some definitions and notions of feasible control. Unpublished notes. Institute of Automatic Control, Warsaw University of Technology.

[7]

A.B.Kurshanski and I.Valyi (1988). Set values solutions to control problems and their applications. In 'Lecture Notes in Control and Information Sciences INRIA'. A.Bensoussan and J.L.Lions (Editors). Analysis and Optimisation of Systems. Springer-Verlag, pp 987-988.

[8]

A.B.Kurzhanski (1988). Identification - A Theory of Guaranteed Estimates. Working Paper. International Institute for Applied System Analysis. Laxenburg, Austria.

[9]

J.P.Norton

(1987). Identification and application of bounded parameter models.

Automatica, 23, pp 497-507. [10]

F.C.Schweppe (1968). Recursive state estimation : Unknown but bounded

errors

and system inputs. I~;~;~iTrans.Automat.Contr., Vo.AC-13, pp 22-28. [11]

A.J-Subbotin and A.G.Chencow (1981)- Optimization of Guarantee in Control Problems. Nauka (in Russian).

UNI~?CATION OF

5ONE

HI~*IO ADAPTIVE

CONTROL

ALt~ORI~I~ISAND GLOBAL CONVERGENCE ANALYSIS

T i a n You C h a i Department of Automatic Control, Northeast University o f T e c h n o l o g y , S h e n y m l g 1 1 0 0 0 6 , L i a o n i n g , P. R. C h i n a

Su~taary This

paper

schastic some

presents

multivariable

HIMO a d a p t i v e

cases.

the general

control

structures

wide variety

of adaptive

viewpoints.

tile

adaptive

adaptive

control.

~daptive

For t h i s

scheme

approaches

arc

special

schcxae u s i n g a m o d i f i e d

control

theorist

which includes

(1980),

Allidina

and

has been significant Dugard

control,

and

arbitrary

peper,

other

progress

Dion

schemes and extensions

this

control

effort

least

adaptive

a

in extending

(1985) pointed open,

out

form

many

one of challenges

a unifying

view

of

has been devoted to determining

adaptive

schc~es and presenting

and self-tuning

SISO d e s i g n s that

in particular,

in

results

albeit

in

this

the fact

t o t h e Mlr~} c a s e .

multivariable

global

a

schemes in Egardit

Ilewcver,

systems have not been given yet,

some p r o b l e m s a r c s t i l l

In

out that

Hughes ( 1 9 8 3 ) a n d Chai ( i 9 f f 7 ) .

for multivarlable

addition,

have been developed

i s Cite p r o b l e m o f p r o v i d i n g

between the seemingly disparate

algorithm

direction

algorithms

Ljung and Anderson (1984) pointed

general

certain

by s p e c i f i c

.

linear

Introduction

control

F o r 5150 a d a p t i v e

tile relationship

tbcrc

matrix

control

algoriClu~s is also given.

different to

scheme which can

interactor

dcrived

convergence of this

1. A

adaptive

systems with arbitrary

A proof of global

squares

direct

stability

that In

adaptive analysis

of

o f known s c h e m e s c a n a n d m u s t b e c o n s i d e r e d .

general

adaptive

s c h c m e s f o r Nlr~0

systems

interactor matrix is presented and the relationship bctwocn

having

the

the

scheme

proposed and other scheraes developed by specific approches is discussed. The global convergenoa analysis for the sche~ao will be established.

685 2 , General Adaptive Scheme Consider the ~It~ system described by the linemr vector difference equation A(z-' )y(t)=B(z" ' )u(t)+B, (z")v(t)+C(z- ' )w(t) where

u(t),

(2.1)

y(t) and v(t) arc n-vectors defining the system input output

and measurable disturbance vector respectively,

and A,

matrices in unit delay operator z-' such that A(O)=I, (I

is the (nxn) identity matrix).

B, ,

B(O)=O,

C are

polynomial

B, (0)=0, and C(O)=I

The nxl disturbance sequence w(t) is assumed to

be a real stochastic p r o c e s s defined on a probability the

B,

vectors

D-- algebra generated by the observations

space (l),~

,

F). Ft denotes

up to and including time t.

It

is

a s s u m e d that E(w(t)/Ft - ,)=0

a.s.

E(w(t ) w ( t ) ' / F t - , ) = ~ , with trace ~ = T

Under quadratic

of

+

these

llm n~ ~

minimize

J(u)

T

V x ~ H.

T,n

we

consider

the

followin~

linear-

problem:

~

*

lu(t)!

}dr

*

2 over

all

Vn { N ,

~ P T,n*1

u s N

(O,T;U), w

subject

to

the

state

equation

(T),y(T)>

696 (1.2)

Here

y(t)

w =

=

I

(w

probabillty

+

wm)

... space

expectation. ~s H and

S(t)~

(~,~,p),

Equation

(1.3)

E O such

c(~)

condition ~ T;

~

/

C&(CO,T);L(H,D(A*~))),

that

(T-t) K (2.2),

and

0 in

the

! t sense

< T; that

P(t)

~ PT

there

698 jT i[i)

'2

Vx 6 H,

IG*A~-P(t)S(t-s)xl

ds

is finir.e

0 iv)

PC.)

satisfies

(2.8)

P(t)

n

(t)]x,x> conver~es

uniformly

continuous

and

v)

convergence

of

of

of

1 hold

the

part

to P(-)x.

This

also

t=T.

Theorem

result

property

at

true.

previous

ill),

we

proposition,

alon~

readily

obtain

part

convergence

stated

in

(2.20).

Nhen Corollary of

(2.20),

iv)

Integrabillty

from

proof

the

P~(.)x

is s t r o n g l y

Parts

Usin~

the

that

P(.)

6.

=

n

follows

COROLLARY

that

n

n

that

of the continuous

this

n

finally

]S(s-t.)ds-

1/2

(t)]xl IP(t)-P

implies

J

P(s)D

inequality

n

if

have

function

2

IEP(t)-P

J

inequality.

every

fixed

D:t

j=1

monotone

a theorem From

from

for

pointwise

By

s

S(~-t):=[M

t

follows

steps

.

and

jT

*

A -t

is c o m p a c t ,

3.

Usin~

Corollary as

this 4 to

claimed

is ~ t a n d a r d

3.

SOLUTION OF THE L - Q - R the

obtain

hypotheses

the

convergence

in p a r t

to

have

obtain

It

Nith

we

v)

the

an of

stron~ one

can

identity the

(more

Theorem.

differential

easily)

in p l a c e

of

Finally,

RIccatl

repeat

the

from this

equation

the

inequality

(2.1).

PROBLEN and

notations

introduced

in s e c t i o n

1.2,

we

have: THEOREN J(u)

7.

over

(3.1) where

Given

u*(t) P

=

Is ~ i v e n

correspondln~ (3.2)

to

~tep

d

(u) n

1.

= E

and

there u*

u~.

Let

exists

a unique

is c h a r a c t e r i z e d ,

-G:=A~P(t)y*(t)

by Theorem

P~oo~.

x E H,

N~(O,T;U).,

1, a n d

control

by

the

u :t m i n i m i z i n g

feedback

formula

t ~ [O,T),

y:r i s t h e

solutlon

of

(I.2)

Moreover, =

J(u*),

J~(u)

be

the

<My(t),y(t)>

cost

*

functional

lu(t)l

)dr

defined

+

E. T n

in

identity

702 Equat, i o n of

(2-3)

is t h ~

min[m[z[n~

gqua~lon Since

Jn(U)

(1.2).

J(u)

Riccati

over

Thus

equation

all

< J(u),

we

correspondinK

u 6 ~(O,T;U),

the

[bmtn~bmax

=

the

the

states

Obviously,

Function

reach

curve

bmax.

D

a selection

SELECTION.

nature

B value.

Fig. oF

2).

the

720 The p r e s e n t e d

instances

consecutively

chosen

will

steps

be u s e d

for

in a d a p t i v e

an o p e r a t i o n

o~

system

this

synthesis

as t h e

system.

ill

T

X

Fill. 2 l EXAMPLE 4 . 2 .

Assumptions:

w h i c h has a d e n s i t y ~orm N = [ b n , b 9

the set

mQx

b is

a random v a r i a b l e

E u n c t i o n ~; ]

C [O,K -

the support of

~],

"

I

> O,

d > O.

- c - (B -

simulations

[8]

I

and i s

(B)

L follows

+ b m~x

E

if

{-c.(B

(14)

o~ t h e

>: ~ i n t ( N ) ;

)/2.

b d O,

i~ B -

b ~ O,

from p r a c t i c a l Taking

-

into

bm ~ x ) , d . ( B

together

be g i v e n

1 [bmtn'bm~x

c

B -

in

L(B,b) of

point

(B)

with

(7b)

b y B = bm~ n , = I

in

~J

(b

mt

--

n

description.

numerical of

view.

account the form o f

the mini-max loss Punction w i l l

inf B

if

f r o m some r e s u l t s

strate~.

= sup

c < d t h e n Prom

Similarly~

b)

recommended

function

mini-max d e c i s i o n w i l l

mLn

> 0 if

(13>

nO Appl~cat~o~ o / M[ns density

b)

Any o t h e r f o r m u l a can be used

The a b o v e s i m p l e f o r m o f

(b

m~x

~(x)

oG

= d-(B -

If

mtn

;

Eunction is

we d e ~ i n e by:

L i B , b)

support

 d t h e n B = bm ~ x .

Instead,

the

be g i v e n by:

by c = d,

it

~ollows B =

721 bD A p p L ~ c u ~ o ~ o~ Ba~es struLe~y. T h e by

(Sa)

will

take

here

the

function

o~ the

Bayes

loss

given

~orm: b

B

ib(B)

=

m~•

I

d- ( B - b ) ~

from

(16)

b)db

- I m

b After

calculating

i

(B)

b

=

0

c- ( B - b ) ~ ( b I d b .

we

(16)

get:

B

Using value

(171

o~

B.

I

~(b)db

mL

n

we c a n

= c/(c

realize

Equation

(17)

+

the can

d).

(171

Bayes

be

strategy,

replaced

by

selecting

the

respective

equivalent

~ormula:

b ~ x

I

B

~(b)db /

I

B

B(b

~

~(b)db = dlc.

9

(18)

b

B:b

F;~.3 EXAMPLE

4.3.

Assumptions:

Let

the

distribution

uniGorm i n N = [bm~n'bmax] ~ [O,K. be g i v e n t h e r e f o r e x(b)

=

the

bmtn
1 ~ I i = T i . lim x i ( t ) e" r i t t-~oo

=

lim t-~

q>i ( t ) e" r i t

(7)

(8)

(g) (i0) (11) = O.

(12)

737

9 The

three policies

liable to

be used along the equilibrium path

are determined by relations (9)-(11) : ( i ) Policy 1 = no investment policy : (I i = O) : firms i does not invest ~ i t s capital stock is kept constant since no capital depreciation is allowed ; this policy has to be implemented along the equilibrium path when x i < 1. ( i i ) Policy2 : steady state policy : (x i = 1) : The capital stock is determined by the condition that its marginal value (measured by adjoint variable x i) equals the inves-

Policy3 : maximum investment policy : firm i invests at

tment cost. ( i i i )

the maximum level ~i ; capital stock increases ; i t has to be implemented along the equilibrium path only when xi > 1. A sequence of policies over the horizon [0,+~] w i l l be called a strategy 9 A duopoly regime is defined as a combination of policies ; regime ( k - l ) stands for firm i using policy k and firm 2 policy I. Clearly all the regimes encompassing policy

1 or 3 are determined by the open loop condi-

tions (obtained by dropping partial derivatives of investment rates in relations (7) and ( 8 ) ) , since the upper and lower bounds o f 11 and 12 are independent on Kz and Kz. Regime (2-2) d e f i n e s the steady s t a t e o f the game which n e c e s s a r i l y holds w i t h I I = 12 = O.Consequently, open loop and closed

loop

s o l u t i o n s coincide.

Accordingly,

the c h a r a c t e r i s t i c s

of the p o l i c i e s

are summarized in t a b l e i :

Policy

ii

I

0

feasibility conditions

equilibrium conditions

none

xi I

Table 1 : characteristics of the policies. - Regime (2-2) defines the Nash long run equilibrium point (KI(N),K2(N)) of the duopoly where KI(N) and K2 (N) are solutions of the system :

anl - r I ' aR2 - r 2 " ~K1

aK2

(13)

738

These relations define the classical reaction functions K2 = RI(K1) and K2 = R2(KI) associated with each firm. A perfect equilibrium path is determined by a sequence of duopoly regimes along which equilibrium conditions given in table I, transversality conditions (12) are satisfied. I t is quite clear that adjoint variables m~and~ have to be continuous along the equilibrium because of the regularity conditions of the problem and the absence of the

constraints on the state.

All these conditions ensure the unicity and

s t a b i l i t y of the equilibrium path as i t will be seen now. Let C be the

upper envelope of RI and R2. I t is quite clear that any equilibrium path starting above C will necessarily remain at the i n i t i a l point. Since there is no depreciation, steady state (KI(N),K2(N))is only on the equilibrium path when the i n i t i a l capital stocks are on line ZIN. I t remains then to characterize the equilibrium path for other i n i t i a l conditions. This will be done in determining the equilibrium path ending at some point (R1,~2) of C. The path is univocally defined by Proposition i and 2.

Proposition 1 : Let capital stocks ~l and kz such that ~(i >I KI(N) and 7{2 = R2(TCI). (i) There exists an unique perfect Nash equilibrium path of the game ending at point (~1,? 6.

753 We notice that ~,(Y) is equalizing : against it, any 7Us strategy gives the same value of ~ ( Y ) . More generaly this computation proves that all strategies satisfaying Ci.~ are equalizing. Computation

of V~_1(Y, z)

V~_t(Y, y - r) = r

M~,_,. ~,_~(Y, y - r) 0

( LIZ[)[y ] = r

0

P'(~):-

+

a,

i -- (2r -- i)# #

4:),it]

=

1 P~ i

1 ,r,-- + r m

=

P, (1G, + p,(1-~,_l(Y)(-r))

=

(--or, + p, + ~,_i(Y)(-r)(l -- p,)

=

p,+, + ~ , + ~ , _ ~ ( Y ) ( - ~ ) ,

+ ~,_l(Y)(-r)

m

and similarly :

Vt-l(Y,y+u) Vt_t(Y,y+i)

= p,+, +cr,+iqot-,(Y)(u) =

p,+,

for u E U fori=r+l

. . . . . 3r.

The proposition is therefore proved until time l = 1, that remains to deal with now. 3.2.3

timet=l=T-(T-1)

Let's notice that there is not flying bulle*, at this time. Game m a t r i x : M = pT_2(ll)~xm + O'T-2N2[O]. O p t i m a l i t y of Pl 1 M , l ( 0 ) = (pr-2 + --~T-2)(1)n ~Z hence (Ih(O) is an argument of :

mineez- P ' M ~ i ( O ) .

O p t i m a l i t y of r r

=

r

p~--~(~).x,,. + ~.r-~ N~[o] ) i

=

pr-,(~)",

m

hence qi(O) is an argument of :

maxoez~ ~i(O)'MQ

eayo~

and furthermore :

= ~(O)'M,~(o) =

PT-1

9

754

4

Conclusions

1- F e a s i b i l i t y of (A2.1) : The following simple strategy satisfies (A2.~) :

r

= (1/m,...,1/,,), rn

r

1 0

ifi=j otherwise.

foreachi,jE

{-2r,...,+2r}.

Observe that we have pure controls as soon as t _> 2. The fact that 7-{ used mixed control at t = 1 is enough to avoid giving more information to ~ . 2- F e a s i b i l i t y of (A2.2) : we have exhibited a ~ ' s strategy that satisfies (A2.2) for each r > 1, and whatever the duration of the game is. In the case r = 1, such a strategy exists only for a game duration, T, not superior to 5. 3- F e a s i b i l i t y of (A~) : We cannot raise a general method to study the existence of strategy satisfying (At). We have therefore showed that in case where r = 1, it is impossible to compute such a strategy as soon as T > 5 4- L o w e r b o u n d o f g a m e payoff: The impossibility to find (or nevertheIess to prove existence of), ~ ' s strategy satisfying (A1), doesn't make the above developpement useless. As a matter of fact, at each step of time, the ~ ' s chosen control being equMizing, the found value of the payoff exhibited is a lower bound when A1 cannot be satisfied. Futhermore we can see that this lower bound is swiftly increasing toward 1, and so the 7~'s strategy as not so bad.

Bibliography - [1]- T.Ba~ar, "On a class of zero-sum discrete games with delayed information", Lecture notes in control and information sciences 119. pp9-34, Springer, Berlin, t989. - [2]- P.Bernhard, "information and strategies in dynamic games 982, 1989.

", rapport

de recherche I.N.R.I.A.

- [3]- P.Bernhard et A.L.Colomb, "Saddle point conditions for class of stochastic dynamical games with imperfect information", IEEE trans, on Automatic control, AC 10-23, pp. 98-101, i98. - [4]- P.Bernhard, A.L.Colomb et G.Papavassilopoulos, "Rabbit and Hunter game : two discrete stochastic formulation", Comput. Math. Applic. 13, No 1-3, pp. 205-225, 1987. - [5]- A.L.Colomb, "l~tude de jeux k deux joueurs en information incomplete ", thb.se, Universi~5 de Provence, Marseille, France, 1986.

AN INTERACTIVE MULTIPLE CRITERIA DECISION SUPPORTING TOOL WITH APPLICATION TO A SIMPLIFIED REGIONAL DEVELOPMENT PROBLEM CAO DONG and M. INSTALLE Laboratoire d'Automatique, de Dynamique et d'Analyse des Systemes Bg.timent Maxwell, Place du Levant, 3 B-1348 Louvain-la-Neuve - BELGIUM

ABSTRACT In this paper, an interactive multiple criteria decision supporting algorithm is proposed which combines a Pareto solution determination method based on the reference point technique of Wierzbicki (1982) with the use of an interactively updated explicit utility function. This function which is recursively updated by the decision makers of the problem is used to represent their preferences among the various objectives of the problem to be solved. The maximization of the non linear utility function is done through an original procedure which works in a progressively enlarged optimization space of small dimension. This procedure makes the algorithm efficient even for large scale probiems. Finally, an application of the algorithm to a simplified regional development problem is presented.

INTRODUCTION The solution of regional developement problems asks usually for the consideration of many objectives among which some may be in conflict. Those objectives represent the interests of the actors or decision makers involved in the development of the region. In this paper, a decision supporting tool is proposed that implies - through an interactive procedure - those actors in a multiple criteria decision making (MCDM) procedure. In the past, interactive MCDM methods have been proposed which allow for the intervention of the DM in the course of the planning procedure in order to progressively identify their preference structure and find eventually a satisfycing compromise solution. Among those algorithms , the one developped by Rosenthal (1985) combines a Pareto solution determination method with the use of an explicit function which was developped by Keeney and Raiffa (1976) from global utility theory. In this algorithm, the explicit utility function is interactively and iteratively updated by the DM on the basis of the sequentially generated Pareto solutions and the associated trade-off informations. Hence, the practical problem of the "a priori" determination of the utility function representing the preference structure of the DM is avoided.

756 However, this algorithm suffers from an important drawback : since each time that the utility function has been updated, the corresponding Pareto set of decisions optimizing this utility function has to be determined, one has to solve a great number of non-linear constrained optimization problems. Hence, for practical dynamical problems like regional development ones with a constraints set of great order, the search for a satisficing compromise solution may be very time-consuming. In the second part of this paper, a new interactive algorithm is described which can be used for large scale MCDM problems with linear constraints and linear objective functions. At each iteration of this algorithm, an explicit utility function is first interactively updated through information given by the DM. This information consists in a minimum acceptable value and in a full satisficing value for each objective of the problem. Then, the maximization of this updated utility function provides a feasible solution vector in the criterion space. Finally, this solution vector is projected on the efficient surface through the reference point method developped by Wierzbicki (1982). The maximization of the utility function is done through an original optimization procedure. In the third part of this paper, the algorithm is applied to a simplified regional development planning problem. DESCRIPTION OF THE ALGORITHM A linear MCDM problem is represented as : max Z(x) = [ z 1(x), z2(x) ..... Zp(x)]T = [C1x,...CpX]T s.t x~X = { x / A x _ o } [P.1] where x is an n-dimensional vector of decision variables, zi(x) is the ith objective function, A is a given m x n matrix and X is the feasible set of the constrained decisions. Generally, for this vectorial optimization problem, one cannot obtain a solution which maximizes simultaneously the p objectives because of the presence of conflicts. However, a often used solution to this problem is a Pareto or efficient solution. In the next sections, it will be shown how : 1) the DM interact with the optimization procedure through the recursive updating of a utility function representing their preference structure; 2) a reference solution Z is determined by maximizing the corresponding updated utility function; 3) a Pareto solution Z corresponding to the reference solution Z may be found; 4) the complete algorithm is organized.

757 Interactive updating of the utility function A well-known additive utility function is chosen in order to represent the preference structure of the DM "

u(z, ..zp)=

1-e I]~(zr-11)

=

u,

where ~i is the minimum acceptable value of the ith objective, 5 i is the full satisficing value of the ith objective and ~i < 0 is the preference coefficient parameter. ui is the "attribute" of the ith objective and takes values in [0, 1] for ~'i < zi -< 8i. This utility function is relevant with the general decision making principle: The more, the better for each objective and the choice of the exponential says that the smaller the objective value, the greater the increasing tendency of U(zl ..... Zp). For the interactive determination of the parameters describing the preference structure U(zl .... Zp), the following information was used : at the current Pareto solution, the DM are asked to give their "margin of allowance", that is, the ~ and 8i for each objective. The J3i are then automatically updated such that for the middle value of the allowance interval, ui has a value equal to (~ where 0.5 < ~ < 0.9.

Maximization of the utilitv function As stated before, the reference solution is obtained by maximizing the function U(Zl...Zp). To overcome the computation inefficiency caused by a constraint set (Ax ~ b) of great order with a given non-linear preference structure U(zl...Zp), a reference solution is first determined in the approximated criterion subspace formed by the convex combination of the sequentially generated Pareto solutions [z i, i = 1 ,...k and the extreme points (zJ, j = 1 ...p) obtained from the individual optimization of each objective. Hence, the problem is : max U(z 1...zp) X,z

s.t. Z= ~, ~7'+ i=l...k

and

~

~.i =1

[Umaxl] ~_, ~

:~i

j=l...p

X.~>0

i=l...k+p

where k is the iteration number. We call the solution of [Umaxl] : [z~. If this solution is identical to one of the former Pareto solutions, the dual information is then used in order to extend the criterion subspace through the cutting planes defined below. Hence, the new problem is 9

758

max U(z, ..... zp) z

p

p

i=1

i=1

[Umax2]

where PJi, ZJi are respectively the dual information and the Pareto solution obtained at the jth iteration (j = 1 ..... k). W e call the solution of [Umax2] : ~2.

Generation of the Pareto solutions AS written before, the reference point method is used to obtain the Pareto optimal solution corresponding to the solution

Z maximizing U(Zl,..., Zp).

This method is based on the

minimization of a generalized Tchebychev distance from a given reference point Z. : min Jr max 1:i(zi-z,) ] + (g,...,g)(z- z ) } z ~.Li=l...p s.t. Z--- [ClX ..... CpX]T xeX

[P.2]

or its equivalent problem " min {(z + (8,...,E) (Z - Z)} z.cx s.t. cc>'~i(Zi-Z i) (i=l,...p), z i - c i x = 0

[P.3] 0 = 1 .... p), A x ~ b

where cc is any real number, the weighting coefficient "q > o (i = 1,...P) and E is a very small positive value. The supplementary term (E.... ~) (~. [z) guarantees the efficiency of the solution found by [P.3]. Two theorems based on the duality theory are easily derived " Theorem l "The optimization gf [p.3J gives a Pareto solution Theorem 2 "The ootimal solution of [P,3] ootimizes at the same time the followina dual oroblem 9 P max - ,~.pizi s.t. z i = cix and Ax < b i-1

and where the variables #i, i = 1..... p are the dual variables corresponding to the objectives zi = cix, i = 1 ..... p. The geometrical interpretation of [P.3] is i~ustrated at the next figure for a problem with 2 objectives -

759 Z2 Cutting plane

Criterion set

Zl p

p

The equation - ~_,lAz~ = - ~ . ~ z i i-1 ~1 generates a cutting plane of the efficient subspace which pass through the efficient point [z.

The comolete algorithm The organization of the algorithm is as follows : steel : For i = 1 ..... p, each objective is optimized independently and a pay-off table with ~i is obtained. steo 2 : Given this pay-off information, the DM are asked to specify their margin of allowance "yi and 8i for each objective, k is set equal to zero. steo 3 : The problem [Umaxl] is solved and Zl is obtained.

This point is projected on the

efficient region by [P.3] in order to have an initial Pareto solution [z. steo 4 : k = k+l. The DM decide whether the last Pareto solution ~,k is satisficing. If yes, the problem is terminated. If no, the step 5 is executed. step 5 : The DM modify their "margin of allowance" % 8i in order to have a newly updated U (zl ..... Zp).

steo 6 : The problem.[Umaxl] is solved. If ~1 is different of ~k, Z.1 is projected on the efficient region by [P.3] in order to have a new Pareto solution ~k and the algorithm goes back to step 4. If ~lis equal to ~Tk,step 7 is executed. steo 7 : The problem [Umax2] is solved. If ~2 is equal to ~1, the algorithm goes back to step 5. IfZ 2 is different of Z1, the step 8 is executed. steo 8 : ~2 is projected on the feasible region through [P.3]. The efficient solution ~k is used to extend the criterion subspace. The algorithm goes back to step 6. The next figure illustrates the various steps of the algorithm 9

760 Z2

~

Z~

The convergence of the algorithm towards a solution maximizing a given U(zl ..... Zp) is guaranteed by the fact that the combination of [Umax2] with [P.31 is essentially a cutting plane method applied to a non linear optimization problem where the following requirements are satisfied : 1) the real polyhedral objective region {Z(x) s.t.xe X} is bounded; 2) the function U(zl,..., Zp) is quasiconcave (Luenbergher, 1984).

ASIMPLIFIED REGIONAL DEVELOPMENT PROBLEM A simpliefied regional development problem within a chinese context with a planning horizon of 6 years is considered where the production system is composed of 2 sectors : the agricultural sector and the rural industry sector which receives the raw materials from the agricultural sector and does a value-adding transformation. In the agricultural sector, two types of land are considered : type 1, irrigated flatland and type 2, dry flatland. By building the appropriate hydraulic installations, lands of type 2 can be transformed into lands of type 1. Five production activities can be performed as follows : Activity/land :

type 1

type 2

Soy beans

X

X

Rice

X

Fruits

X

X

where "X" indicates that on the land of type i the production activity j is possible. Finally it is supposed that the rural industry sector of the studied region absorbs all the harvested soybeans in order to transform them into a value-added, commercialized product.

Descriotion of the model Let Xij(k) be a state variable which represents the area of land of type i devoted to the jth activity during year k. Dynamic equations relating the state variable Xij(k) to the land allocation decisions are the following ones :

761 Xij(k+l) = Xij(k) + U~ij(k) - U~ij(k)

(E.1)

where the decision variables U+ij(k), U-ij(k) are respectively the area of land of type i added, substracted, at the kth year for the activity j. The transformation of the land of type 2 into that of type 1 is described by the following equations : X l ( k + l ) = X](k) + TR(k) X2(k+l) = X2(k) - TR(k)

(E.2)

where Xl(k), X2(k) are respectively the total area of land of type 1 and 2 and TR(k) is the area of land of type 2 transformed into land of type 1 at year k. The static constraints associated with the land allocations are the following ones :

EJ Xij(k) < Xi(k)

(i = 1.2)

(CA)

For the industrial sector, the assets of the rural industry are imbedded into a state variable Xr(k) with which the production capacity is associated. The dynamic equation describing the evolution of those assets is as follows : X,(k+l) = o~X,(k) + U,(k)

(E.3)

where Ur(k) is the investement level in the industry at year k and CCris a redemption factor. Directly associated with this equation is the limit of fabrication capacity of the industry : ~11X11(k) + J321X21(k) -< PrXr(k)

(C.2)

where ~r is the fabrication yield (Ton/kS) and ~}11, ~21 are agriculture yields(Ton/Ha). Finally, the dynamic equation for the regional cash-flow is the following : Xa(k+l ) = asXB(k) + ,7_.~oijXij(k)- 7~rTR(k) - Ur(k) - Up(k)

(E.4)

where Xa(k) is the available cash-flow at year k, ZPijXij(k) is the net profit obtained from the production activities Xij(k), ~rTR(k) is the cost for the land improvement, Ur(k) is the investment in the industry, Up(k) is the budget for public welfare (the income budget for the workers is included in this term) and cq3 is a money erosion factor. Other static constraints are the following ones : non-negativity of the state and decision variables. - minimum quantity of produced rice : -

~12X12(k) > 4Q(k)

(C.3) (C.4)

762 where Q(k) is the quantity of rice necessary for the population at year k (prediction). - necessity of preinvestment in production activities : ~,'~jXij(k) + 7trTR(k) + Ur(k) < XB(k) (C.5) where "~1jXij(k)are costs of production. limit on available man-power for production : 7_,~ijXij(k) + ~rTR(k) ~ MP(k) (C.6) where =r,/qj " man-hours/Ha and MP(k) is the predicted available man-power on year k. limit on available water for irrigated crops : -

-

3

(C.7)

~__,~Xij(k) -<Wa(k) j=l

where % is the water demand coefficient for culture j (M3/Ha) and Wa(k) is the prediction of available water resource at year k. constraint that guarantees the continuity of the production at the end of the planning period of 6 years: XB(6) -> XB(5) (C.8) -

Definition of the ob!ective~ In this regional development problem, the planning horizon is divided into 2 development stages of 3 years each. At each development stage, 3 objectives are considered which represent the different interests of involved parties in this production system : objective 1 maximize the total profit of production; objective 2 - maximize the budget for public welfare and objective 3 - minimize the erosion caused by the production activities. At each development stage, these objectives are measured by their average over the years covered by the stage. Hence, the problem contains 6 objectives given by the following expressions : 3

6

objl" max k__~l{ ~,RjXij(k)t/3 ij ~

obj4 9max ~ , t ~__,PijXij(k)l/3 k=4~ ij J

2

5

obj2" max~__,{Up(k)}/3 k=0

obj5 : max ~_.~{Up(k)}/3

3

6

obj3 " min ~__~f~/___,eijXii(k)l/3 k=lL

ij

J

obj6 " min ~/_.,I ~_~eijXfi(k)t/3 I:~--4L ij

J

In these expressions, the eij, i = 1,2; j = 1, 2, 3, are erosion coefficients which may be given, for example, in M 3 of soil removed per hectarea of land of type i with an activity j.

763 Solution of the oroblern The above problem was resolved on a PC/AT computer with 640 KB memory. First, a pay-off table was computed :

Z1

Z2

Z3

Z4

Z5

Z6

max Zl

18700

8580

-5550

5910

9110

-1770

max Z2

12800

14800

-3830

5910

2840

-1770

max.Z3

5380

7430

-1610

5930

2880

-1780

max.Z4

18700

5400

-5540

26400

8970

-7470

max.Z5

17400

5400

-5220

16600

24000

-5010

max.Z6

16300

5400

-4910

5750

16100

-1730

The diagonal elements of the above table are the maximum values of each objective and were chosen as the initial full satisficing values 5i for each objective. Furthermore, the initial minimum acceptable value of each objective were chosen as follows : Y1 = 10000; )'2=7000; )'3 = - 5500; )'4= 11500; )'5 = 10000; )'6= -6000. Then, three iterations were made interactively in order to obtain a psychologically satisficing Pareto solution. The table at the next page gives, for each iteration, the margins of allowance that have been modified by the DM as well as the reference and Pareto solutions. The underlying course of actions giving rise to the last, satisficing solution is the following one: at the first stage, because of the budget limitations, the main production activity consists in fruit production which doesn't need great investment. Gradually, the investment is put in the transformation of land of type 2 into that of type 1. On the transformed land, the most profitable production activity, that is, soybeans is taken.

CONCLUSION In this paper, an interactive multiple criteria decision supporting algorithm and its application to a simplified regional development problem have been presented. The reduced dimension of the non-linear problems [Umaxl] and [Umax2] makes it possible to work in an interactive mode even for a large scale linear problem. This interactive mode was implemented through the use of an adjustable utility function which represents progressively an approximation of the preference structure of the decision makers.

764 obj.1

obj.2

obj.3

obj.4

obj.5

obj.6

51

18700

24000

-1730

10000 16400 16800

-1610 -5500

26400

Y1 z1 zl

14800 7000

11500

10000

-6000

8070 832O

-4910 -4810

16300 16800

13800 16200

-4770 -4620

i

52

t e r 2

72 z2 z2

id. id. 17200 17200

id. id. 7710 7720

id. id. -4960 -4960

id. 18000 18800 18900

20000 id 14700 16300

id. id. -5210 -5210

83 "t'3 z3 z3

id. id. 17500 17500

id. id.

id. id.

id. 20000

id. id.

id. id.

7210 7210

-5080 -508O

20500 20500

14700 16400

-5700 -5700

~t e r 1

t e Ir 3

REFERENCES Dyer,J.S. (1973) A time-sharing computer program for the solution of multiple criteria problems. Management Sci., 19, 1369-82. Keeney, R.L. and H. Raiffa (1976) Decision with multiple objective : preference and value trade-off. John Wiley, New York. Luenbergher, D.G., (1'984) Linear and non linear programming, Addison-Wesley, Reading, M.A. Rosenthal (1985) Concepts, theory and techniques : principles of multiobjective optimization. Decision Sciences 16, 133-152. Wierzbicki, A.Po (1982) A mathematical basis for satisficing decision. Mathematical Modeling, 3, 391-405.

CRONE CONTROL : PRINCIPLE, SYNTtlESIS, PERFORMANCES WITIt NONLINEARITIES AND ROBUSTNESS-INPUT IMMUNITY DILEMMA A. OUSTALOUP Equipe Systrmes et Commande d'Ordre Non Entier L.A.R.F.R.A. - ENSERB - Universitd de Bordeaux I 351, cours de la Librration - 33405 Talence Cedex Abstract The area of this paper concerns the robustness of stability degree, and more particularly the robustness of the damping of the control versus the parmneters of the plant. The approach of the CRONE Control is presented as resulting from the non integer order differential equation which represents the dynamic model directing a natural robust relaxation. A frequency illustration of robustness is given in the Nickol-Black plane through an open loop frequency template. Such a template is synthesized by means of a CRONE variable phase regulator from the phase diagram of an agricultural mobile robot using a highly non linear error detector. The corresponding robustness performances are presented. At last, a dilemma is established by considering two templates of different lenghts. The imput ot the plant is given for step responses to the reference input. It shows that the greater the robustness, file lower the input immunity. 1 - Introduction

Robustness is a very wide concept, even in a same domain such as the automatic control one. In fact, robustness is a notion which always translates the same idea, namely insensitivity. In automatic control, it is frequent to consider the robustness of stability (case of Hod approach). In the CRONE Control, a French abbrevation of "Commande Robuste d'Ordre Non Entier", namely "Non Integer Order Robust Control", the robustness considered is much stricter, that is to say the robustness of stability degree. More precisely, the robustness which is at stake translates the insensitivity of the damping factor or the stability degree of the control to the parameters of the plant. That amounts to saying that the robustness considered is the robustness of the control damping versus the plant parameters. A frequency illustration of damping robustness is deduced from the non integer order differential equation which represents the dynamic model governing a natural robust relaxation, that of water on a porous dyke. Particularly, in the Nichol-Black's plane, robustness is illustrated by an open loop frequency response locus which is reduced to a vertical straight line segment of abscissa between - x / 2 and - x around the axis OdB. This segment, defined for the nominal parametric state of the plant, is called open loop frequency template (or more simply template ). In this paper, the template is synthesized by means of a CRONE variable phase regulator from the phase diagram of a mobile robot using a highly non linear error detector. The performances obtained through a numerical simulation show that the control is robust, not only versus the open loop gain, but also versus the non-linearity of the error detector, whether the non-linearity is intrepreted as an increasing gain or a decreasing gain. In order to study the price of robustness, the last part of the paper deals with the robustness-input immunity dilemma. For two different lengths of the template, the noted performances reveal that the input of the plant is higher in the case when the template is longer and consequently when the robustness is higher. 2- From strategy

t h e r o b u s t n e s s o f s t a b i l i t y d e g r e e in n a t u r e : The non integer approach of the C R O N E Control.

2.1- O b s e r v a t i o n

to a n e w r o b u s t

control

of a natural robust relaxation

Our approach, the aim of which is the conception and the application of a new robust control strategy, makes use of the observation of a natural relaxation, that of water on a porous dyke. Already in the 17th century, the constructors of dykes had noted the damping properties of the very disturbed dykes and particularly those forming air pockets which can be compressed by the advance of water. Otherwise, an attentive observation of the relaxation of water on fluvial or coastal dykes, reveals that in the case of very damping (or absorbing) dykes through a porous volurnic structure and a rough surfacic structure : - the natural frequency of the relaxation is different whether the dyke isfluvial or coo.~tal ; - the damping of the relaxation seems to be independent of the dyke, whether it isfluvial or coastal.

768 Given that the fluvial and coastal tests can be distinguished by very different carried water masses, the observation seems to show that the relaxation is characterized by a natural frequency which depends on the motion water mass and by a damping which is independent of it. Although it should be paradoxical in the integer approach of mechanics where any relaxation presents a damping linked to the carried mass, this result reveals the insensitivity of the damping factor to a parameter, in this case the motion water mass. So, it translates the robustness of the relaxation phenomenon as for stability degree. 2.2- Non integer o r d e r differential equation as a dynamic model governing the relaxation After trying to determine the mathematical origin of the robustness of such a phenomenon, it appears that it resides in non integer derivation. Indeed, by taking into account the fractality of porosity and the corresponding recursivity, we show [5] that the relaxation satisfies a linear differential equation of non integer order n' between 1 and 2, namely : zn'(d~tt)n'p(t) + P(t) = 0 ;

(1)

P(t) designates the dynamic pressure at the water-dyke interface ; x is a transitional time constant which is a function of the water mass. 2.3- F r e q u e n c y t e m p l a t e The differential equation (1) leads to a symbolic equation of the form : ('r

+ P(S) = O,

(2)

from which one draws :

(3) This operational equation is translated by the fonctional diagram shown in figure 1. Because of a unit feedback, the direct chain determines an open loop transmittance of the form : ~(s) = ( 1 ) n' ,

(4)

which is the transmittance of a non integer integrator and which defines an open loop frequency response of the form :

in which mu = 1/'c designates the unit gain frequency. Given that arg ~3(jm) = -n' ~/2 with 1< n'< 2, the black locus of [3(jr~) is a vertical straight line of abscissa between - ~/2 and -~.

Figure 1 - Functional diagram defining an open loop transfer Physically, a water-dyke interface cannot be characterized by a non integer derivation in all frequency domain [5]. In fact such a derivation is limited to a range of medium frequencies. So, the vertical straight line so defined is reduced to a vertical straight line segment (fig. 2). This segment is called open loop frequency template (or more simply template). When the water mass M changes, the frequency cou is modified in conformity with the relation C0u = (K / M) 1/n', (6)

769

where K is a constant depending on the water-dyke interface. That amounts to saying that the template so defined slides on itself at the time of a variation of the water mass. Such a vertical displacement of the template insures the constance of the phase margin and, consequently, the invariance of the corresponding damping factor in time domain, so translating the robustness of the damping.The longer the template, the greater the robustness.

A

-7~

I I

I ~(j~)l dB

I I

0~u I

0 dB

-n'rd2 -r~ / 2

0

arg ~(jf.0)

I

-%-

t I

Figure 2 - Illustration of robustness in the Black plane: theform aM the vertical sliding of the template AB insure the robustness of tt,e damping through a constant phase margin ~)rn 2.4- The template in automatic control In automatic control, the aim is to obtain such a behavior, namely : - an open loop frequency response Black locus which forms the template so defined for the nominal parametric state of the plant ; - a vertical sliding of the template at the ume of a reparametration of the plant. Trying to find the synthesis of such a template defines the approach (said non integer) that the CRONE control uses.

3- Synthesis

of

the template

3.1- I n t r o d u c t i o n Although several synthesis methods of the template exist, here, we simply present the method which corresponds to the application considered in this paper, namely the wire guidance of a mobile robot. Let 13(jr and G(j~) be the open loop frequency response of the control and the frequency response of the plant. We assume that the phase diagram of G(jo) (fig. 3) is invariant, which is the case when only the variations of gain are considered (case of our application).

0i

argGfjr

argl3(j(o)

I .n t~/"2 -

I .

/D

.

~

Figure 3 - Synthesis of the template from the phase diagram of the plant.

770 To pass from the argument of G(jc0) to the argument of 13(jco) for coA < co < C~ the observation of figure 3 shows that the regulator CN(Jco) placed in cascade with the plant must provide both a phase delay and a phase advance which should be functions of frequency. 3.2- Idea of the synthesis : smoothing of crenels with variable cyclical ratio One considers that the phase diagram of the regulator results from a smoothing of crenels constituting a phase asymptotic diagram. So, one must introduce the notion of phase smoothing curve which replaces thephase smoothing straight line one usually used in the non integer approach. In order to obtain a smoothing whose value should be a function of frequency, the idea is to vary locally the cyclical ratio of the crenels. This is translated by the consideration of crenels whose cyclical ratio is a function of their ranks (fig. 4). The distribution of the zeros and the poles is defined by the relations:

ai = coi/co'i and with

rli = co'i+I/coi,

cq rli = 0~'i+l/co'i = cte > I

(7)

(8)

k/i,

where a i andrli are called recurrent factors. phase smoothing curve

OJl

r

0,~

0)2

0)2

0)'i

0)i

0)5+1

_L__2_L2__0)N 0)'N-t

CON-I CON fOB

Figure 4 - Obtaining of a phase smoothing curve through a modulation of the cyclical ratio of the crenels of a phase asymptotic diagram. 3.3

-

Synthesis

process

The synthesis process successively consists : 1 - in fixing a recursive distribution.of the zeros co'i, namely co'i+l/co'i = oti Tli = cte V i ;

(9)

2 - in considering as data, the values of the phase that the regulator must provide at the frequencies co'i, namely r = arg CN(.jco'i)= (Pd - arg G(jr

(10)

where (Pd = -n' n/2 is the open loop desired phase that the phase locking defines around cou ; 3 - in determining the distribution of the poles, that is to say that of the recurrent factors a i given that ai = coi/co'i. 3.4 - Exact algebraic method The algorithm which results from this exact method is that used by the CRONE software. A regulator frequency response in conformity with the phase asymptotic diagram shown in figure 4, is defined by an expression of the form : (1 + jco / co'l).-.(1 + jco / co'i)...(1 + jco / co'N)

CNOco) = Co

,

(I I)

(1 +ji0/col)...(1 + jco/coi)...(1 +JCO/CON) the corresponding argument satisfying the relation : N

N

arg CN(jco) = ]L tan -1 co/ co'i - ~] tan'l co/ coi. i=l i=l

(12)

771 For the frequency ~ = t0'i, which suggests to replace the index i by the new index j, the phase of the regulator becomes : N N arg CN(Jm'i)= ]~ tan "1 o ' i / 0 ; j - ~ tan "I to'i/o~j, j=l j=l

(13)

from where one draws, taking into account (10) : N

t a n ' l to'i/c0j = A(o'i),

(14)

j=l in putting : N

(15)

A(to'i) = ~ tan -1 ~o'i/o'j - q~d + arg G(j0~'i). j=l

So, one obtains the system of equations : tan -1 o' / 0h + tan "l o' 1 / co2 + ... + tan "I to'l / ~ tan -I 0;2/ro I + tan-1

o'2/a,, 2 + ... +

= A(to'l)

tan-I t 0 ' 2 / o N

= A(to'2)

tan- 1 ~'N / Ol + tan" 1 o, N / c02 + "-- + tan" 1 to,N / m N = A(to'N),

(16)

which constitutes a non linear system of N equations, the N unknowns of which are the toi. One puts ct0ai = C and xi=l/c q. By taking the tangente of each m e m b e r of each equation, the line of rank i admits the reduced writing : N

tan (]~ tan -I aij) = tan A(m'i) = A i with aij = Ci-Jxj,

(17)

j=l

of which one tries to determine the expression of the first member, namely: N PN q~ tan ( • tan -1 aij) = Z (-1)P s2p+l(i) / Z (-1)q S2q(i), j=l p=0 q=o

(i8)

expression in which the different magnitudes are defined as follows : PN = IN - 1 / 2], integer part of N - 1 / 2 ; qN = [N / 2], integer part of N / 2 ; S0(i) = 1 ; Sl(i) = ~ aijl;

Jt I~jl-AIB1): template (2); along AIB 1 or A2B2 , [ flO'co)[ dB= (cou/co)n' dB; along C1DI Icu6oo)l aB +lcoo )l dB. or C2D2, I oco)l

5.2. Performances obtained through a numerical simulation A simulation has been carried out with the regulators defined by relations (28) and (29), without considering the non-linearities of the error detector. Figure 9 gives the open loop Bode diagrams corresponding to the two regulators. It shows that the template obtained with the regulator CN2(S) is longer than that obtained with the regulator CNI(S). Figure 10 gives the variations of the input of the plant corresponding to the step responses of the control successively obtained with the regulators C N 1(s) and CN2(S). It appears that the input is greater for the regulator CN2(S). I1

lit

1era

et

Ii

I1

d II

,Q--~

-1~ d B

-ZT~

-2~

x \,b

i

111-'t

-1111

i

Illlll

I

1~ - Z

I

tlllll

I

llI -I

I

t[llll

I

I0 R

I

--368

Iltlltl

1D I

1B Z

11t]

Frcquencg [112:]

Figure 9 - Bode diagrams of the plant and of the open loop control ." gain; ..... phase ;(a) plant ; (b) plant corrected by the regulator CNI(S) ; (c) plant corrected by the regulator CN2(S)

777

0 r.s

0 1.Sf

t ,tSB

1.21

m ~8~

.91

t

.6~ .31

1~

81 -li]~

,

1

-..J~

I

.1 Ti~e(~)

II

.81

.82

Tiae(~)

Figure 10 - Variations of the input of the plant corresponding to the unit step responses of the control obtained with the regulator CN1(s)(a) and the regulator CN2(s)(b ) : the initial value is much greater with CN2(S)

6. C o n c l u s i o n The approach of the CRONE control arises from the non integer order differential equation which represents the dynamic model directing a natural robust relaxation, that of water on a porous dyke. A frequency illustration of robustness is deduced from such an equation. Particularly, in the Nichol-Black's plane, robustness is illustrated by an opeu loop frequency response locus which is reduced to a vertical straight line segment of abscissa between - re/2 and - ~ around the axis OdB. The template that this segment defines, slides on itself when the parametric state of the plant changes. In this paper, the template is synthesized by means of a CRONE variable phase regulator from the phase diagram of a mobile robot using a highly non linear error detector. The performances obtained through a numerical simulation show that the control is robust, not only versus the open loop gain, but also versus the non-linearities. A new dilemma, the robustness-input immunity dilemma is established by taking into account two templates of different lengths. The input of the plant is given for step responses to the reference input. It indeed reveals that the greater the robustness, the lower the input immunity, since the initial value of the input varies by an important factor when the template lengtht increases towards the high frequencies. References [I] - I. Horowitz and M. Sidi - Synthesis of feedback systems with large plant ignorance for prescribed time domain tolerances. Int. I. Control, vol 16, n~ 287-309, 1972 [2] - A. Oustaloup - Linear feedback control systems of fractional order between 1 and 2.IEEE Int. Symposium on Circuits and Systems, Chicago (USA), April 27-29, 1981 [3] - A. Oustaloup - Syst~mes asservis linraires d'ordre fractionnaire : Th6orie et Pratique - Ed. Masson, Paris, 1983 [ 4 ] - A. O u s t a l o u p and B. B e r g e o n - Frequency space synthesis of a robust dynamic command.IFAC'87, 10th World Congress on Automatic Control, Munich (FRG), July 27-31, 1987 [5] - A. O u s t a l o u p - From fractality to non integer derivation: a fundamental idea for a new process control strategy.8th Int. Conf. "Analysis and optimization of systems".INRIA, Antibes (FRANCE), June 810, 1988 [6] - A. Oustaloup - From fractality to non integer derivation through recursivity, a property common to these two concepts.Survey, session "Fractality and non integer derivation", 12th IMACS World Congress on Scientific Computation, Paris, July 18-22, 1988 [7] - A. Oustaloup - From the robustness of stability degree in nature to the control of highly non linear manipulators.Proceedings edited by Springer Verlag, Colloque Int. CNRS "Automatique Non Linraire", Nantes (France), 13-17 Juin, 1988 [8] - P. Baylou - L'automatisation de la cueillette du safran.Production des Plantes Aromatiques et Mrdicinales. Nions (France),Octobre 1988, 1988 [9] - A. Oustaloup, A. El Yagoubi, P. Baylou, J.F. Samson and P. Melchior - An algorithm of the CRONE software: Application to the guidance of a mobile robot. Eighth IFAC Workshop "Control Applications of Non linear Programming and Optimization", Paris, France, June 7-9, 1989. Proceedings Pergamon Press

V e r s u n e s t a b i l i s a t i o n non l i n 6 a i r e d i s c o n t i n u e Michel FLIESS I, Franqois M E S S A G E R

1z 3

Laboratoire des Signaux & Syst~mes. CNRS-ESE, Plateau de Moulon 91192 Gif-sur-Yvette Cedex (France). TEl : (1) 69.41.80.40 - Fax : (1) 69.41.30.60 2

Applications MathEmatiques et Logiciel. 6, rue AmEdde BollEe 92500 Rueil-Malmaison (France). TEl : (1) 47.49.14.00 - Fax : (1) 47.51.10.89

ABSTRACT : Since Sussmann [18], it is now well-known that, in opposition to the timeinvariant linear case, controllability of a nonlinear system does not necessarily imply the possibility of stabilizing it by a smooth feedback. This paper approaches the stabilization from another point of view, the discontinuous control, and therefore abandons the constraint of smooth feedbacks. It is interesting to see that this technique has already existed for a long time for relay systems (Cypkin [6]), sliding modes (Utidn [19]) and, of course, optimal control. Our method is based on a generalized controller canonicaI form [9, I0] which has recently been derived by using the analogue of the theorem of the primitive element in differential algebra (cf. Kolchin [14]). This canonical form is used to obtain an equation which defines a linearizing state feedback. By choosing the right parameters we are able to express a discontinuous stabilizing state feedback. The introduction of switching curves permits to take the structure of the system into consideration. We illustrate this technique by applying it to an example taken from Aeyels [I], which was shown not to be smoothly stabilizable.

1. I N T R O D U C T I O N I1 est aujourd'hui connu (cf. Sussmann [18]) que, contrairement au cas linEaire stationnaire, la commandabilit6 d'un syst~me non linEaire n'implique pas nEcessairement la possibilit6 de le stabiliser par un bouclage d'Etat rEgulier ou lisse, c'est9h-dire indEfiniment diffErentiabte. De nombreux travaux, souvent illustrEs par des exemples fort instructifs, ont depuis pennis de mieux cerner ce probl~me ( Brockett [3], Aeyels [1], Kawsky [13], Sontag [17], Dayawansa et Martin [7] .... ). Parall~lement, Byrnes et Isidori [4,5] ont entrepris une recherche de grande ampleur pour, gr~ce ~t la dynamique des zEros,.obtenir des conditions de stabilisation par bouclage rEgulier. Cette communication aborde la stabilisation par un autre point de vue, h savoir la commande discontinue et, donc, abandonne l'exigence de boucle rdguli~re. Quoique Sussmann [18] ait ddjh dcrit, il y a plus de dix ans, que "the introduction of disconfinous vector fields is unavoidable", en dehors, bien entendu, de la commande optimale bangbang, une tetle approche n'a re~u chez les thEoriciens qu'une attention fort limitde. Elle est cependant fort ancienne comme en t6moigne le livre de Cypkin [6], et se perpdtue, aujourd'hui, surtout par Ies modes glissants (cf. Utkin [19]), bien que certains travaux de 3

Travail de th5sc effectualdans lc cadre d'unc convcntion CIFRE entre lc CNRS ct AML.

779

nature plus th6orique paraissent de loin en loin (cf. Arstein [2], Hermes [12], Sontag [ 16], et aussi [ 11 ]). Rappelons encore que, sous certaines conditions de commandabilit6, Sussmann [18] a d6montr6 que la stabilisation est possible avec une commande analytique par morceaux. Nous proposons une approche de la stabilisation par bouclage discontinu en utilisant une forme canonique de commande [9,10], g6n6ralisant celle bien connue pour les syst6mes linEaires stationnaires monovariables, obtenue par des m6thodes d'alg~bre diffErentielle que l'un des auteurs a dEveloppEes [8]. Nous illustrons notre approche par un exemple qui, d'apr~s Aeyels [1], ne peut 6tre stabilis6 par bouclage r6gulier. II. R A P P E L S D ' A L G E B R E D I F F E R E N T I E L L E

(voir Ritt [15] et Kolchin [14])

On suppose le lecteur familier avec les propri6tds de base des corps commutatifs. Pour simplifier, les corps consid6r6s sont de caract6ristique nulle. Undcorps diff6rentiel est un corps commutatif K muni d'une d6rivation unique notee ~ - = , qm sausfait les propnetes usue les : d V a , b e K, ~ - ( a + b ) = a + b ?

IIIM

.

.

.

.

I

p

1

d (ab) = ~ + ab dt Une constante clans K est un 616ment c e K tel que c = 0. L'enscmb~e dc~; constantes de K est un sous-corps de K appeltd te corps de constantes. Une extension diffdrentielle L / K est donn6e par deux corps diff&entiels K, L avec K C L. Un 616ment de L e s t dit diffdrentiellement algdbrique sur K si, et seulement si, il satisfait une 6quation diff6rentielle algdbrique ~ coefficients dans K. L'extension L / K est dite diff&eutiellement alg6brique si, et seulement si, tousles 616ments de L sout diff~rentiellement alg6briques sur K. Dans le cas o~ L / K est liniment engendr~e, on a l e r~sultat fondamental suivant: T H E O R E M E : Pour une extension L / K finiment engendrde, les deux conditions suivantes sont dquival~ntes : (i) L/K est diffdrentiellement algdbrique; (ii) le degrd de transcendance de L/K est fini. Rappelons que ce degr6 de transcendance peut s'interpr6ter comme le nombre minimal de conditions initiales n6cessaires pour intdgrer le syst~me diff6rentiel repr6sent6 par L/K. I I I . I N T R O D U C T I O N DE L A F O R M E

CANONIQUE GE.NERALISEE

Soit k un corps diff6rentiel de base. On note k k le corps diff6rentiel engendr6 par k et les composantes de l'entrEe u = (u 1. . . . . urn). Une dynamique est une extension diff6rentiellement alg6brique K/k finiment engendrEe. Soit x = (x 1. . . . . x n) une base de transcendance de K / k , qui, comme on l'a vu, est de degr6 de transcendance t2ni. Les d6rivdes x sont k-alg6briquement d6pendantes de x. I1 en ddcoule (D)

Ai ~x ," i . -~,,u . . . . . . . . . . .,,(m)~=O J

( i = l ......n),

780

oh les A i sont des polyn6mes ~tcoefficients dans k. En invoquant des spEcialisations dans les reels ou les complexes, on voit que l'dcriture explicite (D')

x i = ai(x,u,~l ..... u (m)) = 0

(i = 1..... n),

a seulement une valeur "locale". (D) et (D') redonnent une image plus famiti~re de Ia notion de dynamique. Une autre base de transcendance x = (xx ..... xn) de K / k conduit, 6videmment, ~ des expressions semblables. On sait que les composantes de (resp. x) sont k-algdbdquement dEpendantes de celles de x (resp. x). Ce sont les formules de passage d'un 6tat ~ l'autre qui font intervenir l'entrde et ses dErivEes. Pour des extensions algEbriques non diffErentielles usueUes, qui sont finiment engendrEes, la notion d'gldment primitif est bien connue. I1 en existe un analogue diffErentiel (cf. Kolchin [14]) qui affirme l'existence d'un ElEment ~ 9 K, dit dldment primitifdiffdrentiel, tel que K = k. C'est dire que K est diffErentiellement engendrd par k et ~. Le degrE de transcendance n de K/k est le olus petit entier n tel que ~(n) soit k-algEbriquement dependant de ~,~ .... ~(n-1). Posons ql = ~ ..... :'(n-l) ' . . q~ = ~ . Alors, q = (ql ..... qn) est une base de transcendance de K / k , qm fouralt une gdadralisation implicite de la forint canonique de conmamlde :

I]

ql = q2

qn-1

=

qn

k.C(Ctn,q,u,u ..... u(n)) = 0 06 C est un polynEme ~ coefficients dans k. On en ddduit donc une forme explicite "locale":

!l = q2

I

qn-1

=

qn

ctn = c(q,u,u ..... u(n))

IV.

METHODE

DE

STABILISATION

PAR

RETOUR

D'ETAT

DISCONTINU Considdrons la dynamique, de dimension n : (Z)

{~~ = f l ( ~ ' u )

fn(~,u) 06 ~ = (~I ..... ~n) et u = (u 1..... urn). On suppose, pour simplifier, que fl ..... fn sont des fonctions polynomiales des arguments h coefficients reels. I1 est facile de ramener l'dtude de (E) ",i celle des corps diffdrcntie!s ~ S c e ~ !'idda! diffdrentiel premier [14], qui lui

781

correspond. L'analyse du paragraphe prdcddent montre l'existence d'un dl6ment primitif x I qui conduit &la forme canonique suivante : Ix~ = x2

(Y'g)

/Xn_ 1 = Xn k.G(xn,X,U,U ..... u(m)) = 0

o ~ a x = ( x 1..... xn). Parun argument du type "thdor~me des fonctions implicites", il vient :

I

xI = x2

v

(Eg)

IX--1 =

Xn

1

k.x.n = g ( x , u , u ..... u(m))

C'est la forme canonique explicite, qui, en gdndral, n'est valable que "[ocalement". Dbs lors, il est possible d'expfimer le bouclagc lindarisant, cn 6cfivmlt : (z)

n

p

g(x,u,u ..... u(m)) = i~t aixi + i~l bivi

off les a i e t b i sont des constantes rdelles et v = (v t ..... vp) est la nouvelle entrde. I1 s'agit alors de rdsoudre rdquation (E) en prenant l'entr6e u c o m m e inconnue. On obtient la stabilisation en injectant cette commande dans le syst6me (Zg') durant un petit intervalle de temps At et on rdsoud & nouveau (E) & l'instant t + At. V. E X E M P L E

DE S T A B I L I S A T I O N

DE SYSTEME

Le systSme ?~ entr6e monodimensionnelle v : x=x+ (2)

3t

y3

f(x,y) + v

off f(0,0) = 0, n'est pas, d'apr~s Aeyels [1], stabilisable par retour d'dtat r6gulier. V.1. E t u d e p r d l i m i n a i r e (Y.) peut, par un bouclage r6gulier de la forme u = fix,y} + v, 6tre mis sous la forme :

(]~')

{i = xy+3 u

11 faut distinguer quatre cas selon le signe de x et de x, donc selon la position du couple (x,y) par rapport ~ la courbe x + y3 = O.

782

Y

B' A

x A A'

0 et x + y3 > 0} {(x,y) ~ I R 2 t e l q u e x > 0 e t x + y 3 < 0} {(x,y) ~ ] R 2 t e l q u e x < 0 e t x + y 3 < 0} {(x,y) e ] R 2 t e l q u e x < 0 e t x + y 3 > 0}

Remarque : Les fl~ches repr6sentent le sens de d6placemcnt du couple (x,y) dans le plan avec une entrde u nulle. Si (x,y) e B U B', x et x ont des signes opposds. Aeyels [1] d6montre l'existence d'une commande u guidant l'6tat (x,y) de (T,') vers l'origine. Si (x,y) e A U A', x et x ont marne signe; le syst~me ne peut 6tre stabilis6 en restant darts A U A'. Par un argument analogue ~ celui du paragraphe pr6c6dent, il existe une commande qui permettra au couple (x,y) de passer de la r6gion A (resp. A') ~t la rdgion B (resp. B'). V.2. Calcul et choix des c o m m a n d e s

a) R6gions A et B a.1) (x,y) E A On peut v6rifier tr~s simplement que (x - u) est un 61dment primitif diff6rentiel, ce qui permet de mettre ( Z ' ) sous la forme ddcrite prdc6demment, en effectuant le changement de variable suivant : (F~)

XI=

X - U

x2

x1

x + y3 _ u

d' off

(Zgl)

r~xl = x2

Lx2

-u + (x2+u) §

(xx+u))2/3

Exprimons ensuite le bouclage lin6~arisant en dcrivant : (El)

-'6 + (x2+h) + 3u(x2+h - (xl+u))2/3 = axx I + a2x 2 + bw

off a l, a 2 et b sont des constantes rdeUes et w une nouvelle entr6e, qui sera supposde nul!e. Pour des raisons ~videntes de stabilisation, a 1 e t a 2 seront pris r6els et n6gatifs.

783

On consid~re (El) comme une Equation en u, u et "u que l'on rEsoud aux instants d'dchantillonnage : U=-X tl=-

1 X2

"11= - a l x 1 - a 2 x 2

Connaissant les conditions initiales de u ~ l'instant d'Echantillonnage t=nAt, nous calculons les param~tres A, B e t C de la fonction u(t)=At2+Bt+C, pour nAt0

(1-3)

and

BI(x)(Y) = ~ DX ( Y ) = ~

(1-4)

1 bk D X k ( y ) ~.~ (bk, Bernoulli numbers )

E 1 and B 1 are Lie series with only one occurrence of the variable Y. In (1-3) the fraction means the simplification of the series numerator by the denominator, this represents a compact expression which will be extensively used lateron. In this paper, the formal equality (1-3) is first extended to higher order derivatives with respect to t and then to the exponential term:

789

X

+Z

tk ~T".rYk

(1-5)

k>l

instead o f X + tY, where the Yk, k > 1 denotes a family of formal variables. For, we define repeatedly a family of Lie series denoted: Ei(x,Yjl ..... YJi-1)(YJi), Ji > 1 and we show that each derivative (1-5) with respect to t, of order p, satisfies a homogeneous, of degree p, polynomial in terms of these elements E i. These combinatorial equalities which are variant and extensions of the well-known BakerCampbell-Hausdorff formula are directly applied in this paper to the sampling problem and to explain relations between nonlinear differential equations and their discretized analogues. The present work goes further than the results proposed in [7] and leads to compact solutions which are much more appropriate for digital computing making use of formal languages. Moreover, we can show that the same algebraic manipulations can be used to analyze functional expansions associated to nonlinear discrete time equations, this gives strong analogies between nonlinear continuous, discrete and discretized systems which are extensively studied in [10, 11] The use of formal algebraic methods in nonlinear control theory were initialized in the continuous time case [4] and further developed in both mathematics and control domains [2, 5]. The paper is organized as follows: some definitions and notations are recalled in section 2, section 3 presents a combinatorial study of parametrized exponential series. The results are applied in section 4 to the sampling problem of a nonlinear differential equation.

2 - P R E L I M I N A R I E S AND F O R M A L N O T A T I O N S Consider a family (or an alphabet) denoted {Xl .... Xk} of elements of an associative algebra. Some definitions and notations from the theory of formal languages are recalled. We denote: Xil...Xip; ij E {1 . . . . . k}, p > 1 any word of length p in terms of the indeterminates {X 1,...,Xk }; 1 is the word of length zero or neutral element and: Xik = Xi... Xi (k times) denotes the word composed by the concatenation k times of the same letter Xi. According to these notations the following exponential type and logarithmic type series are easily defined. We note by: e Xi = I + Xi + (1/2!)XiXi + ... + (1/k!)Xi k + ...

(2-1)

the exponential series and in the same way: Log (I + Xi) = Xi - (1/2[)XiXi+ ... +((-1)k/k!)Xi k + ...

(2-2)

the logarithmic series. Stating by convention that any element Xi is said of degree i, any homogeneous of degree p polynomial with real coefficients is written as:

P=~ m>l

Z

ail"'aimXil'"Xim

(2-3)

il,...,im>l m

with: all.. aim • R and ~

ij = p.

j=1

We note DXi = adXi the formal derivative operator which acts on any word Xi 1..... Xij according to the rule: DXi(Xil ..... Xij) = XiXil ..... Xij - Xil ..... XijXi (2-4)

790 The shuffle product [12] is defined in a recursive way as follows: lmXi = Xi ml = Xi XiO)Xj = Xj(.oXi = XiXj + XjXi

(2-5)

Xjl .... Xjpf-OXil .... Xij = Xjl(Xj2 .... Xjpr-OXit .... Xij) + Xit(Xjl .... Xjp(OXi2 .... Xij)

3 - SOME COMBINATORIAL FORMULAE Hereafter, we will successively denote as {X,Y} and {X,Y i, i > 1} formal variables and t any element of R. The aim of this section is to generalize the formal equalities (1-1) et (I-3) to higher order derivatives with respect to t and then to the exponential term referred as the multivariable case:

tk X + kaY-'~1 ' " Yk The monovariable case: Considering equality (1-3), or equivalently at t r 0 the equality:

EI(x+tY) =

1 - e-D(X+tY) (- 1)k n r X + D(X + tY) =k>~_o(k+l)! ~" ty)k

(3-1)

we define: (3-2)

E2(X,Y) = (d/d0/t = 0 EI(x+tY)' and iteratively for i > 3: Ei(x,Y,...,Y) = (d/d0/t = 0 Ei-I(X + tY, Y,...,Y)

(3-3)

In this way, we define a family of Lie series denoted by Ei(x,Y,...,Y)(Y). Since each series Ei(X,Y,...,Y)(Y) contains i occuring of the variable Y, it will be said of "degree i". For the First elements, we obtain the compact expressions generalizing (1-3):

el(x) =

l-e -DX

D------g---= ~

(- 1)k ~ v k

~"'~

k~O

E (X,Y) =

,~

E3(X,Y,Y) =

r~-z2

20

(~ ~) = 2 , ( ~

DXk(mDY)

(3 -4)

I-DX+(DX2/2!) -e'DX . . . . . 2 x-', (-1) k+2 " ~ tc0t)x) = L ~ DXk(mDY)2 k_>0

and finally for i > 1: Ei(x,Y,...,Y) =

1-DX+(DX2/2!)+...+((-1)i'I/(i'I)!)DXi'I-e'DX.(mDY)i-I DX i

(3-5)

With these notations, it becomes possible to extend (I-1) and to express any derivative as a homogeneous polynomial in terms of these Lie series. We state:

791 Theorem 1: The following formed equality of series is verified:

e - x e X + tY = 1 + ~

( t k / k ! ) a k ( X , Y ..... Y)(Y)

(3-6)

k_>l

where:

ak(X,Y,...,Y)(Y) = (d/dtk)/t = 0 e -X e X + tY

can be decomposed as a homogeneous polynomial of degree k in the formal indeterminates of "degree i": E i ( x , Y ..... Y)(Y). Several techniques to prove this theorem are possible. W e will choose the more intuitive one which consists in the successive derivation with respect to t, expressed at t = 0, of the equality (1-1). In fact, according to (1-3) and (3-I), we can rewrite equality (1-1) at t ;~ 0 as: (d/dt)/t * 0 (e X + tY) = e X + t Y E l ( X + t y ) ( y ) = e X + t Y a l ( X + t Y ) ( y )

(3-7)

Taking into account the non-commutativity of the formal variables X and Y and applying the usual rules of derivation, with respect to t, of products of formal objects, we alternatively compute the polynomials ai (X,Y,...,Y)(Y) as follows: By definition and because of (3-7): (d/dt2)/t = 0 eX + tY = e X a 2 ( X , y ) ( y ) = (d/d01 t = 0 ( e x + tYEI(X+:Y)(Y)) = (d/dt)/t = 0 (e X + tY)EI(X)(y) + e X ( d / d 0 / t = 0 (EI(X+tY)(Y)) = e X E I ( X ) ( y ) E I ( X ) ( Y ) + eXE2(X,Y)(Y) in other words: a2(X,Y)(Y) = {E 1(X)(Y) }2 + E 2 ( X , y ) ( y )

(3-8)

from the definition: E2(X'Y) (Y) = (d/dt)/t = 0 E l (X+tY)(Y) Proceeding this way we can d e c o m p o s e successively the elements ak(X,Y ..... Y)(Y) as a homogeneous polynomial. For k = 3, we obtain: (d/dt3)/t = 0 eX + tY= e X a 3 ( X , y , y ) ( y ) = ( d / d 0 / t = 0 (eX + tYa2(X+tY'Y)(Y)) = (d/dt)/t = 0 (eX + t Y ) ( E I ( X ) ( y ) E t ( X ) ( Y ) + E2(X'Y)(Y)) + eX(d/dt)/t = 0 (EI(X+tY)CY))(EI(X)(Y)) + eXEX(X)(Y)(~Vdt)/t = 0 (EI(X+tY)(Y)) +

e X(a/dt)/t = 0 (E2(X+tY'Y)(Y)) which is: a3(X,Y,Y)(Y) = {EI(X)(Y)} 3 + 2 E I ( X ) ( y ) E 2 ( X , Y ) ( Y ) + E2(X)(y)EI(X,Y)(Y) + E3(X,Y,Y)(Y) (3-9) under the definition: E3(X,Y,Y)(Y) = (d/d0 / t = 0 E2(X+tY'Y)(Y)

792

For the higher order derivatives, we obtain the recurrent formula: (d/dtk)/t = 0 eX + tY = eXak(X,y,...,y)(y) = (d/dt)/t = 0 (eX+tYak'l(X+tY'Y"'"Y)(Y)) =

eXEI(X)(Y)ak_I(X,Y,...,Y)(Y) + eX(d/dt)/t= 0 ak.l(X ?- tY,Y,...,Y)(Y)

(3-10)

Denoting with L the Lie algebra generated with X and Y, L0 the Lie ideal of L generated by Y and successively with Lop , for p > 1, the decreasing series of ideals defined as: Lo 1= Lo, Lop= [Lo,L0P-1], p > 2. We verify: Corollary_ 1: Each element Ei(x,Y,...,Y)(Y), i _> 1 is a Lie element of degree i, belonging to L0 i, and including i occuring in the variable Y. Theorem 2: Using the compact notation: Ei(x,Y,...,Y)(Y) = Ei(DX), we verify the following equality of series: e-XeX+tY=l+ ~ (tk/k!) ~ k>l

~

c01 ..... im)Eil(DX)-..Eim(DX)

(3-11)

k>l il,...,im21

= 1 + T(Eil ..... Eim) = S(Eil ..... Eim), with ~. ij = k. Moreover, the coefficients c(il,...,im) satisfy the recurrent formulae of a shuffle product, j=l

which is f o r k > 1: C(il)C(i2) = c(il,i2) + c(i2,il) = C(ilf.Oi2) C(il)C(i2,i3) = C(il,i2,i3) + c(i2,il,i3) + c(i2,i3,il) = C(ilO~i2i3)

The following corollaries are easily deduced from Friedrichs theorem [12] or related ones. Corollary 2: The series Log( 1 + T(Eil ..... Eim)) is a Lie element in the variables Eil(DX) ..... Eim(DX) if, and only if, the coefficients c(i 1..... im) satisfy the recurrent formulae of a shuffle product. This Lie element corresponds to the Baker-CampbeU-Hausdorff exponent of the exponential product: e-XeX + tY. Log (1 + T(Eil ..... Elm) )= BCH(-X, X + tY)

(3-12)

1 + T = e BCH(-X, X + tY)

(3-13)

that is:

Corollary 3: Denoting with E i et F i two families of indeterminates, we verify: T(Eil+Fil,..,Eim+Fim) = T(Eil ..... Eim)+T(Fil,..,Fim)+T(Eil,..,Eim)coT(Fil,..,Fim) (3-14) and in a similar way: S(Eil+Fil ..... Eim+Fim) = S(Eil ..... Eim)(oS(Fil ..... Fim).

(3-15)

793

The multivariabl~ case: Let us now generalize these results to the exponentia! expansion associated with the multivariable series which is nonlinear in t : X + ~ (tk/k!)Yk. k>l

We will exactly follow the same procedures as in the previous case, in terms of a new family of formal Lie series, defined with degree i and denoted in a compact form by EI(DX) for i > 1. Because of the non linearity with respect to t, each EJ(DX), forj > 1, is now expressed as a J linear combination of Lie series of degree i = ~ ip, denoted by EJ (X,Yi 1..... Yij. 1)(Y ij), which are p=l

straightforward extensions of the elements defined in (3-5). In fact, for ij > 1, we easily define the elements: EJ(x,Yil ..... Yij.1)(Yij) (of degree ~ij), by the compact expression: 1 - DX + (DX2/2!) +...+ ((-1)j'I/(j-1)I)DXJ -1 - e -DX DX i coDYi1...mDYij_1(Yij)

(3-16)

In the first terms, we compute : El(X)(Yi) =

1 - e -DX DX (Yi) said of degree i

E2(X,Yi)(Yj) =

1 - DX-e -DX DX 2 (mDYi)(Yj) said of degree i + j

(3-17)

Besides these elements, we define by successive derivatives with respect to t the series Ek(DX) of degree k. According to (3-1), we can write at t = 0:

E1(DX) = EI(X)(Y1) =

1 - e -DX

DX

d X + ~ (ti/iI)Yi) (Y1) =di"/t = 0 (e-x e i_>1

and equivalently at t # 0, 1 - e-D(X+~

E t ( x + ~ (ti/i!)Yi)( ~ (ti/i!)Yi+l) = i>1

(ti/i!)Yi)

i_>l D ( X + E (ti/i!)Yi) i_>l

i~0

( ~ (ti/i!)Yi+l) (3-18) leO

For higher order terms, proceeding as previously, we compute the successive derivatives with respect to t and we define for example for k = 2: E2(DX)=(d/dt)/t = 0 E I ( X + ~ k_>l

(ti/iI)Yi)(~

(ti/i!)Yi+1)

i>0

= E2(X,Y1)(Y1) + EI(X)(Y2)

(3-I9)

ProPosition: Any element Ek(DX), k _> 1, defined of degree k, is a linear combination of elements of degree k of the form EJ(x,Yil ..... Yij_l)(Yij). We obtain: k

Ek(DX) = ~ j=l

~

c~(il...ij) EJ(x,Yil ..... Yij_l)(Yij),

il,...,ij> l

(3-20)

794

with ct(il...ij ) e R, Y~ij = k, EI(DX) = EI(X)(YI) = al(X)(Y1) and according to the recurrent law : k

Ek+I(DX) = EJ(x,Yil+I ..... Yij-1)(Yij) +"'+ Z

Z

j=l

a(il...ij)

i 1 ..... ij_>l

k

EJ(x,Yil ..... Yij_l)(Yij+l) + ~ j=l

~

a(il...ij) Ej+I( X, YI, Yil ..... Yij-1)(Yij)

(3-21)

il,...,ij_> 1

For the t-u-stterms, we compute: E2(DX) = E2(X,Y1)(Y1) + EI(X)(Y2) (3-23) E3(DX) = E34X,Y1,Y1)(Y1) + 2 E2(X,Y1)(Y2) + E2(X,Y2)(Y1) + EI(X)(Y3), E4(DX) = 2E3(X,Y2,Y1)(Y1)+2E3(X,Y1,Y2)(YI)+3E3(X,Y1,Y1)(Y2)+3E2(X,Y2)(Y2) +2E2(X,Y 1)(Y3)+E2(X,Y3)(Y1)+E1(X)(Y4)+E4(X,Y 1,Y 1,Y1)(Y1)+E2(X,Y1)(Y3). (3-24) Remark: When only one indeterminate is involved Y1, we recover the equality: Ek(DX) = Ek(x,Y1 ..... Y1)(Y1). According to these notations, the multivariable analogues of results stated previously can be written with respect to the generalized elements Ek(DX) for k > 1: Theqrem 3 : The following equality of series is verified: e-X

e (X + ~" :"""2.ttVx!)Yi)= 1 + ~

(tk/k[)Ak(X,Y1 ..... Yk)

(3-25)

k>l

with : Ak(X,Y1 ..... Yk)= (dk/dtkt/t = 0 e-X e(X +E (ti/i!)Yi) which can be rewritten as an homogeneous polynomial of degree k in the formal indeterminates: Ei(DX), i>l. Generalizing (3-8) and (3-9) according to the notations (3-23) and (3-24), we directly obtain: A 1(X,Y1)=EI (DX)=E1 (X)(Y 1) A2(X,Y 1,Y2)= {E 1(DX) }2 + E2(DX) A3(X,YI,Y2,Y3)= {EI (DX) }3+2E I (-DX)E2(DX)+E2(DX)E I(DX)+E3(DX)

(3-26) (3-27) (3-28)

Denoting with L the Lie algebra generated by X and Yi. i >1, with iLo the Lie ideal of L generated by Yi and successively denoting by Lo p, p>l, the decreasing series of ideals of "degree p" defined as: Lo 1= 1Lo, Lo 2= [1Lo, lLo]+2Lo, Lo 3= [[1Lo,[1Lo,ILo]]+[2Lo, ILo]+3Lo =[Lo2,1Lo]+3Lo, (3-29) Lo 4= [1Lo,[ iLo,[ ILo,1Lo]]]+[1Lo, {2Lo, ILo]]+[ 1Lo,3Lo]+[2Lo,2Lo]+4Lo = [Lo3,1Lo]+[Lo2,2Lo]+4Lo. We obtain: Corollary 5: Each element E'(DX), for i >_1 is a Lie element of "degree i" that is belonging to Loi.

795 Theorem 4: The following equality of formal series is verified: e-XeX+Y'(ti/it)Yi= 1+ ~

(tk/k!) ~

k_>l

m>l

~

c01 ..... im)Eil(DX) ... Eim(DX)

(3-30)

il,...,im>_l

= 1 + T(Ei! ..... Eim) = S(Eil ..... Eim), with ij > 1, ]~ ij = k. Moreover, the coefficients c(i 1..... ira) satisfy the recurrent formulae of the shuffle product.

4 - NONLINEAR SYSTEMS UNDER SAMPLING When we consider a nonlinear differential equation driven by controls, which are constant over time intervals of amplitude 5, the question of finding a nonlinear difference equation, so that its state evolution corresponds, at sampling times, to the solution of the nonlinear differential equation, when the same initial states are assumed, is referred to as the sampling problem. With respect to usual numerical integration methods, the solution of this problem leads to an implicit sampled representation described by a nonlinear difference equation. On the basis of these equations we can design various digital control schemes and study the preservation of special properties under sampling [8, 9]. This problem, solved in the linear analytical case (equation 1-1) [7] is posed here in a strictly formal way. This approach enables the extension to differential equations including non-linearities with respect to the controls; the solution appears as a direct consequence combinatorial formulae proposed in section 3. It has to be noted that the recursive properties of the solutions are very adequate to the use of symbolic programming language as REDUCE language for example already tested in [1]. Consider a nonlinear differential equation:

x(t) = f(x(t) + ~

(ui(k)/i!)gi (x(t)

(4-1)

i>1

defined on R n by the vector fields f and gi : Rn .... R n, assuming the controls constant over time intervals of amplitude 5 : u(t)=u(k), pour k5 < t 0, the question which is posed is to find a nonlinear difference equation of the form:

x(k+l) = F(x(k)) + ~

(ui(k)/i!)Gi(x(k)

(4-2)

i~l

such that the state evolutions of (4-I) and (4-2) coincide, at sampling times, when x(t -- 0) = x(0); that is x(t) = x(k), for t = k& k >- I. Denoting by Lf and Lgi, the formal directional derivatives associated respectively to the vector fields f and gi, we define an alphabet of formal variables on the basis of which computing rules established in section 2 and 3 can be applied. More precisely, elements Lf and Lg i are called of degree 1 and i respectively; the operation of concatenation of two letters corresponds to the composition of differential first-order operators and consequently exponential and logarithmic expansions, Lie and shuffle products are defined in a very natural way with respect to this composition.

796 Integrating during time interval [kS,(k+l)5[, the solution, at time t = (k+l)8, of the equation (4-1) can be expressed as the flow associated to the constant vector field : f(x(k)) + ~ (ui(k)/il)g i (x(k)). i>l We obtain the formal solution : x(k+l) = eS(L~Y, (ui(k)/i!)Lgi)(Id)/x(k) i_>l

(4-3)

The results proposed in section 3, in particular th e application of theorem 4 to (4-3), directly lead to a compact solution of the sampling problem. Comparing it with (4-2), we can write: F(x) = eSLf(Id)/x Gk(X) = (d k/duk)/u= O jS(Lf+X(ui(k)/i!)Lgi )(Id)/x, k > 1 = e~SLf ~ ~ c(il ..... im)Eil(DSLf) ... Eim(D~SLf)(Id)/x, m_>l il,....im>l with ij >- 1 and Y. ij = k

(4-4)

For the first terms, we compute: G 1(x) = erLf E 1(D~SLf)(Id)/x G2(x) = eSLf{ {El (DSLf) }2 + E2(DrLf) ] (id)/x G3(x) = eSLf{ {E l(DSLf) }3 + 2E 1(D~SLf)E2(DSI_,f) + E2(D~iLf)E I (DSLf) + E3(DSLf) }(Id)/x

(4-5) (4-6)

with: E 1(D~Lf) = E 1(SLf)(SLg I) E2(DfiLf) = E2(SLf, SLg 1) (SLg 1) + E 1(SLf) (SLg 2) E3(DSLf) = E3(SLf, SLgl,SLgl)(SLgl) + 2E2(SLf, SLgl)(SLg2) + + E2(SLf, SLg2)(SLgl)+ El(SLf)(SLg3),

(4-7) (4-8) (4-9)

The elements Ei(SLf, SLgil ..... 5Lgij.1)(fLgij ) are defined by the expressions (3-4) and (3-16) : For the first terms, we compute : El(SLf)(SLgi) = l'e'DSLf = ~ ~(-1) D S kL t k ( S L g l ) , DSLf ~o

k _>0

(4-10)

-- 5Lg i - (52/2!)adLf(Lgi) +(~3/3 !)ad2Lf (Lgi) + 0(53) E2(SLf, SLgi)(SLgj) =

1-DfLf-e-DSLf DSLf 2 (c~

1)k+ 1 DSLfk(c0DSLgi)(SLgj) =~z~ (-(k+2)! k>O = _ (52/2[)adLgi(Lgj) + (53/3 0adLfadLgi(Lg j) + (53/3 !)adLgiadLf(Lgj) + 0(53)

(4-11)

797

E3 (SLf, SLgi,SLgj)(SLgk) =,

I - D 8 L f - + (DSLf2/2!) - e - D S L f (coD8Lgi)(coDSLgj)(8Lgk) DSLf3

= ~ ~ ( - l ) k + 2 DSLfk(o~DSLgi)(coDSLgj)(8Lgk) k>0 = (5313!)adLgiadLgj(Lgk) + O(83)

0-12)

Remarks; l~i)Defining, according to (3-29), the successive Lie ideals associated to the elements 8Lf and 8Lg i ,for i > 1, it follows that Ei(D8Lf) belongs to L01. (it) Considering now the reverse problem which consists of associating to any difference equation of the form (4-2), a differential equation of the form (4-1), in such a way that the solutions coincide at sampling time, some remarks can be deduced from the previous analysis. Let us first assume the sampling period equal to 1 and the existence of a vector field f, such that eLf (Id), is equal to F. According to the definition (1-4) of the reverse series, we see that the vector fields gi can be successively computed as follows: gl(x) = B l(Lf)(G1)(Id)/x g2(x) = B 1(Lf)(G2_G1G I_E2(Lf,Lgl)(Lg 1))(Id)/x g3(x) = BI(Lf)(G3+2G1GIGt-2G1G2-GIG2-2E2(Lf,Lgl)(Lg2)-E2(Lf,Lg2)(Lgl))(Id)/x

(iii) For practical purpose, approximated solutions only are computed, this corresponds to the truncation of the series solutions, with respect to the sampling period, at a fixed order which define the order of approximation of the sampled system. REFERENCES

[1] J.P. BARBOT, A computer aided design for sampling a nonlinear system in "Applied Algebra, Algebraic Algorithms and Error Correcting Codes, Lect. Notes in Comp. Sc. 357, T. Moraed. pp. 74-88, 1989. [2] P. CROUCH, F. LAMNABHI-LAGARRIGUE, Algebraic and multiple integral identities, Acta Applicanda Mathematicae ~ pp. 235-274, 1989. [3] R.GOODMAN, Lifting vector fields to nilpotent Lie groups, J. Math. Pures & Appli., 57, pp.7786, (1978). [4] M. FLIESS, Fonctionnelles causales non lindaires et ind6terminres non commutatives, Bull. Soc. Math. France, 109., pp. 3-40, 1981. [5] M. FLIESS and D. NORMAND-CYROT, Algrbres de Lie nilpotentes, formule de Baker-CampbellHaussdorff et int6grales itErEes de K.T. Chert, s~minaire de probabilit6s, 1980-1981, 16, Lect. Notes in Math., J'. Azema and M. Yor eds., pp. 257-267, Springer-Verlag, 1982. [6] P. LEROUX and X.G. VIENNOT, A combinatorial approach to nonlinear functional expansions: an introduction with an example, Proc 27-th IEEE-CDC, Austin Texas,USA, 1988. [7] S. MONACO and D. NORMAND-CYROT, On the sampling of a linear control system, Rap. 6-84, DIS, Rome, Proc. 24th IEEE CDC, Fort Lauderdale, USA, pp. 90-95, 1985. [8] S. MONACO and D. NORMAND-CYROT, Invariant distributions under sampling, Theory and Applications of Nonlinear Control Systems, ( C.I. Byrnes and A. Linquist eds.), North Holland, pp. 215-221, 1986. [9] S. MONACO and D. NORMAND-CYROT, Zero dynamics of sampled nonlinear systems, Syst. and Contr. Letters, 1._LL1pp. 229-234, 1989. [10] S. MONACO and D. NORMAND-CYROT, Functional expansions for nonlinear discrete time systems, Mathematical Systems Theory, 21, pp. 235-254 1989. [11] S. MONACO and D. NORMAND-CYROT, Combinatorial relations between discrete and continuous time nonlinear systems, submitted for publication, 1990. [12] R.REE, Lie elements and an algebra of shuffles, Ann. of Math., 68, pp.210-220, 1958.

N O N L I N E A R MRAS IN ROBOTS M O T I O N C O N T R O L Bernard BROGLIATO Laboratoire d'Automatique de Grenoble - ENSIEG-INPG-CNRS BP 46 38402 Saint-Martin d'H6res, France

abstract : Inttds paper, we provide a brief overview o f the principal adaptive control laws that have been developped in the field of rigid manipulators during the last ten years, that can be turned into a common MRAS framework and analysed using passivity arguments.

1) I N T R O D U C T I O N Robot manipulators control has been a major topic for an increasing number of researchers during the last decade [1]-[15].Lndeed, besides the fact that new schemes had to be developped in order to improve the performances of new manipulators (particularly those employing direct drive arms), making it mandatory for designers to take the inertial effects into account in their totality (such as Coriotis and centrifugal torques), it appeared that those systems belong to a very particular class of non-linear systems, and even of finite-dimensionnal physical systems (ie systems described by a finite set of generalized coordinates ).Many control schemes have emerged, a large proportion of which are explicitly or implicitly inspired from the Model Reference Adaptive Systems (MRAS) approach studied in the linear case (see eg [16]).Two main phases can be distinguished concerning robots control through a model-reference approach : i) the first one consists in adaptive control laws designed as parallel Adaptive -Model -Following -Control (AMFC) schemes.(figure 1) ii)the second one consists in adaptive control laws derived from the computed torque method, based on the linearization- wit- the- unknown- parameters property of manipulators, and using some physical properties of robots.They can be interpreted as series-parallel AMFC .(figure 2) The general problem of controlling rigid frictionless manipulators can be stated as follows : Find a control law that asymptotically (in the adaptive case) linearizes that non-linear plant, ie makes it asymptotically follow a prespecified linear decoupled reference model.It is in fact essentially this model which will make laws of type i) differ from laws of type li) above. The paper will be organised as follows: in section 2, we provide a brief recall of linear AMFC.In section 3, we present an overview of the "first generation" schemes studied in [1]-[7].In section 4

799

we present the second type of adaptive laws studied in [8]-[15].Section 5 is devoted to unify stability analysis of those schemes. Conclusions are given in section 6.

~_~

reference ] ym model u

1u

riga : parallel MRAS (trajectorytracking : r ~ 0 )

fig. 2 : series-parallelMRAS (state regulation : r = 0 )

2) L I N E A R A M F C 21) Generalities Model reference adaptive systems have been studied extensively in the literature .As noted in [17], designing MRAS requires, from a general point of view, two steps: i) an algebraic part : One has to find a class of reference models such that a bounded control signal u and a controiler C(0*) exist such that lim t~+,,o ]] Xp - Xm[I =0 ,where x m and Xp are the model and plant state (or output) vectors respectively, O* being the true unknown parameters of the plant. ii) an analytic part : it consists in determining the adaptive law for adjusting the parameters estimates 0, such that the overall system will be stable. Point i) obviously requires prior knowledge about the plant that has to be controlled. It is strongly related to minimum-phase properties of the plant.It is called the perfect model matching problem .It can be considered either as a transfert matrix matching, or by establishing algebraic conditions on the error equation.It leads in that case to the well-known Erzberger's conditions on the plant and the model [16] .Note that these are sufficient only and may or may not be verified[27].AMFC problem can then be seen as an MR.AS problem for which a solution of asymptotical model matching exists. 22) A s s u m p t i o n s on the model and the plant W e give here a list of the properties that are generally required in the parallel AMFC problem for SISO systems [17][18][19] and that allow one to solve points i) and ii) above.The plant and the

800

model are described as follows: (P) (RM)

:~p=Apxp+Bpu xm=Amxm+Bm r

; yp=Cpxp ; ym=Cmxm

where r is a bounded piecewise continuous signal. i) the plant is minimum-phase and stabilizable ii)an upper-bound on the plant order is known iii)the relative degree p of the plant is known, and p < Pm (Pm is the relative degree of the model) iv)the sign of the high-frequency gain (transfert function approach) is known. Note that i) is necessary for (P + controller) to be detectable[17,p.375], as the eventuals poles-zeros cancellation that may appear between the plant and the controller (and lead to unobservable modes) are stable.It is shown in [18] that iii) is the only necessary assumption.If the plant zeros are exactly known (which may not be the case in practical applications), then i) is not necessary, but the model has to contain the plant unstable zeros, in order to avoid unstable poles-zeros cancellations.One could ask here whether this still represents a reference-model problem, or simply pole-placement.ii) can be rendered unnecessary by relaxing the objectives on the error [18]. iv) is not a necessary condition : controllers can be found that achieve the MRAC problem although it is not verified.The most general approach for which this condition is unnecessary can be found in [28]. Several remarks arise : i) All that has been done is concerning SISO systems.It can be translated to the MIMO case [17] [19], taking into account the particular structure of multivariable systems, especially for those systems that admit a Hermite normal form .But this is far from trivial, particularly in the adaptive case, and is still being under investigations. ii) The case usually studied in the literature is the parallel MRAC problem, with an input / output (transfert function) approach.The approach which seems to be the most widely used [20]- [23] is based on an augmented state vector xT= [ XpT, wiT, w2T], where w I and w 2 are n-dimensionnal vectors obtained by filtering the input and the output of the plant.It is shown [20] that a non-minimal representation of the reference model can be constructed in terms of the plant true parameters, that allows to derive an appropriate error equation, and then conclude on the convergence of the state error vector. It presents the great advantage of using the plant output instead of the state vector in the control law. Schemes designed from a state space approach have been seldom studied.An approach proposed in [16, p. 218] is based on hyperstability's arguments for the stability analysis.The control law

801

consists in output and state feedbacks, the gains of which are adaptively adjusted by the so-called integral+proportionnal adaptation mechanism .The main drawbacks of this approach are that the input matrix Bp has to be known (or at least its structure) for stability conditions,and that the plant state vector has to be measurable.Though these assumptions could appear at first sight quite stringent, there exist particular cases (see next section) for which they are verified. However, it presents the advantage of being available for MIMO systems, as no assumption is done on the output and input dimensions. remarks a) In this approach, the relative degree problem related to the SPR condition is avoided, as one can consider the output to be such that Cp Bp # 0, ie the relative degree of that "equivalent" system is one. b) MR.AS are often presented as an example of direct control.As noted in [17], it is not a necessary condition, and MRAS can also be designed through an indirect approach .In the litterature, MRAS are almost always considered through a direct approach, as it can be shown that for minimum-phase systems, a reparametrization of the process is always possible so that the estimated parameters turn out to be those of the controller .The most commonly used algorithms are gradient type, as the proportionnal + integral adaptation law [16].This is due to some nice properties of such algofithrr~ such as passivity, which is not met in least-squares algofithrr~ for instance. 23) Robustness issues Adaptive laws lead in the "ideal" case to boundedness of all the signals in the closed-loop system.Actual implementations of theses schemes make it necessary to take the "non-ideal" (ie perturbed) case into account.Several types of unmodeUed dynamics can be considered : i) internal and external bounded disturbances on the state or the output of the plant. ii)Plant parameters variations in those schemes where the process to be controlled is tinae-varying. iii)Plant dynamic uncertainties. A general way to design adaptive schemes with unmodeIled dynamics is to relax performances objectives [ 17][24], ie such algorithms should at least insure BIBO stability, which constitutes in fact the main theoritical problem.A fast approach is to take "small" disturbances only into account [t7][40,p.16] : ~is is based on the total stability concept and is related to the uniform asymptotic stability of the unperturbed system.Considering general unmodelled dynamics, two approaches are available: either modifying the adaptive law (e 1 modification[22],cr modification[21], bounds on

110"11124],dead-zone[25]), or increasing the "richness" of the reference input [26].Note also

that when no disturbance is present, those algorithrr~should at least lead to finite parameters and output errors, and verify exact or asymptotical model matching conditions if possible.Application

802

of the first type of modifications to the series-parallel case should not cause any difficulty.However, persistent excitation arguments on the reference model output are to be reconsidered, as Ym depends on yp and therefore cannot be assumed bounded (which is a necessary condition of excitation). 3) NONLINEAR AMFC 31)Generalities The problem of compensating a nonlinear system in order to make it match a linear reference model has been invastigated (see eg [29],[19, chp 7] for references, [27]).Techniques described therein may not always be very useful for the manipulator's case which represents, as noted in the introduction, a narrow class of nonlinear plants, for which global results can be derived. (Indeed, the general techniques applied to smooth affine in the input nonlinear plants of the form ~ = f(x) + g(x) u often lead to local results only).[30] provides a comparison work on several dynamic linearizing feedbacks studied for rigid robots, in the non-adaptive case.Note that all those linearizing tools present the major drawback that if there is any uncertainty on f and g, then the linearizing properties vanish. Adaptive control has recently been studied.J31] solves the nonlinear AMFC problem for a particular class of time-varying perturbed plants, employing alternatively a discontinuous VSS approach, and a continuous control law derived from a work presented in [16] for the linear case. A more general approach can be found in [19], for continuous SISO and MIMO square systems, linearizable and decouplable by static state feedback, which can be seen as an extension of the work done in [32] for robots controLIt represents an attractive pioneer work, but its major drawback is that estimated terms appear on the denominator of the control law, and must therefore absolutly be kept away from zero.This problem is far from obvious, and has been recently solved for the linear case in [28]. Due to the difficulties encountered, the nonlinear AMFC problem has been much less studied than its linear counterpart.However, it seems that similar problems arise for the design of suitable schemes, especially concerning assumptions i) and iii) in paragraph 22) (SISO plants).The main distinction lies in the definitions of those properties, particularly for the MIMO case, where several definitions of the zero-dynamics can be given [ 19, p.305].Fortunatly, for systems decouplable by static state feedback (as rigid robots) they are equivalent.

32) manipulators dynamics Let q, ~t, and ~ denote the robot's joints position, velocity and acceleration n-vectors.Rigid frictionless manipulators dynamics are then given by the set of nonlinear differential equations : M(q) ~t'+ C(q, ~) ~ = u- g(q)

(1)

803

where :

M(q) is the nxn inertia matrix, C(q, ~t) (} contains the centripets and Coriolis torques,

g(q) stands for gravity torques, u is the input vector torque. The main properties of equation (I) are : i) M(q) is a symmetric strictly positive definite matrix, bounded in q, composed of smooth functions ofq. ii) M(q) - 2 C(q, ~t) is a skewsymmetric matrix. iii) M(q), C(q, ~1) and g(q) can be expressed as Y0, where Y is composed of known functions of q and q, while 0 stands for unknown physical parameters. iv) (1) is static state feedback linearizable and decouplable by the so-called "computed-torque" control law :

u = C(q, ~t) q + g(q) + M(q) a

, a being any output signal.

v) q and q are both physically accessible for measure. i) and ii) are fundamental properties used to derive stability proofs of many schemes .i) insures among other things that global solutions of (1) exist [17, p.55], using the smoothness property of M(q).ii) shows that any system with input v and ouput M(q)~ + C(q, ~t)v, v a n-dimensional vector, represents a passive system [33]. iii) and iv) have been extensively used in the literature, at least for a certain class of schemes [8]-[15]. v) means that the state vector xT = (qT, ~1T) is entirely measurable.This is worthnoting, as minimum-phase properties of systems (linear or not) are obviously related to their output.In the finear case, transfert zeros'dynamics are given by forced (u ;e 0) state trajectories that nullify the output. In the nonlinear case, techniques have been developped to caracterize t h e so-called zero-dynamics [19, chp 7].They are also related to the definition of the output y = h(x).So if the state is known, the designer can choose among several possible definitions of h.In the case of robots, it has been shown that choosing y = q leads to a minimum-phase plant [ 19, p.322].This fact has certainly played a key role in the design of robots AMFC in joint space coordinates.Note that as long as the jacobian matrix J(q) remains nonsingular, one can easily translate joint space design into task space design [10]. The next paragraphs are devoted to a tentative classification of several schemes studied in the field of MRAS applied to robots.We will not consider in the following pioneer works (see eg [34]) that were based on "large" approximations of equation (1), due to their lack of global stability. 33) Parallel A M F C 331)Schemes derived from the linear case Chosing xT = (qT, ~tT) as a state vector, (1) can be rewritten :

~=

dl - M-l(q) [C(q, c].) ~ - g(q)]

+

u

M-l(q)

(2)

804 or :

= A(x) + B(x) u

(3)

It is then attractive to consider the manipulator as a 2n-dimensional plant, and try to make it match a linear decoupled model : im = Am Xm + Bm r

Ama 2n x 2n

Hurwitz matrix

Such is the purpose of papers [ 1]-[5].These control laws were inspired by the work done in [16,p.218].The main distinction lies in the adaptation laws, that include specific terms to compensate for the nonlinearity of (3). Several solutions are proposed, some of which are based on discontinuous (or "unit vector") control laws [1][2][5 part 1], or on continuous control laws [3][4][5 part 2].Properties i) and v) have been used.Note that property iii) has been used in [3] to rewrite the error equation derived from (3) (see [3] for details). As noted in [1], these algorithms require less computations than computed torque techniques. However, they also lead to less precise results. A major drawback of the discontinuous (sliding modes) approach is the so-called chattering phenomenon. No robustness analysis is available in the above references.It has been shown that including a dead- zone in the update law can reduce these negative effects [35].

332) Adaptive computed torque A second type of parallel schemes has been developped in [6]-[7].They appear as the first application of computed torque techniques in an adaptive context. Unfortunatly, on the contrary of schemes [8]-[ 15], no use is done of properties ii) and iii). Unknown time-varying terms are then updated using adaptive algorithms similar to those proposed in [16], and the closed-loop stability is insured by adding a large enough feedback PD term in the control law (see references for details). The reference model is then given as in the exact linearization case by a n th order decoupled double integrator that the plant + controller will asymptotically track. The main drawback of this approach is that the stability proof (as in [4]) hinges on the assumption that M(q) and several other time-varying terms in (1) remain constant during the adaptation process. 4) SERIES-PARALLEL AMFC Another way of ~onsidering equation (1) is to simply write it as : ~1"= M-l(q) [ C(q, q) q + g(q) ] + M'l(q) u

(4)

Chosing the "state" vector as x = q, (4) reads as : x = f(x) + g(x) u

f and g being this time n-dimensional vectors.

(5)

8O5 A whole class of adaptive schemes [8]-[15] can be interpreted as laws achieving asymptotical tracking between (5) and a series-parallel reference model given by : xr = - A x + r , where r = A Cla +/:ia

(6)

One key point in the stability analysis is that the error vector is given by : v = x - x r = H(s)- 1r q , with H(s) = ( sI + A) -1 and qtw = q - qd (see [36] for details) These algorithms can be considered as the most accomplished ones to date. Properties i), through v) are used. Various control laws have been proposed, from the pioneer works on the subject, derived almost simultaneously in [8] and [9], which can be seen as a continuation of schemes developped in [6]-[7], to more elaborate schemes [10]-[15]. The basic idea remains the same from one to another. Modifications essentially concerns update laws, with the so-called composite [11] or repetitive [14] controllers, or update laws using only desired trajectories (instead of measured signals) in the regressor matrix [12]. Exponentially stable schemes have also been studied [13], [15]. 5) S T A B I L I T Y ANALYSIS An interesting point is that all those schemes [1]-[15] can be interpreted as a feedback connection of a strictly-passive block with a passive one (figure 3). Some of them have explicitly been designed using that tool [I]-[3],[5],[6],[7]. The others [4], [8]-[15] have been studied employing a Lyapunov approach. It can be shown [37] that they can be turned into such a passive interpretation.

~ L~

str'~y-passive

passive

1

figure 3

All those convergence proofs have been lead in the ideal case.Robustness analysis similar to those already done in the linear case have been rarely published.Pioneer works on that subject can be found in [38] and [39].[38] is devoted to the robustness analysis of the scheme developped in [32].The main drawback of the approach is that the size of the instability domain depends on M-l(q), where M(q) is the estimate of the inertia matrix. Therefore, M(q) must be kept nonsingular.In [39], the authors have shown that the ~-modification can be applied to the laws

806 [8] and [9], considering almost all the possible unmodeUed dynamics i) to iii) (section 23)).At this stage several renmrks arise : i) Parasitic noises on the output (especially on the velocity q measures) have not been considered. ii) It would be interesting to see if the other techniques developped in the linear case can also be applied to the robots case. iii) Those robustness tools have been developped for parallel schemes. Results obtained in [39 ] confirm the remark done at the end of section 2). 6) C O N C L U S I O N In this paper, we have provided a tentative summary of the most significant adaptive control laws for manipulators studied in the last ten years. The most important fact is that all those controlled systems can be interpreted through a common framework (ie MRAS ), and analysed using passivity tools, either they were explicitly designed like that, or through a Lyapunov approach. references [ 1] Nicosia S.,Tomei P.:"MRAC for industrial robots";Automatica,vol.20,n~ 1984 [2] Balestrino A.,DeMaria G.,Sciavicco L.:"An AMFC for robotic manipulators";ASME JDSM&C,vol. 105; 1983 [3] Nicolo F.,Katende J.:"A robust MRAC for industrial robots";2nd IASTED Symposium on Robotics and Automation, Lugano; 1983 [4] Lim YK.,Eslami M.:"Adaptive controllers designs for robots manipulators systems yielding reduced cartesian errors";IEEE TAC, vol.AC-32,n"2; 1987 [5] Balestrino A.,DeMaria G.,Sciavicco L.:"Hyperstable AMFC of nonlinear plants";Systems & Control Letters,vol.l,n~ 1982 [6] Tomizuka M.,Horowitz R.,Landau ID.:"On the use of MRAC techniques for mechanical manipulators";2nd IASTED Symposium Modelling Identification Control and Robotics,Davos,March;1982 [7] Horowitz R.,Tomizuka M.:'An adaptive control scheme for mechanical manipulators;compensation of nonlinearity and decoupling contror';ASME JDSM&C,vol. 108,n~ [8] Sadegh N.,Horowitz R.:"Stability analysis of an adaptive controller for robotic manipulators";IEEE International Conference on Robotics and Automation, Raleigh; 1987 [9] Slotine JJE.,Li W.:"Adaptive manipulator controha case study";IEEE TAC,vol.33,n~ 1; 1988 [10] Kelly R.,Carelli R.:"Unified approach to adaptive control of robotic manipulators";27th IEEE Conference on Decision and Control,Austin,Texas; 1988 [11] Slotine JJE.,Li W.:"Composite adaptive control of robot manipulator";Automatica,vol.25,n~ [ 13] Sadegh N.,Horowitz R.:"An exponentially stable adaptive control law for robot manipulators";submitted for publication [ 12] Sadegh N.,Horowitz R.:"Stability and robustness analysis of a class of adaptive controllers for robotis manipulators";To be published in the International Journal of Robotics research,1987 [14] Sadegh N.,Horowitz R.,Kao WW.,Tomizuka M.:"A unified approach to design of adaptive and repetitive controllers for robotic manipulators";presented at the USA-Japan symposium on flexible automation, Minneapolis ; 1988 [15] Bayard DS.,Wcn JT.:"New class of control laws for robotic manipulators: part 2: adaptive cam"; International Journal of Control,vol.47,n~ 1988 [16] Landau ID.:"Adaptive control: the model reference approach";Dekker,New-York;1979 [17] Narendra SK.,Annaswamy MA.:"Stable adaptive systems";Prentice Ha11;1989 [18] Miller DE.,Davison EJ.:"On necessary assumptions in continuous time MRAC";28th IEEE Conference on Decision and Control,Tampa,Florida,December;I989 [ 19] Sastry S.,Bodson M.:"Adaptive control: stability, convergence and robustness";Prentice Hall; 1989

807 [20] Narendra KS.,Valavani LS.:"Stable adaptive controller design-Direct control";IEEE TAC,vol.AC-23,n~ [21] loannou PA.,Tsakalis K.:"A robust direct adaptive controUer";IEEE TAC,vol AC-3 l,pp.1133-1143;1986 [22] Narendra SK.,Annaswamy MA.:"A new adaptive law for robust adaptation without persistent excitation";IEEE TAC,voI.AC-32,n~ 1987 [23] Sastry S.:'MRAC :S tability,parameter convergence and robustness";IMA Journal of Mathematical Control & In formation,vol. 1,pp.27-66;1984 [24] Kreisselmeir G.,Narendra KS.:"Stable adaptive MRAC in the presence of bounded disturbances";IEEE TAC,voI.AC-27,n~ 1982 [25] Peterson BB.,Narendra KS.:"Bounded error adaptive control";IEEE TAC,voI.AC-27,n~ [26] Narendm SK.,Armaswamy MA.:"Robust adaptive control in the presence of bounded disturbances";IEEE TAC,vol. AC-31,n~ 1986 [27] Marino R.,Nicosia S.:"Linear model following control and feedback equivalence to linear controllable systems";International Journal of Control,vol.39,n~ 1984 [28] Lozano-Leal R.,Collado S.,Mondie S.:'Model reference robust adaptive control without a-priori knowledge of the high frequency gain~;IEEE TAC, to appear;1990 [29] Di Benedetto M.:"A condition for the solvability of the model matching problem";Conferenee on nonlinear systems analysis,Nantes, France; 1988 [30] Kreutz K.:"On manipulator control by exact linearization";IEEE TAC,vol.34,n~ [31] Balestrino A.,DeMaria G.,Zinober ASl.:"Nonlinear AMFC";Automatica,vol.20,n~ [32] Craig JL:"Adaptive control of mechanical manipulators";Addison Wesley Publishing Company;1988 [33] Landau ID.,Horowitz R.:"Synthesis of adaptive controllers for robots manipulators usinf a passive feedback systems approach";International Journal of adaptive Control and Signal Precessing,vol.3,n~ [34] Dubowsky S.,DesForges DT.:"The application of MRAC to robotic manipulators";ASME JDS M&C,vol. 10 l,September; 1979 [35] Slofine JJE.:"On modelling and adaptation in robot control";IEEE Conference on Robotics and Automation,San Fransisco,CA; 1986 [36] Ortega R.,Spong MW.:"Adaptive motion control of rigid robots: a tutorial";27th IEEE Conference on Decision and Control, Austin,Texas;1988 [37] Brogliato B.,Landau ID.,Lozano-Leal R.:"Adaptive motion control of robot manipulators:a unified approach based on passivity"; American Control Conference, San Diego,CA; 1990 [38] Kang H.,Dawson D.,Lewis FL.:"A robust adaptive controller for rigid robots";28th IEEE Conference on Decision and Control,Tampa,Florida;1989 [39] Reed JS.,Ioannou PA.:"Instability analysis and robust adaptive control of robotic manipulators"; 27th IEEE Conference on Decision and Control,Austin, Texas; 1988 [40] Anderson BDO.,Bitmead RR.,Johnson CR.,Kokotovic PV.,Kosut RL.,Mareels IMY.,Praly L.,Riedle BD.:"Stability of adaptive systems : passivity and averaging analysis";MIT Press; 1986

Adaptive Control of Feedback Equivalent Systems J.-B. Pomet*

A b s t r a c t We address the problem of stabilizing a nonlinear system depending on some unknown parameters in suck a way that M1 the systems obtained by varying these parameters are equivalent to one system, supposed to be state-feedback stabilizable. The description of the adaptation iaws make use of passivity. We consider both the "general" case, and the case where some "matching assumptions" hold.

and

L. P r a l y t

on the p a r a m e t e r p; g is the matrix field defined by (3). One particular p* in Ill will be called the "true value of the p a r a m e t e r p", and our problem is to stabilize the system gp,, p* being unknown. This will be done by means of a dynamic controller, i.e. of a system with a certain state, to be determined, input x, and output u : State: x U ~

1

~.~p.

X

Introduction

We consider a family of nonlinear affine-inthe-control systems, indexed by a parameter vector p: P =

(Pl

...

Pl) T

e

RI

.

(1)

State: to determine

u

Adaptive Controlier

Ix

Fig. 1 : Closed-loop system

T h e state of the overall closed-loop system The system Sp corresponding to a given is composed of x and the dynamic variables value of p is described by : (or the state) of the controller. By "stabilize Sv." we then mean that both x and Sp : ~ = f ( p , x ) + g(p,x) u (2) the dynamic variables of the adaptive conm troller must be bounded, and x must tend f(p, x) + x (3) k=l to a certain point 0 of M " , for all the solutions of the closed loop system (global where the state x lives in an n-dimensioned properties), or only for some (local propC ~ manifold M " and is completely meaerties). By "p* being unknown", we m e a n sured, that the adaptive controller must not deu = ( u , , . . . , u,~) (4) pend on, or use the value of, p*. is in tZ"~, and f and the gk's are k n o w n This general problem, as well as the smooth vector fields smoothly depending distinction between global and local results, is extensively discussed in [10], [8J, *Univ. of Toronto, Department of Mathematics, or [11], where a complete bibliography m a y Toronto, Ontario, M5S IA1 Canada tEcole des Mines, CAI Section d'Automatique, be found. T h e present paper is specifically 35 rue St Honor4, 77305 Fontainebleau, France devoted to the case when all the systems

809

@ are equivalent to one another by state feedabck and diffeomorphism. In section 2, we state precisely our assumptions about the systems Sp. In section 3, we describe some general adaptation schemes. Section 4 presents some adaptive controllers designed for the general case where no more assumption is satisfied. Section 5 is devoted to the case where some "matching assumption" is satisfied and presents some modified controllers. Finally, in section 6 we give some geometric conditions allowing to construct a diffeomorphism meeting this "matching assumption".

is such that

rankg(p, x)Z(p, x) = r = k g ( p , x) (7) 2. There is a system

= i(=) +

independant of p such that~ for any p,

u

=

(9)

=

c~(p,x) q- fl(p,x) w (10)

transforms (2) into =

2

(s)

f(~) + O(~)w

(11)

Assumptions

Linear Parametrisation (LP) assumption: The fields f and g in (2) depend linearly in the parameter p : I

f(p,x)

-= a~

+ ~--~piai(x)

(5)

i=l 1

k(p, x) =

+

b (x) , (6)

Local Stabilizability of the Transformed System (STS(~2) ) assumpt i o n : There ezist k n o w n fltnctions Vnom and U, of class (at least) C I and C 2 respectively, from a neighborhood ~ of 0 in M ~ to R "~ and to l:t respectively, such that: 1. U(~) is n o n n v a t i v e , and zero if and only if ~ is zero, and for any I ( > O,

i=1

{ ~ / Y(~) < I ; } where the ai's and the b~ 's are smooth vector fields 5ndependant of p). Numerous practical examples, as position control of a DC-motor [6] or control of biochemical processes [4] satisfy this linear dependance on the parameters. All the existing schemes for adaptive nonlinear control require linear parametrization. Feedback and Diffeomorphism Equiv a l e n c e ( F D E ) a s s u m p t i o n : There exists three smooth maps, o~ from R z x M ~ to I t ' , t3 f r o m R ' • M ~ to Mm• and ~o from R l x M ~ to M ~ such that

(12)

is a bounded subset of M ' L 2. For all ~ in M ~, we have:

U~=~(() : L~U(() , x) with A a Hurwitz matrix). Then, the adaptive controller (26).(60)-(28)-(29) for feed[a'(x) + ~__lukb~(x)t40) back linearisable systems is similar to those described in [3], though the point of view Notice that q is the state of the small sysis different there. We have the following tem necessary to realize (27) using only results for these controllers (see [3] or [8]): available signals.

If FDE and LP are satisfied, and S T S is satisfied locally, i.e. STS(~) is satisfied for T h e o r e m 1 ([11]) some ~, then any solution of the closed- If assumptions LP and FDE hold, and asloop system (figure I) such that 4 0 ) and sumption STS holds locally, there exists an Ko) are closed enough to zero and p(O) is open neighborhood of (p*,O,O) such that

813 any solution (~,x,~) with initial condition (i5(0), x(0), 7/(0)) in this neighborhood, exists on [0, oo), remains in a compact set and its (x,~)-component tends to zero. In addition, the point (p*,O,O) is a (non asymptotically) stable equilibrium point. T h e o r e m 2 ( [ i i ] ) I[ assumptions LP and FDE hold, and assumption S T S holds globally, and if in addition there exists a C o function d on II such that for all (p, x) in II x M , with Z defined in (39),

IIZ(p,=)ll ~ ( p , x ) < d(p) (1 + V(p, x) ~) ,

meeting, Ior any (p, q,b, ~), x)v2(p,q,[o,x)+-fi-j(p,x)[o = 0

(43) We will use here/~ to compute ul with (see (15)) : Ul = Unom(P, X) ,

(44)

and the estimate ~ of q* to compute u2 by :

u~ = v2(p,~,/~,~) .

(45)

(41) We then have =

,~(~) -]- A ( p , x, u . . . . (]), x)) (p* -- p)

then all the solutions are defined on [0, (x)), + A2(15, x, v2(i5, ~,p, x)) (p* - ~X46) remain in a compact set and their (x,q)with A2 the row vector defined by component goes to zero. This algorithm therefore either gives only local results or requires the bound (41) on the growth of the different vector fields. This was also the case for the first algorithm (h = some coordinates), but, as stressed in [11] or [10], if the systems are such that $ is not Lipschitz in the coordinates h, we cannot guarantee global stability in the previous algorithm whereas we can here.

~ ( p , x) [g(ql, x) - g(q~, x)] ~ = A~(p, ~, u~) (q~ - q~) (47) We will again use the a d a p t a t i o n schemes proposed in section 3 to obtain (#, ~). We propose three different choices :

1. First, as in section 4, we m a y chose for h some coordinates on M " , using the filter (27) with r some positive constant. AgMn, we only mention the results obtalned if the systems are fully feedback linearizable and the coordinates h are these 5 Adaptive control with in which ~ is linear. Thee adaptive controller is given by (17)-(44)-(45)-(26)-(28)matching assumptions (29). The same controllers are described in We call matching assumptions the fact that [3], but without the reparametrisation (and F D E holds with the following restriction on consequently with # in place of ~), and unthe dependance on p of ~ : der an assumption different from FDEM2, see remark 2 below9 We have the following A s s u m p t i o n F D E M 2 : Assumption F D E result : is satisfied, and ~o has the property that If FDEM2 and LP are satisfied, and S T S there exists a smooth map v2 : is satisfied globally (resp. STS(~2) is satisfied for some ~), then any solution of (p,q,~,x) ~ v2(p,q,p,x) E R ~ (42) the closed-loop system (resp. any such that

814

z(0) a~d 7(0) a~e closed enoueh to ~e~o and /5(0) is close enough to p*) is bounded and such that x(t) goes to zero. Notice that, unlike in the situation of the preceeding section, we need no additionnal assumption like global Lipschitzness to get global stability. This would not be the case if the vector field h was not linear in the coordinates h (see [8]). This is one of the reasons to prefer the following choice of h.

(p*,p*, O, O) is a (non assymptoticaIly) stable equilibrium point of this dynamical system.

This vields the following controller : A d a p t i v e C o n t r o l l e r .AC~(U) :

We then define the following modified algorithm which is defined only for U(~) smaller than [To : A d a p t i v e C o n t r o l l e r .AC~'(U, [70) : The same as .AC'I(U), but U is replaced by

Here, we get a global result as soon as assumption STS is global, without a restriction like (41) (but under assumption FDEM2, which is stronger than FDE). Yet, STS being globally satisfied is very restrictive. If it is satisfied only locally, AC'I(U ) gives a good beheviour for only some so2. Another choice for h is h = U, the filter lutions : f has to start close to 0, which being given by (27) with r equal to 1. The is natural, but (/5, q) also has to start close expression of Z is then to (p*, q*), which is more unconvenient. We will modify ACI(U ) to get a better local reZ(/5, q, b, x) = ( Z I Y ) , (48) sult. Suppose that STS(f~) is satisfied for a certain f/, and Uo is a positive number with, A2 being defined by (47), such that Y(/5, x, v2(/5,p, q, x)) (49) 0 ~ , ^, u ( ~ ) < Uo ~ ~ ~ a . (55) = -~ztp x)A2(/5, z,v2@,/5,~,z)).

p

=

U.om(/5,~) + ,~(/5,/5,0,~)(50)

=

-- (~ Z(/5, X, Unoitl(/5 , X))

(51)

- eY(/5,~,,:(/5,b,O,~))"

(52)

OU~ ).~( s~ ) + -~;(

(53)

q = =

-~

uq

(s6)

in (53) and in the definition of Z and Y. We have the following result :

where e = U - U(~),

UoU uo - u

(54)

and Z and Y are given by (39)-(40) and (49)-(47). and the following result :

T h e o r e m 4 If FDEM2, LP and STS(~) hold, and Uo is chosen according to (55), the solutions @(t), ~(t),~(t), =(t)) of the

closed.loop

system S,.-.aCi(U, Uo) such

T h e o r e m 3 If FDEM2, LP and S T S that U(~(0)) < Uo are defined on [0, oz), (resp. STS(f~) for a certain neighborhood remain in a compact 3et and their (x,7])ft of(O, O) in R zx M '~) hold, then all the so- component goes to zero, U(~) remaining lutions (/5(t), ~(t), ~(t), x(t)) of the closed. smaller than Uo. In addition, (p*,p*, O, O) lOOp system Sp.-~Ctl(U ) (resp. the solu- is a (non assymptotically) stable equilibtions with initial conditions in a certain rium point of this dynamical system. neighborhood o f ( / , / , O, 0)) are defined on [0, oe), remain in a compact set and their S k e t c h o f p r o o f : We only state the proof (x, U)-component goes to zero9 In addition, of theorem 3; the proof of theorem 4 goes

815 the same way, replacing U by (56) and considering that (56) is infinite when U is Uo. Consider a solution of the closed-loop system. Lemma 1 (30) implies that i5 and e = 9 - U are bounded on [0, T), the right maximal interval of definition of the solution, and that the time-ruction e(t) is in L2([0, T)). In addition, from (53), (13) and (54), we have :

This controller is the same as these described in [5], except that the double parametrization is not used there (replace by/3 and (62)-(63) by/~ = - Z r - y r . The approach to the synthesi presented here is however different : the authors design the adaptation, and the control u2, to make the positive function - Oil + 1-Iq* =

y(~) + 89

i1 < - c q

+ (l+e) e ,

(64)

(57) decrease

ong t h e s o l u t i o n s

(in fact, U(4)

is a quadratic function of { and the third term is absent there). The authors encounter the problem about implicit definition of i3 which we mention in remark 2. We can, as for AC'I, modify it AC~ into iA d a p t i v e C o n t r o l l e r AC,,i.rr : ~ , Uo). : t The same as .AC:(U), but U is replaced 3. A third choice for h in the algorithm of by section 3 is UoU which, together with e being L 2, implies that q is bounded too. This implies that the solution itself is bounded on [0, T) and therefore that T = +oo. Then (57) and (30) applied for T = +oo give e and 7, and therefore U, going to zero. []

i

-

0

the f e e d b a c k law (which

c o i n c i d e s w i t h the law d e r i v e d in [7] w h e n k-l) u I - w I w 2- k ( W l - W 3) u 2 - -2w I w 2 w 3 - k ( w 2 + w 3 2) locally asymptotically

Example

2,4

stabilizes

Zero d y n a m i c s

and d i s t u r b a n c e

(2.10).

also p l a y a f u n d a m e n t a l

rejection.

- s(w)

the s y s t e m

role in p r o b l e m s

C o n s i d e r an e x o g e n e o u s

of a s y m p t o t i c

tracking

system

w E I R 2, s(0) - 0

producing a reference

trajectory

(2.11a)

to be a s y m p t o t i c a l l y

tracked

YR - r(w)

(2.11b)

and a d i s t u r b a n c e

signal d(w)

x - f(x) + g ( x ) u + p(x)

to be a s y m p t o t i c a l l y

d(w)

(2.11c)

Two m a i n p r o b l e m s are to d e s i g n a f e e d b a c k scheme, only error m e a s u r e m e n t , achieving asymptotic e(t) - h(x(t))

w h i c h stabilizes

tracking;

Under mild assumptions,

x

-

~(w)

u

w h i c h are s o l u t i o n s f(~(w))

-

(2.11a)

u s i n g e i t h e r full state m e a s u r e m e n t

- (2.11c) w h i l e at the same

as t ~

(2.12)

n e c e s s a r y and s u f f i c i e n t

in [2]-[3]

conditions

in terms of the e x i s t e n c e

for the s o l u t i o n of these

of f u n c t i o n s

c(w)

to the p a r t i a l d i f f e r e n t i a l

+ g(=(w))c(w)

or

time

i.e.

- r(w(t))~ 0

problems were derived

rejected

+ p(=(w))d(w)

- a~s(w) @w

equation (2.13a)

825 There is a good geometric existence c(w).

Moreover,

(2.13b)

theory for ~(w) in (2.13a) given any choice of

under certain regularity hypotheses

is automatically

satisfied.

Indeed,

c(w) may be chosen a priori so that

viewing e, as defined

in (2.12),

as the

output of the augmented system (2.11co)-(2.11a)

one sees that if w o- 0 then, for any Xo,

to say y(t) - 0 is achieved with a control u(t)

is to say e(t) - O.

dynamics

That is, if the zero

of both the augmented system and the system to be controlled

the zero d y n a m i c s of the a u g m e n t e d s y s t e m c o n t a i n s

sets V*a and V ~ are smooth,

in case both zero constraint

system.

Moreover,

(2.13)

is e q u i v a l e n t

(2.11c)

to the e x i s t e n c e of a smooch,

exist,

then

the zero d y n a m i c s of the control s o l v a b i l i t y of

to V ~ in V a~ (see

invariant complement

[2]-[3]).

In the discussion of feedback stabilization or smoothness

and output regulation,

certain regularity

e.g. we asked that relative degree * such as (2.2) were satisfied or that V itself be a smooth submanifold.

conditions

surprisingly, and Marino

conditions were assumed to hold,

such conditions

are not necessary

as the following example,

[8], shows.

Example 2,5

[8]

Consider the system

-x 2 (-x+2y 3)

x -

(2.14a)

y - u

(2.14b)

The feedback law hand,

u-x-y 3 renders the origin locally asymptotically

if a smooth real-valued

function h(x,y)

is any feedback law maintaining constraint theorem,

Not

due to Boothby

set h(x,y)-0

stable.

On the other

is such that (2.2) is satisfied and u(x,y)

the constraint h(x,y) - 0

which may be computed

then the flow induced on the

from (2.14a) and the implicit function

is unstable.

Remark 2,6

In particular,

for the system (2.14) one cannot find an output function

for which both the regularity condition

(2.2) and the asymptotic

resulting zero dynamics hold.

one does not need relative degree hypotheses for w to a smooth V , but in this case we can show that

stability of the

Of course,

the zero dynamics

algorithm to converge

the zero dynamics

algorithm cannot converge while yielding asymptotically

stable dynamics.

826 Section 3

V~abi~tu

Ke~els

add Existence Theorems for Zero Dynamics

There are two regularity issues in the construction of the zero dynamics:

the

existence of a smooth V* and the existence of extra piecewise smooth feedback laws making V

invariant and inducing a system evolving on V .

The next calculation, due to Cannarsa

and Frankowska [5], shows that both may fail in an interesting class of problems for which, nonetheless, computing both V* and the associated feedback laws is extremely important.

We begin with some preliminary general remarks about V

in the context of a

general approach to geometric nonlinear control.

Fix a time-interval (0, T] with F C IR n

T S ~

and a constraint set

is controlled invariant for (2.1) provided for each

U C IR m.

XoE F

admissible piecewise continuous control u(t), such that the solution for tE [O, T] and is contained in F.

Given the closed subset

A subset

there exists an x t of (2.1) exists

K - h'l(0) for a continuous

function h, the maximal closed controlled invariant subset of K, provided it exists, is denoted by V (K).

Example 3.1

[5]

Consider the optimal control problem for (2.1) in Meyer form over a

finite time interval [O,T), with T fixed.

That is, for some constraint set U c IR m we

seek to minimize the end point functional

g(x(T)) over all solutions to (2.1) with

-

fo

and

u(t)

a piecewise continuous admissible control.

x(o)

It is well-known that the

value function

V(to,X o) - inf (g(x(t)): x(to) - x, to ~ T)

is nondecreasing along trajectories of (2.1) and is constant along optimal trajectories. Therefore, solving the optimal control problem is equivalent to computing V*(K)

control laws renderinE V*(K)

x -

f(x)

and the

invariant for the following system and choice of K-h'l(0):

+ g(x)u

w - 0

x(0)

-

fo

w(0)

- v(0,f)

r - i h(x,w,r)

- w-v(r,x)

In general, the existence of V*(K) and a description of those controls which render V (K) invariant can be obtained using methods from set-valued analysis. we express (2.1) as a differential inclusion

More explicitly,

827 x E F(x),

F(x) - (f(x)+g(x)u: u E U)

(3.1a)

and, without loss of generality, we set

K - h'l(0)

(3.1b)

By a solution of (3.1a) with initial condition x ~ we mean some absolutely continuous trajectory x(t), tz0 satisfying (3.1a) and x(0)-x o.

In the language of nonsmooth

analysis, a closed subset K has the property local viability if for any x ~ E K, there exists a T>o

such that some solution x(t) to (3.12) exists and belongs to K for tE[0,T].

If, moreover,

T-~, then K enjoys the global viability property.

controlled invariant subset (for infinite time).

Denoting by

In particular, K is a

TK(X )

the contingent cone

of K at x (see e.g. [9], [I0]), Haddad has shown that K has the local viability property if and only if K is a viability domain, in the sense that

Vxek

F(x)

Tk(X)~ ~

(3.2)

Furthermore, if F is bounded on K then, if K is a viability domain, K enjoys the global viability property.

In this language, then, if it exists,

V*(K) is the maximal closed viability domain

contained in K, referred to as the viability kernel of K (see [9], [i0]).

Moreover, it is

known that V*(K) exists (although perhaps empty) provided that F is a Peano map; i.e.

(i)

Graph (F) is closed;

(ii)

F(x)~@ and convex;

(iii)

F(x) - c (i + II x If)B, for some c>0 and some ball B.

From this "viability kernel" theorem, we conclude an existence result for zero dynamics. For (2.1) condition (ii) is always satisfied if U is convex.

Theorem 3,2

Suppose (2.1)-(3.1) satisfies the Peano conditions (i)-(iii).

Then

(2.1) has a zero dynamics evolving on the maximal closed, controlled invariant subset Z -V (K).

Moreover the zero dynamics is the system defined by the differential inclusion

zeZ (z), where

zeZ*

Z(z) - F(z)

T ,(z) '

Furthermore, if 0E U

and f(x) has a nontrivial

Z invariant set in K, e.g. an equilibrium,

Remark 3,3

then Z ~@.

The system (3.3) is a classical autonomous dynamical system only if Z(z)

is a singleton, for each zEZ ~

This situation occurred in the earliest works

feedback stabilization of invertible, nonlinear minimum phase systems.

[4] on

Of course this is

828 not always the case as a differential controls,

evolving on Z .

minimum phase

z(t)EW

W

of

0

if f(0)-0,

then we say that (2.1)

and any control u(t) containing

z~

suc~ that for

on the

(or(3.1))

is

t~0, z(t)EZ*,

there

such

(3.4a) as t ~ .

Remark 3,4 autonomous.

(3.4b)

Even when Z(z)

is single-valued,

tl.e zero dynamics

(3.3) may not be

This will be the case only if a selection for the set-valued map

exists.

are, of course, (see e.g.

For example,

constraint

Z

z_ 1 (see B y m e s and Isidori [3] for a more general definition of the nonlinear minimumphase concept).

854 Without entering into too many details, we can see that the fact of being minimum-phase or non minimum-phase for (2) defines a partition o f the space (u, y). Once again, taking discontinuous values for u and/or some of its derivatives, while y remains smooth, might lead our system from a non minimum-phase domain to a minimum-phase one.

4. A P P L I C A T I O N S In all the SISO applications which we present here: - The control law is obtained by inversion. -

The set point trajectory c(t) is a step function joining two constant operating points of the system 9

For the two academic examples, the reference trajectory yr(t) is defined via a linear fast order differential equation

~+'~=0 where "~is the time constant and e = Yr c. For the last heat exchanger example, the reference -

trajectory is defined via a linear third order differential equation

E + 3"~ + 3~2e + 3"~3"e" = 0. 4.1

Academic examples 9

,

S I N G U L A R I T Y : Take the input-output system y + "g'y = u + # ' u e u , where 'g' and z" are positive constants. There is an obvious singularity at u = 0. We want to rally the previously defined set point trajectory from a constant operating point where u is negative, to another one where u is positive. The usual continuous control leads to the singularity at u = 0, where a conventional computer integration stops (see Fig. 2a). If we impose a discontinuity on u to avoid this particular zero value of u, the integration works and the possibly difficult problem of analyzing the singularity disappears (see Fig. 2b).

855 UNSTABILITY: Now consider the following system: y + "~'~,= u + "c"u u, where "c' and z" are positive constants. The tangent linearized system around the constant operating point (u, Y) Ay + x'A~' = Au + z"uAu is non minimum-phase for u < 0. So, if we want to rally the same set point trajectory, the usual continuous control is divergent (see Fig. 3a). On the contrary, the introduction of a discontinuity, which yields u positive, brings back to stability (see Fig. 3b). Notice that we have also avoided the singularity at u = 0. 4.2

Heat exchanger

A brief physical description of this process is given here. For more details about this description, for the complete modelization and for nonlinear continuous control applications of this process, see [4, 5]. This system, used for energy recovery in the petrol industry, is composed o f two countercurrent liquid-liquid parallel heat exchangers (see Fig. 4). The objective is to control the output temperature y, by means of the flow repartition rate u. The other variables are here supposed to be constant. The input-output behaviour is very nonhnear in the operating domain: - The static characteristic (Fig. 5a) presents a hill top, i.e., the sign of the static gain changes.

- The dynamics (Fig. 5b) changes from a well damped behaviour to an oscillatory one, with minimum-phase or not. The non minimum-phase domain of the linearized model can be located on the static characteristic. Physical laws give a simplified state variable representation of dimension three:

Xl = a l X l + b l u + c l

x2 = a2x2 + b2u + c2 = a3y + uxl + (1 - u)x2 where the state is composed of the intermediate temperatures xl, x2 and the output temperature y. The coefficients al, a2, a3, bb b2, Cl, c2 are constants.

856

It is possible to eliminate state variables xl, x2 for such a bilinear system [8, 11, 15]. It yields the following input-output equation: [ ( a 2 - al)(U - 1)u + u ] [ - " y + a3"y+ ( a l b l - a2b2)u 2 + 3(bl - b2)ux + + (alcl - a2c2 + a2b2)u + (2(cl - c2) + b2)u + a2c2] + + [al2u + 2alu + u][(1 - u ) y - ((a2 + a3)(1 - u) - L1)~r + + a3(a2(1 - u) - u)y - (1 - u)(bl

-

b2)u 2 + (b2+ C l - c2)u + c2)] +

+ [aE2(U - 1) + 2a2u + l J ] [ u y - ((al+ a3)u + u)y + + a3(alu + h)y -- u((bl - b2)u 2 + (b2+ Cl - c2)u + c2)] = 0 Singularities arise when the coefficient of u vanishes:

)' + ((al - a2)u - a2 - a3)y + a3((al - a2)u + a2y) - ((bl - b2)u 2 + (b2 + cl - c2)u + c2) = 0 W i t h o u t m a k i n g a complete analysis o f this last equation, we notice that a particular singularity is obtained from its static solution (), = y = 0).This static singularity is located in the non minimum-phase domain (see Fig. 5a). W e want to rally the previously defined set point trajectory, from a constant operating point which lives outside the non m i n i m u m - p h a s e domain, through this domain. The usual continuous control stops near a singularity which is located in the non m i n i m u m - p h a s e domain (see Fig. 6a). The introduction o f a discontinuity on u leads the system outside the non minimum-phase domain, and therefore stabilizes it (see Fig. 6b). In the same time, the problem o f the singularities is avoided. 5. C O N C L U S I O N W e d e m o n s t r a t e d that the introduction o f control variables discontinuities p e r m i t s a straightforward solution o f classic control problems, which seem to be very difficult in the setting o f smooth controls. Note that our analysis does not apply if the Jacobian ~

3F

related to (1) does not contain u

nor any o f its derivatives. This is also the case when the inverse system has no dynamics, that is ~x = 0 in equation (1):

857 ~

F(u, y, y ..... y(B)) = 0. In order to bypass these important particular cases, which are often encountered in practice, we can possibly reverse the preceding analysis by now introducing other types of discontinuities. The problem will be dealt with in future publications. REFERENCES [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11]

[12] [13]

[14] [15] [16]

S. Abu el Ata and M. Fliess: Nonlinear predictive control by inversion, Proc. IFAC Symposium on Nonlinear Control Systems Design, A. Isidori ed., Capri, Italy, June 1989. S. Abu el Ata, A. Co'fc and S. Chavy: Commande pr6dictive non lin6aire par inversion. Application hun probl~me de guidage d'avion. Actes Colloque SMAI "L'Automatique pour l'A6ronautique et l'Espace", Paris, Mars 1989. C.I. Byrnes and A. Isidori: Local stabilization of minimum-phase nonlinear systems, Systems Control Lett., 11, pp. 9-17, 1988. P. Chantre: Commande et optimisation de processus industriels non lin6aires. Mise en oeuvre sur un train d'6changeurs thermiques en parall61e, Th6se Doct. Ing., Universit6 Paris VI, Paris 1983. P. Chantre: Commande pr6dictive non lin6aire h l'aide d'un mod61e de Volterra, IASTED International Symposium, Grindelwald, Switzerland, February 1989. D.W. Clarke, C. Mohtadi and P.S. Tufts: Generalized Predictive Control - Parts I and II, Automatica, 23, pp. 137-148 and pp. 149-160, 1987. J.Z. Cypkin: Thforie des asservissements par plus-ou-moins (translated from Russian), Dunod, Paris, 1962. S. Diop: Elimination in control theory, Math. Control Signals Systems, to appear. M. Fliess: Automatique et corps diff6rentiels, Forum Math., 1_, pp. 227-238, 1989. M. Fliess and F. Messager: Vers une stabilisation non lin6aire discontinue, These Proceedings. M. Fliess and C. Reutenauer: Une application de l'alg6bre diffdrentielle aux syst6mes r6guliers (ou bilin6aires). In "Analysis and Optimization of Systems", A. Bensoussan and J.L. Lions eds., Lect. Notes Control Inform. Sci 44, pp. 99-107, Springer, Berlin, 1982. R.M. Hirschorn and J. Davis: Output tracking f0r nonlinear systems with singular points, SIAM J. Control Optimiz., 25, pp. 547-557, 1987. F. Lamnabhi-Lagarrigue, P.E. Crouch and I. Ighneiwa: Tracking through singularities. In "New Trends in Nonlinear Control Theory", J. Descusse, M. Fliess, A. Isidori and D. Leborgne eds., Lecture Notes Control Inform. Sci. 122, pp. 44-53, Springer, Berlin, 1989.' J. Richalet, S. Abu el Ata, C. Arber, H.B. Kuntze, A. Jacubash and W. Schill: Predictive Functional Control, Application to fas~ and accurate robots. Proc. 10th IFAC Congress, Munich, 1987. E.D. Sontag: Bilinear realizability is equivalent to existence of a singular affine differential i/o equation, System Control Lett., 11, pp. 181-187, 1988. V.I. Utkin: Sliding Modes and their Application in Variable Structure Systems (translated from Russian), Mir, Moscow, 1977.

858

Prt~s5

Pa~

out:put:

~h

J x2

2

ex_~.anger "

U Q, QI, Q2 T, TI, T2 y, xl, x2

~ ~ %

secondary fluid 2 Q2 T2

: f l o w r e p a r t i t i o n r a t e of the v a l v e (0 0 in a neighborhood of e = 0

S y s t e m s in which this situation occurs are said to have a singularly perturbed zero dynamics. The integer r is the r e l a t i v e degree of the perturbed systems, and the integer r + d is the relative degree of the unperturbed system. By definition, the change from r(e) = r to r(0) = r + d implies that: Lgh(x,e) . . . . . LgLfrqh(x~

LgLfr'Zh(x,e) = 0 # 0

LgLfrqh(x,O) . . . . . LgLfr+d'lh(x~

for all x near x ~ and all e

for all e > 0 LgLfr+d-2h(x,O) = 0

(2.2a) (2.2b)

for all x near x ~

# O.

(2.2c) (2.2d)

The f o l l o w i n g d e v e l o p m e n t s are based on an assumption about the e - f a m i l y of systems (2.1) which, as we shall see later on, allows the singularly perturbed zero d y n a m i c s to possess only two time scales. First of all we assume, in view of (2.2b) and the first on o f (2.2c), that: LgLfr'lh(x,e) = ema0(x,a) where a 0 ( x , e ) is a smooth function, nonzero at (x~

(2.3) and m is an integer. Without

loss o f generality, m can be set equal to d. Otherwise e can be replaced by e d/re. Then, our two-time-scale assumption is that, for all 0 < k < d,

863

LgLfr'l+kh(x,e) = ed'kak(X,e)

(2.4)

where each ak(x,e ) is a smooth function in a neighborhood of (x~ Note that for k = d in (2.4), the condition (2.2d) implies ad(X~ r 0. In particular, this assumption is e.g. satisfied when g(x,E) has the form: g(x,e) = go(x) + egl(x) and f(x,e), h(x,e) are independent of e (see [5]). 3.

The

analysis

of a

singularly

perturbed

zero

dynamics.

We proceed now to identify the zero dynamics of the peturbed system and to show how they are related to the zero dynamics of the unperturbed system. In doing so, we take the advantage of the fact that the zero dynamics of a system are not altered by addition of integrators to the input channel (see [8, page 389]). Because of this property, we can identify the zero dynamics of any system of the family (2.1) with those of the following family of e x t e n d e d systems: = f(x,e) + g(x,e)z 1 EZ 1 = Z 2

ez 2 = z 3

~Zd =

(3.1)

v

y = h(x,e). In order to identify the zero dynamics of (3.1), we set to zero the output y and its derivatives, until we arrive at a relation that can be solved for some v = v ( x , z , e ) . Then, we set v = v(x,z,e) on the system (3.1) and we consider the restriction of the systems thus obtained on the submanifold (of the state space) on which the output and its derivatives are vanishing. We first obtain: y = h(x,E) y(1) = Lfh(x,e) . . ,

y(r-1) = Lfr-lh(x,e) y(r) = Lrrh(x,e) + a0(x,e)eazl Because of the special structure of (3.1), the (r+l)-th time derivative of y has the form: y(r+l) = Lfr+lh(x,E) + al(X,E)Ed-lz1 + (~o(X,E)ed-lz2 +

864

+ (Lfao(X,s

+ (LgaO(X,s

2.

In a s i m i l a r way, we obtain: y(r+k) = Lfr+kh(x,E) + t~k(X,s

+ ... + Cto(X,e)Ed-kzk+1 +

+ ed'k+lpk(X,Z 1. . . . . Zk,s where Pk(X,Zl . . . . . Zk,e) is a p o l y n o m i a l in z 1. . . . . f u n c t i o n s of x and e. T h e last two iterations give:

z k, whose coefficients are smooth

y(r+d-1) = Lfr+d-lh(x,e) + ad.1(x,e)eZl + ... + ao(X,e)eZd + + e2pd.l(x,z 1. . . . .

Zd.l,s

y(r+d) = Lfr+dh(x,e) + CZd(X,e)Zl + ... + Oh(x,e)Zd+ C~0(x,e)v +

+ ePa(x,zt . . . . . Zd,e). R e c a l l that, by a s s u m p t i o n , o:0(x~

~ 0. Thus, in a n e i g h b o r h o o d of x = x ~ z 1. . . . .

z d = 0, e = 0, the e q u a t i o n y(r+d) = 0 can be solved for v, as a s m o o t h x,z 1. . . . . Zk,e. This function is a feedback law of the form: d v(x,z,e) = ~ ai(x,~)z i + b(x, e) + eQd(X,Z,e) i=l

f u n c t i o n of

(3.2)

where z = (z I . . . . . zk) and al(x~ = (cq(x~176 # 0. E q u a t i n g to zero y a n d its first r + d - i d e r i v a t i v e s p r o v i d e s a set o f r+d c o n s t r a i n t s d e l i n e a t i n g the s u b s e t of the state space on w h i c h the zero d y n a m i c s of (3.1) are d e f i n e d . These c o n s t r a i n t s are c o n v e n i e n t l y e x p r e s s e d in the form: 0 = H(x,e) + eR(x,z,e)

(3.3)

where: H(x,e) = col(h(x,e), Lfh(x,e) . . . . .

Lfr+d'lh(x,e))

(3.4)

a n d R(x,z,e) is an ( r + d ) - d i m e n s i o n a l v e c t o r of s m o o t h f u n c t i o n s satisfying R(x,0,e) = 0. O b s e r v e n o w that, at e = 0, the r i g h t - h a n d - s i d e of (3.3) reduces to H(x,0) and that, at x = x ~ the d i f f e r e n t i a l s of the r + d e n t r i e s o f H ( x , 0 ) are l i n e a r l y i n d e p e n d e n t b e c a u s e the s y s t e m (2.1) has r e l a t i v e degree e x a c t l y equal to r + d at

865 (x,a) = (x~ (see [8, page 148]). As a consequence, at (x,z,a) = (x~ matrix of (3.3):

the jacobian

[ ~-~(H(x,e) + eR(x,z,e)) e~ R(x,z,e)] has rank r+d. It follows that - for each e near 0 - the set of points (x,z) which s a t i s f y (3.3) n e a r (x~ is a s m o o t h ( r + d ) - d i m e n s i o n a l s u b m a n i f o l d . This s u b m a n i f o l d , noted Z*, is rendered i n v a r i a n t by the f e e d b a c k law (3.2) and the restriction o f the flow of the c l o s e d - l o o p system (3.1)-(3.2) to Z* describes the zero dynamics of (3.1), i.e. that of the family (2.1). In order to represent these zero d y n a m i c s e x p l i c i t l y , we note that since the differentials of the entries of (3.4) are linearly i n d e p e n d e n t at x ~ for each small e, the f u n c t i o n s : ~i(X,E) = L f i ' l h ( x , e )

1 _< i < r + d

can be used as a (partial) set of new local c o o r d i n a t e s for (2.1). The partial t r a n s f o r m a t i o n thus defined is smooth in x and e. If r+d is strictly less than n, then the c h o i c e of new coordinates m u s t be c o m p l e t e d by a set of additional smooth functions rli(x,e), 1 _< i ~ n-r-d. Since, for all (x,e) near (x~ we have: (span {g(x,e)})lq (span {dh(x,e) . . . . .

dLfr+d-lh(x,e) }).1. = {0}

we can always choose rl(x,e) (see [8, page 150]) in such a way that Lgrl(x,e) = 0 for all (x,e) near (x~ Without loss of generality, it is possible to choose rl(X,e ) so that rl(X~ = 0. In the new coordinates: = (~1 . . . . .

~r+d)

= (rll . . . . . "qn-r-d) the system (2.1) assumes the form:

= c(~,n,e) + d(~,n,e)u

=q(~,q,e). Recall that, by definition, the system: = q(0,n,0) r e p r e s e n t s the zero dynamics of the unperturbed system (2.1). In the new c o o r d i n a t e s , the e x t e n d e d s y s t e m (3.1) with f e e d b a c k (3.2) assumes the form:

control

866 = c(%, 11, e) + d(~, rl, e)Zl fi = q(~, 11, e) E7.1 = Z 2

(3.5)

ET.2 = Z 3

d I~Zd = E ~i(~,rl,E)zi + ~(~,TI,E) + E(~d(~'iq'Z'E)" i=l Note that, in the new coordinates (~, rl), the constraint (3.3) identifying the zero dynamics manifold becomes:

0 = ~ + el~(~,n,z,E). Solving this relation for

~

yields:

= eF(rl, z, e) where F0q, z, e) is a smooth function. We see, then, that in the (~,rl,z) the zero dynamics manifold Z* of the perturbed system (3.1) can be the graph of a smooth mapping; in particular, on this manifold the ~ are of order e. In order to render the zero d y n a m i c s of (3.1) explicit, is substitute (3.6) into (3.5) to obtain:

(3.6) coordinates, expressed as coordinates suffices

to

= q(eF(rl, z, e), rl, e) F~ 1 = Z 2 s

(3.7)

= Z3

d ezd = E ~i(eF(q, z, e),rl,e)z i + ~(FA::(rl, z, e),rl,e) + i=l + e(~d(eF(rl, z, e),rl, z, e) T h e s e equations describe the zero dynamics of the perturbed systems for each small nonzero e. They appear in a singularly perturbed form:

ri = f(n, z, e) e~. = g(B, z, e) with jacobian

matrix:

(3.8)

867

0

1

0

...

0

0

0

1

...

0

az(n,z,O) ) =

o.,

0z

0

0

0

(3.9)

...

1

...

~d(0,1],0)

which is n o n s i n g u l a r for all aq near 0, because

~1(0,0,0)

~ 1(0,1],0)

is nonzero

(recall that

A

~(x~ = 0, ~q(x~ = 0, and that ai({,rl,a) derives from ai(x,e) with x expressed as a function of (~,rl,e)). Hence, (3.8) is in the standard two-time-scale form [7]. F r o m the above analysis we see that, under the assumptions (2.2), (2.3) and (2.4) the zero dynamics of the family of perturbed systems (3.1) exhibit the twot i m e - s c a l e b e h a v i o r of a s i n g u l a r l y p e r t u r b e d s y s t e m . Its " r e d u c e d system", defind by setting e = 0 and eliminating the variable z, is exactly the system: fi

=

q(O,~,O)

(3.10)

which d e s c r i b e s the zero d y n a m i c s o f the u n p e r t u r b e d system. Its " b o u n d a r y l a y e r system", obtained by rescaling time as x = t/e and then setting e = 0, is that of a family of linear systems p a r a m e t r i z e d by a constant vector rl, n a m e l y : 0

1

0

...

0

0

0

1

...

0

0

0

0

...

1

dz

Z+

d~

~(O,n,o)

4.

... ~d(O,n,O)

0

(3.11)

(ore,o)

Remarks

Remark

1. There are cases p o l y n o m i a l o f (3.11) vanish, instance - when the system is d i s c u s s e d in [5]. In each of system are the roots of: xa _ ~(0,n,0)

in which all the c o e f f i c i e n t s of the c h a r a c t e r i s t i c e x c e p t for ~ l ( 0 , r l , 0 ) . This situation occurs for linear and g(x,e) = b0 + b l e or h(x,e) = co + cle, as these cases the e i g e n v a l u e s of the b o u n d a r y layer

= 0

and therefore the boundary layer system is unstable if d > 2.

868 R e m a r k 2. A more detailed inspection of the constraint (3.3) which identifies the

zero dynamics manifold Z* of the unperturbed R(x,z,e) can be rewritten as:

system shows

that the vector

R(x,z,e) = [ 0 .1 [S(x,z,e)] with: S(x,z,e) = A(x,e)z + eQ(x,z,r where: 9

a0(x,e)e

d-1

A(x,z) =

0 ao(X,e.)e,d'2

9 ..

9~

0

0

~

...

ao(X,e)

and Q(x,z,E) is a function which vanishes at z = 0 together with its first order derivatives with respect to z. Therefore, choosing ~' and ~" to denote:

~" = colG+x . . . . . ~r+d) the constraint (3.3), expressed in the (~,~,z) coordinates, has actually the form:

0=~' (4.1)

0 = ~" + ~(~',~",n,E)z + E2(~(~',~",n,z,~) A

where A(0,0,0,e) is nonsingular for small nonzero e. In the previous analysis, we have solved (4.1) for ~' and ~" in order to express Z* as a graph of a mapping. In A

view of the nonsingularity of A(0,0,0,e), one could have solved these equations for ~' and z, thus obtaining Z* in the form of a graph of a mapping:

~'=0 z = G(~",~,e) and express the zero dynamics obtained is not defined at e = 0.

accordingly.

However,

the transformation

thus

869 Remark 3. If the basic assumption (2.4) fails to hold, i.e. if any of the functions ai(x,e), 1 < i < d-1 has a singularity at e = 0, most of the previous analysis is still valid, and in particular the expression (3.7) of the zero dynamics of the perturbed system. However, the equations in question are not anymore in the standard twotime-scale form, and the singularly perturbed zero dynamics may exhibit more than two time scales.

Remark 4. If a system is modeled by an input-output differential relation of the form: F(y,y 0) .....y(n),u,uO) .....U (n-r)) = 0

then its zero dynamics, those induced by the constraint y(t) = 0, can be clearly identified as those of the autonomous system: F(O,O ..... O,u,u (1)..... u (nr)) = 0 This point of view suggests a simple interpretation of the calculations illustrated in the previous section and, in particular, of the e-scaling introduced in the extended system (3.1). To this end, note that the constraint y(t) = 0 necessarily induces, on the input u to the system (2.1), a constraint of the form: Lrrh(x,e) + a0(x,e)edu = 0 Differentiating this d times with respect to time, after simple manipulations one obtains a differential equation of the form: CCo(X,E)Edu(d) + ... + OCd_I(X,E)EU(1) + OCd(X,E)U+ eP(x,u,eu (1)..... 8d'lu(d'l),s + +Lfr+dh(x,e) = 0 where P(x,v 1. . . . . v d ,e) is a polynomial in v 1 ,Vd, with coefficients which are smooth functions of x,e. Replacing x by its expression as a function of the (~,~q) coordinates, adding the differential equation . . . .

= q(~, T1, e) and finally eliminating ~ (which can be expressed as a smooth function of rl,u,eu (D ..... ed'lu(d'D,e) yields an alternative description of the zero dynamics of the system (2.1). Clearly, the equations thus obtained can be given the form (3.8) derived in the previous section, after having set: z i = Ci'lu (i'D

1 ~ i ~ d.

870 5.

References

[1] W.A.Porter, Diagonalization and inverses for nonlinear system, Int. J. Control, 21 (1970). [2] A.J.Krener and A.Isidori, Nonlinear zero distributions, 19th IEEE Conf. Decision and Control, Albuquerque (1980).

[3] C.I.Byrnes and A.Isidori, A frequency domain philosophy for nonlinear systems with applications to stabilization and adaptive control, 23rd IEEE Conf. Decision and Control, Las Vegas (1984). [4] A.Isidori and C.H.Moog, On the nonlinear equivalent of the notion of transmission zeros, Modeling and Adaptive Control, Springer Verlag L.N. in Contr. and Info. Scie 105 (1988). [5] S.S.Sastry, J.Hauser and P.V.Kokotovic, Zero dynamics of regularly perturbed systems may be singularly perturbed, Syst. Contr. Lett., 13 (1989). [6] J. Hauser, S.S.Sastry and G.Meyer, Nonlinear controller design for flight control systems, IFAC Symposium on Nonlinear Control Systems Design, Capri (1989). [7] P.V.Kokotovic, H.K.Khalil, J.O'Reilly, Singular Perturbations in Control: Analysis and Design, Academic Press (1986).

[8] A.Isidori, Nonlinear Control Systems (2nd Ed.), Springer Verlag (1989).

glargissement des ohjectifs de robustesse des syst~mes de co,-n=nde

David Bensoussan D~partement de g~nie ~lectrique Ecole de technologie sup~rieure Universit~ du Quebec 4750. avenue Henri-Julien Montreal, Quebec H2T 2C8

Abstract

:

In this paper, we will review the theoretical background which led to the emergence of the Hoo theory. The advantages offered by the Hoo theory, particularly in the domain of robustness, are weighted against the Hoo theory, limitations which are the conservativeness of the results. It is proposed to use the Hoo theoretical framework, a) to reconsider the role of disturbances so as to use them beneficially through the feedforwarding of their estimates, b) to study variable structure systems, the variable structure being used as an extra degree of freedom. The combination of the variable structure with the feedforwarding of disturbance estimates improves performance objectives while aiming to achieve robustness objectives.

Introduction

Les deux classique bas4 descriptions par

Zames

traditions

connues

sur les

r4ponses en

de variable d'~tat. [i]

a

permis

de

de

la

th~orie

L'approche reformuler

gain en relation avec les non-lin~arit~s

pourraient Feintuch

illustrer

[6].

Feintuch [8]

un

niveau

sont

le contr61e

presenter le

sectorielles

plus

entree-sortie,

introduite

abstrait

th~or~me du petit

[2,3] puis ~ reformuler nombre d'alg~bre

d'op~rateurs

et que

[5], et de Saeks et

initi~es par Zames [7] et

resum~es dans

ont permis la d~finition plus compl~te du problAme de contr61e dans le

a ~t~

pour presenter d'importants

rer ces

servi ~

Les ~tudes de r~alisabilit4,

syst~mes entr~es-sorties

traduits dans

d~veloppements

de systemes

les travaux de Saeks [4], Tung et DeSantis

contexte d'une alg~bre d'op~rateurs

ont ~t4

contr61e

le probl~me du contr61e dans un contexte

d'alg~bre norm~e; cette approche a d'abord

probl~mes de contr61e ~

du

fr~quences et le contr61e moderne bas~ sur les

causaux ou

reprise par

r~sultats de stabilit~ et de sensibilit~

les espaces

de Hardy

dans le domaine du contr61e.

deux theories

et de reconsid~rer de

strictement causaux.

La th4orie des

Zames dans une publication c~l~bre [9] : ces r~sultats

[10], ouvrant la porte ~ des nouveaux Ii est bon ~ ce

stade-ci de reconsid4-

en mettant en valeur leurs points forts et leurs limitations nouvelles

questions

quant

aux

directions

d'une recherche

future dans le but d'~largir le domaine d'application de ces theories.

874 I. Commande moderne et commande H~

La th~orie

de contr61e

moderne bas4e sur les m4thodes d'analyse des variables

d'~tat a pris un essor extraordinaire

depuis les ann~es soixante.

Malgr~ les succ~s

ind~niables des theories de Wiener Hopf Kalman (WHI 0, ]/H(s) = 0(s') as Isl tends to i n f i n i t y . Therefore, in order to make sure that IT(jw) / H(j~)I (which is the gain of the transfer function from disturbance, measurement noise,

and command input

to

plant

input) remains bounded as ~ -->m,

one should choose W(s) = 0(s') as (s( --> ~ [7]. From these considerations, we deduce the following possible structure for W(s)

W(s)

:

(T.s+2)

'~w

(8)

where I/T, is the frequency from which the modeling uncertainty starts increasing significantly, and w is an integer ~ 0 which characterizes the rate of increase of the modeling uncertainty.

883 We can now define a robustness indicator with respect to m u l t i p l i c a t i v e plant uncertainties, IM, as the maximum static gain that can be tolerated For a bounding function F of the form K(T,s+I) (+" (where K is a constant), i.e. the maximum value of K such that all the plants of the class M(H,K(T,s+I) ~'') can be stabilized. Obviously, From (7) and theorem i , IM = I/~ T , where AT is computed with the weighting function (8). 4. SOLUTION OF THE MINIMAX PROBLEM

For the sake of generality, we shall consider a weighting function W(s) of the form

W(s) = n.(s) / dw(s) where nw(s) and dw(s) are real

(9) polynomials with degrees dnW and dd~ respectively.

Moreover, nw(s) and d~(s) are coprime, t h e i r roots are located in the open l e f t - h a l f plane, and dnW> dd~ + E where c is the pole excess of the plant. The computation of the minimax weighted complementary s e n s i t i v i t y Function is now studied. Plants with imaginary axis poles and/or zeros require a special treatment, and they w i l l be dealt with in a separate section. 4.1. Plant without pole or zero on the imaqinary axis (no(s)=do(s)=1) This is a particular case of the minimax mixed s e n s i t i v i t y problem presented in ~7]. The results are dual to those derived in section V of that paper. Theorem 2 ==========

Consider the factorization (4), define W(s)

(5) of the plant transfer function H(s) and l e t (9)

a) I f d+(s) = constant, AT • i n f (I W(j~) T(j~) If- = II ~ ( J ~ ) I f . = 0 GEZ(H)

b) I f d~(s) ~ constant, AT is the largest (in absolute value) real value of ~ such that the solution of the polynomial equation I - - - n~ n'+ q'T : d+ PT + dw n~ qT (I0) consists of two real polynomials, PT and qT, whose degrees are dn~ + dnW - I and dd§ -I respectively. Moreover the optimal value AT is reached with the proper c o n t r o l l e r d. d, q~ G~ . . . . . . . . . ~ - - - where qT and P~Tare the solution of (10) corresponding to ~T, n. PT and the resulting closed-loop system has a l l i t s poles in the open l e f t h a l f plane. Part (a) corresponds to the obvious fact that, i f the plant has no RHP pole, one can make IT(j~)I uniformly a r b i t r a r i l y close to zero by making the controller gain small enough. The solution AT, p~, ~T of (10) can be computed by solving a generalized eigenvalue problem, following the same p r i n c i p l e as [7]. 4.2. Plant with poles and/or zeros on the imaqinary axis For a plant with poles on the imaginary axis, the closed-loop system obtained with the c o n t r o l l e r GT computed using the theory of [7] is not asymptotically stable.

884 Moreover, i f the plant has zeros on the imaginary axis, these zeros should be introduced in dw(s) to assure s t a b i l i t y of the closed-loop system [7]. However, i f we perform this modification of dw(s), we shall loose the l i n k between W(s) and the uncertainty bound F(s). Thus another approach is needed for systems with poles and /or zeros on the imaginary axis. [g] gives such an alternative approach. The results presented here for the complementary sensitivity function are dual to those developed in [9] for the s e n s i t i v i t y function. Their proofs follow the same lines. Theorem 3 ==========

Consider the following factorization of H(s) H(s) = H2(s) Ho(s) (11) where Ho(s) has poles and zeros only on the imaginary axis, and H2(s) has poles and zeros only off i t . Let ~'denote the optimal weighted s e n s i t i v i t y function for the factor H2(s). Under the assumption that IW(J~)I ~ II Y" I I - at each imaginary axis pole of H,

(12)

i t is possible to build a sequence {Y'w}in RH~ such that lim k->|

IIY',II.

=

II Y" II.

(13)

and such that Y, satisfies the interpolation constraints needed to ensure s t a b i l i t y of the closed-loop system. Thus imaginary axis poles and zeros do not affect the optimal weighted complementary s e n s i t i v i t y subject to assumption (12). 5 . EXPLICIT FORMULA In this section, we derive an e x p l i c i t formula for the value of %Tcorresponding to a plant with a single RHP pole. Theorem 4 ========~=

We consider a proper plant H(s) with one RHP-pole, a, factorized according to (5). Moreover we assume that the weighting function W(s) is of the form (9). Then %T= i n f II T G(~(H)

Wil, =

IB,(a)" W(a)]

(14)

Using the weighting function (8), (14) can be written : ~T = I/IM =

]

(I + 7.a)"*

B,(a)" I

(15)

For a minimum phase system, B,(a) = I. In this particular case the study of the evolution of IM, the maximum admissible static gain uncertainty as a function of T,a yields the following conclusions.

If

the

gain of

the

modeling uncertainties

increases at

885 a rate of 40 dB/decade, or more at high frequencies (( + w Z 2), IM decreases at least quadratically when rwa increases. In this case, for a fixed value of a, a value of I/T, smaller than a yields an admissible static gain uncertainty smaller than 25 %. I f the modeling uncertainty only increases as j~ at high frequencies (E + w = 1), the previous number becomes 50 % (see table). Thus, the gain of the modeling uncertainties should not increase significantly in the frequency range [O,a] in order to be able to accommodate sufficient static gain uncertainties. This limitation is strengthened i f the plant has nonminimumphase zeros, because JB,(a)'l > I in this case. For a system with a single RHP pole and a single RHP zero, the situation where the RHP pole lies to the l e f t of the RHP zero in the s-plane is more favourable than the converse situation. This can be checked by applying (15) to two transfer functions with a single RHP pole and a single RHP zero, in which the location of the RHP pole and the RHP zero is interchanged. 6. NUMERICAL EXAMPLES i . H(s) : e (s-2) /

((s-5)

(s+6))

,

~ ( R

Fig. 2 gives the evolution of IM as a function of 7W. The type of results that can be derived from f i g . 2 is the following. I f the multiplicative plant uncertainties can be described by (I),

(2) with F(j~) = F,(j~) = 0.1 (0.1 s +I) there exists a linear

controller stabilizing the class of plant M(H,F,). However, i f the bounding function is F2(j~) = 0.3 (0.1 s+i), there is no linear controller stabilizing M(H,F2). Indeed, the largest static gain that can be accommodatedwithout loss of s t a b i l i t y for a bounding function with a zero at s = 10 is IM = 0.286. 2. Inverted pendulum We consider the inverted pendulummounted on a cart sketched in Fig.3. The dynamic equations of this system are derived in [11] and they are linearized around the state (Y,Y,8,8) :

(Y,O,O,O) f o r u : O. The f o l l o w i n g t r a n s f e r f u n c t i o n between the force u

a c t i n g on the c a r t and the p o s i t i o n o f the c a r t y w i t h respect to Y has been obtained

H(s) : K(s-b) (s+b) / (s2 (s-a) (s+a)) where K = the

I/M , b = / g / l

,

pendulum concentrated at

a: ]

(16)

(M+m)g / MI, w i t h M, mass o f the c a r t ,

the t i p ,

I,

length

of

the

pendulum,

m, mass o f

and g,

gravity

acceleration. For 1 = ] meter, M : m : 0.5 kg, g=10 ms2, we obtain a = vr-2-O-s' and b = ~ s " . In the sequel, those numerical values are always used in model (16). Since the system has poles a t the o r i g i n , we use the r e s u l t s o f section 4.2. where

H2(s) = K (s-b)(s+b)/((s-a) (s+a))

(17)

To be able to use theorem 3, we have to check condition (12).For the class of weighting functions (8), we obtain IW(o)J=1. Applying (15) to system (17) we obtain

886 ~T : II T II- ~ l(a+b)/(a-b)I which is larger than I. Hence a sequence such as defined in theorem 3 always exists in our case. In model (16), the transfer function of the device generating the force u is assumed to be unity.

Let us assume that this transfer function can actually be modelled by a

f i r s t order system whose time constant is approximately 10 times smaller than I/a. Thus a more accurate model for the linearized plant would be : HT : H(s) / (0.03 s + 1)

(18)

Since the nominal model H(s) has relative degree 2, and since the modelling uncertainty w i l l increase significantly for frequencies larger than 33.333 rad/s, we choose : W(s) : ( 0.03 s + I) 2 The corresponding value of IM, given by (15),

(19) is 0.133.

I t can be checked that

belongs to the class (I),(2) with F(j~) = 0.133 ( 0.03 j~ + I )2 .

HT

(20)

We can now analyse the achievable robustness with respect to parameter variations. Assume that the mass M varies in the range [0.4

0.6]. The transfer function which is

equivalent to (18), for M=O.4, is : H' T = 2.25 (s2 - 10) / [ s2 (s2 - 22.5) (0.03 s + I) ]

(21)

Fig. 4 gives the Bode plots of H(s), H'T(s) as well as the bounds corresponding to the class ( I ) , (2) for H(s) and F(s) given by (16) and (20) respectively.

Notice that the

factor I/s 2 has not been included in the Bode plots, in order to obtain a better i l l u s t r a t i o n of the differences between the curves. following conclusion.

From Fig. 4, we can draw the

There exists no controller based on the nominal model (16) and

on the unstructured uncertainty model ( I ) , ( 2 ) , with F(s) = K (0.03 s + I )2, that can stabilize the actual plant subject to a 20 % variation of the mass M. Besides, no satisfactory controller was obtained for system (16) in [11]. To obtain adequate performances for the control of the inverted pendulum, an additional measurement was performed and a cascade control structure was used. In this way, the transfer functions appearing in the modified control problem do not have any nonminimumphase zero [11]. However, no unstructured modeling uncertainties were introduced in this work. 7. CONCLUSIONS

We have presented an index which gives an upper bound on achievable robustness with respect to a class of multiplicative uncertainties, in LTI controller design, for systems with RHP poles. This index t e l l s the designer whether the required closed-loop system properties can possibly be reached so that tedious t r i a l and error procedures can be avoided. Its computation is particularly easy. I t amounts to an eigenvalue problem. Moreover, an e x p l i c i t formula exists for systems with a single RHP pole. A similar index has been derived to characterize robustness with respect to additive unstructured uncertainties in [4]. Similar indices can be defined for multivariable systems. However, their computation will

be more involved. I t does not amount to eigenvalue problems. An interesting

computational approach is given in {12].

887 Acknowledqement This research topic was suggested to the author by Professor GrahamC. Goodwin. Some helpful discussion with him and with Doctor Rick H. Middleton at an early stage of this project are gratefully acknowledged REFERENCES

[ I ] J. S. Freudenberg and D.P. Looze (1985). Right Half Plane Poles and Zeros and Design Tradeoffs in Feedback Systems. IEEE Trans. Automat. Control, AC-30,6, pp. 555-565. [2] K. Glover and D. McFarlane (1988). Robust Stabilization of Normalized Coprime Factors: an Explicit H. Solution. Proceedings of the 1988 American Control Conference, Atlanta, Georgia. [3] D. McFarlane and K. Glover (1988). An H. Design Procedure Using Robust Stabilization of Normalized Coprime Factors Proceedings of the 27th. IEEE Conference on Decision and Control , Austin, Texas, pp. 1343-1348. {4] M. Kinnaert (1989) Robustness LimitationsDue to Right Half Plane Poles and Zeros in LTI Controller Design. Internal Report. Laboratoire d'Automatique. Free University of Brussels. [5] J. C. Doyle and G. Stein (1981). Multivariable Feedback Design : Concepts for a Classical/Modern Synthesis. IEEE Trans. Automat. Control, AC-26, I, pp. 4-16. [6] M. Vidyasagar and H. Kimura (1986). Robust Controllers for Uncertain Linear Multivariable Systems. Automatica, 22, pp. 85-94. [7] H. Kwakernaak (1985). Minimax Frequency Domain Performance and Robustness Optimization of Linear Feedback Systems. IEEE Trans. Automat. Control, AC-30, 10, pp. 994-1004. [8] R.L. Kosut (1986). Adaptive Calibration : An Approach to Uncertainty Modeling and On-Line Robust Control Design. Proceedings of the 25th IEEE Conference on Decision and Control, Athens, pp. 455-461. [9] B.A. Francis and G. Zames (1984). On H" Optimal Sensitivity Theory for SISO Feedback Systems. IEEE Trans. Automat. Control, AC-29, i , pp. 9-16. [IO]H. Kwakernaak (1986). A Polynomial Approach to Minimax Frequency Domain Optimization of Multivariable Feedback Systems. Int. J. Control 44, pp.117-156. [11]R.H. Middleton and G.C. Goodwin. Digital Estimation and Control : A Unified Approach. Book to be released in 1990. [12]E.A. Jonckheere and J.C. Juang (1987). Fast Computation of Achievable Feedback Performance in Mixed Sensitivity H" Design, IEEE Trans. Automat. Control, AC-32, pp. 896-906.

888 TABLE T.a

IM

IM

E+w=l

E+w=2

0.5 0.4 0.33 0.286 0.25

I 1.5 2 2.6 3

0.25 0.16 0.11 0.081 0.063

Table 1 : Evolution of index IM as

IM (+w=3 0.125 0.064 0.037 0.023 0.016 function of r.a

FIGURES

Gain Nominal model t

Envelope

~ Locj ~,

IF(j~II

Log

Fig. 1 : Typical behavior of multiplicative

uncertainties

889 0.45

0.4 0.35 0.3 0.286 O,2fi 0.2 0.[5 0,1 0.05 ..................................

0 0.5

0 0 1

Fig. 2

:

l

1.5

I m h l mR

2.5

2

Plot of IM(rw) for H(s)=a(s-2)/((s-5)(s+6)) w=0 , --- w=l , ..~ w=2

,/?

m

Fig.3

: Inverted pendulum mounted on a cart

u (force)

5

///!

4.5 4 3.5 /; /,

3 // 2.5 _ =:~.v21

.....

2 1.5 1

0.5

-1

-0.5

0

{),5

1

1.5 log w

Fiq. 4 : Inverted

pendulum

__

IH(jm) l

....

IH'T(j~)I

....

IBoundaries

example

for the class of plants

)with H(s) and F'

(i),(2)

given by (16) and (20)

STABILIZING CONTROL OF A SINGULARLY P E R T U R B E D SYSTEM DRIVEN BY WIDE-BAND NOISES

Mohamed E1-Ansary Department of Mathematics California State University, Bakersfield 9001 Stockdale Highway, Bakersfield California 93311

ABSTRACT

A nonlinear singularly perturbed system is considered where the state equations as well as the output equations are corrupted by wide-band noises. The purpose of this paper is to spell out sufficient conditions under which an output feedback stabilizing control designed for a reduced-order model, via the use of stochastic observer, will stabilize the full-order system. Our results, which were established earlier on orderreduction and stability are applied to justify the use of the reduced-order models and the stability of the closed-loop systems which result from applying output feedback to the open-loop systems.

1. Introduction

Until recently, singular perturbation techniques [3] have primarily focused on state and output feedback design of linear systems. Advantages of these techniques, such as order reduction and seperation of time scales, are expected to have a more dramatic effect on feedback design of nonlinear systems. Stabilizing deterministic nonlinear singularly perturbed systems have been considered, for example, in [4-6]. Stabilizing nonlinear stochastic systems via the use of an asymptotically stable stochastic observer has been considered in [7], where the work has been a generalization of the Kalman filter structure [8]. In this paper, motivated by the work in [7], in which a stabilizing feedback control for a system represented by an It5 equation (a Markov model) has been designed using an observer, we designed an output feedback stabilizing control for a nonlinear full order singularly perturbed system driven by wide-band noise by designing a stabilizing control for the corresponding reduced-order open-loop Markov model. To assure the readability of this paper we will state in this section the basic ~ sumptions and the main theoren~ tha~ were developed in [11 and [2t. Section 2 consists of the main result of this paper, an output feedback control has been designed.

891 Reduced-Order Model and Convergence Theorem

In [1] and [2], we studied the nonlinear singularly perturbed system: = a l ( x ) + A z 2 ( x ) y + B l ( X ) V ' , x(O) = x o (1.1) /~) ----a21(x) § A2y § B2(x)v e, y(0) : Yo (1.2) where xeR", yeR m, v~eR r. v ~ is the wide-band noise and was defined as v~(t) = ~ v ( t / e ) . It is wide- band in the sense that thepower spectral density matrix S~(w) = S(w/e) will have a frequency band Wo/e when S(w), the spectral matrix of a stochastic process v, has a frequency band w0. # is a small positive parameter representing parasitic elements. The process v(t) satisfies: (A1) v(t) is a stationary, zero mean, right continuous, uniformly bounded on [0, co). The a-algebras induced by v(t) are assumed to have a mixing property with an exponential mixing rate [9], 81tpA,,t IP(A2/Az) - P(A2)I ~ g e - a t (1.3) for some a > 0, where Alea{v(a), a < t} and A2ecr{v(8), s > t + r}. The exponential mixing rate assumption was taken for convenience but can be replaced by a more general mixing rate as in [10-12]. The following conditions were also imposed: (A2) The coefficients az, a21, A12, B1 and B2 are continuous in x and have continuous partial derivatives up to the second order which are bounded uniformly in x. (A3) The constant matrix A2 is Hurwitz, i.e. ReA(A2) < O. (A4) The positive parameters e and/z satisfy e >/Z'~o where To > 0 is arbitrary but fixed. The reduced-order model can be represented by a diffusion process ~(t) and its operator L ~ , where ~(t) is the solution of the It8 equation: d~(t) = b(~c(t))dt § a(~(t))dw(t), ~.(0) = Xo (1.4) where b and a are given by (1.6) and (1.10) respectively, and the operator L ~, whose form was obtained from the proof of the convergence theorem, is given by:

L'~f(x) :

~bi(x)i:, (x)+ 2 i , i : , (1.5)

where

b(x) = ao(x) + hi(x) - A 1 2 ( x ) A 2 Z h 2 ( x ) + h3(x), A(x) = Bo(x)S(O)B~(x) = [aq(x)] 1, ao(x) = al(x) - A~2(x)A-~'a21(x), Bo(x) = B , ( x ) - A12(x)A-~lB2(x),

= S(to) is the spectral matrix of v,

1(') denotes transposition.

(1.6)

(1.7) (1.s) (1.o)

(1.1o)

892 ! hi, = tr[D~BoW' + D,Ax2A z- - 1 El, $ l ! h2i = tr[EiBoW + E i A 1 2 A 2 - 1 E ], h3i = tr[-F[BoW'Bt~(At2) -1 -- F[BoE'(At2) -1 + F[Ax2A;1P],

(i.Ii)

Di = [Vzr162162

(1.14)

Zi = [Vzr/ii!Vzr/i2i...iVzrlir]nxr; Fi = [V,-~iliVz~i2i...iVz~/m],,xm;

BI = [r

B2 = [rli.i]mxr, Ai2 = [~ii]r*xm,

(1.12) (1.13)

(1.15) (1.16)

oo

w = f R(~)a~,

(11z)

0

R is the correlation matrix of v, oo

r. -_ f

eA,'t,B2R'(r)dr, for some "7e["7o,oo), "70 > 0

(1.18)

0 oo

P = f eA2A(B2a' + aB~)ea'2~dA.

(1.19)

0

The following assumption was also required: (A5) b(x) and Bo(x) satisfy the growth and Lipschitz conditions:

[b(#l + IB00:)l __ K(1 + I~:l), Vx~n"

IbCx) - b(z)l + ] B o ( z ) - B o ( z ) l < K l z - z l , Vz, zeR" The probability space in which v(.) is defined and the notion of weak convergence were introduced in [1] and the following convergence theorem was proved: Theorem 1 Under the assumptions (A1)-(A5), the process x~m(.), defined by (1.1) and (1.2), converges weakly to the diffusion process ~(.), defined by (1.4)-(1.19), as e -+ 0, # --+ 0, and r --+ "7. The explicit form of the limiting diffusion model shows the interaction between two asymptotic phenomena through the dependence of the matix ~, defined by (1.18), on "7 = lira * (The implication of such dependence on the engineering practice of neglect9 ~/~--*0 ~ "

ing parasitic elements was discussed in details in [1]). The first phenomenon arises in singular pertubation analysis due to the existence of parasitic elements (/z --, 0) while the second phenomenon arises in asymptotic stochastic analysis of systems driven by wideband noise (r -+ 0). The Stability Result Most of the work that was done in the area of stochastic stability, was dealing with systems represented by R6 equations (Markov models). The stochastic Lyapunov method, that is analogous to the deterministic Lyapunov method, was effectively used [cf. 13, 14]. One of the first attempts to study stability properties of dynamical systems which are driven by wide-band noise, has been made in [12]. We utilized some of the techniques of [12] and extended them to the singularly perturbed system given by (1.1)

and (1.2).

893

In [2], we proved a Lyapunov like stability theorem about the slow variables which was based only on conditions uppon the approximating diffusion process. A Lyapunov function V(x), xeR ~ was considered with the following properties: (a) V(z) is real valued, positive definite, V(x) = 0 =:a x = O, V(=) -~ co as [x[--* co, and has continuous partial derivatives up to the third order. (b) For any vector or matrix valued function g(x,t) =- O(x) for te[O,T] and xeR '~, we have: ]V~(x)g(x,t)l < K V ( x ) , (1.20)

(v-c~)g(~,,)),~(~, t) l _< gv(~), ~OZIOZjOXk glgZ.g3 < KV(=~ 9

3

lc

--

x

(L21) (1.22)

~"

where i,j, k = 1, 2, , n and a.1. #2., and g,Z are components of vectors or matrices which are O(x). The constant K in (1.20)-(1.22) may not be the same and it is independent of T. Also the following assumption was made: (A6) The coefficients a~(z), a~2(z) and B~(z) vanish at x = 0 and for some M > 0 and 9 .,

,mr7

J

VxeR ~, [al(x) I + ]A12(=)] + ]BI(Z)] _~ M Ix[. Now we state the main theorem which is recalled from [2]. Theorem 2 Suppose that there exists a Lyapunov function V(x) defined on R ~ satisfying (a) and (b) and for some A > O,

L'W(x) ~h for "h > 0 arbitrary but fixed. Now we derive the diffusion operator corresponding to X with the aid of (1.5)-(1.19), where the assumptions that will be listed later, will validate this derivation. As in (1.6), the drift coefficient is b(X) = rio(X) + t~l(X) - ,~2(X)A;XI~2(X) + I;3(X) (2.17) The diffusion coeffiecient is A(X) = 5(X)g'(X) (2.18) where =

( ~ll (~12)

, \521 b~2 rio(X) = ri, (X) - i , ~ (X)A~Xah (X) = a0C~) + a0(~)~(~)

(2.19)

/x(~) + [F,(e) - KF~(~) - K~A~Xa~(~)]~(e) + K[~(~) - f~(~) - ~ : A ; ~ ( ~ ) ]

)

(2.20)

where

(2.21)

Go(x) = Gl(x) - A12(x)ATtGz(x),

go(X)=( B0()

0 ),

-KczA21B2(~,)

(2.22)

KB3(~.) (2.23)

5=

o)

(2.24)

where

(2.25) 0

From(2.15) and similar to (1.14) we have: ( f o r X =

o x,(:

oX,)o

(:))

fori----1,2,...,n

(2.26)

for i = n + 1 , . . . , 2 n

(2.27)

896 where /S,(x) = [V,,ail(x)i ... iVzaio(z)],xo ; KBs(x) = [a~j(x)],xo for i = n + 1,...,2n From (2.14), (2.16) and similar to (1.16) and (1.15) we have

~,(x) = P,(x) =

(?')

0~,•

,

i=

1,2,...,~

(2.29)

i = ~+ 1,...,2~

(2.30)

E,(X) = ( ( E , ) , • :) i = 1,2,...,m 0 Then similar to (1.11) and from (2.22)-(2.24), (2.26) and (2.27) we have: hxi = tr[Dibol4;" +/9~AI2A~'IE] = tr[D~BoW + D~At2ATXE] = hli for i = 1 , 2 , . . . , n From (2.27) we get: hil = 0 for i = n + 1,...,2n

(h,:,)

Hence

(2.28)

(2.31)

(2.32) (2.33) (2.34)

Similar to (1.12), we have: = h2i for i = 1,2,...,m where h2i is defined as in (1.12), and similar to (1.13) we have:

hs, = t r [ - ~ " D o ~ ' g ~ A ; = hzi

for

1 - F'[3o~'(A;1) ' +

(2.35)

~'L2A;'~I

i = 1,2,..., n

(2.36)

i=n+l,...,2n

(2.37)

and hsl = 0 for Hence ~sCX) =

(2.38)

where h3i is defined as in (1.13). Notice that, it can be verified that/5 = p. Now, since ,iCX) = &CX)SC0)B~(X) where "~(0)= ( S l i 0 )

(2.39)

$20)(0)

Then from (2.18), (2.19), (2.22), and (2.39), we get:

a(X) = ( B~ \-gc2A;1B2(x)~

0 ] KBs(x)~-C~J

(2.40)

Then the reduced-order closed-loop model corresponding to (2.10) and (2.11) takes the form:

d.~ = [b(2) + Go(.~)gC~:)ld,t + B o C x ) v / ~ e w x

(2.41)

897

d~ = [fl (x,) + (F, (k,) - KF~ ($z) - Kc2 - Kc2A~IG2(~z))g(~) + g { o ( ~ ) - f2(~) - c2A;la21(~) - c2A;lh2(2,)}]dt - K c 2 A ; 1 B 2 (~) S~/~)dwl + KB3(~-) V / ~ ( ~ d w 2

(2.42) Now we impose a consistency condition which is stated as follows: the reduced-order closed-loop model (2.41) and (2.42) has to be the same as the closed loop reduced-order model which is given by (2.7) and (2.8). Hence by comparing the coefficients in the two systems and insisting that they must be equal for all K and for all functions g(x), we have: fl i x) -= b(x) (2.43) F1 (x) = Go(x) (2.44) 0.1 (x) = Bo(x) ~ (2.45) f2 (x) = el i x) - c2A 2' a21 (x) - e2 A 2' h2 (x) (2.46) F2 (x) = - e 2 A ; 1G~ (x) (2.47) Hence, the open-loop reduced-order model (2.4) and (2.5) can be written in the form: d.~ = [b(~) + Go(~)u]dt + 0.1(~,)dwl i2.48) d2 -- leo (x) -t- Fo (x) u]dt + 0.2 (x,)dwl + a3 (2,)dw2 (2.49) where b(x) and Go(z) are defined by (1.6) and (2.21) respectively, and

C0(X) ---~C1(x)

-

-

c2A21a21C x)

-

-

c2A21h2Cx)

(2.50)

Fo(x) = -c2A~lG2(x)

(2.51) (2.52) (2.53)

0"1 (.T) = B 0 (x) ~

0.2 (x) = -c2A~ 1B2 (x) ~ o3 = B3( )

(2.54)

and the proposed observer (2.6) takes the form: d~ -- [b(}) + Go(~,)g(~,) + g{c0C~) - c0(}) + (F0(~) - Fo(k,))g(~)}]dt (2.55) +K0.2 (~,)dwl + K0.3 (~)dw2 The design problem, is to choose a function g(x) which is smooth enough and a constant matrix K such that the state x and the error e = ~ - $ will be stochastically asymptotically stable. Let us write the the It6 equations (2.48) and (2.49) in the form

,using (:), (2.56)

d)( = b()Odt + 5(2)dw where

b(X) = (

bi~') + G~

)

(2.57)

b(~,) + Go(~,)g(~) + K{eoi~) - coi~) + [F0(~) - F0(~:)]g(~)} and

0 )

(2.58)

K0.2 (5;) K0.3 Suppose we have found u = g(z) and the gain matrix K which stabilize (2.56), then the next step is to apply the same control law to the open-loop full-order system (2.1)(2.3) where u = g(~) and ~ is the reconstructed states and satisfies the equation of the following observer: ~ = b(~:) + Go(~:)g(~) + g[cl(x) - co(:~) - Fo(~)g(~)] + gc2y + gB3(~)v~ 2

(2.59)

898 Now we are ready to spell out all the conditions, under which the stability of (2.56) will imply that of (2.1), (2.2), and (2.59), when u = g(~) is applied to (2.1) and (2.2). This will be done with the aid of the stability result (Theorem 2). We require the following assumptions: (A) al,b, Bo,B3,A12, and cl to vanish at their respective origins. (B) b and 5 are required to satisfy: i

~(x) -~(Y)[ +]6(X)-5(Y)[ _ 0 g i v e n .

that

A1

in

CB. 219

stable.

CB. I S g , C B . QD-CB. II>

it

follows

that

fop

=EI m,

910

C B. 2~-D

--[o from

CB. IO9,

3

X4

where,

AN=

~

Define

and

Kl=-kI

=pT>o,

P a

CB. P-S)

P

a

m

, k>O e

A 4 = A4

C B. 243

+ K1

and

~n-mxn-m

such

that

a

pa XI + -T Pa =: -Qa O,

CB. ?.79

[o and

so,

APPENDIX

CB. 289

the

sufficiency

C

Proof

-

N e c e s s s ty: CA. 14)

and

CB. 3),

CB.:9

CA. 15).

PN CAN + TC~: N)

proved.

of T h e o r e m

Consider

and

is

the Then,

so (2.43

+ CAN

3.2

i i near from is

I

tr a n s f or m a t i on

the

proof

equivalent

+ TC~ZNDTP N

=

of to

theorem

T

in 3.1

C A. g) , we

have

]

911

=

[ ]

-

Now,

%

%

0_~

Q4

from

+

-Q

Sufficiency:

and

=

-~aP~ any

0) on W zd/M which

caricatures a circuit switched network with d y n a m i c r o u t i n g (the s t a t e m e n t s below are true for any d, but the situations d = 1, and d = 2 are likely to be of most interest). We use ~/ to denote a generic element of W zdlM, and call r/(z) the value at site x. Let M* denote ((2M+l)~-l) 2

The Markov process is described by the t r a n s i t i o n s

~(.)

, .(~)

-

1

at r a t e ~ ( ~ ) ,

T/(z) -----+r/(x)+ 1 at rate u if ~(x) # C ,

(~(z), ~(y), ,7(z))

, (~(x), q(y) + 1, ~(z) + 1) at rate . / M " if z , y , z are distinct sites with

~7(x) = C,~(y) < C,~(z) < C and y , z E x § [ - 1 , 1 ] ~ . There is no difficulty constructing such a Markov process even though the n u m b e r of sites is infinite. See Liggett, [9], C h a p t e r 1, Section 3, for details; T h e o r e m 3.9 of that section applies directly. We t h i n k of each site in the 'lattice as representing a link in our network, which consists of C circuits. We think of the value at a site as giving the n u m b e r of occupied circuits in the corresponding link. Occupied circuits become free at rate 1. At each link there is a Poisson process of calls with rate v. Each call occupies one circuit on its link if available; if the link is saturated the call r a n d o m l y picks two other links which are

942 in its [--1, 1]d n e i g h b o u r h o o d , and uses one circuit from each of these links if possible. Otherwise the call is blocked and rejected from the system. Note that because we have a compressed lattice, the i n t e r a c t i o n actually has range M on the scale of links. For x E Z a / M , let u M ( t , x , k ) denote P ( q M ( x ) = k), 0 - o o }

(ii)

963

A is also equal to the minimal mean weight of all the circuit c of G(A): )~ = rain w'(c)

o t'(~)

/

Going back to the initial event graph, this gives a proof of formula/il). The following states the connection between eigenvalues and asymptotic throughput: T h e o r e m 3 ( C o h e n e~ al.[1]) Le~ 2d be the least solution of the autonomous recurrenl

syslem (10), and ~ be ~he eigenvalue of *he irreducible matrix .4. Then, there exists E N* (~he period), and tt C N (*he length of~he transienl} such ~hal:

x(t + e) = ~ex(t) for t > t~ If X(t) represents the quantities produced, then )~ has the obvious meaning of rate of production, or periodic throughpul. In the conventional algebra: X(t + 8) = 0)~ + X(t) i.e. the vector quantity OA is produced during ~ units of time. 6.2 G e n e r a l case If the matrix A is not irreducible, we partition the graph of G(A) into k strongly connected components. By renumbering the nodes of G(A), we may assume that A has the block-triangular form A1,1 A2,1

9 A2, 2

. .. ~

An,1

A~,2

...

~" t

A= Ak,k

where A t , t , . . . , Ak,k are strongly connected. Let AA~,~ be the eigenvalue of the component Ai,~. Then, the spectrum ~r(A) (the set of the eigenvalues of A) is a subset of {AA~,~. . . . . AAk,~}. AA~,~is an eigenvalue ofA iff the upstream eigenvalues AA.... . . - , AAk_,.k_~ are greater than AA~,,. We shall only consider the smallest eigenvalue ~- = rain ~(A) = min{AA, . . . . . . AAk,k} which corresponds to the slowest part of the system. Then, ~ is also characterized by (11).

7

Symbolic

computation

of the maximal

eigenvalue

When A is a constant matrix, a well known algorithm (Karp [9]) allows computing A. In our case, A (more precisely A(q)) is a matrix function of the undeterminates q l , - . . , qk associated with the resources (cf. equation (7)). The classical analogous for (#-lA(q))* would be ( I - #-tA(q))-l. A symbolic computation of this rational matrix function of q and # would obviously be out of range. In the dioid case, we shall show that absorption properties make this computation simpler. We now recall the Gauss-Jordan algorithm for computing the * of a given matrix. 7.1

The Gauss-Jordan algorithm

Let A be a n • n matrix with entries ai,j in a complete dioid, and b a vector. ~" = A*b is the least solution of x = Ax @ b: xi = a l , l Z l (D a l , 2 X 2 . . . ~ a i , n x a ~ bl x2

=

a 2 , 1 x l $ a 2 , : x 2 . . . $ a 2 , n X n (~ b2

(12)

964

We can eliminate xl by substituting zl = a * l , l ( a l , : x 2 . . . 9 al,,~z~ (9 bl) in the n - 1 last equations. This means that the usual gaussian elimination algorithm allows the computation of A*b. There is a simple way of writing the algorithm by considering the matrix equation X = A X ~ A. This is equivalent to solving n systems of type (12) where b is replaced by a column of A. The least solution of X = A X (~ A is A * A . We have A * A = A A * , and A* = AA* (9 I d (this simply is the definition of A*). So, we can recover A* from A* A. It should now be clear that A* A can be computed by the following Gauss-Jordan algorithm (cf. Gondran & Minoux[6]): start with the matrix A (~ -- A define by induction A ('n+l) = ra (m+l)~/ by: al,(re+t) (,~) 71"mare, (ra)~ ~. l,t t = al,(m) t ~ tgd al~rn with 7rm = (a('~,))* Then, we get A (n) = A * A . It is essential to notice that A* is finite iff A * A is finite which is equivalent to all the ~'m being finite. 7.2

Rational closure of the dioid of polynomials

We need to perform the Gauss-Jordan algorithm with the initial matrix A (~ = l ~ - l A ( q ) , the entries of which are monomials in # - 1 , ql . . . . . qk. Our goal is to compute the set of constraints in problem (3). The aftlne constraint ,-~-5ql + . . + ,-~q~ +

>__~"

can be written as a monomial with rational exponents using the dioid algebra:

Therefore, we introduce: D e f i n i t i o n 4 A generalized polynomial f u n c t i o n o f q t , . . . , q ~ and t~- t is a finite sum: f ( q l . . . . . q k , # - l ) = ( ~ a a q ~ X . . . q ~ k # - - ~ k + l , with (~ = (cq . . . . ,~k+l) E (Q+)k+t and a,~ E R.

We denote by R < q t , - - . , qk, ~ - l > the dioid of polynomial functions with indeterminates and /~-1, endowed with the natural sum and product. We keep the usual

ql,...,qk

11

1

algebra for the exponents (e.g. 2q 3 | Oq~ = 2q~-). For instance, 0q~ is a monomial, which is the dioid version of "0 + ~-" a in the conventional algebra. We now study the * properties of polynomials. In a commutative dioid, we have

(o): (aeb)* =a'b* Then, we only have to consider the * of a monomial. We have already pointed out that in the scalar case, a* is 0 or - c o . This leads to defining the indicator function X by: X,(t)

f 0 L -co

if t < s if $ > s

965 Let P E R- < q l , . . . , qk, #-1 ~>, and Q, R E R < q t , . . . , qk >. The main algebraic features of indicator functions are:

(0: 9 ..q~ ) = X ~

~(~)

ql p ...qk p

(.i): Xq(t) r X~(t) = Xqe~(t) (i~): (,): [)/~(~)]* = X Q ( 0 (~): [P X,~(,~) X~(o)]* = P* X,~(,~) XR(o) We denote by ~" tile set of all the functions f of the form:

f(q,l ~) = P(q,#) Xq(q)(#) XR(q)(0)

(13)

for some P E R < ql . . . . , q k , # - i > and Q , R E R ' < q t , . . . , q k >. Then: 1 The dioid F is stable under the * operation.

Proposition

.T" is the rational closure of R < q l , . . . , q k , # -1 >, i.e., the least extension of R- < q t , . . - , q k , # -1 > which is stable under @,| and *. Thus, the entries of the matrix A ('~) defined in the Gauss-Jordan algorithm will have the form (13). As the entries of the matrix A (~ only have powers of #, we never use the (i) property. Moreover, formulas (o), (ii) and (iv) show that the* of a polynomial is an indicator function. Then, we have the stronger result: P r o p o s i t i o n 2 At the end of the Gauss-Jordan algorithm, we get: ~r,,(q,#) = for some P E R < q t , . . . , q k >

Xp(,r

T h e eigenvalue A is characterized by (11), thus it is the maximal solution of X p ( # ) > - c ~ , which gives A = P. Let us explicitly write the polynomial P: P ~- ( ~ , a ~ q ~ . 99qkak 9 Going back to the conventional algebra, we get: A = P(q) = min,~(a~ + atq~ ... + akq~) This is the symbolic expression for A(q) which we need for solving problem (2). 7.3

Complexity

estimation

This algorithm is useful when (i) the size n of the system (of the matrix A) is large, and (ii) there is only a few indeterminates (k o , which is called the faulty mode. The probability of switching from a standard to a faulty mode is known and given by 0.

For simplicity, this

paper will assume that o 1=0, corresponding to a perfectly reliable server and let the switching probability be constant, independent of the number of units produced. The switching process is therefore as that

969 shown in Figure 2. In such situations, unlike queueing models with breakdowns, the process is neither stopped, nor is it taking a vacation, but operates continuously, albeit producing, with some probability, a faulty output. In this sense, the manufacturing technology of the production process is represented by the parameters [o 1'02'0]" The smaller these parameters, the more effective (in a quality production sense) the manufacturing system. For quality conformance a CSP-1 Continuous Sampling method is applied (Dodge 1943, Duncan 1974). For such quality control, we can compute the various probabilities of producing defective units, detecting such units and of course the probability distributions of basic quality states of the queue-like production process. We summarise essential results. A production cycle is shown in Figure 3, during which N jobs are processed and the following is shown. Proposition 1: Let i be the number of defective units produced after the production system has switched to a defective mode and let k be the total number of units produced in the defective mode. Define by Pi(k) the probability that the ith defective unit is detected, then these probabilities are given by the following relationships, (1) Pi(k)= b(1-b)i'~NBD(i,k), k=i, i + l , i + 2 ..... i=1,2,3.. while the probability of producing i defectives in a production run is given by, oo

(2)

qi = ~ Pi (j) j=i

where NBD(i ,k) is given by the negative binomial distribution, (3) NBD(i,k)= ( k-l i-I(l'v ~ k-b '2i k = i , i + l , i + 2 , i + 3 . . . . . . Proof: (See Tapiero and Tsiotras 1990) The expected number of defectives production cycle is,

and

their

variance

in

a

oo

(4)

i = ~ iqt , i~i

i

i i-I

By the same token, the expected number of defective units which leaves A the production system undetected is y =E(i-1), while its variance is a 2y ' or ^

(5)

co

r

y = T. (i-1)qi; a 2= ~. (i-l-y)2qi i-I

Y i=l

If b/ is the total expected number of units produced in a cycle, then of course, the expected number of non-defective units produced in the cycle

970 ^

is

given by,

^

N-y.

In the values to these parameters.

following

proposition,

we

provide

explicit

Proposition 2: Let N be the number of units produced in a cycle. expected production in a cycle and its variance are given by, A (6) N = ( 1 - 0 ) ~ + 1/0 where Oo

=

P

~.

(8)

v a r ( N i)- 1 = k=i (1-0)~

X

kPi(k)=(l/ 02

b)o 2 A

+ P2 + 2nP]

+

(1-0)/02+(1-0)2nP 2

OO

GO

P2 =

the

O0

(7)

(9)

Then,

}:

~:

i--I

k~Pi(k)=(l+~176176176

k=i

Proof:

see (Tapiero and Tsiotras 1990)

Proposition 3: In the long run,^the probability of inspecting a unit is given by, (10) n = b + (1-b)nl/N while the probability of detecting

a

defective unit

and

shutting

down

the system is given by, A (11) J = I / N On

the

basis

of

these

probabilities,

the

production

system's

performance can be assessed. Consider first the amount of time required to perform a job. Let r be the amount of time required to produce a unit standartly, r I is the time required to inspect the unit and z2 the time to shut down the system and attend to repairs, then the production time of a job is given by the mixed distribution T below, r + z (12) The

T

w.p.

=

z + z I + z 2 w.p. r w.p. 1 - ~ 5 - ~ subsequent analysis will integrate

considerations

of

quality

and

quantity issues in the manufacturing process.

The Two Stations With Sampling And Limited Buffer

Here blocking of the first station occurs when upon completion of service in the first station attempts to move to station 2 when it is full are made.

In this case,

station 2 occurs. general and

mixture

repair.

blocking occurs until

a departure

from

In this system, the service time in station 2 is a distribution

Attendance

of

the

service

times,

can

be

time

computed

including without

inspection altering

the

971

basic structure of the queueing model.

Namely,

the mean service time

and its second~, moment are,, given by, (13) (14)

l//.t = z + (~r+~)z t + ~z"2 ET 2 = ( 1 - ~ - r 0 E r 2 + n E ( z + z l ) 2 + J E ( r

+rl+z2) 2

Since the manufacturing system has a limited buffer, under an assumption of heavy loading at the first station, the output rate of the first queue is /~ , (which equals the service rate of result, the second station behaves as if it were an Poisson server

arrival acts

as

rate

/t i

an

since

additional

during

space

the

that

this

station). As a MIGIIIR+2 queue with

blocking

belongs

to

period the

the

second

first queue.

Furthermore, due to the blocking period, the server in the first station does not process unit and remains therefore idle. That is, no more arrivals

occur

at

the

second

station.

Due

to

the

exponential

assumption, this is equivalent to stating that arrivals do occur at the second station at a rate of /~i but they are lost. In view of this observation, the second station becomes an MIGIIIR+2 queue. Such queues have been studied previously (e.g. see Makino 1964, Keilson 1966, Lavenberg 1975) and their operating characteristics are known. As result, using the adjusted service rate /t and E(T 2) we can obtain

a a

direct relationship between the buffer size R, the manufacturing process unreliabilities and the manufacturing system operating performance. These effects are two fold, first on the manufacturing capacity and second on the blocking probabilities for prior queues (causing thereby inventory accumulations and forced idleness). Let fl be the probability that the system is full, blocking thereby the first station. Then, for a buffer size R, (15)

fl = 1 - I / ~ + P

+2], with

p=l~lll.t,

R+I

PR+2= l / l ~

flj]

j=0

with (15)

1 ifk ~k

=0

all(I-at)

"~

if

{Efljak+l. j +

k--1

ak}l[l'a 1]

if k > 2

and (16) a k = f0{[1-FT(t)]e'/Zlt /Zk1 tk-l/(k_l)!} dt If we denote by F~(s) the Laplace Transform of Fr(t),

then using

the

well known transform relationship, (17) LT(tk" IFT(t)) = ('l)k" IF'T (k- 1) (S) where

F~(k'l)(S)

easily shown that,

is

the

(k-1)st

derivative

of

the

transform,

it

is

972

o~ = ~/(k-1)l][F(k)/,u~- (-1) k-I F TO ( k - 1 )(~ l ) / ~ l ] with F;(S) given by, (19) F ; ( s ) = F;(s){F;i(s)[~+SF;2(s)]+I-5-~} (18)

The expected time a job will spend in the second station is thus, R+I

(20)

W = (R+2)//,t - E JPj//t 1 j-I while the variance of the waiting time is, R+I

(21)

var(W) =(R+2)[ET2-1//~ 2] +

E

R'4-1

[j ( j + I ) P j ] / / ~

j=l

- [ T. jPj//~I] 2 j=1

( R + 2 ) (R+ 1)PR+2//~I/~ A Similarly, the expected number of jobs in the second station is L=/~IW, while the expected number in qeue, Em q is given by, L q = Emq= /Jl(1-fl) Wq - ft. Finally, the throughput of the system (at station 2) is d R, and given by, ~R = ]gl/[jO+PR+2] , p=/~t//~ and thcrcforc the loss of throughput ,4 due to limited buffcr in the station is given by, d = 5R(X+~).

These terms, combined with the production system's quality performance characteristics, can bc used to assess the overall performance of the production system. Throughout our applications, define the following cost parameters, C a = the cost per unit of blocking the first station, measuring the forced idle time imposed on the first queue. C = the inventory cost in the second station, per unit per unit time and expressing the cost of WIP in the second station Ct = the inspection cost per unit C d = the post process cost of a dcfcctuous unit, expressing the penalties associated to jobs processed in a faulty manner and that have left, undetected, the system. C = the cost due to the loss of output because of the inspection T

and defective units. C = the cost of adjusting the process, once a defective unit is F detected. These include f'Lxed stoppage, re-start costs as well as the repair or maintenance costs associated to the manufacturing system. Since our process is a renewal-reward process, the long run average cost is equal to the average cycle cost (Cinlar 1975) and therefore this cost, denote by 4 , is given by, (22) 9 = [ C

+Ci[bN+(1-b)nl]+

CdQ]/[~q/~]+

CILq+ CTZ1 + CBfl

973

where N//a is the expected cycle time, C fl is the cost of blocking per unit time of the second station, or the probability of the system being full. A A [bN+(1-b)nl] is the expected number of units inspected in a cycle which was computed earlier, CILq is the inventory cost per unit time since, L q is the average number of units in the queue, CTA is the average loss of output due to inspection and repairs/maintenance, stoppage and restart. Finally, CaQ is the expected cost of non-detected defective units which have exited the system. Since in a cycle, we produce qi defectives on the average, one of which is detected, the expected number of undetected faulty units in the cycle is given by, O0

(23)Q = ~ (i-1)qs with

qi

defined earlier

Numerical analyses were used and the following conclusion were reached. 1. When the buffer space is reduced, the production variability increases. Since such variability has negative effects on the smoothness of the production flow, it is possible to compensate such variability through an increase in automation (which reduces service time variability). We might conjecture therefore that JIT provides an incentive for automation. When we consider the quantity produced in a cycle, we note that while with increases in inspection the expected production cycle becomes smaller, its coefficient of variation increases. 2. When the buffer space R is reduced, the cost of process unreliability decreases. Further we note that A~/AO is largest at R = I , which means that it is most advantageous to invest in process quality when the buffer space equals one. When buffer space increases, the relative advantage in improving quality is reduced. In fact, when b=0.05, we see from that A ~ / A 8 is largest when R=0, at which time investment in process reliability improvement reduces costs the most. Close to full inspection (b=.8), an investment in process reliability improvement can at best be used to reduce inventory costs (explaining thereby the growth with respect to R). 3. In J r r systems, flexibility reduces the blocking probability. Thus the value of flexibility at a given station is given by the cost reduction effects such reduction in blocking has. Of course, the

974

probability of blocking can be reduced by increasing inventory, but this incurs at the same time several other costs. 4. The marginal cost of inspection

are

greater,

the

greater

R.

5. The probability of blocking decreases when the buffer space increases, but it declines faster when the buffer space is smaller. At the same time, while the loss of throughput is smaller when R increases, this loss occurs also at a declining rate. This ~improvement ~ in operating performance involves a cost reflected in the increased average buffer space (inventory cost). These costs increase when we increase inspection. Thus, if the process reliability is not increased, quality can be improved only with inspection which induce detrimental operation of the manufacturing process. For in- versus out of station repair we saw that the greater the cost of production due to process unreliability,

the

more

beneficial

it

will

be

to

inspect

in-station

rather than out. These conclusions, having a strategic implication to the design and the management of unreliable Just in Time Manufacturing Systems are in part

accepted

as

"common

knowledge".

This

paper

has

provided

a

justification based on the modeling of JIT as tandem queue like manufacturing systems, reinforcing both current experience and quantifying it to value JIT, quality control and the process of quality improvement REFERENCES Cinlar, E., Introduction to Stochastic Processes, Prentice Hail, Englewood Cliffs, N.J., 1975. Couway R, W.Maxwell, J.O. McClain and L.J. Thomas, The role of Work in Process Inventory in Serial Production Lines, Operations Research, 36, 1988, 229-241 Dodge H.F., A Sampling Inspection Plan for Continuous Production,

Annals of Mathematical Statistics, vol.14, 1943, 264-279 Duncan A.C., Irwin, Ill. 1974

Quality

Control and Industrial Statistics,

4th.ed.,

Hall R.W., Zero Inventories, Homewood, Ill. Dow Jones-Irwin, 1983 Hsu L.F. and C.S. Tapiero, A Bayes Approach to Quality Control of an M/GI1 Queue, Naval Research Logistics Quarterly, 35, 1988, 327-343 Hsu. L.F. and C. S. Tapiero, Quality Control of the M/G/1 Queue,

Euro. J. of Operations Research, Forth., 1989

975 Hsu L.F. and C. S. Tapiero, Quality Control of an Unreliable Flexible Manufacturing System: with Scrapping and Inf'mite Buffer Capacity, Int. J. of Flexible Manufacturing Systems, Forth. 1989 Keilson J., The er.godic queue length distribution for queueing systems with finite capacity, J. Royal Stat. Soc., Series B, 28, 1966, 190-201 Lavenberg S.S., The steady state queueing time distribution for the M/G/1 finite capacity queue, Management Science, 21, 1975, 501-506. Makino T., On the Mean Passage Time Concerning some Queueing Problems of the Tandem Type, Journal of the Oper. Res. Soc. Japan, 7, 1964, 17-47 McClain J.O. and L.J. Thomas, Operations Edition, Englewood Cliffs, N.J. Prentice Hall, 1985

Management,

Second

Monden Yashiro, (Ed.), Applying Just in Time, The American~Japanese Experience, Industrial Engineering and Management Press, Atlanta, 1986 Schonberger R.J., Japanese Manufacturing Techniques: Lessons in Simplicity, New York, Free Press, 1982 Tapiero C.S., Production Transactions, 19, 1987, 362-370

Learning

and

Quality

Nine Hidden Control,

liE

Tapiero C.S. and L.F. Hsu, Quality Control of an Unreliable Random FMS: with Bernoulli and CSP Sampling, Int. J. of Prod. Res., 26, 1988, 1125-1135 Tapiero C.S. and G. Tsiotras, JIT, and the Quality Improvement Process in Queue Like Production Systems, Working Paper, 1990

976 Figure 1: The Production Model

Limited Butler [][]

Saturated o ....

Re.~=esta.o.

R

M/G/1

~

~R~7;k

Unrell,lble Station

Figure 2: Production System Switching %Defectlvell Produced Failure Mode Probability of mwitch

:StandardMode # Produced

Figure 3: The Production Cycle Start I

b Probabllletlr Failure n

~Sample ~

D

~

Repair ProductionCycle

Re,tart

MODULOIDS AND PSEUDOMODULES 3. T h e L a t t i c e S t r u c t u r e P r o b l e m E. Wagneur* Abstract We determine here sufficient conditions for finite dimensioned modulo~'ds and pseudomodules to be lattices. In particular, we show that completeness of the dioM D of scalars is such a condition. The simplicity conditions for pseudomodules, which make the classification problem tractable, are also shown to be sufficient for the lattice structure. Since these conditions are clearly unrelated, both results show that neither one is necessary. A concluding example illustrates this remark.

1.

Introduction

In traditional dynamical systems discribed by differential or difference equations, the underlying time set is independent of the evolution of the system, and sudden changes in the state are associated to singularities or bifurcations. In contrast, the dynamics of deterministic discrete event systems (DEDS) is characterized by the occurrence of sudden changes in the state, appearing at instants which are determined by the dynamics of the system. For example the message flow in communication networks, or the material flow in automated production lines may be modelled as DEDS. The mathematical modelling of DEDS aims at : 1. Providing the analogue of the powerfull classical models for the analysis of the dynamics involved. 2. Giving a tool for the analysis of the relation design/performance of the underlying physical system. 3. Developing an instrument for observation and control of the system described. Although various models have been proposed in relation with one or more of these goals (e.g. 1, 3 in [7] and [11]), we will concentrate here on the model introduced in [3],[4], which has proven to be relatively efficient in connexion with objectives 1 and 2. In this approach, a DEDS is modelled as an event graph (a particular class of Petri nets), and its evolution discribed by a set of equations of the type : ;gi(~ ~- 1) = Max{ Max {aij + xj(k)}, bi + u(k)}, l<j_ ~o(x) _< ~o(y)). Examples 2.3 E l . (IN U { - o o } , max, + ) , is a dioid. E2. (IR U { - o o } , max, + ) , is a pseudoring. E3. D ~ is a moduloi'd (resp. pseudomodule), whenever D is dioid (resp. pseudoring). Since D is completely ordered, D " is a distributive lattice. E4. For D an arbitrary dioi'd or pseudoring, let xl = (1, 0, 0), x2 = (1, 1, 0), xa = (0, 1, 1) E D a 9 3

The pseudomodule M = {y~ Aixi I Ai E {0,1}} = {0, z l , x2, z3, xl + za} generated by the

xi's

i=l

over the pseudoring {0, 1} is a nonmodular lattice.

Eb. For D arbitrary, let xl = (,~, 1), x2 = (1, # ) , with 1 < # < A. The set M = {~lxl+2~2x2 I )~1, A2 E D} is a proper submoduloYd (subpseudomodule) of D 2 .

980 Although Xl and x2 are not comparable, we have xl < ,~x2, as well as z2 < ~xl. Note also that the semilattice X + = {xl, x2, x l + x 2 } is isomorphic to the semilattice ((1, 0), (0, 1), (1, 1)}. E6. Let O = ( q u { - o o } , m ~ , + ) c I3 = ( r t u { - o o } , m ~ , + ) , a n d ~ = 0 , 1 , 0 ) , ~ = (0,1,1), ~ = (0,-v~,0) E D ~ 9 Then X = {~1,~,~} generates a Z ) - p , e u d o m o a u l e M , which is not a lattice, since the set (x E M [ z _< z i , x _< x2} has no 1.u.b. in M. Again, as a poser, X is isomorphic to Y = {(0,1,0), (1,1,0), (0,1,1)} C D 3 , and Z + and Y + are isomorphic semilattices.

Let xi E M , i E I , where I is an arbitrary index set. Definition 2.4 We say that X = (zi)iE I generates M iffVx E M may be written as a finite linear combination 9, = ~ ~ i X i , with xi E X, ,ki E D, Ai = 0 except for a finite number of values i E I. iE1 Let Xz stand for the span of X , and consider the following properties, where I1,12 C I. P1.

VI1,I2, XI, f] XI2 = XltnI2.

P2. VI1,/2, /'1 N/2 = O ~

XI1 N XI~ = O.

P3. V{ E I, VIx, { f! 11 => X{i} f] Xx, = 0. P4. Vi E I, VI1, ,i ~ Ix :=> xi ~ X q . Clearly these properties are listed here from the strongest to the weakest. They are equivalent in a vector space, but not in a moduloi'd as simple examples show. W h e n M is a pseudomodule, then P4 r P3. In.[12], we show that P4 is equivalent to the condition that X is the set of irreducible (x = y + z => x = y or x = z) elements of the semilattice M . Also the existence theorems for bases of [12] imply that if M is finitely generated then it has a unique system of generators satisfying P4. T h u s weak independence is defined by P4, and independence by P3. A weakly independent system of generators is then called a weak basis and its cardinality defines the weak dimension of M . In case M is a pseudomodule, a system of independent generators is called a basis. It is unique up to a rescating map of the form zi ~ ;bz~, (LJ e I). Examples 2.5 E l . Let xl,a:2 as in E5 above, then X = {2.1,22} is independent and generates a twodimensional moduloYd, which is not isomorphic to D 2 , since morphisms are isotone. E2. For z l , z2 as above, let Yl = A - l a x , Y2 = x2. T h e n Y = {Yl,y2} is independent. Now Yx < Y2, hence, even as a poser, Y is not isomorphic to X . However, as a submoduloid of D 2 (i.e. with the relation Y2 < A#yl), the moduloid generated by Y coincides with that generated by X . E1 above suggests that the order structure on an independent system of generators is not sutficient for the characterization of the moduloid it generates. Nor is it necessary, from E2. It follows that the classification problem in the general case is a m a t t e r of non trivial combinatorial complexity.

981 Given z , y E M let L ( x , y ) = {k E D I x _< A y } . Also, any basis X generates a semilattice X + over the pseudoring {0,1}. Whenever these operations are defined, we will write AM (resp. A+) for the meet in M (resp in X + ) . Definition 2.6 We say a pseudomodule M is simple iff it has a basis X = (xi)iex such that : $1.

Vwi, x j E X , O ~

XiAMX, j = x E M

s2. vx, y E x + , Z(x, y) r

r

xiA+xj

=xEX

+ ,

~ 1 e L(x, y).

Remark 2.7 If M has a finite basis X , then X + = X + U {0} is a finite semilattice with universal lower bound, hence is a lattice. Examples 2.8 E l . D'* is a simple pseudomodule, with basis X . i = 1,...,n,.

(el)'], . . where . .ei

.

(~,

~n),i

E2. Let xl = (A, 0 , 1 , 0 ) , x2 = ( 0 , # , 0 1 ) , xa = (0, 1, A, 0,), x4 = ( 1 , 0 , 0 , # ) , with 1 < A < # . T h e n X = (xi) 4 g e n e r a t e s a s u b m o d u l o ~ d o f D 4. We l e a v e i t to the reader to check that it is not simple. Note that not all bases of a simple pseudomodule satisfy the simplicity conditions $1, $2 above. Hence when it does, such a basis is called a canonical basis. In [131 the following results are proved. Proposition 2.9 If X -- ( x l ) , e i , and Y = (Yj)jEJ are two canonical bases for some simple finite dimensional pseudomodule M ~ then there is a semilattice isomorphism ~0 : X + ---* Y + . Proposition 2.10 T h e semilattice X + generated by a canonical basis X of a simple finite dimensional pseudomodule M characterizes M up to a pseudomodule isomorphism. 3. S a t u r a t e d

linear combinations

The following concept of saturated combination will be used. The definition just requires the existence of a set of generators. Let M be a pseudomodule (or a modulo~d) generated by a set X = (zdiel. Definition 3.1 We say that x = ~ Ajxj (where J C t ) is a saturated combination of x, if the following two

jEJ

conditions hold : 1: Vj E J VA > Aj, x < x+ Axj,

2. Vi q~ J , V A E D*, x < x + Axi.

982 Proposition 3.2 In a finite dimensional modulo'id (pseudomodule) M over a complete dioid (pseudoring) D , saturated combinations always exist and are unique. Proof Let x E M , V x k E X , l e t F-k = {A E D I Axk ~ x } . We h a v e 0 E --k, h e n c e E k # (3. Since Ek is bounded and D is complete, 3Ak = Sup {~k} E --k. Let I ( = {i E I [ ;~i # {0}}. Then x = y~ Akxk is a saturated combination for x. Uniqueness is straightforward. kEK

When I is finite (say I = { 1 , . . . n}), a saturated combination is easily completed to a maximal combination involving all the x i ' s i = 1, . . . , n by setting Ai = 0 for i 6 K . 4. ModuloYds and t h e lattice s t r u c t u r e

Let M be a finite dimensional D-moduloid (rasp. a D-pseudomodule). Theorem 1 If saturated linear combinations exist and are unique, then M is a lattice. Proof Let x, y E M .

Just complete the saturated combinations for x, y given by Proposition 3.2

above into maximal combinations x = ~ Aixi, y = ~ i=1

Then z = ~

YiZi

=

for if

X A y,

u < x,

#ixi,

and let

Vi ~--- /~i

A/-zi ,i = 1 , . . . , n .

i=l

and u _< y for some u E

M,

then the (completed)

i=1

saturated combination u = ~

T]ix i

necessarily verifies

qi 0 B(r(i,j), j} = t(i, j) pij = 1 B ( r ( i , j ) , m+i) = t(i, ji , qlj = 1 c(j, r ( i , j ) ) = 0 , ~ij = 0 C(m+i, r(i,j)) = 0 , ~ij : 0

IV.

DYNAMIC

BEHAVIOR

OF

CLOSED-LOOP

(24 (25

= 0

(26 , (27 (28 (29 (30} (31)

SYSTEM

F o r a n x m FMS, if t i m e - c o n s u m i n g m a t r i x T, column sequence m a t r i x P, r o w sequence matrix Q and state variables distribution matrix K are given, then column directional matrix ~, row directional matrix Q and row permutation m a t r i x S c a n be d e r i v e d . The psuedo-linear system model can be established by using rules (24)-(31}, and its dynamic behavior can be determined from the eigenvalue a n a l y s i s of c l o s e d - l o o p system. L i t e r a t u r e s [ 2-s] s h o w that the eigenvalue is j u s t t h e s t e a d y - s t a t e running rhythm of closed-loop system, so w e p u t e m p h a s i s u p o n e i g e n v a l u e analysis of s y s t e m . N o w s o m e r e l a t i v e t h e o r e m s a r e g i v e n . Definition 6 [z] F o r a s q u a r e m a t r i x A, if t h e r e e x i s t a r e a l n u m b e r A a n d a v e c t o r V w h i c h e n t r i e s a r e n o t a l l e q u a l to 8, s u c h t h a t A V = A V, t h e n A is c a l l e d e i g e n v a l u e a n d V is c a l l e d eigenvector.

989 Theorem 1 [z] The eigenvalue of a n i r r e d u c i b l e square matrix j u s t is t h e a v e r a g e w e i g h t of c r i t i c a l circuit in g r a p h G ( A ) . The eigcnvalue of a n x n s q u a r e m a t r i x A in m a x - a l g e b r a sense c a n be c a l c u l a t e d as f o l l o w ; A = v-~ {32)

A

i=i

where

~' = ~ ~ ~z ~ ... 8 ~" 9 Definition 7 [2] . A s y s t e m is s t a b l c i f f t h e r e number A , such that for all initial conditions and i, it h o l d s t h a t l i m [XI (k)] *''k = A 9

(33) exists a real all subscript

Theorem 2 [=] . A s y s t e m is s t a b l e i f f t h e r e e x i s t a real n u m b e r A, i n t e g e r d a n d k0, s u c h t h a t f o r a l l i n i t i a l c o n d i t i o n s , it h o l d s t h a t X ( k + d ) = A d X(k) , ~ k ) k0 . The above definitions and theorems suggest a relation between the eigenvalue and the stead-state running rhythm of system. N o w w e t a k e s o m e s i m p l e F M S as t h e i l l u s t r a t i o n s . Example i. In a f o l l o w - s h o p FMS with three jobs processing on two machines, its p a r a m e t e r matrixs a r e g i v e n as f o l l o w :

T=

P= 6

5

O=

4

2

2

2

1

Assuming that the capacity of b u f f in m a c h i n e t h e n t h e m a t r i x Ao , Al., B, C, Ao* a n d M in 12) determined as f c l l o w b y u s i n g (16)-(23}

A0

=

~ 3 . . 0 . . . . . . 9 5 . 5 . 4.4

. . . . .~ ] ~q B = . . 3 . _|

.

.

At

=

.

.

!!!iliI

.

.

A0*=

.

0 .... 0 C . . . . . . 0 0 . . . . . 0

M

2

3

Mz is b~ a n d (14}

=

0, are

..... 0

12 5 0 5 0 6 . 0 125 . 5 0 1 6 9 4 9 4 0

2 2 . 0 9 9 ~ 7 14 14 12 12

=

8

8

14 18

14 18

6 12 16

where

12 16

t h e d o t in m a t r i x d e n o t e s 6 . L i t e r a t u r e [~] s h o w t h a t t h e e i g c n v a l u e of c l o s e d - l o o p system with infinite b u f f e r is A = 15. In this example, the eigenvalue of c l o s e d - l o o p system (14) is A = 16, which shows that blocking phenomenon leads to the i n c r e a s e in the steady-state running rhythm of system and the system model with finite buffer can given more suitable description for maunfactuing system 9 Example 2. In t h i s e x a m p l e , we consider a c a s e of 3 o b s h o p manufacturing s y s t e m . Its p a r a m e t e r m a t r i x T, P, Q, E, ~, Q a n d S a r e g i v e n as f o l l o w ;

T

=

p = 6

5

4

Q = 2

2

!

3

2

1

990

[: .o.-3] [2 2 o] R=

B'= 4

5

[2 3 o]

[~. 2 ~]

~=

6

0

0

s =

1

0

1

2

3

2

!

Assuming that the capacities of b u f f e r in m a c h i n e M~ a n d M~ bl = bz = 2, t h e n m a t r i x Ao, Ai , B, C, Ao* a n d M in ( 1 2 ) - ( 1 4 ) determined as f o l l o w b y u s i n g r u l e s ( 2 4 ) - ( 3 1 ) .

arc are

~

~

~0

~

.

r~

~

AI

Ao*

=

|i0

3

1

5

[:!,:

0

-' . 0

5 0

0

B

=

.

.

.

2 2 9 9 7 12 12 I0 20 20 18 14 14 12 4 4

0

2..2 7

. . . . .

C =

.

9

0

.

M

0

=

0 0

]

.

The

cigcnvalue of c l o s e d - l o o p s y s t e m is A = 20. w h e n bt < 2 (i = 1,2), Ao is n o t a n i l p o t e n t r e s u l t s in d e a d l o c k p h e n o m e n o n 9 E x a m p l e 3. In this example, m a t r i x Q, ~ a n d e x a m p l e is c h a n g e d i n t o

Q =

~_ : 3

2

=

0

1

=

3 in m a c h i n e C, A0, a n d (24)-(31)

Ao

2

1

Mi a n d M~ arc M in ( 1 2 ) - ( 1 4 )

* =

35 o 3 6 11 o ; : : 5

C

=

.

.

o 5

.

.

. :2 9 12

o..] . . . .

The

above

the

01j iii?ii I 50

~: :::~| A,

6;.. 6; "o''"

2.~:~ :] =

S in

2

.

B

which

s =

1

Assuming t h a t t h e c a p a c i t i e s of b u f f e r bl = bz = I, t h e n t h e m a t r i x Ao , A~ , B, arc dctermincd as f o l l o w b y u s i n g r u l e s

A0

matrix,

0

.]

o

0

.

0

. 0

M

=

12 7 12 . 18 15 18 12 9 12 4 4

.

eigenvalue of c l o s e d - l o o p s y s t e m is ~ = 15. W h e n b~ < 1 (i = 1,2), Ao is n o t a n i l p o t c n t m a t r i x , which results in deadlock phenomenon. In job-shop manufacturing system, cloumn sequence matrix P usually is g i v e n a c c o r d i n g to t h e t e c h n o l o g i c a l requirements of jobs, but w e c a n c h o s e the s e q u e n c e of j o b s o n e a c h m a c h i n e to reduce the steady-state running period. The difference of d y n a m i c

991 behavior of closed-loop system between different routings has b e e n s h o w n in the l a s t two e x a m p l e s 9 E x a m p l e 4. In t h i s e x a m p l e , t h e m o r e g e n e r a l c a s e of j o b s h o p is c o n s i d e d . Its p a r a m e t e r m a t r i x T, P, Q, K, P, Q a n d S a r e g i v e n as f o l l o w :

T=

3 [

4

a =

P--

22 3-1

3-1

4 7 -

~ =

3 0

3 0 0 -I

o _ - - 1 2 3 1 2-1

~ :

2 2

3 0 0 -i

s :

i 1

Assuming t h a t the c a p a c i t i e s of b u f f e r in m a c h i n e a r e bz = b3 = 0, t h e n t h e m a t r i x A0 , AI , B, C, A0* a n d (14) a r e d e t e r m i n e d as f o l l o w b y u s i n g r u l e s ( 2 4 ) - ( 3 1 )

AG

=

9

,

5

. . 0

2

. 2

9

.

9

1

9

o

0

.

.

--

.

.

.

.

5

.

.

.

.

.

.

I

9

1

.

.

.

.

.

.

.

.

.

.

3

.

.

.

c

4

0

83030 4. . 0 .J

5

.~

o

.

]

Ao*=

At

3

.

.

~

.

Nz a n d M3 M in (12)-

00

]

~ . 0

3 B

.

. . .

"2 2 -

. .

.

.

.

.

.

0

.

.

.

9

0

.

.

.

M

.

0 .

.

.

0

=

6

.

.

5

3

-

3 3 0 q I0 i0 9 9 . . Ii II 7 7 i ; i [ I0 I0 .

0

.

.

.

.

. 0

.

.

7 3 . 3 0

.

.

.

The eigenvalue of the c l o s e d - l o o p system is A = Ii and the period-order is d = I. Now, we consider the influence of the c a p a c i t y of b u f f e r o n t h e d y n a m i c b e h a v i o r of c l o s e d - l o o p s y s t e m . In t h i s e x a m p l e , w h e n the c a p a c i t y of b u f f e r in m a c h i n e M2 is i n c r e a s e d to bz = I, the eigenvalue of c l o s e d - l o o p s y s t e m is A = I0. A n d w h e n t h e c a p a c i t y of b u f f e r in m a c h i n e M3 is a l s o i n c r e a s e d to b3 = bz = i, then the eigenvalue a n d the p e r i o d - o r d e r of c l o s e d - l o o p system are c h a n g e d i n t o A = 9.5, d = 2, w h i c h is s a m e to the c a s e of s y s t e m w i t h i n f i n i t e b u f f e r s h o w n in l i t e r a t u r e [7]. T h u s , d e s i g n of t h e capacities of b u f f e r in e a c h m a c h i n e is a p r o b l e m of c o n s i d e r a b l e i n t e r e s t in f l e x i b l e m a n u f a c t u r i n g system.

V.

CONCLUSIONS

In this paper, Modelling o n n x m F M S is a n a l y s e d by using p a t h a l g e b r a 9 B a s e d o n the p s u e d o - l i n e a r system model with finite buffer, the d y n a m i c b e h a v i o r of c l o s e d - l o o p s y s t e m is analysed from the eigcnvalue of s y s t e m 9 The illustrative examples show that this new model can give a more suitable description for manufacturing system 9

992 REFERENCES

[I]

Ho, Y.C. and Cassandras, D i s c r e t e - E v e n t Systems, on D e c i s i o n and Control,

[2]

Cohen, G., Dubois, D., Quadrat, J.P. and Viot, M., A LinearS y s t e m - T h e o r e t i c View of D i s c r e t e - E v e n t Processes, Proc. of 22nd IEEE C o n f e r e n c e on D e c i s i o n and Control, San Antonio, Texas, 1983, pp. I039-I044.

[3]

Cohen, G., Moller, P., Quadrat, J.P. and Viot, M., Linear System T h e o r y for Discrete Event Systems, Proceedings of 23rd IEEE Conference on D e c i s i o n and Control, Las Vegas, Nevada, 1984, pp.539-544.

[4]

Cohcn, G., Dubois, D., Quadrat, J.P. and Viot, M., L i n e a r System-Theoretic V i e w of D i s c r e t e - E v e n t P r o c e s s e s and Its Use for P e r f o r m a n c e E v a l u a t i o n in Manufacturing, IEEE Trans. on A u t o m a t i c Control, AC-30, 1985, p p . 2 1 0 - 2 2 0

[5]

Xu Xinhe, Decision,

[5]

Xu Xinhe, Modelling and D y n a m i c Analysis for A class Discrete-Event Systems, J. of Northeast University Technology, No.3, 1987.

[7]

Dubois, D. and Stecke, K., U s i n g Petri Nets to R e p r e s e n t P r o d u c t i o n processes, P r o c e e d i n g s of 22nd IEEE C o n f e r e n c e on Decision and Control, San Antonio, Texas, 1983, p p . i 0 6 2 - ! 0 6 7

C., C o m p u t i n g C a s t a t e Variables for P r o c e e d i n g s of 19th IEEE C o n f e r e n c e Albuquerque, 1980, pp.697-700.

Linear Discrete Event Systems, No.3, No.4, 1987.

J. of Control and

of of

Analysis and Optimization of Systems: Proceedings of the 9th International Conference, Antibes, June 12-15, 1990 (Lecture Notes in Control and Information Sciences) (English and French Edition)

Recent Advances in Control and Optimization of Manufacturing Systems (Lecture Notes in Control and Information Sciences)

Singular Control Systems (Lecture Notes in Control and Information Sciences)

Networked Control Systems (Lecture Notes in Control and Information Sciences)

Analysis and Synthesis of Networked Control Systems (Lecture Notes in Control and Information Sciences)

Advances in Cooperative Control and Optimization: Proceedings of the 7th International Conference on Cooperative Control and Optimization (Lecture Notes in Control and Information Sciences)

Topics in Time Delay Systems: Analysis, Algorithms and Control (Lecture Notes in Control and Information Sciences)

ICAOS '96 12th International Conference on Analysis and Optimization of Systems: Images, Wavelets and PDE's, Paris, June 26-28, 1996. (Lecture Notes in Control and Information Sciences)

Analysis and Synthesis of Dynamical Systems with Time-Delays (Lecture Notes in Control and Information Sciences)

Modelling, Analysis and Design of Hybrid Systems (Lecture Notes in Control and Information Sciences)

Active Fault Tolerant Control Systems: Stochastic Analysis and Synthesis (Lecture Notes in Control and Information Sciences)

Control and Estimation of Systems with Input Output Delays (Lecture Notes in Control and Information Sciences)

Ensuring Control Accuracy (Lecture Notes in Control and Information Sciences)

Stability and Control of Time-delay Systems (Lecture Notes in Control and Information Sciences)

Advances in the Control of Nonlinear Systems (Lecture Notes in Control and Information Sciences)

Control of Indefinite Nonlinear Dynamic Systems: Induced Internal Feedback (Lecture Notes in Control and Information Sciences)

Resilient Control of Uncertain Dynamical Systems (Lecture Notes in Control and Information Sciences)

Control of Uncertain Systems with Bounded Inputs (Lecture Notes in Control and Information Sciences)

The Confluence of Vision and Control (Lecture Notes in Control and Information Sciences)

State-Space Models of Lumped and Distributed Systems (Lecture Notes in Control and Information Sciences)

Feedback Control, Nonlinear Systems, and Complexity (Lecture Notes in Control and Information Sciences)

Modeling and Identification of Linear Parameter-Varying Systems (Lecture Notes in Control and Information Sciences)

Variational and Hamiltonian Control Systems (Lecture Notes in Control and Information Sciences)

Optimal Control of Constrained Piecewise Affine Systems (Lecture Notes in Control and Information Sciences)

Approximate and Noisy Realization of Discrete-Time Dynamical Systems (Lecture Notes in Control and Information Sciences)

Modelling, State Observation and Diagnosis of Quantised Systems (Lecture Notes in Control and Information Sciences)

Fuzzy Control and Filter Design for Uncertain Fuzzy Systems (Lecture Notes in Control and Information Sciences)

Nonlinear Analysis and Synthesis Techniques for Aircraft Control (Lecture Notes in Control and Information Sciences)

Linear Time Varying Systems and Sampled-data Systems (Lecture Notes in Control and Information Sciences)

Assessment and Future Directions of Nonlinear Model Predictive Control (Lecture Notes in Control and Information Sciences)

Limited Data Rate in Control Systems with Networks (Lecture Notes in Control and Information Sciences)

Analysis and Optimization of Systems: Proceedings of the 9th International Conference, Antibes, June 12-15, 1990 (Lecture Notes in Control and Information Sciences) (English and French Edition)

Recent Advances in Control and Optimization of Manufacturing Systems (Lecture Notes in Control and Information Sciences)

Singular Control Systems (Lecture Notes in Control and Information Sciences)

Networked Control Systems (Lecture Notes in Control and Information Sciences)

Analysis and Synthesis of Networked Control Systems (Lecture Notes in Control and Information Sciences)

Advances in Cooperative Control and Optimization: Proceedings of the 7th International Conference on Cooperative Control and Optimization (Lecture Notes in Control and Information Sciences)

Topics in Time Delay Systems: Analysis, Algorithms and Control (Lecture Notes in Control and Information Sciences)

ICAOS '96 12th International Conference on Analysis and Optimization of Systems: Images, Wavelets and PDE's, Paris, June 26-28, 1996. (Lecture Notes in Control and Information Sciences)

Analysis and Synthesis of Dynamical Systems with Time-Delays (Lecture Notes in Control and Information Sciences)

Modelling, Analysis and Design of Hybrid Systems (Lecture Notes in Control and Information Sciences)

Active Fault Tolerant Control Systems: Stochastic Analysis and Synthesis (Lecture Notes in Control and Information Sciences)

Control and Estimation of Systems with Input Output Delays (Lecture Notes in Control and Information Sciences)

Ensuring Control Accuracy (Lecture Notes in Control and Information Sciences)

Stability and Control of Time-delay Systems (Lecture Notes in Control and Information Sciences)

Advances in the Control of Nonlinear Systems (Lecture Notes in Control and Information Sciences)

Control of Indefinite Nonlinear Dynamic Systems: Induced Internal Feedback (Lecture Notes in Control and Information Sciences)

Resilient Control of Uncertain Dynamical Systems (Lecture Notes in Control and Information Sciences)

Control of Uncertain Systems with Bounded Inputs (Lecture Notes in Control and Information Sciences)

The Confluence of Vision and Control (Lecture Notes in Control and Information Sciences)

State-Space Models of Lumped and Distributed Systems (Lecture Notes in Control and Information Sciences)

Feedback Control, Nonlinear Systems, and Complexity (Lecture Notes in Control and Information Sciences)

Modeling and Identification of Linear Parameter-Varying Systems (Lecture Notes in Control and Information Sciences)

Variational and Hamiltonian Control Systems (Lecture Notes in Control and Information Sciences)

Optimal Control of Constrained Piecewise Affine Systems (Lecture Notes in Control and Information Sciences)

Approximate and Noisy Realization of Discrete-Time Dynamical Systems (Lecture Notes in Control and Information Sciences)

Modelling, State Observation and Diagnosis of Quantised Systems (Lecture Notes in Control and Information Sciences)

Fuzzy Control and Filter Design for Uncertain Fuzzy Systems (Lecture Notes in Control and Information Sciences)

Nonlinear Analysis and Synthesis Techniques for Aircraft Control (Lecture Notes in Control and Information Sciences)

Linear Time Varying Systems and Sampled-data Systems (Lecture Notes in Control and Information Sciences)

Assessment and Future Directions of Nonlinear Model Predictive Control (Lecture Notes in Control and Information Sciences)

Limited Data Rate in Control Systems with Networks (Lecture Notes in Control and Information Sciences)

Recommend Documents