Lecture Notes in Control and Information Sciences Editors: M. Thoma · M. Morari
275
Springer Berlin Heidelberg NewYor...
22 downloads
830 Views
7MB Size
Report
This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
Report copyright / DMCA form
Lecture Notes in Control and Information Sciences Editors: M. Thoma · M. Morari
275
Springer Berlin Heidelberg NewYork Barcelona Hong Kong London Milan Paris Tokyo
Hideaki Ishii, Bruce A. Francis
Limited Data Rate in Control Systems with Networks With 80 Figures
13
Series Advisory Board
A. Bensoussan · P. Fleming · M.J. Grimble · P. Kokotovic · A.B. Kurzhanski · H. Kwakernaak · J.N. Tsitsiklis
Authors Dr. Hideaki Ishii University of Illinois Coordinated Science Laboratory 1308 West Main Street 61801 Urbana USA
Professor Bruce A. Francis University of Toronto Electrical and Computer Engineering M55 1A4 Toronto, On Canada
Cataloging-in-Publication Data applied for Die Deutsche Bibliothek – CIP-Einheitsaufnahme Limited data rate in control systems with networks / Hideaki Ishii ; Bruce A. Francis (ed.). – Berlin ; Heidelberg ; NewYork ; Barcelona ; Hong Kong ; London ; Milan ; Paris ; Tokyo : Springer, 2002 (Lecture notes in control and information sciences ; Vol. 275) isbn 3-540-43237-X
isbn 3-540-43237-X Springer-Verlag Berlin Heidelberg New York This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilm or in other ways, and storage in data banks. Duplication of this publication or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965, in its current version, and permission for use must always be obtained from Springer-Verlag. Violations are liable for prosecution act under German Copyright Law. Springer-Verlag Berlin Heidelberg New York a member of BertelsmannSpringer Science + Business Media GmbH http://www.springer.de © Springer-Verlag Berlin Heidelberg 2002 Printed in Germany The use of general descriptive names, registered names, trademarks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. Typesetting: Digital data supplied by author. Data-conversion by PTP-Berlin, Stefan Sossna e.K. Cover-Design: design & production GmbH, Heidelberg Printed on acid-free paper spin 10867690 62/3020uw - 5 4 3 2 1 0
Preface This book is an attempt to incorporate data rate issues that arise in control design for systems involving communication networks.
The general setup of
the problems considered here is that, given a plant, a communication channel with limited data rate, and control ob jectives, ÿnd a controller that uses the channel in the feedback loop such that the overall system achieves the control ob jectives.
The theoretical question of interest is to ÿnd the minimum data
rate necessary for the channel. The motivation for this study comes from the recent growth in communication technology. The use of networks has become common practice in many control applications, connecting sensors and actuators to controllers. One of the ob jectives of the book is therefore to provide some basics of the networks used in control systems. From the theory side, we have been stimulated by the increasing attention devoted to the ÿeld of hybrid systems. The systems studied in this book can be viewed as such systems:
The plant has continuous dynamics and, due to
the limited data rate in the channel, there is some discrete decision making in the controller on what information to transmit and/or when to send it. Of the issues related to data rate, our focus is on the use of networks in distributed systems and on quantization in messages sent over networks. In the ÿrst part of the book, we develop a time sequencing technique for a distributed control system where a network cable connects local controllers. A simple network model with some features of practical protocols is proposed, and stabilizability of such systems is addressed.
It is shown that the use
of a network can enlarge the class of plants to be stabilized.
Moreover, we
conÿrm the intuition that properly increasing the data transmission rate over the network can enhance the capability of the systems. The second part of the study deals with a control system that has a ÿnite data rate channel in the feedback loop. Messages are sent from the sensor side to the actuator side periodically and can take only a ÿnite number of bits; there is time delay associated to the rate as well. We propose controller design methods for a continuous-time, linear plant to achieve quadratic stability in the continuous-time domain.
As a ÿrst step to model such a channel, we
consider the sampled-data setup where simply a sampler and a hold are put into the loop. The problem is to ÿnd the largest sampling period for stability.
Preface
vi
The next step is to extend the results for a system that has a quantizer in the controller, in addition to a sampler and a hold.
The quantizer design allows
us to calculate the number of bits in the messages.
As the ÿnal step, we
give a simple analysis on the delay time that the system can tolerate without becoming unstable.
A fairly general treatment of quantizers is developed,
and this enables us to compare the data rate necessary for diþerent types of quantizers in an example. To further reduce the data rate, we also show several variations of the quantized sampled-data system. These systems involve more hybrid decision making, unlike the simple sampling scheme in the original problem. The book is based on the Ph.D. thesis of the ÿrst author.
Acknowledgements
:
We wish to thank Ted Davison, Raymond Kwong, Steve
Morse, and Murray Wonham, who served on the thesis committee of the ÿrst author, for their helpful comments and suggestions.
We also wish to thank
Mireille Broucke and Daniel Liberzon for stimulating discussions. The ÿrst author would like to thank Yutaka Yamamoto, who introduced him to the theory of digital control in the ÿrst place.
He is also grateful to
Sean Bourdon and Guangdi Hu for sharing ideas during the course of his study in Toronto. Finally, the ÿnancial support from the Natural Sciences and Engineering Research Council of Canada and in part the National Science Foundation through the University of Illinois at Urbana-Champaign is gratefully acknowledged.
Contents
Preface
v
Notation
1
2
Introduction
1
1.1 Limited data rate . . . . . . . . . . . . . . . . . . . . . . . . . . 1.2 Outline and contributions . . . . . . . . . . . . . . . . . . . . .
2 3
Control networks
7
2.1 Control over networks . . . . . . . . 2.2 How control networks work . . . . . 2.2.1 Periodic pattern . . . . . . . 2.2.2 Medium access methods . . . 2.2.3 A protocol example . . . . . 2.3 Application examples . . . . . . . . . 2.3.1 Jacking systems of train cars 2.3.2 Networks on automobiles . . 2.3.3 Process control . . . . . . . . 3
4
xi
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
The switch box problem . . . . . . . . . Preliminaries . . . . . . . . . . . . . . . Stabilization using PTV local controllers Assignability measure analysis . . . . . . Multiple mobile robot example . . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
4.1 Dwell-time switched systems and their stability . 4.2 Quadratic stabilization of sampled-data systems . 4.2.1 Problem formulation . . . . . . . . . . . . 4.2.2 Control Lyapunov function approach . . . 4.2.3 Bounds on trajectories . . . . . . . . . . . 4.2.4 Solution to the sampled-data problem . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
. . . . . .
Distributed control over networks
3.1 3.2 3.3 3.4 3.5
. . . . . . . . .
Finite data rate control | single-input case
7 8 8 9 10 12 12 13 14 15
15 17 19 23 25
33
33 35 35 37 40 42
Contents
vii
4.3
4.4 4.5
Quantized sampled-data control . . . . . . . . . . . . . . . . . . Problem formulation . . . . . . . . . . . . . . . . . . . .
48
4.3.2
A suÆcient condition for stability . . . . . . . . . . . . .
49
4.3.3
Solution to the quantized sampled-data problem
Finite quantizers
. . . .
51
. . . . . . . . . . . . . . . . . . . . . . . . . .
60
Control over a ÿnite data rate channel . . . . . . . . . . . . . .
63
4.5.1
Data rate for control . . . . . . . . . . . . . . . . . . . .
63
4.5.2
Time delay analysis
64
4.5.3
Time delay and quantization
ÿ
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
68
. . . . . . . . . . . . . . . . . . . . . . . . . . . . .
70
4.6
Design of
4.7
Magnetic ball levitation example
. . . . . . . . . . . . . . . . .
5 Towards data rate reduction 5.1
47
4.3.1
72
83
Time-varying quantization . . . . . . . . . . . . . . . . . . . . .
83
5.1.1
83
Problem formulation . . . . . . . . . . . . . . . . . . . .
5.1.2
A switching law . . . . . . . . . . . . . . . . . . . . . . .
85
5.1.3
Magnetic ball levitation example continued
88
. . . . . . .
5.2
Nonuniform sampling
. . . . . . . . . . . . . . . . . . . . . . .
89
5.3
Dwell-time switching control . . . . . . . . . . . . . . . . . . . .
90
5.4
5.3.1
Dwell-time switched systems with logarithmic partitions
91
5.3.2
Problem formulation . . . . . . . . . . . . . . . . . . . .
93
5.3.3
Hybrid automata representation
5.3.4
Solution to the dwell-time switching problem
. . . . . . . . . . . . .
98
Finite partition dwell-time switching . . . . . . . . . . . . . . .
101
5.4.1
Dwell-time switched systems with ÿnite partitions
. . .
102
5.4.2
State feedback control . . . . . . . . . . . . . . . . . . .
103
5.4.3
State feedback under measurement noise . . . . . . . . .
104
5.4.4
Observer-based output feedback
. . . . . . . . . . . . .
106
5.4.5
Cart-pendulum system example . . . . . . . . . . . . . .
109
6 Extensions for the multiple input case 6.1
6.2
6.3
94
. . . . . .
117
Quadratic stabilization of sampled-data systems . . . . . . . . .
117
6.1.1
117
Problem formulation . . . . . . . . . . . . . . . . . . . .
6.1.2
Generalization of the setup
. . . . . . . . . . . . . . . .
119
6.1.3
Bounds on trajectories . . . . . . . . . . . . . . . . . . .
122
6.1.4
Solution to the sampled-data problem
. . . . . . . . . .
125
Quantized sampled-data control . . . . . . . . . . . . . . . . . .
128
6.2.1
Problem formulation . . . . . . . . . . . . . . . . . . . .
128
6.2.2
Solution to the quantized sampled-data problem
. . . .
129
Dwell-time switching control . . . . . . . . . . . . . . . . . . . .
140
6.3.1
Dwell-time switched systems with logarithmic partitions
140
6.3.2
Problem formulation . . . . . . . . . . . . . . . . . . . .
143
6.3.3
Solution to the dwell-time switching problem
144
. . . . . .
Contents 6.4
Design of 6.4.1
6.5
ix
S
. . . . . . . . . . . . . . . . . . . . . . . . . . . . .
A suÆcient condition for
S>I
. . . . . . . . . . . . . .
6.4.2
Extension of the design method for
6.4.3
A design method for
S>I
ÿ
. . . . . . . . . .
149
. . . . . . . . . . . . . . . .
150
Two cart-pendulum system example
. . . . . . . . . . . . . . .
7 Conclusion Bibliography
147 148
152
161 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
163
Symbol Index
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
170
Subject Index
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
172
Notation R R+ C Z Z+
set of real numbers nonnegative subset of set of integers nonnegative subset of
2C complement of X closure of X interior of X
Re z
X
real part of
c
X) X)
cl(
int( 2X
X B
Z
z
family of all subsets of
?
span
R
set of complex numbers
X
X
orthogonal complement of span of a subset
S
S
of a vector space
n
ball of radius
0
transpose of
A
ÿ
complex-conjugate transpose or adjoint of
rank A
rank of
Im A
image of
A
Ker A
kernel of
A
A
ÿ (A);
þ(A);
kAk kxk
fÿ fþ
g
k (A)
diag(a1 ; : : :
dxe bxc
g
k (A)
r >
0 centered at
2R
x (r )
A
set of singular values of
A
2C
mþn
max = ÿ1 ÿ ÿ2 ÿ þ þ þ ÿ ÿmin
ÿ
in nonincreasing order:
fm;ng
set of eigenvalues of A
Euclidean norm of
(A; B; C; D )
A
A
spectral norm of ; an )
x
x
=
ÿ
min
A
2R
n
diagonal matrix with diagonal entries
1
a ; : : : ; an
the smallest integer equal to or larger than
x
the greatest integer equal to or smaller than
x
linear time-invariant system with state equations
where
x
is the state,
u
x
=
Ax
+ Bu;
y
=
Cx
+ Du;
is the input, and
y
is the output.
Chapter 1 Introduction In control systems, communication arises when signals are sent from sensors to controllers and then from controllers to actuators. The systems involved in the communication are often digital. Thus, the signals take quantized values at discrete-time instants, and there is time delay inherent in transmission. A traditional assumption made on such communication in control is that there is no quantization and no time delay. the theoretical treatment of systems.
This can signiÿcantly simplify
Moreover, this assumption has been
justiÿed since, in practice, dedicated cables have been used to connect system components.
Data rate
is one of the basic characteristics of communication
systems and is the rate at which bits are transmitted over a channel in bit/sec or bps. Under the traditional view, the channel requires inÿnite data rate. Recently, as a result of the vast progress in communication technology, the use of digital, serial networks has become popular in large-scale control systems. Networks used for the connection of sensors/actuators and controllers are called
control networks or ÿeldbuses . Nowadays, many advanced protocols
are available for such networks.
While this change in hardware oþers new
potential for applications, it suggests that the ÿniteness of the data rate in communication may now be relevant. Since the channel is shared by multiple sources of data, the total amount of communication expected in the system must be known and must not exceed the physical bound on the data rate. Furthermore, in control systems, the real-time requirement is critical, and the time delay in communication has to be minimal. Consequently, it is important that the channel be shared in an eÆcient manner, or otherwise the system will not operate properly. On the other hand, the use of networks in control poses questions that diþer in nature from the traditional views. For example, (i) what is a better strategy to reduce the data rate, to increase the sampling period or to decrease the number of bits per message? (ii) In a large-scale system, network cables may oþer data exchange between system components where there was no cabling before, even at a very low transmission rate; how can such communication enhance the performance of the system?
H. Ishii and B.A. Francis (Eds.): Limited Data Rate in Control Systems with Networks, LNCIS 275, pp. 1−6, 2002. Springer-Verlag Berlin Heidelberg 2002
2
Chapter 1.
Introduction
Data rate issues in control are also related to systems in which communication capability is constrained due to limited sources of energy or stealth reasons.
Examples of such systems are planetary rovers, multi-agent mobile
robots, unmanned autonomous vehicles, and so on. The aim of this book is to develop theories for control systems under data rate limitation.
This study has been particularly motivated by the recent
popularity in control networks and hence incorporates some of the practical aspects of this technology. The underlying theoretical interest is, however, to investigate what the minimum data rate is in control to achieve given speciÿcations.
1.1
Limited data rate
In this section, we discuss problems that arise in control systems with limited data rate and give a survey on the literature related to this topic.
We focus
on issues such as time sequencing, quantization, time delay, and distributed control. In serial networks, messages are multiplexed in time.
Through time se-
quencing of messages, all system components can share the same network cable.
On the other hand, all messages have to be quantized, or be repre-
sented by a ÿnite number of bits. two techniques:
Clearly, there is a trade-oþ between these
Given a ÿxed rate, one can use fewer bits per message and
increase the number of messages per time, and vice versa. Brockett [11] initiated an interesting study concerning time sequencing and proposed a simple model of control networks and a design method of a central controller connected to sensors/actuators by a network.
Because of
the limitation on data rate, only one of the senor/actuator pairs is allowed to communicate with the controller at each discrete instant. Quantization has been studied in the context of digital control and signal processing. Typically, quantizers round oþ the input signals uniformly with a ÿxed step size. In many previous works, the error caused by such quantization is modeled as noise, often uniform and white [71].
On the other hand, it
is known that, even in a simple feedback setup, interesting and complicated behaviors such as limit cycle and chaotic tra jectories can be observed [17, 84]. Many of the recent deterministic treatments of quantizers in feedback systems follow the inýuential paper by Delchamps [24]. Elia and Mitter [27, 28] proposed a stabilization technique for linear systems; a nonuniform quantizer is designed as a result of an optimization problem. Brockett and Liberzon [12] studied the use of uniform but time-varying quantizers.
In [94], coding and
time delay inherent in channels of networked control systems are explicitly taken into account, and a new notion of stability is introduced. There are several types of time delay characteristic of serial networks. One is called the transmission time. This is the time it takes for a transmitter to send out data:
To send
N
bits of data over a
D
bps channel, it takes
N=D
seconds. Another one is a consequence of time sequencing and is the waiting
1.2.
3
Outline and contributions
time when a node has to send out a message, but cannot due to a busy channel; this type of delay is by nature time-varying and, thus, diÆcult to determine. It is known that delay in a feedback loop can result in instability of the system, and there is a fairly large body of research concerning time-delay systems; recent works relevant to control over networks can be found in [62, 70, 98]. One application example is bilateral control of telerobotic systems (e.g., [3]), which often operate over the Internet. To reduce the amount of traÆc over the network, control functions often have to be distributed over a system.
The controllers in such systems are
local in the sense that they can measure only a limited number of outputs and can control only certain actuators. Control of distributed systems has been studied in the area of decentralized control [55, 77]. It is known that this structure may impose a severe limitation on the performance of such systems. The idea of information exchange among local controllers has been long studied (e.g., [64, 90]). However, there are not many works that deal with realistic, practical models of networks incorporating bandwidth issues [99]. Other studies dealing with data rate issues include state estimation problems [23, 60, 93] and control of linear quadratic Gaussian systems [58, 81].
1.2
Outline and contributions
In this book, we study the eþect of data rate limitation in control systems using networks. Among the issues listed in the previous subsection, we are interested in the time sequencing and quantization aspects involved in networks. The outline of the book is given as follows. In Chapter 2, we highlight some practical issues related to control networks. We ÿrst describe the characteristics of the traÆc in control systems to show how the requirements for control networks are diþerent from those for data networks. Second, the advantages of using networks in application are given. Third, common features in control networks protocols are studied brieýy, so as to provide some relations of the studies in the following chapters to practical applications. Finally, some industrial examples are given. Chapter 3 deals with a distributed control problem, where a plant is controlled by two local controllers (see Fig. 1.1).
A network cable is used to
connect the controllers so that local data can be shared. The assumption due to the limited data rate is that only one controller can transmit a message at a time. We solve the problem of ÿnding the sequencing of messages over the network to stabilize the system. It is then shown that the capability of the system can be enhanced in some sense if the traÆc is properly increased. In Chapters 4, 5, and 6, we study control methods for a system that has a ÿnite data rate channel in the feedback loop.
We are motivated by the
problem of ÿnding the minimum data rate required to achieve speciÿed control objectives.
In particular, quadratic stabilization methods for a continuous-
time, linear time-invariant plant are developed.
4
Chapter 1.
Introduction
plant
controllers
interfaces network Figure 1.1: Distributed control system
In Chapter 4, we ÿrst give a class of hybrid systems, which provides a uniform stability criterion for the systems in the following sections and chapters.
Given a stabilizing controller
K
for the single-input, continuous-time
plant represented by (A; B ) in Fig. 1.2(a), as a ÿrst step to model the digital channel, we insert a sampler
T and a zeroth order hold
S
T in the loop as in
H
Fig. 1.2(b). The ÿrst problem is to quadratically stabilize the sampled-data system, and we derive a bound on the sampling period
T
which guarantees a
prespeciÿed decay rate of the state in the continuous-time domain. This problem serves as a basis for the second problem where in the channel a quantizer
Q
is added as in Fig. 1.2(c). We propose a method to design a
quantizer and to ÿnd a bound on
T
while maintaining the quadratic stability
of the closed-loop system. Further, we give more speciÿc results for uniform quantizers and other types of quantizers. Finally, to incorporate the time delay in the communication, we consider the system in Fig. 1.3. The delay is time-varying, but has an upper bound We give an analysis on how large the maximum delay time
ý
ý.
can be for the
closed-loop system to maintain its stability. With this result we can explicitly calculate the necessary data rate for the channel. A design example of a magnetic levitation ball system is given, and we compare the data rate necessary for stabilization using several quantizers. In Chapter 5, we propose some methods to reduce the data rate necessary for stabilization by extending the results in Chapter 4. The picture of the system is more general and is depicted in Fig. 1.4, where the sensor and the actuator are connected to a network through a coder and a decoder, respectively. The ÿrst direction is to use time-varying quantizers that can change their range and thus require almost arbitrarily small number of quantization levels. The other direction for reducing the data rate is to employ nonuniform sampling. We ÿrst give a fairly simple method and then a switching control method. One of the obstacles in switching control of a continuous-time system is that fast switching or chattering may occur. Our results on quantized sampled-data control lead us to a switching control method, called dwell-time switching control; the protocol is that a switching cannot occur until a speciÿed dwell time has elapsed from the last switching.
1.2.
5
Outline and contributions
u(t)
-
-
x(t)
(A; B )
?
x(t)
u(t)
(A; B )
K
K
ÿ
T
H
(a) Continuous-time system
u(t)
ÿ
T
S
ÿ
(b) Sampled-data system
-
x(t)
(A; B )
? K
T
H
ÿ
Q
ÿ
T
S
ÿ
(c) Quantized sampled-data system
Figure 1.2:
Chapter 6 is devoted to the extension of the problems in Chapters 4 and 5 for the multiple-input case. First we formulate the sampled-data problem and the quantized sampled-data problem in a manner parallel to the single-input case. Then, the dwell-time switching control scheme is generalized as well. In Chapter 7, we discuss some future research topics on networked control systems and ÿnite data rate issues. The contributions of the book can be summarized as follows. The time sequencing techniques of messages employed in networks are treated in a distributed control setup.
We propose a problem of designing
dynamic local controllers connected to each other by a network. Stability issues are characterized, and the advantage of the use of networks is clariÿed in the context of decentralized control. Quantization methods for a centralized control system with a ÿnite data rate communication channel in the feedback loop are developed in a systematic manner. In particular, we consider the stabilization of a continuous-time plant, and the stability criterion throughout this study is quadratic stability with a guaranteed decay rate in the continuous-time domain.
Such a treatment
6
Chapter 1.
u(t)
-
Introduction
x(t)
(A; B )
? K
T
H
ÿ
ý
delay
ÿ
ý
ÿ
Q
T
S
ÿ
Figure 1.3: Quantized sampled-data system with delay
u(t)
-
x(t)
(A; B )
T
D
ÿ
Decoder
T
C
i(t)
ÿ
Coder
Figure 1.4: Switching system
of sampled-data control systems with the use of control Lyapunov functions seems to be new. We deal with a fairly general class of quantizers. For systems with quantizers, it may not be possible to achieve asymptotic stability; instead, we employ a weaker notion of stability which requires the state to go only close to the origin. Also, by limiting the region of initial states in the state space to be bounded, the number of quantization levels can be ÿnite. We propose a design method for a sampling period and a quantizer to achieve such stability and give a simple deÿnition of data rate required for control. When a message with data is sent over a network, there is transmission time delay due to the limited data rate.
We analyze the delay time that
the quantized sampled-data system can allow while maintaining stability. The continuous-time quadratic stability of the original closed-loop system is the key to this analysis. This result combines the issues related to network control, namely sampling, quantization, and time delay, and gives us a control system that uses a strictly ÿnite data rate channel in the feedback loop. Extending the ideas underneath the quantized sampled-data problem, we further develop several methods to reduce the data rate for control. Although control systems that use quantized measurements are hybrid in nature, there have been few studies in hybrid dynamical systems from the data rate point of view.
Chapter 2
Control networks In this chapter, we give a brief review of control networks. First, we study the characteristics of communication in control and the advantages of using networks there. Then, we summarize more practical issues in control networks to provide some ideas on the current technology and to describe how the networks work.
2.1
Control over networks
In control systems, the traÆc between sensors/actuators and controllers has some distinct characteristics. First, the messages are small because they contain raw data from sensors or for actuators. Second, the transmission rate is constantly high and the interval between messages is short due to the real-time aspect of control; many sensors/actuators have ÿxed, small sampling periods. Third, it is important that the time delay in communication be minimal and that the delay time be known. Control networks have to be designed to handle such traÆc. It is clear that the requirements for this type of network are diþerent from the requirements for data networks, where large data messages are transmitted occasionally for short intervals of time at high data rates. The idea of using networks in control systems is certainly not new. For example, the nuclear science community developed a parallel bus protocol called CAMAC in the early 1970s, and another standard MIL-STD-1553 has been used in military avionic applications since the mid-1970s in western countries [41].
What is new is that networks are fast enough for the real-time
requirements in control and that network interfaces are now inexpensive and small enough that, even if used in a large number, their presence is not costly. In the past decade much eþort has been made for the standardization of protocols for control networks by groups of companies and professional organizations. These protocols include Controller Area Networks (CAN) [50], Factory Instrumentation Protocol (FIP) [82], Foundation ÿeldbus [73], Lon-
H. Ishii and B.A. Francis (Eds.): Limited Data Rate in Control Systems with Networks, LNCIS 275, pp. 7−14, 2002. Springer-Verlag Berlin Heidelberg 2002
8
Chapter 2.
Control networks
Works [68], and Process Fieldbus (Proÿbus) [8]. There are various advantages to use network cables in control systems. First of all, the overall cost for cabling, installation, and maintenance can be reduced drastically.
Moreover, detection of cable faults and system expan-
sion is much easier because of the modular nature of the network. In addition, networks can bring new functionalities that were not available previously: Networks with broadcast capability allow simultaneous availability of data to all devices connected to the network. Also, network interfaces have some processing capabilities that can be used for control purposes. As we discussed in Section 1.1, the main drawback for using a shared medium instead of dedicated wires is the limitation on data rate.
2.2
How control networks work
In this section, we summarize how control network protocols are designed in practice to meet the requirements described above.
Media access methods
make the diþerences among existing protocols and are brieýy explained. We also give some details on one of the standardized protocols.
2.2.1
Periodic pattern
In the previous section, we looked at the nature of communication in control systems. We can divide the traÆc typically involved in control into two classes. One class is of periodic signals; these signals usually have critical real-time requirements and are transmitted periodically from sensors or to actuators. The other class is of event-driven, aperiodic signals, which need not be transmitted periodically but only when necessary. These signals include emergency alarms, on/oþ type signals, and data from counters. Control networks are mostly serial and thus employ time division multiplexing. A common technique for such networks is to use a communication type with a
periodic pattern
in time [66]. The pattern consists of two phases, peri-
odic and aperiodic, corresponding to the two classes of signals, as in Fig. 2.1.
period
periodic phase
aperiodic phase
time
Figure 2.1: Periodic pattern of communication The periodic phase is deterministic in that the transmission order of messages is scheduled and ÿxed. Hence, no collision of messages occurs and the delay time in communication is known. Only during the aperiodic phase, the
2.2.
9
How control networks work
event-driven communication is allowed to take place, and thus there may be time delay.
It is important that the periodic pattern be designed to leave
enough time for the aperiodic phase so that, in any event, the delay time in aperiodic communication is suÆciently small and that the system does not run into a hazardous state. Because of the periodic and deterministic nature of communication and the critical requirement in time, it is preferred to have one or more
master
stations
in the system that poll all the nodes of sensors, actuators, and controllers in the system according to a periodic schedule. An additional advantage for such architecture is that it requires simple protocols and thus reduces the cost. The presence of a master on a network can be, however, controversial as it creates a single point of failure, that is, if the master fails the entire system comes to a halt. 2.2.2
Medium access methods
As to how to realize a collision-free transmission scheme that suits the requirements described above, there are many ways. Medium access methods are what determine how each node connected to a network gains access to the network. The choice of the method is a crucial step in deÿning network protocols because it determines the complexity and the cost of the communication system as well as the data rate. Here, we describe brieýy some of the common medium access methods and study how they can be combined for control network use. The simplest is the centralized access method, where a single master controls the network and polls all the nodes (i.e., asks nodes one by one if they have any data to transmit) according to the periodic schedule. Slave nodes can transmit a message only when they are given permission by the master. This method is deterministic and is eÆcient for periodic signals. Nevertheless, when the number of nodes is large, the polling of messages can easily lead to an unacceptably large time delay in the system. One of the popular decentralized access methods is the token passing bus technique. It is an extension of the token passing ring method, but does not require a ring topology; a token that allows message transmission is passed around among nodes in an order determined prior to the operation. This is another example of deterministic methods. Other decentralized methods are those in the class of the carrier sense multiple access with collision detection (CSMA/CD). Basically, each transmitting node ÿrst checks whether the channel is free. If not, the node waits for some time and retries.
If the channel is free, the node begins the transmission.
However, since two nodes may simultaneously start sending out messages, the nodes listen to the channel while transmitting to detect any collisions. There are several measures to handle the possible collisions, including the delayed retry (DR) method (as in the Ethernet), the collision resolution (CR) method, and the collision avoidance (CA) method [50]. Because of the event-driven nature of the media access, CSMA methods are strong on aperiodic signals
10
Chapter 2.
for control networks.
Control networks
The shortcoming is the cost due to the complexity in
protocols and interfaces. Each method above has its strong points and weak points.
In today's
advanced control network protocols, to handle eÆciently both periodic and aperiodic signals, combinations of several access methods are usually allowed. We give three examples in the following: (i) Multiple masters, each with polling capability, pass a token around in a deterministic manner. (ii) A master node is put into a CSMA type network for polling. (iii) In a centralized setup, the master polls all the nodes during the periodic phase; any nodes having aperiodic data to be transmitted ÿrst notify the master at this time and are then given permission for transmission later during the aperiodic phase. To keep the complexity low, many protocols employ variations of the centralized access method.
In such cases, of the seven layers of the Open Systems
Interconnection (OSI) reference model, only three layers are necessary, namely the physical layer, the data link layer, and the application layer.
2.2.3
A protocol example
Here we look into the speciÿcation of one of the typical ÿeldbus protocols. Some details on the rate required for the periodic real-time communication are given.
The goal of this book is to reduce the data rate necessary for
control, and one way to do so is to reduce the number of bits in quantization. To see the eþect of doing so, we calculate the transaction time required for sending each message. Factory Instrumentation Protocol (FIP) is a ÿeldbus protocol accepted internationally in the industry [22, 50, 82]. It was developed in France in the late 1980s and is standardized in Europe. FIP is currently provided by a nonproÿt organization called WorldFIP. Some parts of the protocol are shared with the Foundation ÿeldbus, which is based in North America. The maximum data rate for FIP is 2.5 M bps with twisted pair cables and 5 M bps with ÿber optics. The bus length depends on the data rate and can be up to 2000 m at the slow rate mode (31.75 k bps).
This can be, however,
extended by interconnecting up to 4 networks using repeaters. There may be about 60 stations per network. FIP employs a centralized media access method. A master, or a
trator,
bus arbi-
polls the slave stations periodically based on a predeÿned polling list.
There is a mechanism to handle event-driven messages as well, but here we take a closer look at the periodic communication, for which this protocol is mainly designed [95]. When the system is conÿgured, all signals and their periodicity are determined.
Each signal is associated with a unique identiÿer, and what the bus
2.2.
11
How control networks work
bytes
1
1
preamble
frame start delimeter
1
2
control
identiÿer
2
1
frame check
frame end
sequence
delimiter
(a) Question frame
bytes 1
preamble
1
1
frame start delimeter
control
1
1
0-128
data
data
type
length
2
1
frame check frame end
data
sequence
delimiter
(b) Response frame
Figure 2.2: Message formats of FIP
arbitrator has is a list of identiÿers. According to this list, the bus arbitrator
question frame, which contains an identiÿer. Only one station producer, and one or more stations recognize themselves consumers. The producer broadcasts a response frame with the data
broadcasts a
recognizes itself as the as the
of the signal corresponding to the identiÿer, and only the consumers on the bus accept it. After this the bus arbitrator can go on to the next identiÿer on the list. An acknowledgement message is optional; instead, if the response frame is slow (or lost), the consumers return to the status waiting for the next question frame. Note that because of the deterministic nature, none of the frames described here contains addresses of stations. The message formats for the question frame and the response frame are shown in Fig. 2.2. Both frames include the preamble, the frame start delimiter, the control, the frame check sequences, and the frame end delimiter.
The
question frame contains the identiÿer of 2 bytes, making the frame 8 bytes in total. The response frame has the data type/length bits in addition to the data; thus the total
overhead
is 8 bytes. The data size can be between 0 and
128 bytes. Now we calculate the total transaction time for sending one message with data. The protocol requires some
turnaround time
after each frame transmis-
sion (for physical layer transactions), a minimum of 4 mode. Thus, the transaction time to send transaction time =
n
64 bits (question) + (64 + 8
= 59:2 + 3:2
üs
at the 2.5 M bps
bytes of data is
ü
n)
bits (response)
2:5 M bps
ü
+2 n ü
s:
ü4
üs
(2.1)
12
Chapter 2.
For example, the transaction times are 62.4 219.2
üs
for 50 bytes, and 379.2
üs
Control networks
üs for 1 byte, 65.6 üs for 2 bytes,
for 100 bytes. We see that the eþect of
reducing the data size into half is very diþerent in the cases from 2 bytes to 1 and from 100 bytes to 50. The amount of time to send the question frame and the overhead and the turnaround time are indeed not negligible. In real-time control, the data size is typically small, and thus changing it can be less eþective in reducing data rate (compared to changing the sampling period), unless it becomes signiÿcantly smaller. Comparisons of other protocols can be found in, e.g., [51].
2.3
Application examples
As described above, there are many advantages in introducing networks into control systems, especially those requiring large amount of cabling and/or having many system components. Industrial applications include automobiles [15, 42, 50], robotic systems [16, 39], jacking systems for trains [5], automated manufacturing systems [8, 22, 65], and process control systems [69, 73]. In this section, we give some detailed examples to illustrate the changes that networks can bring into such systems.
2.3.1
Jacking systems of train cars
For maintenance, train cars are raised by jacking systems [5]. Four jacks are typically used for one car, and these are movable on the ýoor for ýexibility. Since the decoupling and coupling of the cars may be time-consuming, it is more common to lift multiple, coupled cars, up to 12 cars, at once. A level lift of the entire train is required for safety and to keep the the coupling devices and the jacks from any damage.
Hence, the control objective here is the
synchronized operation of the jacks. The conventional solution for the realization of the system has been a pointto-point, centrally controlled system as in the layout in Fig. 2.3(a), where every jack is connected to the central controller with a separate cable. However, the large amount of cabling can be costly and further causes installation and storage problems. It also requires a central controller with high computational capability for controlling a certain number of jacks. Recently, a new system has been introduced in a more distributed fashion as in Fig. 2.3(b). All jacks are connected to a single network cable, and each of them has a local controller with communication capability; depending on the information of the neighboring jacks, each jack adjusts its height. Clearly, it uses less cabling; the central controller is downsized by distributing its function to local controllers on the jacks and, as a result, merely has supervising functions. Another advantage is that expansion of the system can be done without much change in hardware. New jacks simply have to be connected to the network, and the rest can be taken care of by the software.
2.3.
Application examples
Central Control
13
þ þ
þ þ
þ þ
þ þ
þ þ
þ þ
þ þ
þ þ
þ þ
þ þ
Car 1
Car 2
þ þ
Car 12
þ þ:
jack
(a) Conventional centrally controlled type
þ þ CC
Car 1
þ þ
þ þ
þ þ
þ þ
Car 2
þ þ
þ þ
þ þ
þ þ
þ þ
þ þ
Car 12
þ þ:
jack
: local controller (b) Distributed type with a network
Figure 2.3: Jacking system
This system has employed the LonWorks networks, which uses a CSMA type access method; the jacks send out their height every time they have moved a certain amount. Thus, the system serves as an example of a real-time control system with event-driven communication.
2.3.2
Networks on automobiles
Modern cars are a good example of systems with extensive cabling in limited spaces. The cabling in a medium size car can be more than 2000 m in length and 100 kg in weight, and there can be more than 600 diþerent types of wires [50]. To ÿt these cables compactly into a car within a short period of time is a challenge for the industry. Since the early 1980s, automobile companies have been developing protocols and have implemented on production automobiles such as the 1986 Buick Riviera and the 1986 Pontiac 6000 STE [42]. In a car, there are several types of communications: One is for real-time critical control as in engines, cruise control, gear boxes, brakes, etc. Another type is for body control as in power windows control, door lock control, mirror positioning, seat positioning, lamps, and so on; the real-time requirement in these applications is much lower, but because of the large number of such systems in a car it is cost sensitive. As a result, two or more separate network cables are often used, one for real-time control, one for body control, and possibly more for diagnostics and
14
Chapter 2.
Control networks
supervisory station
controllers ÿeldbuses sensors/actuators
Figure 2.4: Hierarchical structure
other status information sharing.
However, these networks cannot operate
separately, and thus they are connected to each other through gateways. Consequently, protocols for networks on automobiles are designed to handle all of these types of communication. The Controller Area Networks (CAN) is an example of protocols designed by automobile companies. It was ÿrst implemented in Mercedes S-class cars in 1992. For the access method, it employs a CSMA/CD technique with collision resolution, which is event-driven but deterministic; see [50] for details. 2.3.3
Process control
In the area of process control or more generally in automated manufacturing, systems are often large in scale and physically distributed. In such systems, several networks may compose a hierarchical structure for the interconnection of systems at various levels; a simple system is shown in Fig. 2.4. This structure provides eÆciency and reliability in the communication [41, 65]. Of the hierarchy of networks, at the lowest level is the control networks. This connects sensors and actuators to control devices. Compared to the higher level communication, the data size of messages here is smaller and the realtime requirement is more critical. The controllers may also receive commands from supervisory systems located in a higher level. In [33], an installation of ÿeldbuses at the Monsanto Chocolate Bayou plant in Alvin, TX is reported. In their condensate recovery system, there are 14 ÿeld devices such as magnetic ýowmeters, pressure sensors, temperature sensors, control valves, and so on. Two ÿeldbus cables are used to connect them to two controllers separately and realize a distributed system. It is emphasized that devices provided from various companies work together over the networks. Fieldbuses are used in a much larger scale as well.
At the Shell Ham-
burg oil blending facility, the control system for batch and plant control was upgraded in 1997 using Proÿbus networks [83].
Thousands of temperature
sensors, solenoid valves, and level switches were connected. In some parts, a variation of Proÿbus, designed to prevent explosions caused by sparks from the communication media, is used.
Chapter 3
Distributed control over networks In this chapter, we study a distributed control system, where the controllers can communicate to each other over a limited data rate channel. We study methods for time sequencing of messages over the channel and characterize the stabilizability issues. Then we clarify the advantage of the use of networks in the context of decentralized control.
3.1
The switch box problem
In this section, we formulate our design problem for a distributed control system with two local controllers connected to a network cable through network interfaces as in Fig. 3.1. Signals not available locally for each controller can be communicated over the network, possibly at a rate slower than the sampling rate of the controllers. We give a simple model for the network, governed by the protocol described in the previous chapter, as a \switch box." Because of the sequential transmission over the network, at most one signal is allowed to be communicated through the switch box at each discrete-time instant. The behavior of the switch box is periodic and deterministic. Consider the discrete-time system in Fig. 3.2. The plant P is a linear, time-invariant (TI) system whose state equations are given by
ÿ
( + 1) = yi (k ) =
x k
( ) + Bp1 u1 (k ) + Bp2 u2 (k ); ( ) + Dpi1 u1 (k ) + Dpi2 u2 (k );
Ap x k
Cpi x k
i
= 1 ; 2;
(3.1)
where x(k ) 2 Rn is the state, and ui (k ) 2 Rmi and yi (k ) 2 Rpi are the local control input and the output of channel i = 1; 2, k 2 Z+, respectively. The local controllers Ki , i = 1; 2, are periodically time-varying (PTV) systems H. Ishii and B.A. Francis (Eds.): Limited Data Rate in Control Systems with Networks, LNCIS 275, pp. 15−31, 2002. Springer-Verlag Berlin Heidelberg 2002
16
Chapter 3.
Distributed control over networks
plant
controllers
interfaces network Figure 3.1: A distributed control system
with the same period
ÿ
K ÿ 1, and their state equations are
N
i (k + 1) = Aki (k)ûi (k) + Bki1 (k)yi (k) + Bki2 (k)vi (k); ui (k ) = Cki (k )ûi (k ) + Dki1 (k )yi (k ) + Dki2 (k )vi (k ); where ûi (k ) 2 Rn is the state and vi (k ) 2 Rp is the signal through the network for k 2 Z+. Here Aki (k ) = Aki (NK + k ) for all k 2 Z+ and likewise û
i
ki
for other system matrices. The switch box
ÿ
y1 P
S
is a PTV, memoryless system
u1 v1 K1 v2 u2 K2
S
y2
Figure 3.2: The switch box problem with a period
N,
where
þ
1 (k) v2 (k ) v
and
S (k )
ý
þ =
S (k )
1 (k) y2 (k ) y
ý ;
k
2 Z+;
is one of the three matrices
þ
0 :=
S
ý
0
0
0
0
þ ;
1 :=
S
0 0
p2
I
0
ý
þ ;
and
2 :=
S
0
p1
I
ý
0 0
:
3.2.
17
Preliminaries
The switch box provides each local controller with the signals not accessible lo-
switching pattern s 2 f0; 1; 2gN , whose elements correspond to the indices of matrices Si ; i = 0; 1; 2. For excally. Its periodic behavior is determined by a
s = [1 2 0 0] means S has a period N = 4 and S (4k ) = S1 ; S (4k + 1) = + 2) = S (4k + 3) = S0 for k 2 Z+. Thus, at k = 0, the controller 2 K1 receives y2 (0); at k = 1, K2 receives y1 (1); at k = 2; 3, no data is sent over
ample,
S ; S (4k
the network; and this repeats periodically.
Finally, in Fig. 3.2, ü denotes the generalized plant containing It is PTV with the period We deÿne our and detectable. controllers
P
and
S.
N.
switch box problem as follows:
Given a switching pattern
s
Assume that
2 f0; 1; 2gN ,
P
is stabilizable
ÿnd PTV local
1 and K2 such that the closed-loop system is exponentially stable.
K
A technique similar to the switch box is employed in the design problems
in [11, 35]. Their setup is that a controller is connected to pairs of sensors and actuators by a network and can communicate with only one pair at each sampling time, depending on a periodic pattern. Even though their model is also simple and linear, because of the constraint on the controller structure (it must be memoryless), the stabilization problem was shown to be an NP hard nonlinear optimization problem. We have extended their problems to a decentralized control setting. The plant with the switch box is PTV and is diÆcult to control with TI local controllers. Thus, we have enlarged the class of controllers to PTV ones, and as a result our solution is much simpler.
3.2
Preliminaries
In this section, we review some preliminary results on the stabilizability of decentralized control systems. Our focus is on the control of linear PTV systems using PTV local controllers [46]. Consider a
ú -channel,
linear, PTV system ü with a period
equations are
ÿ
x(k
i
y
+ 1)
=
A(k )x(k )
(k )
=
C
i
+
Pÿ Pjÿj=1=1
j (k)uj (k); ij (k)uj (k);
K:
N
ÿ
i (k + 1) ui (k )
û
whose state
B
(k )x(k ) +
D
At each channel i, we apply a local PTV controller period
N
i
i
= 1; 2; : : :
K ;i
; ú:
= 1; 2; : : :
; ú,
(3.2)
with a
ki (k)ûi (k) + Bki (k)yi (k); ki (k)ûi (k) + Dki (k)yi (k):
=
A
=
C
We now deÿne two problems:
û The PTV problem :
Find PTV local controllers
the closed-loop system is exponentially stable;
1
K ;::: ;K
ÿ
such that
18
Chapter 3.
û The TI problem dependent of
k,
:
Distributed control over networks
Suppose all the matrices in ü and
i.e.,
=
N
NK
K1 ; : : : = 1. Find TI local controllers
; Kÿ
are in-
K1 ; : : : ; Kÿ
such that the closed-loop system is stable. We follow the formulation in [46] and give some deÿnitions. A linear, TI system
G
with a state model (A; B; C; D ) is said to be complete if rank (M ((A; B; C; D ); z ))
where
n
is the dimension of
A
and
M ((A; B; C; D ); z )
disk, then the state model of
G,
z
for
n
þþ þ ú 2C
M(
If the rank condition holds for every
ÿ
z
2C
(3.3)
;
; ) is the system matrix deÿned by A
:=
zI
ý
B
C
D
:
in the complement of the open unit
or simply
G,
is said to be weakly complete .
Note that the rank condition (3.3) has to be checked only for eigenvalues of A
since otherwise the condition is automatically satisÿed. If the rank condition
fails for some
z
system.
2C
, then
z
is an uncontrollable and unobservable mode of the
For the system ü in (3.2), its complementary subsystems üi1 :::iÿ j1 :::jþ ÿÿ are
given by the state models
0 B@ ü A;
where
f
ik ;
1; 2; : : :
g
jl
;ú
2f
Ci
þþþ
1
Bj
2 32 û 6 .. 1 7 6 4.54 þ ÿÿ
Bj
1; 2; : : :
;ú
;
g
Ciÿ
,
ü
ú
k
. . .
þþþ ..
1 ÿÿ
Di jþ
. . .
.
1 þþþ ú f
Diÿ j
Diÿ jþ
31 75CA
;
ÿÿ
g
;ú 1, i1 ; : : : ; iþ ; j1 ; : : : ; jÿ ýþ = of the matrices is not displayed for sim-
= 1; 2; : : :
. The dependence on
plicity. There are 2ÿ
;
11
Di j
2 such subsystems for ü. For example, a 2-channel ü
has only two complementary subsystems: ü12 and ü21 . First, for the design problem of TI systems, the following lemma gives a necessary and suÆcient condition [91]. Lemma 3.2.1 The TI problem has a solution if and only if ü is stabiliz-
able and detectable, and all the 2ÿ complete.
ú
2 complementary subsystems are weakly
2
For a TI system ü, z0 C is called a decentralized ÿxed mode (DFM) if it is an uncontrollable or unobservable ÿxed mode, or if the rank condition in (3.3) fails for the state model of some complementary subsystem. With this notion, the lemma above can be stated that the TI problem is solvable if and only if ü has no unstable DFM. If ü is stabilizable by local controllers, there are a number of methods for decentralized control design, e.g., [19, 30, 77, 79, 91]. We next proceed to give the solution to the PTV case. Deÿne the
operator
LN
[47] by
LN
:
82 > < f ( )g 2Z+ 7! >:64 u k
u(kN )
. . .
k
u((k
+ 1)N
ú
39 > 75= > ;
1)
: k
2Z+
N -lifting
3.3.
19
Stabilization using PTV local controllers
Hereafter, we denote the N -lifting of a signal u by u ~ := LN u and that of a ~ := L GL 1 . Note that G ~ is TI if N is a multiple of the PTV system G by G N N
ý
period of
G.
The PTV problem is characterized by the next result [46]. Lemma 3.2.2 The PTV problem is solvable if and only if ü is stabilizable
and detectable (with a centralized controller), and for each complementary subsystem üi1 :::iÿ j1 :::jþ ÿÿ , if its input-output map is zero, then its ~ ü i1 :::iÿ j1 :::jþ ÿÿ is weakly complete.
N -lifting
If the conditions above are satisÿed, there exist stabilizing PTV local conû , where N û is deÿned as follows: For a TI NK ý N K K system G with a state model (A; B; C; D ), let T0 := D; Ti := C Ai 1 B; i ÿ 1, and deÿne
trollers of some period
ý
ú
ù (G)
:= i0 + 1 +
ù
n
;
rank (Ti0 )
where i0 := minfi; Ti 6= 0g and bac denotes the greatest integer equal to or smaller than a. Then û N K
:=
N
þ max
n ø ù
~ ü i1 :::iÿ j1 :::jþ ÿÿ
÷o
;
where the maximum is taken over all complementary subsystems of ü. For a linear system
G,
we write
G
= 0 if its output equals zero for every
input; likewise G 6= 0 means otherwise. ~ is equal to zero. N -lifting G
3.3 Even if
Clearly,
G
= 0 if and only if its
Stabilization using PTV local controllers P
in Fig. 3.2 does not satisfy the conditions in Lemma 3.2.1 or 3.2.2,
stabilization may be possible by making use of the switch box, that is, by exchange of local signals. We can see that the system ü is a PTV system, so Lemma 3.2.2 is applicable. In this section, a necessary and suÆcient condition for solvability of the switch box problem is given. First we introduce some notation for the systems in Fig. 3.2. Let be partitioned appropriately as
þ
P
=
respectively. Let their
LN Pij L
P12
P21
P22
N -lifting
~ := P ~ij := where P
P11
þ~
ý1, N
P11
~ P 21
ý
þ and
S
=
systems be
~ P 12
ý
~ P 22
and J~i :=
~ := and S
LN Ji L
ý1 N
0
J1
J2
0
þ
0 ~ J 2
for
i; j
P
and
S
ý
~ J 1 0
;
ý ;
= 1; 2. Here J~i are TI
memoryless and hence have corresponding matrix forms; in what follows, we
20
Chapter 3.
Distributed control over networks
i
i 2 f0; 1; 2gN
use the same notation J~ for these matrices. The matrices J~ ; i = 1; 2, can be constructed as follows. Decompose the switching pattern
i 2 f0; 1gN ; i = 1; 2, that satisfy s = s1 + 2s2 . ~ = diag(s ); i = 1; 2: J i i
into
s
For example, the switching pattern and
= [0 0 1].
s2
Now the
s
S
(3.4)
= [1 1 2] is decomposed to
s
of
Then we have
e
= [1 1 0]
s1
e
~ of ü is given by ü ~ := S ~ P ~ , where S ~ is a TI memoryless ü
N -lifting
system with a matrix form
3 7 2(p1 +p2 )N þ(p1 +p2 )N 17 2R : 05
2 I 6 ~ := 6 0 S e 4J~ 2
0 ~ J
0
I
e
~ has full column rank. Clearly, the matrix S For the sake of completeness, we give the state models of these systems. ~ be Let the state model of P
ø
p; p; p p
~ A
~ B
~ C
~ D
e ÷
÷
ö
:=
p;
~ A
ü~
p
~ B
B 1
û þC~p1 ý þD~ p11 p2 ; C~p2 ; D~ p21
p ~ D p22 ~ D 12
ýõ
:
~ =S ~ P ~ , so the state model of ü ~ can be expressed as By deÿnition, ü
ø
s
s
s D~ s
~ ; B ~ ; C ~ A
where
ö
:=
p
~ ; A
ü~
p
p
~ B 2
B 1
û þC~s1 ý þD~ s11 ; ; ~ ~ C D s2 s21
ýõ
2þ ý þ ý3 þ~ ý þ ~ ý 6 I ~0 7 þ ~ ý 0 Cs1 J1 ~ Cp1 = 6þ ý þ ý7 C~p1 := S e 4 ~ ~ ~ 0 5 C Cs2 Cp2 J 2 p2 0
and similarly
þ~
s ~ D s21 D 11
ij
~ The subsystems ü i; j
s ~ D s22 ~ D 12
= 1; 2.
s ~ D s22 ~ D 12
ý
(3.5)
(3.6)
I
þ~
e D~ p p21
~ := S
;
D 11
p ~ D p22 ~ D 12
ý (3.7)
:
p ; B~pj ; C~si ; D~ sij ),
~ are given by the state models (A ~ of ü
We now state our main result. It gives a necessary and suÆcient condition for the solvability of our switch box problem under mild conditions. Theorem 3.3.1 Suppose that P is stabilizable and detectable,
þ
P11 P21
Then
P
ý
6= 0;
þ
and
P12 P22
ý
6= 0:
(3.8)
is stabilizable using the network structure in Fig. 3.2 with PTV local
controllers if and only if
3.3.
21
Stabilization using PTV local controllers
(i) if
P12
= 0 and is not weakly complete, then
s
contains 1;
(ii) if
P21
= 0 and is not weakly complete, then
s
contains 2.
Proof
By Lemma 3.2.2,
P
is stabilizable with the switching and PTV local
controllers iþ (a) ü is stabilizable and detectable; ~ ~ (b) if ü 12 = 0, then ü12 is weakly complete; ~ ~ (c) if ü 21 = 0, then ü21 is weakly complete. We ÿrst show that (a) is equivalent to having
P stabilizable and detectable. ~ is staLifting preserves stabilizability and detectability of systems. Hence, P ~ ~ ~ ~ bilizable and detectable iþ P is. By deÿnition, ü = S P and S is memoryless.
e
e
~ is stabilizable iþ P is. Thus, ü ~ has column full rank, we have from (3.5) and (3.6) Since S
e
þ
zI
rank
for
z
2C
ú
s
~ C
s
~ A
ý
þ
I
= rank
0
0 ~ S
ýþ
e
zI
ú
p
~ C
p
~ A
ý
þ = rank
zI
ú
p
~ A
p
~ C
ý
~ and that of ü ~ are equivalent. Thus, . Hence, the detectability of P
(a) holds iþ
P
is stabilizable and detectable.
Next, we must show that (b) is equivalent to (i). First, observe that (i)
, ,
[If
s
does not contain 1, then
[If
P12
= 0 and
s
P12
6
= 0 or is weakly complete]
does not contain 1, then
P12
is weakly complete] :
In the following, we show that the last condition above is equivalent to (b). By hypothesis, we can see that
þ ~ ü 12 =
~ P 12
~P J 1 ~22
ý =0
,
[P12 = 0 and
J1
= 0]:
Thus,
h
(b)
, h ,
If
P12
= 0 and
J1
~ = 0, then ü 12 is weakly complete
If
P12
= 0 and
J1
~ is weakly complete = 0, then P 12
i
i
~ = 0 implies that the rank condition for weak completeness of ü 12 ~ only depends on P12 . Finally, the weak completeness property of the TI system P12 is invariant under lifting by Theorem 1 in [46]. Thus, (b) is equivalent to
because
J1
(i). Likewise, we can show that (c) is equivalent to (ii).
ÿ
If the conditions (i) and (ii) are satisÿed, then stabilizing controllers of û exist, based on Lemma 3.2.2. period at most N
K
22
Chapter 3.
Distributed control over networks
The main idea is simple. By Lemma 3.2.2, for stabilizability, we need to introduce output exchange through the switch box so that each complementary subsystem of ü is either not equal to zero, or equal to zero but not weakly complete. In the result above, under a weak hypothesis, the strategy is to ensure that no complementary subsystem is equal to zero by using the switch box. The eþect of exchange of the local output information to remove DFMs was considered in [64, 90] for stabilization of a TI plant using TI local controllers. However, a point-to-point connection between controllers for each exchange is required in the results, which may be costly and may result in complex wiring. We give some examples to illustrate our result. Example 3.3.2 For decentralized stabilization, we have seen that three choices
of controllers are available: TI controllers, PTV controllers, and PTV controllers with a switch box. Of the three, the more sophisticated the controllers are, the larger the class of plants that can be stabilized. The ÿrst example is a 2-channel plant
P
that is stabilizable using PTV
controllers but not with TI controllers and its state model is given by
021 0 0 3 21 03 þ 1 ý þ ý @40 ú1 ú35 40 15 01 10 00 00 00 A 0 0 ú2 0 1 We see that 12 = 6 0, but is not weakly complete; it has a DFM of 1. We next consider a 2-channel plant given by 021 0 0 3 21 03 þ 1 ý þ ý @40 ú1 0 5 41 05 01 10 0 00 00 A 0 0 ú2 0 1 If = 6 0, then is stabilizable only with PTV controllers with a switch box, ;
;
;
:
;
:
P
P
;
c
;
c
P
among the three control structures. In this case, weakly complete because of the DFM 1. pattern
s
6
P12 is equal to 0, but not By Theorem 3.3.1, the switching
must have 1, which gives ü12 = 0.
The case with
P12 is equal to 0 and 0 0 0 not weakly complete, we cannot apply Theorem 3.3.1 because [P12 P22 ] =
0.
c
= 0 is interesting. Although again
Nevertheless, the system is stabilizable with a switch box, and we can
check that the DFM 2 in 1.
P12 is removable by a switching pattern having As a result, ü12 has a zero input-output map and is weakly complete.
This implies that information exchange may result in a weakly complete, zero
5
complementary subsystem. As seen in the last example, even if (3.8) fails,
P
can still be stabilizable; in
general, even for the 2-channel plant case, the necessary and suÆcient condition is not known. This observation is suggestive for the case with a
ú -channel
plant. We can ÿnd a switching pattern to stabilize the system by following the idea in Theorem 3.3.1: If a complementary subsystem
Pi1 :::iÿ j1 :::jþ
ÿÿ
is
3.4.
23
Assignability measure analysis
equal to 0 and not weakly complete, then choose a pattern so that the corre~ sponding subsystem ü i1 :::iÿ j1 :::jþ ÿÿ is not equal to 0. However, the example ~ above implies that there may be a pattern that makes ü equal to i1 :::iÿ j1 :::jþ
ÿÿ
0 and weakly complete. So, the pattern choice for the general problem is not so straightforward.
3.4
Assignability measure analysis
Naturally, we expect that the use of the switch box enhances the capability of the system in some way. For a quantitative analysis of the system characteristics, we ÿrst deÿne a partial ordering in the switching patterns and then employ the assignability measure proposed in [87] for decentralized control systems. In this section, we show that our system in Fig. 3.2 can be improved in terms of assignability by allowing more signal transmission with respect to the partial ordering.
f0; 1; 2g
N
We ÿrst introduce a partial ordering on the set patterns.
Let
s1 ; s2
be obtained from s1
s1
= [1 1 2 2] and
dependence on
s
s2
2 f0; 1; 2g
of switching
s1
ÿ
s2
by setting some elements of
s1
to zero. For example, if
N
.
We deÿne
= [1 0 2 0], then
ÿ s2 .
s1
to mean that
s2
can
When necessary, we show the
of the system ü explicitly as in ü(s).
Assignable systems refer to systems not having any DFMs, that is, those that can be stabilized decentrally. The assignability measure gives the distance between a system and the set of systems having DFMs. This is an extension of the controllability measure for centralized control systems in [26]. We deÿne the assignability measure and give some preliminary results on the measure in the following. Let
Sÿ~
be the set of TI state models whose dimensions are the same as ~ that of ü as
Sÿ~ :=
ÿö
A;
ü
B1
B2
û
þ
C1
;
C2
ý þ ;
D11
D12
D21
D22
Ci
Then a metric
ýõ
2R
pi N
þ þ) on Sÿ~ is deÿned by
d( ;
d((A; B; C; D ); (A; B; C ; D ))
:=
:
þ
n
A
2R þ
; Dij
n
n
; Bj
2R
óþ ó AúA ó ó C úC
pi N
þ
2R þ
mj N
n
mj N
; i; j
;
ô
= 1; 2
:
ýó ú B óó ; D úD ó B
kþk denotes the maximum singular value. A system with a state model Sÿ~ having at least one DFM is called unassignable. Let Uÿ~ be the set of all
where in
such systems:
Uÿ~ := f(A; B; C; D) 2 Sÿ~ : 9z 2 C The
assignability measure
s.t. rank M ((A; B; C; D ); z )
for a TI state model (A; B; C; D )
Uÿ~ deÿned by d((A; B; C; D ); U ~ ) := ÿ
from the set
inf
2Uÿ
(Au ;Bu ;Cu ;Du )
d((A; B; C; D );
g:
< n
2 Sÿ~ is its distance
(Au ; Bu ; Cu ; Du )):
24
Chapter 3.
Let
Distributed control over networks
n (þ) denote the nth largest singular value of a matrix.
ÿ
The following
lemma gives a method to obtain this measure computationally [87]. Lemma 3.4.1 For a system with a state model
(A; B; C; D
ö ü )= A;
B1
B2
û þ 1ý þ
D11
D12
C2
D21
D22
;
C
;
ýõ
2 Sÿ~ ;
its assignability measure is given by d((A; B; C; D );
Uÿ~ ) = min z C minfÿn ([A ú zI 2
n (M ((A; B2 ; C1 ; D12 );
ÿ
B1 B2 ]); ÿ z )); ÿ
n ([A ú zI 0
0
0 0
C1 C2 ]
);
g
n (M ((A; B1 ; C2 ; D21 );
z )) :
Now we are ready to state our result on assignability and switching. Theorem 3.4.2 Let s1 ; s2
2 f0; 1; 2gN with s1 ÿ s2 .
Suppose
P
in Fig. 3.2 is
stabilizable using switching patterns s1 and s2 . Then the assignability measure ~ with s is greater than or equal to that for the system with s , that is, for ü 1 2
s s s (s1 ); D~ s (s1 )); Uÿ~ ) ÿ d((A~s ; B~s ; C~s (s2 ); D~ s (s2 )); Uÿ~ ):
~ ;B ~ ;C ~ d((A
~ in (3.5), we must show that for all From the state model of ü
Proof
(3.9) z
2C
n (M ((A~p ; B~p2 ; C~s1 (s1 ); D~ s12 (s1 )); z )) ÿ ÿn (M ((A~p ; B~p2 ; C~s1 (s2 ); D~ s12 (s2 )); z )); ~ ;B (ii) ÿn (M ((A p ~p1 ; C~s2 (s1 ); D~ s21 (s1 )); z )) ÿ ÿ (M ((A~p ; B~p1 ; C~s2 (s2 ); D~ s21 (s2 )); z )); 02A~ ú zI 31n 02A~ ú zI 31 p p ~ (s ) 5A ÿ ÿn @4 C ~ (s ) 5A : (iii) ÿn @4 C s1 1 s1 2 ~ (s ) ~ (s ) C C s2 1 s2 2 (i)
ÿ
Then, by Lemma 3.4.1, (3.9) follows. We ÿrst show (i). For
p p2 s1 (si
~ M ((A
~ ;B
~ ;C
~ ); D
i
= 1; 2, from (3.6) and (3.7), we have
þ~
ý
~ ú zI B s12 (si )); z ) = C~p (s ) D~ p(2s ) 2 A~s1 úi zI s12 B~i 3 p p2 ~ ~ =4 C D p1 p12 5 ~ (s )C ~ ~ (s )D ~ J J 2I 1 0i p20 31 2iA~ úp22zI p ~ 0 54 C = 40 I p1 ~ ~ 0 0 J1 (si ) C p2 A
3
p2 ~ D p12 5 : ~ D p22 ~ B
(3.10)
3.5.
25
Multiple mobile robot example
s1 ÿ s2 and (3.4), the system matrix in (3.10) for s2 can be obtained by deleting some rows in that for s1 . By Theorem 7.3.9 in [34], we have (i). In a similar manner, (ii) and (iii) can be shown.
Because
ÿ
Thus, the assignability measure for the system in Fig. 3.2 is a monotonically nondecreasing function of
s
with respect to the partial ordering on
f0; 1; 2gN .
This conÿrms our intuition that capability of a system improves with increased communication. The result itself may appear obvious from the partial ordering deÿnition. Comparison of systems can be, however, diÆcult if there is no ordering in their switching patterns. For example, systems with s1 = [1 1 2] and s2 = [2 2 1] use the same data rate over the channel, but cannot be compared analytically with respect to the assignability measure. Moreover, it is not clear how the measure may change by changing the patterns.
3.5
Multiple mobile robot example
In this section, we give a toy example of a multiple mobile robot system to illustrate our results. This example satisÿes the following conditions so that our theory is applicable: The original plant is open-loop unstable, stabilizable, and detectable, and the unstable poles are not assignable without a network structure. So, the system is stabilizable only when signal transmission over the network is allowed. We note, however, that systems with such properties rarely exist, as was shown in [21]; thus, we make assumptions that may appear artiÿcial. Consider the three cart system in Fig. 3.3.
Cart 1 receives reference
r
for the position and is the leader. Carts 2 and 3 follow it trying to keep the distance x0 between them; M is the mass of each cart; x2 + x0 is deÿned to be the distance between Carts 1 and 2 (the desired value of x2 being 0); likewise for ø. i
x3 + x0 . Cart 2 has an inverted pendulum of mass m, length l , and angle However, ø is not sensed. Each cart has a discrete-time local controller K ,
i ; xi ]
= 1; 2; 3, whose input is the measured signal [x
period
T;
0
its output goes through the 0th order hold
control input
u
i.
i
sampled at sampling H
and becomes the
Communication among the carts is to be kept low: signals x2 and x3 are measurable only to Carts 2 and 3, respectively; a communications channel is given only between Carts 1 and 3, and its behavior is determined by a switching pattern
s.
Suppose
K2 is constant (i.e., proportional-derivative control) and is designed so that Cart 2 follows Cart 1 with no consideration for the pendulum.
Thus, the pendulum cannot be stabilized locally by Cart 2. The control problem is to design s, K1 , and K3 so that the whole system, including the inverted pendulum on Cart 2, is stabilized, and x1 tracks r . The state ø is observable only to Cart 3 through signals x3 ; x3 . Hence, its stabilization is possible only by sending signals from Cart 3 to Cart 1 as
26
Chapter 3.
Distributed control over networks
x x +x
x m ÿ þ 2 l ÿ Cart x u x_ M u
+x
1
2
þ 1 ÿ Cart
x x_ M ÿ þ r ÿ 1
3
2
1
1
K
0
ÿ Cart þ 3
x x_ M
H
3
3
H
K
K
2
1
u
3
2
2
0
w
0
H
3
switch box Figure 3.3: Carts system
shown with a switch box in Fig. 3.3. Let
i be the velocity of Cart i for i = 1; 2; 3. Then v1 = x1 ; v2 = x1 ú x2 ; and v3 = x1 ú x2 ú x3 :
v
The dynamics of the linearized system are described as M v1
= 100u1 ;
M v2
= 100u2
M v3
= 100u3 ;
ú mgø;
= (M + m)gø M lø where
g
= 9:8 m=s2 ,
M
= 3 kg,
Assignability analysis:
m
= 0:1 kg,
ú 100u2; l
= 1 m, and
K2
= [0:0001 0:05].
We computed the assignability measures for sys-
tems with switching patterns
i þþþ
z }| {
i := [1
s
Here, period
N
1 0
þþþ
0]
2 f0; 1; 2g6;
i
= 0; 1; : : : ; 6:
= 6 was arbitrarily chosen. Note that
The result is shown in Table 1 for eþectively zero for
i
= 0, and for
i
i ÿ iý
s s 1 ; i = 1; 2; : : : ; 6. = 0:8 s. We see that the measure is = 0 this jumps up to about 2 10 5 .
T
6
ü ý
As shown in Theorem 3.4.2, the measure slightly increases as we allow more 1s in the switching pattern. Based on Lemma 3.2.2, the maximum periods û of controllers were also obtained. Note, however, that the measures were N
K
computed for
K
N
= 6, and it was conÿrmed that controllers of this period
could assign all poles of the plant.
3.5.
Multiple mobile robot example
27
i
In Fig. 3.4, the assignability measure is plotted versus the index i of s ; i = 0; 1; : : : ; 6 for T = 0:2; 0:4; 0:6; 0:8 s. We again observe that the assignability of the system improves by allowing more signals on the network.
Note that
the measure also decreases by reducing the sampling period. Nevertheless, the measure is relative to state-space realizations, and thus comparison between systems with diþerent sampling periods may not be appropriate.
Table 3.1: Results for T = 0:8 switching pattern s0 = [ 0 0 0 0 0 0 ] s1 = [ 1 0 0 0 0 0 ] s2 = [ 1 1 0 0 0 0 ] s3 = [ 1 1 1 0 0 0 ] s4 = [ 1 1 1 1 0 0 ] s5 = [ 1 1 1 1 1 0 ] s6 = [ 1 1 1 1 1 1 ]
assign. measure
ü 10ý13 ý5 1:86 ü 10 ý5 1:91 ü 10 ý5 1:94 ü 10 ý5 1:97 ü 10 ý5 2:00 ü 10 ý5 2:09 ü 10
3:07
K
N
ù
36 54 30 18 18 12
−5
2.5
x 10
assign measure
2
1.5 T=0.2 T=0.4 T=0.6 T=0.8
1
0.5
0 0
2
1
4
3
5
6
index i of s
i
Figure 3.4: Assignability measure for the example
H 1 design
:
Assignability is just one measure of achievable performance,
and we observed that in terms of assignability the diþerence is very small
6
among the systems with i = 0.
1
To compare achievable performance of the
systems from a diþerent viewpoint, a simple controller was designed using the H
N = N
method based on [31] as follows.
K = 6, and T
Let K2
= K3
= [0:0001 0:05],
= 0:2 s. Design a servo controller K1 so that x1 follows
r, where reference r is a step, and ø is small with an appropriate weight.
28
Chapter 3.
Distributed control over networks
For s2 , s4 and s6 , the local controller for Cart 1 was computed. As a result of the lifting technique, we need N integrators and N delays in the design,
K
K
and thus the order of the controller is 33.
Figures 3.5, 3.6, and 3.7 show step responses of positions of the carts, the
ÿ
angle
ø , and the control input u1 for r (t) = 10; t 0. Here, to emphasize the diþerence in performance, white noise w = [w1 w2 ]0 is added to the signal
[x3 ; x3 ]0 communicated over the network. Signals
w1 ; w2 are uniformly distributed and model quantization error, assuming that 19 bits can be used [71].
The system with s2 is sensitive to noise, showing a noticeable oscillation in cart positions, while the systems with s4 and s6 both exhibit almost perfect tracking. However, there is still a slight diþerence in the behavior of
ø;
system with
does not
s6 , ø
goes to zero slowly, but that in the system with
s4
in the
and keeps oscillating. The diþerence among the systems observed in these step response plots is
Te1 r : r 7! e1 , where e1 = r ú x1 , ýr : r 7! ø, and Týw1 : w1 7! ø shown in Figs. 3.8, 3.9, and 3.10. All systems have the same sensitivity of Te1 r and Týr . This explains that all systems have similar tracking abilities. On the other hand, the sensitivity of Týw1 improves
evident in the frequency responses of systems
T
as we increase the signal transmission rate over the network. Through this part
of the simulation, we conÿrm that the achievable performance of the system improves by increasing the data rate over the network; this may not have been clear in the previous part using assignability.
Positions
15 10 5
Cart 1 Cart 2 Cart 3
0 −5 0
5
10
15
20
25
30
35
40
5
10
15
20
25
30
35
40
5
10
15
20
25
30
35
40
Angle θ
0.4 0.2 0
Control Input u1
−0.2 0
5
0
−5 0
Time t
Figure 3.5: Step response for
s2
= [1 1 0 0 0 0]
3.5.
Multiple mobile robot example
29
15
Positions
10 5
Cart 1 Cart 2 Cart 3
0 −5 0
5
10
15
20
25
30
35
40
5
10
15
20
25
30
35
40
5
10
15
20
25
30
35
40
Angle θ
0.4 0.2 0 −0.2 0
Control Input u1
5
0
−5 0
Time t Figure 3.6: Step response for
s4
= [1 1 1 1 0 0]
15
Positions
10 Cart 1 Cart 2 Cart 3
5 0 −5 0
5
10
15
20
25
30
35
40
5
10
15
20
25
30
35
40
5
10
15
20
25
30
35
40
Angle θ
0.4 0.2 0
Control Input u1
−0.2 0 5
0
−5 0
Time t Figure 3.7: Step response for
s6
= [1 1 1 1 1 1]
30
Chapter 3.
Distributed control over networks
10 5 0
Gain [dB]
−5 −10 −15 −20 −25
s2 s 4 s6
−30 −35 −3 10
−2
10
−1
10
0
10
1
10
Frequency [rad/sec] Figure 3.8: Frequency response for
Te1 r
0
−10
Gain [dB]
−20
−30
−40 s2 s4 s
−50
6
−60 −3 10
−2
10
−1
10
0
10
Frequency [rad/sec] Figure 3.9: Frequency response for
Týr
1
10
Multiple mobile robot example
31
100 90 80
Gain [dB]
3.5.
70 60 50 40
s2 s 4 s6
30 20 −3 10
−2
10
−1
10
0
10
Frequency [rad/sec] Figure 3.10: Frequency response for
Týw1
1
10
Chapter 4 Finite data rate control | single-input case The goal of this chapter is to design a stabilizing controller for a single-input linear time-invariant plant that has a ÿnite data rate channel in the feedback loop.
This is done in two steps: First we propose a design method for a
sampled-data controller that stabilizes the plant in a quadratic manner. Then this result is extended to a design problem of a quantizer in a sampled-data controller.
To provide stability criteria applicable to the diþerent types of
systems, we give a general class of switched systems in the ÿrst section.
4.1
Dwell-time switched systems and their stability
In this section, we introduce a class of hybrid systems, called dwell-time switched systems , and develop Lyapunov-like stability results for such sys-
tems. The objective of this section is to give stability criteria for the later sections. Consider the nonlinear switching system x(t)
where
x(t)
2 Rn
is
s0
2S
fj
fi(t) (x(t));
(4.1)
is the state of the system and
switching logic with the index set
assumed that
=
S
i(t)
2S
is globally Lipschitz continuous for each
such that
0 (0)
fs
is the state of the
and the ÿxed dwell time
= 0. The index set
S
j
T
>
0. It is
2 S and that there
may be a discrete set, as in
most switching systems, but may be a continuum as well. The switching logic generates are switching times (ii) i(t) = jk0
jk
= s0 for
k
i
from
2
on [tk ; tk+1 ), and (iii) if 0
ÿ k.
x.
It is assumed that for every
ftk gk Z+ and fjk gk Z+ ø S 2
x(tk )
such that (i)
= 0 then i(t) =
s0
i
there
ÿ tk + T , for t ÿ tk , i.e.,
tk+1
H. Ishii and B.A. Francis (Eds.): Limited Data Rate in Control Systems with Networks, LNCIS 275, pp. 33−81, 2002. Springer-Verlag Berlin Heidelberg 2002
34
Chapter 4.
Finite data rate control | single-input case
Because of the rule (i), there is no chattering in the switching, and thus there is no need to introduce generalized solutions [85]. The trajectory the system is uniquely deÿned satisfying (4.1) for all times
=
t
tk , k
2Z
ÿ
t
+.
þ
x(
) of
0 except at switching
From the rule (ii), it follows that for the system (4.1) the origin is an
equilibrium point
in the sense that if
x(0)
= 0 then
x(t)
= 0 for
ÿ
t
natural to give stability deÿnitions for such systems as follows. Deÿnition 4.1.1 The origin of the system (4.1) is (i)
there exists
Æ >
(ii) (globally )
!
x(t)
0 as
t
0 such that
k k x(0)
asymptotically stable
!1
< Æ
implies
stable
k k x(t)
< ÷
0. It is
if, for every ÷ for all
if it is stable and if, for every
t
ÿ
x(0)
.
0,
>
0 and
2R
n
,
We also give another looser but more general deÿnition of stability. Here
B
x (r )
denotes the closed ball center
Deÿnition 4.1.2 Let
B
0 (r0 ) is
DøR D
n
attractive from
Note that if
D
=
Rn
x,
and
radius
r0
if, for every
ÿ
r.
0. For the system (4.1), the ball
x(0)
2D
,
x(t)
converges to
B
0 (r0 ).
and r0 = 0 then attractiveness of the ball is equivalent
to asymptotic stability. A simple extension of standard Lyapunov stability theory (e.g., [45], Theorem 3.1) gives the following suÆcient conditions for stability of the system (4.1). Proposition 4.1.3 Let P
R
2R
nþn
a positive-deÿnite function. Let
suppose that, for every trajectory
ö
then
Then the ball
B
@V @x
0 (r0 )
õ
fi(t)
be a positive-deÿnite matrix and
x(
(x(t))
(x) :=
þ
V
Given
DøR
n
) of the system (4.1), if
ýú D s
ö(x(t))
is attractive from
r0
0
x P x.
= r1
for
t
ÿ
x(t)
ö
and
:
Rn
2 D nB r1
ÿ
!
0,
0 (r1 )
0:
for the system (4.1), where
þmax (P ) þmin (P )
:
If the conditions in the proposition are satisÿed with a Lyapunov function V
R
(x) = nþn
from and
0
x Px
and a function
ö
of a quadratic form
is positive deÿnite, we say that the ball
D with respect to
r0
B
ö(x)
0 (r0 ) is
=
0
x J x,
where
(P; J ) for the system (4.1). In particular, if
= 0, we say the system is
quadratically stable with respect to
D
=
S
2
Rn
(P; J ).
Our main goal is to develop systems in which the communication of quires only a ÿnite data rate. That is,
J
quadratically attractive
i
re-
is ÿnite and there are ÿnite switchings
in ÿnite time. The convention of dwell time in our switching logic is to meet the property of ÿnite switchings in ÿnite time. Dwell-time switching was studied extensively in [59], but not in the context of ÿnite data rate. We see in the next section that a rather simple sampled-data system falls into this class of systems.
4.2.
35
Quadratic stabilization of sampled-data systems
4.2
Quadratic stabilization of sampled-data systems
In this section, we study a problem of stabilizing a continuous-time linear plant by a sampled-data controller. The stability to be achieved is quadratic stability deÿned in the previous section. 4.2.1
Problem formulation
Consider the continuous-time control system in Fig. 4.1(a). The pair (A; B ) represents a linear time-invariant plant given by x(t)
where A
6
2 Rn
x(t)
= 0, that
A
=
is the state and
Ax(t)
+ Bu(t);
2R
u(t)
(4.2)
is the control input. Assume that
is unstable, i.e., it has one or more eigenvalues with nonnegative
real parts, and that (A; B ) is stabilizable. The matrix
K
2 R1
n is chosen
þ
so that the closed-loop system is stable, i.e., all eigenvalues of
A
+
BK
have
negative real parts. Since
A
+ BK is stable, the system is quadratically stable with respect to
a positive-deÿnite
2
P
(A + BK ) Now let
V
of
:
V
of V
With
u
=
K x,
2 Rn
0
x P x; x
0
P
+ P (A + BK ) =
. For a trajectory
(x(t)) is determined by
Rn ü R ! R
decay rate V
(x) :=
V
2
J Rnþn , there exists n þn R that satisÿes the Lyapunov equation
some pair (P; J ). For example, given a positive-deÿnite
J
ú þ
(4.3)
J:
x(
) of the plant (4.2), the
in the following sense: The derivative
along the trajectories is deÿned by 0
(x; u) := (Ax + Bu)
we have
V
(x; K x) =
ú
Px
0
+x
P (Ax
+ Bu):
0
x J x.
Now suppose that the communication channel between the sensor and the actuator has a ÿnite data rate. The traditional way to communicate
u(t)
is to
sample and quantize. The ÿrst step to model such a setup is to add a sampler and a hold to the system as in Fig. 4.1(b). An sampling period T >
0 is deÿned by
T :õ
S
the
zeroth order hold HT
T : õd
H
and
0
u
2R
7!
v
:
7!
ideal, uniform sampler ST
d : õd (k ) := õ(kT );
õ
k
with
2 Z+
;
is given by v (t)
:=
d (k );
õ
t
2
[kT ; (k + 1)T );
k
2 Z+
;
. Observe that u(t)
=
0
u K x(kT )
for
t
2
[kT ; (k + 1)T ); k
2 Z+
:
(4.4)
36
Chapter 4.
Finite data rate control | single-input case
u(t)
-
-
x(t)
(A; B )
?
x(t)
u(t)
(A; B ) u0
ÿ
K
v (t)
K
6
ÿ
T
H
(a) Continuous-time system
ÿ
T
S
d (k )
õ
õ(t)
(b) Sampled-data system
Figure 4.1:
This system can be viewed as a dwell-time switched system (4.1) by taking f ( ) (x(t))
ut
=
Ax(t)
+
is always dwell time
T
Bu(t)
S
and
=
R.
Obviously
f0 (0)
= 0 and there
between switching times. Moreover, if
k ) = 0, it
x(t
2 [tk ; tk+1 ), from (4.4) and consequently x(tk+1 ) = 0; repeating this argument, we have u(t) = 0 for t ÿ tk . Thus, we can use the follows that
u(t)
= 0; t
stability deÿnitions in Section 4.1.
The question of interest here is, how large can
T
be while maintaining the
stability of the closed-loop system with respect to the same Lyapunov function at a decay rate similar to that of the original system? Here we allow a scalar multiplication on clear that
u0
K
and have
u0
in Fig. 4.1(b). In our treatment it becomes
= 1 is not always the best choice.
The problem of this section is stated as follows. a sampling period
T
and a scalar
u0
Given
÷
2
(0; 1), ÿnd
such that the closed-loop sampled-data
system in Fig. 4.1(b) is quadratically stable with respect to (P; ÷J ). We call this problem the sampled-data control problem . Notice that the choice of system. Larger
÷
÷
changes the decay rate of the sampled-data
guarantees faster decaying. We also note that
÷
being strictly
smaller than 1 is a key assumption; we elaborate more on this later. A similar but simpler problem is to ÿnd the sampling period for the discretized system. That is, discretize the plant to (Ad ; Bd ) via step-invariant transformation and ÿnd the sampling period for all eigenvalues of
d + Bd K
A
to have absolute values less than 1. In contrast with this simpler problem, we emphasize that in our problem the decay rate of trajectories is bounded and that the stability is in the continuous-time domain, i.e., the intersample behavior is taken into account. In our setup, since the full state is measured and there is no noise, one way to achieve better control is to send the sampled state the sampled control
K x(kT )
x(kT )
instead of
from the sensor side to the actuator side. Then,
on the actuator side, we can place an open-loop controller whose state is reset periodically. Such a scheme has an obvious advantage over ours and is consid-
4.2.
37
Quadratic stabilization of sampled-data systems
ered in [74]; however, this paper does not deal with quantization issues. We stress that the objective to study our sampled-data problem is to lay the basis for the problem with quantization in Section 4.3. 4.2.2
Control Lyapunov function approach
Our objective in the sampled-data control problem is to achieve quadratic stability for a ÿxed Lyapunov function
V
and a bound on the decay rate. In
other words, we look for a control law, or the switching logic on a continuum, so that the positive-deÿnite function
V
becomes a Lyapunov function for the
V control Lyapunov function (e.g., [48]). In the following, we limit the class of K to LQR optimal ones for technical reasons and analyze function V for such K . We note that this assumption may
sampled-data system to guarantee its stability. In this design methodology, is referred to as a
not be such a restriction since LQR optimal gains are known to have inÿnite gain margin and fairly large phase margin; in particular, 1+ ( )ý1
j K j!I úA B j ÿ ! 2 R [2, 43]. These robustness properties are suitable for our design. Let Q 2 Rn n be a positive-deÿnite matrix and R > 0. Denote by P the
1 for
þ
unique positive-deÿnite solution of the Riccati equation
A P + P A ú P BR 1 B P + Q = 0:
(4.5)
úR 1B P .
Now let the control
0
Deÿne the stabilizing feedback matrix
V (x) := x P x. u = Kx is V (x; Kx) = úx Jx, where 0
Lyapunov function be
ý
0
K
:=
ý
P,
For this
0
0
the decay rate of
V
when
J := ú(A + BK ) P ú P (A + BK ) = Q + P BR 1 B P > 0: 0
ý
By (4.5) and the deÿnition of
K, V
0
can be alternatively represented as
V (x; u) = x (A P + P A)x + 2x P Bu 1 = x (P BR B P ú Q)x + 2x P Bu = x K R(Kx ú 2u) ú x Qx: 0
0
0
ý
0
Fix
÷ 2 (0; 1).
For
0
0
0
0
0
u 2 R, let X (u) be the set of states for which u decreases
the control Lyapunov function:
n o X (u) := x 2 Rn : V (x; u) ý ú÷x Jx n = fx 2 R : x K R [(1 + ÷)Kx ú 2u] ý (1 ú ÷)x Qxg : 0
0
0
0
Q = 1=(1 ú ÷)I without loss of generality (we can x to give this Q). Now, let
In the following, we select always transform
M := (KerK )
?
Then
K:
= Im
0
e := K =kK k constitutes an orthonormal basis for M. 0
(4.6)
38
Chapter 4.
Finite data rate control | single-input case
Finally, letting := (1 + ÷)R
ÿ
X
we obtain a simpler expression for Lemma 4.2.1 For u
X
+y :
õe
there exist unique
õ
;
2X ,
+y
(u)
2R
õ
?
y
and 0
(õe + y )
, , k k 0
ú
(ÿ
;
õ
K
K
2M
?
y 0
K R[(1
Rn
0
õ(1
K R
2
õ
+y :
õ
=
õe
õ
K
?
y
, for any
K
2u]
2ÿ
2u
u
(1 + ÷)
K
B
2R 2M
?
y
+
õ
K
K
(ÿ
ú
y
:
(4.8)
2 Rn
,
0
(õe + y ) (õe + y )
(since
y
y
2+y y
?
0
K ; e)
0
õ
ÿ
:
0
u =(1 + ÷).
1)õ
ò
y
õ
and u ~0 =
;
0
2
e)
2 2 k k ý kk 2 k k ýk k
k k
;
+y
0
K
x
+ y . Then
ú ý
õ e e
ý
u
~ := (1 + ÷) For simplicity, let B
õe
M÷M
0
2u]
+ ÷)K
(1 + ÷)
1)õ
(4.8) can be expressed as
(1 + ÷)
ú ý 2ý k kú ý
(by deÿnition of
(ÿ
ñ
+ ÷)K (õe + y )
þ
, ú , ú 2ú
(u) =
x
k k ýk k
u
2ÿ
=
such that
[(1 + ÷)K õe
þ
õÿ
ú
2 1)õ
(by (4.6))
0
õe K R
X
(4.7)
;
(u).
Corresponding to the decomposition
Proof
õe
K
,
ÿ
(u) =
2R 2R 2M
k k2
2
ú
2ÿuõ
~ ), Then, for (A; B
ý k k2ô y
:
(4.9)
Figure 4.2 depicts a system equivalent to that in Fig. 4.1(b). Now the control input is
u(t)
=u ~0 e0 x(kT ) for
t
2
[kT ; (k + 1)T ); k
2 Z+
.
From (4.9), we see that the state space can be parameterized by
õ
and
kk y
for the purpose of stabilization. This is not diÆcult to picture because the continuous-time control x
in
M
u
=
Kx
depends only on the component of the state
. This observation motivates us to deÿne
þ
F (x)
Clearly,
F
kk õ y
;
where
õ
=
0
e x
and
: y
Rn
=
! R2 ú
x
by (4.10)
õe:
preserves norm. Then, from (4.9), we have the image of
F:
F
=
ý
F
X
ÿþ ý (u) =
õ ô
2
R2
:
ô
ÿ
0; (ÿ
ú
2 1)õ
ú
2ÿuõ
ý
2 ô
X
(u) under
ò :
(4.11)
4.2.
39
Quadratic stabilization of sampled-data systems
u(t)
-
x(t)
~) (A; B
?
u ~0
0
e
6 T
H
ÿ
ÿ
T
S
Figure 4.2: Equivalent sampled-data system
ÿ
ÿ=
p ýÿ (
þ ÿ 2ýuþ
1) 2
F X (u)
ÿ ÿÿ1 u
0
þ
Figure 4.3: Set
Figure 4.3 shows this set for particular the line
õ
=
ÿu=(ÿ
ú
2X
(u)
We refer to the image space of
F
X
(u)
1 and
2 Rn , 2 X
1). Clearly, for x
F
ÿ >
F (x)
as the
u >
0. It is symmetric about
,
x
F
õ-ô
(u):
plane
(4.12)
.
The following result gives a suÆcient condition for quadratic stability of the system in Fig. 4.2. We make use of of the
õ-ô
F,
so the result is expressed in terms
plane.
Lemma 4.2.2 If for every initial condition x(0) F (x(t))
2 X F
(~ u0 e
0
x(0))
for
2 Rn 2
t
the trajectory satisÿes
[0; T ];
(4.13)
then the closed-loop system in Fig. 4.2 is quadratically stable with respect to (P; ÷J ). Proof
By (4.12), the condition in (4.13) is equivalent to
for
[0; T ].
t
2
x(t)
2X
0 (~ u0 e x(0))
Since this holds for any initial condition, we have
x(t)
2
40
X
V
Chapter 4.
0 (~ u0 e x(kT )) for
2
t
Finite data rate control | single-input case
[kT ; (k + 1)T ); k
(x(t)) decreases at all times: V
(x(t); u ~0 e
0
2 Z+
. Thus, by the deÿnition of
ýú
x(0))
Xþ
( ),
0
÷x(t) J x(t):
ÿ
This implies quadratic stability with respect to (P; ÷J ).
The diÆculty encountered by the use of sampled-data controllers for quadratic stability is that during the sampling period the plant is controlled blindfold, but the control input has to be guaranteed to keep
V
decreasing during
that period. The result above states this intuition formally. It implies that, to achieve stabilization, the next step is to choose u ~0 so that, starting at the trajectory
x(t)
X
stays in
and, moreover, that this step can be done in the
4.2.3
x(0),
0 (~ u0 e x(0)) for the entire ÿrst sampling period
õ-ô
plane.
Bounds on trajectories
We give a preliminary result on the bound of the state trajectory during the sampling period, characterized in the We ÿrst give some notation. For
Observe that Suppose
k úk ! !
:= max
cT
2[0;T ]
t
cT ; dT
cT
cT >
0.
1:
õ
=õ ~.
:
ö
ú új jú j j
ÿ
f
ô
aT (õ
F~
û (u)
max
õ ~)
õ ~
dT cT
(4.15)
u
û
û
ò
g
+ ý f ~ (õ; u); f ~ (õ; u); 0
(4.16)
:
is the shaded area and is a cone symmetric
The following proposition says that if control
6
0
Aü
, deÿne the functions
and the set
T
ú
t
2R
f
taking
A
1. Let
aT
F~
0, let
T >
s
Now, for õ ~
plane.
cT
2 [0; T ].
t
0. We have
ó ó Z t ó At ó Aü ~ ó kx(t) ú x0 k = óe x0 + e B dý u ú x0 óó 0 óZ t ó ó At ó ó ó Aü ~ ó ó ó ý e ú I þ kx0 k + ó e B dý óó u
ý c kõe ~ +y ~k + d u ý c (kõe ~ k + ky ~k) + d = c (õ ~ + ky ~k) + d u T
Tu
T
t
2 [0; T ].
0
T
T
for
y ~
1, then F (x(t))
Proof
2 R,
T
(since
kek = 1)
This proves the ÿrst claim.
r := c (õ ~ + ky ~k) + d u. 2 B( ~ k ~k) (r) for all t 2 [0; T ]. Proof From the ÿrst claim x(t) 2 B 0 (r ) = B ~ +~ (r ) for t 2 [0; T ]. Here ñ ô 2 2 B ~ +~(r) = õe + y : k(õe + y) ú (õe ~ +y ~)k ý r ñ ô 2 2 2 = õe + y : k(õ ú õ ~ )ek + ky ú y ~k ý r (since e ? (y ú y ~)) ñ ô 2 2 2 = õe + y : jõ ú õ ~ j + ky ú y ~k ý r ô ñ 2 ø õe + y : jõ ú õ~j + jkyk ú ky~kj2 ý r2 :
The second claim is shown in two steps. Let F (x(t))
Step 1
x
ûe
y
T
û; y
ûe
y
T
42
Chapter 4.
The last inclusion holds since F (x(t))
2
ÿþ ý õ
:
ô
Finite data rate control | single-input case
jkyk ú ky~kj ý ky ú y~k.
jõ ú õ~j
2
+
jô ú ky~kj ý r 2
Thus, we have
ò
2
Now we must show that the ball in Step 1 is in Fig. 4.4). If
Proof
It suÆces to show that
cT
0 and õ ~ h(õ; u)
= 0, as a quadratic equation of
ÿu=(ÿ
õ,
ú 1).
Then
ýÿg
û~
f
with
u
iþ
has either a single real solution or
complex solutions.
ý Denote by õf the point that the line fû ~ crosses the õ-axis. Note2 that ý 2 2 (fû ~ ) and g are parabolas with vertices at (õf ; 0) and (ÿu=(ÿú1); ú(ÿu) =(ÿú 1)), respectively. By deÿnition, õf < õ ~ and moreover, by hypothesis, õf < ý2 2 ÿu=(ÿ ú 1). Hence, the vertex of (f ) is on the left of that of g . û~ ý ý 2 Now suppose fû ~ ÿ g with u. It then follows that (fû~ ) is above or tangent to g 2 for õ ý õf . From the position of the vertices of the parabolas, clearly ý2 ý2 2 (fû g for õ ÿ õf as well. Thus, (fû ~ ) is above ~ ) is above or tangent (at one 2 point) to g . This is equivalent to h(õ; u) = 0 not having two real solutions. Proof
ÿ
The converse can be shown similarly. Lemma 4.2.8 Suppose õ ~ > 0 and õ ~
1
p
T
(4.25)
a
ÿ
+1 +
a
T cT
d
ýú ú ÿ
2
1). So if
1 2
ÿ
and
þ a
T ú
a
This is equivalent to (4.24) and
ú
T
a
q
+1
ÿ
T
c
u
0:
ÿ
+1
0. Thus, the
(4.28)
0:
p
1=
ý
1
(4.27)
T =cT in (4.27) is negative. Moreover, by deÿnition, dT =cT
condition (4.27) holds iþ
ú
T
ÿ.
Thus clearly the bounds in (4.23)
and (4.24) imply the condition (4.28), which is equivalent to the condition (4.25).
4.3.
Quantized sampled-data control
47
ý ý
ú T ú ú
To show the converse, we need to prove only that the condition (4.28) implies (4.23). By d
(4.28)
)
T =cT
þ
q2
> 0 and the assumption õ ~
ú ú 2ú T 2 ) Tú , T p ÿu
0
1
a c
1
0, the negative case being symmetric. By (4.22), the condition (4.18) is equivalent to
Since both
Fû~
+
û~
f
0
ÿ
(u ~ õ) ~ and F
X
ý ÿ g with u = u~0 õ: ~
û~
g and f
ý0 û~
ÿ
(u ~ õ) ~ are symmetric about the line õ = õ ~ by
(4.19), we have to show only f
0
g with u = u ~ õ ~ for õ ~ > 0.
Lemma 4.2.8, this is equivalent to the bounds on c
T
T
and d
However, by in (4.20) and
ÿ
(4.21), respectively.
Fû~
We give some remarks as follows. due to the size of Proposition 4.2.3.
Conservativeness in the size of T
is
(u), which was shown to give bounds on tra jectories in
In the examples in Section 4.7, we compare this sampling
period with that necessary for the stability in the discrete-time sense.
X
Next we discuss the assumption that ÷ be strictly smaller than 1. Suppose ÷ = 1. Then
(u) in (4.6) has the form
X
(u) =
f 2 Rn x
0
:
0
2x K R (K x
ú ýg
which does not depend on Q explicitly anymore. can be expressed as
F
X
(u) =
ÿþ ý õ ô
2 R2
:
ô
ÿ X
Thus, it follows that when ÷ = 1 the set F
2
0; õ
u)
0
;
Now, from (4.11), F
úk k ý u
K
õ
ò 0
(u)
:
Fû~
(u) becomes simply a strip. Clearly,
the construction in Theorem 4.2.5 does not work because cones ÿt in strips.
4.3
X
(u) don't
Quantized sampled-data control
In this section, the sampled-data control problem is studied further, taking account of the eþect of quantization. We consider the design of quantizers by
48
Chapter 4.
Finite data rate control | single-input case
extending the results in the previous section, and thus the development is done in a similar manner. A few speciÿc quantizers are examined for comparison of their characteristics. 4.3.1
Problem formulation
A quantizer is usually deÿned as a function from
R to a ÿnite or countable set.
We give a fairly general treatment of quantizers with the following deÿnition in terms of partitioning. Deÿnition 4.3.1 A
countable
2 Q0 f j gj ø R
(ii) 0 q
fQj gj of R is a set of bounded intervals with a Z+ such that (i) Qi \Qj = ;, i 6= j , and [j Qj = R,
partition
S
index set
=
, and (iii) for
2S
with
0
q
6
2S
= 0, 0
j
= 0, a
2 Qj =
quantizer
Q(õ)
=
cl( Q
j if
q
õ
2S
R ! fqj gj
:
fQj gj
). Given a partition
2 Qj 2 S S ;j
Some remarks are given as follows. (1) We use
2S
2S
and
is deÿned by
(4.29)
:
=
Z+ here for deÿniteness,
but may use other countable sets elsewhere. (2) The property (iii) of partitions is due to technical reasons. Consequently, the origin is either in a bounded cell, e.g, [a; b] with
a
÷
0 such that, for some
R
2
(0; 1),
r0
ÿ
0,
n with respect to (P; ÷J ) for 0 (r0 ) is quadratically attractive from the closed-loop quantized sampled-data system in Fig. 4.7.
the ball
Note that the radius r0 of the attractive ball is not prespeciÿed in this problem, though as a design problem it is preferred to be so. In general, this turns out to be diÆcult. There are, however, examples of quantizers that allow us to choose r0 prior to the design. We also remark that if r0 = 0 the control objective is that the closed-loop system be asymptotically stable. 4.3.2
A suÆcient condition for stability
For the sampled-data control problem in the previous section, there is a suÆcient condition expressed in the
õ-ô
plane. In this section, the approach taken
there is generalized to the case of discrete-valued control inputs. We give a suÆcient condition for closed-loop stability, an extension of Lemma 4.2.2, that holds for any quantizer. For
e
and the quantizer
Q
:
R
the state space that is mapped to
X
j :=
= Such sets form a partition of
!f g qj
f 2R f 2R x
x
Rn
X 2S
j 2S , let
j, j
qj :
n
n
: :
0
Q(e x) 0
e x
= qj
2Q g j
, be the subset of
g
(4.32)
:
and are referred to as state partition cells . In
terms of state partition cells, the quantized sampled-data controller works as follows: If
x(kT )
2X
j then u(t) = uj for t
2
[kT ; (k + 1)T ),
k
2 Z+
.
The following lemma is used repeatedly in our development (see Fig. 4.8).
50
Chapter 4.
Finite data rate control | single-input case
rc2
E
rc1
Figure 4.8: An ellipsoid and balls
Lemma 4.3.2 Let
E
be an ellipsoid of the form
E = fx 2 Rn The largest ball contained in smallest ball containing Proof 0
x Px
ÿ
If
0
E
is
ý rc21 ,
then 2 þmin (P )rc2 = c. x x
E
0
:
is
x Px
B0 (rc1 )
ý cg ;
0
0:
pc1 =
with
B0 (rc2 ) with rc2 =
x Px
c >
r
p
c=þmax (P )
and the
c=þmin (P ).
ý þmax (P )rc21
=
c.
Similarly, if
0
x x
ý rc22 ,
ÿ
Now the idea used in Lemma 4.2.2 is extended to the state partition case in the following result. Lemma 4.3.3 Suppose the quantized sampled-data controller is designed to
have the following properties: 1. There exists r1 for t [0; T ].
2
2. For
j
ÿ 0 such that if x(0) 2 X0 then F (x(t)) 2 F X (0) [B0 (r1 )
2 S , j 6= 0, if x(0) 2 Xj
then
F (x(t))
2 F X (uj ) for t 2 [0; T ].
Then, for the closed-loop system in Fig. 4.7, the ball attractive from
Rn
with respect to (P; ÷J ), where
s
r0
Proof
for every
= r1
þmax (P ) þmin (P )
B0 (r0 ) is quadratically
(4.33)
:
It follows from the assumptions and the deÿnition of j
2S
if
x(0)
2 Xj
x(t)
2
then
(
X (0) [ B0 (r1 ); X (uj );
if
j
if
j
= 0;
6= 0;
F
in (4.10) that
4.3.
for
2 [0; T ].
t
which
x(kT )
B0 (r1 ) here is in Rn .
Note that j
2
(
2 [kT ; (k + 1)T ]; k 2 Z+.
t
Lyapunov function
B0 (r1 ).
is in
X (0) [ B0 (r1 ); X (ujk );
j
if
j
6 0; k=
This means that at each
The smallest level set containing
from Lemma 4.3.2.
E0 .
k = 0;
if
t
:
V
(x)
B0 (r1 ) is
ý þmax (P )r12 g
This is an invariant set, and hence
Finally, the smallest ball that contains
Lemma 4.3.2.
4.3.3
ÿ 0 either the control X (þ), or else x(t)
(x(t)) is decreasing, by the deÿnition of
V
E0 = fx 2 Rn stays in
Thus, denoting the cell index
k , we have
is in by
x(t)
for
51
Quantized sampled-data control
E0
x(t)
is
goes into and
B0 (r0 ),
again by
ÿ
Solution to the quantized sampled-data problem
The similarity of the quantized version of the sampled-data problem to the original problem is obvious now. This problem too is fully described in the õ-ô
plane. The diþerence in Lemmas 4.2.2 and 4.3.3 is how the initial states
in Lemma 4.2.2 and x(0) 2 Xj in Lemma 4.3.3. Fû~ (u), deÿned in (4.16), gives a bound on the trajectories starting on x(0) = õe ~ + y; y 2 M? with a constant control u 2 R. To continue the parallel discussion, deÿne the set FQj (u); j 2 S ; u 2 R, by
are classiÿed: õ ~ =
0
e x(0)
Recall that the cone
FQj (u) :=
[
Fû~ (u):
û~ 2Qj
This gives a bound on the trajectories starting in closure of
FQj (uj ) is another cone expressed as
FQj (uj )) =
ÿþ ý õ
cl(
ô
:
ô
ÿ max
n
ÿ
j
:=
oò
+ ý sup Qj (õ; uj ); finf Qj (õ; uj ); 0
f
and this set is symmetric about the line
q
Xj by Proposition 4.2.3.
õ
1
ö jsup Q j ú jinf Q j
2
a
j
j
T
=
q
ÿ j , where
+ sup
Qj + inf Qj
;
The
(4.34)
õ :
(4.35)
These are not diÆcult to check and we skip the proof; see Fig. 4.9. We can now reduce the suÆcient condition for stability to the following one. Proposition 4.3.4 Suppose cT < 1. If
(i) there exists
1 ÿ 0 such that FQ0 (0) ø F X (0) [ B0 (r1 ), and
r
Chapter 4.
52
Finite data rate control | single-input case
ÿ
ÿ finf Qj
+ fsup Qj
0
inf
Q
j
Q
j
sup
qþ
Q
þ
j
j
FQj (uj ))
Figure 4.9: Set cl(
(ii) for j
2 S , j 6= 0, FQj (uj ) ø F X (uj ),
then, for the closed-loop system in Fig. 4.7, the ball attractive from
Proof x(0)
Rn
By the deÿnition of
2 X0 ,
B0 (r0 )
is quadratically
with respect to (P; ÷J ), where r0 is as in (4.33).
FQ0 (0), the condition (i) implies that, for every 2 F X (0) [ B0 (r1 ).
the tra jectory of the system satisÿes F (x(t))
Similarly, it follows from the condition (ii) that, for every tra jectory starting
2 Xj , j = 6 0, F (x(t)) 2 F X (uj ) for t 2 [0; T ]. B0 (r0 ) is quadratically attractive. at x(0)
The symmetry of the two sets F
6
Thus, by Lemma 4.3.3,
ÿ
X (uj ) and FQj (uj ) plays an important X (uj ) is symmetric about the line
role here. We have seen that, for j = 0, F õ = ÿ=(ÿ
ú 1)uj = qj and FQj (uj ) about õ = qjÿ .
Figure 4.10 illustrates these
two sets. Thus, in our framework, to maximize the sampling period T , it is natural to take the quantizer's output values to be
(
qj =
0;
if j = 0;
qj ;
if j = 0:
ÿ
(4.36)
6
Recall that, by Deÿnition 4.3.1, q0 = 0. For this choice of
fqj gj2S , the suÆcient
condition in Proposition 4.3.4 for closed-loop stability can be further reduced to bounds on T in the following theorem. Let q j
2 S , j 6= 0.
j
:= min
fjsup Qj j; jinf Qj jg for
We are now ready to construct a stabilizing quantized sampled-data controller.
4.3.
53
Quantized sampled-data control
ÿ
FQ (uj ) j
F X (uj )
qj qjþ
0 Figure 4.10: Sets
u ~0
3.
j)
and
F
X (u
j)
for
j
6= 0
2 (0; 1). Select the matrix Q and R > 0 in (4.5) so that ÿ > 1. ú 1)=ÿ. Given fQ g 2S , set fq g 2S as in (4.36). For j 2 S ; j = 6 0, select T > 0 small enough that
1. Fix
2.
FQj (u
þ
÷
Set
= (ÿ
j
j
j
j
j
cTj
Tj >
0, let
r0
f
= max sup
Tj
+
q
0. Set
s
Q0 ; ú inf Q0 g
T
2
a
Tj
j
There always exists such 4. If
j
úÿ+1
(4.37)
)
(4.38)
:
= inf j 2S ;j 6=0 Tj .
þmax (P )
p
þmin (P ) aT
ÿ (aT
p + 1) : ú ÿú1
(4.39)
Clearly, we may have T = 0, in which case a controller does not exist for the given quantizer. We call all quantizers that yield
T >
0
stabilizing
quantizers.
For such quantizers, we have the main theorem. Theorem 4.3.5 With the quantized sampled-data controller just deÿned, for
the closed-loop system in Fig. 4.7, the ball from
R
n
with respect to (P; ÷J ).
In Proposition 4.3.4, the
j
B0 (r0 ) is quadratically attractive
= 0 case and the nonzero
j
case have diþerent
conditions and can be dealt with separately. The proof of the theorem is based on two lemmas in the following. We ÿrst give a lemma related to
j
= 0.
54
Chapter 4.
Finite data rate control | single-input case
ÿ
F
F[ÿ
r;r
X (0)
+ (þ; 0)
=
ÿ
fr
](0) ÿ
p
=
ý
ÿ1
ÿr1
r
0
þr1
þ
F[ý ](0) and F X (0)
Figure 4.11: Sets
r;r
pÿ 0 be large enough that Q0 ø [úr; r] and T
Lemma 4.3.6 Let r
enough that
cT
0 small
+ 1. Then
FQ0 (0) ø F X (0) [ B0 (r1 ); where
1=
p
(4.40)
ÿ (aT
r
aT
ú
p + 1) r: ÿ ú1
(4.41)
The condition (4.40) is equivalent to
Proof
FQ0 (0) \ (F X (0)) ø B0 (r1 ): c
Since
ÿþ ý
FQ0 (0) ø F[ý ](0) =
õ
r;r
ô
:
ô
ÿ max
ñ + ý (õ; 0); 0ô f (õ; 0); f
ý
r
r
ò ;
it is suÆcient to show that
F[ý ](0) \ (F X (0)) ø B0 (r1 ): c
r;r
Observe F
X (0) =
Figure 4.11 shows sets (F
X (0))
c
.
ÿþ ý õ ô
2 R2
F[ý ](0) r;r
:
ô
and
p
ÿ maxf F
X (0);
ÿ
(4.42)
ò
ú 1jõj; 0g
:
the dark area is
F[ý ](0) \ r;r
þ
4.3.
Quantized sampled-data control
55
p ú
p
0 r1 ] be the intersecting point of lines ô = ÿ 1 õ and ô = + + fr (õ; 0). This point exists since cT < 1= ÿ + 1 implies the slope aT of fr is
Let [õr1
larger than
ô
p ú ÿ
1. Then, by direct calculation,
óóþ ó p ú óp ú T ú
óóþ ýóó óó r1 óó = r1
(aT + 1)r
õ ô
a
1
ÿ
ýóó ó= 1 ó
1
ÿ
1
r :
ÿ
Hence, (4.42) and consequently (4.40) follow. We now have a result for nonzero
2S
Lemma 4.3.7 For j
enough that
T
c
p
X
j=
u
ÿ
Lemma 4.2.8, and the bounds on Proof of Theorem 4.3.5 c
u
=
cl(
) and
j (u
j ))
ø X F
(uj ).
j
u :
(uj ) are symmetric about the same line
ý
j
ú ÿ
1
q
Therefore, it follows from (4.43) that
Since
with
=
ý
inf Qj
f
ÿ
g
with
u
=
u
>
p
ÿ >
1.
and hence
q
From (4.30) and (4.36),
r; r ].
g
2 Qj
); the other case is
j. 0. By the bound on cT , we have aT
q
[
ÿ
ý
inf Qj
and
g
Consequently, it suÆces to show
Q0 ø ú
1 FQ
0). We show for (0;
(uj ) is a closed set, we must show that cl(
f
=
:
is a bounded interval and 0
1
By (4.34), this is equivalent to
õ
+1
ÿ
(u ).
F
is in either (0;
symmetric.
T
1
By Deÿnition 4.3.1,
Qj
Proof
hence
ÿ
j
.
;
((
+1
ÿc
T
Then
1 ÿ
d
with
2S
j
T and
(4.43)
j= u
j
ÿ
ú ÿ
1
ÿ
j
q :
ÿ ú (ÿ
1)=ÿq . We can now apply
r
j
ÿ f Q0 ú Q0g ÿ
ý T and dT imply fqj
c
Let
ÿ
j
q :
= max sup
;
g
inf
with
u
=
j.
u
0. Clearly,
T in (4.14) are nondecreasing functions of
d
ÿ
T,
56
c
T
Chapter 4.
ý cTj
and
d
T
ý dTj .
Finite data rate control | single-input case
Thus, by the bounds on
T in (4.37), we can apply
c j
Lemma 4.3.6 and obtain
FQ0 (0) ø F X (0) [ B0 (r1 ) (4.44) p p with r1 = ( ÿ (aT + 1))r=(aT ú ÿ ú 1). On the other hand, bounds on cTj in (4.37) and those on that
T in (4.38) allow us to apply Lemma 4.3.7, and it follows
d j
FQj (uj ) ø F X (uj ):
p
(4.45)
From (4.44) and (4.45), the stability of the system follows by Proposition 4.3.4; letting
=
þmax (P )=þmin (P ), we conclude that attractive for the closed-loop system. r0
r1
The size of
T
B0(r0 )
is quadratically
ÿ
Fû~ (u), as in the FQj (uj ) is conservative. An-
is conservative again because the size of
sampled-data controller, which aþects the size of
other source of conservativeness here is the size of
r0 . As can be seen in the proof, the radius of the actual attractive ball can be much smaller.
In the following, we examine two quantizers, one that uses cells of ÿxed length and the other with cells of varying length. Applying the above theorem, we see that the latter type can do better in terms of the sampling period and eÆciency in partitioning.
Uniform quantizer Here, we solve the quantized sampled-data control problem for the commonly used uniform quantizer.
Qÿ1 ÿ 3þ2
Q0 ÿ þ2
Q1
Q2
þ 2
0
Q3 5þ 2
3þ 2
þ
7þ 2
Figure 4.12: Partitioning of uniform quantizers For ù
>
0, the
uniform quantizer
the cells
Qj =
þ
(j
ú
1 2
(see Fig. 4.12), and the output values
ÿ j = qj =
q
Qþ
is deÿned by the index set
)ù; (j +
ö
q0
j
1 2
)ù
õ
= 0 and for
+
1 2aT
õ
j
6= 0
ù:
For this class of quantizers, we have a corollary of the theorem.
S
=
Z,
4.3.
57
Quantized sampled-data control
Corollary 4.3.8 Given r0 > 0, select T > 0 small enough that
T
1 and T > 0 small enough that
cT
R
in log-
arithmic quantizers. The price we pay for a coarser partition is the size of T:
The coarser the partition (i.e., the larger
seen in the bound on that, for
Æ
=
ÿ
qj
for
6
j1
in the logarithmic quantizer. From the design procedure, for each cTj
2
a
= inf j 2S
Tj
ú
Æ
2
j1
here and
0 such that
ÿ
+1
ò :
for
Tj
= 0, and this is
j
= [j1 ;
ö
j1
1],
as in the logarithmic quantizer, but the
sampled-data controller is not deÿned because of
Q fg
aTj
is a decreasing function of
0 as j1 not a stabilizing quantizer. T[j1 ;s1 ]
ú
j
=
= 0. Thus, the
j
diþerence here is the points at which the cells are partitioned: Æ
S
T
= 0. Thus, it follows that
is not a suÆcient condition for asymptotic stability.
60
Chapter 4.
Finite data rate control | single-input case
Q[ÿ1 ÿ1] Q[0 ÿ1] Q0 ;
Q[0 1]Q[1 1]
;
ÿÆ4 ÿÆ ÿ1
;
0
Q[2 1]
;
;
Æ4
1Æ
þ
Æ9
Figure 4.14: Partitioning of the modiÿed logarithmic quantizers
4.4
Finite quantizers
The stability criterion considered in the quantized sampled-data control problem is a global one. This, in turn, resulted in inÿnitely many partition cells in the quantizers since the cells are deÿned to be bounded intervals. In this section, we turn our attention to a more local stability criterion and use a
ÿnite
special class of quantizers, called
fQ g of R is said to be ÿnite if for every r > fQ k g =1 with N 2 N such that [úr; r] ø [ =1 Q k .
Deÿnition 4.4.1 A partition
0 there is a ÿnite subset
Finite quantizers
quantizers.
j
j 2S
N k
j
N k
j
are quantizers deÿned by ÿnite partitions.
For example, uniform quantizers are ÿnite, but logarithmic ones are not. The problem in this section is as follows: Given subset
D of R
n
, and
r0 >
loop system in Fig. 4.7, the ball respect to (P; ÷J ). trajectory
÷
2
(0; 1), a bounded
0, ÿnd a ÿnite quantizer such that, for the closed-
B
D
with 0 (r0 ) is quadratically attractive from Also, ÿnd a ÿnite subset N of such that, for every
S
þ) starting in D, x(t) 2 [ N X D, let E0 be a level set of V
x(
j 2S
for
j
For the given
E0 := fx 2 R
n
:
V
(x)
S
t
ÿ 0.
containing
ý c0 g ;
c0 >
D:
0;
DøE
that is,
c0 is chosen large enough that 0. The design of ÿnite quantizers is similar to that of general quantizers.
1. Given a ÿnite partition
fQ g j
j 2S ,
follow the design procedure of quan-
tizers in Subsection 4.3.3 and obtain 2. Select
S øS N
such that
E0 ø
fq g j
[X N
j 2S
and
fT g j
=0 .
j 2S ;j 6
(4.52)
j;
j 2S
where 3. Set
T
fX g is the state partition in (4.32). j
= minj 2SN ;j 6=0 Tj .
E 0 := fx 2 R : x P x ý r02 þmin (P )g (this is the largest level set inside B0 (r0 ) by Lemma 4.3.2).
4. Take
r0
as in (4.39) and
r
n
0
4.4.
61
Finite quantizers
In contrast to the general quantizer case,
is always positive here, and
T
hence ÿnite quantizers can stabilize the system under only a mild condition.
2
Theorem 4.4.2 Suppose c0 > r0 þmin (P ). Then, for the closed-loop system
in Fig. 4.7 with the sampled-data controller using the ÿnite quantizer
SN
Q
and
just deÿned,
B0 (r0 ) is quadratically attractive from D with respect to (P; ÷J ); (ii) more speciÿcally, E0 and Er0 are invariant sets, and every trajectory starting in E0 goes into Er0 in ÿnite time. Proof Let r = maxfsup Q0 ; ú inf Q0 g ÿ 0. Clearly, Q0 ø [úr; r ]. By bounds on cTj in (4.37) and on dTj in (4.38), we can apply Lemmas 4.3.6 and (i) the ball
4.3.7 and obtain
F 0 (0) ø F X (0) [ B0 (r1 ); F j (uj ) ø F X (uj ); j 2 SN ; j 6= 0; p p with r1 = r ( ÿ (aT + 1)) p =(aT ú ÿ ú 1). Q
(4.53)
Q
Notice that r0 = r1 þmax (P )=þmin (P ). Thus, the assumption implies c0 > By Lemma 4.3.2, it follows that 0 (r1 ) 0 . Moreover, from (4.52), 0 (r1 ) . 0 2SN
B
2
r1 þmax (P ).
B
ø E ø [j
Xj
øE
x(þ) starting at x(0) 2 2 X0 then F (x(t)) 2 F X (0) \ B0 (r0 ) for t 2 [0; T ], or else there exists j 2 SN such that x(0) 2 Xj and F (x(t)) 2 F X (uj ) for t 2 [0; T ]. Now, following an argument similar to Lemma 4.3.3, we have that B0 (r0 ) is quadratically attractive from D for the closed-loop system. Since E0 is a level set covered by [j N Xj , it is an invariant set. Also, it can be easily veriÿed that all trajectories starting in E0 go into any level sets containing B0 (r1 ) in ÿnite time and stay inside; by Lemma 4.3.2, n 2 the psmallest such set is fx 2 R : x P x ý r1 þmax (P )g. Because of r0 = þmax (P )=þmin (P ), this set is the same as Er0 . r1 ÿ
E0 ,
Thus, we have from (4.53) that, for every trajectory if
x(0)
2S
0
Note that, if the assumption has to take either
2
c0 > r0 þmin (P )
D larger or r0 smaller.
does not hold initially, one
On the other hand, the size of
N
may
be larger than necessary because the ÿnite partition covers a ball containing
E0 , while it needs to cover only E0 .
We give corollaries for uniform quantizers and for a ÿnite version of loga-
rithmic quantizers. Here, we take
D
B
= 0 (R0 ) with R0 > 0 so that comparison is easier later in examples. In this case, the level set 0 in (4.52) is given with 2 c0 = R0 þmax (P ) from Lemma 4.3.2. We start with the result for uniform quantizers.
E
Corollary 4.4.3 Given R0 > 0 and r0 > 0 with R0 > r0 , select T small
enough that inequalities in (4.46) hold and set ù
&
N
=
s
R0
þmax (P )
ù
þmin (P )
ú
>
1 2
'
0 as in (4.47). Let
;
(4.54)
62
Chapter 4.
and
S
f ö
1; : : :
ö g S N
. Then, for the closed-loop system in Fig. 4.7 with
the uniform quantizer and
0 (r0 ) is quadratically attractive from
N
= 0;
Finite data rate control | single-input case
;
N , the ball 0 (R0 ) with respect to (P; ÷J ).
B
Proof
Since
R0 > r0 , c0
only that the union of
[X ÿ
=
02 max (P ) > r02 þmin (P ).
X 2S j ,j
N,
and
j
2SN
=
õe
E0 ø B0 p 0 (
+y :
2 ú ú
õ
r
õ
N
covers the level set
(
N
1
)ù; (N +
2
1 2
, and the rest follows
)ù
õ ; y
2M
?
ò ;
min(P )) by Lemma 4.3.2, it suÆces to show
c =þ
j jý The least
þ
Thus we have to show
E0
R þ
from Corollary 4.3.8. Since
j
B
ö
) j jý
0 þmin (P ) c
õ
+
N
õ
1 2
ù:
ÿ
for this is (4.54).
Next we consider the ÿnite logarithmic quantizer. For this, we introduce
õ0 to give more freedom in the partition. 0 > 0, we deÿne the ÿnite logarithmic quantizer as follows. be = Z+ 1; 0; 1 , let the partition cells , j , be
a new parameter õ
S
ü fú
g
Q0 Q[0 0] ú Q[ ü1] ö 0 0 =
j1 ;
=
=(
;
[õ
Æ
j1
Q 2S
Given
Æ >
j
0 0 ); +1 ); Æ
õ ;õ
;õ
1 and
Let the index set
(4.55)
j1
(see Fig. 4.15) and let the output values be
ö ú ö
0 = q[0 0] = 0; õ0 Æ q[ 1 ü1] = 2 a q
;
j ;
1
+Æ+1
õ
(4.56) Æ
j1
(4.57)
:
T
For this class of quantizers, we have the next result.
Q[1 ÿ1] Q[0 ÿ1] ;
;
Q0
Q[0 1] Q[1 1] Q[2 1]
0
þ0 þ0 Æ
ÿþ0 Æ2 ÿþ0 Æ ÿþ0
;
;
Q[3 1]
;
þ0 Æ 2
;
þ0 Æ 3
þ
þ0 Æ 4
Figure 4.15: Partitioning of the ÿnite logarithmic quantizers
0 0
Corollary 4.4.4 Given Æ > 1 and R ; r
with
0
R
> r
enough that the inequalities in (4.51) hold and set
s
0
õ
=
r
p pú ú
min (P ) a 0 þ (P ) max þ
ÿ
T
ÿ (aT
1
+ 1)
0
>
0, select
T
small
4.5.
63
Control over a ÿnite data rate channel
and
ï s
& N
=
log
Æ
R0
þmax (P )
õ0
þmin (P )
!' (4.58)
:
Then, for the closed-loop system in Fig. 4.7 with the ÿnite logarithmic quantizer and
SN
=
attractive from
f0; 1; : : : ; N ú 1g ü fú1; 0; 1g, the ball B0 (r0 ) is quadratically B0 (R0 ) with respect to (P; ÷J ).
This can be shown similarly to Corollary 4.4.3, and hence we skip the proof.
4.5
Control over a ÿnite data rate channel
Signals communicated over a channel with ÿnite data rate must be sampled and quantized, but are also delayed.
When such a channel is used in the
feedback loops of control systems, these eþects on the signals have to be taken into account in the controller design and in determining the necessary data rate of the channel. In this section, we ÿrst characterize the data rate used by the quantized sampled-data systems dealt in the previous section.
Then analyses on the
robustness against time delay in the sampled-data systems are given; based on these analyses, one can determine the bandwidth necessary for the channel. 4.5.1
Data rate for control
With a stabilizing ÿnite quantizer and its index set
SN
rate used for the communication between the quantizer can be ÿnite: Messages are sent every
T
in Fig. 4.7, the data Q
and the hold
H
T
seconds and it is guaranteed that
there are only ÿnitely many diþerent messages to be communicated. Here we give a simple deÿnition of data rate that is necessary for this communication. For a ÿnite set
SN ,
denote the number of elements in the set by
Then, given a stabilizing ÿnite quantizer
Q
with an index set
the data rate required for the transmission of the control input data rate for control with
Q
:=
log2
jSN j
T
bps:
SN , u
jSN j.
we deÿne
by (4.59)
We note that the data rate in this deÿnition is only what is used by the data transmission for the control purpose out of the entire data rate available in the communication system. Clearly it excludes all overheads that may be necessary in practice. The data rate necessary for the communication system is another issue and is discussed further in the next subsection. To clarify the diþerence in the two data rate notions, we call the one deÿned above the data rate for control or in the sense of (4.59). For ÿnite uniform and logarithmic quantizers, the number of cells required
is
jSN j = 2N + 1.
However,
N
is a function of
T,
and it is rather diÆcult to
64
Chapter 4.
see what is the best choice of
T
Finite data rate control | single-input case
for a quantizer to minimize the data rate in
the sense deÿned above. In Section 4.7, we design the uniform and logarithmic quantizers for an example and ÿnd those that require the minimum rates.
4.5.2
Time delay analysis
Communication over a physical channel is always accompanied with a certain amount of delay. To begin with, there is transmission delay, which is determined by the size of the message: To send takes
N=D
N
bits over a
D
bps channel, it
seconds. This is a limitation due to the ÿniteness of the data rate
of the communication system. Moreover, networks are shared by many nodes and, depending on the media access method, time delay due to busy channel can exist; this type is not deterministic and varies over the time. In this subsection, we analyze the sampled-data system studied in Section 4.2 and ÿnd the time delay that the system can tolerate in the feedback loop while it maintains stability. The results will be extended to the case with quantization in the next subsection. Suppose the sampled-data controller in Fig. 4.2 is designed based on Theorem 4.2.5.
There we obtained a quadratically stable system with respect
to (P; ÷J ). This stability is in the continuous-time domain; though the measurement is not updated between the sampling times, the Lyapunov function V
(x) =
0
x Px
is guaranteed to decrease at a certain rate. This fact suggests
that, even if an update arrive late but no later than the next sampling time, the Lyapunov function will decrease from that point and, thus, that the system can maintain a certain level of control. Our goal for this subsection is to ÿnd how large the delay can be.
-
u(t)
x(t)
~) (A; B
?
u ~0
v (t)
0
e
6 He;T
ÿ
delay
vd (t)
ýý
ÿ
ST
õd (t)
ÿ
õ(t)
Figure 4.16: Sampled-data system with time delay Now consider the system in Fig. 4.16, where the channel has a varying time delay smaller than or equal to Let the sampled signal
õd
õd (t)
ý
2 [0; T ).
We call
ý
the
maximum delay time .
be expressed as
(
=
õ(kT )
if
0
otherwise;
t
=
kT , k
2 Z+;
4.5.
and let the delay that occurred while
2 Z+
. So the delayed signal
k
65
Control over a ÿnite data rate channel
(
vd (t)
extended
We deÿne the :
He;T
vd
=
7!
v
:
v (t)
õd (kT )
õd (kT )
if
0
otherwise:
t
=
zeroth order hold
(
0
=
vd (kT
is transmitted be
is
vd
+ ýk )
+ ýk ,
kT
t
if
t
2 2
2
[0; ý ],
2 Z+
;
by
He;T
if
k
ýk
[kT ; kT + ýk ),
k
2 Z+
;
[kT + ýk ; (k + 1)T ),
k
2 Z+
:
This hold is synchronized with the sampler and, at each sampling time, sets the output to zero until the next message arrives (see Fig. 4.17). Notice that if
ýk
= 0 it works the same with the original hold
HT .
þþ ý (t)
v(t)
d
T
0
2T
3T
4T
Figure 4.17: Signals The
time delay analysis problem
T
0
t
and
õd
2T
3T
4T
t
v
is to ÿnd the maximum delay time
ý
2
[0; T ) such that the closed-loop system in Fig. 4.16 is asymptotically stable. In particular, given trajectory k
2 Z+
þ
x(
.
ó
2
(0; 1), we would like to ÿnd
) of the closed-loop system,
V
such that, for every
ý
(x((k + 1)T ))
We begin with a lemma similar to Lemma 4.2.2. Let c :=
and let
ó
2
(0; 1).
Lemma 4.5.1 Suppose for every ý0
(i)
óP
ú
ý ( ý 0)e
e
c T
ü
0
A ü0
Aü0
Pe
(ii) for every initial condition u(t)
ÿ
0;
x(0)
(
2
2R
n
, if the control
0
u ~0 e x(0)
if
t
if
t
2 2
2 X F
(x(kT )) for
÷þmin (J )=þmax (P ),
0
(~ u0 e
x(0))
for
u
is
[0; ý0 ); [ý0 ; T ];
then the trajectory satisÿes F (x(t))
óV
[0; ý ]
0
=
ý
t
2
[ý0 ; T ]:
(4.60)
(4.61)
66
Chapter 4.
Finite data rate control | single-input case
Then the closed-loop system in Fig. 4.16 is asymptotically stable. In particular, V
(x(kT ))
Proof
ý
k
ó V
(x(0)) for
k
2 Z+
.
By (4.12) and the condition in (4.61), for every
[0; ý ] under the control in (4.60), the deÿnition of V
Xþ
x(t)
( ), 0
(x(t); u ~0 e
x(0))
2X
ýú
0
(~ u0 e
x(0))
0
÷x(t) J x(t)
for
for
t
2
x(0)
2
t
2 Rn
and
ý0
2
[ý0 ; T ]. Hence, by
[ý0 ; T ]:
So, by the comparison lemma (e.g., Lemma 2.5 in [45]), we have V
(x(t))
ý
We now show that for V
t
e
ýc(týü0) V (x(ý0 ))
2
(x(T ))
for
t
[ý0 ; T ] under (4.60)
ý
óV
(x(0)) for
x(0)
2
[ý0 ; T ]:
2 Rn
(4.62)
(4.63)
:
From (4.62), a suÆcient condition for this is
ýc(T ýü0) V (x(ý0 )) ý óV (x(0))
e
for
2 Rn
x(0)
(4.64)
:
Now (4.64)
,
ýc(T ýü0) V (eAü0 x(0)) ý óV (x(0)) n (since u(t) = 0 for t 2 [0; ý0 ]) for x(0) 2 R , eýc(T ýü0) x(0)eA ü0 P eAü0 x(0) ý óx(0) P x(0) for x(0) 2 Rn , e c(T ü0) eA ü0 P eAü0 ý óP: e
0
ý
ý
0
0
Thus, (i) yields (4.63). Therefore, in the closed-loop system, every
x(0)
2
Rn .
This implies
V
(x(t))
!
V
ý !1
(x(kT ))
0 as
t
k
ó V
(x(0)),
k
.
2Z
+,
for
ÿ
We have a few remarks on the condition (i) in the lemma. It is clear that if ó >
eýcT then there exists
ý >
0 such that the condition holds for
On the other hand, the constant
c
is proportional to
÷.
quadratic stability, the larger ÷, the faster the decrease in control is applied. This may imply that with a larger
ý0
2
[0; ý ].
By the deÿnition of ÷
V
when the correct
the period of time
to apply the correct control can be shorter to recover from the increase in V
during the delay time.
We will discuss more on this in the example in
Section 4.7. In the following theorem, we show that for any
ý
satisfying (i) the system is
asymptotically stable with an additional technical condition. In the construction of the original controller, we make use of the bounds on the trajectories in Proposition 4.2.3. Due to the conservativeness in these bounds, it is not diÆcult to show that the condition (ii) in the above lemma is satisÿed for every ý0
2
[0; T ].
4.5.
67
Control over a ÿnite data rate channel
Theorem 4.5.2 If ý > 0 is small enough that
ý
ý
and if for every ý0
þ
1
kAk
ln
öq
ÿcT dT (ÿ
ú 1)
õý
úÿ+1ú1
2
a
T
(4.65)
2 [0; ý ] óP
ú eý ( ý 0) e c T
0
A ü0
ü
Aü0
Pe
ÿ 0;
(4.66)
then the closed-loop system in Fig. 4.16 is asymptotically stable. We must show that, with the original controller and
Proof
(4.65), the condition (ii) in Lemma 4.5.1 holds for all stability of the system follows. Fix that
ý0
x(0)
2
[0; T ]. Given
starting from
x(0)
2
ý
satisfying
[0; T ] so that
2 R , there exist õ~ 2 R and y~ 2 M such 2 [ý0 ; T ], we have a bound on the trajectory n
x(0)
= õe ~ +y ~. Then, for
ý0
t
?
as
ó ó Z t ó At ó As ~ 0 ó kx(t) ú x(0)k = óe x(0) + e B ds u~0 e x(0) ú x(0)óó ü0 óZ t ó ó At ó ó ó As ~ ó ó ó ý e ú I þ kx(0)k + ó e B dsóó þ ju~0 e0 x(0)j ü0 ó ó ó ó Aü0 Z týü0 As 0 ~ dsó þ ju e B ~0 e x(0)j ý cT kx(0)k + óóe ó 0 óZ týü0 ó ó ó Aü0 As ~ 0 ó ý cT kx(0)k + ke k þ ó e B dsó ó þ ju~0 e x(0)j 0
ý c kx(0)k + e 0 d ju~0 e x(0)j ý c (jõ~ j + ky~k) + e 0 d ju~0 e x(0)j: kAkü
T
kAkü
T
F ~ (u) in (4.16),
In the deÿnition of by
û
F~
0
T
0
T
replace
dT
by ekAkü0 dT and denote this
û;ü0 (u). By Proposition 4.2.3, it follows from the bound above that the
trajectory satisÿes
F (x(t))
Note here that
cT
analysis problem is to ÿnd
0 is added to the original system of Fig. 4.7. The
B
such that for the closed-loop system the ball 0 (r0 ) maintains to be attractive (but not necessarily quadratically attractive) from ý
D.
The approach taken here is similar to the one in the previous subsection; the following lemma is analogous to Lemma 4.5.1. Let and let
ó
2 (0; 1).
c
:=
÷þmin (J )=þmax (P )
4.5.
69
Control over a ÿnite data rate channel
Lemma 4.5.3 Suppose the sampled-data system with a ÿnite quantizer has
the following properties:
ú eýc(T ýü0 ) eA ü0 P eAü0 ÿ 0 for every ý0 2 [0; ý ]; there exists r1 ÿ 0 such that if x(0) 2 X0 and if u(t) = 0 for t 2 [0; T ] then F (x(t)) 2 F X (0) [ B0 (r1 ) for t 2 [0; T ]; for j 2 SN , j = 6 0 and for ý0 2 [0; ý ], if x(0) 2 Xj and if the control is
(i)
0
óP
(ii)
(iii)
(
u(t)
then
F (x(t))
=
0
j
u
if
t
if
t
2 [0; ý0 ); 2 [ý0 ; T ];
(4.69)
2 F X (uj ) for t 2 [ý0 ; T ].
B0 (r0 ) is attractive from p D, where r0 = r1 þmax (P )=þmin(P ). In particular, if x(kT ) 2 Xj , j 2 SN , j = 6 0, then V (x((k + 1)T )) ý óV (x(kT )).
Then, for the closed-loop system in Fig. 4.18, the ball
Following a discussion similar to the proof of Lemma 4.5.1,
Sketch of Proof
we obtain from the conditions (i) and (iii) that for every under some
u(t)
j
in (4.69),
2 SN .
V
(x(T ))
ý
óV
x(0)
(x(0)). Here note that
2 Xj , j 6= 0, 2 Xj with
x(T )
x(0) 2 X0 þ) õ 0, V (x(t)) decreases or else x(t) is in B0 (r1 ) for t 2 [0; T ]. The ball B0 (r0 ) is the smallest ball containing a level set with B0 (r1 ) inside by Lemma 4.3.2. Clearly, B0 (r0 ) is an invariant set and moreover trajectories starting in D go in to it in ÿnite time. ÿ
The condition (ii), on the other hand, implies that, for every
under
u(
This lemma is applicable to any ÿnite quantizer. Note that the condition (ii) in the lemma holds automatically with the original design in Subsection 4.4 because when u(t)
= 0 for
t
r1
x(0)
that is obtained in the
2 X0
2 [0; T ] whether or not there is time delay.
the control will be
Thus, the size of the
attractive ball is not an issue to be worried about. In the following, we state our main theorems of the chapter for the uniform and logarithmic quantizers. The procedure of the analysis is to design
2 (0; 1).
a quantizer based on Corollary 4.4.3 or 4.4.4 and then to set
ó
theorems give the conditions on the maximum delay time
We begin with
ý.
The
the uniform quantizer case. Theorem 4.5.4 Given a stabilizing ÿnite uniform quantizer for which the ball
B0 (r0 ) is quadratically attractive from B0 (R0 ) with respect to (P; ÷J ), suppose that
ý >
ý
ý
þ
ö
0 is small enough that 1
kAk
ln
ÿc
T
T (ÿ ú 1)
d
and that, for every
ý0
2 [0; ý ],
q
õý
T (aT ú 1) ú a + a2 ú ÿ + 1 T T 2aT + 1 ýc(T ýü0 ) eA ü0 P eAü0 ÿ 0. Then, óP ú e
a
closed-loop system in Fig. 4.18, the ball
0
for the
B0 (r0 ) is attractive from B0 (R0 ).
70
Chapter 4.
Finite data rate control | single-input case
The proof for this result is straightforward following that for Theorem 4.5.2. We next state the result for the logarithmic quantizer case as a theorem as well. Theorem 4.5.5 Given a stabilizing ÿnite logarithmic quantizer with Æ > 1
for which the ball
B
0 (r0 ) is quadratically attractive from to (P; ÷J ), suppose that ý > 0 is small enough that
ý
ý
1
kAk
ln
þ
ö
ÿcT dT (ÿ
ú 1)
B0 (R0 ) with respect
õý
q ú 1) ú aT + a2T ú ÿ + 1 (aT + 1)Æ + aT ú 1 2aT (aT
2 [0; ý ], óP ú eýc(T ýü0 ) eA ü0 P eAü0 ÿ 0. Then, for the closed-loop system in Fig. 4.18, the ball B0 (r0 ) is attractive from B0 (R0 ). and that, for every
0
ý0
These theorems allow us to control a linear time-invariant plant over a channel with a strictly ÿnite data rate: Every
T
seconds, a message containing
jSN j) bits of data is sent, and the data rate of the channel has to be larger than log2 (jSN j)=ý bps. For the calculation of the actual necessary data rate
log2 (
for the channel, in addition, we have to take account of computation time,
coding/decoding time, the propagation delay, the bits in the overhead of each message, and so on.
4.6
Design of
We have seen that
ÿ
ÿ
plays an important role in the design of the sampled-data
controllers. In particular, large
ÿ
can result in a small sampling period, as
can be observed in the bounds on cT and dT in (4.20) and (4.21), respectively. However, in general, it is diÆcult to obtain a prespeciÿed value of ÿ by changing
Q
and
R.
On the other hand, we have made the critical assumption
In this section, we show that design
ÿ
ÿ
arbitrarily close to 1.
So far, we have also made the assumption that (0; 1). (1
Here we allow
ú ÷)Q.
Q
1.
ÿ >
is always greater than 1 and give a method to Q
= 1=(1
ú ÷)I
with
to be any positive-deÿnite matrix, and set
Qú
In view of (4.6), we introduce a change in coordinates using
÷
2
:=
1=2
Qú
and deÿne :=
Qú
û B
:=
Qú
û := K The correct deÿnition of
ÿ
ÿ (Q; R)
1=2
ý1=2 ;
AQú
~ B;
(4.70)
ý1=2: KQ ú
is therefore (see (4.7))
ÿ (Q; R)
showing its dependence on
1=2
û A
= (1 + ÷)R
and
óó ó
Q
:= (1 + ÷)R R
explicitly. Observe
ó
ý1=2óó2 =
K Qú
kKû k2 ;
1+÷ 1
ú÷
R
óó ó
ó
ý1=2 óó2 :
KQ
(4.71)
4.6.
71
Design of ÿ
For this general Q case, the results in the previous sections hold by replacing û and so on. matrix A with A
2 (0; 1), then
Theorem 4.6.1 If (A; B ) is stabilizable, A is unstable, and ÷
inf
Q >
0;
R >
1+÷
g=
0
1
ú÷
(4.72)
:
By (4.71), the minimization problem (4.72) is equivalent to
Proof
1+÷ 1 Fix
fÿ(Q; R) :
Q >
ú÷
0 and R
ÿ
inf
R >
óó ó
ò
ó
ó2
9Q > 0; R > 0) R óóK Qý1=2óó ý ò
: (
0. Let
ó
ò
ÿ RkK Qý1=2k2.
ý1=2óó2 = R(Rý1 B P Q 0
KQ
=
1=2
ý
þmax (Q
ò
:
By deÿnition,
1=2
ý
P BR
1=2
ý
)(Q
1
ý
P BR
0
1=2
ý
B PQ
ý ò:
1
ý
)
) (4.73)
This is equivalent to P BR
1
ý
0
B P
ý òQ = ò (úA P ú P A + P BR 0
1
ý
0
B P)
(4.74)
by the Riccati equation in (4.5). This in turn is equivalent to 0
+ PA
A P
Here
P
is positive deÿnite, and
inequality (4.75) holds only if
k
ú
ö
1
A
ò
ú
1
õ
P BR
ò
1
ý
ý 0:
0
B P
(4.75)
is an unstable matrix by assumption. Thus,
ÿ 1.
Next we show that for every ò > 1 there exist matrices Q and R such that ý1=2 2 ~ . Find the unique, ò . Take any ò > 1 and positive-deÿnite Q
R KQ
k ý
positive-deÿnite solution
P
of
0
+ PA
A P
ú P BB P + Q~ = 0: 0
(4.76)
Now set Q
~+ =Q
1
ò
ú1
0
P BB P;
R
ò
=
ú1 ò
:
Then 0=
0
A P
+ PA
ú P BB P + Q~ 0
ö
=
A P
+ PA
ú PB
=
A P
0
+ PA
ú P BR
0
1
ú
1
ý
1
õ
ò 0
B P
R
1
ý
+ Q:
0
B P
~ +Q
(4.77)
72
Chapter 4.
Thus,
P , Q,
using this
and R satisfy the Riccati equation (4.5), so R
P.
(4.73). Thus, the inÿmum
Finite data rate control | single-input case
kK Qý1=2k2 is deÿned
Now (4.77) implies (4.75), and this implies ÿrst (4.74) and then 1=2 2 R KQ ò . Since ò > 1 is arbitrary, we conclude that
ò
ý k ý
k
ÿ
equals 1.
ý
ú
In the proof, we saw that, given ò > 1, we can design ÿ ò (1 + ÷)=(1 ÷). ~ is a free parameter, and the control input is one dimensional; thus, P Here, Q can be any LQR solution by (4.76). This gives some freedom in the design, as shown in the numerical example in Section 4.7. The parameter
in
X (u)
÷
2 (0; 1) deÿnes the decay rate of the Lyapunov function
in (4.6): the larger
÷
the faster the decay to the origin. The above
theorem, however, says that faster decay rate can result in larger
ÿ,
which in
turn makes the sampling period smaller. This trade-oþ is discussed further in the numerical example as well.
4.7
Magnetic ball levitation example
We view our results in the previous sections as being theoretical in nature. Nevertheless, it is always interesting to see what practical issues arise when theory is applied to an example. Here, we apply the sampled-data controller to a magnetic ball levitation system depicted in Fig. 4.19. A steel ball of mass
M
is levitated in air by an electromagnetic force generated by the electromagnet. The control objective is to keep the position by controlling the voltage
v.
y
of the ball at an equilibrium
The current in the coil is denoted by i, and the
resistance and inductance of the magnet are R and L, respectively. The force produced by the coil is K i(t)2 =y (t)2 , where K > 0, and g is acceleration due to gravity. We take1 Nm2 /A2 .
M
= 0:068 kg,
R
= 10 ø,
= 0:41 H, and
L
K
= 3:27
ü 10ý5
The dynamics of the system is given by
where x0
g
My
=
Mg
v
=
L
ú
di dt
2
2 y
;
+ Ri;
= 9:8 m/s2 . We take the state as
of the system for the nominal voltage
Ki
x
v0
:= [y
0
y i]
. Then the equilibrium
= 10:0 V is
2 q 3 2 3 u0 K 7:00 ü 10 3 R Mg 6 7 5: x0 = 4 0 5=4 0 u0 1:00 R Now, letting ùx := x ú x0 and ùv := v ú v0 , we have the linearized system ý
ùx = 1 These
Aùx
are values from a real system [63].
+ B ùv;
4.7.
73
Magnetic ball levitation example
i(t)
+
R v (t)
L
ú
y (t)
M
Figure 4.19: Magnetic ball levitation system
where
2 0 q 6 Rg Mg A = 42 v0
1 0
K
0 The eigenvalues of First,
K
and
ÿ
A
are
=
v0
2 3 0 B = 405: 1 L
úR L
f52:9; ú52:9; ú24:2g.
were designed using the method in Section 4.6. We took = 8:18;
ò
and obtained
þ(A)
0
3 7 ú2 Rg 5 ; 0
2 Q = 4
~ Q
1:15 ü 107 2:17 ü 105
2:17 ü 105 4:11 ü 103
ú5:52 ü 104
R
= 0:878;
K
= 1:04 ü 104
ü
These matrices yield
= 0:01 diag(1; 1; 1)
ú1:04 ü 103 196
û
ú49:7
3
ú5:52 ü 104 ú1:04 ü 103 5 ; 265
(4.78)
:
ý1=2k2 = 8:18 as expected.
k
R KQ
These values were ar-
rived at by some trial-and-error with a view to a reasonable transient response. The eigenvalues of
A
+ BK are
þ(A
For
÷
+ BK ) =
= 0:100, we obtained
ÿ
fú24:2; ú52:9; ú67:6g:
= 10:0. We ÿx
K
and
÷
throughout most of
this section. In the following, we design controllers for the sampled-data and quantized sampled-data setups. Sampled-data control:
For the sampled-data controller in Section 4.2, we
calculated the maximum sampling period
Tmax
satisfying bounds on
cT
and
74
Chapter 4.
ô
Finite data rate control | single-input case
8 7 ô
=
6
ý ~0 õ ~) û (õ; u
f
ô
=
+ ~0 õ ~) û (õ; u
f
5 4 3 2
=
ô
0
g (õ; u ~ õ ~)
1 0 −2
4
3
2
1
0
−1
õ ~
Figure 4.20: Functions
(
max =
T
ý + û~ , fû~ , and
f
ú 1) 1: 3 76 ü 10ý3 3 57 ü 10ý3
~0 = (ÿ T for the two cases u
d
õ
g
for õ ~ =1
=ÿ;
: :
for u ~ 0 = (ÿ for u ~0 = 1:
There is a slight advantage in using u ~0 = (ÿ
ú 1)
ú 1)
=ÿ;
=ÿ ,
as expected.
For comparison, we also calculated the sampling period for the stability of
the system in the discrete-time domain. We discretized the plant (A; B ) via step-invariant transformation and obtained (Ad ; Bd ). The maximum sampling periods for all eigenvalues of
(
max =
T
Note here that
0
u
d + Bd u0 K to be less than 1 were
A
ü 10ý2 1 78 ü 10ý2 2:03 :
for u ~ 0 = (ÿ for u ~0 = 1:
ú 1)
=ÿ;
= (1 + ÷)~ u0 as in Subsection 4.2.2. These are roughly 5
times larger than the sampling period required for quadratic stability.
We
stress that the discrete-time stability does not have a guaranteed decay rate. From this point in the example, we ÿx u ~0 = (ÿ
controller with
T
= 3:76
ü 10ý3.
Functions
ú 1)
ý + û~ , fû~ ,
f
=ÿ .
and
g
We designed a for õ ~ = 1 are
graphed in Fig. 4.20. Note that the functions are tangent to each other. The time response plots of ùx, ùi, and ùv are given in Fig. 4.21 for the initial condition 0
ùx(0) = [ùy (0) ùy (0) ùi(0)] = [0:700
ü 10
3
ý
0
0 0]
:
(4.79)
This is typical [63]. The solid lines are for the sampled-data system and the dashed lines are for the full state feedback case (u =
K x).
We see that in
4.7.
75
Magnetic ball levitation example
−3
∆y
1
x 10
0.5
0 0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
∆i
0.2
0.1
0 0
∆v
10
5
0 0
time t Figure 4.21: Time responses for the sampled-data system
both cases
y
goes to the equilibrium in about 0:25 seconds. In the transient
response, the position for the sampled-data system decreases slightly faster than that for the continuous-time system.
Eþect of ÿ in the design of þ:
The parameter
÷
2
(0; 1) determines
the decay rate of the Lyapunov function for the sampled-data controllers and enters into the design in (4.6). The result in Section 4.6 says that the larger ÷,
the larger
ÿ;
in other words, a faster decay rate results in larger
further implies a smaller sampling period, by the bound on
T
c
ÿ,
which
as in (4.20).
We conÿrmed this observation using the current example as follows: The gain
K
was ÿxed as above, and we varied
maximum sampling period
Tmax .
÷
2 (0; 1) and calculated ÿ
and the
The results are shown in Fig. 4.22.
We have to mention, however, that it is not clear how the choice of aþect the decay rate.
In the simulations above, we used
÷
÷
might
= 0:1, which is
fairly small. Nevertheless, in the time response plots, the trajectory for the full state feedback case and that for the sampled-data control case in Fig. 4.21 look almost the same, and hence there is little diþerence in the decay rate.
The maximum delay time analysis:
In Subsection 4.5.2, an analysis
on the robustness of the sampled-data system against varying time delay was given. The maximum delay time ý was calculated for the case ÷ = 0:1 and ó = 0:999 for T (0; 3:76 10 3 ); this is shown in Fig. 4.23. Here the solid
ü ý
2
line is the bound on
ý
calculated from the condition in (4.66) in Theorem 4.5.2 10 4
and the dashed line is the one from (4.65). The largest ý is ý = 4:70 obtained at T = 3:76 10 3 , which is eþectively the maximum T .
ü ý
ü ý
76
Chapter 4.
Finite data rate control | single-input case
200
σ
150 100 50 0 0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
−3
5
x 10
4
T
max
3 2 1 0 0
ε
Figure 4.22:
ÿ
and the maximum
As discussed in Subsection 4.5.2, size of the
÷
ý.
Here the largest
= 0:1 case, these
that at
÷
ý
ý
÷
T
versus
÷
is a parameter that has an eþect on the
is plotted in Fig. 4.24 for several
÷
2 (0; 1); as in
were obtained at around the maximum
= 0:4 the maximum delay time is about 100
T.
We see
üs.
In Subsection 2.2.3, we showed the delay time, or the transaction time, for the communication of one data message using a practical control networks protocol. According to the formula in (2.1), with the delay time of 100
üs,
about 12 bytes of data can be transmitted periodically over a 2.5 M bps channel using the FIP protocol. Hence, we can conclude that the bounds obtained above are fairly realistic for this example. Quantized sampled-data control:
We design the quantized sampled-data
controllers in Section 4.3. Here, in particular, we compare the characteristics of the uniform and logarithmic quantizers. The maximum sampling periods for these quantizers are
(
Tmax
=
ü 10ý3 3:76 ü 10ý3 2:43
for the uniform quantizer; for the logarithmic quantizer:
We have seen that, in the design of the uniform quantizer T
Qþ ,
parameters
ý Tmax and r0 > 0 can be chosen independently; for a ÿxed T , ù in (4.47)
is a linear function of
r0 .
The controller with a logarithmic quantizer too
has two parameters,
and
T,
Æ
but, in contrast, there is a trade-oþ between
them as seen in (4.51). For each Fig. 4.25. As Tmax
= 3:76
Æ
Æ >
1, there is
approaches 1 from above, 10 3 as given above.
ü ý
T
Tmax , and this is shown in grows larger. Note that for Æ = 1,
4.7.
77
Magnetic ball levitation example
−5
5
x 10
maximum delay τ
4
3
2
1
0 0
0.5
1
1.5
2
2.5
3
3.5
4 −3
x 10
T
Figure 4.23: Time delay analysis:
versus
ý
Sampled-data control with ÿnite quantizers:
T
for
÷
= 0:1
As discussed in Sec-
tion 4.4, when ÿnite quantizers are used, the data size of the message sent from the sensor to the actuator is ÿnite. Moreover, the data rate in the sense of (4.59) for this communication is ÿnite. Here, we are interested in ÿnding the minimum data rate required for the uniform and logarithmic quantizers in our framework. Following the construction in Corollaries 4.4.3 and 4.4.4, we ÿx and
r0
= 3. First, for the uniform quantizer, we computed ù,
the data rate in the sense in (4.59) for
T < Tmax
are given in Fig. 4.26. We see that ù and T,
N
= 2:43
ü 10ý3.
Next, we computed Æ ,
N,
T
=
R0 = 10 and then
The plots
are almost linear functions of
and as a result the data rate decreases exponentially as
minimum data rate is about 7780 bps at
N,
T
increases. The
Tmax .
and the data rate for control using the logarithmic
quantizer in a similar fashion. The results are shown in Fig. 4.27. (The plot on the top is the same as Fig. 4.25 with opposite coordinates.) Here, the growth of
N
is exponential, but the rate is slow. Hence, for sampling periods not too
close to
Tmax , the data rate is a decreasing function of T , and there is a clear advantage over the uniform quantizer. The minimum data rate for this system is about 1810 bps at T = 3:07 10 3 .
ü
ý
We designed the two quantizers with the parameters to achieve the minimum data rate. For the uniform quantizer, we obtained ù = 0:364 and N = 2:38 105 and for the logarithmic quantizer õ0 = 1:86, Æ = 1:79, and
ü
N
= 24. We computed the time responses for these two systems with the
same initial condition ùx(0) as in (4.79). These are shown in Fig. 4.28 for the system with the uniform quantizer and in Fig. 4.29 for the one with the loga-
78
Chapter 4.
Finite data rate control | single-input case
−4
1
x 10
0.8
τ
0.6
0.4
0.2
0 0
0.1
0.2
0.3
0.4
0.5
0.6
0.8
0.7
0.9
1
ε
Figure 4.24: Time delay analysis: the largest
ý
versus
÷
rithmic quantizer. We have chosen
r0 = 3 in the design so that the switching in the control input ùv is visible in the steady state. As a result, in both plots, the position ùy only goes down to about 0:2 10 3 and seems to reach
ü
ý
a steady state there. To achieve a steady state closer to 0, we can redesign with a smaller r0 . The number of cells used in these simulations are 7 and 6 for the uniform case and the logarithmic case, respectively. The diþerence may seem small, given that the logarithmic quantizer uses a more eÆcient partition scheme. This is however because the initial state is not so far away from cell redesigning the controllers with a smaller
r0
X
0 . Again, will likely make the diþerence in
the number of cells to be used larger. The conservativeness in the design can be found as follows. The norm of initial condition x(0) in (4.79) is about 2:25 in the state space corresponding to û B û ). This means that the state is already in the prespeciÿed ball of radius (A; r0 = 3. In the steady state, this norm goes down to about 0.193 and 0.130 for the responses in Figs. 4.28 and 4.29, respectively. Thus, trajectories go
into a ball much smaller than the one prespeciÿed by r0 . This phenomenon is expected since r0 is a rather large estimate.
Finite data rate control:
Finally, we carried out the time delay analysis
on the sampled-data system with ÿnite quantizers based on the results in Subsection 4.5.3. We used
ó = 0:999. For the uniform quantizer that achieved the minimum data rate, the maximum delay time ý was 2:92 10 5, while for the logarithmic quantizer it was ý = 3:68 10 5 . So both are in the order of
10
üs.
ü ý
ü ý
These values were calculated using plots similar to Fig. 4.23.
4.7.
79
Magnetic ball levitation example
−3
4
x 10
3.5
Sampling period T
3
2.5
2
1.5
1
0.5
0 0
10
5
30
25
20
15
δ
Figure 4.25: Trade-oþ between
Æ
and
T
It was observed above in the analysis for the sampled-data systems without quantizers that
÷
serves as a parameter that has inýuence on
ý,
as seen in
Fig. 4.24. This is also true for the case with the quantizers. For example, for ÷
= 0:4, the sets of parameters that achieved the minimum data rate were for
the uniform quantizer T
= 1:88
ü 10ý3,
2:55 ü 10ý1 , N
ü
data rate for control = 1:01 104 bps, ù = 5 5 = 2:76 10 , and ý = 6:02 10
ü
ü
ý
and for the logarithmic quantizer T N
ü 10ý3, data rate for control = 2 33 ü 103 bps, = 24, and = 7 72 ü 10ý5 .
= 2:41
So clearly
:
ý
ý
Æ
= 1:76,
:
can be increased by changing ÷.
Now we look at the logarithmic quantizer case here. of cells necessary is
jSN j
Since the number
= 2N + 1 = 49, each message to be sent over the
channel contains less than 1 byte of data. In Subsection 2.2.3, we presented the expected delay time for the communication system realized by the FIP protocol with a 2.5 M bps channel. This is about 63
üs
for 1 byte of data,
according to the formula in (2.1). Therefore, it is clear that the maximum delay time calculated for the logarithmic quantizer above is suÆciently large using this communication system with a strictly ÿnite data rate.
Chapter 4.
80
Finite data rate control | single-input case
0.8
∆
0.6 0.4 0.2 2.5
0 5 x 10
0.5
0 4 x 10
0.5
0
0.5
1
1.5
2
2.5 x 10
3
N
2 1.5
Data rate [bps]
1 10
1
1.5
2
2.5 x 10
3
5
0
1
1.5
2
2.5 x 10
Sampling period T [s]
3
Figure 4.26: Data rate for control for the uniform quantizer
30
δ
20 10
0
0
0.5
1
1.5
2
2.5
3
3.5
4 x 10
150
3
N
100
50
Data rate [bps]
0
0
0.5
1
1.5
2
2.5
3
3.5
4 x 10
5000
3
4000 3000
2000 1000
0
0.5
1
1.5
2
2.5
Sampling period T [s]
3
3.5
4 x 10
3
Figure 4.27: Data rate for control using the logarithmic quantizer
Magnetic ball levitation example
81
−3
∆y
1
x 10
0.5
0 0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
∆i
0.2
0.1
0 0
8
∆v
6 4 2 0 −2 0
time t
Figure 4.28: Responses for the system with a uniform quantizer
−3
∆y
1
x 10
0.5
0 0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
∆i
0.2
0.1
0 0
8 6
∆v
4.7.
4 2 0 −2 0
time t Figure 4.29: Responses for the system with a logarithmic quantizer
Chapter 5
Towards data rate reduction In this chapter, we show a few variations of the quantized sampled-data system in Fig. 4.7 that can potentially decrease the data rate for control. The deÿnition of data rate in (4.59) suggests two ways: (i) by decreasing the number of bits, or the number of partition cells, and (ii) by increasing the time between samplings.
In the following sections, we propose methods for time-varying
quantization and nonuniform sampling in our system. Combinations of these methods are possible, but can be rather complicated and are not attempted.
5.1
Time-varying quantization
One way to reduce the data rate is to decrease the number of bits, that is, to use fewer cells in the quantizer. This is possible by introducing time-varying quantizers that use cells to cover only the necessary region in the state space at the time. Here, we employ the technique proposed by Brockett and Liberzon [12, 52]. We show that a ÿnite quantizer with a sampling period
T
can
be modiÿed to use only a few cells along with additional data of 1 bit for stabilization. Further, this method has the feature that the states go to the origin, not only into a small ball. The price to be paid is, however, the decay rate; it is not quadratic anymore. Although this approach is applicable to any ÿnite quantizers, we use the ÿnite logarithmic quantizer in the following construction.
5.1.1
Problem formulation
Consider the sampled-data system in Fig. 5.1 with the ÿnite logarithmic quantizer deÿned in Subsection 4.4. Recall that the parameters that determine the
H. Ishii and B.A. Francis (Eds.): Limited Data Rate in Control Systems with Networks, LNCIS 275, pp. 83−116, 2002. Springer-Verlag Berlin Heidelberg 2002
84
Chapter 5.
-
u(t)
Towards data rate reduction
x(t)
~) (A; B
?
u ~0
0
e
6 HT
ÿ
QÆ;û0 (k)
ÿ
ST
ÿ
Figure 5.1: Quantized sampled-data system
Q
Æ;þ0 (k1 )
Q
Æ;þ0 (k2 )
Figure 5.2: The zoom-in feature (k1
< k2 )
quantizer are Æ >
Given
õ0
and
N,
instants
t
=
T >
0;
õ0 >
0; and
N
2 N:
a ÿnite logarithmic partition is deÿned as in (4.55).
see that, for a ÿxed
Q
the union of
1;
j, j
kT , k
N,
the size of
õ0
2 S , covers. Here, we allow õ0 to change at sampling 2 Z+, and thus denote õ0 (k). We call õ0 (k) the state of N
the quantizer, and, moreover, we denote the quantizer by dependence on
We
determines the size of the region that
õ0 (k )
explicit.
QÆ;û0 (k) ,
making its
The key idea behind the time-varying quantizer lies in the zoom-in feature. For any ÿxed
õ0 ,
the closed-loop system with a ÿnite quantizer is stable with
some attractive ball. smaller
Once the trajectory is inside the ball, by choosing a
appropriately, it can go into a smaller ball and so on, approaching
õ0
the origin. In Fig. 5.2, we show how the logarithmic partition of the quantizer zooms in to have ÿner cells around the origin. The time-varying quantizer problem is stated as follows: Given a bounded set
DøR
n
, ÿnd a switching law for the state
nite quantizer starting in
QÆ;û0 (k)
þ)
õ0 (
of the time-varying ÿ-
such that, for the closed-loop system, every trajectory
D eventually goes to the origin.
Time-varying quantizers were ÿrst proposed in [12] for stabilization problems of both continuous-time and discrete-time systems. The continuous-time problem considered there is somewhat diþerent from ours. First, the quantized signal is a continuous-time one, and thus chattering is allowed to occur. This
5.1.
85
Time-varying quantization
setup necessitates solutions in the sense of Filippov [85]. From our viewpoint, this is problematic since a bound on the necessary data rate cannot be derived. Second, asymptotic stability can be achieved by having a zoom-out mode in the quantizer; if the initial state is not within the range of the quantizer, it enlarges the range until the state is captured and then goes into the zoom-in mode. In practice, a local stability result is suÆcient, and hence we leave this zoom-out feature out of the system. 5.1.2
A switching law
In this section, we show how the parameter õ0 of the ÿnite logarithmic quantizer (see (4.55)) changes the characteristics of the closed-loop system and then give a stabilizing switching law for
õ0 . We construct the controller based on the ÿnite logarithmic quantizer in
Corollary 4.4.4. Parameters
T
tizer is designed for a given
õ0 .
1. Fix
÷
2. Take
2 (0; 1). Æ >
T
c
Select matrix
1 and
1 ÿ
ÿ
+1
ÿcT T ý ÿú1
d
N
Q
and
2N
N,
Note that 4. Given
let
ü >
ÿ >
ò
q T T ú 1) ú a + a2 ú ÿ + 1 T T (aT + 1)Æ + aT ú 1 2a (a
:=
pT ú
a
p
ÿ (a
ÿ
T
1.
ú1
s
+ 1)
þmin (P ) þmax (P )
(5.1) :
:
large enough that N >
For this
0 in (4.5) so that
;
ü0
N
R >
0 small enough that
3. Let
Choose
are ÿrst determined and then the quan-
SN
=
log
1
Æ ü0 :
f0; 1; : : : ; N ú 1g ü fú1; 0; 1g and N ü := Æ ü0 :
(5.2)
(5.3)
1, from (5.2).
fQj gj2SN be the logarithmic partition in (4.55) and f j gj2SN in (4.57) Deÿne the level set E0 (þ) of V by E0 (õ0 ) := fx 2 Rn : V (x) ý c0 (õ0 )g; õ0 > 0;
õ0 > 0, let output values q
where c0 (õ0 ) := (õ0 Æ
N )2 þmin (P ).
86
Chapter 5.
T
Note that the choices of
and
N
Towards data rate reduction
do not depend on
õ0 .
Lemma 5.1.1 For the closed-loop system in Fig. 5.1 with the ÿnite logarith-
E0 (õ0 ) and E0 (õ0 =ü) are invariant sets, and every E0 (õ0 ) goes into E0 (õ0 =ü) in ÿnite time.
mic quantizer just deÿned, trajectory starting in Let
Proof
s r0 (õ0 )
as in (4.39).
=
õ0
Notice here that
þmax (P )
p
pT + 1) T ú ÿú1 ÿ (a
þmin (P ) a r0 (õ0 )
=
õ0 =ü0 .
To apply Theorem 4.4.2,
2 SN , j 6= 0, the bounds on cTj in (4.37) j and dTj in (4.38) are satisÿed; (ii) E0 (õ0 ) ø [j 2SN Xj (õ0 ); and (iii) c0 (õ0 ) > we must show (i) for
=
T
T, j
2
r0 (õ0 ) þmin (P ).
First (i). This is immediate since for a logarithmic partition, the bounds on
Tj
c
and
d
Tj
are all identical and are equal to those in (5.1).
Next (ii). By Lemma 4.3.2 and the deÿnition of c0 (õ0 )
E0 (õ0 ) ø B0
ïs
c0 (õ0 )
!
þmin (P )
=
B0 (õ0 ÆN ):
By the deÿnition of state space cells in (4.32), we also have
[
j2SN Thus, clearly,
Xj (õ0 ) =
ñ
õe
+y :
õ
2 (úõ0 ÆN ; õ0 ÆN ); y 2 M?
E
Xj
ô
:
2 SN
(õ0 ), j . 0 (õ0 ) is contained in the union of cells Finally we show (iii). Since ü > 1, c0 (õ0 ) > c0 (õ0 =ü). On the other hand,
ö
c0 (õ0 =ü)
=
õ0
N Æ
ö õ2
=
ü
õ0
õ2
þmin (P )
þmin (P )
ü0
2
= r0 (õ0 )
þmin (P )
(by deÿnition of c0 (õ0 ))
(by deÿnition of (since
r0 (õ0 )
=
ü
in (5.3))
õ0 =ü0 ).
D = E0 (õ0 ). Thus, for E0 (õ0 ) and E0 (õ0 =ü) are invariant sets, and moreover trajectories starting in E0 (õ0 ) go into E0 (õ0 =ü) in ÿnite time. ÿ Now, the conditions in Theorem 4.4.2 are satisÿed with the closed-loop system,
We now describe the quantizer. Given
switching law
for the state
D, take õ00 > 0 large enough that D ø E0 (õ00 ):
þ) of the time-varying
õ0 (
5.1.
Clearly, such
õ00
always exists. Take
(
õ0 (k
for
87
Time-varying quantization
ÿ
k
+ 1) =
õ0 (0)
x(kT )
and
2E
if
õ0 (k );
otherwise;
0 (õ0 (k )=ü);
(5.4)
1.
D
Theorem 5.1.2 Given
time-varying quantizer converges to the origin.
ÿ
Proof k
õ00
õ0 (k )=ü;
The stability result for the system using
for
=
D
stays in
Denote this time by
E
now follows.
, for the sampled-data system in Fig. 5.1 with the
QÆ;û0 ( )
Let k0 = 0. By (5.4), k0 . However, by
starting in
QÆ;û0 ( )
deÿned above, every trajectory starting in
õ0 (k )
DøE
0 (õ(k0 ))
=
õ0 (k0 )
as long as
2E
x(kT ) =
D
0 (õ0 (k0 )=ü)
0 (õ0 (k0 )) and Lemma 5.1.1, a trajectory
and goes into
E
0 (õ0 (k0 )=ü)
in ÿnite time.
ÿ
= t1 and the next sampling instant by t = k1 T t1 . Then, by (5.4), õ0 (k ) = õ0 (k1 ) = õ0 (k0 )=ü as long as x(kT ) = 0 (õ0 (k1 )=ü) for
ÿ
k
k1 .
t
Applying Lemma 5.1.1 again, the trajectory in
this set and enters
E
0 (õ0 (k1 )=ü) in ÿnite time.
E
2E
0 (õ0 (k1 ))
Repeating this argument, we obtain an increasing sequence that x(t)
Since
ü >
2E
0 (õ0 (kl ))
for
t
2
[kl T ; kl+1 T ),
l
f g 2Z+ kl
such
2Z
+:
=
õ0 (k0 ) l
ü
!
0 as
l
!1
:
ÿ
Therefore, the trajectory goes to the origin. The advantage of using this quantizer is that
D
l
1, õ0 (kl )
of
stays in
N
is independent of the size
and thus can be chosen to be small. Moreover, asymptotic stability can
be achieved with a ÿnite data rate. The drawback is that the stability is not quadratic any more and thus that the decay rate for the trajectory can be appreciably slower than that using the original (time-invariant) quantizer. In particular, if at some set
E
0 (õ0 (k )=ü),
t
2
[kT ; (k + 1)T ) the trajectory is in the smaller level
the Lyapunov function may not be decreasing for some time.
Moreover, if the trajectory is already in a much smaller level set for some
l
ÿ
the period of
1 at lT
t
=
kT ,
E
l
0 (õ0 (k )=ü
)
the Lyapunov function may continue to increase for
in the worst case. For this reason, using
ü
close to 1 may not
be eÆcient for the decay rate. How does the time-varying ÿnite quantizer change the necessary data rate for the closed-loop system? The data to be sent from the sensor side to the actuator side at each sampling instant are
û
the index of the cell in which
x(kT )
is,
88
Chapter 5.
û
one of the two states
{ zoom in : if x(kT ) 2 E0 (õ0 (k )=ü), { hold : otherwise.
The parameter case
Towards data rate reduction
N
N
must satisfy (5.2), but can be as small as
N
jS j = 2
N
= 1, in which
+ 1 = 3 and this requires 2 bits per message. The additional
data (zoom in/hold) requires only one bit. Therefore, data rate using
5.1.3
QÆ;û0 (k)
=
log2
jS j + 1 = log (2 2
N
N
T
+ 1) + 1
bps:
T
Magnetic ball levitation example continued
The use of a time-varying quantizer is expected to reduce the data rate. It is not clear from the theory how eþective this quantizer can be compared to the time-invariant one. For comparison, we computed the necessary data rate for the magnetic levitation system in Section 4.7. As in the design of other controllers there, we used the same
K
in (4.78).
The time-varying quantizer is based on the logarithmic quantizer, so the maximum sampling period is Tmax
The minimum
N
in (5.2),
ü
= 3:76
ü 10ý
3
:
in (5.3), and the data rate for
T
plotted in Fig. 5.3.
2 (0
; Tmax )
are
Minimum N
15 10
5 0
0
0.5
1
1.5
2
2.5
3
3.5
4 x 10
15
3
µ
10
5
Data rate [bps]
0
0
0.5
1
1.5
2
2.5
3
3.5
4 x 10
5000
3
4000
3000 2000
1000
0
0.5
1
1.5
2
2.5
3
3.5
4 x 10
Sampling period T [s]
3
Figure 5.3: Data rate for control using the time-varying quantizer We see that
N
= 1 is allowed for
T
ý 1 08 ü 10ý , and there is a local ' 2400 bps at = 1 08 ü 10ý .
minimum data rate (log2 (2N + 1) + 1)=T
:
3
T
:
3
5.2.
89
Nonuniform sampling
The global minimum is about 1460 bps and is achieved at with
N
T
= 2:86
ü 10ý3
= 4. Since the minimum data rate using the logarithmic quantizer
was about 1810 bps, we see that it is indeed advantageous to use the timevarying quantizer. We note that this reduction is possible at the expense of the decay rate of the trajectories; in particular,
ü
closer to 1 can result in a
slower decay rate. We see in the plot that local minima in the data rate are always accompanied by
ü
' 1.
Thus, there is a certain trade-oþ between the
data rate and the decay rate.
5.2
Nonuniform sampling
As described in Chapter 2, many control networks protocols employ periodic sampling for transmission of control signals. viewpoint, it may be worthwhile to look into
However, from the data rate
nonuniform
sampling (e.g., [75])
so that messages are transmitted over the network only when necessary. This can be advantageous on a shared network. If the maximum data rate required for the system is known, the data rate while the system is in operation varies over time and can be less than the bound. A ÿnite quantizer output values determined by
Q
f j gj2SN . q
T
= min
is deÿned by an index set
SN , cells fQj gj2SN , and
In the design procedure, the sampling time
j2SN Tj ,
where
j
T
(4.37) and (4.38). By Lemma 4.3.7, this implies that trajectories in
Xj at
t
= 0 remain in
X ( j ) for at least 2 [0 j ]. u
T
is
is chosen to satisfy the bounds in t
x(t)
starting
;T
STj as follows. k and that x(tk ) is in Xjk . Then the next sampling time is given by t = tk + Tjk . We can deÿne a nonuniform zeroth-order hold HTj , synchronized with STj . Replacing the uniform sampler
Thus, we are motivated to deÿne a nonuniform sampler
Suppose that a sampling takes place at
t
=
t
and hold with these, we obtain a system with the same stability characteristics that the original system has. As a result, the data rate changes over time and is at most equal to what the original system needs: Data rate using The use of
j
T
instead of
T
Tj
S
and
H
Tj
ý log2 jSN j T
:
in sampling can be eþective especially if
j
T
> T
for many j ; this is the case for the uniform quantizers as we see in an example shortly. For other quantizers, such as the logarithmic quantizers, where T = for all
j,
T
j
we propose another nonuniform sampling scheme called dwell-time
switching control in the next section.
Magnetic ball levitation example continued For the magnetic ball levitation system using the uniform quantizer, we calculated the sampling time
j
T
for the nonuniform sampler and hold.
The design of the uniform quantizer is based on Corollary 4.3.8. In its proof, the bounds on
Tj
c
and
Tj
d
are given in (4.48). The calculation is based
90
Chapter 5.
ÿ
on these bounds for j that T1 = 2:43
ü
ý3
Towards data rate reduction
1; the result is shown in Fig. 5.4.
It is observed
is the minimum and thus T = T1 and that Tj quickly
10
increases to about 3:75
ü
ý3.
10
Notice that the uniform quantizer has partition cells of the same size. However, with this type of quantizer, when the state is far from the origin, the sampling can be slower.
This characteristic can reduce the data rate during
the operation of the system. This is in contrast with the logarithmic quantizer case, where the sampler must be uniform but the data rate is saved through the nonuniform partitioning. −3
4
x 10
T
j
3.5
3
2.5
2 0
20
40
60
80
100
j Figure 5.4: Sampling times Tj , j
5.3
ÿ
1, for the uniform quantizer
Dwell-time switching control
Systems whose control is based on quantized measurements have a hybrid nature: They have continuous dynamics accompanied by discrete signals taking values in a countable set. This aspect is not well represented in the quantized sampled-data control problem, for the setup is more classical with a ÿxed sampling period; messages are transmitted from the sensor side to the actuator side periodically, whether or not the message contains a quantized value diþerent from the previous one.
This may be ineÆcient from the data rate viewpoint
because some messages are likely not to have any new information. In contrast, many switching control methodologies in hybrid systems employ schemes where switchings occur only when the tra jectory of the plant enters a new state partition cell. Therefore, there is no ÿxed sampling period. Instead, the controller, or the switching logic, attentively observes the track of the tra jectory.
5.3.
91
Dwell-time switching control
In this section, we are interested in extending the results of the quantized sampled-data control problem from a hybrid systems viewpoint and study a stabilization method for a continuous-time plant via switching control. One of the major obstacles in switching control of a continuous-time system is that chattering, or arbitrarily fast switching, may occur. This is not only physically harmful to the actuators and/or the plant, but also impossible to realize with a ÿnite data rate channel. In addition, it presents mathematical diÆculties concerning existence of solutions of diþerential equations. We employ the switching control strategy based on state partitioning. The state space is partitioned into cells, and each cell has a corresponding constant control input. Of the several methods to prevent chattering [59, 86], we here employ a dwell-time switching controller, the protocol being that switching cannot occur until a speciÿed dwell time has elapsed. Some papers treating continuous-time switching controllers, e.g., [12, 52, 78], handle the potential of chattering by permitting solutions in the sense of Filippov [85]. By contrast, our switching with dwell time prevents chattering, and control signals are piecewise constant. Although the development can be done based on any of the stabilizing quantizers, here we use the logarithmic quantizer, which has some optimal properties, as seen in the previous chapter. We note that the quantizer itself does not appear explicitly; it is rather the state-space partition induced by the quantizer that we use. For this reason, we call the state partition cells simply cells in this section.
5.3.1
Dwell-time switched systems with logarithmic partitions
In Section 4.1, we deÿned a class of hybrid systems called dwell-time switched systems. In this section, we give a subclass of such systems where switching logics are equipped with a logarithmic partition.
R . Let e 2 R be a M denote the 1-dimensional subspace generated by
We deÿne a logarithmic partition of the state space vector of unit length, let e,
and let
Æ >
n
n
1. The partition is
X00 := M ; X := fx 2 R ?
ü
n
j
The picture is Fig. 5.5 for
n
:
0
e x
2 ö[Æ
= 2. Thus,
j
;Æ
X00
j
+1 )g;
is an (n
j
2 Z:
ú 1)-dimensional sub-
space, so it has empty interior, while the other cells, being bounded by parallel hyperplanes, have nonempty interiors. For simplicity, we rename the cells to
X
j; j
2 S , in the following manner:
Let
X[0 0] := X00 ; X[ 1 1] := X 1 ;
S := Z ü fú1; 0; 1g and
;
ü
j ;ü
In particular, we denote
j
X0 := X[0 0]. ;
[j1 ;
ö1] 2 S :
92
Chapter 5.
Towards data rate reduction
x2
Xÿ1+ X0+ X1ÿ
M
X1+
X0ÿ Xÿ1ÿ x1
Figure 5.5: Logarithmic partition of
R2
Now consider the nonlinear switching system
where
x(t)
x(t)
2R
n
=
fi(t) (x(t));
2S
is the state of the system, and i(t)
switching logic with a ÿxed dwell time Lipschitz continuous for each The switching logic (how
times as follows: Set
t0
j
2S
x(t)
T >
(5.5) is the state of the
0. It is assumed that
and that
f0 (0)
fj
is globally
= 0.
generates i(t)) is deÿned in terms of switching
= 0 and let j0 denote the index of the cell
x(t0 )
is in.
The equation x(t)
has a unique solution, say
=
fj0 (x(t))
ÿ , on the interval [t ; 0
x
1
). If
x(t0 )
= 0, then no
switching will ever occur; this is the trivial case. Otherwise, the ÿrst switching time is t1
( =
þ
t0
+ T;
fÿ
min
t
t0
+T :
x
ÿ (t)
if j0 = 0 and
2 X g =
int(
j0 )
;
6
x(t0 )
if j0 = 0.
Letting cl( ) denote closure, we set i(t)
= j0 for
2
[t0 ; t1 );
8 > : ( ) 2 cl(X 1 ) and 1 6= t
;
j1
j
x t1
j
j1
j0 ;
if
x(t1 )
2X
otherwise:
0;
6
= 0;
5.3.
93
Dwell-time switching control
This rather complicated deÿnition is to avoid the situation where trajectory is on the hyperplane
X
at
x
= t0 + T and will leave it the next moment; the cell that the state is entering cannot be deÿned. 0
t
Proceeding, the equation x(t)
has a unique solution, say
=
fj1 (x(t))
ÿ , on the interval [t ; 1
x
1
). Again, if
x(t1 )
= 0, then
no further switching will occur. Otherwise, the second switching time t2 is
(
t1
=
t2
+ T;
fÿ
min
t
t1
+T
Set i(t)
= j1 for
2
if j1 = 0 and
2 X g
: xÿ (t)
=
int( j1 )
;
6
[t1 ; t2 );
;
j2
x t2
þ
Continuing, we obtain i( ) on [0; jectory
þ
x(
j2
1
if
x(t2 )
j1 ;
= 0 then
2X
0;
f k gk2Z 2 1 + ÿ
) and switching times
x(0)
= 0;
otherwise:
) is uniquely deÿned satisfying (5.5) for all
switching times. It is clear that if
6
if j1 = 0:
8 > : ( ) 2 cl(Xj2 ) and j2 6= t
x(t1 )
x(t)
t
t
. The tra-
[t0 ;
) except at the
= 0 for all
t
0.
We call this class of systems dwell-time switched systems with a logarithmic partition. This class of systems falls into the dwell-time switched systems, and
hence the stability deÿnitions in Section 4.1 can be applied. 5.3.2
Problem formulation
The problem in this section is to design a dwell-time switched system with a logarithmic partition that stabilizes a given linear time-invariant plant. Consider the feedback system in Fig. 5.6. The pair (A; B ) represents the plant
where
x(t)
2 Rn
x(t)
=
is the state and
Ax(t) u(t)
+ Bu(t);
2R
(5.6)
is the control input; single input
is assumed as in the previous chapter. To avoid triviality, it is assumed that A
6
= 0 and
Besides of
A
is an unstable matrix, but that (A; B ) is stabilizable.
The controller is termed a dwell-time controller with dwell time
Rn
T,
the controller is deÿned in terms of a logarithmic partition
and a corresponding set
U f j gj2S =
u
u(t)
=
it
u ( );
so the plant equation becomes x(t)
=
Ax(t)
T
>
of control values. The decoder
in Fig. 5.6 is governed by the law
+ Bui(t) :
0.
fXj gj2S T
D
94
Chapter 5.
u(t)
-
Towards data rate reduction
x(t)
(A; B )
T
D
ÿ
T
C
i(t)
Decoder
ÿ
Coder
Figure 5.6: Dwell-time switching control scheme
This has the form (5.5), and therefore the switching logic of the preceding section deÿnes the coder
T mapping
C
x
to i.
We can now state the dwell-time switching control problem with the logarithmic partition: Find a dwell time
T >
Rn , and corresponding control values U
0, a logarithmic partition =
fuj gj2S
fXj gj2S
of
such that the closed-loop
dwell-time switched system in Fig. 5.6 is asymptotically stable. One historical solution to this problem is time-optimal control [13]. One could specify a control bound, say
juj ý 1, and then compute the time-optimal
bang-bang control to steer from an initial state to the origin. The control would switch a ÿnite number of times, with the state reaching the origin in ÿnite time (after which, the control is set to
u
= 0). One objection to this is a practical
one: The switching surfaces can be complicated, requiring great computational eþort to determine when the control should switch. By contrast, the switching surfaces in our setup are hyperplanes. Our solution is based on the results for the quantized sampled-data control systems. The system in Fig. 5.6 can be converted to a system similar to a sampled-data one with a sampler, a hold, and a logarithmic quantizer. Then, a suÆcient condition for stabilization, expressed in the two-dimensional
õ-ô
space, follows. Since this condition is the same as that in Lemma 4.3.3, the remaining construction follows directly from the main result for the logarithmic quantizer. 5.3.3
Hybrid automata representation
The dwell-time switched system with a logarithmic partition, deÿned in Section 5.3.1, is a fairly complicated example of a hybrid system, having real-time, or continuous, dynamics and discrete events. The objective of this section is to place this class of systems into the more general context of hybrid dynamical systems, so as to relate the terminology in the ÿeld to our systems. The hybrid automata model is one of the reasonably general classes of hybrid systems [86]. It is deÿned as a combination of state-space models described by diþerential equations for the continuous dynamics and ÿnite automata for the discrete dynamics. In the following, we express the dwell-time switched system with a logarithmic partition as a hybrid automaton.
5.3.
95
Dwell-time switching control
The discrete states for the dwell-time switched system correspond to the state partition cells, is inÿnite.
X[0 0] ;
and
X[
1] , j
j;ü
2Z
, and thus the discrete state space
So, strictly speaking, this system does not ÿt into the class of
hybrid automata. We emphasize that the purpose of this section is merely to represent our switching system in a diþerent format. We also note that the origin, contained in the cell
X[0 0] ;
, is a special state
since once the state trajectory comes there, no switching will ever occur. Thus,
X[0 0] X[1 0] f g X[2 0] X[0 0] r f g
we deÿne two subcells in
as
;
;
:=
;
:=
0
;
0
;
:
As a hybrid automaton, the dwell-time switched system can be described
LX AEIF
as a six-tuple (
;
;
in the following. Its
ûL
;
;
;
), where the deÿnitions of the symbols are listed
state transition graph
L f 2L
In the graph, each
ûX
locations
is the set of discrete states or =
=
R
n
ü R+
is shown in Fig. 5.7.
[1; 0]; [2; 0]; [j;
ö
1] :
j
2 Zg
:
is a vertex.
l
is the space of the continuous state
is the state of the system in (5.5) and
symbols
is the set of
given by
xc
= [x0 ; xT ]0 , where
x
is the timer state, which times
xT
the dwell time and is thus governed by
ûA 2L ûE
and is deÿned by
= 1.
xT
A f g =
al
l2L ,
where each symbol
al
represents the event that the discrete state transitions to a new state l
and serves as a label of the edges in the graph.
events
is the set of
or transitions, which are the edges. Each edge is
G J 2 L( 2 A X üf g G X ü 1 6 ÿöþ ý þ ýõ J 2R
deÿned by (l; al0 ; l; l
ll0 ;
0
ll0
;
=
;l
0
), where
al0
;
cl(
l0 )
T
cl(
l0
)
[T ;
x
=
xT
;
When the continuous state cell and dwell time from state
l
to
l
0
is
T
x
0
xc
û
The mapping
:
is in
if
l
= [2; 0];
if
l
= [1; 0]; [2; 0];
n
x
guard
enabled. During jump set
I L! :
);
; xT
G
ll0 ,
ÿ
6 ò
T
l
=
0
l ;
:
that is, when
x
is in a new
has passed since the last transition, the transition
J
state x ^c speciÿed by the reset to 0.
;
the transition,
xc
as (xc ; x ^c )
; the timer state is
2J
jumps to a new
2X speciÿes the subset of the state space in
which the continuous state
xc
must stay while the discrete state is l , i.e.,
96
Chapter 5.
a[2;0]
G
xT
[2;0][2;0]
Towards data rate reduction
:= 0
[2; 0] x _
=
f[2;0] (x)
=0
x _T
=1 location invariants x(t) Rn
2 2 [0; T ]
xT (t)
G
xT
[2;0][1;0]
xT
:= 0
[2;0]l
xT
a[2;0] al
[2;0]l 0
:= 0
x _
:= 0
=
x _T
f[1;0] (x)
a[2;0]
G
al 0
[1; 0]
G
l [2;0]
xT
G
a[1;0]
G
:= 0
l
=0
xT
=1
0
[2;0]
:= 0
location invariants x(t) = 0 xT (t)
l x _
=
x _T
:= 0
xT
G
fl (x)
+
xT
:= 0
a[1;0]
G
l
location invariants
2 I (l)
l
a[1;0]
l [1;0]
=1
xc (t)
2R
xT
G
ll
0
:= 0
al
al 0
0
[1;0]
x _
=
x _T
0
fl 0 (x)
=1
location invariants
G
0
xc (t)
l l
xT
2 I (l ) 0
:= 0
Figure 5.7: The hybrid automaton of the dwell-time switched system (l; l 0 f[j; ö1]
:
j 2
Z
g)
2
5.3.
97
Dwell-time switching control
xc (t)
2I
(l (t)) at all
t
ÿ
8 >:R ü [0 (R ü [0
0. It is deÿned as
+;
n
l
The set
I
ûF
;T
(l ) is called the
to occur when
xc
S ]) (X
; T ];
n
[j;ü1]
üR
+ );
location invariant
if
l
= [1; 0];
if
l
= [2; 0];
if
l
= [j;
ö
1],
j
2Z
of l . A transition is
is the set of functions given by
þ
xc
(t)
F f g =
l2L .
Fl
At each location
öþ
=
Fl
x(t)
ýõ þ =
xT (t)
fl (x(t))
state
xc (t)
t
2
,
1
:
At each
, the local trajectory is given by (l; xc ; ý ), where the continuous
evolves for
2
t
[0; ý ] and satisÿes xc (t)
for all
2L
ý
The trajectory of this hybrid automaton is deÿned as follows.
2L
l
= [x0 ; xT ]0 satisÿes the diþerential equation
ý
x(t) xT
l
enforced
violates the condition given by the location invariant.
the continuous state
location
:
xc (t)
2I
=
(l );
Fl (xc (t))
[0; Æ ). Then, the trajectory of the hybrid automaton is given as a
sequence of local trajectories accompanied with events: (l0 ; xc0 ; ý0 ) Here the events
f g alk
t1
!
al
1
!2
al
(l1 ; xc1 ; ý1 )
(l2 ; xc2 ; ý2 )
Z+ occur at switching times
k2
= ý0 ;
t2
and, at each switching time
= ý0 + ý1 ; tj ; j
t3
!3 þþþ
al
:
given by
= ý0 + ý1 + ý2 ; : : :
;
= 1; 2; : : : , the continuous state satisÿes two
conditions speciÿed by the guard and the jump as xc(j ý1) (tj ) xcj (tj )
2 Gþ ÿ1 lj
=
lj ;
xj ý1 (tj )
0
ý :
The discrete state space is not ÿnite, as in the deÿnition of hybrid automata. Nevertheless, the dwell-time switched systems can be well represented in this framework. As an automaton, the dwell-time switched system has some characteristics as follows. The automaton is deterministic and events
alk
are internally induced in the
following sense. At each location, the local trajectory is unique. Each time the location invariant is violated, there is only one guard that allows (and hence forces) one transition to occur.
Therefore, for any given initial continuous
state, the trajectory is unique and is deÿned on [0;
1
). Moreover, because of
98
Chapter 5.
Towards data rate reduction
the presence of dwell time, local trajectories are always deÿned with Only one switching, or an event, can take place in time
T,
ý
ÿ T.
and multiple events
at one time instant can not occur. As seen in the graph in Fig. 5.7, all the discrete states are connected to each other by edges in both directions except between [1; 0] and the rest. The states [1; 0] and [2; 0] are special. There is no possible transition from [1; 0], while only [2; 0] has a selþoop, as a result of the rather complicated switching law. Also, all transitions starting in [2; 0] are forced at
5.3.4
t
=
T.
Solution to the dwell-time switching problem
The approach taken here to solve the dwell-time switching control problem is to reduce the problem to a quantized sampled-data control one. We ÿrst obtain a system equivalent to the one in Fig. 5.6. We discuss its similarity to a sampled-data system; the main diþerence in the two setups is the sampling time. We then give a suÆcient condition for the stability of this equivalent system, which is identical to the one we used for solving the quantized sampleddata control problem. Thus, the solution for the dwell-time switching control problem follows immediately. Let us recap how the system in Fig. 5.6 evolves: If of the cell in which
x(0)
lies, then the control is
u(t)
j0
=
for
t
= 0 up to
ÿ T . At the ÿrst switching time t = t1 , the trajectory is in or enters Xj1 with j1 6= j0 , and the control switches to u(t) = uj1 for t 2 [t1 ; t2 ).
= t1 the cell t
This is repeated for all switching times cell
denotes the index
uj0
x(t)
is in, we have
u(t)
=
u ( ),
it
k . Denoting by j (t) the index of the t 2 [tk ; tk+1 ).
t
where i(t) = j (tk ) for
With this switching scheme in mind, we can convert the dwell-time switching system in Fig. 5.6 to that in Fig. 5.8(a), where
fQj gj
fqj gj
2S
Æ is the logarithmic
Q
R and output values ø R. Its output takes the form qj(t) , where j (t) 2 S is the index of the
quantizer in Section 4.3.3, deÿned by cells
2S
in
T 0 is the system that is governed by the dwell-time switching j t to i(t). In the decoder DT , DT 0 is the nonuniform zeroth order hold synchronized with CT 0 and is deÿned by DT 0 (i(t)) = qi(t) ; u ~0 is a cell
x(t)
is in;
C
logic, mapping
q ( )
scalar. Thus, the control input is u(t)
=u ~0 qi(t) = u ~0 QÆ (e
0
k ));
x(t
t
2 [tk ; tk+1 ):
Æ , the cells Qj can be obtained as Qj = ý1 from (4.32) and the output values as qj = u ~0 uj , j 2 S , from (4.30). Note that, for the quantizer
Q
0
e
Xj
This system is very similar to the sampled-data control system in Fig. 4.7.
Q = QÆ commutes with the uniform sampler T , and thus the system in Fig. 4.7 is equivalent to the one in Fig. 5.8(b), where QÆ and ST are interchanged.
Note that there the quantizer S
As we compare the two systems in Figs. 5.8(a) and 5.8(b), the uniform
T is replaced by CT 0 , which has a nonuniform sampling time, and T is replaced by DT 0 . We note that the switching in CT 0 is intermittent and slower than the sampling in ST . The discrete-time signal taking discrete values between QÆ and HT in Fig. 4.7 is represented now by i(t). sampler
S
likewise
H
5.3.
99
Dwell-time switching control
-
u(t)
ÿ
u ~0
ÿ
T0
D
it
q ( )
T
x(t)
~) (A; B
T0
C i(t)
ÿ
Æ
Q
jt
q ( )
ÿ
0
e
õ(t)
ÿ
T
D
C
(a) Dwell-time switching system
-
u(t)
x(t)
~) (A; B
?
u ~0
0
e
6
ÿ
T
H
ÿ
T
S
Æ
Q
ÿ
(b) Sampled-data system
Figure 5.8: Comparison of systems
We follow the design methodology for the quantized sampled-data system as summarized in the following. Fix a positive-deÿnite matrix
2 Rn
(4.5) and obtain the solution K
=
úR
ý1
0
B P,
and set
e
=
÷
P. 0
2 (0; 1).
n and
þ
Q
We design
e
as usual: Given
0, solve the Riccati equation
2 R1
þn be the LQR optimal one, ~ . Replace B = (1 + ÷) K B with B .
Then let
k k
K = K
R >
K
k k
The problem is reduced to designing the dwell-time switching controller so that the closed-loop system is quadratically stable with respect to (P; ÷J ).
þ) of the system, the decay rate of V (x) := x P x ý ú÷x (t)J x(t), t ÿ 0, where J = Q + P BR 1 B P . Recall
That is, for any trajectory is
V
(x(t); u(t))
0
x(
0
ý
0
that in the continuous-time system with the full state feedback (i.e., in Fig. 4.1(a), the decay rate is given by
V
(x(t); K x(t)) =
u
=
úx (t)J x(t).
K x)
0
The following lemma gives a suÆcient condition for quadratic stability of the system in Fig. 5.8(a) with a dwell-time switching controller characterized by
T >
0,
fXj gj
2S
, and
U
=
fuj gj
2S
. We make use of operator
so the result is expressed in terms of the
õ-ô
F
in (4.10),
plane. This key lemma is parallel
to Lemma 4.3.3 for the quantized sampled-data control problem. Lemma 5.3.1 Suppose the dwell-time controller is designed to have the fol-
lowing property: For every t
2
j
2 S , if x(0) is in Xj , then F (x(t)) 2 F X (uj ) for
[0; T ]. Then the dwell-time switched system in Fig. 5.6 is quadratically
100
Chapter 5.
Towards data rate reduction
stable with respect to (P; ÷J ).
Proof
Let j0 denote the index of the cell in which x(0) lies. By assumption, x(0), the trajectory x( ) of the switched system satisÿes
þ
starting at
F (x(t))
2 F X (uj0 )
for
t
2 [0; T ]:
By (4.12), this is equivalent to x(t)
This implies that
X
øX
2 X (uj0 )
Xj0 ø X (uj0 )
for
2 [0; T ]:
t
and therefore, since
that cl( j0 ) (uj0 ). By the deÿnition of switching times, if t1 x(t)
2 cl(Xj0 ) ø X (uj0 )
Hence, whether the switching occurs at
t
> T,
for
=
T
t
or
X (uj0 )
is a closed set,
then
2 [T ; t1 ]: t > T , x(t)
is in
[0; t1 ].
X (uj0 ) on
Repeating this argument, we obtain
2 X (ujk )
x(t)
where
jk
for
t
2 [tk ; tk+1 ]; k ÿ 1;
is the index of the cell which
x(t)
implies that the control Lyapunov function the deÿnition of
fX (uj )gj V
2S
V
is in or enters at
t
=
(x(t)) decreases for all
tk . t
This
ÿ 0 by
in (4.6); in particular, we have
(x(t); u(t)) =
V
(x(t); ui(t) )
ý ú÷x (t)J x(t): 0
ÿ
Hence, by Proposition 4.1.3, the system is quadratically stable.
With the logarithmic quantizer in the sampled-data system, asymptotic stability can be achieved by Corollary 4.4.4. The construction of the quantized sampled-data controller is based on the suÆcient condition in Lemma 4.3.3. For the logarithmic quantizer, this condition holds with r0 = 0. This is equivalent to the suÆcient condition above in Lemma 5.3.1, and hence the dwell-time control problem can be solved by constructing the logarithmic quantizer and sampling period
T
QÆ
as in Corollary 4.4.4.
We are ready to construct the stabilizing dwell-time controller. 1. Fix u ~0
÷
2 (0; 1). Select matrix Q and R > 0 in (4.5) that give ÿ > 1. ú 1)=ÿ.
Set
= (ÿ
2. Design a logarithmic quantizer as in Corollary 4.4.4: Take obtain
T >
0, cells
fQj gj
2S
, and output values
fqj gj
2S
.
Æ >
1 and
5.4.
101
Finite partition dwell-time switching
3. Obtain the state partition
fX g
j j 2S
through (4.32):
X00 = M = Ker ñ X = 2 R : 2 ö[ K;
ü
j
n
x
0
e x
j
Æ ;Æ
j
+1 )ô ;
j
2 Z+
:
Their corresponding control inputs are given by (4.30)
00 = u~0 q0 = 0; u =u ~0 q[ 1] ;
u
ü
j
j;ü
j
2 Z+
:
We state the stability result for the dwell-time controller as a theorem. For the dwell-time switching controller just deÿned, the closed-loop system in Fig. 5.8(a) is quadratically stable with respect to (P; ÷J ). Theorem 5.3.2
The proof of this theorem follows from Corollary 4.3.9 of Theorem 4.3.5; the proof of the theorem makes use of the suÆcient condition in Lemma 4.3.3, which is equivalent to the suÆcient condition in Lemma 5.3.1. The motivation to extend the quantized sampled-data controller to the dwell-time switching controller was to reduce the data rate. In fact, this scheme guarantees to decrease the number of message transmissions over the channel since the time between switchings is larger than or equal to T . We note, however, that the switching times can be any real numbers and thus i(t) can not be transmitted perfectly with a ÿnite data rate. In this respect, we consider our results to be theoretical, and this method may not be directly applicable using real control networks. To get a ÿnite data rate, one could time-sample i(t) appropriately or use an event-driven type protocol with some measures to take care of the possible time delay.
5.4
Finite partition dwell-time switching
The dwell-time controller of Section 5.3.4 employs a partition of the state space with inÿnitely many cells: The stringent requirement of x(t) 0 demands ÿner and ÿner resolution near the origin in the state space. In this section, we want to ÿnd a dwell-time controller deÿned by ÿnitely many cells such that all trajectories starting in a given bounded subset of the state space eventually go into the ball 0 (r0 ) of a prespeciÿed (small) radius r0 . This notion is called \practical stability" in [28]. In Subsection 5.4.1, we deÿne a state partition with ÿnite cells, similar to the logarithmic one. The dwell-time controller for this new partition employs a switching logic simpler than the one for the original partition. For the sake of completeness, we describe this switching law ÿrst. In Subsection 5.4.2, we present a stabilization result for this ÿnite partition case. The dwell-time switching control up to these subsections deals only with the state feedback case with no noise in the measurement. In Subsection 5.4.3, we extend our results to measurement noise rejection and then in Subsection 5.4.4 to output feedback control.
!
B
D
102
Chapter 5.
5.4.1
Towards data rate reduction
Dwell-time switched systems with ÿnite partitions
Repeating Subsection 5.3.1, we deÿne a
ÿnite logarithmic partition
bounded region
n
õj
:=
õ0 Æ
j
,
in the state space
2Z
j
R
n
subspace in
D
+.
spanned by
ü
j
X
ý
j
,
:=
n
Let
kk
with
0 and
õ0 >
of a given 1, and let
Æ >
M
= 1 and denote by
e
The partition cells are now deÿned by
n
x
:=
n
x
0
:
e x
:
0
(
õ0 ; õ0 )
;
[õj ; õj +1 )
e x
;
j
1 covers
;N
0
=
[0;0]
j
=
[j1 ;ü1]
and set the index set
N
. We rename the cells to
+ j
, and
as
ÿ
=
=
j
ÿ
00 ,
the
(5.7)
+:
large enough so that the union of the cells
N
= 0; 1; : : :
j
e.
2R
e
.
Xÿ f 2 R 2ú g X f 2R 2ö g 2Z 2N X X ú D X X X X X X X ú S f ú gü fú g 00
Here, we take
Then take
R
00 ;
ü
=
j1
0; 1; : : :
;
j1
= 0; 1; : : : 1
;N
1;
;N
1; 0; 1 . Both notations are
used throughout the section.
Every cell in a ÿnite logarithmic partition has interior, and thus the dwelltime switched systems equipped with such a partition can be deÿned in a simpler manner as follows. Consider the switching system x(t)
where
x(t)
2R
switching logic
n
=
dwell time j
2S
N
The switching logic determines = 0 and let
t0
x(t)
0 (x(t))
=
fj
(5.8)
2S
is the state of the system and i(t)
with a ÿxed
Lipschitz continuous for each Set
fi(t) (x(t));
j0
and also that i(t)
from
x(t)
f0 (0)
ÿ
x
= min
2
fÿ t
t0
+T :
ÿ
x
(t)
fj
is in. The system
1
). The ÿrst switching
2 X0 g =
int(
j
)
:
Then set i(t) = j0 for t [t0 ; t1 ) and j1 to be the index of the cell x(t1 ) cl( j1 ) with j1 = j0 .
2 X
6
Repeating this procedure, we obtain
f g tk
Z+,
k2
and the trajectory
þ
x(
þ
i(
is globally
= 0.
x(t0 )
, on [t0 ;
time t1 is given by t1
is the state of the
in the following manner:
be the index of the cell that
has a unique solution, say
N
0. We assume that
T >
) on [0;
1
X1 j
, where
), the switching times
) satisfying (5.8) for all
t
ÿ
0 except at the
switching times. These are all uniquely determined, and the switching logic is causal. Note that if
x(0)
= 0, then
equilibrium.
x(t)
= 0 for
t
ÿ
0, so the origin is an
This class of systems belongs to the dwell-time switched systems in Section 4.1. However, because of the ÿnite partition scheme, it is impossible to deal with asymptotic stability. In the following construction, we are concerned with attractive balls for such systems.
5.4.
103
Finite partition dwell-time switching
u(t)
n(t)
-
- ?d
x(t)
(A; B )
x ^(t)
DT
ÿ
CT
i(t)
Decoder
ÿ
Coder
Figure 5.9: Dwell-time control system
5.4.2
State feedback control
In this section, a stabilization problem for a linear time-invariant plant using a dwell-time controller with a ÿnite logarithmic partition is studied. Consider the feedback system in Fig. 5.9. The pair (A; B ) represents the plant x(t)
where
x(t)
2R
n
is the state,
=
u(t)
Ax(t)
2R
+ Bu(t);
is the control input, and
n(t)
2R
n
, is
the measurement noise. We assume the plant to be stabilizable. The dwell-time controller, composed of the coder is deÿned by three components: a dwell time
T,
fX g 2SN , and a corresponding set U := fu g 2SN j
CT
j
maps
j
x
to
i
j
CT
and the decoder
DT ,
a ÿnite logarithmic partition of control values. The coder
and is governed by the switching logic deÿned in the previous
subsection. The decoder is memoryless and is given by Here we consider the case
þ) õ 0, and,
n(
given
÷
DT (i(t))
=
ui(t) .
2 (0; 1), R0 ; r0
>
0, the
problem is to ÿnd a dwell-time controller such that, for the closed-loop system in Fig. 5.9, the ball
B0 (r0 ) is attractive from B0 (R0 ) with respect to (P; ÷J ).
This is a modiÿed problem of the design of dwell-time controller with a (countable) logarithmic partition in the previous subsection.
The stability
result relies on the techniques developed for the sampled-data control systems with ÿnite quantizers. Given
R0 ; r0
with
R0 > r0 >
0, we start the design procedure by following
Steps 1 and 2 in Section 5.3.4;
Æ >
parameters to be designed are
N
1 and
T
are determined in these steps. The
2 N and õ0 > 0. Then, deÿne the partition ÿ , X ü , j 2 f0; 1; : : : ; N ú 1g, as in (5.7) and their corresponding control cells X00 j
inputs as
ÿ = 0; ü = öu~ u
u00 j
ü
0 q[j; 1] ;
j
2 f0; 1; : : : ; N ú 1g:
The theorem below gives the parameters for the dwell-time controller that ensures stability.
104
Chapter 5.
Towards data rate reduction
Deÿne the dwell-time controller using
Theorem 5.4.1
õ0
N
=
p
pú
a
&
ÿ (a
s
+ 1)
ï s
Æ
=
ú1
ÿ
log
þmin (P ) þmax (P )
R0
þmax (P )
õ0
þmin (P )
r0 ;
!'
Then, for the closed-loop system in Fig. 5.9 with partition using
SN ,
the ball
B0 (r0 )
:
þ) õ
n(
0 and the ÿnite
B0 (R0 )
is quadratically attractive from
with respect to (P; ÷J ). The proof of this theorem follows directly from Corollary 4.4.4, in a manner similar to the countable partition case in Theorem 5.3.2. 5.4.3
State feedback under measurement noise
So far in our theory, perfect measurement with no noise has been assumed on the sensor side. However, in practice this is diÆcult to expect since the measurement is often obtained by a digital device and hence gives only quantized values, at a ÿner level than that of the coder. As a result, the measured state, denoted by x ^(t), may indicate a cell in which the actual state
x(t)
is not
contained. Although, if the magnitude of the noise is small, it is unlikely that this error can destabilize the closed-loop system, quadratic stability may not be guaranteed in the strict sense. Moreover, the noise can make it diÆcult for the system to maintain the small ball prespeciÿed in the design to be attractive. This motivates us to extend the design of dwell-time controllers for the case where measurement noise is present. Consider again the system depicted in Fig. 5.9, where x ^
:=
x+n
n(t)
is the noise and
is the measured state. We assume that the noise is bounded, that is,
k
for some
ký
ÿ
2
n0 > 0, n(t) n0 for t 0. The problem here is, given ÷ (0; 1), with R0 > r0 > 0, and n0 > 0, to design a dwell-time controller with a ÿnite partition such that 0 (r0 ) is quadratically attractive from 0 (R0 ) with
R0 ; r0
B
B
respect to (P; ÷J ). We propose a design method based on the ÿnite partition dwell-time controller. The design parameters here are design we obtain
Æ >
1 and
2 Rn ,
e
2
õ0 > 0 and N N . From the original among others. Now we deÿne a new
partition as follows. Given
n0 >
0, take
õ0 >
0 such that õ0 >
and deÿne j
õ0 := = 1; 2; : : : by
õ0
ú n0
and
j := Æõjý1 ;
õ
õ0
j
õ
Æ Æ
:=
:=
õ
+1
ú1 õ0
(5.9)
n0
+
n0 .
j ú n0 ;
Then deÿne
õ
j := õj ú n0 :
j
õ ;
j
õ ;
j
õ
for
5.4.
105
Finite partition dwell-time switching
fõj gj Z+
The new partition is now deÿned by the new set
2
control inputs are given by
00 := 0;
öu~0 õj ;
ü
ÿ
j :=
u
u
= 1; 2; : : :
j
and (5.7). The
:
We note that the cells deÿned above is smaller than those of the original partition in the previous subsection. In addition, we deÿne another set of cells, larger than those deÿned above, by
ñ
X 00 := fx 2 Rn : X j := x 2 Rn : ÿ
ü
ô
2 (úõ0 ; õ0 )g ; e x 2 ö[õj ; õj +1 ); 0
e x 0
;
j
= 1; 2; : : :
:
The dwell-time switching controller deÿned using this partition works similarly to the original one; the switching hyperplanes are only located diþer-
X 00 ÿ
ently. It is clear that the cells
and
Xj ; ü
= 1; 2; : : : , overlap and hence
j
do not comprise a partition. However, these two sets of cells guarantee that if
2 Xj then x(t) 2 X j for t ÿ 0 and j = 1; 2; : : : , and likewise with X00 and X 00 . This is the key characteristic used by the controller for our result in ü
ü
x ^(t)
ÿ
ÿ
this section. In the following, we refer to these two sets of cells as the ÿnite
(logarithmic) partition with overlapping cells . The next lemma gives a closed form for sequence is increasing.
ö
Lemma 5.4.2 (i) For j = 0; 1; : : : ,
õj
(ii)
0 > 0 and õj
õ
Proof
=
Æ
1
> õj ý
j
+1
0 ú n0 Æ ú 1 Æ
õ
for
j
õj
=
õ
fõj gj Z+
and shows that this
2
+ n0
Æ Æ
+1
ú1
:
(5.10)
= 1; 2; : : : .
(i) By deÿnition,
ú n0 = Æõj 1 ú n0 = Æ (õj 1 ú n0 ) ú n0 = Æõj 1 ú n0 (1 + Æ ): õj
ý
ý
(5.11)
ý
So the closed form is õj
=
0 ú n0 (1 + Æ)
j
Æ õ
X1 jý
=0
l
= (ii) By deÿnition of
0
õ
j
Æ
l
ú1
j
0 ú n0 (1 + Æ) Æ ú 1 : Æ
Æ õ
and the assumption on +1
0
õ
in (5.9),
0 = õ0 ú n0 > Æ ú 1 n0 ú n0 ;
õ
Æ
106 but since j
Chapter 5.
Æ >
1 we have
õ0 >
Towards data rate reduction
0. It is also clear from (5.11) that
õj > õj
= 1; 2; : : : .
ý1 for
ÿ
We must show now that the new partition is suÆcient to make the ball
B0 (r0 ) attractive from B0 (R0 )
under the presence of suÆciently small noise.
The main concern is to ensure that starting in
r0
is large enough so that all trajectories
B0 (R0 ) eventually go into and stay in B0 (r0 ).
Theorem 5.4.3 Assume that n0 is small enough that
n0
0, for the system in B0(r0 ) is quadratically attractive from B0 (R0 ) with respect to some
time controller such that, given Fig. 5.10,
(P; ÷J ). Hence, we want to ÿnd the parameters for the ÿnite partition with
2
õ0 > 0, N N , and the size of the overlap n0 > 0. The diÆculty in the use of an observer is that the two states may grow
overlapping cells:
large during the transient response due to the error between them. Thus we have to ÿnd a partition that covers the region in the state space through which the estimated state may travel. Our (naive) control strategy is (i) to wait and not to apply any control until the error becomes small enough, and then (ii) to start the dwell-time control so that, from this point, the state will go into a small ball centered at the origin in a quadratic manner. We describe this strategy formally and show that it is indeed stabilizing. Take
n0 >
0 such that
n0
0 and
Towards data rate reduction
0 such that
þz >
kz (t)k ý a eý z kz (0)k for t ÿ 0: ù t
z
Now take
T0 >
0 such that R1
N
:=
ý
ùz T0
az e
1
kAk + þ
2 := 6 66log
R0
ý n0 , and then let
az R0
q
z
R1
kLC k(1 ú eý
ùz T0
ùmax (P ) ùmin (P )
Æ
3 77
);
ú n0 ý+11 7: Æ
Æ
ú n0 +1 ý1 Æ
õ0
(5.15)
Æ
(5.16)
(5.17)
Theorem 5.4.4 For the dwell-time switching controller using the ÿnite par-
tition with the overlapping cells that we just deÿned, if the control input given by
( u(t)
then
=
is
2 [0; T0 );
0;
if
ui(t) ;
otherwise,
t
u
B0(r0 ) is quadratically attractive from B0 (R0 ) with respect to (P; ÷J ).
Proof Ax ^(t)
Since
u(t)
= 0 for
2
t
[0; T0 ) and x ^(0) = 0, the observer x ^(t) =
+ LC z (t) has a solution on [0; T0 ) as
Z
x ^(t)
t
=
ý
A(t
e
ü)
LC z (ý ) dý :
0
Now let R2
We claim that R0 ,
R1
= max
kx^(t)k:
2[0;T0 ]
t
in (5.16) gives an upper bound on
R2 .
From (5.15),
and x ^(0) = 0, we have
kz (ý )k ý a eý ý a eý z z
and thus R2
ý ý =
Z max
2[0;T0 ]
t
1
kAk + þ
=
R1 :
R0 ;
e
kAk(týü ) kLC ka eýùz ü R
az R0 z
kx(0) ú x^(0)k
kAk(týü ) kLC kkz (ý )k dý
t
0
ùz ü
e 0
max
2[0;T0 ]
t
Z
t
ùz ü
0 dý
z
kLC k(1 ú eý
ùz T0
)
kx(0)k ý
5.4.
109
Finite partition dwell-time switching
Hence we must ÿnd a ÿnite partition with overlapping cells that covers
E0 of V containing B0 (R1 ) is ô ñ E0 = x 2 Rn : V (x) ý R12 þmax (P ) :
B0 (R1 ).
The smallest level set
Therefore, similarly to Theorem 5.4.3, the number of cells to cover by
N
E0 is given
in (5.17).
x(t) = x ^(t) + z (t) for t ÿ T0 with kz (t)k ý n0 , x(t) B0 (r0 ) eventually by construction, following Theorem 5.4.3. ÿ
We can see that since goes into
As described above, we have taken the viewpoint that some computation resources are available on the sensor side. This leads us to the use of an observer; to take account of the error between the actual state and the estimated one, our control strategy is to wait until the error becomes small and then to start control employing the partition with overlapping cells. This partition has been shown to handle small noise in the measurement. A stabilization problem for the quantized output feedback case is also considered in [28]. Since the design of an observer gain is a dual problem to that of the feedback gain, an observer incorporating a quantizer is proposed: One of its inputs is the quantized error between the actual output and the estimated one; the estimated state is then sent to the quantizer as in our method. A question that arises here is why we would need an extra quantizer in the observer. On the other hand, the output feedback stabilization in [12] using a time-varying quantizer relies on the existence of
K
such that
A
ú BK C
is
stable. However, the static output feedback problem has not been solved.
5.4.5
Cart-pendulum system example
In this section, we apply the dwell-time switching controller to the familiar cart-pendulum system in Fig. 5.11. Here, the cart is of mass x,
and the force
of mass
m,
u
length l , and angle
2 3 2 x 0 66x77 660 4 ø 5 = 40 ø
We set input
M
u
and position
0
= 2 kg,
m
1
ø.
The linearized state space model is
32 3 2 x 6 6 x7 07 7 6 7 +6 5 4 5 4 ø 1
0
0
0
úmg=M
0
0
0
(M + m)g=M l
= 0:4 kg, and
l
0
ø
3 7 7 5 u:
0 1=M 0
ú1=M l
= 0:5 m. The states
x
and
ø
and the
have units of m, radians, and N.
First,
was designed using the result in Section 4.6. We used
ÿ
÷
= 0:1;
ò
= 12:3;
and obtained
ÿ
in (4.51) on
T and dT over all
plot of
M
is applied by an actuator. On top of the cart is a pendulum
Æ >
= diag(50; 10; 0:1; 0:1)
= 15:0. For this setup, the maximum
c
1 versus maximum
parameters.
~ Q
Æ >
T.
T
meeting the bounds
1 was about 0:026. Figure 5.12 shows a
We can see the trade-oþ between these two
110
Chapter 5.
m
ü u
Towards data rate reduction
l
M x
Figure 5.11: A cart with an inverted pendulum
T
Ideally, chose
T
and
Æ
should both be as large as possible. As a compromise, we
= 0:021 and
Æ
= 1:97. For
0
õ
follows:
= 0:1, the resulting parameters are as
ú0:0644 ú 0:101 ú 0:971 ú 0:208] ; fõ0; õ1 ; õ2 : : : g = f0:1; 0:197; 0:386; : : : g; fu0 ; u1 ; u2 ; : : : g = fö0:146; ö0:287; ö0:564; : : : g; X00 = fx 2 R4 : e x 2 (úõ0 ; õ0 )g; Xj = fx 2 R4 : e x 2 ö[õj ; õj+1 )g; j 2 Z+: e
ü
ü
0
=[
ü
ÿ
0
ü
0
û B û ).) (Note that the state space is of (A; In the following, we show simulation results for three diþerent dwell-time switching systems: (A) the original system using the ÿnite partition as deÿned above, (B) the system using the partition with overlapping cells for noise rejection control in Section 5.4.3, and (C) the system with an observer from Section 5.4.4. A. Dwell-time control with ÿnite partition:
The time response plots
are given in Fig. 5.13 for the initial condition [x(0) We used
0
x(0) ø (0) ø (0)]
0
= [0 0 0:1 0]
:
0 = 0:1 so that the smallest control input u+ 0
õ
is visible in the plot.
Figure 5.13(a) shows the responses in solid lines. The number of cells needed for this initial condition and
0
õ
is
N
= 11, but only 9 of them were used in
these plots. The ideal response for full state feedback (u =
K x,
no switching,
no state partition) is in dashed lines. Although the transient responses in these plots look similar, the behaviors in the steady state diþer; while the state of the optimal system goes to the origin, that of the system with the dwell-time controller, as expected, only
5.4.
Finite partition dwell-time switching
111
0.03
Max dwell time T
0.025
0.02
0.015
0.01
0.005
0 0
10
5
30
25
20
15
δ Figure 5.12: Trade-oþ between Æ and maximum dwell time T
stays around the origin. We see a periodic change in the state and also in the control input due to the dwell-time switching. We can say from these plots that the proposed controller is eÆcient in partitioning the space of the signals transmitted from the sensor to the actuator without causing much distortion in performance.
Here, as mentioned, the
response required only 9 cells, and it is fairly close to the optimal one, which virtually requires signals of real numbers. To get the state closer to the origin, we could make õ0 smaller, though this might result in a larger N . We next calculated responses for a larger dwell time without changing the rest of the controller. The case for T = 0:126 is shown in Fig. 5.13(b) in solid lines.
The response is stable though not necessarily quadratically stable; we
don't know how conservative our result is for quadratic stability with respect to (P; ÷J ).
However, compared to the plots in Fig. 5.13(a), we may say that
the transient response is more oþ from the optimal one.
For larger T , the
system became unstable.
B. Noise rejection control:
We compare the diþerence in the performance
between the system using the original partition and the one using the partition with overlapping cells. To make the measurement noise as malicious as possible
ý1=2 e, where n
in putting x and x ^ in diþerent cells, we used n(t) = n1 (t)Qú
ún0 ; n0 ] and n0
uniformly distributed over [
= (Æ
1 (t)
is
ú 1)=(Æ + 1)õ0 ü 0:7 = 0:0228.
This noise is in the direction of e after a change in coordinate using some matrix Qú . This noise is bounded as
ý1=2 ek ô 0:045.
kn(t)k ý n0 kQú
Then, the
112
Chapter 5.
Towards data rate reduction
0.2
x
0.1 0 −0.1 0 0.1
2
4
6
8
10
12
2
4
6
8
10
12
2
4
6
8
10
12
θ
0.05 0
−0.05 0 10
u
5
0 −5 0
time t (a) For
T
= 0:021
0.2
x
0.1 0 −0.1 0 0.1
2
4
6
8
10
12
2
4
6
8
10
12
2
4
6
8
10
12
θ
0.05 0 −0.05 0 10
u
5 0 −5 0
time t (b) For
T
= 0:126
Figure 5.13: Time responses for dwell-time control
5.4.
Finite partition dwell-time switching
113
dwell-time controller using the partition with overlapping cells is deÿned using
fõ0 ; õ1 ; õ2 ; : : : g = f0:1; 0:129; 0:186; : : : g; fuü0 ; uü1 ; uü2 ; : : : g = fö0:113; ö0:155; ö0:238; : : : g: In the simulations, the dwell time of T = 0:105 was used (thus, no guarantee for quadratic stability) to make the diþerence between the two systems more apparent. Time responses for the two systems are given in Fig. 5.14, for the system with the original partition in (a) and for the system using the one with overlapping cells in (b).
In each ÿgure, the responses of position x, angle ø,
and control u are shown. The initial condition is
[x(0) x(0) ø(0) ø(0)] = [0 0 0:1 0]:
The numbers of cells used were 9 for Fig. 5.14(a) and 14 for Fig. 5.14(b). We can see that, after an initial transient, the system with the original partition exhibits a larger change in the cart position compared to the other system; the magnitude in the change of ø seems to be slightly larger as well. This is because, closer to the origin, the cell sizes are smaller, and thus with the original partition the noise can easily aþect the measurement to give a wrong cell index, whereas the proposed system can take account of such noise.
C. Observer-based control:
For the system in Fig. 5.10, we assumed that
the sensor measurement is available only for x and ø, i.e.,
þ
C =
ý
1
0
0
0
0
0
1
0
:
2 R4þ2 for the observer so that the eigenvalues of A ú LC were placed at ú0:2; ú0:15; ú0:4, and ú0:3, and obtained 2 3 0:575 0:00499 60:0736 ú1:96 7 L = 6 4 0:611 0:475 7 5: We designed L
0:149
23:6
Thus the time constant for the observer error is about 5 seconds. very large, but is chosen so that errors are visible in the plots.
This is
Moreover,
of course, we don't need to estimate x and ø since these are available in the measurement; however, for the sake of presentation, we used all the estimated states for control. We calculated time responses for three diþerent systems for the initial condition
[x(0) x(0) ø(0) ø(0)] = [0:01 0 0:1 0]:
(x(0) is diþerent from the previous simulations so that there is some error between x and x ^ initially.)
In Fig. 5.15, the continuous-time case with no
114
Chapter 5.
Towards data rate reduction
0.2
x
0.1 0 −0.1 0
5
10
15
20
5
10
15
20
5
10
15
20
0.1
θ
0.05 0 −0.05 0
15
u
10 5 0 −5 0
time t (a) The original partition
0.2
x
0.1 0 −0.1 0
5
10
15
20
5
10
15
20
10
15
20
0.1
θ
0.05 0 −0.05 0
15
u
10 5 0 −5 −10 0
5
time t (b) The partition with overlapping cells Figure 5.14: Time responses under measurement noise
5.4.
115
Finite partition dwell-time switching
1
x
0.5 0 −0.5 0
5
10
15
20
5
10
15
20
10
15
20
θ
0.2
0
−0.2 0
u
10
5 0 −5 0
5
time t Figure 5.15: Time responses for observer-based continuous-time control
quantization is shown as a reference. For
x and ø, the actual states are in solid
lines and the estimated ones in dashed lines. The error between them becomes small fairly quickly.
Figure 5.16(a) shows the responses for the dwell-time
control system using the original partition, and Fig. 5.16(b) shows those for the system using the one with overlapping cells. For these systems, we simply used
u(t) = ui(t)
small.
for all
t ÿ 0, i.e., T0 = 0, since the number of cells used was
Compared to the noise rejection plots, the diþerence between the responses of the two dwell-time control systems is much smaller. This is because, by the time the actual state and the estimated state come close to the origin, the error between them is not large enough to put them into diþerent cells. However, both plots are comparable to that of the continuous-time one, and this fact is supportive for our approach using an observer. The numbers of cells used are 9 for the original partition case and 12 for the case with overlapping cells. Because the responses are similar, in the latter case we had to pay the price for using the more expensive partition.
116
Chapter 5.
Towards data rate reduction
1
x
0.5 0 −0.5 0
5
10
15
20
5
10
15
20
10
15
20
θ
0.2
0
−0.2 0 10
u
5 0 −5 0
5
time t (a) Dwell-time control using the original partition
1
x
0.5 0 −0.5 0
5
10
15
20
5
10
15
20
10
15
20
θ
0.2
0
−0.2 0
10
u
5 0 −5 0
5
time t (b) Dwell-time control using the partition with overlapping cells
Figure 5.16: Time responses for observer-based dwell-time control
Chapter 6
Extensions for the multiple input case In this chapter, we extend the limited data rate control methods for the singleinput plant in Chapters 4 and 5 to the multi-input plant case. The development is in a similar order: ÿrst the sampled-data control, then the quantized sampled-data, and ÿnally the dwell-time switching control. Other problems dealt in the previous chapters can be extended as well, but are not studied here.
6.1
Quadratic stabilization of sampled-data systems
In this section, we design a sampled-data controller to stabilize a continuoustime plant with an
m-dimensional
input. The stability criterion is quadratic
stability in the continuous-time domain with a guaranteed decay rate.
6.1.1
Problem formulation
The continuous-time system in Fig. 6.1(a) is the
m-dimensional
input version
of the one in Fig. 4.1(a). The linear time-invariant plant (A; B ) is given by the state equation x(t)
where A
x(t)
2 Rn
is the state, and
Ax(t)
u(t)
+ Bu(t);
2 Rm
is the control input. Assume that
6= 0 and A is unstable, i.e., it has one or more eigenvalues with nonnegative
real parts, that rank B = K
=
2 Rmþn
m,
and that (A; B ) is stabilizable.
The matrix
is a stabilizing state feedback gain, so all eigenvalues of
A
+
BK
have negative real parts.
H. Ishii and B.A. Francis (Eds.): Limited Data Rate in Control Systems with Networks, LNCIS 275, pp. 117−160, 2002. Springer-Verlag Berlin Heidelberg 2002
118
Chapter 6.
-
Extensions for the multiple input case
x(t)
u(t)
(A; B )
K
ÿ
(a) Continuous-time system
u(t)
-
x(t)
(A; B )
?
0
U
v (t)
K
6
ÿ
T
H
T
S
d (k )
õ(t)
ÿ
õ
(b) Sampled-data system
Figure 6.1:
By the stability of
A + BK ,
the system is quadratically stable with respect
2
J Rnþn , n þ n there exists a positive-deÿnite solution P R of the Lyapunov equation 0 (A + BK )0 P + P (A + BK ) = J . Now let V (x) := x P x, x Rn . For a
to some pair (P; J ). For example, given a positive-deÿnite matrix
trajectory
þ
x(
:
V
V
u
=
ú
) of the system, the decay rate of
The derivative
With
2
K x,
Rn ü Rm
!R
of
V
(x; K x) =
2
(x(t)) is determined by
J:
along the trajectories is given by
V
(x; u) := (Ax + Bu)
we have
V
ú
0
Px
0
+x
P (Ax
+ Bu):
(6.1)
0
x J x.
Now, as in the single-input case, we turn our interest to the sampled-data system in Fig. 6.1(b). This represents a simple setup where the communication channel between the sensor and the actuator has a ÿnite data rate. Here,
T is the uniform sampler with a sampling period
S
T :õ
S
The
zeroth order hold
d : õd (k ) := õ(kT );
õ
T is given by
H
T : õd
H
7!
7!
v
:
v (t)
:=
d (k );
õ
t
2
k
2 Z+
T:
:
[kT ; (k + 1)T );
6.1.
Quadratic stabilization of sampled-data systems
and U0
2 Rmþm .
119
Observe that
u(t) = U0 K x(kT )
for t
2 [kT ; (k + 1)T ); k 2 Z+:
This sampled-data system is one type of dwell-time switched system in (4.1):
Let fu(t) (x(t)) = Ax(t) + Bu(t) and
S
=
Rm .
there is always dwell time T between switchings. then u(t) = 0 for t
ÿ tk ,
and thus x(t) = 0 for t
Clearly, f0 (0) = 0 and
Furthermore, if x(tk ) = 0
ÿ tk .
Hence, the stability
deÿnitions in Section 4.1 can be applied. We are interested in ÿnding the largest sampling period while the closedloop system maintains stability with respect to the same Lyapunov function 0
V (x) = x P x at a certain decay rate. The follows: Given ÷
sampled-data control problem
2 (0; 1), ÿnd a sampling period T
is as
and a matrix U0 such that
the closed-loop sampled-data system in Fig. 6.1(b) is quadratically stable with respect to (P; ÷J ). Since the problem for the single-input case is solved in Section 4.2, one might argue that Heymann's lemma [32] is applicable: be controllable and b
2 Im B, we can ÿnd a matrix K0
Assuming (A; B) to such that the single-
input plant (A + BK0 ; b) is controllable; then stabilization results should be immediate from those for the single-input case.
This is not possible under
our assumption that feedback loops can be closed only over a ÿnite data rate channel.
6.1.2
Generalization of the setup
In the single-input sampled-data control problem, we have taken the approach to use V
as a control Lyapunov function and look for a controller so that for
any tra jectory V decreases at all times. In this section, we generalize the setup given in Subsection 4.2.2 for the m-input case. Here as well, we limit the class of K to LQR optimal ones.
This is an
important technical assumption, but may not be such a restriction on K . If the matrix R is diagonal, the optimal gains are known to have robustness properties such as an inÿnite gain margin and a fairly large phase margin [2,43]. These are helpful in our design for stabilization with quantization and delay. Given positive-deÿnite matrices Q
2 Rn
n
þ
and R
2 Rm
m,
þ
let P
2 Rn
n
þ
be the unique positive-deÿnite solution of the Riccati equation 0
A P + PA
The LQR optimal gain is K := 0
ú P BR
úR
ý1
ý1
0
B P + Q = 0:
(6.2)
0
B P and let the control Lyapunov func-
tion be V (x) := x P x. The continuous-time system in Fig. 6.1(a) is quadratically stable with respect to (P; J ), where J := Q + P BR Fix ÷
2 (0; 1).
For u
2
Rm ,
u decreases V :
X (u) :=
n x
we denote by
2 Rn
:
ý1
0
B P.
X (u) the set of states for which
V (x; u)
ý ú÷x J x 0
o :
(6.3)
120
Chapter 6.
Extensions for the multiple input case
This can be expressed as
X
f 2 Rn
(u) =
x
0
:
0
x K R[(1
ú ý ú
+ ÷)K x
2u]
by (6.1) and (6.2). To simplify the problem, we select loss of generality. Let the subspace
M
Rn
of
M
Note that since rank B =
M
dim
basis
f1 2 Rn
M
. Let
was
=
E
e
=
þþþ k k
:= [e1
E
0
m]
e
.
K = K
?
=
e ;::: ;e
of
g ú
÷)x Qx
= 1=(1
Q
(6.4)
; ÷)I
m.
= Im K
m.
0
(6.5)
:
Hence, there is an orthonormal
m g ø Rn
þ
In the single-input case, the notation
The parameter ÿ in Subsection 4.2.2 is generalized to the matrix S deÿned by S
:= (1 + ÷)E
X
We now obtain the following form for
2 Rm
Lemma 6.1.1 For u
X
(u) =
ñ
Eõ
+y :
õ õ
, , , ,
+y
2X
0
0
0
0
0
K R[(1
õ E K R[(1 0
õ S 0
õ
ñ
?
; y
(S
(S
and
0
I )õ
2õ
y
2M
?
õ
ú
+ ÷)K (E õ + y )
+ ÷)K E õ
ú ú I )õ
ú ôý 1 ý
ý
[(1 + ÷)K E ] 0
2õ
S [(1
(6.6)
(u).
2 Rm 2 M ú ú
(u)
(E õ + y ) 0
õ
2 Rm
m
þ
K RK E :
;
S [(1
Corresponding to the decomposition
there exist unique Eõ
0
2 Rm
,
0
Proof
without
be
:= (KerK )
m,
0
(1
õ õ 0
2u
õ õ
1u
ý
+ ÷)K E ]
=
such that
2u]
0
2u]
Rn
ú ý +y
0
+y
0
x
=
Eõ
y
?
, for any
2:
y
2M
?
U
and
0
2 Rn
,
0
E E S)
=
I)
ÿ
with
0 := (K E ) 1 U0 K E =(1 + ÷):
~ U
x
(by (6.4))
(by the deÿnition of
~ := (1 + ÷)BK E and replace For simplicity, we use B
:
+ y . Then
0
(since
ýk k y
M÷M
ý k k2ô
(E õ + y ) (E õ + y )
y y
1u
ý
+ ÷)K E ]
ý
(6.7)
A system equivalent to the original one in Fig. 6.1(b) is shown in Fig. 6.2. In this setup, the control input becomes u(t)
~ E 0 x(kT ) for =U 0
t
2
[kT ; (k + 1)T ),
k
2 Z+
.
(6.8)
6.1.
121
Quadratic stabilization of sampled-data systems u(t)
-
x(t)
~) (A; B
?
0
~ U
E
6 T
H
ÿ
ÿ
T
S
0
õ(t)
Figure 6.2: Equivalent sampled-data system ~ ), In this system, for (A; B
X
(u) =
This set
X
ñ
Eõ
+y :
õ
X
(u) is
2 Rm 2 M ; y
?
0
; õ
ú ú
(S
I )õ
components in the state space. Hence, we deÿne
þ
F (x)
Notice that
X
Su
ý k k2 ô y
:
(u) plays a crucial role in our controller construction. We see in its
expression above that, for stabilization, parameters
F
0
2õ
õ
ÿþ ý
x
where
;
y
=
õ
0
E x
preserves norm. The image of
F
õ
(u) =
Clearly, for
ý
kk
=
2 Rn
:
ô
õ
x
2
2X
Rm ;
(u)
We refer to the image space of
F
ô
ÿ
,
0;
0
õ
and
: y
kk ! Rm+1 ú
and
Rn =
(S
ú ú I )õ
are the essential
y
by
E õ.
x
(u) under
F
F
0
2õ
is Su
ý
2 ô
ò :
(u):
space ;
õ-ô
õ
X
2 X
F (x)
as the
F
(6.9)
it is of (m + 1)-dimension.
A suÆcient condition for the stability of the system is given by the following lemma. Lemma 6.1.2 If, for any initial condition x(0) F (x(t))
2 X F
~ E (U 0
0
x(0))
2 Rn 2
for
t
, the trajectory
x
[0; T ];
satisÿes (6.10)
then the closed-loop system in Fig. 6.2 is quadratically stable with respect to (P; ÷J ).
2
By (6.9), the condition in (6.10) is equivalent to
Proof
for
X 0 Xþ t
[0; T ].
~ (U
E x(kT ))
( ),
V
0
x(t)
2X
~ E 0 x(0)) (U 0
Since this holds for any initial condition, we have for
t
2
[kT ; (k + 1)T ); k
(x(t)) decreases at all times: V
~ E 0 x(0)) (x(t); U 0
2 Z+
ýú
.
x(t)
2
Thus, by the deÿnition of
0
÷x(t) J x(t):
122
Chapter 6.
Extensions for the multiple input case
ÿ
This implies quadratic stability with respect to (P; ÷J ). A crucial assumption made in the 1-input case is clear later that
S > I
ÿ >
X (u) becomes what is known as a hyperboloid 2 S > I and u 2 R . Given ô ÿ 0, we call the set ÿ þ ý ò õ õ : 2 F X (u) ô
Then,
1. It will become
is an important condition we have to make as well.
of one sheet [76]. An
F
example plot is given in Fig. 6.3 for
the projection of
F
X (u)
onto the
õ-space
for
ô.
sets in the following as well. Note that for each ý1 ellipsoid with a center at õ = (S I) S u.
ú
We use this term for other ô
ÿ
0 the projection is an
2
β
1.5
1
0.5
1 0.5 0
0 −1
α
2
−0.5 −0.5
0
0.5
α
1
−1
1
Figure 6.3:
F
X (u):
a hyperboloid
For future use, we give its alternative form
ÿþ ý X (u) = õô : õ 2 Rm ; ô ÿ 0; î ð î 1 õ ú (S ú I ) Su (S ú I ) õ ú (S ú I )
F
ý
0
1
ý
ð
Su
ý u S (S ú I ) 0
1
ý
Su
+ô
2
o :
(6.11) 6.1.3
Bounds on trajectories
In this section, we present a result on bounds for the state trajectories during the sampling period.
6.1.
123
Quadratic stabilization of sampled-data systems
For
T >
0, let cT
Note that
:= max
2[0;T ]
t
!
cT ; dT
Suppose
cT
0.
1:
(6.13)
, let
m
T >
(6.12)
õ ~
1
ö
kk õ ~
aT
+ô+
dT cT
õò
kk u
:
(6.14) This is a circular cone with an axis
õ
=õ ~ , as shown in Fig. 6.4.
2
β
1.5
1
0.5
1 0.5 0
0 −1
−0.5 −0.5
0
0.5
α1
The
m-input
−1
1
Figure 6.4: A ball inside the cone
α2
F~
û (u)
counterpart of Proposition 4.2.3 is given in the following.
2
2M 2 2
þ
? . Let x( ) be the 0 = E õ~ + y~, õ~ R ; y~ trajectory starting at x(0) = x0 with a constant control u R . Then
Proposition 6.1.3 Suppose x
x(t)
Moreover, if
cT
x(t)
Eû ~ +~ y (r )
2B
F (x(t))
ø
ñ
Eõ
Eû ~ +~ y (r ),
2
ÿþ ý õ ô
:
+y :
kõ ú õ~k2 + jkyk ú ky~kj2 ý r2
then
kõ ú õ~k
2
+
jô ú ky~kj ý r 2
2
ò =
ô
:
B( ~
5
û;ky ~k) (r ):
To prove (6.16), we must show that the ball in Step 1 is contained in the cone
F ~ (u). û
Step 2
If
1,
cT