Constructive Aspects of Functional Analysis (C.I.M.E. Summer Schools, 57)

G. Geymonat ( E d.) Constructive Aspects of Functional Analysis Lectures given at the Centro Internazionale Matematico...

Author: Giuseppe Geymonat

19 downloads 644 Views 33MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
Report copyright / DMCA form

DOWNLOAD PDF

G. Geymonat ( E d.)

Constructive Aspects of Functional Analysis Lectures given at the Centro Internazionale Matematico Estivo (C.I.M.E.), held in Erice (Trapani), Italy, June 27-July 7, 1971

C.I.M.E. Foundation c/o Dipartimento di Matematica “U. Dini” Viale Morgagni n. 67/a 50134 Firenze Italy [email protected]

ISBN 978-3-642-10982-9 e-ISBN: 978-3-642-10984-3 DOI:10.1007/978-3-642-10984-3 Springer Heidelberg Dordrecht London New York

©Springer-Verlag Berlin Heidelberg 2011 st Reprint of the 1 ed. C.I.M.E., Ed. Cremonese, Roma, 1971 With kind permission of C.I.M.E.

Printed on acid-free paper

Springer.com

CENTRO INTERNAZIONALE MATEMATICO ESTIVO

(C.I. M. E . )

A. V. BALAKRISHNAN

: I

A CONSTRUCTIVE APPROACH T O OPTIMAL CONTROL

Corso tenuto

ad E r i c e d a l 27 giugno

a1 7 luglio 1973

A CONSTRUCTIVE APPROACH TO OPTIMAL CONTROL

0. Introduction. Except for linear problems, i t i s difficult, .if not impossible, to obtain explicit solutions for. optimal control problems.

The closest we

get to a general 'solution' i s the Maximum Principle of Pontrjagin.

But

important a s this result i s , i t only provides us with necessary conditions for a (any) postulated solution. Unfortunately, many control problems do not have an optimal solution.

1; =,u ;

Consider for instance this trivial example:

x(0) = 0

Minimize :

subject to the constraint that the control u(t) must be equal to

+ 1 o r -1

a.e.

The minimal value i s zero, but this i s attained for u ( t ) . 0~ and of course this i s impossible.

u (t) = n

On the other hand

Sin nnt

fTiGGl

provides us with a sequence.of admissible controls which approximate the infimum arbitrarily closely.

The sequence

1 1 un(t)

of course converge

A. V. .Balakrishnan in the weak sense in L ~ [ O , I] to zero, but unfortunately un(t)2 converges

to one,

and of Course there is no optimal control.

In his recent book [ I

1,

L. C . Young has pointed out the fallacy

in proving necessary conditions for a possibly non- existent solution. He eites a paradox of P e r r o n that this leads to: consider the problem of fbnding the largest positive integer. sohtion, say N,

If we assume there exists a

then clearly N 2 I; on the other hand, we must have

thp t

which combined with

shows that N = 1 ! To resolve this difficulty, Young introduces the notion of a 'relaxed' or 'generalized control' and proves the existence of an optimal control in this class, -and derives the maximum principle valid for such 'functions'.

In

A. V. Balakrishnan the present work we shall go one step further and show how to actually construct

-

-

'compute'

a sequence of approximating controls which

converge to an optimal 'generalized control' and which then satisfies the maximum principle.

The computational technique i s of more than

theoretical value; and in fact has proved to be.practically useful a s well..

Relaxed controls play an essential role in this approach. We begin with a simple exposition of the theory of relaxed controls [Young [I]

1,

because i t i s of some independent interest a s well.

1. Relaxed Controls Let U be a compact s e t i n Euclidean space E the L 2 -space of functions u(t), 0 < t < T < of measurable functions such that

u (t) E U n

-.

m ' Let H denote

Let u (t) be any sequence n

a.e.

Then we can find a subsequence (renumber i t un (.)) such that u n ( a ) converges weakly to u (.) say in H. 0

Em.

L e t p ( . ) be any polynomial over

Then

also contains a weakly convergent subsequence. What i s the limit? Unfortunately if i s not

'

A. V , Balakrishnan a s the example Sin nt u (t) = n Isin ntl

-

shows, taking p(u) = u

2

. At the slmplest level the 'generalized curves'

[we shall continue to use the t e r m generalized 'controls' because we shall need this notion only with the controls] may be regarded a s providing a means tc. straighten out this situation.

Consider now the product space Cl = I x U where I denotes the interval [0, T]

.

Then Cl i s compact metric and l e t C(Q)denote the

Banach space of continuous functions over Cl with range in Em. f(t,u) denote such a function.

Let

Then observe that for any Lebesgue

measurable function ~ ( t such ) that

we have that

SI

f(t3

u(t))dt

defines a continuous linear functional on C(Cl). We know that there must be a countably additive s e t function p (of finite variation) defined on the Lebesgue subsets of Cl such that

A. V. Balakrlshnan

and it i s clear that p i s an atomic m e a s u r e with a unit jump a t u(t) for each t.

That i s to say, on any product s e t of the f o r m A x B

p(A x B) = Lebesgue measure of the set [t

I u(t) e BI

F o r any polynomial p(.), we note that

i s Lebesgue measurable in t. A generalized control i s simply a measure on (the Lebesgue subsets of) I x U such that

and

SU p(u) dp(t;u) i s Lebesgue measurable i n t . Alternately, 'for our purposes i t i s more natural to define i t a s a 'family' of probability measures ('control measures') dk(t; u) over U such that

i s Lebesgue measurable in t.

Thus defined i t i s not difficult to show

that

Jnf (t;u) 'aP(t;u) defines a continuous linear functional on C(n). Moreover

JU f(t;u) dp(t;u)

i s Lebesgue measurable in t.

L e t now u (t) be the sequence we began with, un(t) converging n weakly in H to uo(t). Let f(t) be any m x m matrix function, continuous on I and p(.) be any polynomial with domain and range in Em. Then we can write

where dpn(t;u) i s the corresponding sequence of measures. Now by the weak compactness of measures we know that (independent of f ( -) and

A. V. Balakrishnan

p(.)) we can find a subsequence (renumber i t dpn(.) again) which converges to a measure d h ( t ; ~ ) : 0

Working with a further subsequence, we know that

where ~ ( t i)s Lebesgue ~ c e a s u r a b l eand since f(t) i s arbitrary, i t follows that

Thus if we agree to define

where the bar indicates use of 'generalized control', then we do indeed have that if

A. V . Balakrishnan

then p(un(t))

-

p[u0(t)l

Example L e t us illustrate this with a simple example for m = 1. Let Sin n nt u (t) = -n Isinn ntl

O 0.

To conclude the computational scheme, w e need only now to d e s c r i b e how the i n f i m u m in (3.33a) i s determined. only s e e k a local minimum.

H e r e again w e

F o r this purpose, l e t u s note that the

functional

i s now a function only of the coefficients function by h(&;n;u(.)), . n =

{%}

and l e t u s denote this

Then ure u s e the iteration:

cr

m1.1 =

-1

nm - Hm G m

where

H

m

i s an n X n m a t r i x with components:

aa.

3

This i s then a slight variation of the Newton-Raphson technique (in that no second derivati-ves of the integrand a r e u s e d ) .

The

convergence of the scheme i s proved by a minor modification on the usual proofs, a s given in [14]for examp1.e. Of c o u r s e other techniques can be used.

A. V . B a l a k r i s h n a h

F i n a l l y , l e t 6 > 0 be given.

Then, in t h e o r y , we c a n find

a n N 2nd c such that

F o r this we only need to f i r s t find

6

such that

Then s i n c e

we have that

g(O+) - h(e)

5

6/2

Next w e choose N l a r g e enough s o that

A. V . Balakrishnan

Computational A s p e c t s We shall now study some of the questions that a r i s e in examining computational a s p e c t s m o r e deeply, and a t the s a m e time indicate a p a r t i c u l a r s c h e m e f o r solving the epsilon p r o b l e m . In doing s o we shall need to m a k e s o m e additional assumptions which a r e n a t u r a l i n the p r a c t i c a l context.

We c a n a p p r e c i a t e in a general way that a s epsilon i s m a d e s m a l l e r and s m a l l e r we will r u n into computational accuracy p r o b l e m s while too l a r g e an epsilon will have no relation to the control p r o b l e m w e w i s h to solve.

This i s b e s t s e e n by examining a R i t e approximation, o r of : rersion

of i t f o r the p r o b l e m .

L e t C[O,T] denote the Banach s p a c e of contin(..,us

s t a t e functions under the sup n o r m . b a s i s functions i n C[O,T].

.

Let

1. 1 b (t)

denote a sequence of

F o r each n , l e t A n denote the s e t of

functions spanned by b k ( ), k = I ,

. ..n, such that

A. V. Balakrishnan

F o r each n, vJe now consider the epsilon p r o b l e m over the c l a s s of s t a t e functions (denoted S ) of the form: n

w h e r e the

1 ak} 1

m u s t satisfy ( 3 . l ) , corresponding to condition

(F'), and over controls u ( t ) a s before.

(The controls a r e not

approximated by b a s i s functions .) L e t u s denote the corresponding infimum by hn(c). Clearly any admissible s t a t e function can be approximated uniformly in t by functions i n S a s closely a s n d e s i r e d for l a r g e enough n, and of c o u r s e S

n

i s a l s o conditionally

compact.

Let

w h e r e the quantities a r e defined the s a m e way a s in section 2, the s u b s c r i p t n denoting r e s t r i c t i o n to Sn. It i s evident that

tin(€)

A. V. Balakrishnan

and g ( s ) a r e again monotone i n the s a m e fashion a s before. n L e t 6,(0) and gn(O+) denote the l i m i t s a s o goes to z e r o .

Since

h ( s ) (unlike h ( c ) ) h a s no given upper bound, 6,(0) need not b e n z e r o . -In f a c t we have:

s o that

lim 0

(hn(c)

-

6 , ( 0 ) / ~ ) = gn(O+)

€7)

Thus hn(s) eventually i n c r e a s e s without bound [0(1I s ) ] a s w e m a k e epsilon s m a l l e r .

Of c o u r s e

and h ( s ) = l i m h n ( c ) , f o r each

6

> 0.

L e t u s now indicate a method f o r obtaining h ( o ) . W e begin with any element of Sn, say with a l l

{ ak I

s e t to b e z e r o , i n the

a b s e n c e of any p r i o r i d e a s concerning the optimal s t a t e function.

A . - V . Balakrishnan

Call this xl(t)

.

We now make the following assumption (U1):

Min uou i s attained a t a unique point i n U, for each t, y , x and c . ul(t) denote the minimal point in U f o r y =

A 1( t ) , x

Let

= x ( t ) , so 1

that in o u r previous notation:

( T h e assumption (U1) c a n c l e a r l y be weakened to hold in a suitable neighbourhood.) We now choose x ( t ) s o that 2

(which i s attained in general by an element i n the c l o s u r e of Sn) i s attained by x 2 ( t ) . Note that the function m(o ; t ;

...) i s not

r e q u i r e d to be known. We next determine u 2 ( t ) so that ( 3 - 5 ) holds with x ( t ) replaced by x ( t ) , and continue on this way to 1 2 produce the sequence

x n ( t ) , un(t)

A . V. Balakrishnan

Now

and we have thus a monotone decreasing sequence.

F r o m any

subsequence we can choose a further subsequence such that x (t), n

4n (t) converge i n C[O,T] to x o ( t ) , Go(t) say, and now because

of condition ( U ), the corresponding u ( t ) m u s t converge to uo(t) 1 n where

A. V. Balakrishnan

Moreover we must have:

The l a s t equality together with (3.8) means that xo(t),uo(t) cannot be further improved by our procedure.

Next l e t us note that xo(t), uo(t) is a local minimum for the epsilon problem in the sense that

for any admissible control u(t) while the f i r s t variation of

vanishes at x(t) = xo(t). F o r this purpose we assume that for any

.

h( ) in

dc,

belongs to Sn f o r all sufficicntly s m a l l

1 t3 1 .

A: V . Balakrishnan Now because of (U1),

that we only need to show that

i s z e r o . Rut this follows f r o m the f a c t that this i s t r u e f o r x n ( t ) , u ( t ) and we c a n take l i m i t s with r e s p e c t to n. n

Clearly i n any

compuLationa1 s c h e m e we can only obtain a local minimum.

In

p r a c t i c e w.e a s s u m e that ( a t l e a s t f o r l a r g e enough o r d e r of approximation) t h e r e i s only one l o c a l minimum, o r , a t l e a s t o u r s e a r c h i s confined to a region w h e r e t h e r e is only one minimum artd i t i s the t r u e minimum. Note that under condition [ U

1

hh(c) =

-

1,

we have

bn(e)l e 2 f o r every

E

> 0.

To conclude the computational s c h e m e , we need only now to d e s c r i b e how the i n f i m u m in (3.6) is determined. only s e e k a l o c a l minimum.

H e r e again we

F o r this purpose, l e t u s note that the

functional

is now a function only of the coefficients (ak} and l e t u s denote this function by

A. V. Balakrishnan

Then w e u s e the iteration:

-

1 r n t l = a m - Hm G m

n

where

H

. is

m

an n X n m a t r i x with components: .

so

1 T L . ; [aai(~(t)-f(t;~(t);~(t))s

a

a-(G(t)-f(t;x(t);u(t))] a. dt

J

This i s then a slight variation of the Newton-Raphson technique (in that no second derivatives of the integrand a r e u s e d ) .

The

convergence of the scheme i s proved by a m i n o r modification on the usual p r o o f s . Of c o u r s e other techniques c a n be used.

A . V . Balakrishnan

F i n a l l y , l e t 6 > 0 b e given. a n N and

E

Then, i n theory, w e c a n find

such that

F o r this we only need to f i r s t find

E

s u c h that

Then s i n c e

~ ( E ) / c ( g(O+) - g ( ~ )

w e have that

g(O+) - h ( s )

5

6/2

Next w e choose N l a r g e enough s o t h a t

A. V . Balakrishnan

References

1

.,

L. C . Young: "Calculus of Variations and Control Theory", W . B . Saunders, 1969.

2.

A . V. Balakrishnan: "On a New Computing Technique in Optimal Control", S U M J o u r n a l on Control, S e p t e m b e r 1968.

3.

A . V. Balakrishnan: "A Computational Approach to the Maximum P r i n c i p l e t ' , Jourrial of Computer and S y s t e m Sciences, 1971.

4.

E . J . McShane: "Relaxed Controls and Variational P r o b l e m s " , SLAM J o u r n a l on ConLrol, 1967.

5.

H. G . Eggleston: "Convexity", CaLnbridgc University P r e s s , 1968.

A . V. Balakrishnan

LECTURE NOTES

A . V. Balakrishnan

Erice, July 1971

11

A. V. Balakrishnan

Stochastic Systems

1.

Inducing M e a s u r e s on C; the Wiener m e a s u r e -Given a stochastic p r o c e s s , o r equivalently, a consistent family

of finite dimensional distributions, we can always c o n s t r u c t a 'functionspace' p r o c e s s with X a s the sample space and a probability m e a s u r e

p ( . ) on the sigma-algebra

9' of subsets of, X (that ' a g r e e s ' with the

finite-dimensional distributions). two a r b i t a r y time-points

Throughout we a s s u m e that f o r any

t l , t2, denoting the corresponding 'variables'

by x ( t l ) , x ( t ), that w e have.: 2

where r

> 0,

k > 0, 6

of t l , tZ; an3

>0

1. I

a r e fixed constants independent denotes the Eticlidean n o r m .

F u r t h e r m o r e we shall a s s u m e that T i s a compact i n t e r v a l , F o r simplicity of notation, we s h a l l take i t to be the unit interval [0, 11 without l o s s of generality. L e t C (0, 1 ) denote the c l a s s of continuous functions (with range in E ) on the closed i n t e r v a l [0, 11. Endowing i t with the 'sup' norm, we know that i t becomes a ~ a n a c hspace:

A . V. Balakrishnan

where

llfll

= sup I f ( t ) l

I 'I

denotes the Euclidean norm.

,

0( t 2 1

~ g n a c hspace, and denote i t by

We note that i t i s a separable

%'. By the Borel s e t s of g we mean

the smallest sigma-algebra generated by all open s e t s .

Let B be an

Then for a r b i t r a r y t 1, t 2 s m-dimensional Bore1 s e t in E ( ~ ) . the s e t s in g

... t n s

defined by:

a r e called 'cylinder s e t s ' (and B i s then referred to a s the 'base').

Lemma

The c l a s s of Borel s e t s in Q coincides with the smallest

sigma-algebra generated by the c l a s s of cylinder sets,

Proof'

It i s c l e a r that cylinder s e t s a r e Borel s e t s . Conversely, since

B i s separable, any open s e t in $2 can be expressed a s the union of a countable number of closed spheres; and every closed sphere, say with center fo and radius d can be expressed:

where r

n

denotes the countable collection of rational numbers in

[o, 11. Hence open s e t s a r e contained in the smallest sigma-algebra


g e n e r a t e d by cylinder s e t s . Note i n p a r t i c u l a r that the B o r e l s e t s a r e g e n e r a t e d a s the s m a l l e s t s i g m a - a l g e b r a containing s e t s of the f o r m ,

[f(.)

I f(t) E

I

,

f ( . ) e g:

I a n i n t e r v a l i n E]

Given any c o n s i s t e n t s e t of d i s t r i b u t i o n s , we c a n induce a finitely -additive m e a s u r e o n the cylinder s e t s of m

for every v in E , o r

Or, we have the expansion (in the mean s q u a r e sense), for each t:

Actually, the convergence i s with probability one also, since

and by virtue of Kolmogorov's inequality this converges with probability one, since

Problem for any f ( . ) in L ~ [ O , 11, show that


Problem

Hint:

and so,

2,I!f 1

tk(s)ds

11 ' converges boundedly to

(nt)

, in L ~ [ O11. ,

0

Linear Stochastic Equations Let F ( s ) be an m-by-n matrix function, Lebesgue measurable on [0, 11 and

A. V. Balakrishnan such that

Let

f o r almost every w.

which we know defined f o r each t, i t i s not defined for every w,

However

and in particular, the exceptional s e t

of points w on which i t i s not defined may well depend on t. wish now to rectify this situation.

We

If F(s) i s absolutely continuous

with a square-integrable (on [0, 11) derivative we can do this very simply.

F o r then,

with probability one, and since the right side i s defined f o r every w, we may just define the left side to be that f o r every w . S(t;w) i s then continuous in t, separable process. Lemma

05 t

5

Note that

1, for every w, and hence a

Moreover, we have the following basic bound:

Suppose S(t;w) in (2.7) i s determined a s a continuous

function'of t f o r almost every

w

sup

[OlE 1

11

jO t

W.

Then:

11

F(S) d ~ ( s ; w )

1

>e

;T

] ~

jOI I F ( ~ ) I I 1

ds

A. V. Balakrishnan

Proof Let

Then by the assumed continuity of S(t;w) in t, measurable.

we observe that A i s

In f a c t if

then An(k) i s clearly monotone in n for each k

But the integral over non-overlapping intervals being independent, a simple application of the Kolmogorov inequality shows that

f r o m which the Lemma follows. Next l e t us recall t h t if F(.) i s any element in LZ(O,I ) , we can find a sequence F (.) of functions which a r e absolutely continuous n with derivative in L ( 0 , l ) such that 2


Using the lemma we obtain that

Let 0

, 0 < €I 0 and fixed.

A. . V.

Balakrishnan

Then f r o m Lemma 2, we have:

and hence

which, just a s i n the proof of the monotonicity of the sequence P (.) n in Lemma 3 , implies that

But

s t )=

[

((ttr) ( ( s ~ T ) - ~ ( B B t *~ ( s t T ) ~ * ~ ~ ( s t r ) ) l $ ( s t r ) : 8 - ' ) ( t t r ) * d s

and by an obvious change of variable in the integrand, this i s

1 ttr

=

( ( t t r ) ( ( s ) - '(BB:"

P(s)c*cP(s))((s)::-'~(ttr)*ds

A. V . Balakrishnan

Hence

P(t)

< P ( ~ + Ta s) r e q u i r e d .

Hence P(t) converges a s t goes to infinity to the unique solution of ( 2 . 1 0 1 ) .

Finally suppose

P(m)

i s singular.

Then by L e m m a 5, so i s

P ( t ) f o r every t, and a s we have seen, this i m p l i e s that ( A - B ) i s not controllable.

A. V . B a l a k r i s h n a n References 1.

A . N . K o l m o g o r o v : F o u n d a t i o n s of the T h e o r y of P r o b a b i l i t y , Chelsea, 1950.

2.

J . L . Doob: S t o c h a s t i c P r o c e s s e s , J o h n W i l e y a n d S o n s , 1953.

3.

L. I. G i k h m a n a n d A . V. S k o r o k h o d : I n t r o d u c t i o n t o t h e

4.

K. P a r t h a s a r a t h y : " P r o b a b i l i t y M e a s u r e s o n M e t r i c S p a c e s " , A c a d e m i c P r e s s , 1967.

5.

L . Shepp: R a d o n - N i k o d y m D e r i v a t i v e s of G a u s s i a n M e a s u r e s : A n n a l s of M a t h e m a t i c a l S t a t i s t i c s , 1966.

of R a n d o m P r o c e s s e s , W . B. S a u n d e r s , 1969.

heo or^

6. H. M c K e a n : S t o c h a s t i c I n t e g r a l s , A c a d e m i c P r e s s . 7.

W. M. Wonham: "Random Differential Equations,in Control T h e o r y ", i n P r o b a b i l i s t i c M e t h o d s i n A p p l i e d M a t h e m a t i c s . V o l u m e 2 , A c a d e m i c P r e s s , 1970.

8.

E . N e l s o n : I1Dynamical T h e o r i e s of B r o w n i a n Motion", P r i n c e t o n University P r e s s , 1967.

9.

W . M . W o n h a m : "On a M a t r i x R i c c a t i E q u a t i o n of S t o c h a s t i c C o n t r o l " , SLAM J o u r n a l o n C o n t r o l , V o l u m e 6 , N o . 4, 1 9 6 6 .

10.

M. C o e v e : P r o b a b i l i t y T h e o r y , V a n N o s t r a n d , 1954.

11.

I. G o h b e r g a n d M . G . K r e i n : V o l t e r r a O p e r a t o r s , Translations, 1970.

AMS

Example: L i n e a r Stochastic Control L e t u s next consider stochastic control problems f o r the l i n e a r system:

where we a s s u m e that a l l the coefficients a r e continuous on .[O, 11, and

and of c o u r s e W(s;w) i s a Wiener p r o c e s s a s before.

The control

problem i s that.of finding a n optimal control function u ( t ) , u ( t ) being measurable @ ( t ) , so a s to minimize:

Y

where Q ( s ) i s continuous in s and i s non-negative definite, and ),

i s a fixed positive constant.

Since u ( t ) i s now a l s o a random

proces&, l e t u s denote i t by u(t;w). It i s implicit that u(t;(") i s jointly m e a s u r a b l e in t and w .

A. V. Balakrishnan

Whatever the choice of u(t;w), l e t

A

I

x(t;cu) = E(x(t;w) By ( t ) )

Then we have the Kalman filter equations characterizing c(t;w):

A ~(t;., =

st

+

st

~ ( ~ ) $ ( ~ ; ~ ) d(P(s)c(s)* s t F(~)G(S)*)(G(S)G(S~*)-'~Z~(~;.)

0

0

+~~~(s)u(s;~)ds

(2.111)

0

where

z 0 ( s ; ~=) Y(s;w) -

sS

c(o)G(o;w)do

0

and i s a Wiener p r o c e s s with covariance G(s)G(s)*. (Cf equations (2.83), (2.66)).

The matrix P ( s ) i s determined by: (Cf. (2.84)):

'

+

( ~ ( t ) ~ ( t ) * () -~ ( t ) ~ ( ~t () t ) ~ ( t ) * ) and in particular does not depend on the control. Next in (2.110) w e

(2.113)

A. V. Balakrishnan

can write

A

E([~(s)x(s;ro),x(s;,)1)= T r . Q ( s ) C ( ( ~ ( S ; U + )e(s;w))(x(s;w)+ d(s;w))*)

= T r . Q ( s ) E ( G ( S ; ~2(~;(;1)*) ~) t Tr

. Q(s) P(s)

where e(s;w) = x(s;w)

- %s;w)

The point i n doing this i s that the problem i s thus reduced to '&at of choosing u(t;w) so a s to minim'lze:

w h e r e x(t;w) satisfies (2.111).

H e r e we shall exploit o u r knowledge of

deterministic control p r o b l e v s .

Thus, l e t u s fix the sample point w, and

consider the problem of minimizing:

for each u. F o r this purpose i t i s convenient to w r i t e

(2.114)

A. V. Balakrishnan

where $ ( t )i s a fundamental m a t r i x solution of

and finally:

Note that w(t;w) i s continuous in t ( a s we have seen). Since w i s fixed, we consider G e control functions a s functions of t alone f o r the moment.

L e t L (0, 1) denote the usual L space for control functions 2 2

~ ( t ) .Introduce the l i n e a r bounded operator on this space:

Then, we can r e w r i t e (2.116) a s :

where C! stands f o r the operator corresponding to multiplication by Q ( s ) , u stands f o r u ( t ) , w for w(t;u), and inner-products in two spaces, have been used.

Being a quadratic f o r m with h positive, i t i s obvious

by a routine f i r s t variation (gradient with r e s p e c t to u ) that the u.tique

A. V. Balakrishnan

minimum is given by

,)

uO

+ L*Q

Lu

0

=

-

L*Q w

where L* denotes the adjoint of L , and i s given by:

La: 0 s o that G i s non- singular):

where (in the steady state) P s i s determined from

In the c a s e where d4 = 0 (corresponding to high windgust) we note that

a s explained in section 2 and in Appendix I. notation letting

In fact in our current

,

A . V. Balakrishnan

where c1, c Z ,c 3 , c 4 are 1 x 5 matrices, and noting that

c4Fc = k # 0

we have

;(t) = ( A - F ~ ' L - c' 4 A) ~ ( t+) ( B - F ~ k - l c 4 B) ~ ( t )

Using the optimal feedback control, the actual value of the normal acceleration component

=

lim T->w

T S ~[ Q x(t), x(t)] dt

= TrQJ where in.the non-singular case J = Ps + Ja where Ja i s the solution of

A. V. Balakrishnan

This follows from the fact that in the steady state

E[(x(t)x(t)*l =

and ~

t =)

E[(x(~)-P( t))(x(t)-:(t)*)]+

B B*P P ') lt (A 0

J~GG*

,o

ds

In the singular c a s e ' (d4 = 0) we have

J = Ja

and Ja i s the solution of

h

G ( s ) ~ st PE*( c c * ) - l ( z O ( t ) )

where Z (t) i s a Wiener process with 0 E[z0(t) zo(t)*] =

A

~ [ x ( t ) x ( t ) * ) ]Ps =

+ Ja

. A.

V. Balakrishnan

Note that in the absence of feedback, the normal acceleration due to windgust i s given by J

A Jb

b

+ JbA* + FF*

the solution of

=0

The reduction in decibels i s given by

10 Log ( T r Q Ja) - 10 Log ( T r Q Jb)

and this remains the same even'if FF* increases by any multiplicative factor.

Optimization of Step Response Let u s next consider the problem of meeting the requirements on desired response to step input (that is,

bp(t) i s a step function). Here again we

can follow standard optimization theory.

Thus l e t the state dynamics be

(setting windgust noise to be zero):

we consider the 'deterministic' case where the state is'known completely. Our optimization criterion is: Minimize

A. V. Balakrishnan

where

i s a positive constant to be chosen appropriately later.

The corresponding theory i s new with this paper; and i s given in Appendix 111.

The optimal control uo(t) i s given by:

where

L being defined by:

n,(t) = L x(t)

Remembering that

Q = L*L

A. V. Balakrishnan

the optimal feedback control i s the same a s that obtained in the stochastic c a s e .

+

BT

h

(A*

Thus the pilot input 6 (t) i s shaped by the rule:

P

-

P B B*/I)-' C

C

C

L* 6 ( t ) P

and the feedback control remains the same a s in the stochastic case. F o r a unit step function, the corresponding average meansquare 'tracking-error' is

of course this average mean-square e r r o r i s only the square of the difference between the steady-state output corresponding to the unit step, and the unit step and is not very significant,

since the precise

steady state value i s not important and can be scaled up a s desired. However the criterion does yield the same feedback gain a s in the stochastic case. Stability is of course guaranteed.

A. V. Balakrishnan This brings up then the possibility of using a 'shaping filter' with memory.

Ideally the frequency transform of this shaping filter

should be the inverse of the-frequency transform of the feedback system: Denoting the f o r m e r by K(f), we must have, ideally,

and this involves differentiatorg. in theory.

The problem i s thus solved completely,

In practice a suitable approximation can be chosen.

It

must be noted that the input i s no longer a constant s o that .the considerations

.

of Appendix I11 for Z(t) now hold, strictly speaking, only asymptotica~ly

APPENDIX I

A. V. Balakrishnan

IDENTIFICATION THEORY

The identification problem can be formulated as: Given

where W0( * ) and W(*) a r e Wiener processes in the appropriate dimensions, perhaps correlated with each other, u(.) i s a given known input, and v ( * ) i s the observed output.

It i s desired to identify

some o r all of the parameters in the m a t r i c e s A, B, C ,D, F. matrix G is a known constant matrix.

The

F o r most of the analysis we

assume G is a non-singular square matrix; we t r e a t the c a s e where

G i s singular separately. Identification [41,

In spite of the considerable l i t e r a t u r e on

such a problem in this generality has not been

studied hitherto.

The f i r s t question tha.t we wish to answer i s , when can we identify such a s y s t e m ? Of course the answer must depend on the notion of identifiability used. While the literature on Identification abounds with many 'recipes', there i s often little by way of any measure of goodne.s

of the estimates obtainable.

There i s not in any

c a s e much agreement on what constitutes 'identifiability'.

In the

A . V. Balakrishnan

present paper we take the foUowing,approach which has a t l e a s t the virtue of being mathematically precise. F i r s t we assume there does exist a s e t of 'true' values f o r the unknown p a r a m e t e r s .

Let 8

denote the s e t of unknown parameters; 0 then takes its values in some finite-dimensional Euclidean space. value.

Let

denote the true

An estimate based on observed data f o r a time-interval T

will be denoted BT.

We shall say that a system i s identifiable if we

can find an estimate which is asymptotically unbiassed and i s consistent.

That is to say:

i)

ii)

limit T a m

E(BT) = go

QT converges with probability one to B0 a s T goes to infinity

.

Such a definition requires a precise formulation of the problem involving 'noise' processes, of course. Flight Coptrol problem -

Fortunately, the practical

shows that this i s not unrealistic.

In the

present paper we shall show that under certain conditions, which we t e r m 'iazntifiability conditions', it i s possible to find such a c l a s s of estimates f o r the problem under consideration.

Moreover, we can

develop a computational algorithm for generating such an estimate. Our estimate i s a maximun+ikelih~'od estimator; o r , more cor:.ectly (and i n o r d e r that the difference in approach can be emphasized) i t is a root.of the gradient of the likelihood functional.

Because we a r e

A. V. Balakrishnan dealing with 'continuous' data, the terml'likelihood functional' will have to b e clarified. We shall f i r s t consider the c a s e where G i s non-singular; without l o s s of generality, we can clearly take i t to be the Identity; which we do.

The main thing to note then is that measure

induced by the process v(.) f o r any finite time-interval [O, T] on the [ ~ a n a c h ]space C of continuous functions on [0, T] in the usual manner, i s absolutely continuous with respect to the Wiener m e a s u r e thereon.

This i s t r u e f o r any assumed value f o r 0.

By the 'likeli-

hood functional' we mean the corresponding Radon-Nikodyrn derivative. We shall show that under the 'identifiability conditions', there i s a non-zero neighborhood of

e0

i n which the gradient of the likelihood

functional has a root for a l l T bigger than some To. We shall take and show that it is asymptotically

such a root a s the estimate OT,

unbiassed and is consistent, provided the identifiability conditions a r e satisfied. We begin then with the calculation of the R-N derivative for the c a s e w h e r e G i s the identity matrix. the s y s t e m (1) c a n b e rewritten with:

where m ( t ) = D u(t) t C

St e A ( t ' a ' ~ u(s)ds 0

in the form: x(t)'

=

:(t)

=

A x(s)ds t F W. (t)

;/

0

C x(s)ds t W(t)

F o r this we note that

A . V. Balakrishnan Mainly f o r notational simplicity we shall a s s u m e W o ( . ) and W ( - ) are

The fact that G i s the identity matrix will imply that the

independent.

process v(6) i s absolutely continuous with respect to Wiene.r measure. However this can be explicitly demonstrated by ueing the well-known Kalman filtering equations f o r the system.(3). Thus l e t ,

Then we have: %(t)

where W

( 0 ) .

=

Jbt

A

P(B)d s

t

it P(s) 0

C*

dW(s)

!(4)

is the Wiener procee s and P(*) is the unique solution of:

~ ' ( t )= A P ( t )

+ P(t)A* - P ( t )C*CP(t) + FF*; P ( 0 ) = 0

, ~ u l t i p l ~the in~ differential f o r m of (5) by P(t)C*, we have:

and substitt~ting-for-thelastt e r m on the eight using (4), we have finally : A

x(t)

- f t (A-P(s)C*C) % ( s ) d s = St P(s)C* 0

Letting

d ?(s)

0

@ ( t )to be a fundamental matrix solution.of

A(6)

A. V. Balakrishnan

we have: 'P(t) = #(t)

S'@(S)-'P(S)C* d ;(a) 0

and hence

,.

''W(t) = ~ ( t ).- .

St0 ~ ( t ; s ) ?(a) d '

where: K(t;s) =

J;

C @(.I

d0 O (8)-

In the differential form, dW(t) = d ?(t)

P(.)C*

(8) becomes:

- J~L ( t ; s ) d ;(s)

dt; L(t, s) = C@(~)I#(S)-'P(s)c*'

0

Let y(t) =

- st

L(t;s)dv(s)

0

Theorem The Radon-Nikodym derivative of the measure induced by the process v ( - ) on C with respect to Wiener measure is given by:

where the second integral i s an Ito integral, and v(..) s C. Remark Note that the use of the Ito integral in (9) eliminates the determinant used i n [ Z ]

.

A. V. Balakrishnan

W e proceed next to the gradient equation.

F o r this, l e t 0

denote the vector of unknown p a r a m e t e r s . We shall show that f o r l a r g e T i t i s enough to seek a root of

where L a )

and

Virgdenotes

where Qi

m(s)ds

-

A

m(t) = m(t)

the gradient with r e s p e c t to 0 .

denotes the ith unknown parameter.

Let

Then the m a i n

'identifiability conditioni i s that the m a t h with components

be positive definite i n the l i m i t a s T goes to infinity.

Using this

m a t r i x a Newton-Raphson technique for finding the root can be readily developed.

F u r t h e r i t can be shown, f o r example, that for

our p a r t i c u l a r problem, the identifiability condition i s satisfied under a n appropriate almost-periodic c h a r a c t e r imposed on the input in open loop mode.

A. V. Balakrishnan

Singular Case:

Let us now consider the case (corresponding to

large windgust component)

and

with

where W 2 (t) i s a Wiener process.

Then'of course the measure induced

by the process vet) cannot be absolutely continuous with respect to Wiener measure. But:

A. V . Balakrishnan

Theorem: Suppose CIF is non-singular, that i s to sag:

Denote this (square) matrix by K.

'

Then

being the unique solution of

Thus x(t) is known without e r r o r

and the measure induced by

v (t), conditioned on x(t) being given, with respect to Wiener measure i s 2

given by :

where the second integral i s an Ito integral. The minimization can proceed a s before; indeed, the calculations a r e simpler.

APPENDIX I1

A. V. Balakrishnan

In this Appendix we indicate the iterative technique used to find the

solution of the steady state Riccatti equation

where Q i s non-negative definite and we a s s u m e A is stable. The iteration is:

This i s a linear equation for P

; m o r e o v e r l e t us a s s u m e that

nt 1

the s y s t e m i s observable s o that

i s actually non- singular, and denote the i n v e r s e by

/\,

s o that

Then choose

P =A 0

with this choice the approximating sequence Pn i s actually a monotone decreasing sequence of non-negative definite m a t r i c e s , the l i m i t being the solution sought.

F o r a proof s e e

[s].

................................

--COMMAND SIGNAL

-- -

I

/

,

:RESPONSE

i i

3 --

NONLINEAR CONTROL SERVO ACTUATOR

AIRCRAFT

GUST RESPONSE TRANSFER FUNCTION

GUST DISTURBANCECONTROL SURFACE

-I

-

+

:

RIGID BODY TRANSFER FUNCTION

;

:

STRUCTURAL RESPONSE TRANSFER FUNCTION

: RESPONSE :

...

-

RIGID BODY RESPONSE

+

: STRUCTURAL

...............................,

--

I

MEASURED SIGNALS

-FLIGHT

Figure A.

DYNAMIC RESPONSE VARIABLES

CONDITION VARIABLES

IDEAL MEASUREMENT SENSORS

4

+

;

AIR-DATA COMPUTER

I

Functional Block Diagram of the Aircraft, Control Servo Actuator and Measurement Dynamics.

1

--

-

A. V. 'Balakrishnan

NOMENCLATURE pilot command input force, lbs.

*s

acceleration due to gravity constant, ft/sec 2

g

2

Gw

angle-of-attnck calibration coefficient

Ka

M

Mach number

e

n

z

gust power spectral density (ftlsec) sec.

Dimensional pitch moment coefficients

normal acceleration, "g' s"-

9

dynamic pressure, lbs/ft

v

true velocity, ft/sec

w

g

2

incremental vertical.velocity due to air gusts, ft/eec

z

distance from c. g. to accelerometer, .ft.

=a

Dimensional normal force coefficients

'6

e

a

.angle-of-attack, rad

ir (sub) distance from c. g. to angle-of-attack vane, ft.

'e

elevator deflection angle, rad

A. V. Balakrishnan

NOMENCLATURE (Continued) commanded elevator deflection angle, rad pilot command input deflection, inches damping ratio of the jth bending mode pitch attitude, rad slope of the jth bendingmode at angle-of-attack $me, rad/ft slope of the jth bending mode at pitch attitude and rate sensor, rad / ft displacemenr of the jth bending mode at reference station, ft relative displacement of the jth bending mode at the accelerometer forcing function coefficient % r bending modes, ftfsec

2

characteristic frequency of gust power spectral density, rad /sec natural frequency of the jth bending mode, rad/sec

A. V. Balakrishnan

Taking the state equation as:

x = Ax

+ Bu

x(0) = 0

we wish to find u(t) so a s to minimize:

where 6 (t) i s a unit step function. By the usual analysis, the optimal P , u(t), denoted uo(t) i s given by:

[The main point to note i s the appearance of

m

as the upper limit in

the integral.] We can express this solution alternately a s

u (t) = D

B* -Y(t)

O0

3 choi-

sir convenablement . Ou a l e : (1) (2)

-

L1algorithme consider* e s t de type ARROW-HURWICZ [I] Si on s a i t que

u E W , W Hilbert inclu dans V avec densite et

injection continue ( dlob W c V cV1cWt), on peut p r e n d r e pour S un operateur de dualite

de W + \V1

R. Glowinski On cherche

P2

SOUS

l a forme:

.Pz= PC

PI=? Utilisant

P

(5. 12) dans ( 5 . l l ) , il vient :

Mais : ZC(1-P) si

p

0
0

( destine B tendre v e r s z e r o ) , on

definit successivement : (6. 5)

Rh = [ M . 1J

(6.6)

0.. 1J

I Mij E R2

=I 5 , xi

-

1 . q + J (resp. C.. 2 -1 ) 1 -

a

2

o x ( r e s p .oy).

1J

2

xi +

.

M.. = (xi. Y . ) x . ~=ih

J

1.l

ki

z [x]

= translate de

yj

h

- 5 ,

+ h- 2

y. +

J

. Y.J

=jh . i , j r Z I

[

de 4 . , parallelement 1J

R. Glowinski

4

e t on notera

qj, la valeur approchee

en M.. ; mdme rotation avec 1J

Pour dBfinir la variable duale

(du moins on ltesp&re) de u

f . . pour f. On posera: 1J

2 p ( E (L

2 (a) ) ),

il est commode d1in-

troduire un reseau Qh, de pas h Bgalement, decal6 par rapport R h de l a fason indiquee s u r la figure

6. 1 :

On utilisera systematiquement l e s relations 1

-1 j+

et

2 2 1 2 p 2 ( P = ( P , P 1) en

h

+

1 Z

et

2

Pi+

prendre

Mi+

, Pi + 1 . 1 pour les valeurs tfapproch6estl de p = ~ + ~

en compte

Mi+l j+ ; l e s points M ~ + L.+ 1 a 2 2 2 J Z sont ceux qui sont centre dlun c a r r e de cote

(v. figure 6. 1) dont un sommet au moins appartient B Roh

notera

!2 l h

-

1

llensemble de c e s points; on posera Bgalement :

et on

R. Glowinski Dans c e s conditions llalgorithme (6. 2) e s t

(p u

approch6If p a r :

donne n+ 1 i+ij

+

n+ 1 i-ij

u

,n+l

+

u.. ij+l

n+l 1

+

-

4un+1 -i j-

n g Dij Ph 'fij

(MijQaoh)

avec : Dij

1n Pi+-1 j+L 2 2

-

n

Ph

-

(6. 12)

+

1n

-

2n 1 1 P i+- j+ 2 2

-

1n

j+L

pi-L 2

2

+

P i+L 2

2h 2n 1 . 1 + pi+- J-2 2 2h

In

j-L2 -

2n pi-L 2

approchant

j+L

u

=

n+ 1 u. i+lj+l

-

2

AU

;S;;

n+l

'

au bSf

en Mi+L

2

J

j+L2

u n+l 2+lj 2h

9

+

n+l q+l

u..

2n

j+l- p . 1 . 1 2 1-5 J-2

approchant d i v p e n Mij,

2 Gi+L 2

pi+L j-- 1 2 +

- u .n+l . 13

R. Glowinski

(6. 14)

{

avec

n+ 1 Dans (6. 11, (6. 13), on convient de prendre u k l = 0 s i Mkl

4 nOh

On demontre dans CEA-GLOWINSKI [l] ,GLOWINSKI-LIONS-TREMOLIERES

[I] que (6. 11) , (6. 12), (6. 13), (6. 14) correspond 5 l1appli-

cation de llalgorithme

du

N. 4 (llalgorithme dlUZAWA)B 'la minimiJh (vh), convexe, non differentiable, en

sation dlune fonctionnelle

dimension finie ; J h Btant une approximation de l a fonctionelle problgme

du

(6. I ) , explicitee dans l e deux references ci-avant ; on y

demontre egalement la convergence de (6. 11) pour En ce qui concerne l a convergence de u h

-+

pn -

A)=

Min v,p

$(v,p;A)

Min [(Av.v) - 2 ( f , v ) + 2 A l l v l l I - ~ 2 ] v

d'od :

(1)

Ce

e s t B distinguer de celui de la r e m a r q u e

7. 1.

'R. Glowinski Min J (v) = Max v X I

Min

= Max

+

-

2 Al/v

Min [ ( A v , ~ )- 2(f.v) + 2(t,v)-

Max

= Max

(7. 17)

[(Av,~; ) 2(f,v)

v

1

2 Max ! A V , V ) - ~ ( ~ , V ) + ~ (h~ , V ) 4JlltIlrn5A

Min

t = Max

Min v

t

]

2

[(Av,v)-2(fJv)+2(t,v)-

1ti/

2

Mais

Min [(Av,v)- 2(f,v) + 2(t,v)] = -(A-'(f-t) , (f-t) ) v - 1 f l a formulation du probleme dual d'oG posant g = A (7. 18) Min t

[(A-' t , t )

-

2 (g, t ) +

la fonctionnelle J* (t) = (A-I t, t )

-

~(tll 2(g, t )

+

11 ti1

2

e s t non differen-

tiable et l'utilisation d'une m5thode de gradient s u r l e probl&me dual (7. 18) conduit A des difficult& lies A l a non differentiabilite de t+\l t/j

'

m

8. Un probleme de contrsle optimal ------ avec fonction cout s n differentiable Soit R un ouvert borne de R Q =

n X ] O , T [ ( T fi")

on s e donne

,

n

2

de fronti@re =

r

x ] 0, T

suffisemment r e g d i e r e et

C;

:

I

Y ( x , O ) = yo (x) dans R

G. Glowinski avec

I E L~ (Q) , y o E L~ (Q). 1 L e contr6le v &ant donne, (8. 2) admet une solution unique dans H (Q),

soit

y (v)

(8.3) avec o(

, et on peut definir l a fonction co3t

J(v) =

>

o

(QIy(v)

-jd12dxdt+dL

:

1vl2dxdt

zrhlvla

. /j > o e t a E L 2 (Q).

I1 r e s u l t e de

LIONS [I]

que l e probleme :

Min J ( v) vEUad adrnet une solution unique , soit Compte

u , qui e s t l e contrzle optimal.

tenu de LIONS , loc. c i t . , et des N. 2 e t 3 , on montrerait

facilement llexistence et lrunicite de p. p. s u r Q solution de At

f

=

+ u

sur

y (x,t) = o y (x,o) = y

P (x,t)

=

0

sur sur >

R

~

p. p . sur

Q I1

u 20

IXI Xu

x O] , T [

sur R

(x)

o

+PA+p

(du

~

s u r x

p(r,T)=o o(u

avec

:

'Y - b y (8. 5)

( y,p, u,X)

11

1

+PA+p ) u =

1.1

= 0

It 11

I

X(x t ) ( 1~

dt

R. Glowinski Reciproquement

si

(y,p, u,

1)

e s t solution de (8.5), (8. 6), (8.7) u

e s t contr8le optimal. L1utilisation de (8.5), (8.6), (8.7), associee B des algorithmes du type de ceuxdeveloppes

aux

N. 4 et 5 , a donne des resultats nume-

riques satisfaisants pour l e probleme (8.4) , mzme pour des valeurs assez petites de o(.

: 9.:

Une remarque s u r l e comportement des mulSiplicateurs de Lagrange. I1 a souvent Cte constate que, la fonctionnelle J

ficacite des methodes duales etait une fonction

I1

&ant donnee, l1ef-

(en particulier celles des N. 4 et 5 )

..

decroissantell

du convexe K, ceci. n l a rien de

t r 6 s surprenant puisqu1.5 la limite lorsque K s e reduit B un point l e problkme dloptimisation considCrC de Lagrange en gGn6ral.

n'admet pas de multiplicateurs

On. va mettre ce phenomeme en evidence s u r

un exemple t r e s simple :

V

Soit

=

R

, J definie par

On prend pour K E l e convexe &ant donnee par :

D1o~ le Lagrangien :

La solution optimale de

(9.4)

Min y(K

6

J (y)

[y

:

1 1 y 15€1 , llappartenance

K

E

R. Glowinski e s t donnee de fa$on 6vidente p a r :

(9.5)

ye

=

min

e t l e multiplicateur

( 1 , E )

de Lagrange

E

de : (9.6)

d'ofi :

Min J ( y ) YEKE

=Min&(y.a)

Y a

correspondant s'obtient & p a r t i r

-

R. Glowinski

B I B L I O G R A F H I E ARROW K. J. - HURWICZ L. [I]

CEA J.

CEA J.

- GLOWINSKI

-

R.

GLOWINSKI R. Li]

NEDELEC J. C. -

DUVAUT G.

-

[l]

-

LIONS J. L.

GLOWINSKI R.

: Dans Arrow-Hurwicz-Uzawa,

Studies i n l i n e a r and non l i n e a r programming. - Stanford Univers i t y P r e s s 1958. : MBthodes numeriques pour llecou-

lement l a m i n a i r e d t u n fluide rigide visco-plastique incompressible A b a r s t r e - (R~DDOI-t IRIl\ d i s ~ o n i b l e ) : Methodes duales pour l a minimisa-

tion de fonctionnelles non diffkrentiables - A p a r a i t r e dans l e s p r o ceedings d u Colloque d'analyse numCrique de DUNDEE- 197 1, SPRINGER VERLAG. : L e s inequations e n mecanique et en

physique

[I]

-

DUNOD 1971

: Methodes Numerique pour llCcoule-

ment stationnaire d l u n fluide rigide visco-plastique incompressible Proceedings of the 2. nd. Int. Conf. on Num. Methods i n Fluid DynamicsL e c t u r e Notes i n physics, 8 , Springer - V e r l a g 197 1 : Expose B cette reunion CIME c z -

s a c r e B llAnalyse NumCrique du ProblPme e l a s t o - ~ l a s t i a u e . GLOWINSKI- LIONSTREMOLIERES.

GOURSAT

5

C1l

: L i v r e s u r llAnalyse NumCrique des inequations variationnelles, 5 parai-

t r e an 1972 chez DUNOD : A.nalyse Numerique d e problemes

dlelasto-plasticit6 e t de visco-plasticttk. - T h e s e de 3 cycle PARIS-IRIA, 1971

R. Glowinski KY-FAN

[I]

LIONS J. L.

MOSCO U.

S u r un t h e o r e m e de Min Max C. R. A. S. - P a r i s , 259, 39253928, P a r i s .

:

[I]

[l]

Controle optimal de s y s t e m e s g o u ~ e r n e sp a r d e s equations aux derivees p a r t i c e l l e s - DUNODGAUTHIER-vILLARs 1968

:

Expose 5 cette reunion CIME

:

ROCKAFELLAR R. T.

[I]

:

Convex Analysis P r e s s 1970

-

Princeton Univ.

SION M.

C11

:

On g e n e r a l m i n max t h e o r e m s Pacific J. Math. 8,1958, 171- 176.

UZAWA -- H.

El]

:

Dans ARROW- HURWICZ-UZAWA. loc. cit.

C E N T R O INTERNAZIONALE MATEMATICO ESTIVO ( C . I. M . E . )

J. L . LIONS

Corso

tenuto

ad E r i c e

dal

21

g i u g n o a1 7 l u g l i o

1971

.

APPROXIMATION NU~ERIQUEDES INEQUATIONS D '~VOLUTION J.L. LIONS (Paris)

Introduction.-

On donne dans ce cours les methodes fondg

mentales pourla r6solution numerique des inequations d'Svolution intervenant en Mecanique et en Physique. ~exSexperiencesnumerique, faites

a 1'I.R.I.A.

(Paris),

i

seront present&

avec toues les details dans un livre de R.

Glowinski, R. TrbmoliSres et l'A., a paraitre chez Dunod.

Plan detaill6. CHAPITRE 1 . 1

Inequations d'evolutions parabolique. Type I.

. Exemples .

2. Formulation generale. 3. Solutions fortes et faibles. 4 . Generalitgs sur les methodes constructives d'approxi-

mation. 4.1

Reduction 5 un equation parabolique. Penalisation.

4.2

Reduction 5 un equation parabolique. REgularisation.

4.3

Reduction 5 un inequation elliptique. Regularisation elliptique.

a un inequation elliptique. Discr6tisation.

4.4

Reduction

4.5

InQquation d'6volution et points selles.

J.L. Lions

CHAPITRE -2.

-

Approximation par discretisation des inequations paraboliques de type I.

1. Approximation d'un couple d'espaces. Constante de stabilit6. 2. Schemas d 'approximation 'des inequations paraboliques de

type I. 3 . Analogue de la stabilite.

4. Etude de la,convergence.

CHAPITRE 3 . -

Inequations d'evolution paraboliques de type 11.

1. Exemples. 2. Formulation gQn6rale. 3.

Schemas d'approximation.

4 . Stabilite et convergence.

CHAPITRE 4. - Inequations d16volution du 2eme ordre en t. 1 . Exemples. 2. Formulation ggngrale. 3 . SchQmas d 'approximation.

4. Stabilite et convergence.

CHAPITRE 5.- Complgments et problSmes.

1. Ecoulement de fluides de Bingham. 2. ProblBmes ouverts.

BIBLIOGRAPHIE.

J.L.

Lions

CHAPITRE I.- I n e q u a t i o n s d ' e v o l u t i o n s p a r a b o l i q u 1.-

Exemples. Exemple 1 . 1 . -

La t h e o r i e d e l a d i f f u s i o n e n m i l i e u x p o r e u x

( c f . Duvaut-Lions Soit

n

[lj)

c o n d u i t 3 d e s problemes du t y p e s u i v a n t :

o u v e r t borne de R

(n=2

o u n=3 d a n s l e s a p p l i c a t i o n s ) d e fro; tiere

le 3

r

2

"r6guli&re8'. Soit

l a norma-

r dirigee vers l'exterieur de a.

On c h e r c h e une f o n c t i o n u = u ( x , t ) , xe 0, t>0, solution de (1.1)

aU at

[ , T>O f i n i q u e l c o n q u e ,

x ]O,T

n x ]0,T[),

(oii f e s t donnee d a n s (1.2)

n

au = f d a n s

avec l a c o n d i t i o n i n i t i a l e ~

~ ( ~ 1 =0 u) 0 ( x ) I

I

un donne d a n s

n

e t l e s c o n d i t i o n s aux l i m i t e s (uiO

sur

o:"

sur

u & = o

Z = P X ] O , T [ ,

z, surz

.

Remarque 1.1.Le probleme (1.1 )

,

(1.2)

,(1.3)

e s t non l i n e a i r e Z cause d e s

c o n d i t i o n s aux l i m i t e s ( 1 . 3 ) . Remarque 1.2.DtaprSs u = 0

sur

u

c0

c

an

z

= 0

sur Z,

on voit q u e

J.L. Lions

an

o

=

sur

Z-

zo.

Mais Zo n'est pas donne 3 priori. Orientation. Le but de ce premiGr chapitre est: a) de montrer brievement corn ment le probleme (1.1), (1.2) , (1.3) (et, a vrai dire, des problemes beaucoup plus gen6rauxJ. est bien pos6

(

1

1;

b) de donner des m6thodes d'approxirnation numerique de la solution du problsme. Donnons un 2eme exernple. Exemple 1.2. On cherche u satisfaisant 3 (1.1),(1.2) et aux conditions aux limites sue

I

(1.4)

u 32

+

glul =

Z

o

g constante sur z

>Or

.

an

Autrement dit:

On verra que ce probleme est encore bien pose.

1

( )

[I]

Pour une 6tude plus systgmatique de la theorie, cf. H B&ZIS J.L.

LIONS [lJ[2]

.

J.L. Lions

2. Formulation ggnerale. Nous donnous maintenant une formulation "abstraite" de pro blSmes dlinequat.ionsde type parabolique, puis nous montrons cog ment cette formulation contient, en particulier, les exemples du N.1. Soient V et X deux eispaces de Hilbert ( 1 ) (2.1)

VcH

,

sur E,avec

V dense dans H I l'injection de V dans H etant continue.

On designe par:

(2.2)

I

I I

la norme dans H,

(

,

)

le produit scalaire

correspondant dans :HI

II II

la norme dans V

D'aprGs (2.1), il existe une constante c>O telle que

On se donne ensuite: bilingaire continue sur V x V, coercive au sens: il existe X tel que Ivl

l2

, a>O, VV E V ,

et on se donne encore: (2.5)

K = ensemble.convexe ferme dans V;

(2.6)

j = fonction convexe continue de V +1R

.

On identifie H 3 son dual et l'on introduit l'espace V' dual de V

1

( )

On peut aller beaucoup plus loin, enprenant pour V un espace

de Banach reflexif. Cf. Lions [2]

et la bibliografie de ce livre.

J.L.. Lions

de s o r t e que (2.7) S i f E V'

, on

ddsigne p a r ( f , v ) son p r o d u i t s c a l a i r e avec v E V;

c e t t e n o t a t i o n e s t c o m p a t i b l e avec c e l l e du p r o d u i t s c a l a i r e dans H. Le problsme. 1 O n c h e r c h e une f o n c t i o n t + u ( t ) d e [o,T] + V ( ) t e l l e que (2.8)

9

u ( t ) E K,

Un a u t r e probleme est: On c h e r c h e u = u ( t ) d e [o,T] +V t e l l e que

'tlv E V r

avec ( 2 . 1 0 ) .

L ' i n d q u a t i o n (2.9) ou (2.11 ) e s t c e q u ' o n a p p e l l e r a i c i une inequation parabolique de type I Remarque 2.1

.

S i K=V ou s i j=O,

2

( )

1

( )

(2).

(2.9)

et

(2.11) s e r d d u i s a n t 3 l ' e q u a t i o n :

Cf. l e s i n d q u a t i o n s du t y p e I1 au Chap.3. Dont il f a u d r a p r e c i s s r l e s p r o p r i 5 t d s .

J.L. Lions

Remarque 2.2. Si la fonction v

+

j(v) est diffgrentiable sur V alors (2.11)

Bquiva~ita l'equation (en ggn6ral non lingaire): (2.13)

au(t) , v ) + a ( ~ ( t ) ~ v ) + ( j ~ ( ~ ( t ) ) , v ) = ( f ( t ) , v ) ,V V E V . ( at

Remarque 2.3. Introduisons A

E

$ (V;V' ) par

Alors (2.13) gquivaut 2

Remarque 2.4. Si l'on considere la fonction $k indicatrice de K 0

1

( ):

si v e K ,

JI (v)=

(2.16)

k

+msi

V#K

alors (2.8) (2.9) gquivaut 3 : (2.17)

au(t) ( , t , v-u(t) )+a(u(t) ,v-u(t) + ~ ~ ~ ( v ) - $ ~ ( u) (3 t )

Les ingquations (2.9) sont donc des cas particuliers de l8in&quation

1

( )

La fonction $k est convexe et semi continue infgrieqrement.

J.L. Lions

I

vvcv 1 09 y e s t une fonction convexe propre (cf. le cours de U. Mosco [I]).

Utilisant la notion de sous diffsrential, on voit que

(2.18) 6quivaut 3

equation parabolique multivoque. Exemple 2.1.Voyons comment 18enonc6 gen6ral recouvre le probleme de 18Exemple 1.1. On introduit (notations des cours de R. Glowinski et U. Mosco): 1

(2.20)

V = H (n),

(2.22)

a(u.v)=

2

H=L (n),

&.K dx i=

ax. ax. 1

1

Alors le probleme (2.81,(2.9) ,(2.10) 6quivaut au probleme (1.1),(1.2),(1.3). Exemple 2 . 2 . On prend V,H,a(u,v) c o m e en (2.20) ,(2.22) et 180n intro duit

Alors le probleme (2.11 ) , (2.10) Bquivaut au probleme (1.1 ) (1.2), (1.4).

,

J.L.

Lions

Origntation. On va m a i n t e n a n t p r g c i s 9 r 3 q u e l s e n s on e n t e n d l e s " s o l u t i o n s " ' d e s probl6mes p r g c g d e n t s .

3.-

Solutions f o r t e s e t faibles. Solutions fortes. Par " s o l u t i o n f o r t e " du problsrne ( 2 . 8 ) , ( 2 . 9 )

,(2.10)

on en-

t e n d r a une f o n c t i o n u t e l l e que

(3.3)

(3.4)

u(t)e K

I

p.p.

en t

(z pour

tout t E

[o,~])

s a u f p e u t S t r e pour t d a n s un ensemble Z c [o,T] de mesurenulle, ona:

e t naturellement (2.10) ( 2 ) :

Evidernment ( 3 . 3 )

3 impose ( )

( I ) L~ (0,T; X ) = e s p a c e d e s " f o n c t i o n s " t + u ( t ) d e T 2 mesurables e t telles que I I u ( t ) l l dt<m

jo

2

( )

[o,T] +X

qui sont

•

X

I1 r 6 s u l t e d e ( 3 . 1 ) e t ( 3 . 2 ) que t + u ( t ) e s t , a p r e s modifica-

t i o n 6 v e n t u e l l e s u r un ensemble d e mesure n u l l e , c o n t i n u e de [O,T] A l o r s u ( 0 ) a un s e n s . 3

( )

Tant que l ' o n t r a v a i l l e avec d e s s o l u t i o n s f o r t e s 06 ( 3 . 3 )

l i e u pour t o u t t .

a

+H.

J.L. Lions

I1 est important pour les applications d'lntroduire tion de solution faible

(cf. Lions-Stampacchia [I],

une no-

Brdzis [ z ] )

.

Pour simplifier l'expose nous prenons

On observe alors que si u est solution "forte1'de (3.4) , on a:.

[

I

,v-u)+a(u,v-u)-(f,v-u) dt50

(3. 8)

V V E L2 (0,T;V) tel

que =EL' at

(o,T,v') et v(t) 6, K p. p. et v(O)=O. '

Mais c o m e

n'intervient plus dans (3.8) on peut ddfinir u cop! at me solution faible si u satisfait 3 (3.1), (3.3) et (3.8). Remarque 3.1.On a 6videmment des notions analogues de solutions "fortes" et "faibles" relativement

3 11in6quation (2.11)

.

Remarque 3.2.Seuil de .c6gularitS. La solution u(t) des problemes prdcedents n'est pas une fonction "trGs r6guliSre8'de t, quelle que soit la rdgularit6 des donnes f et uo. Prenons en effet V=H= IR,

a (u,v)=O (qui v6rifie (2.4) lorsque V=H),

J.L. Lions

La solution est indiquee

sur

le graphe ci contre. On voit que, 2

en particulier,

at2

$

L' (0,T).

Resultats ggngraux. On demontre

1

( )

les resultats

suivants (cf. par ex. Lions [2]

et

L L 0

la bibliographie de ce travail):

-

>C

TheorSme 3.1.- On suppose f E L~(O,T;V'). On suppose que (2.4) a lieu. I1 existe alors une fonction u et un seule satisfaisant

a

(3.11,(3.31 ( 2 . 8 ) .

TheorSme 3.2.- On suppose que (2.4) a lieu et que

I1 existe alors un solution forte et une seule de (3.1). ..(3.5). Remarque 3.3.On a des enonci!s analogues pour l1ini!quation (2.11). Remarque 3.4. L'unicite des solutions fortes est immediate. Pour l'uniciti! des solutions faibles, si.ul et u2 sont deux solutions Bven-

tuelles, on introduit: 1

( )

Nous donnons ci aprPs guelques indications sur les methodes

'constructives possibles de dgmonstration et au Chap.2 nous donnons l'approximation numgrique de la solution (qui peut, d'ail-

J.L. Lions

1 W = -(u +u ) 2 1 2

et l'on prend v=w u1 et u2

,

puis w

e

solution de

dans chacun des inequations (3.8) relatives 2

E

.

On additionne et.on peut alors faire e

-+

0. Cf. H. Brezis

c21 4.- G6ngralit6s sur les m6thodes constructives d'approximation.

4.1.-

Reduction 2 une equation parabolique. penalisation.

Soit 6 un operateur de penalisation attach6 a K (cf. Lions [2], p.370 et les cours de R. Glowinski et U. Mosco). On

"approche"

(3.4) par 1 "equation penalisee (4.1)

au ( 7 ,v)+a(u

1

(t),v)+;(~(u~(t)),v)=(f(t),v)

VVGV,

E

oil

E>O est destine 2 tendre vers 0, avec

--

I1 s'agit d'un probleme parabolique non lin6aire monotone (car 6 est, par definition, monotone de V

-+

V') dont on sait qu'il

admet une solution unique. On montre (Lions [2)) Theoreme 3.1

,-u E

que, par ex. sous les conditions du 2

-+

u dans L ( 0 , T ; V )

faible lorsque

.

E-+Ooh u est

solution faible. On peut ensuite approcher ur par l'une des msthodes de reso

J.L. Lions

4.2.- RPduction,&un equation pbrabolique. R6gularisation. Dans le cas. des inequations (2.11) on peut introduire j (v), EI

approchant j ( v ) . Par ex.

fonctionnelle convexe differentiable

,

dr

o

on prendra

est convexe, differentiable, et par exemple

y

(X)=IXI E

si

\ A ( > E

On "approchet'alors (2.11) par llequation rggularisee %u (4.3)

,

(* v)+a(uE,v)+(j~(u

avec (4.2)

,v)=(C,v)

Vv E V I

E

.

11 s1agit 12 encore d'une gquation parabolique non lineaire monotone

2

et on verifie encore gue u

-+

u dans L (0,T;V) faible,

u solution faible.

4.3.- Reduction 2 un inequation elliptique. Regularisation elliptique. Pour reduire par ex. (3.8)

a un situation

d e j a connue, nous

nous somrnes rarnenks jusqtici 2 des equations dlBvolution. On peut essajer de se ramener 2 des inequations stationnaires. pour cela, on considere le probleme de nature elliptique: trou ver u

fi

oii

fGo =

{v

E E

(4.4)

I

2 av veL (O,T;V), % E L

2

(0,T;H),v(t) E K v(O)=O>

solution de

1

P.P. I

J.L. Lions

Cette inequation entre dans le cadre de inequations variationnelles-elliptiques etudiees dans le cours de R. Glowinski et U.

= .

MOSCO 7 ( 1 )

On montre (cf. Lions-Stampacchia [I] u

par ,ex.) 1 'existence de

solution de (4.5) et la convergence de u

vers la solution E

faible lorsque Remarque 4.1

E

+

0.

.-

On peut utiliser pour llapproximation de la solution u

€

de

(4.5) les methodes descours Glowinski et Mosco. I1 est possible (mais non encore verifie sur des exemples num6riques)que l'usage de (4.5) soit utile

pour des calculs sur de longs intervalles

de temps. Cf. Carasso [I] pour le cas .de equations. Remarque 4.2.On peut utilis6r simultanim_ep~le5 id6es de 4.1 on 4.2 et 4.3. On peut donc se ramener 3 des equations stationnaires.

4.4.- Reduction 3 une inequation elliptique. Discretisation. La methode peut Gtre la plus naturelle de reduction au cas

(

1

On notera que le probleme (4.5) est non symetrique meme si

l'on part dlune forme a(u,v) sym6trique.

J.L. Lions

elliptique est dlutilis&r la discretisation de la deride en t. En raison de l'importance essentielle de ce procede pour les applications numBriques, nous etudions cela en detail au Chap. 2..

4.5.- InBquations dl&volution et points selles (cf. Tremolieres C11 1

. On introduit

et 1 'ensemble

On vBrifie que si u est solution forte alors

autrement dit: {u,u} est point selle de L(v,w) .

sur

hxh.

A

RBciproquement, soit {u,u) point selle de L(u,w) sur & x X , Alors (4.9)

L(u,w)

L(u,Q) 4 L(v,Q)

vv,w ~

3 d

d l o a l1on deduit (en observant que ~ ( u , u=L ) (QIQ)=0) que L (u,Q)=O. Alors L ( v , ~>o )

et

L(w,u)=-L(U,W) a0

donc dlaprSs 11unicit6 dans le Theoreme 3.1 (ou 3.2) on a: U

= u.

J.L. Lions

Donc

u est solution, alors {u,u) est point selle de

L(v,w) sur

Kx k

et rgciproquement.

On peut dgduire de 13 une mgthode de demonstration de l'&stence de solutions. En effet si K est born6 dans V, alorsk est born& dans l'espace W des fonctions v 6 L~ (0,T;V) avec

E L ' (O,T;Vt)et l'existence at d'un point selle (ngcessairement de la forme {u,u}) est consGqueg

ce d'un resultat classique di Von Neumann. Si K n'est pas borne, on i n t r ~ d u i t ~ KR =

CV I v

E

Kt

1 lvl 1

6 R};

soit uR la solution de

Prenant dans (4.10 (4.11)

J

:

I luRI I 2dt

v=O on en d6duit que

lid inf

[$

laN-l

12+

1 N-1 7 a(u 1

-

1 !w'(o)

-'1

TI

a(w(o))

J.L. Lions

de sorte que

pour toute fonction v par exemple continue de [o,T]

+

K

-

et

par prolongement par continuite, t/v E L~ (0,T;V) avec v (t)E K P -P. On ddduit de I3 que w=u=solution du 'problSme, d'oO le TheorSme sous reserve de la verification de (4.29)

.

Mais

d'oO le resultat dlaprSs (4.5). Remarque 4.1. Les resultats pour le schema (3.9) sont tout 3 fait analogues aux precedents. Faisant v=O dans (3.9) on en deduit: (4.32)

(

6"-6"-1 At

,6n)+a(un,6n)~(fn,6n 1 .

Mais a(un ,6n )=a(un+l , 6n)-~ta(6") d'oii en portant dans (4.32) et en multipliant par At:

J.L. Lions

Par sommation on en deduit

d'oil l'on dgduit, &

1 6"1 '+a(~"+~)
0 , that for each function &f) the

relations hold

( A Y . 9 ) L f ( y . y ) >o, being different from

(1. 3)

0 and for the sake of simplicity i s defined

G . I. Marchuk by A

> 0.

The operator A is called positive semidefinite if there a r e

such non-trivial

0E

,elements which turn the inner product (A lp, rp )

into zero. For a l l the remaining elements there holds the inequality

Below we shall formally denote positive semi-definite operators byA > 0. Let u s introduce then an adjoint operator A*, satisfying the Lagrange identity (Ag, h ) = ( g , A* h ) .

4 and h E 4. The space 4*, generally coincide with 4 ,though the domain. D of definition

If i s essential to note that speaking, does not

(1.5)

g E

of basic and adjoint functions is the same. To clarify the fact we shall show that

in many problems of mathematical physics

longing to the Hilbert space

@

g - function be-

satisfy some homogeneous boundary

conditions. In the application of the Lagrange identity (1. >), a s a rule, alongside with the operator

* those

boundary conditions which a r e

A

satisfied by the adjoint functions

h

a r e defined.

Further, we shall

apply a more convenient notation for adjoint functions. Thus, if the elements of the space of functions

* @ are

adjoint space

denoted by (P , then those of the

suitably denoted by

In the case of A Then

(Z, a r e

= A*

q=@.

CP*

, the operator A is called self-adjoint.

Note an important consequence connected with the properties'of the adjoint operators. it follows that Fourier

-

A*

>

Thus, if A

>

0

, then from the Lagrange identity

0.

s e r i e s expansions by eigenfunctions of basic and adjoint

operators a r e of great importance f o r the analysis of algorithms.

G. I. Marchuk Consider the two following spectral problems for A

Assume that each of the homogeneous equations forms a complete set of orthogonal eigenfunctions {u

n

2

0:

(1. 6) , (1. 7) and {u*), which n

can be normalized a s follows:

and eigenvalues

n

belong to the interval

We shall call this complete set of eigenfunctions a biorthogonal basis. Then supposing completeness, any flinction f of

@

and f*of @*can be r e -

presented a s a Fourier s e r i e s

where

Later we shall consider, without stipulating, the spectrum of

A

>0

and

A

-> 0

operators to be real. Hence, it is not difficult to A A establish that in such a case 1 > 0 and 2 2 0. n n Of great value for the analysis of numerical algorithms a r e esti1

2

mations of norms of operators. The norm of the operator from :

A is definyd

G. I. Marehuk

(further f o r the sake of simplicity the restriction

#

(fl

0 will not be stated).

A

Taking into consideration the relation

tlie'squared norm of the operator A can be also written a s follows;

ll 2 The operator

* A A is

=

sup

('p , A * A ~

(rp.9)

cp€+

symmetric and positive sani-definite. Consider

a spectral problem

A*AR =

X~

*

(1. 13)

The problem defines a s e t of eigenfunctions

bJ

X A * ~ > 0. The set n any function yY of

@J and

eigenvalues

for symmetric operators is complete. Then

Q A Acan be

represented a s a F o u r i e r s e r i e s

where

cpn Substitute the s e r i e s tion

R

n

=

((P

.nn).

(1. 15)

(1. 14) into (1. 12) and u s e the condition f o r func-

orthonormalization. Then we shall have

where Q is a Hilbert space of F o u r i e r coefficients. It is not difficult to find that

-

11

1

;

11

XA'A

min

;

d

A*A '

G. I. Marchuk

where

A* A Amin

and

A*A max

a r e minimum and maximum eigenvalues

respectively from the totality hAtA of the spectral problem (1. 13). n A*A The value is usually called a spectral radius of PA*A = max the ,dperator A*A. li

9

In the case of a self-adjoint operator A consider

problem. Au = We have

IIAII

=

xA

u

a spectral

.

(1. 18)

FA-

If i s evident that for the self-adjoint operator

Let us consider certain properties of norms of operators

the problem

of eigenvalues. 1. 1. 1. Energy Norms. Later we shall always deal with Hilbert spaces of real functions with the norm (1.21) where C

> 0 . It

is easy to see that using the Lagrange identity we have

the equality

and consequently

C+C* 2

The operator C

i s symmetric and positive. It means that i f

> 0, then the norm of lp function can always

be presented a s any

inner product with a symmetric operator in the form of the weight function, i. e.

.

On the basis of Buniakowsky-Schwarz inequality

one can obtain the following important estimation :

where

dC+p and

-2

-

i s a maximum and minimum eigenvalue of

2

the symmetric operator

c + c" , 2

F o r simpler and more frequent cases one usually assumes C = E. Then we get

1. 1. 2.

Estimation of the Norm of a single operator

Let us consider a positive semi-definite operator A

2

0.

There

is the following relation :

II(E

+&A )-l11< 1

(1. 25)

for any parameter 6' > 0. This assumption can be proved by the formula

G. I. Marchuk

11

( E + @A)-

1

(1 =

( (E + ~ A )- l q ,(E + ~ A ) - l q

sup

26)

( Y W

Let u s take

y,

=

(E

+

G-~)-lq

a s a new t r i a l function of (1. 26). Then we get

11 ( E

As A

A

->

>0 ,

-

('t'>Y')

( ( E + C A ) (I, , ( E + C A )

0,

=

)

the estimation (1. 2 5 ) follows f r o m the l a s t relation. If

we have

1. 1. 3. If

-

+ W A ) - ' I ~ = sup

Kellogg's Lem= A > 0 and

>

0

, then

Let u s introduce the notation

T = ( E - r A ) ( E + #A)Consider t h e expression f o r

I( T I(

1

G . I. Marchuk = sup

( ( E - ~ A ) ( v (, E - W A ) ( Y )

-

y E \ Y ((E+O-A)Y, ( E + c A ) y ) = sup YIEY

(yJ, W )

-2

(A yr,

( y , y)+2 (Ay,

2

'Y )+a( A Y s A ' ? )
0

instead of (1. 28) we shall get

1. 1. 4. Estimation of the Norm of Operators A.s it was stated above

Since the squared norm of the operator A coincides with the spectral radius of the self-adjoint operator A*A, to define/3A,A

one can use the well-known

iterative Kelloggls process, i f A i s a normal operator, that ,is AA*= A*A,

where index k denotes the number of the sequential iteration of the following scheme :

The proof of the convergence of the iterative process (1.29)imme-

-

G. I. Marchuk

diately follows from the Fourier analysis. In.fact, let

where the

R a r e eigenvalues of problem (1. 13). Consequently n

Substituting the series into' (1.29) with large k we have

Am

where

=/$I*A

=

11 ~ 1 a n1 d 1~;-

is the eigenvalue preceeding '

its maximum of the operator A*A. 1. 1. 5. Cdculation of Spectrum Bounds .of a Positive Matrix Consider a problem of finding out maximum and minimum eigen~ a l u e sof the operator A, havifig a positive spectrum Au=

A

u.

F o r this purpose we use Lyusternik's method. We shall introduce an iterative process

where c

is a normalization factor, which i s conveniently chosen in n (n) the form c . Them n

=11~

9

= A

$111

and

/A

= lim

n+m

1,

y(n)

11

1

G. I. Marchuk Here the following norm i s used

where

a r e vector

selected to the order

y(n) components. The constant c

PA.

Then consider the matrix

B

n

i s usually

= / j A- ~ A

and the problem

It is evident that

B.2

0. Then consider again Lyusternikts iterative

process

and get

It i s easy to s e e that operators A and B have a common base and

Hence

=A-A.

Un this way not only maximum, but also minimum eigenvalues bf the matrix A a r e found. We assumed the matrix A to be of the form in which Lyusternikts method is applicable. 1. 1. 6. Examples Let us now go over the simplest examples which later will help illustrate methods i n numerical mathematics.

G. I. Marchuk 1. Let

where

A=

3 3;2

p 32

+

is the Laplace operator. The operator A

is defined on a set of r e a l functions

+,

whose elements satisfy the fol-

lowing requirements. B r s t ,

where 3 D is the boundary of the domain D. F o r the sake of simplicity it is assumed that the domain D i s a unit square Second , the functions

((P)

form a

{0 5r5

1, O

0.

At last, consider a spectral problem A

h

u

u = 0

=

Xu

in^,

for B D h .

The components of the eigenvectors corresponding t o (1. 56) u k1 = mP In (1. 57) the indices m

and

follows

sin

k m X h s i n lp7s h.

k, 1 specify the components

p a r e the numbers of eigenvalues :

are (1.'57)

of the solution, and

which can be ordered a s

G. I. Marchuk

uk1 mP

=

U

!

l , (i = 1.2 . . . . 1.

J

With the obvious relations

- A k (vksin

4

k r n ~ h =)

-a,ol s i n (

l

p h ~) =

sin

h2 4

h2

sin2

----mxh sin krnKh, 2 sin

i p h,~

2

the eigen-values will be = -4

mp Note that

Here the

m

and

1. a r e

p

mrch ---

( sin

h2

change from

ordered

mP

.

+

2

I to

pxh sin2 -1.

n-1.

As, usually ,

2

(1. 58)

Consequently,

*h 2

< 1 we can

write approximately sin2 7Ch --2

-

-- + n2h2 4

0

(h4)

hence , we get

Thus, we can write

The basis of eigen-vectors

(1. 57) can be used to present the vector

G. I. Marchuk a s a series

CQ kl

.

Then we get

where

The examples considered above give the necessary understanding of some operators and their properties.

-1.-2.

Approximation -----

Let us consider a certain problem of mathematical physics in the operator

where Here

form

cq

A is linear operator,

4

and

elements in

F

E

4

and

f E F.

a r e Hilbert spaces with the domains of definition of

D + 3 D and D respectively;

is

a linear operator of the

bountary condition, g e G , G is a Hilbert space of functions with the definition domain 3 D . Along with the equation (2. 1) , let us consider a s i m i l a r equation in a finite-dimensional euclidean space

a

'f

=

P,

=

gh

in

D~

, f o r aD,

G . I. Marchuk h where is a linear operator depending on the m ~ s h s i z e h , +h h f E F h , and. $h and F a r e euclidean spaces with the domain of h + a D h and Dh , respectively. Here Dh definition of elements D h

,y

is a set of inner

mesh-points of the D domain, and

3 D is a s e t of h the meshpoints, on which the boundary condition of the problem is aph proximated, ah is a linear ?perator on the grid g E G G is an h' h euclidean space of the vectors with the domain of definition a D h' Let us introduce the Hilbert norm of the vector i n grid spaces Fh,

Gh ,

.

Then we s h a l l denote by (

) the totality of values 7.h of any function of problem (2. '1) after projection on the grid domain

3 D o r Dh + aDh . Then the following definition is usually used.: h problem ( 2 . 2 ) approximates problem ( 2 . 1) with the o r d e r hn for the

Dh,

solution

where

cp

, if

M. a r e some constants different from

ca

.

F o r the cases where the solution of problem (2. 1) is

smooth

enough, e r r o r s in approximation a r e conveniently measured by the maximum norm peculiar t o the space of continuous and differentiable functions. To this end the Taylor-series functions participating in the formulation of the problem is used.

G. I. Marchuk L a t e r we shall assume that redtction of problem (2. 1) t o (2. 2) is made and moreover, the boundary condition of (2. 2) is used to eli-

minate the solution in boundary points of the

Dh

+ a D domain. As h

a result we have a n equivalent problem

h . is now the domain of definition of the solution D The soh' lution iph in boundary points is t o be found after solving equation

where

(2. 4) a s a result of a solution of

Eq. (2. 2)

with respect to the un-

knowns. Thus, in s o m e cases it is convenient to u s e form (2. 4) in writing an approximation problem, and in others form (2. 2) is

m o r e proper.

So, a s a result of the reduction applied with certain approximation, a problem with a continuous argument (2. 1) is reduced t o a problem in linear algebra (2. 4). F u r t h e r task is to solve a system of algebraic equations.

E x a m p 1 e.

Consider problems

The domain of definition D is assumed to be a square €0

5x 5

1 , 0

5

y

5

1 1 , and f

square D with a uniform grid along

-

a smooth function. Let u s cover

x and y

with m e s h s i z e h. The

mesh-points of the domain will be denoted by two indices (k, l ) , where the f o r m e r 3 0)

corresponding

to the spectral problem Au

=

Xu.

Introduce the following F o u r i e r s e r i e s :

where

u" a r e eigenfunctions of the adjoint spectral problem. Substitute (3.2) n into (3. 1) and multiply the result scalarly by u: . Then , if 7 > 0 we get expressions

Supposing that

f o r Fourier coefficients

G. I. Marchuk

we obtain the initial condition

The solution of ( 3 . 3 ) , ( 3 . 4) is obtained f r o m r e c u r r e n t elimination of the unknowns. A s a r e s u l t . we have

where r

n

=

1

-

7Xn.

Equality ( 3 . 5) is estimated

in modulus

We reinforce the l a t t e r substituting have

max J

I f h 1 f o r 1 fi- 1 I .

Then we

where

l fn l

rnax j

John Neumann introduced a so-called spectral critetion of stability.

It means that if f o r each harmonic

va

G. I. Marchuk

of the Fourier s e r i e s of ( 3 . 2 ) ,

t h e r e holds the relation

.1'

where C

C

2 -

a r e constants. independent of

j, then the difference

scheme' ( 3 . 1) is announced to be numerically stable. i ~ e u ts s e e what conditions should be applied to the p a r a m e t e r s of the difference scheme ( 2 . 12) to satisfy relation ( 3 . 8): Relation ( 3 . 7) analysis shows that sta-

bility criterion ( 3 . 8 ) is fulfilled if the condition

is imposed upon the parameter

r

.

n Suppose that the spectrum of the operstor

A

is situated in the

interval 0

0

.

Now let us come to a more general definition of the notion of numerical stability. ,For this purpose l e t us consider the problem

which i s approximated by the difference problem

G. I. Marchuk

We s a y that the difference scheme (3. 15) is stable , if with the fixed parameter

h , characterizing the difference approximation, t h e r e holds

the following relation for any

where the

h C1 , and

C:

j :

constants a r e independent of j.

The definition of numerical stability involves notion of the correctness of problems .with continuous argument. One can s a y that num e r i c a l stability establishes a continuous dependence of the solution on the input data for the problems of discrete argument. In fact , we choose, a s the input data of ( 3 . 15) f = f,,

and

g = g*

.

We get some solution of ( 3 . 15) and denote it by (4+. the input data

Then f o r the difference of the solutions

we have the following problem :

Then the stability condition becomes

Then we take a s

correspond

Hence it follows that small variations uf the solution t o those of the input data

f

and

g

.

It i s easy to s e e that the definition of stability a s it is given in

(3. 16) already relates the solution itself with a priori information

of the input of the problem. Such a definition i s more convenient for stability Bnalysis of many prqblems than the one due to Neumann. Let us consider stability of (2. 12) from this point of view., To this end we rewrite

the recurrent relation (3. 1) a s

where

The formal solution of (3. 17) i s

We shall estimate solution

(2. 28) by the norm using the Cauchy-Bunia-

kowski inequality and the triangle inequality.

We replace

11 f j - I 11

by the maximal value in all

Let

max j Then

Then we get

Il ll = I II fJ

j

.

If we put

then scheme

(2. 12) will be stable in the sense of the definition (3. 16).

It i s natural that condition (3. 22) is a sufficient condition of stability. One could get finer and weaker criteria through the norms of the powers i of the step operators 11 T )[ (i = 1,2,. ,j). However, such a weakin-

..

ing of the condition makes difficult a constructive procedure for establishing the stability criterion.

A s a rule , in practical calculations the sufficient

condition of

the form (3.22) i s usually applied.

A =A'>

Let us consider a case with the operator

0

and denote

Then

Let

'f=z

n

where

(u,)

~

~

n

~

n

'

is the base of the operator A .

Then

J

=

1

- 2 t X + -c2

2,

where

Let us find conditions to be satisfied by 7

, s o that J < - 1 ,

i. e.

then

H e n c e , if

P A =l l h ]

mar

=

An

=

,then 1.

We get following sdficient condition of stability :

Z

.

0 ,already independent of h, i. e. C l = max h

ch 1

and

C2 =

max h

Thus,with the constructive approach to the establishment of stability of some o r other difference scheme it is always preferrable to h h define C1 and C constants f o r h fixed. However, i n the c a s e s when 2 passages to the limit a r e studied with h + 0 , Z+ 0 , judgement of '

stability should be connected with either of

h

o r of

Z

.

C1 and C2

constants independent

It i s this view of stability that we shall use

in the investigation of the convergence of approximate problem solutions to accurate ones, with simultaneous tendency of

h and Z towards zero.

A s i m i l a r consideration r e f e r s to the definition of numerical stability in the f o r m (3. 3 1).

-- 1 . Let us investigate the numerical stability of differenExample ce schemes approximating the equation of heat conductivity. In the c a s e of an explicit scheme (2. 21) we had the norm of the step operator in the form (2. 28)

Now let

(1 T 1)

0 . Such schemes a r e usually called unconditionally o r absostable . Similarly, for the Crank-Nicolson scheme (2. 36) we have :

Thus this scheme is also absolutely stable. 1. 4.

The Convergence Theorem-

The present section will deal with one of the most important theorems of numerical mathematics, known a s the equivalence theorem, finally formulated by P . Lax. The point i s that f r o m approximation and stability of the difference scheme follows the convergence of the approximate problem to the solution of the exact one. Let u s consider an evolutionary problem

G. I. Marchuk

with t h e boundary conditions :

acp

= g foraD

x

T

and the initial data

tp =

lp

0

with

Let u s &rite the problem (4. 1)

t

-

=

0

(4. 3) a s follows ;

Let us consider a space of functions with the definition domain D x T and cover this domain with a grid. Then we project the solution of the problem (4. 4) upon the grid domain

-Dh x

TT, where

and consider an approximate problem approximating

6h

=

D +aDh

h (4. 4) in the form :

Suppose now that the following approximation is valid :

G. I. Marchus

Then a s s u m e a certain numerical stability of a difference problem

and C be independent of h and T . Then provided that we have 1 2 approximation (4. 6 ) , stability (4. 7 ) and .linearity L, 1, Lh*and lh there

Let C

is a convergence

Let us prove the theorem a s follows. Consider the identities

and taking (4. 4) and (4. 5) into account we write :

G. I. Marchuk

If triangle inequalities a r e taken f o r the norms :

Applying the conditions of approximation (4. 6 ) we get :

Now consider the equalities

where

A s the difference scheme (4. 1 2 ) is numerically stable according to the theorem condition, we get

and, considering (4. 13) and ( 4 . l l ) , we have

G . I. Ma rchuk

o r , finally,

Thus the convergence theorem is proved. The assumption of the theor e m included a r a t h e r rigid condition that of

h

and

C1,

and C

2

a r e independent

Z.

P a r t i c u l a r l y unpleasant i s the requirement of independence of these constants of h . As i t was already mentioned i n the previous h section, t h e values ch and C2 a r e defined with h fixed. Moreover, 1 with h -+ 0 t h e s e constants, i n many c a s e s , tend t o infinity a s follows:

wheke

m > 0. If this fact i s taken into account,

then the

approxi-

mate solution convergence to the exact one will b e evaluated in the following wa.y :

If k>m and t P< hm , then t h e r e is convergence. Naturally, the convergence t h e o r e m can also be formulated f o r the c a s e s when C depend both on h and on 7 .

1

and C

2

G . I. Marchuk Let us turn t o the case of convergence in stationary problems of mathematical physics. Let the problem be

A y

=

f'

in D

a 9

=

g

for 3 D .

Let problem (4. 16) be approximated by the following difference scheme:

Suppose, there i s the following approximation

Besides there is an estimate a priori of the problem solution

where C

1

and C

2

(4. 17)

a r e constants independent of h. Then , similar to

the above , we get a convergence

Thus, in the study of difference stationary problems of mathematical physics the role of stability condition i s performend by close to i t c o r r e c t n e s s condition using a p r i o r i estimations. Here l i e s a profound inner relationship of difference equations for stationary and evolutionar y problems. After establishing approximation and a priori estimates (of stability i n the case of evolutionary problems), both of the problems become principally equivalent i n their formulations and a r e investigated by s a m e methods.

G. I. Marchuk

C h a p t e r

2

METHODS O F SOLUTION O F NONSTATIONARY PROBLEMS

In this chapter we shall deal with methods of solution of nonstationary problems i n mathematical physics. We shall concentrate on a solution of complex problems and their reduction t o simple ones using the method of finite differences. Our main object will be a n evolution problem in mathematical physics

where A

> -

0 , and the solution of the problem

( X ), the functions f

and g possess a necessary smoothness in the domain of definition of the solution D x T. It will be assumed that at the boundary of dD the solution of the problem satisfies some boundary conditions. 2. 1 Approximation

-

Stability Relation

The problem of approximating difference equations by finite-difference ones and stability of difference equations a r e closely related. Indeed, let u s a s s u m e that it is required to- find an approximate solution to a problem in mathematical physics given imput data of the problem. The approach to approximation of stability formulated in the previous sections appears i n most instances too general judge of individual properties o r the algorithm being developed. One of the reasons is generality of assumptions made when one investigates properties of the solution of a n approximate problem.

Thus, when one judges of approxi-

G. I. Marchuk mation one usually uses estimates which a r e valid f o r a whole c l a s s of problems but not for an individual problem, a theoretical estimate of the operator's norm being given from the worst function of the class, i. e. the function which results in a maximum e r r o r . In practical cal-

culations, however , we have to do with specific functions defined by the input data of the pt.oblem. Therefore it is expected that investigation of a n approximate solution of a problem studied may allow u s to construct effective algorithms of an approximate calculation in a different aspect. This thesis is easily illustrated if we consider a n evolution problem

where A = A g

>

0

, and the operator A does not depend

belong to the Hilbert space Let u s approximate

L2 (1. 1) by

on t , +

and

.

Naturally, a necessary condition for approximation to (1. 1) by the difference problem (1. 2 ) is , in a sense, smallness of the expression T A ~ '.

This follows immediately from Tailor's expansion in a s e r i e s

of the initial problem (1. 1) and from substitution of the s e r i e s into (1. 2). Here , s u r e l y , i t is natural , when one uses this method, to make an assumption that the solution is sufficiently smooth.

Thus, let

G. I. Marchuk

Substituting (1. 3 ) into (1. 2 ) we get

Using (1. 1) the last equality is transformed and written a s

F r o m h e r e i t ' follows that the approx?mation condition may be chosen

a

so a s t o emphasize smallness of the remainder due t o the mutual compensation of the t e r m s in the left-hand side of (14 ) and, considering that for symmetric operators (A2 * j . 4 ' ) = (A

4', A @ j ) ,

we have

3 2

( A + j , ~ + j ) 0 and

11 (E +

%A')

-'1

L

'

I 8

Thus, considering (3. 4) and (3. 7), we transform the inequality (3. 5) a s follows :

(3. 8)

setting

1 yo11

=

1 g 11

and

1 f 11

= max

1 $ 11 .nd

using

j

the recurrence relation (3. 8-, we get

(3. 9)

where

C = J T

=

T

2,

F i r s t of a l l we can find out that a trivial extension of the above splitting-up methods to the case n = 2 i s generally impossible. Therefore our object will be t o extend $putting-up algorithms to this case making assumptions which allow such an extension. 2. 5. 1 -

The method of universal algorithm-

Under the assumption (5. 1) it can be presented a s

p.'

0

I g,

where fJ = f

(tj+l/2).

The scheme of the operational algorithm i s a s follows : $ = - A $ + f j .

3 j+l/n

(E + (E

z +2

A2)

= $,

j+l/n = $j+l/n

G. I. Marchuk (E+

Y'

;%) 5

j+l =

j

j+1

Il- 1

y j+ y-

;

- 7tj+l

(5. 3)

IV is not difficult to check that the universal splitting-up algorithm has second o r d e r accuracy in Z if a solution i s sufficiently smooth. Numerical stability 'will be achieved if the condition is fulfilled :

II T 11
0. There

")8-

is a question if it is reasonable to

"split" ,

f i r s t , the operator into A

dP

operators

A

into

A and then, in turn, the operators A d

? . I s it not easier to represent the operator A a s a set of

&

right away ? In this connection it should be remarked

that though these two approaches seem equivalent, in many cases it

is more convenient to convert, first, a complex problem in mathematical physics into simpler ones which further

can be independently

reduced t o difference problems (see supplement). Let us analyse any problem of (6. 3 ) and ,considering (6. 7), split it into even simpler ones

where

It i s not difficult to see that the system of the split equations (6. 8) approximates the initial-value problem (6. 1) to within the second order in Z .. The proof of such a statement i s based on the fact that, using (6.2) and (6. 6), one can change the ordering of the components of splitting-up , by writting

*dp and in this event we gave the problem

=

f A *=I

C

G. I. Marchuk which, a s was shown in 6 2 . 5, approximates the problem ( 6 . 1) accu-

.

rate to the second order in Z

The result is also true of the case

when A i s dependent on time. Then one should make approximation to

ou3

the operators

Ad.

=

to within the second order in 7 on each

A&

. If the a r e non-commutative, then, using interval t . < t 5 t . J J+1 the two-cicle procedure described in 8 2. 5, we obtain a difference scheme accurate to the second order for each interval

t j - l 5 t 2 tj+l. To summarize, we can assert the following. When an evolution problem of the form (6. 1) , under the condition Ad? 0 , is reduced to particul a r evolution problems ( 6 . 3) and these a r e regarded a s a set of new evolution problems, the approximation to the initial will be accurate to the first order in

z provided

-

value problem

that at least one of

the elementary problems i s reduced to difference schemes accurate to the first order. If every such problem has approximation of secondorder accuracy, then, using the two-cycle procedure with respect to O(.

a n d p , we again come t o approximation of second order accuracy

in Z . It should be noted that if the operators A

a r e non-commutative,

then without using the two-cycle procedure we derive approximation of ( 6 . 1) accurate to the first order.

Indeed, let us consider the case of non-commutative operators. Then the following is an initial-value problem : n

v =y j

if

t =

t

j '

(6. 10) is reduced t o the system

Let

Ad

=

E LAMP /j'l

. where Adpi

'

Adpj

"yi

(6. 11) is solved by the two-cycle method.

Then every problem of

The initial conditions for each of the systems (6. 12) a r e taken in the form

It i s easy to find that (6. 12) approximates, on the interval t .< t 32 any of the problem ( 6 . 11) to within T .

5

tj+l,

In order for the whole algorithm to lead to a solution of (6. 1) to within z4 it i s also necessary alternate the basic cycles. Thus,instead of (6. 11) on the interval t . 5 t J- 1

* at

+

= 0

and, on the next interval

5 t. 3

,

t. < t 3 -

we should have

(d=1,2,.

5

t. 3+1

. . ,n),

:

(6. 14)

G . I. Marchuk

It is assumed that each .problem of (6. 13) and (6. 14) is solved by thertwo-cycle method of the form (6. 12). Note that for the condition

& .2

C

the component-wise splitting-up method is absolutely sta-

ble.

2. 7 Hyperbolic ,Equations Hyperbolic equations hold a prominent place in applications. Numerical methods for such equations a r e studied to full advantage. Hyperbolic equations have the following characteristic features. F i r s t , the domain of dependence of a solution for such equations is bounded by a characteristic cone so that the region outside D x T space does not affect the solution in the point under consideration. Second, among the solutions of the initial-value problem there may be non- smooth solutions a s well and this should be kept in mind when one develops numerical schemes. Since a great amount of excellent investigations a r e devoted t o constructing difference schemes for hyperbolic problems we shall discuss only some of the methods most widely used in recent years. Let u s look at the problem

G. I. Marchuk

It will be assumed that the operator the functions

g

and

p

j- >

0 for a l l

#

is independent

of time and

allow sufficient smoothness of the solution

of the periodic problem. Let the exist

A

operator A be pdsitive, i. e. there

0, so that

By the way, we must note that for symmetric positive definite operators

)-= dA, , where

dA is a minimum eigenvalue of the operator

A spectrum. Let us consider difference approximation to the equation of (7. 1) in the form

It is easy to show that the difference scheme (7. 3) approximates the initial equation of (7. 1) to within quantities of the second-order of smallness with respect to 7 . We apply initial data t o (7. 3 ) . In order not to distort'the second o r d e r of approximat'ion we take, along with the condition,

the following relation :

(7. 5) is derived by expanding the problem (7. 1) into Taylor's f e r i e s in the vicinity of

t = 0 with a subsequent elimination of derivatives

G. I. Marchuk

using the equation and the known initial conditions in (7. 1). The problem of (7. 31, (7. 4), (7. 5)

is fully formulated. Our

object now is to analyse numerical stability spectral method. Let u

and u* n n values of spectral problems

Next, it will assumed that

of (7. 3) , using the

be eigenfunctions and

I gn)

n

> -

0 eigen-

f o r m s a basis. Then we seek a

solution to the equation in the form

where

Substituting the F o u r i e r s e r i e s result

by

expression

j

yn

(7. 7) into (7. 3) and multiplying the

we get for the F o u r i e r coefficients the following

A solution to (7. 8)

is sought in a form of the power

Note that in the left- hand side of (7. 9 ) j right

-

i s an index

function

while in the

hand side it is a power.

Substituting (7. 9 ) into

(7. 8)

we get the characteristic equation for

7' n'

G. I. Marchuk

It .is easily seen that i f

the roots of (7. 10) a r e complex conjugate and equal t o unity in modulus

i. e.

IqnI F r o m the condition

(7. 12 )

=

(7. 11)

Evidently, (7. 13) will be fulfilled for all

n

if

7 a r e taken such

that

where

/jA

is the upper bound of the o p e r a t o r

F o r symmetric operators

PA

=

11 A 11

Let u s proceed to implicit difference schemes

A

hence,

spectrum.

G. I. Marchuk

The scheme ( 7 . 16)

is accurate to the second o r d e r with respect to Z

and in combination with ( 7 . 4), ( 7 . 5)

it approximates ( 7 . 1) to within

the setond o r d e r . F o r ( 7 . 16) the characteristic equation is of the form.

hence, 7

F r o m h e r e i t follows that with any 1

Thus,

( 7 . 16)

( 7 . 19)

is a n unconditionally stable scheme.

Let

where

A

b(

> -

0 . F o r an approximate solution of ( 4 . 1) we make use of

difference appr~~xirnation of the form

G. I. Marchuk where

F r o m (7. 21) and (7. 22) it follows that (7. 21) approximates the initial equation of (7. 1) to within quantities of the second o r d e r with respect to 7

.

Since the equation

(7. 21) can be reduced to

f r o m the analysis made above t h e r e follows stability of (7. 21), (7. 22), provided that

In t h i s way the problem of choice of the p a r a m e t e r

Z satisfying the

stability condition reduces to calculating a maximum eigenvalue of the problem (under the assumption that all eigenvalues

/3B- 1 A

a r e positi-

ve ) : AU

=

XBU.

( 7 . 25)

This problem is solved by the iterative p r o c e s s

In this connection

The operational scheme of the difference s y s t e m corresponding to (7. 21)

G. I. Marchuk

i s written a s

This problem is solved successively with initial data

(7. 4 ) and

j = 2, 3

. . . . ,and

using the

(7. 5).

( 7 . 28) i s a splitting-up scheme. To conclude, we consider a wave equation

where

.2 a

is squared. velocity of propagation of wave perturbation

The problem (7. 2 9 ) will be called periodic in geometric variables. Using our notation

A

=

-a2,.

G. I. Marchuk

Let us assume that instead of the differential operator we consider its second-order difference approximation in all variable6 xo< Then

If

a

,X

= h

. the spectral problem

defines the upper bound of the difference operator A spectrum in the form

Thus, i n this case the explicit scheme (7. 3) requires fulfilment of the condition

(7. 14) o r

If we consider the scheme

(7.21), (7. 22) , where

then, applying the spectred analysis , we get

This means that the difference scheme of (7. 29) on the basis of the

G. I. Marchuk

splitting-up algorithm (7. 2 1 )

will be unconditionally stable.

The algorithm considered extends fairly simply to inhomogeneous hyperbolic equations.

G. I. Marchuk Contents INTRODUCTION

.. . . .. . .. . .. . . .. .. ...

Chapter 1. General information from the theory of difference schemes 1. 1. Basic and adjoint equations

1. 1.-1. Energy norms

1. 1. 2. Estimation of the norm of a single operator

;

1. 1. 3. Kellogg's leinma 1. 1.4. ~ s t i m a t i o nof the norm of the operators

1. 1. 5. .Calculation of the spectrum bounds of a positive matrix 1. 1. 6. Examples 1. 2

Approximation

1. 3

Numerical stability

1.4

The convergence theorem

Chapter 2. 2.

Methods of solution of nonstationary pr6blems

L Approximation-stability relation

2. 2. Difference schemes of second o r d e r accuracy with time-dependent operators 2. 3. Inhomogeneous evolution equations 2. 4. Splitting-up methods for nonstationary problems 2. 4. 1. The method of universal algorithm

2. 4.2. The predictor-corrector method 2. 4.3. Component-wise splitting-up method 2. 4.4. A solution of problems with time-dependent operators

2. 4. 5. An example 2. 5. Multi -component splitting of problems

G. I. Marchuk

2 , 5. 1. The method of universal algorithm 2,'5. 2, The predictor-corrector method 2. 5. 3. The method of successive splitting based on .elementary Crank-Nicolson difference schemes

2. 6 . A general approach to component-wise splitting 2. 7 . Hyperbolic equations References.

page

G. I. Marchuk References p e r s which a r e v e r y close with a theory symbol ( W ) indicates t h e p a-------------of the spletting-up method. --

-The ---

a -

1

- -Monographs ----

and text books.

Babuska I. P r h g e r M. , Vithsek E. Numerical p r o c e s s e s i n differential equations-Interscience ,1966 Bahvalov N. S. Foundations of numerical analysis Berezin I S. , Zhidkov N. P. Computing methods

-

-

(1970) Moscow (Russian)

Pergamon P r e s s

-

Oxford 1965,2' vols

Wasow W. ,Forsythe G. Finite difference methods f o r partial differential equations J . Wiley and Sons (1959).

-

Voevodin V. V. Numerical methods of algebra. Theory and algorithms. "Naukafl Moscow 1966 (Russian) Godunov S K. L e c t u r e s on difference methods f o r the solution of the equations of gas-dynamics Novosibirsk 1962 (Russian).

-

Godunov S. K. , Ryabenki V. S. The theory of difference schemes. An Introduction. North Holland .- A m s t e r d a m 1964. D'jakonov E. G . Iterative methods f o r t h e solution of d i s c r e t e analogues of ( ) boundary value problems f o r ellyptic equations (Intern Spring School on Numerical Math. , Kiev. 1966 (Russian) I. K. A. N. SSSR V. Z. Acad. Nauk. SSSR/Kiev 1970. Illin V. P.

Difference s c h e m e s f o r t h e solution of ellyptic equations Izd . NGU. Novosibirsk (1970) (Russian)

Kantorovich L. V. Functional analysis and applied mathematics Nauk 3,6, 89- 185 (1948) (Russian).

.

-

-

Uspehi Matem.

Kantorovich L. V. Krylov R. I. Approximate methods of higher al.alysis, FM. ,Moscow-Leningrad (1962).

G. I. Marchuk

Collatz L. The numerical treatment of differential equations, 3rd. ed. Springer Verlag, Berlin (1960).

-

Funktional analysis und Numerische Mathematik , SpringerVerlag, Berlin (1964).

Krasnosel'skii M. A. , Vainikko G. M. , Zabreiko P. P . , Rutickii J a . B. , Stecenko V J a . Approximate solutions of operator equations Izdat "Naukall Moscow (1969) (Russian) Courant R. P a r t i a l differential equations vol. 11, New York 1962. Lions J .

*

- llCourant-Hilbertv

( ) Resolution iterative d1in6quations variationnelles p e r decomposition et eclatement College d e F r a n c e (1967)

-

Marchuk G I. Computational methods f o r nuclear r e a c t o r s Atomizdat, Moscow 1961 (Russian)

-

Marchuk G. I. (+) Numerical methods f o r weather forecasting-Hidrometizdat

1967 (Russian)

-

( S ) Methods and P r o b l e m s of numerical analysis, (Russian) Int. Congress of Math. Nice (1970)

Marchuk G. I. , Lebedev V. I. (+) Numerical methods i n t r a n s p o r t theory Moscow (Russian)

- Atomizdat

(1971)

Mikhlin S. G. Variationsmethoden d e r Mathematischen Physik-Akademie Verlag-Berlin 1962 Richtmyer R. Difference methods f o r Initial-Value problems Intersciance Publ. Inc. New York (1957) Richtmyer R . , Morton K. Difference methods f o r initial-value problems, New York 1967 Rozdestvenski B. L. , Ianenko N. N. ( + ) Systems of quali-linear equations 1968 (Russian)

-

"NaukaN Moscow

G. I. Marchuk Ryabenki V. S. ,Philippov A. F. On stabJlity of difference equations Gozudarst Izdat. Tehn-Teor. Lit. Moscow 1956 (Russian)

-

Samarskii A,. A . (*) Lectures on difference schemes

- Moscow

1969 (Russian)

Saul1yev V. K. Integration of equations of parabolic type by the method of Pergamon P r e s s London 1964 nets

-

-

Smirnov V. K. Lehrgang d e r hBherer Mathematik Berlin 1956

- Deutscher Verlag -

Sobolev S. L. Lectures on the theory of cubature f o r m u l a s part I (1964) part I1 (1965) Novosibirsk (Russian), Izd NGU Tihonov A . N. , Samarskii A. A. Equations of mathematical physics (Russian)

- " ~ a u k a "Moscow

Wilkinson J H. The algebraic eigenvalue problem (1965)

- Oxford, Claredon P r e s s

Faddeev V. K. , Faddeeva V. N. Numerische Methoden d e r linearen A.lgebra Verlag-Berlin 61964)

1966

- Veb. Deutscher

For.sythe G ,and MBller F . B. Computer solution of linear algebraic systems ,Prentice-Hall, Inc. Engle wood Cliffs, N. Y. 1967 ,

Ianenko N N ( $ ) The method of fractional steps f o r solving multidimenllNaukall Novosisional problemd of mathematical physics. birsk 1967 (Russian)

-

- ( & ) Introduction to difference methods of mathematical phys i c s - part I and 2 Novosibirsk 1868 (Russian),Izd. NGU.

G. I. Marchuk 2.

Additional

literature

-

Babuska I. The finite element method- foe elliptic differential equations. I1 SYNNumerical solution of p a r t i a l differential equations 1970 A.cc. P r e s s New Y o r k , London , 69-106 (1967) SPADE

-

-

-

Belotserkovskii 0. M. , Chuskin P. I. A numerical method of integral relations Zh vych. mat. i mat. fi?. 2 , 5 (1962) 731-759 (Russian) Birkoff G . . Varga R.', Young D. Alternating direction implicit methods. Advantages in Comp. , Vol. 3, Academic P r e s s , New York-London 189-273 (1962) Birkoff G. , Schultz M. H . , Varga R. S. Piecewise Hermite !nterpolation i n one and two variables with applications t o p a r t i a l differential equations, Num. Math. 11 (1968) 232-256 Bryan K. A s c h e m e f o r numerical integration of t h e equations of motion on a n i r r e g u l a r grid f r e e of non l i n e a r instability Mon. Wea. Review, v 94, 1, 39 40 (1966)

-

-

Byleev N N. Numerical methods f o r the solution of two-and t h r e e sional diffusion equation Mat Sb. T 51,2,227-238 (1960) (Russian) Varga R. S. Matrix iterative a n a l y s i s

- Prentice-Hall-New

-

dimen-

J e r s e y (1962)

Wachspress E. L. ( j ~Extended ) application of alternating direction implicit iteration model problem t h e o r y 1016 (1963) SIAM. J. v . 11,3,994

-

Gunn J E .

( & ) The solution of elliptic difference equations by semiexplicit iterative techniques . SIAM. J Numer. Anal. v. 2,1,24-25 (1965)

G. I. Marchuk Godunov S. K , Prokopov G. P. variational approach to the solution of large systems of linear equations appearing in strongly elliptical problems. Preprint Inst. of appl. Math. Acad Nauk SSSR, Moscow 1968

-

Dorodnizin A. A. A computing method for solving some non-linear problems of aerohydrodynamics. Trudy 3-rd All Union Math. Symposium 5,447-453 (1958) A contribution to the problem of computing eigenvalues and eigenvectors of matrics. Dokl, Acad. Nauk. SSSR 126 (1959) 1170-1171 (Russian) Dtjakonov E. G. ( + ) Difference schemes with a '1 disintegrating "operator for multidimensional stationary equations. Zh. vych. mat. Mat. fiz. 2 , 4 (1962) 549-568 - ( ~ u s s i a n ) ( # ) The construction of iterative methods based on the use of spectrally equcvalent operators. Zh. vych. mat. mat. fiz. 6 , 1 (1966) 12- 34 (Russian)

-

Douglas J. . ~ a c h f o r dH. ( +) On the numerical solution of heat conduction problems in two and three space variables. Trans. mev. Math. Soc. v. 82, 2,421 (439) (1956) Kellogg

v)

-

( ~.not'heralternating direction 11,4,976-979 (1963) S1A.M J. V . -

- implicit

method

Konovalov A. N ( Y )Numerical methods for problems of. the theory of elasticity. - Izd. NGU Novosibirsk 1968 (Russian) Krasnoseltskii M. A.. , Krein S. G. An iteration process with minimal residuals 31 (73) 315-334 (1952) (Russian) --

- Mat.

Sb. (N. S)

Kreiss H. 0. Initial boundary value problem for partial differential and difference equation* in one space dimension? Numerical so1970 lution of partial differential equations. I1 SYNSPADE A.cademic P r e s s New York-London , 401-410 (1971)

-

-

Courant R. , Friedrichs K. , Lewy H. Uber di partiellen Differenzengleichungen der mathematischen Physik - Math Ann: T. 100, 32 (1928)

G. I. Marchuk Kurihara Y. , Holloway J. L. Numerical integration of a nine-level global primitive equaMon. Wea. Rev. tions model formulated by the .box method. v. 95,8,509-530 (1967)

-

Lhdyzenskaya 0 . A The mothod of finite differences in the theory of partial differential equations Uspehi Mat. Nauk (N. S. ) 2 (1957) 123-145 (Russian).

-

Lax P. D. , Wendroff W. On the stability of difference schemes with variable coeffiComm. P u r e Appl. Math. v. 15,4,363-371 (1962) cients

-

Lax P. D. , Richtmyer R. D. Survay of the stability of linear finite difference equationsComm. P u r e Appl. Math. v.,Q,2,267-293 (1956) Lebedev V. I. On the mesh method f o r a certain system of partial different i a l equations - Izv. Acad. Nauk' SSSR Ser. Mat. -22 (1958) 717 - 734 (Russian) Dirichlet and Neumann problems on triangular and hexagonal % (1961) 33-36 (Russian) grids. - Dokl. Acd. Nauk SSSR 1 Lyusternik L . A On difference approximations of the Laplace operator Mat. Nauk (N. S. ) 9_, 2. (1954) 3 - 66 (Russian)

- Uspehi

Marchuk G I.

-

( T )Numerical solution of Poincarb problem f o r the oceanic circulation - Dokl. Acad. Nauk SSSR T 185 -, 8 (1969) 1041-1044 (Russian) ( On the theory of the splitting-up method. Numerical solution of partial differential equations SYNSPADE - 1970 Academic P r e s s , New York-London 469-500 (1971)

w)

-

Marchuk G. I. , Kuznezov Ju. A. On the question of optimal iteration p r o c e s s e s -- (1968) 1331-1334 (Russian) Nauk. SSSR 181

- Dokl

A.cad.

Marchuk G. I. , Sultangazin U. M. (q)Convergence of a decoupling method f o r the radiation (1965) t r a n s f e r equation - Dokl. Acad. Nauk. SSSR 66-69 (Russian) (j(f) Solving the kinetic t r a n s f e r equation by the separation method - Dokl. Acad. Nauk. SSSR 163 -- -(1965) 857-860 (Russian)

G. I. Marchuk Marchuk G. I. On the foundation of the separation method for the equations of radiation transfer Zh. vych. mat. fiz. 5 , 5 (1965) 852-863 (Russian)

(x)

-

Marchuk G. I , Ianenko N N. ( * ) Application of the fractional steps method. to the solution of problems of mathematical physics Proc. A11 Union Conference of Num. Math ,Moscow February 1965 - Proc. Congress IFIP, New York May 1965 P r o c "Some questions of applied and numerical math. Wauka1I ~ o v o s i b i r s k5-22 (1966)

-

A.ubin J. P . ,Burchard H. G. . Some aspects of the method of hypercircle applied to elliptic variational problems. ,Numerical solution of partial differential equations. I1 SYNSPADE 1970. Academic P r e s s New York-London 1-67 (1971)

-

Oganesyan L A. , Rukhovets. L. A Variational-difference schemes for linear second-order elliptic equations in a two-dimensional region with piecewise smooth boundary. Zh. vych mat. fiz. 8. 1 (1968) 97-114 (Russian) Analysis of the r a t e of convergence of variational-difference schemes for second-order elliptic equations on a two-dimensional domain with smooth boundary. Zh vych. mat. fiz. 9,5 (1969) 1102-1 120 (Russian)

-

Rivkind V. J a . An approximate method of solving the Dirichlet problem and estimates of the r a t e of convergence of solutions of the difference equations to solutions of elliptic equations with dicontinnous coefficients - Vestik Leningrad Univ. , Sez. Mat. Meh 13 - (1964) 3,37-52 (Russian) Peaceman D W . , Rachford H. H. ( $ ) The numerical solution of parabolic and elliptic differential equations - SIAM J . v. 3, 1 (1955) 28-42 Raviart P. A Sur ltapproximation de certains equations dl€wolution lineaires et non lineaires - J. de Mathem. P u r e s et Appl. ,v. 46,1(1967) 11-107

G..I. Marchuk Richtmyer R. D. Nonlinear stability of difference schemes P r o c . "Some questions of applied and numerical mathematics" "Nauka" Novosibirsk 1966, (Russian) 54-59

-

Samarskii A. A. ( W ) An economical algorithm f o r the numerical solution of s y s t e m s of differential and algebraic equations Zh. vych. mat. mat. fiz. 4 , 3 (1964) 580-585 ( 8) Necessary and sufficient conditions for the stability of double layer difference schemes. Doll. Acad. Nauk SSSR 181 (1968) 808-811 (Russian)

-

Strang W.

Difference methods f o r mixed boundary problem Math. J. , v. 27,2 (1960) 221-232

Tihonov A N. ,Samarskii A. A. Homogeneous difference schemes (1961) 5-63 (Russian)

- Zh

- Duke

vych. mat. mat fiz. l , 1

Thomee V. On maximum-norm stable difference operators. Numerical solution of partial differential equations. P r o c . Intern. Symposium 1965, Academic P r e s s . 1 2 5 - 1 5 1 New York. Fedorenko R F. The r a t e of convergence of an itarative process mat. mat fiz. 4 , 3 (1964) 559-564 (Russian)

- Zh, vych.

Phillips N. A. A.n example of non-linear computational instability . The atmosphere and the s e a in motion- Scientific Contribution to the Rossby Memorial Volume The Rockfeller Inst.

-

Frankel S. P. Convergence r a t e s of iterative treatments of partial differenMath. Tables and Other Aids Comput. v. 4,30 t i a l equations (1950) 65-75.

-

Hubbard B. E.

(y)

Alternating direction schemes for the heat equation in a general domain.' SIAM J. Num Anal. -2 , 3 (1966) 448-463

-

Young D. M. Iterative methods f o r solving partial difference equations of 1 (1954) 93-111. elliptic type- T r a n s . Math Soc.

U. Mosco

CHAPTER 2 and existence Some typical problems -

theorems

1 : Examples

2 : Finite-dimensional and iterative existence theorems 3 : .Variational inequalities f o r bilinear f o r m s in Hilbert

spaces

4 : Direct existence theorem CHAPTER 3 Convergence ----of convex s e t s and of solutions

of variational

lities 1 : The Ritz-Galerkin discretization

2 : Convergence of convex s e t s and convex functions 3 : The "stability?heorem

4 : Further existence theorems 5 : Finite-dimensional approximation, I : the discrete

problem 6 : Finite-dimensional approximation, I1 : convergence of

the approximate solutions

7 : Dual variational inequalities and complementary systems 8 : An example

inequa-

U. Mosco

CHAPTER 1 Minimum ~ r o b l e m sand variational ineaualities: convexity, monotonicity and fixed-points 1 : The direct formulation 2 :' The weak formulation 3 : The linearized formulation

4 : The fixed-point formulation 5 : The epigraph formulation 6 : Minimum problems in normed spaces

7 : Monotone operators and variational inequalities : the linearization lemma

8 : Variational inequalities and fixed-points 9 : Minimization of non-differentiable functionals and

mixed variational inequalities

U . Mosco

CHAPTER I Minimum problems and variational inequalities : convexity, monotonicity and fixed-points The aim of this introductory chapter is to bring to light some simple geometric features underlying the theory of the s o called variational inequalities which a r e also inherent in the problem of minimizing a convex functional on a convex set. Most of the characterizations of the solutions of these problems, indeed ; give-r*

to inequalities involving the differential of the given

functional, which is a monotone map from the space where the functional is defined to its dual. F o r minimum problems in function spaces, these inequalities should be seen a s the analogue of the Euler condition of the calculus of variations, in the presence of unilateral constraints on the solution. The variational inequality approach consists in dealing directly wiih such inequalities, without asstiming "a priorill that the monotone operator involved is the differential of a convex functional. Moreover, e ven in this special case, they can often be used, a s the Euler equation is, to investigate the properties of the solution of the original minimum problem is

-

-

f o r example, the regularity, i. e . , how

smooth that solution

and they also occur in various approxi~lratemethods of solution, a l -

ternative to the direct Ritz approach. Therefore, it is of some interest that some basic general featur e s of convex optimization a r e shared by the monotone inequalities we a r e going to discuss in our lectures. Let u s recall some of the specific aspects of convexity.

U . Mosco F i r s t , any local minimum is actually a global minimum. The r e levance of this property is apparent: only a w l o c a l l linvestigation of the functional and the constraints is required, which id terms,for instance, of computer performances means l e s s information to be memorized. Another motivation for setting

(if possible

) an infinite dimen-

sional minimum problem in a convex framework 'is that "good"

topolo-

gies a r e then allowed. It is well known, in fact, that convex functionals keep their semicontinuity in topologies weak enough to make the set of constraints compact in a suitable function space, under reasonable boundedness assumptions most of the times intrinsic in the problem at hand : the existence of the minimum is then an immediate consequence of

We-

i e r s t r a s s theorem. Another specific feature of convexity is that some linearization is always possible. This i s , on the other hand, the underlying reason for the existence of good topologies. As we shall s e e later, a linearization of the problem is the basic tool in the existence theory f o r infinite dimensional variational inequalities. The equivalent formulations of a convex minimum problem that we shall discuss in the present chapter, can be tentatively named a s follows: I

the direct formulation

I1 the gradient o r

weak

formulation

I11 the linearized formulation IV the fixed-point and the iterative -fixed -point fo m u l a t i o n . V the epigraph formulation. Further methods that should be mentioned a r e the VI

duality and minimax methods and the methods based on

VII

penalization and regularization

U. Mosco

We should perhaps also say that many of these approaches a r e often adopted at the same time and can be variously combined with the finite-dimensional approximate methods of solution we shall

talk about

in the following chapters. We shall fi+st intuitively sketck I

.. .

V in Sections 1 . . . 5 below,

postponing the rigorous proof of their equivalence to the later sections., where we shall also discuss further properties of monotone variational inequalities. A partial account of the duality theory will be given in the Section . j

of Chapter 3, whereas the other subjects mentioned in VI

\

and VII above will not be treated at all in our lectures, thsugfT important they are. Let us mention that a &tailed discussion of the methods based on penalization and regularization devices with regard to variational inequalities, can be found in J. L. Lions [I] , while an account of the duality and minimax methods can be found in J. L. Lions, R. Glowinski and

.

R. Tremolieres [I] 1

.

Direct formulation

Let us consider the problem of minimizing a real valued convex function ce

F

on a convex subset

K

of the n-dimensional euclidean spa-

E~ , i. e . , the problem u

0 and all u, w t X we have

F(u + t w )

I1

F

is

DF

11

v, u

and then addi-

1.

it i s a consequence of the continuity of

:

hence also of

t

(DF(u + t w), w)

u , w t ? X ,

U . Mosco

which is a well known elementary property of differentiable convex functions on the r e a l line. In fact, we can replace the vector

that belongs to

K

which to the limit t H IDF(U

+

t w),

Remark 6 : rentiability of

v at

F ( u ) +

(iii)

DF

DF: X (i)

X

(DF(u), v

is monotone, -

-

u)

, with

~ , V E X

i. e.,

Proof : The implications (i) J (ii)

----\ (iii) have been shown

in the proof of Proposition 1. The proof of

(iii)

(ii) is based on the formula

that i s obtained by integrating the function of

t L

- F(u + dt

from indeed

0 to

1

t(v

-

u) = (&(u

+ t(v

-

t u)), v -. u)

and then applying the mean value theorem. We find

U. Mosco

F(v)

- F(u) = (DF(u), v

- u) + (DF(u + T(v - u)) - DF(u),

v

- u) ) (DF(u), v - u),

since (DF(u by the monotonicity of

+ T(v -

u))

-

DF(u),

(v

-

u))

)

v

G X, o < X < l .

0

DF.

Let

(ii) 3 (i) :

u =

Avl

+

(I

-

A)v2

,

v

1'

2

By (ii) we have

By multiplying the f i r s t of these inequalities by t e r one by (1

-

)

and adding up,we find

which is the convexity inequality .

1

and the l a t -

U. Mosco

-

7 . Monotone operators and variational inequalities :

--zation

lemma. Any problem such u € K :

11

with K

the lineari-

K

a s I1 of Proposition 1, that is

(Au, v

-

>

u)

'dv

0

a convex subset of the normed space

into

' X

X

L K , and

A

a map of

, is called a variational inequality.

As we said in the introduction and the discussion up to now should have shown, variational inequalities involving monotone mappings a r i s e naturally in connection with the minimization of a convex functional subject to convex constraints

and

s h a r e indeed many important proper-

ties with these problems, even if the map

A

is not the differential of

a convex functional. F o r instance, whatever the map i . e . , a vector

u

of

K

(Au, v

A

is, any local solution of

such that for some

-

u)

6

>

O

forall

) 0

11

v

f

where

is actually a global solution of

the vector

v

that belongs to

I1

.

h his

can be seen by replacing

in the inequality above with the vector K

for

t

>

0

s m a l l enough

u

+ E

{v

- u)

.J

Moreover, the solutions of variational inequalities involving any

U. Mosco

monotone mapping having sdme mild continuity property can still be characterized, like the solutions of convex minimum problems, by means of a s y s t e m of infinite linear Inequalities. In fact, if we inspect the proof of the equivalence

I1 ( ; 1 I11

of Proposition 1, we note that only the following properties of the map A = DF

have been used (see also Remark 6 ) :

is monotone, i. e. , (Au

- Av, u

-

v)

>0

u, v

C 'X

and hemicontinuous, i. e . ,

t c+

(A(u

+

tw), w)

is continuous at

o+.

U,

w

X

Therefore, we can state the following basic lemma, which is essentially due to G . J. Minty [ I ]

Let

(see also F. E. Browder

be 2

C8] )

and hemicontiof the normed space X to itsdual ' X . Then, -for any nuous map ---convek sbbset K of X , problems I1 and I11 -below a r e equivalent Linearization Lemma :

I11

u L K

:

(Av, v

A

-

U)

>

0

monotone

v

C K .

U. Mosco Corollary : The common set of a l l solutions and -

u

----

I11 --above is convex : it is also closed, provided

of problems K

I1

is closed. --

Perhaps the most important consequence of this "linearizationI1 of problem I1 is that the inequalities in I11 a r e stable under limits in

u

in the weak topology of

X, contrary to f h a t occurs for inequalities

u

11, where we can take weak limits in

only if

A

has some

compa-

cteness property. An essential use of this property of the equivalent linearized problem I11 will be made in Chapter 3, when we shall dead with the existence and stability of the solutions of variational inequalities such as I1

. Remark 7 :

example if

If the solution

is an interior point of

u

K , for

K = X , then I1 reduces to u

E

K

:

(Au, w) = 0

b'w

E X

,

which is the weak form of the equation

(cfr. 'the' Corollary of Proposition 1)

.

Even in this case, however, the linearization lemma is meaning

-

ful, sfor it states the eQlivalence of this(non-1inear)equation with the system of linear inequalities satisfied by the solution u :

U. Mosco

Remark 8 :

An interesting cesult due to

T. Kato

that a monotone hemicontinuous mag on a convex= space

X

[ 11

affirms

domain of a Banach

is always derhicontinuous , i. e . , continuoue from the strong

topology of

X

to the weak*

topology of

ps l e s s surprising if we think of the case

X

*

(this property is perha-

A = DF,

F

being a differe-

ntiable convex functional). Let u s remark, however, that what i s really required to have the equivalence I1 micontinuity of

A

I11 is the

only on the set

hemicontinuity on a convex subset

K

K

monotonicity and he-\

and not on the whole of of

X

X, and

is in general a weaker p r -

operty than demicontinuity. F o r the boundedness properties of monotone operators s e e also F. E . Browder Remark 9 : on

X,

If

A = DF,

then for any finite s e t

of vectors of

X,

we have

[3]

F

and

R. T . Rockafellar

[4J

a differentiable convex functional

.

U . Mosco

(see (2) of Section 6) and adding up a l l these.inequalities we find

Therefore, not only the monotonicity condition is satisfied by the map

A = DF, indeed a whole family of l l c y c l i c l l inequalities a r e also

satisfied,

each one corresponding t o a finite subset of

is cyclically monotone

.

X : the map

DF

As Rockafellar has shown, this property is the

basic one occurring in the characterization of the monotone mappings which a r e the differential of convex functionals, s e e R. T. Rockafellar

I

.a 8

.

Variational inequalities and fixed-points

Let u s come back now to the relation between variational inequalities and fixed-points. Even if the reduction ~f a general variational inequality to a fixed-point statement can be realized in any smoothly normed Banach space (see Remark 1 3 below), we shall a s s u m e f o r the sake of sirriplicity that our problem takes place in a Hilbert space framework. We shall make u s e of the following tools, both of which depend on the specific inner product of the given Hilbert space

V

and not m e -

rely on the topology induced by it :

J

(i)

the duality Riesz isomorphism

(ii)

the weak characterization of the Riesz projection

the convex s e t

K.

of

V

onto

V

PK

on

U. Mosco

The Riesz If

V

ng between

isomorphism

J :

is a (real) Hilbert space,

v C: V

and

v * V* ~ and

* ,v)

V* i t s dual*(v (u

I

v) the inner product in

then

is the map defined by the identity

the .map J

is an (isometric) isomorphism of

We can use the inverse of

to represent any given map

by

the map

J

the pairi-

V

onto

v*.

V,

U. Mosco

Remark 10 :

If

&

V, then

functional on

A = DF, = J-I A

The Riesz projection P~ : -.K i s a convex subset of V

If

F being a differentiable real valued i s the gradient

and

dl?

of

F

:

z 6 V , the vector

is defined to be, if it exists, the unique solution of the minimum pro-

blem

where

I w 11 It

is

(w

=

I

K

.

well known and it can be elementary proved by using the pa-

rallelogram identity in u = P

w) 112

z on K,

V, that any vector

provided

K

Lemma 2: Then, given

z

Let -

6 V

u

of

V

has a projection

is closed.

It is also clear that a vectnr problem above if and only if

z

u

is the solution of the minimum

is the s o l

of the pmblem

K ----be a convex subset of 5 Hilbert space V

, we have

-.

U. Mosco

if and --

only if u ( K Proof:

( u - z l v - n ) > O

:

The functional

V ,with

is differentiable on

(that is,

VF

= I

-

z,

I E identity of

V)

.

Therefore , (DF(u), v -u) = (J(u - z), v

- u)

= (u

-

1

V

-

U)

and the lemma follows a s a special c a s e of t h e equivalence I tj I1 Proposition 1

.I

The weak characterization of P K of l a t e r on :

ve that

of

P t u r n s out t o be useful t o p r o K does not i n c r e a s e distances, a property we shall make u s e

Corollary

of

Lemma 2 :

PK -is non-expansive,

i. e.,

U. Mosco Proof : Let u s write the weak characterizations of

,

and replace

u

1

= P z

K 1

and

u

2

= P z K 2'

v = u

in the f i r s t inequality, 2 Adding up,we then find

v = u

1

in the l a t t e r one.

that is

(ul

-

u2

I u1 -

u2) 6 (zl

-

z2

U1

-

u2)

hence, by Schwarz inequality,

Now we a r e ready to prove the fixed-point characterization of a variational inequality

sketched in Section 4

PROPOSITION 2 : Let K and -

A

-a -map -

of

K

into V * -

.

be a convex s e t of 5 Hilbert space V ----. Then, I1 and IV --below a r e equiva-

U . Mosco

lent : 11

where

I

u

c

is the v * m v .

K

:

-

(AU, v

identity -map of

u) 2

V

V V E K

o

, J

the canonical

Remark 11 : In t e r m s of the map

= J-'

isomorphism of

A,

I1

and

IV

above can be written respectiveiy as u E K :

(f2u I v - u ) > O

v v

EK

and

Proof of Proposition 2 : It suffices to write the

weak characte-

rization of u = P z K

with

,

u

-

pJ-'Au.

In fact, we find u 6 K

:

which i s to say, since

(U

-

(U

y o ,

- g

-

J-'A

U)

I

v

-

u), 3

o

VV c

K

U . Mosco

where

I

(J-'A~

v

- U) = IAU, v -

.

U)

E

Remark 12 : The fixed-point characterization IV of problem I1 is not intrinsic : if we change the inner product in

one, then the dual

V

fi

of

V, hence also

A

V

to an equivalent

and problem 11, does

and dt = J-'A will change. The not change, whereas P K of the inner product will also effect the range of values of make the map

PK(I

-

f

A

3

)

a contraction in

V

choice which

( s e e Remark 5).

We shall come back to this point in the following chapter, when we shall discuss the iterative methods of solution of problem I1

.

Remark 13 : The weak characterization of the Riesz projection above, Holds i n any Banach space.

PK , hence also Proposition 2

w b s e norm is Gateaux differentiable (outside the origin) we can take

where

7

J

t o be any duality mapping of

F(u) +

(DF(u),

v

-

u)

which is a consequence of the convexity of Remark 16 : zing

F

F

. 1

As the proof above shows., a vector

+ G always satisfies the inequality 11'

ble functional

G

is not convex

u

minimi-

even if the differentia-

.1

Another way of looking at the inequality 11' above is to regard it a s a unification of the direct and weak formulations of minimum problems. In fact, while IIt obviously reduces to the problem u E X when

F

:

G(u) 6 G(v)

b'v

E X

0 , on the other hand, it is easy to verify that

IIt is equi-

valent to the variational inequality

when

G

is taken t o be the indicator function of the s e t

K

.

Similarly, .we can generalize both the direct minimum problems and the variational inequalities discussed s o far, by introducing inequalities of the form

U . Mosco

(Au, v

11''

u 6 X :

with

A : X W X *

- u)

and

2 F(u)

F

-

F(v)

v

v

E X ,

: X H ( - m ,

It turns out, however, that such ttmixedw variational inequalities only apparently a r e m o r e general then the original ones. In fact, by making use of the epigraph formulation discussed ,in Section

5

, we can equivalently write problem 11" above a s a v a r i a -

tional inequality a s those considered so f a r . In fact, let us consider now the product space

and the inequality : rcl

I1

.

where

?i

"'w

(Au, v N

N

ly

K

F

w

X

of

I:V.~J

i s the epigraph of

V

"'

u) >, 0

A x 1

i s the map

A( and

-

=

into i t s dual

CAV,~~

$ p*

X

G

~

-

X x

-

*

,

R, i. e . ,

I"'"1

:

The following lemma holds : Lemma3:

Let

A : X H X*

,

F : X n (-m,

+ m J

,

U. Mosco

+

F

oo

above and --

G

.

Then, a vector F(u) =

of

fu,dl

=

every

r

If

> F(v)

u

$ = [u,

>F(u)

when

+

-

u)

+

e"

I

F(v)

- F(u) =

I.(

p

~ ( u] ) ,

7 = [v, p] t u, d l is

a,

11"

11.

solution of

a

- U) +

, v

u =

andforall

F(v) =

is

= (Au, v

Conversely, if O(

solution of problem

,

then

for

we have

0 6 (Au, v

where

and only --if the vector

is a --

g-_asolution of problem

X

Proof :

X

d E R , if

,

a(

A4

of -

u

CV,P~

:=

-

F(u)) =

w

, therefore I1 holds. a solution of

with

lu

11, then

J3 a F ( v )

the inequality in 11" is trivially satisfied

[notethat

]

,

we

have -'N

N

0 ,< (Au, v

-

"4

u) = (Au, v

Therefore, by taking cular, that

46

p

- u) +

v = u

= F(u), hence

and, moreover, f o r all

P

I"'

and

j 3 - d .

= F(u), we find, in p a r t i -

U . Mosco

0

,
, F(u) - F(v)

v €

F ( v ) ,< b

X,

o( = F(u).

We shall make u s e of this fact l a t e r .

Remark 18 : differentiable convex

A different approach to the minimization of non F

consists in making the subdifferential

F take the role of the differential

DF

the, in general multivalued, mapping of each of

F

u

of at

X

. X

Let us recall that

zx*,

into

3

aF

u* E X*

defines a support hyperplane of

F

such that at the point

tangent one). In other words, f o r each

u

of

v u

X ,

of is

which associates

with the s e t (possibly empty) of a l l subgradients

u, i . e . , of all

F

uY

F(u)+($ v

-

(not necessarily a

u)

U . Mosco

It should be noted, indeed, that all statements about the vector u = DF(u)

of

we have made s o far, involve

u

only a s a subgradient

F . This approach yields naturally to variational inequalities invol-

ving multivalued monotone mappings. We r e f e r t o F. E. Browder 1143 R. T . Rockafellar

.8

C6]

As general reference on the theory of convex functions, let us only mention h e r e R. T. Rockafellar

[ 11

J. J. Moreau

[I]

A. Ioffe

,

[77

- V.

, J. Stoer and C . Witzgall Tikhomirov [I]

.

Monotonicity properties of operators in Hilbert o r Banach spaces have been investigated R. I. KachurovskiL Ll]

, l2J

,[3]

M. M. Vainberg and R. I . Kachurovskii 1

111

l21 ,

E. H. Zarantonello [I]

, G . J. Minty

,F..E. . ~ r o w d e rs e e ref. quoted ih F.E.B., D51, T. Kato rl]

R . T. Rockafellar

3

]

and others. More references and a survey

of the theory and i t s applications can be found in R. I. Kachurovskii and

F . E. Browder

C3]

loc. cit.

Specific references to variational inequalities will be given in the following chapters.

U. Mosco

CHAPTER 2

Some typical problems and existence theorems 1 : Examples 2 : Finite-dimensional and iterative existence theorems 3 : Variational inequalities for bilinear forms in'Hilbert

spaces 4 : Direct existence theorem

CHAPTER

2

TQe existence results and the methods of approximation of the s o lutions of variational inequalities such a s

u E K

11

where

K

:

(AU,

V V E K,

V - U ) > O

is a convex subset of a noruled space

tone map of

K

into

X*

X

and

A

a mono-

, a r e essentially based on the characteriza-

tions of problem I1 we discussed in the preceding chapter. In fact, the three main ideas underlying this existence and approximation theory can be summarized a s followsi : 1

(ij Reduction of I1 to a fixed-point problem and application of a

fixed-point theorem such a s Brouwer's o r Schauder's theorem or, more constructively, the contraction principle which also

yields an iterative

algorithm for the approximation of the solution ; (ii) Reduction of I1 to a direct minimum problem (when the map

A

is the differential of some convex functional

F), what makes it pos-

sible to apply the classical existence theorems of the calculus of variations, ,The solution can be evaluated by combinkg a finite-dimensional approximation of Ritz type together with some method of finite-dimensional convex optimization ; (iii) Preliminary restriction of I1 to finite-dimensional subspaces

of

X

. where a solution can be found by applying any one of the fore-

going methods, and then linearization of the problem to get a solution in the whole space. The constructive aspect of this approach comes out a gain from a combination of a finite-dimensional discretization of Ritz-Galerkin

type and iterative algorithms o r methods of convex optimization

U. Mosco f o r the evaluation of the solution of the finite-dimensional approxim3te problem: In the present chapter we shall present some of the existence and approximation results that can be found along the lines mentioned in (i) and (ii). above, while a more general existence theorem, based on the Linearization Lemma of Chapter' 1, and a description of the tdi'adretization p r o ~ e d u r e s ~ w ibe l l postponed to the following Chapter 3

.

Moreover, a s a motivation to all problems discussed so f a r in an abstract framework, we shall describe below some typical examples of minimum problems and variational inequalities involving integral func t i o n a l ~and partial differential operators. Most of these problems a r i s e in the mathematical description of the equilibrium of a physical system subject to unilateral constraints. As everywhere else in these lectures,we shall confine ourselves to problems of elliptic stationary type and we refer to the lectures of J. L. Lions and R. Glowinski at this Course for all that riational inequalities of evolutive type.

concerns va-

U . Mosco 1

.

Some variational inequalities

-. .

Everywhere in this section we shall denote by pen subset of the n-dimensional euclidean space

R

E ~ by ,

a bounded o-

f

its boundary.

We shall begin our list of examples with a quite classical pqobelm, that does 'nt involve unilateral constraints. Example 1 : The Dirichlet problem :

-A

where

R, say,

is the Laplace operator and

f

is a given function on

f G ~ ~ (. 0 )

The variational

( o r weak) solution of (1) is the function

u

that minimizes the energy integral

H: (R) : this is. indeed, the classi-

over the appropriate Sobolev space cal Dirichlet principle

he space

.

1 H (L?) is made of a l l functions f C L ~ ( R ) whose 0 9 v. , i = 1, . . . ,n still belongs to L2(R). distribution derivatives

a x,

I

and m

e t r a c e on the boundary

r

of

R

vanishes

.

With the n o r m .

U. Mosco

Hb(R)

is a reflexiue Banach space, whose dual,

can be identified with a l l distributions

T

on

R

that can be (non uni-

quely) written a s

the duality pairing between

v E Hb(R) and

T

The functional

i s differentiable on

1

H (0) , with differential 0

H

-1

(0) being given

I

U. Mosco

given by

(-

nu,^)

=

d Fo dt

(U

f3u

+ tv) t=O

/3v

dx,

n

The identity above,indeed,provides the variational definition of the Laplace operator

and the bilinear form at the right hand

i s called the Dirichlet form

.

The variational solution of the Dirichlet problem can then be characterized as the solution of the problem

U. Mosco

a s i t follows, for instance, from the Corollary of Proposition 1 of Chapter 1

. As for the existence of

u, i t suffices to show that the energy in-

tegral above attains its minimum in the space

Hb(R) , what can be do-

ne, f o r instance, by applying the general theorem on the existence of minima we s h a l l give in Section 4 of the present Chapter. Remark 1 :

-

To replace the Laplace operator

, in

the

problem above, by any 2d o r d e r elliptic partial differential operator of the form

where the

ai,(x) a r e bounded measurable functions on

the ellipticity

satisfying

condition

amounts only to replace the Dirichlet form the form

R

a(u, v)

in problem ( 2 ) with

U. Mosco

However, this generalized form is

not

the differential of the fun-

ctional

unless it is symmetric, that is, the coefficients

a..

satisfies

l . 1

a..(x) = a..(x) 4

....n .,

a. e.

i,j = 1

J1

The proof of the existence of the solution of this

generalized Di-

richlet problem cannot be obtained, a s before, by using the direct variational formulation. We must appeal, indeed, to a well known theorem of Lax and Milgram that will be recalled later.

a

The existence theorems for variational inequalities we a r e going t o discuss in our lectures can be seen, and in fact so they were first obtained, as a generalization of the Lax-Milgram theorem to problems involving unilateral constraints

.

The simplest example of problems of this type is the following Example 2 : The capacity problem : Given a compact :subset

E

of

that .minimizes the Dirichlet integral 1 v 6 Hi)(0) , such that v > l

on

E

Fo(v) over: the

in the sense of

[ w e say that. v 2 the'limit in the norm of

0, we look for the function all

Hb(R) ,

E

in the sense of

1

v is 1 H 0 ( 0 ) of a sequence of smooth functions which '

1 on

cone of

u

Ho(S2) if

U. Mosco are

2 1 on E Therefore, our problem now i s

The solution

u

is called the equilibrium potential

of the minimum is the capacity of the s e t

E

in

R

and the value

.

By applying Proposition 1 of Chapter 1, now we find that the lution

u

so-

of (3) can be characterized by means of the variational inequa-

lity

where

a(u, v)

i s the Dirichlet form ( 2 )

Remark 2

.

.

The variational inequality (4) is the

formulation of this capacity problem whenever

a(u, v)

only

possible

i s the generali-

zed Dirichlet form associated wlth a non-symmetric elliptic operator a s in Example 1

.

L

In this case, ( 4 ) has to be taken a s the definition of

the equilibrium potential

u

on

E

relative to the operator

L

in

R.

The capacity theory f o r non-symmetric second o r d e r elliptic p. d. o. was started by G. Stampacchia

[I]

and his variational inequality appro-

ach to this problem was extended to other variational problems with unil a t e r a l constraints, both of stationary and evolutive type, by J. L. Lions

U. Mosco

and G. Stampacchia

E] . Many other problems. of this tm arising in phyaics

and engineering have been (investigated 8inc.e t h o ) .by maqy authors in the light o'f the theory of variational inequalities. The main reference in this r e gard

is the recent book of C.

Duvaut and J. L. Lions

rl]

.

Let us also mention that many unilateral problems of mechanics and hydrodynamics have been also investigated by J. J. Moreau [3]

, by using

the methods of convex analysis. @ Example 3 : The l.!ohataclefl problem. We now want to minimize the Dirichlet integral F0(v) over the cone 1 of a l l functions v of H (R) which a r e )a. e. of a preassigned function Y, 0 in R . The function Y/ will be called the obstacle and we assume that Y, i s such that the cone just defined i s not empty. F o r instance we may have

y

2

H1(n)

E L (R) and a l l distribution derivatives yx. of [that is, 2 1 y also belonging to L (0) the t r a c e of on being < 0 a. e.

E

Y/

1,

r

The minimizing u i s the solution of the variational inequality

It can be shown that if I i s the closed subset of S2 where, formally, u =

y

, then the solution u satisfies the conditions

u Z \ t / u =

y

a.e. on I

andand

u s 0

in

R ,

- Au=Oin R - I

Thus, u, i s super-harmonic all over R and harmonic outside the s e t I where it fltouchesllthe obstacle.

A similar problem which includes the capacity problem, consists in minimizing F

over all functions v which a r e ) only on a given compa0 ct subset E of R (and now v > U( on E has an analogue meaning than the conditicn v >, 1 on E in Example 2). If E is a (n-1)-dimensional manifold in 9 , the problem at hand may be called a "thin obstacle" problem.

Problems of this type have been f i r s t considered by J. L. Lions- and G. Stampacchia [I]

.

The regularity of the solution has been also investi-

gated by many authors. See H. Lewy and G. Stampacchia [I], L 2 ] , [3],

H.

Brezis and G . Stampacchia [I] , H. Lewy [I], c22] , H. ~ r e z i s L 5 1 ,G . Stampacchia L2J , [3] , [4] by C. Baiocchi [I]

. An application to hydraulics has been .

Example 4 : The I1boundarf obstaclet1 problem

recently given

.

The problem now consists in minimizing the functional n

F(v) =

11 i=

1312 R

where F i s a given function on R '1 v € H (R) such that v 2 h

a. e.

on

-

dx

0xi

1

f v d r

51

. say, f r

E L L (a), on the cone of all

,

where h is a preassigned function on

r .

It can be seen that the associated variational inequality satisfied by the solution u corresponds to a boundary value problem f o r the Laplacian

-A

, with unilateral constraints on the boundary

problem can be stated a s follows

r . Formally,

this

U . Mosco

A detailed discussion of this example can be found in C. Duvaut

and J . L . Lions

.

1

Let us only remark here, following J. L.

Lions , R. Glowinski and R. Tremoliers

(11

,

that the conditions

-

(6) can be interpreted a s describing the stationary equilibrium of a fluid in a region

R

surrounded by a membrane

to came in and prevents it to leave

R

that allows the fluid

.

If u(x) is the p r e s s u r e of the fluid inside f2 and h(x) is the external p r e s -

p,

s u r e applied on the boundary l e u = h on ver

p

fin > '0' whi; on the other hand, if u > 0, then the fluid is pushed out, howe-

P forbids the outcome,

when the fluid comes in,then

l = 0. The regularity of the solution u hence f-

nn

has been studied, in particular, by H. Brezik-G. Stampacchia [11 , H. Brezis

157, H.

de Veiga 117 , [2]

.

Example 5 : A problem in nonlinear elasticity r h e energy integral

now has t o be minimized on the convex set of a 1 v

HI (R) 0

such

that

1 grad

v

] 61

a. e. in

R

.

This is a problem bccurring in 'the elastic-plastic torsion of a

U. Mosco

bar and it has been studied by many authors, s e e in particular B. D. Annin [I]

, H. Lanchon and C. Duvaut [I]

cil

W. Ting

and C. Duvaut

- J.

, H. Lanchon

Cd

, T.

L. Lions, loc. cit., where more in-

formation can be found.The variational inequality characterizing the solution u formally interpreted a s follows : There is a "plasticity regionv R

can be Ro in

where

outside

no,

the function

moreover,

that is, in the region

u

u

R

-

R where 0

satisfies the equation

and its derivatives

ditions at the interface between

O u /b X i

Ro

and

satisfy certain matching con -

R - Ro

.

Like the obstacle

problem, this too is a free boundary problem. Let us also mention that the present problem can be eqyivalently stated in the form of a J1two-obstacles'l problem, that is, condition (17). above can be replaced by a condition of the type

where

and

y2

a r e two suitable functions. We refer to T. W.

U. Mascs

Ting, loc. cit. and H. Brezis-M. Sibony [ 2 1

for more details on this point

The regularity of the solution has been studied by H. Brezis-G. Stampacchia, loc. cit.; see also H. Brezis (51. F o r the numerical solution of this problem see R. Glowinski 3 , J . F . I?ourgat[1] , M. Nedelec ursat

/17, M.

Sibony [27

tl]

, M. Go-

.

-

Example 6 : A Bingham's

fluid

In its direct variational formulation, the ~ r o b l e mconsists in minimizing the

=

over the space

differentiable functional

Hb(n)

The funational above is the sum of the same energy integral F(v) occurring in Example 5, which is obviously differentiable on

Hb(R), and

the non differentiable term

G(v) = g

The minimizing

J

u

1 grad

n

v

I

dx

-

.

i s thus characterized by the mixed variational

inequality

u 6 Hb(R) where

: a(u, v

- u)

2 G(u)

- G(v)

\d v

.

Hb(n)

zl J

U . Mosco

F o r the numerical solution s e e R. Glowinski [I]

.u,

V) =

/i> X,

dx

/?x i

R

-

I

, M. Goursat

fvdx

El]

.

,

52

.

cfr. Proposition30f Chapter 1

The physical motivation of this problem a s well a discussion of the properties of the solution

u

can be found in C. Duvaut

-

J1 L.

Lions, loc. cit.. of elasticity ---with friction on the bbundary. Example 7 : A p r o b l e m Another problem involving a non .differentiable functional is the m i nimization- of

over the whole space

H1(R)

.

The non differentiable t e r m is the bounda-

r y integral

and the solution

u

riational inequality

where now we have

is characterized, a s in Example 6, by the mixed va-

U. Mosco

Problems of this type occur in the theory of elastic bodies subjected to unilateral boundary constraints. Once again we refer to C. Duvaut

-

J. L. Lions, loc. cit, o r

J. L . Lions, R. Glowinski and R. T r e -

moliers, loc. cit. Example 8 : the Laplace -

opeador

Inequalities involving non linear generalization

of

.

A generalization of all,problems considered so far, which is qui-

t e natural from a mathematical point of view thongbit of no direct physical interest, consists in replacing the Dirichlet integral with the functional

F o r every

p >, 2

this is indeed a differentiable convex functio-

nal on .th.e Sobolev sppce

and its differential is the monotone. operator

U. Mosco H ~ ' ~ ( R t)o its dual, associated with the form

from

r

~ have e in fact

Let u s remark that another natural family of convex functionals that generalize the Dirichlet integral is given by

FI.1

=

1 grad v 1

whose differential i s now the operator Av =

- div

(

[ grad v 1 p - 2 grad v)

dx

,

U. Mosco

-

Like \the opetarot. (18), this operator obviously reduces to when

p = 2

.

Therefore, they can both be considered a s natural nori li-

near generalization of the Laplace operator. ~ t ' s h o u l d be also noted that these operators a r e all duality mapH ~ ~ ~ (suitable Q ) normed, cfr. Section 8

pings of the spaces pter 1,

Remark 1 2

of Cha-

.

Example 9 : Inequalities involving m o r e general non linear second order

elliptic, partial differential operators. Let us consider the form

u

(ux , . . . ,ux ), 1 n

X

a r e measurable in a. e . in

R

x

where the functions

t

for fixed

and continuous in

r

for

x

fixed

.

If the functions

ai(x; f

)

a r e of polynomial growth at

oa

in

1,

that is

f o r some

p

with

for every function u we have

1 0,\j \( . being the norm " H1' '(0). 1, P f~~ taking t h e Sobolev imbedding theorem into account, t h e g r o w t h

holds, with

condition (19) could be obviously

weakened, s e e the ref. quoted

below

.I

Therefore, the identity

,

(AU, v) = a(u, v)

defines a map

A

from

Clearly, this map (Au

-

Av, u

- v)

provided the functions

H1"(n)

A

EH~,~(Q)

i t s dual, formally

is monotone, that is

= a(u, u

ai(x;

to

u, v

-

v)

-

a(v, u

1/) , satisfy a

-

v) 2 0 ,

weak ellipticity conditions

of type

Let u s notice t h l if t h e r e exists a function

5 (x;

)

such that

U. Mosco

then

A

i s the differential of the multiple integral

since then we have

a(u, v) =

d dt

Fl-*+ t v ) 1t.O = (DF(u),v)

.

The monotonicity condition (20) is then equivalent to the convexi-

$ (x;

ty of

5

)

in

1

(see also the Remark below)

.

Thus, variational inequalities involving differential ogerators as the operator

A

above arise,for instance,whenever we minimize

an

integral functional like (21) subject to a convex set of constraints. .There i s an extensive literature .onathe partial differential operators of type described above, and of highek order too, and on the related boundary value problems. We only r e f e r here to the papers of E . Browder

[l]

, [2]

, to J. Leray and J. L. Lions [I]

P. H. H'artmen and G. 'Stampacchia rl]

F.

and to

, where variational problems

wits unilateral constraints a r e studied in detail. Surveys of the theor; and its' Applications can be found in F . E. Browder [6]

,

[ 12 R. .I. Kachurovski [3]

[7]

J. L. Lions

.

Remark 3 : The application of the theory of monotone operators to the boundary value problems f o r partial differential operators of elliptic type has brought to a natural' genekalization of the theory.

In

fact,

it

is

U. Mosco more convenient in many applications concerning an operator such a s the a.(x; u, ux , . . . ,ux ) 1 1 n be monotone in the arguments corresponding to the highest derivatives

A

considered above, to require

-

only

,. . . u

u

der terms

1

A = DF, F

-

the

functions

in our example above - and handle the lower o r n u in the example - by using a compactness argument. When X -

being the integral functional ( 2 1 ) , this corresponds to r e s t r i c t

the convexity assumption on the highest o r d e r derivatives appearing in the integrand of

F , a s i t is indeed natural in many problems of calcu,-

lus of variations. '

The operators that a r i s e in this way can be described in the s i m plest c a s e by adding

a compact operator to a monotone one and in gene-

r a l by allowing a more sophisticated intertwining between the monotone and compact components. These so-called semimonotone

operators a r e

also discussed in the references quoted above. In this regard let u s also mention a m o r e general class of operators, the pseudo-monotone operators which has been introduced by H. Brezis

Cl]

, r2-J, E3J , s e e also J. L . Lions C17 .

Example

10 : Minimal surfaces with obstacles

Let u s consider the functional

that gives the area of the surface F

over the cone of all

compact subset

E

of

v

v = v(x), x E R . We can minimize

which a r e >, then a given function

y

on a

R ,

The variational inequality that characterizes the minimizing surface

u = u(x) involves

the Euler operator

U. Mosco

The natural Sobolev space h e r e is HI' '(n), that is, the space 1 1 C L ( 0 ) . However, of all v E H (f2) with all f i r s t derivatives v Xi this space is not reflexive. Therefore, the problem mentioned above cannot be handled with the standard methods diaoussed

in

a u r lectures ,and

ad hoc techniques have been indeed developed. F r o m the extensive literature concerning minimal surfaces, let us only quote the papers by J. C. C. Nitsche El] , M. Miranda [l] , H. Lewy and G. Stampacchia r3] , E. Giusti El] ,

R . Temam El] ,

that specifically concern the obstacle problem.

2 Finite-dimensional and iterative existence theorems

By taking into account the relation between variational inequalities and fixed-point problems (see Proposition 2 of Chapter 1) and making us e of ttie classical'Brouwerl's fixed-point theorem, we can easily profe the following finite-dimensional existence theorem, due to P . H. Hartman and G. Stampacchia Cl] Theorem tkiB euclidean -

,

1 : Let K

space

En,

be a non-empty closed convex subset of -n ,& -a continuous -map of K E . Let

u s suppose, furthermore, -that either -

K

is bounded -o r the -

following

coerciveness condition holds (c) --There exists a bounded --open convex subset v

0

(

K fl B, such that

--

B

of En and - -a vector

U . Mosco

'3

B

tion -

being the boundary of B u

. ------Then, there exists

of the problem

Proof : Let us f i r s t assume that

By Proposition 2 of Chapter 1, above if and only if

of

K

K u

is bounded, hence compact.

-.

is a solution of problem I1

is a fixed-point of the m a p

u

into itself, where PK is the Riesz projection on K. since n E I K is continuous, a s it follows from the Corollary 2 of

PK : Lemma 2 of Chapter 1, the map vided

at least one solu-

is such

p@

.

PK(I

-

)

too

The existence of a fixed-point

now a consequence of Brouwer's theorem

is continuous, prou

of this map is

.

Now let us replace the compacteness assumption with the coerciveness condition (c). By what we have just proved, there exists a solution ii

where

u (

of

5

the problem

= B U

/3 B

Since we a r e assuming that condition ( c ) holds,

cannot belong to the boundary .? B

8;/ v O - 5 ) < 0

of

B , for

we should have

in contradiction with the inequality above. This v>

means that

%

is a local solution of problem 11, hence it is a l s o a

U. Mosco

global solution of I1 (cfr. Section 1 of Chapter 1) Remark 4 : nal of

32

F o r a map

.

Bd

which is the gradient of a

functio-

F , the coerciveness condition ( c ) is related to the growth at F, s e e Lemma 2 of the following Section 4

RJ

.a

Theorem 1 could be extended t o infinite dimensional

Remark 5 :

spaces by making use of the Schauder o r Tychonoff fixed point theorems,

[ld [lq.

s e e F. E . Browder [8] needed by the map

&

However, the continuity assumption application

would then be too strong f o r direct

to problems of the type mentioned in the preceding section. Theorem 1 cannot be considered a constructive^^ existence theorem, because it relies on the deep though non constructive, Brower's theorem. However, we can easily. convect Theorem 1 into a w c o n s t r u c t i v e ~ ls with a constructitheorem, whenever we can replace R r o u ~ ~ e rtheorem ve fixed point theorem. The main example is obviously given. by

the

well known contraction principle. We haven in fact, the following iterative existence theorem : THEOREM 2 : space

V ,

3

-

a map of ---

K

ft

is 2 -

I - 5 %there

Let K be a closed convex subset of a ~ & e r t -

- - - - A

into

-

.

I

--

V , such'that

contraction for some

exists a unique solution

u

y>o .

of the problem --

.

_

U. Mosco

and u -

=

lim

u n

rative Scheme -

in -

V, where the sequence (un )- -is given

the it&--

Proof : Since P : V c--, K is non-expansive (see Coroll. 2 of K L e m m a 2 , Chapt. 1), the m a p PK(I & ) is a contraction provided

1

- 7 eft

is a contraction. which is, f o r a suitable

o u r assumption (

+

)

.

f > 0,

Therefore, there exists a unique fixed point

u

yielded by.the iterative:acheme (IV ), which is also a solution of problem n 11, again by Proposition 2 of Chapter 1 . b Theorem 2 above can be integrated by the following lemma that gives a simple sufficient condition f o r I

& & 2 -m-a p- ofsuch that --

Let LEMMA 1 : space

V (i)

into V, -

be a contraction.

a subset

lipschiztian, i. a . , t h e r e exists

--

(ff

]I &u (n)

- ?8

11

- &v

.\. L

fl i s s t r o n g l r monotone,

Then, -the map I tisfying the bound ---

-

@

-is -a

11 u - 17. I\

K

L

>

of -a-Hilbert -

0 -such that

u,v

,

t K

i . e. , t h e r e exists c > 0 -such that

contraction

&

V

f o r all --

p

"-

U. Mosco Proof. An elementary computation shows that

n (I - g S t , u =

IIu - v I I

2

-

(1 - p & , v

- 2 g r

112

=

dtu-av J U - V ) +
0, with F(vo) < + w and I\ vO (I < R , such that F(v)

f o r all v E K,

> F(vo)

Then, there exists ------

I

u E K :

with 11

at least one solution

N u ) ,< F ( v )

u

for all --

v

11

=

R

of the problem V E K

Proof : The proof i s based on the following two well known results of functional analysis

:

closed convex subsets of a normed space a r e

also closed in the weak topology of the space; bounded subsets of a r e flexive Banach space a r e relatively compact in that topology bounded closed convex subsets of Such a r e then, if

of

F

on

K

.

K

X

a r e weakly compact.

is bounded,

Thus, we have

all level s e t s

.

Therefore

U . Mosco which is to say, problem I above has a solution

u. If the boundedness

is replaced by the coerciveness condition ( c ), we 0 can still draw the s a m e conclusion a s before, provided we show that s o -

assumption of

K

m e non-empty level set of

F

on

K

is bounded.

and R a r e the vector and the constant, respecti0 vely, appearing in (c0), it is easy to verify that all vectors z 6 L (vo) Now, if

v

a r e bounded in norm by

R.

[1n fact, i f there exists

zl E L(vo) with

z2 6 L(vo) such that

it would also exist

would contradict the condition

>

F(z2)

1 z2

F(vO)

11 vl 11 > 11 = R , and

.1

-

-

this

@

Addendum 1 to Theorem 4 : The set of a l l solutions p

R, then

-

u

of pro-

blem I above, under the assumptions -of Theorem 4, is a bounded clop

---

sed convex subset of ----

K

-

.

-

- .-

-

In fact, this set is nothing e l s e than the set (33), which i s bounded closed and convex f o r some ( o r all) s e t s Addendum 2 to Theorem 4 unique, provided

In fact, if (ul

+

u )/2 rG K 2

F

u

:

The solution

is strictly convex on

1

and

L(v) a r e such u

of -

. l(l

problem I

K, i. e . ,

u2- were two distinct solutions of

we would have both

is

I, since

U. Mosco hence

t h a t . contradicts the s t r i c t convexity of

.

F

Let us now suppose that the functional X

and let u s ask how the properties of

F

F

is differentiable on

involved in the assumption

of Theorem 4 may be expressed in t e r m s of the differential We already know, from Section 6 vexity of

F

over, the differentiability of

of F :

of Chapter 1, that the con-

is equivalent to the monotonicity of the map

F

DF DF

.

More-

clearly a s s u r e s i t s lower semicontinui-

ty, a s it follows trivially f r o m the inequality

that implies

l i m inf F(v.) >, F(u) whenever

o r strongly) to a given

u

.

J

v j

converges (weakly

More interesting to investigate is the connection between the coercivity of

F

and the coercivity of

D F . and we shall do that in the

following four lemmas. We a r e now assuming that nal on a normed space and

X

0

K

and R -

> 0

is a differentiable convex functio-

DF : X W X*

an unbounded' convex subset of

K

Lemma 2 : Let DF v

,

F

X

,

the differential ;f

F

.'

satisfy the condition : (d ) There exists -0 -,with )I vo I\ < R , -such that

U. Mosco Then,

F

There exists v 0 g 11 vO 11 < R , -such that

(c,)

F(v)

Rt

v

E K

0

>R

belongs to

. Let

and

K

and -

K

> m0)

Proof : Let

fix

the condition

satisfies

[I

and

vo

R

d R

and

:

> 0, with F(v0) < +

v

z be a vector of K with

[I
0 because we a r e assuming that (do) holds

Now, for every' z

g K

with

I\

z

11

>

R

in the proof of Lemma 2, a vector

t 6 K,

and

with

-11

X,

v 11

=

R , such that

.

Thus,

we can find, a s

U. Mosco

z0

Therefore, if K

n

1 v 11

v :

= R

1 z 1 -+ ao

hence

is any vector that minimizes

F

on

, by taking (34) into account we find

implies

F(z)

+ +

w

. m'

: The lemmas above is false if

X

has infinite dimen-

sion, a s the following simple example shows : X

=

12, space of a l l

Remark 9 sequence

v

I .

( v ~ )such ~ that

hence

Then, we have

and

F(v)

whereas

>

0

v

#

0, according to Lemma 2 above,

U. Mosco

F(v(4)

=

1 -+ 2n

0

on the sequence

though

Thus, in order that

F

in an arbitrary normed space ger sense than (d ) 0

. We have

satisfy the coerciveness condition (cl) X ,

DF

must be coercive in a stron-

indeed the following

the condition Lemma 4 : Let DF satisfy (d )

1

There exists --

Then, ceding lemma Proof and let m> 0

R

>

-. 'Such

lion (dl)

.

F

vo E K

satisfies

--

, such that

the coerciveness

condition

(cl)

'

of the pre-

. . Let

be the vector that appears in condition (d ) 1 0 be such that 11 vo 11 < R and (34) holds for some vo

a consth

R now exists in consequence of our assump-

Thereafkr-', the proof i s the same:as that of Lemma 2

.1

The coerciveness condition (c ) is not stable under a correction of 1 as F(v) by an affine function (vgf v) + c , v * € x*, c C IEt 0 condition (d ) is not stable under the addition of a constant term 1 vo* to DF(v) ,. In other words, the coerciveness (c ) of the map 1 v DF(v) - v *.would depend on the given vector v 0 0

.

*

U. Mosco

'

Coerciveness conditions which a r e stable under the above mentio-

ned corrections a r e those given in the following Lemma 5

:& J

Then,

satisfies the c ~ n d i t i o n

F

DF

satisfy the condition

Proof : It is easy to verify that the lemma at hand is invariant under addition of affine functions to

F

.

Therefore, it suffices to prove

the lemma with regard t o the functional

yr/

F DF = DF

still is a differentiable convex function on

- DF(0)

and we have

Therefore, we have

where

X, with

U. Mosco

by the monotonicity of

5,

f o r a suitable

As

.

-21

0, with

vo

I\
0

Then, ----t h e r e exists at least -

one solution

u

of' the -

variational ine-

quality

u

11

C K

Proof of -If such an

A

(Au, v

:

-

Theorem 5

u) 3 0

' d v C K

under the --

adaitional assumption

i s the Gateaux differential of a function

F , a s we know from the discussion

semicontinuous and, in case

K

F

A = DF

.

X , then

on

above, i s convex, lower

is unbounded, it satisfies the coercive-

( c ) of Theorem 4 . Therefore, t h e r e exists a vector 0 that minimizes F on K . By Proposition 1 of Chapter 1,

ness condition u

of

K

any such minimizing

u

is a solution of problem

Remark- 10

If we want a solution

u 6 K

(Au, v - u ) ) ( f , v - u )

to exist

:

whatever

f

i s given in

x*,

u

I1

above

.

5

of the inequality

V V E

K

then we must assume in place

of the coerciveness condition

(d ) above that the stronger condition 0 holds. The l a t t e r condition i s indeed stable under the addition of a

(d2 ) constant vector to

A , a s we already remarked.

-

Theorem 5 in its' general f o r m i s due to Stampacchia

[I]

and

F. E . Browder

r5]

.

@ P . H. Hartman and G.

The proof of Theorem 5,

U. Mosco

under the additional assumption

that

A

i s bounded, will be given in

Chapter 3 and it will be based on the Linearization Lemma of Chapter 1 and a Ifstability theoremff f o r solutions of I1 convex set

under perturbation of the

that will be proved in that Chapter.

K

Since now, however, we can easily prove that t h e set of all solutions

of problem I1 above hag the same properties than the set of

u

a l l solutions of the minimum problem considered in Theorem 4

.

We have in fact the following Addendum 1

to Theorem

Proof

.

K

of

: Under the assumptions of the theo-

solutions -of the inequality I1

rem, ---the set of. a l l vex subset ---

5

closed con-

: We already proved in Chapter 1, a s a corollary of the

Linearization

Lemma of

Sec@on 7 (where the Benrhmtinuit'' of f was

used) that this set is closed and convex any solution

fi 5;bounded

u

of

I1

.

Moreover, if

K

i s bounded in nbrm by the. constant

i s unbounded, R

appea-

ring in condition (d ) of the theorem, a s i t can be shown easily by taking 0

the convexity of the set of a l l solutions into account

-

Addendum 2 to Theorem 5 provided

A

is strictly monotone

(Au

- Av,

Proof : .If u

1

u

- v) > 0

and

u

2

. Tile solution

. dlI u

of I1 i s unique,

K, i. e.,

u f v , u,v

a r e solutions

C K .

of I1 , then we have both

U. Mosco

hence (Aul that,

-

Au2, u

by the monotonicity

city of

A

-

u2)

of

0

X

. Let

.] v 0 C d i m s u p Mh

,

.)h. (Mh1

be such that for a l l j

Then, for any

v

we have,in oonsequence of (15),

, i. e.,

U. Mosco

and, by our assumption (13), dist (v,

3)

0.

Therefore, (V

0

that is, .

lim (vh j

vo E M

v) = 0

for a l l

v € M,

1 ,

.

Conversely, let us . suppose that (14) holds and let us prove that

for every (13)

I V) =

v

M we have dist (v, Mh) 4 0 , what clearly proves

of

. a vector

Again as a consequence of (15),we can find for every h vh 6

qL ,

with

such that

Let

(MhjIj

be an arbitrary subsequence of (Mh)h

dedness of the sequence which we still call

.

BJ the boun-

( v ~ ,) there ~ exists a subsequence of

(vh.) , that converges weakly to some vector

J j of

and this

V

fore we

(vhjlj ,

v

by our assumption (14), belongs to 0 ' have, since v M ,

It follows, by the arbitrariness of

(Mh )) , that j

M

1 .

vo

.. There-

U. Mosco

-M

Corollary : Let

= lim Mh

above hold. Then, we have

and (14) -

mean that both the inclusions ----

M = lim

%

(13)

if and only if

1 . = lim M~ .

M

We shall also see in Section 6 that if we actually define the convergence of a sequence of subspaces a s in the corollary above, then the requirement.

t r\ 1

"

satisfied : thkt is, P

z

for every

M

of

'

we stipulated at the beginning of our disuussion is

M = lim M h

does imply indedd that

z = strong lim P

z

Mh

V

'.

The discussion made up to now leads us to the following general definition Definition 1 : A sequence (\)

of convex subsets of a normed spa-

c e X converges to a (closed convex) ,subset

K of

X, and then we

wri-

te K = lim K

i n x

h

,

if both the following inclusions holds

w-limsup

K

h

C

K C s-liminf Kh

#

where the limits wre defined a s in (11) and (12) above. Remark 1 : If

Kh C K

quivalent to the single inclusion a

if

K

C

Kh

w-limsup

\

for every C

K

.

h , than

for every

h

, than K

= lim

M

is B

h K C s-lim inf Kh. On the other hand,

K = lim K h

reduces to the condition

U.. Mosco

It could be shown that if

X is a reflexive Banach space, then a

Hausdorff topology can be-intraduced in the space of a l l closed convex sub-

X

sets of

which reduces on sequences of sets to the convergence defi-

ned above. However, we shall not discuss this problem here, and we r e f e r to J. L.

Joly [I]

f o r a general investigation of the topologies f o r

convex s e t s and convex functions and the action of polarity on them. The basic property of the convergence we have just defined (and its

of the topology mentioned above) is indeed

stability under polarit&

a s we already showed in the special case of Lemma 1

.

To make this point more specific, let us consider the Young-Fenchel transform

which associates every 1. s. c. convex function f

:xH

(-a, +co1

(fF+oo)

with its polar function f*:

X*

(-

H

CO,

+ COT

,

given by (16)

f*(vq)=

sup V'G

L(v*,v) - f ( v ) j

f

++

f*

v

* e xr.

X

It can be shown that and

,

f

*

is again a

1. s . c. convex function

i s bijective and involutory, that is,

f

**

=

f

U. Mosco

Let us note, in particular, that when

, where

K

is the support function of is a closed subspace of

re

X)

.

X

. As and

is the.annihilator of

M

space of

(

f = M

, then f

* X

in

H

*

=

K = M

6MA

(hence, the orthogonal

, whe-

sub-

is identified with

i s a closed convex cone with vertex at ze-

SH)*= SH*, where

is-.the polar cone of

--

If

a special case of this, if

, if X is an Hilbert space and X*

M

More generally, if

ro, then

is the indicator

X ( s e e Section 9 of Chapter I),

function of a closed convex subset of then. f4=

SK

f =

(f ) h

H

.

is a sequence of 1.8. C. convex functions on

X

we now

define f = lim f

(1 9) where

f

h

in

X

,

is also a 1. s. c. convex function on

X

epi f = lim epi f

in

,

to be equivalent to

the limit

in the product space

X x

h

R , according

X x

R

to Definition 1 above. Let

U. Mosco us recall that for any 1. S . C .

is the epigraph of

epi g

convex fun'ction

3

.

g , that i s the closed convex subset

of the product space X x

R

.

It; is easy to verify that if Kh

g : X H ( - w, + w

LK>fh

f =

X , then

closed convex subsets of

=

f = lim f

Kh h

'

with

K,

if and only if

K = lim K

h ' Again, if

is a reflexive Banach space a Hausdorff topology

X

can be define on the space of - a l l 1. s. c. convex functions on ced from the analog

X , indu-

topology for closed convex sets we mentioned abo-

ve, which reduces on sequences to the convergence (19) , The stability of this topology under polarity can now be stated as follows THEOREM 1

The Young-Feuched --

tinuous

.In particular,

on the --

reflexive Banach space

only if

f=

lid ;f

bijection

f

H

f*

is bicon--

f o r any sequence (fh) of 1. s. c. convex functions X*

in -

.

X , we have

f = lim f

h

in -

X

if and --

This theorem generalizes Lemma 1 to which it reduces when

.

b

Another speckik case of the theorem above, which, we shall Mh need later, is obtained by taking f to be the indicator function of a clof

=

sed convex cone

H , with vertex at

COROLLARY with vertex ---

at 0

0 :

- (Hh) -be- a

: Let

sequence ---of closed convex cones,

, -in a refled~veBanach space

ce of the polar cones ------

in

X

* .-- --Then we

have

X , (H h

the sequen*) -

U. Mosco H = lim H h if and -

only if Hh .= lim

H being the polar cone of

----

i n x ,

H h H

.

This result could be also proved directly, along the lines of the proob of Lemma 1 F o r the proof of Theorem 1 s e e J . L. Joly, loc. cit. and U. Mosco [6]

. Let us now give a few examples of sequences of convex sets that

converge according to Definition 1. By X we shall always denote a r e flexive Banach space, even if for some of the results stated below the reflexivity of the space is not neede,d.

Let

(a)

be subspaces of -

(b) ' be -

c . . . c M h c

....

:X . Then ,

lirn where

M,CM

Mh = M

M = closure of

,x

Mh

&

Let M I D M ... sulypaces f X . Then, 3

X

3

lim Mh = lk ,

. Mh 3

....

U. Mosco

The Ritz approximation suggeststhe following example ..

2a

-K

(c) Let X

is not -such that --

closed convex subset of

empty, and

X , whose interior

0

K

(5)an increasing sequence of subspaces of

X = closure of

i X h

& X,

.Then*

More generally we have be as in (c) -above and -K -

(c') L e t subsets 0

of

that K n -

X S

converging

# 9

.

Then,

5

S

in

(Sh)

5 sequence of closed. convex

.

Let us assume, furthermore,

X

U. Mosco F o r the proof we r e f e r to U. Mosco [4]

, L e m m a 1.4. (+I

The l a s t example leads u s to the general problem of the continuity of the intersection operation : under which assumptions does K = l i m K

s

= lim S

s

K ll

imply

= um

K

fl S

h 0 we saw that if we drop the assumption K f

At the end of Section 1

?

9

the conclusion of (c) a -

bove may be false. Therefore, some condition on the sequences involved must be imposed. We r e f e r to the papaer of J. L. Joly already quoted, where the problem raised above is investigated also in the form it takes for convex functions

.

Then, it i s the pi-oblem of the continuity of the

so-called inf-convolution

f

l a r operation of the sum

V g f + g

.

which is, roughly speaking, the po-

. see J.

J . Moreau [I]

duces some notion of angle, called codistance convex s e t s quired in

K1

(c')

and

K

2

e (K1, K2)

.

Joly introbetween two

to find an alternative condition t o the one r e -

above, which i s of the type

That a condition of this kind m a y be required to draw the conclusion of ( c l ) can be also inferred from trivial examples such a s that sketched below :

As J. L. Joly kindly pointed out to the author, the additional hypo-

(+)

$ n

tesis u

0 tor

S f

9

was erroneously omitted in that lemma

0

E K that appears in the proof must be replaced,

u

0

EEn

s .

: the vector

indeed, by a vec-

h'

U. Mosco

(d) Let

operator

x

on

a Hilbert space,

X

(ph)

of simmetric

linear

--

v = strong l i m p . v h If K -is a -

is a bounded -------

5 sequence

such that

f o r every

bounded ---closed convex subset of

closed convex subset of

X

v t X

.

X

then

,

and

F o r the proof s e e the author's paper

p]

, Lemma 1.5.

E~hat

p K is closed f o r each given h can be seen a s follows : l e t h v = l i m p wj , W. 4 K ; since K is bounded and weakly closed j J t h e r e exists a subsequence of (w!) of (w.) that converges weakly 3 J to a vector wo G K ; thus, f o r every z E X we have

U. Mosco

therefore,

V

= Ph

W0.7

An important special case of (d) is the following (dl) Let X . be-a- Hilbert space,

(X ) h with

increasing sequence of f i -

Xh dense in X , and, for nite -dimensional subspaces of X .. each h , ph the orthogonal projection of X onto Xh . Then, the conclusion

of

(d) above holds

.

It can be shown by simple examples chat if

K

is unbounded

then the projected s e t s

p K may not converge to K , even if K is h a closed linear subspace of X ( s e e Remark 1 . 3 of Ref. quoted above).

A sufficient condition is then the inclusion h

.

Let us also r e m a r k that if

K

phK

C

K

f o r every

is unbounded then the projected

phK need not even be closed : think of the orthogonal p.ojection of the epigraph of the r e a l function y = l / x , x > 0 , on the x-axis set

of the euclidean plane

x, y

.

Further examples of converging sequences of convex s e t s in Sobolev spaces have been also considered in the author's papers

c53 ,

C4) and

in connection with some perturbed boundary value problems- f o r

partial differential operators. In this regard s e e also L. ~ o c c a r d ; [l]

In the following Section 6 sing in the theory ces

.

we shall consider some e x a m p k s a r i -

of internal o r external approximation of Sobolev spa-

. Finally,< l e t us also mention the applications to approximation

theory that have been given by J. L. Joly in his paper quoted above.

U. Mosco. 3. The lfstabilitylltheorem

Let u s now study how the solution involving a map

A

and a convex s e t

u

of a variational inequality,

K, depends on the s e t

is the problem of the continuity of the map

K

u

K : this

and we shall stu-

dy *it with respect to the topology f o r convex s e t s described in the p r e - . ceding section and the weak o r strong topology in the space We shall f i r s t assume that the map

A

X

.

is kept. fixed while

K

is allowed to vary and we shall mention l a t e r how joint perturbations of

A and

K

can be taken into account.

Let u s consider the "initialff problem

u E K :

(21)

(Au, v

-

u) >, 0

YVCK

and a family of I1perturbed" o r wapproximateftproblems of the form

We shall assume

be a sequence of sets. However, any (Kh)h directed family could be also allowed, with anly minor changes in what follows

. The main result concerning the problem considered above can

be stated a s follows

5 reflexive Banach space and

-X

THEOREM 2 : Let (i) A

a bounded monotone

(ii) (Kh)h

5

subset

of -

K

hemicontinuous. -map of -a domain .D(A)

sequence of subset of

D(A), which converges to -a-convex

&)(A) , according to Definition 1 -of Section 2

.

U. Mosco

Letus assume, furthermore, ----that there exists f o r every h 52 solution

uh of the perturbed problem (22) --and that the sequence (u ) hh is bounded & X . ----T--

if this --

Then the initial problem --solution, u , is unique, u

h

(21) has-at -least - one solution ; moreover, then

converges weakly to

5

u

X

and (Auh)

.

Proof

- Au, uh

-

U)

4

0

Let us prove f i r s t :

(a) Any weak limit

in X of a subsequence . ( u ~ . )of~ approximat6 J -

u

solutions is a solution of the initial problem. 7 -

To simplify our notation, l e t us call (u ) h hand. Thus, we have f o r every h

every for a l l

the subsequence at

By our assumption (ii), we have

w -1im sup K

On the other hand, we also have

K

v E K h

.

C

w = v

C

s-liminf K

is the strong limit of a sequence

We can put

h

K , hence

h' (vh), with

therefore vh

K h in the inequality above and make v

h appear in that inequality, by writing it as

e

U. Mosco

A

Now we use the monotonicity of (Av, v

-

uhj

> (Auh,

and then we go to the limit in

v

at the left hand to get

- vh)

h : since A is bounded, the sequen-

ce , ( A u ~ )i s~ bounded; fat. (uhlh is such; thus we find (Av, v

-

.

u) >, 0

Therefore we can conclude that

u

i s a solution of the lineari-

zed problem

and since we a r e under the assumption of the Linearization Lemma of Chapater 1, this also means that

u

is a solution of problem (21)

.

The second step in our proof is (b) --There exists a solution

u.

of problem -

(21) is unique then the whole sequence in -

(21) and -- if-the - solution of ( u ~ )converges ~ weakly t~ u

X.. Since X

is reflexive, the existence of a solution

u

follows

from (a) and our assu~nptionthat there exists a bounded sequence of approximate solutions. The weak convergence of an obvious consequence of the uniqueness if .into account.

(uh)

u- to u iS th'en h u, once we take (a) again

U. Mosco

The final step is the proof of

(Auh)

-

Au, uh

-

U)

0

Still in consequence of the hypotesis find for every

h

we can

z E Kh that converges strongly to

a vector

On the other hand, since all

K C s - L i m inf Kh,

u

h

u

,.

is a solution of (22), we have for

h (Allh'

-

Zh

.

uh) 3 0

We now write this inequality by making

and then we go to the limit in verges weakly to

h

. Since

u

to appear in it,

A

is bounded and

c

0

uh

con-

u , we find that L i m sup h

(Auh, uh

-

U)

-

-

u) u 0,

hence also

Lirn sup (Auh h

wdich, by the monotonicity of Remark 2

.

Au, u h

A , is clearly equivalent to

Many operators

A

(c)

above.

arising in the applications ha-

ve the property that the weak convergence of a sequence

u h

to a vec-

U. Mosco tor

u

of

together with the convergence of the form

X

(Auh

-

Au, uh

-

U)

to zero imply the strong convergence of of this kind a r e often said to

IS^

wder

.

to h type (S)

be of

u

This is the case, for instance, if

u

in

X

. Operators

s e e F. E. BroA

is strongly mono-

tone : c

Ilv - u

11 2

or, more generally, if

with at

+ 0 ,

f : R+ with

- Au,

4

(Av

A

satisfies

-

v

&

u)

condition such a s

R+ any continuous and strictly increasing function

I+

g(0) = 0

.[1f

the space X

is uniformly

convex, then

this condition can be somewhat weakened, s e e H. Brezis - M. Sibony

[ 133 L e t us also remark that the analogue of condition (c) f o r minimuin problems invalvinga functional gence of

\

to

u

convergence of

uh

for instance, if

F

and of to

u

F

is that the joint weak conver-

F(u ) to Ftu) should imply the strong h and i t is well known that this is the case,

is the norm of a uniformly convex Banach space

In this regard, s e e also the Corollary of Theorem 2 below Remark 3

.

If a simultaneous approximation of the map

.

A

must be also taken into account, them problem (22) should be replaced by the problem

.

U. Mosco where D(Ah)

A is a suitable perturbation of the given h .$ in X containing Kh E U I ~range in X

A , with domain

Indeed, if the maps

A I s a r e uniformly bounded,monotone and h hemicbntinuous and satisfy the convergence sondition

A C s -Lim inf graph

graph

(26)

in

Ah

X X X *

in the strong topology of this product space, then the analogue of Theorem 1, with (22) replaced by (25) above, can be proved along the same lines, see U. Mosco [4]

he ded subset

B

maps of

.

{Ah)

an uniformly bounded in X

X , there exists a bounded subset

if for any bounB' of

X

*

,

such that

A ~ ( Bn D~A,)) c B'

for all

2

h

The following result is included in Theorem 2, a s .it can be shown by using the epigraph formulation of minimum problems we discussed in Chapater 1 :

of

Corollary

Theorem 2

:

Let

1-

f .: X

w,

-1

w2

be con-

-

X H ( - w, + be such that f = lim f in X Eh ' h cording to the definition of Section 2 Let us suppose that there exists

vex and -. for each --

f

.

h

a vector

( u ~ ) ~ bounded reover, if u kly to

u

. Then there exists

4 f h(uh)

X x

minimizes

fh

a vector

and that the sequence u

minimizing

is the unique minimizing vector, then

Proof space

% -that

converges

Apply Theorem

R

into

X* x

2

R

f(u)

uh

f ;mo-

converges =a-

.

to the map

0 x 1 of the product

and to the convex subsets K = epi f

U. Mosco and

Kh = epi f

ter

1

of

X x

R

and use Lemma 3

of

Section 9 of Chap-

. @ Similarly, we can consider a mixed variational inequality such as u g X

: (Au, v

-

ill

)

f(u) - f(v)

Y v E X ,

and the perturbed inequality

o r even take

to depend on h too, and use the epigraph forh mulation together with Theorem 1, o r its generalization with A = Ah, A = A

to obtain a result on the convergence of the perturbed solution

u We h ' refer to. U. Mosco 1 4 1 for more details on this point. See also Theorem 4 below

.

In the following section we shall apply Theorem 2 in order to obtain further existence results for variational inequalities and related problems, In Section 6

we shall apply Theorem 2 to prove the convergen-

ce of certain schemes of finite-dimensional approximation of the solutions of variational inequalities o r minimum problems. Let us, also mention that Theorem 2 could be also useful to investigate the continuous dependence of the solution of boundary value problems on the, pofsibly unilateral, constraints imposed on the solution. Some applications of this typd can be'found in the authorrs pers [43 ,

[&

and in

L. Boccardo [l]

.

pa-

U. Mosco 4 : Further existence theorems We shall assume throughout this section that reflexive Banach space

X

is a separable

.

The reason for the separabilkty assumption, which is indeed unnacessary for the general validity of the existence. results we shall prove below, must. be found in the fact that we shall deduce our results from the "stabilityH theorem of Section 3 and in that theorem only sequences were allowed . To remove this assumption we Kh should rely, instead then on Theorem 2, on the analogue stability result

of perturbed sets

for directed families

(Kh)

.

Let us now prove the general existence theorem,namely Theorem 5 we stated in the last section of Chapter 2, In addition to the separability of the space, we shall now prove that theorem by assuming the map A

to be bounded. THEOREM 3

of -

xX

X

either that --

.

Let K

be 5 bgunded -

a closed convex subset of

K is o r that . A -bounded --

ness condition on -

'

X

.Letus

suppose

satisfies the -following coercive-

K

(d ) There exists 0 such that

-- v0 G -(AV, v

K

R

> 0, with 11 vo 11 < R ,

- vo) > o for a~ v .€ K

Then, there exists a solution ---

Proof

monotone hemicontinuous map

: By the separability of

u

,

JJ v II

=

R

of; the problem

X , we can find an increasing

U. Mosco sequence of finite-dimensional closed convex subsets vO E K

1'

Kh

of

K , with

such that K = closure of

We shall now apply the finite-dimensional existence theorem of Chapter 2 to prove the existence, f o r every

h

, of

a

h

,

solution

the problem

u

h

of

such that

ll Uh II

for all

6

R being the constant appearing in condition (d ) any positive constant l a r g e enough, if Let X,

n = n

h ' Let

h

be fixed and l e t that containes

Kh

? t h : En be an injective map with

jh ' be the transpose of

h

'

+-+

.

X*b+

Then

i s unbounded o r

.

be an n-dimensional subspace of

X

Nh( E ~ =) Xh , and l e t

*:

Th

X

K

if K 0 is bounded,

En

U. Mosco iw

*

r

V) =

I

(y

KhX'

v =

x).

y =

*

Xh

w 21:

The s e t

is a closed convex subset .of

n E ,

bounded if

K

Moreover, the map

i s continuous X,

A

[In fact,

is continuous from

(see Remark X

.

h X to

* with that topology to

En

(%xIx where

x

0 [I v 11 = R

- xo) 1

=

= (ATX ,

h

vo € Ch

.

for

'.

to

E~

X* endowed with the weak topology

n h X , in

.J Finally

bounded, then by the assumption

is such

is obviously continuous from

of Chapter 1) and

8

h

if

turn, i s continuous from

Kh, hence

Ch too, is not

(do) we have Xhx

-

all

x

ThxO) = (Av, v such that

v =

-

vO) > 0 ,

rhx6K

and

C such that 11 Rh x 11 = R . h We now a r e in position to apply Theorem 1 of Chapter 2 , with

, in particular, for all x

1

and B = x € En : K = C h exists a solution x € Ch n B

hence a solution

u = h

Rh x

€

11 Rh x (1 4

R

1

and we find that there

of

of problem ( 2 8 ) , with

.

U. Mosco

Thus, we have shown that there exists a bounded sequence

(uh)

of approximate solutions. The conclusion of the theorem is now a consequence of the stability theorem. Remark

4

: If

the map

A

is only defined on

K ,which is

not an open domain, then the hemicontinuity assumption must be strenghtened

. In fact,

in this case the demicontinuity of

A

r i l y follows from the hemicontinuity (cfr. Remark 8 even we can affirm that sections

K fI Xh

of

A

is continuous from the finite-dimensional

X

* . This latter

property, however, was

needed in the proof above to show the continuity of

Ah

.

Therefore, i t

"a p r i o r i w , in place of the hemicontinuity, when A

has not an open domain Remark 5 :

. of Chapter I), nor

(Xh being any finite-dimensional subspace of

K

X) to the weak topology of

must be assumed

does1nt necessa-

.1

We know f r o m Chapter 1 that the s e t of all solu-

tions of problem (27) is closed and convex. Under the assumption of Theorem 3 above, we also have that this s e t is bounded is obviously the case if

solution dition

u

is bounded. If

K

is bounded in norm by the constant

(do) of the theorem

11 'ii I( >

K

.

In fact, if

.

In fact this

is unbounded, then any

R

that appears in con-

ii is a solution of

R , since there exists at least one solution

u

with

(27) with

11 ull (

R

and the s e t of all solutions is convex, there would also exist a solution lu

u

with

11 11

= R ; hence

and this contradicts

(do)

.

-

U. Mosco

Remark 6

.

The coerciveness condition

(do) can be obviously

replaced by the stronger condition (dl)

There exists

v

K

€

0

, -such that

(Av, v - ~ ) - - + + a ,

as IvII-,oo, -

0

(cf r

v

E 'K

of Section 4 of Chapter 2). We shall now deduce from Theorem 3 an existence theorem for

inequalities of type (29)

u

X

:

(Au, v

-

u)

> F(u) -

F(v)

v v EX

,

by using the epigraph formulation of this problem we already discussed in Sections 5 and

9 of Chapter 1

THEOREM 4 map of

X

lues in (-w,

5 bounded monotone hemicontinuous

Let A -

into X r + WJ .

.

a 1. s. c. convex functional

F

k t us suppose that

lowing coerciveness condition There exists R -F(vo)
0

v

0

-

€ X, with

ll vO \I

< R and

w , such that

-

(Av, v f o r all --

vo) + F'(v)

v E X

wit@

-

F(vo)

IIv II

>

?

= R

Then, problem (29) --above has a solution Remark 7 : Condition stronger condition

:

u

.

(dl) can be obviously replaced by the 0

U; Mosco ( d1l )

There exists vo E -(Av, v

-

vo)

+

X ,

with

F(v ) < 0

+

F(v)

w

as

+

w

\I

, -such that V

11

+

-4

03

(cfr Remark 6 above). L e t u s also notice that i f the effective domain of then v

0

F

is bounded,

( d l ) is automatically satisfied, for it suffices then to choorz: any 0 with F(vo) < + oo and R l a r g e enough, s o that F(v) = t 2 whe-

n e e

\Iv

1= R .

Proof of Theorem 4

.

In t e r m s of the epigraph formulation of

Sections 5 and 9 of Chapter 1, problem (29) can be equivalently written a s follows

where

and N

K = epi

F

.

As w e know from Section 7 of Chapter 1, to find a solution of the inequality above, it suffices to find a vector a local

solution of (30)

K

u

which is

..

To find such a local solution auxiliary problem

ci( of

N

I-

u =

[u,d]

let us consider the

U . Mosco

(31)

#

.

rd

u ,g KR

u

-m

: (Au, v

- ); 3

0

where

i s the intersection of the epigraph

BR x I

is

.

0-L

K

of

F

with the ttcykindert4

Here

X

the closed ball of

a r s in condition

whose radius is the constant R

that appe-

(db), while

I =

[a,

b3

is a closed bounded interval of the r e a l line, which we assume to have been

closen iarge enough, a s we shall specify l a t e r Since the map

?i

.

i s obvioubly bounded, monotone and hemicontinuous

in X x LR and X is a bounded closed convex subset of X x

R, the existence of

a solution 5 = [ u , q of problem (31) is now a consequence of Theorem 3. To prove that

G

= cu,,$]

is also a

problem (30), it suffices now'to show that boundary of the "cylindert1

BR x I ,

local solution

u

of our initial

does not belong to the

which is to say, that we have

U . Mosco

1. 11

< R

and

a < - ( 0 and

.cl > 0

any 1. s. c. convex

,

a r e such that

can be bounded f r o m below in this way

F

3.

and

v

0

being the vector appearing in condition Let u s now interpret

n/

KR

a s the intersection

I%

KR = (epi FR) " X with the " s t r i p f f X x I by putting for

F

r +

cu

F (v ) = F(v ) < R 0 0

+

(db)

x I)

of the epigraph of the functional outside the ball w

.

BR

.

Note that

FR

Moreover, by our previous choice

B we now have,

a ,< inf F

FR

R'

actually

a < inf F

R *

obtained

+

co

,

(33) of

U.

Mosco

[1n fact, by (33), we have a< hence

- coR - cl 6 - c0 ll v ll - c1 6

a < inf

FR

F(v)

Ifv ll < R

f o r all

3

Thus, by Lemma 3 and Remark 1 7 of Section 9 of Chapter 1, since

$

=

[u,d3

is a solution of inequality (31), then

u

is a solution

of the mixed inequality

(35)

u € X

: (Au, v

-

u) 2 FR(u)

-

FR(v)

v'v

L X, F (v.)6 b

R

and

By putting now v = vo in (35), CnoJe that F (v ) = F(vO)-= b by (34)l R 0 we find that F (u) < oo, hence, F (u) = F(u) and then R R

+

By condition

(dl ), this implies 0

(u

11 < R, which is.theifirst

strict bound in (32) we had to prove. It rema'ins to prove the bounds on

4 . ve

Again by inequality (36) above and the monotonicity of

A, we ha-

U. Mosco

hence

o(

.

 inf

FR

>

.

a

@

Theorem 4 was deduced above f r o m Theorem 3

.

However, it clearly implies in turn both Theorem 3. (which is obtained by taking 4

F '- 0 ) and the direct existence theorem, Theorem

, of Chapter 2 (which is obtained by taking A

0 )

.a

Existence theorems f o r mixed variational inequalities such a s ( 2 9 ) have been given by C. L e s c a r r e t

111 ,

611

, J. Lions and G. Stampacchia,

, 193 , R. T. Rockafellar r 6 1 . Theorem 4

F. E. Browder C 8 1

i s essentially due to F. E . Browder, loc. cit. See also U . Mosco [ 2 ] where the proof given above is taken from

,

.

In the remaining of the present section we s h d l apply Theorem 3 to prove the existence of fixed-points of so-called inward non-expansive mappings

of a closed convex subset every

K

L e t us recall that

U

v C K

Uv

the vector

of a Hilbert space

z = Uv

into

V

.

is said to be an inward mapping if f o r

Clearly, any mapping of then we can take

V

and

belongs to some r a y

K

into

h= 1 .

K

i s an inward mapping, f o r

U. Mosco

.me following theorem, that generalizes Schauderls theorem, is F. E. Browder l111 :

due' to

.Let

THEOREM 5 a -

U

be an inward non-expansive mapping of

K

bounded closed convex subset

K f

fl .

U

Then,

fixed points ---

has a fixed -----

a Hilbert space

point in

a ---closed convex

U

of

of

K

.

V

,

Moreover, ---the s e t of all

subset of

K

.

Proof. We know from Section 4 of Chapter 1 that point of

into V -

u

i~ s fixed'

U, i. e . ,

if and only if

u

is a solution of the variational inequality

where

&

= I

- U ,

I = identity map of

V

By taking Lemma 2 below into account, the existence of a solution

u

of (37) and the properties of the s e t of all solutions of ( 3 7 ) co-

m e a s a consequence of Theorem 3 above and the Addendum 1 to The-

.@ . Let U

o r e m 5 of Chapter 1 LEMMA 2

$Z = map -

I

and continuous -

-

U

.

: K

is monotone -

V

be non-expansive.

and lipschiztian, -

Proof. We have f o r every

v, w

of

K

Then,the -

in particular, -

bounded

U. Mosco

(av2

I

&w 2

IIV - W I I

-

v

-

w) =

-

I~UV

IJV

uwv

-

W I

2

1lv -

-

I

(Uv-uw

w ~ r 2 (1

-

C)

v -w)

IIV - w 11

2

,

provided

Therefore, i f

U

is non-expansive

( c = I), then

&=

I - U

is monotone. Moreover,

,
/ 6

d

V,

t Kh

ive must conveniently choose a finite-dimensional skbspace

Xh

of

X

U. Mosco and a -

s

map

5

convex subset A

.

.

of Xfi ~ could e also take into account an approximation

Ah

that containes

being a map from Kh

.

X

to

* with X

Ah

a domain

of the

D(Ah)

Then the approximate problem becomes

.

Here too the pairing (. , ) is the duality pairing between

X

and

x* .I The further choice of a basis

in

%; n = nhbeing the

dimension of

Xh, allows us to associate with

the approximate problem (39) a discrete problem

in the euclidean space Here subset of

En

.

En

is a map of

En.

into

which a r e obtained from

show. Let

Th

:

En

-

be the injective map that brings the vector

En A

X

and and

Kh

h C

a convex

as we now

U. Mosco

of,

of

E~~ .into the function

X

. Clearly

Trh

hence

is a (norm) isomorphism of

E~

There is then a unique convex subset

obviously,

h

C

he

Kh

=

onto

ch

actual determination of

vh

,

lowing Remark 10

En

.

. such that

. can present in practice so-

C'

h

and the basis 3 , Xh 9 that has been done. In this respect see also the fol-

me difficulties, depending on the choice of hence of

of

Xh

,I

Moreover, the map

Th has a transpose

w h o s e kernel is the. annihilator of Y,

in

X

k

, and

U. Mosco

i s a map

of

En:

F o r all vectors

into

En'

vh

and

wh

of

En'

we have

were h Tfh v

,

Thus, a vector

u

v (x) = h

h

wh(x) = h

I (uq) 6 En

if and only if the function problem (41)--

is a --

flh wh

is a solution of the discrete h \(x) = nh u , &

solution of the approximate problem (39) CWe have in fact

were vh =

h Thv,

uh=

h

TThg

,

and

71h

is a 1-1 map

U. Mosco

ch

of

onto

5J

To write down the discrete problem (41) explicitely, let us

com-

pute the components

of the vector

Ah un of

E~

in the

canonical basis

We have

and since

we f a d

~ is expressed, in term of the components Therefore, ( A u')~ h ,uhn ) introduced*in (iiq)q of u , b y the same hnctions A:(U?. Section 1 : h

. ..

'U. Mosco

Thus, the discrete problem (41) is the s a m e a s the discrete p r o blem of that section, namely

We shall not discuss in o u r lectures which a r e the methods that can be used to solve numerically the discrete problem. h L e t u s only r e m a r k that if the map A i s strongly monotone, then the iterative methods of Chapter 2 could be applied.

A

On the other hand, if

is the gradient of a convex functional

and, therefore, the solution of (38) i s also the vector that optimizes a convex function on a convex subset of

E ~ , then the numerical solu-

tion of ( 3 8 ) could be c a r r i e d on by means of one of the s e v e r a l methods available in convex optimization, s e e f o r instance F o r a linear

A

, also

J. Cea [2]

.

pivoting methods a s those typical of line-

a r programming can be also tried. We shall s e e an example of that in the l a s t sectioq of the present Chapter. The role played by the euclidean m e t r i c and the c a -

Remark 10 nonical b a s i s

r

1

= (

$ql)q,

. ..

in associating

the discrete problem

( 4 1 ) with the approximate problem ( 3 9 ) can be taken by any inner product

. 1.

and any orthonormal basis with respect to that inner product. 'This

change will naturally affect the maps hence also the subset

ch

of

Rn

and i t s transpose h and the map of Rn

flhX into

r

U. Mosco

itself. Since the discrete problem (45) will .be modified in consequence, a suitable choice of the new m e t r i c and a new basis can facilitate the actuaJ numerical solution of (45). In this regard, l e t u s r e c a l l Remark 8 of Section 3 of Chapter 2

n

r ~ h map e brings the- vector

. &ill is the l i n e a r map of

of the new basis

Rn

into

X

that

chosen in Rn into h , f o r any the functions of the basis (40) of Xh ; hence , n h v h h a r e now the compov € R~ , is given again by (42), where (vs)s h nents of v , in the basis ( ) While the subset h IT;' Kh changes accordingly only to the change of Rh , the C =

rp: 5S

nh*

Ah =

A

qh

)

.

5

map

(

will be also affected by the change of the me-

tric, on which the transpose

'

nhr

depends. Let u s also notice that the di-

s c r e t e problem is still given by (45) above, where the functions Ah h a r e the s a m e a s before, provided the components (vqIq (uq)q of

.

vh

and

uh

a r e now taken with respect to the new basis in

Remark 11 When ctional

F

on

DF

R~

.]a

of a (convex) fun-

X , we know that the variational inequality (38) charac-

t e r i z e s a solution

(46)

is the differential

A

.

u e K

u

of the minimum problem :

minimizes

F

on

K

.

This suggests that we could directly solve this minimum problem, by taking an approximate problem (47)

%

E Kh

minimizes

and then, once the basis

(

F

h Ts)s

on

K

h

has been chosen in

X numerically h'

U. Mosco

solving the discrete euclidean problem

(48)

u

h

Here

ch

r

minimizes

h C =

-1 TIh

F

Kh

d

on and

h

C

,

IV

F = F e R h , that i s

It should be remarked, however, that the discrete variational inequality associated with the discrete minimum problem (48) above, that

is

i s the same a s the variational inequality (45), where

that we may find by discretizing the variatloniu inequality

associated with the initial' problem (46). In fact, we have

.

U. Mosco

h v F ( u ) in the canonical basis

and the components of

(es)

of

B"

a r e given by

for

T h es =

CPf:

h Rh u = uh =

and

h uq

p 4 . Thus,

Riesz-Ga-

lerkin discretization and weak characterization of minimum problems a r e trcommutingt operations 6. Finite-dimensional approximation, I1 : convergence of the ap-

proximate solutions

.

We shall now give conditions on the map

A

and the approxi-

mants convex sets Kh in order that the approximate solutions converge to the solution

uh(x)

u(x) of the initial problem.

As the stability theorem shows, the most natural convergence to b; expected from the sequence

(u' (x)] is the weak convergence

h X - where .A

u (x) to u(x) on the space h convergence to zero of the form

of

acts, together with the

.

Au, u - u) .As we said in h Section 3, for a whole class of maps, by definition : those of . . m e (S) , the convergence of uh (x) to u(x) in the nor& of the space X

(Auh

-

will then follows as a ronseqUence.

Let us notice, however, that we shall not be able to give estimates of the e r r o r

11 u

-

u I1 of the same type a s those which a r e h In this common in the analogue approximation of the equation Au = f 45 regard'see Remark below.

.

U . Mosco

By taking the finite-dimensional existence theorem of Chapter 2 into account, we can state the stability theorem of Section 3 in the form theorem, more suitable f o r our present of a ffconstructivelfexistence aims.

Let

,THEOREM 6 continuous -map of

X

A

&5

bounded, strictly monotone and hemi-

into X * . -

Let K b e a closed convex subset of ce of finite-dimensional closed convex subset in

K = lim K h according to Definition 1 f Letus -

Section 2

X , f $, of X

sequen-'

, such that

--

,

.

assume ,furthermore,that the following coerciveness condi--

tion holds -There exists v --

t O

n Kh -such that h

Then, t h e r e exists f o r -----

every

h

a unique solution

problem

unique solution and 5 -

u G K

X

2

(Kh)h

:

u

of. the -

problem

(Au,v-u)aO

W v E K .

u

h

of the -

U. Mosco

Moreover, (AuI.

-

u

converges -weakly to h Au, uh- u) converges to zero.

COROLLARY u

h

'

If, in addition, -

converges strongly

5

X

u

A

u

2 of

in X --and the form -

(S),

type

,

then

.

Proof : Theorem 6 reduces to a special c a s e of Theorem 2 once we have proved the existence of a bounded sequence of approximate s o lutions

F o r given h , the existence of uh can be proved by aph ' plying Theorem 3 to the map A and the s e t K Let us only note h ' that, if K is unbounded, then the coerciveness property (d ) requih 0 r e d in Theorem 3 is now an obvious consequence of our present assumption (uh)

u

(g1)

is a further consequence of

that appears in

for every ce

(uh)

.

(see Remark 6 above) )

(dl) : in fact, if

'

v

is the vector

0

, vo 6 Kh hence

h , and

this clearly implies, by

is bounded in

Remark 12

The boundedness of the sequence w

X

N

(dl),

that the sequen-

.

The assumption

particular, that the approximauts

w

(d ) .of Theorem 6 requires, in 1 Kh have a non-empty intersection.

A more general condition avoiding this hypotesis can be found, f o r instance, in U. Mosco Lb] , namely Theorem 3 lowing remark

. See

also the fol-

.

Remark 13

Condition

the following assumption on

ICI

(d ) 1

in Theorem 6 can be replaced by

A : --there exists a function

U. Mosco

continuous

(r)

and +

4

11 v -

strictly increasing a,

w 1

as

(

r

Il v

-

j

+

0

a

+, with

$ (0) =

Remark

(C)

and -

, such that

< (Av

q)

- Aw, v

-

w),

v, w E X

Let us also note, in particular, that a map tisfies the property

0

A

.

of this type s a -

mentioned in the corollary of Theorem 6 , s e e

of Section 3 @

2

Remark 14

The approximate solutions considered in Theorems 6

a r e required t o satisfy the initial inequality exactly. This corresponds, indeed, to the fact that the map

A

was left unchanged in the approxi-

mate problem. However, a simultaneous approximation of

A

could be

also taken into account and in this regard we recall Remark 3 of Section 3

.8 As we said at the beginning, the problem we a r e concerned with

in the present section is the convergence of the approximate solutions u

h

to the initial solution

u.

In this respect, the meaning of Theorem

6 above is that, for inequalities involving a map

A

of the type consi-

dered in that theorem, the proof of the weak o r strong convergence of uh

to

te sets

u

is reduced to the proof of the conv'ergence of the approxima-

Kh

of Section 2

to the convex set

.

The proof that

K

of the initial problem, in the sense

K = lim Kh

can be achieved, in some

cases, by using one of the general convergence results given in Section 2 o r otherwise by carrying i t out directly in the specific situation at.

hand. We shall now consider some examples. (a) Ritz-Galerkin approximation. An circumstance

mentioned

above

is

the

example

of the

first

U. Mosco

internal approximation of Ritz-Galerkin type of a convex s e t with a non-empty interior. In this case, given an initial convex set K in the space

X , the approximahis K

h

Are simply chosen to be the finite-di-

mensional sectiohs.

of

K , with respect to an increasing sequence of finite dimensional sub-

spaces

Kh of K

Xh . h Then we know f r o m (c) of Section 2, that we have convergence of of

X

such that

U

X = closure of

K , hence, under the assumption of Theorem 6 , convergence

to

u to u in the sense of that theorem, provided the interior of h is not empty If

0

K is empty, then a condition of type mentioned in (a) of

Section 2 could be cheked.

-

(b) Internal approximation of :Sobolov spaces.

A general scheme

of internal approximation, which is typical, for instance, of the finite-el e p e n t methods, can be described a s follows K

is a closed convex subset of

(vh) L~

X :

is a sequence of finite-dimensional spaces,,

is, for every h, a closed convex subset of

We assume that there exists f o r every

and a map

.

h

'h V

.

an injective map

U. Mosco

such that (i) and h (i) ph L

(ii) below hold : C

K

f o r every -

11 v - % rh v 11 + 0

(ii)

h f o r every -

v

fK

Under these assumption, if

we obviously have K = lim K

in

h

Differently then in example

X

.

( a ) , this scheme requires that the

convergence condition (ii) must be checked

directly in the specific si-

tuation at hand. We shall s e e now a classical example of this kind of approximation, namely the internal approximation of the Sobolev space

.

F o r this,as well a s f o r the example of external approximation H;)(R) 1 of H (R) we shall give in the subsection ( c ) below, we r e f e r to J. Cea 0 [ , J. P. Aubin [I-52 , F. Di Guglielmo Ll] .

11

1 Internal approximation H (52) . 0 1 X = Ho(R) , where R is a bounded open subset of

of

Example 1 . Here

~ ' ( 0 )the Sobolev space of a l l r e d functions

0

such that

2

v 4 L (R) and

vx

i

v(x) on

R ,

, distribution derivative of

v , also

R ~ ,

U. Mosco 2

.

.

belongs to L ( 0 ) , f o r every i = 1,. .,li 1 Ho(R) is a reflexive Banch space with the norm

Given a discretization parameter

.An)

h = (h1 8 . .

,

h.

1

>

0

f o r every

i

,

we shall now define an injective .map

where

vh

is an

and on the given R

n -dimensional vector space, h

-

o(*d (

depending on

:

) = the characteristic function of the real interval

5

h

.

Let us consider, indeed

- d(

nh

) = the one-dimensional I1tentl1function

1

[-p,

1

z]

U . Mosco

- for any multi-index

q = (ql,

. ..,qi, . . .

x), qi

E fN

i

'

.

let

i . e . , the .n-dimensional tent function whose support is the "n-cube"

with center at the point

Q=

(qlhl,

.. ., qihi, . . . .qnhnl

and "edge lenght"

2h = (2h1,.

. . ,2hi,. . .2hn)

U . Mosco

U . Mosco

h

-Q

= the set of

q = (ql,.

aU multi-indices u

p

,;

c

..,qn),

-

such that

,

Q

and

Note that h I (rq(x) 6 H,,(R)

- vh S Vh (0)=

for every

q E ph

;

the space of all real vectors

-the map

given by

that i s

v (x) = h

L ~

E

h v Q

X

[ ~

o(*d(-

1

hl

X

-

qJ.

..

d*

0.L (

n

hn

- q)

1

U . Mosco

-

h Xh = phV (Q) , the subspace

X = H J (0)

04

0

generated by the basis functions

?",XI

Since

ph

.

q E

Q~

is obviously injective,

vh

and

Xh a r e isomorphic

and in many arguments they can be indeed identified ;

-

the map

which associates the function

v(x) with the vector

whose q-component

is the mean-value of

v(x) on the region

h vh = ( v ~ ) ~ Qh

U. Mosco

The main approximation results can then be summarized a s follows : Lemma 3

F o r every --

Corollary 1 If

X = lim X

h

v o Hb(R) , ph

rhv

€ TtR)

1 h X = HO(R) and Xh = ph V ( 0 )

'

Corollary 2 If

and -

then K = lim K

h

in

1 HO(R)

,

n

then

U . Mosco

11 v - ph

For the estimate of the e r r o r the papers quoted above

rh

I(

v

we refer to

.

(c) External approximation of Sobolev spaces

.

The examples

mentioned up to now have the common feature that the set ximated by sets

Kh

which a r e contained in

K

is appro-

K , and it is to s t r e s s

this fact that we used the term internal approximation. However, it is easy to adapt the scheme (b) to the more general situation in which the approximant set ined in

K

is not required to be conta-

Kh

.

It suffices, indeed, to replace condition (i) with the condition

-

(ii) If v. J and -

6 ph Lhj , where j

v j

h

-is-a

(L j)j

converges weakly to

v

&

h subsequence of (L )h '

-v

X , then

In fact, this condition together with condition (j)

K

above is equi-

valent to the limit

Although improperly, we may call this kind

of. approximation an

external approximation. As it well known, the finite-difference methods for the approxi>

mation of partial differential operators caii be put in the external framework of approximation we have just mentioned. As an example of that, le't us briefly describe the exterrial approximation of the Sobolev space 1 HO(W 1 Example External approximation of HO(0)

.

.

Let us consider :

-

U. Mosco

-

for each

i = 1,

. . .,n,

the space

with the norm

-

the product space

of all vector functions

with the product topology ; -the closed subspace

with

Xo,

.

1 We shall identify the space HO(0) 1 that i s , we shall identify the function v(x) E Ho(0) with

contained in the diagonal of

the vector function

X

U. Mosco

-

the n-dimensional tent functions "smooth in the i-directionn : (XI=

i = 1,.

.. , n ,

-

X

1

-

X.

q...

hl

where

X

d + d ( 1h - q i ) . - - - d ( -n

q = (ql,.

..,R)

-

4.1.

hn

1

is a multi-index and

d(

t

)

is

the one-dimensional tent function considered in the previous. example :

The support of t e r at the point

Q = (qlhl,

(x) is the n-coordinate llcube" with cen-

. . .,q,hn),

but the i-direction and edge-leght 2h

i

- Qest

=

the s e t of all multi-indices

the t l c r o s s region"

edge-lenght

h

j

in all directions

in the i-direction :

q = (ql.

. .x),

such that

U . Mosco

with center at the point

Q = (qihi)

and "arm-lenght"

and

Clearly, for every

h

-

- 'est -

'est

(G?) = the space of

i = 1,

.. .,n

all real vectors

2hi

in every

U . Mosco

- the map ph d

~ : , t s~t

~

where, for every

i = 1,.

)

h est ~ (n)

x= i

. .,n,

the map

-z

h

i~'(')3~

i s given by h phi v

= vhix) =

- q = ph tst(S,2the ) subspace

of

hi

k

h . 'est

CH'(~)]

=

i

generated by the basis vector-functions

-f:

(x) = ( .

.., y:

(XI,.

(XI)

.:st

E~

Clearly,

is an isomorphism of

associates the vector function of

vhi(x) = phi v

where

h

X =

7CH

, i = 1,. . .,n,

Finally, we shall still denote by every function

v(x)

h Vest(SZ) 1 (S2)

7

phir

h

v

i = l,...,n,

h

of

h Ves,(fl)

rh the map with associates

L (SZ) with the vector

Let -

Let -

v

2

1 v E HOW)

~[H'(Q)I

Lemma 5

Xh , which

,

with the vector

whose q-component is given by the mean value of

Lemma 4

onto

. For ,each

v(x) on the region

i =I,.

. .,n

9

@

v

h

.=

h (v ) and suppose that. for every 4 q r Q ; ~~

.

U. Mosco

convekges weakly to a funcrion v

r

H'(R)~

f o r all

i = 1,.

in

i

Then, we have --wi(x) Z where

V(X)

v(x) belongs to

weakly to

Hb(0)

-

=

7

cp_ hg(x)

1 6

Corollary

Xo

--

Moreover,

in

x

1 Ho(R)

0

.

..,n

v (x) = p vh -h -h

.

converges

=

C H ~ ( R ~ as

&

Xh

ih/+

o

the subspace of

spanned by the basis vector-functions

CH'(~)J

,

.

y(x) = (v(x). . ... v ( r ) )

Corollary 1 : Let

x

3.e.

a s 1h 1 * -

Qhq

.

r?

: Let

Then

-K

identified --with a cone

1

v E Ho(S2) : v ) 0

=

K

X

a. e.

R

3 be

and

Then , K -

(d)

= l i m -h K

-

Projection methods

X

.

as

\hl+O

.

The approximation methods discussed

.

U. Mosco

s o f a r may be called of injective type : they a r e based on suitable injective mappings

ph

vh

of some fiAte-dimensional space

rying some convex subset that approximates the given

L~

of

K

vh

into

in a convex suset

K

.

h

X of

, carX

However, a conceptually different type of approximation is also possible, which is based instead nn some projection mappings X

onto a finite-dimensional subspace

map

ph

c a r r i e s the given s e t , K

Xh

of

X

of h : in this case, the

onto an approximate

Kh

p

in

Xh

.

The most natural setting for these methods .involves an Hilbert space of

X

, an increasing sequence of finite-dimensional subspace

U

X. with

Xh

dense in

X, and, f o r each

h

Xh

, the orthogonal

h

projection If

p h

of

X

onto

is a map of

convex subset of

X

h

X

'

into -

X

and

is a bounded closed

K

X, then the problem

may be approximated by the sequence of problems

v where

,is a map of

Xh

into

Xh

and

h

c

Kh

U. Mosco

(see (dl! of section 2) . Xh Let us remark that the approximate problem above is in the

i s a (bounded) closed convex subset of space

Xh. However, its solution

uh

is the same as. that of the pro-

blem

in the space

X

. In fact,

we have

hence

Therefore, the proof of the convergence of the approximate s o still can rely on Theorem 6, by taking now the convergenu h ce result stated in (el) of Section 2 into account. We refer to the au-

lutions

thor's paper C43

, e.g., Proposition 3.1, for more details on this point.

Projection methods for solving equations involving non-linear operators in Banch spaces have been extensively investigated by many authors, let us mention here F. E. Browder , [lo] n23 der and W. V. P e t r y s h s [I]

,

F. E. Brow-

and the review paper by R. I. Kachurov-

skii [3] , where further reference on the subject can be found. See also D. G. de Figuerido [I] Eemark 15

.

.

The usual estimates of the e r r o r [lu. - uh

the Ritz-Galerkin' approximation of an equation Au = f near map

A

a r e based on the inequality

!( in

involving a li

U. Mosco

u beh longs to. This estimate, however, is in general false f o r variational ine-

Xh

being the subspace of

X, which the approximate solution

convex cones. qualities, even if it r e f e r s to an internal approximation of -This can be seen with the following simple example, due to G .

--, (0, 0)

i s the vector that minimizes the distance functional 2 2 F(v) = - 11 v - z )I , v = (vl, v2) E E z = ( - 1 0 on the 2 0 0 E~ , while u = (0, -h) half plane v 3 0 of the euclidean space 1 h i s f o r a given h > 0 the vector that minimizes F ( v ) on the cone Kh Strang : u 1

described in the figure below

Then, h

> 0 small.

11 u -

uh

11

= h, whereas

dist (u, Kh)

/J

h

2

for

U. Mosco

7

Dual variational inequalities and complementarity systems

It i s well known that many minimum problems of the calculus of variations and optimization theory admit a "complementary" on "dual" formu

lation. On this matter, let u s only r e f e r here,for instance, t o A. M. Arthurs [I], J.Stoer-Witzgal [I], J . C e a 1 2 1 Robinson [I],

Moreau [2],

U.Dieter [i][2].

F o r variational inequalities too , many "dual" characterizat ions of the solutions can be given, which a r e based, essentially, on separation o r

.,

mini-max theorems. A discussion of some of these dual methods can be found in J. L. Lions, R. Glowinski, R. ~ r e m o l i G r e s ,loc. cit. As we shall see below, it i s always possible, at least in principle, t o associate a variational inequality in X*

-

the "dual" inequality

-

with any

given variational inequality in the space X - the "primal" inequality

-

in such a

way that a vector ,u of X i s the solution of the primal inequality if and only

' is a solution of the associated dual inequality. if the vector u* = -Au of X However, the explicit formulation of the dual inequality may eften be in practice a difficult problem in itself. . We shall first consider variational inequalit$ on convex cones. The dual scheme we have in mind becomes then particularly slrnple and both the primal and dual inequalities can be characterized more symmetrically by means of a so-called (generalized) complementarity sy$tem. Let indeed M be any map of X into

x*,

H a convex cone with u er

tex at 0 in X, z a solution of the variational inequality

(48' )

Z E H : ( M Z ;w - z ) 3 0 Then, the p a i r z, z* =

-

Sb

W E H

Mz i s a solution of the problem

U. Mosco

In fact, by putting w = z

+v

into (Cg'), with v a n a r b i t r a r y vector

of H, we find

which is t o say, z * H*. ~ Moreover, by lepladhg now the vector w in

(46")

once with 0 and then with 22, we find

(Mz,-z)3 0

and

(Mz,z) 3 0

respactetely, therefore (z*, z) = 0. Conversely, i f the p a i r z, z*= every

WQ

- Mz

satsfies (480 above, then for

H we have

since z * ~H*. Therefore, we have proved the following LEMMA 6 : Let M be any map of X into X:

H a convex cone

with vertex a t 0 in X. A vector z is a solution of the veriational inequality

i f %nd only i f the pair z, z*= - M z is a solution of the

U . Mosco

zg H

,

Z*d H*

,

(Z*, zz)= 0

Let u s suppose now that the map M i s 1-1 of X onto X* and let us define the map

Note that MI= M-'

if M i s linear. Let u s also assume that H i s a closed

convex cone with vertex at 0. Then, if we denote by H** the polar cone

* & X,

of H

i.e.

U. Mosco

a simple argument, basedon the separation theorems for convex sets, shows that

Since the relation

is clearly equivalent to

by applying Lemma 6 once t o the map M and the cone H and then to M 1 and H*, we obtain the following

*

THEOREM 7 : Let M be a 1-1 map of X onto X , H a closed convex cone with vertex at 0 in X. If MI is the.map of X* on X given by (49) and H* -

the polar cone of H in x*, then the following three problems a r e

equivalent (i)

z~H:(Mz,w-z)po

(iii)

z e H , Z*E:H*

VWEH

, (zX,z)=O ,

U. Mosco

prbvided z a n d z* a r e related by

Remark 16: If M = DF, F being a convex functional oli X, then MIZ*

= - ~ f ( - z * ) ,zkE x*, where

is the

conjugate functional of F (see

Section 2)Then, the dual problems (i)and (ii)of Theorem 7 characterize the minimum problems ( i ) z minimizes

F(w)

on

H

(ii) z* minimizes

F*(w*)

oh

-H

*

respectevely. Problems (i)and (ii) above a r e conjugate in the sense, for instance, of Fenchel's duality theorem, cfr. R. T. Rockafellar [2].

91

Remark 17 : Let vo be a given vector of X, A a given 1-1 map of X onto

x*, A'

defined a $ in (49). If we apply Theorem 7 to the map Mz

=

A(z+vo)

zEX

,

,

then we find that the following problems a r e equivalent (i)

(ii)

uEvo+H:(Au,v-u)>O 'u% H~

Y

v.v=H Y

: ( ~ l u * + vv~ , - u 13

o

Vv'g

H*

U. Mosco

(iii) provided

It mfficeb infactAa make the change of variable z When X = IR"

=

u

- v 0' @

" x3:and H 'H* is the non-negative

ortant of

IRn ,

then problems such a s (iii) above a r e known in the litarature a s complementarity systems: -

linear c. s . if M i s an affine map of I R ~into itself;=

linear c. s . in the general case. They a r i s e in many problems of optimization and game theory, a s well a s i n geometric o r physical applications, and have been investigated by many a u t h o r s see R. Cottle-I. Dantzig ClJ R. Cottle [I],

C. E. Lemke [lJ, S. Karamardian

Cl],

l1.3,

where further r e -

ference on these systems and their applications can be found. In these papers many algorithms for the numerical solution both of linear and non-linear complementarity systems have been given. These algorithms, which a r e based mainly on suitable pivoting techniques, can thus be also used to solve discrete variational inequalities on convex cones. We shall s e e an example in the following Section 8. In turn, the reduction of a complementarity system to a variational ineauality, hence to a fired-point problem, can be convenient in o r d e r to obtain more general existence results. M.oreover, this reduction is also fruitful from an algorithmic point of view, for it makes i t possible to use iterative methods of solution. F o r more details on the relation between variational inequalities and complementarity systems in finite-dimensional spaces we r e f e r to K a r h r d i a n , loc. cit. , I. Dolcetta

[I],

J. MorC

611.

U. Mosco

Variational inequalities in connection with convex programming have been a l s o investigated, from a computational point of viewtoo, by

0.G. Mancino-G. Stampacchia [I]. The duality for variational inequalities on convex cones considered above is a special case of a general dual scheme for variatibnal inequalities of type u r x : (Au, v

(50)

- u) 3 F(u) - F(v)

,

++VEX ,

where F i s a 1. s. c. convex functional on the normed space X, with values in ( - co,+w]. In this case, the dual variational inequality can be witten a s

JP

where At is defined a s in (49) above and F is the Young-Fenchel conjugate of F . see Sec@m 2. It can be proved that a vector u is a solution of (50) ~f and only if the vector (52)

u* = -Au

4

(i. e. , u = -A1u )

is a solution of (51). Moreover, both solutions u and u* a r e characterized by the Young-Fenchel identity

U. Mosco where u and u* a r e related a s in (52) above. The special c a s e of theorem 7 is obtained by taking F t o be the indicator function

6H of the convex cone

F* = (

is the indicator.function of the polar cone H

in

=

cH'

H in X ( s e e Section 2). hence

* of

x*. C ~ h dual e prolems of Remark 17 a r e given. insteed, by F=

hence

H

L-

.

* = &.,,(wl) + (w*,vo) 2 F o r more details we r e f e r to U. ~ o s c o r f l . F (c*) Remark 18: When A is the differential of a convex functional G, than

the dual problems (50) (51) characterise a p a i r of dual extrenum problems in the sense of Fenchells duality theorem, s e e R. T . Rockafellar 123. When

*

X i s a Hilbert space, X

X, and A = identity map of X, the0 problems

(50) (51) above and the equivalent system (53) a r e related t o the so-called proxi&y mappings introduced by J. J. Moreau C f l . An application of a dual scheme of this type t o prove the regularity of the solution has been given by H. Brezis

[d.

Remark 19: The explicit formulation of the dual variational inequality (51) r e q u i r e s the knowlbdge of the inverse map A-' 'and of the conjugate functional F'.

In particular, t h e calculation of F*-may be a difficult pro-

e blem even for "simple1' F. However, the dual scheme described in the p r sent section can be modified in concrete situations, hy making a sort of "change of variable" in the initial inequality before operating with the duality. In some c a s e s this leads t o a more feasible

inversion of A and

dualization of F. Dual extremum problems have been indeed investigated along these lines by R. Teman

[a, by relying on a generalized form of

F e n c h e l f s duality theorem given by R. T. Rockafellar

C72. or

a similar

U. Mosco approach to dual variational inequalities see

M. Matzeu El].

In the following section we shall apply the dual scheme described.above t o the "obstacle problemt' mentioned in Section 1 of Chapter 2 and we shall rely on it t o give a method f o r the numerical approximation of the solution. Let u s a l s o mention, in this respect, that a different approach to duality, based mainly on minimax techniques, h a s been a l s o applied t o the numerical solution of problems a s those mentioned in Section 1 of Chapter 2, by J. Cea-R. Glowinski

Cfl , Dl.J. Cea-R. Glowinski-Nedelec [d, M. Nedelec

[:11, J. F. Bourgat C13. See a l s o J. Cea 121. 8. An example Let u s consider the obstacle problem of Section 1 of Chapter 3 (Exam ple 3):

where a(u, v ) i s the Dirichlet form

and

y

i s a fiven function in HI

(a).

0

We a r e in the situation described in Remark of the proceding section, with

U. Mosco

Moreover,

A t = A-' = G

:

H-l(Jl)

Hi(*

is the Green operator for the

: for any measure

Dirichlet problem in

V(X)

H

= G 'E

T in

H-'(A),

the potential

(XI

i s given by

dn

Where g(x, y) is the Green function for the Dirichlet problem in

R.

The primal variational inequality now is

while the dual ineauality now is

Both these inequalities a r e equivalent t o the complementarity s i s t e m (see Remark 17 of the preceding section)

U. Mosco

The approximate solution of problem (50), o r of the equivalent direct minimum problem, has been studied by many authors, s e e R. Glowinski i2J, M. Goursat [I],

Sibony [2],

li-L. Guerra and G. Volpi

Marzulli [I],

G. Stampacchia

[a J. J. Moreau C73,

J. F. Durand

rg,

V. Comincio -

n].

We shall summarize below the method followed i n A. Fusciardi et al. r l 3 . The complementarity system (52) is approximated by a sequence of finite-dimensional complementarity s y s t e w what gives a direct simultaneous approximation of the function u and the measure

/( = d u .

the solution of (50)

and (51) respectevely, without any assumption of regularity. This discretization is obtained by realizing an internal approximation

*

of the cone of measures H , by meane-of unit m a s s e s supported by the (n-1)-dimensional meshes of a given coordinate subdivision of

fi .

We shall now describe this approximation, by taking, for sake of simplicity, n=2. Let u s consider a coordinate subdivision of IR

2

a s that given in Ex. 1

of Section 6 , h=(h ,h ) being the discretization parameter and Q = (q h q h ), 1 2 1 1 ' 2 2 q = (q., q,) G z2, the vertices of the subdivision. 1 L 2 F o r every q=(q q ) G Z w e shalldenote by sh and sh2the one-dimensional 1 1' 2 9 ( l = n - 1 ) meshes of the subdivision which have Q a s ? h e left end point and the lowest end point, respectevely: that i s ,

U. Mosco

h Let u s now consider the functional bh and d which associate q1 the mean values on shl aPld sh2,respectevely: 9 q

aith each function

p cc:(~).

It i s easy to show that

2 q1

and

h 2 satisy the estimates q

U. Mosco

h. a r e contained for every q such that both sh and S q h q1 by Q ' the set of a11 such q l s .

ins. We shall denote

[we hive in fact

for every ipC c:(a ), we can find x

0

2

I

such that

thus,

hence xll

j' x1 1

2 I'f(xl,x21\ d x l < ( d i a m a )

jz a

2

2 ~ x l , x 2 ) l dx, dx

2

Therefore,

Then.

cqhl and

6: 2,

,

q C Qh a r e both elements of the dual H-'(Q

)

U. Mosco of H

1

(h)and

0

provided

(p+

since they a r e ooviously non-negative ( i . e. 0, i = l , 2 ) . we can conclude that

tive m e a s u r e s belonging t o H-'(R).

c$

ql

and

h

Q-

i ( )+ ~0 q a r e non-nega-

We shall now denote by H the convex cone, with vertex at 0, geneh h , rated by the non-positive m e a s u r e s - cqi, Q ~ ii1,2:

Clearly

i=1,2

,

*

Hh C' H where, let u s recall it, H

*

for every h,

+ i s the cone of a l l non-positive measures in H-'(R).

The finite-dimensional cones H* approximate the cone H* in the sense h of the following lemma: LEMMA

K

H

= l i r n H W in H-'(JL). h -

In view of the Corollary of Theorem 1 of Section 2, the convergence just stated is equivalent t o the convergence of the polar cones: the polar cone 1 1 of H' in H is the cone H of a l l non-negative functions of H we 0

(a)

0

(a)

started with, while the polar cone H of H * i s the cone of a l l functions h h h whose t r a c e on each S q P Q ~ iz1.2, , has a non-negative mean vc qi ' value:

%(R)

U. Mosco

Now, it i s not difficult t o prove that lim H

h

=

A h

we r e f e r t o A. Fusciardi et al.

H~ = H

as l h 1 3 0

,

. loc. cit.

We can thus apply the approximation sheme described in Sections! 5 and 6 above. The finite-dimensional problems that approximate problem (51) can

: be obtained by replacing H* with H

(we take

yh =

for a l l h and we

also leave the operator G unchanged):

which i s equivalent t o the complementarity system

where

C ~ o t ethat whereas the cone H* i s finite-dimensional, i t s polar cone h i s ndt such. However, zh belongs t o the finite-dimensional cone -G(Hh)-\t'

H h thus the complementarity system above is essentially a finite-dimensional one.2 We now w r f ~ ethe approximate measure

rh

i i t h m s of the'basis

U. Mosco

h [- Cql .]

that generates H*. h'

If, similarly, we put

then the discrete variational inequality corresponding t o ( 5 4 ) , according to what we have seen in Section, i s given by

E b y writing that a vector of ElN is non-negative, we mean that a l l its components a r e non-negative

1,

where for each q C Qh, i = 1 , 2 h yql . =

' crqi.y

) = mean value of

Y

On

h sqi

U. Mosco

fi

of the potential in

of the measure [ ~ o t ethat

h h . carried by the mesh S TI rj

b

Gh =Gg qi; r j rj;qi

,

q , r e ,~ i ,~j = 1,2

because of the symmetry of the Green operator G.

2

is given by F o r example, if i = l , j=2, the matrix elemeb G ql, r 2

Gh

--

1

J (q2+l)h2 g(x10 q2h2;fZhl. ~ ~ ) d x ~ d

qzh2 The discrete problem ( 5 6 ) is in turn equivalent to the discrete complementarity system

which can be solved, f o r instance, by using pivoting thechniques ( s e e F. Scarpini,. A. Valdinoci [l) ) Once this system has been solved, we can write the approximate mea sure

U. Mosco

and the approximate function

By the convergence r e s u l t s of Section,G we know that u (x) converges h 1 strongly in H (R ) to the solution u(x) of problem (51) a s \hi + 0, while 0

the measure of (52).

/". converges strongly in the dual

H-'(& ) to the solution

r

Let u s also r e m a r k that the coefficients z

obtained by solqi+ yqis ving system (58), yield a direct approximation of the mean values of the so lution u(x). on each one-dimensional mesh of the subdivision used in the a p proximation. We r e f e r to A. Fusciardi efal. for more d e t d s on this point. [ ~ o t ethat the mean-values a r e well defined for a n a r b i t r a r y function uq

$(a), 0

(n=2) whereas the point-values of u(x) a r e not defined, unless

some regularity of the solution u(x), depending on the regularity of the obstacle

\Y

, is known

1

Remark20.The approximation of the one-dimensional obstacle problem h in an interval (a, b) is particularly simple. Then, the basis measures q at a point x = qh of a subdivision of lR. can be taken to be the unit m a s s q q The approximate measures a r e finite-combinations of such Dirac measures

6

and since the potential g(x) = G

6 q1 x 1 in

( a , b) is thdtriangle'function

U. Mosco

then the approximate functions

a r e piece-wise affine function in (a, b). The solution of the complementarity system in t h i s case is particularly simple, see F. Scarpini, A. Valdinoci

Ed. @

Remark 2 1.The approximation method described in t h i s section requices the knowledge of the Green function g(x, y) for the Dirichlet problem in

fi

It i s still possible, however, to combine the dual approach discussed above with a n approximation of the Laplace operabr of finite-difference type (by replacing point-values with suitable mean values). This requires a n external approximation of the m e a s u r e s in H-'(R),

in place of the internal one ddscribed Pbo

ve. We r e f e r to U. Mosco-F. Scarpini [I].

$

Existence and uniqueness of the solution of the elastic-plastic torsion problem for a cylindrical b a r of nval cross-section, 29 (1965). .Prikl. Mat. Meh. -

B.D.ANNIN,

[I]

A. M. ARTHURS,

[I] Complementary variational principles,

Clarendon P r e s s

Oxford, 1970. E. ASPLUND,

M

Positivity of duality mappings, Bull. Amer. Math. Soc. 73

(19671, 200-203. J. P. AUBIN,

Approximation of variational inequalities, in "~unctional Analysis and Optimizationt' (E.R. Caianiello Ed. ), A.cad. P r e s s

[I]

1966, 7-14. [2] Approximation des espaces de distributions et des operateurs differentiels, Bull. Soc. Math. France, MBmoire 12 (1967), 1-139. [3] Behavior of the e r r o r of the approximate solutions of boun-

bary value problems for linear elliptic operators by Galerkints and finite difference methods, Annali Scuola Norm. Sup. 21 (1967),

599-637.

I41 Evaluation des e r r e u r s de troncaturo des approximations des 21 (1968), 356-368. espaces de Sobolev, J. Math. Anal. Appli. -

[g

Approximation des problemes aux limites non homogenes 30 (1970), pour des op6rateurs non lin6aires, J. Math. Anal. Appl. -

510-521. C. BAIOCCHI,

[I] Su un problema di frontiera libera connesso a questioni di idraulica, to appear in Annali Mat. P u r a Appl.

A'. BEURLING and A. E. LNINGSTONE, [I] A theorem on duality mappings in 4 (1962), 405-411. Banach spaces, Ark. Mat. L. BOCCARDO,

M

Alcuni problemi a1 contorno con vincoli unilaterali dipendenti da un parametro, to appear.

J. F. BOURGAT, [I] Analys3 numerique du probleme de l a torsion elastoplastique, Thsse, I. R. I.A. Paris, 1971;

p. Mosco H. BREZIS.

[I] Une generalisation des operateurs monotones, Inequations d'6volution abstraites, C. R. Acad. Sci. Paris, t. 264 (1967), 683-686 and 732-735.

k]

Sur certains probl&mesnon-lineaires, Sdminaire Choquet n. 18 (1966-67), 1-18.

[g

Equations et inequations eon lineaires dans l e s espaces vectoriels en dualit6, Ann. Inst. Fourier (19681, 115-175.

18

Problemes unilateraux, to appear. H. BREZIS and M. SIBONY, [lJ M6thodes dlapproximation e t dliteration pour l e s operateurs monotones, Arch. Rat. Mech. Anal. 28 (1969), 59-82.

[23 Equivalence de deux inequations variationnelles et applications, Arch. Rat. Mech. Anal.. to appear. H. BREZIS and G. STAMPACCHLA, [I] Sur l a regularit6 de l a solution d1in696 (1968), 153-180. quations elliptiques, Bull. Soc. Math. France F. E. BROWDER, [I] Nonlinear elliptic boundary value problems, Bull. Amer. Math. Soc. 69 (1963), 862-874.

b]

wonlinear elliptic boundary value problems, 11, Trans. Amer. Math. Soc..-117 (1965), 530-550.

b]

Continuity properties of monotone nonlinear operators in Banach spaces, Bull. Amer. Math. Soc. (19641, 551 553.

Ef3

-

On a theorem of Beurling and Livingstone, Canad. J. Math. 367-372.

17 (1965), -

E]

Nonlinear monotone operator-s and convex sets in Banach spaces, Bull. Amer. Math. Soc: 71 (1965), 780-785.

[6] Existence and uniqueness theorems for solutions of nonlinear boundary value problems, Proc. Amer. Math. Soc. Symp. Appl. Math. XVII (1965), 24-49.

B]

Probl&mes non-lineaires, Les P r e s s e s de lfUniv. de Montreal, 1966, 1-48.

[q

Existence and approximation of solutions of nonlinear variational inequalities, Proc. Natl. Acad. Sci. U. S. 56 (1966), 1080- 1086.

[93 On the unification of the calculus of variations and the theory of monotone nonlinear operators in Banach spaces, Proc. Natl. Acad. Sci. U.S. 56, 419-425.

-

U. Mosco

[lo]

Non-linear accretive operators inBmach spaces, Bull. Amer. Math,.Soc. 73 (1967), 470-476 .. -

1 A new generalization of the Schauder fixed point theorem, Math. Annalen 174 (1967), 285-290. [12] Approximation - solvability of nonlinear functional equa26 ( 1967), tions in normed linear spaces, Arch. Rat. Mech. Anal. 33-42. b3] Non-expensive nonlinear operators in a Banach space,

54 (1965), 1041-1044 Proc. Nat. Acad. Sci. USA, -

&4] Non-linear variational inequalites and maximal monotone mappings i n ~ a & s p a c e s , Math. Annalen 175 (1968), 89-113 b5] Nonlinear operators and nonlinear equations of evolution in Banach spaces, P r o c . Amer. Math. Svmp. Nonlinear Functional Analysis,Chicago. 1968. The solution by iteration of nonF. E. BROWDER and W. V. PETRYSHYN, linear functional equations in ~ar$ch spaces, Bull. Amer. Math. 72 (19661, 571-575. SOC.Construction of fixed points of nonlinear mappings in Hilbert space, J . Math. Anal. Appl. , 2 0 (1967) , 197-228. -

[2]

J. CEA,

Approximation variationnelle d e s problemes aux limite s , Ann. Inst. F o u r i e r 14 ( 19641, 345-444.

@ Optimisation, thkorie et algorithmes, Dunod ed.,Paris, 1971, J. CEA and R. GLOWINSKI, @] Mi-tion bles, t o appear,.

d e s fonctionnelles non differentia-

Methodes numeriques pour 1'6conlement laminaire d'un fluide rigide viscoplastique incompressible, to appear.

b]

J . CEA, R. GLOWINSKI and J . L. NEDELEC, Methodes numeriques pour la torsion elasto-plastique d'une b a r r e cylindrique, t o a p p e a r ,

V. COMINCIOLI, L. GUERRA and G. VOLPI Analisi numerica di un problema di frontiera libera connesso col moto d i un fluido a t t r a v e r s o un mezzo poroso, Pubbl. n. 17 d e l G b . Anal.Numer., pavia, 1971. L

R. W. COTTLE,

n]

Nonlinear P r o g r a m s with positively bounded Jacobians, SIAM J. Appl. Math. 14 (1966), 147-157. - -

U. Mosco

R. W. COTTLE and G. B. DANTZIG, [lJ Positive (semi-)definite programming, Symp. Math. P r o g r . , H. W. Kuhn E d . . Princeton, 1970

-

] Complementary pivot theory of mathematical programming, i n LinemAlgebra and i t s Applications, vol. I (1968), 103-125* Mdthodes d'approximation pour certains probl&mes non lindaires non homogenes, to a p p e a r .

J. P . DIAS and M. SIBONY,

Optimierung soufgaben i n topologischen ~ e k t o r r z u m e nI: ~ualitgts:.theorie,Z. Wahrscheinlichkeitheorie verw. Geb. 5 (19661, 89-117.

U. DIETER,

Dual extremal problems ir, linear s p a c e s with examples and applications i n geme theory and statistics, P r o c . NATO Adv. Study Inst. on Theory and Appl. of Monotone Operators, Venezia 1968, Oderisi Ed. , 1969, 1 - 9 . F. DI GUGLIELMO,

[I] Construction dlapproximations d e s espaces d e Sobolev 6 (1969), 279-33 1 s u r d e s reseaux e n simplexes, Calcolo -

I. DOLCETTA,

Sistemi d i complementarith e disuguaglianze variazionali, Tesi, Universith d i Roma, 1972

J. F. DURAND,

.

a

Rdsolution numdrique de problbmes aux limites sousharmonique, These B' llUniv. d e Montpellier, 1968-6 9 .

C. DUNAUT and J. L. LJONS,

n]

MM6chanique e t fn6quations, Dunod. ed., P a r i s , 197t

D. G. de. FIGUERIDO, Topics i n non-linear functional analysis, Lecture S e r i e s N. 48, University of Maryland, 1967

-

A. FUSCIARDI, U. MOSCO, F. SCARPINI and A. SCMAFFINO, bl A dual method f o r the numerical solution of some variational inequalities, t o appear in J. Math.Anal.App1. 4 0 (4972) . E. GIUSTI;

Superfici minime cartesiane con ostacoli discontinui, Arch. Rat. Mech. Anal. 40 (1971) -

U. Mosco R. GLOWINSKI, [I] Mdthodes n u m e r i q u e s pour l t 6 c m l e m e n t stationn.aire d l u n fluide r i g i d e vi'sco-plastique incompressible,,to appear.

La methode d e relaxation. Applications B l a minimisation a v e c e t s a n s consts&ted d e fonctionnelles convexes, Q u a d e r n i d e i Rendiconti, 1st. Mat. Univ. Rorna, 1971.

@] Methodes n u m e r i q u e s pour la t o r s i o n elasto-plastique d t u n e

b a r r e cylindrique, dormulation variationnelle, Colloq. Anal. N u m e r Supper B e s s e s , 1970

.

W

R. GLOWINSKI, J. L. LIONS and R. TREMOLIERES, Methodes nume'riquep d e resolution d e s probl&mes d1in6quations variationnelles e n m e c a n-i que, eT e n phydique, Dunbd E d , . P a r i s , t o appear.

.

M. GOURSAT,

[I] Analyse numkrique d e p r o b l 6 m e s d t e l a s t o - p l a s t i c i t 6 e t d e visco-plasticitd, T h e s e , I. R. I. A. ,P a r i s , 1971

P. H. HARTMAN and G. STAMPACCHIA, [I) On s o m e non l i n e a r elliptic differ e n t i a l functional equations, A c t l Math. 115 (19663, 271-310 . A. I O F F E and V . TIKHOMIROV, [l] Duality of convex functions and e x t r e m u n problems, Uspekhi Mat. Nauk. 23,6 (19681, 51-116; R u s s i a n 23,6 (1968), 5 3 - 5 4 . Math. S u r v e y s J,L.JOLY,

Une f a m i l l e d e topologies e t d e convergences s u r l ' e n s e m b l e d e s fonctionnelles convexes, T h e s e B l a Facult6 d e s Sciences d e Grenoble, 1 9 7 0 .

R. I. KACHUROVSKII, [I] On monotone o p e r a t o r s a n d convex functionals, Uspe h i Mat. Nauk. 15, 94 (1960), 213-215. Monoton'e non-linear o p e r a t o r s i n Banach s p a c e s , Dokl. Akad. Nauk. SSSR 163 (19651, 559- 562. 133 Nonlinear monotone o p e r a t o r s i n Banach s p a c e s , Uspehi Mat. Nauk. 23 (19683, 121-168 R u s s i a n Math. Surveys 23,2 (1968), 117-165. S. KANIEL,

Construction of a fixed-point f o r contractions i n Banach space,. I s r k e l J. Math. , 9 1971, 535-540.

S. KARAMARDIAN, T h e nonlinear complementarity p r o b l e m with application, JOTA 4 (1969),87-98.

U. Mosco T. KATO,

[I] Demicontinuity, hemicontinuity and monotonicity, Bull. i d e m . , P a r t . 11, ibid. A m e r . Math. Soc. 70 (1964), 548-550; 73 (1967), 886-889. -

H. LANCHON,

Solution du p r o b l e m e d e t o r s i o n elasto-plastique d ' u n b a r r e cylindrique d e s e c t i o n quelconque, C. R. A d d . Sc. Paris, 269 (1969), 791-794

H. L. LANCHON and C. DUVAUT, 111 S u r la solution du p r o b l e a e d e l a t o r s i o n Clasto-plastique d'une b a r r e cylindrique d e section quelconque, C. R. Acad. Sci. P a r i s 264 (1967)

-

@]

C. E. LEMKE,

Recent r e s u l t s on complementarity p r o b l e m s , P r o c . P r i n c e t o n Symp. on Math. P r o g r a m m i n g , H. W. Kuhn ed. , P r i n c e t o n Univ. P r e s s , 1970, 349-384.

J . LERAY and J. LIONS, [I] Quelques r e s u l t a t s d e Visik s u r l e s p r o b l e m e s elliptiques nonlineaires p a r l e s mCthodes d e Minty- Browder, Bull. Soc. Math. F r a n c e 93 (1965), 97-107. C. LESCARRET,

H. LEWY,

[I] Ca s d'addition d e s applications monotones m a x i m a l e s d a n s un erpace d e Hilbert, C. R. Acad. Sc. P a r i s 261 (1965), 1160-1163.

03

O n a variational p r o b l e m with inequalities on t h e boundary, J. Math. Mech. 17 (1968), 861-884.

[a

O n a minimum p r o b l e m f o r ' s u p e r h a r m o n h functions, Int. Conf. on Functional Anal. ,Tokyo 196 9 ,

H. LEWY and G. STAMPACCHIA, 111 On t h e rejgularity of a solution of a v a r i a t i o n a l inequality, Comm. P u r e Appl. Math. 22 ( 1969), 153-188.

@] Onthe r e g u l a r i t y of c e r t a i n s u p e r h a r m o n i c functions, J. dlAnxlyse Math. 23 (1970), 227-236 O n existence and s m o o t h n e s s of solutions of s o m e non-coercive variational inequalities, t o a p p e a r

.

J. L. LIONS,

Quelques methodes d e resolution d e s problCmes aux l i m i t e s non l i n e a i r e s . Dunod e t Gauthier-Villars E d . , P a r i s , 1969.

J. L. LIONS and G. STAMPACCHIA, [I] Variational inequalities, Comm. P u r e Appl. Math. 20 (1967), 493-519.

W. LITTMAN, G. STAMPACCHIA and H. F. WE1 NBERGER, [I] R e g u l a r points f o r elliptic equations with discontinuous coefficients (19631, 45-79 Ann. Scuola N o r m a l e Sup. P i s a ,

17

0. G. MANCINO, G. STAMPACCHIA, [I] Convex p r o g r a m m i n g and variational inequalities, JOTA 9 (1972). 3-23. P. MARZULLI, [I] Risoluzione a l l e differenze d i equazinni a l l e d e r i v a t e p a r ziali d i tip0 ellittico oon condizioni s u u n contorno l i b e r o , Calcolo, Suppl. 1 , 5 (19681, 1-22. M. MATZEU,

[l] Dualitti nella t e o r i a della capacitg, Tesi,Ist. Mat. Univ.

d i Roma, 1972.

G. J . MINTY,

p]

Monotone (nonlinear) o p e r a t o r s i n Hilbert space, Duke Math. J. 29 (1962), 341-346.

(23 On a "monotonicity" method f o r t h e solution of non l i n e a r .us. equations i n Banach s p a c e s , P r o c . Natl. Acad. Sci; 5'0 (1963), 10384.041. [33 On t h e monotonicity of t h e gradient of a convex function, 14 (1964), 243-247. Pacific J. Math. -

@

On t h e solvability of nonlinear functional equations of rno14 (1964), 243-247. notonic type, P a c i f J. Math. -

[5) On the generalization of a d i r e c t method of t h e c a l c u l u s of variations, Bull. A m e r . Math. Soc. 73 (1967), 3 1 5 - 3 2 1 .

b1

On s o m e a s p e c t s of the t h e o r y of monotone o p e r a t o r s P r o c . NATO Adv. Study Inst. onl'heory and Appl. of Monotone O p e r a t o r s , Venezia 1968, O d e r i s i E d . , 1969, 6 7 - 8 2 .

.

M. MIRANDA,

[llF r o n t i e r e

J. J. MORE,

T h e application of variational inequalities t o complementarity p r o b l e m s end e x i s t e n c e theorem-s, T e c h . Rep. n. 71- 110, Dept. Computer Science, Cornell Univ., 1972.

m i n i m a l i con ostacoli, t o a p p e a r

J. J. MOREAU, @ Fonctionnelles convexes, S e m i n a i r e s u r l e s equations aux d e r i v e e s $artielles, College d e France., Paris,1966- 1067, miltigraph, 1- 108.

[2] P r o x i m i t e e t dualit6 d a n s un e s p a c e hilbertien, Bull. Soc. Math. F r a n c e 93 (1965),, 273-299.

U. Mosco [3] One-sided c o n s t r a i n t s i n h y d r o d h a m i c s , i n J. Abadie E d . , Nonlinear programming, Nort Holland Pub. , Amsterdam( l967), 257-279. P r i n c i p e s extremaux pour l e probleme d e la n a i s s a n c e de l a cavitation, J o u r n . d e Mkcanique, 5 (1966), 439-470.

[5) La notion d e sur-potentiel e t l e s l i a i s o n s u n i l a t e r a l e s e n elastotatique, C. R. Acad. Sci. P a r i s , 267 (1968), 954-957. [6] S u r l e s lois d e frottement, d e plasticit6 e t d e viscosit6, C. R. Acad. Sci. P a r i s , 271 (1970), 608-611. [7] Traitement numerique d l u n probleme a u x de'rive'es p a r t i e l l e s d e type unilateral, Publ. b.15, Depart. dlInformatique, Univ. d e MontrBal, 196 U. MOSCO,

D]

Approximation of t h e solutions of s o m e variational inequal i t i e s , Ann. Scuola N o r m a l e Sup. P i s a , 2 1 (1967), 373-394.

117

A r e m a r k oil a t h e o r e m of F. E. Browder, J. Math. Anal. Appl. 20 (1967), 90-93.

[3] convergence of solutions of variational inequalities, P r o c . NATO, Adv. Study Inst. o n l h e o r y and Appl. of Monotone Oper a t o r s , Venezia 1968, O d e r i s i Ed. , 1969, 231-247. Convergence of convex s e t s and of solution of variational 3 , 4 (1969), 510-585. inequalities, Adv. i n Math. -

4

[5] Perturbation of variational inequalities, P r o c . Amer. Math. Soc. Symp. P u r e Math. XVIII (1'970), 182-194 . [61 On t h e continuity of t h e Young-Fenchel t r a n s f o r m , J. Math. Anal. Appl. 35 (1971), 518-535. [73 Dual variational inequalities t o a p p e a r i n J'. Math. Anal. Appl. 3_9 (4472) .

U. MOSCO, F. SCARPINI, 01 On t h e approximation of s o m e complementarity s y s t e m s i n Sobolev s p a c e s , t o appear. M. NEDELEC,

Un algorithme dual pour l e problgme d e l a t o r s i o n elastoplastique d'une b a r r e , Colloq. d1Analyse N u m e r . , Supper B e s s e s , 1969-70.

U. Mosco

[I] Variational problemswith inequalities a s boundary conditions, Arch. Rat. Mech. Anal. 35 (1969), 83-113

J. C. C. NITSCHE,

Z. OPIAL,

.

Non expansive and monotone mappings in Banach spaces Lecture Notes Divis. Appl. Math. , Brown Univ. ,1967

-

[g Projection methods in non linear numerical functional analysis, J. Math. Mech. 17 (1967), 353-372

W. V. PETRYSHYN,

P. D. ROBINSON, El] Complementary Variational Frinciples, in Nonlinear Functional Analysis and applications,edited by L. B. Rall, Academic P r e s s , 1971 R. T . ROCKAFELLAR, [I] Characterization of the subdifferentials of convex 17 (1966), 497-510 functions, Pacific J. Math. -

.

[21 Extension of Fenchel's duality theorem for convex func3 (1966), 81-90. tions, Duke Math. J. 3 On the virtual convexity of the ,domain and range of a ncn linear maximal monotone operator, Math. Annalen

@] Local boundedness of nonlinear monotone operators Michigan Math. J.

b]

On the maximal monotonicity of subdifferential mappings, Michigan Math. J.

@

Convex functions, monotone operators and variational inequalities, Proc. NATO Study Inst. on Theory and Appl. of Monotone Operators, Venezia 1968, Oderisi Ed. , 1969, 231-247. Convex analysis, Princeton Univ. P r e s s . , Princeton,

. 1970

Su alcuni sistemi di complementaritP F. SCARPINI and T. VALDINOCI, connessi a disequazioni variazionali d i tip0 ellittico, to appear& C - c l ~ l O.

fi]

A. SCHIAFFINO, Su un problema di disequazioni variazionali p e r operat o r i differenziali ordinari, Boll. U. M. I. (1969), 25-35 * M. SIBONY,

@J Sur l'approximation d16quatios et inequations aux d6riv6es partielles non lineaires de type monotone, to appear in J. Math. Anal,. Appl.

@]

Methodes iteratives pour l e s dquations et inequations aux 7 dkrivkes partielles non lineaires de type monotone, Calcolo (1970), 65-183.

U. Mosco G. STAMPACCHIA, [I] F o r m e s b i l i n e a i r e s c o e r c i t i v e s s u r l e s e n s e m b l e s convexes. C. R. Acad. Sc. P a r i s 258. (19641, 4413-4416. [21 On t h e regularity of solutions of variational inequalities Int. Conf. on Functional Anal. , Tokyo, 1969. 131 Variational inequalities, P r o c . NATO Adv. Study Inst. Venezia 1968, O d e r i s i E d . , 1969, 101-191.

.

k]

Regularity of solutions of s o m e variational inequalities, P r o c . Amer. Math. Soc. Symp. P u r e Math. XVIII (1970), 271-281.

151 On a problem of n u m e r i c a l a n a l y s i s connected with the t h e o r y of variational inequalities, I. E . I. , CNR, P i s a , Nota interna ~ 7 2 1 5 ,1972

.

b]

J. STOER and C. WITZGALL, Convexity and optimization i n finite dimen s i o n s I , Springer Verlay, Berlin, 1970 R. TEMAM,

[l] Solutions g6nbralisees d16quations non 1inCaires non unifor mbment elliptiques, Publ. Math. d1Orsay, Univ. Paris XI, 1970-71.

T . W. TING,

[I] Elastic-plastic t o r s i o n P r o b l e m , Arch. Rat. Mech. Anal. 25 (19671, 342-366 1

h]

Variational methods f o r the study of nonlinear o p e r a t o r s , M. M. VAINBERG, Holden Day; San F r a n c i s c o C a l . , 1964

.

LE probl$me d e la minimisation d e s fonctionnelles non l i n e a i r e s , i n P r o b l e m s i n Non-linear Analysis, CIME Varenna 1970, C r e m o n e s e Ed., Roma, 1971. M. M. VAINBERG and R. I. KACHUROVSKI, [fJ .On t h e variational t h e o r y of non-linear o p e r a t o r s and equations, Dokl. Akad. Nank. SSSR 129 (1959), 1199-1202. H. d e VEIGA

111 Sulla holderianitP d e l l e soluzioni d i alcune disequaziopi

variazionali con condizioni unilaterali a 1 bordo, Annali Mat. 83 (1969), 73-112. P u r a Appl. -

[23 R6gulaPitB pour u n e c l e s s e d'inequations non linbaires, C. R. Acad. Sci. P a r i s 271 (1970), 23-25. R. A. WIJSMm. [I] Convergence of sequences of convex s e t s , cones and functions, Bull. A m e r . Math. Soc. 70 (1964), 186-188; idem, P a r t . 11, T r a n s . Amer. Math. Soc. ( 1 9 6 3 , 32-45

' U . Mosco

E. H. ZARANTONELLD, [d Solving functional equations by contractive averaging, Tech. Rep. N. 160, U. S. Army R e s e a r c h C e n t e r Madison, Wisconsin, 196 0.

CENTRO INTERNAZIONALE MATEMATICO ESTIVO ( C . I. M. E . )

4.

SINGER

BEST APPROXIMATION IN NORMED LINEAR SPACES

G o r s o tenuto ad Erice d a i 27

giugno a1 7 luglio

1971

Beat approximation in normed linear spaces

by Ivan Singer (University of Bucharest)

Contents -Introduction. 1. Characterizations of elements of best approximation.

1. 1. The first main theorem of characterization. 1 . 2 . The second main theorem of characterization.

1 . 3 . Differential characterizations. 1 . 4 . Other characterizations.

2. Existence of elements of best approximation. 2. 1. Characterizations of proximinal linear subspaces. 2.2. Some classes of proximinal linear subspaces. 2.3. Normed linear spaces in which all linear subspaces a r e proximinal. 2.4. Normed linear spaces which a r e proximinal in e: t r y superspaces.

2.5. Transitivity of proximinality. 2.6. Proximinality and quotient spaces. 2.7. Very non-proximinal linear subspaces. 3 . Uniqueness of elements of best approximation.

3.1. Characterizations of semi-zebygev and Eebygev subspaces. 3.2. Existence of semi-Eeby~evand Eebyt3ev subspaces. 3.3. Normed linear spaces in which all linear (respectively, all closed linear) subspaces a r e semi-Eebygev (respectively, Sebygev) subspaces.

I. Singer

3.4. semi-Eebygev and Eebygev sybspaces and quotient spaces. 3.5. Strongly unique elements of best approximation. Strongly zebyxev subspaces. Interpolating subspaces. 3.6. Almost Eebygev subspaces. k-semi-8ebygev and k-8eby;ev paces. pseudo-?ebygev

subs-

subspaces

3.7. Very non-Cebysev subspaces. 4. P r o p e r t i e s of m e t r i c projections. 4. 1. Definition and some properties of m e t r i c projections. 4 . 2 . Continuity of m e t r i c projections. 4. 3. Weak continuity of m e t r i c projections. 4.4. Lipschitzian m e t r i c projections. 4.5. Differentiability of m e t r i c projections. 4. 6. Linearity of metric projections. 4. 7. Semi-continuity and continuity of set-valued m e t r i c projections. 4. 8. Continuous selections and linear selections f o r setvalued metric projections

.

5. Best approximation 5.1. Best approximation

by elements of non-linear s e t s . by elements of convex s e t s .

5. 2. Best approximation by elements of N-parameter sets. 5. 3. Generalizations. 5.4. Best approximation by elements of a r b i t r a r y s e t s .

I. Singer

Introduction Here we want to present briefly some results, problems and directions of research in the modern theory of best approximation, i. e. in which the methods of functional analysis a r e applied in a consequent manner. In this theory the functions to be approximated and the approximating functions a r e

regarded a s elements of certain normed linear (or,

more generally, of certain metric) spaces of functions and best approximation amounts to finding "nearest pointsf1. The

advan$ages and a brief

history of this modern point of view have been described in the Introduction to the monograph

1821 and we shall not repeat them here; the m a -

terial which will be presented in the sequel will be convincing enough, we hope, to prove again that the theory of best approximation in normed linear spaces constitutes both a rigorous theoretical foundation for the existing classical and more recent results in various concrete spaces and a powerful1 tool for obtaining new results, solving the new problems which appear. Since June 1966, when the Romanian version of the monograph [82] has gone to print, the theory of best approximation in normed linear spaces has developed rapidly and the number of papers in this field is growing co~tinuously. However, except the expository paper up to 1967, by A. L. Garkavi

[23]

[31]

, which appeared in 1969, and the bibltography

compiled by F. Deutsch and J. Lambert in 1970, we know of no

other survey material on these new developments. One of the aims of our course is to fill this gap to a certain extent, by presenting much new material which appeared after the monograph

[82]

. In this

re-

spect the present course, though self-contained, may be regarded a s an up to date complement to the monograph

[82]

; however,

the

biblio-

graphy does not aim at begin complete, but wants merely to give useful

I. Singer

orientation to the reader

. Naturally,

since another aim of the course

is to introduce the non-specidits to this field , some overlapping with

the material of the monograph

[ ~ z Ji s

unavoidable; however, even this

part is presented here in a slightly improved way. We shall give here only a few simple proofs, but for all results we shall give references. Even with this I1economyt1in proofs, some important topics had to be omitted.. The reader i s assumed to know some elements of functional analysis and integration theory, but we shall

recall, whenever necessary, the

definition of the notions (especially those of the geometry of normed linear spaces) which will be used in the sequel. We acknowledge with pleasure that we benefited from attending s e minar lectures on metric projections by F. R. Deutsch (at Pennsylvania State University, in 1968) and G. Godini (at the Institute of Mathematics of the Academy, Bucharest, in 1970/71).

I. Singer

1

.

Characterizations of elements of best approximation.

The first main theorem of characterization

1. 1 .

Throughout the sequel, without any special mention, we shall denote by

p

the distance in a metric space

p

E i s a normed linear space,

E

and, in particular, if

will denote the distance in

E

indu-

ced by the norm, i. e.

Definition 1.1. x E E. An element of

x

Let

g € 0

E 'be a metric space,

denote by

go

a set in

E

and

G i s called an element of best approximation

(by the elements of the set

i. e., if

G

is "nearestIf to

x

G ) if we have

among the elements of

PG(x) the set of all such elements

G ; we shall

go, i. e.

It i s natural to consider f i r s t the problem of characterization of elements of best approximation, i. e. the probelm of finding necessary and sufficient conditions in order that

g

0

E FG(x), since these results-

will be applied to solve the other problems on best approximation (e. g.

-

those of existence and uniqueness of elements of best approximation, etc

I. Singer

Also, the characterization theorems in concrete spaces (see e. g. the llaltemation theoremv11.7 below) a r e convenient tools for verifying whether o r not a given

go

satisfies

G ( ~ ) ,since they a r e easier

go

to use 'than (1. 2). Since we have obviously

it will be sufficient to characterize the elements of best approximation of the elements

x E: E

5.

\

In order to exclude the case when such

elements do not exist, in the sequel we shall assume, without any special mention, that

5

f

E

.

Unless otherwise stated, the field of scalars for all (general o r concrete) normed linear spaces considered ib the sequel can be either the field of complex numbers o r the field of realenumbem. The first main theorem of characterizatio; of elements of best approximation by elements of linear subspaces in normed linear spaces is the following (see 1 8 2 3

, p. 18)

Theorem 1.1. Let E subspace of

E, x E E \

if and only if there exists an

:

be a normed linear space, and f E E*

go E G

. We

such that

have

G

a linear

go E

TG(X)

I. Singer

E

We recall that

*

denotes the conjugate space of

space of a l l continuous linear functionals on

E, i. e. the

E , endowed with the usual

vector opePations and with the norm

To p m e theorem 1.1, assume that

x E E \

G

, we have

go

IIx-- g

p ( x , G) =

0

E TG(x)

(1

3

a corollary of the Hahn-Banach theorem (see e.g. ma IZ), t h e r e exists an

II f o I I

=

0

.

Then, since

and hence, by

, p. 6 4 , l e m -

[25]

fO E E* such that

1

II.-goo

Then the functional .f =

fo(g) = 0 ( g E G)

11 x

- 1 7).

Conversely, if there is an

f o r any

g E G

we have

11.

- gO1l =

If(.

- so)l= If(.

- go\( fo f 6

-

gl

E

and E

* satisfies

E * satisfying

< llfll

IIx

f (x) = 1 0

.

(1.5)-

(1.5)-(1.7), then

- sll =

IIx

-

gll

I. Singer and'ehence

go E

pG(x) , which completes

the proof.

It is easy to s e e that theorem 1.1 admits the following geometrical interpretation :

We have

a closed hyperplane

H

such that

E

(i. e., a closed

dim B/H = 1) containing

p (HBS(x,Ilx -

go(( 1) =

o

?#

Any functional

f CZE

T G ( x ) if and only if there exists

go

G

n

Int S(X,

satisfying (1.5)

"maximal functional" of the element

H

, which supports the cell

H

and

linear suhspace

x

-

go

and

11

-

go 11 ) =

9).

(1. 7) is called a

(because we have

The usefuhes-6 of theorem. I. 1 for ap~lictiti0ns:inxdious concrete normeh linear spaces i s due to the fact that for these spaces the general form of maximal functionals of the elements of the space i s well _known and simple (see e. g.

[73J ,

[99I

)

.

Let- us give now some examples

of applications of theorem 1.1 in concrete spaces. We shall use the word llcompacttl in the sense of i; e.: bicompact Hausdorff. F o r mmpact space

C(Q), respectively

Q

N. Bourbaki,

we shall denote by

C (Q), the space of all complex o r real, respective-

R

l y of all r e d continuous functions on operations and with the norm

Q , endowed with the usual vector

I. Singer

Using the general form of maximal functionis of the elements of C (Q), from theorem 1 . 1 we obtain (see [82] , p. 33 ) : R Theorem 1 . 2 . Let E = CR(Q) (Q compact), G a linear subspa-

ce of

E

.

.

x EE \

and -

go.€ G

. We have

only if there exist two disjoint closed subsets a Radon measure

where

S ( p)

/" 2

&,

such that

Y+

go 6 PG(x) if and

go

denotes the c a r r i e r of the measure

, Ygo

of

-

Q

and

/L" .

One can also give a characterization theorem in the spaces E = C(Q) (see

[82]

, p. 2 9 )

.

Theorem 1 . 2 , which appeared in [79]

,

has constituted the first theorem' of characterization in- E = C (Q) (even R in E = CR( [a, b] )) of elements of best approximation by elements o? linear subspaces

G

-

of arbitrary dimehsion.

I. Singer F o r a positive measure space p - f i e l d of subsets of

T

(T, 3 ) (we shall not specify the

on which the measure

3

i s defined; this

will cause no confusion ) and for 1 $ p ,< co (respectively p = a) we shall denote by L P(T, 3 ) the space of all equivalence classes of functions with

3 -integrable

p-th power (respectively 'of

3 -essentially bounded functions on

and

3 -measurable

T) , endowed with the usual

vector operations and with the norm

(respectively,

11 x 11 =

ess sup t E T

I c(t

)I;

for simplicity, we use .here the same notation f o r a function and for its equivalence class in

L'.

Again the subscript

R

will mean, both here

and for the dpaces occurring in the sequel, that we restrict ourselves to real scalars. F o r a function

x1 on

T

we shall use the notation

Using the general form of maximal functionals of the elements of 1

L (T, p.

3

), we obtain from theorem 1 . 1 the following theorem (see [82]

46 ),

lin [51]

which was obtained initially by

T.

Riv-

with different (function-theoretic) methods and with the above func-

tional 'analytic method$ in 1811 Theorem 1 . 3 . measure bpace) , G go E G

B. 'R. Kripke and J.

,

. We

have

Let

:

1 E = L (T, 3 ) (where (T, 3 )

a linear subspace of

E,

go 6 FG(x) if and only if

x € E \

is a positive

5 'and

I. Singer

We recall that f o r a complex number -i a r g d

sign o( = e and that sign 0 = 0 For

% we

results ( s e e [82]

3 ) with

€

G

b) Let E = x E E \

where

(x, y)

.

< p < co and f o r an abstract inner

:

-E

and go -

i

obtain from theorem 1.1 the following well known

, pp. 56-57) 1

o(

Id1

Theorem 1.4. a) Let s u r e space and

< p

a subset of the unit circumference

Moreover, if

Q

is homeomorphic to the whole unit circumfe-

rence, then every r e a l Cebys'ev system on

Q

consists of an odd num-

b e r of elements. This result has been conjectured by S. Mazur and proved by J. C. Mairhuber under the assumption that

Q

is a subset of a finite

-

-dimensional euclidean space; f o r general compact spaces it has been proved by K. Siekluki and P. C. Curtis Jr. (see 1 8 2 1 , pp. 218-222, where a proof due to I.. J. Schoenberg and C. T. Yang is presented; f o r the l a s t statement of theorem 3. 7 s e e e. g. [6l] , p. 26 ). F o r the c a s e of complex s c a l a r s the problem is still open (only

I. Singer

partial results a r e known, s e e C827, p. 222)

.

F r o m theorems 3. 7 and 3 . 4 one can deduce the following result, d ~ to e R. R. Phelps (see [82] , p. 222) :

E = L w (T, 3 ), where (T, 3 ) R & -finite 'positive measure space such that dim E = oo ,: has no 6ebyCorollary 3.2. The space

3 2 (however, i t does have ?ebygev

gev subspace of finite dimension

subspaces of dimension 1, even in the case of complex scalars)

.

It id natural to consider the similar problem f o r subspaces of finite codimension, i. e. the problem of characterizing those compact spaces

Q

f o r which

C(Q)

contains a semi-6eby;ev

subspace of finite codimension t r i c compact -

spaces

Q

n >, 1

.

The problem is solved f o r

and r e a l scalars, by the

to A. L. Garkavi (see [82]

,

o r a Eebygev

following results,

pp. 314 and 325, footnote, and [33]

Theorem 3.8. a ) F o r every infinite compact metric space every integer

n

with

6ebygev subspaces

G

1 ,< n

, 2

I. Singer

(see

[a27 ,

.

ch. III)

One can also consider the analogous problem f o r the spaces

on positive measure spaces

(T, ), )

. For

real s c a l a r s we have the fol-

lowing results corresponding to theorems 3.4 and 3.8 b), which a r e due, in the case when (T, 3 )

is

W-finite, to A. L. Garkavi ( s e e [82],

p. 233 and p. 331) : Theorem 3.9. Let (T, 3 ) be a positive measure space such 1 The following statethat LR (T, Y )*" L: (T, ), ) and l e t n 2, 1 -

.

ments a r e equivalent : 1 1.' L (T, 3 ) has an n-dimensional z e b y ~ e vsubspace. R

1

2O. L (T, 3 ) R 3'.

(T, 3 )

has a t least

We recall that an atom of with

B

3 (A) > 0, such that if

then either

Z) (B) = 0

n

.

(T, 3 ) is a measurable s e t

A

C

is any measurable subset of

A ,

has a . ceby8ev subspace of codimension

or

n

.

atoms

3 (A \ B) =

0

.

T

F r o m theorem 3.9 it fol-

if (T, 3 ) lows, in particular, that -

has no atoms (e. g. if T = [0, 1 1 and 3 is the Lebesgue measure) L 1 (T, 3 ) has no Eebygev subR spaces of finite dimension o r codirnension. F o r Eeby:ev subspaces of

then

finite dimension this l a t t e r result is known to hold also f o r complex p. 230-232), but no extension of theorem 3.9 to (see [82], 1 complex L (T, I> ) spaces is known .

scalars

E. W. Cheney and D. E. Wulbert have proved ( 1 2 0 3 , theo1 Y r e m 34) that E = CR(Q, P ) (Q compact, S ( 3 ) = Q, contains a Cebygev subspace of codimension

n

if and only if

Q

has at least

n

I. Singer

lated points. We conclude with the following problem of A. L. Garkavi (see 83

, p. 2 and 31

, p. 9 6 ) :

Problem 3 . 2 . Does the space subspace

G

C ( 0, 1 ) possess a Cebysev R of infinite dimension and infinite codimension?

Recently D. E. Wulbert

Q

spaces

95

has shown that there exist compact

such that the analogue of problem 3 . 2 f o r

affirmative answer

. It

i s also known ( s e e

CR(Q) has an

, p. 332) that

82

L;(

0, 1

has Cebysev subspaces of infinite dimension and infinite codimension. 3 . 3 . Normed linear spaces in which all linear (respectively, all

closed linear) subspaces a r e semi-Cebygev (respectively, c e byzev) subspaces

.

We recall that a normed linear space

E

is said to be strictly

convex (or rotund) if the relations

imply the existence of a

c

> 0 such that

It is well known and easy to show that this property is equivaJ

lent to each of the following properties : a) :Fr SE = g ( S E ) ;

b ) ' ~ r SE

contains no segment ; c) each

f E E*

has at most one maximal element.

Using theorem 3. 1, one obtains (see [82 Theorem 3 . 1 0 .

1,

F o r a normed linear space

statements --. -a r e equivalent:

p. 110)' : E

the .following

)

I. Singer

.

1'

2

0

sion n -

v

E

a r e semi-Cebysev subspaces.

. All linear s u b s ~ a c e sof

E

of a certain fixed finite dimen-

, where

1 4 n

.

codimension

v

v

4 dim E - 1 , a r e semi-Cebysev (or, what is e -

quivalent, Eebyzev) subspaces 3O

"

All linear subspaces of

. E

All closed linear subspaces of 1 ,< m

of a certain fixed finite

4 dim E - 1 , a r e semi-Eebygev sub-

m

, where

E

is strictly convex

sDaces. 4'

.

82 ,

Combining this with

.

theorem 2.5, we obtain (see [82]

,

p. 111) : Theorem 3.11. F o r a Banach space

E

the following statements

a r e equivalent : 1'

. All

2'

.

codimension 3'

.

closed linear subspaces of

E

a r e <ebyzev subspaces.

All closed linear subspaces of

E

of a certain fixed finite

- 1,

a r e eebyzev subspaces.

, where

E

is reflexive and strictly convex

In particular, re

(T, 3 )

1 1,< m g dim E

m

since the spaces

.

E = L'(T,

3 ) (1

 0

i s the unique element

for every

g E G \

Ig0)

go). The following characterization of such

elemehts is due, essentially (namely) for

g = 0

and

11 x 11

= 1). to

D. E . Wulbert 1977 : be a linear subspace of a r e a l normed

Proposition 3 . 3 . Let G linear space

E

. -An

element

go E G

best approximation of an element a constant

r = r(x, G)

with 0 -

is a strongly unique element of

x E E \

-

G

if and only if

< r ,< 1 such that

there exists

I. Singer

f(g)

2 r IIg

11

(g E G )

>

where

Definition 3.3. A 6ebyzev s e t to be a strongly 8eby:ev

G

set if every

lement of best approximation in

G

x

in a metric space C

E

has a strongly unique e -

.

D. J. Newman and H. S. Shapiro ( s e e [lg], tively, D. E. Wulbert [97]

3)

80) and, respec-

p.

have proved

Theorem 3.13. In the spaces ((T,

is said

E

a positive measure space)

1 CR(Q) (Q compact) and LR(T, 3 ) every finite-dimensional Cebygev

subspace i s a strongly Eebygev subspace. has observed that

On the other hand, D. E . Wulbert [97] smooth normed linear space

E

a

no Eebygev subspace is strongly Eeby-

sev . V

We recall that there exists only one Definition 3.4. linear space

E

E

is said to be smooth

f = f

X

E E*

c

. ..,cn

11 f 11

G

E

x

f(x) =

= 1

An n-dimensional linear subspace

11 x (I.

of normed

is called an interpolating subspace if f o r any

n

line-

. ..,f n

n

num-

a r l y independent extremal points bers

such that

if f o r every

fl.

there exists exactly one

(3.23)

f.(g) = c

J

j

of

g E G

and

any

such that

( j = 1, ..., n)

In a r b i t r a r y normed linear spaces such considered in [76],

SE

.

subspaces have been f i r s t

where it was proved that they a r e Cebygev subspa-

I. Singer ces (indeed, -

this i s a consequence of theorem 3.3; s e e C82] , pp. 213-

-214) and that the converse need not hold

even if dim E < co

.

Recen-

tly D.A. Ault, F. R. Deutsch, P. D. Morris and J. E . Olson [ 3 I have studied best approximation by elements of interpolating subspaces, proving, among other results, the following :

G

Theorem 3.14. Every interpolating subspace linear space

E

of a normed

i s a strongly zebygev subspace.

F r o m th.e Haar-Kolmogorov theorem (theorem 3.4 above) it folG f E = C(Q)

lows that a finite-dimensional subspace

is a 6ebygev

subspace if and only if it is an interpolating subspace ; this, together with theorem 3. 14, implies again the f i r s t - part of theorem 3.13. On the other hand, fkom theorems 3 . 4 and the observation made after theorem 3.13 it follows that (in particular, the

1 1 the situat?on

and

ting subspace of any finite dimension

1'

rn

n

is quite opposite,

contain no proper interpola-

> 1

(J. H. Biggs, F. R. Deutsch,

R. E. Huff, P. D. Morris and J . E. Olson [4] ); on the other hand, it is c l e a r that the unit vector

{ 1 , 0 , 0 , . . .)

in the complex spaces

I. Singer

l1 o r

spans a one-dimensional interpolating subspace.

lm

3.6. Almost Cebygev subspaces. k-semi-eebygev and k-?ebygev subspaces. pseudo -6ebyZev subspaces. We shall consider now some generalizations of semi-Eebygev and E e b y ~ e vsubspaces.

A set

Definition 3.5.

G

in a metric space

most Gebys'ev s e t if the s e t of a l l

f o r which

x € E

is called an al-

E

pG(x)

does' not

consist of a single element forms a s e t at most of the f i r s t category in

E

. Almost Eebygev linear subspaces of normed linear spaces have

been introduced by A. L. Garkavi (see [82],

p. 116), since they ha-

ve the advantage that in every separable Bansch space E almost 6eby;ev space

there exist

However, the Banach

subspaces of any finite dimension.

E(1) of section 3 . 2 has no almost 6ebyzev subspace of infinite

dimension. F o r results on finite-dimensional almost 6eby;ev of

CR(Q) (Q compact) s e e [82

224-225.

A lin'ear subspace

Definition 3.6. E

1, p.

subspaces

G

of a normed linear space

is called a k-semi-Cebyzev subspace, respectively a k-Eebyzev sub-

space. (where

(3.24)

is an integer with

k

-1 ,< dim

50G (x),< k

0 ,< k

e of

3 . 2 and all infinite-dimensional closed

.

a r e very non-<eby:ev

subspaces.

the -space E(1)

of section

linear subspaces of

E = c

0

I. Singer

Properties -

4.

of m e t r i c projections.

4. 1 . Definition and..some properties of m e t r i c projections If G is a s e t in a m e t r i c space E, we shall deno-

Definition

4. 1. a)

t e by

the multi-valued mapping

?r

G

D(% ) G

+G

defined by

this mapping should be distinguished from the "set-valued m e t r i c projection"

pG defined

the function

y(t)

=

in section 4.. 1 ( the reader may compare %

6

on

LO,oo)

with

).

D(tCG) = E and

b) In the particular c a s e when lued (i.e. when G i s a Eebyiev s e t ), tion - of E

G

i s one-va-

IG is called the m e t r i c projec-

onto G .

Some properties of the mappings %

(and hence, in particular , G of m e t r i c projections) onto linear. subspaces of normed linear spaces a r e collected in Let E be a normed linear space and G a linear subTheorem 4. 1. space of E. Then a)

G

D( XG)

C

and

7tG is one-valued on G, namely , T G ( g ) =

= g f o r a l l g EG. Hence , ifxeD(TZ

have (4. 2 )

7C

2

i. e. the mapping b) We have

(XI G

=

nG (XI

is idempotent

G

), then TC

-

(X E

G

(x) E D (XG) and we

D(TG)),

I, Singer

z-

is continuous at every point g & G (i. e, x -+ g E G n plies that f o r any I (xn)E (xn) we have 7CG (xn) --,nG (g)=g) G c)

X

pG

d)

IfG

1

i s a linear subspace of G, we have

If, in addition, the mapping .Stl i s one-valued on D ( V ) (i&f G G i s a semi-eebygev subspace),

G

then

e ) If - x€D(n

G

)

and gciG, then - x + g c D ( XG) -

and we have

i s quasi-additive.

i. e. q -

f) if x E D ( X G )

d is an a r b i t r a r y s c a l a r ,

then d

x e D("iYG)

and we have (4.8)

(x&D(TG) ,d=scalar),

XG(O(x) =d'K (x) G

i. e. n G

is homogeneous.

g) If G is closed and x n e D(71 ) lim G 'n-a, x cD(TG)

and

TCG(x) = g ,

i.e.

TtG

x =x, lim n G ( x n ) = g , n n+oo 6

is closed.

The proofs a r e straightforward ; s e e

[82]

, pp. 140- 142 and 390.

Part. a ) shows that in the particular case when D (If ) = E and dG G i s one-valued , the metric projection I K is indeed a ( n ~ n l i n e a r )closed G projection of E onto G. Some ~ u t h o r suse for the m e t r i c projection 'K G

I. Singer the t e r m n o r m a l projection , o r best approximation operator, o r nearest point map, 4. 2.

o r cebygev map

.

Continuity of m e t r i c projections.

The main characterization of 6ebyzev subspaces G with continuous m e t r i c projection K '

G

is the following result, due t o R. B. Holmes ([38],

theorem 6; in the particular c a s e when E / G is reflexive, this result also follows from

[84]

, theorem 3):

Theorem 4. 2. F o r a Cebyiev subspace G of normed linear space

E the m e t r i c projection 7C is continuous if and only if the G tion w = w of the canonical mapping w . E + E / G G ' G ln-k(o) 1 s e t rrG (0) is a homeomorphism of %-A (0) g o E / G . -

restrict o the

Proof. Since G is a EebysVev subspace , by $ 3 , proposition 3. 1 is one-to-one. K r t h e r m o r e , w

w

is always conti.nuous and a mapping

onto E / G , since for any x + G E E / G we have x-IT (X)E%-'(O) and G uG(x - WG(x)) = X. + G. Thus, the condition that w = w G x - ~ ( o be ) a

I

-1 homeomorphism onto E / G i s equivalent to the continuity of w . - 1 i s continuous and let x x € . E , ,himrn Ilx - x Aseume now that w n' n Then by the preceding r e m a r k , x whence

and thus TC i s continuous. G

n

-

TC

G

11 = 0.

(x -)= w - l ( x +G), x - T (x)=w-'(x+G), n n G

I. Singer Conversely, assume now that 'IY x+GEE/G, lim (x the point w

-1

x +G, n E > 0. Since 7CG is continuous at

G

+G) = x+G and

is continuous and let

(x+G)ETt-' (0). there exisls a G

11 z - w - 1(xtG) 11
0

(,).n

G

(W

such

that

-1

(x+G)

11

N). n n c j n n Therefore x

n

have 'K (x ) G n z

n

G

z c ~ I ~ ~ z - w - l ( x + ~<min )ll

- z n € G and hence , by the quasi-additivity of XG, we = 71: (x - z + z )= x - z + RG(zn) . Consequently, since G

n

n

n

n

n

V ( O N ) , we obtain

and thus 71 i s continuous, which completes the proof of theorem 4 . 2 . G Note that induced by I

w - I i s nothing else than the mapping E / G + W ~ - ' ( O )

- KG,

where I denotes the identical mapping of E onto E.

Since for every

-1

xtXG

(0) we have

w

-1

(x+G)

=

x

- TLG (x)=x,

theorem 4. 2 can be also rephrased a s follows ; 9 ! is continuous if G and only if the relations x , x & % i l (0) , nvm f) (xn - x,G) = 0 n

I. Singer

imply

lim n-+w

11 x n

x

11

.=

0.

Corollary 4. 1. Let E be a normed linear space. Then a)

A

G ( E + ,E )

- closed 6,ebys'ev subspace

of E

admits a conti

nuous m e t r i c projectionTlri f and only if the (uniquely determined) extension mqp (Q E(

rL)*+

-f 1

f E E* with

b) If G i s a 6ebygev property

=

5

11 f 11 = Ilyll,

,

subspace of E , such that G

I

i s continuous. C

E*=

(U) and that the extension map ip€(GL )* $ f E** with -

H $ 11 = 11 (Q 11.

aGis

is continuous, & t

4 IG~=.(P,

continuous.

Indeed, this follows f r o m theorem 4. 2 by the.arguments of fj 3, proof of corollary 3 . 1. A. direct proof of corollary 4. 1 a ) has been given by J. Lindenstrauss

( [57J,

4 7),

but the proof sketched h e r e is

simplep Some other, m o r e elementary, characterizations of the continuity of llG. due t o E. W. Cheney-D. E . Wulbert

C203 and R. B. Holmes [38]

a r e collected in Proposition 4. 1.

F o r a 6ebygev subspace G of a normed linear

space E the following statements a r e equivalent : lf

The m e t r i c projection 'lX

20

rGcontinuous

G

is continuous.

at each point of

-1

T G (0).

The direct sum decomposition E = G 63 T t - ' (0) is topologiG cal ( i . , lirn x = x if and only if l i m 7ZG(xn) = xG(x) n-+w n n+w 3:

5

40 %G

IA. ( G)

is continuous , where

I. Singer The functional

5P

i s continuous. 69 The mapping

% :E

- 1 (0) n Fr SE defined by

\ G

TG

i s continuous. Proof. The implication not hold, say 4

x

x

2Ois obvious. Conversely; if lo does

lo+

x' , X G (xn) +xG(x),

-1

- n G ( x ) e n G (o), but reG(xn -

dicting 2O

Thus,

TC

G

then

xn

- TtG (XI

(XI)= K G (xn )-TGh)+o, contra-

1°*20

The equivalence

1 ° e 3 0 a n d the implication lo ==$ 5' - 1 (0) , t h e n Conversely, if we have 5' and x ---+ x e ?t n G

1G

x n

that

5'-

-

x

1

=

1

XG x

1

T

G1

a r e obvious.

= 0. which proves

2'

The implication 1 ° 3 6 0 i s also obvious. Furthermore, if we have 6'

and if

x . x E A (G), xn 4 x, then n 1

which proves that

6*'

Assume finally

4. 1

4O that we have 4Oand let

x -+x. Since by theorem n c) 'KG is always continuous at the points x EG, we may assume Xn €A (G) G, hence xn# G f o r n>N. Then 11 x,-

rrG(xn

I. Singer f o r n > Nand by theorem 4. 1 b) , formula (4. 3 1. IIxn

-A

-+ 11 x-.n,cx, II .

Therefore

x

n

11 xn- ~ ~ ( ~* ~ ) l l

- ZG(xn)11 + whence, by 4',

= nG(x) ;

thus, 4 ' 3 lo, which completes the proof of proposition 4. 1. No theerem is knoun in concrete spaces about characterization

6f Eebys'ev subspaces Gof arbitrary dimension with a continuous metric projention. Let us consider now, in arbitrary normed linear spaces, the problem of characterization of eebygev subspaces G with continuous mewhen we have restrictions on dim G o r codim G, t r i c projection 71: G ' o r restrictions on the quotient space E/G. Theorem 4. 3. F o r every finite-dimensional ?ebygev subspace G is continuous. G The proof is straightforward, using a compactness argument

of a normed linear space E, the metric projection 'IC (see

[82]

, pp.251 and 386-3.90).

or Zebygev subspaces Gof codikension 1 , we have even a

stronger property of TG (see [82],

pp. 142- 145):

Theorem 4.4. F o r ePery eebygev hyperplane G in a normed linear space E , the metric projection It

G

is linear, and hence (by theo-

rem 4. 1 c)) continuous. We have the following characterizations of Eebygev subspaces of finite codimension with. continuous metric projections, due to CheneyWulbert

C20]

and Holmes

[38]

respectively :

I. Singer Theorem 4. 5 .

For 'a 6ebygev subspace

G

of finite codimen-

sion of a normedHnear space E ,the following statements a r e equivalent -

:

lo. -It i s continuous. G 1 20.7tG , (0) is boundedly compact, that is, interexects

every cell

S (x, r) C E in a compact set. 30. n:.(o)

n F r sE

is compact.

Proof. Since dim E/G < Furthermore, by whence (4 12)

f 3.

-1

wG(XG (0)

w

a r e compact. , SEIG and F r S -1 E/G.

formula (3. 71, Tf. G(0) f l F r

nFr

SE = F r

SE = { x E E I u w ~ ( x ) A = ~ ) .

SEIG. ,

where w

i s the canonical map E + E / G . Now, i f 'KG is continuous, G then, by theorem 4 . 2 , w = w Ifill(;) is a homeomorphism and hence, by (4. 13), we have 2'

.

Finally, since the implication 2O33Ois obvious,

let us assume that we have 3'.

Then, by (4. 12) and by the remarks ma-

de at the beginning of the proof of theorem

4. 2,

~ ~ 1 % :( b ) n F r SE

-1 i s a one-to-one continuous mapping of the compact set 'TCG (0)

n F r $E

and hence a homeomorphism. Therefore, onto the compact set F r S 1 E/G since both 'TKG (0) and E / G a r e star-shaped (by 8 2, pro~osition2. 2) since w is homogeneous (i. e . , w ( d x ) = d w (x)), it follows G G G i s a homeomorphism of I f 1 (0) onto E / G and easily that wG G hence, by theorem 4. 2, is continuous, which completes the proof of G theorem 4. 5. and

1. Singer Naturally, one can also prove theorem 4. 5 directly , i . e , without using theorem 4. 2.

Moreover, with a direct argument one can prove

that the relations 3°&20410 spaces G

pemain valid for a r b i t r a r y zebyzev sub-

[20].

F o r reflexive strictly convex Banach spaces E we have also another useful characterization of 6ebys'ev subspaces G

of finite codimension

due to R. B. Holmes [38] , in t e r m s of the with continuous IT G' spherical image map v:E*-+E defined a s follows : for f €E*, v(f)= the (unique) element x Theorem 4. 6. subspace G o f i n i t e

E such that

f(x)

11 f 11 . 11 x 11 . 11 X 11

=

=

11 f 11

F o r a ?ebyzev (or, equivalently, closed linear) codimension of a reflexive strictly convex Banach

space E the metric projection It i s continuous if and only if the r e G - ~ l i contis striction of the spherical image map v:E*--+ E to !GI nuous. In connection with theorem 4. 6 , note that we have always v(G')= -1 -1 I = 'lt (0) and, i f E i s smooth , then also v G ( n G( o ) ) = G Furthermore, v Glis one-to-one i f and only if E / G is smooth. Using

I

that v is a

duality mapu and hence a llmaximal monotone operator"

in the sense of F:Browder proved that

(see e. g. [16]

),

R. B. Holmes [38]

has

the sufficieoky part of theorem 4. 6 remains valid if in-

stead of m d i m G

0 there is a

4 (& )

1, llx-yll > & imply [21])

>0

(Ixtyll

such that the

relations

if

11% 11 =

- 2 ( 1 - J ( E ) ) . It i s well known

is a strictly convex

space of dimension 2 , then, by theorem 4. 4, TC is uniformaly LipschitG zian on E. R. B. Holmes and B. R. Kripke [39] have constructed examples of non-Hilbert

spaces % of any finite dimension such that 'NG i,s

uniformly Lipschitzian on E.

In this connection we alSo have (see [82]

, pp. 247-249 and 350):

I. Singer If - E is normed linear space of (finite o r infinite)

Theorem 4. 14. dimension

2

3 with the property that for every linear ( not necessa-

rily ?ebygev) subspace G of a certain fixed finite dimension n,or codimension n, where 1 4 n ,< dim E- 1, the (generally multi-valued) map-

ping %G satisfies ) \ (x) ~ 11 < llx 11 for a l l x 6 D ( x ) (hence, i n particular,

if 7CG

G

is1! contractive

then E is -

-

" on D

G

($

-

) i. e. Lipschitzian with constant I ) ,

G linearly isometric t o a Hilbert space.

4. 5.

Differentiabilitv of m e t r i c ~ r o i e c t i o n s .

The notions end results of this section a r e due t o R. B. Holmes and B. R. Kripke

[39J.

Definition 4. 2

If G is a Cebygev subspace df a normed linear

space E and x, y E.E and if the limit 'nG(x+ty - q x ) (4.17)

XIG (x, y) = t l+i m o

t

exists, then %IG (x, y) is called the ~ s t e a u xderivative of TI

G

at x

-

2

the direction y . The following observations a r e 'immediate : if either side exists.

a ) ?tTG(x,cy) = c f C ' G ( ~Y), b) ntG(g,y) = 'rtG(y) f o r a l l gaG, c) where

If x E E \ G and either

yGis

y4E.

%IG (x, y) o r

71 I G

(

Y G ( x ) ,y) exist,

a s in proposition 4. 1, then both exist and a r e .equal. G

If for a <ebys'ev subspace G of a normed #near -1 (x,y) exists for a l l x & a G (0) h F r SE, y E F r S; and -

Theorem 4. 15. space E if -

G sup- 1

x E'It

G

Ilnb

( 0 ) n F r SE

(x, y) 11 < ca,

then

G is Lipschitzian.

If x € ~ \ { ~ f a n dif f o r any y, z E E the function N(s, t ) =

11 x+sy+tz 11

I. Singer is twice continuously differentiable in a neighbourhood of (0, O ) ,

then

one can define a functional on E X E by

fixed

y, z E E . Moreover, using this observation and theorem 4. 16, one

obtains 4. 17. F o r every finite-dimensional linear subspace G -@ Theorem

E = L:

(T, 9 ), where -- 2 < p < m,

T(

G

(x, y) exists for all

x E E \ G, y QE.

The assumption h e r e that dim G < m, is essential, since one can give an example of an infinite-dimensional closed linear (hence 6ebygev) subspace G of

A:,

where 1 < p < m , p

ferentiable and non-pointwise Lipschitzian.

2 , such that fl

G

i s non-dif-

I. Singer Definition 4. 3 . If G a Eebygev subspace of a normed linear spa1 ce E, Tf is said t o be Frbchet - C on the open s e t E \ G i f there G

exists a continuous mapping u: E \G

4 L(E, G)

(where L(E, G) denotes

the space of all continuous linear mapping of E into G, with the unif o r m norm), such that % & (x, y) exists and

Using theorem 4. 15 one can show that if dim E

E such that X G then -flG Lipschitzian. a Cebylev subspace of

< a, and if G

i s Frechet C

1

' onJ' E -

1%

\ G,

-

Some m o r e results on the existence of ' X I (x, y) whenever Ilx-x 11 < 5 G 0 (for some x € E \ G ) and on 71G satisfying a Lipschitz condition for 0

all x in a neighbourhood of x

0

and a l l

.

y e E , a r e given in p 9 ]

.

of m e t r i c projections. 4. 6 . Linearity --

"

'Y

By theorem 4. 1. f), for any semi-Cebysev subspace

G the lineari-

ty of 71' on D(Q ) is equivalent to i t s additivity on D ( n ) . G G G The main characterization of eebygev subspaces G with linear mec

t r i c projection 'KG is the following analogue of theorem 4. 2, due to R. B. Holmes

~387.

Theorem 4. 18. F o r a Eebygev subspace G of a normed linear space

E

the m e t r i c projection

nG

I

is linear if and only

if w = w , G T C..k ( 0 ) i s an iso'metric (i. e . , distance-preserving) mapping of ~ ' ( 0 ) onto - G -E/G. Note that, a s was observed in the proof of theorem 4. 2 and in $3, formula (3. 7)

. for any EebysVev subspace G,w- 1

is a one-to-

= w

one continuous norm-preserving mapping of llG(0) o s o E/G. One can also give a corollary of theorem 4. 18, similar to corollary 4. 1. Some other characterizations of the linearity of p. 144 and in C39] , theorem 3 , a r e collected in

wC,' given in -

[82],

I. Singer Proposition 4. 7. F o r a semi-6ebyZev subspace G of a normed linear space E the following statements a r e equivalent : i s one-valued and linear on D ( fC ). G G 1 % (0) i s a closed linear subspace of E. G '

lo. % . ' 2

(0) is convex.

3'. K:'

v

- in addition, G i s proximinal (and hence a ~ e b y s ' e vsubspace), If, these statements a r e equivalent t o the following : 4

0

-1 . nG (0)

contains a linear s u b s p a c e , F of E such -- that

E = G + F . 5'.

!( %,

(4.21)

(x+Y) 6P

There exists a constant

/I

11 < 'Tf

G

K

+ Il%(y)ll

G

such that

1 (x, y € E l

i s continuously Gateaux differentiable.

In theorem 4. 4 it was established that for every Zebygev hyperplane G i n a norm.ed linear space E , % i s linear. There we also obG served that whenever N onto a Eebygev subspace G is linear, it i s G also continuous and hence a bounded linear projection ; thus a necessar y condition in o r d e r that % be linear i s that G be complemented G in - E . Moreover, in this case, by theorem 4. 1 b) we have 1 - 2 of- E

= C (Q)

R

(Q c o m p a ~ t ) .then -

9G is not

upp*

.semi-

I. Singer b) F o r a pseudo-6eby8ev subspace G o f E = C (Q), i n o r d e r that

R

PGbe lower

semi-continuous it is necessary, and if

PGis upper 4

semi-continuous , also sufficient , that for every xgT( (0) the set. G -

I go

z(TG(x)i( q BQ

(4 25)

( q ) = 0 for all

goE

pG(x)

be open. P a r t a ) is a generalization of theorem 4. 7 a). Blatter, Morris and Wulbert [7J

have proved the following corollary of part b) :

Far a compact space Q the following continuous

Corollary 4. 3 . a r e equivalent. :

lo . Q is connected.

2O. Every pseudo-EebyHev subspace G

pGis

= C (Q), such that

lower continuous. is a Eebvgev subspace.

.R

Every one-dimensional subspace G o L E = CR (Q) ,

3'. G

of E

--

is lower semi-continuous , is a Eeby2ev subspace.

such

F o r some related results s e e also B. Brosowski, K. -H. Hoffmann, E. S c h a e r

and H. Weber

F o r the spaces

[147. 1 E = L ( T , 3 ) , A Lazar, D. E. Wulbert and P. D.

R

' M o r r i s [567 have proved Theorem 4. 24.

F o r an n-dimensional.linear subspace G h

where

(T, 3 ) is -a C-finite positive measure space,

continuous if and only if there do not e x i s t / j € ~ ' \

{o)

yGis

of E = LR1 (T,$)

lower semiand g g G with

the following three properties : o() The set

))(s(P)\

u

S(P)= { t E T it)

I1

p(t)1
0, c

=

GE

such that Ilx+z 11

c (x, z)

> 0 such that

5

IIxll,there

I. Singer It i s natural t o ask, which normed linear spaces E have ty (P). A . L. Brown and

latter- orris

Proposition 4. 10. has property

El71 has -Wulbert

roved

a ) and b) , and Blatter

proper[5J

[7] have proved the other statemeqts of

a ) Every strictly convex normed linear space

b

(P).

b) Every finite-dimensional normed linear space E , in which the -unit cell S i s a polyhedron --E-

(i.e . , the intersection of a finite number

of half-spaces o r , what i s equivalent, the convex hull of a finite num,i-c r of points)has property -

(P).

c ) C (Q compact ) has property R d) co has property (P)

e)

Lf

(P) i f and only i f Q i s finite.

( T , 3 ) i s a F - f i n i t e positive measure space such that T i s not

the union of a finite number of atoms, then

1

LR(T, 3 )does

not have

property (PI 4. 8. Continuous selections and linear selections for set-valued m e t r i c projections. We recall that if G and E a r e metric spaces, a ping

continuous map-

u : E --+ G i s said t o be a continuous selection for a set-valued

mapping

U : E + 2G i f

u(x) EU(x) f o r all x E E . If G is a linear sub-

space of a normed linear space E , one can. define a linear selection for

I* in a similar way. By theorem 4. 1 b) o r c ) , if G is proximinal , every linear selection for

pGi s

continuous.

In o r d e r to characterize the proximinal linear subspaces G of a norrned linear space E, for which

admits a continuous selection o r G a linear selection, we define a set-valued mapping

I. Singer It is easy to s e e that

n;'(o)

VG(x+G) E 2

and closed. Indeed, VG(x+G) f

, i. e . . , i s non-void

since G is proximinal.

Furthermo-

?I (x ) --, x E E , then since ~ - ' ( 0 )i s closed, we have G n G whence x = x -71 (x) E V (x+G), which proves that VG(x+G) G G is closed. Observe that if G is a Eebyiev subspace of E, then VG(x+G)

r e , if

x

-1 n xEXG (0),

-

{ w- '(x))

i s the one-point set

=

( x- T ~ ( x ) ).

where u = w

(0)

( s e e section 4. 2). Also , V is nothing e l s e than the set-valued mapG ping

induced by

E/G-+ 2

I-

pG,where

I is

the identical map-

ping of E onto itself. W e have the following generalization of theorem 4. 2 (which, in

the particular c a s e when E / G .is reflexive, was essentially proved in [84]

, theorem 3 )

:

Theorem 4 22 F o r proximinal linear subspace G of a normed liG E +2 admits n e a r space E , - a continuous selection if and only i f G: 1 E / G --+ 2 ( O ) defined by (4 21) admits a continuousthe --mapping V

9

G:

rG

selection Indeed, this can be proved either similarly t o the above proof of theorem 4. 2, o r using, in the necessity part, a theorem of Bartle and Graves (see

[63])

according to which the mapping

.w

G

-

E / G +2E

'

defined by

always admits a continuous selection w

G

and

PG

then putting

where x ( O ) is a continuous selection for . G F r o m theorem 4. 27 we obtain the following generalization of co-

I. Singer rollary 4. 1, the f i r s t part of which i s due t o J. Lindenstr'auss L57] and the second part to Corollary 4. 5.

[84]

:

Let E be a normed .. -linear space. a ) For ~ u ( E T E ) - Then --

closed (hence - proximina1)linear -- - subspace r o f E*,

: a selection ) +(f

b)

e ~ * l Ip; f

9,admits

a continuous

if and only if the (set-valued) extension map ( P c ( ~)*+

(9 , 11 f

11

=

*

l l ~ l & d m i t s a continuous selection.

that the extension If- G i s a. proximinal linear subspace of E , such = I/(P admits a eonti(P~(G')+{+ =

EE**(QI~I

y,

m

11)

1

nuous selection, then -y% 5dmits oontinuous selection, Let u s observe that one can a l s o obtain relations between the G : E -2 and semi-continuity properties of the set-valued mapping

pG

+

VG : E/G

2 % ~ '(O) ( a s well a s corollar$es of the above type).

The r e s u l t s on lower semi-continuity a r e particularly useful because of the following theorem Michael

( [63];

on continuous selections, due to E. A,.

theorem 3. 2") :

lower --semi-continuous

If -E , G afe Banach spaces, every G U : E -+ 2 such that U(x) is convex for each

if

x E E, admits a continuous selection. Hence, in particular, p r o ~ i m i n a llinear subspace of a Banach space E, such that semi-continuous then

53G admits acontinuous -

G i

-

~

a

i s lower

selection; the converse i s not

true, even if dim G = 1. F r o m this observation and from the of the preceding section there follow sufficient

results

conditions on a given

G in o r d e r that

admit a continuous selection and sufiicient congitions G on E oi o r d e r that for a l l subspaces G of E. admit a se-

pG

continuo?^

lection. Conversely, in some c a s e s the results on non-lower semi-continuity of for

P'G

pGcan

be sharpened t o non-existence of continuous selections

F o r example , A . L a z a r , D. E. Wulben and P. D. Morris have '

proved the following partial sharpening of corollary 4 . 4 r e m 1. 4) :

( [56]

, theo-

I. Singer Theorem 4. 28.. E = L'

R

If

G is any finite -dimensional subspace of

-

(T,3 ) , where (T,V) is a positive measure space having no

atoms, then

PG admits

no continuous selection.

We have the following characterizations of the one-dimensional l i n e a ~ subspace G of E =

-8;

and E = C (Q) f o r which

R

a continuous selection, due to A. Lazar

PG admits

, lemma 5. 2) and

( :55]

respectively A. Lazar, T, E Wulbert and P. D. M o r r i s ( [56]

, proposi-

tion 2. 6): Proposition 4. 11. a ) F o r the one-dimensional linear subspace G

4R

- E= of

spenned by an element

g =

{rn).eGadmits a continuous

selection i f and only if t h e r e do not exist

-of N

=

{

1,2,3,.

. . .)

two disjoint subsets N

1'

N

2

such that

b) F o r the one-dimensional linear subspace (Q compact), spanned by an element

g

G

=

Lg] of E =

CR(Q)

of norm 1,

tinuous selection if and only if

I g(q) 01 .

d)

card F r Zlg) < - 1,

/3 )

q e F r Z(g) implies that there exists a neighborhood of q

where Z(g)

=

{ q eQ

=

on which g is either non-positive o r non-negative. Let us also mention the &llowing recent results of A . L. Brown

( [18]

,theorems 2 . 8 and 3. 10) :

Theorem 4 . 2 9 . a )

If

G i s a

"

Z-subspace" of E

i. e . a closed linear subspaee such that Int Z(g) =

then either there i s no continuous selection for

=

9

CR(Q) (Q compact),

for all g 6 G \

(01,

o r there i s a unique G -

I. Singer one. -b) There exists a 5-dimensional Z-subspace G of E = C ( [-l,+l])

R

which contains the constants, i s non-eebygev and such that

f?G

a unique continuous selection.

imits -

The l a t t e r result (which disproves a claim of A . Lazar, D. E. Wulbert and P. D. Morris :

[56],

theorem 2. 1) shows that in the parti-

cular case when dim G < co ( and hence the implication

1°+200f

i s upper semi-continuous), f?G corollary 4. 3 cannot be sharpened s o a s to

assume only existence of a continuous selection f o r

PG instead

of the

lower semi-continuity of

G' Concerning linear selections f o r

we have ( s e e [82]

, p . 142):

Theorem 4. 30. F o r every proximinal hyperplane G in a normed linear space E,

admits a l i n e a r selection. PG By theorem 4. 1, i f f o r a proximinal linear subspace G of a nor-

PG

admits a linear selection %(0) then X ( 0 ) G ' i s a continuous linear projection of E onto G and hence G is complemed linear space E,

mented. Obviously, the converse is not valid. Finally, l e t u s mention t h a t one can give a characterization of proximinal linear subspaces G for which

PG admits

a linear selection,

genera1izing.theorem 4. 18 in a s i m i l a r way a s we generalized theorem 4. 2 by theorem 4. 27 and one can than prove also a corollary correspon-

ding t o corollary 4. 5. Also, one can define weakly continuous, Lipschitzian and differentiable selections f o r 9 and obtain f o r then- similar exG tensions of the preceding results.

I. Singer

5.

Best approximation by elements of non-linear s e t s

5. 1. Best approximation by .elements of convex sets. By a non-linear s e t i n a normed linear space E we mean any s e t G C E which i s not a form x+G

0

linear manifold ", i . e. which i s not of the

,where x e E and where G

0

i s a linear subspace of E. Since

best approximation by elements of linear

manifolds can be reduced,

by a simple translation , t o best approximation by elements of linear subspaces, we shall not consider here this problem, but r e f e r the r e a d e r to [82]

,p p . 135-140 and 242-246. We want to present here,

briefly, some directions of r e s e a r c h on best approximation by elements of non-linearsets. Note that the existing r e s u l t s in this field do not yet constitute a unified theory ( a s is ,the theory of best approximation by elements of linear subspaces) and the construction of such a theory in general normed linear spaces i s only at its beginning. The f i r s t natural step when passing from best approximation in normed l i n e a r spaces E by elements of linear s e t s G C E to non-linear s e t s is t o take a s G a convex s e t in E. The followi.ng extension of $1, theorem 1. 1, to this case has been given, f o r r e a l s c a l a r s , by G. Rubinstein [75]

and Ch. Roumieu

( [747 ,proposition 5) and for com-

plex s c a l a r s in r82], pp. 360- 361 and [22] Theorem 5. 1. and -

1 4

x EE

5.

, [37]

(independently) :

L A G be a convex s e t in a normed linear space E ,

\ 5,

%E G . We have g0 E

PG

(x) if and only if

e r i k t s an f EE* with the following properties :

there

I. Singer

This theorem admits the following geometric interpretation, observed by V. N. Burov (see [82]

, p . 362):

g0g c ( x ) if and only i f

there exists a r e a l hyperplane H which separates G f r o m S(x, Clearly , such a hyperplane H must pass through g cell S(x,

11 x-g 0 1)

convex cone,

0

11 x-g,ll).

and support the

). The particular c a s e of theorem 5. 1, when G i s a

was also considered by G.

S. Rubinstein

( s e e [75],

pp. 362-

363) ; another characterization theorem f o r best approximation by elements of convex cones has been given by G. Godini [35]. F o r bite-dimensional convex s e t s F . R. Deutsch and P. H. Maserick [22]

and , independently, S. Ia. Havinson [37J

lowing extension of

have proved the fol-

51, theorem 1 . 6 :

Theorem 5.2. L A G be an n-dimensional convex s e t i n a normed linear space E and let x E E \E, only if t h e r e exist

g E G. We have g o E p G (x) if and 0

h extremal points f l , . . . ,fh

5 h 5 2n+l if X1 . . . . . . A h > O ~ $

- n+l i f the s c a l a r s a r e r e a l and
s a 0-

I.. Singer (i. e. , -

Moreover

got

(5t18)

&&(go )

. in this case we have

PG(').

B) Some problems on existence of elements of, best approxi-

mation by elements of closed sets. One of the problems studied recently is that of finding the Banach spaces E with the property that for every closed set G c

is dense i n E

.

S. B. Stezkin and M. Edelstein [26]

C

E the s e t

have proved that

eve-

r y uniformly convex Banach space E has this property. This result has been slightly extended by D. E. Wulbert [96],

who has proved that every

Banach space E "with property (2R) " also has the above property. We recall ( s e e e . g. [21])

that a Banach space E is said to have property

(2R) if every sequence {x,)

C

E such that

lim bn+xml( = 2 is a

n;m+co

Cauchy sequence (and hence convergent); clearly, every space with property (2R) i s uniformly convex, but ~ n econverse is not true. S a c z ever y uniformly convex space (and hence every space with property (2R) i s strictly convex, it is natural to ask whether there exist non-strictly the above property (i.e. , s u c l ~that for every clo-

convex spaces.Ewith sed s e t G

C

E the set D (TCG) is dense in E.

D. E. Wulbert [96] has

given an affirmative answer, by proving that every uniformly smooth Banach space E

with property

(H) (see $4, section 4. 2) also has the

above property. We recall (see e. g. [2g) that E is smooth i f for every tion L961

x

-

1

f

called uniformly

7

> 0 there exists an & = E ( q ) such that the relaimplies 11 x 11 + 11 y 11 5 (I+ 7 ) 11 x + 11~; D. E. Wulbert

has shown that there exist uniformly smooth spaces with proper-

ty (H) which a r e not strictly convex.

I. Singer By the r e m a r k made at the end of 5 2 (on v e r y non-proximinal subspaces) a Banach space E with the above property must be reflexive. D. E: Wulbert

96

has raised the problem whether the converse is

true,. i. e. : Problem . 5 . 1 Does there exists a reflexive Banach space E containing a closed set G such that

D(?tG) is not dense in E ?

Some other problems related t o existence of elements of best approximation a r e concerned with v e r y non-proximinal s e t s (see definition 2. 2).

M. Edelstein [27]

has proved that in a separable co-

njugate space E* no closed bounded s e t He has also shown

[27]

$ 2,

r is

very non-proximinal.

that in the - separable space E = c

0

(which i s

not i s isomorphic t o any conjugate Banach space) t h e r e do exist bounded very non-proximinal sets.

V. Klee

( s e e [82]

terization of the c l a s s e s

, p. 371) h a s considered the problem of characN. (i = 0 , 1 , 2 , 3 , 4 ) of a l l normed linear spa1

ces E which contain a very non-proximinal set G having respectively the following properties : (0) no additional property ; (1) G is convex; (2) G is bounded and convex; ( 3 )

E \ G is convex; (4) I;: \ G is bounded

V. Klee has made the following r e m a r k s : a ) Nl is the

and convex.

class of a l l non-reflexive spaces ; b)

N3 3 N1 ;

C)

no Banach space

but N2 f 9 ; d) N4 f 9 ; e ) it i s possible that N (whence 4 2 ' also Ng , N ) coincides with the class of a l l normed linear. spaces.

is in N

0

C) Some problems on uniqueness of elements of best approximation. F o r an a r b i t r a r y set G i n a normed linear space E , S. B. Stezkin (see [82]

,p' 375 ) has studied the s e t

i. e. the s e t of a l l elements x € E

which have at most one element of

I. Singer best approximation i n G, and has obtained, among other results, the following 'tconstructive characterization" of strictly convex spaces : Theorem 5 . 6. A Banach space E has the property that for every set -

G C E the set -.-

U

G

is dense in E

i f and only i f

E

i s strictly

convex.

if E

Furthermore, S. E. stezkin has proved that

is a strictly con-

vex Banach space, then for every boundedly compact set G C E the set is of the second category in E. However , i t is not known whether G this also holds for every s e t G C E ;it is also unknown whether from

U

the fact that ror every compact G C E the set U

E o r of the s m n n d category in E it follows

G

is either dense in

that E is strictly convex.

Finally, we mention the following results of Stezkin : If - E is a locally E the set U i s of G the second category and if E 'is a uniformly convex Banach space, then uniformly convex Banach space, then for every G

f o r e v e r y closed set G

iz E

the set

category. However, it is not known

D(TCG)

C

n UG is of the second

whether the second result remains

valid also for locally uniformly convex spaces. We conclude -this section with a famous classical problem, namely, the problem of convexity of ?ebygev sets. We have seen in section 5. 1 that a Banach space E has the property that every closed convex s e t G C E is a ?ebygev s e t i f and only if

E is reflex-ive and strictly con-

vex. It is natural to ask whet a r e the Banach spaces E in which the conv e r s e property holds, i. e. in which every Cebygev s e t G C E is convex. This problem has been solved only for 3-dimensional spaces E(see [82], p. 364), namely, E has this property i f and only if every exposed point

-

of S (see $ 3 , section 3. 2) admits a unique maximal functional of norm 1 . E F o r Banach spaces E of finite dimension m -> 4 it is only known .that the smoothness of E is a sufficient but not necessary condition f o r the convexity of all ?ebygev s e t s G C E.

F o r infinite-dimensional Banach

I. Singer spaces E the problem i s considerably more difficult, even the anBwer to the following problem being Problem 5. 2

unknown :

In a Hilbert space

%,

i s every CebyEev s e t necessa-

rily convex ? V. Klee has conjectured that the answer is negative and has proved

(see [82], p. 370) that in every infinite-dimensional Hilbert space%t= exist non-convex closed semi-cebygev s e t s . On the other hand, much work has been done towards a positive answer. L. P. Vlasov has observed (see [82]

,.p. 366) that in a smooth normed linear space E every

- ( s e e section 5. 3) i s convex (the con?ebyEev s e t G which i s an &-sun v e r s e is immediate) and thus the problem reduces to prove that every 8ebygev s e t is an d -sun. With an ingenious application of SchaiLder1s fixed point theorem, L. P . Vlasov has proved (see [82],p.

365) that

in

an a r b i t r a r y Banach space E E r y boundedly compact <ebygev set G i s an o(-sun and hence, if E i s smooth, G is convex. The assumption of boundedly compactness of G i n this result was weakened by N. V. Efimov and S. B. ~ t e z k i nand others ( s e e [82]

,pp.

368-369), under additional restrictions on the space E (e, g. uniform convekity). An important step in this di'rection was the idea of V. Klee of imposing continuity conditions on the m e t r i c projection 7C onto G rather G than imposing conditions directly on the 6ebygev s e t G; i n this way, for all classes of 6eby8ev s e t s G f o r which 7C has the required continuity G properties, it follows that the s e t s G in those classes a r e convex! L. P. Vlasov [93]

has proved.

If E is a Banach space such that the conjugate space Theorem 5. 7. -

E * is strictly convex

(in particular, i f E is a smooth Banach space),

then every 8ebyEev s e t G C E with continuous m e t r i c projection % convex.

G

5

I. Singer F o r Hilbert spaces E. Asplund [21) has shown that it is sufficient here to assume that W is continuous from the norm tqpology to the G weak topology. Also, E. Asplund [2] has proved.

:

- G i s a <ebygev set in a Hilbert space %such that Theorem 5. 8. If every closed half-space intersects G in a proximinal set, then G

2

convex. These two theorems contain as particular cases the previously known results, since e. g, every boundedly compact Eeby;ev

set G sa-

tisfies the above hypotheses. Let us note that the arguments of E. Asplund [2]

lean heavily on the tools of the theory of convex functions;

some other uses of the theory of convex functions to problems of best approximation have been mentioned in section 5. 1. F o r the continuity of metric projections onto EebysVev sets

(see also D. E.~u1ber.t [943.).

We mention that the above problems can be generalized in several ways, e. g. some of the a b w e results remain valid i f we replace the aasirmption that G i s a Eebygev set by the weaker assumption that G i s a proximinal set such that for every x E E the set

PG(r) is

convex.

F o r these problems and for other related results we refer the reader to [84, pp. 364-371 , [83]

-

[W].

and to the recent papers of L. P. Vlasov [88]-

Finally, for some results and problems on best approximation

in metric (not necessarily normed linear) spaces we refer to [82], 377-391.

pp.

I. Singer References N. I. Ahiezer, E. Asplund,

Lectures on t h e theory- of approximation. Second edition, Moscow (1965) [ ~ u s s i a d . Cebygev s e t s i n Hilbert space. Trans. Amer. Math. . SOC. 144 (1969), 236-240.

D. A. Ault, F. R. Deutsch, PD. M o r r i s and J. E. Olson, Interpolating subspaces i n approximation theory. J. Approx. Theory 3 (1970), 164-182.

J. Blatter,

Zur Stetigkeit von mengenwertigen metrischen Projkktionen. Schriften JesRheimsch-Westf. Inst. fiir Instrum. Math. Univ.. ~ l o n n , , ~ k r . & , ; , .16[196,7), ~o. 1'72.38.

J. Blatter,

Approximation und Selection. Habilitationsschri'ft, Bonn (1969).

J. Blatter, P. D. M o r r i s and D. E. Wulbert, Continuity of the set-valued

metric projection. Math. Ann. 178 (1968), 12-24.

0. Brandt,

Geometrische Approximations theorie in normierten Vektorrgumen. Schriften des Rheinisch-Westf. Inst. f g r Instrum. Math. Univ. Bonn. s e r . A, No. 18 (1968). 1-36.

B. Brosowski,

Nicht-lineare Tschebyscheff-Approximation. Bibliogr. Inst. Hochschulskripten Bd. 808/808a, Mannheim (1968).

B. Brosowski,

Einige Bemerkungen zum verallgemeinerten Kolmogoroffschen Kriterium. Funktionalanalytische Methaden d e r numerischen Mathematik. ISNM 12, Birkhzuser Verlag (1969), 25.-34.

B. Brosowski,

Nichtlineare Approximation i'n normierten Vektorrgumen. Abstract spaces and approximation. ISNM 10, i a i r k h ~ u s e r - V e r l a g(1969), 140- 159.

B. Brosowski,

FixpunktsHtze in d e r Approximations theorie. Mathematica II(34) (1969), 195-220.

B. Brosowski, K. -H. Hoffmann, E. ~ c h z f e rund H. Weber, Stetigkeitssstze fiir Metrische Projektionen. Iterationsverfahren. Numerische Mathematik. Approximations-theorie. ISNM 15, Birkhzuser-Verlag (1970), 11-17.

I. Singer [14]

B. Brosowski, K. -H. Hoffmann, E. Sthzfer und H. Weber, Metrische Projektionen auf lineare ~ e i l r z u m evon Co [Q, H]. If&rationsverfahren. Numerische Mathematik. Approximationstheorie. ISNM 15, irksu user- Ver lag (1970), 19-27.

1151

B. Brosowski und R. Wegmann, Charakterisierung bester Approximationen i n normierten v e k t o r r ~ u m e nJ. . Approx. Theory 3 (1970), 369-397.

[l6]

F . Browder,

Multivalued monotone nonlinear mappings and duality mappings i n Banach spaces. Trans. Amer. Math. Soc. 118(1965),338-351.

[IT]

A. L. Brown,

Best n-dimensional approximation to s e t s of fuctions. Proc. London Math. Soc. 14(1964), 577-594.

h18]

A. L. Brown,

On continuous selections f o r metric projections in spaces of continuous functions (to appear).

[191

E. W. Cheney,

Introduction to approximation theory. Mc Graw Hill, New York (1966).

E207

E. W. Cheney and D. E. Wulbert, Existence and unicity of best approximations. Math. Scand. 24(1969), 113- 140.

[21]

M. M. Day,

[22]

F. R. Deutsch and P. H. Maserick. Applications of the Hann-Banach theorem in approximation theory. SIAM Rev. 9 (1967), 516-530.

[23]

F. R. Deutseh and J. Lambert, A bibliography on metric projections (mimeographed).

[24]

A. I. Dubovitskii and A. A. Miliutin, E x t r e m ~ mproblems i n the p r e sence of restrictions. Z. Vycisl. Mat. i Mat. Fiz.

Normed linear spaces. Springer-Verlag, BerlinGattingen-Heidelberg (1962).

5(1965), 395-453 [ ~ u s s i a n ] [25]'

.

N. Dunford and J. Schwartz, Linear operators. P a r t . I :General theory. Interscience Publ. ,New York (1958).

~ 2 6 j M. Edelstein,

~ e a r e s tpoints of s e t s in uniformly convex Banach spaces. J. London Math. Soc. 43(1968), 375- 377.

[27I

A note on nearest points. Quarterly J. Math. 21 (1970), 403-406.

M. Edelstein,

I. Singer L28]

N. V. Efimov and S. B. ~ t e k n Some , properties of EebysVev s e t s . Doklady Akad. Nauk SSSR 18 (1958), 17- 19[Russian].

E29]

G. Ewald,D. G. L a r m a n and C. A. Rogers, The directions of the line segments and of the n-dimensional balls on t h e ~ space boundary of a convex body i r Euclidean (to appear).

[30]

Ky F a n and I. Glicksberg, gome geometric p r o p e r t i e s of the s p h e r e s in. a normed linear space. Duke Math. J. 52(1958), 553-568.

[31]

A. L. Garkavi,

The theory of best approximation i n normed lin e a r spaces. Mathematical analysis 1967, MOSCOW (1969), 75-132 [Russian].

[32]

A. L. Garkavi,

The problem of Helly and best approximation i n the space of continuous functions. Izevestija Akad. Nauk SSSR 31 (1967), 641-656 [ ~ u s s i a n ] .

[33]

A. L. Garkavi,

Compact admitting Eebygev s y s t e m s of m e a s u r e s . Matem. Sbornik 74 (116) (1967), 209-217 [ ~ u s s i a n ] .

[34]

A. L. Garkavi.

Characterization of Eebygev subspaces of finite codimension L1. Matem. Zametki 7(1970), 155- 163 (1970) C ~ u s s i a n ].

[35]

G. Godini,

Best approximation i n normed l i n e a r spaces by elements of convex cones. Studfi si cekcet. mat. 21(1969), 931-936 (1969) C ~ o m a n i a n ] .

[36]

A. Haar,

Die Minkowskische Geometrie und die ~ n n s h e r u n g an stetige FunMionen. Math. Ann. 78(1918).294- 3 11.

[37]

S. Ia. Havinson,

On approximation by elements of convex s e t s . Doklady Akad. Nauk SSSR 172(1967), 294-297 [Russian].

[38]

R. B. Holmes,

On the continuity of best approximatior? operators. Symp. on infinite dimensionhl topology. 'princeton University P r e s s '(to appear).

[39]

R. B. Holmes and B.R. Kripke, Smoothness of approximation. Michigan Math. J. 15(1968), 225-248.

1401

Y. Ikebe,

A characterizarion of H a a r dubspaces i n C P r o c . Japan Acad. 44(1968), 219-220.

[41]

Y. Ikebe.,

A characterization of best Tchebycheff approximations i n fl. nction spaces. P r o c . Japan Acad. 44(1968) 485-488.

Lb, b].

I. Singer [42]

A . D. Ioffe and V. M. Tihomirov, Duality of convex functions and

C431

R. C. J a m e s ,

Characterizations of reflexivity. Studia Math. 23(1964), 205-216.

[44I

M. I. Kadec,

Topological equivalence of uniformly convex spaces. Uspehi Mat. Nauk. 10, 4(66) (1955), 137141 [~ussian].

[45]

V. m e e ,

The support property of a convex set i n a l i n e a r normed space. Duke Math. J. 15 (1948), 767-772.

[461

V. Klee,

R e s e a r c h problem no. 5, Bull. Amer. Math. Soc. 63 (1957), 419.

[47]

V. Klee,

Convexity of Chebyshev s e t s , Math. Ann. 142 (1961), 292-304.

[48]

V. KZee,

R e m a r k s on nearest points i n normed l i n e a r s p a c e s . P r o c . Coll. on convexity (Copenhagen, 1965), Univ. of Copenhagen (1966), 168- 176.

[49]

A. N. Kolmogorov, A r e m a r k on the polynomials 01 P. L. eeby&ev deviating t h e l e a s t f r o m a given function. Uspehi Mat. Nauk 3,1(23) (1948), 216-221 [ ~ u s s i a n ]

e x t r e m a l problems. Uspehi Mat. Nauk 23, 6(144) (1968), 51-116 [ ~ u s s i a n ] .

.

[50I

C. A. Kottman and Bor-Luh Lin, On the weak continuity of m e t r i c projections. Michigan Math. J. 17(1970), 401-404.

[51]

B R. Kripke and T. J. Rivlin, Approximation i n the m e t r i c of L ~ ( X P ) T r a n s . A.mer. Math. Soc. 115 (1965), 101- 122.

[52]

P. J. Laurent,

T h e o r e m s de c a r a c t e r i s a t i o n e n approximation convexe. Mathernatica 10 (33) (1968), 95- 111.

[53]

P. J. Laurent,

Conditions n e c e s s a i r e s pour une meilleure appromation non lineaire dans un espace norme. Compte rendus Acad. Sci. ( P a r i s ) Ser. A-B 269(1969), A 245-A 248.

154)

P. J. Laurent and Pham-Dinh-Tuan, Global approximation of a compact s e t by elements of a convex s e t i n a normed space. Numer. Math. 15 (1970), 137- 150.

[551

A. J. L a z a r ,

Spaces of affine continuous functions on simplexes. T r a n s . a m e r . Math. Soc. 134(1968,), 503-525.

I. Singer A. J. Lazar, D. E. Wulbert and P. D. Morris, Continuous selections for metric projections: J. Functional Anal. 3(1969), 193-216.

J. Lindenstrauss,

Extension of compact operators. Memolsa. . Amer. Math. Soc. 48 (1964).

J, Lindenstrauss,

On nonlinear projections i n Banach spaces. Michigan Math. J. 11(1964), 263-287.

J. Lindenstrauss,

On nonseparable reflexive Banach spaces. Bull. A.mer. Math. Soc. 72(1966), 967-970.

J. Lindenstrauss and L. Tzafriri, On the complemented subspace problem (to appear).

G. G . Lorentz,

Approximation of functions. Holt, Rinehart and Winston, New York (1966).

G. Meinardus,

Approximation of functions, theory and numerical methods. Springer-Verlag, Berlin-Heidelberg New J o r k (1967).

E. A. Michael,

Continuous selections. I. Ann. Math. 63(1956), 361-382.

J. J. Moreau,

Fonctionnelles convexes. Seminaire s u r l e s Bquaions aux derivees partielles. College de France, P a r i s (1966-67).

J. J. Moreau,

Distance un convexe dlun espace norme et caracterisation d e s points proximaux. S6minaire dlanalyse milaterale. Fac. Sci. Montpellier, 2(1969), expose no 6.

P. D. M o r r i s ,

Metric projections onto subspaces of finite codi mension. Duke Math. J. 35 (1968), 799- 808.

E. V. Ogman,

Continuity of metric projections and some geomet r i c properties of the unit sphere in a Banach s ace, Doklady Akad. Nauk SSSR (1969), 34- 36. {Russian].

R. R. Phelps,

Convex s e t s and nearest points. Proc. Amer. Math. SOC.8(1957), 790-797.

W. Pollul,

Reflexivitgt und ~ x i s t e n z - ~ e i l r g u mi ne der linearen Approximationstheorie. Schriften d e r Gas. fcr Math. und Datenverarbeitung, Bonn (to appear)

I. Singer C701

J. R. Rice,

The approximation of fbnctions. Vol. 1;Linear theory. Vol. 1I:Non-linear and multivariate theory. Addison-Wesley, Reading, Mass. -London-Don Mills,Qnt. (1964 and 1969).

[71]

J. R. Rice,

Nonlinear approximation. Approximation of functions (Ed. by H. L. Garabedian). Elsevier. AmsterdamLondon-New York (1965), 111- 133.

[7i]

J. R. Rice,

Non-linear approximation. 11. Curvature in Minkowski geometry and local uniquenness. Trans. ~'mer.,Math.Soc. 128(1967), 437-459.

[73]

W. W. Rogosinski,

Continuous linear functiohals on subspaces of x P a n d 'g. Proc. London Math. Soc. 6,22(1956), 175-190.

[74]

Ch. Roumieu,

Sur quelques problkmes d'approximation. S6mi.n. math. fac. sci. Montpellier -(1966).

[75]

G. 5. Rubinstein,

On gn extremal problem i n a linear normed space. Sibirak. Mat. 2. 6(1965), 71 1-714 [ ~ u s s i a n ]

C76]

1. Singer,

Properties of the surface of the unit cell and applications to the solution of the problem of uniqueness of the polynomial of best approximation i n a r b i t r a r y Banach spaces. Studil si cercet. mat. 7(1956), 95-145 [ ~ o m a n i a n ]

[77]

I. Singer,

Caractdrisation des elements de meilleure approximation dans un espace de Banach quelconque. Acta Sci. Math. 17(1956), 181- 189

C78]

I. Singer,

[I

On best approximation of continuous functions. Math. Ann. 140 (1960), 165- 168.

I: Singer,

On best approximation of continuous functions. 11. Rev. math. pures et Appl. 6(1661), 507-511.

[80]

I. Singer,

Some r e m a r k s on approximative compactness. Rev. rourri. math. pures et appl. 9(1964), 167-177.

C81]

I. Singer,

On the extension of continuous linear functionals .and best approximation in normed linear s.~~c€!s. Math. Ann. 159(1965), 344-355.

[Is21

I. Singer,

Best approximation i n normed linear & p a c e sby elements of linear subspaces. Publ. House Acad.

I. Singer Soc. Rep. Romania, Bucharest (1967) [~omanian]. English translation : Publ. House Acad. Soc. Rep. Romania, Bucharest and Springer-Verlag, BerlinHeidelberg-New York (1970). C83I

I. Singer,

Some open problems on best approximation in normed linear spaces. SCminaire Choquet, 6'annee. Universite de P a r i s :1966/67), expose m. 1 2 .

[a47

I. Singer,

On metric projections onto linear subspaces of normed linear spaces. Proc. Confer. on rrProjections and related topicsrrheld i n Clemson. Aug. 1967 Preliminary Edition (January 1968).

[85]

I.Singer,

Remark on a paper of Y. Ikebe. P r o c . Amer. Math. SOC. 21 (1969), 24-26.

[86]

I. Singer,

On normed linear spaces which a r e proximinal in every superspace. J. Approx. Theory, (to appear).

[87]

K. Tatarkiewicz,

Une theorie generalisee de l a meilleure approximation. Ann. Univ. Mariae Curie-Sklodowska 6 (1952), 31-46.

[88]

L. P. Vlasov,

On ?ebysev s e t s . Doklady Akad. Nauk SSSR 173 (1967), 491-494 [~ussian].

[89]

L. P . Vlasov,

Approximatively convex s e t s in uniformly smooth spaces. Mat. t a m e t k i 1(1967),443-449 [Ftussian] .

[go]

L. P. Vlasov,

On 6ebys'ev and approximatively convex eets. Mat. Zametki 2(1967), 191-200 [ ~ u s s i a n ] .

[91]

L. P. vsasov,

cebygev s e t s and some generalizations of them Mat. Zametki 3(1968), 59-69 [~ussian].

[92]

L. P. Vlasov,

Approximative properties of s e t s . i n Banach spaces. Mat. Zametki 7 (1970), 593-604 [ ~ u s s i a n ] .

1937

L. P. Vlasov,

Almost convex and z e b gelr s e t s . Mat. Zvmetki 8 (1970) 545-550 [Russianj.

643

D. E. Wulbert,

Continuity of m e t r i c projections. Trans. Amer. Math. SOC.134 (1968), 335-343.

[957

D. E. Wulbert,

Convergence of operators and Korovkin's theorem. J. Approx. Theory 1(1968), 8- 18.

[96]

D, E. Wulbert,

Differential theory for non-linear approximation. (preprint ) .,

I. Singer [97]

D. E. Wulbert,

Uniqueness and differential characterization of approximations from manifolds of functions. Amer. J. Math. (to appear).

[98]

D. E. Wulbert,

Nonlinear approximation with tangential characterization (to appear).

[99J

S. I. Zuhovitskii,

On minimal extensions of linear functionals in the space of continuous functions. Izvestija Akad. Nauk SSSR 21(1957), 409-422 [ ~ u s s i a n ] .

CENTRO INTERNAZIONALE MATEMATICO ESTIVO l(C. I. M. E. 1

STRANG

f3.

AND

G.

FIX

A FOURIER ANALYSIS O F T H E FINITE E L E M E N T VARIATIONAL METHOD

Corso tenuto ad Erice

dal

2 7 giugno a1 7 luglio

1971

These lectures were prepared f o r a CIME Advanced Summer Institute held in 1971. ?h ' e 'first author has been supported by the Office of Naval Research and the National Science Foundation (GP - 13778), and the second by AEC Contract 7158

-

2

.

9.Apologia .This paper has been taken from a preliminary d r a f t of our book "An Analysis of t h e F i n i t e Element Method", t o be published by Prentice-Hall about the end of 1972.

I n t h i s f i r s t d r a f t we developed the theory of

f i n i t e elements on a regular mesh, w i t h Fourier -analysis a s the principal tool, and we were able t o discuss the connections w i t h f i n i t e differerne equations and t o include a p a r t of the theory of splines.

lhis framework

we now c a l l the "abstract f i n i t e element method".

In our book, the emphasis will be s h i f t e d to the "nodal f i n i t e element methodn a s developed by s t r u c t u r a l engineers, i n which i r r e g u l a r elements a r e more the r u l e than the exception.

In t h i s case splines a r e muoh l e s s

convenient, and Fourier analysis i s impossible. remain valid

- we r e f e r t o a forthcoming paper of

However t h e basic theorems the f i r s t author i n

Numerische Mathematik on "Approximations in t h e F i n i t e Element Method". Furthermore it becomes

possible to examine t h e e r r o r s due to the presence

of curved boundaries and inhomogeneous boundary data. Ve hope that the book w i l l give a reasonably complete and r e a l i s t i c treatment of t h e e s s e n t i a l f i n i t e element theory f o r l i n e a r problems, and t h a t the reader w i l l accept the present paper a s an interim report.

821

G . Strang-G. Fix

in wj t u r n out t o

The c r u c i a l i d e a i n t h i s synthesis i s t o choose t h e b a s i s such a way t h a t t h e variati.ona1 eauations f o r t h e . v j be difference eouations.

The customary description of t h e r e s u l t i n g

method i s i n v a r i a t i o n a l terms, and we s h a l l introduce i t i n t h i s way below.

To think- of it a t t h e same time a s a difference scheme

w i l l require a c e r t a i n tolerance on t h e reader's p a r t , since a t f i r s t s i g h t t h e f i n i t e element method seems t o depart a t several

points from conventional difference equations.

I n f a c t it i s J u s t

these p o i n t s which represent f o r u s t h e main contrjbutions of the\ method t o f i n i t e difference theory; they a r e innovations which could have been devised independently, -

but never were.

Our goal w i l l be t o decide when t h e f i n i t e element method i s convergent and numerically s t a b l e , and t o estimate t h e e r r o r ,

Al-

though ne do not discuss a t t h i s point t h e i r r e g u l a r meshes and general boundary conditions which a r e met i n applications, we have t r i e d t o r e t a i n t h e mathematical e s s e n t i a l s of t h e method; t h e s e we .study i n some generality.

Of course t h e ultimate question i s whether

t h e f i n i t e element method i s more e f f e c t i v e than i t s competitors, namely those techniques which l i e ou$side t h e above i n t e r s e c t i o n . The evidence sugGests t h a t although difference scliemes can be constructed which require fewer operations f o r a given order of accuracy, nevertheless t h e v a r i a t i o n a l approach has an important coherence which derives from t h e f a c t t h a t , once t h e b a s i s i s chosen, t h e r e s t i s l a r g e l y automatic.

'Pj

This coherence seems

t o be r c f l e c t c d i n a more regular behavior both of t h e e r r o r and of t h e user, who has othcrv:ise t o make a separate choice of f i n i t e difference replacement f o r each term i n t h e d i f f e r e n t i a l equation

G. Strang-G. Fix

Thc evidence i n t h i s comparison i s s t i l l

and boundary cqnditions.

very l i m i t e d , however, and we s h a l l t r y t o remain n e u t r a l . The nenie we have adopted was o r i g i n a l l y chosen by engineers [ I ] , who decompose a continuous s t r u c t u r e , f o r numerical purposes, i n t o a s e t of " f i n i t e elements". mathematics i s l e s s c l e a r .

The h i s t o r y of t h e underlying

Both Courant [2] and p61ya [3] commented

on t h e merits, i n c e r t a i n v a r i a t i o n a l problems, of seeking ap-. proximate s o l u t i o n s which a r e l i n e a r within each ( t r i a n g u l a r ) element; accuracy i s . improved by i n c r e a s i n g t h e number of elements r a t h e r than t h e .complexity of t h e approximating functions. t h i s t r i a l space, t h e Laplace operator a c t i n g on

u

With

induces i t s

f a m i l i a r 5-point d i f f e r e n c e analogue, a c t i n g on t h e c o e f f i c i e n t vector

v

.

Such t r i a l f u n c t i o n s t h e r e f o r e make t h e Ritz method

esgeclally s i ~ p l et o execute, and it seems very l i k e l y t h a t t h i s

idea was proposed even e a r l i e r . The development of t h e method has l e d n a t u r a l l y from p i c e c u i s e l i n e a r functions t o s p i i n e s and o t h e r piecewise polynomials of f i x e d dcgree

p ; each i n c r e a s e i n

and t o t h e complexity of t h e method.

p

adds both t o t h e accuracy A s usual, t h e e x t r a accuracy

i s i n i t i a l l y .worth t h e p r i c e ; b u t j u s t a s Newton1 s method i s more

popular than i t s higher-order analogues, questions of convenience soon become paramount, cubic approximants point.

I n a p p l i c a t i o n s t o second-order equations,

(p = 3 ) . a r e apparently c l o s e t o t h e t u r n i n g

Thc e s s e n t i a l f e a t u r e s of t h e method a r e t h e subdivision

of t h e region i n t o f i n i t e elements, and t h e choice of a so-called local b a s i s f o r t h e space of approximating f u n c t i o n s b a s i s con~posedof functions which vanish over a l l b u t

- that tL

is, a

few elements.

G. Strang-G. Fix

We analyze i n t h i s paper t h e case of subdivision by a regular mesh, of width

h-30

, with

following systematic way.

We s t a r t with a fixed s e t of t r i a l

...,uN(~) ; N

functions

vl(~),

f o r each meshpoint. functions variable

a l o c a l b a s i s constructed i n t h e

mi

w i l l be t h e number of b a s i s functions

To ensure a l o c a l basis, we i n s i s t thak t h e s e

vanish f o r l a r g e

x by t h e mesh width

1x1 h

.

, and

r e s c a l e t h e independent

The eventual Rayleigh-Ritz

equations w i l l take t h e form of difference equations i f t h e b a s i s h functions vi, associated with each meshpoint ( jlh,. ,jnh)

..

a r e simply t r a n s l a t e s of these rescaled functions b a s i s i s thus composed, f o r each

h

, of

cDi(~/h)

.

The

t h e functions

Of course t h e r e have t o be modifications a t boundaries. I n t h e piecevrise l i n e a r case t h e r e i s a s i n g l e parameter f o r each meshpoint, namely t h e value of t h e function a t t h a t point; thus of

N =l

v1

.

The graph

i s a pyramid with vertex a t t h e o r i g i n and with base

formed from t h e neighboring-triangular elements.

G . Strang- G . Fix

It should be c l e a r t h a t t h i s

and 3.ts t r a n s l a t e s span t h e space

cpl

of a l l continuous functions i n t h e plane which a r e l i n e a r within each element.

I n some exemples our description (1.1) of t h e b a s i s

w i l l seem l e s s transparent than a description of t h e space i t s e l f ,

i n terms of t h e functions admitted within each element and t h e compatibility conditions across element boundaries.

Nevertheless,

both f o r p r a c t i c a l computations and f o r t h e general theory, t h e d e f i n i t i o n of the b a s i s i s crucial; it i s from combinations of these functions

oi,

t h a t t h e Rayleigh-Ritz-Gal-erkin p r i n c i p l e w i l l s e l e c t an approximation uh The fundamental questions f o r a numerical analyst a r e those of convergence and s t a b i l i t y : i) k a c c u r e t e i s t h e apnroxixftc s o l u t i o n

uh

, and

how well conditioned a r e the eauations from vrhich

ii)

uh

i s determined numcrica.lly?

The answers can only come from t h e connections between t h e given d i f f e r e n t i a l problem, t h e t r i a l functions

, and

the

norms i n which accuracy and condition number a r e measured.

We

tpl,...,qN

want t o study these questions f o r e l l i p t i c o ~ e r a t o r sof a r b i t r ~ r y order

2m

, for

with t h e spaces eh = uh

of order

-u

q u i t e general

xS

and

I

I I ~

, and

f o r t n e norms associated

These norms measure t h e e r r o r

and i t s d e r i v a t i v e s

In1 = xuj ( s

respectively:

.

qi

i n t h e mean-square and po5.ntwise senses

In Sobolevts notation the space 9fS is written :W

.

Our arguments make constant use of the Fourier"transfo?m, which operates at full strength only on problems which are either periodic, or defined on the whole of Euclidean space R"

.

We

are convinced that (as in the theory of elliptic differential operators) the investigation of these special problems is fundamental to the understanding of more general boundary conditions. Thus we regard the present work as a necessary first step in analyzing the wide variety of problems, with irregular meshes and boundaries, vihich are actually being solved.

Fortunately,

most of the second step in the analysis is already complete; J.-P. Aubin, follotring the work of ~ 6 and a others in the French

school, has successfully analyzed the solution of boundary problems by means of

splines

bution is to determine extend.

.

all trial

In his terms, our contrifunctions to which his theory can

(He mention, in addition to his forthcoming manuscript,

the reference [ 4 ] .)

The third step is the study of more general

meshes, particularly those formed by an arbitrary triangulation of the region. This is a major point in our forthcoming book.

G . Strang-G. Fix

For problems on t h e whole space runs over t h e s e t

2"

Rn

, the

of a l l m u l t i - i n t e g e r s

index (jl,'.

j

i n (1.1)

..,jn) .

We

adopt t h e d e f i n i t i o n

f o r t h e Fourier transform, where xlcl

+

... + xntn..

= (,...,E~)

and

xg

denotes

AS a f i r s t a p p l i c a t i o n of t h e Fourier transform,

P a r s e v a l t s formula can be used t o replace (1.2) by t h e equivalent and more convenient norm

We f i r s t describe t h e app1ication:of t h e Ritz-Galerkin method

.

Using t h e con-

a b s t r a c t l y , t o e l i n e a r e l l i p t i c problem on

R*

ventional i n n e r product

,.t h e problem begins

( f , g) = /f (x)E(x) dx

with a b i l i n e a r form

I f the coefficients

ever u

and

w

a r e bounded, t h e form

l i e i n t h e space

7j"

,

a

i s defined when-

t h a t is, whenever a l l

p a r t i a l d e r i v a t i v e s of order not exceeding m The fgrm i s c a l l e d ? ? - e l l i p t i c provided t h a t

.

lie in L ~ ( R ~ )

G.Strang-G. F i x

The most familiar example is the form associated with the Laplace equation, a(ualv) = $ au ai;

+

au a? ... + axn ax, dx .

As it stands this is not ??'-elliptic, corresponding to the fact that Laplace' s equation has non-zero solutions, e,g. u = constant.

To satisfy (1.4), and thereby eliminate this non-uniqueness of the solution, we need'to add some positive multiple of the zero-order term

(u,w)

.

An elliptic form induces the follovring variational problea: given f

, find

(1.5)

u in

a(u,w)

so that

=

(f,v,)

for all w

This problem has one and only one solution u

in

ip

.

, provided

the in-

homogeneous data f is such that the right side makes sense; since w ranges over %? , this places f . in the adJoint space IC-"

, so that

The elliptic problem (1.5) can equally well be put into the more familiar opcrationcl form

G. Strang-G. Fix

For t h i s we i n t e g r a t e t h e l e f t s i d e of (1.5) by p a r t s , s h i f t i n g a d e r i v a t i v e s from w onto q D u The r e s u l t i s ( L U , ~ )= ( f , ~ ) ,

.

which i s equivalent t o (1.6); L

ji)n t o u - ~

i s t h e map from

given by

Many applications l e a d a l s o t o problems of I n such c a s e s the form of

a(u,u)

- ( f , u ) - (u,f)

problem (1.5).

a

i s self-adjoint,

mininlization

.

and t h e minimization

l e a d s exactly t o t h e same v a r i a t i o n a l

I n f a c t t h e operational equation

is

LU = f

nothing but t h e x e r eauation from t h e c a l c u l u s of variations. The Ritz-Galerkin technique i s now simple and very familiar; t h e space

i s replaced i n t h e v a r i a t i o n a l statement (1.5) by

a sequence of closed subspaces

sh

.

Thus t h e approximating

problem, w r i t t e n v a r i a t i o n a l l y , i s t o f i n d a ( uh ,wh ) = (f,wh)

(1.7)

for a l l

uh

wh

in

sh

in

E l l i p t i c i t y implies the existence and uniqueness of concerned with i t s computation. expand

sh

.

uh ; we a r e

Therefore we put t h e approximate

problem a l s o i n t o operational form.

sh , we

so t h a t

Choosing a b a s i s

vh u

fdr

G . Strang-G. Fix

and compute t h e vector

vh of unlmovrn c o e f f i c i e n t s .

Substituting

i n t o (1.7))

Since t h i s holds f o r a l l c o e f f i c i e n t s wvh

Thus t h e v e c t o r . vh

,

s a t i s f i e s t h e d i s c r e t e operational equation

where t h e e n t r i e s i n t h e coefficient matrix and t h e inhomogeneous vector a r e given by

Since a l l t h e s e e n t r i e s have t o Be calculated, e i t h e r a n a l y t i c a l l y o r by numerical quadrature, one wants a s simple a b a s i s a s possible. The use of t h e c l a s s i c a l special functions, i n o t h e r words a r e t u r n t o stage one, i s by no means obsolete; both Urabe and Clenshaw have made successful application of Chebyshev polynomials. interested, however, i n t h e b a s i s functions for'problems on R" f ,u,

...

and ' t h a t

t h e index

a l l have period 0 ( jv < h-'

j

i,J

runs over

Z"

1 , we require t h a t

f o r cach component of

j

Me a r e

defined e a r l i e r ;

, whereas

if

h-I

be an i n t e g e r

.

I n t h i s periodic

sh

case

has f i n i t e dimension

The index

i

N h-n

assumes t h e values

.

1,. .,N

, so

~t is n a t u r a l

t o take t h e b a s i s functions i n groups

of

W

order

a t a time. 11

,a

This p a r t i t i o n s t h e matrix

ct

N

i n t o blocks of

t y p i c a l block being

Thus t h e f i 3 i t e element r e l a t i o n

of

Ah

d i s c r e t e equations.

Ah vh = f h

i s a coupled system

Normally ' t h i s system i s analogous t o

continuous one, i n which t h e o r i g i n a l d i f fe r c n t i a l equation

Lu = f

i s coupled t o some of i t s d i f f e r e n t i a t e d forms

I n t h e Hernite case

D(LU) = Df

t h e d i s c r e t e system can be formally

recombined t o y i e l d a s i n g l e s p l i n e - l i k e equation, J u s t a s t h e Hcrmite b a s i s functions, with small support, can be combined w i t h t h e i r t r a n s l a t e s t o yield the spline basis. .

We want now t o summarize our r e s u l t s .

A more extended summary

has already appeared [5] i n Studies i n Applied ~~lathematics,a r e i n c a r h a t i o n of M.I.T.fs

Journal of Mathematics and Physics.

We

hope t h a t our discussion there-, i n terms of t h e model problem -Au

-1-

u .= f

and i t s

5-

and

9- point d i f f e r e n c e analogues, w i l l

be a u s e f u l supplement t o t h e present pzper.

For t h e moment wc

set nsidc extensions t o eigenvnlue problems and parabolic equations,

and dcscribc our conclusions only f o r t h c problems of convergence and s t a b i l i t y s t a t e d above.

G. Strang-G. Fix ii)

The problem 6f s t a b i l i t y i s t h e simpler,

Hcre t h e

fundamental question i s whether o r not t h e independencc of t h e b a s i s h elements qi, i s uniform a s h e 0 ; i n L2 ,with our normalization (1.1) of t h e b a s i s elements, t h i s means

For t h i s uniform independence we f i n d t h e folloering necessary and s u f f i c i e n t condition: I' =

1

ci qi

t h e r e e x i s t s no n o n - t r i v i a l combination such t h a t t h e Fourier transform of

and r e a l

Y

satisfies

(1.14)

u(so

+

2sj) = 0

for a l l

j e

zn

.

The reader w i l l n o t i c e t h a t everything depends on t h e

qi ; only gross f e a t u r e s of t h e d i f f e r e n t i a l problem, i t s order and e l l i p t i c i t y , a r e r e l e v a n t t o t h e condition number.

This i s an

a t t r a c t i o n , a t l e a s t t o t h e a n a l y s t , which should not disappear

i n more general boundary problems. are

not

For d i f f e r e n c e equations which

derived v s r i a t i o n a l l y , Schacfferf s poererful work [6] has

shovm how deep t h e s t a b i l i t y problem a c t u a l l y i s , even i n comparison with t h e 'corresponding question i n t h e general theory of e l l i p t i c boundary problems.

I n ~homgelS terminology [ 7 ]

,.

(1.14) i s

ncccssary and s u f f i c i e n t f o r t h e d i f f crence equations (1.9) t o be e l l i p t i c . i)

Thc r a t c of -convergence of

"density" of t h e spaces

sh ; t h e r c f o r c

uh

depends on t h e

we begin with a discussion

G. Strang- G . Fix

of approximation theory.

Our main r e s u l t f o r t h e case

takes

N = l

t h e following form:

smooth functions can be approximated from

with error

O(hp+l-') i n l i ~ egs

--

no~nialsi n

xl,.

..,xn

cor;lb:inatlons of

Q

norm, -

,if

s(p

of degree ( p

and only i f a l l poly-

can be w r i t t e n a s l i n e a r

and i t s t r a n s l a t e s .

Fourier a n a l y s i s l e a d s ; i t must have zeros

t o an equivalent condition on t h e transform

$

p -1- 1 a t a l l t h e p o i n t s

,j #

of order

sh

4 = 2nj

(0,.

.,,0)

Here

we have assumed t h e conditions most commonly met i n p r a c t i c e r t h a t cp

is in

fJP

and s t a b i l i t y holds; more p r e c i s e r c s u l t s a r e proved

i n 52. With

N > 1 , the

qi

and l i n e a r combinations of t h e i r t r a n s -

l a t e s may have d i f f e r i n g degrees of smoothness.

I n fact thcre are

important c a s e s i n which t h i s i s bound t o happen. o.r'-.:,?

.?

thnt

sh

Suppose f o r

:is comyriscA of pccc&rise cubic functions

which have continuous f i r s t d e r i v a t i v e s a t each j o i n t .

(n=1)

Since t h i s

Hermite space contains t h e s p l i n e subspace,. whose elements have .continuous second d e r i v c t i v e s a s well, t h e s p l i n e b a s i s f u n c t i o n s must be combinations of t h e Hermite b a s i s ; t h e former i s i n and t h e l a t t e r only i n only t h a t t h e

mi

??

are i n

approximation of order

.

Thus, i n t h e general case, ure a s s w e

vq , q(p , and

hP'l-S

cpi

agbin we prove t h a t

i s p o s s i b l e i n ?lS ( f o r

if and only i f a l l polynomials of degree

from t h e

3

2(

p

s( q)

can be produced

and t h e i r t r a n s l a t e s .

These approximation r e s u l t s a r e of course a l r e a d y known f o r many s p c c i f i c choices of

sh

.

\re mention i n p i r t i c u l a r t h e e a r l y

estimatcs f o r s p l i n c s by Birl p -

i f and only i f each

The range of surnnztion i s understood t o bc

of i n t c c r a t i o n i s

, when

R"

Suppose

TiiEORFki I.

.

2 P,

a,

zn , and

t h e range

nothing i s s a i d t o t h e contrary.

is i n

m

?fz .

Then t h e following

conditions a r e equivalent: A

v ( ~# ) 0

(5.)

p+1

...,tn

for

h

5

p

27.2" :

-

, ),

ja cp(t-J)

is a polynomial i n

jczn

with l e a d i n g terrn

(iff) that as

la1

has zeros of order a t l e a s t

co

n . t t h e o t h e r p o i n t s of

(ii)

tl'

, but

f o r each -*

0

The c o n s t a n t s

u

in

eta , C # ?!p-kl

0

t h e r e a r e weights

,

cs

and

K

a r e independent of

u

wh S

such

.

Proof.

( i =

(

i

.

This equivalence, l i k e much of t h e a n a l y s i s

l a t e r i n t h i s paper, depends on t h e Poisson f o r r ~ u l a : t h e

G. Strang- G. Fix

vnlucs of a function

Y

on t h e l a t t i c e A

those of i t s Fouricr transform

If

zn

a r e connected with

on t h e l a t t i c e

y

2azn by

has compact support, t h e f i r s t swn involves only a f i n i t e

Y'

number of terms,. and t h e secolld i s absolutely convergent. a Applied t o t h e function Y(x) = x ~ ( t - x ) , this y i e l d s

Suppose f i r s t t h a t ( i ) holds. terms on t h e r i g h t s i d e with

j

Then f o r any

# 0 a l l vanish.

have only t o compute t h e contribution from

The leading term i s c l e a r l y

eta , with

la1 ( p

, the

Therefore we

j = 0 :

C = $(o)

# 0 as

required. NOVI we supposc t h a t ( i i ) ho1d.s.

Taking

a = 0

, this

means from (2.4) t h a t

i s a non-zero constant function. for

j

# 0

A Therefore ~ ( 0# ) 0

.

Next we considcr

a = (1,0,.

..,0) ; t h e

A ,m ( 2 ~ j )=

0

r l g h t s i d e of (2.4) i s

G . Strang-G. Fix

{lccording to (ii) this is a polynomial, and therefore this time we

have

A arn/asl

(21.j) = 0 for ' j # 0

way, in order of increasing a conditions in (2.1) (i) =

> (iii)

.

(2.2) and (2.3)

.

Proceeding in the seme

, we,establish the

remaining

.

Our first step is to convert the inequalities

, by

Parseval's formula, into inequalities for

Fourier transforms. We note first that

Therefore the transform of C wh vh is

J

J

We denote the function in braclrets by Wh ( 5 ) it has period C/11

2r/h

in each variable

denote t-hecube - ~ / h
1 (or N > 1) P ' no such common d i v i s o r e x i s t s , but condition (i)should make 1-b from convolution with t h e "B-spline"

possible t o i d e n t i f y a l l u s e f u l bases.

We have learned t h a t

t h i s condition vras known mcch e a r l i e r t o Schoenberg; it appears i n h i s fundamental paper [18] a s t h e condition f o r a smoothing formula t o map onto i t s e l f t h e space of polynomials of degree The Poisson formula, which i s t h e c r u c i a l connection between

zn

transforms an

and

R"

, also

f i g u r e s i n t h e valuable

recent work of Bramble and H i l b e r t 1191 Remark 2.

.

Theorem I d i f f e r s a l i t t l e from t h e r e s u l t

s t a t e d i n t h e introduction, where we assumed ' s t a b i l i t y but gave weaker forms f o r t h e conditions i n t h e theorem.

In t h e

t h i r d condition, f o r exemple, t h e weaker statement imposed no r e s t r i c t i o n (2.3) on t h e weigizts; given t h e s t a b i l i t y condition (1.14), however, t h i s r e s t r i c t i o n i s automatic.

In t h e second

condition, we assumed no p a r t i c u l a r form f o r t h e expansion of polynomials

-

only t h a t they could somehow be represented a s

combinations of

and i t s t r a n s l a t e s , e.g.

p

G . Strang-G. Fix

We, t h e r e f o r e want t o show t h a t s t a b i l i t y f o r c e s these weights

J

The r o l e of s t a b i l i t y i s t o make any r e -

- t o be equal.

presentation (2.14) unique; t h e only rcpresent.ation of tllc zero .function has a l l weights zero.

Novr .we use t r a n s l a t i o n

invariance, s h i f t i n g both s i d e s of (2.14) through the. u n i t vect4r

ev ' i n t h e

By uniqueness

p j-ev

direction:

t,,

= Pj

for a l l

j and

v

, and. t h e

weights

a r e equal

For an expension

o S-e, we add

t h e same argument gives s h i f t i n g through

el

Uniqueness now gives

a

j

=

uo

= oj

for

v = 2,. ..,n

; after

1 t o find

+

jlp

, .so t h a t

This means t h a t t h e l a s t sum i s a polynomial of t h e form required by ( i i ) t t h e induction i s obvious.

Thus t h e statement i n t h e

introduction may be rcgarded a s a corol.lary t o Theorem I Remnrk.3.

.

Thc condition G(0) # 0 i s i n general not necessary f o r clpproximation t o be possible i n ?Is , i f t h e

G . Stkang-G. Fix

r e s t r i c t i o n (2.3) on t h e weights i s removed. n = 1 , and

example t h a t origin.

tp

Suppose f o r

has a zero of o r d e r

Then i f (2.1) holds with

one can c o n s t r u c t ireights holds.

A

p

replaced by

of order

W~

h-'

p

A converse of t h i s r e s u l t is, a l s o poqsible.

all

2nd

, and

(1.14) i s s a t i s f i e d a t

+

p

A

so

= 0

(I

.

We vanishes Therefore

t h e associated Rite-Galerkin system w i l l be & e r i c a l l y s t a b l e , and such a choice of Remark 4.

y

,

such t h a t (2.2)

emphasize tha,t i n such a s i t u a t i o n t h e transform at

a t the

un-

i s t o be avoided.

Theorem I can be made much more p r e c i s e i n a (These refinements may be o f l i t t l e

number of d i r e c t i o n s .

i n t e r e s t t o t h e s e n s i b l e reader, who wants to. .get on with t h e p l o t ; he can s a f e l y disregard Thcorem It. ) show t h a t t h e exponent

pkl-s

b e s t p o s s i b l e f o r any u f 0

First, we can

in t h e e r r o r estimate (2.2) i s

, and

f i n d t h e infinum of constznts

c s f o r which t h i s estimate holds. Second, we note t h a t t h e A smoothness of y ' a n d t h e order of t h e d e r i v a t i v e s . o f tp

which vanish a t t h e p o i n t s 27rj were s p e c i f i e d i n Theorem I by t h e same i n t e g e r

p

.

This r e l a t i o n i s t h e most e f f i c i e n t

I n p r a c t i c e , anc! consequently t h e most common, but t h e r e i s n o . a n r i o r i reason why t h e two i n d i c e s must agree. folloiring we allovr rp where

q (p

>

, since

.q

p

.

t o be l e s s smooth, say 'cp

Insthe i n W:

There i s n o a s e i n permitting e x t r a smoothness,

t h e e s t i n ~ o t e sa r e n o t improved.

Third, we

strengthen t h e converse p a r t of t h e theorem by deducing t h i s smootlmess of

cp

r a t h e r than assunling it.

A f u r t h e r generalization,' which Ire s h a l l forego, i s t o

give estimates a l s o i n f r a c t i o n a l and negative norms.

In

f a c t t h e reader can v e r i f y t h a t such r e s u l t s follow d i r e c t l y from our proofs5 t h e norm (1.3)

, involving s

form,applics equally well t o a l l r e a l discussion of ' L ~ estimates f o r

t h e Fourier t r a n s -

.

We a l s o omit any

.

,m

p f 2

The more p r e c i s e version of Theorem I i s THEOREM 1'.

For any i n t e g e r s

p

0

q

, the

following

conditions a r e equivalent:

t

(I)

rp

(ii)

. cp

ja * ( t - j)

l i e s i n :(?

, {(o) #

lies in

, and

1~:

i s a polynomial Xn

eta , c + o .

term

, and

la1

, the

(p

..,tn

tl ,.

&th

function

leading

i s a d i s t r i b u t i o n with compact support, and

(iii) f o r each h+O

for

0

u

in

XP+l

t h e r e a r e weights

wh

5

such t h a t a s

J

The exponent every

u

p

0

,if

pi-1-s p

i s b c s t p o s s i b l e . f o r every

s

and

i s t h e l a r g c s t i n t e g e r f o r which ( i )

G. Stran.g:G. Fix

I n one dimension, t h e g r e a t e s t lower bound of possible

holds.

constants cs i s

>

With' n

1 t h e l a s t f a c t o r becomes

where " :a w

.

i s t h e d e r i v a t i v e of order

(This constant

Cs

cs = Cs

estimate (2.2) holds, with know only t h a t f o r every u h

1

, the situation be a tiny subset

seems to be essentially the samej there of u

for which the asymptotic constant exceeds Cs

namely

A

those with u supported away from the optimal direction w

.

(We hope to show elsewhere that this conjecture extends also to ripproximation in the ,maximumnorm.) 'Novr we consider the approximation problem when the space

.

...,*

is generated by several functions pl. h case there are N unknowns viJJ i lJ...JN S"

; :

In this

, to be

computed at each meshpoint from the finite element equations vh = fh

.

The merit of such an extension'is to make high

accuracy p possible.xvithreiatively.sLmple functions cpi

-

their support can be small, so that frequently the required inner products are easier to compute and boundary conditions simpler to match, and they can have additional interpolating propcrtics. each

vi

In the one-dimensional Hermitc case, for example,

is supported on the interval [-1,1]

, and the

1-1 st

is the only one of the first N - 1 derivatives to be non-zero

G. Strang-G. Fix.

which

at the origin. This means that the quantities vi,j

satisfy the finite element equation have physical significance in themselves, as the "displacement", "slope", "stress", etc. of the approximate solution at the meshpoint x

=

.

jh

This

has been found very attractive by users.

TIEOREM 11.

Suppose

", ...er", N

in ?lz

.

Then the

following conditions are equivalent: (i)

there are linear combinations Pa

of the q i which

satisfy A

A

(2.194

~~(= 0 la, ) y0(2nj)

0

= 0

for j #

o

for all j E z n , 1 s la1 L p

(ii) there are linear combinations Y,

of the cpi

which

satisfy

(iii) for each u that for s = D , l , .

in ?tP+l there,areweights w

. .q,

i, j

wch

G. Strang- G. Fix

Remark 5 .

Babuska has asked us whether t h e following

condition on t h e ' (iv) ql,.

..,%

vi

i s equivalent t o those i n Theorem 11:

t h e r e i s a f i n i t e l i n e a r combination and t h e i r t r a n s l a t e s

via

0

of

which s a t i s f i e s t h e

conditions imposed' i n t h e case I?.= 1 :

(2.23)

~ ~ f i ( 2 r =j )0 f o r "la1 ( p

W e s h a l l prove t h a t

,j #

( i ) => ( i v ) => ( i i i )

0

, so

.

t h a t Babuska's

i n s i g h t allolvs t h e reduction of many 'N-dimensional p r o b l e m t o t h e simpler case

N = 1

.

We note t h a t t h e t r a n s l a t e s admitted

i n . ( i v ) a r e t h e functions

It i s obvious t h a t ( i v ) => ( i i i ) : i f t h e function s a t i s f i e s t h c conditions of Theorem I, then combinations

n

G. strang-G. Fix

C w? J .

oh can be used t o J

To prove t h a t

( i ) => ( i f f )

polynomialof degree for

Jy-l 5 P

approximate

p

A

l a t e s of

Y,

e

l e t t, 191,

..., e

denote t h e unique such t h a t

,

Cl by giving i t s Fourier transform:

i s t h e trznsform of a f i n i t e combination of t r a n s -

, this

of t h e o r i g i n a l qi

n i s indeed a f i n i t e l i n e a r combination and W e i r t r a n s l a t e s .

By t h e p e r i o d i c i t y of t h e

which by (2.lga) equals one i f To v e r i f y (2.23) of a product:

, we

jointlyin

Then we define t h e required

Since t, Y,

u a s required i n (2.21-22).

, we

t,

,

j = 0

and zero otherwise.

use t h e Leibniz r u l e f o r . t h e d e r i v a t i v e

G..Strang-G. Fix

replacinp, a

by

y

-P .

property (2.19~)of t h c

This vanishes, by t h e fundamental h

Ya

, 2nd

condition (iv) i s verified.

We note t h a t (2.23) holds even f o r 1 ( In1 ( p

.

Tnus t h e

qa

, when

which were constructed i n t h e

~ i - o o fof Theoren I reduce -to boa u

j= 0

, when

by combinations of t h e functions

rve a r e approximating

". h

We t u r n now from t h e mean-square zpproximation. of a f u n c t i o n and i t s d c r i v a t i v c s t o t h e problem of p o i n t a i s e approximation.

r

The l a t t e r i s c r u c i a l nwncrically, even th0up.h t h e origfr:al d i f f e r e n t i a l problem ( e . ~ .t h a t of minimizing a quadratic f u n c t i o n a l ) l e s d s more n a t u r a l l y t o t h e former.

It i s a

funda!ncntal orovcrty of Lhe f i n i t e elernent method t h a t t h e tv:o p

+

GO

1

toqcthcr.

-

s

By t h i s vie mean not only t h a t t h e order

o r t h e b a s t approximetion i s t h e sane i n t h e t m

nonns, a s t h e next theorem shov~s, but a l s o t h a t t h e o r d e r

r

of t h e a c t u a l approxi?nntion by t h e Ritz-Galerkin function

uh

G. Strang-G. Fix

i s t h e same i n both norms. of

p+l-s

and

2(p+l-m)

r

This exponent

i s t h e smaller

.

The following estimate includes known r e s u l t s f o r spline approximation, i n t h e special case of equally spaced knots. Splines a r e t y p i c a l of many important choices of t h e t h a t t h e i r derivatives of some order

q

cpi

,in

have jump discontiunities;

.

( I n t h e spline we note t h a t t h i s leaves them s a f e l y i n W; case, q i s the. degree of t h e polynomial i n each interval, and coincides h%th t h e accuracy exponent THEOREM 111.

Suppose

Q1,...,vN

p .) s a t i s f y t h e conditions of

t h e previous theorem, and have bounded derivatives of order

q

.

Then i f

i t follows t h a t

Proof. part of

x/h

i n powers of

We write and

h

,

t

x=kh+ th

, where

k

l i e s i n t h e u n i t cube

i s the integral O l t v < l

.

Expanding

G . Strang-G. Fix

We know from the s t a r t that uI1 cannot be c l o s e r to u than the optimal h approximation from the space S

.

Therefore the e r r o r , measured i n H~

o, r:w

will be a t best of the o r d e r hptl-s determined i n Theorems I

and 11.

The question i s whether the approsimation produced by the finite Thcre i s no a p r i o r i reason

clement is actually of this optimal o r d e r .

why this should always be so; i n fact, if we fix

(3 i n

H~ and i n c r e a s e the It

o r d e r 2m of the equation, t h e r e i s every reason to think otherwise. s e e m s unlikely that the e r r o r in 14' would stay of o r d e r p

+ 1 until m

exceeds

p, a t which point cp would no longer lead to admissible t r i a l functions and the method would collapse.

Therefore we anticipate that tlie o r d e r of

accuracy will depend on m a s a c l l a s p and s . Thc c o r r e c t o r d e r can in fact he d e t e r ~ ~ l i n eover d the range 0

Q

s 4 m by an elegant variational argument which me lkarned from

Marlin Schultz. A special but still typical c a s e of this argument h a s been published by Nitsche ( 211, and Aubin has shown us all alternative route to the s a m e result. We begin with the fundamciltal result (cf. Varga 1221) that i n the case s

= m, the c r r o r uh

- u i s indeed of the optimal o r d e r hp t l -m

allowcd by Lhc approximation theorems.

Repeating the standard a r g u -

ment, we deduce from the v a r i a t i o ~ l a lequations (1. 5) and (1. 7) that a(uh

- u, wh ) = (f, wh ) -

h (f, w ) =

o

Therefore by ellipticity plluh

- u l l H m2

-

6 ~ e a ( u " u,u

h

-u)

11 . h for a l l w ~n s

(2.31)

G. Strang-G. Fix ior a l l w

h

.

h Cancelling the f i r s t i a c t o r on thc right, and choosing w

a s the optimal approximation to u,

p 1-1 IIerc we ticed u i n 13 i n o r d e r l o apply T h e o r e m s I and 11, and t h e r e If i is l e s s smooth, the e s t i -

fore we a s s u m e that f l i c s in Hp'1-2m.

m a t e s of this section a r e not difficult t o r e v i s e .

It i s t o the e s t i m a t e

(2.32)that we m a y apply o u r calculation i n Theorem I' of the m i n i m a l constant C

i n approximation.

m

Now we give Schultz's a r g u i l ~ c n tf o r s

< m. I t begins with t h e

adjoint problem LQv = g, which i s equivalent to the variatioilal equation 'm a(y, v) = (y, g) f o r a l l y i n N

(2.33)

Again the cllipticjty of a guarantees a unique solution v f o r g i n H - ~ , and i u r t h e r ~ n o r cthat

-

Taking y = ul' h (U h . i o r a l l w in

-

u, and recalling (5. I ) ,

U,

sh.

11 g) = a ( u

- u,

I)

= a(uh

- u, v - wh )

(2.35)

Therefore

I(u"-u,E)~

K ~ ~ u ' ' - u I I ~ ~ ~ ~ hv -llHmw

.

(2.36)

To e s t i m a t e the l a s t t e r m , we choose wh a s the b e s t Hm a p p r o x i m a ~ i o n t o v, and appeal to T h c o r c m s I and 11:

r iipf 1 4 2m-s (2.37) i f p f l > Z m - s I1

G. Strang-G. Fix

(In tLc second c a s e we reduced p t o 2m

- s r - 1 before applying the

approximation theorems; if t h e i r hypotheqes hold f o r a given p they certainly hold for a s m a l l e r on;. ) Substituting @.32)and@:37) into (2.36)

and using (2.34), we have

r = min(p

+ 1 - s, 2(p + 1 - in)) .

(2.38)

Now a s g r u n s over thc unit b a l l i n I - I - ~ , the s u p r e m u m of the left side is exactly the n o r m of uh

- u i n t h e d u d space H ~ .T h e r e f o r e the finai

e r r o r estimate i s

We notice that with s = m the f i r s t expression i n r is the s m a l l e r , and

r =p

+ 1 - rn in agreement with

(2.32).

G . Strang-G. Fix

C. C , Zienkiewicz, "The f i n i t e element method i n s t r u c t u r a l and continuum mechanics, " London: McGraw-Nil1 (1967)

.

R. Courant, "Variationel methods f o r the s o l u t i o n of problems of e u i l i b r i m znd vibrations," Bull. Amer. Math. Soc. 49, 1-23?1943)

.

G. Polya, "Sur une i n t e r p r e t a t i o n de l a mcthode cles differences

f i n i e s c j u i peut fournir des bornes superieures ou i n f e r i e u r e s , " Comptes Hendus 235, 995-997 (1952)

.

J. P. Aubin, "Behavior of t h e e r r o r of . t h e approximate

solutions of boundary value problems f o r l i n e a r e l l i p t i c operators by Galerkin's and f i n i t e difference methods," Rem. Sem. Mat. Pado-ra.

G. Fix and 6 . Strang, "Fourier analysis of t h e f i n i t e elernent method i n Ritz-Galerkin theory," Studies i n Appl. Math. 48, 265-273 (1969)

.

D. Schaeffer, "Approxination of e l l i p t i c boundary value problemllby difference equations; I. Factorization of t h e symbol, J. Functional Analysis 1970.

V. Thomee, " E l l i p t i c difference operators and D i r i c h l e t ' s

problem," Contr. Diff. Eqns. 3, 301-324 (1964).

J . P. Aubin, "Approxim9tion des espaces de d i s t r i b u t i o n s e t

.

dea operateurs d i f f e r e n t i a l s , I' Bull. Soc Math. Prance, Memoire 12 (1957).

G. Birkhoff, M. H. Schultz and R. S. Varga, "Hermite i n t e r polation i n one and more v a r i a b l e s with applications t o p a r t i a l d i f f e r e n t i a l equations, " Elmer. Math., 11, 232-256 (1968)

M. II. Schultz, "Rayleigh-Ritz-Galerkin methods f o r multi-

dimensional problems,

"

SUM Numer. Anal. 6, 523 -538 (1969)

.

F. DiGuglielmo, "Methode des elements f i n i s : une famille dlapproximations des espaces de Sobolev par l e s t r a n s l a t e s de p-fonctions," Manuscript, 1970.

I. Babuska,

"Approximation by H i l l functions,

''

t o appear.

J . J. Goel, "Construction of basic functions f o r numericAl u t i l i z a t i o n of R i t z l s method," Numer. Math. 12, 435-447 (1968). V. Thomee,"On t h e convergence of difference quotients i n

e l l i p t i c problems," Univ. of Maryland, Note BPI-537 -(1968).

Zlamal, "On the. f i n i t e el ernent method, " Numer. Math. 12, 394-!+09 (19.58)

)I.

.

.

G. Strang-G. Fix

[16] R. J. lierbold, M. H. Schultz, and R. S. Varga, "Quzdrature schemes for the numerical ,solution of boundary value problems by variational techniques, Aequationes Mathenaticae 3, 96-119 (1959). .

[1.7] R. Boas, :Entire functions," New York, Academic Press (1954). [18] I. 3. Schoenberg, "Contributions to the problem of approximation of eauiclistant dcta by analytic functions, Parts A and B.," Quart. Appl. ihth. 4, 45-99>112-141 (1946). [lg]

3. H. Bramble, and S. R. Hilbert, "Bounds for a class of

linear functionals with applications to Hermite interpolation." Numer Mathematik.

.

r2.01 G. Strang, "The finite element method and approximation theory,

Numerical Solution of Partial Differential Equations I1 (SYI'TSPADE), Academic Press, 1971. [21] J. Nitsche, "Ein Kriterium fur die auasi-optimilifat des Ritzschen verfahrens," Numer. Math. .11, 346-348 (1968). 1221

R. S. Varga, 'Hermite interpolation-type Ritz methods for two-point boundary value problems," in: "Numerical solution of partial differential equations," 3. H. Bramble, Ed., New York: Academic Press, 365-373 (1965).

CENTRO INTERNAZIONALE MATEMATICO ESTIVO

(C. I. IVI. E. )

$A.

ZERNER

CARACTERISTIQUES D'APPROXIMATION DES COMPACTS DANS L E S ESPACES FONCTIONNELS E T PROBLEMES AUX LIMITES ELLIPTIQUES

Corso

t e n u t o a E r i c e d a l 2 7 giugno a 1 7 l u g l i o

1971

,CARACTERISTIQUES DfA PPROXIMATION DES COMPACTS DANS L E S ESPACES FONCTIONNELS E T PROBLEMES AUX LIMITES ELLIPTIQUES

par M. Zerner ( Universite de Nice )

1.

Considerons l e probleme aux limites :

oh R est un ouvert borne suffisamment r6gulier.de R

n

.

A est un opera-

teur elliptique & coefficients e g d f o r d r e 2m, l e s B. des operateurs d f o r 3 des fonctions donndes dans des boules des wP d r e , les j k-m. -112. 3 Nous faisons expressement lfhypoth&se que ce problgme e s t bien pose et que si u = G ( (9 1' de Vk=

. . .ym )

wP k - m l - l / p ( 3 0)

sur

est l a solution, alors G est un isomorphisme

wPk

(R) flA-'(0) et cel& pour tout k assez

grand. Nous voulons des indications s u r l a faqon de discretiser ce problhme de f a ~ o nB garantir une precision de E au sens de donnee

tq

= (

IlyIIk_

Constructive Aspects of Functional Analysis (C.I.M.E. Summer Schools, 57)

Geometric Aspects of Functional Analysis

Geometrical Aspects of Functional Analysis

Geometric Aspects of Functional Analysis

Geometric aspects of functional analysis

Geometric Aspects of Functional Analysis

Geometric aspects of functional analysis

Techniques of Constructive Analysis

Constructive Analysis

Aspects of Mathematical Logic (C.I.M.E. Summer Schools, 48)

Some Aspects of Diffusion Theory (C.I.M.E. Summer Schools, 42)

Complex Analysis (C.I.M.E. Summer Schools, 62)

Complex Analysis (C.I.M.E. Summer Schools, 62)

Spectral Analysis (C.I.M.E. Summer Schools, 64)

Some Aspects of Ring Theory (C.I.M.E. Summer Schools, 37)

Techniques of Constructive Analysis (Universitext)

Techniques of Constructive Analysis (Universitext)

Numerical Analysis of Partial Differential Equations (C.I.M.E. Summer Schools, 44)

Fundamentals of functional analysis

Elements of Functional Analysis

Elements of Functional Analysis

Fundamentals of functional analysis

Elements of Functional Analysis

Geometric Aspects of Functional Analysis: Israel Seminar 2001-2002

Geometric aspects of functional analysis: Israel seminar 2004-2005

Geometric Aspects of Functional Analysis: Israel Seminar 2006-2010

Geometric aspects of functional analysis. Israel seminar 2002--2003

Principles of functional analysis

Principles of functional analysis

Constructive Aspects of Functional Analysis (C.I.M.E. Summer Schools, 57)

Geometric Aspects of Functional Analysis

Geometrical Aspects of Functional Analysis

Geometric Aspects of Functional Analysis

Geometric aspects of functional analysis

Geometric Aspects of Functional Analysis

Geometric aspects of functional analysis

Techniques of Constructive Analysis

Constructive Analysis

Aspects of Mathematical Logic (C.I.M.E. Summer Schools, 48)

Some Aspects of Diffusion Theory (C.I.M.E. Summer Schools, 42)

Complex Analysis (C.I.M.E. Summer Schools, 62)

Complex Analysis (C.I.M.E. Summer Schools, 62)

Spectral Analysis (C.I.M.E. Summer Schools, 64)

Some Aspects of Ring Theory (C.I.M.E. Summer Schools, 37)

Techniques of Constructive Analysis (Universitext)

Techniques of Constructive Analysis (Universitext)

Numerical Analysis of Partial Differential Equations (C.I.M.E. Summer Schools, 44)

Fundamentals of functional analysis

Elements of Functional Analysis

Elements of Functional Analysis

Fundamentals of functional analysis

Elements of Functional Analysis

Geometric Aspects of Functional Analysis: Israel Seminar 2001-2002

Geometric aspects of functional analysis: Israel seminar 2004-2005

Geometric Aspects of Functional Analysis: Israel Seminar 2006-2010

Geometric aspects of functional analysis. Israel seminar 2002--2003

Principles of functional analysis

Principles of functional analysis

Recommend Documents