Microeconomic Theory

d~~\, 'u < 5: --< ,~'" T f Microeconomic Theory Andreu Mas-Colell Michael D. Whinston and Jerry R. Green New Yo...

Author: Andreu Mas-Colell | Michael D. Whinston | Jerry R. Green

203 downloads 2287 Views 33MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
Report copyright / DMCA form

DOWNLOAD PDF

d~~\,

'u

pAR

T

ONE

Individual Decision Making

I

A distinctive feature of microeconomic theory is that it aims to model economic activity as an interaction of individual economic agents pursuing their private interests. It is therefore appropriate that we begin our study of microeconomic theory with an analysis of individual decision making. Chapter I is short and preliminary. It consists of an introduction to the theory of individual decision making considered in an abstract setting. It introduces the decision maker and her choice problem, and it describes two related approaches to modeling her decisions. One, the preference-based approach, assumes that the decision maker has a preference relation over her set of possible choices that satisfies certain rationality axioms. The other, the choice-based approach, focuses directly on the decision maker's choice behavior, imposing consistency restrictions that parallel the rationality axioms of the preference-based approach. The remaining chapters in Part One study individual decision making in explicitly economic contexts. It is common in microeconomics texts-and this text is no exception-to distinguish between two sets of agents in the economy: i/ldividual CO/lSUlllers and firms. Because individual consumers own and run firms and therefore ultimately determine a firm's actions, they are in a sense the more fundamental element of an economic model. Hence, we begin our review of the theory of economic decision making with an examination of the consumption side of the economy. Chapters 2 and 3 study the behavior of consumers in a market economy. Chapter 2 begins by describing the consumer's decision problem and then introduces the concept of the consumer's demand funcrion. We then proceed to investigate the implications for the demand function of several natural properties of consumer demand. This investigation constitutes an analysis of consumer behavior in the spirit of the choice-based approach introduced in Chapter I. In Chapter 3, we develop the classical preference-based approach to consumer demand. Topics such as utility maximization, expenditure minimization, duality, integrability, and the measurement of welfare changes are studied there. We also discuss the relation between this theory and the choice-based approach studied in Chapter 2. I n economic analysis, the aggregate behavior of consumers is often more important than the behavior of any single consumer. In Chapter 4, we analyze the 3

4

PART

I:

INDIVIDUAL

DECISION

C

MAKING

extent to which the properties of individual demand discussed in Chapters 2 and 3 also hold for aggregate consumer demand. In Chapter 5, we study the behavior of the firm. We begin by posing the firm's decision problem, introducing its technological constraints and the assumption of profit maximization. A rich theory, paralleling that for consumer demand, emerges. In an important sense, however, this analysis constitutes a first step because it takes the objective of profit maximization as a maintained hypothesis. In the last section of the chapter, we comment on the circumstances under which profit maximization can be derived as the desired objective of the firm's owners. Chapter 6 introduces risk and uncertainty into the theory of individual decision making. In most economic decision problems, an individual's or firm's choices do not result in perfectly certain outcomes. The theory of decision making under uncertainty developed in this chapter therefore has wide-ranging applications to economic problems, many of which we discuss later in the book.

Preference and Choice

HAP

T

E

R

1 ,",

l.A Introduction In this chapter, we begin our study of the theory of individual decision making by considering it in a completely abstract setting. The remaining chapters in Part I dcvelop the analysis in the context of explicitly economic decisions. The starting point for any individual decision problem is a sel of possible (mulually nell/sh'e) "/lallalives from which the individual must choose. In the discussion that follows, we denote this set of alternatives abstractly by X. For the moment, this set can be anything. For example, when an individual confronts a decision of what career path to follow, thc alternatives in X might be: {go to law school, go to graduate school and study economics, go to business school, ... , become a rock star}. In Chapters 2 and 3, when we consider the consumer's decision problem, the elements of the set X are the possible consumption choices. There are two distinct approaches to modeling individual choice behavior. The first, which we introduce in Section I.B, treats the decision maker's tastes, as summarized in her preferellce relalion, as the primitive characteristic of the individual. The theory is developed by first imposing rationality axioms on the decision maker's preferences and then analyzing the consequences of these preferences for her choice behavior (i.e., on decisions made). This preference-based approach is the more traditional of the two, and it is the one that we emphasize throughout the book. The second approach, which we develop in Section I.C, treats the individual's choice beha vi or as the primitive feature and proceeds by making assumptions directly concerning this behavior. A central assumption in this approach, the weak axiom of re"ealed preJ~r(,lIce, imposes an element of consistency on choice behavior, in a sense paralleling the rationality assumptions of the preference-based approach. This choice-based approach has several attractive features. It leaves room, in principle, for more general forms of individual behavior than is possible with the preferencehased approach. I t also makes assumptions about objects that are directly observable (choice bchavior), rather than about things that are not (preferences). Perhaps most importantly, it makes clear that the theory of individual decision making need not be based on a process of introspection but can be given an entirely behavioral foundation. 5

------------------

,..:....

.

6

CHAPTER

1:

PREFERENCE

AND

Understanding the relationship between these two different approaches to modeling individual behavior is of considerable interest. Section I.D investigates this question, examining first the implications of the preference-based approach for choice behavior and then the conditions under which choice behavior is compatible with the existence of underlying preferences. (This is an issue that also comes up in Chapters 2 and 3 for the more restricted setting of consumer demand.) For an in-depth, advanced treatment of the material of this chapter, see Richter (1971).

1.B Preference Relations In the preference-based approach, the objectives of the decision maker are summarized in a preference relation. which we denote by;::. Technically. ;:: is a binary relation on the set of alternatives X, allowing the comparison of pairs of alternatives x. Y EX. We read x;:: y as "x is at least as good as y." From ;::, we can derive two other important relations on X: (i) The strict preference relation, :>. defined by x :> y

SECTION

CHOICE

-=

x;:: y but not y ;:: x

and read "x is preferred to y.'" (ii) The intiijJerellce relation, -, defined by x - y

-=

x;:: y and y ;:: x

and read "x is indifferent to y." In much of microeconomic theory, individual preferences are assumed to be rational. The hypothesis of rationality is embodied in two basic assumptions about the preference relation ;::: completeness and transitivity.l DefinItion 1.B.1: The prelerence relation;:: is rational il it possesses the lollowing two properties: (i) Completeness: lor all x, y E X, we have that x;:: y or y;:: X (or both). (ii) Transitivity: For all x, y, Z E X, if x ;:: y and y;:: Z, then x;:: z. The assumption that;:: is complete says that the individual has a well-defined preference between any two possible alternatives. The strength of the completeness assumption should not be underestimated. Introspection quickly reveals how hard it is to e\'aluate alternatives that are far from the realm of common experience. It takes work and serious reflection to find out one's own preferences. The completeness axiom says that this task has taken place: our decision makers make only meditated choices. Transitivity is also a strong assumption. and it goes to the hcart of the conccpt of 1. The symbol .- is read as "if ilnd only ir." The literature sometimes speaks of x ;;: .r as "x is weakly preferred to .\' .. and.\" >- y as "x is strictly preferred to y:' We shall adhere to the terminology introduced above. 2. Note that there is no unified terminology in the literature; weak order and complete preordl!r are common alternatives to the term rational preference relation. Also. in some presentations. the assumption that;:: is rejlexire (defined as x ~ x for all x EX) is added to the completeness and tnmsitivilY assumptions. This property is, in fact, implied by completeness and so is redundant.

1.8:

PREFEAENCE

rationality. Transitivity implies that it is impossible to face the decision maker with a sequence of pairwise choices in which her preferences appear to cycle: for example, feeling that an apple is at least as good as a banana and that a banana is at least as good as an orange but then also preferring an orange over an apple. Like the completeness property, the transitivity assumption can be hard to satisfy when evaluating alternatives far from common experience. As compared to the completeness property, however, it is also more fundamental in the sense that substantial portions of economic theory would not survive if economic agents could not be assumed to have transitive preferences. The assumption that the preference relation;:: is complete and transitive has implications for the strict preference and indifference relations:> and -. These are summarized in Proposition I.B.I, whose proof we forgo. (After completing this section. try to establish these properties yourself in Exercises I.B.I and 1.B.2.) Proposition 1.B.1: 11 ;:: is rational then: (i) :> is both irreflexive (x:> x never holds) and transitive (il x :> y and y:> z, then x:> z). (ii) - is reflexive (x - x lor all x), transitive (if x - y and y - z, then x - z). and symmetric (il x - y, then y - x). (iii) if x:> Y ;:: z, then x:> z. The irreflexivity of :> and the reflexivity and symmetry of - are sensible properties for strict preference and indifference relations. A more important point in Proposition I.B.I is that rationality of ;:: implies that both:> and - are transitive. In addition, a transitive-like property also holds for:> when it is combined with an at-Ieast-asgood-as relation, ;::. An individual's preferences may fail to satisfy the transitivity property for a number of reasons. One difficuhy arises because of the problem of jusl /Wrcept;ble differellas. For example. if we ask an individual to choose between two very similar shades of gray for painting her room. she may be unable to tell the difference between the colors and will therefore be indifferent. Suppose now that we offer her a choice between the lighter of the two gray paints and a slightly lighter shade. She may again be unable to tell the difference. If we continue in this fashion. letting the paint colors get progressively lighter with each successive choice experiment. she may express indifference at each step. Yet. if we offer her a choice between the original (darkest) shade of gray and the final (almost white) color, she would be able to distinguish between the colors and is likely to prefer one of them. This, however. violates transitivity. Another potential problem arises when the manner in which alternatives are presented matters for choice. This is known as the 1;'''''';IIg problem. Consider the following example. paraphrased from Kahneman and Tversky (1984): Imagine that you arc about to purcha~e a stereo for 125 dollars and a calculator for 15 dollars. The salesman tells you that the calculator is on sale for 5 dollars less at the other branch of the store. located 10 minutes away. The stereo is the same price there. Would you make the trip to the other store? It turns Ollt that the fraction of respondents saying that they would travel to the other store

for the 5 dollar discount is much higher than the fraction who say they would travel when the question is changed so that the 5 dollar saving is on the stereo. This is so even though the ultimate saving obtained by incurring the inconvenience of travel is the same in both

RELATIONS

8

CHAPTER

1:

PREFERENCE

AND

SECTION

CHOICE

cases.' Indeed, we would expect indifference to be the response to the following question: Because of a stockout you must travel to the other store to get the two items, but you will receive S dollars oft' on either item as compensation. Do you care On which item this S dollar rebate is given?

If so, however, the individual violates transitivity. To see this, denote x = Travel to the other store and get a 5 dollar discount on the calculator. }' = Travel to the other store and get a 5 dollar discount on the stereo. z = Buy both items at the first store. The first two choices say that x >- z and z >- y, but the last choice reveals x - y. Many problems of framing arise when individuals are faced with choices between alternatives that have uncertain outcomes (the subject of Chapter 6). Kahneman and Tversky (1984) provide a number of other interesting examples. At the same time, it is often the COolse that apparently intransitive behavior can be explained fruitfully as the result of the interaction of several more primitive rational (and thus transitive) preferences. Consider the following two examples (i) A household formed by Mom (M), Dad (D), and Child (C) makes decisions by majority voting. The alternatives for Friday evening entertainment are attending an opera (0), a rock concert (R), or an ice.skating show (I). The three members of the household have the rational individual preferences: 0 >-,. R >-", I, 1 >-00 >-0 R, R >-c 1 >-eO, where >-,., >-0' >-e are the transitive individual strict preference relations. Now imagine three majority· rule votes: 0 versus R, R versus I, and 1 versus O. The result of these votes (0 will win the first, R the second, and 1 the third) will make the household's preferences ~ have the intransitive form: 0 >- R >- 1 >- O. (The intransitivity illustrated in this example is known as the Condoreet paradox, and it is a central difficulty for the theory of group decision making. For further discussion, see Chapter 21.) (ii) Intransitive decisions may also sometimes be viewed as a manifestation of a change of tastcs. For example, a potential cigarette smoker may prefer smoking one cigarette a day to not smoking and may prefer not smoking to smoking heavily. But once she is smoking one cigarette a day, her tastes may change, and she may wish to increase the amount that she smokes. Formally, letting y be abstinence, x be smoking one cigarette a day, and z be heavy smoking, her initial situation is y, and her preferences in that initial situation are x>- y >- z. But once x is chosen over y and z, and there is a change of the individual's current situation from y to x, her tastes change to z >- x >- y. Thus, we apparently have an intransitivity: z >- x >- z. This change·oj·tastes model has an important theoretical bearing on the analysis of addictive beha vior. It also raises interesting issues related to commitment in decision making [see Schelling (1979)]. A rational decision maker will anticipate the induced change of tastes and will therefore attempt to tie her hand to her initial decision (Ulysses had himself tied to the mast when approaching the island of the Sirens). It often happens that this change.of·tastes point of view gives us a well·structured way to think about nonrational decisions. See Elster (1979) for philosophical discussions of this and similar points.

Utility Functions In economics, we often describe preference relations by means of a utility lunction. A utility function u(x) assigns a numerical value to each element in X, ranking the 3. Kahneman and Tversky attribute this finding to individuals keeping "mental accounts" in which the savings are compared to the price of the item on which they are received.

1.C:

CHOICE

RULES

9

elements of X in accordance with the individual's preferences. This is stated more precisely in Definition 1.B.2. Definition 1.B.2: A function u: X relation;:: if, for all x. y E X.

-+

R is a utility function representing preference

x;:;y _

u(x)

~u(y).

Note that a utility function that represents a preference relation;:: is not unique. For any strictly increasing function I: R -+ R, v(x) = I(u(x)) is a new utility function representing the same preferences as u('); see Exercise I.B.3. It is only the ranking of alternatives that matters. Properties of utility functions that are invariant for any strictly increasing transformation are called ordinal. Cardinal properties are those not preserved under all such transformations. Thus, the preference relation associated with a utility function is an ordinal property. On the other hand, the numerical values associated with the alternatives in X, and hence the magnitude of any differences in the utility measure between alternatives, are cardinal properties. The ability to represent preferences by a utility function is closely linked to the assumption of rationality. In particular, we have the result shown in Proposition I.B.2.

•

Proposition 1.B.2: A preference relation;:: can be represented by a utility function only if it is rational. Proof: To prove this proposition, we show that if there is a utility function that represents preferences ;::, then;:; must be complete and transitive. COl1lplelelle.~s. Because u(·) is a real-valued function defined on X, it must be that for any x, Y E X, either u(x) ~ u(y) or u(y) ~ u(x). But because u(·) is a utility function representing ;::, this implies either that x;:; y or that y;:: x (recall Definition I.B.2). Hence, ;:: must be complete.

TrallSirivity. Suppose that x;:; y and y;:: z. Because u(·) represents ;::, we must have u(x) ~ u(y) and u(y) ~ u(z). Therefore, u(x) ~ u(z). Because u(·) represents ;:;, this implies x;:::. Thus, we have shown that x;:: y and y;:: z imply x;:: Z, and so transitivity is established. _

At the same time, one might wonder, can any rational preference relation ;:; be described by some utility function? It turns out that, in general, the answer is no. An example where it is not possible to do so will be discussed in Section 3.G. One case in which we can always represent a rational preference relation with a utility function arises when X is finite (see Exercise I.B.5). More interesting utility representation results (e.g., for sets of alternatives that are not finite) will be presented in later chapters.

I.C Choice Rules In the second approach to the theory of decision making, choice behavior itself is taken to be the primitive object of the theory. Formally, choice behavior is represented by means of a choice structure. A choice structure (&ii, C(.» consists of two ingredients:

,

10

CH APTER

1:

PREFERENCE

AND

CHOICE

(i) !fI is a family (a set) of nonempty subsets of X; that is, every element of !fI is a set Be X. By analogy with the consumer theory to be developed in Chapters 2 and 3, we call the elements BE!fI budget sets. The budget sets in !fI should be thought of as an exhaustive listing of all the choice experiments that the institutionally, physically, or otherwise restricted social situation can conceivably pose to the decision maker. It need not, however, include all possible subsets of X. Indeed, in the case of consumer demand studied in later chapters, it will not. (ii) C(.) is a choice rule (technically, it is a correspondence) that assigns a nonempty set of chosen elements C(B) c B for every budget set BE!fI. When C(B) contains a single element, that element is the individual's choice from among the alternatives in B. The set C(B) may, however, contain more than one element. When it does, the elements of C(B) are the alternatives in B that the decision maker mig/II choose; that is, they are her acceptable alternatives in B. In this case, the set C(B) can be thought of as containing those alternatives that we would actually see chosen if the decision maker were repeatedly to face the problem of choosing an alternative from set B. Example I.C.!: Suppose that X = {x,y.z} and!fl = {{x.y}, {x. y, z}}. One possible choice structure is (!fl. C,('», where the choice rule C,(-} is: C,({x,y}) = {x} and C,({x. y, z}) = {x}. In this case. we see x chosen no matter what budget the decision maker faces. Another possible choice structure is (!fI, C2 ('», where the choice rule C 2 (') is: C2 ({x. y}) = {x} and C2 ({x, y, z}) = {x, y}.ln this case, we see x chosen whenever the decision maker faces budget {x, y}, but we may see either x or y chosen when she faces budget {x, y. z}. _ When using choice structures to model individual behavior. we may want to impose some "reasonable" restrictions regarding an individual's choice behavior. An important assumption, the weak axiom of revealed preference [first suggested by Samuelson; see Chapter 5 in Samuelson (1947)], reflects the expectation that an individual's observed choices will display a certain amount of consistency. For example, if an individual chooses alternative x (and only that) when faced with a choice between x and y, we would be surprised to see her choose y when faced with a decision among x, y, and a third alterative z. The idea is that the choice of x when facing the alternatives {x. y} reveals a proclivity for choosing x over y that we should expect to see reflected in the individual's behavior when faced with the alternatives {x. y, z}.· The weak axiom is stated formally in Definition I.CI. Definition 1.C.1: The choice structure (!fl. C( .» satisfies the weak axiom of revealed preference if the following property holds: If for some BE!fI with x, Y E B we have x E ClB), then for any B' E ~ with x, Y E B' and Y E ClB'). we must also have x E ClB'). In words, the weak axiom says that if x is ever chosen when y is available, then there can be no budget set containing both alternatives for which y is chosen and x is not. 4. This proclivity might reflect some underlying "preference" for x over }' but might also arise in other ways, It could. for example, be the result of some evolutionary process.

SECTION

1.0:

RELATIONSHIP

8ETWEEN

PREFERENCE

RELATIONS

AND

Note how the assumption that choice behavior satisfies the weak axiom captures the consistency idea: If c({x. y}) = {x}, then the weak axiom says that we cannot have c({x. y. z}) = {y}.' A somewhat simpler statement of the weak axiom can be obtained by defining a revealed preference relmion ;::* from the observed choice behavior in C(.). Definition 1.C.2: Given a choice structure (!fI, ;::* is defined by

C(.»

the revealed preference relation

x ;::* Y "'" there is some BE!fI such that x, Y E B and x E C(B). We read x;::* y as "x is revealed at least as good as y." Note that the revealed preference relation ;::* need not be either complete or transitive. In particular, for any pair of alternatives x and y to be comparable. it is necessary that. for some BE!fI, we have x. y E B and either x E qB) or y E C(B). or both. We might also informally say that "x is revealed preferred to },H if there is some BE;il such that x, Y E B, x E C(B). and y f C(B), that is. if x is ever chosen over y when both are feasible. With this terminology. we can restate the weak axiom as follows: "If x is re,Jealed

al leasl as y()ml as }'.

1/1 II(Y) implies x ~ y, then 11(') is a utility function representing ;::. 1.8.s" Show that if X is finite and;:: is a rational preference relation on X, then there is a utility function II: X - Ii that represents ;::. [Hill/: Consider first the case in which the individual's ranking between any two elements of X is strict (i.e .. there is never any indifference), and construct ~, utility function representing these preferences; then extend your argument to the general case.)

I.e.." Consider the choice structure (~, C(·)) with.lit = ({.~, y}, lx, y, t}) and C({x. y}) = {xl. Show that if (.J!. C(·» satisfies the weak axiom. then we must have C({x,,I', :}) = {x}, = {::" or

={x,;l· 1.e.2· Show that the weak axiom (DerlOition 1.e.1) is equivalent to the following property holding: Suppose that /J. /J' E .J!. that x, y E B, and that x. must have {x. yl c e(B) and {x. y} c C(B').

rEB'.

Then if x

E C(B)

and Y E C(IJ'), we

1.e.3" Suppose that choice structure (.-4.!, C(·)) satisfies the weak axiom. Consider the following two possible revealed preferred relations. ::>* and >-•• ;

"

~.!'

x

>-•• y

-: x >- y\). This concept 01 ;ginates in Thurstone (1927). and it is of considerable econometric interest (indeed. it provides a theory for the error term in observable choice). (a) Show that the stochastic choice function Cnx. be rationalized by preferences. (b) Show that the stochastic choicefunction rationalizable by preferences.

yl> = City. z}) = q{:. x}) = (t il can

C(I x. y}) = C( {)'. :}) = q{:. x}) = OJ. Moreover, by homogeneity of degree zero, x(p, w) depends only on the budget set the consumer faces. Hence (:II" ,x(·)) is _ choice structure, as defined in Section I.e. Note that the choice structure (jJ" ,x(·)) docs not include all possible subsets of X (e.g., it does not include all two- and three-element subsets of X). This fact will be significant for the relationship between the choice-based and preference-based approaches to consumer demand.

Comparative Statics We are often interested in analyzing how the consumer's choice varies with changes in his wealth and in prices. The examination of a change in outcome in response to a change in underlying economic parameters is known as comparaeit'" seatics analysis.

Wealeh effeces For fixed prices p. the function of wealth x(p, \\') is called the consumer's £lIyei ./illlccioll. Its image in IR~, £" = :x(p, 11'): II' > OJ, is known as the wealeh expansioll paeh. Figure 2.E.1 depicts such an expansion path. At any (p, \\'), the derivative ,'X/(P, w)/"w is known as the wealeh effece for the (th good.' 6. We use normalizoHions extensively in Part IV. 7. It is also known as the itJ£'ume effect in the literature. Similarly. the wealth expansion path is sometimes referred to as an inn"'f(' ('xpans;oll Pdt".

2.E:

DEMAND

FUNCTIONS

AND

COMPARATIVE

STATICS

x, w" > w' > w

Bi .... ··

B; .•'

Flgur. 2.E.1

B, .•

x,

A commodity 1 is normal at (p, w) if iJx/(p, W)/OIV w. Since P'x(p, w) = w by Walras' law, this implies that

x,

(e)

(d)

P'[x(p', w') - x(p, w)]

>0

(2.FA)

Together, (2.F.2), (2.F.3) and (2.FA) yield the result. (ii) The weak axiom is implied by (2.F'/) holding for all compensated price changes, with strict illequality ifx(p, w) '" x(p', IV'), The argument for this direction of the proof Figure 2.F.l

Demand in panels (a) to (c) satisfies the (e)

x,

when prices change to p', we imagine that the consumer's wealth is adjusted to w' = P"x(p, IV). Thus, the wealth adjustment is ~w = ~P'x(p, w), where ~p = (p' _ pl. This kind of wealth adjustment is known as Slutsky wealth compensation. Figure 2.F.2 shows the change in the budget set when a reduction in the price of good I from p to " • 1 p, IS accompanIed by Slutsky wealth compensation. Geometrically, the restriction is that the budget hyperplane corresponding to (p', w') goes through the vector x(p, w). We refer to price changes that are accompanied by such compensating wealth changes as (Slutsky) compensated price changes.

uses the following fact: The weak axiom holds if and only if it holds for all compensated price changes. That is, the weak axiom holds if, for any two price-wealth pairs (p, w) and (p', w'), we have P"x(p, IV) > w' whenever p·x(p', IV')=W and x(p', w')",x(p, w).

weak axiom; demand

in panels (d) and (e) does not.

To prove the facl stated in the preceding paragraph, we argue that if the weak axiom is violated, then there must be a compensated price change for which it is violated. To see this, suppose that we have a violation of the weak axiom, that is, two price-wealth pairs (p', w') and (p", w") such that x(p', IV') '" x(p", 1\'"), p'·x(p·, w") :s; w', and p"'x(p', w') :s; w·. If one of these two weak inequalities holds with equality, then this is actually a compensated price change and we are done. So assume that, as shown in Figure 2.F.3, we have P"x(p", w") < w' and p"'x(p', w') < w". Figure 2.F.2

A compensated price

change from (p, (p',w').

In Proposition 2.F.I, we show that the weak axiom can be equivalently stated in terms of the demand response to compensated price changes. B,..•..

Proposition 2.F.1: Suppose that the Walrasian demand function x(p, w) is homogeneous of degree zero and satisfies Walras' law. Then x(p, w) satisfies the weak axiom if and only if the following property holds:

w)

to

32

CHAPTER

2:

CONSUMER

CHOICE

-------------------------------------------------

seCTION

2.F:

WEAK

AXIOM

OF

REVEALED

PREFERENCE

AND

THE

LAW

OF

DEMAND

Figure 2.F.4 (len)

B,·,w·

Demand must be

X, B,.w

non increasing in own

w' = p" x(p, w)

price [or a compensated price change.

Figure 2.F.3

Figure 2.F.5 (right)

The weak axiom holds if and only if it holds for all compensated price changes.

Demand [or good 1 can fall when its price

decreases for an x,

x,

Now choose the value of. E (O,l) for which {.p'

+ (I

- ,)p")'x{p', .,') = {.p'

and denote p = .p' + {I - ')p" and .. = {.p' illustrated in Figure 2.F.3. We then have '11"

+ (I

- ,)11'"

+ (I

+ (I -

(p, IV), a compensated decrease in the price of good I rotates the budget line through x(p, IV). The WA allows moves of demand only in the direction that increases the

- ,)p")'x{p", .,"),

,)p")'x{p', 11"). This construction is

> 'P"x{p', 11") + (I - ,)p"'x{p', 11") =

It'

= P'x(p,

11')

= 'P"x(p,lI') + (I - ,)p"'x(p, .,).

Therefore, either P"x(p,lI') < 11" or p"'x(p, w) < 11'". Suppose that the first possibility holds (the argument is identical if it is the second that holds). Then we have x(p, 11') #0 x(p', IV'), P'x{p', \1") = 11', and P"x{p, w) < IV', which constitutes a violation of the weak axiom for the compensated price change from (p', w') to (p, 11').

Once we know that in order to test for the weak axiom it suffices to consider only compensated price changes, the remaining reasoning is straightforward. If the we 0, we must have OXt(p, wi/ow < O. For later reference, we note that Proposition 2.F.2 does not imply, in general, that the matrix SIp, w) is symmetric. I I For L = 2, SIp, w) is necessarily symmetric (you are asked to show this in Exercise 2.F.II). When L> 2, however, SIp, IV) need not be symmetric under the assumptions made so far (homogeneity of degrec zero, Walras' law, and the weak axiom). See Exercises 2.F.1O and 2.F.15 for examples. In Chapter 3 (Section 3.H), we shall see that the symmetry of SIp, w) is intimately connected with the possibility of generating demand from the maximization of rational preferences. Exploiting further the properties of homogeneity of degree zero and Walras' law, we can say a bit more about the substitution matrix SIp, IV). II. A mailer of lerminology: 11 is common in the mathematical literature that "definite" matrices are assumed to be symmetric. Rigorously speaking, if no symmetry is implied, the matrix would be called "quasidefinite." To simplify terminology, we use "definite" without any supposilion about symmetry; ir a matrix is symmetric, we say so explicitly. (See Exercise 2.F.9.)

2.F:

WEAK

AXIOM

OF

REVEALED

PREFERENCE

AND

THE

LAW

PropOS Ilion 2.F.3: Suppose that the Walrasian demand function x(p, w) Is differentiable, homogeneous of degree zero, and satisfies Walras' law. Then p 'S(p, w) = 0 and SIp, w)p = 0 for any (p, wi. Exercise 2.F.7: Prove Proposition 2.F.3. [Hint: Use Propositions 2.E.l to 2.E.3.] It follows from Proposition 2.F.3 that the matrix SIp, w) is always singular (i.e., it has rank less than L), and so the negative semidefiniteness of SIp, w) established in Proposition 2.F.2 cannot be extended to negative definiteness (e.g., see Exercise 2.F.17). Proposition 2.F.2 establishes negative semidefiniteness of S(p, w) as a necessary implication of the weak axiom. One might wonder: Is this property sufficient to imply the WA [so that negative semidefiniteness of S(p, w) is actually equivalent to the W A]? That is, if we have a demand function x(p, w) that satisfies Walras' law, homogeneity of degree zero and has a negative semidefinite substitution matrix, must it satisfy the weak axiom? The answer is almost, bur 1I0t quite. Exercise 2.F.16 provides an example of a demand function with a negative semidefinite substitution matrix that violates the WA. The sufficient condition is that v'S(p, w)v < 0 whenever v ~ ap for any scalar a; that is, S(p, w) must be negative definite for all vectors other than those that are proportional to p. This result is due to Samuelson [see Samuelson (1947) or Kihlstrom, Mas-Colell, and Sonnenschein (1976) for an advanced treatment). The gap between the necessary and sufficient conditions is of the same nature as the gap between the necessary and the sufficient second-order conditions for the minimization of a function.

Finally, how would a theory of consumer demand that is based solely on the assumptions of homogeneity of degree zero, Walras' law, and the consistency requirement embodied in the weak action compare with one based on rational preference maximization? Based on Chapter I, you might hope that Proposition I.D.2 implies that the two are equivalent. But we cannot appeal to that proposition here because the family of Walrasian budgets does not include every possible budget; in particular, it does not include all the budgets formed by only two- or three-commodity bundles. In fact, the two theories are not equivalent. For Walrasian demand functions, the theory derived from the weak axiom is weaker than the theory derived from rational preferences, in the sense of implying fewer restrictions. This is shown formally in Chapter 3, where we demonstrate that if demand is generated from preferences, or is capable of being so generated, then it must have a symmetric Slutsky matrix at all (I', w). But for the moment, Example 2.F.I, due originally to Hicks (1956), may be persuasive enough. Example 2.F.!: In a three-commodity world, consider the three budget sets determined by the price vectors pi = (2, 1,2), p2 = (2,2, I), pl = (1,2,2) and wealth = 8 (the same for the three budgets). Suppose that the respective (unique) choices are x, = (1,2,2), x' = (2, 1,2), X3 = (2,2, I). In Exercise 2.F.2, you are asked to verify that any two pairs of choices satisfy the WA but that x, is revealed preferred to x', x' is revealed preferred to x', and x, is revealed preferred to x 3• This situation is incompatible with the existence of underlying rational preferences (transitivity would be violated).

OF

DEMANO

35

36

CHAPTER

2:

CONSUMER

E X ERe I S E S

CHOICE

37

-------------------------------------------------------------------------------------------for all ~ > 0] and satisfies Walras' law, then '(.(P. w) = I for every t. Interpret. Can you say something about D.x(p, w) and the form of the Engel functions and curves in this case?

The reason this example is only persuasive and does not quite settle the question is that demand has been defined only for the three given budgets, therefore, we cannot be sure that it satisfies the requirements of the W A for all possible competitive budgets. To clinch the matter we refer to Chapter 3. •

2.E.S 8 Suppose that x(p, w) is a demand function which is homogeneous of degree one with respect to wand satisfies Walras' law and homogeneity of degree zero. Suppose also that all the cross· price effects are zero. that is ox,(P. w)/oP. = 0 whenever k ~ t. Show that this implies that for every I. x,(P. w) = ~(w/p(o where ~( > 0 is a constant independent of (p, w).

In summary, there are three primary conclusions to be drawn from Section 2.F: (i) The consistency requirement embodied in the weak axiom (combined with the homogeneity of degree zero and Walras' law) is equivalent to the compensated law of demand. (ii) The compensated law of demand, in turn, implies negative semidefiniteness of the substitution matrix S(p, w). (iii) These assumptions do no/ imply symmetry of S(p, w), except in the case where

2.E.6A Verify that the conclusions of Propositions 2.E.I to 2.E.3 hold for the demand function given in Exercise 2.E.I when /1 = 1. 2.E.7 A A consumer in a two.good economy has a demand function x(P. w) that satisfies Walr"s' law. His demand function for the first good is xl(p, w) = ~W/PI' Derive his demand function for the second good. Is his demand function homogeneous of degree zero? 2.E.S" Show that the elasticity of demand for good ( with respect to price P., .,,(p, w~ can be written as > O.

(e) That good I is an inferior good (at some price) for this consumer? Assume that the weak axiom is satisfied.

(0) If X is the set depicted in Figure 2.C.3. would B,.w be convex?

(f) That good 2 is an inferior good (at some price) for this consumer? Assume that the weak axiom is satisfied.

(b) Show that if X is a convex set. then B,.w is as well. 2.0.4A Show that the budget set in Figure 2.0.4 is not convex.

2.F.4A Consider the consumption of a consumer in two different periods, period 0 and period I. Period I prices, wealth. and consumption are and x' = x(p', w,). respectively. It is often of applied interest to form an index measure of the quantity consumed by a COnsumer. The Laspqres quantity index computes the change in quantity using period 0 prices as weights: La = (po·xl)/(pO·xo). The Paasche quantity index instead uses period I prices as weights: Pa = (pl·XI)/(pl·XO). Finally, we could use the consumer's expenditure change: Ea = (pl·XI)/(po·xo). Show the following:

t, w,.

2.E.IA In text. 2.E.2B In text. 2.E.3 8 Use Propositions 2.E.1 to 2.E.3 to show that p' D,x(p, w) p = - w. Interpret. 2.E.4B Show that ifx(p. w) is homogeneous of degree one with respect to w [i.e.• x(p. ~w)=ax(p. w)

....

38

CHAPTER

2:

CONSUMER

CHOICE

EXERCISES

(a) If LQ < I, then the COnsumer has a revealed preference for x· over x'. (b) If PQ > I, then the consumer has a revealed preference for x, over

(c) Show that the generalized weak axiom and Walras' law implies the following generalized version of the compensated law of demand: Starting from any initial position (p, w) with demand x E x(P. w), for any compensated price change to new prices p' and wealth level w' = p" x, we have

x·.

(c) No revealed preference relationship is implied by either EQ > I or EQ < \. Note that at the aggregate level, EQ corresponds to the percentage change in gross national product.

(p' - p)'(x' - x):s 0

2.r.sc Suppose that x(p, w) is a differentiable demand function that satisfies the weak axiom, Walras' law, and homogeneity of degree zero. Show that if x(',·) is homogeneous of degree one with respeet to w [i.e., x(p, awl = ax(p, w) for all (p, w) and a > 0], then the law of demand holds even for uncompensated price changes. If this is easier, establish only the infinitesimal version of this conclusion; that is, dp' D,x(p, w) dp :s 0 for any dp.

for all x' E x(p'. w'), with strict inequality if x' E x(p, 1\1). (d) Show that if x(p, w) satisfies Walras' law and the generalized compensated law of demand defined in (c), then x(p, w) satisfies the generalized weak axiom. 2.r.14 A Show that if x(p, III) is a Walrasian demand function that satisfies the weak axiom, then x(P. \\') must be homogeneous of degree zero.

2.r.6A Suppose that x(p, w) is homogeneous of degree zero. Show that the weak axiom holds if and only if for some w > 0 and all p,p' we have p" x(p, w) > w whenever p' x(p', w) :s; wand

2.r.IS" Consider a setting with L = 3 and a consumer whose consumption set is R'. The consumcr's demand function x(p, w) satisfies homogeneity of degree zero, Walras' law and (fixing p, = I) has

,,(p', w) ¥- x(p, 1\1).

2.r.7" In text.

+ p,

. 0 for all k :S II, where Au is the submatrix of A obtained by deleting the last II - k rows and columns. ror semidefiniteness of the symmetric matrix A, we replace the strict inequalities by weak inequalities and require that the weak inequalities hold for all matrices formed by permuting the rows and columns of A (see Section M.D of the Mathematical Appendix for details). (a) Show that an arbitrary (possibly nonsymmetric) matrix A is negative definite (or semidefinite) if and only if A + AT is negative definite (or semidefinite). Show also that the above determinant condition (which can be shown to be necessary) is no longer sufficient in the nonsymmetric case.

2.r.16" Consider a setting where L = 3 and a consumer whose consumption set is RJ. Suppose that his demand function x(p, IV) is

(b) Show that for L = 2, the necessary and sufficient condition for the substitution matrix S(P. \\') of rank 1 to be negative semidefinite is that any diagonal entry (i.e., any own-price

x.(p, w)

substitution effect) be negative.

x,(p,w)

=~, p,

= _12, PJ

2.r.\O" Consider the demand function in Exercise 2.E.1 with (I = \. Assume that

W

= \.

\\'

x,(P .... ) =-. p,

(a) Compute the substitution matrix. Show that at p = (I, I, I), it is negative semidefinite but not symmetric.

(a) Show that .'(p, w) is homogeneous of degree zero in (p, w) and satisfies Walras' law.

(b) Show that this demand function does not satisfy the weak axiom. [Hin!: Consider the

(b) Show that "(p, w) violates the weak axiom.

price vector p = (I, I. t) and show that the substitution matrix is not negative semidefinite (for f. > 0 small).]

(c) Show that "'S(p, w) ..

2.r.11" Show that for L = 2, S(p, \II) is always symmetric. [/lint: Use Proposition 2.r.3.]

",(p,

2.r.12A Show that if the Walrasian demand function x(p. III) is generated by a rational preference relation. than it must satisfy the weak axiom.

w) = (_.

{p~)-

for k = I, ... , L.

(:=)

2.".13{' Suppose thm x(p, w) may be multivalued.

(a) Is this demand function homogeneous of degree zero in (p, w)'1

(a) rrom the definition of the weak axiom given in Section I.C, develop the generalization of Definition 2.F.1 for Walrasian demand correspondences. (b) Show that if x(p, w) satisfies this generalization of the weak axiom and Walras' law. then x(·) satisfies the following property:

(b) Does it satisfy Walras' law?

(0)

-

= 0 for all v E R'.

2.F.17" In an I.·commodity world, a consumer's Walrasian demand function is

(c) Does it satisfy the weak axiom? (d) Compute the Slutsky substitution matrix for this demand function. Is it negative semidefinite? Negative definite? Symmetric?

ror any x E x(p, w) and x' E x(p'. w'), if p'x' < w, then p'X > w.

b

39

C

Classical Demand Theory

HAP

T

E

R

SECTION

3.B:

PREFERENCE

RELATIONS:

BASIC

PROPERTIES

41

-------------------------------------------------

3

preference-based demand theory. Section 3.F may be skipped without loss of continuity in a first reading of the chapter. Nevertheless, we recommend the study of its material. Section 3.G continues our analysis of the utility maximization and expenditure minimization problems by establishing some of the most important results of demand theory. These results develop the fundamental connections between the demand and value functions of the two problems. In Section 3.H, we complete the study of the implications of the preference-based theory of consumer demand by asking how and when we can recover the consumer's underlying preferences from her demand behavior, an issue traditionally known as the integrability problem. In addition to their other uses, the results presented in this section tell us that the properties of consumer demand identified in Sections 3.D to 3.G as necessary implications of preference-maximizing behavior are also sufficient in the sense that any demand behavior satisfying these properties can be rationalized as preference-maximizing behavior. The results in Sections 3.0 to lH also allow us to compare the implications of the preference-based approach to consumer demand with the choice-based theory studied in Section 2.F. Although the differences turn out to be slight, the two approaches are not equivalent; the choice-based demand theory founded on the weak axiom of revealed preference imposes fewer restrictions on demand than does the preference-based theory studied in this chapter. The extra condition added by the assumption of rational preferences turns out to be the symmetry of the Slutsky matrix. As a result, we conclude that satisfaction of the weak axiom does not ensure the existence of a rationalizing preference relation for consumer demand. Although our analysis in Sections 3.B to 3.H focuses entirely on the positive (i.e., descriptive) implications of the preference-based approach, one of the most important benefits of the latter is that it provides a framework for normative, or welfare, analysis. In Section 3.1, we take a first look at this subject by studying the effects of a price change on the consumer's welfare. In this connection, we discuss the use of the traditional concept of Marshallian surplus as a measure of consumer welfare. We conclude in Section 3.1 by returning to the choice-based approach to consumer demand. We ask whether there is some strengthening of the weak axiom that leads to a choice-based theory of consumer demand equivalent to the preferencebased approach. As an answer, we introduce the strong axiom of revealed preference and show that it leads to demand behavior that is consistent with the existence of underlying preferences. Appendix A discusses some technical issues related to the continuity and differentiability of Walrasian demand. For further reading, see the thorough treatment of classical demand theory offered by Deaton and Muellbauer (1980).

3.A Introduction In this chapter, we study the classical, preference-based approach to consumer demand. We begin in Section 3.B by introducing the consumer's preference relation and some of its basic properties. We assume throughout that this preference relation is rational, offering a complete and transitive ranking of the consumer's possible consumption choices. We also discuss two properties, monotonicity (or its weaker version, local nonsatiation) and convexity, that are used extensively in the analysis that follows. Section 3.e considers a technical issue: the existence and continuity properties of utility functions that represent the consumer's preferences. We show that not all preference relations are representable by a utility function, and we then formulate an assumption on preferences, known as continuity, that is sufficient to guarantee the existence of a (continuous) utility function. In Section 3.0, we begin our study of the consumer's decision problem by assuming that there are L commodities whose prices she takes as fixed and independent of her actions (the price-raking assumption). The consumer's problem is framed as one of utility maximization subject to the constraints embodied in the Walrasian budget set. We focus our study on two objects of central interest: the consumer's optimal choice, embodied in the Walrasian (or market or ordinary) demand correspondence, and the consumer's optimal utility value, captured by the indirect utility function.

Section 3.E introduces the consumer's expendilllre minimization problem, which bears a close relation to the consumer's goal of utility maximization. In parallel to our study of the demand correspondence and value function of the utility maximiza. tion prOblem, we study the equivalent objects for expenditure minimization. They are known, respectively, as the Hicksian (or compensated) demand correspondence and the expenditure function. We also provide an initial formal examination of the relationship between the expenditure minimization and utility maximization problems. In Section 3.F, we pause for an introduction to the mathematical underpinnings of duality theory. This material offers important insights into the structure of

3. B Preference Relations: Basic Properties In the classical approach to consumer demand, the analysis of consumer behavior begins by specifying the consumer's preferences over the commodity bundles in the consumption set X c R'+.

40

..

42

CHAPTER

3:

CLASSICAL

DEMAND

THEORY

------------------------------------------The consumer's preferences are captured by a preference relation ;t (an "at-leastas-good-as" relation) defined on X that we take to be rational in the sense introduced in Section I.B; that is, ;::; is complete and transitive. For convenience, we repeat the formal statement of this assumption from Definition I.B.1.1

-

SECTION

3.11:

PREFERENCE

RELATIONS:

BASIC

PROPERTIES

43

x,

Definition 3.B.1: The preference relation ;::; on X Is rational if it possesses the following two properties:

Figure 3.B.1

The test ror local nonsatiation.

(i) Completeness. For all x. y E X, we have x;t y or y;t x (or both). (ii) Transitivity. For all x. y, Z E X. if x;::; y and y;::; z, then x;t z. In the discussion that follows, we also use two other types of assumptions about preferences: desirability assumptions and convexity assumptions.

x,

(i) Desirability assumptions. It is often reasonable to assume that larger amounts of commodities are preferred to smaller ones. This feature of preferences is captured in the assumption of mono tonicity. For Definition 3.B.2, we assume that the consumption of larger amounts of goods is always feasible in principle; that is, if x E X and y ~ x, then y E X.

Ftgure 3.B.2

Definition 3.B.2: The preference relation ;::; on X is monotone If x E X and y» x implies y >- x. It is strongly monotone if y ~ x and y '" x imply that y >- x.

x,

x, (a)

The assumption that preferences are monotone is satisfied as long as commodities are "goods" rather than "bads". Even if some commodity is a bad, however, we may still be able to view preferences as monotone because it is often possible to redefine a consumption activity in a way that satisfies the assumption. For example, if one commodity is garbage, we can instead define the individual's consumption over the "absence of garbage".l Note that if;t is monotone, we may have indifference with respect to an increase in the amount of some but not all commodities. In contrast, strong montonicity says that if y is larger than x for some commodity and is no less for any other, then y is strictly preferred to x. For much of the theory, however, a weaker desirability assumption than monotonicity, known as local nonsatiation, actually suffices.

(b)

smail distance away from x, denoted by t > 0, there is another bundle y E R~ within this distance from x that is preferred to x. Note that the bundle y may even have less of every commodity than x, as shown in the figure. Nonetheless, when X = R~ local nonsatiation rules out the extreme situation in which all commodities are bads, since in that case no consumption at all (the point x = 0) would be a satiation point. Exercise 3.B.I: Show the following: (a) If;t is strongly monotone, then it is monotone. (b) If ;t is monotone, then it is locally nonsatiated. Given the preference relation;::; and a consumption bundle x, we can define three related sets of consumption bundles. The indifference set containing point x is the set of ail bundles that arc indifferent to x; formally, it is {y E X: y - xl· The upper colllolir set of bundle x is the set of all bundles that are at least as good as x: ()' EX: y ;t x}. The lower comOllr set of x is the set of all bundles that x is at least

Definition 3.B.3: The preference relation ;::; on X is locally nonsatiated if for every x E X and every t > 0, there is y E X such that II y - xII ~ t and y >- x. 3

as good as: {y EX: x ;t y}. One implication of local nonsatiation (and, hence, of monotonicity) is that it rules out "thick" indifference sets. The indifference set in Figure 3.B.2(a) cannot satisfy local nonsatiation because, if it did, there would be a better point than x within the circle drawn. In contrast, the indifference set in Figure 3.B.2(b) is compatible with local nonsatiation. Figure 3.B.2(b) also depicts the upper and lower contour sets of x.

The test for locally nonsatiated preferences is depicted in Figure 3.B.1 for the case in which X = R~. It says that for any consumption bundle x E R~ and any arbitrarily 1. See Seetion I.B ror a thorough discussion or these properties. 2. II is also sometimes convenient to view prererenoes as defined over the level or goods available ror consumption (the stocks or goods on hand), rather than over the consumption levels themselves. In this case, ir the consumer can rreely dispose of any unwanted commodities, her prererences over the level or commodities on hand are monotone as long as some good is always desirable. 3. IIx - yU is the Euclidean distance between points x and y; that is, IIx - yll =

(ii) Convexity assumptions. A second significant assumption, that of convexity of ;t, concerns the trade-offs that the consumer is willing to make among different goods .

[L~., (XI- YI)']'"'

...

(a) A thick indifference set violates local nonsatiation. (b) Prererences compatible with local nonsatiation.

44

CHAPTER

3:

CLASSICAL

DEMAND

THEORY

X,

SECTION

3.B:

PREFERENCE

RELATIONS:

BASIC

PROPERTIES

X,

X,

{yeR~:y~x}

'y.

Flgur. 3.B.4 (leH)

\ \oy + \

\

(0)

(1 -

.

A convex, but nol strictly convex, preference relation.

.)z Flgur. 3.B.3

Z

X,

(b)

45

-------------------------------------------

X,

(a) Convex preferences, (b) Nonconvex

Flgur. 3.B.S (right) X,

X,

Homothetic preferences,

preferences.

Definition 3.B.4: The preference relation ;:: on X is convex if for every x E X, the upper contour set {y E X: y;:: xl is convex; that is, if y;:: x and z;:: x, then "'Y + (1 - IX)Z;:: x for any IX E [0, 1]. Figure 3.8.3(a) depicts a convex upper contour set; Figure 3.8.3(b) shows an upper contour set that is not convex. Convexity is a strong but central hypothesis in economics. It can be interpreted in terms of diminishing marginal rates of substitution: That is, with convex preferences, from any initial consumption situation x, and for any two commodities, it takes increasingly larger amounts of one commodity to compensate for successive unit losses of the other.' Convexity can also be viewed as the formal expression of a basic inclination of economic agents for diversification. Indeed, under convexity, if x is indifferent to y, then !x + !y, the half-half mixture of x and y, cannot be worse than either x or y. In Chapter 6, we shall give a diversification interpretation in terms of behavior under uncertainty. A taste for diversification is a realistic trait of economic life. Economic theory would be in serious difficulty if this postulated propensity for diversification did not have significant descriptive content. But there is no doubt that one can easily think of choice situations where it is violated. For example, you may like both milk and orange juice but get less pleasure from a mixture of the two. Definition 3.BA has been stated for a general consumplion set X. But de facto, the convexity assumption can hold only if X is convex. Thus, the hypothesis rules out commodities being consumable only in integer amounts or situations such as that presented in Figure 2.C3. Although the convexity assumption on preferences may seem strong, this appearance should be qualified in two respects: First, a good number (although not all) of the results of Ihis chapler extend without modification to the nonconvex case. Second, as we show in Appendix A of Chapter 4 and in Section 17.1, nonconvexities can often be incorporated into the theory by exploiting regularizing aggregation effects across consumers. We also make use at times of a strengthening of the convexity assumption. Definition 3.B.S: The preference relation;:: on X is strictly convex if for every x, we have that x, z;:: x, and y '" Z implies IXY + (1 - IX)Z >- x for all IX E (0,1).

y;::

4. More generally, convexity is equivalent to a diminishing marginal rate of substitution between any two goods, provided that we allow for "composite commodities" formed from linear combinations of the L basic commodities.

FIgure 3.B.6

Quasilinear preferences. Figure 3. 8.3(a) showed strictly convex preferences. In Figure 3.B.4, on the other hand, the preferences, although convex, are not strictly convex. In applications (particularly those of an econometric nature), it is common to focus on preferences for which it is possible to deduce the consumer's entire preference relation from a single indifference set. Two examples are the classes of homothetic and quasilinear preferences. Definition 3.B.6: A monotone preference relation ;:: on X = R\ is homothetic if ali indifference sets are related by proportional expansion along rays; that is, if x - Y, then IXX - IXY for any IX ;:: O. Figure 3.8.5 depicts a homothetic preference relation. Definition 3.B.7: The preference relation;:: on X = (-00,00) X R~-I is quasilinear with respect to commodity 1 (called, in this case, the numeraire commodity) if' (i) Ali the indifference sets are parallel displacements of each other along the axis of commodity 1. That is, if x - y, then (x + lXed - (y + lXe, ) for e, = (1,0, ... ,0) and any IX E R(ii) Good 1 is desirable; that is, x + lXe, >- x for all x and IX > O. Note that, in Definition 3.B.7, we assume that there is no lower bound on the possible consumption of the first commodity [the consumption set is ( -00, 00) x R\- ']. This assumption is convenient in the case of quasilinear preferences (Exercise 3.DA will illustate why). Figure 3.B.6 shows a quasilinear preference relation. 5. More generally, preferences can be quasilinear with respect to any commodity t.

~

CHAPTER

3:

CLASSICAL

DEMAND

-

THEORY

3.e Preference and Utility For analytical purposes, it is very helpful if we can summarize the consumer's preferences by means of a utility function because mathematical programming techniques can then be used to solve the consumer's problem. In this section, we study when this can be done. Unfortunately, with the assumptions made so far, a rational preference relation need not be representable by a utility function. We begin with an example illustrating this fact and then introduce a weak, economically natural assumption (called continuity) that guarantees the existence of a utility representation.

• E C T ION

S. C:

PRE F E A E N C E

AND

UTI LI T Y

An equivalent way to state this notion of continuity is to say that for all x, the upper contour set {y EX: Y it X} and the lower contour set {y EX: x it y} are both closed; that is, they include their boundaries. Definition 3.C.! implies that for any sequence of points {Y·}:'. I with x it Y· for all nand Y = lim.~" y., we have x it y Uust let x' = x for all n). Hence, continuity as defined in Definition 3.C.1 implies that the lower contour set is closed; the same is implied for the upper contour set. The reverse argument, that closed ness of the lower and upper contour sets implies that Definition 3.C.1 holds, is more advanced and is left as an exercise (Exercise

3.C.3).

Example J.C.I: The Lexicographic Preference Relalion. For simplicity, assume that X = R!. Define x it y if either "XI> YI" or "XI = YI and X, ~ y,," This is known as the lexicographic preference relation. The name derives from the way a dictionary is organized; that is, commodity I has the highest priority in determining the preference ordering, just as the first letter of a word does in the ordering of a dictionary. When the level of the first commodity in two commodity bundles is the same, the amount of the second commodity in the two bundles determines the consumer's preferences. In Exercise 3.C.I, you are asked to verify that the lexico· graphic ordering is complete, transitive, strongly monotone, and strictly convex. Nevertheless, it can be shown that no utility function exists that represents this preference ordering. This is intuitive. With this preference ordering, no two distinct bundles are indifferent; indifference sets are singletons. Therefore, we have two dimensions of distinct indifference sets. Yet, each of these indifference sets must be assigned, in an order·preserving way, a different utility number from the one· dimensional real line. In fact, a somewhat subtle argument is actually required to establish this claim rigorously. It is given, for the more advanced reader, in the following paragraph.

Example J.C.I continued: Lexicographic preferences are not continuous. To see this, consider the sequence of bundles x· = (lIn, 0) and y. = CO, I). For every n, we have x· ~ y'. But lim. ~ '" y' = CO, 1) ~ (0, 0) = lim, _'" x'. In words, as long as the first component of x is larger than that of y. x is preferred to yeven if y, is much larger than x,. But as soon as the first components become equal, only the second components are relevant, and so the preference ranking is reversed at the limit points of the sequence. _ It turns out that the continuity of it is sufficient for the existence of a utility function representation. In fact, it guarantees the existence of a continuous utility function.

Proposition 3.C.1: Suppose that the rational preference relation it on X is continuous. Then there is a continuous utility function u{x) that represents it. Proof: For the case of X = R~ and a monotone preference relation, there is a relatively simple and intuitive proof that we present here with the help of Figure

3.C.1. Denote the diagonal ray in R~ (the locus of vectors with all L components equal) by Z. It will be convenient to let e designate the L·vector whose elements are all equal to I. Then cre E Z for all nonnegative scalars a ~ O. Note that for every x E R~, monotonicity implies that x it O. Also note that for any eX such that eXe » x (as drawn in the figure), we have eXe it x. Monotonicity and continuity can then be shown to imply that there is a unique value a(x) E [0, eX] such that cr{x)e - x.

Suppose there is a utility function u(·). For every x" we can pick a rational number rex,) such that u(x" 2) > r(x,) > u(x" I). Note that because of the lexicographic character of preferences, x, > x', implies r(x,) > r(x',)[since r(x,) > u(x" I) > u(x'" 2) > r(x',)]. Therefore, r(') provides a one·to·one function from the set of real numbers (which is uncountable) to the set of rational numbers (which is countable). This is a mathematical impossibility. Therefore, we conclude that there can be no utility function representing these preferences.

-

z

Figure 3.C.1

Construction of a utility function.

The assumption that is needed to ensure the existence of a utility function is that the preference relation be continuous. DefinItion 3.C.1: The preference relation it on X is continuous if it is preserved under limits. That is, for any sequence of pairs {(x n, yn)};:,., with xn it yn for all n. x = limn~'" xn, and y = limn~'" yn. we have x it y. Continuity says that the consumer's preferences cannot exhibit "jumps," with, for example, the consumer preferring each element in sequence {x·} to the corresponding element in sequence {yO} but suddenly reversing her preference at the limiting points of these sequences x and y.

L

47

48

CHAPTEA

3:

CLASSICAL

DEMAND

-

THEOAY

Formally, this can be shown as follows: By continuity, the upper and lower contour sets of x are closed. Hence, the sets A + = (" E R.: IU - ",e whenever " > ",. Hence, there can be at most one scalar satisfying .e - x. This scalar is

SEC T ION

P A E F E A ENe E

AND

UTI L , T Y

49

Hence, for all such n, x ...·, - .(x"·')e >- de (where the latter relation follows from monotonicity). Because preferences arc continuous, this would imply that x u(y) if x » y. The property of convexity of preferences, on the other hand, implies that u(·) is quas;collcave [and, similarly, strict convexity of preferences implies strict quasiconcavity of u(·»). The utility function u(·) is quasiconcave if the set {y E R'.: u(y) 2: u(x)} is convex for all x or, equivalently, if u(acx + (\ - ac)y) 2: Min {u(x), u(y)} for

Flgur. 3.C.2

Proof that the constructed utility x,

3. C:

function is continuous.

Figure 3.C.3

Leontief preferences cannot be represented by a dilferentiable utility function.

lies in this compact set for all n > N. But any infinite sequence that lies in a compact set must have a convergent subsequence (see Section M.F of the Mathematical Appendix). What remains is to establish that all convergent subsequences of (.(x')}:'., converge to a(x). To see this, suppose otherwise: that there is some strictly increasing function m(·) that assigns to each positive integer n a positive integer men) and for which the subsequence (.(x"'·')}:'., converges to .' >F .(x). We first show that .' > ,,(x) leads to a contradiction. To begin, note that monotonicity would then imply that .'e >- .(x)e. Now, let d = Ha' + .(x»). The point ae is the midpoint on Z between .'e and ,,(x)e (see Figure 3.e2). By monotonicity, de >- .(x)e. Now, since .(x"") .....' > d, there exists an N such that for all n > N, .(x Me ,,) > ix.

x,

b.

50

CHAPTER

.:

CLASSICAL

DEMAND

THEORY

any x, y and all IX E [0, 1]. [If the inequality is strictfor all x "I y and IX E CO, 1) then u(') is strictly quasiconcave; for more on quasiconcavity and strict quasiconcavity see Section M.C of the Mathematical Appendix.] Note, however, that convexity of ;:: does not imply the stronger property that u(·) is concave [that U{IXX + (I - IX) y) ;:: IXI/(X) + (I - lX)u(y) for any x, y and all IX E [0, I)). In fact, although this is a somewhat fine point, there may not be any concave utility function representing a particular convex preference relation ;::. In Exercise 3.C.5, you are asked to prove two other results relating utility representations and underlying preference relations: (i) A continuous;:: on X = R~ is homothetic if and only if it admits a utility function u(x) that is homogeneous of degree one [i.e., such that U(IXX) = IXU(X) for all IX > OJ. (ii) A continuous;:: on (-00, (0) X R~-I is quasilinear with respect to the first commodity if and only if it admits a utility function u(x) of the form u(x) =

XI

+ 4l(x"

... ,xd·

It is important to realize that although monotonicity and convexity of ;:: imply that all utility functions representing;:: are increasing and quasiconcave, (i) and (ii) merely say that there is at least one utility function that has the specified form. Increasingness and quasiconcavity are ordinal properties of u(·); they are preserved for any arbitrary increasing transformation of the utility index. In contrast, the special forms of the utility representations in (i) and (ii) are not preserved; they are cardinal properties that are simply convenient choices for a utility representation."

3.D The Utility Maximization Problem We now turn to the study of the consumer's decision problem. We assume throughout that the consumer has a rational, continuous, and locally nonsatiated preference relation. and we take u(x) to be a continuous utility function representing these preferences. For the sake of concreteness, we also assume throughout the remainder of the chapter that the consumption set is X = R~. The consumer's problem of choosing her most preferred consumption bundle given prices p » 0 and wealth level w> 0 can now be stated as the following utility maximization problem (U M PI: Max

u(x)

s.t. p'X

~

-

SEC T ION

3.0:

THE

UTili T Y

M A X I M I Z A T ION

Proposition 3.0.1: If p» 0 and u(') is continuous, then the utility maximization problem has a solution. 6. Thus, in this sense. continuity is also a cardinal property of utiHty functions. See also the discussion of ordinal and cardinal properties of utility representations in Section LB.

51

x,

Figure 3.0.1

u< "(.'( p,")) XI

(a)

(h)

Proof: If p» 0, then the budget set Bp.•. = {x E R';: p'x:s; w} is a compact set because it is both bounded [for any I = I, ... ,L, we have XI :s; (wi PI) for all x e B, .• ] and closed. The result follows from the fact that a continuous function always has a maximum value on any compact set (set Section M.F. of the Mathematical Appendix). -

With this result, we now focus our attention on the properties of two objects that emerge from the U M P: the consumer's set of optimal consumption bundles (the solution set of the U M P) and the consumer's maximal utility value (the value function of the UMP).

The Wail'asian Demalld Correspolll!ence/ FUllction The rule that assigns the set of optimal consumption vectors in the UM P to each price-wealth situation (p, IV) » 0 is denoted by x(p, w) E \R~ and is known as the Wa/rasian (or ordinary or market) demand correspondence. An example for L = 2 is depicted in Figure 3.0.1(a), where the point x{p, IV) lies in the indifference set with the highest utility level of any point in Bp.w. Note that, as a general matter, for a given (p, w) » 0 the optimal set x(p, IV) may have more than one element, as shown in Figure 3.0.I(b). When x(p, IV) is single-valued for all (p, wI, we refer to it as the Walrasiwi (or ordinary or market) demand jUllclion. 7 The properties of x(p, 1\") stated in Proposition 3.0.2 follow from direct examination of the UMP.

w.

In the UM P, the consumer chooses a consumption bundle in the Walrasian budget set B,.•. = {x E R~: p' X ~ w} to maximize her utility level. We begin with the results stated in Proposition 3.D.1.

PRO B L E M

ProposItion 3.0.2: Suppose that u(·) is a continuous utility function representing a locally nonsatiated preference relation;:: defined on the consumption setX = R~. Then the Walrasian demand correspondence x(p, w) possesses the following properties: 7. This demand function has also been called the Marshallian demand function. However, this terminology can creale confusion. and so we do not use it here. In Marshallian partial equilibrium analysis (where wealth elTeclS are absent), alt Ihe dilTerent kinds of demand functions studied in this chapter coincide. and so it is not clear which of these demand functions would deserve the Marshall name in the more general selling.

The utility maximization problem (UMP). (al Single solution. (b) Multiple solutions.

52

CHAPTER

3:

CLASSICAL

DEMAND

SECTION

THEORY

3.0:

THE

UTILITY

MAXIMIZATION

PROBLEM

53

--------------------------------------------------------------------------

(i) Homogeneity of degree zero in (p, w): x(a.p, a.w) = x(p, w) for any p, w and scalar a. > O. (ii) Walras' law: p'x = w for all x E x(p, w). (iii) Convexity/uniqueness: If ;:: is convex, so that u(·) is quasiconcave, then x(p, w) is a convex set. Moreover, if ;:: is strictly convex, so that u(·) is strictly quasiconcave, then x(p, w) consists of a single element.

Flgur. 3.D.3

(a) Convexity of preferences implies convexity of x(p, w). (b) Strict convexity of preferences implies that x(p, w) is single·valued.

{x: u(x) = u'}

x'

Proof: We establish each of these properties in turn. (i) For homogeneity, note that for any scalar a. > 0, (b)

(a)

{x E R';: a.p·x::; a.w} = {x E R';: p'x::; wI; that is, the set of feasible consumption bundles in the UM P does not change when aU prices and wealth are multiplied by a constant a. > O. The set of utility-maximizing consumption bundles must therefore be the same in these two circumstances, and so x( p, w) = x(a.p, a.w). Note that this property does not require any assumptions on u(·). (ii) Walras'law foUows from local nonsatiation. If p' x < w for some x E x(p, w), then there must exist another consumption bundle y sufficiently close to x with both p' y < wand y >- x (see Figure 3.0.2). But this would contradict x being optimal in the UMP.

Slope =

x,

-!:.!.

\(""

~

\

Slope = -MRS,,(x')

\

\

\

\

\

\

\ Flgur. 3.0.4 X' E x(p,

Local nonsatiation implies Walras' law.

ou(x') ::; i,PI'

with equality if x1 > O.

(3.0.1)

OCI

(iii) Suppose that u(·) is quasiconcave and that there are two bundles x and x', with x # x', both of which are elements of x(p, w). To establish the result, we show that x" = (Xx + (I - a.)x' is an element of x(p, w) for any a. E [0,1]' To start, we know that u(x) = u(x'). Oenote this utility level by u·. By quasiconcavity, u(x") ;:>: u· [see Figure 3.0.3(a»). In addition, since p'X ::; wand P'x' ::; w, we also have

+ (1

(a) Interior solution. (b) Boundary solution.

If 11(') is continuously differentiable. an optimal consumption bundle x' E x(p, w) can be characterized in a very useful manner by means of first-order conditions. The Kullll- Tucker (necessary) conditions (see Section M.K of the Mathematical Appendix) say that if x· E x(p, w) is a solution to the UMP, then there exists a Lagrange llIultiplier i. ;:>: 0 such that for al1l = I, ... , L:8

Figure 3.0.2

p'x" = p·[a.x

x,

(b)

(a)

XI

w)

Equivalently, if we let Vu(x) = [ou(x)/ox I" •• ,ou(X)/OXL] denote the gradient vector of u(·) at x, we can write (3.0.1) in matrix notation as Vu(x') ::; i.p

(3.0.2)

X"[Vu(x*) - i.p] = O.

(3.0.3)

and

- IX)X'] ::; w.

Thus. if we are at an interior optimum (i.e., if x' » 0), we must have

Therefore, x" is a feasible choice in the UM P (put simply, x" is feasible because Bp. w is a convex set). Thus, since u(x") ;:>: u' and x" is feasible, we have x" E x(p, w). This establishes that x(p, w) is a convex set if u(·) is quasiconcave. Suppose now that u(·) is slricriy quasiconcave. Fol1owing the same argument but using strict quasiconcavity, we can establish that x" is a feasible choice and that u(x") > u' for al1 a. E (O,I). Because this contradicts the assumption that x and x' are elements of x(p, w), we conclude that there can be at most one element in x{p, w). Figure 3.0.3{b) illustrates this argument. Note the difference from Figure 3.0.3(a) arising from the strict quasiconcavity of u{x). •

Vu{x') = i.p.

(3.0.4)

Figure 3.0.4(a) depicts the first-order conditions for the case of an interior optimum when L = 2. Condition (3.0.4) tel1s us that at an interior optimum, the 8. To be rully rigorous, these Kuhn-Tucker necessary conditions are valid only ir the constraint qualification condition holds (see Section M.K of the Mathematical Appendix). In the UMP, this is always so. Whenever we use Kuhn-Tucker necessary conditions without mentioning the constraint qualification condition. this requirement is met.

h..

54

CHAPTER

3:

CLASSICAL

DEMAND

THEORY

SEC T ION

P,

ou(x*)/ox,

p,


Example 3.0.1 illustrates the use of the first-order conditions in deriving the consumer's optimal consumption bundle. Example 3.0.1: The Demand Function Derived from the Cobb-Douglas Utility Fllnction. A Cobb-Douglas utility function for L = 2 is given by u(x" x 2 ) = kxix~-' for some IX E (0, I) and k > O. It is increasing at all (x., x 2 ) »0 and is homogeneous of degree one. For our analysis, it turns out to be easier to use the increasing transformation IX In x, + (I - il) In X2' a strictly concave function, as our utility function. With this choice, the UMP can be stated as

Figure 3.0.4(b) depicts the first-order conditions for the case of L = 2 when the consumer's optimal bundle x* lies on the boundary of the consumption set (we have x! = 0 there). In this case, the gradient vector need not be proportional to the price vector. In particular, the first-order conditions tell us that OUt(x*)/OXt ~ },P, for those ( with x, = 0 and iJul(X*)/iJX, = }.Pt for those ( with x7 > O. Thus, in the figure, we see that M RS 12 (x*) > P,/P2' In contrast with the case of an interior optimum, an inequality between the marginal rate of substitution and the price ratio can arise at a boundary optimum because the consumer is unable to reduce her consumption of good 2 (and correspondingly increase her consumption of good I) any further.

Max

IX

In x.

+ (I

-

IX)

In x,

(3.0.6)

s.t. p,x, + P2X, = w. [Note that since u(·) is increasing, the budget constraint will hold with strict equality at any solution.] Since In 0 = -00, the optimal choice (x,(p, w), x,(p, 11'» is strictly positive and must satisfy the first-order conditions (we write the consumption levels simply as x, and X 2 for notational convenience)

The Lagrange multiplier ;. in the first-order conditions (3.0.2) and (3.0.3) gives the marginal, or shadow, value of relaxing the constraint in the UMP (this is a general property of Lagrange multipliers; see Sections M.K and M.L of the Mathematical Appendix). It therefore equals the consumer's marginal utility value of wealth at the optimum. To see this directly, consider for simplicity the case where x(p, IV) is a differentiable function and x(p, 11') »0. By the chain rule, the change in utility from a marginal increase in II' is given by Vu(x(p, 11'». D •.x(p, 11'), where D •. x(p, IV) = [vx,(p, 11')/011', ... , OXL(P, W)/OIV]. Substituting for Vu(x(p, from condition (3.0.4), we get

-~- = x,

i.PI

(10.7)

and I-~

- - = i.p,

x2

(3.0.8)

for some i. ~ 0, and the budget constraint P'x(p, w) = \\'. Conditions (3.0.7) and (3.0.8) imply that

w»

w»' Dwx(p, 11') = i.p· Dwx(p, w) =

UTI LIT Y

We have seen that conditions (3.0.2) and (3.0.3) must necessarily be satisfied by any x' E x(p, 11'). When, on the other hand, does satisfaction of these first-order conditions by some bundle x imply that x is a solution to the UM P? That is, when are the first-order conditions slIjJiciellt to establish that x is a solution? If u(') is quasiconcave and monotone and has VII(X) #' 0 for all X E R ., then the Kuhn-Tucker first·order conditions are indeed sufficient ' (see Section M.K of the Mathematical Appendix). What if u(·) is not quasiconcave? In lhat case. if u(·) is locally quasiconcave at x', and if x' satisfies the first·order conditions, then x' is a local maximum. Local quasiconcavity can be verified by means of a determinant test on the hordered Hessian mdlrix of u(·) at x'. (For more on this, see Sections M.e and M.O of the Mathematical Appendix.)

(3.0.5)

The expression on the left of (3.0.5) is the marginal rate of substitution of good ( for good k at x*, M RSIl(x*); it tells us the amount of good k that the consumer must be given to compensate her for a one-unit marginal reduction in her consumption of good I." In the case where L = 2, the slope of the consumer's indifference set at x* is precisely -MRS 12 (x*). Condition (3.0.5) tells us that at an interior optimum, the consumer's marginal rate of substitution between any two goods must be equal to their price ratio, the marginal rate of exchange between them, as depicted in Figure 3.0.4(a). Were this not the case, the consumer could do better by marginally changing her consumption. For example, if [ou(X*)/OX,]/[OU(x*)/ox,] > (pdp,), then an increase in the consumption of good ( of dx combined with a decrease in good k's consumption equal to (PI/P,) dx would be "feasible and would yield a utility change " of [211(X*)/ox , ] dx, - [ou(x*)/iJx,](P,/P,) dx, > O.

Vu(x(p,

THE

a marginal increase in wealth-the consumer's marginal utility of wealth-is precisely )..'0

gradient vector of the consumer's utility function Vu(x*) must be proportional to the price vector p, as is shown in Figure 3.0.4(a). If Vu(x*) » 0, this is equivalent to the requirement that for any two goods ( and k, we have oU(X*)/OX,

3. 0:

).,

or, using the budget constraint,

where the last equality follows because p'x(p,lI') = II' holds for all II' (Walras' law) and therefore p' D•. x(p, 11') = I. Thus, the marginal change in utility arising from

p,x, =

."--- (w - p,x,).

I-~

10. Note that if monotonici'y of u(') is slrengthened slightly by requiring that Vu(x) ;;, 0 and

9. Note that ir utility is unchanged with differential changes in XI and Xl. dx, and dx ... then [ u(O), we must have x· ~ O. Hence, p·x· > O. Suppose that x· is not optimal in the UMP when wealth is p·x·. Then there exists an x' such that u(x') > u(x·) and p' x' :5'. p' x·. Consider a bundle x· = o:x' where 0: e (0,1) (x· is a "scaled-down" version of x'J. By continuity of u(·), if 0: is close enough to I, then we will have u(x·) > u(x·) and p·x· < p·x·. But this contradicts the optimality of x. in the EMP. Thus, x· must be optimal in the UMP when wealth is p·x·, and Ihe maximized utility level is therefore u(x·). In Proposition 3.E.3(ii), we will show that if x· solves the EMP when the required utility level is u, then u(x·) = u. • As with the UMP, when p» 0 a solution to the EMP exists under very general conditions. The constraint set merely needs to be nonempty; that is, u(·) must allain values at least as large as u for some x (see Exercise 3.E.3). From now on, we assume that this is so; for example, this condition will be satisfied for any u > u(O) if u(') is unbounded abovc. We now proceed to study the optimal consumption vector and the value function of the EM P. We consider the value function first.

(EMP)

P'x

SECTION

u.

Whereas the UM P computes the maximal level of utility that can be obtained given wealth IV, the EM P computes the minimal level of wealth required to reach utility level u. The EMP is the "dual" problem to the UMP. It captures the same aim of efficient use of the consumer's purchasing power while reversing the roles of objective function and constraint. 13 Throughout this section, we assume that u(·) is a continuous utility function representing a locally nonsatiated preference relation;:: defined on the consumption set R~. The EMP is illustrated in Figure 3.E.1. The optimal consumption bundle x· is the least costly bundle that still allows the consumer to achieve the utility level u. Geometrically, it is the point in the set {x e R~: u(x) ~ u} that lies on the lowest possible budget line associated with the price vector p. Proposition 3.E.1 describes the formal relationship between EMP and the UMP.

The Expenditure Function Given prices p » 0 and required utility level u > u(O), the value of the EM P is denoted e(p, u). The function e(p, u) is called the expenditure funclion. Its value for any (p, u) is simply p·x·, where x· is any solution to the EMP. The result in Proposition 3.E.2 describes the basic properties of the expenditure function. It parallels Proposition 3.D.3's characterization of the properties of the indirect utility function for the UM P. Proposition 3.E.2: Suppose that u(·) is a continuous utility function representing a locally nonsatiated preference relation;:: defined on the consumption set X = IR~. The expenditure function e(p, u) is

Proposition 3.E.1: Suppose that u(') is a continuous utility function representing a locally nonsatiated preference relation;:: defined on the consumption set X = IR~ and that the price vector is p » O. We have

(i) (ii) (iii) (iv)

(i) If x· is optimal in the UMP when wealth is w > 0, then x· is optimal in the EMP when the required utility level is u(x·). Moreover, the minimized expenditure level in this EMP is exactly w. (ii) If x· is optimal in the EMP when the required utility level is u > u(O), then x· is optimal in the UMP when wealth is p·x·. Moreover, the maximized utility level in this UMP is exactly u.

Homogeneous of degree one in p. Strictly increasing in u and nondecreasing in PI for any t. Concave in p. Continuous in p and u.

Proof: We prove only properties (i), (ii), and (iii). (i) The constraint set of the EM P is unchanged when prices change. Thus, for any scalar rx > 0, minimizing (rxp)' x on this set leads to the same optimal consumption bundles as minimizing p·x. Lelting x· be optimal in both circumstances, we have e(rxp, u)

Proof: (i) Suppose that x· is not optimal in the EMP with required utility level u(x·). Then there exists an x' such that u(x') ~ u(x·) and P'x' < p·x· :5 IV. By local nonsatiation, we can find an x" very close to x' such that u(x") > u(x') and P'x" < IV. But this implies that x" e Bp.w and u(x") > u(x·), contradicting the optimality of x· in the UMP. Thus, x· must be optimal in the EMP when the required utility level

= rxp·x· = o:e(p, u).

(ii) Suppose that e(p, u) were not strictly increasing in u, and let x' and x" denote optimal consumption bundles for required utility levels u' and u·, respectively, where u" > u' and P'x' ~ p·x· > O. Consider a bundle x = o:x·, where o:e(O, I). By continuity of u(·), there exists an ~ close enough to 1 such that u(x) > u' and p' x' > p' .x. But this contradicts x' being optimal in the EM P with required utility level u'. To show that e(p, u) is nondecreasing in PI' suppose that price vectors p" and p' have P, ~ p~ and p; = p~ for all k ~ f. Let x· be an optimizing vector in the EM P for prices p•. Then e(p·, u) = p•. x· ~ p" x· 2! e(p', u), where the lalter inequality follows from the definition of e(p', u).

13. The lerm "dual" is meanllo be suggeSlive. II is usualty applied 10 pairs of problems and conceplS Ihal are formalty similar excepllhallhe role of quanlities and prices, and/or maximizalion and minimization, and/or objective runction and constraint. have been reversed.

L

PROBLEM

59

60

CH"PTER

3:

CL"SSIC .. L

DE .... ND

THEORY

SECTION

3.E:

THE

EXPENDITURE

MINIMIZATION

PROBLEM

figure 3.E.3

The Hicksian (or compensated) demand function.

Figure 3.E.2

p,

p,

(a)

~ E

.p,

p,

(b)

XI

The concavity in p of the expenditure function.

single-valued. (The reason for the term "compensated demand" will be explained below.) Figure 3.E.3 depicts the solution set hlp, u) for two different price vectors p and p'. Three basic properties of Hicksian demand are given in Proposition 3.E.3, which parallels Proposition 3.0.2 for Walrasian demand.

(iii) For concavity, fix a required utility level il, and let p" = rxp + (I - rx)p' for [0, I]. Suppose that x' is an optimal bundle in the EM P when prices are pH. If so, e(p", ti) = p"' x"

+ (I + (I

=

(1.p'.~"

~

rxe(p, II)

~)p'·x·

Proposition 3.E.3: Suppose that u(·) is a continuous utility function representing a locally nonsatiated preference relation l:: defined on the consumption set X = RL+. Then for any p » 0, the Hicksian demand correspondence hlp. u) possesses the following properties:

- rx)e(p', ti),

where the last inequality follows because u(x") ~ il and the definition of the expenditure function imply that p' x" ~ e(p, il) and p" x' ~ e(p', il) . •

(i) Homogeneity of degree zero in p: h(rxp. u) = h(p. u) for any P. u and rx > O. (ii) No excess utility: For any xEh(p. u). u(x) = u. (iii) Convexity/uniqueness: II l:: is convex, then hlp. u) is a convex set; and if l:: is strictly convex, so that u(·) is strictly quasiconcave. then there is a unique element in hlp, u).

The concavity of e(p, ti) in p for given ti, which is a very important property, is actually fairly intuitive. Suppose that we initially have prices p and that x is an optimal consumption vector at these prices in the EM P. If prices change but we do not let the consumer change her consumption levels from x, then the resulting expenditure will be p' x, which is a linear expression in p. But when the consumer can adjust her consumption, as in the EM P, her minimized expenditure level can be no greater than this amount. Hence, as illustrated in Figure 3.E.2(a), where we keep PI fixed and vary P2' the graph of e(p, il) lies below the graph of the linear function p·.x at all p i' P and touches it at p. This amounts to concavity because a similar relation to a linear function must hold at each point of the graph of e(', u); see Figure 3.E.2(b). Proposition 3.E.1 allows us to make an important connection between the expenditure function and the indirect utility function developed in Section 3.0. In particular, for any p » 0, W > 0, and u > 11(0) we have

e(p, r(p, 11'»

= I\'

and

v(p, e(p,

II» = u.

Proof: (i) Homogeneity of degree zero in p follows because the optimal vector when minimizing p' x subject to u(x) :2: u is the same as that for minimizing rxp' x subject to this same constraint, for any scalar rx > O. (ii) This property follows from continuity of u(·). Suppose there exists an x E h(p, u) such that u(x) > u. Consider a bundle x' = rxx, where rx E (0, I). By continuity, for rx close enough to I, u(x'):2: u and p'x' < P'x, contradicting x being optimal in the EMP with required utility level u. (iii) The proof of property (iii) parallels that for property (iii) of Proposition 3.0.2 and is left as an exercise (Exercise 3.E.4). • As in the UM P, when u(·) is differentiable, the optimal consumption bundle in the EM P can be characterized using first-order conditions. As would be expected given Proposition 3.E.I, these first-order conditions bear a close similarity to those of the UMP. Exercise 3.E.I asks you to explore this relationship.

(3.E.I)

These conditions imply that for a fixed price vector p, e(p, .) and vIp, .) are inverses to one another (see Exercise 3. E.8). In fact, in Exercise 3.E.9, you are asked to show that by using the relations in (3.E.I), Proposition 3.E.2 can be directly derived from Proposition 3.0.3, and vice versa. That is, there is a direct correspondence between the properties of the expenditure function and the indirect utility function. They both capture the same underlying features of the consumer's choice problem.

Exercise 3.E.!: Assume that u(·) is differentiable. Show that the first-order conditions for the EM Pare (3.E.2)

P
SECTION

3.F:

DUALITY:

A

MAT~EMATICAL

Proof: For any p » 0, consumption bundle hlp, u) is optimal in the EMP, and so it achieves a lower expenditure at prices p than any other bundle that offers a utility level of at least u. Therefore, we have

p··h(p·, 1/)

:
Proof I: (Duality Theorem Argument). The result is an immediate consequence of the duality theorem (Proposition 3.F.I). Since the expenditure function is precisely the support function for the set K = (x E R~: u(x) ~ u}, and since the optimizing vector associated with this support function is hlp, u), Proposition 3.F.I implies that hlp, u) = V,e(p, u). Note that (3.G.I) helps us understand the use of the term ~dual" in this context. In particular, just as the derivatives of the utility function u(·) with respect to quantities have.a price interpretation (we have seen in Section 3.D that at an optimum they are equal to prices multiplied by a constant factor of proportionality), (lG.I) tells us that the derivatives of the expenditure function e(', u) with respect to prices have a quantity interpretation (they are equal to the Hicksian demands). _

The idea behind all three proofs is the same: If we are at an optimum in the EMP, the changes in demand caused by price changes have no first-order effect on the consumer's expenditure. This can be most clearly seen in Proof 2; condition (3.G.2) uses the chain rule to break the total effect of the price change into two effects: a direct effect on expenditure from the change in prices holding demand fixed (the first term) and an indirect effect on expenditure caused by the induced change in demand holding prices fixed (the second term). However, because we are at an expenditure minimizing bundle, the first-order conditions for the EMP imply that this latter effect is zero. Proposition 3.G.2 summarizes several properties of the price derivatives of the Hicksian demand function D,h(p, u) that are implied by Proposition 3.G.1 [properties (i) to (iii)]. It also records one additional fact a~out these derivatives [property (iv)].

Proof 2: (First-Order COllditions Argument). For this argument, we focus for sim· plicity on the case where hlp, u) » 0, and we assume that hlp, u) is differentiable at (p, u).

Using the chain rule, the change in expenditure can be written as

Proposition 3.G.2: Suppose that u(·) is a continuous utility function representing a locally nonsatiated and strictly convex preference relation ..liIUli"" mCltrix. Note, in particular, that S(p, 1\') is directly computable from knowledge of the (observable) Walrasian demand function x(p, 1\'). Because S(p, 1\') = D,11(p, u), Proposition 3.G.2 implies that when demand is generated from preference maximization, S(p, \\') must possess the following three properties: it must be lIegalive semidefinile, spllmetric, I/Ild salisfy S(p, I\')p = O. In Section 2.F, the Slutsky substitution matrix S(p, w) was shown to be the matrix of compensated demand derivatives arising from a different form of wealth compensation, the so-called SIUI.,k,r II'faled ciellJeII"}.

We can understand this result as follows: Suppose we have a utility functio,) u(·) and are at initial position (p. Ii-) with .i = x(p, Ii-) and ii = 1/(',,). As we change prices to p', we want to change wealth in order to compensate for the wealth effect arising from this price change. In principle, the compensation clln be done in two ways. By changing wealth by amount .6,\\'ShIlSly = p'·x(p. \t') - }t', we leave lhe consumer just able to afford her initial bundle .'(.

Alternatively, we can change wealth by amount unchanged. We ha\'e L\\\'Hi~b ~

al\'HI,k,

= CV(po, pI, w)(you should check that the same is true when p: > p?). This relation between the E V and the C V reverses when good I is inferior (see Exercise 3.1.3). However, if there is no wealth effect for good I (e.g., if the underlying preferences are quasilinear with respect to some good I # I), the CV and EV measures are the same because we then have

CV(po, p', w)

The equivalent EV(po, p', w),

since

EV(pO, pi,

w) _

EV(po, p', w) = e(po, .') - e(po, .').

Thus. Ihe E V measures EV(po, pi, w) and E V(p·, p', w) can be used nol only 10 compare Ihese Iwo price veclors wilh p. bUI also 10 delermine which of Ihem is beller for Ihe consumer. A comparison oflhe compensaling varialions CV(p·, pi, w) and CV(p·, p', w), however, will nol necessarily rank pi and p' correclly. The problem is Ihallhe CVmeasure uses Ihe new prices as Ihe base prices in Ihe money melric indirecl ulilily funclion, using pi 10 calculale CV(po. pi, w) and p' 10 calculale CV(po, p', w). So

h,(p~ + I" p~, . • ') = h (p', .')

x,

(b)

(a)

or course, we can

rank pi and pl correctly

by

Comparing two taxes

In summary, if we know the consumer's expenditure function, we can precisely measure the welfare impact of a price change; moreover, we can do it in a convenient way (in dollars). In principle, this might well be the end of the story because, as we saw in Section 3.H, we can recover the consumer's preferences and expenditure function from the observable Walrasian demand function x(p, w).2> Before concluding, however, we consider two further issues. We first ask whether we may be able to say anything about the welfare effect of a price change when we do nol have enough information to recover the consumer's expenditure function. We describe a test that provides a sufficient condition for the consumer's welfare to increase from the price change and that uses information only about the two price vectors pO, pi and the initial consumption bundle x(po, w). We then conclude by discussing in detail the extent to which the welfare change can be approximated by means of the area to the left of the market (Walrasian) demand curve, a topic of significant historical importance.

e(pl, uO) = e(pO, uO)

that raise revenue T.

(a) Tax on good I. (b) Tax on good 2.

+ (pi

- pO)'Vpe(po, uO)

+ o(lIp'

- pOll).

(3.1.7)

If (pi - po). Vpe(pO, ,,") < 0 and the second-order remainder term could be ignored, we would have e(pl, ,,0) < e(po, uo) = w, and so we could conclude that the consumer's welfare is greater after the price change. But the concavity of e(', uo) in p implies that the remainder term is nonpositive. Therefore, ignoring the remainder term leads to no error here; we do have ('(pI, uO) < w if (pI - pO)·V.e(po, uo) < O. Using Proposition 3.0.1 then tells us that (pI _pO)'Vpe(pO, uo) = (pi _ pO)'h(po, uo) = (pI _ pO). x O, and so we get exactly the test in Proposition 3.1.1.

seeing whether C V(pl, pl, w) is positive or

25. As a practical maller, in applications you should use whatever are the state-of-the-art techniques for performing this recovery.

negative.

L

E CON 0 M ICC HAN G E S

The test in Proposition 3.1.1 can be viewed as a first-order approximation to the true welfare change. To see this, take a first-order Taylor expansion of e(p, u) around the initial prices po:

is beller Ihan tax I, if and only if E V(po, pi, w) > E V(p·, p', w), I, is beller Ihan I, if and only if [( - n - EV(p·, pi, w)) < [( - T) - E V(p·, p', w)], Ihal is, if and only if Ihe deadweighlloss arising under lax I, is less than Ihat arising under tax I,.

24.

0 F

Proof: The result follows simply from revealed preference. Since pO. XO = w by Walras'law, if (pi - po). Xo < 0, then p' 'xo < w. But ifso, X Ois still affordable under prices pi and is, moreover, in the interior of budget set B.,.w' By local nonsatiation, there must therefore be a consumption bundle in B.,. w that the consumer strictly prefers to xO. •

Flgur. 3.1.6

hl(p~+I,.P~'.UI)

E V A L U A T ION

Proposition 3.1.1: Suppose that the consumer has a locally nonsatiated rational preference relation::::. If (p' - pO). XO < 0, then the consumer is strictly better off under price-wealth situation (p', w) than under (po, w).

Deadweighl Loss from Tax on Good 2

= h,(p', .')

W ELF ARE

In some circumstances, we may not be able to derive the consumer's expenditure function because we may have only limited information about her Walrasian demand function. Here we consider what can be said when the only information we possess is knowledge of the two price vectors pO, pi and the consumer's initial consumption bundle XO = x(po, w). We begin, in Proposition 3.1.1, by developing a simple sufficiency test for whether the consumer's welfare improves as a result of the price change.

which need nOI correclly rank pi and p' [see Exercise 3.1.4 and Chipman and Moore (1980)]. In olher words, fixing pO, E V(p·, " w) is a valid indirecl ulilily funclion (in facl, a money mel ric one), bUI CV(po, " w) is not." An inleresling example of Ihe comparison of several possible new price veclors arises when a governmenl is considering which goods 10 lax. Suppose, for example, Ihallwo differenllaxes are being considered Ihal could raise lax revenue of T: a tax on good I of I, (crealing new price vector pi) and a tax on good 2 of I, (creating new price vector p'). Note thai since they raise Ihe same tax revenue, we have I,X,(p', w) = I,X,(p', w) = T(seo Figure 3.1.6). Because lax I,

'I

3. I:

Welfare Allalysis with Partiallll/ormatioll

CV(po, pi, w) _ CV(p·, p', w) = e(p', •• ) - e(p', .0),

p~ +

SEC T ION

l

87

88

CHAPTER

3:

CLASSICAL

DEMAND

THEORY

------------------------------------------h

h

JpeR!: ;(p:"0) > e( pO. "o)} ::,;:,'£~'r~:~~,~:f;:! ~

"

~

'~ {p e

Figure 3.1.7

/

R!: e(P. "0) ~ e(po• "oll

The welfare lest of Propositions 3.1.\ and

{p e R!: e(P. "0) ~ e(po. "o)}

P,

Proposition 3.1.2: Suppose that the consumer has a differentiable expenditure function. Then if (p' - po)·xO > 0, there is a sufficiently small a e (0.1) such that for all a < 2. we have e.«1 - a)po + ap'. u O) > w. and so the consumer is strictly better off under price-wealth situation (po. w) than under «1 - a)po + ap'. w). Figure 3.1.7 illustrates these results for the cases where pi is such that (pi _ pO)'x o < 0 [panel (a)) and (pi - pO)'x o > 0 [panel (b)]. In the figure the set of prices (p e R~: e(p, uO) ~ e(po, UO)} is drawn in price space. The concavity of e(', u) gives it the shape depicted. The initial price vector pO lies in this set. By Proposition 3.G.l, the gradient of the expenditure function at this point, V,e(po, UO), is equal to x O, the initial consumption bundle. The vector (pi - pO) is the vector connecting point po to the new price point p'. Figure 3.1. 7(a) shows a case where o (p' - pO)'xo < O. As can be seen there, pi lies outside of the set (p e R;: e(p, u ) ~ e(po, uo)}, and so we must have e(po, uo) > e(pl, uO). In Figure 3.1.7(b), on the other hand, we show a case where (p' - pO)·x o > O. Proposition 3.1.2 can be interpreted as asserting that in this case if (pi _ po) is small enough, then e(po, uo) < e(p', uO). This can be seen in Figure 3.1.7(b), because if (pi - po). XO > 0 and p' is close enough to pO [in the ray with direction p' - pO], then price vector p' lies in the set

e

R~: e(p, u O)

>

3.1:

WELFARE

EVALUATION

OF

ECONOMIC

however, it has been common practice in applied analyses to rely on approximations of the true welfare change. We have already seen in (3.1.3) and (3.1.4) that the welfare change induced by a change in the price of good 1 can be exactly computed by using the area to the left of an appropriate Hicksian demand curve. However, these measures present the problem of not being directly observable. A simpler procedure that has seen extensive use appeals to the Walrasian (market) demand curve instead. We call this estimate of welfare change the area variation measure (or A V):

3.1.2. P,

What if (pi - po)'xo > O? Can we then say anything about the direction of change in welfare? As a general matter. no. However. examination of the first-order Taylor expansion (3.1.7) tells us that we get a definite conclusion if the price change is. in an appropriate sense, small enough because the remainder term then becomes insignificant relative to the first-order term and can be neglected. This gives the result shown in Proposition 3.1.2.

{p

SECTtON

e(po, uO)}.

Using the Area to the Left of the Walrasian (Market) Demand Curve as an Approximate Welfare Measure Improvements in computational abilities have made the recovery of the consumer's preferences/expenditure function from observed demand behavior, along the lines discussed in Section 3.1, far easier than was previously the case. 26 Traditionally, 26. They have also made it much easier to estimate complicated demand systems that are explicitly derived from utility maximization and from which the parameters of the expenditure function can be derived directly.

CHANGES

89

---------------------------------------------------------------~~

(a) (pi - pO)'xo - y for every y # x(p, w) with y E Bp.w.

28. In effect, the property identified here amounts to saying that the Walrasian demand function provides a first-order approximation '0 the compensating variation. Indeed, note that the deriva,ives of CV(p', pO, w), EV(p', pO, w), and AV(p', pO, w) with respectto p: evaluated at p~ are all precisely x,(p~, p". .. wI. 29. Thus, for example, in the problem discussed above where we compare 'he deadweight losses induced by taxes on two different commodities that both raise revenue T. the area variation measure need not give the correct ranking even for small taxes.

30. For an informal account of revealed preference theory after Samuelson, see Mas-Colell (1982).

l

PRE FER E" C E

91

92

CHAPTER

3:

CLASSICAL

DEMAND

THEORY

93

----------------------------------------------------------- ----------------------------------------------------------APPENDIX

A:

CONTINUITY

Proof: We follow Richter (1966). His proof is based on set theory and differs markedly from the differential equations techniques used originally by Houthakker.31 Define a relation >-' on commodity vectors by letting x >-' y whenever x '" y and we have x = -«p, w) and p' y :S w for some (p, wI. The relation >-' can be read as "directly revealed preferred to." From >-' define a new relation >-', to be read as "directly or indirectly revealed preferred to," by letting x >-' y whenever there is a chain x' >-' x' >-', ... , >-' x N with x' = x N and x = y. Observe that, by construction, >-' is transitive. According to the SA, >-' is also irreftexive (i.e., x >-' x is impossible). A certain axiom of set theory (known as Zorn's lemma) tells us the following: Every relation >-' that is transitive and irrejlexive (called a partial order) hus a total extension >-', an irreflexive and transitive relation such that, first, x >-' y implies x >-' y and, second, whenever x '" y, we have either x >-' y or y >-' x. Finally, we can define ~ by letti~~ x ~ y whenever x = y or x >-' y. It is not difficult now to verify that - y whenever p' y :S wand y '" x(p, wI . •

AND

DIFFERENTIABILITY

OF

WALRASIAN

Figure 3.AA.l

An upper hemicontinuous

Walrasian demand x,

correspondence.

I

I, I

Continuity The proof of Proposition 3.1.1 uses only the single·valuedness of x(p, w~ Provided choice is single·valued, the same result applies to the abstract theory of choice of Chapter I. The fact that the bUdgets are competitive is immaterial.

Because x(p, w) is, in general, a correspondence, we begin by introducing a generalization of the more familiar continuily property for functions, called upper

In Exercise 3.1.1, you are asked to show that the WA is equivalent to the SA when L = 2. Hence, by Proposition 3.1.1, when L = 2 and demand satisfies the WA we can always find a rationalizing preference relation, a result that we have alread; seen in Section 3.H. When L > 2, however, the SA is stronger than the W A. In fact, Proposition 3.1.1 tells us that a choice-based theory of demand founded on the strong axiom is essentially equivalent to the preference-based theory of demand presented in this chapter.

Definition 3.AA.1: The Walrasian demand correspondence x(p, w) is upper hemicontinuous at (p, w) if whenever (pn, w n) .... (p, w), xn e x(pn, w n) for all n, and x = Iimn~'" x n, we have x e x(p, w).32

hemicontinuit y.

In words, a demand correspondence is upper hemicontinuous at (p, w) if for any sequence of price-wealth pairs the limit of any sequence of optimal demand bundles is optimal (although not necessarily uniquely so) at the limiting price-wealth pair. If x(p, w) is single-valued at all (p, w) »0, this notion is equivalent to the usual continuity property for functions. Figure 3.AA.I depicts an upper hemicontinuous demand correspondence: When p" .... p, x(', w) exhibits a jump in demand behavior at the price vector p, being x" for all p" but suddenly becoming the interval of consumption bundles [x, x) at p. It is upper hemicontinuous because x (the limiting optimum for p" along the sequence) is an element of segment [x, x) (the set of optima at price vector pl. See Section M.H of the Mathematical Appendix for further details on upper hemicontinuity.

The strong axiom is therefore essentially equivalent both to the rational preference hypothesis and to the symmetry and negative semidefiniteness of the Slutsky matrix. We have seen that the weak axiom is essentially equivalent to the negative semidefiniteness orthe Slutsky matnx. It IS therefore natural to ask whether there is an assumption on preferences that is weaker than rationality and that leads to a theory of consumer demand equivalent to that based on the WA. Violations of the SA mean cycling choice, and violations of the symmetry of the Slutsky matrix generate path dependence in attempts to "integrate back" to preferences. Th,s suggests preferences that may violate the transitivity axiom. See the appendix with W. Shafer in Kihlstrom, Mas-Colell, and Sonnenschein (1976) for further discussion of this point.

Proposition 3.AA.l: Suppose that u(·) is a continuous utility function representing locally non satiated preferences ~ on the consumption set X = R~. Then the derived demand correspondence x(p, w) is upper hemicontinuous at all (p, w) » O. Moreover, if x(P. w) is a function [i.e., if x(p, w) has a single element for all (p, w)), then it is continuous at all (p, w) » O. Proof: To verify upper hemicontinuity, suppose that we had a sequence {(pO, w")l:'., _ (p, w) » 0 and a sequence {x" I:'., with x" E x(p", w') for all n, such that x' - x and x j x(p, w). Because p" x" :S w' for all n, taking limits as n - 00, we conclude that p. x :S W. Thus, x is a feasible consumption bundle when the budget set is B;.... However, since it is not optimal in this set, it must be that u(x) > u(x) for some X E B;....

APPENDIX A: CONTINUITY AND DIFFERENTIABILITY PROPERTIES OF WALRASIAN DEMAND

In Ihis appendix, we investigate the continuity and differentiability properties of the Walrasian demand correspondence x(p, wI. We assume that x» 0 for all (p, w) »0 and x e x(p, wI.

32. We use the notation z" - z as synonymous with z

= Iim" .... ox:> Z".

This definition of upper

hemicontinuily applies only to correspondences Ihat arc "locally bounded" (see Section M.H of Ihe Mathematical Appendix). Under our assumptions, the Walrasian demand correspondence satisfies Ihis property at all (p, w) »0.

31. Yel a third approach, based on linear programming lechniques, was provided by Afriat (1967).

=

DEMAND

L

94

CHAPTER

3:

CLASSICAL

DEMAND

THEORY

---------------------------------------------------------------------------------~

x,

-

REFERENCES

Therefore, the implicil funclion Iheorem (see Section M.E of the Mathematical Appendix) tells us that the differentiability of the solution x(p, w) as a function of the parameters (p, w) of the system depends on the Jacobian matrix of this system having a nonzero determinant. The Jacobian matrix [i.e., the derivative matrix of the L + I component functions with respect to the L + I variables (x, ).)] is

[D2;;X) ~pl x,

Since Vu(x) = ),p and ). > 0, the determinant of this matrix is nonzero if and only if the determinant of the bordered Hessian of u(x) at x is nonzero:

D2U(X) I[VU(X)]T By the continuity of u(·), there is a y arbitrarily close to x such that p' y < wand > u(x). This bundle y is illustrated in Figure 3.AA.2. Note that if /I is large enough, we will have pO. y < w" [since (p", w") - (p, w)]. Hence, y is an element of the budget set B,... ..... , and we must have u(x") ~ u(y) because x" e x(p", w"). Taking limits as /I - 00, the continuity of u(') then implies that u(x) ~ u(y), which gives us a contradiction. We must therefore have x e x(p, w), establishing upper hemicontinuity of

u(y)

x(p, w).

The same argument also establishes continuity if x(p, w) is in fact a function. _ Suppose that the consumption set is an arbitrary closed set Xc R'+. Then the continuity (or upper hemicontinuity) property still follows at any (p, w) that passes the following (locally cheaper cOllsumption) test: "Suppose that x e X is affordable (i.e., p'x s: w). Then there is a ye X arbitrarily close to x and that costs less than w(i.e., p' Y < w).n For example, in Figure 3.AAJ, commodity 2 is available only in indivisible unit amounts. The locally cheaper test then fails at the price-wealth point (p, w) = (I, w, w), where a unit of good 2 becomes just affordable. You can easily verify by examining the figure [in which the dashed line indicates indifference between the points (0, I) and z] that demand will fail to be upper hemicontinuous when p, = 'v. In particular, for price-wealth points (p", w) such that p; = I and pi> w, x(p", ,i') involves only the consumption of good 1; whereas at (p, w) = (I, w, w), we have x(p, ,i') = (0, 1). Note that the proof of Proposition 3.AA.1 fails when the locally cheaper consumption condition does not hold because we cannot find a consumption bundle y with the properties described there.

Differentiability Proposition 3.AA.1 has established that if x(p, w) is a function, then it is continuous. Often it is convenient that it be differentiable as well. We now discuss when this is so. We assume for the remaining paragraphs that u(·) is strictly quasiconcave and twice continuously differentiable and that Vu(x) '" 0 for all x. As we have shown in Section 3.D, the first-order conditions for the UMP imply that x(p, w) »0 is, for some). > 0, the unique solution of the system of L + I equations in L + I unknowns: Vu(x) - ),p = 0

p·x-w=O.

Figure 3."''''.2 (left)

Finding a bundle y such that p' y < wand u(y)

>

u(x).

Flgur. 3,"''''.3 (right)

The locally cheaper test fails at price-wealth pair (p,w)= (I,w,w).

VU(X)I '" O. 0

This condition has a straightforward geometric interpretation. It means that the indifference set through x has a nonzero curvature at x; it is not (even infinitesimally) flat. This condition is a slight technical strengthening of strict quasiconcavity [just as the strictly concave function f(x) = -(x') has = 0, a strictly quasiconcave function could have a bordered Hessian determinant that is zero at a point]. We conclude, therefore, that x(p, w) is differentiable if and only if the determinant of the bordered Hessian of u(·) is nonzero at x(p, w). It is worth noting the following interesting fact (which we shall not prove here): If x(p, w) is differentiable at (p, w), then the Slutsky matrix S(p, w) has maximal possible rank; that is, the rank of S(p, w) equals L _ 1.3)

reO)

REFERENCES Afrial, S. (1967). The construction of utility functions from expenditure dala. Inurnational Economic Review 8: 67-77.

Antonelli, G. B. (1886). Sulla r.oria MarrmaticQ d.lla [conomia Politica. Pisa: Nella tipogrofia del Folchetto. [English translation: On the mathematical theory of political economy.] In Preferences, Urilil)' and Demand, edited by J. Chipman, L. Hurwicz. and H. Sonnenschein. New York: Harcourt Brace Jovanovich. 1971.] Chipman. 1., and 1. Moore. (1980). Compensating variation. consumer's surplus. and welfare. American Economic Revitw 70: 933-48. Deaton. A.• and J. Muellbaucr (1980). Economics and Consumer Behavior. Cambridge, U.K.: Cambridge University Press. Debreu, G. (1960). Topotogical methods in cardinal utitity.ln Matlt.matical M,,/tods in th. Social Studies,

1959. edited by K. Arrow, S. Karlin. and P. Suppes. Stanford. Calif.: Stanford University Press. Djewert. W. E. (1982). Duality approaches to microeconomic theory. Chap. 12 in Handbook of Mathrmarical Economic!f. Vol. 2. edited by K. Arrow and M. Intriligator. Amsterdam: North-

Holland. Green. J. R., and W. Heller. (1981). Mathematical analysis and convexity with applications to economics. Chap. I in Handbook of Mathematical Economic.'i, Vol. I, edited by K. Arrow and M. Intriligator.

Amsterdam. North-Holland. 33. This statement applies only to demand generated from a twice continuously differentiable utility function. It need not be true when this condition is not met. For example. the demand function x(p, w) = (w/(p, + p,), wl(p, + p,» is differentiable, and it is generated by the utility function u(x) = Min {x,. "'}' which is not twice continuously differentiable at all x. The substitution matrix

for this demand function has all its entries equal to zero and therefore has rank equal to zero.

96

96

CHAPTER

3:

CLASSICAL

DEMAND

THEORY

E X E R CIS E S

~ Hausman. J. (1981). Exact consumer surplus and deadweight loss. American Economic Review 71: 662-76.

Hicks, J. (1939). Value and Capilal. Oxford: Clarendon Pre... Houthakker. H. S. (1950). Revealed preference and the utility function. Economica 17: 159-14. Hurwicz, L.. and Uzawa. (1911). On the integrability of demand functions. Chap. 6 in Preferences. Ulilily arul Demand. edited by J. Chipman. L. Hurwicz, and H. Sonnenschein. New York: Harcourt Brace. Jovanovich.

Kihlstrom. R, A. Mas·Colell. and H. Sonnenschein. (1916). The demand theory of the weak axiom of

3.C.6 8 Suppose that in a two·commodity world. the consumer's utility function takes the form u(x) = [a,x~ + a,x~] I/p. This utility function is known as the COnstanl elasticity of substitution (or CES) utility function. (a) Show that when p = I. indifference curves become linear. (b) Show that as p - O. this utility function comes to represent the same preferences as the (generalized) Cobb-Douglas utility function u(x) = x~'x~'.

revealed preference. Econometrica 44: 971-78. McKenzie, l. (1956-57). Demand theory without a utility index. Review of Economic Studies 24: 185-89. Marshall. A. (1920). Principles of Economics. London: MacmilhlR. Mas-Colell, A. (1982). Revealed preference arter Samuelson, in Samuelson and Neoclassical Ecorlomjcli.

(e) Show that as p - -00. indifference curves become ~right angles"; that is. this utility function has in the limit the indifference map of the Leontief utility function u(x I' x,) =

edited by G. Feiwel. Boston: Kluwer.Nijholf. Richter. M. (1966). Revealed prererence theolY. E O.

(b) A continuous;::; on (-00. 00) )( R~-I is quasilinear with respect to the first commodity if and only if it admits a utility function u(x) of the form u(.(x,' .... xd. [Hinl: The existence of some continuous utility representation is guaranteed by Proposition 3.G.I.] After answering (a) and (b). argue that these properties of u(·) are cardinal.

(a) Compute the Walrasian demand and indirect utility functions for this utility function.

(c) Derive the Walrasian demand correspondence and indirect utility function for the case of linear utility and the case of Leontief utility (see Exercise 3.C.6). Show that the CES Walrasian demand and indirect utility funetions approach these as p approaches I and -00. respectively. (d) The elasticily of subSlilUlion between goods I and 2 is defined as O[x,(P. w)/x,(P. w)] _----'P,::I/-=.P.:.'.,--, a[PI/p,]

X,(P, w)/x,(P. w)'

Show that for the CES utility function. {,,(Po w) = 1/(1 - pl. thus justifying its name. What is {dp. w) for the linear. Leontief. and Cobb-Douglas utility functions?

97

98

CHAPTER.:

CLASSICAL

DEMAND

THEORY

--------------------------------------------------------------------------------3.0.6" Consider the three-good setting in which the consumer has utility function I/(x) = (x, - h,)'(x, - h,)I(x, - h,)'.

(a) Why can you assume that. of the problem.

+ p + y = 1 without

-

EXERCISES

3.E.9" Use the relations in (3.E.I) to show that the properties of the indirect utility function identified in Proposition 3.0.3 imply Proposition 3.E.2. Likewise, use the relations in (3.E.I) to prove that Proposition 3.E.2 implies Proposition 3.0.3.

loss of generality? Do so for the rest

(b) Write down the first·order conditions for the UMP, and derive the consumer's

Walrasian demand and indirect utility functions. This system of demands is known as the

lill"ar expellditure s},slem and is due to Stone (1954). (e) Verify that these demand functions satisfy the properties listed in Propositions 3.0.2 and 3.0.3.

3.0.7" There are two commodities. We are given two budget sets B,•. w" and B,_ .• _described, respectively, by pO = (I, I), 1V0 = 8 and p' = (1,4), IV' = 26. The observed choice at (po, wo) is -,0 = (4,4). At (p', w'), we have a choice x' such that p'x' = 11".

(a) Determine the region of permissible choices x' if the choices X o and x' are consistent with maximization of preferences.

3.E.IO" Use the relations in (3.E.I) and (3.E.4) and the properties of the indirect utility and expenditure functions to show that Proposition 3.0.2 implies Proposition 3.E.4. Then use these facts to prove that Proposition lE.3 implies Proposition 3.0.2. 3.r.l" Prove formally that a closed, convex set K c RL equals the intersection of the half-spaces that contain it (use the separating hyperplane theorem). 3.F.2 A Show by means of a graphic example that the separating hyperplane theorem does not hold for nonconvex sets. Then argue that if K is closed and not convex, there is always some x f K that cannot be separated from K.

3.G.I" Prove that Proposition 3.G.I is implied by Roy's identity (Proposition 3.G.4). 3.G.2" Verify for the case of a Cobb-Douglas utility function that all of the propositions in

(b) Determine the region of permissible choices Xl if the choices XO and Xl are consistent with maximization of preferences that are quasilinear with respect to the first good. (e) Determine the region of permissible choices x' if the choices X o and x' are consistent

Section 3.G hold.

with maximization of preferences that are quasilinear with respect to the second good.

(0) Derive the Hicksian demand and expenditure functions. Check the properties listed in Propositions 3.E.2 and 3.E.3.

(d) Determine the region of permissible choices x' if the choices x· and x' are consistent

with maximization of preferences for which both goods are normal. (e) Determine the region of permissible choices with maximization of homothetic preferences.

Xl

if the choices

XO

and

Xl

are consistent

3.G.3" Consider the (linear expenditure system) utility function given in Exercise 3.0.6.

(b) Show that the derivatives of the expenditure function are the Hicksian demand function you derived in (a). (e) Verify that the Slutsky equation holds.

3.0.S A Show that· for all (p, w), wov(p, w)/aw = -P''V,v(p, w).

(d) Verify that the own-substitution terms are negative and that compensated cross-price effects are symmetric. (e) Show that S(p, 11') is negative semidefinite and has rank 2.

3.E.I A In text.

3.G.4" A utility function u(x) is additively sepurahlt if it has the form u(x) = L,U,(X/).

3.E.2 A In text.

(0) Show that additive separability is a cardinal property that is preserved only under linear transformations of the utility function.

3.E.3" Prove that a solution to the EMP exists if p» 0 and there is some x E R'+ satisfying u(. 0 for all ( and k # t.

3.E.7" Show that if ;::; is quasilinear with respect to good I, the Hicksian demand functions for goods 2, ... , L do not depend on u. What is the form of the expenditure function in this case?

3.G.5C (liicksian compo.,ite commodities.) Suppose there are two groups of desirable com· modities, x and y, with corresponding prices p and q. The consumer's utility function is u(x, y), and her wealth is w > O. Suppose that prices for goods y always vary in proportion to one another, so that we can write q = Ilq •. For any number z ~ 0, define the function

3.E.SA For the Cobb-Douglas utility function, verify that the relationships in (3.E.I) and (3.E.4) hold. Note that the expenditure function can be derived by simply inverting the indirect

utility function, and vice versa.

u(x, z) = Max

u(x, y)

99

100

C HAP T E R

3:

C L .. S SIC" L

0 EM" NOT H E 0 R Y

(a) Show that ir we imagine that the goods in the economy are x and a single composite commodity Z, that u(x, z) is the consumer's utility runction, and that a is the price or the composite commodity, then the solution to Max •.• u(x, z) S.t. p'x + aZ:$; w will give the consumer's actual levels or x and Z = q.' y. (b) Show that properties or Walrasian demand runctions identified in Propositions 3.0.2 and 3.G.4 hold ror x(p, a, w) and zIp, a, wI.

- --

EXERCISES

3.G.II" Veriry that an indirect utility function in Gorman rorm exhibits linear wealth·

expansion curves. 3.G.n" What restrictions on the Gorman rorm correspond to the cases or homothetic and quasilinear preferences? 3.G.\3c Suppose that the indirect utility runction v(p, w) is a polynomial or degree n on w (with coefficients that may depend on pl. Show that any individual wealth·expansion path is contained in a linear subspace or at most dimension n + I. Interpret.

(e) Show that the properties in Propositions 3.E.3, and 3.G.1 to 3.G.3 hold ror the Hicksian demand runctions derived using u(x, z).

3.G.14 A The matrix below records the (Walrasian) demand substitution effects ror a consumer endowed with rational prererences and consuming three goods at the prices p, = I, p, = 2,

3.G.6" (F. M. Fisher) A consumer in a three.good economy (goods denoted x" x" and x,; prices denoted PI' p" p,) with wealth level w > 0 has demand runctions ror commodities I and 2 given by

and p, = 6:

x,=100-5tJ..t.+pl2+b~ p,

p,

p,

[

~ '~ ~4

:]

x,=a+fltJ..t.+yl2+b~ p,

p,

p,

Supply the missing numbers. Does the resulting matrix possess all the properties or a

where Greek letters are nonzero constants.

substitution matrix?

(a) Indicate how to calculate the demand ror good 3 (but do not actually do it).

3.G.IS" Consider the utility runction

(b) Are the demand runctions ror x, and .I log P, + (Q p~}}. P, ..... Ii, are

necessary ror this to be derivable rrom

utility maximization?

x'Vu(x)

(b) Find the indirect utility that corresponds to it.

Deduce rrom Proposition 3.GA that

(c) Veriry Roy's identity and the Slutsky equation. x(p) = _1_ Vv(p). P'Vv(p)

3.G.17" [From Hausman (1981)] Suppose L = 2. Consider a "local" indirect utility runction defined in some neighborhood or price-wealth pair (p. "'I by

Note that this is a completely symmetric expression. Thus, direct (Walrasian) demand is the normalized derivative or indirect utility, and indirect demand is the normalized derivative or direct utility.

+ ~(a~"" + ~ +

v(P.w) = -eXP(-bP,/Pl)["':

b

p,

p,

b

3.G.S" The indirect utility runction v(p,.-) is logarithmically homogeneous ir v(p, aw) = vIp, IV) + In 2 ror a > 0 [in other words. vIp, IV) = In (v"(p, w)), where v"(p, w) is homo· geneous or degree one]. Show that ir r(',') is logarithmically homogeneous. then x(p, I) = - V,v(p, I).

(a) Verify that the local demand function ror the first good is

3.G.9C Compute the Slutsky matrix rrom the indirect utility runction.

(b) Veriry that the local expenditure runction is

PI x,(P. w) = a p,

3.G.IO" For a runction or the Gorman rorm r(P. w) = alp) + b(p)w. which properties will the runctions a(') and b(·) have to satisry ror vIp. w) to qualiry as an indirect utility runction?

W

+ b - + c.

e(p, u) = - p,u exp (bp,/p,) -

l

c)].

p,

~ (a p , + ~ p, + CP').

101

102

C HAP T E R

3,

C LAS SIC A L

0 E MAN 0

THE 0 R Y

EX ERe I S E S

(c) Verify that the local Hicksian demand function for the first commodity is h,(p, u) = -ub exp (bp,/p,) -

a

3.1.7 8 There are three commodities (i.e .• L = 3). of which the third is a numeraire (let p, = I). The market demand function x(p, w) has

b'

C

3.G.IS Show that every good is related to every other good by a chain of (weak) substitutes; that is, for any goods t and k, either oh,(p, u)/op, ~ 0, or there exists a good r such that ilh,(p, u)/op,
x,(P. w) = a

+ bp, + cp,

x,(p, w) = d

+ ep, + gp,.

(a) Give the parameter restrictions implied by utility maximization. (b) Estimate the equivalent variation for a change of prices from (p" p,) = (1,1) to (PI> fi,) = (2,2). Verify that without appropriate symmetry, there is no path independence. Assume symmetry for the rest of the exercise. (c) Let EV" EV" and EV be the equivalent variations for a change of prices from (p" p,) = (I, I) to, respectively, (2, 1), (1,2), and (2,2). Compare EV with EV, + EV, as a function of the parameters of the problem. Interpret. (d) Suppose that the price increases in (c) are due to taxes. Denote the deadweight losses for each of the three experiments by D W" D W" and D W. Compare D W with D W, + D W, as a function of the parameters of the problem. (e) Suppose the initial tax situation has prices (p" p,) = (1, I). The government wants to raise a fixed (small) amount of revenue R through commodity taxes. Call t, and t, the tax rates for the two commodities. Determine the optimal tax rates as a function of the parameters of demand if the optimality criterion is the minimization of deadweight loss. 3.1.8 8 Suppose we are in a three·commodity market (i.e. L = 3). Letting p, = I, the demand functions for goods I and 2 are

3.H,SB Suppose you know the indirect utility function. How would you recover from it the expenditure function and the direct utility function?

x,(p, w) = a, x,(p, w) = a,

+ b,p, + c,p, + d,p,p, + b,p, + c,p, + d,p,p,.

t = I, ... ,L with Lta.t =

3.H.6 B Suppose that you observe the Walrasian demand functions x,(p, w) = a.,w/p, for all 1. Derive the expenditure function of this demand system. What is the consumer's utility function?

(a) Note that the demand for goods I and 2 does not depend on wealth. Write down the most general class of utility functions whose demand has this property.

3.H.7 8 Answer the following questions with reference to the demand function in Exercise 2.F.I7.

the values of the parameters cannot be arbitrary. Write down as exhaustive a list as you can of

the restrictions implied by utility maximization. Justify your answer.

(a) Let the utility associated with consumption bundle x = (I, I, ... , I) be 1. What is the expenditure function e(p, I) associated with utility level u = I? [Hint: Use the answer to (d) in Exercise 2.r.17.]

(c) Suppose that the conditions in (b) hold. The initial price situation is p = (p" p,), and we consider a change to p' = (p'" p',). Derive a measure of welfare change generated in going from p to p'.

(b) Argue that if the demand functions in (a) are generated from utility maximization, then

(b) What is the upper contour set of consumption bundle x = (I, I, ... , I)? 3.1.1 B In text. 3.1.2 8 In text. 3.I.3B Consider a price change from initial price vector po to new price vector p' :s; pO in which only the price of good t changes. Show that CV(pO, p', w) > EV(pO, pI, w) if good t is inferior. 3.I.4B Construct an example in which a comparison of CV(po, p', w) and CV(pO, p2, w) does not give the correct welfare ranking of pl versus pl,

3.1,SB Show that if u(x) is quasilinear with respect to the first good (and we fix P, = I), then CV(po, p', w) = EV(po, p', w) for any (po, p', wI. 3.1.6 A Suppose there are i = I, ... , I consumers with utility functions ",(x) and wealth w,. We consider a change from pO to p'. Show that if L C V,(po, p', w,) > 0 then we can find (w; H. I such that L. w; :s; L. w, and V,(p', wi) ~ v,(pO, w,) for all i. That is, it is in principle possible to compensate everybody for the change in prices.

(d) Let the values of the parameters be a, = a, = 3/2, b, = c, = I, c, = b, = 1/2, and d, = d, = O. Suppose the initial price situation is p = (1, I). Compute the equivalent variation for a move to p' for each of the following three cases: (i) p' = (2, I), (ii) p' = (I, 2), and (iii) p' = (2, 2). Denote the respective answers by EV;, EV" EV,. Under which condition will you have EV, = EV, + EV,? Discuss. 3.1.9

8

In a one-Consumer economy, the government is considering putting a tax of t per unit

on good ( and rebating the proceeds to the consumer (who nonetheless does not consider the effect of her purchases on the size of the rebate). Suppose that sNIp, w) < 0 for all (p, wI. Show that the optimal tax (in the sense of maximizing the consumer's utility) is zero. 8

3.1.10 Construct an example in which the area variation measure approach incorrectly ranks pO and pI [Hint: Let the change from pO to p' involve a change in the price of more than one good.]

3.1.11" Suppose that we know not only pO, p', and

X

O

but also x, = X(p', wI. Show that if

(p' - pO)·x ' > 0, then the consumer must be worse off at price-wealth situation (p', w) than at (po, w). Interpret this test as a first-order approximation to the expenditure runction at pI,

103

104

C H " PTE R

3:

C L " S SIC" L

CHAPTER

0 EM" NOT H E 0 R Y

Also show that an alternative way to write this test is pO·(x ' - xo) < 0, and depict the test for the case where L = 2 in (x" x,) space. [Hint: Locate the point XO on the set {x E R'+: "(x) = "O}.] 3.1.12B Extend the compensating and equivalent variation measures of welfare change to the case of changes in both prices and wealth, so that we change from (pO, wo) to (pI, w'). Also

Aggregate Demand

4

extend the "partial information" test developed in Section 3.1 to this case. 3.J.tC Show that when L = 2, x(p, w) satisfies the strong axiom if and only if it satisfies the weak axiom. 3.AA.t B Suppose that the consumption set is X = {x E R~: x, + x, ~ I} and the utility function is "(x) = x,. Represent graphically, and show (a) that the locally cheaper consumption test fails at (p, w) = (I, I, I) and (b) that market demand is not continuous at this point. Interpet economically. 3.AA.2c Under the conditions of Proposition 3.AA.I, show that h(p,") is upper hemicon-

tinuous and that e(p, u) is continuous (even if we replace minimum by infimum and allow 0). Also, assuming that h(p, u) is a function, give conditions for its differentiability.

p ~

4 A Introduction For most questions in economics, the aggregate behavior of consumers is more important than the behavior of any single consumer. In this chapter, we investigate the extent to which the theory presented in Chapters I to 3 can be applied to aggregate demand, a suitably defined sum of the demands arising from all the economy's consumers. There are, in fact, a number of different properties of individual demand that we might hope would also hold in the aggregate. Which ones we are interested in at any given moment depend on the particular application at hand. In this chapter, we ask three questions about aggregate demand: (i) Individual demand can be expressed as a function of prices and the individual's wealth level. When can aggregate demand be expressed as a

function of prices and aggregate wealth? (ii) Individual demand derived from rational preferences necessarily satisfies the

weak axiom of revealed preference. When does aggregate demand satisfy the weak axiom? More generally, when can we apply in the aggregate the demand theory developed in Chapter 2 (especially Section 2.F)? (iii) Individual demand has welfare significance; from it, we can derive measures of welfare change for the consumer, as discussed in Section 3.1. When does aggregate demand have welfare significance? In particular, when do the welfare measures discussed in Section 3.1 have meaning when they are computed from the aggregate demand function? These three questions could, with a grain of salt, be called the aggregation theories of, respectively, the econometrician, the positive theorist, and the welfare theorist. The econometrician is interested in the degree to which he can impose a simple structure on aggregate demand functions in estimation procedures. One aspect of these concerns, which we address here, is the extent to which aggregate demand can be accurately modeled as a function of only aggregate variables, such as aggregate (or, equivalently, average) consumer wealth. This question is important because the econometrician's data may be available only in an aggregate form. The positive (behavioral) theorist, on the other hand, is interested in the degree 105

-

,

106

CHAPTER

4:

AGGREGATE

DEMAND

SECTION

4.8:

AGGREGATE

Invariance of aggregate demand to redistribution of

weallh implies weahh expansion paths ,hat are straight and parallel across consumers.

dw, are equal; that is, aXII(p, W,) = axn(p, WJ)

Suppose that there are I consumers with rational preference relations 12:i and corresponding Walrasian demand functions xi(p, w,). In general, given prices P E JRL and wealth levels (w" ... , WI) for the I consumers, aggregate demand can be written as I

x,(p, Wi)'

Thus, aggregate demand depends not only on prices but also on the specific wealth levels of the various consumers. In this section, we ask when we are justified in writing aggregate demand in the simpler form x(p, LiW,), where aggregate demand depends only on aggregate wealth Li Wi' For this property to hold in all generality, aggregate demand must be identical for any two distributions of the same total amount of wealth across consumers. That is, for any (w" ... , WI) and (w~, ... , wI) such that LiWi = LiWi, we must have LiXi(P, Wi) = LiXi(P, wi)· To examine when this condition is satisfied, consider, starting from some initial distribution (w" . .. , WI)' a differential change in wealth (dw" ... ,dwl ) E JRI satisfying Li dW i = O. If aggregate demand can be written as a function of aggregate wealth, then assuming differentiability of the demand functions, we must have

I i

ax,,(p, w,) dw, = 0

aW

for every t.

j

This can be true for all redistributions (dw" ... ,dwl ) satisfying L dw, = 0 and from any initial wealth distribution (w" ... , WI) if and only if the coefficients of the different

.

I '0

•

:.

aWj

(4.B.l)

for every t, any two individuals i and j, and all (w" . .. , WI ).2 In short, for any fixed price vector p, and any commodity t, the wealth effect at P must be the same whatever consumer we look at and whatever his level of wealth. 3 It is indeed fairly intuitive that in this case, the individual demand changes arising from any wealth redistribution across consumers will cancel out. Geometrically, the condition is equivalent to the statement that all consumers' wealth expansion paths are parallel, straight lines. Figure 4.B.l depicts parallel, straight wealth expansion paths. One special case in which this property holds arises when all consumers have identical preferences that are homothetic. Another is when all consumers have preferences that are quasilinear with respect to the same good. Both cases are examples of a more general result shown in Proposition 4.B.1.

i"¥c

Proposition 4.B.1: A necessary and sufficient condition for the set of consumers to exhibit parallel, straight wealth expansion paths at any price vector p is that preferences admit indirect utility functions of the Gorman form with the ,. coefficients on Wi the same for every consumer i. That is:

I ~::.f t;:-~

~''';'''

~~

~

i i

v;(p,

Wi) =

a;(p)

+ b(p)w;.

Proof: You are asked to establish sufficiency in Exercise 4.B.l (this is not too difficult; use Roy's identity). Keep in mind that we are neglecting boundaries (alternatively, the significance of a result such as this is only local). You should not attempt to prove necessity. For a discussion of this result, see Deaton and Muellbauer (1980). •

~c

~l "3

w', in

4.C:

AGGREGATE

DEMAND

AND

THE

with strict inequality if x,(p', IX,W) 1= x,(p, ct,w'). Adding (4.C.2) over gives us precisely (4.C.1). Thus, we conclude that aggregate demand must satisfy the WA for any price-wealth change that is compensated for every consumer. The difficulty arises because a price-wealth change that is compensated in the aggregate, so that w' = p" x(p, w), need not be compensated for each individual; we may well have ct,w' 1= p" x,(p, ct,w) for some or all i. If so, the individual wealth effects [which, except for the condition p'Dw,x(p,ct,w) = I, are essentially unrestricted] can play havoc with the well-behaved but possibly small individual substitution effects. The result may be that (4.C.2) fails to hold for some i, thus making possible the failure of the similar expression (4.C.1) in the aggregate. Given that a property of individual demand as basic as the WA cannot be expected to hold generally for aggregate demand, we might wish to know whether there are any restrictions on individual preferences under which it must be satisfied. The preceding discussion suggests that it may be worth exploring the implications of assuming that the law of demand, expression (4.C.2), holds at the individual level for price changes that are left uncompensated. Suppose, indeed, that given an initial position (p, w,), we consider a price change p' that is not compensated, namely, we leave wi = Wi' If (4.C.2) nonetheless holds, then by addition so does (4.C.1). More formally, we begin with a definition. Deflnilion 4.C.2: The individual demand function Xi(p, Wi) satisfies the uncompensated law of demand (ULD) property if

(p' - p). [xi(p', Wi) - xi(p, Wi)] oS 0

(4.C.3)

for any p, p', and Wi' with strict inequality if xi(p', Wi) 1= Xi(p, Wi)' The analogous definition applies to the aggregate demand function x(p, w). In view of our discussion of the weak axiom in Section 2.F, the following differential version of the ULD property should come as no surprise (you are asked to prove it in Exercise 4.C.1): If x,(p, w,) satisfies the ULD property, then Dpx.(p, w,) is negative semidefinite; that is, dp' Dpx,(p, w,) dp oS 0 for all dp.

As with the weak axiom, there is a converse to this: If Dpx,(p, w.J is negative definite for all p, then x,(p, w,) satisfies the ULD property.

The analogous differential version holds for the aggregate demand function x(p, w). The great virtue of the ULD property is that, in contrast with the WA, it does, in fact, aggregate. Adding the individual condition (4.C.3) for w, = IX,W gives us (p' - p)·[x(p', w) - x(p, w)] oS 0, with strict inequality if x(p, w) 1= x'(p, w). This leads us to Proposition 4.C.1. Proposition 4.C.1: If every consumer's Walrasian demand function xi(p, w,) satisfies the uncompensated law of demand (ULD) property, so does the aggregate demand x(p, w) = Li xi(p, ctiW), As a consequence, the aggregate demand x(p, w)

satisfies the weak axiom.

WEAK

AXIOM

111

112

CHAPTER

.:

AGGREGATE

,.c:

SECTION

DEMAND

AGGREGATE

DEMAND

AND

THE

WEAK

AXIOM

113

..~-----------------------------------------------------------------------------------

Proof: Consider any (p, w), (p', w) with x(p, w) i' x(p', wI. We must have x,(p, a,w) i' x,(p', a,w)

for some i. Therefore, adding (4.C.3) over i, we get (p' - p)'[x(p, w) - x(p', wI] < O.

This holds for all p, p', and w. To verify the WA, take any (p, w), (p', w') with x(p, w) i' x(p', w') and P'x(p', w'):s w. 9 Define p" = (wjw')p'. By homogeneity of degree zero, we have x(p", w) = x(p', w'). From (p" - p)·[x(p", w) - x(p, wI] < 0, P'x(p", w):S w, and Walras' law, it follows that p'" x(p, w) > w. That is, p" x(p, w) > w' . • How restrictive is the ULD property as an axiom of individual behavior? It is clearly not implied by preference maximization (see Exercise 4.C.3). Propositions 4.C.2 and 4.C.3 provide sufficient conditions for individual demands to satisfy the ULD property.

The proof of Proposition 4.C.3 will not be given. The courageous reader can attempt it in Exercise 4.C.S. The condition in Proposition 4.C.3 is not an extremely stringent one. In particular, notice how amply the homothetic case fits into it (Exercise 4.C.6). So, to the question "How restrictive is the ULD property as an axiom of individual behavior?" perhaps we can answer: "restrictive, but not extremely so."tO Note, in addition, that for the ULD property to hold for aggregate demand, it is not necessary that the ULD be satisfied at the individual level. It may arise out of aggregation itself. The example in Proposition 4.C.4, due to Hildenbrand (1983), is not very realistic, but it is nonetheless highly suggestive. Proposition 4.C.4: Suppose that all consumers have identical preferences)::; defined on IR~ [with individual demand functions denoted x(p, wI] and that individual wealth is uniformly distributed on an interval [0, w] (strictly speaking, this requires a continuum of consumers). Then the aggregate (rigorously, the average) demand function x(p) =

Proposition 4.C.2: If)::;, is homothetic, then x,(p, Wi) satisfies the uncompensated law of demand (ULD) property.

tv

x(p, w) dw

satisfies the unrestricted law of demand (ULD) property. Proof: We consider the differentiable case [i.e., we assume that x,(p, w,) is differentiable and that )::;, is representable by a differentiable utility function]. The matrix Dpx,(p, w,) is

Proof: Consider the differentiable case. Take v # O. Then V' Dx(p)v

where S,(p, w;) is consumer i's Slutsky matrix. Because [dp' x,(p, W,)]2 > 0 except when dp' x,(p, w,) = 0 and dp' S,(p, w,) dp < 0 except when dp is proportional to p, we can conclude that Dpx,(p, w,j is negative definite, and so the ULD condition holds. • In Proposition 4.C.2, the conclusion is obtained with minimal help from the substitution effects. Those could all be arbitrarily small. The wealth effects by themselves turn out to be sufficiently well behaved. Unfortunately, the homothetic case is the only one in which this is so (see Exercise 4.C.4). More generally, for the ULD property to hold, the substitution effects (which are always well behaved) must be large enough to overcome possible "perversities" coming from the wealth effects. The intriguing result in Proposition 4.3.C [due to Mitiushin and Polterovich (1978) and Milleron (1974); see Mas-Colell (1991) for an account and discussion of this result] gives a concrete expression to this relative dominance of the substitution effects. Proposition 4.C.3: Suppose that )::;, is defined on the consumption set X = R~ and is representable by a twice continuously differentiable concave function u,~·). If x·' D 2 u.(x.)x.

,

,,' < 4

x,'Vu,(x i )

for all x· "

then x,(p, Wi) satisfies the unrestricted law of demand (ULD) property. 9. Strictly speaking, this proof is required because although we know that the WA is equivalent to the law of demand for compensated price changes, we are now dealing with uncompensated price changes.

=

t'

V'

D.x(p, w)v dw.

Also D.x(p, w) = SIp, w) - Dwx(p, w)x(p, wIT,

where SIp, w) is the Slutsky matrix ofthe individual demand function V·

Dx(p)v =

t'

v'S(p, w)v dw -

t'

xc-. .) at(p, wI. Hence,

(v' Dwx(p, w))(v'x(p, w)) dw.

The first term of this sum is negative, unless v is proportional to p. For the second, note that _ d(v'x(p, w»' 2(v' Dwx(p, w))(v'x(p, w)) = -----c'-----"dw

So -

f"

1 2

(v'Dwx(p, w))(v'x(p, w»dw= - -

o

f" 0

d(v'x(p w))' 1 ' dw= --(v'x(p,w))',sO, dw 2

where we have used x(p, 0) = O. Observe that the sign is negative when v is proportional to p. • Recall that the ULD property is additive across groups of consumers. Therefore, what we need in order to apply Proposition 4.C.4 is, not that preferences be identical, but that for every preference relation. the distribution of wealth conditional on that preference be uniform over 10. Not to misrepresent the import ofthis claim, we should emphasize that Proposition 4.C.l,

which asserts that the ULD property is preserved under addition. holds for the price. independent distribution rules that we are considering in this section. When the distribution

or real wealth may

depend on prices (as it typically will in the general equilibrium applications of Part IV). then aggregate demand may violate the WA even if individual demand satisfies the ULD property (see Exercise 4.C.I3). We discuss this point further in Section 17.F.

114

CHAPTER

4:

AGGREGATE

SEC T ION

DEMAND

some interval that includes the level 0 (in fact, a nonincreasing density fUDction is enough; see Exercise 4.C 7).

4. C:

A G G REG ATE

S(p, w) = Dpx(p, w)

+ Dwx(p, w) x(p, w)T

+ n:,,,,Dw,x,(p, ",w)] x(p, W)T

(4.C5)

Next, let S,(p, w,) denote the individual Slutsky matrices. Adding the individual Slutsky equations gives L,S,(p, o,w) = L,Dpx,(p, o,w)

Since Dpx(p, w) =

+ L,Dw,x,(p, o,w) x,(p, O,W)T

(4.C6)

L, Dpx,(p, o,w), we can substitute (4.C6) into (4.C5) to get

S(p, w) = L,S,(p, w,) - L,o,[Dw,x,(p, o,w) - Dwx(p,

W)][ ~ x,(p, o,w) - x(p, W)T.

(4.C7)

Note that because of wealth effects, the Slutsky matrix of aggregate demand is not the sum of the individual Slutsky matrices. The difference C(p, w) = L,S,(p, ow) - S(p, w) =

L,,,,[Dw,x,(p, o,w) - Dwx(p,

W){ ~ x,(p, ",w) -

~ss Individuals

x(p,

W)T

(4.C8)

is a covariance matrix between wealth effect vectors Dw,x,(p, ",w) and proportionately adjusted consumption vectors (l/",)x,(p, ",w). The former measures how the marginal dollar is spent across commodities; the latler measures the same thing for the average dollar [e.g., (I/o,w)x,,(p, ",w) is the per-unit-of-wealth consumption of good t by consumer i]. Every "observation" receives weight ",. Note also that, as it should be, we have

For an individual Slutsky matrix S,(', .) we always have dp'S,(p, .,w) dp < 0 for dp .. 0 not proportional to p. Hence, a sufficient condition for the Slutsky matrix of aggregate demand to have the desired property is that C(p, w) be positive semidefinite. Speaking loosely, this will be the case if, on average, there is a positive association across consumers between consumption (per unit of wealth) in one commodity and the wealth effect for that commodity. Figure 4.C2(a) depicts a case for L = 2 in which, assuming a uniform distribution of wealth across consumers, this association is positive: Consumers with higher-than-average

A X 10M

115

Figure 4.C.2

The relation across consumers between

COnsumers have the

(a)

(b)

consumption of one good spend a higher· than-average fraction of their last unit of wealth on that good. The association is negative in Figure 4.C2(b)."·1l From the preceding derivation, we can see that aggregate demand satisfies the WAin two cases of interest: (i) All the Dw,x,(p, .,w) are equal (there are equal wealth effects), and (ii) all the (I/.,)x, (p, ",w) are equal (there is proportional consumption). In both cases, we have C(p, w) = 0, and so dp'S(p, w) dp < 0 whenever dp" 0 is not proportional to p. Case (i) has important implications. In particular, if every consumer has indirect utility functions of the Gorman form ",(p, WI) = a,(p) + b(p)w, with the coefficient b(p) identical across consumers, then (as we saw in Section 4.8) the wealth effects arc the same for all consumers and we can therefore conclude that the WA is satisfied. We know from Section 4.8 that one is led to this family of indirect utility functions by the requirement that aggregate demand be invariant to redistribution of wealth. Thus, aggregate demand satisfying the weak axiom for a fixed distribution of wealth is a less demanding property than the invariance to redistribution property considered in Section 4.8. In particular, if the second property holds, then the first also holds, but aggregate demand (for a fixed distribution of wealth) may satisfy the weak axiom even though aggregate demand may not be invariant to redistribution of wealth (e.g., individual preferences may be homothetic but not identical). Having spent all this time investigating the weak axiom (W A), you might ask: "What about the strong axiom (SA)?" We have not focused on the Strong Axiom for three reasons. First, the WA is a robust property, whereas the SA (which, remember, yields the symmetry of the Slutsky matrix) is not; a priori, the chances of it being satisfied by a real economy are essentially zero. For example, if we start with a group of consumers with identical preferences and wealth, then aggregate demand obviously satisfies the SA. However, if we now perturb every preference slightly and independently across consumers, the negative semidefiniteness of the Slutsky matrices (and therefore the WA) may well be preserved but the symmetry (and therefore the SA) will almost certainly not be. 12. You may want to verify that the wealth expansion paths of Example 4.C.l must indeed look like Figure 4.C.2(b). 13. A priori, we cannot say which form is more likely_ Because the demand at zero wealth is zero, it is true that for a consumer, some dollar must be spent among the two goods according to shares similar to the shares of the average dollar. But if the levels of wealth are not close to zero, it does not follow that this is the case for the marginal dollar. It may even happen that hecause of incipient satiation, the shares of the marginal dollar display consumption propensities that are the

reverse of the ones exhibited by the average dollar. See Hildenbrand (1994) for an account of II. In the next few paragraphs, we follow Jerison (1982) and Freixas and Mas-Colell (1987).

W £ A K

expenditure per unit of wealth on a commodity and its wealth effect when all

(4.C.4)

Or, since x(p, w) = L,X,(p, ",w), S(p, w) = Dpx(p, w)

at p / / .~oss Individuals

~~;r~hsatP

/

THE

.~ ~;r:hS

Wealth Expansion One lesson of PropoSition 4.C.4 is that the properties of aggregate demand will depend on how preferences and wealth are distributed. We could therefore pose the problem quite generally and ask which distributional conditions on preferences and wealth will lead to satisfaction of the weak axiom by aggregate demand. II As mentioned in Section 2.F, a market demand function x(p, IV) can be shown to satisfy the WA if for all (p, w), tbe Slutsky matrix S(p, w) derived from the function x(p, w) satisfies dp' S(p, w) dp < 0 for every dp .. 0 not proportional to p. We now examine when this property might hold for the aggregate demand function. The Slutsky equation for the aggregate demand function is

AND

Wealth Expansion

x,

x,

DE MAN 0

empirical research on this matter.

same wealth. (a) Positive relation. (b) Negative relation.

116

CHAPTER

.:

AGGREGATE

DEMAND

Second, many oC the strong positive results oC general equilibrium (to be reviewed in Part IV, especially Chapters 15 and 17) to which one wishes to apply the aggregation theory discussed in this chapter depend on the weak axiom, not on the strong axiom, holding in the aggregate. Third, while one might initially think that the existence oC a preCerence relation explaining aggregate behavior (which is what we get Crom the SA) would be the condition required to use aggregate demand measures (such as aggregate consumer surplus) as welCare indicators, we will see in Section 4.D that. in Cact, more than this condition is required anyway.

4.D Aggregate Demand and the Existence of a Representative Consumer The aggregation question we pose in this section is: When can we compute meaningful measures of aggregate welfare using the aggregate demand function and the welfare measurement techniques discussed in Section 3.1 for individual consumers? More specifically, when can we treat the aggregate demand function as if it were generated by a fictional representative consumer whose preferences can be used as a measure of aggregate societal (or social) welfare? We take as our starting point a distribution rule (w,(p, w), ... , w,(p, wi) that to every level of aggregate wealth wE R assigns individual wealths. We assume that L. w.(p, w) = w for all (p, w) and that every w,(', .) is continuous and homogeneous of degree one. As discussed in Sections 4.B and 4.C, aggregate demand then takes the form of a conventional market demand function x(p, w) = L,X,(P, w,(p, wi). In particular, x (p, w) is continuous, is homogeneous of degree zero, and satisfies Walras' law. It is important to keep in mind that the aggregate demand function x(p, w) depends on the wealth distribution rule (except under the special conditions identified in Section 4.B). It is useful to begin by distinguishing two senses in which we could say that there is a representative consumer. The first is a positive, or behavioral, sense. Definition 4.0.1: A positive representative consumer exists if there is a rational preference relation;:: on R~ such that the aggregate demand function x(p, w) Is precisely the Walrasian demand function generated by this preference relation. That is, x(p, w) >- x whenever x .p x(p, w) and P' x ~ w. A positive representative consumer can thus be thOUght of as a fictional individual whose utility maximization problem when facing society's budget set {x E R'+: p'x ~ w} would generate the economy's aggregate demand function. For it to be correct to treat aggregate demand as we did individual demand functions in Section 3.1, there must be a positive representative consumer. ,4 However, although this is a necessary condition for the property of aggregate demand that we seek, it is not sufficient. We also need to be able to assign welfare significance to this Note that if there is a positive representative consumer, then aggregate demand satisfies the positive properties sought in Section 4.C. Indeed, not only will aggregate demand satisCy the weak axiom, but it will also satisCy the strong axiom. Thus, the aggregation property we are aCter in this 14.

section is stronger than the one discussed in Section 4.C.

SECTION

•. 0:

AGGREGATE

DEMAND

AND

REPRESENTATIVE

fictional individual's demand function. This will lead to the definition of a normative representative consumer. To do so, however, we first have to be more specific about what we mean by the term social welfare. We accomplish this by introducing the concept of a social welfare function, a function that provides a summary (social) utility index for any collection of individual utilities. Definition 4.0.2: A (Bergson-Samuelson) social welfare function is a function W: IR' - IR that assigns a utility value to each possible vector (u". ., u,) E IR' of utility levels for the I consumers in the economy. The idea behind a social welfare function W(u" ... , u,) is that it accurately expresses society's judgments on how individual utilities have to be compared to produce an ordering of possible social outcomes. (We do not discuss in this section the issue of where this social preference ranking comes from. Chapters 21 and 22 cover this point in much more detaiL) We also assume that social welfare functions are increasing, concave, and whenever convenient, differentiable. Let us now hypothesize that there is a process, a benevolent central authority perhaps, that, for any given prices p and aggregate wealth level w, redistributes wealth in order to maximize social welfare. That is, for any (p, wi, the wealth distribution (w, (p, w), ... , w,(p, wi) solves (4.0.1) WI •••• ,W,

s.t.I[ •• w, ~ w,

where v,(p, w) is consumer i's indirect utility function."'!· The optimum value of problem (4.0.1) defines a social indirect utility function v(p, wi. Proposition 4.0.1 shows that this indirect utility function provides a positive representative consumer for the aggregate demand function x(p, w) = L,X,(P, w,(p, w)). Proposition 4.0.1: Suppose that for each level of prices p and aggregate wealth w, the wealth distribution (w,(P, w), . .. ,w,(p, wI) solves problem (4.0.1). Then the value function v(p, w) of problem (4.0.1) is an indirect utility function of a positive representative consumer for the aggregate demand function x(p, w) = L,Xj(p, w;(p, w)).

Proof: In Exercise 4.0.2, you are asked to establish that v(p, w) does indeed have the properties of an indirect utility function. The argument for the proof then consists of using Roy's identity to derive a Walrasian demand function from v(p, w), which we denote by XR(P, w), and then establishing that it actually equals x(p, wi. We begin by recording the first-order conditions of problem (4.0.1) for a 15. We assume in this section that our direct utility functions UI(') arc concave. This is a weak hypothesis (once quasiconcavity has been assumed) which makes sure that in all the optimization problems to be considered, the first-order conditions are sufficient Cor the determination oC global optima. In particular, vj(p, .) is then a concave function of W" 16. In Exercise 4.D.1, you are asked to show that if so desired, problem (4.D.1) can be

equivalently Cormulated as one where social utility is maximized, not by distributing wealth, but by distributing bundles oC goods with aggregate value at prices p not larger than w. The fact that in optimally redistributing goods. we can also restrict ourselves to redistributing wealth is, in essence, a version of the second fundamental theorem of welfare economics, which will be covered extensively

in Chapter 16.

CONSUMERS

117

118

CHAPTER

4:

AGGREGATE

DEMAND

SECTION

given value of (p, w). Neglecting boundary solutions, these require that for some

A ~ 0, we have A = oW ~ OV, ow,

= ... = oW OV, OV, ow,

(4.0.2)

(For notational convenience, we have omitted the points at which the derivatives are evaluated.) Condition (4.0.2) simply says that at a socially optimal wealth distribution, the social utility of an extra unit of wealth is the same irrespective of who gets it. By Roy's identity, we have XR{P, w) = - [1/{ov(p, w)/ow)] V,v(p, w). Since vIp, w) is the value function of problem (4.0.l), we know that ov/ow = A. (See Section M.K of the Mathematical Appendix) In addition, for any commodity t, the chain rule and (4.0.2)-or, equivalently, the envelope theorem-give us

~

=

OPt

L 0 W OV, ,ov, OPt

+ A L ow, = L 0 W OV, , ,OPt

,OV, OPt

where the second equality follows because L, w,{p, w) = w for all (p, w) implies that L,(OW;/0Pt) = O. Hence, in matrix notation, we have V,v{p, w) = L,(oW/ov,) V,v,(p, w,(p, w)). Finally, using Roy's identity and the first-order condition (4.0.2), we get xR(p, w) =

-~

L,[_A_J

A

ovdow,

V,v,{p, w,(p, w))

= - L{ov';ow,] V,v,{p, w,(p, w)) = L, x,(p, w,{p, w)) = x(p, w),

as we wanted to show. _ Equipped with Proposition 4.0.1, we can now define a normative representative consumer. Definition 4.0.3: The positive representative consumer ~ for the aggregate demand x(p, w) = LiX;(P, Wi(P, w)) is a normative representative consumer relative to the social welfare function W(·) if for every (p, w), the distribution of wealth (w,(p, w), ... , w/(p, w)) solves problems (4.D.1) and, therefore, the value function of problem (4.D.1) is an indirect utility function for~. If there is a normative representative consumer, the preferences of this consumer have welfare significance and the aggregate demand function x (p, w) can be used to make welfare judgments by means of the techniques described in Section 3.I. In doing so, however, it should never be forgotten that a given wealth distribution rule [the one that solves (4.0.1) for the given social welfare function] is being adhered to and that the "level of wealth" should always be understood as the "optimally distributed level of wealth." For further discussion, see Samuelson (1956) and Chipman and Moore (1979).

Example 4.D.l: Suppose that consumers all have homothetic preferences represented by utility functions homogeneous of degree one. Consider now the social welfare function W(u" ... , u,) = L,a, In u, with a, > 0 and L,a, = 1. Then the optimal

4.D:

AGGREGATE

DEMAND

AND

REPRESENTAT'VE

wealth distribution function [for problem (4.0.1)] is the price-independent rule that we adopted in Section 4.C: w,(p, w) = a,w. (You are asked to demonstrate this fact in Exercise 4.0.6.) Therefore, in the homothetic case, the aggregate demand x(p, w) = L,X,(P, a,w) can be viewed as originating from the normative representative consumer generated by this social welfare function. _ Example 4.D.2: Suppose that all consumers' preferences have indirect utilities of the Gorman form v,(p, w,) = a,(p) + b(p)w,. Note that b(p) does not depend on i, and recall that this includes as a particular case the situation in which preferences are quasilinear with respect to a common numeraire. From Section 4.B, we also know that aggregate demand x(p, w) is independent of the distribution of wealth.'7 Consider now the utilitarian social welfare function L,U,. Then any wealth distribution rule (w,(p, w), ... , w,(p, w)) solves the optimization problem (4.0.1), and the indirect utility function that this problem generates is simply v(p, w) = L,a,(p) + b(p)w. (You are asked to show these facts in Exercise 4.0.7.) One conclusion is, therefore, that when indirect utility functions have the Gorman form [with common b(p)] and the social welfare function is utilitarian, then aggregate demand can always by viewed as being generated by a normative representative consumer. When consumers have Gorman-form indirect utility functions [with common b(p)], the theory of the normative representative consumer admits an important strengthening. In general, the preferences of the representative consumer depend on the form of the social welfare function. But not in this case. We now verify that if the indirect utility functions of the consumers have the Gorman form [with common b(p)], then the preferences of the representative consumer are independent of the particular social welfare function used. '8 In fact, we show that vip, w) = L,a,(p) + b(p)w is an admissible indirect utility function for the normative representative consumer relative to any social welfare function W(u" ... , u,). To verify this claim, consider a particular social welfare function W(·), and denote the value function of problem (4.0.1), relative to W(·), by v*(p, w). We must show that the ordering induced by v(·) and v* (-) is the same, that is, that for any pair (p, w) and (p', w') with vip, w) < vip', w'), we have v*(p, w) < v*(p', w'). Take the vectors of individual wealths (w" ... , WI) and (w'" ... , w~) reached as optima of (4.0.1), relative to W(·), for (p, w) and (p', w'), respectively. Denote u, = a,(p) + b(p)w" u; = a,(p) + b(p)w;, u = (u" ... , u/), and u' = (u'" ... , u~). Then v*(p, w) = W(u) and v*(p', w') = W(u'). Also vIp, w) = L,a,(p) + b(p)w = L,U" and similarly, vip', w') = L,U;. Therefore, vip, w) < vIp', w') implies L,U, < L,U;. We argue that VW(u')'(u - u') < 0, which, W(-) being concave, implies the desired result, namely W(u) < W(u').'· By expression (4.0.2), at an optimum we have (oW/ov,)(ov,/ow,) = A for all i. But in our case, ov,/ow, = b(p) for all i. Therefore, oW/ov, = oW/ov j > 0 for any i,j. Hence, L,U, < L,U; implies VW(u')'(u - u') < O. The previous point can perhaps be better understood if we observe that when 17. As usual, we neglect the non negativity constraints on consumption. 18. Bu~ of course, the optimal distribution rules will typically depend on the social welfare function. Only for the utilitarian social welfare function will it not matter how wealth is distributed. 19. Indeed, concavity of W(·) implies W(u') + VW(u')'(u - u')" W(u); see Section M.e of the Mathematical Appendix.

CONSUMERS

119

SEC T ION

120

c HAP T E R

.:

A G G REG ATE

preferences have the Gorman form [with common b(p)], then (p', w') is socially better than (p, w) for the utilitarian social welfare function 1:,u, if and only ifwhen compared with (p, w), (p', w') passes the following potential compensation test: For any distribution (Wi' ... ' w,) of w, there is a distribution (W'lo .. ·' w~) of w' such that v,(p', w;) > v,(p, w;) for all i. To verify this is straightforward. Suppose that

(L,a,(p')

+ b(p')w') -

A G G " EGA TED E MAN DAN D

REP R E 9 E N TAT I VEe 0 N SUM E R S

121

have been required to have the Gorman form [with common b(p)]. •

It is important to stress the distinction between the concepts of a positive and a normative representative consumer. It is not true that whenever aggregate demand can be generated by a positive representative consumer, this representative consumer's preferences have normative content. It may even be the case that a positive representative consumer exists but that there is no social welfare function that leads to a normative representative consumer. We expand on this point in the next few paragraphs [see also Dow and Werlang (1988) and Jerison (1994)]. We are given a distribution rule (WI(P, w), ... , w,(p, w» and assume that a positive representative consumer with utility function u(x) exists for the aggregate demand x(p, w) = L,XI(P, w,(p, w)). In principle, using the integrability techniques presented in Section 3.H, it should be possible to determine the preferences of the representative consumer from the knowledge of x(p, w). Now fix any (p, w), and let x = x(p, w). Relative to the aggregate consumption vector x, we can define an at-Ieast-as-good-as set for the representative consumer: B

A =

= {xe R~:u(x) ~ (x)} c

x, =

x,(p,

R~.

w,), and consider the set

{x = LIX,: XI~,XI for all i}

C

R~.

In words, A is the set of aggregate consumption vectors for which there is a distribution of commodities among consumers that makes every consumer as well off as under (XI' ... ,x,), The boundary of this set is sometimes called a ScilOvsky contour. Note that both set A and set B are supported by the price vector p at X (see Figure 4.D.I). If the given wealth distribution comes from the solution to a social welfare optimization problem of the type (4.D.I) (i.e., if the positive representative consumer is in fact a normative 20. We continue to neglect nonnegativity constraints on wealth.

Figure 4.0.1

X,

Comparing the at-Ieast·as-good-as set of the p()sitive representative consumer with the

+ b(p)w) = c > O. by a,(p') + b(p')w; = a,(p) + b(p)w, + c/I

Exercise 4.D.8). The two properties just presented-independence of the representative consumer's preferences from the social welfare function and the potential compensation criterion-will be discussed further in Sections IO.F and 22.C. For the moment, we simply emphasize that they are not general properties of normative representative consumers. By choosing the distribution rules that solve (4.D.l), we can generate a normative representative consumer for any set of individual utilities and any social welfare function. For the properties just reviewed to hold, the individual preferences

Next, let W, = w,(p, w) and

X,

(L,a,(p)

Then the wealth levels w; implicitly defined will be as desired.lO Once we know that (p', w') when compared with (p, w) passes the potential compensation test, it follows merely from the definition of the optimization problem (4.D.l) that (p', w') is better than (p, w) for any normative consumer, that is, for any social welfare function that we may wish to employ (see

.>

•. 0:

DE MAN D

~=

A

sum oflhe at-Ieast·as-good-as sets of the individual consumers.

A

B

(a) The positive representative

x,

XI

(a)

__ B

consumer could be a

(b)

normative representative consumer.

A = {x E 1:, XI: u,(x,) ~ u;C'i,) for all i} B = {XE R:: u(x) ~ u(.'i)}

(b) The positive

representative consumer), then this places an important restriction on how sets A and B relate to each other: Every element of set A must be an element of set B. This is so because the social welfare function underlying the normative representative consumer is increasing in the utility level of every consumer (and thus any aggregate consumption bundle that could be distributed in a manner that guarantees to every consumer a level of utility as high as the levels corresponding to the optimal distribution of X must receive a social utility higher than the latter; see Exercise 4.0.4). That is, a necessary condition for the existence of a normative representative consumer is that A c B. A case that satisfies this necessary condition is depicted in Figure 4.0.I(a). However, there is nothing to prevent the existence, in a particular setting, of a positive representative consumer with a utility function u(x) that fails to satisfy this condition, as in Figure 4.0.I(b). To provide some further understanding of this point, Exercise 4.0.9 asks you to show that A c B implies that LSI(P, WI) - S(p, w) is positive semidefinite, where S(p, w) and S,(p, Wi) are the Slutsky matrices of aggregate and individual demand, respectively. Informally, we could say that the substitution effects of aggregate demand must be larger in absolute value than the sum of individual substitution effects (geometrically, this corresponds to the boundary of B being flatter at x than the boundary of A). This observation allows us to generate in a simple manner examples in which aggregate demand can be rationalized by preferences but, nonetheless, there is no normative representative consumer. Suppose, for example, that the wealth distribution rule is of the form wl(p, w) = ~IW. Suppose also that S(p, w) happens to be symmetric for all (p, w); if L = 2, this is automatically satisfied. Then, from integrability theory (see Section 3.H), we know that a sufficient condition for the existence of underlying preferences is that, for all (p, w), we have dp'S(p, w) dp < 0 for all dp #' 0 not proportional to p (we abbreviate this as the n.d. property). On the other hand as we have just seen, a necessary condition for the existence of a normative representativ; consumer is that C(p, w) = L,S,(P, WI) - S(p, w) be positive semidefinite [this is the same matrix discussed in Section 4.C; see expression (4.C.8»). Thus, if S(p, w) has the n.d. property ~or all (p, w) but C(p, w) is not positive semidefinite [i.e., wealth effects are such that S(p, w) IS "less negative" than LSI(p, w)), then a positive representative consumer exists that

Hi

nonetheless, cannot be made normative for any social welfare function. (Exercise 4.0. provides an instance where this is indeed the case.) In any example of this nature we have moves in aggregate consumption that would pass a potential compensation test (each consumer's welfare could be made better off by an appropriate distribution of the move) but are regarded as socially inferior under the utility function that rationalizes aggregate demand. [In Figure 4.0.I(b), this could be the move from x to x'.] The moral of all this is clear: The existence of preferences that explain behavior is not

representative consumer cannot be a normative representative consumer.

122

C HAP T E R

.:

A G G REG ATE

D E MAN D

EXERC'SES

123

------------------------------------------------------------------------~ enough to attach to them any welfare significance. For the latter, it is also necessary that these preferences exist for the right reasons. _

Hence, the aggregate demand x,('), shown in Figure 4.AA.I(b), is a nice continuous function even though none of the individual demand correspondences are so. Note that with only a finite number of consumers, the distribution function G(') cannot quite be a continuous function; but if the consumers are many, then it can be nearly continuous. _

APPENDIX A: REGULARIZING EFFECTS OF AGGREGATION

The regularizing effects of aggregation are studied again in Section 17.1. We show there that in general (i.e., without dispersedness requirements). the aggregation of numerous individual demand correspondences will generate a (nearly) convex-valued average demand correspondence.

This appendix is devoted to making the point that although aggregation can be deleterious to the preservation of the good properties of individual demand, it can also have helpful regularizing effects. By regularizing, we mean that the average (per-consumer) demand will tend to be more continuous or smooth, as a function of prices, than the individual components of the sum. Recall that if preferences are strictly convex, individual demand functions are continuous. As we noted, aggregate demand will then be continuous as well. But average demand can be (nearly) continuous even when individual demands are not. The key requirement is one of dispersion of individual preferences.

REFERENCES Chipman, 1. S., and J. Moore. (1979). On social welfare functions and the aggregation of preferences. Journal of Economic Theor)! 21: 111-39. Deaton. A., and 1. Muellbauer. (1980). Economics and Consumer Behavior. Cambridge. UK: Cambridge University Press.

Example 4.AA.l: Suppose that there are two commodities. Consumers have quasilinear preferences with the second good as numeraire. The first good, on the other hand, is available only in integer amounts, and consumers have no wish for more than one unit of it. Thus, normalizing the utility of zero units of the first good to be zero, the preferences of consumer i are completely described by a number v". the utility in terms of numeraire of holding one unit of the first good. It is then clear that the demand for the first good by consumer i is given by the correspondence

if PI
128

CHAPTER

5:

PRODUCTION

Section 5.D analyzes in detail the geometry associated with cost and production relationships for the special but theoretically important case of a technology that produces a single output. Aggregation theory is studied in Section 5.E. We show that aggregation on the supply side is simpler and more powerful than the corresponding theory for demand covered in Chapter 4. Section 5.F constitutes an excursion into welfare economics. We define the concept of efficiellt production and study its relation to profit maximization. With some minor qualifications, we see that profit-maximizing production plans are efficient and that when suitable convexity properties hold, the converse is also true: An efficient plan is profit maximizing for an appropriately chosen vector of prices. This constitutes our first look at the important ideas of the fundamental theorems of welfare economics.

In Section 5.G, we point out that profit maximization does not have the same primitive status as preference maximization. Rigorously, it should be derived from the latter. We discuss this point and related issues. In Appendix A, we study in more detail a particular, important case of production technologies: Those describable by means of linear constraints. It is known as the linear activity model.

S.B Production Sets As in the previous chapters, we consider an economy with L commodities. A production vector (also known as an input-output, or netput. vector. or as a production plan) is a vector y = (y, • ...• YL) E RL that describes the (net) outputs of the L commodities from a production process. We adopt the convention that positive numbers denote outputs and negative numbers denote inputs. Some elements of a production vector may be zero; this just means that the process has no net output of that commodity. Example S.B.l: Suppose that L = 5. Then y = ( - 5. 2. - 6.3.0) means that 2 and 3 units of goods 2 and 4, respectively. are produced. while 5 and 6 units of goods 1 and 3. respectively, are used. Good 5 is neither produced nor used as an input in this production vector. _ To analyze the behavior of the firm. we need to start by identifying those production vectors that are technologically possible. The set of all production vectors that constitute feasible plans for the firm is known as the production set and is denoted by Y c RL. Any y E Y is possible; any y f; Y is not. The production set is taken as a primitive datum of the theory. The set of feasible production plans is limited first and foremost by technological constraints. However. in any particular model, legal restrictions or prior contractual commitments may also contribute to the determination of the production set. It is sometimes convenient to describe the production set Y using a function F(·). called the transformation function. The transformation function F(') has the property that Y = {y E IRL: F(y) ~ O} and F(y) = 0 if and only if y is an element of the boundary of Y. The set of boundary points of Y, {y E RL: F(y) = OJ, is known as the transformation frontier. Figure 5.B.l presents a two-good example.

SEC T ION

5. 8:

PRO D U C T ION

SET S

y, VF(j')

Slope

= - M RT12 (y) y, Transformation frontier Iy: F(y)

= O}

Figure 5.B.1

The production set

and transfonnation frontier.

If F(') is differentiable. and if the production vector y satisfies F(Y) = 0, then for any commodities t and k, the ratio M RT. (-) = of(y)/Oy( tI y of(y)/oy,

is called the marginal rate of transformation (M R T) of good t for good k at y.' The marginal rate of transformation is a measure of how much the (net) output of good k can increase if the firm decreases the (net) output of good ( by one marginal unit. Indeed, from F(Y) = 0, we get

+ of(Y) d

of(Y) d

oy,

y,

oYt

- 0

Yt -

•

and therefore the slope of the transformation frontier at y in Figure 5.8.1 is precisely -MRTdY)·

Technologies with Distinct Inputs and Outputs In many actual production processes. the set of goods that can be outputs is distinct from the set that can be inputs. In this case. it is sometimes convenient to notationally distinguish the firm's inputs and outputs. We could, for example. let q = (qh ... , qM) 2: 0 denote the production levels of the firm's M outputs and z = (z" ... , ZL-M) 2: 0 denote the amounts of the firm's L - M inputs. with the convention that the amount of input Z( used is now measured as a nonnegative number (as a matter of notation, we count all goods not actually used in the process as inputs). One of the most frequently encountered production models is that in which there is a single output. A single-output technology is commonly described by means of a production function f(z) that gives the maximum amount q of output that can be produced using input amounts (z, •...• ZL-,);::>: O. For example, if the output is good L, then (assuming that output can be disposed of at no cost) the production function f(·) gives rise to the production set: Y={(-z" ... , -zL-,.q):q-f(z, •... ,zL_,)~O

and (z" ... ,ZL_,)2:0}.

Holding the level of output fixed. we can define the marginal rate of technical I. As in Chapter 3, in computing ratios such as this, we always assume that aF(y)/ay, " O.

129

130

C HAP T E R

5:

SECTION

PROD U C T ION

substitution (MRTS) of input

t

y,

for input k at i as

5.B:

PROOUCTION

SETS

131

y,

MRTS (i) = Jf(i)/Jz t Ik J f(i)jJzk The number M RTS,,(i) measures the additional amount of input k that must be used to keep output at level q = f(i) when the amount of input t is decreased marginally. It is the production theory analog to the consumer's marginal rate of substitution. In consumer theory, we look at the trade-off between commodities that keeps utility constant, here, we examine the trade-off between inputs that keeps the amount of output constant. Note that M R TS" is simply a renaming of the marginal rate of transformation of input t for input k in the special case of a single-output, many-input technology.

Figure S.B.2

Y,

(a)

The no free lunch property. (a) Violates no free lunch. (b) Satisfies no free lunch.

Y,

(b)

Example S.B.2: The Cobb-Doug/as Production Function The Cobb-Douglas production function with two inputs is given by f(z"z,) = t;z1, where <X ~ 0 and p ~ O. The marginal rate of technical substitution between the two inputs at z = (z" z,) is MRTS 12 (z) = <xz,/pz, .•

Restricted Production Set

y,

Properties oj Production Sets We now introduce and discuss a fairly exhaustive list of commonly assumed properties of production sets. The appropriateness of each of these assumptions depends on the particular circumstances (indeed, some of them are mutually exclusive). ' (i) Y is nonempty. This assumption simply says that the firm has something it can plan to do. Otherwise, there is no need to study the behavior of the firm in question. (ii) Y is closed. The set Y includes its boundary. Thus, the limit of a sequence of technologically feasible input-output vectors is also feasible; in symbols, y' -+ y and y" E Y imply y E Y. This condition should be thought of as primarily technical. 3 (iii) No free lunch. Suppose that y E Yand y ~ 0, so that the vector y does not use any inputs. The no-free-Iunch property is satisfied if this production vector cannot produce output either. That is, whenever y E Y and y ~ 0, then y = 0; it is not possible to produce something from nothing. Geometrically, Y n R'i. c {O}. For L = 2, Figure 5.B.2(a) depicts a set that violates the no-free-lunch property, the set in Figure 5.B.2(b) satisfies it. (iv) Possibility of inaction This property says that 0 E Y: Complete shutdown is possible. Both sets in Figure 5.B.2, for example, satisfy this property. The point in time at which production possibilities are being analyzed is often important for the validity of this assumption. If we are contemplating a firm that could access a set of technological possibilities but that has not yet been organized, then inaction is clearly 2. For further discussion of these properties, see Koopmans (1957) and Chapter 3 of Debreu (1959). 3. Nonetheless, we show in Exercise 5.B.4 that there is an important case of economic interest when it raises difficulties.

L

Sunk Costs Figure S.B.3

Y,

y, Y, y,

(a)

(b)

possible. But if some production decisions have already been made, or if irrevocable contracts for the delivery of some inputs have been signed, inaction is not possible. In that case, we say that some costs are sunk. Figure 5.B.3 depicts two examples. The production set in Figure 5.B.3(a) represents the interim production possibilities units of good I arising when the firm is already committed to use at least (perhaps because it has already signed a contract for the purchase of this amount); that is, the set is a restricted production set that reflects the firm's remaining choices from some original production set Y like the ones in Figure 5.B.2. In Figure 5.B.3(b), we have a second example of sunk costs. For a case with one output (good 3) and two inputs (goods I and 2), the figure illustrates the restricted production set arising when the level of the second input has been irrevocably set at y, < 0 [here, in contrast with Figure 5.B.3(a), increases in the use of the input are impossible].

-.v,

(v) Free disposal. The property of free disposal holds if the absorption of any additional amounts of inputs without any reduction in output is always possible. That is, if y E Y and y' ::s; y (so that y' produces at most the same amount of outputs using at least the same amount of inputs), then y' E Y. More succinctly, Y - R'i. c Y (see Figure 5.B.4). The interpretation is that the extra amount of inputs (or outputs) can be disposed of or eliminated at no cost.

Two production sets with sunk costs. (a) A minimal level of expenditure committed. (b) One kind of input fixed.

132

CHAPTER

5,

PRODUCTION

SEC T ION

5. B:

PRO D U C T ION

5ETS

133

y,

.liz

y,

" y

"

. . . , . . . Sunk "Setup Costs,

YI YI

YI

(a) (a)

YI

(b)

(vi) Irreversibility. Suppose that y E Yand y ;" O. Then irreversiblity says that - y '" Y. In words, it is impossible to reverse a technologically possible production vector to transform an amount of output into the same amount of input that was used to generate it. If, for example, the description of a commodity includes the time of its availability, then irreversibility follows from the requirement that inputs be used before outputs emerge.

Figure 5.B.4 (left)

The free disposal property.

(vii) Nonincreasing returns to scale. The production technology Y exhibits nonincreasing returns to scale if for any y E Y, we have lXy E Yfor all scalars IX E [0,1]. In words, any feasible input-output vector can be scaled down (see Figure 5.B.5). Note that nonincreasing returns to scale imply that inaction is possible [property (iv)]. (viii) N ondecreasing returns to scale. In contrast with the previous case, the production process exhibits nondecreasing returns to scale if for any y E Y, we have lXy E Y for any scale IX ;;>: I. In words, any feasible input-output vector can be scaled up. Figure 5.B.6(a) presents a typical example; in the figure, units of output (good 2) can be produced at a constant cost of input (good 1) except that in order to produce at all, a fixed setup cost is required. It does not matter for the existence of nondecreasing returns if this fixed cost is sunk [as in Figure 5.8.6(b)] or not [as in Figure 5.8.6(a), where inaction is possible]. (ix) Constant returns to scale. This property is the conjunction of properties (vii) and (viii). The production set Yexhibits constant returns to scale if y E Y implies lXy E Y for any scalar IX ;;>: O. Geometrically, Y is a cone (see Figure 5.8.7).

(b)

Example 5,B.3: Returns to Scale with the Cobb-Douglas Production Function: For the Cobb-Douglas production function introduced in Example 5.B.2, f(2z" 2:,) = 2"+'z~z~ = 2"+'f(z"z,). Thus, when IX + P= 1, we have constant returns to scale; when IX + p < 1, we have decreasing returns to scale; and when IX + p > 1, we have increasing returns to scale. _

The noninereasing property. (a) Nonincreasing returns satisfied. (b) Nonincreasing returns violated

(x) Additivity (or free entry). Suppose that y E Y and y' E Y. The additivity property requires that y + y' E Y. More succinctly, Y + Y c Y. This implies, for example, that ky E Y for any positive integer k. In Figure 5.B.8, we see an example where Y is additive. Note that in this example, output is available only in integer amounts (perhaps because of indivisibilities). The economic interpretation of the additivity condition is that if y and y' are both possible, then one can set up two plants that do not interfere with each other and carry out production plans y and y' independently. The result is then the production vector y + y'. Additivity is also related to the idea of entry. If y E Y is being produced by a firm and another firm enters and produces y' E Y, then the net result is the vector y + y'. Hence, the aggregate production set (the production set describing feasible production plans for the economy as a whole) must satisfy additivity whenever unrestricted entry, or (as it is called in the literature) free entry, is possible,

returns to scale

property.

A technology satisfying the constant returns to scale

property.

(xi) Convexity. This is one of the fundamental assumptions of microeconomics.

It postulates that the production set Y is convex. That is, if y, y' E Yand IX E [0, 1], then lXy + (1 - IX)Y' E Y. For example, Y is convex in Figure 5.B.5(a) but is not convex in Figure 5.8.5(b).

Flgur. 5.B.8

y,

A production set satisfying the additivity property.

For single-output technologies, properties of the production set translate readily into properties of the production function f(·). Consider Exercise 5.B.2 and Example 5.8.3. Exercise 5.B.2: Suppose that f(·) is the production function associated with a single-output technology, and let Y be the production set of this technology. Show that Y satisfies constant returns to scale if and only if f( .) is homogeneous of degree

Flgur. S.B.6 (I.ft)

The nondecreasing

Flgur. 5.B.7 (right)

Flgur. 5.B.S (right)

returns to scale

Exercise 5.B.1: Draw two production sets: one that violates irreversibility and one that satisfies this property.

one.

V .1

YI

YI

134

CHAPTER

.:

PRODUCTION

SECTION

S.C:

PROfiT

MAXIMIZATION

AND

COST

MINIMIZATION

135

~---------------------------------~~~~=

The convexity assumption can be interpreted as incorporating two ideas about production possibilities. The first is nonincreasing returns. In particular, if inaction is possible (i.e., if 0 E y), then convexity implies that Y has nonincreasing returns to scale. To see this, note that for any <X E [0, 1], we can write <xy = <xy + (1 - <x)0. Hence, if y E Y and 0 E Y, convexity implies that <xy E Y. Second, convexity captures the idea that "unbalanced" input combinations are not more productive than balanced ones (or, symmetrically, that "unbalanced" output combinations are not least costly to produce than balanced ones). In particular, if production plans y and i produce exactly the same amount of output but use different input combinations, then a production vector that uses a level of each input that is the average of the levels used in these two plans can do at least as well as either y or y'. Exercise 5.B.3 illustrates these two ideas for the case of a single-output technology. Exercise 5.B.3: Show that for a single-output technology, Y is convex if and only if the production function f(z) is concave. (xii) Y is a convex cone. This is the conjunction of the convexity (xi) and constant returns to scale (ix) properties. Formally, Y is a convex cone if for any production vector y, y' E Yand constants <X ~ 0 and P~ 0, we have <xy + Pi E Y. The production set depicted in Figure 5.B.7 is a convex cone. An important fact is given in Proposition 5.B.1. Proposition 5.B.1: The production set Y is additive and satisfies the non increasing returns condition if and only if it is a convex cone. Proof: The definition of a convex cone directly implies the nonincreasing returns and additivity properties. Conversely, we want to show that if nonincreasing returns and additivity hold, then for any y, y' E Yand any <X > 0, and P> 0, we have <xy + Py' E Y. To this effect, let k be any integer such that k > Max {<X, Pl. By additivity, ky E Yand ky' E Y. Since (<x/k) < 1 and <xy = (<x/k)ky, the nonincreasing returns condition implies that <XYE Y. Similarly, Py'E Y. Finally, again by additivity, <xy+PY'E Y. • Proposition 5.B.l provides a justification for the convexity assumption in production. Informally, we could say that if feasible input-output combinations can always be scaled down, and if the simultaneous operation of several technologies without mutual interference is always possible, then, in particular, convexity obtains. (See Appendix A of Chapter 11 for several examples in which there is mutual interference and, as a consequence, convexity does not arise.) It is important not to lose sight of the fact that the production set describes technology, not limits on resources. It can be argued that if all inputs (including, say, entrepreneurial inputs) are explicitly accounted for, then it should always be possible to replicate production. After all, we are not saying that doubling output is actually feasible, only that in principle it would be possibte if all inputs (however esoteric, be they marketed or not) were doubled. In this view, which originated with Marshall and has been much emphasized by McKenzie (1959), decreasing returns must reflect the scarcity of an underlying, unlisted input of production. For this reason, some economists believe that among models with convex technologies the constant

returns model is the most fundamental. Proposition 5.B.2 makes this idea precise. Proposition S.B.2: For any convex production set Y c RL with 0 E Y, there is a constant returns, convex production set Y' c RL such that Y = {y E RL (y, -1) E Y').

+,

JJ ("Entrepreneurial --.--.--::~a.._-1_

input") )" Figure 5.8.9

A constant returns production set with an .. entrepreneurial

factor."

Proof: Simply let Y' 5.B.9) •

=

(y'

E RL+ ':

y' = a(y, -1) for some y E Y and a 2! 0). (See Figure

The additional input included in the extended production set (good L + 1) can be called the "entrepreneurial factor." (The justification for this can be seen in Exercise 5.C.12; in a competitive environment, the return to this entrepreneurial factor is precisely the firm's profit.) In essence, the implication of Proposition 5.B.2 is that in a competitive, convex setting, there may be little loss of conceptual generality in limiting ourselves to constant returns technologies.

5.C Profit Maximization and Cost Minimization In this section, we begin our study of the market behavior of the firm. In parallel to our study of consumer demand, we assume that there is a vector of prices quoted for the L goods, denoted by p = (PI' ... ,pLl » 0, and that these prices are independent of the production plans of the firm (the price-taking assumption). We assume throughout this chapter that the firm's objective is to maximize its profit. (It is quite legitimate to ask why this should be so, and we will offer a brief discussion of the issue in Section 5.G.) Moreover, we always assume that the firm's production set Y satisfies the properties of nonemptiness, c/osedness, and free disposal (see Section 5.B).

The Profit Maximization Problem Given a price vector p » 0 and a production vector y E RL , the profit generated by implementing y is p' y = L.~= I PI y(. By the sign convention, this is precisely the total revenue minus the total cost. Given the technological constraints represented by its production set Y, the firm's profit maximization problem (PMP) is then Max

P'Y

(PMP)

s.t. yE Y.

Using a transformation function to describe Y, F('), we can equivalently state the PMP as Max

p'y

s.t. F(y)

~

O.

SEC T ION

136

CHAPTER

5:

P A 0 FIT

M It X I M I Z It T ION

A. N 0

COS T

In words, the price vector p and the gradient VF(y·) are proportional (Figure S.C.I depicts this fact). Condition (S.C.I) also yields the following ratio equality: PI/Pt = MRTfk(Y·) for all t, k. For L = 2, this says that the slope of the transformation frontier at the profit-maximizing production plan must be equal to the negative of the price ratio, as shown in Figure S.c.!. Were this not so, a small change in the firm's production plan could be found that increases the firm's profits.

y,

6F(y(p))

Slope = -~ p,

y,

-
Pl' Exercise S.c.!: Prove that, in general, if the production set Yexhibits nondecreasing returns to scale, then either n(p) :S 0 or n(p) = +00. If the transformation function F(') is differentiable, then first-order conditions can be used to characterize the solution to the PMP. If y. E y(p), then, for some ,( 2': 0, y. must satisfy the first-order conditions =

Pt

5. C:

,. I N I M I Z A T ION

137

--------------------------------------------------------~~~~~~

PRODUCTION

I A

aF(y·)

for

t

= I, ... , L

aYt

or, equivalently, in matrix notation, p =;, VF(y·).

When Y corresponds to a single-output technology with differentiable production function f(:), we can view the firm's decision as simply a choice over its input levels Z. In Ihis special case, we shall let the scalar p > 0 denote the price of the firm's output and the vector w » 0 denote its input prices." The input vector z· maximizes profit given (p, w) if it solves Max pf(z) - W'Z. :~o

If z· is optimal, then the following first-order conditions must be satisfied for ( = I, ... ,L -I: af(z·) p :S w" with equality if z~ > 0,

-a-z(

or, in matrix notation, p VJ(z·) :S w

and

[p Vf(z·) - w]-z· = 0.'

(S.C.2)

Thus, the marginal product of every input ( actually used (i.e., with z7 > 0) must equal its price in terms of output, Wt/p. Note also that for any two inputs t and k with (z1, z:J» 0, condition (S.C.2) implies that MRTStt = wtlw,; that is, the marginal rate of technical substitution between the two inputs is equal to their price ratio, the economic rate of substitution between them. This ratio condition is merely a special case of the more general condition derived in (S.C.!). If the production set Y is convex, then the first-order conditions in (S.C.!) and (S.C.2) are not only necessary but also sufficient for the determination of a solution to the PMP. Proposition S.C.I, which lists the properties of the profit function and supply correspondence, can be established using methods similar to those we employed in Chapter 3 when studying consumer demand. Observe, for example, that mathematically the concept of the profit function should be familiar from the discussion of duality in Chapter 3. In fact, n(p) = -Jl-r(P), where Jl-r(P) = Min {p-(-Y):YE Y} is the support function of the set - Y. Thus, the list of important properties in Proposition S.C.I can be seen to follow from the general properties of support functions discussed in Section 3.F.

(S.C.I) 6. Up to now, we have always used the symbol p for an overall vector of prices; here we use it

4. We use the term supply correspondence to keep the parallel with the demand terminology of the consumption side. Recall however that y(p) is more properly thought or as the firm's net supply to the market. In particular, the negative entries of a supply vector should be interpreted as demand for inputs. 5. Rigorously, to allow ror the possibility that n(p) = +00 (as well as ror other cases where no profit-maximizing production plan exists), the profit runction should be defined by n(p) = Sup {p' y: Y E f}. We will be somewhat loose. however, and continue to use Max while allowing ror this possiblity.

only for the output price and we denote the vector of input prices by w. This notation is fairly standard. As a rule of thumb. unless we are in a context of explicit classification of commodities as inputs or outputs (as in the single-output case), we will continue to use p to denote an overall vector

or prices p = (p" ... , pd. 7. The concern over boundary conditions arises here. but not in condition (S.C.l), because the assumption of distinct inputs and outputs requires that z ~ 0, whereas the formulation leading to (5.C.1) allows the net output of every good to be either positive or negative. Nonetheless, when using the first-order conditions (5.C.2), we will typically assume that z· »0.

5 E C T ION

138

c HAP T E R

5:

PRO

Proposition S.C.1: Suppose that n(') is the profit function of the production set Y and that y(.) is the associated supply correspondence. Assume also that Y is closed and satisfies the free disposal property. Then (i) n(') is homogeneous of degree one. (ii) n(-) is convex. (iii) If Y is convex, then Y= (YE[RLp.y:; n(p) for all p»O}. (iv) y(.) is homogeneous of degree zero. (v) If Y is convex, then y(p) is a convex set for all p. Moreover, if Y is strictly convex, then y(p) is single-valued (if nonempty). (vi) (Hotelling's lemma) If y(P) consists of a single point, then n(') is differentiable at p and Vn(p) = y(p). (vii) If y(.) is a function differentiable atp, then Oy(p) = 02n(p) is a symmetric and positive semidefinite matrix with Oy(p)p = O. Properties (ii), (iii), (vi), and (vii) are the nontrivial ones. Exercise 5.C.2: Prove that n(') is a convex function [Property (ii) of Proposition 5.C.1]. [Hint: Suppose that y E y(exp + (I - ex)p'). Then n(exp

+ (I

- ex)p') = exp' y + (I - ex)p" y :; exn(p)

+ (I

- ex)n(p').]

Property (iii) tells us that if Y is closed, convex, and satisfies free disposal, then n(p) provides an alternative ("dual") description of the technology. As for the indirect

utility function's (or expenditure function's) representation of preferences (discussed in Chapter 3), it is a less primitive description than Y itself because it depends on the notions of prices and of price-taking behavior. But thanks to property (vi), it has the great virtue in applications of often allowing for an immediate computation of supply. Property (vi) relates supply behavior to the derivatives of the profit function. It is a direct consequence of the duality theorem (Proposition 3.F.!). As in Proposition 3.G.I, the fact that Vn(p) = y(p) can also be established by the related arguments of the envelope theorem and of first-order conditions. The positive semidefiniteness of the matrix Dy(p) in property (vii), which in view of property (vi) is a consequence of the convexity of n('), is the general mathematical expression of the law of supply: Quantities respond in the same direction as price changes. By the sign convention, this means that if the price of an output increases (all other prices remaining the same), then the supply of the output increases; and if the price of an input increases, then the demand for the input decreases. Note that the law of supply holds for any price change. Because, in contrast with demand theory, there is no budget constraint, there is no compensation requirement of any sort. In essence, we have no wealth effects here, only substitution effects. In nondifferentiable terms, the law of supply can be expressed as (5.C.3) (p - p,).(y - y') ~ 0 for all p, p', y E y(p), and y' E y(p'). In this form, it can also be established by a straightforward revealed preference argument. In particular, (p _ p').(y _ y') = (p'y _ p.y')

5. C:

PRO FIT

M A X 1M' Z A T ION

AND

COS T

MIN I M I Z A T ION

139

~---------------------------------------------------------~~~~

Due T ION

+ (p"y'

- p'.y) ~ 0,

where the inequality follows from the fact that y E y(p) and y' E y(p') (i.e., from the fact that y is profit maximizing given prices p and y' is profit maximizing for prices p'). Property (vii) of Proposition 5.C.1 implies that the matrix Dy(p), the supply substitution matrix, has properties that parallel (although with the reverse sign) those for the substitution matrix of demand theory. Thus, own-substitution effects are nonnegative as noted above [aYt(p)/aPr ~ 0 for all t], and substitution effects are symmetric [aYt(p)japk = ah(p)/aPr for all t, k]. The fact that Dy(p)p = 0 follows from the homogeneity of y(.) [property (iv)] in a manner similar to the parallel property of the demand substitution matrix discussed in Chapter 3.

Cost Minimization An important implication of the firm choosing a profit-maximizing production plan is that there is no way to produce the same amounts of outputs at a lower total input cost. Thus, cost minimization is a necessary condition for profit maximization. This observation motivates us to an independent study of the firm's cost minimization problem. The problem is of interest for several reasons. First, it leads us to a number of results and constructions that are technically very useful. Second, as we shall see in Chapter 12, when a firm is not a price taker in its output market, we can no longer use the profit function for analysis. Nevertheless, as long as the firm is a price taker in its input market, the results flowing from the cost minimization problem continue to be valid. Third, when the production set exhibits nondecreasing returns to scale, the value function and optimiZing vectors of the cost minimization problem, which keep the levels of outputs fixed, are better behaved than the profit function and supply correspondence of the PMP (e.g., recall from Exercise 5.C.! that the profit function can take only the values 0 and +00). To be concrete, we focus our analysis on the single-output case. As usual, we let z be a nonnegative vector of inputs, f(z) the production function, q the amounts of output, and w » 0 the vector of input prices. The cost minimization problem (CMP) can then be stated as follows (we assume free disposal of output): Min

w'z

,;,0

s.t. f(z)

~

q.

(CMP)

The optimized value of the CMP is given by the cost function c(w, q). The corresponding optimizing set of input (or factor) choices, denoted by z(w, q), is known as the conditional factor demand correspondence (or function if it is always singJevalued). The term conditional arises because these factor demands are conditional on the requirement that the output level q be produced. The solution to the CMP is depicted in Figure 5.C.2(a) for a case with two inputs. The shaded region represents the set of input vectors z that can produce at least the amount q of output. It is the projection (into the positive orthant of the input space) of the part of the production set Y than generates output of at least q, as shown in Figure 5.C.2(b). In Figure 5.C.2(a), the solution Z(lV, q) lies on the iso-cost line (a line in [R2 on which all input combinations generate equal cost) that intersects the set (z E IR~ : f(z) ~ q} closest to the origin. If z· is optimal in the CMP, and if the production function f(·) is differentiable,

140

C HAP T E R

5:

SEC T ION

PROD U C T ION

f(z)

Slope = _":'.t. WI

FIgure 5.C.2

:z:

W',

=

z,

C} with c > c(w, q)

(a)

(b)

then for some A ~ 0, the following first-order conditions must hold for every input

t

=

I, ... ,L - I:

w, > -

A af(z")

az,'

with equality if z," > 0,

or, in matrix notation, w ~ AVf(z") and [w - AVf(z")]'z" = 0. (5.C.4) As with the PMP, if the production set Y is convex [i.e., if f(-) is concave], then condition (5.CA) is not only necessary but also sufficient for z" to be an optimum in the CMp· Condition (5.C.4), like condition (5.C.2) of the PMP, implies that for any two inputs t and k with (Zto zo) » 0, we have M RTS(o = wtlwo. This correspondence is to be expected because, as we have noted, profit maximization implies that input choices are cost minimizing for the chosen output level q. For L = 2, condition (5.C.4) entails that the slope at z" of the isoquant associated with production level q is exactly equal to the negative of the ratio of the input prices - w,/w 2 , Figure 5.C.2(a) depicts this fact as well. As usual, the Lagrange multiplier A can be interpreted as the marginal value of relaxing the constraint f(z") ~ q. Thus, A equals ac(w, q)/aq, the marginal cost of production.

Note the close formal analogy with consumption theory here. Replace f(·) by u( . ), q by u, and z by x (i.e., interpret the production function as a utility function), and the CMP becomes the expenditure minimization problem (EMP) discussed in Section 3.E. Therefore, in Proposition 5.C.2, properties (i) to (vii) of the cost function and conditional factor demand correspondence follow from the analysis in Sections 3.E to 3.G by this reinterpretation. [You are asked to prove properties (viii) and (ix) in Exercise 5.C.3.] Proposition S.C.2: Suppose that c(w, q) is the cost function of a single-output technology Y with production function f(') and that z(w, q) is the associated 8. Note, however, that the first~order conditions are sufficient for a solution to the eMP as long as the set {z: f(z) 2: q} is convex. Thus, the key condition for the sufficiency of the first-order conditions of the CMP is the quasiconcavity of f(·). This is an important fact because the quasiconcavity of f(·) is compatible with increasing returns to scale (see Example S.C.I).

The cost minimization problem. (a) Two inputs. (b) The isoquant as a section of the production set.

.

}-

•. C:

PRO FIT


AND

COS T

conditional factor demand correspondence. Assume also that Y is closed and satisfies the free disposal property. Then (i) c(·) is homogeneous of degree one in wand nondecreasing in q. (ii) c(·) is a concave function of w. (iii) If the sets {z ~ 0: f(z) I corresponds to increasing returns. The conditional factor demand equations and cost function have exactly the same form, and are derived in exactly the same way, as the expenditure function in Section 3.E (see Example 3.E.1; the only difference in the computations is that we now do not impose (1 + P= I):

z,(w" w2 , q) =

q'"'+~I(exW2/PW.)~/(·+~I,

w2 , q) =

ql/('+~I(PW,/exW2)'/(O+~I,

Z2(W"

and c(w" w 2 , q) = q 1/(, + ~I[(a! P)~II'+ ~I

+ (ex! P) -'/(O+~I] w'i/('+ ~lw~IIO+ ~I .

This cost function has the form c(w" w2 , q) = q'/('+~IIJc/>(w" w2 ), where IJ = [(ex!P)~/(O+~1 + (I1./P)-·/(·+~I] is a constant and r!>(w" w2 ) = w~/('+~'wf(O+~' is a function that does not depend on the output level q. When we have constant returns, 1Jc/>(w" w2 ) is the per-unit cost of production. One way to derive the firm's supply function and profit function is to use this cost function and solve problem (5.C.5). Applying (5.C.6), the first-order condition for this problem is

p s; IJc/>(w"

W2

)C ~ p)q(1/(O+~II-',

with equality if q > 0

(5.C.7)

The first-order condition (5.C.7) is sufficient for a maximum when ex + Ps; \ because the firm's cost function is then convex in q. When fl + P< I, (5.C.7) can be solved for a unique optimal output level:

q(w" W2' p) = (ex

+ P)[p!IJc/>(w"

W2)J·+~I/(1-·-~I.

The factor demands can then be obtained through substitution,

zAw" w2, p) = zAw" w2 , q(w"

W2,

p»

for

t

= 1,2,

as can the profit function, ]t(w" w" p) = pq(w" w" p) -

W'

z(w" w" q(w" w2 , p».

When ex + P= I, the right-hand side of the first-order condition (5.C.7) becomes 1Jc/>(w" w,), the unit cost of production (which is independent of q). If 1Jc/>(w" w,) is greater than p, then q = 0 is optimal; if it is smaller than p, then no solution exists (again, unbounded profits can be obtained by increasing q); and when 1Jc/>(w" w,) = p, any non-negative output level is a solution to the PMP and generates zero profits. Finally, when 11. + P> I (so that we have increasing returns to scale), a quantity q satisfying the first-order condition (5.C.7) does not yield a profit-maximizing production. [Actually, in this case, the cost function is strictly concave in q, so that

i

L

5.0:

GEOMETRY

OF

COST

AND

SUPPLY

IN

SINGLE-OUTPUT

CASE

143

~---------------------------~~~~~ any solution to the first-order condition (5.C.7) yields a local minimum of profits, subject to output being always produced at minimum cost]. Indeed, since p > 0, a doubling of the output level starting from any q doubles the firm's revenue but increases input costs only by a factor of 21/(0+ ~I > 2. With enough doublings, the firm's profits can therefore be made arbitrarily large. Hence, with increasing returns to scale, there is no solution to the PMP . •

- 5.D The Geometry of Cost and Supply Single-Output Case

III

the

In this section, we continue our analysis of the relationships among a firm's technology, its cost function, and its supply behavior for the special but commonly used case in which there is a single output. A significant advantage of considering the single-output case is that it lends itself to extensive graphical illustration. Throughout, we denote the amount of output by q and hold the vector of factor prices constant at Iii» O. For notational convenience, we write the firm's cost function as C(q) = c(Iii, q). For q > 0, we can denote the firm's average cost by AC(q) = C(q)/q and assuming that the derivative exists, we denote its marginal cost by C'(q) = dC(q)/dq. Recall from expression (5.C.6) that for a given output price p, all profitmaximizing output levels q E q(p) must satisfy the first-order condition [assuming that C(q) exists]: p S; C'(q)

with equality if q > O.

(5.0.1)

If the production set Y is convex, C(.) is a convex function [see property (ix) of Proposition 5.C.2], and therefore marginal cost is nondecreasing. In this case, as we noted in Section 5.C, satisfaction of this first-order condition is also sufficient to establish that q is a profit-maximizing output level at price p. Two examples of convex production sets are given in Figures 5.0.1 and 5.0.2. In the figures, we assume that there is only one input, and we normalize its price to equal I (you can think of this input as the total expense of factor use).'o Figure 5.0.1 depicts the production set (a), cost function (b), and average and marginal cost functions (c) for a case with decreasing returns to scale. Observe that the cost function is obtained from the production set by a 90-degree rotation. The determination of average cost and marginal cost from the cost function is shown in Figure 5.D.1(b) (for an output level 4). Figure 5.0.2 depicts the same objects for a case with constant returns to scale. In Figures 5.D.I(c) and 5.D.2(c), we use a heavier trace to indicate the firm's profit-maximizing supply locus, the graph of q(-). (Note: In this and subsequent figures, the supply locus is always indicated by a heavier trace.) Because the technologies in these two examples are convex, the supply locus in each case coincides exactly with the (q, p) combinations that satisfy the first-order condition (5.0.1). If the technology is not convex, perhaps because of the presence of some underlying indivisibility, then satisfaction of the first-order necessary condition to. Thus, the single input can be thought of as a Hicksian composite commodity in a sense analogous to that in Exercise lG.S.

144

C HAP T E R

5,

SECTION

PROD U C T ION

5.0,

GEOMETRY

OF

COST

AN~

SUPPLY

IN

SINGLE·OUTPUT

CASE

C;(q) = C(q)

Heavy Trace is Supply Locus q( p)

C'(q)

Figure 5.0.1

A strictly convex

AC(q)

technology (strictly scale). (a) Production set. (h) Cost function. (c) Average cost, marginal cost, and supply.

-z (c)

(h)

(aJ

y

-z (a)

(h)

FTgur.5.1),4

Strictly convex variable costs with a nonsunk setup cost. (a) Production set. (b) Cost function. (c) Average cost, marginal cost, and supply.

decreasing returns to

y

(c)

~."(')

C(q) C(q) q(p)

/

y

AC(q)

= C(q)

-z (c)

(h)

(a)

Flgur. 5.0.2

FTgur. 5.1).5

A constant returns to

Constant returns variable costs with a nonsunk setup cost. (a) Production set. (b) COSI function. (c) Avera ge cost, marginal cost, and supply.

scale technology. (a) Production set. (b) Cost function. (c) Average cost, marginal cost, and supply.

Flgur. 5.0.3

/ /"'-

A nonconvex

I

I

q(p)

I

/ / Slope: = AC@ = C@ /

I

-z (a)

(h)

(c)

(5.0.1) no longer implies that q is profit maximizing. The supply locus will then be only a subset of the set of (q, p) combinations that satisfy (5.0.1). Figure 5.0.3 depicts a situation with a nonconvex technology. In the figure, we have an initial segment of increasing returns over which the average cost decreases and then a region of decreasing returns over which the average cost increases. The level (or levels) of production corresponding to the minimum average cost is called the efficient scale, which, if unique, we denote by ij. Looking at the cost functions in Figure 5.D.3(a) and (b), we see that at ij we have AC(ij) = C(ij). In Exercise 5.0.1, you are asked to establish this fact as a general result. Exercise 5.0.1: Show that AC(ij) = C(ij) at any ij satisfying AC(ij) ~ AC(q) for all q. Does this result depend on the differentiability of C(') everywhere? The supply locus for this nonconvex example is depicted by the heavy trace in

technology. (a) Production set. (b) Cost function. (c) Average cost, marginal cost, and supply.

P

---------C;(q)

y q(p)

-z (a)

(h)

(c)

Figure 5.D.3(c). When p > AC(ij), the firm maximizes its profit by producing at the unique level of q satisfying p = C'(q) > AC(q). [Note that the firm earns strictly positive profits doing so, exceeding the zero profits earned by setting q = 0, which in tum exceed the strictly negative profits earned by choosing any q > 0 with p = C'(q) < AC(q).] On the other hand, when p < AC(ij), any q > 0 earns strictly negative profits, and so the firm's optimal supply is q = 0 [note that q = 0 satisfies the necessary first-order condition (5.0.1) because p < C(O)]. When p = AC(ij), the profit-maximizing set of output levels is {O, ij}. The supply locus is therefore as shown in Figure 5.D.3(c). An important source of nonconvexities is fixed setup costs. These mayor may not be sunk. Figures 5.0.4 and 5.0.5 (which parallel 5.0.1 and 5.0.2) depict two cases with nonsunk fixed setup costs (so inaction is possible). In these figures, we consider a case in which the firm incurs a fixed cost K if and only if it produces a positive amount of output and otherwise has convex costs. In particular, total cost is of the form C(O) = 0, and C(q) = C,(q) + K for q > 0, where K > 0 and C,(q), the variable cost function, is convex [and has C,(O) = 0]. Figure 5.0.4 depicts the case in which C,(') is strictly convex, whereas C,(') is linear in Figure 5.0.5. The supply loci are indicated in the figures. In both illustrations, the firm will produce a positive amount of output only if its profit is sufficient to cover not only its variable costs but also the fixed cost K. You should read the supply locus in Figure 5.D.5(c) as saying that for p> p, the supply is "infinite," and that q = 0 is optimal for p ~ p. In Figure 5.0.6, we alter the case studied in Figure 5.0.4 by making the fixed costs sunk, so that C(O) > O. In particular, we now have C(q) = C,(q) + K for all q ~ 0; therefore, the firm must pay K whether or not it produces a positive quantity.

145

SECTION

146

CHAPTER

5:

C(q)

Figure 5

Strictly variabl~

sunk c{ (a) (b) (c)

----,-

Not in Y

(a)

Pro Co, Ave

margin;

supply.

(c)

(b)

Although inaction is not possible here, the firm's cost function is convex, and so we are back to the case in which the first-order condition (5.0.1) is sufficient. Because the firm must pay K regardless of whether it produces a positive output level, it will not shut down simply because profits are negative. Note that because C,(') is convex and C,(O) = 0, p = C;(q) implies that pq > C,(q); hence, the firm covers its variable costs when it sets output to satisfy its first-order condition. The firm's supply locus is therefore that depicted in Figure 5.D.6(c). Note that its supply behavior is exactly the same as if it did not have to pay the sunk cost K at all [compare with Figure 5.D.I(c)].

As we noted in Section 5.B, one source of sunk costs, at least in the short run, is input choices irrevocably set by prior decisions. Suppose, for example, that we have two inputs and a production function f(z " z,). Recall that we keep the prices of the two inputs fixed at (w" w,). In Figure 5.D.7(a), the cost function excluding any prior input commitments is depicted by C(.). We call it the long-run cost function. If one input, say z" is fixed at level Z2 in the short-run, then the short-run cost function of the firm becomes C(qIZ2) = w,z, + W2Z" where z, is chosen so that f(z"i 2) = q. Several such short-run cost functions corresponding to different levels of Z2 are illustrated in Figure 5.D.7(a). Because restrictions on the firm's input decisions can only increase its costs of production, C(qlz,) lies above C(q) at all q except the q for

C(q I

',I

\

C(q I :~)

/-

AC(ql z;)

I

C(q)

AC(q I ,,)

AC(q I z;)

I

\

which Z2 is the optimal long-run input level [i.e., the q such that z,(w, q) = i2l Thus, C(qIZ2(W,q» = C(q) for all q.lt follows from this and from the fact that C(q'IZ2(W, q» ~ C(q') for all q', that C(q) = C(qIZ2(W, q» for all q; that is, if the level of Z2 is at its long-run value, then the short-run marginal cost equals the long-run marginal cost. Geometrically, C(.) is the lower envelope of the family of short-run functions C(ql=,) generated by letting z 2 take all possible values. Observe finally that given the long-run and short-run cost functions, the long-run and short-run average cost functions and long-run and short-run supply functions of the firm can be derived in the manner discussed earlier in the section. The average-cost version of Figure 5.D.7(a) is given in Figure 5.D.7(b). (Exercise 5.0.3 asks you to investigate the short-run and long-run supply behavior of the firm in more detail.)

In this section, we study the theory of aggregate (net) supply. As we saw in Section 5.C, the absence of a budget constraint implies that individual supply is not subject to wealth effects. As prices change, there are only substitution effects along the production frontier. In contrast with the theory of aggregate demand, this fact makes for an aggregation theory that is simple and powerful." Suppose there are J production units (firms or, perhaps, plants) in the economy, each specified by a production set Y" ... , ~. We assume that each lj is nonempty, closed, and satisfies the free disposal property. Denote the profit function and supply correspondences of lj by nip) and Yip), respectively. The aggregate supply correspondence is the sum of the individual supply correspondences:

Exercise 5.0.2: Depict the supply locus for a case with partially sunk costs, that is, where C(q) = K + C,(q) if q > 0 and 0 < C(O) < K.

J

y(p) = L Yip) = (YEIRL: Y = LYj for some YjEYip),j= I,." ,J}. j"" 1

Figure!

Costs \ level is short r

Assume, for a moment, that every Yi') is a single-valued, differentiable function at a price vector p. From Proposition 5.C.1, we know that every DYj(p) is a symmetric, positive semidefinite matrix. Because these two properties are preserved under addition, we can conclude that the matrix Dy(p) is symmetric and positive semidefinite. As in the theory of individual production, the positive semidefiniteness of Dy(p) implies the law of supply in the aggregate: If a price increases, then so does the corresponding aggregate supply. As with the law of supply at the firm level, this property of aggregate supply holds for all price changes. We can also prove this aggregate law of supply directly because we know from (5.C.3) that (p - p')'[Yip) - Yip')] ~ 0 for every j; therefore, adding over j, we get (p - p')'[y(p) - y(p')]

vary in (a) LOI

short-r funetio (b)

LOI

short-r cost.

(a)

5.E:

PRODUCTION

q such that

q such that

z,(w,q) = "

z,(w,q) (b)

= z,

~

O.

The symmetry of Dy(p) suggests that underlying Y( p) there is a "representative producer." As we now show, this is true in a particularly strong manner. Given Y" ... , ~, we can define the aggregate production set by

Y = Y,

+ ... +

~ = (y E IRL: Y = Lj Yj for some Yj E ~,j = I, ... , J }.

11. A classical and very readable account for the material in this section and in Section 5.F is Koopmans (1957).

AGGREGATION

147

148

C HAP T E R

5:

PRO D U C T ION

SECTION

The aggregate production set Y describes the production vectors that are feasible in the aggregate if all the production sets are used together. Let 1[*(p) and y*(p) be the profit function and the supply correspondence of the aggregate production set Y. They are the profit function and supply correspondence that would arise if a single price-taking firm were to operate, under the same management so to speak, all the individual production sets. Proposition 5.E.l establishes a strong aggregation result for the supply side: Tile

5.F:

EFFICIENT PRODUCTION

)"

Figure S.E.1

aggregate profit obtained by each production unit maximizing profit separately taking prices as given is the same as that which would be obtained if they were to coordinate their actions (i.e., their yjs) in a joint profit maximizing decision.

Proposition S.E.1: For all p » 0, we have (i) 1[*(p) = Lj1[j(P) (ii) y*(p) = LjYj(P) (= {4Yj: YjE Yj (p) for every jl)·

Proof: (i) For the first equality, note that if we take any collection of production plans Yj E lj, j = I, ... , J, then LjYj E Y. Because 1[*(') is the profit function associated with Y, we therefore have 1[*(p) 2: P'(LjYj) = LjP·Yj. Hence, it follows that 1[*(p) 2: Lj1[j(P), In the other direction, consider any YE Y. By the definition of the set Y, there are Yj E lj,j= I, ... , J, such that LjYj= y. So P'Y=P'(LjYj)=Lj P' Yj~ Lj1[j(P) for all YE Y. Thus, 1[*(p) ~ Lj1[j(P), Together, these two inequalities imply that 1[*(p) = LI1[(P). (ii) For the second equality, we. must show that LjYj(P) c: y*(p) and that y*(p) c: LjY;\P). For the former relation, consider any set of individual production plans Yj E y;\p),j = I, ... , J. Then P'(LjYj) = Ljp' Yj = Lj1[;(P) = 1[*(p), where the last equality follows from part (i) of the proposition. Hence, Lj Yj E y*(p), and therefore, LjYj(P) c: y*(p). In the other direction, take any YE y*(p). Then Y = LjYj for some Yj E Yj, j = I, ... ,J. Since P'(LjYj) = 1[*(p) = Lj1[;(P) and, for every j, we have P' Yj ~ 1[j(p), it must be that P' Yj = 1[j(p) for every j. Thus, Yj E y;\p) for all j, and so YE LjY;(P)· Thus, we have shown that y*(p) c: LjYj(P), • The content of Proposition 5.E.l is illustrated in Figure 5.E.1. The proposition can be interpreted as a decentralization result: To find the solution of the aggregate profit maximization problem for given prices p, it is enough to add the solutions of the corresponding individual problems. Simple as this result may seem, it nevertheless has many important implications. Consider, for example, the single·output case. The result tells us that if firms are maximizing profit facing output price P and factor prices w, then their supply behavior maximizes aggregate profits. But this must mean that if q = Ljqj is the aggregate output produced by the firms, then the total cost of production is exactly equal to c(w, q), the value of the aggregate cost function (the cost function corresponding to the aggregate production set Y). Thus, the al/ocation of the production of output level q among the firms is cost minimizing. In addition, this allows us to relate the firms' aggregate supply function for output q(p) to the aggregate cost function in the same manner as done in Section 5.0 for an individual firm. (This fact will prove useful when we study partial equilibrium models of competitive markets in Chapter 10.)

loint profit maximization as a result of individual profit maximization.

In summary: If firms maximize profits taking prices as given, then the production side of the economy aggregates beautifully. As in the consumption case (see Appendix A of Chapter 4), aggregation can also have helpful regularizing effects in the production context. An interesting and important fact is that the existence of many firms or plants with technologies that are not too dissimilar can make the average production set almost convex, even if the individual production sets are not so. This is illustrated in Figure 5.E.2, where there are J firms with identical production sets equal to y,

y, Figure S.E.2

~(Y, + ... + Y,)

Y=Y,+"'+y,

J y, (a)

y, (b)

that displayed in 5.E.2(a). Defining the average production set as (lfJ)(>; + ... + Y,) = {yo y = (lIJ)(y, + ... + YJ) for some YI E }j, j = I, ... , J l, we see that for large J, this set is nearly convex, as depicted in Figure 5.E.2(b). 12

Efficient Production Because much of welfare economics focuses on efficiency (see, for example, Chapters 10 and 16), it is useful to have algebraic and geometric characterizations of productions plans that can unambiguously be regarded as nonwasteful. This motivates Definition 5.F.1. !2. Note that this production set is bounded above. This is important because it insures that the lO~ividual nonconvexity is of finite size. If the individual production set Was like that shown in, ~~y, FIgure 5.B.4, where neither the set nor the nonconvexity is bounded, then the average set would IS play a large nonconvexity (ror any J). In Figure 5.8.5, we have a case or an unbounded production set but with a bounded nonconvexity; as for Figure 5.E.2, the average set will in this case be almost convex.

An example of the convexifying effects of aggregation. (a) The individual production set. (b) The average production set.

149

SECTION

150

CHAPTER

5:

FIgure 5.F.l

y,

y,

An efficient production plan must be on the boundary of Y. but not all points on the boundary of Yare efficient. (a) An inefficient production plan in the interior of Y. (b) An inefficient production plan at the boundary of Y. (c) The set of efficient production plans.

Y is not efficient

Y is not efficient

Y

Y·

y

y

y,

)"

(a)

(b)

(c)

Dellnltlon S.F.1: A production vector Y E Y is efficient if there is no y' E Y such that y' :2: Y and y' # y.

efficient. We now show that the concept of efficiency is intimately related to that of supportability by profit maximization. This constitutes our first look at a topic that we explore in much more depth in Chapter 10 and especially in Chapter 16 Proposition 5.F.1 provides an elementary but important result. It is a version of the first Jundamental theorem oj welfare economics.

151

Figure S.F.3

The use of the separatIng hyperplane theorem to prov\! y,

A converse of Proposition 5.F.1 would assert that any efficient production vector is profit maximizing for some price system. However, a glance at the efficient production y' in Figure 5.F.2 shows that this cannot be true in general. Nevertheless, this converse does hold with the added assumption of convexity. Proposition 5.F.2, which is less elementary than Proposition 5.F.l, is a version of the so-called second Jundamental theorem oj welfare economics.

Proof: Suppose otherwise: That there is a y' E Y such that y' # y and y' :2: y. Because p» 0, this implies that p' y' > p' y, contradicting the assumption that y is profit maximizing. _

Proposition S.F.2: Suppose that Y is convex. Then every efficient production y E Y is a profit-maximizing production for some nonzero price vector p :2: 013

It is worth emphasizing that Proposition 5.F.l is valid even if the production set is nonconvex. This is illustrated in Figure 5.F.2. When combined with the aggregation results discussed in Section j.E, Proposition 5.F.I tells us that if a collection oj firms each independently maximizes profits with respect to the same fixed price vector p »0. then the aggregate production is FIgure 5.F.2

A profit-maximizing production plan (for p »0) is efficient.

y,

)'2

Exercise S.F.l: Give an example of ayE Y that is profit maximizing for some p :2: 0 with p # 0 but that is also inefficient (i.e. not efficient).

Proposition S.F.1: If y E Y is profit maximizing for some p» O. then y is efficient.

L

EFFICIENT PRODUCTION

socially efficient. That is, there is no other production plan for the economy as a whole that could produce more output using no additional inputs. This is in line with our conclusion in Section 5.E that, in the single-output case, the aggregate output level is produced at the lowest-possible cost when all firms maximize profits facing the same pnces. The need for strictly positive prices in Proposition 5.F.I is unpleasant, but it cannot be dispensed with, as Exercise 5.F.1 asks you to demonstrate.

In words, a production vector is efficient if there is no other feasible production vector that generates as much output as y using no additional inputs, and that actually produces more of some output or uses less of some input. As we see in Figure 5.F.I, every efficient y must be on the boundary of Y, but the converse is not necessarily the case: There may be boundary points of Y that are not

I

5.F:

PRODUCTION

Proof: This proof is an application of the separating hyperplane theorem for convex sets (see Section M.G of the Mathematical Appendix). Suppose that y E Y is efficient, and define the set Py = {y' E !;lL: y'» y}. The set Py is depicted in Figure 5.F.3. It is convex, and because y is efficient, we have Y" Py = 0. We can therefore invoke the separating hyperplane theorem to establish that there is some p # 0 such that p' y' :2: p' y" for every y' E Py and y" E Y (see Figure 5.F.3). Note, in particular, that this implies p' y' :2: p' y for every y' » y. Therefore, we must have p :2: 0 because if Pt < 0 for some t, then we would have p' y' < p' Y for some y' » y with Yt - Yt sufficiently large. Now take any y" E Y. Then p' y' :2: p' y" for every y' E Py • Because y' can be chosen to be arbitrarily close to y, we conclude that p' y :2: p' y" for any y" E Y; that is, y is profit maximizing for p. _ 13. As the proof makes clear. the result also applies to weakly efficient productions, that is, to productions such as y in Figure 5.F.l(b) where there is no y' E Y such that y'» y.

Proposinon 5.F.2: If Y is convex. every efficient Y E Y is profit maximizing for some p p' y. Hence. we conclude that if we maintain the assumption of price-taking behavior, all owners would agree, whatever their utility functions, to instruct the manager of the firm to maximize profits." I! is worth emphasizing three of the implicit assumptions in the previous reasoning: (i) prices are fixed and do not depend on the actions of the firm, (ii) profits are not uncertain. and (iii) managers can be controlled by owners. We comment on these assumptions very informally. (i) If prices may depend on the production of the firm, the objective of the owners may depend on their tastes as consumers. Suppose, for example, that each consumer has no wealth from sources other than the firm (w; = 0), that L = 2, and that the firm produces good I from good 2 with production function 1(-). Also, normalize the price of good 2 to be I, and suppose that the price of good I, in terms of good 2, is p(q) if output is q. If, for example, the preferences of the owners are such that they care only about the consumption of good 2, then they will unanimously want to solve Max,,,o p(f(z»I(z) - z. This maximizes the amount of good 2 that they get to consume. On the other hand, if they want to consume only good I, then they will wish to solve Max,,,o I(z) - [z/p(f(z»] because if they earn p(I(z))I(z) - = units of good 2, then end up with [p(f(z»I(z) - z]/p(f(z» units of good l. But these two problems have different solutions. (Check the first·order conditions.) Moreover, as this suggests, if the owners differ in their tastes as consumers, then they will not agree about what they want the firm to do (Exercise 5.G.I elaborates on this point.) (ii) If the output of the firm is random, then it is crucial to distinguish whether the output is sold before or after the uncertainty is resolved. If the output is sold after the uncertainty is resolved (as in the case of agricultural products sold in spot markets after harvesting), then the argument for a unanimous desire for profit maximization breaks down. Because profit, and therefore derived wealth, are now uncertain, the risk attitudes and expectations of owners will influence their preferences with regard to production plans. For example, strong risk averters will prefer relatively less risky production plans than moderate risk averters. On the other hand, if the output is sold before uncertainty is resolved (as in the case of agricultural products sold in futures markets before harvesting), then the risk is fully carried by the buyer. The profit of the firm is not uncertain, and the argument for unanimity in favor of profit maximization still holds. In effect, the firm can be thought of as producing a commodity that is sold before uncertainty is resolved in a market of the usual kind. (Further analysis of this issue would take us too far afield. We come back to it in Section 19.G after covering the foundations of decision theory under uncertainty in Chapter 6.) (iii) I! is plain that shareholders cannot usually exercise control directly. They need managers, who, naturally enough, have their own objectives. Especially if ownership is very diffuse, it is an important theoretical challenge to understand how and to what extent managers are, or can be, controlled by owners. Some relevant considerations are factors such as the degree of observability of managerial actions 14. In actuality, there are public firms and quasipublic organizations such as universities that do not have owners in the sense that private firms have shareholders. Their objectives may be different. and the current discussion does not apply to them.

153

154

CHAPTER

5:

APPENDIX

PRODUCTION

A:

THE

LINEAR

ACTIVITY

MODEL

155

~-------------------------------------------------------------------

and the stake of individual owners. [These issues will be touched on in Section 14.C (agency contracts as a mechanism of internal control) and in Section 19.G (stock markets as a mechanism of external contro!).]

APPENDIX A: THE LINEAR ACTIVITY MODEL

The saliency of the model of production with convexity and constant returns to scale technologies recommends that we examine it in some further detail. Given a constant returns to scale technology Y, the ray generated (or spanned) by a vector ji E Y is the set {y E Y: y = aji for some scalar a ~ OJ. We can think of a ray as representing a production activity that can be run at any scale of operation. That is, the production plan ji can be scaled up or down by any factor a ~ 0, generating, in this way, other possible production plans. We focus here on a particular case of constant returns to scale technologies that lends itself to explicit computation and is therefore very important in applications. We assume that we are given as a primitive of our theory a list of finitely many activities (say M), each of which can be run at any scale of operation and any number of which can be run simultaneously. Denote the M activities, to be called the elementary activities, by a, E RL, ... , aM E RL. Then, the production set is M

Y= lYE RL: y =

L a",a

m

Figure S.AA.l

A production set generated by two activities.

incurs a loss (if operated at all). Therefore, A(p) = {a,} and y(p) = {y: y = (X,a, for any scalar (x, ~ OJ, the ray spanned by activity a,. A significant result that we shall not prove is that for the linear activity model the converse of the efficiency Proposition 5.F.l holds exactly; that is, we can strengthen Proposition 5.F.2 to say: Every efficient y E Y is a profit-maximizing production for some p » O.

for some scalars (a" ... ,a M ) ~ OJ.

An important special case of the linear activity model is Leontief's input~output model. It is characterized by two additional features:

m=l

(i) There is one commodity, say the Lth, which is not produced by any activity. For this reason, we will call it the primary factor. In most applications of the Leontief model, the primary factor is labor. (ii) Every elementary activity has at most a single positive entry. This is called the assumption of no joint production. Thus, it is as if every good except the primary factor is produced from a certain type of constant returns production function using the other goods and the primary factor as inputs.

The scalar am is called the level of elementary activity m; it measures the scale of operation of the mth activity. Geometrically, Y is a polyhedral cone, a set generated as the convex hull of a finite number of rays. An activity of the form (0, ... ,0, -\,0, ... ,0), where -\ is in the tth place, is known as the disposal activity for good t. Henceforth, we shall always assume that, in addition to the M listed elementary activities, the L disposal activities are also available. Figure 5.AA.l illustrates a production set arising in the case where L = 2 and M = 2. Given a price vector p E R~, a profit-maximizing plan exists in Y if and only if P'a m ~ 0 for every m. To see this, note that if p'a", < 0, then the profit-maximizing level of activity m is a", = O. If p' a", = 0, then any level of activity m generates zero profits. Finally, if p' a", > 0 for some m, then by making a", arbitrarily large, we could generate arbitrarily large profits. Note that the presence of the disposal activities implies that we must have p E R~ for a profit-maximizing plan to exist. If Pt < 0, then the tth disposal activity would generate strictly positive (hence, arbitrarily large)

The Leontief Input-Output Model with No Substitution Possibilities

~,

profits. For any price vector p generating zero profits, let A(p) denote the set of activities that generate exactly zero profits: A(p) = {am: p'a", = OJ. If a", ~ A(p), then p'a", < 0, and so activity m is not used at prices p. The profit-maximizing supply set y(p) is therefore the convex cone generated by the activities in A(p); that is, yep) = (L_eAIP) (lma",: (X'" ~ OJ. The set y(p) is also illustrated in Figure 5.AA.1. In the figure, at price vector p, activity a, makes exactly zero profits, and activity a2

j

.

The simplest Leontief model is one in which each producible good is produced by only one activity. In this case, it is natural to label the activity that produces good t = 1, ... , L - \ as at = (a'b ... ,au) E RL. So the number of elementary activities M is equal to L - I. As an example, in Figure 5.AA.2, for a case where L = 3, we represent the unit production isoquant [the set {(Z2' Z3): f(z2' Z3) = I}] for the implied production function of good I. In the figure, the disposal activities for goods 2 and 3 are used to get rid of any excess of inputs. Because inputs must be used in fixed proportions (disposal aside), this special case is called a Leontief model with no substitution possibilities. If we normalize the activity vectors so that at( = 1 for all t = 1, ... , L - 1, then the vector (X = (a" ... , (XL-') E RL - ' of activity levels equals the vector of gross production of goods 1 through L - I. To determine the levels of net production, it is convenient to denote by A the (L - 1) x (L - 1) matrix in which the tth column is

156

CHAPTER

5:

PRODUCTION

APPENDIX

"

Unit isoquant }('" ZJ)

=I

production fur for good I in , Leontief mode

" the negative of the activity vector ar except that its last entry has been deleted and entry aIr has been replaced by a zero (recall that entries au with k #- tare nonpositive):

0

-a 12

-a 21

0

-aL-1,l

THE

LINEAR

ACTIVITY

Proof: We will show that if A is productive, then the inverse of the matrix (1 - A) exists and is nonnegative. This will give the result because we can then achieve net output levels c E R~-l by setting the (nonnegative) activity levels a = (1- A)~ 'c. FIgure 5.AA.2

A_[

A:

-a L -

-""J -a2.L-l

1• 2

0

The matrix A is known as the Leontief input-output matrix. Its ktth entry, k is needed to produce one unit of good t. We also denote by be RL - ' the vector of primary factor requirements, b = (-aLl>"" -a L •L -,). The vector (I - A)Cl then gives the net production levels of the L - I outputs when the activities are run at levels Cl = (Cl" ... , ClL-')' To see this, recall that the activities are normalized so that the gross production levels of the L - I produced goods are exactly Cl = (Cl" ... , "'L-')' On the other hand, ACl gives the amounts of each of these goods that are used as inputs for other produced goods. The difference, (I - A)Cl, is therefore the net production of goods I, ... , L - I. In addition, the scalar b'''' gives the total use of the primary factor. In summary, with this notation, we can write the set of technologically feasible production vectors (assuming free disposal) as - akt ~ 0, measures how much of good

Y = {Y: y;5; [I =bAJCl for some Cl E R'i,}. If (I - A)ii » 0 for some i ~ 0, the input-output matrix A is said to be productive. That is, the input-output matrix A is productive if there is some production plan that can produce positive net amounts of the L - I outputs, provided only that there is a sufficient amount of primary input available. A remarkable fact of Leontief input-output theory is the all-or-nothing property stated in Proposition 5.AA.1. Proposition S.AA.l: If A is productive, then for any nonnegative amounts of the L - 1

producible commodities C E R'i,-', there is a vector of activity levels Cl ~ 0 such that (I - A)Cl = c. That is, if A is productive, then it is possible to produce any nonnegative net amount of outputs (perhaps for purposes of final consumption), provided only that there is enough primary factor available.

no substitutiOl

To prove the claim, we begin by establishing a matrix-algebra facl. We show Ihat if A is productive. then the matrix r,:=o A", where A" is the nth power of A, approaches a limit as N - ,:0. Because A has only nonnegative entries, every entry ofL:=o A" is nondecreasing with N. Therefore, to establish that r,:=o A" has a limit, it suffices to show that there is an upper bound for its entries. Since A is productive, there is an a and c »0 such that c = (1 - A)a. If

we premultiply both sides of this equality by r,:=o A", we geqr,:=. A")c = (I - A N+ ')a (recall that A· = I). But (1- A N + ')a" ii because all elements of the matrix A N+' are nonnegative. Therefore. (r,:=o A")c " ii. With c» 0, this implies that no entry of r.:=o A" can exceed {Max {a" ... , at~ d/Min {c" ... , Ct_ d], and so we have established the desired upper bound. We conclude, therefore, that :L:'=o A" exists. The fact that r..ro=o A" exists must imply that Iim N _ ro AN =0. Thus, since (r.:=o A")(/ -A)= (/ - AN+t) and lim N _ ro (1- AN+t) = I, it must be that r,:'.o A" = (/ - A)~ '. (If A is a single number, this is precisely the high-school fonnula for adding up the tenns of a geometric series.) The conclusion is that (/ - A) ~, exists and that all its entries are nonnegative. This establishes the result. _ The focus on r,:=o A" in the proof of Proposition 5.AA.1 makes economic sense. Suppose we want to produce the vector of final consumptions e E R~~ '. How much total production will be needed? To produce final outputs e = AOe, we need to use as inputs the amounts A(Aoe) = Ae of produced goods. In tum, to produce these amounts requires that A(Ae) = A'e of additional produced goods be used, and so on ad infinitum. The total amounts of goods required to be produced is therefore the limit of (r,:.o A")e as N ~ 00. Thus, we can conclude that the vector e 2: 0 will be producible if and only if r,:,.o A" is well defined (i.e., all its entries are finite). Example S.AA,I: Suppose that L = 3, and let a, = (I, -I, - 2) and a2 = ( - p, I, -4) for some constant p ~ O. Activity levels Cl = (Cl,. Cl2) generate a positive net output of good 2 if "" > Cl,; they generate a positive net output of good I if Cl, - PCl 2 > O. The input-output matrix A and the matrix (1- A)-' are

A=[~ ~J

and

I [I

(I -A)-' = - I-P

I

Hence, matrix A is productive if and only if P < 1. Figure 5.AA.3(a) depicts a case where A is productive. The shaded region represents the vectors of net outputs that can be generated using the two activity vectors; note how the two activity vectors can span all of R~. In contrast, in Figure 5.AA.3(b), the matrix A is not productive: No strictly positive vector of net outputs can be achieved by running the two activities at nonnegative scales. [Again, the shaded region represents those vectors that can be generated using the two activity vectors. here a set whose only intersection with R~ is the point (0,0)]. Note also that the closer Pis to the value I, the larger the levels of activity required to produce any final vector of consumptions. _

The Leontief Model with Substitution Possibilities We now move to the consideration of the general Leontief model in which each good may have more than one activity capable of producing it. We shall see that the

MODEL

157

158

c HAP T E R 5:

,--__------------------------------------~A~P~P:E~N~D~I~X~:A~:~T~H::E~l~I~N~E~A~R~A~C~T~I~V~'~T~V~~M~O~D~E~l~~1~5~9

PRO D U C T ION

Z,

Y2

Y2

Slope

y,

y,

(b)

ial

Figure 5.AA.3

=2

Use az and d,

d,=(-I.O.O)} . d, = (0. -1.0) DISposal d, = (0. O. -1) ActIVIties ", = (1. -2. -1) "2 = (1. -1. -2)

Leontief model of Example 5.AA.1. (a) Productive (P < I). (b) Unproductive (p? I).

Slope =

i Figure 5.AA.4

/iz 2• Z,) = 1 properties of the nonsubstitution model remain very relevant for the more general case where substitution is possible. The first thing to observe is that the computation of the production function of a good. say good I. now becomes a linear programming problem (see Section M.M of the Mathematical Appendix). Indeed. suppose that a, E ilL •. ..• aM, E RL is a list of M, elementary activities capable of producing good I and that we are given initial levels of goods 2•...• L equal to Z2" ..• ZL' Then the maximal possible production of good 1 given these available inputs f(Z2" ..• ZL) is the solution to the problem

We also know from linear programming theory that the L - 1 dual variables (A2' ...• A ) of this problem (i.e .• the multipliers associated with the L - 1 constraints) L can be interpreted as the marginal productivities of the L - 1 inputs. More precisely. for any t=2 •...• L. we have (of/oz()+ s,A(s,(ofl oz()-. where (ofloz()+ and (iJf/{)z()- are. respectively. the left-hand and right-hand tth partial derivatives of f(-) at (Z2' ...• ZL)' Figure 5.AA.4 illustrates the unit isoquant for the case in which good 1 can be produced using two other goods (goods 2 and 3) as inputs with two possible activities a, = (1. _ 2. - I) and a2 = (1. - 1. - 2). If the ratio of inputs is either higher than 2 or lower than one of the disposal activities is used to eliminate any excess inputs. For any vector y E ilL. it will be convenient to write y = (y -L. YL). where y -L = (y, •. ..• YL-I)' We shall assume that our Leontief model is productive in the sense that there is a technologically feasible vector Y E Y such that y -L » O. A striking implication of the Leontief structure (constant returns. no joint products. single primary factor) is that we can associate with each good a single opcimalcechnique (which could be a mixture of several of the elementary techniques corresponding to that good). What this means is that optimal techniques (one for each output) supporting efficient production vectors can be chosen independently of the particular output vector that is being produced (as long as the net output of every producible good is positive). Thus. although substitution is possible in principle. efficient production requires no substitution of techniques as desired final consumption levels change. This is the content of the celebrated non-substitucion

t.

theorem (due to Samuelson [1951]).

Z2

Proposition S.AA.2: (The Nonsubstitution Theorem) Consider a productive Leontief Input-output model With L - 1 producible goods and M( ~ 1 elementary activilies for the producible good t = 1•... , L - 1. Then there exist L - 1 activities (a" . ..• aL -,) • .with a( possibly a nonnegative linear combination of the M, elementary acllvltles for producing good t. such that all efficient production vectors With y _ L » 0 can be generated with these L - 1 activities. Proof: Let y E Y be an efficient production vector with y _ L » O. As a general matter, the vector y must be generated by a collection of L - I activities (a, •... , a L -,) (some of these may be "mixtures" of the original activities) run at activity levels (a, •...,' a L- I ),» 0; that is, y = L7;; I a,a,. We show that any efficient production plan Y With Y-L» 0 ~an be achieved USIng the activities (a l , · · · , a L- I ). SInce Y E Y IS effiCient, there exists a p » 0 such that Y is profit maximizing with respect to p (thiS IS from Proposition 5.F.2. as strengthened for the linear activity model). From p'a( S, 0 for all t = I •... , L - I. (1.( > 0, and

0= p'y = p.(Li ' (1.tat) = Li ' t= 1

~tP·a(.

(='l

it follows that p'at = 0 for all t = I •... , L - I. 0 W e want to h Consider' now any other efficient production y' E Y with y'- L )'.",.. s ow that y can be generated from the activities (al> ...• a L- I ). Denote by A the Input-output matnx associated with (a l , . . . • aL_I)' Because y -L»O. it follows by defimtlOn that A IS productive. Therefore, by Proposition 5.AA.I. we know that there are actlVlty levels (a';, ...• (1.~_I) such that the production vector y" = ",L-I~" Q h " _ 'N' ~t; 1 I.O.t , as Y-L - Y-L' ote thatsIncep'at = Ofora lit = I, .. .• L -I. we must have P'Y" = 0 Thus, y" IS profit maximizing ,~o.r p » 0 (recall that the maximum profits for p ar~ zero). and so It follows that Y IS effiCient by Proposition 5.F.1. But then we have two productIOn vectors. y' and y", with y'-L = y'~L' and both are efficient. It must ther~fore. be that y~ = y/.. Hence. we conclude that y' can be produced using only the activities (a l • . . . , aL - I ), which IS the desired result. _ The nonsubstitution theorem depends critically on the presence of only one

Unit isoquant of production function of good 1. in the Leootid model with substitution.

160

C HAP T E R

5:

PROD U C T , 0 N

primary factor. This makes sense. With more than one primary factor, the optimal choice of techniques should depend on the relative prices of these factors. In turn, it is logical to expect that these relative prices will not be independent of the composition of final demand (e.g., if demand moves from land-intensive goods toward labour-intensive goods, we would expect the price of labor relative to the price of land to increase). Nonetheless, it is worth mentioning that the nonsubstitution result remains valid as long as the prices of the primary factors do not change. For further reading on the material discussed in this appendix see Gale (1960).

EXERC'SES

161

~---------------------------------------------~~~~~ (a) Describe the three-dimensional production set associated with these 'wo techniques. Assume free disposal. (b) Give sufficien' conditions on
S.c.g" Alpha Incorporated (AI) produces a single output q from two inputs z, and z,. You are assigned to determine AI's technology. You are given 100 monthly observations. Two of these monthly observations are shown in the following table: I nput prices

Input levels

Out put price

Output level

is closed. S.B.sc Show that if Y is closed and convex, and - R';. c Y, then free disposal holds. S.B.6" There are three goods. Goods 1 and 2 are inputs. The third, with amounts denoted by q, is an output. Output can be produced by two techniques that can be operated simultaneously or separately. The techniques are not necessarily linear. The first (respectively, the second) technique uses only the first (respectively, the second) input. Thus, the first (respectively, the second) technique is completely specified by 0 and y such that D(L) = PV(L) + y for every L E !f'. Proof: Begin by choosing two lotteries [. and L with the property that [.;:: L ;:: L for all L E!f'6 If [. - L. then every utility function is a constant and the result follows immediately. Therefore. we assume from now on that [. >- L. 6. These best and worst lotteries can be shown to exist. We could, for example. choose a maximizer and a minimizer of the linear, hence continuous, function U(-) on the simplex of probabilities, a compact set.

174

CHAPTER

8:

Note first that if U(·) is a v.N-M expected utility function and O(L) = fJU(L)

+ y,

then 0(1

SEC T ION

(

CHOICE UNDER UNCERTAINTY

a.[fJU(L,)

~

Exercise 6.B.2: Show that if the preference relation ;:: on fI' is represented by a utility function U(·) that has the expected utility form, then ;:: satisfies the independence axiom.

+y

+ y]

1= 1

The expected utility theorem, the central result of this section, tells us that the converse is also true.

K

L

=

a.O(L.).

lI;=l

The Expected Utility Theorem

Since 0(·) satisfies property (6.B.1), it has the expected utility form. For the reverse direction, we want to show that if both 0(·) and U(·) have the expected utility form, then constants fJ > and y exist such that O(L) = fJU(L) + y for all L E fI'. To do so, consider any lottery L E fI', and define AL E [0, I] by

The expected utility theorem says that if the decision maker's preferences over lotteries satisfy the continuity and independence axioms, then his preferences are representable by a utility function with the expected utility form. It is the most important result in the theory of choice under uncertainty, and the rest of the book bears witness to its useful ness. Before stating and proving the result formally, however, it may be helpful to attempt an intuitive understanding of why it is true. Consider the case where there are only three outcomes. As we have already observed, the continuity axiom insures that preferences on lotteries can be represented by some utility function. Suppose that we represent the indifference map in the simplex, as in Figure 6.8.5. Assume, for simplicity, that we have a conventional map with one-dimensional indifference curves. Because the expected utility form is linear in the probabilities, representability by the expected utility form is equivalent to these indifference curves being straight, parallel lines (you should check this). Figure 6.8.5(a) exhibits an indifference map satisfying these properties. We now argue that these properties are, in fact, consequences of the independence axiom. Indifference curves are straight lines if, for every pair of lotteries L, L', we have that L - L' implies aL + (1 - alL' - L for all ~ E [0,1]. Figure 6.B.5(b) depicts a situation where the indifference curve is not a straight line; we have L' - L but

°

U(L) = ALU(L)

+ (I

- AL)U(L)·

Thus

A _ U(L) - U(L)

(6.8.2)

L - U(L) - U(L)

I.

Since AL U(L) + (I - AL)U(L) = U()'LL + (I - AL)L) and U(·) repr.:sents the preferences ;::, it must be that L - ALL + (I - AL)L. But if so, then since UC) is also linear and represents these same preferences, we have O(L) = OPLL + (I - AL)L) = ALO(L)

+ (I

- AL)O(L)

= AL( O(L) - O(L)

+ O(L).

Substituting for AL from (6.8.2) and rearranging terms yields the conclusion that O(L) = fJU(L)

+ y, where fJ

175

If:

K

L

E X PEe TED UTI LIT Y THE 0 R Y

.'"

~,L') = fJ u ( 1 a,L.) + y = fJ[1 a,U(L,)]

•. B:

~----------------------Note that if a preference relation ::::: on fI' is representable by a utility function U(·) that has the expected utility form, then since a linear utility function is continuous, it follows that;:: is continuous on fI'. More importantly, the preference relation::::: must also satisfy the independence axiom. You are asked to show this in Exercise 6.8.2.

= O(L) -

O(L) U(L)- U(L)

and _ O(L) - O(L) y = U(L) - U(L) U(L) _ U(L)·

- L. This

SEC T ION

tL' + iL >- iL + iL.

(6.B.3)

But since L - C, the independence axiom implies that we must have !L' + iL 1L + !L (see Exercise 6.B.l). This contradicts (6.B.3), and so we must conclude that indifference curves are straight lines. Figure 6.B.5(c) depicts two straight but nonparallel indifference lines. A violation of the independence axiom can be constructed in this case, as indicated in the figure. There we have L;::: L' (in fact, L - L'), but !L + 1L";::: tL' + 1L" does not hold for the lottery L" shown in the figure. Thus, indifference curves must be parallel, straight lines if preferences satisfy the independence axiom. In Proposition 6.B.3, we formally state and prove the expected utility theorem. Proposition 6.B.3: (Expected Utility Theorem) Suppose that the rational preference relation;::: on the space of lotteries !l' satisfies the continuity and independence axioms. Then;::: admits a utility representation of the expected utility form. That is, we can assign a number un to each outcome n = I, ... ,N in such a manner that for any two lotteries L = (p" ... ,PN) and L' = (p;, ... ,piv), we have N

I

L

unP~'

If L >- L' and IX E (0, I), chen L

>- IXL + (I

- - al + (I - -

The existence of aL is established in a manner similar to that used in the proof of Proposition

Seep 4.

177

-

leI. Specifically, define the sets

>- L'.

IXL + (1 - IX)L>-IXL + (1- IX)L'>-- L, then he can conclude that L' >- L. Indeed, if L" >- L, then there is an indifference curve separating these two lotteries, as shown in the figure, and it follows from the fact that indifference curves are a family of paraliel straight lines that there is also an indifference curve separating L' and L, so that L' >- L. Note that this type of inference is not possible using only the general

. As a descriptive. theory, however, the expected utility theorem (and, by implication, Its central assumption, the mdependence axiom), is not without difficulties. Examples 6.B.2 and 6.B.3 are deSigned to test its plausibility. Example 6.8.2: The AI/ais Paradox. This example, known as the Allais paradox [from Allais (1953)], constitutes the oldest and most famous challenge to the expected utility theorem. It ts a thought experiment. There are three possible monetary prizes (so the number of outcomes is N = 3):

Ftgure 6.B.6 Expected utility as a guide to introspection.

First Prize

Second Prize

Third Prize

2 500 000 dollars

500000 dollars

o dollars

The decision maker is subjected to two choice tests. The first consists of a choice between the lotteries L I and L'I: LI = (0,1,0)

L;

= (.10, .89, .01).

The second consists of a choice between the lotteries L2 and L 2:

L2

= (0, .11, .89)

L2 = (.10,0, .90).

The f~ur lotteries involved are represented in the simplex diagram of Figure 6.8.7. It IS common for individuals to express the preferences L 1"''-- L'1 and L'2''-- L 2' 8 8. In our classroom experience, roughly half the students choose this way.

Flgur. 6.B.7

Depiction of the Allais paradox in the simplex.

180

C HAP T E R

I:

SECTION

C HOI C E U N D E RUN C E R T A I N T Y

The first choice means that one prefers the certainty of receiving 500000 dollars over a lottery offering a 1/10 probability of getting five times more but bringing with it a tiny risk of getting nothing. The second choice means that, all things considered, a 1/10 probability of getting 2500 000 dollars is preferred to getting only 500 000 dollars with the slightly better odds of 11/100. However, these choices are not consistent with expected utility. This can be seen in Figure 6.B.7: The straight lines connecting L, to L~ and L2 to L~ are parallel. Therefore, if an individual has a linear indifference curve that lies in such a way that L, is preferred to L~, then a parallel linear indifference curve must make L2 preferred to L~, and vice versa. Hence, choosing L, and L~ is inconsistent with preferences satisfying the assumptions of the expected utility theorem. More formally, suppose that there was a v.N-M expected utility function. Denote by u 2S , un"~ and U o the utility values of the three outcomes. Then the choice L, >- L~ implies un' > (.lO)U2S + (.89)u o, + (.Ol)u o· Adding (.89)u o - (.89)u o, to both sides, we get (.l1)u o,

+ (.89)u o >

(.lO)U2S

+ (.90)u o,

and therefore any individual with a v.N-M utility function must have L2

>- L~.

-

There are four common reactions to the Allais paradox. The first, propounded by J. Marshack and L. Savage, goes back to the normative interpretation of the theory. It argues that choosing under uncertainty is a reflective activity in which one should be ready to correct mistakes if they are proven inconsistent with the basic principles of choice embodied in the independence axiom (much as one corrects arithmetic mistakes). The second reaction maintains that the Allais paradox is of limited significance for economics as a whole because it involves payoffs that are out of the ordinary and probabilities close to 0 and I. A third reaction seeks to accommodate the paradox with a theory that defines preferences over somewhat larger and more complex objects than simply the ultimate lottery over outcomes. For example, the decision maker may value not only what he receives but also what he receives compared with what he might have received by choosing differently. This leads to regret theory. In the example, we could have L, >- L', because the expected regret caused by the possibility of getting zero in lottery L'" when choosing L, would have assured 500000 dollars, is too great. On the other hand, with the choice between L2 and L;, no such clear-cut regret potential exists; the decision maker was very likely to get nothing anyway. The fourth reaction is to stick with the original choice domain of lotteries but to give up the independence axiom in favor of something weaker. Exercise 6.B.5 develops this point further. Example 6.B.3: Machina's paradox. Consider the following three outcomes: "a trip to Venice," "watching an excellent movie about Venice," and "staying home." Suppose that you prefer the first to the second and the second to the third. Now you are given the opportunity to choose between two lotteries. The first lottery gives "a trip to Venice" with probability 99.9% and "watching an excellent movie about Venice" with probability 0.1%. The second lottery gives "a trip to

I.B:

EXPECTED UTILITY THEORY

Venice," again with probability 99.9% and "staying home" with probability 0.1%. The independence axiom forces you to prefer the first lottery to the second. Yet, it would be understandable if you did otherwise. Choosing the second lottery is the rational thing to do if you anticipate that in the event of not getting the trip to Venice, your tastes over the other two outcomes will change: You will be severely disappointed and will feel miserable watching a movie about Venice. The idea of disappointment has parallels with the idea of regret that we discussed in connection with the Allais paradox, but it is not quite the same. Both ideas refer to the influence of "what might have been" on the level of well-being experienced, and it is because of this that they are in conflict with the independence axiom. But disappointment is more directly concerned with what might have been if another outcome of a given lottery had come up, whereas regret should be thought of as regret over a choice not made. _ Because of the phenomena illustrated in the previous two examples, the search for a useful theory of choice under uncertainty that does not rely on the independence axiom has been an active area of research [see Machina (1987) and also Hey and Orme (1994)]. Nevertheless, the use of the expected utility theorem is pervasive in economics. An argument sometimes made against the practical significance of violations of the independence axiom is that individuals with such preferences would be weeded out of the marketplace because they would be open to the acceptance of so-called "Dutch books," that is, deals leading to a sure loss of money. Suppose, for example, that there are three lotteries such that L >- L' and L>- L" but, in violation of the independence axiom, .L' + (I - '>.)L" >- L for some'>. E (0, I). Then, when the decision maker is in the initial position of owning the right to lottery L, he would be willing to pay a small fee to trade L for a compound lottery yielding lottery L' with probability'>. and lottery L" with probability (I - .). But as soon as the first stage of this lottery is over, giving him either L' or L" we could get him to pay a fee to trade this lottery for L. Hence, at that point, he would have paid the two fees but would otherwise be back to his original position. This may well be a good argument for convexity of the not-better·than sets of
')l

u(· )

u(x

+ ,)

-------------

= u(x)

,) ,/,/

u(l)

"" "" ""

u(X - ,)

!

(b)

Figure 6.C.2

x-£ [~

c(F. u)

context of expected utility theory, we see that risk aversion is equivalent to the eoneaviry of u(') and that strict risk aversion is equivalent to the strict concavity of u(·). This makes sense. Strict concavity means that the marginal utility of money is decreasing. Hence, at any level of wealth x, the utility gain from an extra dollar is smaller than (the absolute value of) the utility loss of having a dollar less. It follows that a risk of gaining or losing a dollar with even probability is not worth taking. This is illustrated in Figure 6.C2(a); in the figure we consider a gamble involving the gain or loss of I dollar from an initial position of 2 dollars. The (v.N-M )utility of this gamble, lU(I) + lU(3), is strictly less than that of the initial certain position u(2). For a risk-neutral expected utility maximizer, (6.C.2) must hold with equality for all F( '). Hence, the decision maker is risk neutral if and only if the Bernoulli utility function of money u(·) is linear. Figure 6.C2(b) depicts the (v.N-M) utility associated with the previous gamble for a risk neutral individual. Here the individual is indifferent between the gambles that yield a mean wealth level of 2 dollars and a certain wealth of 2 dollars. Definition 6.C.2 introduces two useful concepts for the analysis of risk aversion. Definition 6.C.2: Given a Bernoulli utility function u(·) we define the following concepts: (i) The certainty equivalent of F(')' denoted e(F, u), is the amount of money for which the individual is indifferent between the gamble F(') and the certain amount e(F, u); that is, u(e(F, u)) =

f

u(x) dF(x).

(6.C.3)

(ii) For any fixed amount of money x and positive number B, the probability premium denoted by n(x, e, u), is the excess in winning probability over fair odds that makes the individual indifferent between the certain outcome x and a gamble between the two outcomes x + e and x-e. That is

+ n(x, B, u))u(x + e) + (~-

n(x, B, u))u(x - e).

(6.C.4)

These two concepts are illustrated in Figure 6.C3. In Figure 6.C3(a), we exhibit the geometric construction of e(F, u) for an even probability gamble between I and 3 dollars. Note that e(F, u) < 2, implying that some expected return is traded for certainty. The satisfaction of the inequality e(F, u) ~ f x dF(x) for all F(') is, in fact,

,

1/'

A

,, , ,,, , I

The certainty equivalent (al and the probability premium (bl.

equivalent to the decision maker being a risk averter. To see this, observe that since u( . 1 is nondecreasing, we have

u) ~ fx dF(x)

¢>

u(e(F, u»

~ u(f x dF(Xl)

¢>

f

u(x) dF(x)

~ u(f xdF(X»).

where the last ¢> follows from the definition of e(F, u). In Figure 6.C.3(b), we exhibit the geometric construction of n(x, B, u). We see that n(x, e, u) > 0; that is, better than fair odds must be given for the individual to accept the risk. In fact, the satisfaction of the inequality n(x, e, u) ;?: 0 for all x and e > 0 is also equivalent to risk aversion (see Exercise 6.C.3). These points are formally summarized in Proposition 6.Cl. Proposition 6.C.1: Suppose a decision maker is an expected utility maximizer with a Bernoulli utility function u(·) on amounts of money. Then the following properties are equivalent: (i) (ii) (iii) (iv)

1 ,

,, ,, ,, I

,, ,, i'

X+E

+ rr(x, '. ull(x + ,)

(b)

Figure 6.C.3

e(F,

X

",:

+ G- rr(x, '. ull(x - ,) = x + 2,,(x. '. ul

Risk aversion (a) and risk neutrality (b). (a)

u(x) = (~

-/-(/

,,

,

(a)

AVERSION

187

,------------------------------------------------------------~~~~~~~~

The decision maker is risk averse. u(-) is concave"6 e(F, u) ~ f x dF(x) for all F(·). n(x, e, u) ;?: 0 for all x, e.

Examples 6.CI to 6.C.3 illustrate the use of the risk aversion concept. Example 6.C.l: Insurance. Consider a strictly risk-averse decision maker who has an initial wealth of w but who runs a risk of a loss of D dollars. The probability of the loss is n. It is possible, however, for the decision maker to buy insurance. One unit of insurance costs q dollars and pays I dollar if the loss occurs. Thus, if ex units of insurance are bought, the wealth of the individual will be w - exq if there is no loss and w - exq - D + ex if the loss occurs. Note, for purposes of later discussion, that the decision maker's expected wealth is then w - nD + ex(n - q). The decision maker's problem is to choose the optimal level of ex. His utility maximization problem is 16. Recall that if u(· I is twice differentiable then concavity is equivalent to u"(x) ,; 0 for

all x.

I

.... -'-. r~-~ -

188

C HAP TEA

8,

CH0 , CE

UNO E A

SEC T , 0 N

U NeE A T A , N T Y

M0 NEY

LOT TEA' E SAN 0

A' S KAY E AS' 0 N

Cl" is optimal, it must satisfy the Kuhn-Tucker first-order conditions: l7

therefore Max (I - It)u(w - cxq) + Itu(w - aq - D + a).

cf>(a") =

112:0

If

I. C:

,,* is an optimum, it must satisfy the first-order condition: -q(1 - It)u'(w - a"q) + 1t(1 - q)u'(w - D + a"(1 - q)):S 0,

with equality if 7' > O. . Suppose now that the price q of one unit of insurance is actuarially fair 10 the sense of it being equal to the expected cost of insurance. That is, q = It. Then the first-order condition requires that

f

u'(w

+ Cl"[Z -

1])(: - I) dF(z)

{

:sO ~O

ifCl' < w, . ,fCl' > O.

Note that f z dF(z) > 1 implies q,(0) > O. Hence, ,,' = 0 cannot satisfy this first-order condition. We conclude that the optimal portfolio has a' > O. The general principle illustrated in this example, is that if a risk is actuariall.\' favorable, then a risk avercer will always accept at least a small amount of it. This same principle emerges in Example 6.C.1 if insurance is not actuarially fair. In Exercise 6.C.I, you are asked to show that if q > It, then the decision maker will not fully insure (i.e., will accept some risk). _

u'(w - D + a"(I - It)) - u'(w - a"lt) :S 0, with equality if ",* > O. Since u'(w - D) > u'(w), we must have a" > 0, and therefore

u'(w - D + ","(I - It» = u'(w - a"It). Because u'( .) is strictly decreasing, this implies w- D

+ ",*(1

- It) = w - cx"lt,

Example 6.C.3: General Asset Problem. In the previous example, we could define the utility U(a,fJ) of the portfolio (Cl,fJ) as U(Cl,fJ) = f U(ClZ + fJ) dF(z). Note that U(·) is then an increasing, continuous, and concave utility function. We now discuss an important generalization. We assume that we have N assets (one of which may be the safe asset) with asset n giving a return of z. per unit of money invested. These returns are jointly distributed according to a distribution function F(z" ... , ZN)' The utility of holding a portfolio of assets (Cl" ... , Cl N) is then

or, equivalently,

U(Cl" ... , aN) =

Cl* = D. Thus, if insurance is actuarially fair, the decision maker insures completely. The individual's final wealth is then w - ltD, regardless of the occurrence of the loss. This proof of the complete insurance result uses first-order conditions, which is instructive but not really necessary. Note that if q = It, then the decision maker's expected wealth is w - ltD for any iX. Since setting a = D allows him to reach w - ltD with certainty, the definition of risk aversion directly implies that this is the optimal level of "'. _ Example 6.C.2: Demand for a Risky Asset. An asset is a divisible claim to a financial return in the future. Suppose that there are two assets, a safe asset with a return of I dollar per dollar invested and a risky asset with a random return of z dollars per dollar invested. The random return z has a distribution function F(z) that we assume satisfies f z dF(z) > I; that is, its mean return exceeds that of the .s~fe asset. An individual has initial wealth w to invest, wh,ch can be dIVIded 10 any way between the two assets. Let a and fJ denote the amounts of wealth invested in the risky and the safe asset, respectively. Thus, for any realization z of the random return, the individual's portfolio (Cl,fJ) pays az + fJ. Of course, we must also have Cl + fJ = w. The question is how to choose", and fJ. The answer will depend on Fe), w, and the Bernoulli utility function u(·). The utility maximization problem of the individual is Max a.~:2::

0

f

U(ClZ

U(Cl,Z,

+ ... + ClNZN) dF(z" ... , ZN)'

This utility function for portfolios, defined on I\!~, is also increasing, continuous, and concave (see Exercise 6.C.4). This means that, formally, we can treat assets as the usual type of commodities and apply to them the demand theory developed in Chapters 2 and 3. Observe, in particular, how risk aversion leads to a convex indifference map for portfolios. _ Suppose that the lotteries pay in vectors of physical goods rather than in money. Formally, the space of outcomes is then the consumption set R~ (all the previous discussion can be viewed as the special case in which there is a single good). In this more general setting, the concept of risk aversion given by Definition 6.C.1 is perfectly well defined. Furthermore, if there is a Bernoulli utility function u: R~ ~ R, then risk aversion is still equivalent to 'he concavity of u(·). Hence, we have here another justification for the convexity assumption of Chapter 3: Under the assumptions of the expected utility theorem, the convexity of preferences for perfectly certain amounts of the physical commodities must hold if for any lottery with commodity payoffs the individual always prefers the certainty of the mean commodity bundle to the lottery itself. In Exercise 6.C.5, you are asked to show that if preferences over lotteries with commodity payoffs exhibit risk aversion, then, at given commodity prices, the induced preferences on money lotteries (where consumption decisions are made after the realization of wealth) are also risk averse. Thus, in principle, it is possible to build the theory of risk aversion on the more primitive notion of lotteries over the final consumption of goods.

+ fJ) dF(z)

+ fJ = w. f u(w + Cl(Z - I» S.I.Cl

Equivalently, we want to maximize

f

dF(z) subject to 0 :S Cl :S w. If

__ ~L__________________________

17. The objective runction is concave in 1))(z - 1)2 dF(x) S O.

f u"(w + o(z -

IX

because the concavity of u(·) implies that

189

5 E C T ION 100

CHAPTER

6:

CHOICE

UNDER

M 0 NEY

LOT T E R I E SAN D

RI SK

Note that, up to two integration constants, the utility function u(·) can be recovered from ~. (.) by integrating twice. The integration constants are irrelevant because the Bernoulli utility is identified only up to two constants (origin and units). Thus, the Arrow-Pratt risk aversion measure rA (.) fully characterizes behavior under uncertainty.

The Measurement of Risk Aversion

Now that we know what it means to be risk averse, we can try to measure the extent of risk aversion. We begin by defining one particularly useful measure and discussing some of its properties. Definition 6.C.3: Given a (twice-differentiable) Bernoulli utility function u(·) for money, the Arrow-Pratt coefficient of absolute risk aversion at x is defined as

Example 6.C4: Consider the utility function u(x) = _e- ax for a > O. Then u'(x) = ae- ax and u"(x) = -a'e-·'. Therefore, rA(x, u) = a for all x. It follows from the

rA(x) = -u"(x)/u'(x).

observation just made that the general form of a Bernoulli utility function with an Arrow- Pratt measure of absolute risk aversion equal to the constant a > 0 at all x is u(x) = -lIe-·' + {1 for some 11 > 0 and (1. •

The Arrow-Pratt measure can be motivated as follows: We know that risk neutrality is equivalent to the linearity of u(·), that is, to u"(x) = 0 for all x. Therefore, it seems logical that the degree of risk aversion be related to the curvature of u(·). In Figure 6.C4, for example, we represent two Bernoulli utility functions u l (·) and U2( .) normalized (by choice of origin and units) to have the same utility and marginal utility values at wealth level x. The certainty equivalent for a small risk with mean x is smaller for uk) than for uIC), suggesting that risk aversion increases with the curvature of the Bernoulli utility function at x. One possible measure of curvature of the Bernoulli utility function u(·) at x is u"(x). However, this is not an adequate measure because it is not invariant to positive linear transformations of the utility function. To make it invariant, the simplest modification is to use u"(x)/u'(x). If we change sign so as to have a positive number for an increasing and concave u(·), we

Once we are equipped with a measure of risk aversion, we can put it to use in comparative statics exercises. Two common situations are the comparisons of risk attitudes across individuals with different utility functions and the comparison of risk atlltudes for one individual at different levels of wealth. Comparisons across individuals

Given two Bernoulli utility functions u l (·) and u 2C), when can we say that uk) is unambIguously more risk averse than u l (')1 Several possible approaches to a definition seem plausible: (i) rA(x, u, ) ; c(F, Ul) for any F(·). (iv) n(x, e, u 2) ; 0, and using the first 19. In other words, any risk that u2 (') would accept starting from a position of certainty would

also be accepled by

U I ( • ).

A V E R S ION

191

192

c HAP TEA

I:

C HOI C E

UNO E A

SEC T ION

U N C EAT A I NT Y

I. C:

M0 NEY

LOT TEA IE SAN 0

AI SK

richer people "can afford to take a chance." Hence, we shall explore the implications of the condition stated in Definition 6.C.4.

expression, we get

Dellnltlon 6.C.4: The Bernoulli utility function u(·) for money exhibits decreasing

Thus, rA(x, u,) ~ rA(x, Ul) for all x if and only if !/t"(u l ) ~ 0 for all Ul in the range of u l (-). The more-risk-averse-than relation is a partial ordering of Bernoulli utility functions; it is transitive but far from complete. Typically, two Bernoulli utility functions Ul (.) and u,(· ) will not be comparable; that is, we will have rA(x, Ul) > rA(x, U,) at some x but rA(x', Ul) < rA(x', u,) at some other x' # x. Example 6.C.2 continued: We take up again the asset portfolio problem between a safe and a risky asset discussed in Example 6.C.2. Suppose that we now have two individuals with Bernoulli utility functions u l (·) and u,(·), and denote by at and a! their respective optimal investments in the risky asset. We will show that if u,(·) is more risk averse than u l ('), then et! < at; that is. the second decision maker invests less in the risky asset than the first. To repeat from our earlier discussion, the asset allocation problem for U 1(-) is Max 05.:t::S,:w

f

ul(w - a

(z - l)u;(w

+ etnZ - I]) dF(z) = O.

(6.C.S)

f

(z -

I)u~(w + etHZ -

I]) dF(z) = O.

(6.C.6)

As we know, the concavity of u2 (·) implies that "'2(') is decreasing. Therefore, if we show that < 0, it must follow that a! < "T, which is the result we want. Now, u,(x) = !/t(ul(x» allows us to write

"',(an

"'2(an =

f

(z - I)!/t'(ul(w

+ anz -

I]»u;(w

+ "nz

Proposition 6.C.3: The following properties are equivalent:

(i) The Bernoulli utility function (.) exhibits decreasing absolute risk aversion. < x" u 2 (z) = u(x2 + z) is a concave transformation of u,(z) =

(ii) Whenever x 2

(iii) For any risk F(z) , the certainty equivalent of the lottery formed by adding risk z to wealth level x. given by the amount C x at which u(c x) = Ju(x + z) dF(z). is such that (x - cx) is decreasing in x. That is, the higher x is, the less is the individual willing to pay to get rid of the risk. (iv) The probability premium 1t(x, e, u) is decreasing in x. (v) For any F(z), if u(x2 + z) dF(z) ~ u(x 2 ) and x2 < x,. then u(x, + z) dF(z) ~

J

J

u(x,).

The analogous expression for the utility function U2(') is "',(et!) =

Individuals whose preferences satisfy the decreasing absolute risk aversion property take more risk as they become wealthier. Consider two levels of initial wealth x I > x 2 • Denote the increments or decrements to wealth by z. Then the individual evaluates risk at Xl and X2 by, respectively, the induced Bernoulli utility functions ul(z) = U(XI + z) and u,(z) = U(X2 + z). Comparing an individual's attitudes toward risk as his level of wealth changes is like comparing the utility functions Ul(') and U2(-), a problem we have just studied. If u(-) displays decreasing absolute risk aversion, then rA(z, U2) ~ rA(z, Ul) for all z. This is condition (i) of Proposition 6.C.2. Hence, the result in Proposition 6.C.3 follows directly from Proposition 6.C.2.

u(x, +z).

+ az) dF(z).

Assuming an interior solution, the first-order condition is

f

absolute risk aversion if rA (x, u) is a decreasing function of x.

-

I]) dF(z) < O.

(6.C.7)

To understand the final inequality, note that the integrand of expression (6.C.7) is the same as that in (6.C.S) except that it is multiplied by !/t'('), a positive decreasing function of z [recall that uk) more risk averse than ul (·) means that the increasing function !/t(.) is concave; that is, !/t'(') is positive and decreasing]. Hence, the integral (6.C.7) underweights the positive values of (z - I)u;(w + anz - I]), which obtain for z> \, relative to the negative values, which obtain for z < I. Since, in (6.C.S), the integral of the positive and the negative parts of the integrand added to zero, they now must add to a negative number. This establishes the desired inequality. Comparisons across wealth levels

It is a common contention that wealthier people are willing to bear more risk than

poorer people. Although this might be due to differences in utility functions across people, it is more likely that the source of the difference lies in the possibility that

Exercise 6.C.8: Assume that the Bernoulli utility function u(·) exhibits decreasing absolute risk aversion. Show that for the asset demand model of Example 6.C.2 (and Example 6.C.2 continued), the optimal allocation between the safe and the risky assets places an increasing amount of wealth in the risky asset as w rises (i.e., the risky asset is a normal good). The assumption of decreasing absolute risk aversion yields many other economically reasonable results concerning risk-bearing behavior. However, in applications, it is often too weak and, because of its analytical convenience, it is sometimes complemented by a stronger assumption: nonincreasing relative risk aversion. To understand the concept of relative risk aversion, note that the concept of absolute risk aversion is suited to the comparison of attitudes toward risky projects whose outcomes are absolute gains or losses from current wealth. But it is also of interest to evaluate risky projects whose outcomes are percentage gains or losses of current wealth. The concept of relative risk aversion does just this. Let t > 0 stand for proportional increments or decrements of wealth. Then, an individual with Bernoulli utility function u(·) and initial wealth x can evaluate a random percentage risk by means of the utility function u(t) = u(tx). The initial wealth position corresponds to t = I. We already know that for a small risk around t = \, the degree of risk aversion is well captured by 17"(\)/17'(\). Noting that 17"(\)/17'(\) = xu"(x)/u'(x), we are led to the concept stated in Definition 6.C.S.

A V E A S ION

193

I

194

C HAP TEA

.:

C HOI C E

UNO E A

SEC T ION

U NeE A T A I N T Y

Definition 6.C.S: Given a Bernoulli utility function u('), the coefficient of relative risk aversion at x is rR(x, u) = -xu"(x)/u'(x). Consider now how this measure varies with wealth. The property of nonincreasing relative risk aversion says that the individual becomes less risk averse with regard to gambles that are proportional to his wealth as his wealth increases. This is a stronger assumption than decreasing absolute risk aversion: Since r/l(x, u) = xr,,(x, u), a risk-averse individual with decreasing relative risk aversion will exhibit decreasing absolute risk aversion, but the converse is not necessarily the case. As before, we can examine various implications of this concept. Proposition 6.C.4 is an abbreviated parallel to Proposition 6.C.3. Proposition 6.C.4: The following conditions for a Bernoulli utility function u(') on amounts of money are equivalent: (i) rR(x, u) is decreasing in x. (ii) Whenever x 2 < x" u2 (t) = u(tx 2 ) is a concave transformation of u,(t) = u(tx,).

(iii) Given any risk F(t) on t> 0, the certainty equivalent ex defined by u(e x) = f u(tx) dF(t) is such that x/ex is decreasing in x. Proof: Here we show only that (i) implies (iii). To this effect, fix a distribution F(t) on t > 0, and, for any x, define ux(t) = u(tx). Let c(x) be the usual certainty equivalent (from Definition 6.C.2): ux(c(x» = f u.(t) dF(t). Note that -u;(t)/u~(t) = -(I/t)tx[u"(tx)/u'(txlJ for any x. Hence if (i) holds, then ux '(-) is less risk averse than ux (') whenever x' > x. Therefore, by Proposition 6.C.2, c(x') > c(x) and we conclude that c(') is increasing. Now, by the definition of ux ('), ux(c(x» = u(xc(x». Also ux(c(x»

=

f

ux(t) dF(t)

=

f

u(tx) dF{t)

= u(cx)'

Hence, cx/x = c(x), and so x/cx is decreasing. This concludes the proof. Example 6.C.2 continued: In Exercise 6.C.1l, you are asked to show that if r/l(x, u) is decreasing in x, then the proportion of wealth invested in the risky asset y = rx/w is increasing with the individual's wealth level w. The opposite conclusion holds if r/l(x, u) is increasing in x. If r/l(x, u) is a constant independent of x, then the fraction of wealth invested in the risky asset is independent of w [see Exercise 6.C.12 for the specific analytical form that u(·) must have). Models with constant relative risk a version are encountered often in finance theory, where they lead to considerable analytical simplicity. Under this assumption, no matter how the wealth of the economy and its distribution across individuals evolves over time, the portfolio decisions of individuals in terms of budget shares do not vary (as long as the safe return and the distribution of random returns remain unchanged). -

6.D Comparison of Payoff Distributions in Terms of Return and Risk In this section, we continue our study of lotteries with monetary payoffs. In contrast with Section 6.C, where we compared utility functions, our aim here is to compare

•• D:

COli P A A ISO N

0 F

PAY 0 F F

payoff distributions. There are two natural ways that random outcomes can be compared: according to the level of returns and according to the dispersion of returns. We will therefore attempt to give meaning to two ideas: that of a distribution F(') yielding unambiguously higher returns than G(') and that of F(') being unambiguously less risky than G(·). These ideas are known, respectively, by the technical terms of first-order stochastic dominance and second-order stochastic dominance. 2o In all subsequent developments, we restrict ourselves to distributions F(') such that F(O) = 0 and F(x) = 1 for some x.

First-Order Stochastic Dominance We want to attach meaning to the expression: "The distribution F(') yields unambiguously higher returns than the distribution G(' )." At least two sensible criteria suggest themselves. First, we could test whether every expected utility maximizer who values more over less prefers F(') to G(·). Alternatively, we could verify whether, for every amount of money x, the probability of getting at least x is higher under F(') than under G(·). Fortunately, these two criteria lead to the same concept. Definition 6.0.1: The distribution F(') first-order stochastically dominates G(-) if, for every nondecreasing function u: R -+ R we have

f

u(x) dF(x)

~

f

u(x) dG(x).

Proposition 6.0.1: The distribution of monetary payoffs F( . ) first-order stochastically dominates the distribution G(') if and only if F(x) s; G(x) for every x. Proof: Given F(') and G(') denote H(x) = F(x) - G(x). Suppose that H(x) > 0 for some x. Then we can define a nondecreasing function u(· ) by u(x) = 1 for x > x and u(x) = 0 for x S; X. This function has the property that f u(x) dH(x) = - H(x) < 0, and so the" only if" part of the proposition follows. For the "if" part of the proposition we first put on record, without proof, that it suffices to establish the equivalence for differentiable utility functions u(·). Given F(') and G('), denote H(x) = F(x) - G(x). Integrating by parts, we have

f

u(x) dH(x) = [u(x)H(x)]" -

f

u'(x)H(x) dx.

Since H(O) = 0 and H(x) = 0 for large x, the first term of this expression is zero. It follows that f u(x) dH(x) ~ 0 [or, equivalently, f u(x) dF(x) - f u(x) dG(x) ~ 0] if and only if f u'(x)H(x) dx S; O. Thus, if H(x) S; 0 for all x and u(·) is increasing, then f u'(x)H(x) dx S; 0 and the "if" part of the proposition follows. _ In Exercise 6.0.1 you are asked to verify Proposition 6.0.1 for the case of lotteries over three possible outcomes. In Figure 6.0.1, we represent two distributions F(') and G(·). Distribution F(') first-order stochastically dominates G(') because the graph of F( .) is uniformly below the graph of G(·). Note two important points: First, first-order stochastic dominance does not imply that every possible return of the 20. They were introduced into economics in Rothschild and Stiglitz (1970).

-

------------

DIS T A I BUT ION

195

196

CHAPTER

8:

CHOICE

UNDER

UNCERTAINTY

SECTION

1.0:

COMPARISON

OF

PAYOFF

shifted up to an even probability between 2 and 3 dollars, and the outcome "4 dollars" is shifted up to 5 dollars with probability one. Figure 6.D.2(b) shows that F(x) :
f

G(')

Dollars

u(x) dG(x).

Flgur. 6.0.2

F(')

F(') first-order stochastically dominates G(·).

------------------ r---,-G(\')

- II I -I

.---....l...o-..L-_\J 1 r--J F(') -I

x (a)

(b)

u(x) dF(x)
f

u(x) dG(x).

Example 6.D.2 introduces an alternative way to characterize the second-order stochastic dominance relation.

So F( .) first-order stochastically dominates G(·). A specific example is illustrated in Figure 6.D.2. As Figure 6.D.2(a) shows, G(') is an even randomization between 1 and 4 dollars. The outcome" 1 dollar" is then

l

Second-Order Stochastic Dominance

Example 6.D.2: Mean-Preserving Spreads. Consider the following compound lottery: In the first stage, we have a lottery over x distributed according to F(·). In the second stage, we randomize each possible outcome x further so that the final payoff is x + z, where Z has a distribution function Hx(z) with a mean of zero [i.e., f Z dHx(z) = OJ. Thus, the mean of x + Z is x. Let the resulting reduced lottery be denoted by G(·). When lottery G(') can be obtained from lottery F(') in this manner for some distribution H.(·), we say that G(') is a mean-preserving spread of F(·). For example, F(') may be an even probability distribution between 2 and 3 dollars. In the second step we may spread the 2 dollars outcome to an even probability between 1 and 3 dollars, and the 3 dollars outcome to an even probability between 2 and 4 dollars. Then G(') is the distribution that assigns probability 1 to the four outcomes: 1, 2, 3, 4 dollars. These two distributions F(') and G(') are depicted in Figure 6.D.3. The type of two-stage operation just described keeps the mean of G(') equal to that of F(·). In addition, if u(·) is concave, we can conclude that

f

u(x) dG(x) = =

f(f f

u(x

+ z) dHx(Z») dF(x)

u(x) dF(x),

: 0 that it occurs. We abuse notation slightly by also denoting the total number of states by S. An uncertain alternative with (nonnegative) monetary returns can then be described as a function that maps realizations of the underlying state of nature into the set of possible money payoffs R+. Formally, such a function is known as a random variable.

Definition 6.E.1: A random variable is a function g: S monetary outcomes.23

-+

R+ that maps states into

Every random variable g(.) gives rise to a money lottery describable by the distribution function F(·) with F(x) = L{",(,) " xl", for all x. Note that there is a loss in information in going from the random variable representation of uncertainty to the lottery representation; we do not keep track of which states give rise to a given monetary outcome, and only the aggregate probability of every monetary outcome is retained. Because we take S to be finite, we can represent a random variable with monetary payoffs by the vector (XI"'" xs), where x, is the nonnegative monetary payoff in state s. The set of all nonnegative random variables is then R~. State-Dependent Preferences and the Extended Expected Utility Representation

(Certainty Line)

L',u.(x.) ~ L•. u.(.,.)}

.

.

Slope = _ '\u;(.> 0, we could define u,(·) = (1/7t,)u,(·), and we could then evaluate (Xl' ... ' Xs) by L. ",u,(x,). What is needed is some way to disentangle utilities from probabilities. Consider an example. Suppose that a gamble that gives one dollar in state 1 and none in state 2 is preferred to a gamble that gives one dollar in state 2 and none in state 1. Provided there is no reason to think that the labels of the states have any

26. To be specific, we consider monetary payoffs here. All the subsequent arguments, however. work with arbitrary sets of outcomes.

T~EOAY

205

206

CHAPTER

.:

CHOICE

UNDER

UNCERTAINTY

-------------------------------------------------------------particular influence on the value of money, it is then natural to conclude that the decision maker regards state 2 as less likely than state I. This example suggests an additional postulate. Preferences over money lotteries within state s should be the same as those within any other state s'; that is, risk attitudes towards money gambles should be the same across states. To formulate such a property, we define the state s preferences ~, on state s lotteries by F,

~,F;

if

f

u,(x,) dF,(x,) 2:

Definition 6.F.1: The state preferences

(~"

... ,

f

u,(x,) dF;(x,).

~s)

on state lotteries are state

uniform if ~s = ~" for any sand s'.

With state uniformity, u,(·) and u,,(·) can differ only by an increasing linear transformation, Therefore, there is u(·) such that, for all s = I,. , ., S,

u,U =

It,uU

+ p,

for some It, > 0 and p,. Moreover, because we still represent the same preferences if we divide all It, and p, by a common constant, we can normalize the It, so that L, It, = I. These It, are going to be our subjective probabilities. ProposlUon 6.F.1: (Subjective Expected Utility Theorem) Suppose that the preference relation ~ on !i' satisfies the continuity and extended independence axioms, Suppose, in addition, that the derived state preferences are state uniform. Then there are probabilities (It,, ' , , , lts) » 0 and a utility function u(·) on amounts of money such that for any (x" ' , , , xs) and (x;, ' , . ,xs) we have (x", ., ,xs) ~ (x;"., ,xs)

if and only if

L It,u(x,) 2: L It,u(x~).

Moreover, the probabilities are uniquely determined, and the utility function is unique up to origin and scale. Proof: Existence has already been proven. You are asked to establish uniqueness in Exercise 6.F.1. _ The practical advantages of the subjective expected utility representation are similar to those of the objective version, which we discussed in Section 6.B, and we will not repeat them here. A major virtue of the theory is that it gives a precise, quantifiable, and operational meaning to uncertainty. It is, indeed, most pleasant to be able to remain in the familiar realm of the probability calculus. But there are also problems. The plausibility of the axioms cannot be completely dissociated from the complexity of the choice situations. The more complex these become, the more strained even seemingly innocent axioms are, For example, is the completeness axiom reasonable for preferences defined on huge sets of random variables? Or consider the implicit axiom (often those are the most treacherous) that the situation can actually be formalized as indicated by the model. This posits the ability to list all conceivable states of the world (or, at least, a sufficiently disaggregated version of this list). In summary, every difficulty so far raised against our model of the rational consumer (i.e., to transitivity, to completeness, to independence) will apply with increased force to the current model. There are also difficulties specific to the nonobjective nature of probabilities. We devote Example 6,F.1 to this point.

REFERENCES

207

,--------------------------------------------------~~~~~~ Example 6.F,1: This example is a variation of the ElIsberg paradox.27 There are two urns, denoted Rand H. Each urn contains 100 balls. The balls are either white or black. Urn R contains 49 white balls and 51 black balls. Urn H contains an unspecified assortment of balls. A ball has been randomly picked from each urn. Call them the R-ball and the H-ball, respectively, The color of these balls has not been, disclosed. Now we consider two choice situations, In both experiments, the deCISIon maker must choose either the R-ball or the H-ball. After the choices have been made, the color will be disclosed. In the first choice situation, a prize of 1000 dollars is won if the chosen ball is black. In the second choice situation, the same prize is won if the ball is white. With the information given, most people will choose the R-ball in the first experiment. If the decision is made using subjective probabilities, thiS should mean that the subjective probability that the H-ball is white is larger than .49, Hence, most people should choose the H-ball in the second experiment. However, It tu~ns out that this does not happen overwhelmingly in actual experiments, The deCISion maker understands that by choosing the R-ball, he has only a 49% chance of winning, However, this chance is "safe" and well understood, The uncertainties incurred are much less clear if he chooses the H-ball. _ Knight (1921) proposed distinguishing between risk and uncertainty according to wh~ther the proba.bilities a~e given to us objectively or not. In a sense, the theory of subjective probability nullifies this distinction by reducing all uncertainty to risk through the use of beliefs expressible as probabilities. The Example 6.F.1 suggests that there may be something to the distinction. This is an active area of research [e.g., Bewley (1986) and Gilboa and Schmeidler (1989)]. 27, From Elisberg (1961),

REFERENCES Allais. M. (1953). Le comportement de l'homme rationneJ devant Ie risque, critique des pastulats et axiomes de j'ecole Americaine. Econometrica 21: 503-46. Anscombe, F., and R. Aumann. (1963). A definition of subjective probability. Annals of Mathematical Statistics 34: 199-205. Arrow, K. J. (1971). Essays in the Theory of Ri'ik Bearing. Chicago: Markham. Bewley. T. (1986). Knightian Decision Theory: Part 1. New Haven: Cowles Foundation Discussion Paper No,807, Dekel, . E. (1986). An .axiomatic characterization of preferences under uncertainty: Weakening the mdependence aXiom. Journal of Economic Theory 40: 304-18. Diamond, P., and M. Rothschild. (1978). Uncertainty in Economics: Readings and Exercises. New York: Academic Press. E1.lsberg, D. (1961). Risk, ambiguity, and the Savage axioms. Quarterly Journal of Economics 75: 643-69. Gilboa, I., and .D. Schmeidler. (1989). Maximin expected utility with a unique prior. Journal of Mathematical Economics 18: 141-53. Grether, D., and C. H. Plott. (1979). Economic theory of choice and the preference reversal phenomenon. American Economic Review 69: 623-38. Green. 1. (1987). 'Making book against oneself: the independence axiom, and nonlinear utility theory. Quarterly Journal of Economics 98: 785-96. Hey, J. D. and C. Orme~ (1994). Investigating generalizations of expected utility theory using experimental data. Econometrica 62: 1291-326.

208

CHAPTER

8:

CHOICE

UNOER

UNCERTAINTY

Knight. F. (1921). Risk, Uncertainty and Profit. Boston, Mass.: Houghton Mifflin. Reprint, London: London School of Economics 1946. Kreps, D. (1988). Notes on the Theory of Choice. Boulder, Colo.: Westview Press. Machina, M. (1987). Choice under uncertainty: Problems solved and unsolved. The Journal of Perspectives I: 121-54. Pratt, J. (1964), Risk aversion in the small and in the large. Econometrica 32: 122-36. Reprinted in Diamond

and Rothschild. Rothschild. M. and 1. Stiglitz. (1970). Increasing risk I: A definition. Journal of Economic Theory 2: 225-43. Reprinted in Diamond and Rothschild. Savage, L. (1954). The Foundations of Statistics. New York: Wiley. Von Neumann. 1., and O. Morgenstern. (1944). Theory of Games and Economic Behavior. Princeton, N.J.: Princeton University Press.

EXERCISES

betweenness axiom [see Oekel (1986)]:

For all L, L' and ,Ie (0,1), if L - L'. then AL

+ (I -

A)L' - L.

Suppose that there are three possible outcomes.

209

, I

1

(a) Show that a preference relation on lotteries satisfying the independence axiom also

satisfies the betweenness axiom. (b) Using a simplex representation for lotteries similar to the one in Figure 6.B.I(b), show that if the continuity and betweenness axioms are satisfied, then the indifference curves of a preference relation on lotteries are straight lines. Conversely, show that if the indifference curves are straight lines, then the betweenness axiom is satisfied. Do these straight lines need to be parallel? (c) Using (b), show that the betweenness axiom is weaker (less restrictive) than the independence axiom.

EXERCISES 6.B.IA In text. 6.B.2A In text. 6.B.3" Show that if the set of outcomes C is finite and the rational preference relation:::: on the set of lotteries ff satisfies the independence axiom. then there are best and worst lotteries in ff. That is. we can find lotteries [ and l, such that [:::: L :::: l, for all L e ff.

(d) Using Figure.6.B.7, show that the choices ofthe Allais paradox are compatible with the betweeness axiom by exhibiting an indifference map satisfying the betweenness axiom thai yields the choices of the Allais paradox. 6.B.6" Prove that the induced utility function U(·) defined in Ihe last paragraph of Section 6.B is convex. Give an example of a set of outcomes and a Bernoulli utility function for which the induced utility function is not linear. 6.B.7A Consider the following two lotteries:

6.B.4" The purpose of this exercise is to illustrate how expected utility theory allows us to make consistent decisions when dealing with extremely small probabilities by considering relatively large ones. Suppose that a safety agency is thinking of establishing a criterion under which an area prone to Hooding should be evacuated. The probability of flooding is 1%. There are four possible outcomes: (A) No evacuation is necessary, and none is performed. (B) An evacuation is performed that is unnecessary. (C) An evacuation is performed that is necessary. (0) No evacuation is performed, and a Hood causes a disaster. Suppose that the agency is indifferent between the sure outcome B and the lottery of A with probability p and 0 with probability 1 - p, and between the sure outcome C and the lottery of B with probability q and 0 with probability 1 - q. Suppose also that it prefers A to 0 and that p e (0. I) and q e (0, I). Assume that the conditions of the expected utility theorem are satisfied. (a) Construct a utility function of the expected utility form for the agency. (b) Consider two different policy criteria: Criterion 1: This criterion will result in an evacuation in 90% of the cases in which Hooding will occur and an unnecessary evacuation in 10% of the cases in which no flooding occurs.

Criterion 2: This criterion is more conservative. It results in an evacuation in 95% of the cases in which Hooding will occur and an unnecessary evacuation in 5% of the cases in which no Hooding occurs.

First, derive the probability distributions over the four outcomes under these two criteria. Then, by using the utility function in (a), decide which criterion the agency would prefer. 6.B.5" The purpose of this exercise is to show that the Allais paradox is compatible with a weaker version of the independence axiom. We consider the following axiom, known as the

L: { L': {

200 dollars with probability .7.

o dollars with probability .3. 1200 dollars with probability.!.

o dollars with probability .9.

and XL· be the sure amounts of money that an individual finds indifferent to Land L'. Show that if his preferences are transitive and monotone, the individual must prefer L to L' if and only if XL > XL •. [Note: In actual experiments, however. a preference reversal is often observed in which L is preferred to L' but XL < XL·. See Grether and Plott (1979) for details.] Let

XL

6.C.1" Consider the insurance problem studied in Example 6.C.1. Show that if insurance is not actuarially fair (so that q > It), then the individual will not insure completely. 6.C.2" (a) Show that if an individual has a Bernoulli utility function u( ) with the quadralic form u(x) =

PX' + yx,

then his utility from a distribution is determined by the mean and variance of the distribution and, in fact, by these moments alone. [Note: The number p should be taken to be negative in order to get the concavity of u(-). Since u(-) is then decreasing at X > -y/2P, u(-) is useful only when the distribution cannot take values larger than -y/2p.] (b) Suppose that a utility function U(-) over distributions is given by U(F) = (mean of F) - r(variance of F),

where r > O. Argue that unless the set of possible distributions is further restricted (see, e.g., Exercise 6.C.19), U(·) cannot be compatible with any Bernoulli utility function. Give an example of two lotteries Land L' over the same two amounts of money, say x' and x" > x', such that L gives a higher probability to x" than does L' and yet according to U(·), L' is preferred to L.

-

210

C HAP T E R

.:

C HOI C E

UNO E RUN C E R T A I N T Y

E

6.C.3 B Prove that the four conditions of Proposition 6.C.1 are equivalent. [Hint: The equivalence of (i), (ii), and (iii) has already been shown. As for (iv), prove that (i) implies (iv) and that (iv) implies u(lx + b) ;:: tu(x) + 1u(y) for any x and y, which is, in fact, sufficient for (ii).] 6.C4 B Suppose that there are N risky assets whose returns z, (n = I, ... , N) per dollar invested are jointly distributed according to the distribution function F(:" ... , ZN)' Assume also that all the returns are nonnegative with probability one. Consider an individual who has a continuous, increasing, and concave Bernoulli utility function u(·) over R •. Define the utility function U(·) of this investor over R~, the set of all nonnegative portfolios, by

U(~"

... , ~N) =

f

u(<x,z,

+ ... + ~NZN) dF(z"

... , ZN)'

Denote the solution to this problem by x'. (a) Show that if E[v'(xo

+ y)] >

v'(xo), then x' > Xo.

(b) Define the coefficient of absolute p,udence of a utility function v(·) at wealth level x to be - v"'(x)/v"(x). Show that if the coefficient of absolute prudence of a utility function v,(·) is not larger than the coefficient of absolute prudence of utility function v,(·) for all levels of wealth, then E[v',(xo + y)] > v',(x o) implies E[v;(xo + y)] > v;(x o). What are the implications of this fact in the context of part (a)'? (e) Show that if v"'(·) > 0, and E[y] = 0, then E[v'(x

+ y)] >

v'(x) for all values of x.

(d) Show that if the coefficient of absolute risk aversion of v(·) is decreasing with wealth, then -v"'(x)/v"(x) > -v"(x)/v'(x) for all x, and hence v"'(·) > 0. A

Prove that U(·) is (a) increasing, (b) concave, and (c) continuous (this is harder). 6.C.5· Consider a decision maker with utility function u(·) defined over R~, just as in Chapter 3. (a) Argue that concavity of u(·) can be interpreted as the decision maker exhibiting risk

aversion with respect to lotteries whose outcomes are bundles of the L commodities. (b) Suppose now that a Bernoulli utility function u(·) for wealth is derived from the maximization of a utility function defined over bundles of commodities for each given wealth level w, while prices for those commodities are fixed. Show that, if the utility function for the commodities exhibits risk aversion, then so does the derived Bernoulli utility function for wealth. Interpret. (e) Argue that the converse of part (b) does not need to hold: There arc nonconcave functions u: R~ -+ R such that for any price vector the derived Bernoulli utility function on

wealth exhibits risk aversion.

6.CIO Prove the equivalence of conditions (i) through (v) in Proposition 6.C.3. [Hint: By letting u,(z) = u(w, + z) and u,(z) = u(w, + z), show that each of the five conditions in Proposition 6.C.3 is equivalent to the counterpart in Proposition 6.C.2.] B 6.Cn For the model in Example 6.C.2, show that if ,.(x, u) is increasing in x then the proportion of wealth invested in the risky asset y = <x/x is decreasing with x. Similarly, if ,.(x, u) is decreasing in x, then y = 2/X is increasing in x. [Hint: Let u,(t) = u(tw,) and u,(t) = u(tw,), and use the fact, stated in the analysis of Example 6.C.2, that if one Bernoulli utility function is more risk averse than another, then the optimal level of investment in the

risky asset for the first function is smaller than that for the second function. You could also attempt a direct proof using first-order conditions.] 6.C12B Let u: R.

-+

R be a strictly increasing Bernoulli utility function. Show that

(a) u(·) exhibits constant relative risk aversion equal to p {ix' -, + y, where {i > and 'I E R

°

*I

if and only if u(x) =

(b) u(·) exhibits constant relative risk aversion equal to 1 if and only if u(x) = {i In x

6.C6 B For Proposition 6.C.2:

where {i >

(a) Prove the equivalence of conditions (ii) and (iii). (b) Prove the equivalence of conditions (iii) and (v).

6.C7· Prove that, in Proposition 6.C.2, condition (iii) implies condition (iv), and (iv) implies (i). 6.CS· In text. 6.C9B (M. Kimball) The purpose of this problem is to examine the implications of uncertainty and precaution in a simple consumption-savings decision problem. In a two-period economy, a consumer has first-period initial wealth w. The consumer's utility level is given by

where u(') and v(·) are concave functions and c, and c, denote consumption levels in the first and the second period, respectively. Denote by x the amount saved by the consumer in the first period (so that c, = w - x and c, = x), and let Xo be the optimal value of x in this problem. We now introduce uncertainty in this economy. If the consumer saves an amount x in the first period, his wealth in the second period is given by x + y, where y is distributed according to F(·). In what follows, E['] always denotes the expectation with respect to Fe). Assume that the Bernoulli utility function over realized wealth levels in the two periods (w" w,) is u( w,) + v( w,). Hence, the consumer now solves Max u(w - x)

+ E[v(x + y)].

°

+ y,

and y E R.

(e) lim,_, (x'-'/(1 - p)) = In x for all x> 0.

B 6.C.13 Assume that a firm is risk neutral with respect to profits and that if there is any uncertainty in prices, production decisions are made after the resolution of such uncertainty. Suppose that the firm faces a choice between two alternatives. In the first, prices are uncertain. In the second, prices are nonrandom and equal to the expected price vector in the first alternative. Show that a firm that maximizes expected profits will prefer the first alternative Over the second. B 6.C.14 Consider two risk-averse decision makers (i.e., two decision makers with concave Bernoulli utility functions) choosing among monetary 10lleries. Define the utility function u'(·) to be strongly more risk averse than u(·) if and only if there is a positive constant k and a nonincreasing and concave function v(·) such that u'(x) = ku(x) + v(x) for all x. The monetary amounts are restricted to lie in the interval [0,,]. (a) Show that if u,(·) is strongly more risk averse than u(·), then u'(·) is more risk averse than u(-) in the usual Arrow-Prall sense.

(b) Show that if u(·) is bounded, then there is no u'(·) other than u'(·) = ku(') + c, where e is a constant, that is strongly more risk averse than u(-) on the entire interval [0, +00]. [Him: in this part, disregard the assumption that the monetary amounts are restricted to lie in the interval [0, , ].] (e) Using (b), argue that the concept of a strongly more risk-averse utility function is stronger (i.e., more restrictive) than the Arrow-Pratt concept of a more risk-averse utility function.

x ERe I S E S

211

212

CHAPTER

.:

CHOICE

UNDER

UNCERTAINTY

EXERCISES

6.C.lSA Assume that, in a world with uncertainty, there are two assets. The first is a riskless asset that pays I dollar. The second pays amounts a and b with probabilities of" and I - ", respectively. Oenote the demand for the two assets by (x" x,). .. Suppose that a decision maker's preferences satisfy the axioms of expected utl~lty theory and that he is a risk averter. The decision maker's wealth is I, and so are the pnces of the assets. Therefore, the decision maker's budget constraint is given by

x, + x, = I, x"

X, E

[0, I].

risky assets. The return per unit invested on the riskless asset is r. The returns of the risky assets are jointly normally distributed random variables with means 11 = (11" ... ,11 .• ) and variance-covariance matrix V. Assume that there is no redundancy in the risky assets, so that V is of full rank. Oerive the demand function for these N + I assets. A

6.C.20 Consider a lottery over monetary outcomes that pays x + • with probability t and x - " with probability t. Compute the second derivative of this lottery's certainty equivalent with respect to E. Show that the limit of this derivative as" _ 0 is exactly -~,(x).

(a) Give a simple necessary condition (involving a and b only) for the demand for the

riskless asset to be strictly positive. (b) Give a simple necessary condition (involving a, b. and" only) for the demand for the risky asset to be strictly positive.

In the next three parts, assume that the conditions obtained in (a) and (b) are satisfied. (c) Write down the first-order conditions for utility maximization in this asset demand problem. (d) Assume that a

< I. Show by analyzing the first-order conditions that dx,/da :s; O.

(e) Which sign do you conjecture for dx,/d,,? Give an economic interpretation.

(f) Can you prove your conjecture in (e) by analyzing the first-order conditions?

6.C.16A An individual has Bernoulli utility function u(·) and initial wealth w. Let lottery L offer a payoff of G with probability p and a payoff of B with probability I - p.

6.0.1' The purpose of this exercise is to prove Proposition 6.0.1 in a two-dimensional probability simplex. Suppose that there are three monetary outcomes: I dollar, 2 dollars, and 3 dollars. Consider the probability simplex of Figure 6.B.I(b). (a) For a given lottery L over these outcomes, determine the region of the probability simplex in which lie the lotteries whose distributions first-order stochastically dominate the distribution of L. (b) Given a lottery L, determine the region of the probability simplex in which lie the lotteries L' such that F(x) :s; G(x) for every x, where F(') is the distribution of L' and G(') is the distribution of L. [Notice that we get the same region as in (a).]

6.0.2A Prove that if F(') first-order stochastically dominates G(·), then the mean of x under F('), Jx dF(x), exceeds that under G('), Jx dG(x). Also provide an example where Jx dF(x) > Jx dG(x) but F(·) does not first-order stochastically dominate G('). 6.0.3' Verify that if a distribution G(·) is an elementary increase in risk from a distribution F( .), then F(') second-order stochastically dominates G(·).

(a) If the individual owns the lottery, what is the minimum price he would sell it for? (b) If he does not own it, what is the maximum price he would be willing to pay for it? (e) Are buying and selling prices equal? Give an economic interpretation for your ~nswer. Find conditions on the parameters of the problem under which buying and selling pnces are equal.

6.D.4' The purpose of this exercise is to verify the equivalence of the three statements of Proposition 6.D.2 in a two-dimensional probability simplex. Suppose that there are three monetary outcomes: 1,2, and 3 dollars. Consider the probability simplex in Figure 6.B.I(b).

= Jx. Compute the buying and selling prices for

(a) If two lotteries have the same mean, what are their positions relative to each other in the probability simplex.

(d) Let G = 10, B = 5, w = 10, and u(x) this lottery and this utility function.

6.C.17 B Assume that an individual faces a two-period portfolio allocation problem. In period t = 0, I, his wealth w, is to be divided between a safe asset with return R and a risky asset with return x. The initial wealth at period 0 is WOo Wealth at period t = 1,2 depends on the portfolio cx, _, chosen at period t - I and on the return x, realized at period t, according to

w,

= «I - cx,_,)R

+ cx,_,x,)w,_,.

w,.

The objective of this individual is to maximize the expected utility of terminal wea~t~ Assume that x and x are independently and identically distributed. Prove that the mdlVldual optimally sets ~ -,,' if his utility function exhibits constant relative risk aversion. Show also that this fails toO hold' if his utility function exhibits constant absolute risk aversion. 6.C.lS B Suppose that an individual has a Bernoulli utility function u(x) =

Jx.

(a) Calculate the Arrow-Pratt coefficients of absolute and relative risk aversion at the level of wealth w = 5.

(b) Calculate the certainty equivalent and the probability premium for a gamble (16,4;!, i). (c) Calculate the certainty equivalent and the probability premium for a gamble (36, 16; t, t). Compare this result with the one in (b) and interpret 6.C.19c Suppose that an individual has a Bernoulli utility function u(x) = -e-= where a> O. His (nonstochastic) initial wealth is given by w. There is one riskless asset and there are N

(b) Given a lottery L, determine the region of the simplex in which lie the lotteries L' whose distributions are second-order stochastically dominated by the distribution of L. (c) Given a lottery L, determine the region of the simplex in which lie the lotteries L' whose distributions are mean preserving spreads of L. (d) Given a lottery L, determine the region of the simplex in which lie the lotteries L' for which condition (6.0.2) holds, where F(·) and G(·) are, respectively, the distributions of Land L'.

Notice that in (b), (c), and (d), you always have the same region. 6.E.I B The purpose of this exercise is to show that preferences may not be transitive in the presence of regret. Let there be S states of the world, indexed by s = I, ... , S. Assume that state s occurs with probability",. Oefine the expected regret associated with lottery x = (x" ... , xs) relative to lottery x' = (x;, ... , xs) by s L ",h(Max {O, x,n,

x; -

where h(') is a given increasing function. [We call h(·) the regret valuation function; it measures the regret the individual has after the state of nature is known.] We define x to be at least as good as x' in the presence of regret if and only if the expected regret associated with x relative to x' is not greater than the expected regret associated with x' relative to x.

213

214

C HAP T E R

6;

CH

a ICE

Suppose that S lotteries:

= 3,

UNO E RUN C E R T A I N T Y

IT,

= IT, = n] = t

and h(-,)

x=(0,-2,

= J~.

Consider the following three

I),

x'=(O,

2,-2),

= (2,

-3, -I).

x"

EXERCISES

(a) Prove that if P consists of only one belief. then Vw and V. are derived from a von Neumann-Morgenstern utility function and tbat Vw(R) > Vw(H) if and only if V.(R) < U.(H).

Show that the preference ordering over these three lotteries is not transitive.

6,E.ZA Assume that in a world with uncertainty there are two possible states of nature (s = 1,2) and a single consumption good. There is a single decision maker whose preferences over 101leries satisfy the axioms of expected utility theory and who is a risk averter. For simplicity, we assume that utility is state· independent. Two contingent commodities are available to the decision maker. The first (respectively, the second) pays one unit of the consumption good in state s = I (respectively s = 2) and zero otherwise. Denote the vector quantities of the two contingent commodities by (Xl' X2). (a) Show that the preference relation of the decision maker on (x" x,) is convex. (b) Argue that the decision maker is also a risk averter when choosing between lotteries whose outcomes are vectors (Xl' .'(2). (c) Show that the Walrasian demand functions for x, and x, are normal. 6,E.3 B Let g: S - R, be a random variable with mean E(g) = I. For" E (0, I), define a new random variable g': S _ R, by g'(s) = "g(s) + (I - ,). Note that E(g') = I. Denote by G(-) and G'(') the distribution functions of g(') and g'('), respectively, Show that G'(') second-order stochastically dominates G(·). Interpret. 6,F,1 B Prove that in the subjective expected utility theorem (Proposition 6.F.2), the obtained utility function u(,) on money is uniquely determined up to origin and scale. That is, if both u(·) and Ii(·) satisfy the condition of the theorem, then there exist {3 > G and Y E R such that !lex) = {3u(x) + y for all x. Prove also that the subjective probabilities are uniquely determined. 6,F,ZA The purpose of this exercise is to explain the outcomes of the experiments described in Example 6.F.I by means of the theory of nonunique prior beliefs of Gilboa and Schmeidler (1989). We consider a decision maker with a Bernoulli utility function u(,) defined on {G, lOoo}. We normalize u(·) so that u(O) = G and u(lOOO) = I. The probabilistic belief that the decision maker might have on the color of the H-ball being white is a number n E [0, I]. We assume that the decision maker has, not a single belief but a set of beliefs given by a subset P of [0, I]. The actions that he may take are denoted R or H with R meaning that he chooses the R-ball and H meaning that he chooses the H-ball. As in Example 6.F.I, the decision maker is faced with two different choice situations. In choice situation IV, he receives 1000 dollars if the ball chosen is white and dollars otherwise. In choice situation B, he receives 1000 dollars if the ball chosen is black and dollars otherwise. For each of the two choice situations. define his utility function over the actions Rand H in the following way:

°

For situation IV, V w : {R, H} - R is defined by Vw(R) =.49

and

Vw(H) = Min

{n:

IT E

Pl.

For situation B, V.: {R, H} - R is defined by V.(R) = .51

and

t:i ___________________________________________

V.(H) = Min {(I - n):" E

Pl.

Namely, his utility from choice R is the expected utility of 1000 dollars with the (objective) probability calculated from the number of white and black balls in urn R. However, his utility from choice H is the expected utility of 1000 dollars with the probability associated with the most pessimistic belief in P.

°

(b)

Find a seL P for which Uw(R) > Vw(H) and V.IR) > U.(H).

215

PAR T

TWO

Game Theory

In Part I, we analyzed individual decision making, both in abstract decision problems and in more specific economic settings. Our primary aim was to lay the groundwork for the study of how the simultaneous behavior of many self-interested individuals (including firms) generates economic outcomes in market economies. Most of the remainder of the book is devoted to this task. In Part II, however, we study in a more general way how multi person interactions can be modeled. A central feature of multiperson interaction is the potential for the presence of strategic interdependence. In our study of individual decision making in Part I, the decision maker faced situations in which her well-being depended only on the choices she made (possibly with some randomness). In contrast, in multiperson situations with strategic interdependence, each agent recognizes that the payoff she receives (in utility or profits) depends not only on her own actions but also on the actions of other individuals. The actions that are best for her to take may depend on actions these other individuals have already taken, on those she expects them to be taking at the same time, and even on future actions that they may take, or decide not to take, as a result of her current actions. The tool that we use for analyzing settings with strategic interdependence is noncooperative game theory. Although the term "game" may seem to undersell the theory's importance, it correctly highlights the theory's central feature: The agents under study are concerned with strategy and winning (in the general sense of utility or profit maximization) in much the same way that players of most parlor games are. M ultiperson economic situations vary greatly in the degree to which strategic interaction is present. In settings of monopoly (where a good is sold by only a single firm; see Section 12.B) or of perfect competition (where all agents act as price takers; see Chapter 10 and Part IV), the nature of strategic interaction is minimal enough that our analysis need not make any formal use of game theory.' In other settings, however, such as the analysis of oligopolistic markets (where there is more than one I. However. we could well do so in both cases; see, for example, the proof of existence of

competitive equilibrium in Chapler 17. Appendix B. Moreover, we shall stress how perfect competition can be viewed usefully as a limiting case of oligopolistic strategic interaction; see, for

example. Section 12.F. 217

218

PAR T

I I:

GAM E

C HAP

THE 0 R Y

but still not many sellers of a good; see Sections 12.C to 12.G), the central role of strategic interaction makes game theory indispensable for our analysis. Part II is divided into three chapters. Chapter 7 provides a short introduction to the basic elements of noncooperative game theory, including a discussion of exactly what a game is, some ways of representing games, and an introduction to a central concept of the theory, a player's strategy. Chapter 8 addresses how we can predict outcomes in the special class of games in which all the players move simultaneously, known as simultalleous-move games. This restricted focus helps us isolate some central issues while deferring a number of more difficult ones. Chapter 9 studies dynamic games in which players' moves may precede one another, and in which some of these more difficult (but also interesting) issues arise. Note that we have used the modifier noncooperative to describe the type of game theory we discuss in Part II. There is another branch of game theory, known as cooperative game theory, that we do not discuss here. In contrast with noncooperative game theory, the fundamental units of analysis in cooperative theory are groups and subgroups of individuals that are assumed, as a primitive of the theory, to be able to attain particular outcomes for themselves through binding cooperative agreements. Cooperative game theory has played an important role in general equilibrium theory, and we provide a brief introduction to it in Appendix A of Chapter 18. We should emphasize that the term noncooperative game theory does not mean that noncooperative theory is incapable of explaining cooperation within groups of individuals. Rather, it focuses on how cooperation may emerge as rational behavior in the absence of an ability to make binding agreements (e.g., see the discussion of repeated interaction among oligo po lists in Chapter 12). Some excellent recent references for further study of noncooperative game theory are Fudenberg and Tirole (1991), Myerson (1992), and Osborne and Rubinstein (1994), and at a more introductory level Gibbons (1992) and Binmore (1992). Kreps (1990) provides a very interesting discussion of some of the strengths and weaknesses of the theory. Von Neumann and Morgenstern (1944), Luce and Raiffa (1957), and Schelling (1960) remain classic references.

Basic Elements of

7

Noncooperative Games

Introduction In this chapter, we begin our study of noncooperative game theory by introducing some of its basic building blocks. This material serves as a prelude to our analysis of games in Chapters 8 and 9. Section 7.B begins with an informal introduction to the concept of a game. It describes the four basic elements of any setting of strategic interaction that we must know to specify a game. In Section 7.C, we show how a game can be described by means of what is called its extensive form representation. The extensive form representation provides a very rich description of a game, capturing who moves when, what they can do, what they know when it is their turn to move, and the outcomes associated with any collection of actions taken by the individuals playing the game. In Section 7.D, we introduce a central concept of game theory, a player's strategy. A player's strategy is a complete contingent plan describing the actions she will take in each conceivable evolution of the game. We then show how the notion of a strategy can be used to derive a much more compact representation of a game, known as its normal (or strategic) form representatioll. In Section 7.E, we consider the possibility that a player might randomize her choices. This gives rise to the notion of a mixed strategy.

REFERENCES Binmore. K. (1992). Fun and Games:A Text on Game Theory. Lexington, Mass.: D. C. Heath. Fudenberg, D., and 1. Tirole. (1991). Game Theory. Cambridge, Mass.: MIT Press. Gibbons, R. (1992). Game Theory for Applied Economists. Princeton, N.J.: Princeton University Press. Kreps. D. M. (1990). Game Theory and Economic Modeling. New York: Oxford University Press. Lucc, R. D., and H. Raiffa. (1957). Games and Decisions: Introduction and Critical Survey. New York: Wiley. Myerson, R. B. (1992). Game Theory: Analysis of Conflict. Cambridge, Mass.: Harvard University Press. Osborne, M. J., and A. Rubinstein. (1994). A Course in Game Theory, Cambridge, Mass.: MIT Press. Schelling. T. (1960). The Strafegy of Conflicc. Cambridge, Mass.: Harvard University Press. Von Neumann. 1., and O. Morgenstern. (1944). The Theory of Games and Economic Behavior. Princeton. N.J.: Princeton University Press.

T E R

j7.B What Is a Game? :'!',

A game is a formal representation of a situation in which a number of individuals interact in a setting of strategic interdependence. By that, we mean that each individual's welfare depends not only on her own actions but also on the actions of the other individuals. Moreover, the actions that are best for her to take may depend on what she expects the other players to do. To describe a situation of strategic interaction, we need to know four things: (i) The players: (ii) The rules:

Who is involved? Who moves when? What do they know when they move'> What can they do'! 219

220

CHAPTER

"

BASIC

ELEMENTS

OF

NONCOOPERATIVE

GAMES

SEC T ION

(iii) The aulcames: For each possible set of actions by the players, what is the outcome of the game? (iv) The payoffs: What are the players' preferences (i.e., utility functions) over the possible outcomes?

7. C,

THE

EXT ENS lYE

FOR M

REP RES E N T A TI 0 N

In later references to Matching Pennies and Tick-Tack-Toe, we assume that each player'S payoff is simply equal to the amount of money she gains or loses. Note that in both examples. the actions that maximize a player's payoff depend on what she expects her opponent to do. Examples 7.B.1 and 7.B.2 involve situations of pure conflict: What one player wins, the other player loses. Such games are called zero-sum games. But strategic interaction and game theory are not limited to situations of pure or even partial conflict. Consider the situation in Example 7.B.3.

We begin by considering items (i) to (iii). A simple example is provided by the school-yard game of Malching Pennies. Example 7.B.I: Malching Pennies. Items (i) to liii) are as follows: There are two players, denoted I and 2. Each player simultaneously pul S a penny down, either heads up or tails up. aU/comes: If the two pennies match (either both heads up or both tails up), player 1 pays 1 dollar to player 2; otherwise, player 2 pays 1 dollar to player I. _ Players: Rules:

Example 7.B.3: Meeling in New York. Items (i) to (iv) are as follows: Two players, Mr. Thomas and Mr. Schelling. The two players are separated and cannot communicate. They are supposed to meet in New York City at noon for lunch but have forgotten to specify where. Each must decide where to go (each can make only one choice). aUlcomes: If they meet each other, they get to enjoy each other's company at lunch. Otherwise, they must eat alone. Payoffs: They each attach a monetary value of 100 dollars to the other's company (their payoffs are each 100 dollars if they meet, 0 dollars if they do not). Players: Rules:

Consider another example, the game of Tick- Tack- Toe. Example 7.B.2: Tick-Tack-Toe. Items (i) to (iii) are as follows: There are two players, X and O. The players are faced with a board that consists of nine squares arrayed with three rows of three squares each stacked on one another (see Figure 7.B.1). The players take turns putting their marks (an X or an 0) into an as-yet-unmarked square. Player X moves first. Both players observe all choices previously made. aUlcomes: The first player to have three of her marks in a row (horizontally, vertically, or diagonally) wins and receives 1 dollar from the other player. If no one succeeds in doing so after all nine boxes are marked, the game is a tie and no payments are made or received by either player. _ Players: Rules:

In this example, the two players' interests are completely aligned. Their problem is simply one of coordination. Nevertheless, each player's payoff depends on what the other player does; and more importantly, each player's optimal action depends on what he thinks the other will do. Thus, even the task of coordination can have a strategic nature. _ Although the information given in items (i) to (iv) fully describe a game, it is useful for purposes of analysis to represent this information in particular ways. We examine one of these ways in Section 7.C.

To complete our description of these two games, we need to say what the players' preferences are over the possible outcomes [item (iv) in our list]. As a general matter, we describe a player's preferences by a utility function that assigns a utility level for each possible outcome. It is common to refer to the player's utility function as her payoff Junclian and the utility level as her payoff. Throughout, we assume that these utility functions take an expected utility form (see Chapter 6) so that when we consider situations in which outcomes are random, we can evaluate the random prospect by means of the player's expected utility.

The Extensive Form Representation of a Game If we know the items (i) to (iv) described in Section 7.B (the players, the rules, the

FIgure 7.B.l

A Tick-Tack-Toe board.

outcomes, and the payoffs), then we can formally represent the game in what is called its eXlensive Jorm. The extensive form captures who moves when, what actions each player can take, what players know when they move, what the outcome is as a function of the actions taken by the players, and the players' payoffs from each possible outcome. We begin by informally introducing the elements of the extensive form representation through a series of examples. After doing so, we then provide a formal specification of the extensive form (some readers may want to begin with this and then return to the examples). The extensive form relies on the conceptual apparatus known as a game tree. As our starting point, it is useful to begin with a very simple variation of Matching Pennies, which we call Matching Pennies Version B. Example 7.C.l: Matching Pennies Version B and Its Extensive Form. Matching Pennies Version B is identical to Matching Pennies (see Example 7.B.1) except

0 FAG A M E

221

222

CHAPTER

7:

BASIC

ELEMENTS

OF

NONCOOPERATIVE

GAMES

SECTION

7.e:

THE

EXTENSIVE

FORM

REPRESENTATION

OF

A

GA.Me

Player I

~Lower.right

/

Terminal nodes

Player I's paYOff) ~ (-I) ( Player 2's Payoff +I

that the two players move sequentially, rather than simultaneously. In particular, player I puts her penny down (heads up or tails up) first. Then, after seeing player I's choice, player 2 puts her penny down. (This is a very nice game for player 2!) The extensive form representation of this game is depicted in Figure 7.CI. The game starts at an initial decision node (represented by an open circle), where player I makes her move, deciding whether to place her penny heads up or tails up. Each of the two possible choices for player I is represented by a branch from this initial decision node. At the end of each branch is another decision node (represented by a solid dot), at which player 2 can choose between two actions, heads up or tails up, after seeing player I's choice. The initial decision node is referred to as player 1's decision node; the latter two as player 2's decision nodes. After player 2's move, we reach the end of the game, represented by terminal nodes. At each terminal node, we list the players' payoffs arising from the sequence of moves leading to that terminal node. Note the treelike structure of Figure 7.CI: Like an actual tree, it has a unique connected path of branches from the initial node (sometimes also called the root) to each point in the tree. This type of figure is known as a game tree. _ Example 7.C.2: The Extensive Form of Tick- Tack- Toe. The more elaborate game tree shown in Figure 7.C2 depicts the extensive form for Tick-Tack-Toe (to conserve space, many parts are omitted). Note that every path through the tree represents a unique sequence of moves by the players. In particular, when a given board position (such as the two left corners filled by X and the two right corners filled by 0) c!ln be reached through several different sequences of moves, each of these sequences is depicted separately in the game tree. Nodes represent not only the current position but also how it was reached. _ In both Matching Pennies Version B and Tick-Tack-Toe, when it is a player's turn to move, she is able to observe all her rival's previous moves. They are games of perfect information (we give a precise definition of this term in Definition 7.CI). The concept of an information set allows us to accommodate the possibility that this is not so. Formally, the elements of an information set are a subset of a particular player's decision nodes. The interpretation is that when play has reached one of the decision nodes in the information set and it is that player's turn to move, she does

Figure 7.C.1

Extensive form for Matching Pennies Version B.

Corner

...--Player X Middle Square Player 0 ~Lower-center

Square

Player X ~Lower-Ieft

-

Corner X gets three in a row (terminal node)

(~~) (~:: ~:;~:: ) not know which of these nodes she is actually at. The reason for this ignorance is that the player does not observe something about what has previously transpired in the game. A further variation of Matching Pennies, which we call Matching Pennies Version C, helps make this concept clearer. Example 7.C.3: Matching Pennies Version C and Its Extensive Form. This version of Matching Pennies is just like Matching Pennies Version B (in Example 7.CI) except that when player I puts her penny down, she keeps it covered with her hand. Hence, player 2 cannot see player I's choice until after player 2 has moved. The extensive form for this game is represented in Figure 7.C3. It is identical to Figure 7.C.1 except that we have drawn a circle around player 2's two decision nodes to indicate that these two nodes are in a single information set. The meaning of this information set is that when it is player 2's turn to move, she cannot tell which of these two nodes she is at because she has not observed player I's previous move. Note that player 2 has the same two possible actions at each of the two nodes in her information set. This must be the case if player 2 is unable to distinguish the two nodes; otherwise, she could figure out which move player I had taken simply by what her own possible actions are. In principle, we could also associate player I's decision node with an information set. Because player I knows that nothing has happened before it is her turn to move, this information set has only one member (player I knows exactly which node she is at when she moves). To be fully rigorous, we should therefore also draw an information set circle around player I's decision node in Figure 7.C.3. It is common, however, to

Figure 7.C.2

Part of the extensive form for

Tick·Tack·Toe.

223

224

C HAP T E R

7:

BAS I C E L E MEN T S

0 F

NON COO PER A T I V EGA M E S

SECTION

7.C:

THE

EXTENSIVE

FORM

REPRESENTATION

OF

A

GAME

225

Player I

Figure 7.C.3

rspayoff) ( 2's Payoff

~

(-I) + I

simplify the diagrammatic depiction of a game in extensive form by not drawing the information sets that contain a single node. Thus. any uncircled decision nodes are understood to be elements of singleton information sets. In Figures 7.CI and 7.C2. for example, every decision node belongs to a singleton information set. _ A listing of all of a player's information sets gives a listing, from the player's perspective, of all of the possible distinguishable "events" or "circumstances" in which she might be called upon to move. For example, in Example 7.C.l, from player 2's perspective there are two distinguishable events that might arise in which she would be called upon to move, each one corresponding to play having reached one of her two (singleton) information sets. By way of contrast, player 2 foresees only one possible circumstance in which she would need to move in Example 7.C3 (this circumstance is, however, certain to arise). In Example 7.C3, we noted a natural restriction on information sets: At every node within a given information set, a player must have the same set of possible actions. Another restriction we impose is that players possess what is known as perfect recall. Loosely speaking, perfect recall means that a player does not forget what she once knew, including her own actions. Figure 7.C4 depicts two games in which this condition is not met. In Figure 7.C4(a), as the game progresses, player 2 forgets a move by player I that she once knew (namely, whether player I chose t or r). In Figure 7.C.4(b), player I forgets her own previous move.' All the games we consider in this book satisfy the property of perfect recall. The use of information sets also allows us to capture play that is simultaneous rather than sequential. This is illustrated in Example 7.C4 for the game of (standard) Matching Pennies introduced in Example 7.B.1. L In terms of the formal specification of the extensive form given later in this section, if we denote the information set containing decision node x by H(x), a game is formally characterized as one of perfect recall if the following two conditions hold: (i) If H(x) = H(x'), x is neither a predecessor nor a successor of x'; and (ii) if x and x' are two decision nodes for player i with H(x) = H(x'), and if x" is a predecessor of x (not necessarily an immediate one) that is also in one of player i's information sets, with a" being the action at H(x") on the path to x. then there must be a predecessor node to x' that is an element of H(x") and the action at this predecessor node that is on the path to x' must also be a".

Extensive form for Matching Pennies Version C.

(a)

Figure 7.C.4 (b)

Example 7,C.4: The Extensive Form for Matching Pennies. Suppose now that the players put their pennies down simultaneously. For each player, this game is strategically equivalent to the Version C game. In Version C, player 1 was unable to observe player 2's choice because player I moved first, and player 2 was unable to observe player I's choice because player I kept it covered; here each player is unable to observe the other's choice because they move simultaneously. As long as they cannot observe each other's choices, the timing of moves is irrelevant. Thus, we can use the game tree in Figure 7.C3 to describe the game of (standard) Matching Pennies. Note that by this logic we can also describe this game with a game tree that reverses the decision nodes of players I and 2 in Figure 7.C3. _ We can now return to the notion of a game of perfect information and offer a formal definition.

Two games not satisfying perfect recall.

226

CHAPTER

7:

BASIC

ELEMENTS

OF

NONCOOPERATIVE

SECTION

GAMES

7.C:

THE

EXTENSIVE

FORM

REPRESENTATION

(i) A finite set of nodes !{, a finite set of possible actions d, and a finite set of players {I, ... , I}. (ii) A function p:!{ -+ {:f u 0} specifying a single immediate predecessor of each node x; pix) is nonempty for all x E !( but one, designated as the initial node Xa' The immediate successor nodes of x are then sex) = p- t(x), and the set of all predecessors and all successors of node x can be found by iterating pix) and six). To have a tree structure, we require that these sets be disjoint (a predecessor of node x cannot also be a successor to it). The set of terminal nodes is T = (x E:f: sex) = 0}. All other nodes :f\ T are known as decision nodes.

( Flgur.7.C.S

-+11)

Extensive form for Matching Pennies Version D.

Definition 7.C.1: A game is one of perfect information if each information set contains a single decision node. Otherwise, it is a game of imperfect information.

Up to this point, the outcome of a game has been a deterministic function of the players' choices. In many games, however, there is an element of chance. This, too, can be captured in the extensive form representation by including random moves of nature. We illustrate this point with still another variation, Matching Pennies Version D.

Example 7.C.5: Matching Pennies Version D and Its Extensive Form. Suppose that prior to playing Matching Pennies Version B, the two players flip a coin to see who will move first. Thus, with equal probability either player 1 will put her penny down first, or player 2 will. In Figure 7.C.S, this game is depicted as beginning with a move of nature at the initial node that has two branches, each with probability t. Note that this is drawn as if nature were an additional player who must play its two actions with fixed probabilities. (In the figure, H stands for "heads up" and T stands for "tails up".) • It is a basic postulate of game theory that all players know the structure of the game, know that their rivals know it, know that their rivals know that they know it, and so on. In theoretical parlance, we say that the structure of the game is common knowledge [see Aumann (1976) and Milgrom (1981) for discussions of this concept). In addition to being depicted graphically, the extensive form can be described mathematically. The basic components are fairly easily explained and can help you keep in mind the fundamental building blocks of a game. Formally, a game represented in extensive form consists of the following items:' 2. To be a bit more precise about terminology: A collection of items (i) to (vi) is formally known as an extensive game form; adding item (vii), the players' preferences over the outcomes, leads to a game represented in extensive form. We will not make anything of this distinction here. See Kuhn

(t953) or Section 2 of Kreps and Wilson (1982) for additional discussion of this and other points regarding the extensive form.

~-------------------------------------------

~

(1" paYOff) 2's Payoff

(iii) A function <X: !{\{xa } -+ d giving the action that leads to any noninitial node x from its immediate predecessor p(x) and satisfying the property that if x', x" E sex) and x' # x", then <x(x') # <x(x"). The set of choices available at decision node x is c(x) = (a E d: a = <x(x') for some x' E six)}. (iv) A collection of information sets..lf', and a function H::f -+ ..If' assigning each decision node x to an information set H(x) E..If'. Thus, the information sets in ..If' form a partition of '!C. We require that all decision nodes assigned to a single information set have the same choices available; formally, c(x) = c(x') if H(x) = H(x'). We can therefore write the choices available at information set H as C(H) = (a E d: a E c(x) for x E H}. (v) A function .:..If' -+ {O, I, ... , I} assigning each information set in ..If' to the player (or to nature: formally, player 0) who moves at the decision nodes in that set. We can denote the collection of player i's information sets by JI'i = {H E..If': i = .(H)}. (vi) A function p:.JI'(, x sri -+ [0, I] assigning probabilities to actions at information sets where nature moves and satisfying p(H, a) = 0 if a I/o C(H) and L..eC(Hlp(H, a) = 1 for all HE ..If'o. (vii) A collection of payoff functions u = (utC), ... , u,(·)} assigning utilities to the players for each terminal node that can be reached, Ui: T -+ R. As we noted in Section 7.B, because we want to allow for a random realization of outcomes we take each Ui(') to be a Bernoulli utility function. Thus, formally, a game in extensive form is specified by the collection (:f, d, I, p(.), <xC),..If', H(·),.(·), pC), u}.

r. =

We should note that there are three implicit types of finiteness hidden in the formulationjust presented. Because we will often encounter games not sharing these features in the economic

applications discussed in later chapters, we briefly identify them here, although without any formal treatment. The formal definition of an extensive form representation of a game can be extended to these infinite cases without much difficulty. although there can be important differences in the predicted outcomes of finite and infinite economic models. as we shall see

later (e.g., in Chapters 12 and 20). First, we have assumed that players have a finite number of actions available at each decision node. This would rule out a game in which, say, a player can choose any number from some interval [a, b] c R. In fact, allowing for an infinite set of actions requires that we allow for an infinite set of nodes as well. But with this change, items (i) to (vii) remain the basic elements of an extensive form representation (e.g., decision nodes and terminal nodes

are still associated with a unique path through the tree).

OF

A

GAME

227

228

CHAPTER

7:

BASIC

ELEMENTS

OF

NONCOOPERATIVE

GAMES

Second. we have described the extensive form of a game that must end after a finite number of moves (because the set of decision nodes is finite). Indeed. all the examples we have considered so far fall into this category. There are, however. other types of games. For example, suppose that two players with infinite life spans (perhaps two firms) play Matching Pennies repeatedly every January I. The players discount the money gained or lost at future dates with interest rate r and seek to maximize their discounted net gains. In this game, there are

no terminal nodes. Even so, we can still associate discounted payoffs for the two players with every (infinite) sequence of moves the players make. Of course. actually drawing a complete game tree would be impossible, but the basic elements of the extensive form can nonetheless be captured as before (with payoffs being associated with paths through the tree rather than with terminal nodes). Third, we may at times also imagine that there are an infinite number of players who take actions in a game. For example, models involving overlapping generations of players (as in various macroeconomic models) have this feature, as do models of entry in which we want to allow for an infinite number of potential firms. In the games of this type that we consider, this issue can be handled in a simple and natural manner. Note that all three of these extensions require that we relax the assumption that there is a finite set of nodes. Games with a finite number of nodes, such as those we have been considering. are known as finite games.

For pedagogical purposes, we restrict our attention in Part II to finite games except where specifically indicated otherwise. The extension of the formal concepts we discuss here to the economic games studied later in the book that do not share these finiteness properties is straightforward.

7,D Strategies and the Normal Form Representation of a Game A central concept of game theory is the notion of a player's strategy. A strategy is a complete contingent plan, or decision rule, that specifies how the player will act in every possible distinguishable circumstance in which she might be called upon to move. Recall that, from a player's perspective, the set of such circumstances is represented by her collection of information sets, with each information set representing a different distinguishable circumstance in which she may need to move (see Section 7.C). Thus, a player's strategy amounts to a specification of how she plans to move at each one of her information sets, should it be reached during play of the game. This is stated formally in Definition 7.0.1. Definition 7.0.1: Let Jfj denote the collection of player i's information sets, d the set of possible actions in the game, and C(H) c d the set of actions possible at information set H. A strategy for player i is a function Si: Jfj -+ d such that si(H) e C(H) for all He Jf;.

The fact that a strategy is a complete contingent plan cannot be overemphasized, and it is often a source of confusion to those new to game theory. When a player specifies her strategy, it is as if she had to write down an instruction book prior to play so that a representative could act on her behalf merely by consulting that book. As a complete contingent plan, a strategy often specifies actions for a player at information sets that may not be reached during the actual play of the game.

leTION

7.0:

STRATEGIES

AND

THE

NORMAL

FORM

REPRESENTATION

For example, in Tick-Tack-Toe, player O's strategy describes what she will do on her first move if player X starts the game by marking the center square. But in the actual play of the game, player X might not begin in the center: she may instead mark the lower-right corner first, making this part of player O's plan no longer relevant. In fact, there is an even subtler point: A player's strategy may include plans for actions that her own strategy makes irrelevant. For example, a complete contingent plan for player X in Tick-Tack-Toe includes a description of what she will do after she plays "center" and player 0 then plays "lower-right corner," even though her own strategy may call for her first move to be "upper-left corner." This probably seems strange; its importance will become apparent only when we talk about dynamic games in Chapter 9. Nevertheless, remember: A strategy is a complete contingent plan that says what a player will do at each of Iler information sets if she is called on to play there. It is worthwhile to consider what the players' possible strategies are for some of the simple Matching Pennies games. Example 7_D.l: Strategies in Matching Pennies Version B. In Matching Pennies Version B, a strategy for player I simply specifies her move at the game's initial node. She has two possible strategies: She can play heads (H) or tails (T). A strategy for player 2, on the other hand, specifies how she will play (H or T) at each of her two information sets, that is, how she will play if player 1 picks H and how she will play if player I picks T. Thus, player 2 has four possible strategies. Strategy Strategy Strategy Strategy

1 (5,): Play H if player I plays H; play H if player I plays T. 2 (52): Play H if player I plays H; play T if player I plays T. 3 (5,): Play T if player I plays H; play H if player I plays T. 4 (5.): Play T if player I plays H; play T if player I plays T. _

Example 7.D.2: Strategies in Matching Pennies Version C. In Matching Pennies Version C, player l's strategies are exactly the same as in Version B; but player 2 now only has two possible strategies, "play H" and "play T", because she now has only one information set. She can no longer condition her action on player I's previous action. _ We will often find it convenient to represent a profile of players' strategy choices in an I-player game by a vector 5 = (s" ... ,51)' where 5, is the strategy chosen by player i. We will also sometimes write the strategy profile 5 as (5" L,), where L, is the (1 - 1) vector of strategies for players other than i. The Normal Form Representation of a Game Every profile of strategies for the players s = (5" ... ,51) induces an outcome of the game: a sequence of moves actually taken and a probability distribution over the terminal nodes of the game. Thus, for any profile of strategies (5" ... , sIl, we can deduce the payoffs received by each player. We might think, therefore, of specifying the game directly in terms of strategies and their associated payoffs. This second way to represent a game is known as the normal (or strategic) form. It is, in essence, a condensed version of the extensive form.

OF

A

GAME

229 ~

230

CHAPTER

7:

BASIC

ELEMENTS

OF

NONCOOPERATIVE

GAMES

SECTION

s,

s,

H -I, +1 -I, +1 +1, -I +1, -I T +1, -I -I, +1 +1. -I -I, +1

Definition 7.0.2: For a game with I players, the normal form representation r N specifies for each player i a set of strategies S; (with 5; E S;) and a payoff function u;(s" ... ,5,) giving the von Neumann-Morgenstern utility levels associated with the (possibly random) outcome arising from strategies (5" . .. ,5,). Formally, we write r N = [1, IS;}. {u;(·)}].

In fact, when describing a game in its normal form, there is no need to keep track of the specific moves associated with each strategy. Instead, we can simply number the various possible strategies of a player, writing player i's strategy set as S, = {s", 5", . .. } and then referring to each strategy by its number. A concrete example of a game in normal form is presented in Example 7.0.3 for Matching Pennies Version B. Example 7.0.3: The Normal Form of Matching Pennies Version B. We have already described the strategy sets of the two players in Example 7.0.1. The payoff functions are

)= {

+I

if (5 1,S,) = (H, strategies 3 or 4) or (T, strategies I or 3),

- I if

(5 I' 52)

= (H, strategies I or

CHOices

231

s.

Player I

U 1(S\,5 2

RANDOMIZED

Player 1

Player 2

"

7.E:

2) or (T, strategies 2 or 4),

Figure 7.0.1

The normal form of Matching Pennies Version B.

Figure 7.0.2

An extensive form whose normal form is that depicted in Figure 7. D.l.

extensive form in Figure 7.D.2. In the latter game, players move simultaneously, player 1 choosing between two strategies, Land R, and player 2 choosing among four strategies: a, b, c, and d. In terms of their representations in a game box, the only difference between the normal forms for these games lies in the "labels" given to the rows and columns. Because the condensed representation of the game in the normal form generally omits

some of the details present in the extensive form, we may wonder whether this omission is important or whether the normal form summarizes all of the strategically relevant information (as the last paragraph in regular type seems to sugge,I). The question can be put a little differently: Is the scenario in which players simultaneously write down their strategies and submit them to a referee really equivalent to their playing the game over time as described in the extensive form? This question is currently a subject of some controversy among game theorists. The debate centers On issues arising in dynamic games such as those studied in

and U 2(SI' 52) = - UI(SI' 5,). A convenient way to summarize this information is in the "game box" depicted in Figure 7.0.1. The different rows correspond to the strategies of player I, and the columns to those of player 2. Within each cell, the payoffs of the two players are depicted as (U I(5 I, 52)' U,(5 1 , 5,)) . •

Chapter 9. For the simultaneous-move games that we study in Chapter 8, in which all players choose their actions at the same time, the normal form captures all the strategically

Exercise 7.D.2: Depict the normal forms for Matching Pennies Version C and the standard version of Matching Pennies.

relevant information. In simultaneous-move games, a player's strategy is a simple noncontingent choice of an action. In this case, players' simultaneous choice of strategies in the normal form is clearly equivalent to their simultaneous choice of actions in the extensive form

The idea behind using the normal form representation to study behavior in a game is that a player's decision problem can be thought of as one of choosing her strategy (her contingent plan of action) given the strategies that she thinks her rivals will be adopting. Because each player is faced with this problem, we can think of the players as simultaneously choosing their strategies from the sets is,}. It is as if the players each simultaneously write down their strategies on slips of paper and hand them to a referee, who then computes the outcome of the game from the players' submitted strategies. From the previous discussion, it is clear that for any extensive form representation of a game, there is a unique normal form representation (more precisely, it is unique up to any renaming or renumbering of the strategies). The converse is not true, however. Many different

extensive forms may be represented by the same normal form. For example, the normal form shown in Figure 7.0.1 represents not only the extensive form in Figure 7.C.1 but also the

,

~------------------------

(captured there by having players not observing each other's choices).

.E Randomized Choices Up to this point, we have assumed that players make their choices with certainty. However, there is no a priori reason to exclude the possibility that a player could randomize when faced with a choice. Indeed, we will see in Chapters 8 and 9 that in certain circumstances the possibility of randomization can play an important role in the analysis of games. As stated in Definition 7.0.1, a deterministic strategy for player i, which we now call a pure strategy, specifies a deterministic choice s,(H) at each of her information sets HE Jt",. Suppose that player i's (finite) set of pure strategies is S,. One way for

--

232

CHAPTER

7:

BASIC

ELEMENTS

OF

NONCOOPERATIVE

GAMES

EXERCISES

the player to randomize is to choose randomly one element of this set. This kind of randomization gives rise to what is called a mixed strategy. Definition 7.E.1: Given player i's (finite) pure strategy set

large set of pure strategies in S,' she could randomize separately over the possible actions at each of her information sets HE .7f',. This way of randomizing is called a behavior strategy.

S;, a mixed strategy

lor player i, a,: S; - [0, 1], assigns to each pure strategy a,(si) 2': 0 that it will be played, where :L,
Proof: For necessity, note that if either of conditions (i) or (ii) does not hold for some player i, then there are strategies s, E S" and s', E S, such that u,(s'" a -,) > u,(s" (J -,). If so, player i can strictly increase his payoff by playing strategy s; whenever he would have played strategy s,. 4. This approach actually dates to Courno!'s (1838) myopic adjustment procedure. A recent example can be found in Milgrom and Roberts (1990). Interestingly, this work explains the "ultrarational" Nash outcome by relaxing the assumption of rationality. It also can be used to try to identify the likelihood of various Nash equilibria arising when multiple Nash equilibria exist.

Figure 8.0.3

'.0:

NASH

For sufficiency, suppose that conditions (i) and (ii) hold but that (J is not a Nash equilibrium. Then there is some player i who has a strategy (J; with u,(a;, (J _,) > u,(a" a _,). But if so, then there must be some pure strategy s; that is played with positive probability under (J; for which u,(s;, (J _,) > u,(a" a _,). Since u,(a" (J _,) = u,(s" a _,) for all s, E S;, this contradicts conditions (i) and (ii) being satisfied. _

Matching Pennies. Hence, a necessary and sufficient condition for mixed strategy profile a to be a Nash equilibrium of game f" = [I, (illS,)}. {u,(')}J is that each player, given the distribution of strategies played by his opponents, is indifferent among all the pure strategies that he plays with positive probability and that these pure strategies are at least as good as any pure strategy he plays with zero probability. An implication of Proposition 8.D.I is that to test whether a strategy profile (J is a Nash equilibrium it suffices to consider only pure strategy deviations (i.e .• changes in a player's strategy a, to some pure strategy s;). As long as no player can improve his payoff by switching to any pure strategy, (J is a Nash equilibrium. We therefore get the comforting result given in Corollary 8.b.1. Corollary B.O.l: Pure strategy profile 5 = (5" ...• 51 ) is a Nash equilibrium of game fN = [I, (Si}, (u i (·)}] if and only if it is a (degenerate) mixed strategy Nash equilibrium of game fN = [I, {~(S,)}. (u i (·)}]. Corollary 8.D.I tells us that to identify the pure strategy equilibria of game f~ = [I. {~(S,)}, (u,(' )}]. it suffices to restrict attention to the game fN = [I, IS,}, (u,C)}] in which randomization is not permitted. Proposition 8.D.I can also be of great help in the computation of mixed strategy equilibria as Example 8.D.S illustrates. Example 8.0,5: Mixed Strategy Equilibria in the .\leeting in New York Game. Let us try to find a mixed strategy equilibrium in the variation of the Meeting in New York game where the payoffs of meeting at Grand Central are (!Ooo, !Ooo). By Proposition 8.D.!, if Mr. Thomas is going to randomize between Empire State and Grand Central, he must be indifferent between them. Suppose that Mr. Schelling plays Grand Central with probability a,. Then Mr. Thomas' expected payoff from playing Grand Central is lOOOa, + 0(1 - (J,), and his expected payoff from playing Empire State is 100(1 - (J,) + Oa,. These two expected payoffs are equal only when a, = 1/11. Now, for Mr. Schelling to set (J, = 1/11, he must also be indifferent between his two pure strategies. By a similar argument. we find that Mr. Thomas' probability of playing Grand Central must also be 1/11. We conclude that each player going to Grand Central with a probability of 1/11 is a Nash equilibrium. _ Note that in accordance with Proposition 8.D.I, the players in Example 8.D.S ha ve no real preference over the probabilities that they assign to the pure strategies they play with positive probability. What determines the probabilities that each player uses is an equilibrium consideration: the need to make the other player indifferent over his strategies. This fact has led some economists and game theorists to question the usefulness of mixed strategy Nash equilibria as predictions of play. They raise two concerns: First, if players always have a pure strategy that gives them the same expected payoff as their equilibrium mixed strategy. it is not clear why they will bother to randomize.

EQUtLIBRIUM

251

252

CHAPTER

.:

SIMULTANEOUS·MOVE

GAMES

One answer to this objection is that players may not actually randomize. Rather, they may make definite choices that are affected by seemingly inconsequential variables ("signals") that only they observe. For example, consider how a pitcher for a major league baseball team "mixes his pitches" to keep batters guessing. He may have a completely deterministic plan for what he will do, but it may depend on which side of the bed he woke up on that day or on the number of red traffic lights he came to on his drive to the stadium. As a result, batters view the behavior of the pitcher as random even though it is not. We touched briefly on this interpretation of mixed strategies as behavior contingent on realizations of a signal in Section 7.E, and we will examine it in more detail in Section 8.E. The second concern is that the stability of mixed strategy equilibria seems tenuous. Players must randomize with exactly the correct probabilities, but they have no positive incentive to do so. One's reaction to this problem may depend on why one expects a Nash equilibrium to arise in the first place. For example, the use of the correct probabilities may be unlikely to arise as a stable social convention, but may seem more plausible when the equilibrium arises as a self-enforcing agreement. Up to this point, we have assumed that players' randomizations are independent. In the Meeting in New York game in Example 8.0.5, for instance, we could describe a mixed strategy equilibrium as follows: Nature provides private and independently distributed signals (II" II,) E [0, I] x [0, I] to the two players, and each player i assigns decisions to the various possible realizations of his signal II,. However, suppose that there are also public signals available that both players observe. Let 0 E [0, I] be such a signal. Then many new possibilities arise. For example, the two players could both decide to go to Grand Central if II < ! and to Empire State if II ~ t. Each player's strategy choice is still random, but the coordination of their actions is now perfect and they always meet. More importantly, the decisions have an equilibrium character. If one player decides to follow this decision rule, then it is also optimal for the other player to do so. This is an example of a correlated equilibrium [due to Aumann (1974»). More generally, we could allow for correlated equilibria in which nature's signals are partly private and partly pUblic. Allowing for such correlation may be important because economic agents observe many public signals. Formally, a correlated equilibrium is a special case of a Bayesian Nash equilibrium, a concept that we introduce in Section 8.E; hence, we defer further discussion to the end of that section.

Existence of Nash Equilibria Does a Nash equilibrium necessarily exist in a game? Fortunately, the answer turns out to be "yes" under fairly broad circumstances. Here we describe two of the more important existence results; their proofs, based on mathematical fixed point theorems, are given in Appendix A of this chapter. (Proposition 9.B.l of Section 9.B provides another existence result.) Proposition 8.0.2: Every game rN = [I, (dIS,)}, (u,(·)}] in which the sets S" .. . , SI have a finite number of elements has a mixed strategy Nash equilibrium. Thus, for the class of games we have been considering, a Nash equilibrium always exists as long as we are willing to accept equilibria in which players randomize. (If you want to be convinced without going through the proof, try Exercise 8.0.6.) Allowing

---------------------~- ~---

".CTION

I.E:

GAMES

OF

INCOMPLETE

INFORMATION:

BAYESIAN

NASH

EQUILIBRIUM

253

for randomization is essential for this result. We have already seen in (standard) Matching Pennies. for example. that a pure strategy equilibrium may not exist in a game with a finite number of pure strategies. Up to this point, we have focused on games with finite strategy sets. However. in economic applications, we frequently encounter games in which players have strategies naturally modeled as continuous variables. This can be helpful for the existence of a pure strategy equilibrium. [n particular. we have the result given in Proposition 8.0.3. Proposition 8.0.3: A Nash equilibrium exists in game i= 1, ... , I,

rN =

[I, {Sj}, (U j(·)}] if for all

(i) Sj is a nonempty, convex, and compact subset of some Euclidean space RM (ii) Uj (5" . .. ,51) is continuous in (5" . .. ,51 ) and quasiconcave in Sj. Proposition 8.0.3 provides a significant resuJt whose requirements are satisfied in a wide range of economic applications. The convexity of strategy sets and the nature of the payoff functions help to smooth out the structure of the model, allowing us to achieve a pure strategy equilibrium.' Further existence results can also be established. [n situations where quasi· concavity of the payoff functions u;(·) fails but they are still continuous. existence of a mixed strategy equilibrium can still be demonstrated. In fact, even if continuity of the payoff functions fails to hold, a mixed strategy equilibrium can be shown to exist in a variety of cases [see Dasgupta and Maskin (1986)]. Of course, these results do not mean that we cannot have an equilibrium if the conditions of these existence results do not hold. Rather, we just cannot be assured that there is one.

E Games of Incomplete Information: Bayesian Nash Equilibrium Up to this point, we have assumed that players know all relevant information about each other, including the payoffs that each receives from the various outcomes of the game. Such games are known as games of complete information. A moment of thought, however, should convince you that this is a very strong assumption. Do two firms in an industry necessarily know each other's costs? Does a firm bargaining with a union necessarily know the disutility that union members will feel if they go out on strike for a month? Clearly, the answer is "no." Rather, in many circumstances, players have what is known as incomplete information. The presence of incomplete information raises the possibility that we may need to consider a player's beliefs about other players' preferences, his beliefs about their beliefs about his preferences, and so on, much in the spirit of rationalizability· 5. Note that a finite strategy set Sf cannot be convex. In fact, the use of mixed strategies in Proposition 8.D.2 helps us to obtain existence of equilibrium in much the same way that Proposition 8.0.3'5 assumptions assure existence of a pure strategy Nash equilibrium: It convexities

players' strategy sets and yields well· behaved payoff [unctions. (See Appendix A [or details.) 6. For more on this problem. see Mertens and Zamir (1985).

-

GAMES

254

CHAPTER

I:

SIMULTANEOUS-MOVE

OF

INCOMPLETE

INFORMA.TION:

BAYESIAN

NASH

GAMES

indicates what action he will take. Hence. prisoner 2 now has four possible pure strategies:

Fortunately, there is a widely used approach to this problem, originated by Harsanyi (\ 967-68), that makes this unnecessary. In this approach, one imagines that each player's preferences are determined by the realization of a random variable .. Although the random variable's actual realization is observed only by the player, Its ex ante probability distribution is assumed to be common knowledge among all the players. Through this formulation, the situation of incomplete information IS reinterpreted as a game of imperfect information: Nature makes the first move, choosing realizations of the random variables that determine each player's preference type, and each player observes the realization of only his own random variable. A game of this sort is known as a Bayesian game.

(confess if type I, confess if type II); (confess if type I, don't confess if type II); (don't confess if type I, confess if type II); (don't confess if type I, don't confess if type 11). Notice, however, that player I does not observe player 2's type, and so a pure strategy for player I in this game is simply a (noncontingent) choice of either "confess" or "don't confess." _

Example 8.E.I: Consider a modification of the DA's Brother game discussed in Example 8.B.3. With probability J.I, prisoner 2 has the preferences In Figure 8.B.4 (we call these type / preferences), while with probability (I - J.I), prisoner 2 hates to rat on his accomplice (this is type /I). In this case, he pays a psychic penalty equal to 6 years in prison for confessing. Prisoner I, on the other hand, alw~ys has the preferences depicted in Figure 8.B.4. The extensive form of this Bayesian game IS represented in Figure 8.E.I (in the figure, "C" and "DC" stand for "confess" and "don't confess" respectively). In this game, a pure strategy (a complete contingent plan) for player 2 can be viewed as a function that for each possible realization of his preference type

Formally, in a Bayesian game, each player i has a payoff function u,(s" L" 0,), where 0, E 0, is a random variable chosen by nature that is observed only by player i. The joint probability distribution of the O,'s is given by F(lJ l , •.• , 0,), which is assumed to be common knowledge among the players. Letting 0 = 0 1 X . . . x 0" a Bayesian game is summarized by the data [L IS,:, (u,(-)}, 0, F(-»). A pure strategy for player i in a Bayesian game is a function s,(II,), or decision rule, that gives the player's strategy choice for each realization of his type 11,. Player i's pure strategy set Yo is therefore the set of all such functions. Player i's expected payoff given a profile of pure strategies for the / players (Sl(·)"'. ,s,(·» is then given by (8.E.l) Figure 8.E.1

The DA's Brolher game with incomplete information.

We can now look for an ordinary (pure strategy) Nash equilibrium of this game of imperfect information, which is known in this context as a Bayesian Nash equilibrium. 7 Definition a.E.1: A (pure strategy) Bayesian Nash equilibrium for the Bayesian game [I, lSi}, (u i (·)},0,F(·)] is a profile of decision rules (s,(·), ... ,s,(·» that constitutes a Nash equilibrium of game rN = [I, (Yo), (u i (·»)]. That is, for every i = 1, ... ,/, Ui(s/(·). L/('» 2'0 u;(s,(·), L i ('))

for all si(')

(::) ~ (=!) (=:0) (=:0) (-~)

(=:1) (=:0)

I

Simultaneous-Move Game: Prisoner 2 C DC

Simultaneous-Move Game: Prisoner 2

DC Prisoner

I

DC C

C

0, -2

-10, -I

-I, -10

-5, -5

(=~O)

Prisoner

I

DC C

0, -2

-10, -7

-I, -10

-5, -11

C~)

E

Yj. where Ui(Si('), Li(')) is defined as in (S.E.1).

A very useful point to note is that in a (pure strategy) Bayesian Nash equilibrium each player must be playing a best response to the conditional distribution of his opponents' strategies for each type that he might end up having. Proposition 8.E.l provides a more formal statement of this point.

/

a.E.1: A profile of decision rules (s,(·) . .. " s,(·» is a Bayesian Nash equilibrium in BayeSian game [I, lSi}' (u/(· )}, 0, F(')] if and only if, for all i and

PrelDDsillian

7. We shall restrict our attention to pure strategies here; mixed strategies involve randomization over the strategies in .Y/. Note also that we have not been very explicit about whether the e,'s are finite sets. If they are, then the strategy sets 9i are finite; if they are not, then the sets ,Sf; include an infinite number of possible functions sk). Either way, however, the basic definition of a Bayesian Nash equilibrium is the same.

EQUILIBRIUM

255

56

CHAPTER

all

8:

SIMULTANEOUS·MOVE

O,E 0, occurring

T ION

GAMES

with positive probabilityB

E._Ju,(S,(O,), L,(O_,), 0,)18;]
e,

0, > [ -

C

I - Prob (s,(e,) = I)

l Jl '

(S.E.3)

8. The formulation given here (and the prooO is ror the case in which the sets 9 j are finite. When a player j has an infinite number of possible types, condition (S.E.2) must hold on a subset of 0 i that is of full measure (i.e., that occurs with probability equal to one). It is then said that (8.E.2) holds for almost every OJ E 9 j .

8. E:

GAM E S

0 F I N COM P LET E

I N FOR MAT ION:

BAY E S I ANN ASH

Note that for any given strategy of firm j, firm i's best response takes the form of a cu/Off rule: It optimally develops the Zigger for all above the value on the right-hand side of (S.E.3) and does not for all below it. [Note that if firm i existed

e,

e,

e, Jr.

in isolation, it would be indifferent about developing the Zigger when = But (S.E.3) tells uS that when firm i is part of the consortium, its cutoff is always (weakly) above this. This is true because each firm hopes to free·ride on the other firm's development effort: see Chapter II for more on this.] Suppose then that &1' 0, E (0, I) are the cutoff values for firms I and 2 respectively in a Bayesian Nash equilibrium (it can be shown that 0 < 0, < I for i = 1,2 in any Bayesian Nash equilibrium of this game). If so, then using the fact that Prob(sj(Oj) = I) = I -OJ' condition (S.E.3) applied first for i = I and then for i = 2 tells us that we must have

(oyo, =c and

(0,)'0 1 =

c.

(oyo,

Because = (0,)'9 1 implies that 91 = 9" we see that any Bayesian Nash equilibrium of this game involves an identical cutoff value for the two firms. 0" = (C)1/3. In this equilibrium, the probability that neither firm develops the Zigger is (e")', the probability that exactly one firm develops it is 2e"(1 - 0"), and the probability that both do is (I _0*)2 • The exercises at the end of this chapter consider several other examples of Bayesian Nash equilibria. Another important application arises in the theory of implementation with incomplete information, studied in Chapter 23. In Section S.D, we argued that mixed strategies could be interpreted as situations where players play deterministic strategies conditional on seemingly irrelevant signals (recall the baseball pitcher). We can now say a bit more about this. Suppose we start with a game of complete information that has a unique mixed strategy equilibrium in which players actually randomize. Now consider changing the game by introducing many different types (formally, a continuum) of each player, with the realizations of the various players' types being statistically independent of one another. Suppose, in addition, that all types of a player have identical preferences. A (pure strategy) Bayesian Nash equilibrium of this Bayesian game is then precisely equivalent to a mixed strategy Nash equilibrium of the original complete information game. Moreover, in many circumstances, one can show that there are also "nearby" Bayesian games in which preferences of the different types of a player differ only slightly from one another, the Bayesian Nash equilibria are close to the mixed strategy distribution, and each type has a strict preference for his strategy choice. Such results are known as purification theorems [see Harsanyi (1973)]. We can also return to the issue of correlated equilibria raised in Section 8.0. In particular, if we allow the realizations of the various players' types in the previous paragraph to be statistically correlated. then a (pure strategy) Bayesian Nash equilibrium of this Bayesian game is a correlated equilibrium of the original complete information game. The set of all correlated equilibria in game [/, {S;}, (u;(')}] is identified by considering all possible Bayesian games of this sort (i.e., we allow for all possible signals that the players might observe).

E QUI LIB A I U M

257

258

CHAPTER

S:

SIMULTANEOUS-MOVE

SECTION

GAMES

•. F:

1l,(S,) = { C L. Solve for the Bayesian Nash equilibrium.

(a) Show that ~, ~ !!" in any game.

IN=[/,{"'(S,),"'(S,)},{u,('),u,('j}], player i's expected utility

(b) Prove

that in any mixed strategy Nash equilibrium of the zero·sum game u~ satisfies u~=~,=!!". [Hint: Such an equilibrium must exist by Proposition 8.0.2.]

8.F.I C Prove Proposition 8.F.1.

(c) Show that if (0',,021 and (o;,oll are both Nash equilibria of the zero·sum game IN = [/, ("'(S,), "'(S,)}, {u,('), u,(·)}]. then so are (0',,0,) and (0',,021.

8.F.28 Consider the following three-player game [taken from van Damme (1983)], in which player 1 chooses rows (S, = {V, D}). player 2 chooses columns (S, = {L, Rj), and player 3 chooses boxes (SJ = {B" B,}):

8.0.8 c Consider a simultaneous·move game with normal form [/, {"'(S,)}, {u,(·)}]. Suppose that, for all i, S, is a convex set and u,(·) is strictly quasiconvex. Argue that any mixed strategy Nash equilibrium of this game must be degenerate, with each player playing a single pure strategy with probability I. 8.0,9 8 Consider the following game [based on an example from Kreps (1990)]: Player 2

v

LL

L

M

R

100,2

-100, I

0,0

-100, -100

Player I D

-100, -100

100, -49

1,0

100,2

B,

B,

L

R

V

(I, I, 1)

(1,0, 1)

D

(I, 1, 1)

(0,0,1)

L

R

V

(1, 1,0)

(0,0,0)

D

(0,1,0)

(1,0,0)

Each cell describes the payoffs to the three players (u" u" uJ) from that strategy combination. Both (D, L, B,) and (V, L, B,) are pure strategy Nash equilibria. Show that (D, L, B,) is not (normal form) trembling· hand perfect even though none of these three strategies is weakly dominated.

265

266

CHAPTER

I:

SIMULTANEOUS-MOVE

C HAP T E R

GAMES

8.F.3C Prove that every game lN = [1, (Il(S,)}, (u,(-)}] in which the S, are finite sets has a (normal form) trembling-hand perfect Nash equilibrium. [Hint: Show that every perturbed game has an equilibrium and that for any sequence of perturbed games converging to the original game lN and corresponding sequence of equilibria, there is a subsequence that converges to an equilibrium of IN']

Dynamic Games

9

Introduction In Chapter 8, we studied simultaneous-move games. Most economic situations, however, involve players choosing actions over time. l For example, a labor union and a firm might make repeated offers and counteroffers to each other in the course of negotiations over a new contract. Likewise, firms in a market may invest today in anticipation of the effects of these investments on their competitive interactions in the future. In this chapter, we therefore shift our focus to the study of dynamic games. One way to approach the problem of prediction in dynamic games is to simply derive their normal form representations and then apply the solution concepts studied in Chapter 8. However, an important new issue arises in dynamic games: the credibility of a player's strategy. This issue is the central concern of this chapter. Consider a vivid (although far-fetched) example: You walk into class tomorrow and your instructor, a sane but very enthusiastic game theorist, announces, "This is an important course, and I want exclusive dedication. Anyone who does not drop every other course will be barred from the final exam and will therefore flunk." After a moment of bewilderment and some mental computation, your first thought is, "Given that I indeed prefer this course to all others, I had better follow her instructions" (after all, you have studied Chapter 8 earefully and know what a best response is). But after some further reflection, you ask yourself, "Will she really bar me from the final exam if I do not obey? This is a serious institution, and she will surely lose her job if she carries out the threat." You conclude that the answer is "no" and refuse to drop the other courses, and indeed, she ultimately does not bar you from the exam. In this example, we would say that your instructor's announced strategy, "I will bar you from the exam if you do not drop every other course," is not credible. Such empty threats are what we want to rule out as equilibrium strategies in dynamic games. In Section 9.8, we demonstrate that the Nash equilibrium concept studied in Chapter 8 does not suffice to rule out noncredible strategies. We then introduce a stronger solution concept, known as sub game perfect Nash equilibrium, that helps

I. As do most parlor games.

267

268

CHAPTER

9:

DYNAMIC

GAMES

to do so. The central idea underlying this concept is the principle of sequential rationality: equilibrium strategies should specify optimal behavior from any point in the game onward, a principle that is intimately related to the procedure of backward induction. In Section 9.C, we show that the concept of subgame perfection is not strong enough to fully capture the idea of sequential rationality in games of imperfect information. We then introduce the notion of a weak perfect Bayesian equilibrium (also known as a weak sequential equilibriulII) to push the analysis further. The central feature of a weak perfect Bayesian equilibrium is its explicit introduction of a player's beliefs about what may have transpired prior to her move as a means of testing the sequential rationality of the player's strategy. The modifier weak refers to the fact that the weak perfect Bayesian equilibrium concept imposes a minimal set of consistency restrictions on players' beliefs. Because the weak perfect Bayesian equilibrium concept can be too weak, we also examine some related equilibrium notions that impose stronger consistency restrictions on beliefs, discussing briefly stronger notions of perfect Bayesian equilibrium and, in somewhat greater detail, the concept of sequential equilibrium. In Section 9.D, we go yet further by asking whether certain beliefs can be regarded as "unreasonable" in some situations, thereby allowing us to further refine our predictions. This leads us to consider the notion of forward induction. Appendix A studies finite and infinite horizon models of bilateral bargaining as an illustration of the use of subgame perfect Nash equilibrium in an important economic application. Appendix B extends the discussion in Section 9.C by examining the notion of an extensive form trembling-hand perfect Nash equilibrium. We should note that-following most of the literature on this subject-all the analysis in this chapter consists of attempts to "refine" the concept of Nash equilibrium; that is, we take the position that we want our prediction to be a Nash equilibrium, and we then propose additional conditions for such an equilibrium to be a "satisfactory" prediction. However, the issues that we discuss here are not confined to this approach. We might, for example, be concerned about noncredible strategies even if we were unwilling to impose the mutually correct expectations condition of Nash equilibrium and wanted to focus instead only on rationalizable outcomes. See Bernheim (1984) and, especially, Pearce (1984) for a discussion of nonequilibrium approaches to these issues.

9.B Sequential Rationality, Backward Induction, and Subgame Perfection We begin with an example to illustrate that in dynamic games the Nash equilibrium concept may not give sensible predictions. This observation leads us to develop a strengthening of the Nash equilibrium concept known as subgame perfect Nash equilibrium.

Example 9.B.l: Consider the following predation game. Firm E (for entrant) is considering entering a market that currently has a single incumbent (firm I). If it does so (playing "in "), the incumbent can respond in one of two ways: It can either accommodate the entrant, giving up some of its sales but causing no change in

SECTION

9.B:

BACKWARD

INDUCTION

AND

SUBGAME

PERFECTION

269

Firm E Firm I Fight Accommodate if Firm E if Firm E Plays "In"

Plays "In"

OUt88'2 0,2

Firm E

In

-3, -I

2,1

Figure 9.8.1

Extensive and normal forms for Example

9.8.1. The Nash eq~ilibri~m

(a" a,l = (out, fight if firm E plays "in ") involves a noncredible

threat.

the market price, or it can fight the entrant, engaging in a costly war of predation that dramatically lowers the market price. The extensive and normal form representations of this game are depicted in Figure 9.B.1. Examining the normal form, we see that this game has two pure strategy Nash equilibria: (u E , 0 (read as "the probability of reaching information set H is positive under strategies a"), we must have Prob (x I u)

I1(X) = Prob (H I a)

for all x E H.

It should be noted that the definition formally incorporates beliefs as part of an equilibrium by identifying a strategy-beliefs pair (U,I1) as a weak perfect Bayesian equilibrium. In the literature, however, it is not uncommon to see this treated a bit loosely: a set of strategies a will be referred to as an equilibrium with the meaning that there is at least one associated set of beliefs 11 such that (u. II) satisfies Definition 9.C.3. At times, however, it can be very useful to be more explicit about what these beliefs are, such as when testing them against some of the "reasonableness" criteria that we discuss in Section 9.0. A useful way to understand the relationship between the weak PBE concept and that of Nash equilibrium comes in the characterization of Nash equilibrium given in Proposition 9.C.l.

Prob (x I a) Prob(xl H,u) = - - - - - - - - . LX"H Prob (x'i a) 12. Equivalently, a completely mixed strategy can be thought of as a strategy that assigns a strictly positive probability 10 each of Ihe player's pure strategies in the normal form derived from extensive form game

BELIEFS

As a concrete example, suppose that in the game in Example 9.C.l, firm E is using the completely mixed strategy that assigns a probability of ! to "out," t to "in I'" and ±to "in2'" Then the probability of reaching firm J's information set given this strategy is l Using Bayes' rule, the probability of being at the left node of firm l's information set conditional on this information set having been reached is 5, and the conditional probability of being at the right node in the set is t. For firm l's beliefs following entry to be consistent wilh firm E's strategy, firm l's beliefs should assign exactly these probabilities. The more difficult issue arises when players are not using completely mixed strategies. In this case, some information sets may no longer be reached with positive probability, and so we cannot use Bayes' rule to compute conditional probabilities for the nodes in these information sets. At an intuitive level, this problem corresponds to the idea that even if players were to play the game repeatedly, the equilibrium play would generate no experience on which they could base their beliefs at these information sets. The weak perfect Bayesian equilibrium concept takes an agnostic view toward what players should believe if play were to reach these information sets unexpectedly. In particular, it allows us to assign any beliefs at these information sets. It is in this sense that the modifier weak is appropriately attached to this concept. We can now give a formal definition.

Definition 9.C.2: A strategy profile

E[U.(H)IH, 11, u.(H)'

i.C:

ProposltlQn 9.C.1: A strategy profile a is a Nash equilibrium of extensive form game fE if and only if there exists a system of beliefs 11 such that

rE .

13. Bayes' rule is a basic principle of statistical inference. See, for example, DeGroot (1970),

(i) The strategy profile a is sequentially rational given belief system 11 at all information sets H such that Prob (H I a) > O.

where il is referred 10 as Bayt>s' theorem.

.:,.

RATIONALITY

285

286

C HAP T E R

.:

0 Y N AM, eGA M E S

SEC T , ON

•. C:

• E L , E F SAN 0

• E QUE NT' A L

RAT' 0 N A L , T Y

(ii) The system of beliefs 11 is derived from strategy profile a through Bayes' rule whenever possible. Exercise 9.C.1 asks you to prove this result. The italicized portion of condition (i) is the only change from Definition 9.C.3: For a Nash equilibrium, we require sequential rationality only on the equilibrium path. Hence, a weak perfect Bayesian equilibrium of game r, is a Nash equilibrium, but not every Nash equilibrium is a weak PBE. We now illustrate the application of the weak PBE concept in several examples. We first consider how the concept performs in Example 9.C.1. Example 9.C.I Continued: Clearly, firm I must play "accommodate if entry occurs" in any weak perfect Bayesian equilibrium because that is firm I's optimal action starting at its information set for any system of beliefs. Thus, the Nash equilibrium strategies (out, fight if entry occurs) cannot be part of any weak PBE. What about the other pure strategy Nash equilibrium, (in" accommodate if entry occurs)? To show that this strategy profile is part of a weak PBE, we need to supplement these strategies with a system of beliefs that satisfy criterion (ii) of Definition 9.C.3 and that lead these strategies to be sequentially rational. Note first that to satisfy criterion (ii), the incumbent's beliefs must assign probability I to being at the left node in her information set because this information set is reached with positive probability given the strategies (in" accommodate if entry occurs) [a specification of beliefs at this information set fully describes a system of beliefs in this game because the only other information set is a singleton). Moreover, these strategies are, indeed, sequentially rational given this system of beliefs. In fact, this strategybeliefs pair is the unique weak PBE in this game (pure or mixed). _ Examples 9.C.2 and 9.C.3 provide further illustrations of the application of the weak PBE concept. Example 9.C.2: Consider the following "joint venture" entry game: Now there is a second potential entrant E2. The story is as follows: Firm EI has the essential capability to enter the market but lacks some important capability that firm E2 has. As a result, EI is considering proposing a joint venture with E2 in which E2 shares its capability with EI and the two firms split the profits from entry. Firm EI has three initial choices: enter directly on its own, propose a joint venture with E2, or stay out of the market. If it proposes a joint venture, firm E2 can either accept or decline. If E2 accepts, then E I enters with E2's assistance. If not, then E I must decide whether to enter on its own. The incumbent can observe whether EI has entered, but not whether it is with E2's assistance. Fighting is the best response for the incumbent if EI is unassisted (EI can then be wiped out quickly) but is not optimal for the incumbent if EI is assisted (EI is then a tougher competitor). Finally, if EI is unassisted, it wants to enter only if the incumbent accommodates; but if E I is assisted by E2, then because it will be such a strong competitor, its entry is profitable regardless of whether the incumbent fights. The extensive form of this game is depicted in Figure 9.C.2. To identify the weak PBE of this game note first that, in any weak PBE, firm E2 must accept the joint venture if firm EI proposes it because E2 is thereby assured o[ a positive payoff regardless of firm I's strategy. But if so, then in any weak PBE

l

Figure 9.C.2

Extensive form for Example 9.C.2.

firm EI must propose the joint venture since if firm E2 will accept its proposal, then firm EI does better proposing the joint venture than it does by either staying out or entering on its own, regardless of firm I's post-entry strategy. Next, these two conclusions imply that firm I's information set is reached with positive probability (in fact, with certainty) in any weak PBE. Applying Bayesian updating at this information set, we conclude that the beliefs at this information set must assign a probability of I to being at the middle node. Given this, in any weak PBE firm I's strategy must be "accommodate if entry occurs." Finally, if firm I is playing "accommodate if entry occurs," then firm EI must enter if it proposes a joint venture that firm E2 then rejects. We conclude that the unique weak PBE in this game is a strategy-beliefs pair with strategies of (a n, a ' h a I) = «propose joint venture, in if E2 declines), (accept), (accommodate if entry occurs)) and a belief system of 11 (middle node of incumbent's information set) = I. Note that this is not the only Nash equilibrium or, for that matter, the only SPNE. For example, (a£" a£2, all = «out, out if E2 declines), (decline). (fight if entry occurs)) is an SPNE in this game. _ Example 9.C.3: [n the games of Examples 9.C.1 and 9.C.2 the trick to identifying the weak PBEs consisted of seeing that some player had an optimal strategy that was independent of her beliefs and/or the future play of her opponents. In the game depicted in Figure 9.C.3, however, this is not so for either player. Firm I is now willing to fight if she thinks that firm E has played "in,," and the optimal strategy for firm E depends on firm I's behavior (note that y> -I). To solve this game, we look for a fixed point at which the behavior generated by beliefs is consistent with these beliefs. We restrict attention to the case where y > O. [Exercise 9.C.2 asks you to determine the set of weak PBEs when YE (-1,0).] Let a, be the probability that firm I fights after entry, let 11, be firm I's belief that

287

288

CHAPTER

g,

DYNAMIC

GAMES

SECTION

----------------------------------------------------------~

•• C,

BELIEFS

AND

SEQUENTIAL

RATIONALITY

289

,-------------------------------------------------------------

Firm E Out

(:~) - (~)

Figure 9.C.3

Figure 9.C.4

Extensive form for

i> -I

"in l " was E's entry strategy if entry has occurred, and let u o, u l, u, denole the probabilities with which firm E actually chooses "out," "in,," and "in,," respectively. Note, first, that firm I is willing to play "fight" with positive probability if and only if -I ~ -21', + 1(1 - 1'1)' or 1'1 ~ i. Suppose, first, that 1'1 > 1 in a weak PBE. Then firm I must be playing "fight" with probability I. But then firm E must be playing "in," with probability I (since y > 0), and the weak PBE concept would then require that 1'1 = 0, which is a contradiction. Suppose, instead, that 1', < i in a weak PBE. Then firm I must be playing "accommodate" with probability I. But, if so, then firm E must be playing "in," with probability I, and the weak PBE concept then requires that 1', = I, another contradiction. Hence, in any weak PBE of this game, we must have 1'1 = l If so, then firm E must be randomizing in the equilibrium with positive probabilities attached to both "in l " and "in," and with "in," twice as likely as "in,." This means that firm I's probability of playing "fight" must make firm E indifferent between "inl" and "in,." Hence, we must have -Iu y + 3(1 - UF) = yUy + 2(1 - Uy), or UF = I/(y + 2). Firm E's payoff from playing "in," or "in," is then (3y + 2)/(y + 2) > 0, and so firm E must play "out" with zero probability. Therefore, the unique weak PBE in this game when y > 0 has (u o, UI' u,) = (0, i, 1), Uy = I/(y + 2), and 1', = l -

Strellgthellillgs of the Weak Perfect Bayesiall Equilibrium COllcept We have referred to the concept defined in Definition 9.C.3 as a weak perfect Bayesian equilibrium because the consistency requirements that it puts on beliefs are very minimal: The only requirement for beliefs, other than that they specify nonnegative probabilities which add to 1 within each information set, is that they are consistent with the equilibrium strategies on the equilibrium path, in the sense of being derived from them through Bayes' rule. No restrictions at all are placed on beliefs off the equilibrium path (i.e., at information sets not reached with positive probability with play of the equilibrium strategies). In the literature, a number of strengthenings of this concept that put additional consistency restrictions on off-the-equilibrium-path

Extensive form for Example 9.C.4. Beliefs in a weak PBE may not be structurally

Example 9.C.3.

consistent.

beliefs arc used. Examples 9.C.4 and 9.C.S illustrate why a strengthening of the weak PBE concept is often needed. Example 9.C4: Consider the game shown in Figure 9.C.4. The pure strategies and beliefs depicted in the figure constitute a weak PBE (the strategies are indicated by arrows on the chosen branches at each information set, and beliefs are indicated by numbers in brackets at the nodes in the information sets). The beliefs satisfy criterion (ii) of Definition 9.C.3; only player l's information set is reached with positive probability, and player I's beliefs there do reflect the probabilities assigned by nature. But the beliefs specified for player 2 in this equilibrium are not very sensible; player 2's information set can be reached only if player I deviates by instead choosing action y with positive probability, a deviation that must be independent of nature's actual move, since player I is ignorant of it. Hence, player 2 could reasonably have only beliefs that assign an equal probability to the two nodes in her information set. Here we see that it is desirable to require that beliefs at least be "structurally consistent" off the equilibrium path in the sense that there is some subjective probability distribution over strategy profiles that could generate probabilities consistent with the beliefs. _ Example 9.CS: A second and more significant problem is that a weak perfect Bayesian equilibrium need not be subgame perfect. To see this, consider again the entry game in Example 9.B.3. One weak PBE of this game involves strategies of (u E , u t ) = «out, accommodate if in), (fight if firm E plays "in"» combined with beliefs for firm I that assign probability I to firm E having played "fight." This weak PBE is shown in Figure 9.C.5. But note that these strategies are not subgame perfect; they do not specify a Nash equilibrium in the post-entry subgame. The problem is that firm I's post-enlry belief about firm E's post-entry play is unrestricted by the weak PBE concept because firm I's information set is off the equilibrium path. _

290

CHAPTER

.:

DYNAMIC

SEC T ION

GAMES

--------------------------------------------------------------~

BELl E F SAN 0

SEQ U E N T I A L

As we now show, the sequential equilibrium concept strengthens the weak perfect Bayesian equilibrium concept in a manner that avoids the problems identified in Examples 9.C.4 and 9.C.5.

Firm E Out

51. C:

In

Example 9.C4 Continued: Consider again the game in Figure 9.C4. In this game, all beliefs that can be derived from any sequence of totally mixed strategies assign equal probability to the two nodes in player 2's information set. Given this fact, in any sequential equilibrium player 2 must play r and player I must therefore play y. In fact, strategies (y, r) and beliefs giving equal probability to the two nodes in both players' information sets constitute the unique sequential equilibrium of this game. _

Figure 9.C.5

Extensive rorm for Example 9.C.S. A weak PRE may not bt subgame perfect.

These two examples indicate that the weak PBE concept can be too weak. Thus, in applications in the literature, extra consistency restrictions on beliefs are often added to the weak PBE concept to avoid these problems, with the resulting solution concept referred to as a perfect Bayesian equilibrium. (As a simple example, restricting attention to equilibria that induce a weak PBE in every subgame insures subgame perfection.) We shall also do this when necessary later in the book; see, in particular, the discussion of signaling in Section 13.C For formal definitions and discussion of some notions of perfect Bayesian equilibrium, see Fudenberg and Tirole (199Ia) and (199Ib). An important closely related equilibrium notion that also strengthens the weak PBE concept by embodying additional consistency restrictions on beliefs is the sequential equilibrium concept developed by Kreps and Wilson (1982). In contrast to notions of perfect Bayesian equilibrium (such as the one we develop in Section 13.C), the sequential equilibrium concept introduces these consistency restrictions indirectly through the formalism of a limiting sequence of strategies. Definition 9.CA describes its requirements. Definition 9.C.4: A strategy profile and system of beliefs (11, Jl) is a sequential equilibrium of extensive form game r E if it has the following properties: (i) Strategy profile 11 is sequentially rational given belief system Jl. (ii) There exists a sequence of completely mixed strategies {l1k};~ ,. with lim*_ ~".* = 11. such that Jl = lim k _", Jlk. where Jlk denotes the beliefs derived from strategy profile 11k using Bayes' rule.

I

In essence, the sequential equilibrium notion requires that beliefs be justifiable as coming from some set of totally mixed strategies that are "close to" the equilibrium strategies". (i.e., a small perturbation of the equilibrium strategies). This can be viewed as requiring that players can (approximately) justify their beliefs by some story in which, with some small probability, players make mistakes in choosing their strategies. Note that every sequential eqUilibrium is a weak perfect Bayesian equilibrium because the limiting beliefs in Definition 9.CA exactly coincide with the beliefs derived from the equilibrium strategies 11 via Bayes' rule on the outcome path of strategy profile 11. But, in general, the reverse is not true.

A A T ION A LIT Y

291

~---------------------------------------------------------------

Example 9.C.S Continued: The unique sequential equilibrium strategies in the game in Example 9.C.5 (see Figure 9.C.5) are those of the unique SPNE: «in, accommodate if in), (accommodate if firm E plays "in To verify this point, consider any totally mixed strategy 6 and any node x in firm \'s information set, which we denote by H,. Letting z denote firm E's decision node following entry (the initial node of the associated with 6 at information set H, are subgame following entry), the beliefs equal to

"».

I'.

Prob (x I 6) Prob (x I t, ii) Prob (z I 6) I'·(x) = - - - - - - = - - - - - - - - - a

Prob (H, 16)

Prob (H, I z, 6) Prob (z I 6)'

where Prob (x I z, 6) is the probability of reaching node x under strategies 6 conditional on having reached node z. Canceling terms and noting that Prob (H, I z, 6) = I, we then have /,.(x) = Prob (x I z, 6). But this is exactly the probability that firm E plays the action that leads to node x in strategy ii. Thus, any sequence of totally mixed strategies {ii' that converge to 11 must generate limiting beliefs for firm I that coincide with the play at node z specified in firm E's actual strategy". E' It is then immediate that the strategies in any sequential equilibrium must specify Nash equilibrium behavior in this post-entry subgame and thus must constitute a subgame perfect Nash equilbrium. _

Jr.,

Proposition 9.C.2 gives a general result on the relation between sequential equilibria and subgame perfect Nash equilibria. Proposition 9.C.2: In every sequential equilibrium (11, I') of an extensive form game r E' the equilibrium strategy profile 11 constitutes a subgame perlect Nash equilibrium of rEo Thus, the sequential equilibrium concept strengthens both the SPNE and the weak PBE concepts; every sequential equilibrium is both a weak PBE and an SPNE. Although the concept of sequential equilibrium restricts beliefs that are off the equilibrium path enough to take care of the problems with the weak PRE concept illustrated in Examples 9.C.4 and 9.C.S, there are some ways in which Ihe requirements on off-equilibrium-path beliefs embodied in the notion of sequential equilibrium may be too strong. For example. they imply that any two players with the same information must have exactly the same beliefs regarding the deviations by other players that have caused play to reach a given part of the game tree. In Appendix B, we briefly describe another related (and still stronger) solution

292

c HAP T E R

.:

0 Y N A M I C

SEC T ION

GAM E S

-----------------------------------------------------------------

g. D:

REA SON A B L E

BEL I E F SAN 0

FOR WAR 0

I N 0 U C T ION

293

concept, an extellsive form trembling-hand perfect Nash equilibrium, first proposed by Selten (1975).14

9.D Reasonable Beliefs and Forward Induction In Section 9.C, we saw the importance of beliefs at unreached information sets for testing the sequential rationality of a strategy. Although the weak perfect Bayesian equilibrium concept and the related stronger concepts discussed in Section 9.C can help rule out noncredible threats, in many games we can nonetheless justify a large range of off·equilibrium-path behavior by picking off-equilibrium·path beliefs appro· priately (we shall see some examples shortly). This has led to a considerable amount of recent research aimed at specifying additional restrictions that" reasonable" beliefs should satisfy. In this section, we provide a brief introduction to these ideas. (We shall encounter them again when we study signaling models in Chapter 13, particularly in Appendix A of that chapter.) To start, consider the two games depicted in Figure 9.0.1. The first is a variant of the entry game of Figure 9.C.1 in which firm I would now find it worthwhile to fight if it knew that the entrant chose strategy" in, "; the second is a variant of the Niche Choice game of Example 9.B.4. in which firm E now targets a niche at the time of its entry. Also shown in each diagram is a weak perfect Bayesian equilibrium (arrows denote pure strategy choices, and the numbers in brackets in firm l's information set denote beliefs). One can argue that in neither game is the equilibrium depicted very sensible's Consider the game in Figure 9.D.I(a}. In the weak PBE depicted, if entry occurs, firm I plays "fight" because it believes that firm E has chosen "inl'" But "in," is strictly dominated for firm E by "inl." Hence. it seems reasonable to think that if firm E decided to enter. it must have used strategy "inl'" Indeed. as is commonly done in this literature. one can imagine firm E making the following speech upon entering: "I have entered, but notice that I would never have used 'in l' to do so because 'in2' is always a better entry strategy for me. Think about this carefully before you choose your strategy." A similar argument holds for the weak PBE depicted in Figure 9.D.I(b). Here "small niche" is strictly dominated for firm E. not by "large niche", but by "out." Once again, firm I could not reasonably hold the beliefs that are depicted. In this case. firm I should recognize that if firm E entered rather than playing "out," it must have chosen the large niche. Now you can imagine firm E saying: "Notice that the only way I could ever do better by entering than by choosing 'out' is by targeting the large niche."

14. Sellen actually gave it the name trembling·halld perfect Nash equilibrium; we add the modifier form to help distinguish it rrom the normal rorm concept introduced in Section 8.F. 15. For simplicity, we focus on weak perfect Bayesian equilibria here. The points to be made apply as well 10 the stronger related notions discussed in Seclion 9.C. In fact, all the weak perfect

fXlel1sille

Bayesian equilibria discussed here are also sequential equilibria; indeed. they are even extensive

form Irembling-hand perfect.

(b)

These arguments make use of what is known as forward inductioll reasoning [see Kohlbcrg (1989) and Kohlberg and Mertens (1986)]. In using backward induction, a playcr decides what is an OPtimal action for her at some point in the game tree based on her calculations of the actions that her opponents will rationally play at later points of the game. In contrast, in using forward induction, a player reasons about what could have rationally happened previously. For example, here firm I decides on its optimal post-entry action by assuming that firm E must have behaved rationally in its entry decision. This Iype of idea is sometimes extended to include arguments based on equilibriulIl dOllliuati,,,,. For example, suppose that we augment the game in Figure 9.D.I(b) by also giving firm I a mo'e after firm E plays "out," as depicted in Figure 9.D.2 (perhaps "out" really involves enlTY into some alternative market of firm I's in which firm E has only one potential entry slTategy). The figure depicts a weak PBE of this game in which firm E plays "out" and firm I believes that firm E has chosen "smail niche" whenever its post-entry information set is reached. In this game, "small niche" is no longer strictly dominated for firm E by "out," so our previous argument does not apply. Nevertheless, if firm E deviates from this equilibrium by entering, we can imagine firm I Ihinking that since firm E could have received a payoff of 0 by following its equilibrium strategy. it must be hoping to do better than that by entering. and so it must

Figure 9.0.1 Two weak PBEs with unreasonable beliefs.

294

CHAPTER

9:

DYNAMIC

---

GAMES

Firm E

Small Niche

OUI

Large Niche

Flgur. 9.0.2

(III.) ~ (-10) -10

Strategy "small nich,' is equilibrium dominated for firm E.

II,

The Post·entry Subg,me:

Firm I Small Niche

SmalI Niche Firm E

Large Niche

-6, -6

8

Large Niche

8 -3, -3

have chosen to target the large niche. In this case, we say that "small niche" is equilibrium domillated for firm E; that is, it is dominated if firm E treats its equilibrium payoff as something that il can achieve with certainty by following its equilibrium strategy. (This type of argu· ment is embodied in the intuirive criterion refinement that we discuss in Section 13.C and Appendix A of Chapter 13 in the context of signaling models.) Forward induction can be quite powerful. For example, reconsider the original Niche Choice game depicted in Figure 9.D.3. Recall that there are two (pure strategy) Nash equilibria in the post-entry subgame: (large niche, small niche) and (small niche, large niche). However, the force of the forward induction argument for the game in Figure 9.0.I(b) seems to apply equally well here: Strategy (in, small niche if in) is strictly dominated for firm E by playing "out." As a result, the incumbent should reason that if firm E has played "in," it intends to target the large niche in the

-

Figure 9.0.3

Forward induction selects equilibrium (large niche, small niche) in the post·entry subgame.

SECTION

'.0:

REASONABLE

BELIEFS

AND

FORWARD

post-entry game. If so, firm I is better ofT targeting the small niche. Thus, forward induction rules out one of the two Nash equilibria in the post-entry subgame. Although these arguments may seem very appealing, there are also some potential problems. For example, suppose that we are in a world where players make mistakes with some small probability. In such a world, are the forward induction arguments just given convincing? Perhaps not. To see why, suppose that firm E enters in the game shown in Figure 9.0.l(a) when it was supposed to play "out." Now firm I can explain the deviation to itself as being the result of a mistake on firm E's part, a mistake that might equally well have led firm E to pick "in," as "in2'" And firm E's speech may not fall on very sympathetic ears: "Of course, firm E is telling me this," reasons the incumbent, "it has made a mistake and now is trying to make the best of it by convincing me to accommodate." To see this in an even more striking manner, consider the game in Figure 9.0.3. Now, after firm E has entered and the two firms are about to play the simultaneousmove post-entry game, firm E makes its speech. But the incumbent retorts: "Forget it! I think you just made a mistake-and even if you did not, I'm going to target the large niche!" Clearly, the issues here, although interesting and important, are also tricky.

A noticeable feature of these forward induction arguments is how they use the normal form notion of dominance to restrict predicted play in dynamic games. This stands in sharp contrast with our discussion earlier in this chapter, which relied exclusively on the extensive form to determine how players should play in dynamic games. This raises a natural question: Can we somehow use the normal form representation to predict play in dynamic games? There are at least two reasons why we might think we can. First, as we discussed in Chapter 7, it seems appealing as a matter of logic to think that players simultaneously choosing their strategies in the normal form (e.g., submitting contingent plans to a referee) is equivalent to their actually playing out the game dynamically as represented in the extensive form. Second, in many circumstances, it seems that the notion of weak dominance can get at the idea of sequential rationality. For example, for finite games of perfect information in which no player has equal payoffs at any two terminal nodes, any strategy profile surviving a process of iterated deletion of weakly dominated strategies leads to the same predicted outcome as the SPNE concept (take a look at Example 9.8.1, and see Exercise 9.0.1). The argument for using the normal form is also bolstered by the fact that extensive form concepts such as weak PBE can be sensitive to what may seem like irrelevant changes in the extensive form. For example, by breaking up firm E's decision in the game in Figure 9.0.I(a) into an "out" or "in" decision followed by an "in," or "in," decision (just as we did in Figure 9.0.3 for the game in Figure 9.0.I(b)], the unique SPNE (and, hence, the unique sequential equilibrium) becomes firm E entering and playing "in," and firm I aocommodating. However, the reduced normal form associated with these two games (i.e., the normal form where we eliminate all but one of a player's strategies that have identical payoffs) is invariant to this change in the extensive form; therefore, any solution based on the (reduced) normal form would be unaffected by this change. These points have led to a renewed interest in the use of the normal form as a device for predicting play in dynamic games [see, in particular, Kohlberg and Mertens (1986)]. At the same time, this issue remains controversial. Many game theorists believe that there is a loss of some information of strategic importance in going from the extensive form to the more condensed normal form. For example, are the games in Figures 9.0.3 and 9.0.I(b) really the same? If you were firm I, would you be as likely to rely on the forward induction argument

INDUCTION

295

296

CHAPTER

9:

DYNAMIC

GAMES

A P PEN 0 I X

A:

FIN I TEA N DIN FIN I T E

H 0 RI Z 0 N

8 I L ATE R ALB A R G A I N I N

G

297

----------------------------------------------------------------- ---------------------------------------------------------------in the game in Figure 9.0.3 as in that in Figure 9.0.I(b)? Does it matter for your answer whether in the game in Figure 9.0.3 a minute or a month passes between firm E's two decisions? These issues remain to be sorted out.

APPENDIX A: FINITE AND INFINITE HORIZON BILATERAL BARGAINING

In this appendix we study two models of bilateral bargaining as an economically important example of the use of the subgame perfect Nash equilibrium concept. We begin by studying a finite horizon model of bargaining and then consider its infinite horizon counterpart. Example 9,AA.l: Finite Horizon Bilateral Bargaining. Two players, 1 and 2, bargain to determine the split of v dollars. The rules are as follows: The game begins in period I; in period I, player 1 makes an offer of a split (a real number between 0 and v) to player 2, which player 2 may then accept or reject. If she accepts, the proposed split is immediately implemented and the game ends. If she rejects, nothing happens until period 2. In period 2, the players' roles are reversed, with player 2 making an offer to player I and player I then being able to accept or reject it. Each player has a discount factor of 15 E (0, I), so that a dollar received in period t is worth 15'-1 in period I dollars. However, after some finite number of periods T, if an agreement has not yet been reached, the bargaining is terminated and the players each receive nothing. A portion of the extensive form of this game is depicted in Figure 9.AA.1 [this model is due to Stahl (1972)]. There is a unique subgame perfect Nash equilibrium (SPNE) in this game. To see this, suppose first that T is odd, so that player I makes the offer in period T if no previous agreement has been reached. Now, player 2 is willing to accept any offer in this period because she will get zero if she refuses and the game is terminated (she is indifferent about accepting an offer of zero). Given this fact, the unique SPNE in the subgame that begins in the final period when no agreement has been previously reached has player I offer player 2 zero and player 2 accept. I 6 Therefore, the payoffs from equilibrium play in this subgame are T - 'v, 0). Now consider play in the subgame starting in period T - I when no previous agreement has been reached. Player 2 makes the offer in this period. In any SPNE, player I will accept an offer in period T - 1 if and only if it provides her with a payoff of at least 15 T-'v, since otherwise she will do better rejecting it and waiting to make an offer in period T (she earns -'v by doing so). Given this fact, in any SPNE, player 2 must make an offer in period T - 1 that gives player I a payoff of exactly OT-'V, and player I accepts this offer (note that this is player 2's best offer

(c5

c5 T

16. Note that if player 2 is unwilling to accept an offer of zero, then player I has no optimal Slrategy; she wants to make a strictly positive offer ever closer to zero (since player I will accept any strictly positive offer). If the reliance on player I accepting an offer over which she is indifferent bothers you, you can convince yourself that the analysis of the game in which offers must be in small increments (pennies) yields exactly the same outcome as that identified in the text as the size of these increments goes to zero.

Figure 9.AA.1

The alternali ng-offer bilateral bargaining game.

among all those that would be accepted, and making an offer that will be rejected is worse for player 2 because it results in her receiving a payoff of zero). The payoffs arising if the game reaches period T-I must therefore be v - V-IV). Continuing in this fashion, we can determine that the unique SPNE when T is odd results in an agreement being reached in period I, a payoff for player I of

(c5 T-' p,c5 T-2

vT(T) = v[1 - 15

+ 15 2 _ ... + OT-I]

={(1-c5{' ~~:~t)+c5T-'l and a payoff to player 2 of v!(T) = v - vT(T). If T is instead even, then player I must earn v - c5vt(T - I) because in any SPNE, player 2 (who will be the first offerer in the odd-number-of-periods subgame that begins in period 2 if she rejects player 1's period I offer) will accept an offer in period I if and only if it gives her at least av!(T - I), and player I will offer her exactly this amount. Finally, note that as the number of periods grows large (T -+ OCJ), player I's payoff converges to "/( I + 15), and player 2's payoff converges to c5v/(1 + 0) . • In Example 9.AA.I, the application of the SPNE concept was relatively straightforward; we simply needed to start at the end of the game and work backward. We now consider the infinite horizon counterpart of this game. As we noted in Section

298

CHAPTER

9:

DYNAMIC

GAMES

---------------------------------------------------------------~

9.B, we can no longer solve for the SPNE in this simple manner when the game has an infinite horizon. Moreover, in many games, introduction of an infinite horizon allows a broad range of behavior to emerge as subgame perfect. Nevertheless, in the infinite horizon bargaining model, the SPNE concept is quite powerful. There is a unique SPNE in this game, and it turns out to be exactly the limiting outcome of the finite horizon model as the length of the horizon T approaches 00. Example 9.AA,2: Infinite Horizon Bilateral Bargaining. Consider an extension of the finite horizon bargaining game considered in Example 9.AA.I in which bargaining is no longer terminated after Trounds but, rather, can potentially go on forever. If this happens, the players both earn zero. This model is due to Rubinstein (1982). We claim that this game has a unique SPNE. In this equilibrium, the players reach an immediate agreement in period I, with player 1 earning v(1 + b) and player 2 earning bv((I + b). The method of analysis we use here, following Shaked and Sutton (1984), makes heavy use of the stationarity of the game (the subgame starting in period 2 looks exactly like that in period I, but with the players' roles reversed). To start, let VI denote the largest payoff that player 1 gets in any SPNE (i.e., there may, in principle, be multiple SPNEs in this model).'7 Given the stationarity of the model, this is also the largest amount that player 2 can expect in the subgame that begins in period 2 after her rejection of player I's period 1 offer, a subgame in which player 2 has the role of being the first player to make an offer. As a result, player I's payoff in any SPNE cannot be lower than the amount YI = V - bv, because, if it was, then player 1 could do better by making a period 1 offer that gives player 2 just slightly more than bVI' Player 2 is certain to accept any such offer because she will earn only bV I by rejecting it (note that we are using subgame perfection here, because we are requiring that the continuation of play after rejection is an SPNE in the continuation subgame and that player 2's response will be optimal given this fact). Next, we claim that, in any SPNE, V, cannot be larger than v - by,. To see this, note that in any SPNE, player 2 is certain to reject any offer in period 1 that gives her less than by, because she can earn at least b~, by rejecting it and waiting to make an offer in period 2. Thus, player 1 can do no better than v - b~, by making an offer that is accepted in period I. What about by making an offer that is rejected in period I? Since player 2 must earn at least b~1 if this happens, and since agreement cannot occur before period 2, player I can earn no more than bv - b~1 by doing this. Hence, we have VI ~ v - by,. Next, note that these derivations imply that VI ~ v - by, = (YI

+ bv,) - by"

so that vl(1 - b) S ~,(I - b).

Given the definitions of~, and V" this implies that ~, = V" and so player I's SPNE payoff is uniquely determined, Denote this payoff by v~, Since v~ = v - bv~, we find that player I must earn v~ = v/(I + b) and player 2 must earn v~ = v - v~ = bv/( 1 + b). In addition, recalling the argument in the previous paragraph, we see 17. This maximum can be shown to be well defined, but we will not do so here.

--

APPENDIX

B:

EXTENSIVE

FORM

TREMBLING-HAND

PERFECT

NASH

that an agreement will be reached in the first period (player 1 will find it worthwhile to make an offer that player 2 accepts). The SPNE strategies are as follows: A player who has just received an offer accepts it if and only if she is offered at least bv~, while a player whose turn it is to make an offer offers exactly bv; to the player receiving the offer. Note that the equilibrium strategies, outcome, and payoffs are precisely the limit of those in the finite game in Example 9.AA.I as T -+ 00 . • The coincidence of the infinite horizon equilibrium with the limit of the finite horizon equilibria in this model is not a general property of infinite horizon games. The discussion of infinitely repeated games in Chapter 12 provides an illustration of this point. We should also point out that the outcomes of game-theoretic models of bargaining can be quite sensitive to the precise specification of the bargaining process and players' preferences. Exercises 9.B.7 and 9.B.13 provide an illustration.

APPENDtX B: EXTENStVE FORM TREMBLING-HAND PERFECT NASH EQUtLlBRIUM

In this appendix we extend the analysis presented in Section 9.C by discussing another equilibrium notion that strengthens the consistency conditions on beliefs in the weak PBE concept: extensive form trembling-hand perfect Nash equilibrium [due to Selten (1975)]. In fact. this equilibrium concept is the strongest among those discussed in Section 9.C. The definition of an extensive form trembling-hand perfect Nash equilibrium parallels that for the normal form (sec Section 8.F) but has the trembles applied not to a player's mixed strategies, but rather to the player's choice at each of her information sets. A useful way to view this idea is with what Selten (1975) calls the agent normal form. This is the normal form that we would derive if we pretended that the player had a set of agents in charge of moving for her at each of her information sets (a different one for each), each acting independently to try to maximize the player's payoff. Definition 9.BB.1: Strategy profile 11 in extensive form game r E is an extensive form trembling-hand perfect Nash equifibrium if and only if it is a normal form trembling-hand perfect Nash equilibrium of the agent normal form derived from

rEo

To see why it is desirable to have the trembles occurring at each information set rather than over stratcgies as in the normal-form concept considered in Section 8.F. consider Figure 9.BB.I, which is taken from van Damme (1983). This game has a unique subgame perfect Nash equilibrium: (11 1, (1 2 ) = (NR, L), I). But you can check that «N R, L), f) is not the only normal form trembling-hand perfect Nash equilibrium: so are «R, L), r) and «R, M), r). The reason that these two strategy profiles are normal form trembling-hand perfect is that, in the normal form, the tremble to strategy (NR, M) by player I can be larger than that to (NR. L) despite the fact that the lattcr is a better choice for player I at her second decision node.

EQUILIBRIUM

299

300

CHAPTER

9:

DYNAMIC

GAMES

E X E R CIS E S

301

-------------------------------------------------------------- -------------------------------------------------------------Player I

NR

Firm E

R

G)

(:;) (~) Flgur. 9.BB.2

A sequential equilibrium need not

Figure 9.BB.1

Strategy profiles «R, L), r) and «R, M), r) are normal form trembling-hand perfect but are not subgame perfect.

With such a tremble, player 2's best response to player I's perturbed strategy is r. It is not difficult to sec, however, that the unique extensive form trembling-hand perfect Nash equilibrium of this game is «N R, L), t) because the agent who moves at player I's second decision node will put as high a probability as possible on L. When we compare Definitions 9.BB.I and 9.C.4, it is apparent that every extensive form trembling-hand perfect Nash equilibrium is a sequential equilibrium. In particular, even though the trembling-hand perfection criterion is not formulated in terms of beliefs, we can use the sequence of (strictly mixed) equilibrium strategies {u·} .... , in the perturbed games of the agent normal form as our strategy sequence for deriving sequential equilibrium beliefs. Because the limiting strategies u in the extensive form trembling-hand perfect equilibrium are best responses to every element of this sequence, they are also best responses to each other with these derived beliefs. (Every extensive form trembling-hand perfect Nash equilibrium is therefore also subgame perfect.) In essence, by introducing trembles, the extensive form trembling-hand perfect equilibrium notion makes every part of the tree be reached when strategies are perturbed, and because equilibrium strategies are required to be best responses to perturbed strategies, it insures that equilibrium strategies are sequentially rational. The primary difference between this notion and that of sequential equilibrium is that, like its normal form cousin, the extensive form trembling-hand perfect equilibrium concept can also eliminate some sequential equilibria in which weakly dominated strategies are played. Figure 9.BB.2 (a slight modification of the game in Figure 9.C.1) depicts a sequential equilibrium whose strategies are not extensive form tremblinghand perfect. [n general, however, the concepts are quite close [see Kreps and Wilson (1982) for a formal comparison]; and because it is much easier to check that strategies are best responses at the limiting beliefs than it is to check that they are best responses for a sequence of strategies, sequential equilibrium is much more commonly used. For an interesting further discussion of this concept, consult van Damme (1983).

be extensive form

trembling-hand perfec!. REFERENCES Bernheim. B. D. (I ,)M4). Rationalizable strategic behavior. Econometrica 52: 1007-28. DeGroot. M. H. (1970). Optimal Statistical Decisions. New York: McGraw.HiII. Fudcnbcrg. D .. and J. Tirolc. (l99Ia). Perfect Bayesian and sequential equilibrium. lOUT/wI of Ecofltmric 77.,(m'"

(ii)

1>,(m" m,) = 1>,(m'" m',).

or (d) Suppose that for any two pairs of moves (m" m,) and (m'" m~) such that '"' .. m', or III, .. 111" condition (ii) is violated (i.e., player 2 is never indifferent between pairs of moves). Suppose also that there exists a pure strategy Nash equilibrium in the game in (a) in which 1[, is player I's payoff. Show that in any SPNE of the game in (b), player I's payoff is at least" ,. Would this conclusion necessarily hold for any Nash equilibrium of the game in (b)? (e) Show by example that the conclusion in (d) may fail either if condition (ii) holds for some strategy pairs (m" m,), (m', , m2) with m', or or if we replace the phrase purr srrateg), Nash equilibrium with the phrase mixed strategy Nash equilibrium.

m, ..

m, .. m,

---

EXERCISES

9.B.10" Reconsider the game in Example 9.B.3, but now change the post-entry game so that when both players choose "accommodate", instead of receiving the payoffs (u" ",) = (3, I), the players now must play the following simultaneous-move game: Firm I {

Firm E

u~ D~

What arc the SPNEs of this game when x:2! O? When x < O? 9.B.1 I" Two firms. A and B, are in a market that is declining in size. The game starts in period 0, and the firms can compete in periods 0, 1,2,3, ... (i.e., indefinitely) if they so choose. Duopoly profits in period r for firm A are'equal to 105-IOt, and they are 10.5 - t for finn B. Monopoly profits (those if a firm is the only one left in the market) are 510 - 25t for finn A and 51 - 2t for firm 8. Suppose that at the start of each period, each firm must decide either to "stay in" or "exit" if it is still active (they do so simultaneously if both are still active). Once a finn exits, it is out of the market forever and earns zero in each period thereafter. Finns maximize their (undiscounted) sum of profits. What is this game's subgame perfect Nash equilibrium outcome (and what are the firms' strategies in the equilibrium)? 9.B.12' Consider the infinite horizon bilateral bargaining model of Appendix A (Example 9.AA.2). Suppose the discount factors a, and a, of the two players differ. Now what is the (unique) subgame perfcct Nash equilibrium?

9.B.6" Solve for the mixed strategy equilibrium involving actual randomization in the post-entry subgame of the Niche Choice game in Example 9.B.4. Is there an SPNE that induces this behavior in the post-entry subgame? What are the SPNE strategies?

9,B.l3" What are the subgame perfect Nash equilibria or the infinite horizon version of Exercise 9.8. 7?

9.B.78 Consider the finite horizon bilateral bargaining game in Appendix A (Example 9.AA.I); but instead of assuming that players discount future payoffs, assume that it costs c < v to make an offer. (Only the player making an offer incurs this cost, and players who have made offers incur this cost even if no agreement is ultimately reached.) What is the (unique) SPNE of this alternative model? What happens as T approaches oo?

9.B.14" At time 0, an incumbent firm (firm I) is already in the widget market, and a potential entrant (firm E) is considering entry. In order to enter, firm E must incur a cost of K > O. Firm E's only opportunity to enter is at time O. There are three production periods. In any period in which both firms are active in the market, the game in Figure 9.Ex.1 is played. Firm E moves first, deciding whether to stay in or exit the market. If it stays in, firm I decides whether to fight (the upper payoff is for firm E). Once firm E plays "out," it is out of

9.8.8 c Prove that every (finite) game equilibrium.

r,

has a mixed strategy subgame perfect Nash Figure 9.Ex.1

9.8.9" Consider a game in which the following simultaneous-move game is played twice:

b,

Player I

Player 2 b,

b,

10,10

2,12

0,13

a,

12,2

5,5

0,0

a,

13,0

0,0

1,1

The players observe the actions chosen in the first play of the game prior to the second play. What are the pure strategy subgame perfect Nash equilibria of this game?

303

-----------------------------------------------~~~~~

304

CHAPTER.:

DYNAMIC

GAMES

EXERCISES

305

---------------------------------------------------------------- ,--------------------------------------------------------------the market forever; firm E earns zero in any period during which it is out of the market, and firm I earns x. The discount factor for both firms is O. Assume that:

B

(A.I) x> z > y. (A.2) y + Ox > (I + O)z. (A.3) 1+,I>K. (a) What is the (unique) subgame perfect Nash equilibrium of this game? (b) Suppose now that firm E faces a financial constraint. In panicular, if firm I fights Vllee against firm E (in any period), firm E will be forced out of the market from that point on. Now what is the (unique) subgame perfect Nash equilibrium of this game? (If the answer depends on the values of parameters beyond the three assumptions, indicate how.) Figure 9.Ex.2

9.C.l" Prove Proposition 9.C.1. 9.C.2" What is the set of weak PBEs in the game in Example 9.C.3 when YE (-I, OJ? 9.C.3 c A buyer and a seller are bargaining. The seller owns an object for which the buyer has value v > (the seller's value is zero). This value is known to the buyer but not to the seller. The value's prior distribution is common knowledge. There are two periods of bargaining. The seller makes a take-it-or-Ieave·it offer (i.e., names a price) at the start of each period that the buyer may accept or reject. The game ends when an offer is accepted or after two periods, whichever comes first. Both players discount period 2 payoffs with a discount factor of 0 E (0, I). Assume throughout that the buyer always accepts the seller's offer whenever she is indifferent.

°

(0) Characterize the (pure strategy) weak perfect Bayesian equilibria for a case in which v can take two values "L and V H , with VH > vL > 0, and where .< = Prob (VN)'

(b) Do the same for the case in which v is uniformly distributed on [r, oJ. 9.C.4 c A plaintiff, Ms. P, files a suit against Ms. D (the defendant). If Ms. P wins, she will collect" dollars in damages from Ms. D. Ms. D knows the likelihood that Ms. P will win, i. E [0, I], but Ms. P does not (Ms. D might know if she was actually at fault). They both have strictly positive costs of going to trial of c, and c•. The prior distribution of Ahas density f(i.) (which is common knowledge). Suppose pretrial settlement negotiations work as follows: Ms. P makes a take-it-or-Ieave-it settlement offer (a dollar amount) to Ms. D. If Ms. D accepts, she pays Ms. P and the game is over. If she does not accept, they go to trial. (a) What are the (pure strategy) weak perfect Bayesian equilibria of this game? (b) What effects do changes in c" c., .nd " have? (e) Now allow Ms. D, after having her offer rejected, to decide not to go to court after all. What are the weak perfect Bayesian equilibria? What about the effects of the changes in (b)? 9.C.sc Reconsider Exercise 9.C.4. Now suppose it is Ms. P who knows ).. 9.C.6" What are the sequential equilibria in the games in Exercises 9.C.3 to 9.C.S? 9.C.7" (Based on work by K. Bagwell and developed as an exercise by E. Maskin) Consider the extensive form game depicted in Figure 9.Ex.2. (0) Find a subg.me perfect Nash equilibrium of this game. Is it unique? Are there any other Nash equilibria?

(b) Now suppose that player 2 cannot observe player I's move. Write down the new extensive form. What is the set of Nash equilibria?

(e) Now suppose that player 2 observes player I's move correctly with probability p E (0, I) and incorrectly with probability I - p (e.g., if player I plays T, player 2 observes T with probability p and observes B with probability I - pl. Suppose that player 2's propensity to observe incorrectly (i.e., given by the value of p) is common knOwledge to the two players. What is the extensive form now'! Show that there is a unique weak perfect Bayesian equilibrium. What is it? 9.D.I" Show that under the condition given in Proposition 9.8.2 for existence of a unique subgame perfect Nash equilibrium in a finite game of perfect information, there is an order of iterated removal of weakly dominated strategies for which all surviving strategy profiles lead to the same outcome (i.e., have the same equilibrium path and payoffs) as the subgame perfect Nash equilibrium. [In fact, allY order of deletion leads to this result; see Moulin (1981).]

pAR

T

T H R E E

Market Equilibrium and Market Failure

In Part I II, our focus shifts to the fundamental issue of economics: the organization of production and tlte allocation of the resulting commodities among consumers. This fundamental issue can be addressed from two perspectives, one positive and the other normative.

From a positive (or descriptive) perspective, we can investigate the determination of production and consumption under various institutional mechanisms. The institutional arrangement that is our central focus is that of a market (or private ownership) economy. In a market economy, individual consumers have ownership rights to various assets (such as their labor) and are free to trade these assets in the marketplace for other assets or goods. Likewise, firms, which are themselves owned by consumers, decide on their production plan and trade in the market to secure necessary inputs and sell the resulting outputs. Roughly speaking, we can identify a market equilibrium as an outcome of a market economy in which each agent in the economy (i.e., each consumer and firm) is doing as well as he can given the actions of all other agents. In contrast, from a normative (or prescriptive) perspective, we can ask what constitutes a socially optimal plan of production and consumption (of course, we will need to be more specific about what "socially optimal" means), and we can then examine the extent to which specific institutions, such as a market economy, perform well in this regard. [n Chapter 10, we study competitive (or perfectly competitive) market economies for the first time. These are market economies in which every relevant good is traded in a market at publicly known prices and all agents act as price takers (recall that much of the analysis of individual behavior in Part I was geared to this case). We begin by defining, in a general way, two key concepts: competitive (or Walrasian) equilibrium and Pareto optimality (or Pareto efficiency). The concept of competitive equilibrium provides us with an appropriate notion of market eqUilibrium for competitive market economies. The concept of Pareto optimality offers a minimal and uncontroversial test that any social optimal economic outcome should pass. An economic outcome is said to be Pareto optimal if it is impossible to make some individuals better off without making some other individuals worse off. This concept is a formalization of the idea that there is no waste in society, and it conveniently 307

308

PART

III,

MARKET

EQUILIBRIUM

AND

MARKET

FAILURE

separates the issue of economic efficiency from more controversial (and political) questions regarding the ideal distribution of well-being across individuals. Chapter 10 then explores these two concepts and the relationships between them in the special context of the partial equilibrium model. The partial equilibrium model, which forms the basis for our analysis throughout Part III, offers a considerable analytical simplification; in it, our analysis can be conducted by analyzing a single market (or a small group of related markets) at a time. In this special context, we establish two central results regarding the optimality properties of competitive equilibria, known as the fundamental theorems of welfare economics. These can be roughly paraphrased as follows: ' The First Fundamental Welfare Theorem. If every relevant good is traded in a market at publicly known prices (i.e., if there is a complete set of markets), and if households and firms act perfectly competitively (i.e., as price takers), then the market outcome is Pareto optimal. That is. when markets are complete, any competitive equilibrium is necessarily Pareto optimal. The Second Fundamental Welfare Theorem, If household preferences and firm production sets are convex, there is a complete set of markets with publicly known prices, and every agent acts as a price taker, then any Pareto optimal outcome can be achieved as a competitive equilibrium if appropriate lump-sum transfers of wealth are arranged.

The first welfare theorem provides a set of conditions under which we can be assured that a market economy will achieve a Pareto optimal result; it is, in a sense, the formal expression of Adam Smith's claim about the "invisible hand" of the market. The second welfare theorem goes even further. It states that under the same set of assumptions as the first welfare theorem plus convexity conditions, all Pareto optimal outcomes can in principle be implemented through the market mechanism. That is, a public authority who wishes to implement a particular Pareto optimal outcome (reflecting, say, some political consensus on proper distributional goals) may always do so by appropriately redistributing wealth and then "letting the market work." In an important sense, the first fundamental welfare theorem establishes the perfectly competitive case as a benchmark for thinking about outcomes in market economies. In particular, any inefficiencies that arise in a market economy, and hence any role for Pareto-improving market intervention, must be traceable to a violation of at least one of the assumptions of this theorem. The remainder of Part III, Chapters II to 14, can be viewed as a development of this theme. In these chapters, we study a number of ways in which actual markets may depart from this perfectly competitive ideal and where, as a result, market equilibria fail to be Pareto optimal, a situation known as market failure. In Chapter II, we study externalities and public goods. In both cases, the actions of one agent directly affect the utility functions or production sets of other agents in the economy. We see there that the presence of these nonmarketed "goods" or "bads" (which violates the complete markets assumption of the first welfare theorem) undermines the Pareto optimality of market equilibrium. In Chapter 12, we turn to the study of settings in which some agents in the economy have market power and, as a result, fail to act as price takers. Once again,

--

PART

III:

MARKET

EQUILIBRIUM

AND

MARKET

FAILURE

309

----------------------------------------------------------an assumption of the first fundamental welfare theorem fails to hold, and market equilibria fail to be Pareto optimal as a result. In Chapters 13 and 14, we consider situations in which an asymmetry of information exists among market participants. The complete markets assumption of the first welfare theorem implicitly requires that the characteristics of traded commodities be observable by all market participants because, without this observability, distinct markets cannot exist for commodities that have different characteristics. Chapter 13 focuses on the case in which asymmetric information exists between agents at the time of contracting. Our discussion highlights several phenomena-adverse selection, signaling, and screening-that can arise as a result of this informational imperfection, and the welfare loss that it causes. Chapter 14 in contrast, investigates the case of postcontractual asymmetric information, a problem that leads us to the study of the principal-agent model. Here, too, the presence of asymmetric information prevents trade of all relevant commodities and can lead market outcomes to be Pareto inefficient. We rely extensively in some places in Part 111 on the tools that we developed in Parts I and II. This is particularly true in Chapter 10, where we use material developed in Part I, and Chapters 12 and 13, where we use the game-theoretic tools developed in Part II. A much more complete and general study of competitive market economies and the fundamental welfare theorems is reserved for Part IV.

C HAP T E R

Competitive Markets

10

lO.A Introduction In this chapter, we consider, for the first time, an entire economy in which consumers and firms interact through markets. The chapter has two principal goals: first, to formally introduce and study two key concepts, the notions of Pareto optimality and competitive equilibrium, and second, to develop a somewhat special but analytically very tractable context for the study of market equilibrium, the partial equilibrium model. We begin in Section 10.B by presenting the notions of a Pareto optimal (or Pareto efficient) al/ocatiolJ and of a competitive (or WalrasialJ) equilibrium in a general setting. Starting in Section 10.C, we narro~' our focus to the partial equilibrium context. The partial equilibrium approach, which originated in Marshall (1920), envisions the market for a single good (or group of goods) for which each consumer's expenditure constitutes only a small portion of his overall budget. When this is so, it is reasonable to assume that changes in the market for this good wiII leave the prices of all other commodities approximately unaffected and that there will be, in addition, negligible wealth effects in the market under study. We capture these features in the simplest possible way by considering a two-good model in which the expenditure on all commodities other than that under consideration is treated as a single composite commodity (called the numeraire commodity), and in which consumers' utility functions take a quasilinear form with respect to this numeraire. Our study of the competitive equilibria of this simple model lends itself to extensive demand-andsupply graphical analysis. We also discuss how to determine the comparative statics effects that arise from exogenous changes in the market environment. As an illustration, we consider the effects on market equilibrium arising from the introduction of a distortionary commodity tax. In Section IO.D, we analyze the properties of Pareto optimal allocations in the partial equilibrium model. Most significantly, we establish for this special context the validity of the fundamelHal theorems of welfare economic.~: Competitive equilibrium allocations are necessarily Pareto optimal, and any Pareto optimal allocation can be achieved as a competitive equilibrium if appropriate lump-sum transfers are made. 311

312

CHAPTER

'0,

COMPETITIVE

SECTION

MARKETS

10.B:

PARETO

OPTIMALITY

AND

COMPETITIVE

EQUILIBRIA

313

-------------------------------------------------------------- ------------------------------------------------------------As we noted in the introduction to Part III, these results identify an important benchmark case in which market equilibria yield desirable economic outcomes. At the same time, they provide a framework for identifying situations of market failure, such as those we study in Chapters 11 to 14. In Section IO.E, we consider the measurement of welfare changes in the partial equilibrium context. We show that these can be represented by areas between properly defined demand and supply curves. As an application, we examine the deadweight loss of distortionary taxation. Section IO.F contemplates settings characterized by free entry, that is, settings in which all potential firms have access to the most efficient technology and may enter and exit markets in response to the profit opportunities they present. We define a notion of long-run competitive equilibrium and then usc it to distinguish between longrun and short-run comparative static effects in response to changes in market conditions. In Section IO.G, we provide a more extended discussion of the use of partial equilibrium analysis in economic modeling. The material covered in this chapter traces its roots far back in economic thought. An excellent source for further reading is Stigler (1987). We should emphasize that the analysis of competitive equilibrium and Pareto optimality presented here is very much a first pass. In Part IV we return to the topic for a more complete and general investigation; many additional references will be given there.

lO_B Pareto Optimality and Competitive Equilibria In this section, we introduce and discuss the concepts of Pareto optimality (or Pareto efficiency) and competitive (or Walrasian) equilibrium in a general setting. Consider an economy consisting of I consumers (indexed by i = I •...• I), J firms (indexed by j = I •... , J). and L goods (indexed by t = I•... , L). Consumer ts preferences over consumption bundles Xi = (XII' •••• XLi) in his consumption set Xi c RL are represented by the utility function ui(·). The total amount of each good { = I, ... , L initially available in the economy, called the total endowment of good I, is denoted by WI ~ 0 for { = I •...• L. It is also possible, using the production technologies of the firms, to transform some of the initial endowment of a good into additional amounts of other goods. Each firm j has available to it the production possibilities summarized by the production set lj c RL. An element of lj is a production vector Yj = (Y'i' ... , hJ) E RL. Thus, if (y" . .. , YJ) E RLJ are the production vectors of the J firms, the total (net) amount of good t available to the economy is WI + I j Ylj (recall that negative entries in a production vector denote input usage; see Section 5.B). We begin with Definition IO.B.I, which identifies the set of possible outcomes in this economy: Definition 10.B.l: An economic allocation (x" ... ,x" V"~ ... 'YJ) is a specification of a consumption vector Xi E Xi for each consumer i = 1, ... ,I and a production vector YjE If for each firm i = 1, ... ,J. The allocation (x" ... , x,, y" ... 'YJ) is feasible if J

I

L: j:: 1

Xli $ WI

+

L: j=l

Yli

for(= 1, ... ,L.

", Utilily Pairs Associated wilh ParelO Oplimal Allocations

u Figure 10.8.1

", Thus, an economic allocation is feasible if the total amount of each good consumed docs not exceed the total amount available from both the initial endowment and production. Pareto Optimality

It is often of interest to ask whether an economic system is producing an "optimal" economic outcome. An essential requirement for any optimal economic allocation is that it possess the property of Pareto optimality (or Pareto efficiency). Definition 10.B.2: A feasible allocation (x" ... , x" V"~ ... , YJ) is Pareto optimal (or Pareto efficient) if there is no other feasible allocation (x;, ... , x;, y;, ... , yj) such that ui(x;) ~ Ui(X i ) for all i = 1, ... , I and ui(xi) > U;(Xi) for some i. An allocation that is Pareto optimal uses society'S initial resources and technological possibilities efficiently in the sense that there is no alternative way to organize the production and distribution of goods that makes some consumer better off without making some other consumer worse off. Figure IO.B.I illustrates the concept of Pareto optimality. There we depict the set of attainable utility levels in a two-consumer economy. This set is known as a utility possibility set and is defined in this two-consumer case by U

{(I/" u,)

E 1R2: there exists a feasible allocation (x h X 2' y" ... ,J'J) such that U i $ ui(x,) for i = 1,2}. The set of Pareto optimal allocations corresponds to those allocations that generate utility pairs lying in the utility possibility set's northeast boundary. such as point (u l, ri2)' At any such point, it is impossible to make one consumer better off without making the other worse off. It is important to note that the criterion of Pareto optimality does not insure that an allocation is in any sense equitable. For example, using all of society's resources and technological capabilities to make a single consumer as well off as possible, subject to all other consumers receiving a subsistence level of utility, results in an allocation that is Pareto optimal but not in one that is very desirable on distributional grounds. Nevertheless, Pareto optimality serves as an important minimal test for the desirability of an allocation; it does, at the very least, say that there is no waste in the allocation of resources in society. =

A Ulility possibility set.

33~14~~C~H~A~P~T~E~R~~1:0~:~C~O~M::P~E~T~I~T~I~V~E~~M~A~R~K~E~T~S________________________________________________

-

----

Competitive Equilibria Throughout this chapter, we are concerned with the analysis of competitive mar.ket economies. In such an economy, society's initial endowments and technological. possibilities (i.e., the firms) are owned by consumers. We suppose that c~?sumer I initially owns W/i of good t, where LI w/i = w,. We denote consumer I s vector of endowments by WI = (w i /> ••• ,wLI).In addition, we suppose that .consumer I own.s a share Oij of firm j (where LI 0lj = I), giving him a claim to fracllon 0lj of firm J s profits. In a competitive economy, a market exists for each of the L good~, and .all consumers and producers act as price takers. The idea behind the pnce-taklng assumption is that if consumers and producers are small r~lative to ~he s:ze of the market, they will regard market prices as unaffected by theIr own actIons. Denote the vector of market prices for goods I, ... , L by P = (PI' ... ,pd· Definition 10.B.3 introduces the notion of a competitive (or Walrasian) equilibrium. Definition 10.B.3: The allocation (xr, ... , xr. Vr ... ·. V,) and price vectorp' E JRL constitute a competitive (or Walrasian) equilibrium if the following condlttons are satisfied: (i) Profit maximization: For each firm Max

i.

vi solves (10.B.1)

p"Vj'

ViEr,'

(ii) Utilitv maximization: For each consumer i. xt solves Max

(10.B.2)

u;(x;)

K,EX,

J

L 9;j(p··Vn·

s.t.p··x;~p··w;+

j=1

(iii) Market clearing: For each good

t = 1•...• L. J

I

L x7; = ;=1

w(

+L

V7j·

(10.B.3)

j=1

Definition 10.B.3 delineates three sorts of conditions that must be met for a competitive economy to be considered to be in equilibrium. Conditio~s (i) and (ii) reflect the underlying assumption, common to nearly all economIc mo.d~ls, that agents in the economy seek to do as well as they can for them~el~es. ~ondltlOn (i) states that each firm must choose a production plan that maxImIzes .'ts profits, taking as given the equilibrium vector of prices of i.ts outputs a~d Inputs (for the justification of the profit-maximization assu,:"ptlo.n, see SectIOn 5.G). We studied this competitive behavior of the firm extensIvely In Chapter 5. Condition (ii) requires that each consumer chooses a consumption bun~~e ~hat maximizes his utility given the budget constraint imposed by the equlhbnum prices and by his wealth. We studied this competitive be~avior of the consum~r extensively in Chapter 3. One difference here, however, IS that th~ cons~me~ s wealth is now a function of prices. This dependence of wealth on pnces anses In 1. Strictly speaking, it is equilibrium market prices tha~ they will regard as unaffected by their actions. For more on this point, see the small-type diSCUSSion later 10 thiS sectiOn.

--

SECTION

10.8:

PARETO

OPTIMALITY

AND

COMPETITIVE

EQUILIBRIA

315

two ways: First, prices determine the value of the consumer's initial endowments; for example. an individual who initially owns real estate is poorer if the price of real estate falls. Second, the equilibrium prices affect firms' profits and hence the value of the consumer's shareholdings. Condition (iii) is somewhat different. It requires that, at the equilibrium prices, the desired consumption and production levels identified in conditions (i) and (ii) are in fact mutually compatible; that is, the aggregate supply of each commodity (its total endowment plus its net production) equals the aggregate demand for it. If excess supply or demand existed for a good at the going prices, the economy could not be at a point of equilibrium. For example, if there is excess demand for a particular commodity at the existing prices, some consumer who is not receiving as much of the commodity as he desires could do better by offering to pay just slightly more than the going market price and thereby get sellers to offer the commodity to him first. Similarly, if there is excess supply, some seller will find it worthwhile to offer his product at a slight discount from the going market price.' Note that in justifying why an equilibrium must involve no excess demand or supply, we have actually made use of the fact that consumers and producers might not simply take market prices as given. How are we to reconcile this argument with the underlying price-taking assumption? An answer to this apparent paradox comes from recognizing that consumers and producers a/wa)'s have the ability to alter their offered prices (in Ihe absence of any institutional constraints preventing this). For the price-taking assumption to be appropriate, what we want is that they have no incentive to alter prices that, if taken as given, equate demand and supply (we have already seen that they do have an incentive to alter prices that do not equate demand and supply). Notice that as long as consumers can make their desired trades at the going market prices, they will not wish to offer more than the market price to entice sellers to sell to them first. Similarly, if producers are able to make their desired sales, they will have no incentive to undercut the market price. Thus. at a price that equates demand and supply, consumers do not wish to raise prices, and firms do not wish to lower them. More troublesome is the possibility that a buyer might try to lower the price he pays or that a seller might try to raise the price he charges. A seller, for example. may possess the ability to raise profitably prices of the goods he sells above their competitive level (see Chapter 12). In this case, there is no reason to believe that this market power will not be exercised. To rescue the price-taking assumption, one needs to argue that under appropriate (competitive) conditions such market power does not exist. This we do in Sections 12.F and IS.C, where we formalize the idea that if market participants' desired trades are small relative to the size of the market, then they will have little incentive to depart from market prices. Thus, in a suitably defined equilibrium, they will act approximately like price takers. Note from Definition 10.8.3 that if the allocation (x!, ... , xr, y!, ... , y1) and price vector p' » 0 constitute a competitive equilibrium, then so do the allocation 2. Strictly speaking, this second part of the argument requires Ihe price 10 be positive; indeed, if the price is zero (i.e., if the good is free), then excess supply should be permissible at equilibrium. In the remainder of this chapter. however, consumer prderences will be such as to preclude this possibility (goods will be assumed to be desirable). Hence, we neglect this possibility here.

~

316

CHAPTER

'0:

COMPETITIVE

MARKETS

x;,

(xf, ... , Yr, ... , y1) and price vector rxp' = (rxpr, ... , rxpt) for any scalar rx > 0 (see Exercise 10.B.2). As a result, we can normalize prices without loss of generality. In this chapter, we always normalize by setting one good's price equal to I. Lemma 10.B.I will also prove useful in identifying competitive equilibria. Lemma 10.B.l: If the allocation (x" ... , x,, Y" ... , YJ) and price vector P» 0 satisfy the market clearing condition (10.B.3) for all goods t "p k, and if every consumer"s budget constraint is satisfied with equality, so that p' x; = p·w; + Li O;iP' Yi for all i, then the market for good k also clears. Proof: Adding up the consumers' budget constraints over the rearranging terms, we get

L

I",..

consumers and

Pt(t Xt;-Wt- I Yt -p,(t x,,-w,- I j) =

i=l

J-l

i-I

hj)'

j=l

By market clearing in goods t "p k, the left·hand side of this equation is equal to zero. Thus, the right·hand side must be equal to zero as well. Because p, > 0, this implies that we have market clearing in good k . • In the models studied in this chapter, Lemma 10.B.I will allow us to identify competitive equilibria by checking for market clearing in only L - I markets. Lemma 10.B.I is really just a matter of double·entry accountancy. If consumers' budget constraints hold with equality, the dollar value of each consumer's planned purchases equals the dollar value of what he plans to sell plus the dollar value of his share (Oli) of the firms' (net) supply, and so the total value of planned purchases in the economy must equal the total value of planned sales. If those values are equal to each other in all markets but one, then equality must hold in the remaining market as well.

lO.C Partial Equilibrium Competitive Analysis Marshallian partial equilibrium analysis envisions the market for one good (or several goods, as discussed in Section 10.G) that constitutes a small part of the overall economy. The small size of the market facilitates two important simplifications for the analysis of market equilibrium:' First, as Marshall (1920) emphasized, when the expenditure on the good under study is a small portion of a consumer's total expenditure, only a small fraction of any additional dollar of wealth will be spent on this good; consequently, we can expect wealth effects for it to be small. Second, with similarly dispersed substitution effects, the small size of the market under study should lead the prices of other goods to be approximately unaffected by changes in this market.' Because of this fixity of other prices, we are justified in treating the expenditure on these other goods as a single composite commodity, which we call the lIul1Ieraire (see Exercise 3.G.5).

---

SECTION

'O.C:

PARTIAL

EQUILIBRIUM

COMPETITIVE

With this partial equilibrium interpretation as our motivation, we proceed to study a simple two· good quasilinear model. There are two commodities: good t and the numeraire. We let x, and 111, denote consumer i's consumption of good t and the numeraire, respectively. Each consumer i = I" .. , I has a utility function that takes the quasilinear form (sce Sections 3.B and 3.C): U,(I1I" x;) =

111,

+ c/J;(x,).

We let each consumer's consumption set be III x 1Il+, and so we assume for convenience that consumption of the numeraire commodity 111 can take negative values. This is to avoid dealing with boundary problems. We assume that '/1;(') is bounded above and twice differentiable, with 4>,(x,) > 0 and 4>i(x;) < 0 at all x, ~ O. We normalize 4>;(0) = O. In terms of our partial equilibrium interpretation, we think of good t as the good whose market is under study and of the numeraire as representing the composite of all other goods (111 stands for the total money expenditure on these other goods). Recall that with quasilinear utility functions, wealth effects for non·numeraire commodities are null. In the discussion that follows, we normalize the price of the numeraire to equal I, and we lei p denote the price of good t. Each firm j = I, ... , J in this two·good economy is able to produce good t from good 111. The amount of the numeraire required by firm j to produce qj ~ 0 units of good t is given by the cost function c/q) (recall that the price of the numeraire is I). Letting Zj denote firm j's use of good 111 as an input, its production set is therefore

lj = {( -Zj, qj): qj

~ 0 and zi ~ cJ{qj)}.

In what follows, we assume that cJ{·) is twice differentiable, with cj(ql) > 0 and cj(q) ~ 0 at all qj ~ O. [In terms of our partial equilibrium interpretation, we can think of Cj(qj) as actually arising from some multiple·input cost function cI(w, qj)' given the fixed vector of factor prices w.'] For simplicity, we shall assume that there is no initial endowment of good t, so that all amounts consumed must be produced by the firms. Consumer i's initial endowment of the numeraire is the scalar won' > 0, and we let Won = L, Won" We now proceed to identify the competitive equilibria for this two·good quasilinear model. Applying Definition IO.B.3, we consider first the implications of profit and utility maximization. Given the price p' for good t, firm j's equilibrium output level qj must solve Max

P'qj - ciqj),

qJ~O

which has the necessary and sufficient first·order condition

p' ::; c;{qj),

with equality if qj > O.

On the other hand, consumer i's equilibrium consumption vector (I1It, 3. The following poinls have been formalized by Vives (1987). (See Exercise IO.C.l for an illuslralion.) 4. This is not Ihe only possible justification for laking other goods' prices as being unaffected by the market under study; see Section 10.0.

xn must

5. Some of the exercises at the end of Ihe chapter investigate the effects of exogenous changes

in these factor prices.

ANALYSIS

317

3t8

CHAPTER

10:

COMPETITIVE

SEC T ION

MARKETS

----------------------------------------------------------~

10. C:

Pit. R T I It. L

E QUI L 18 R I U M

COM PET I T lYE

A N It. L Y SIS

319

,-------------------------------------------------------------

solve J

m,

S.t.

+ p'x, $ rom' +

L !i,}p'qj -

cl(qi)·

I-I

Figure lO.C.l

p ----In any solution to this problem. the budget constraint holds with equality. Substituting for nI, from this constraint. we can rewrite consumer i's problem solely in terms of choosing his optimal consumption of good t. Doing so. we see that x must solve

2

r

x(p) =

X,

(a)

which has the necessary and sufficient first-order condition

4>;(xn $ p'.

with equality if xr > O.

In what follows. it will be convenient to adopt the convention of identifying an eq uilibrium allocation by the levels of good t consumed and produced. (xT • ...• xT. qT •. ..• qj). with the understanding that consumer i's equilibrium consumption of the numeraire is then mt = [ro.., + L) !il)(p*qj - cI(q;JlJ - p'xt and that firm j's equilibrium usage of the numeraire as an input is z1 = Cj(qj). To complete the development of the equilibrium conditions for this model. recall that by Lemma 10.B.I. we need only check that the market for good t clears." Hence. we conclude that the allocation (xT •. ..• xT. qT •. ..• q1) and the price p* constitute a competitive equilibrium if and only if

p'

cj(qj).

with equality if q1 > 0

j = 1..... 1.

(IO.C.I)

4>;(xn $ p*.

with equality if xr > 0

i = 1, .... 1.

(to.C.2)

$

I

J

L xr = i"q L qj.

(10.C.3)

1=1

At any interior solution. condition (to.C.I) says that firm j's marginal benefit from selling an additional unit of good t. pO. exactly equals its marginal cost c;(qi). Condition (10.C.2) says that consumer i's marginal benefit from consuming an additional unit of good t. 4>;(xn, exactly equals its marginal cost pO. Condition (IO.C.3) is the market-clearing equation. Together. these 1 + 1 + I conditions characterize the (1 + 1 + I) equilibrium values (xf •.. ·• xT. qf •...• qj) and pO. Note that as long as Max, 4>1(0) > Min) cj(O). the aggregate consumption and production of good must be strictly positive in a competitive equilibrium [this follows from conditions (IO.C.I) and (10.C.2)]. For simplicity. we assume that this is the case in the discussion that follows. Conditions (IO.C.I) to (to.C.3) have a very important property: They do not in volve. in any manner. the endowments or the ownership shares of the consumers. As a result. we see that the equilibrium allocation and price are independent of the

r

6. Note that wc must have p. > 0 in any competitive equilibrium; otherwise, consumers would demand an infinite amounl or good t [recall that 4>;(') > 0).

L x,(p) i"'l

X,(P!

-',(P) xUj)

(b)

distribution of endowments and ownership shares. This important simplification arises from the quasilinear form of consumer preferences.' The competitive equilibrium of this model can be nicely represented using the traditional Marshallian graphical technique that identifies the equilibrium price as the point of intersection of aggregate demand and aggregate supply curves. We can derive the aggregate demand function for good t from condition (10.C.2). Because 4>:(') < 0 and 4>,(') is bounded. 4>1 O. and is strictly decreasing at any p < 4>1(0) [at any such P. we have x;(p) = IN:'(x,(p» < 0]. The aggregate demand function for good I is then the function x(p) = L, x.(p). which is continuous and nonincreasing at all p > O. and is strictly decreasing at any p < Max, 4>1(0). Its construction is depicted in Figure 10.C.I(b) for the case in which 1 = 2; it is simply the horizontal summation of the individual demand functions and is drawn in the figure with a heavy trace. Note that x(p) = 0 whenever p ;(0). The aggregate supply function can be similarly derived from condition (10.C.1).8 Suppose. first, that every c/·) is strictly convex and that cj(qJ) -+ 00 as qJ -+ 00. Then. for any p> 0, we can let q,{p) denote the unique level of q) that satisfies condition (IO.C.I). Note that for P $ c;{O). we have qJ{p) = O. Figure 10.C.2(a) illustrates this construction for a price;; > O. The function q,{') is firm j's supply function for good ( (see Sections S.C and 5.0). It is continuous and nondecreasing at all p > O. and is strictly increasing at any p > c;(O) [for any such p. qj(p) = l/cj(qJ{p)) > 0]. The aggregate (or industry) supply function for good ( is then the function q(p) = LI qip). which is continuous and nondecreasing at all p> O. and is strictly increasing at any p > Minj c; Min) cl(O), this equilibrium price is uniquely defined." The individual consumption and production levels of good ( in this equilibrium are then given by xf = xi(p·) for i = I, ... , J and qj = q)(p*) for j = I, ... , J. More generally, if some cj') is merely convex [e.g., if cj') is linear, as in the constant returns case], then qj') is a convex-valued correspondence rather than a function and it may be well defined only on a subset of prices. lo Nevertheless, the 9. Be warned, however, that the uniqueness of equilibrium is a property that need not hold in

more general seltings in which wealth effects are present. (See Chapler 17.) 10. For example, if firm j has c/{qj) = Cjqj for some scalar cJ > 0, then when p> Cj. we have As a result. if P > c),the aggregate supply is q(p) = L.)qj(p) = co; consequenUy q(') is not well defined for this p.

4j(P) = 0C0.

.._ - - - - - -

The equilibrium po", equates demand and supply.

p'

= c f-------=""'----- q( p)

C'«j) = ci(I/,) = c;(I/,) = q-'(q)

.t(p) .t(p') = q(p')

x,q

ii,

1/, 1/, + ii,

basic features of the analysis do not change. Figure IO.C.4 depicts the determination of Ihe equilibrium value of p in the case where, for all j, Cj(qj) = cqj for some scalar c > O. The only difference from the strictly convex case is that, when J > I, individual firms' equilibrium production levels are not uniquely determined. The inverses of the aggregate demand and supply functions also have interpretalions that are of interest. At any given level of aggregate output of good I, say ij, the inverse of the industry supply function, q-'(ii), gives the price that brings forth aggregate supply ij. That is, when each firm chooses ils optimal oUlput level facing the price p = q-'(ij), aggregate supply is exactly ij. Figure to.C.S iJlustrales Ihis point. Note that in selecting these output levels, all active firms set their marginal cost equal to q - •(ij). As a result, the marginal cost of producing an additional unit of good I at ij is precisely q-'(ij), regardless of which active firm produces it. Thus q -.(.), the inverse of the industry supply function, can be viewed as the industry margilJal cosr function, which we now denote by C'(') = q-'(' ).tt The derivation or C(·) just given accords fully with our discussion in Section 5.E. We saw there that the aggregate supply or the J firms, q(p), maximizes aggregate profits given p; thererore, we can relate q(') to the industry marginal cost function C(·) in exactly the same manner as we did in Section 5.0 for the case or a single firm's marginal cost runction and supply behavior. With convex technologies, the aggregate supply locus ror good ( thererore coincides with the graph of the industry marginal cost runction CU, and so q-'(-) = C(· )." Likewise, at any given level of aggregate demand x, the inverse demand function PIx) = x - '(.x) gives the price that results in aggregate demand of X. That is, when each consumer optimally chooses his demand for good t at this price, tolal demand exactly equals .x. Note that at these individual demand levels (assuming that they are positive), each consumer's marginal benefit in terms of the numeraire from an additional unit of good t, 0 maximizes aggregale surplus ir and only ir S'(x·) = O. Then veriry that S'(x) = P(x) - C'(x) at all x > O. 20. This problem is closely related to that studied in Example 3.1.1 (we could equally well motivate the analysis here by asking, as we did there. about the welfare cost of the distortionary tax relative to the use of a lump-sum tax that raises the same revenue; the measure of deadweight loss that emerges would be Ihe same as that developed here). The discussion that rollows amounts to an extension, in the quasilinear conlext, or the analysis or Example 3.1.1 to situations with many consumers and the presence or firms. For an approach that uses the theory or a normative representative consumer presented in Section 4.D, see the small-type discussion at the end of this

section.

(a) A differenlial change in Marshallian surplus. (b) The Marshallian surplus al aggregate consumption

level x.

332

CHAPTER

10:

COMPETITIVE

MARKETS

-------------------------------------------------------------------

SECTION

10.E:

WELFARE

ANALYSIS

IN

THE

PARTIAL

EQUILIBRIUM

Finally, the integral in (10.E.7) is equal to 22 eS(p) = p'(t)

+'

1'"

xes) ds.

(10.E.8)

Thus, because consumers face an effective price of pOet) + t when the tax is t, the change in consumer surplus from imposition of the tax is

Figure 10.E.3 The deadweight welfare loss from distortionary ta'"tion.

p'(l)

o

x, q

eS(p'(t)

+ t) - eS(p*(O)) =

'·c')+'

f

-

xes) ds.

(10.E.9)

,·(0)

In Figure 10.E.3, the reduction in consumer surplus is depicted by area (dbc!). The aggregate profit, or aggregate producer surplus, when firms face effective price p is J

n(p) =

the introduction of the tax as

Mp) -

I

c/qj(P».

j= I X .("

S'(t) - SOlO) =

f

[PIS) - C(S)) ds.

(10.E.6)

Again, using the optimality of the allocation of production across firms, we have 23

,,"-(O}

Expression (10.E.6) is negative because x'(t) < x'(O) (recall the analysis of Example 10.C.1) and PIx) ~ C(x) for all x::; x'(O), with strict inequality for x < x '(D). Hence, social welfare is optimized by setting t = O. The loss in welfare from t > 0 is known as the deadweight loss of distortio'IUry taxation and is equal to the area of the shaded region in Figure 10.E.3, called the deadweight loss triangle. Notice that since S"(t) = [P(x'(t)) - C(x'(t))]x"(t), we have S"(O) = O. That is, starting from a position without any tax, the first-order welfare effect of an infinitesimal tax is zero. Only as the tax rate increases above zero does the marginal effect become strictly negative. This is as it should be: if we start at an (interior) welfare maximum, then a small displacement from the optimum cannot have a first-order effect on welfare. It is sometimes of interest to distinguish between the various components of aggregate Marshallian surplus that accrue directly to consumers, firms, and the tax authority.2! The aggregate consumer surplus when consumers' effective price is p and therefore aggregate consumption is x(p) is defined as the gross consumer benefits from consumption of good ( minus the consumers' total expenditure on this good (the latter is the cost to consumers in terms of forgone consumption of the numeraire): I

eS(p) =

I

,-!

c'(O)

0

if p :;; c'(O).

The difficulty can be understood in a related way. As discussed in Exercise 5.B.4, the long·run aggregate production set in the situation just described is convex but not closed. This can be seen in Figure IO.F.3, where the industry marginal cost function with J firms, c'(Q)

c'(Q/3)

c'(Q/IO) c'(O)

------------------

Figure 10.F.3

The limiting behavior of industry marginal cost as J .... a:) with strictly convex Costs.

X,Q

if p = c if p < c. We move next to the case in which c(·) is increasing and strictly convex (i.e., the production technology of an individual firm displays strictly decreasing returns to scale). We assume also that x(c'(O» > O. With this type of cost function, 110 101lg-rllll competitive equilibrium can exist. To see why this is so, note that if p > c'(O), then rr(p) > 0 and therefore the long-run supply is infinite. On the other hand, if p :;; c'(O), then the long-run supply is zero while x(p) > O. The problem is illustrated in Figure IO.F.2, where the graph of the demand function x(·) has no intersection with the 27. In particular, if (p', q', J*) is along-run equilibrium. then condition (i) of Definition IO.F.I implies that q' e q(p') and condition (iii) implies that n(p') = O. Hence, by condition (ii). x(p') e Q(p'). In the other direction, if x(p') e Q(p'). then n(p') = 0 and there exists q' e q(p*) and J* with x(p*) = J*q'. Therefore,the three conditions of Definition IO.F.I are satisfied.

Nonexistence of long-run competitive equilibrium with strictly convex costs. (a) A firm's supply correspondence. (b) No intersection of long-run supply and demand.

c'(Q/J), is shown for various values of J (in particular, for J = I, J = 3, and J = 10). Note

that as J increases, this marginal COSt function approaches but never reaches the marginal COSt function corresponding to a constant marginal COSt of c'(O). Perhaps not surprisingly, to generate the existence of an equilibrium with a determinate number of firms, the long-run cost function must exhibit a strictly positive efficient scale; that is, there must exist a strictly positive output level ij at which a firm's average costs of production are minimized (see Section 5.0 for a further discussion of the efficient scale concept). Suppose, in particular, that c(·) has a unique efficient scale ij > 0, and let the minimized level of average cost be c = c(ij)/q. Assume, moreover, that x(C) > O. If at a long-run equilibrium (p*, q*, J*) we had p* > c, then p*ij > cij, and so we would have rr(p*) > O. Thus, at any long-run equilibrium we must have p* ~ In contrast, if p* < c, then x(p*) > 0; but since p*q - c(q) = p*q - (c(q)/q)q ~ (p* - c)q < 0

c.

338

CHAPTER

10:

COMPETITIVE

MARKETS

SECTION

c'(')

Flgur. 10.F.4

Long-run competitiv, equilibrium when a verage costs exhibil strictly pOsitive a efficient scale. (a) A firm's supply correspondence. (a)

"Q (b)

FREE-ENTRY

AND

LONG-RUN

COMPETITIVE

not intersect. 29 The nonexistence of competitive equilibrium can occur here for the same reason that we have already alluded to in small type in Section lO.e: The long-run production technologies we are considering exhibit nonconvexities. It seems plausible, however, that when the efficient scale of a firm is small relative to the size of the market, this" integer problem" should not be too much of a concern. In fact, when we study oligopolistic markets in Chapter 12, we shall see that when firms' efficient scales arc small in this sense, the oligopolistic equilibrium price is close to e, the equilibrium price we would derive if we simply ignored the integer constraint on the number of firms J *. Intuitively, when the efficient scale is small, we will have many firms in the industry and the equilibrium, although not strictly competitive, will involve a price close to C. Thus, if the efficient scale is small relative to the size of the market [as measured by x(e)], then ignoring the integer problem and treating firms as price takers gives approximately the correct answer. Third, when an equilibrium exists, as in Figure 10.F.4, the equilibrium outcome maximizes Marshallian aggregate surplus and therefore is Pareto optimal. To see this, note from Figure 10.F.4 that aggregate surplus at the considered equilibrium is equal to

C(q)

x,q

10.F:

EQUILIBRIA

339

------------------------------------------------------------------------------

------------------------------------------------------------------------~-

(b) Long-run equilibrium.

for all q > 0, a firm would earn strictly negative profits at any positive level of output. So p' < C also cannot be a long-run equilibrium price. Thus, at any long-run equilibrium we must have p' = c. Moreover, if p* = c, then each active firm's supply must be q' = q (this is the only strictly positive output level at which the firm earns nonnegative profits), and the equilibrium number of active firms is therefore J' = X(C)jq.2. In conclusion, the number of active firms is a well-determined quantity at long-run equilibrium. Figure 10.F.4 depicts such an equilibrium. The long-run aggregate supply correspondence is

Max .x~O

IX P(s) ds -

Ex,

0

the maximized value of aggregate surplus when firms' cost functions are cq. But because c(q) 2! i'q for all q, this must be the largest attainable value of aggregate surplus given the actual cost function c('); that is, Max

c if p = c if p < c. if p >

J:~O

IX Pis) ds 0

ex 2!

Ii P(s) ds -

Jc(,XfJ),

0

for all .X and 1. This fact provides an example of a point we raised at the end of Section 10.D (and will substantiate with considerable generality in Chapter 16): The first welfare theorem continues to be valid even in the absence of convexity of individual production sets.

Observe that the equilibrium price and aggregate output are exactly the same as if the firms had a constant returns to scale technology with unit cost c. Several points should be noted about the equilibrium depicted in Figure IO.F.4. First, if the efficient scale of operation is large relative to the size of market demand, it could well turn out that the equilibrium number of active firms is small. In these cases, we may reasonably question the appropriateness of the price-taking assumption (e.g., what if J* = 17). Indeed, we are then likely to be in the realm of the situations with market power studied in Chapter 12. Second, we have conveniently shown the demand at price C, x(c), to be an integer multiple of q. Were this not so, no long-run equilibrium would exist because the graphs of the demand function and the long-run supply correspondence would

Although firms may enter and exit the market in response to profit opportunities in the long run, these changes may take time. For example, factories may need to be shut down, the workforce reduced, and machinery sold when a firm exits an industry. It may even pay a firm to continue operating until a suitable buyer for its plant and equipment can be found. When examining the comparative statics effects of a shock to a market, it is therefore important to distinguish between long-run and short-run effects. Suppose, for example, that we are at a long-run equilibrium with J* active firms

28. Note Ihal when "0 is differentiable, condition (i) of Definition IO.F.1 implies Ihat c'(q' ) = p', while condition (iii) implies p' = c(q' )/q'. Thus, a necessary condition for an equilibrium is that c'(q') = c(q·)fq*. This is the condition for q' to be a critical point of average cosls [differentiate c(q)/q and see Exercise 5.0.1]. In the case where average cost c(q)/q is U-shaped

29. An intermediate case between constant relurns (where any scale is efficient) and the case of a unique efficient scale occurs when there is a range [q. of efficient scales (the average cost curve has a Hat boltom). In this case. the integer problem is mitigated. For a long-run competitive equilibrium to exist, we now onty need there to be some q E [q, oj] such that x(C)/q is an integer.

(i.e., with no critical point other than the global minimum, as shown in Figure IO.F.4), this implies that q' = 4. and so p' = c and l ' = x(c)/ij. Note, however, that the argument in the text does not

Of course, as the inlerval [4, oj] grows larger. not only are the chances of a long-run equilibrium existing greater, but so are the chances of indeterminacy of the equilibrium number of firms (Le., of multiple equilibria involving differing numbers of firms).

Short-Run and Long-Run Comparative Statics

in

require this assumption about the shape of average costs.

...

- - - - ~-------------------------

-----------------.

---

.--

340

CHAPTER

10:

COMPETITIVE

SECTION

MARKETS

~~------------------------------------------------------------

cost function

K + "'(q)

if q > 0

o

if q = 0,

(IO.F.I)

where ",(0) = 0, ""(q) > 0, and "'''(q) > O. But in the short run, it may be impossible for an active firm to recover its fixed costs if it exits and sets q = O. Hence, in the short run the firm has the cost function c,(q) = K

+ "'(q)

for all q

~

O.

CONCLUDING

REMARKS

ON

PARTIAL

EQUILIBRIUM

Now suppose that we have a shift to the demand function x(·, 0Criods in isolation from each other. This approach ignores. for example, the possibility of intert'-mporal substitution by consumers when tomorow's price is expected to differ from today's (intertemporal substitution might be particularly important for very short-run periods when the fact that many production decisions are fixed can make prices very sensitive to demand shocks). These weaknesses are not flaws in the competitive model per se, but rather only in the somewhat extreme methodological simplification adopted here. A fully satisfactory treatment of these issues requires an explicitly dynamic model that places expectations at center stage. In Chapter 20 we study dynamic models of competitive markets in greater depth. Nevertheless, this simple dichotomization into long-run and short-run periods of adjustment is often a useful starting point for analysis.

(10.F.2)

Another possibility is that c(q) might be the cost function of some multiple-input production process, and in the short run an active firm may be unable to vary its level of some inputs. (See the discussion in Section 5.B on this point and also Exercises 10.F.S and 10.F.6 for illustrations.) Whenever the distinction between short run and long run is significant, the shorl-rull comparative scatics effects of a demand shock may best be determined by solving for the competitive equilibrium given J * firms, each with cost function c,(·), and the new demand function. This is just the equilibrium notion studied in Section 10.C, where we take firms' cost functions to be c,(')' The long-run comparative statics effects can then be determined by solving for the long-run (i.e., free entry) equilibrium given the new demand function and long-run cost function c(·). Example IO.F.I: Short-Run and Long-Run Comparative Statics with Lumpy Fixed Costs that Are SUlik in the Short Run. Suppose that the long-run cost function c(') is given by (I O.F.l) but that in the short run the fixed cost K is sunk so that c,(-) is given by (I 0.F.2). The aggregate demand function is initially x(·, "'0)' and the industry is at a long-run equilibrium with Jo firms, each producing ii units of output [the efficient scale for cost function c('», and a price of p' = c = c(ii)/ii· This equilibrium

IO.G Concluding Remarks on Partial Equilibrium Analysis

position is depicted in Figure IO.F.S. Flgur. 10.F.5

Short-run and long.run comparati\"c

statics in Example 10.F.!.

ANALYSIS

341

--------------------------------------------------~~~~

In principle, the analysis of Pareto optimal outcomes and competitive equilibria requires the simultaneous consideration of the entire economy (a task we undertake in Part IV). Partial equilibrium analysis can be thought of as facilitating matters on two accounts. On the positive side, it allows us to determine the equilibrium outcome in the particular market under study in isolation from all other markets. On the normative side, it allows us to use Marshallian aggregate surplus as a welfare measure that, in many cases of interest, has a very convenient representation in terms of the area lying vertically between the aggregate demand and supply curves. In the model considered in Sections IO.C to 10.F, the validity of both of these simplifications rested, implicitly, on two premises: first, that the prices of all commodities other than the one under consideration remain fixed; second, that there are no wealth effects in the market under study. We devote this section to a few additional interpretative comments regarding these assumptions. (See also Section IS.E for an example illustrating the limits of partial equilibrium analysis.)

342

c HAP T E R

1 0:

COM PET I T I V E

MA AKETS

----------------------------------------------------------~

The assumption that the prices of goods other than the good under consideration (say, good t) remain fixed is essential for limiting our positive and normative analysis to a single market. In Section IO.B, we justified this assumption in terms ofthe market for good t being small and having a diffuse influence over the remaining markets. However, this is not its only possible justification. For example, the nonsubstitution theorem (see Appendix A of Chapter 5) implies that the prices of all other goods will remain fixed if the numeraire is the only primary (i.e., non produced) factor, all produced goods other than t are produced under conditions of constant returns using the numeraire and produced commodities other than t as inputs, and there is no joint production. 30 Even when we cannot assume that all other prices are fixed, however, a generalization of our single-market partial equilibrium analysis is sometimes possible. Often we are interested not in a single market but in a group of commodities that are strongly interrelated either in consumers' tastes (tea and coffee are the classic examples) or in firms' technologies. In this case, studying one market at a time while keeping other prices fixed is no longer a useful approach because what matters is the simultaneous determination of all prices in the group. However, if the prices of goods outside the group may be regarded as unaffected by changes within the markets for this group of commodities, and if there are no wealth effects for commodities in the group, then we can extend much of the analysis presented in Sections IO.C to IO.F. To this effect, suppose that the group is composed of M goods, and let Xi E RAt and qj E IRM be vectors of consumptions and productions for these M goods. Each consumer has a utility function of the form

REFERENCES

ignored." (Exercises IO.G.3 to IO.G.5 ask you to consider some issues related to this point.) The assumption of no wealth effects for good t, on the other hand, is critical for the validity of the style of welfare analysis that we have carried out in this chapter. Without it, as we shall see in Part IV, Pareto optimality cannot be determined independently from the particular distribution of welfare sought, and we already know from Section 3.1 that area measures calculated from Walrasian demand functions are not generally correct measures of compensating or equivalent variations (for which the Hicksian demand functions should be used). However, the assumption of no wealth effects is much less critical for positive analysis (determination of equilibrium, comparative statics effects, and so on). Even with wealth effects, the demand-and-supply apparatus can still be quite helpful for the positive part of the theory. The behavior of firms, for example, is not changed in any way. Consumers, on the other hand, have a demand function that, with prices of the other goods kept fixed, now depends only on the price for good t and wealth. If wealth is determined from initial endowments and shareholdings, then we can view wealth as itself a function of the price of good ( (recall that other prices are fixed), and so we can again express demand as a function of this good's price alone. Formally, the analysis reduces to that presented in Section IO.C: The equilibrium in market f can be identified as an intersection point of demand and supply curves. 32 31. A case in which the single-market analysis for good I is Slill fully justified is when ulility and cost functions have (he form

and

where mi is the consumption of the numeraire commodity (Le., the total expenditure on commodities outside the group). Firms' cost functions are c,(qJ). With this specification, many of the basic results of the previous sections go through unmodified (often it is just a matter of reinterpreting X, and q, as vectors). In particular, the results discussed in Section IO.C on the uniqueness of equilibrium and its independence from initial endowments still hold (see Exercise IO.G.I), as do the welfare theorems of Section 10.0. However, our ability to conduct welfare analysis using the areas lying vertically between demand and supply curves becomes much more limited. The cross-effects among markets with changing and interrelated prices cannot be

30. A simple example of Ihis resull arises when all produced goods other than ( are produced directly from the numeraire with constant returns to scale. In this case, the equilibrium price of each of these goods is equal to the amount of the numeraire that must be used as an input in its produclion per unit of output produced. More generally, prices for produced goods other than ( will remain fixed under the conditions of the nonsubstitution theorem because all efficient production vectors can be generated using a single set of techniques. In any equilibrium, the price of each produced good other than ( must be equal to the amount of the numeraire embodied in a unit of the good in the efficient production technique, either directly through the use of the numeraire as an input or indirectly through the use as inputs of produced goods other than (that are in turn produced using the numeraire (or using other produced goods that are themselves produced using

the numeraire, and so on).

343

--------------------------------------------------------~~~

where ;~ -/.i and q -1.1 are consumption and production vectors for goods in the group other than I. With this additive separabilily in good I, the markelS for goods in the group other than { do nol influence Ihe equilibrium price in market t. Good t is effeclively independent of Ihe group, and we can treat it in isolation, as we have done in the previous sections. (In point of fact, we do not ~ven need 10 assu~e Ihat Ihe remaini~g ~arkets in the group keep their prices fixed. What happens '" Ihem IS SImply ",elevanl for eqUIlibrium and welfare analysis in the market for good t.) See Exercise IO.G.2.

32. The presence of wealth effects can lead. however. to some interesting new phenomena on the consumer's side. One is the backward.bending demand curve, where demand for a good is increasing in its price over some range. This can happen if Consumers have endowments of good I, because then an increase in its price increases consumers' wealth and could lead to a net increase in their demands for good t, even ifit is a normal good.

REFERENCES Marshall. A. (1920). Principles of Economics. New York: Macmillan. Stigler, G. (1987). The n"ory of Price, 4th ed. New York: Macmillan. Vives, X. (1987). Small income effects: A Marshallian theory or consumer surplus and downward sloping demand. Review of Economic Studies 54: 87-103.

344

EXERCISES

CHAPTER

10:

COMPETITIVE

MARKETS

~~~~~----------------------------------------------------EXERCISES 10.B.I" The concept defined in Definition 10.B.2 is sometimes known as strong Pareto efficienc)'. An outcome is weakly Pareto efficient if there is no alternative feasible allocation (a) Argue that if an outcome is strongly Pareto efficient, then it is weakly Pareto efficient

as well. (b) Show that if all consumers' preferences are continuous and strongly monotone, then these two notions of Pareto efficiency are equivalent for any interior outcome (Le., an outcome in which each consumer's consumption lies in the interior of his consumption set). Assume for simplicity that X, = R~ for all i. (c) Construct an example where the two notions are not equivalent. Why is the strong monotonicity assumption important in (b)? What about interiority? 10.B.2' Show that if allocation (xT, ... , xT, yT, ... , y1) and price vector p. »0 constitute a competitive equilibrium, then allocation (xT, . .. , xT, yT, ... , yJ) and price vector ap· also constitute a competitive equilibrium for any scalar a > O. IO.C.I" Suppose that consumer i's preferences can be represented by the utility function Lt log (Xli) (these are Cobb-Douglas preferences).

"'(Xli' ... ' Xli) =

(a) Derive his demand for good

t.

10.CAB Consider a central authority who has x units of good t to allocate among I consumers, each of whom has a quasilinear utility function of the form 4>,(x,) + m" with ,pt.. ) a differentiable, increasing, and strictly concave function. The central authority allocates good t to maximize the sum of consumers' utilities L, "I' (a) Set up the central authority's problem and derive its first-order condition.

that makes all individuals strictly better off.

What is the wealth effect?

(b) Now consider a sequence of situations in which we proportionately increase both the number of goods and the consumer's wealth. What happens to the wealth effect in the limit? 10.C.2" Consider the two-good quasilinear model presented in Section 10.C with one consumer and one firm (so that I = I and J = I). The initial endowment of the numeraire is W > 0, and the initial endowment of good (is O. Let the consumer's quasilinear utility function be q,(x) + m, where q,(x) = a + (i In x for some (~, (i) » O. Also, let the firm's cost function be c(q) = aq for some scalar a > O. Assume that the consumer receives all the profits of the firm. Both the firm and the consumer act as price takers. Normalize the price of good m to equal 1, and denote the price of good ( by p.

(b) Let y(x) be the value function of the central authority's problem, and let P(x) = y'(x) be its derivative. Show that if (xT, ... , x1) is the optimal allocation of good t given available quantity X, then PIx) = q,;(xn for all i with xr > O. (e) Argue that if all consumers maximize utility facing a price for good t of P(x) (with the price of the numeraire equal to I), then the aggregate demand for good t is exactly x. Conclude that P(·) is, in fact, the inverse of the aggregate demand function x(·).

to.C.5" Derive the differential change in the equilibrium price in response to a differential change in the tax in Example 1O.c.1 by applying the implicit function theorem to the system of equations (IO.C.4) to (IO.C.6). to.C.6" A tax is to be levied on a commodity bought and sold in a competitive market. Two possible forms of tax may be used: In one case, a specific tax is levied, where an amount t is paid per unit bought or sold (this is the case considered in the text); in the other case, an ad valorem tax is levied, where the government collects a tax equal to r times the amount the seller receives from the buyer. Assume that a partial equilibrium approach is valid. (a) Show that, with a specific tax, the ultimate cost of the good to consumers and the amounts purchased are independent of whether the consumers or the producers pay the tax.

(b) Show that this is not generally true with an ad valorem tax. In this case, which collection method leads to a higher cost to consumers? Are there special cases in which the collection method is irrelevant with an ad valorem tax?

M

(a) Derive the consumer's and the firm's first-order conditions.

(b) Derive the competitive equilibrium price and output of good with

~,

t. How do these vary

p, and a?

1O.C.3" Consider a central authority who operates J firms with differentiable convex cost functions 0 and ,< 0, and aggregate supply curve q(p) = ap', where IX > 0 and "I > O. Calculate the percentage change in consumer cost and producer receipts per unit sold for a small ("marginal") tax. Denote I< = (I + f). Assume that a partial equilibrium approach is valid. Compute the elasticity of the equilibrium price with respect to 1 0 if and only if p> c'(O).

to.F.2" Consider a market with demand function x(p) = A - Bp in which every potential firm has cost function c(q) = K + aq + pq', where ~ > 0 and P> O. (a) Calculate the long-run competitive equilibrium price, output per firm, aggregate output, and number of firms. Ignore the integer constraint on the number of firms. How does each of these vary with A? (b) Now examine the short-run competitive equilibrium response to a change in A starting from the long-run equilibrium you identified in (a). How does the change in price depend on the level of A in the initial equilibrium? What happens as A --+ oo? What accounts for this effect of market size? 10.F.3" (0. Pearce) Consider a partial equilibrium selling in which each (potential) firm has a long-run cost function «'), where c(q) = K + (q) for q > 0 and c(0) = O. Assume that q,'(q) > 0 and "(q) < 0, and denote the firm's efficient scale by ij. Suppose that there is initially a long-run equilibrium with J. firms. The government considers imposing two different types

348

CHAPTER

10:

COMPETITIVE

MARKETS

of taxes: The first is an ad valorem tax of t (see Exercise 1O.C.6) on sales of the good. The second is a tax T that must be paid by any operating firm (where a firm is considered to be "operating" if it sells a positive amount). If the two taxes would raise an equal amount of revenue with the initial level of sales and number of firms, which will raise more after the industry adjusts to a new long-run equilibrium? (You should ignore the integer constraint on the number of firms.) IO.F.4H (J. Panzar) Assume that partial equilibrium analysis is valid. The single-output, many-input technology for producing good t has a differentiable cost function c(w, q), where W = (Wi' ••• ' w.) is a vector of factor input prices and q is the firm's output of good t. Given factor prices w, let q(w) denote the firm's efficient scale. Assume that q(w) > 0 for all w. Also let p'(w) denote the long-run equilibrium price of good t when factor prices are w. Show that the function p;(w) is nondecreasing, homogeneous of degree one, and concave. (You should ignore the integer constraint on the number of firms.) IO.F'sC Suppose that there are J firms that can produce good t from K factor inputs with differentiable cost function c(w, q). Assume that this function is strictly convex in q. The differentiable aggregate demand function for good t is x(p, ~), where iJx(p, ~)/iJp < 0 and ox(p, ~)/iJ~ > 0 (~ is an exogenous parameter affecting demand). However, although c(w, q) is the cost function when all factors can be freely adjusted, factor k cannot be adjusted in the short run. Suppose that we are initially at an equilibrium in which all inputs are optimally adjusted to the equilibrium level of output q* and factor prices w so that, letting z,(w, q) denote a firm's conditional factor demand for input k when all inputs can be adjusted, = z,(w, q*).

z:

(a) Show that a firm's equilibrium response to an increase in the price of good in the long run than in the short run.

t is larger

(b) Show that this implies that the long-run equilibrium response of p, to a marginal increase in ~ is smaller than the short-run response. Show that the reverse is true for the response of the equilibrium aggregate consumption of good t (hold the number of firms equal to J in both the short run and long run). IO.F.6 B Suppose that the technology for producing a good uses capital (z,) and labor (z,) and takes the Cobb-Douglas form f(z" z,) = z~zl-', where ~ e (0, I). In the long run, both factors can be adjusted; but in the short run, the use of capital is fixed. The industry demand function takes the form x(p) = a - bp. The vector of input prices is (w" w,). Find the long-run equilibrium price and aggregate quantity. Holding the number of firms and the level ofeapital fixed at their long·run equilibrium levels, what is the short-run industry supply function? IO.F.7B Consider a case where in the short run active firms can increase their use of a factor but cannot decrease it. Show that the short-run cost curve will exhibit a kink (i.e., be nondifferentiable) at the current (long-run) equilibrium. Analyze the implications of this fact for the relative variability of short-run prices and quantities. IO.G.IB Consider the case of an interrelated group of M commodities. Let consumer ;'s utility function take the form ui(x 1/, •.• ,xu,) = m. + 4>.(XI/" •• ,x",). Assume that .p,{-) is differen· tiable and strictly concave. Let firm j's cost function be the differentiable convex function cl(q,i"" ,qMI)'

Normalize the price of the numeraire to be 1. Derive (l + J + t) M equations characterizing the (l + J + I)M equilibrium quantities (xT..... , xt..) for i = t, ... ,I, (qTi' ... , qt.l ) for j = t, ... , J, and (pT, ... , Pt.). [Him: Derive consumers' and firms' first·order conditions and the M - I market-clearing conditions in parallel to our analysis of the single-market case.] Argue that the equilibrium prioes and quantities of these M goods arc independent of

---

EXERCISES

349

----------------------------------------------------~~ consumers' wealths, that equilibrium individual consumptions and aggregate production levels are unique, and that if the ck) functions arc strictly convex, then equilibrium individual prOduction levels are also unique. IO.G.Z" Consider the case in which the functions .pi(·) and cl (·) in Exercise IO.G.I are separable in good { (one of the goods in the group): .pi(·) = 4>,,(XII) + .p-f.• (X-f .• ) and cl(') = Cf/q,;) + C-f,;{q -f.I)' Argue that in this ease, the equilibrium price, consumption, and prOduction of good ( can be determined independently of other goods in the group. Also argue that under the same assumptions as in the single-market case studied in Section 10.E, changes in welfare caused by changes in the market for this good can be eaptured by the Marshallian aggregate surplus for this good, L. 4>II(X,,) - LI c(j(q'I)' which ean be represented in terms of the areas lying vertically between the demand and supply curves for good t. Note the implication of these results for the case in which we have separability of all goods: .pi(·) = LI 4>,,(x,,) and ci') = Lt cfj(q'I)' H IO.G.3 Consider a three-good economy (t = 1,2, 3) in which every Consumer has preferences that can be described by the utility function u(x) = x, + 4>(x" x,) and there is a single production process that produces goods 2 and 3 from good I having c(q" q,) = c,q, + c,q,. Suppose that we are considering a tax change in only a single market, say market 2. (a) Show that if the price in market 3 is undistorted (Le., if I, = 0), then the change in aggregate surplus caused by the tax change can be captured solely through the change in the area lying vertically between market 2's demand and supply curves holding the price of good 3 at its initial level. (b) Show that if market 3 is initially distorted because I, > 0, then by using only the single-market measure in (a), we would overstate the decrease in aggregate surplus if good 3 is a substitute for good 2 and would understate it if good 3 is a complement. Provide an intuitive explanation of this result. What is the correct measure of welfare change? IO.G.4 Consider a three-good economy (t = 1,2,3) in which every consumer has preferences that can be described by the utility function u(x) = x, + ",(x" x,) and there is a single production process that produces goods 2 and 3 from good I having c(q" q,) = c,q, + c,q,. Derive an expression for the welfare loss from an increase in the tax rates on both goods. B

IO.G,SH Consider a three-good economy (t = 1,2, 3) in which every consumer has preferences that can be described by the utility function u(x) = XI + ",(x" x,) and there is a single production process that produces goods 2 and 3 from good I having c(q2' q,) = C2(q2) + c,(q,), where c 2 (·) and c,(·) are strictly increasing and strictly convex. (a) If goods 2 and 3 are substitutes, what effect does an increase in the tax on good 2 have on the price paid by consumers for good 3? What if they are complements? (b) What is the bias from applying the formula for welfare loss you derived in part (b) of Exercise 10.G.3 using the price paid by consumers for good 3 prior to the tax change in both the case of substitutes and that of complements?

C HAP

Externalities and Public Goods

T

E

R

11

il.A Introduction In Chapter 10, we saw a close connection between competitive, price-taking equilibria and Pareto optimality (or, Pareto efficiency).' The first welfare theorem tells us that competitive equilibria are necessarily Pareto optimal. From the second welfare theorem, we know that under suitable convexity hypotheses, any Pareto optimal allocation can be achieved as a competitive allocation after an appropriate lump-sum redistribution of wealth. Under the assumptions of these theorems, the possibilities for welfare-enhancing intervention in the marketplace are strictly limited to the carrying out of wealth transfers for the purposes of achieving distributional aims. With this chapter, we begin our study of market failures: situations in which some of the assumptions of the welfare theorems do not hold and in which, as a consequence, market equilibria cannot be relied on to yield Pareto optimal outcomes. In this chapter, we study two types of market failure, known as externalities and public goods. In Chapter 10, we assumed that the preferences of a consumer were defined solely over the set of goods that she might herself decide to consume. Similarly, the production of a firm depended only on its own input choices. In reality, however, a consumer or firm may in some circumstances be directly affected by the actions of other agents in the economy; that is, there may be external effects from the activities of other consumers or firms. For example, the consumption by consumer i's neighbor of loud music at three in the morning may prevent her from sleeping. Likewise, a fishery'S catch may be impaired by the discharges of an upstream chemical plant. Incorporating these concerns into our preference and technology formalism is, in principle, a simple matter: We need only define an agent's preferences or production set over both her own actions and those of the agent creating the external effect. But the effect on market equilibrium is significant: In general, when external effects are present, competitive equilibria are not Pareto optimal. Public goods, as the name suggests, are commodities that have an inherently "public" character, in that consumption ofa unit of the good by one agent does not preclude its consumption by another. Examples abound: Roadways, national defense, 1. See also Chapter 16. 350

SEC T ION

1 1 . B:

A

S IMP L E

B' L ATE R ALE X T ERN A LIT Y

351

,----------------------------------------------------------flood-control projects, and knowledge all share this characteristic. The private provision of public goods generates a special type of externality: if one individual provides a unit of a public good, all individuals benefit. As a result, private provision of public goods is typically Pareto inefficient. We begin our investigation of externalities and public goods in Section I LB by considering the simplest possible externality: one that involves only two agents in the economy, where one of the agents engages in an activity that directly affects the other. In this setting, we illustrate the inefficiency of competitive equilibria when an externality is present. We then go on to consider three traditional solutions to this problem: quotas, taxes, and the fostering of decentralized bargaining over the extent of the externality. The last of these possibilities also suggests a connection between the presence of externalities and the nonexistence of certain commodity markets, a topic that we explore in some detail. In Section II.C, we study public goods. We first derive a condition that characterizes the optimal level of a public good and we then illustrate the inefficiency resulting from private provision. This Pareto inefficiency can be seen as arising from an externality among the consumers of the good, which in this context is known as the free-rider problem. We also discuss possible solutions to this free-rider problem. Both quantity-based intervention (here, direct governmental provision) and pricebased intervention (taxes and subsidies) can, in principle, correct it. In contrast, decentralized bargaining and competitive market-based solutions are unlikely to be viable in the context of public goods. In Section II.D, we return to the analysis of externalities. We study cases in which many agents both produce and are affected by the externality. Multilateral externalities can be classified according to whether the externality is depletable (or private or rivalrous) or nondepletable (or public or nonrivalrous). We argue that market solutions are likely to work well in the former set of cases but poorly in the latter, where the externality possesses the characteristics of a public good (or bad). Indeed, this may well explain why most externalities that are regarded as serious social problems (e.g., water pollution, acid rain, congestion) take the form of nondepletable multilateral externalities. In Section I I.E, we examine another problem that may arise in these settings: Individuals may have privately held information about the effects of externalities on their well-being. We see there that this type of informational asymmetry may confound both private and government efforts to achieve optimal outcomes. In Appendix A, we study the connection between externalities and the presence of technological nonconvexities, and we examine the implications of these nonconvexities for our analysis. The literature on externalities and public goods is voluminous. Useful introductions and further references to these subjects may be found in Baumol and Oates (1988) and Laffont (1988).

l1.B A Simple Bilateral Externality Surprisingly, perhaps, a fully satisfying definition of an externality has proved somewhat elusive. Nevertheless, informal Definition II.B.I provides a serviceable point of departure.

352

CHAPTER

11:

EXTERNALITIES

AND

PUBLIC

GOODS

=:~~~~-----------------------------------------------Definition 11.B.1: An externality is present whenever the well-being of a consumer or the production possibilities of a firm are directly affected by the actions of another agent in the economy.

Simple as Definition II.B.I sounds, it contains a subtle point that has been a source of some confusion. When we say "directly," we mean to exclude any effects that are mediated by prices. That is, an externality is present if, say, a fishery's productivity is affected by the emissions from a nearby oil refinery, but not si~ply because the fishery's profitability is affected by the price of oil (which, in turn, IS to some degree affected by the oil refinery's output of oil). The latter type of effect [referred to as a pecuniary externality by Viner (1931)] is present in any competitive market but, as we saw in Chapter 10, creates no inefficiency. Indeed, with price-taking behavior, the market is precisely the mechanism that guarantees a Pareto opti,:"al outcome. This suggests that the presence of an externality is not merely a technological phenomenon but also a function of the set of markets in existence. We return to this point later in the section. In the remainder of this section, we explore the implications of external effects for competitive equilibria and public policy in the context of a very simple two-agent, partial equilibrium model. We consider two consumers, indexed by i = 1,2, who constitute a small part of the overall economy. In line with this interpretation, we suppose that the actions of these consumers do not affect the prices pERL of the L traded goods in the economy. At these prices, consumer i's wealth is w,. In contrast with the standard competitive model, however, we assume that each consumer has preferences not only over her consumption of the L traded goods (x", ... , XL') but also over some action h E R+ taken by consumer 1. Thus, consumer i's (differentiable) utility function takes the form u,(X", ... , XLI' h), and we assume that CU'(X'2' ... ' XL2' h)/oh ¥- O. Because consumer I's choice of h affects consu~er 2's well-being, it generates an externality. For example, the two consumers may bve next door to each other, and h may be a measure of how loudly consumer 1 plays music. Or the consumers may live on a river, with consumer 1 further upstream. In this case, h could represent the amount of pollution put into the river by consumer 1; more pollution lowers consumer 2's enjoyment of the river. We should hasten. to add that external effects need not be detrimental to those affected by them. ActIOn II could, for example, be consumer l's beautification of her property, which her 2 neighbor, consumer 2, also gets to enjoy. In what follows, it will be convenient to define for each consumer i a derived utility function over the level of h, assuming optimal commodity purchases by consumer i at prices pERL and wealth w,: v,(p,

w" h) = Max

u,(x"I1)

,I(,~O

s.t. p' x, !>

w,.

For expositional purposes, we shall also assume that the consumers' utility functions 2. An externality favorable to the recipient is usually called a positive externality, and conversely for a negative externality.

SECTION

I1.B:

A

SIMPLE

BILATERAL

EXTERNALITY

353

------------------------------------------------------------------take a quasi linear form with respect to a numeraire commodity (we comment below, in small type, on the simplifications afforded by this assumption). In this case, we can write the derived utility function v,(') as v,(p, w" h) = ",,(p, h) + W,.l Since prices of the L traded goods are assumed to be unaffected by any of the changes we are considering, we shall suppress the price vector p and simply write ",,(h). We assume that ",,(.) is twice differentiable with "'i 0,

with equality if h' > O.

(1I.B.I)

For an interior solution, we therefore have ",;(h') = O. In contrast, in any Pareto optimal allocation, the optimal level of h, h must maximize the joint surplus of the two consumers, and so must solve4 O

,

Max >,,0

"'I(h)

+ "'2(h).

This problcm gives us the necessary and sufficient first-order condition for hO of (1I.B.2) Hence, for an interior solution to the Pareto optimality problem, "';(hO) = -""ihO). When external effects are present, so that "'2(h) ¥- 0 at all h, the equilibrium level of h is not optimal unless hO = h' = O. Consider, for example, the case in which we have interior solutions, that is, where (h', hO) »0. If "'2(') < 0, so that h generates 3. Indeed. suppose that Uj(x/. h) = gi(X _II. h) + Xu. where x _ Ii is consumer i's consumption of Iraded goods other than good I. Then, the consumer's Walrasian demand function for these L - I traded goods. x _ ,,(.), is independent of her wealth, and .,(p, w" h) = g,(x _,,(p, h), h) p'., _ jj(P.Ir) + w,. Thus, denoting ,(p, h) = g,(x _,,(Po h), h) - P'x _,,(p, h), we have obtained the desired rorm. 4. Recall the reasoning of Sections 10.0 and 10.E, or note that at any Pareto optimal allocation in which Ir" is the level of hand w, is consumer i"s wealth level for i = 1,2, it must be impossible to change Ir and reallocate wealth so as to make one consumer better off without making the other Tsubjcct to ,(h) + + T ~ "" forsome worse off. Thus. (h", 0) must solve Max•. T . (h) + ",. Because the constraint holds with equality in any solution to this problem, substituting from the constraint ror T in the objective function shows that hO must maximize the joint surplus of the two consumers ,(h) + ,(h).

w. -

w,

354

CHAPTER

11:

EXTERNALITIES

AND

PUBLIC

GOODS

-------------------------------------------------------------- -------------------------------------------------------------SECTION

11.B:

-4>;(h)

A

SIMPLE

BILATERAL

EXTERNALITY

-4>;(h)

/// .-

th

" = - cP;(hO)I/----;f-"----

Figure 11.B.1

Figure 11.B.2

The equilibrium (h') and optimal (hO) Ie,," of a negative externality.

a negative externality, then we have t/l; (hO) = - t/l;(hO) > 0; because t/l;(.) is decreasing and t/l',(h") = 0, this implies that h" > hO. In contrast, when t/l;(.) > 0, h represents a positive externality, and t/l',(hO) = -t/l;(hO) < 0 implies that h" < hO. Figure II.B.I depicts the solution for a case in which h constitutes a negative external effect, so that t/l;(h) < 0 at all h. In the figure, we graph t/l;(.) and - 4>;(-). The competitive equilibrium level of the externality h" occurs at the point where the graph of t/l; (.) crosses the horizontal axis. In contrast, the optimal externality level hO corresponds to the point of intersection between the graphs of the two functions. Note that optimality does not usually entail the complete elimination of a negative externality. Rather, the externality'S level is adjusted to the point where the marginal benefit to consumer I of an additional unit of the externality-generating activity, 4>;(hO), equals its marginal cost to consumer 2, -4>;(hO). In the current example, quasilinear utilities lead the optimal level of the externality to be independent of the consumers' wealth levels. In the absence of quasilinearity, however, wealth effects for the consumption of the externality make its optimal level depend on the consumers' wealths. See Exercise II.B.2 for an illustration. Note, however, that when the agents under consideration are firms, wealth effects are always absent.

Traditional Solutions to the Externality Problem Having identified the inefficiency of the competitive market outcome in the presence of an externality, we now consider three possible solutions to the problem. We first look at government-implemented quotas and taxes, and then analyze the possibility that an efficient outcome can be achieved in a much less intrusive manner by simply fostering bargaining between the consumers over the extent of the externality. Quotas and taxes

To fix ideas, suppose that h generates a negative external effect, so that hO < h". The most direct sort of government intervention to achieve efficiency is the direct control of the externality-generating activity itself. The government can simply mandate that h be no larger than hO, its optimal level. With this constraint, consumer I will indeed fix the level of the externality at hO. A second option is for the government to attempt to restore optimality by imposing a tax on the externality-generating activity. This solution is known as

The optimality-

h'

restoring Pigouvian

tax.

Pigouvian taxalion, after Pigou (1932). To this effect, suppose that consumer I is made to pay a tax of t, per unit of h. It is then not difficult to see that a tax of I,

= -t/l;(hO) > 0

will implement the optimal level of the externality. Indeed, consumer I will then choose the level of h that solves Max

,,,0

t/l,(h) - I,h,

(1I.B.3)

which has the necessary and sufficient first-order condition t/l;(II) ~ I., with equality if h > O. (11.8.4) Given I, = -t/l;(hO), h = h' satisfies condition (1I.B.4) [recall that hO is defined by the condition: t/l; (hO) ~ - t/l;W), with equality if hO > 0). Moreover, given t/l~(.) < 0, hO must be the unique solution to problem (11.8.3). Figure II.B.2 illustrates this solution for a case in which hO > O. Note that the optimality-restoring tax is exactly equal to the marginal externality at the optimal solution.' That is, it is exactly equal to the amount that consumer 2 would be willing to pay to reduce h slightly from its optimal level hO. When faced with this tax, consumer I is effectively led to carry out an individual cost-benefit computation that internalizes the externality that she imposes on consumer 2. The principles for the case of a positive externality are exactly the same, only now when we set I, = - t/l;(hO) < 0, I. takes the form of a per-unit subsidy (i.e., consumer I receives a payment for each unit of the externality she generates). Several additional points are worth noting about this Pigouvian solution. First, we can actually achieve optimality either by taxing the externality or by subsidizing its reduction. Consider, for example, the case of a negative externality. Suppose the government pays a subsidy of s, = - t/l;(hO) > 0 for every unit that consumer I's choice of /r is below /r", its level in the competitive equilibrium. If so, then consumer I will maximize t/l,(h) + s.(/r" - h) = t/l,(h) - I.h + t./r". But this is equivalent to a tax of I, per unit on h combined with a lump-sum payment of t,h". Hence, a subsidy for the reduction of the externality combined with a lump-sum transfer can exactly replicate the outcome of the tax. Second, a point implicit in the derivation above is that, in general, it is essential 5. In Ihe case where hO = 0, any lax greater Ihan - 4>;(0) also implemenlS the optimal outcome.

355

356

C HAP T E R

1 1:

EXT ERN A LIT I E SAN D

PUB L I C

GOODS

SECTION

-------------------------------------------------------------------~

to tax the externality-producing activity directly. For instance, suppose that, in the example of consumer 1 playing loud music, we tax purchases of music equipment instead of taxing the playing of loud music itself. In general, this will not restore optimality. Consumer I will be led to lower her consumption of music equipment (perhaps she will purchase only a CD player, rather than a CD player and a tape player) but may nevertheless play whatever equipment she does purchase too loudly. A common example of this sort arises when a firm pollutes in the process of producing output. A tax on its output leads the firm to reduce its output level but may not have any effect (or, more generally, may have too little effect) on its pollution emissions. Taxing output achieves optimality only in the special case in which emissions bear a fixed monotonic relationship to the level of output. In this special case, emissions can be measured by the level of output, and a tax on output is essentially equivalent to a tax on emissions. (See Exercise 11.B.5 for an illustration.) Third, note that the tax/subsidy and the quota approaches are equally effective in achieving an optimal outcome. However, the government must have a great deal of information about the benefits and costs of the externality for the two consumers to set the optimal levels of either the quota or the tax. In Section 1 I.E we will see that when the government does not possess this information the two approaches typically are not equivalent. Fostering bargaining over externalities: enforceable property rights Another approach to the externality problem aims at a less intrusive form of intervention, merely seeking to insure that conditions are met for the parties to themselves reach an optimal agreement on the level of the externality. Suppose that we establish enforceable property rights with regard to the externality-generating activity. Say, for example, that we assign the right to an "externality-free" environment to consumer 2. In this case, consumer 1 is unable to engage in the externality-producing activity without consumer 2's permission. For simplicity, imagine that the bargaining between the parties takes a form in which consumer 2 makes consumer 1 a take-it-or-Ieave-it offer, demanding a payment of T in return for permission to generate externality level h." Consumer 1 will agree to this demand if and only if she will be at least as well off as she would be by rejecting it, that is, if and only if O.

Comparing this expression with (1I.B.2), we see that II" equals the optimal level h'. The equilibrium price of the externality is = "';(h') = -"';W). Consumer I and 2's equilibrium utilities are then "'IW)- P: h' and t/l 2 (h') + P: h', respectively. The market therefore works as a particular bargaining procedure for splitting the gains from trade; for example, point g in Figure II.B.3 could represent the utilities in the competitive equilibrium. We see that if a competitive market exists for the externality, then optimality results. Thus, externalities can be seen as being inherently tied to the absence of certain competitive markets, a point originally noted by Meade (1952) and substantially extended byArrow (1969). Indeed, recall that our original definition of an externality, Definition 11.8.1, explicitly required that an action chosen by one agent must directly affect the well-being or production capabilities of another. Once a market exists for an externality, however, each consumer decides for herself how much of the externality to consume at the going prices. Unfortunately, the idea of a competitive market for the externality in the present example is rather unrealistic; in a market with only one seller and one buyer, price taking would be unlikely· However, most important externalities are produced and felt by many agents. Thus, we might hope that in these multilateral settings, price taking would be a more reasonable assumption and, as a result, that a competitive market for the externality would lead to an efficient outcome. In Section II.D, where we study multilateral externalities, we see that the correctness of this conclusion depends on whether the externality is priv8te" or "public" in nature. Before coming to this, however, we first study the nature of public goods.

P:

M

AiI:!:O

Il.e Public Goods

which has the first-order condition

",;(h.) s; P., with equality if hi > O.

(1I.B.6)

In deciding how many rights to sell, h2' consumer 2 will solve Max

"'2(h 2) + p.h 2,

In this section, we study commodities that, in contrast with those considered so far, have a feature of "publicness" to their consumption. These commodities are known as public goods. Definition 11.C.1: A public good is a commodity for Which use of a unit of the gOOd by one agent does not preclude its use by other agents.

Itl~O

which has the first-order condition

",;(h 2 ) s; - P.,

with equality if h2 > O.

(11.B.7)

7. Note, however, that this conclusion presumes that the owner of a firm has full control over all its functions. In more complicated (but realistic) settings in which this is not true, say because owners must hire managers whose actions cannot be perfectly controlled, the results of a merger and of an agreement over the level of the externality need not be the same. Chapters 14 and 23 provide an introduction to the topic of incentive design. See Holmstrom and Tirole (\989) for a discussion of these issues in the theory of the firm.

Put somewhat differently, public goods possess the feature that they are nondepletable: Consumption by one individual does not affect the supply available for other individuals. Knowledge provides a good illustration. The use of a piece of knowledge for one purpose does not preclude its use for others. In contrast, the commodities studied up to this point have been assumed to be of a private, or depletable, nature; . 8. For Ihat matter, the idea lhal the exlernalily righls are all sold at lhe same price lacks Justtfication here, because there is no natural unit of measurement for the externality.

I

J

360

CHAPTER

11:

EXTERNALITIES

AND

PUBLIC

GOODS

~~~~~-------------------------------------------that is, for each additional unit consumed by individual i, there is one unit less available for individuals j '" i." A distinction can also be made according to whether exclusion of an individual from the benefits of a public good is possible. Every private good is automatically excludable, but public goods mayor may not be. The patent system, for example, is a mechanism for eXCluding individuals (although imperfectly) from the use of knowledge developed by others. On the other hand, it might be technologically impossible, or at the least very costly, to exclude some consumers from the benefits of national defense or of a project to improve air quality. For simplicity, our discussion here will focus primarily on the case in which exclusion is not possible. Note that a public "good" need not necessarily be desirable; that is, we may have public bads (e.g., foul air). In this case, we should read the phrase "does not preclude" in Definition 11.C.1 to mean "does not decrease."

Conditions for Pareto Optimality Consider a setting with I consumers and one public good, in addition to L traded goods of the usual, private, kind. We again adopt a partial equilibrium perspective by assuming that the quantity of the public good has no effect on the prices of the L traded goods and that each consumer's utility function is quasilinear with respect to the same numeraire, traded commodity. As in Section II. B, we can therefore define, for each consumer i, a derived utility function over the level of the public good. Letting x denote the quantity of the public good, we denote consumer i's utility from the public good by ¢;(x). We assume that this function is twice differentiable, with ¢;(x) < 0 at all x ~ O. Note that precisely because we are dealing with a public good, the argument x does not have an i subscript. The cost of supplying q units of the public good is c(q). We assume that c(.) is twice differentiable, with c"(q) > 0 at all q ~ O. To describe the case of a desirable public good whose production is costly, we take ¢i(') > 0 for all i and c'(·) > O. Except where otherwise noted, however, the analysis applies equally well to the case of a public bad whose reduction is costly, where ¢:;Cx*):s pO,

1-1

The necessary and sufficient first-order condition for the optimal quantity qO is then I

I

cI>;(qO) :s c'(qO),

with equality if qO > O.

(11.C.I)

i-I

Condition (11.C.I) is the classic optimality condition for a public good first derived by Samuelson (1954; 1955). (Here it is specialized to the partial equilibrium setting; 9. Inlermediale cases are also possible in which Ihe consumplion of Ihe good by one individual alTecls 10 some degree ils availabililY 10 olhers. A classic example is Ihe presence of congeslion effecls. For Ihis reason, goods for which Ihere is no deplelabilily whalsoever are somelimes referred 10 as pure pu blic goods.

PUB Lie

GOO D S

361

----------------------------------------------~~~~~

with equality if xi>

o.

(11.C.3)

The firm's supply q*, on the other hand, must solve Max ;'0 (p*q _ c(q» and therefore must satisfy the standard necessary and sufficient firs't-order condition

p* :S c'(q*),

with equality if q* > O.

(lI.CA)

At a competitive equilibrium, q* = x*. Thus, letting 0i = I if xi > 0 and O. = 0 if xi = 0, (11.C.3) and (11.C.4) tell us that LI o,[¢i(q*) - c'(q*)] = O. Recalli~g that 0 and c'(·) > 0, this implies that whenever I> I and q* > 0 (so that O. = I for some i) we have ' I

I

;=1

¢i(q*) > c'(q*).

(11.C.5)

362

CHAPTER

11:

EXTERNALITIES

AND

PUBLIC

GOODS

-----------------------------------------------------------~-

Figure 11.C.l

q*

q"

Comparing (lI.C.5) with (II.C.I), we see that whenever qO > 0 and I> I, the level of the public good provided is too low; that is, q* < qo.IO The cause of this inefficiency can be understood in terms of our discussion of externalities in Section II.B. Here each consumer's purchase of the public good provides a direct benefit not only to the consumer herself but also to every other consumer. Hence, private provision creates a situation in which externalities are present. The failure of each consumer to consider the benefits for others of her public good provision is often referred to as the free-rider problem: Each consumer has an incentive to enjoy the benefits of the public good provided by others while providing it insufficiently herself. In fact, in the present model, the free-rider problem takes a very stark form. To see this most simply, suppose that we can order the consumers according to their marginal benefits, in the sense that 4>;(x) < ... < 4>i(x) at all x ~ O. Then condition (11.C.3) can hold with equality only for a single consumer and, moreover, this must be the consumer labeled I. Therefore, only the consumer who derives the largest (marginal) benefit from the public good will provide it; all others will set their purchases equal to zero in the equilibrium. The equilibrium level of the public good is then the level q. that satisfies 4>i(q·) = c'(q*). Figure II.C.1 depicts both this equilibrium and the Pareto optimal level. Note that the curve representing L 4>i,(q) for i = 1, ... ,1 (whereas in the case of a private good, the market demand curve is identified by adding the individual demand curves horizontally). The inefficiency of private provision is often remedied by governmental intervention in the provision of public goods. Just as with externalities, this can happen not only through quantity-based intervention (such as direct governmental provision) but also through "price-based" intervention in the form of taxes or subsidies. For example, suppose that there are two consumers with benefit functions 4> I (x I + x 2) and 4>2(X I + X2)' where x, is the amount of the public good purchased by consumer i, and that qO > O. By analogy with the analysis in Section II.B, a subsidy to each consumer i per unit purchased of 5, = 4>'-/(qO) [or, equivalently, a tax of -4>'-,(qO) per unit that consumer i's purchases of the public good fall below some specified 10. The conclusion follows immediately if q' = O. So suppose instead that q' > O. Then since L, ;(q' )-c'(q') > 0 and L, ;(')- c'(·) is decreasing, any solution to (ll.e I) must have a larger value than q'. Note that, in contrast, if we are dealing with a public bad, so that ;(') < 0 and c'(') < 0, then the inequalities reverse and qO < q •.

SECTION

PUBLIC

level] faces each consumer with the marginal external effect of her actions and so generates an optimal level of public good provision by consumer i. Formally, if (XI' x2 ) are the competitive equilibrium levels of the public good purchased by the two consumers given these subsidies, and if p is the equilbrium price, then consumer i's purchases of the public good, X" must solve Max.,;,o 4>,(x, + XI) + s,x l - px" and so Xi must satisfy the necessary and sufficient first-order condition

Private provision leads to an insufficientltve! of a desirable public

good.

II.C:

GOODS

363

,--------------------------------------------------~~~~~~::

4>;U,

+ x2) + 5i S p, with equality

of Xi> O.

Substituting for 5 i , and using both condition (1I.C.4) and the market-clearing condition that x, + x2 = ii, we conclude that ii is the total amount of the public good in the competitive equilibrium given these subsidies if and only if 4>M)

+ 4>'-i(qO) S c'(ii),

with equality for some i if ii > O. Recalling (II.C.I) we see that ii = qQ. (Exercise II.C.1 asks you to extend this argument to the case where 1 > 2; formally, we then have a multilateral externality of the sort studied in Section 11.0.) Note that both optimal direct public provision and this subsidy scheme require that the government know the benefits derived by consumers from the public good (i.e., their willingness to pay in terms of private goods). In Section I I.E, we study the case in which this is not so.

Lindahl Equilibria Although private provision of the sort studied above results in an inefficient level of the public good, there is in principle a market institution that can achieve optimality. Suppose that, for each consumer i, we have a market for the public good "as experienced by consumer i." That is, we think of each consumer's consumption of the public good as a distinct commodity with its own market. We denote the price of this personalized good by Pi' Note that P, may differ across consumers. Suppose also that, given the equilibrium price each consumer i sees herself as deciding on the total amoullt of the public good she will COllsume, x" so as to solve

p,o.,

4>,(x,) - p,.·x,.

Max Xj~O

Her equilibrium consumption level sufficient first-order condition

O.

(11.C.6)

The firm is now viewed as producing a bundle of 1 goods with a fixed-proportions technology (i.e., the level of production of each personalized good is necessarily the same). Thus, the firm solves Max q~O

(±

p:.q) - c(q).

iEt

The firm's equilibrium level of output q"

therefore satisfies the necessary and

364

CHAPTER

11:

EXTERNALITIES

AND

PUBLIC

SECTION

GOODS

~~~~~~~~~----------------------------------sufficient first-order condition I

L pro ~ c'(q**),

with equality if q** > O.

(11.C.?)

i='l

Together, (11.C.6), (ll.c.?), and the market-clearing condition that all i imply that

, L ¢,(q*') ~ c'(q*'),

j=

with equality if q" > O.

xr' = q"

for

(11.C.S)

1

Comparing (11.e.S) with (II.C.I), we see that the equilibrium level of the public good consumed by each consumer is exactly the efficient level: q" = q'. This type of equilibrium in personalized markets for the public good is known as a Lilldahl equilibrium, after Lindahl (1919). [See also Milleron (1972) for a further discussion.] To understand why we obtain efficiency, note that once we have defined personalized markets for the public good, each consumer, taking the pr~ce in her personalized market as given, fully determines her own level of consumption of the public good; externalities are eliminated. Yet, despite the attractive properties of Lindahl equilibria, their realism is questionable. Note, first, that the ability to exclude a consumer from use of the public good is essential if this equilibrium concept is to make sense; otherwise a consumer would have no reason to believe that in the absence of making any purchases of the public good she would get to consume none of it. II Moreover, even if exclusion is possible, these are markets with only a single agent on the demand side. As a result, price-taking behavior of the sort presumed is unlikely to occur. The idea that inefficiencies can in principle be corrected by introducing the right kind of markets, encountered here and in Section II.B, is a very general one. In particular cases, however, this "solution" mayor may not be a reali.s~ic ~ossibil~ty. We encounter this issue again in our study of multilateral externahtles m Section 11.0. As we shall see, these types of externalities often share many of the features of public goods.

11.0:

MULTILATERAL

piece of property, that much less is left to be dumped on others. 1l Depletable externalities therefore share the characteristics of our usual (private) sort of commodity. In contrast, air pollution is a nondepletable externality; the amount of air pollution experienced by one agent is not affected by the fact that others are also experiencing it. Nondepletable externalities therefore have the characteristics of public goods (or bads). In this section we argue that a decentralized market solution can be expected to work well for multilateral depletable externalities as long as well-defined and enforceable property rights can be created. In contrast, market-based solutions are unlikely to work in the nondepletable case, in parallel to our conclusions regarding public goods in Section 11.C. We shall assume throughout this section that the agents who generate externalities are distinct from those who experience them. This simplification is inessential but eases the exposition and facilitates comparison with the previous sections (Exercise 11.0.2 asks you to consider the general case). For ease of reference, we assume here that the generators of the externality are firms and that those experiencing the externality are consumers. We also focus on the special, but central, case in which the externality generated by the firms is homogeneous (i.e., consumers are indifferent to the source of the externality). {Exercise 11.0.4 asks you to consider the case in which the source matters.} We again adopt a partial equilibrium approach and assume that agents take as given the price vector p of L traded goods. There are J firms that generate the externality in the process of production. As discussed in Section 11.B, given price vector p, we can determine firm j's derived profit function over the level of the externality it generates,llj ;:: 0, which we denote by nJ(hJ). There are also I consumers, who have quasilinear utility functions with respect to a numeraire, traded commodity. Given price vector p, we denote by 4J,(h,) consumer i's derived utility function over the amount of the externality h, she experiences. We assume that nJ(') and 4J,{') are twice differentiable with nj(') < 0 and O.D

(11.0.1)

In contrast, any Pareto optimal allocation involves the levels (h~, ... , iiI' h;, ... , h~) 12. A distinction can also be made as to whether a depletable externality is allocable. For example, acid rain is depletable in the sense that the total amount of chemicals put into the air will fall somewhere. but it is not readily allocable because where it falls is determined by weather patterns. Throughout this section. we take depletable externalities to be allocable. The analytical implications of nonallocable depletable externalities parallel those of nondepletable ones. 13. The firms are indifferent about which consumer is affected by their externality. Therefore, the particular values of the individual ii/s are indeterminate. apart from the fact that L, hi = L;

h7.

365

366

CHAPTER

11:

AND

EXTERNALITIES

PUBLIC

SECTION

GOODS

that solve 14 I

J

I (Mii,) + I

Max I,.I ....• "JI~O

1=1

(li, ..... ~,)~o

J

S.t.

1!)(h)

):1

I i""l

(11.0.2)

/

h) =

Iii,. i""t

The constraint in (11.0.2) reflects the depletability of the externality: If ii, is increased by one unit, there is one unit less of the externality that needs to be experienced by others. Letting /1 be the multiplier on this constraint, the necessary and sufficient first-order conditions to problem (11.0.2) are

0.

= - L, ;(h') and each firm j using hj permtts and so yields an optimal allocation. The advantage of this scheme relative to a strict quota method arises when the government has limited information about the n (.) functIOns and cannot tell which particular firms can efficiently bear the burden of extern:lity reductIOn, although it has enough information, perhaps of a statistical sort, to allow the computation of the optimal aggregate level of the externality, h'.

p:

p:

15. Recallihat the single firm's cost function c(·) in Section I I.e could be viewed as the aggregate cost functIOn of J separate profit-maximizing firms. Were we to explicitly model these J firms in SectIon t I.e, the optimality conditions for public good production would take exactly the form in (II.D.6) with c;{hj) replacing -nj{hj). I~. The public nature of the externality leads to similar free-rider problems in any bargaining Soiutlon. (See ExerCIse 11.0.6 for an illustration.)

368

CHAPTER

11:

EXTERNALITIES

AND

PUBLIC

GOODS

l1.E Private Information and Second-Best Solutions In practice, the degree to which an agent is affected by an externality or benefits from a public good will orten be known only to her. The presence of privately held (or asymmetrically held) information can confound both centralized (e.g., quotas and taxes) and decentralized (e.g., bargaining) allempts to achieve optimality. In this section, we provide an introduction to these issues, focusing for the sake of specificity on the case of a bilateral externality such as that studied in Section I LB. Following the convention adopted in Section I I.D, we shall assume here that the externalitygenerating agent is a firm and the affected agent is a consumer. (For a more general treatment of some of the topics covered in this section, see Chapter 23.) Suppose, then, that we can write the consumer's derived utility function from externality level h (see Section II.B for more on this construction) as (h, 'I), where 'I E R is a parameter, to be called the consumer's type, that affects the consumer's costs from the externality. Similarly, we let 7[(h, 8) denote the firm's derived profit given its type 8 E R. The actual values of 0 and 'I are privately ohsert'ell: Only the consumer knows her type 'I, and only the firm observes its type 0. The ex ante likelihoods (probability distributions) of various values of 8 and 'I are, however, publicly known. For convenience, we assume that 0 and 'I are independently distributed. As previously, we assume that 7[(h, 0) and (h, III are strictly concave in h for any given values of 0 and 'I.

Decell!ralized Bargaining Consider the decentralized approach to the externality problem first. In general, bargaining in the presence of bilateral asymmetric information will not lead to an efficient level of the externality. To see this, consider again the case in which the consumer has the right to an externality-free environment, and the simple bargaining process in which the consumer makes a take-it-or-Ieave-it offer to the firm. For simplicity, we assume that there are only two possible levels of the externality, 0 and > 0, and we focus on the case of a negative externality in which externality level [" relative to the level 0, is detrimental for the consumer and beneficial for the firm (the analysis is readily applied to the case of a positive externality). It is convenient to define b(O) = 7[(ii, 0) - 7[(0,0) > 0 as the measure of the firm's benefit from the externality-generating activity when its type is O. Similarly, we let c(11l = I/J(O, 'I) -I/J(ij, III > 0 give the consumer's cost from externality level ii. In this simplified selling, the only aspects of the consumer's and firm's types that matter are the values of band c that these types generate. Hence, we can focus directly on the various possible values of band c that the two agents might have. Denote by G(b) and F(c) the distribution functions of these two variables induced by the underlying probability distributions of 0 and 'I (note that, given the independence of 0 and 'I, b and C are independent). For simplicity, we assume that these distributions have associated density functions g(b) and ftc), with g(b) > 0 and ftc) > 0 for all b> 0 and (' > O. Since the consumer has the right to an externality-free environment, in the absence of any agreement with the firm she will always insist that the firm set h = 0 (recall that c > 0). However, in any arrangement that guarantees Pareto optimal outcomes for all values of hand c, the firm should be allowed to set h = ii whenever b > c.

I.

---

SECTION

11.E:

PRIVA.TE

INFORMATION

AND

SECOND-BEST

SOLUTIONS

369

---------------------------~~~~~~~~~~~~:: Now consider the amount that the consumer will demand from the firm when her cost is c in exchange for permission to engage in the externality-generating activity. Since the firm knows that the consumer will insist on h = 0 if there is no agreement, the firm will agree to pay the amount T if and only if b ;;" T. Hence, the consumer knows that if she demands a payment of T, the probability that the firm will accept her offer equals the probability that b ;;" T; that is, it is equal to I - G(T). Given her cost c > 0 (and assuming risk neutrality), the consumer optimally chooses the value of T she demands to solve Max

(I - G(T»(T - c).

(II.E.I)

T

The objective function of problem (I I.E.I) is the probability that the firm accepts the demand, multiplied by the net gain to the consumer when this happens (T - c). Under our assumptions, the objective function in (\I.E.I) is strictly positive for all T> c and equal to zero when T = c. Therefore, the solution, say P,', is such that P,' > c. But this implies that this bargaining process must result in a strictly positive probability of an inefficient outcome, since whenever the firm's benefit b satisfies c < b < P,', the firm will reject the consumer's offer, resulting in an externality level of zero, even though optimality requires that h = ii."·'·

Quotas alld Taxes Just as decentralized bargaining will involve inefficiencies in the presence of privately held information, so too will the use of quotas and taxes. Moreover, as originally noted by Weitzman (1974), the presence of asymmetrically held information causes these two policy instruments to no longer be perfect substitutes for one another, as they were in the model of Section 11.8.'· To begin, note that given 8 and 'I, the aggregate surplus resulting from externality level h (we return to a continuum of possible externality levels here) is I/J(h, ~) + 7[(h,O). Thus, the externality level that maximizes aggregate surplus depends in general on the realized values of (0, Ill. We denote this optimal value by the function hO(O, 'I). Figure II.E.I depicts this optimum value for two different pairs of parameters, (0', ~') and (0",

,n.

Suppose, first, that a quota level of h is fixed. The firm will then choose the level of the externality to solve Max

7[(h, (I)

';;'0

s.t.

h~

ft.

Denote its optimal choice by h«h, 8). The typical effect of the quota will he to make 17. No.e .he similarity between problem (II.E.I) and the monopolist's problem studied in Section 12.B. Here the consumer's inability to discriminate among firms of different types leads her optImal oITer to be one that yields an inefficient outcome.

18. We could, of course, also consider the outcomes from other, perhaps more elaborate, bargaining procedures. In Chapter 23, however, we shall study a result due to Myerson and Salterthwai.e (1983) that implies that no bargaining procedure can lead to an efficient outcome ror all values of band c in this setting. 19. T~e discussion that follows also has implications for the relative advantages of quantity-

versus pTlce-based control mechanisms in organizations.

370

C HAP T E R

1 1:

EXT ERN ALIT I E SAN D

PUB Lie

SEC T ION

GOO D S

1 1 . E:

P R I V ATE

I N FOR MAT ION

AND

SEC 0 N D - 8 EST

SOL UTI 0 N S

371

----------------------------------------------------------~ ~-----------------------------------------------------~~~ _ ilq,(h,

Loss in / Aggregate Surplus ,/ ilq,(h, ~') --il-hFigure II.E.l

The surplusmaximizing aggregale externality level for two different pairs or parameters, (0', ~') and

iJh

~)

ilq,(h,~)

----ah

,/ /""

"

iJq,(h, ilh

r,

c. In addition, if externality generation is allowed (i.e., if h = ii), the government will tax the firm an amount equal to c and will subsidize the consumer with a payment equal to 6. That is, if the firm wants to generate the externality (which it indicates by reporting a large value of b), it is asked to pay the externality's cost as declared by the consumer; and if the consumer allows the externality (by reporting a low value of c) she receives a payment equal to the externality's benefit as declared by the firm. In fact, under this scheme both the firm and the consumer will tell the truth, so that an optimal level of externality generation will, indeed, result for every possible (b, c) pair. To see this, consider the consumer's optimal announcement when her cost level is c. If the firm announces some 6 > c, then the consumer prefers to have the externality-generating activity allowed (she does 6 - c better than if it is prevented). Hence, her optimal announcement satisfies {: < 6; moreover, because any such announcement will give her the same payoff, she might as well announce the truth, that is, c = c < 6. On the other hand, if the firm announces 6 S c, the consumer prefers to have the externality level set to zero. Hence, she would like announce C~ 6; and again, because any of these announcements will give her the same payoff, she may as well announce the truth, that is, {: = c ~ 6. Thus, whatever the firm's announcement, truth-telling is an optimal strategy for the consumer. (Formally, telling the truth is a weakly dominant strategy for the consumer in the sense studied

o

A P PEN D I X

A:

NON CON V E X I TIE SAN D

THE

THE 0 R Y

0 F

EXT ERN A LIT I E S

375

---------------------------------------------------------------- ,---------------------------------------------------------------374

CHAPTER

11:

EXTERNALITIES

AND

PUBLIC

GOODS

in Section 8.B. In fact, it is the consumer's only weakly dominant strategy; see Exercise 11.E.3.) A parallel analysis yields the same conclusion for the firm.

.,(h)

= ,h"" -"

Exercise 11.E.4: Show that in the tax-subsidy part of the mechanism above we could add, without affecting the mechanism's truth-telling or optimality properties, an additional payment to each agent that depends in an arbitrary way on the other agent's announcement. The scheme we have described here is an example of the Groves-Clarke mechanism [due to Groves (1973) and Clarke (1971); see also Section 23.C] and was originally proposed as a mechanism for deciding whether to carry out public good projects. Some examples for the public goods context are contained in the exercises at the end of lhe chapter. The Groves-Clarke mechanism has two very attractive features: it implements the optimal level of the externality for every (b, c) pair, and it induces truth-telling in a very strong (i.e., dominanl strategy) sense. But the mechanism has some unattractive features as well. In particular, it does not result in a balanced budget for the government: The government has a deficit equal to (b - c) whenever b > c. We could use the flexibility offered by Exercise II.E.4 to eliminale this deficit for all possible (b, c), but then we would necessarily create a budget surplus and therefore a Pareto inefficient outcome for some values of (b, c) (not all units of the numeraire will be left in the hands of the firm or the consumer). In fact, this problem is unavoidable with this type of mechanism: If we want to preserve the properties that, for every (b, c), truth-telling is a dominant strategy and the optimal level of exlernalily is implemented, then we generally cannot achieve budget balance for every (b, c). In Chapler 23 we discuss this issue in greater detail and also consider other mechanisms that can, under certain circumstances, get around the problem. (Sec also Exercise II.E.5 for an analysis in which budget balance is required only on average.)

APPENDIX A: NONCONVEXITIES AND THE THEORY OF EXTERNALITIES

Throughout this chapter, we have maintained the assumption that preferences and production sets are convex, leading the derived utility and profit functions we have considered to be concave. With these assumptions, all the decision problems we have studied have been well behaved; they had unique solutions (or, more generally, convex-valued solutions) that varied continously with the underlying parameters of the problems (e.g., the prices of the L traded commodities or the price of the externality if a market existed for it). Yet, this is not a completely innocent assumption. In this appendix, we present some simple examples designed to illustrate that externalities may themselves generate nonconvexities, and we comment on some of the implications of this fact. We consider here a bilatenll externality situation involving two firms. We suppose that firm I may engage in an externality-generating activity that affects firm 2's production. The level of externality generated by firm I is denoted by h, and firm j's profits conditional on the production of externality level hare ItJ(h) for j = 1,2. It is perfectly natural to assume that It l (·) is concave: The level h could, for example,

Flgur.".AA.' The derived profit function of firm 2 (the externality recipienl) in Example II.AA.I when a + fJ > I.

be equal to firm I's output. 24 As Examples II.AA.I and II.AA.2 illustrate, however, this may not be true of firm 2's profit function. Example 11.AA.1: Positive Externalities as a Source of Increasing Returns. Suppose that firm 2 produces an output whose price is I, using an input whose price, for simplicity, we also take to equal I. Firm 2's production function is q = h' z', where IX, PE [0, I). Thus, the externality is a positive one.2S Note that, for fixed h, the problem of firm 2 is concave and perfectly well behaved. Given a level of h, the maximized profits of firm 2 can be calculated to be 1t2(h) = yh"11 -", where y > 0 is a constant. In Figure II.AA.I, we represent 1t 2(h) for P> I - IX. We see there that firm 2's derived profit function is not concave in /1; in fact, it is convex. This reflects the fact that if we think of the externality h as an input to firm 2's production process, then firm 2's overall production function exhibits increasing returns to scale because IX + P> I. • Example 11.AA,2: Negative Externalities as a Source of Nonconvexities. In Example II.AA.I, the nonconvexity in firm 2's production set, and the resulting failure of concavity in its derived profit function, were caused by a positive externality. In this example the failure of concavity of firm 2's derived profit function is the result of a negative externality. Suppose, in particular, that Iti(h) $ 0 for all h, with strict inequality for some h, and that firm 2 has the option of shutting down when experiencing externality level h and receiving profits of zero. 26 In this case, the function 1t2(') can never be concave 24. Note also Ihal we may well have n,(Ir) < 0 for some levels h ~ 0 because n,(h) is firm I's maximized profit conditional on producing eXlernality level h (and so shulling down is not possible if h > 0). 25. More generally, we could think that there is an industry composed of many firms and that Ihe externalily is produced and felt by all firms in Ihe industry (e.g., h could be an index, correia led with outpul, of accumulated know-how in the industry). Externalities were first studied by Marshall (1920) in Ihis context. See also Chipman (1970) and Romer (1986). 26. In the more typical case of a multilaleral externality, the abilily of affected parties to shut down in this manner orten depends Oil whelher the externalilY is deplelable. In the case of a nondcplctable externality, such as air pollution. affected firms can always shut down and receive zero profits. In contrasl, in Ihe case of a depletable externality (such as garbage), where ,,)(h) reHects firmj's profits when it individually absorbs h units of the externality, the absorption of the externality may itself require the usc of some inpuls (e.g., land to absorb garbage). Indeed, were this not the case for a deplelable eXlernality, the exlernality could always be absorbed in a manner that creates no social costs by allocating all or the externality to a firm that shuts down.

----

376_ _ C_ HAP T E_ R _ 1': ERN A_ LIT E SAN 0 _PUB Lie _ __ _ _EXT __ __ _I _ ___ __ _ _GOODS __________________

",

A E F E A E N C ES

~

Flgur. 11 .AA.2

(\

concave Over he [0, 00].

over all Ii E [0, if_), a point originally noted by Starrett (1972). The reason can be seen in Figure II.AA.2: If nz I

-

h)2

'v--{('" "): " + "

earn zero profits for any level of the externality, then its derived profit function 1t,(h) cannot be

for h :s; I for h > 1.

" , __ (n.(O), niO») = (0,2)

"-,

If the recipient o[ a negative externality can shut down and

",(h)

377

----------------------------------------------------~~~~~~

{('" ",): "J:S "J(h) [or j = I, Hor some h
"''''''

=

",(0) + ",(O)} Flgur.ll.AA.3

(.,(1),

",(I»

",

= (1,0)

, ", The function 1t2(') is not concave, something that the two previous examples have shown us can easily happen with externalities. The profit levels for the two firms that are attainable for different levels of h are depicted in Figure II.AA.3 by the shaded set {(It" 1t2): Itj:S; ItJ(h) for j = 1,2 for some h > 0) (note that this definition allows for free disposal of profits). The social optimum has h = 0 (joint profits are then equal to 2), in which case firm 2 is able to operate in an environment free from the externality. This can be implemented by setting a tax rate on firm I of t > I per unit of the externality. But note that the outcome h = I (implemented by setting a tax rate on firm I of t = 0) is a local social optimum: As we decrease h, it is not until h < ! that we get an aggregate surplus level higher than that at h = I. Hence, this latter outcome satisfies both the first-order and second-order conditions for the maximization of aggregate surplus (e.g., at this point, the marginal benefits of the externality exactly equal its marginal costs), and it will be easy for a social planner to be misled into thinking that she is at a welfare maximum. _

REFERENCES Arrow, K. J. (1969). The organization of economic activity: Issues pertinent to the choice of market versus non-market allocation. In Collected Papers of K. J. Arrow, Vol. 2 Cambridge, Mass.: Harvard University Press, 1983.

Baumol, W. J. (1964). External economies and second-order optimality conditions. American Economic Review 54: 368-72. Baumol. W. J.. and W. E. Oa'es. (1988). The Theory of Environmental PolJcy, 2nd. ed. New York: Cambridge University Press. Chipman, J. S. (1970). External economies of scale and competitive equilibrium. Quarterly Journal of Economics 84: 347-85. aarke, E. H. (1971). Multipart pricing of public goods. PublJc Choice 11: 17-33. Coase, R. (t960). The problem of social cosl. Journal of Law and Economics 1: 1-44. Groves, T. (1973). Incentives in 'earns. Econometrica 41: 617-31. Holmstrom, B., and J. Tirole. (1989). The theory of the firm. In Handbook of Induslrial Organization, edtled by R. Schmalensee and R. D. Willig. Amsterdam: North-Holland. !.afron', J.-J. (1988). Fundamenlals of Public Economics. Cambridge. Mass.: MIT Press. Lindahl. E. (1919). Die Gerechtigkeit der BeSleuring. Lund: Gleerup. [English translation: Just 'axation-a positive solution. In Classics in the Theory of Public Finance, edited by R. A. Musgrave and A. T. Peacock. London: Macmillon, 1958.] Marshall, A. (1920). Principles of Economics. London: Macmillan.

The set of possible profit pairs (7[10 ~,) in Example II.AAJ exhibits mUltiple local maxima of aggregate surplus 1t,(h) + .,(h).

378

C HAP T E R

1 1:

EXT ERN .. LIT I E S

.. N D

PU8 L IC

GOO D S

-------------------------------------------------------------------~ Meade, J. (1952). External economies and diseconomies in a competitive situation. Economic )our"aI62: 54-67. Milleron. J..c. (1972). Theory of value with public goods: A survey article. Journal of Economic Theory 5: 419-77. Myerson, R.o and M. Satterthwaite. (1983). Efficient mechanisms for bilateral trading. Journal of Economic T/uory 29: 265-81. Pigou, A. C. (1932). nfe Economics of Welfare. London: Macmillan. Romer. P. (1986). Increasing returns and long-run growth. Journal of Political Economy 94: 1002-36. Samuelson, P. A. (1954). The pure theory of public expenditure. Review of Economics and Statistics 36: 387-89. Samuelson, P. A. (1955). Diagrammatic exposition of a pure theory of public expenditure. Review of £('onomics and Statistics 37: 350-56. Starrett. D. A. (1972). Fundamental non-convexities in the theory of externalities. Journal of Economic Theory 4: 180-99. Viner. J. (1931). Cost curves and supply curves. ZeilschriJt fur NaJionaliikonomie 111: 23-46. Weitzman, M. (1974). Prices vs. quantities. Review of Economic Studies 41: 477-91.

EXERCISES

II.B.1 8 (M. Weitzman) On Farmer Jones' farm, only honey is produced. There are two ways to make honey: with and without bees. A bucket full of artificial honey, absolutely indistin· guishable from the real thing, is made out of I gallon of maple syrup with one unit of labor. If the same honey is made the old·fashioned way (with bees), k total units of labor are required (including bee·keeping) and b bees are required per bucket. Either way, Farmer Jones has the capacity to produce up to H buckets of honey on his farm. The neighboring farm, belonging to Smith, produces apples. If bees are present, less labor is needed because bees pollinate the blossoms instead of workers doing it. For this reason, c bees replace one worker in the task of pollinating. Up to A bushels of apples can be grown on Smith's farm. Suppose that the market wage rate is w, bees cost P. per bee, and maple syrup costs Pm per gallon. If each farmer produces her maximal output at the cheapest cost to her (assume the output prices they face make maximal production efficient), is the resulting outcome efficient? How does the answer depend on k, b, c, w, p" and Pm? Give an intuitive explanation of your result. Up to how much would Smith be willing to bribe Jones to produce honey with bees? What would happen to efficiency if both farms belonged to the same owner? How could the government achieve efficient production through taxes? II.B.2C

Consider the two-consumer externality problem studied in Section II.B, but now assume that Consumer 2's derived utility function over the externality level h and her wealth available for commodity purchases w, takes the form ¢,(h, w,). Assume that ¢,(h, w,) is a twice-differentiable, strictly quasiconcave function with o¢,(h, w,)/ow, > 0 and, for simplicity, that we have a positive externality so that o¢,(h, w,)/oh > O. (a) Set up the Pareto optimality problem as one of choosing h and a wealth transfer T to maximize consumer I's welfare subject to giving consumer 2 a utility level of at least ii,. Derive the (necessary and sufficient) first-order condition characterizing the optimal levels of hand T, say h' and T'. (b) Imagine that consumer I could purchase h on an externality market. Let P. be the price per unit, and let h(p., w,) be consumer 2's demand function for h. Express the wealth effect iJh(p., w,)/ow, in terms of first-order and second-order partial derivatives of consumer 2's utility function.

EXERCISES

379

-------------------------------------------------------------(c) Derive the comparative statics change in the Pareto optimal level of the externality h' (for a given ii,) with respect to a differential increase dw, > 0 in consumer 2's wealth. Show that if consumer 2's demand for the externality, derived in (b), is normal at price fi, = [iJ¢,(h', w, - r)/oh]/ [o¢,(h', r)/ow,] and wealth level w, = 1" [i.e., if oh(fi., w,)/ow, > 0], then a marginal increase in consumer 2's wealth w, causes the Pareto optimal level of the externality II' to increase. (Similarly, in the case of a negative externality, if consumer 2's demand for reductions in the externality is a normal good, then when consumer 2 becomes wealthier, the Pareto optimal level of the externality declines.)

w, -

w, -

II.B.3" Consider the optimal Pigouvian tax identified in Section II.B for the two-consumer externality problem studied there. What happens if, given this tax, the two consumers are able to bargain with each other? Will the efficient level of the externality still result? What about with the optimal quota? II.B.4 8 Consider again the two-consumer externality problem studied in Section I LB. Suppose that consumer 2 can take some action, sayee R, that affects the degree to which she is affected by the externality, so that we now write her derived utility function as ¢,(h, e) + K',. To fix ideas,let h be a negative externality, and suppose that o'¢,(h, e)/ohoe > 0, so that increases in e reduce the negative effect of the externality on the margin. Suppose that both hand e can in principle be taxed or subsidized. Should e be taxed or subsidized in the optimal tax scheme? Why or why not? II.B.5" Suppose that at fixed input prices of wa firm produces output with the differentiable and strictly convex cost function c(q, h), where q ;;, 0 is its output level (whose price is P > 0) and h is the level of a negative externality generated by the firm. The externality affects a single consumer, whose derived utility function takes the form ¢(h) + w. The actions of the firm and consumer do not affect any market prices. (a) Derive the first-order condition for the firm's choice of q and h. (b) Derive the first-order conditions characterizing the Pareto optimal levels of q and h. (c) Suppose that the government taxes the firm's output level. Show that this cannot restore efficiency. Show that a direct tax on the externality can restore efficiency. (d) Show, however, that in the limiting case where h is necessarily produced in fixed proportions with q, so that h(q) = ~q for some ~ > 0, a tax on the firm's output can restore efficiency. What is the efficiency-restoring tax? 11.e.1 A Consider the model discussed in Section II.C, in which J consumers privately purchase a public good. Identify per-unit subsidies 5" .. . ,5" such that when each consumer i faces subsidy rate 5" the total level of the public good provided is optimal. 11.e.2A Consider the model discussed in Section II.C, in which J consumers privately purchase a public good. Show that a per-unit subsidy on the firm's output (paid to the firm) can also restore efficiency. II.C.3c Reconsider the Ramsey tax problem from Exercise 10.E.3, but now suppose that the government can also provide a public good Xo that can be produced from good I at cost c(xo). However, the government must still balance its budget (including any expenditures on the public good). Consumer i's utility function now takes the form Xli + LJ., ¢,,(Xlh xo). Derive and interpret the conditions characterizing the optimal commodity taxes and the optimal level of the public good. How do the two problems of Ramsey taxation and provision of the public good interact? 11.0.1 8 (M. Weitzman) First-year graduate students are a hard-working group. Consider a typical class of J students. Suppose that each student i puts in hi hours of work on her classes. This effort involves a dis utility of hfl2. Her benefits depend on how she performs relative to her peers and take the form ¢(h;/h) for all i, where h = (1/l)LI hi is the average number of hours put in by all students in the class and ¢(.) is a differentiable concave function, with

380

CHAPTER

11:

EXTERNALITIES

AND

PUBLIC

GOODS

=-~~~~~~~----------------------q,'(') > 0 and lim, _ 0 q,'(h) = 00. Characterize the symmetric (Nash) equilibrium. Compare it

with the Pareto optimal symmetric outcome. Interpret. II.D.2" Consider a setting with 1 consumers. Each consumer i chooses an action h, E R •. Consumer i's derived utility function over her choice of h and the choices of other consumers takes the form q"(h,, L, h,) + w" where I identical firms that act as price takers. The price of their output is p, and the prices of their inputs are unaffected by their acti~ns. Suppo~e that partial equilibrium analysis is valid and that the aggregate demand for their product IS given by the function x(p). The industry is characterized by "learning by doing." in that each firm's total cost of producing a given level of output is declining in the level of total Industry output; that is, each firm j has a twice-differentiable cost function of the form c(~J' Q.) f~r Q = Lj
EXERCISES

(b) Show that as 1 increases, the equilibrium level of 0 declines. Also show that 0 = O.

Iim,_~

t1.D.7 C Individuals can build their houses in one of two neighborhoods, A or B. It costs to build a house in neighborhood A and c. < C A to build in neighborhood B. Individuals care about the prestige of the people living in their neighborhood. Individuals have varying levels of prestige, denoted by the parameter O. Prestige varies between 0 and I and is uniformly distributed across the population. The prestige of neighborhood k (k = A, B) is a function of the average value of 0 in that neighborhood, denoted by 0,. If individual i has prestige parameter 0 and builds her house in neighborhood k, her derived utility net of building costs is (I + O)(! + 0,) - c,. Thus, individuals with more prestige value a prestigous neighborhood more. Assume that C A and c. are less than I and that fA - f. E(!, I).

CA

(8) Show that in any building-choice equilibrium (technically, the Nash equilibrium of the simultaneous-move game in which individuals simultaneously choose where to build their house) both neighborhoods must be oceupied.

(b) Show that in any equilibrium in which the prestige levels of the two neighborhoods differ, every resident of neighborhood A must have at least as high a prestige level as every resident of neighborhood B; that is, there is a cutoff level of 0, say 0, such that all types (/ (h, ~) for the consumer has a4>(h, ~)Iah E (0, - 00). Show, however, that a variable tax per unit in which the total tax collected from the firm is 4>(h,~) when the level of the externality is h will maximize aggregate surplus for all values of 0 for any derived utility function 4>(h, ~).

Market Power

HAP

T

E

R

12

12.A Introduction In the competitive model, all consumers and producers are assumed to act as price takers, in effect behaving as if the demand or supply functions that they face are infinitely elastic at going market prices. However, this assumption may not be a good one when there are only a few agents on one side of a market, for these agents will often possess market power-the ability to alter profitably prices away from competitive levels. The simplest example of market power arises when there is only a single seller, a monopolist, of some good. If this good's market demand is a continuous decreasing function of price, then the monopolist, recognizing that a small increase in its price above the competitive level leads to only a small reduction in its sales, will find it worthwhile to raise its price above the competitive level. Similar effects can occur when there is more than one agent, but still not many, on one side of a market. Most often, these agents with market power are firms, whose fewness arises from nonconvexities in production technologies (recall the discussion of entry in Section 10. F). In this chapter, we study the functioning of markets in which market power is present. We begin, in Section 12.B, by considering the case in which there is a monopolist seller of some good. We review the theory of monopoly pricing and identify the welfare loss that it creates. The remaining sections focus on situations of oligopoly, in which a number of firms compete in a market. In Sections 12.C and 12.0, we discuss several models of oligopolistic pricing. Each incorporates different assumptions about the underlying structure of the market and behavior of firms. The discussion highlights the implicatons of these differing assumptions for market outcomes. In Section I2.C, we focus on static models of oligopolistic pricing, where competition is viewed as a one-shot, simultaneous event. In contrast, in Section 12.0, we study how repeated interaction among firms may affect pricing in oligopolistic markets. This discussion constitutes an application of the theory of repeated games, a subject that we discuss in greater generality in Appendix A. The analysis in Sections 12.B to 12.0 treats the number of firms in the market as 383

384

CHAPTER

12:

MARKET

exogeneously given. In reality, however, the number of active firms in a market is likely to be affected by factors such as the size of market demand and the nature of competition within the market. Sections 12.E and 12.F consider issues that arise when the number of active firms in a market is determined endogenously. Section 12.E specifies a simple model of entry into an oligopolistic market and studies the determinants of the number of active firms. It offers an analysis that parallels that considered in Section 10.F for competitive markets. Section 12.F returns to a theme raised in Chapter 10. We illustrate how the competitive (price-taking) model can be viewed as a limiting case of oligopoly in which the size of the market, and hence the number of firms that can profitably operate in it, grows large. In the model we study, an active firm's market power diminishes as the market size expands; in the limit, the equilibrium market price comes to approximate the competitive level. In Section 12.G, we briefly consider how firms in oligopolistic markets can make strategic precommitments to affect the conditions of future competition in a manner favorable to themselves. This issue nicely illustrates the importance of credible commitments in strategic settings, an issue we studied extensively in Chapter 9. In Appendix B, we consider in greater detail a particularly striking example of strategic precommitment to affect future market conditions, the case of entry deterrence through capacity choice. If you have not done so already, you should review the game theory chapters in Part II before studying Sections 12.C to 12.G (in particular, review all of Chapter 7, Sections 8.A to 8.0, and Sections 9.A and 9.B). An excellent source for further study of the topics covered in this chapter is Tirole (1988).1

12.B Monopoly Pricing In this section, we study the pricing behavior of a profit-maximizing monopolist, a firm that is the only producer of a good. The demand for this good at price p is given by the function x(p), which we take to be continuous and strictly decreasing at all p such that x(p) > 0. 1 For convenience, we also assume that there exists a price ji < OC! such that x(p) = 0 for all p ~ p.3 Throughout, we suppose that the monopolist knows the demand function for its product and can produce output level q at a cost of c(q). The monopolist's decision problem consists of choosing its price p so as to maximize its profits (in terms of the numeraire), or formally, of solving Max

- --

_--------------------------------------------~S~E~C~T~I:O~N~~1:2~.8~:-=M~O::N~O~P~O::L~Y~P~R~I:C~I~N~G~~3~8~5

POWER

px(p) - c(x(p».

(12.B.I)

I. See also Ihe survey by Shapiro (1989) for the topics covered in Sections I2.C, 12.0,

and 12.G. 2. Throughout this chapter we take a partial equilibrium approach; see Chapter 10 for a discussion of this approach. 3. This assumption helps to insure that an optimal solution to the monopolist's problem exists. (Sec Exercise 12.B.2 for an example in which the failure of this condition leads to nonexistence.)

An equivalent formulation in terms of quantity choices can be derived by thinking instead of the monopolist as deciding on the level of output that it desires to sell, q ~ 0, lettmg the price at which it can sell this output be given by the inverse demand junction pC-) = x- I (. ).4 Using this inverse demand function, the monopolist's problem can then be stated as Max

p(q)q - c(q).

(12.B.2)

.>:0

We shall focus our analysis on this quantity formulation of the monopolist's problem [identical conclusions could equally well be developed from problem (12.B.I)]. We assume throughout that p(.) and c(') are continuous and twice differe~tiable at all q ~ 0, that p(O) > c'(O), and that there exists a unique output level q E (0, (0) such that p(qO) = c'(qO). Thus, qO is the unique socially optimal (competitive) output level in this market (see Chapter 10). Under these assumptions, a solution to problem (12.8.2) can be shown to exist.' Given the differentiability assumed, the monopolist's optimal quantity, which we denote by qm, must satisfy the first-order condition 6

p'(qm) qm

+ p(qm)

~

c'(qm), with equality if qm > O.

(12.B.3)

The left-hand side of (12.B.3) is the marginal revenue from a differential increase in q. at ~m, which is equal to the derivative of revenue d[p(q)q]jdq, while the right-hand Side tS the. corresponding marginal cost at q"'. Since p(O) > c'(O), condition (12.B.3) can be satisfied only at qm > O. Hence, under our assumptions, marginal revenue must equal marginal cost at the monopolist's optimal output level: (l2.B.4) For the typical case in which p'(q) < 0 at all q ~ 0, condition (12.B.4) implies that we must have p(q") > c'(qm), and so the price under monopoly exceeds marginal cos~. Correspond~ngly, the monopolist's optimal output qm must be below the socially opumal (compettt.lve) output level qQ. The cause of this quantity distortion is the monopolist's recognition that a reduction in the quantity it sells allows it to increase the price charged on its remaining sales, an increase whose effect on profits is captured by the term p'(q")qm in condition (12.B.4). The welfare loss from this quantity distortion, known as the deadweight loss oj mOllopoly, can be measured using the change in Marshallian aggregate surplus

4. More precisely, 10 lake account of the fact that x(p) = 0 for more than one value of p, we take p(q) = Min {p: x(p) = q} at all q 2: O. Thus, P(O) = p, the lowest price at which x(p) = O. 5. In par,ticular, it follows from condition (12.B.3) and from the facts that p'(q) sO for all q 2: 0 and f(q) < c (q) for all q ~ qO, that the monopolist's optimal choice must lie in the compact set [0: q ]. Becau~ the objective function In problem (12.B.2) is continuous, a solution must therefore eXist (see Section M.F of the Mathematical Appendix). 6. Satisfaction .of first-order condition (12.B.3) is sufficient for q" to be an optimal choice if the obJ:ctl.ve funct~on of problem (12.B.2) is concave on [0, q0]. Note, however, that concavity of thiS objective funchon depends not only on the technology of the firm, as in Ihe competitive model, but also on the shape of the Inverse demand function. In particular, even with a convex cost Cunction the ~onopolist's profit function can violate this concavity condition if demand is a convex functio~ of prtce.

386

CHAPTER

12:

MARKET

POWER

~~~~~~~~~~~----------------------------------[p(q)

+ p'(q)q] c'(q) Flgur. 12.B.1 (teft)

\----"""7f-'-- p(q) : p for all q

The monopoly solution and welfare loss when p'(') < O. Figure 12.B.2 (right)

q": q'

(sec Section 10.E),

r

The monopoly solution when p'(q) : 0 for all q

--

SECT tON

[pes) - c'(s)]

ds > 0,

where q" is the socially optimal (competitive) output level. Figure 12.B.1 illustrates the monopoly outcome in this case. The ~onopolist's quantity qm is determined by the intersection of the graphs of marginal revenue p'(q)q + p(q) and marginal cost c'(q). The monopoly. price p(qm) can then be determined from the inverse demand curve. The deadweight welfare loss IS equal to the area of the shaded region. Note from condition (12.B.4) that the monopoly quantity distortion is absent in the special case in which p'(q) = 0 for all q. In this case, wh~re p(q) eq~als so~e constant p at all q > 0, the monopolist sells the same quantity as a price-taking competitive firm because it perceives that any increase in i~s pri~ ab~ve the competitive price pcauses it to lose all its sales. 7 Figure 12.B.2 depicts thiS special case. Example 12.B.1: Monopoly Pricing with a Linear Inverse Dem~nd . Function and Constant Returns to Scale. Suppose that the inverse demand function In a monopolized market is p(q) = a - bq and that the monopolist's cost function is c(q) = cq, where (/ > c 2': 0 [so that p(O) > c'(O)) and b > O. In this case, the objective function of the monopolist's problem (12.B.2) is concave, and so condition (12.B.4) is b?th necessary and sufficient for a solution to the monopolist's pro~lem. From ,condition (12.B.4), we can calculate the monopolist's optimal quantity. and price t? be qm : (0 _ c)(2b and pm = (a + c)(2. In contrast, the socially optimal (competitive) output level and price are q' = (a - c)(b and po = p(qO) = c, • Although we do not discuss these issues here, we point out that the behavi~ral distortions arising under monopoly are not limited to pricing decisions. (ExerCises 12.B.9 and 12.B.10 ask you to investigate two examples.) The monopoly quantity distortion is fundamentally linked to the fact that if the monopolist wants to increase the quantity it sells, it must lower its price on all its existing sales. In fact, 7. This inverse demand function arises, for example. when each consumer i has quasilinear

preferences of the form u,(q,) + m, with u,(q,) = pq" where q, is consu,:,er j's ~onsumpti~n of the good under study and m, is his consumption of the numeraire commodity. [Stnctly speakong, wIth these preferences we now have a multi valued demand correspondence rather than a demand function, but p(') is nevertheless a function as before.]

L

STATIC

MODELS

OF

if the monopolist were able to perfeclly discriminate among its customers in the sense that it could make a distinct offer to each consumer, knowing the consumer's preferences for its product, then the monopoly quantity distortion would disappear. To see this formally, let each consumer i have a quasilinear utility function of the form u;(q;) + m; over the amount q, of the monopolist's good that he consumes and the amount m; that he Consumes of the numeraire good, and normalize .,(0) = O. Suppose that the monopolist makes a take-it-or-leave-it offer to each consumer i of the form (q;, 7;), where q; is the quantity offered to consumer i and 7; is the total payment that the consumer must make in return. Given offer (q;, 7;), consumer i will accept the monopolist's offer if and only ifu;(q;) - 7; 2': O. As a result, the monopolist can extract a payment of exactly u;(q;) from consumer i in return for q, units of its product, leaving the consumer with a surplus of exactly zero from consumption of the good. Given this fact, the monopolist will choose the quantities it sells to the I consumers (q" ... , q,) to solve

,

q

Jq~

12,C:

Max

L

14'10 ...••,12:0

'''''I

u,(q;) - c(Lq,)·

(12.8.5)

Note, however, that any solution to problem (12.8.5) maximizes the aggregate surplus in the market, and so the monopolist will sell each consumer exactly the socially optimal (competitive) quantity. Of course, the distributional properties of this outcome would not be terribly attractive in the absence of wealth redistribution: The monopolist would get all the aggregate surplus generated by its product, and each consumer i would receive a surplus of zero (i.e., each consumer i's welfare would be exactly equal to the level he would achieve if he consumed none afthe monopolist's product). But in principle, these distributional problems can be corrected through lump-sum redistribution of the numeraire. Thus, the welfare loss from monopoly pricing can be seen as arising from constraints that prevent the monopolist from charging fully discriminatory prices. In practice, however, these constraints can be significant. They may include the costs of assessing separate charges for different consumers, the monopolist's lack of information about consumer preferences, and the possibility of consumer resale. Exercise 12.B.5 explores some of these factors. It provides conditions under which the best the monopolist can do is to name a single per-unit price, as we assumed at the beginning of Ihis section.

12,C Static Models of Oligopoly We now turn to cases in which more than one, but still not many, firms compete in a market. These are known as situations of oligopoly. Competition among firms in an oligopolistic market is inherently a setting of strategic interaction. For this reason, the appropriate tool for its analysis is game theory. Because this discussion constitutes our first application of the theory of games, we focus on relatively simple static models of oligopoly, in which there is only one period of competitive interaction and firms take their actions simultaneously. We begin by studying a model of simultaneous price choices by firms with constant returns to scale technologies, known as the Bertrand model. This model displays a striking feature: With just two firms in a market, we obtain a perfectly competitive outcome. Motivated by this finding, we then consider three alterations of this model that weaken its strong and ollen implausible conclusion: a change in the firm's strategy from choosing its price to choosing its quantity of output

OLtGOPOLY

387

388

CHAPTER

12:

MARKET

POWER

SECTION

12.C:

STATIC

MODELS

OF

OLIGOPOLY

389

----------------------------------------------------------------------- ---------------------------------------------------------------------(the Cournot model); the introduction of capacity constraints (or, more generally, decreasing returns to scale); and the presence of product differentiation.· One lesson of this analysis is that a critical part of game-theoretic modeling goes into choosing the strategies and payoff functions of the players. In the context of oligopolistic markets, this choice requires that considerable thought be given both to the demand and technological features of the market and to the underlying processes of competition. Unless otherwise noted, we restrict our attention to pure strategy equilibria of the models we study.

Tile Bertrand Model of Price Competition We begin by considering the model of oligcipolistic competition proposed by Bertrand (1883). There are two profit-maximizing firms, firms I and 2 (a duopoly), in a market whose demand function is given by x(p). As in Section IO.B, we assume that x(·) is continuous and strictly decreasing at all P such that x(p) > 0 and that there exists a ;; < 00 such that x(p) = 0 for all P 0, per unit produced. We assume that x(c) E (0, 00), which implies that the socially optimal (competitive) output level in this market is strictly positive and finite (see Chapter 10). Competition takes place as follows: The two firms simultaneously name their prices P, and P2' Sales for firm j are then given by

Xj(Pj, P.) =

l

X(Pi)

ifpj P•.

The firms produce to order and so they incur production costs only for an output level equal to their actual sales. Given prices Pi and P., firm j's profits are therefore equal to (Pj - c)xj(Pj, P.)· The Bertrand model constitutes a well-defined simultaneous-move game to which we can apply the concepts developed in Chapter 8. In fact, the Nash equilibrium outcome of this model, presented in Proposition l2.C.I, is relatively simple to discern. Proposition 12.C.1: There is a unique Nash equilibrium (pr, pn in the Bertrand

duopoly model. In this equilibrium, both firms set their prices equal to cost:

pr = p~ = c.

incurs losses. But by raising its price above c, the worst it can do is earn zero. Thus, these price choices could not constitute a Nash equilibrium. Now suppose that one firm's price is equal to c and that the other's price is strictly greater than c: Pj = c, P. > c. In this case, firm j is selling to the entire market but making zero profits. By raising its price a little, say to Pj = C + (p, - c)/2, firm j would still make all the sales in the market, but at a strictly positive profit. Thus, these price choices also could not constitute an equilibrium. Finally, suppose that both price choices are strictly greater than c: Pj > c, p, > c. Without loss of generality, assume that Pj :s; P•. In this case, firm k can be earning at most !(Pj - c)x(Pj)' But by setting its price equal to Pj - 0 for 0> 0, that is, by undercutting firm j's price, firm k will get the entire market and earn (Pj - " - c)x(Pj - 0). Since (Pj - C - c)x(Pj - r.) > HpJ - c)x(Pj) for small-enough c > 0, firm k can strictly increase its profits by doing so. Thus, these price choices are also not an equilibrium. The three types of price configurations that we have just ruled out constitute all the possible price configurations other than PI = P2 = c, and so we are done. _ The striking implication of Proposition 12.C.1 is that with only two firms we get the perfectly competitive outcome. In effect, competition between the two firms makes each firm face an infinitely elastic demand curve at the price charged by its rival. The basic idea of Proposition 12.C.1 can also be readily extended to any number of firms greater than two. [In this case, if firm j names the lowest price in the market, say p, along with J - I other firms, it earns (1/J)x(p).] You are asked to show this in Exercise 12.C.1. Exercise l2.C.I: Show that in any Nash equilibrium of the Bertrand model with J > 2 firms, all sales take place at a price equal to cost. Thus, the Bertrand model predicts that the distortions arising from the exercise of market power are limited to the special case of monopoly. Notable as this result is, it also seems an unrealistic conclusion in many (although not all) settings. In the remainder of this section, we examine three changes in the Bertrand model that considerably weaken this strong conclusion: First, we make quantit), the firms' strategic variable. Second, we introduce capacity constraillCS (or, more generally, decreasing returns to scale). Third, we allow for product differentiation.

Qualltity Competitioll (The Cournot Model)

Proof: To begin, note that both firms setting their prices equal to C is indeed a Nash equilibrium. At these prices, both firms earn zero profits. Neither firm can gain by raising its price because it will then make no sales (thereby still earning zero); and by lowering its price below c a firm increases its sales but incurs losses. What remains is to show that there can be no other Nash equilibrium." Suppose, first, that the lower of the two prices named is less than c. In this case, the firm naming this price 8. Section 12.0 studies a rourth variation that involves repeated interaction among firms. 9. Recall that we restrict attention to pure strategy equilibria here. See Exercise l2.C.2 for a consideration of mixed strategy equilibria. There you are asked to show that under the conditions assumed here, Proposition l2.C.1 continues to hold: p! = = c is the unique Nash equilibrium, pure or mixed, of the Bertrand model.

P;

Suppose now that competition between the two firms takes a somewhat different form: The two firms simultaneously decide how much to produce, ql and q2' Given these quantity choices, price adjusts to the level that clears the market, p(ql + q2), where p(.) = x- I (-) is the inverse demand function. This model is known as the Coumot model, after Cournot (1838). You can imagine farmers deciding how much of a perishable crop to pick each morning and send to a market. Once they have done so, the price at the market ends up being the level at which all the crops that have been sent are sold. IO In this discussion, we assume that p(.) is differentiable 10. One scenario that will lead to this outcome arises when buyers bid ror the crops sent that day (very much like sellers in the Bertrand model; see Exercise 12.C.S).

SEC T ION

390

C HAP T E R

1 2:

MAR K E T

POW E R

~~~~~--------------------------------------------------------with p'(q) < 0 at al1 q ~ O. As before, both firms produce output at a cost of c > 0 per unit. We also assume that p(O) > c and that there exists a unique output level qO E (0, OC!) such that p(qO) = c [in terms of the demand function x(·), qO = x(c)]. Quantity qO is therefore the social1y optimal (competitive) output level in this market. To find a (pure strategy) Nash equilibrium of this model, consider firm j's maximization problem given an output level ii, of the other firm, k # j: Max

(12.C.1)

p(qj + ii,)qj - cqj.

4};

o.

(12.C.2)

For each ii.. we let b/ii,) denote firmj's set of optimal quantity choices; b j(') is firm j's best-response correspondence (or function if it is single-valued). A pair of quantity choices (q!, is a Nash equilibrium if and only if qj E bj(q:) for k # j and j = 1,2. Hence, if (q!, is a Nash equilibrium, these quantities

qn

qn

must satisfy"

p'(q!

+ q!lq! + p(q! + q~) !5: c,

with equality if q! > 0

(I2.C.3)

p'(q!

+ q!lq! + p(q! + q!) !5: c,

with equality if q! > O.

(12.C.4)

and

It can be shown that under our assumptions we must have (q!, q!) » 0, and so conditions (12.C.3) and (12.C.4) must both hold with equality in any Nash equilibrium.' 2 Adding these two equalities tel1s us that in any Nash equilibrium we

must have

p'(q!

+ qn(q! ; q!) + p(q! + qn

= c.

(12.C.S)

Condition (12.C.S) al10ws us to reach the conclusion presented in Proposition 12.C.2. Proposition 12.C.2: In any Nash equilibrium of the Cournot duopoly model with cost c > a per unit for the two firms and an inverse demand function p(.) satisfying p'(q) < a for all q ~ a and pta) > c, the market price is greater than c (the competitive price) and smaller than the monopoly price. II. Nole Ihal Ihis melhod of analysis, which relies on Ihe use of first-order condilions 10 calculale best responses, differs from the method used in the analysis of the Berlrand model. The reason is that in the Bertrand model each firm's objective runction is discontinuous in its decision

variable. so that differential optimization techniques cannot be used. Fortunately, the determination of Ihe Nash equilibrium in the Berlrand modellurned out, nevertheless, 10 be quite simple. 12. To see this, suppose Ihat q~ = O. Condition (12C.3) then implies Ihal P(qi) S c. By condilion (12.CA) and the fact that p'(') < 0, Ihis implies Ihat were q; > 0 we would have p'(q;)q; + p(q;) < c, and so q; = O. Bullhis means that p(O) s c, contradicting the assumption that p(O) > c. Hence, we must have q! > O. A similar argument shows that q; > O. Note, however, Ihal this conclusion depends on our assumption of equal costs for the two firms. For example, a firm might set its output equal to zero if it is much less efficient than its rival. Exercise 12.C.9 considers some of the issues that arise when firms have differing costs.

1 2 • C:

5 TAT I C

MOO E L S

0 F

0 L I GOP 0 L y

391

------------------------------------------~~~~~~~= Proof: That the equilibrium price is above c (the competitive price) follows immediately from condition (I2.C.S) and the facts that q! + q! > 0 and p'(q) < 0 at all q ~ O. We next argue that (q! + q!l > q"', that is, that the equilibrium duopoly price p(q! + q!) is strictly less than the monopoly price p(q"'). The argument is in two parts. First, we argue that (q! + q!) ~ q"'. To see this, suppose that q'" > (q! + q!), By increasing its quantity to qj = q"' - q:, firmj would (weakly) increase the joint profit of the two firms (the firms' joint profit then equals the monopoly profit level, its largest possible level). In addition, because aggregate quantity increases, price must fall, and so firm k is strictly worse off. This implies that firm j is strictly better off, and so firm j would have a profitable deviation if q"' > (q! + q!), We conclude that we must have (q! + ~ q"'. Second, condition (l2.C.S) implies that we cannot have (q! + = q"' because then

qn

qn

in violation of the monopoly first-order condition (12.B.4). Thus, we must in fact have (q! + q!) > q"' . • Proposition 12.C.2 tells us that the presence of two firms is not sufficient to obtain a competitive outcome in the Cournot model, in contrast with the prediction of the Bertrand model. The reason is straightforward. In this model, a firm no longer sees itself as facing an infinitely elastic demand. Rather, if the firm reduces its quantity by a (differential) unit, it increases the market price by - p'(ql + q2)' If the firms found themselves jointly producing the competitive quantity and consequently earning zero profits, either one could do strictly better by reducing its output slightly. At the same time, competition does lower the price below the monopoly level, the price that would maximize the firms' joint profit. This occurs because when each firm determines the profitability of selling an additional unit it fails to consider the reduction in its rival's profit that is caused by the ensuing decrease in the market price [note that in firm j's first-order condition (12.C.2), only qj multiplies the term p'('), whereas in the first-order condition for joint profit maximization (q, + q2) does]. Example 12.C.I: Cournot Duopoly wit It a Linear Inverse Demand Function and COOlstant Returns to Scale. Consider a Cournot duopoly in which the firms have a cost per unit produced of c and the inverse demand function is p(q) = a - bq, with a > c ~ 0 and b > O. Recall that the monopoly quantity and price are q"' = (a - c)/2b and p" = (il + c)/2 and that the socially optimal (competitive) output and price are qO = (a - c)/b and po = p(qO) = c. Using the first-order condition (12.C.2), we find that firm j's best-response function in this Cournot model is given by

bj(q,) = Max {O, (a - c - bq,)/2b}. Firm I's best-response function b,(q2) is depicted graphically in Figure 12.c'1. Since b,(O) = (a - c)/2b, its graph hits the q, axis at the monopoly output level (a - c)/2b. This makes sense: Firm I's best response to firm 2 producing no output IS to produce exactly its monopoly output level. Similarly, since b,(q2) = 0 for all

392

C HAP T E R

1 2:

MAR K E T

-

POW E R

a-c

+ q,)q, {(q,. q,): p(q, + q,)q,

------------ {(q,.q,): p(q,

- cq, = n') - ("q, = W); W>

n'

2b

SEC T ,

0N

12•

c:

S TAT I C

MOO E L S

Flgur. 12.C.3

Firm. 1'5 ~st-responsc funct,on 10 the Cournot duopoly model of Example I 2.C. I.

equilibrium in the Cournot model.

Symmelric Joinl Monopoly Poinl Figure 12.C.2

3b (a)

q,

(b)

q2 ~ (a - c)/b, the graph of firm I's best-response function hits the q2 axis at the socially optimal (competitive) output level (a - c)/b, Again, this makes sense: If firm 2 chooses an output level of at least (a - c)/b, any attempt by firm I to make sales results in a price below c. Two isoprofit loci of firm I are also drawn in the figure; these are sets of the form (q" q2): p(q, + q2)q, - cq, = n} for some profit level n. The profit levels associated with these loci increase as we move toward firm 1's monopoly point (q" q2) = «a - c)/2b, 0). Observe that firm I's isoprofit loci have a zero slope where they cross the graph of firm I's best-response function. This is because the best response b,(ih) identifies firm I's maximal profit point on the line q, = ii2 and must therefore correspond to a point of tangency between this line and an isoprofit locus. Firm 2's best-response function can be depicted similarly; given the symmetry of the firms, it is located symmetrically with respect to firm I's best-response function in (q" q2)-space [i.e., it hits the q2 axis at (a - c)/2b and hits the q, axis at (a - c)/b]. The Nash equilibrium, which in this example is unique, can be computed by finding the output pair (q!, qt) at which the graphs of the two bestresponse functions intersect, that is, at which q! = b,(qf) and q! = b,(q!). It is depicted in Figure 12.C.2(a) and corresponds to individual outputs of qT = q! = H{a - c)/b], total output of Wa - c)/b], and a market price of p(qT + q!) = !(a + 2c) E (c, pM). Also shown in Figure 12.C.2(b) is the symmetric joint monopoly point (qm/2, qm/2) = «a _ c)/4b, (a - c)/4b). It can be seen that this point, at which each

Nash equilibrium in the Cournot duopoly model of Example 12.C.1.

393

Nonexistence of (pure strategy) Nash

Exercise 12.C.6: Verify the computations and other claims in Example l2.C.1.

a-c

L Y

q,

firm produces half of Ihe monopoly output of (a - c)/2b, is each firm's most profitable point on the q, = q, ray. _

4b

0 L I GOP 0

Figure 12.C.1

q,

4b

0f

---------------------------~~~~~~~~~~~~

Up 10 Ihis poinl we have nol made any assumptions aboul the quasiconcavilY in q. of each firmj's objeclive funclion in problem (12.C.1). Withoul quasiconcavilY of Ihese funcli~lls. however. a pure slralegy Nash equilibrium of Ihis quantilY game may nol exisl. For example. as happens in Figure 12.C.3. Ihe besl-response funclion of a firm lacking a quasiconcavc objeclive fUllclion may "jump." leading 10 Ihe possibilily of nonexislence. (Striclly speaking, for a silualion like Ihe one depicled in Figure 12.C.310 arise.lhe Iwo firms must have differenl cosl funclions; see Exercise 12.C.8.) Wilh quasiconcavity. we can use Proposilion 8.0.3 10 show Ihal a pure slralegy Nash equilibrium necessarily ex iSIs. Suppose now that we have J > 2 identical firms facing the same cost and demand functions as above. Letting QJ be aggregate output at equilibrium, an argument parallel to that above leads to the following generalization of condition (l2.C.S):

P'(QJ)~! + p(Q1) =

c.

(l2.C.6)

At one exlreme, when J = I, condition (12.C.6) coincides with the monopoly first-order condition that we have seen in Section 12.B. At the other extreme, we must have p(Q1J .... c as J .... 00. To see this, note that since QJ is always less than the socially optimal (competitive) quantity qO, it must be the case that p'(QJ )(QJ /J) .... 0 as J .... oc;. Hence, condition (12.C.6) implies that price must approach marginal cost as the number of firms grows infinitely large. This provides us with our first taste of a "competitive limit" result, a topic we shall return to in Section 12.F. Exercise 12.C.7 asks you to verify these claims for the model of Example 12.C.!. Exercise 12.C.7: Derive the Nash equilibrium price and quantity levels in the Cournot model with J firms where each firm has a constant unit production cost of c and the inverse demand function in the market is p(q) = a - bq, with a > c ~ 0 and b > O. Verify that when J = I, we get the monopoly outcome; that output rises and price falls as J increases; and that as J .... DC the price and aggregate output in the market approach their competitive levels. In contrast with the Bertrand model, the Cournot model displays a gradual reduction in market power as the number of firms increases. Yet, the "farmer sending

394

CHAPTER

12:

MARKET

---

POWER

crops to market" scenario may not seem relevant to a wide class of situations. After all, most firms seem to choose their prices, not their quantities. For this reason, many economists have thought that the Cournot model gives the right answer for the wrong reason. Fortunately, the departure from the Bertrand model that we study next offers an alternative interpretation of the Cournot model. The basic idea is that we can think of the quantity choices in the Cournot model as long-run choices of capacity, with the determination of price from the inverse demand function being a proxy for the outcome of short-run price competition given these capacity choices.

In many settings, it is natural to suppose that firms operate under conditions of eventual decreasing returns to scale, at least in the short run when capital is fixed. One special case of decreasing returns occurs when a firm has a capacity constraint that prevents it from producing more than some maximal amount, say Ii. Here we consider, somewhat informally, how the introduction of capacity constraints affects the prediction of the Bertrand model. With capacity constraints (or, for that matter, costs that exhibit decreasing returns to scale in a smoother way), it is no longer sensible to assume that a price announcement represents a commitment to provide any demanded quantity, since the costs of an order larger than capacity are infinite. We therefore make a minimal adjustment to the rules of the Bertrand model by taking price announcements to be a commitment to supply demand only up to capacity. We also assume that capacities are commonly known among the firms. To see how capacity constraints can affect the outcome of the duopoly pricing game, suppose that each of the two firms has a constant marginal cost of c > 0 and a capacity constraint of ii = ~x(c). As before, the market demand function x(·) is continuous, is strictly decreasing at all p such that x(p) > 0, and has x(c) > O. In this case, the Bertrand outcome p~ = p! = c is no longer an equilibrium. To see this, note that because firm 2 cannot supply all demand at price p! = c, firm I can anticipate making a strictly positive level of sales if it raises PI slightly above c. As a result, it has an incentive to deviate from p~ = c. In fact, whenever the capacity level ii satisfies ii < x(c), each firm can assure itself of a strictly positive level of sales at a strictly positive profit margin by setting its price below p(ij) but above c. This is illustrated in Figure 12.CA. In the figure, we assume that the lower-priced firm 2 fills the highest-valuation demands. By charging

p, = c

MODELS

a price PI E (c, p(ii)), firm I sells to the remaining demand at price p" making sales of x(I'.J - ii > O. Hence, with capacity constraints, competition will not generally drive price down to cost, a point originally noted by Edgeworth (1897).

Product Differentiation Calculation of demand in the presence of capacity constraints when the low-priced firm satisfies high-valuation demands first.

p(ti)

STATIC

Up to this point in our discussion, we have taken a firm's capacity level as exogenous. Typically, however, we think of firms as c/lOosing their capacity levels. This raises a natural question: What is the outcome in a model in which firms first choose their capacity levels and then compete in prices? Kreps and Scheinkman (1983) address this question and show that under certain conditions (among these is the assumption that high-valuation demands get served first when demand for a low-priced firm outstrips its capacity), the unique subgame perfect Nash equilibrium in this two-stage model is the Cournol outcome. This result is natural: the computation of price from the inverse demand curve in the Cournot model can be thought of as a proxy for this second-stage price competition. Indeed, for a wide range of capacity choices (iiI' ii 2), the unique equilibrium of the pricing subgame involves both firms setting their prices equal to p(iil + ii,) (see Exercise 12.C.II). Thus, this two-stage model of capacity choice/price competition gives us the promised reinterpretation of the Cournot model: We can think of Cournot quantity competition as capturing long-run competition through capacity choice, with price competition occurring in the short run given the chosen levels of capacity.

Figure 12.C.4

Demand Satisfied

12.C,

Determining the equilibrium outcome in situations in which capacity constraints are presenl can be tricky because knowledge of prices is no longer enough to determine each firm's sales. When the prices quoted are such that the low-priced firm cannot supply all demand at its quoted price, the demand for the higher-priced firm will generally depend on precisely who manages to buy from the low-priced firm. The high'priced firm will typically have greater sales if consumers with low valuations buy from the low· priced firm (in contrast with the assumption made in Figure 12.C.4) than if high-valuation consumers do. Thus, to determine demand functions for the firms, we now need to state a rarioning rule specifying which consumers manage to buy from the low-priced firm when demand exceeds its capacity. In fact, the choice of a rationing rule can have important effects on equilibrium behavior. Exercise 12.C.11 asks you to explore some of the features of the equilibrium outcome when the highest valuation demands are served first, as in Figure 12.C.4. This is the rationing rule that tends to give the nicest results. Yct, it is neither more nor less plausible than other rules, such as a queue system or a random allocation of available units among possible buyers.

Capacity Constraints alld Decreasing Returns to Scale

~YFirm2

SECTION

In the Bertrand model, firms faced an infinitely elastic demand curve in equilibrium: With an arbitrarily small price differential, every consumer would prefer to buy from the lowest-priced firm. Often, however, consumers perceive differences among the products of different firms. When product differentiation exists, each firm will possess some market power as a result of the uniqueness of its product. Suppose, for example, that there are J > I firms. Each firm produces at a constant marginal cost of c > O. The demand for firm j's product is given by the continuous function xj(Pj' P_j), where P _ j is a vector of prices of firm j's rivals. \3 In a setting of simultaneous price

p(' ) ij

X(PI)

'---y-l

Firm I's Sales

x,q

OF

OLIGOPOLY

395

------------------------------------------------------------

13. Note the departure from the Bertrand model: In the Bertrand model, X/Pl' P_j) is discontinuous at Pj = Minl#i Pl.

5 E C T ION

396

C HAP T E R

1 2:

MAR K E T

POW E R

.~~~------------------------------------Firm 2 "..

/Firml

.

•

o

Flgur. 12.C.S

\

The linear city.

M Consumers Uniformly Distributed on Segment

choices, each firmj takes its rivals' price choices P-i as given and chooses Pi to solve

buy from firm I. At these locations, p, + IZ < p, + c(l - z} (purchasing from firm I is better than purchasing from firm 2), and v - p, - Iz > 0 (purchasing from firm I is better than not purchasing at all). At location z" a consumer is indifferent between purchasing from firm I and not purchasing at all; that is, z, satisfies v - p, - tz, = O. In Figure 12.C.6(a}, consumers in the interval (z" Z2) do not purchase from either firm, while those in the interval (Z" I] buy from firm 2. Figure 12.C.6(b), by contrast, depicts a case in which, given prices PI and p" all consumers can obtain a strictly positive surplus by purchasing the good from one of the firms. The location of the consumer who is indifferent between the two firms is the point such that

p, + Ii = P2 + 1(1 - i)

i = I

'----y-''----y----'~

Buy From Firm I (a)

p,

PI

(12.0)

.

In general, the analysis of this model is complicated by the fact that depending on the parameters (v, c, c), the equilibria may involve market areas for the firms that do not touch [as in Figure 12.C.6(a)], or may have the firms battling for consumers in the middle of the market [as in Figure 12.C.6(b}]. To keep things as simple as possible here, we shall assume that consumers' value from a widget is large relative to production and travel costs, or more precisely, that v > c + 3c. In this case, it can be shown that a firm never wants to set its price at a level that causes some consumers not to purchase from either firm (see Exercise 12.CI3). In what follows, we shall therefore ignore the possibility of non purchase. Given p, and P2' let be defined as in (12.C.7). Then firm I's demand, given a pair of prices (p" P2), equals Mi when i E [0, I], M when > I, and 0 when i < 0.14 Substituting for i from (12.C. 7), we have

z

z

if p, > P2 if p,

E

+c

[p, - C, P2

+ c]

(12.C.8)

if p, < P, - r. By the symmetry of the two firms, the demand function of firm 2, x 2(p" P2), is if P, > p, Consumer purchase decisions given PI and P" c) because it can assure itself of strictly positive profits by setting its price slightly above c. Thus, in the presence of product differentiation, equilibrium prices will be above the competitive level. As with quantity competition and capacity constraints, the presence of product differentiation softens the strongly competitive result of the Bertrand model. A number of models of product differentiation are popular in the applied literature. Example 12.C.2 describes one in some detail.

0"

1 2 . C;

E

+c

[PI - C, PI

+ c]

(12.C.9)

if P2 < PI - r. Note from (12.C.8) and (12.C.9) that each firm j, in searching for its best response to any price choice P_i by its rival. can restrict itself to prices in the interval [P-i - C, p-J + c]. Any price Pi > P-i + c yields the same profits as setting Pi = P-i + c (namely. zero), and any price Pi < P-i - t yields lower profits than setting Pi = P-i - r (all such prices result in sales of M units). Thus, firm j's best

~ '------y--'

Buy From Firm I

No Buy From Purchase Firm 2 (b)

Buy From Firm 2

0 F

0 l I GOP 0 L

Y

397

----------------------------------------------------------------

14. Recall that the M consumers are unirormly distributed on the line segment, so i is the

rraction who buy from firm I.

398

CHAPTER

12:

MARKET

POWER

~~~~~~---------------------------------------------------response to

P_j

solves

(Pj - C)(I

Max

+ P-j -

M

Pj)

s.t. PjE [p_j - I,P_j

---

SEC T , 0 N

C

I, firm j plays pm if in every previous period both firms have charged price pm and otherwise charges a price eq ual to cost. This type of strategy is called a Nash reversion stralegy: Firms cooperate until someone deviates, and any deviation triggers a permanent retaliation in which both firms thereafter set their prices equal to cost, the one-period Nash strategy. Note that if both firms follow the strategies in (12.0.1), then both firms will end up charging the monopoly price in every period. They start by charging pm, and therefore no deviation from pm will ever be triggered. For the strategies in (12.0.1), we have the result presented in Proposition 12.0.1. Proposition 12.0.1: The strategies described in (12.0.1) constitute a subgame perfect Nash equilibrium (SPNE) of the infinitely repeated Bertrand duopoly game if and only if c5 ~ ~. Proof: Recall that a set of strategies is an SPNE of an infinite horizon game if and only if it specifies Nash equilibrium play in every subgame (see Section 9.B). To start, note that although each subgame of this repeated game has a distinct history of play leading to it, all of these subgames have an identical structure; Each is an infinitely repeated Bertrand duopoly game exactly like the game as a whole. Thus, to establish that the strategies in (12.0.1) constitute an SPNE, we need to show that after any previous history of play, the strategies specified for the remainder of the game constitute a Nash equilibrium of an infinitely repeated Bertrand game.

IN T ERA C T ION

401

402

C HAP T E R

1 2:

MAR K E T

POW E R

In fact, given the form of the strategies in (12.0.1), we need to be concerned with only two types of previous histories: those in which there has been a previous deviation (a price not equal to pm) and those in which there has been no deviation. Consider, first, a subgame arising after a deviation has occurred. The strategies call for each firm to set its price equal to c in every future period regardless of its riva!"s behavior. This pair of strategies is a Nash equilibrium of an infinitely repeated Bertrand game because each firm j can earn at most zero when its opponent always sets its price equal to c, and it earns exactly this amount by itself setting its price equal to (' in every remaining period. Now consider a subgame starting in, say, period t after no previous deviation has occurred. Each firm j knows that its rival's strategy calls for it to charge pm until it encounters a deviation from pm and to charge c thereafter. Is it in firm j's interest to usc this strategy itself given that its rival does? That is, do these strategies constitute a Nash equilibrium in this subgame? Suppose that firm j contemplates deviating from price pm in period t 2: t of the subgame if no deviation has occurred prior to period r'9 From period t through period r - I, firm j will cam !(pm - c)x(pm) in each period, exactly as it does if it never deviates. Starting in period r, however, its payoffs will differ from those that would arise ifit does not deviate. In periods after it deviates (periods r + I, r + 2, ... ), firmj's rival charges a price of c regardless of the form offirmj's deviation in period r, and so firm j can earn at most zero in each of these periods. In period r, firm j optimally deviates in a manner that maximizes its payoff in that period (note that the payoffs firm j receives in later periods are the same for any deviation from pm that it makes). It will therefore charge pm - £ for some arbitrarily small £ > 0, make all sales in the market, and earn a one-period payoff of (pm - C - £)x(pm). Thus, its overall discounted payoff from period r onward as a result of following this deviation strategy, discounted to period r, can be made arbitrarily close to (pm - c)x(pm). On the other hand, if firm j never deviates, it earns a discounted payoff from period r onward, discounted to period r, of [l(pm - c)x(pm)]j(l - b). Hence, for any t and r 2: t, firm j will prefer no deviation to deviation in period r if and only if

--- ---

(12.0.2) Thus. the strategies in (12.0.1) constitute an SPNE if and only if .I 2:

! .•

19. From our previous argument, we know that once a deviation has occurred within this stlbgame. firm j can do no bener than to play c in every period given that its rival will do so. Hence, to check whether these strategies form a Nash equilibrium in this subgame, we need only check whether firm j will wish to deviate from I'M if no such deviation has yet occurred.

REPEATED

The discount factor need not be interpreted literally. For example. in a model in which market tiL-mand is growing at ratc~' [i.e .• .'(,(1') = ),'.\:(p)J, larger values of y make the model behave as if there is a larger discount factor because demand growth increases the size of any future losses caused by a current deviation. Alternatively, we can imagine that in each period there is a probability)' that the firms' interaction might end. The larger y is, the more firms

will effectively discount the future. (This interpretation makes clear that the infinitely repeated game framcwork can be relevant even when the firms may cease their interaction within some

finite amount of time: what is needed to fil Ihe analysis into the framework above is a strictly positive probability of continuing upon having reached any period.) Finally, the value of b can rellect how long il takes to deleCI a deviation. These interpretalions are developed in Exercise Il.D.!.

1.

Although the strategies in (12.0.1) constitute an SPNE when .I 2: they are nor the only SPNE of the repeated Bertrand model. In particular, we can obtain the result presented in Proposition 12.0.2. ProposItion 12.0.2: In the infinitely repeated Bertand duopoly game, when ,j 2: ~ repeated choice of any price p € [C, pm] can be supported as a subgame perfect Nash equilibrium outcome path using Nash reversion strategies. By contrast, when ,I < ~, any subgame perfect Nash equilibrium outcome path must have all sales occurring at a price equal to c in every period. Proof: For the first part of the result, we have already shown in Proposition 12.0.1 that repeated choice of price pm can be sustained as an SPNE outcome when rI 2: 1. The proof for any price p € [c, pm) follows exactly the same lines; simply change price I'm in the strategies of (12.0.1) to p€ [c, pm). The proof of the second part of the result is presented in small type. We now show that all sales must occur at a price equal to c when ~ 0 such that

adding (12.D.3) for { =

211: r ~ (l:1t

+

V2r)'

But (,." + r,,) S [1/(1 - ,liJn" and so this is impossible if b < j.

Tt f

> O. \Ve will derive a

11:, ~ Tt,

for all r. If so. thell

INTERACTION

403

404

C HAP T E R

1 2:

MAR K E T

POW E R

(ii) Suppose. instead. that no such period exists; that is. for any period I. there is a period n, > n,. Define t(l) for I n,U-II}' Note that, for all I, n, is bounded above by the monopoly profit level "m = (pM - c)x(p') and that the sequence {n,u,},";" is monotonically increasing. lien j, and so J* = j is the unique equilibrium number of firms 2 1.22 21. NOle, however, that although there is a unique number of entrants. there arc many equilibria, in each of which the particular firms choosing to enter differ. 22. Without the assumption that firms enter when indifferent, condition (12.1.:":.2) would be a weak inequality. This change in (12.E.2) matters for the idcntinc~ltIon ofthc equilihrium numher or firms only in the case in which (here is an integer number of firms J such that "Ttj = K (so that with J firms in the market each firm earns exactly zero net of its entry cost K). When this is so, this change allows both J and J - 1 to be equilibria. With minor adaptations but some loss of exrosilional simplicity, all the points made in this section can be extended to cover this case.

Elltry {(lid WeI/1m'

J

Consider now how the number of firms entering an oligopolistic market compares with the number that would maximize social welfare given the presence of oligopolistic competition in the market. We begin by considering this issue for the case of a homogeneous-good industry. Let 'I) be the symmetric equilibrium output per firm when there are J firms in the market. As usual, the inverse demand function is denoted by pC). Thus, p(Jq}) is the price when there are J active firms; and so tr} = p(Jq})q} - e(q}), where c(') is the cost function of a firm after entry. We assume that e(O) = O.

1 2 . E:

E N TRY

407

408

CHAPTER

12:

MARKET

POWER

We measure welfare here by means of Marshallian aggregate surplus (see Section IO.E). In this case, social welfare when there are J active firms is given by J .,

W(J) =

f

[>(s) ds - Je(qJ) - JK.

0

(12.E.5)

Example I2.E,3: Consider the Cournot model of Example 12.E.1. For the moment, ignore the requirement that the number of firms is an integer, and solve for the number of firms I at which w'(I) = O. This gives (J

+

(a - e)2 I) = J

-"K-

proposition 12.E,1: Suppose that conditions (Al) to (A3) are satisfied by the post-entry oligopoly game. that p'(') < 0, and that c ~ O. Then the equilibrium number of entrants. J*. is at least J O - 1. where J O is the socially optimal number of 23 entrants. H

_

+ 1)2

I) =

Q,

f

Q,

bK

(I +

pis) ds - l"c(qJ )

+ (J" -

I)c(qr-,) ~ K,

,

where we let QJ = JlJJ. We can rearrange this expression to yield Q,

nJ

-,

-

K ~ p(QJ -, )qJ -, -

f

Q,

pIs) ds

+ J"[r(qJ')

- r(qJ _,)].

,

Given 1"(') < 0 and condition (A I), this implies that 1tJ-,-I\.~p(QJ-I)[(JJ'-,+QJ-,-QJ]+r[c(qJ)-(qJ-,)].

(a _ 1')2

= - ---

the equilibrium number of firms is the largest integer less than or equal to (12.E.6) and (12.E.7), we see that

(J +

Proof: The result is trivial for J 0 = I, so suppose that r > I. Under the assumptions of the proposition, 1t J is decreasing in J (Exercise 12.E.2 asks you to show this). To establish the result, we therefore need only show that 1tJ '_I ~ K. To prove this, note first that by the definition of r we must have W(J°)W(}" - I) ~ 0, or

(12.E.6)

If I turns out to be an integer, then the socially optimal number of firms is r = 1. Otherwise, J '" is one of the two integers on either side of I [recall that W(·) is concave]. Now, recall from (12.E.4) that 1t J ~ (I/h)[(a - C)/(J + 1)]2. As noted in Example 12.E.I, if we let I be the real number such that (J

For markets satisfying these three conditions we have the result shown in Proposition 12.E.1.

(.)

The socially optimal number of active firms in this oligopolistic industry, which we denote by J , is any integer number that solves Max J W(J). Example 12.E.3 illustrates that in contrast with the conclusion arising in the case of a competitive market, the equilibrium number of firms here need not be socially optimal.

_

---

SECTION

(12.E.7)

1.

From

1)J!2.

Thus, when the demand and cost parameters arc such that the optimal number of firms is exactly two (F = I = 2), four firms actually enter this market (J* = 4, since J;; 4.2); when the social optimum is for exactly three firms to enter (J" = I = 3), seven firms actually do (J * = 7, since J = 7); when the social optimum is for exactly eight firms to enter (J' = I = 8), 26 actually enter (J * = 26, since J = 26) . • Can we say anything general about the nature of the entry bias? It turns out that we can as long as stage 2 competition satisfies three weak conditions [we follow Mankiw and Whinston (1986) here]: (A I) JqJ ~ J''1J' whenever J > J'; (A2) '1J $: '1J' whenever J > J'; (A3) p(J'lJ) - e'('1J) ~ 0 for all J. Conditions (A I) and (A3) arc straightforward: (A I) requires that aggregate output increases (price falls) when more firms enter the industry, and (A3) says that price is not below marginal cost regardless of the number of firms entering the industry. Condition (A2) is more interesting. It is the assumption of bl/siness stealing. It says that when an additional firm enters the market, the sales of existing firms fall (weakly). Hence, part of the new firm's sales come at the expense of existing firms. These conditions are satisfied by most, although not all, oligopoly models. [In the Bertrand model. for example, condition (A3) does not hold.]

12.E,

ENTRY

409

,---------------------------------------------------------------------

(12.E.8)

But since 1'''(-) ~ 0, we know that (,'(qJ _, )[qr - qJ _,] $: (qJ) - c(qJ _,). Using this inequality with (12.E.8) and the faet that q, _, + QJ _ I - Qr = J"(qr- , - qJ') yields 1t J -, - K ~ [p(QJ-,) - ('(q, -,)]J"(qr-, - qJ ).

Conditions (A2) and (A3) then imply that 1t J

_,

~ K.24 •

The idea behind the proof of Proposition 12.E.1 is illustrated in Figure 12.E.1 for the case where ('I) = 0 for all q. In the figure, the incremental welfare benefit of the rth firm, before taking its entry cost into account, is represented by the shaded area (ahcd). Since entry of this firm is socially efficient, this area must be at least K. But area (ahed) is less than area (aha), which equals p(QJ -,)(QJ" - QJ -,i. Moreover, business stealing implies that (Qr - QJ _,) = J'qJ - (J" - 1)'1, _, $: q, _I' and so we see that area (abce) $: p(QJ' -,)qr-I ~ 1CJ ,-, [the value of1CJ"_' is represented in Figure 12.E.1 by area (abfg)]. Hence 1tJ-' ~ K. The tendency for excess entry in the presence of market power is fundamentally driven by the business-stealing effect. When business stealing accompanies new entry and price exceeds marginal cost, part of a new entrant's profit comes at the expense of existing firms, creating an excess incentive for the new firm to enter. Of course, as Proposition 12.E.1 indicates, we may also see too few lirms in an industry. The classic example concerns a situation in which the socially optimal number of lirms is one. A single firm deciding whether to enter a market as a

1I J

21 Iflhere is more than one maximizer of W(J), say p;, .. .,J~}, then J* ~ Max{J; .... ,J~} - t. 24. Note that if (A I) holds with strict inequality. then this conclusion can be strengthened to O _ I > K [a strict inequality appears in (12.E.8»). In this case, J* ~ J I even if firms do not enter

when indifferent.

410

CHAPTER

12:

MARKET

SECTION

POWER

------------------------------------------------------------------- ,---

AC(q) = K

Ftgure 12.E.l

/

(left)

12.F:

THE

COMPETITIVE

+ cq q

Diagrammatic

p(Q,

I Q

12E.4. Figure 12.E.2 (right) An insufficient entn incentive.

monopolist compares its monopoly profit-the hatched area (abde) in Figure 12.E.2-with the entry cost K. However, the firm fails to capture, and therefore ignores. the increase in consumer surplus that its entry generates-the shaded area (file). As a result, the firm may find entry unprofitable even though it is socially desirable. Proposition 12.E.1 tells us, however, that if we have too little entry in a homogeneous-good market, this can be at most by a single firm. What happens when product differentiation is present? It turns out that we can then say very little of a general nature. The reason is that the sort of problem illustrated in Figure 12.E.2 can now happen for many products, leading to many "too few by one" conclusions. An additional issue is that. with product differentiation. the number of firms is not all that matters. We may also fail to have the right selection of prod ucts. 25 An alternative approach to the two-stage entry game models the actions of entry and quantity/price choice as simultaneous. In this one-stage tntr}' game, a firm incurs its setup cost

.

several firms at price p*, none of them could cover their cost).2b In this equilibrium, all lirms

make zero prolits. The equilibrium outcome

IS

depicted in Figure 12.E.3. Observe that it is

strictly superior in welfare terms to the outcome that arises in the two-stage entry process considered in Example 12.E.2, where there is also a single firm active but it quotes a monopoly pricc. 27 •

What is the critical ditTerence between the one-stage and two-stage entry processes" In the two-stage model an cntrant must sink its fixed costs prior to competing, whereas in the one-stage model it can compete for sales while retaining the option not to sink these costs if it does not make any sales. We can think of the two-stage case as a model of a firm incurring a once-and·for-all sunk entry cost that allows for many later periods of competitive interaction, whereas the one-stage case cHptures a setting in which ·'hit·and-run" entry is possible (i.e.,

entry for just one period while paying only the one-period rental price of capital). When a firm must incur a sunk cost in entering it must consider the reaction of other firms to its entry. In the Bertrand model with constant costs this reaction is severe: price falls to cost and the firm loses money by entering. In contrast, in the one-stage game the firm can enter and undercut active firms' prices without fearing their reactions. This makes entry more aggressive

and leads to a lower equilibrium price. This one-stage entry model with price competition providcs olle formalization of what Baumol. Panzar. and Willig (1982) call a co"lesllll>/e market.

only if it sells a positive amount. For example. the one-stage versions of Examples 12.E.1 and 12.E.2 are Cournot and Bertrand games. respectively. with cost functions C(q) = { :

+ c(q)

if q > 0 if q = 0

and an infinite (or very large) number of firms. For models of price competition. this change can have dramatic consequences. Consider the effect on the result of Example 12.E.2 that is illustrated in Example 12.E.4. Example 12_E.4: Tire Olle-Stage Elltry Model witlr Bertralld Competi/ioll. Suppose that + cx(p)J/x(p) for some p (the parameter c > 0 is the cost per unit); that is, suppose

12.F The Competitive Limit In Chapter 10. we introduced the idea that a competitive market might usefully be thought of as a limiting case of an oligopolistic market in which firms' market power grows increasingly small (see Section 10.B). We also noted that this view could provide a framework for reconciling cases in which competitive equilibria fail to exist in the presence of frcc entry and average costs that exhibit a strictly positive etlicient

p> [K

there is some price level at which a monopolist can earn strictly positive profits after paying its set up cost K. Assume that many firms simultaneously name prices and that a firm incurs the setup cost K only if it actually makes sales. Any equilibrium of this game has all sales occurring at price p' = Min{p: p '" [K + cx(plJjx(p)} (if price is above p', some firm could

gain by setting a price p' - r.; if price is below p', some firm must be making strictly negative prolits). and one firm satisfying all demand at this price (if the demand were split among 25. See Spence (1976). Dixit and Stiglitz (t977), Salop (1979). and Mankiw and Whinston (1986) for more on the case of product differentiation.

Figure 12.E.3

Equilibrium in the one-stage entry game discussed in Example

explanation of Proposition 12.E.1.

II----I-~

LIMIT

26. Note that we now allow consumer demand to be given entirely to one firm when several firms name the same price (before, we had taken the division of demand in this case to be exogenously given). This is the only division of demand that is compatible with equilibrium in this example. It can he formally juslificd as the limit of the equilibria thaI arise when prices must be quoted in di"crctc units as the size of these units grows small. 27. in fact. this equilibrium outcome is the solution to the problem faced by a welfaremaximizing planner who can control the outputs qj of the firms but must guarantee a nonnegative profit to all active firms. that is. who faces the constraint that p(L.1t qk)qj 2: cqj + K for every j with qj > o.

411

412

CHAPTER

12:

MARKET

POWER

----------------------------------------------------------scale (see Section to. F). In this situation, we argued, as long as many firms could fit into the market, the market outcome ought to be close to the competitive outcome that would arise if industry average costs were actually constant at the level of minimum average cost. In this section, we elaborate on these points and develop, in a setting of free entry, the theme that if the size of individual firms is small relative to the size of the market, then the equilibrium will be nearly competitive. We have already seen one example of this phenomenon in Example l2.E.1. Here we establish the point in a more general way. We now let market demand be x,(p) = rn(p), where x(p) is differentiable and x'(') < O. Increases in IX correspond to proportional increases in demand at all prices. Letting p(q) be the inverse demand function associated with x(p), the inverse demand function associated with x,(p) is then p,({j} = p(q/~). All potential firms have a strictly convex cost function c(q) and entry cost K > O. We denote the level of minimum average cost for a firm by 0], we say that s, is a strategic complement of s,; and if firm 2 becomes less aggressive in the face of more aggressive play by firm I [i.e., if db,(sf(k»/ds, < 0], s, is a strategic suhstifllte ofs,. [This terminology is derived from Bulow, Geanakoplos, and Klemperer (1985); see also Fudenberg and Tirole (1984) for a related taxonomy.] Figure 12.G.1 summarizes these two determinants offirm 2's response, ds!(k)/dk. Example 12.G.I: The Strategic Effects /rom IIIt:estmellf ill Marginal Cost Reduction. The importance for strategic behavior of the distinction between cases of strategic complements and strategic substitutes is nicely illustrated by examining the strategic effects of investments in marginal cost reduction for models of quantity versus price competition. Suppose that if firm I invests k then its (constant) per-unit production costs are c(k), where c'(k) < O. Consider, first, the case in which stage 2 competition takes the form of the Cournot model of Example 12.C.I, so that the stage 2 strategic variable is Sj = ql' firm j's quantity choice. In this model, we have a situation of strategic substitutes because firm 2's best-response function in stage 2 is downward sloping [db,(q,)/llq, < 0 at all 'I, such that b,(qtl > 0]. As shown in Figure 12.G.2(a), the lowering of firm I's marginal cost because of an increase in k from, say, k' to k" > k', shifts firm I's best-response function outward from b,(q" k') to b,(q" k"); with lower marginal costs, firm I will wish to produce more for any quantity choice of its rival

Strategic effects of a reduction in marginal cost from c(k') to elk") < elk'). (a) Quantity model. (b) Price model.

415

416

CHAPTER

12:

MARKET

APPENDIX

POWER

[and so, in terms of our earlier analysis, iJb,(q!(k), k)/iJk > 0). Thus, in this model, investment in cost reduction leads to a reduction in firm 2's output level, an effect that is beneficial for flrm I [sec Figure 12.G.2(a»). In contrast, suppose that stage 2 competition takes the form of the differentiated price competition model of Example 12.C.2. Here we take Sj = (I/p;) to conform with the interpretation of Sj as an "aggressive" variable [i.e., ,)1!,(s" 52' k)/DS 2 < 0]. In this model. we have a situation of strategic complements: an anticipated reduction in lirm 1'5 pri 0]. As depicted in Figure 12.G.2(b), a reduction in firm I's marginal cost because of an increase in k from k' to k" > k' once again makes flrm I more aggressive, leading it to choose a lower price given any price choice of its rival: its best-response function shirts to the right from h,(I/P2,k') to h,(I/P2,k") [hence, in terms of our earlier analysis. ,oh,(I/p!(k), k)/,'k > 0]. With strategic complements, the result of the reduction in firm I's marginal cost is therefore to lower flrm 2's equilibrium price, an elTect that is undesirable for flrm I. Thus, the strategic ciTects of a reduction in flrm I's marginal cost differ between the two models, being beneficial to firm I in the quantity model and detrimental in the price model.·'o Which model more accurately captures the nature of competitive intera O. Firm 2 will therefore choose "out" given firm l's stage I choice of k if its anticipated profit in stage 3, 1!2(s1'(k), s!(k», is less than F. Given this fact, the incumbent would, of course, like simply to announce that in response to any entry it will engage in predatory pricing (i.e., it will choose a very high level of s, in stage 3). The problem, however, is that this threat must be credible (recall the discussion in Chapter 9). Thus, what the incumbent needs to do to deter entry is choose a level of k that preeommits it to sufliciently aggressive behavior that flrm 2 chooses not to enter. In any particular problem, this mayor may not be possible, and it mayor may not be profitable. As a general matter, there arc many potential mechanisms (i.e., many types of variables k) by which such precolllmitments can be made. In Appendix B, we examine in some detail the classic mechanism of entry deterrence through capacity expansion first studied hy Spence (1977) and Dixit (1980).

APPENDtX A: tNFINITELY REPEATED GAMES AND THE FOLK THEOREM

In this appendix, we extend the discussion in Section 12.D of infinitely repeated games to a more general setting. Our primary aim is to develop a formal statement of a version of the jiJlk the()rem of infinitely repeated games. Infinitely repeated games have a very rich theoretical structure and we shall only touch on a limited number of their properties. Fudenberg and Tirolc (1992) and Osborne and Rubinstein (1994) provide Illore extended discussions.

The Model

The IIrst term on the right-hand side of (12.G.2) is the direct effect on firm l's proflts from changing k: the second term is the strateyic effect that arises because of flrm 2's equilibrium response to the change in k. Since e", (sf(k), s! 0 for i = 1,2. The differential change in firm i's profits from this change is

1(*

{ Tr.

*)_on,(q;,qil .+t1n,(q,',qn - - - . - - d q, - - . - - dq, cq, aqj

q• . qj

cn,(q;, qn d ..

cqj

qj.

(t2.AA.4)

since q't is a best response to qj. Thus, dn,(q~,

qn > O.

(t2.AA.5)

420

C HAP T E R

1 2:

MAR K E T

A P PEN 0 I X

POW E R

A:

I NFl NIT ELY

REP EAT E 0

GAM E 5

AND

THE

F0 LK

THE 0 REM

421

-------------------------------------------------------------------------- ,-----------------------------------------------------------------------"" (I - b)v,

On the other hand, the envelope theorem (see Section M.L of the Mathematical Appendix) tells us that at any 4j • (;rr,(h,(4j), 4) tirr,(qj)

= .... -:-_._.

,,, ,,

J4"

cqj

where 11,(') is player i's best response to 'II in the stage game. Hence,

.

•

tirr,(4j)

tn, ('I: . qj)

= .

....

~

.,(q') J4 r

(12.AA.6)

----+-------,

,

"irq')

Ftgur.12.AA.1 The Nash reversion folk theorem.

rr,,(1 - b)v,

+ 6.41,42 + lltlz) is sustainable as players from deviating from any given outcome path. In general, Nash reversion is not the most severe credible punishment that is possible. Just as players can be induced to cooperate through the use of threatened punishments, they can also be induced to punish each other. To consider this issue, it is useful to let ,!, = Min., [Max., ",(q;, qj)) denote player i's minimax po)'uj):'.1 Payoff ,!; is the lowest payoff that player i's rival can hold him to in the stage game if player i anticipates the action that his rival will play. Note, first, that player i's payoff in the stage game Nash equilibrium q. = ('Ii, cannot be below'!,. More importantly, regardless of the strategies played by his rival, player i's average payoff in the infinitely repeated game or in any subgame within it cannot be below'!,. Thus, no punishment following a deviation can give player i an average payolT below'!,. Payoffs that strictly exceed ,!, for each player i are known as illdivitillol/y rational pu)'offs. Note that for a punishment to be credible we must be sure that after an initial deviation occurs and the punishment is called for, no player wants to deviate from the prescribed punishment path. This means that a punishment is credible if and only if it itself constitutes an SPNE outcome path. Proposition 12.AA.4 tells us that as long as " > 0 and conditions similar to those in Proposition 12.AA.1 hold, SPNEs that yield more severe punishments than Nash reversion can be constructed whenever each player i's stage game Nash equilibrium payoff strictly exceeds ,!;. (You are asked to prove this result in Exercise 12.AA.2.)

the outcome path of an S I'N E using Nash reversion strategies and. by (12.AA.5), yields strictly higher discounted payolfs to the two players than does inHnite repetition of 4· = ('If. 'In. • Proposition 12.AA.I tells us that with continuous strategy sets and differentiable payolT functions, as long as there is some possibility for a joint improvement in payolTs around the stage game Nash equilibrium, some cooperation can be sustained. Going further, examination of condition (12.AA.2) tells us that cooperation becomes easier as t) grows.

qn

Proposition 12.AA.2: Suppose that outcome path Q can be sustained as an SPNE outcome path using Nash reversion when the discount rate is ,5. Then it can be so sustained for any (\' ;::: ii. In fact, as ,\ gets very large. a great number of outcomes become sustainable. The result presented in Proposition 12.AA.3, a version of the Nash reversioll/olk rheorem [originally due to Friedman (1971)], shows that allY stationary outcome path that gives «Ich player a discounted payoff that exceeds that arising from infinite repetition of the stage game Nash equilibrium q. = ('Ii, q!) can be sustained as an SPNE if ,\ is sulficiently close to I. Proposition 12.AA.3: For any pair of actions q = (q" q2) such that ";(q" q2) > ,,;(qj, qn for i = 1,2, there exists a ~ < 1 such that, for all c5 > Q, infinite repetition of q = (q" q2) is the outcome path of an SPNE using Nash reversion strategies. The proof of Proposition 12.AA.3 follows immediately from condition (12.AA.3) letting J -+ I. In fact, with a more sophisticated argument, the logic of Proposition 12.AAJ can be extended to nonstationary outcome paths. By doing so, it is possible to convcxify the set of possible payoffs identified in Proposition 12.AA.3 by alternating between various action pairs (If" 'I,). In this way, we can support any payolTs in the shaded region of Figure 12.AA.I as the average payoffs of an SPN E. .12

Proposition 12.AA.4: Consider an infinitely repeated game with [) > 0 and S; c il for i = 1, 2. Suppose also that ,,;(q) is differentiable at q. = (qf, qn. with 01!;(qj, qtJ/ciq/ i' 0 for i i' i and i = 1,2, and that ,,;(qf, q~) > ,!; for i = 1,2. Then there is some SPNE with discounted payoffs to the two players of (v;, v,,) such that (1 - il) vi < ,,;(qj, q~) for i = 1,2. Under the conditions of Proposition 12.AA.4, for any" E (0, I), more severe punishments than Nash reversion can credibly be threatened. We should therefore expect that more cooperative outcomes can be sustained than those sustainable through the threat of Nash reversion whenever a fully cooperative outcome is not already achievable using Nash reversion strategies.

Exercise I2.AA.I: Argue that no pair of actions 'I such that ",(If" 'I,) < 1!,(qj, '1j) for some i can be sustained as a stationary SPNE outcome path using Nash reversion.

More SCI'ere PUllishmellts alld the Folk Theorem It is intuitively clear that, for a given level of b < I, the more severe the punishments that can be credibly threatened in response to a deviation, the easier it is to prevent 32. See Fudcnbcrg and Maskin (1991) for details.

Possible Payoffs

,,

,,

rq, Together. (12.AA.4) and (12.AA.6) imply that. to first order, the value of the left-hand side of condition (12.AA.3) is unalfected by this change. However, (12.AA.5) implies that the right-hand side of (12.AA.3). to lirst order, increases. Hence, for a small enough change (/\t/1' Ll'!.!) in din:ction (tit/I' dt/~). infinite repetition of (tit

Supportable as SPNE Average Payoffs as b ~ I with Nash Reversion

33. In general. a player's minimax payoff will be lower if mixed strategies are allowed. In this case, the statement of the folk theorem given in Proposition 12.AA.S remains unchanged. but with

these (potentially) lower levels of ~,.

1

422

C HAP T E R

1 2:

MAR K E T

POWE

R

AP PEN 0 I X

8:

5 T RAT E G I C E N TRY

0 E T ERR ENe E

AND

Ace 0 M MOO A T ION

423

---------------------------------------------------------------,-------------------------------------------------------------6)v, .,,(1 -

,,

,,, ,,

n,(q')

!l.,

Supportable as SPNE Average Payoffs as J ~ I

--t-.,

,I· ,I --+-+---------, , !l.. ,,(q')

(i) Both finns play quantity ii in period I followed by the monopoly quantity '1 m in every period I > I as long as no one deviates, where quantity ij satisfies ",(1 - 8)1,

For arbitrary 0 < I, constructing the full set ofSPNEs is a delicate process. Each SPNE, whether collusive or punishing, uses other SPNEs as threatened punishments. For details on how this is done, see the original contributions by Abreu (1986) and (1988) and the presentation in Fudenberg and Tirole (1992). As with SPNEs using Nash reversion strategies, the full set of SPNEs grows as {) increases, making possible both more cooperation and more severe punishments. In fact, the result presented in Proposition 12.AA.S, known as the folk theorem, tells us that lilly feasible individually rational payoffs can be supported as the average payoffs in an SPNE as long as players discount the future to a sufficiently small degree. 34 (Feasibility simply means that there is some outcome path Q that generates these average payoffs.) Proposition 12.AA,5: (The Folk Theorem) For any feasible pair of individually rational payoffs (n" n 2 ) » (,!" '!2)' there exists a ~ < 1 such thaI, for all /j > §, (n" n 2 ) are the average payoffs arising in an SPNE. In comparison with Proposition 12.AA.3, Proposition 12.AA.S tells us that as I we can support any average payoffs that exceed each player's minimax payoff. 35 This limiting set of SPNE average payoffs is shown in Figure 12.AA.2. Example 12.AA.I gives some idea of how this can be done. c5

chooses quantity 'I as n(q).36 Note that '!J = 0 for j = 1,2 here; if firm j's rival chooses a quantity at least as large as the competitive quantity q, satisfying p(q,) = c, then the best firm j can do is to produce nothing and earn zero, and firm j can never be forced to a payoff worse than zero. Consider strategies for the players that take the following form:

--+

Example 12,AA,\: Sustaining an Al'erage Payoff of Zero in the Infinitely Repealed Game. In this example, we construct an SPNE in which both firms earn an average payoff of zero in an infinitely repeated Cournot game. In particular, let the stage game be a symmetric Cournot duopoly game with cost function c(q) = cq, where c > 0, and a continuous inverse demand function p(.) such that p(x) --+ 0 as x --+ Yo. It will be convenient to write a firm's profit when both firms choose quantity 'I as n(q) = [p(2q) - e]q and, as before, a firm's best-response profits when its rival

Ftgure 12.AA.2 The folk theorem.

n«i)

o

+1-"':-3 n(qm) = o.

(I2.AA.7)

(ii) If anyone deviates when ij is meant to be played, the outcome path described in (i) is restarted. (iii) If anyone deviates when '1 m is meant to be played, Nash reversion OCCurs. Note that the outcome path described in (i), if followed by both players, gives both players an average payoff of zero by construction [recall (12.AA.7)). By Proposition 12.AA.3. we know that for some ~ < I we ClIO sustain infinite m repetition of '1 through Nash reversion for all c5 > ,j. Thus, for ,) > ,), neither firm will deviate from the above strategies when '1 m -is supposed to b~ played. Will they deviate when ,j is supposed to be played" Consider firm j's payoff from deviating from ,j in a single period and conforming with the prescribed strategy thereaf1er. Firm j earns n(,7) + (0)(0) because it plays a best response when deviating, and then the original path is restarted. Thus, this deviation does not improve firm j's payoff if n(ij) = 0 (it cannot be less than zero because ,!, = 0). This is so if ij;:>: 'I,. But examining condition (12.AA.7), we see that as ,) approaches I, 11('7) must get increasingly negative for (12.AA.7) to hold and, in particular, that there exists a D, < I such that ,j will exceed q, for all /j > 0,. Thus, for ,j > Max {J" ~}, these strategies constitute an SPNE that gives both firms an average payoff of OJ7 •

COUf/lOl

34. The theorem's name refers to the fact that some version of the result was known in game theory "folk wisdom" well before its formal appearance in the literature. See Fudenberg and Maskin

APPENDIX B: STRATEGIC ENTRY DETERRENCE AND ACCOMMODATION

In this appendix, we discuss an important example of credible precommitments to affect future market conditions in which an incumbent firm engages in pre-entry capacity expansion to gain a strategic advantage over a potential entrant and possibly to deter this firm's entry altogether [the original analyses of this issue are due to Spence (1977) and Dixit (1980)). In what follows, we study the following three-stage game that is adapted from Dixit (1980)

(1986) and t1991) for a proof of the result. When there are more than two players, the result requires that the set of feasible payoffs satisfy an additional "dimensionality" condition. The original appearances of the result in the litemture actually analyzed infinitely repeated games wirluml

discounting [see, for example, Rubinstein (1979)]. 35. We may also be able in some cases to give each player exactly his minimax payoff. This is the case, for ex.ample, in the repeated Bertrand game, where (he stage game's Nash equilibrium yields the minimax payofTs. In Example 12.AA.1. we show that we can also do this for large enough i) in the repeated Cournot duopoly game.

36. We can make the strategy sets compact by noting that in no period will any firm ever

choose a quantity larger than the level q such that "(4) + [,,/(1 - ,')](Max, n(q)) = 0, because it would do better setting its quantity equal to zero forcvcr. Then, Without loss, we can let each firm choose its output from the compact set [0, ii 1

37. We have not considered any multiperiod deviations, but it can be shown that if no single-period deviation followed by conformity with the strategies is worthwhile, then neither is any multiperiod deviation (this is a general principle of dynamic programming).

424

C HAP T E R

1 2;

MAR K E T

POW E R

---------------------------------------------------------------Stage I: An incumbent, firm I, chooses the capacity level of its plant, denoted by k/. Capacity costs r per unit. Stage 2: A potential entrant, firm E, decides whether to enter the market. If it does, it pays an entry cost of F. Slaye 3: If firm E enters, the two firms choose their output levels, q/ and q" simultaneously. The resulting price is p(q/ + q£). For firm E, output costs (IV + r) per unit: for each unit of output produced, firm E incurs both a capacity cost of r and a labor cost of w. For firm I, production must not exceed its previously chosen capacity level. Its production cost, however, is only w per unit because it has already built its capacity. If, on the other hand, firm E does not enter, then firm I acts as a monopolist who can produce up to k/ units of output at cost IV per unit.

APPENDIX

B:

STRATEGIC

ENTRY

DETERRENCE

AND

ACCOMMODATION

425

~-------------------------------------------------------------

h(4 ... 1 .. + r) /

Flgur. 12.BB.2 (leH)

h,(q, I k,)

Nash Equilibrium b(q,lw) b(q,1 w t r) /'

k,

4,

4,

Firm I's stage 3 best-response function after entry.

Flgur. 12.BB.3 (right)

Stage 3 Nash equilibrium after entry.

'II.

To determine the subgame perfect Nash equilibrium (SPNE) of this game, we begin by analyzing behavior in the stage 3 sub games and then work backward. Slllfje 3: QlIalltit)' Competitiol1 Nash

The subgames in stage 3 are distinguished by two previous events: whether firm E has entered and the previous capacity choice of firm I. We first consider the outcome of stage 3 competition following entry and then discuss firm l's behavior in stage 3 if entry does not occur. For simplicity, we assume throughout that firms' profit functions are strictly concave in own quantity; a sufficient condition for this is for p(') to be concave. The concavity of p(.) also implies that firms' best-response functions are downward sloping.

,/b(q,I.·tr) 4,

Staye 3 competition after ell try. Figure 12.BB.l depicts firm E's best-response function in stage 3, which we denote by b(qlw + r) to emphasize that it is the best-response function for a firm with marginal cost w + r. Firm E's stage 3 profits decline as we move along this curve to the right (involving higher levels of q/) and, at some point, denoted Z in the figure, they fall below the entry cost F. Now consider firm l's optimal behavior. The key difference between firm I and firm E is that firm I has already built its capacity. Hence, firm I's expenditure on this capacity is sunk (it cannot recover it by reducing its capacity), its capacity level is fixed, and its marginal cost is only w. Suppose we let b(ql IV) denote the best-response function of a firm with marginal cost IV. Then firm f's best-response function in stage 3 is b,(qElk/) = Min{b('/rlll'),k,}. Figure 12.66.1

'If

Firm Ts SI:!I.!C: J Profits = F

/

-

4,

Firm E's stage J best-response function after entry.

That is, firm f's best response to an output choice of qE by firm E is the same as that for a firm with marginal cost level was long as this output level does not exceed its previously chosen capacity. Figure 12.BB.2 illustrates firm I's best-response function. We can now put together the best-response functions for the two firms to determine the equilibrium in stage 3 following firm E's decision to enter, for any given level of k,. This equilibrium is shown in Figure 12.BB.3. In Figure 12.B8.3, point A is the outcome that would arise if there were no first-mover advantage for firm I, that is, if the two firms chose both their capacity and output levels simultaneously. However, when firm I is able to choose its capacity level first, by choosing an appropriate level of k" it can get the post-entry equilibrium to lie anywhere on firm E's best-response function up to point B. Firm I is able to induce points to the right of point A because its ability to incur its capacity costs prior to stage 3 competition allows it to have a marginal cost in stage 3 of only w, rather than IV + r. Note, however, that firm I cannot induce a point on firm 2's best-response function beyond point B, even though it might want to; if it built a capacity greater than level k., it would not have an incentive to actually use all of it. Figure 12.BB.4 depicts this situation. A threat to produce up to capacity following entry would in this case not be credible. Staye 3 olltcomes if firm E does not elller. If firm E decides not to enter, then firm I will be a monopolist in stage 3. Its optimal monopoly output is then the point where its best-response function hits the q£ = 0 axis, b,(Olk,).

------_.-----------------

Flgur. 12.BB.4

A stage 3 equilibrium in which firm I does not use all of its capacity.

426

C HAP T E R

1 2:

MAR K E T

POW E R

REF ERE NeE s

--------------------------------------------------------------------- ,---

Flgur. 12.88.7 (left)

b(q, I w + r)

Entry deterrence is possible but not

b(q, I w)

Figure 12.88.5 (left)

Blockaded entry. h(q, I II'

h(OI'\'

+ r)

+ r)

inevitable.

Figure 12.88.6 (right)

b(q,I.' + r)

Strategic entry accom,m?dation When q,

q,

Sraye 2: Firm E's Encr), Decision Firm E's entry decision is straightforward: Given the level of capacity k, chosen by firm I in stage I, firm E will enter if it expects nonnegative profits net of its entry cost F. This means that firm E will enter when it expects that the postentry equilibrium will lie to the left of point Z on its best-response function in Figure 12.BB.1.

Stayl' 1: Firm /'5 Stage 1 Capacity Illvestment Now consider firm I's optimal capacity choice in stage I. There are three situations in which firm I could find itself: Entry could be blockaded, entry could be inevitable, or entry deterrence could be possible but not inevitable. Let us consider each in turn.

Elllry is blockaded. One possibility is that the entry cost F is large enough that firm E does not find it worthwhile to enter even if firm I ignores the possibility of entry and simply builds the same capacity that it would if it were an uncontested monopolist, h(OI \I" + r). This situation, in which we say that entry is blockaded, is shown in Figure 12.BB.5. [n this case, firm [ achieves its best possible outcome: it builds a capacity of h(Olw + r), no entry occurs, and then it sells b(Olw + r) units of output. £111ry delerrellce is impossible: strategic entry accommodalion. Suppose that point Z is to the right of point B. [n this case, entry deterrencc is impossible; firm E will find it profitable to enter regardless of k,. What is firm I's optimal choice of k, in this casc') In Figure 12.BB.6, we have drawn isoprofit curves for firm [; note that because these include the cost of capacity, they are the isoprofit curves corresponding to those of a firm with marginal cost (w + r). Now recall that firm [ can induce any point on firm E's best-response function up to point B through an appropriate choice of capacity. It will choose the point that maximizes its profit. In Figure 12.BB.6, this point, which involves a tangency between firm E's best-response function and firm I's isoprofit curves, is denoted as point S. This outcome corresponds to exactly the outcome that would emerge in a model of sequential quantity choice, known as a Sracklebery leadership model (see Exercise 12.C.IS). Note that firm I's first-mover advantage allows it to earn higher profits than the otherwise identical firm E. The point of tangency, S, could also lie to the right of point B. [n this case, the optimal capacity choice will be k, = k., and the outcome will not be as desirable for firm I as the Stackleberg point. Here firm I is unable to credibly

entry

IS

q,

lOevitable.

b(q,l.'

+ r)

q,

commit to produce the output associated with point S, even if it builds sufficient capacity in stage I.

Emr.\" cit'lerrl!llCc is possihll! bIll nOI ineL'ilable. Suppose now that point Z lies to the left of point B but not so far that entry is blockaded, as shown in Figure 12.BB.7. Firm I can deter firm E's entry by picking a capacity level at least as large as point kl in the figure. The only question is whether this will be optimal for firm I, or whether firm [ is better off accommodating firm E's entry. To judge this, firm [ will compare its profits at point (kz,O) to those at point S (or at point B if point S lies to the right of B). This can be done by comparing the capacity level k, in Figure 12.BB.S, the output level under monopoly that gives the same profit as the optimal accommodation point S, with k z . If k. > k z , then firm I prefers to deter entry because its profits are higher in this case; but if k, < kl' then it will prefer accommodation. Note that if deterrence is optimal, then even though entry does not occur its threat nevertheless has an effect on the market outcome, raising the level of output and welfare relative to a situation in which no entry is possible. Exercise 12.IIB.I: Show that when entry deterrence is possible but not inevitable, if point S lies to the right of point Z, then entry deterrence is better than entry accommodation.

REFERENCES Abrell, D. (19S6). Extremal equilibria 191-225.

or

oligopolistic supl!rgamcs. J(lurtla/ of Economic Theory 39:

Abreu, D. (198H)' On the theory of infinitely repeated games with discounting. Econometrica 56: 383-96.

Abreu. D .. D. Pearce. and E. Stachctti. (1990). Toward a theory of discounted repeated games with imperfect monitoring. ECrHlOmt'trica 58; 1041-64. Baumol, W .• J. Panzar. and R. Willig. {I982}. Contestahft> Markers mid the Theory of Industry Structure. San Diego: Harcourt. Brace. Jovanovich. Bertrafld, 1. (tS~U). Thcorie mathematique de la nchesse sociale. Journal des Sal'Qllts 67: 499-508. Bulow. J., J. Geanakoplos. and P. Klcmpcrer. (1985). Mullimarket oligopoly: strategic substitutes and complements. Journal of Political Economy ~3: 488-511. Chamberlin. E. (1933). The Thear,\' oj Monopolistic Competition. Cambridge. Mass.: Harvard University Press.

Figure 12.88.8 (right)

Entry deterrence versus entry accommodation.

427

428

C HAP T E R

1 2:

MAR K E T

POW E R

Cournot, A. (IS)!S). R"dlt'rcill':i .\ur It's Prindpf!.\ Mml!i'mmlqul's df! la J"IIi'orit' dt's Riches.H's. [English edition. Rt':it'tlrdll's illto llll' Mar/lt'fPlulicu/ Pri/ldpies of Ihe Tllt'orr of Weulth. edited by N. Bacon London: Macmillan, 1X97.J Di.\it. A. (19S0). The role of investment in entry dctt:rrence. £("(lIIomic" Jounwl90: 95-106. Diut. A .. and J. E. Stiglitz. (1977). Monopolistic competition and optimal product diversity. AIII..,inllJ Economic Rt't"it'\I' 67; 297-308. Edg.('worth. F. (1~97). Me tc.:oria pura del nlOnoroitO. Gion",h- clt'f/Ii L/"{}//omi\(i 40. 13-31. lEnglish tr.ln:.l:ilillll: Th(' pure theory of monopoly. In PtI/'l'r.\ ReI,""I!/ '" 1'"lakul t:C/lllllfllj', Vol I. edited by F. Edgl.'worth. London: Mal.'millan. 1925.J I-"ricdman. J. (1971). A non-cooj1\:rativc equilibrium fllr supcrgamcs. Rt,!·it,\\· oj" EClJtlomic Slucllt'J 28: 1-12. Fudcnlxrg. D .. and E. Maskin. (19X6). The folk theorem in ~cpcated games with discounting or with il1l:ompkh! information. £collomt'trim 52; 533-54. FudL·n~rg. J), O. (a) Show that if r. :5 I, then the monopolist's optimal price is not well defined. (b) Assume that r. > I. Derive the monopolist's optimal price, quantily, and price-cost margin (p. - t")/pm. Calculate Ihe resuiting deadweighl welfare loss.

(c) (Harder) Consider a sequence of demand functions thai differ in their levels of r. and ~ but that all involve the same compelitive quantity x(e) [i.e., for each level of 1:, a is adjusted to keep x(c) the same]. How does the deadweighl loss vary with F.? (If you eannol derive an analytic answer. try calculating some values on a computer.) 12.8.3" Suppose Ihat we consider a monopolist facing demand function x(p,O) with cost fUIH.:tion c«(/. lfJ). where () and
o. Assume throughout that the monopolist's objective runction is concave in q and I. (a) Derive the first·order conditions for the monopolist's choices.

C

12.C.8 Consider a homogeneous-good J·firm Cournot model in which the demand function x(p) is downward sloping but otherwise arbitrary. The firms all have an identical Cost function c(q) that is increasing in q and convex. Denote by Q the aggregate output of the J firms, and let Q-, = L. •• i q•. (a) Show that firm j's best response can be written as b(Q _ j).

(b) Compare the monopolist's choices with those of a benevolent social planner who can control both q and 1 (a "first-best" comparison).

(b) Show that h(Q _ j) need not be unique (i.e., that it is in general a correspondence, not a function).

(e) Compare the monopolist's choices with those of a benevolent social planner who can cantrall but not q (a "second·best" comparison). Suppose that the planner chooses 1 and then the monopolist chooses q.

(e) Show that if Q_j > Q- i , q, E h(Q_j), and qjE b(Q -i)' then (4 j + Q-i) ~ (qj + Q_j). Deduce from this that h(') can jump only upward and that b'(Q _i) ~ -I whenever this derivative is defined.

12.B.10· Consider a monopolist that can choose both its product's price p and its quality q. The demand for its product is given by x(p, q), which is increasing in q and decreasing in p. Given the price chosen by the monopolist, does the monopolist choose the socially efficient quality level?

(d) Use you result in (e) to prove that a symmetric pure strategy Nash equilibrium exists in this model. (e) Show that multiple equilibria are possible. (f) Give sufficient conditions (they are very weak) for the symmetric equilibrium to be the only equilibrium in pure strategies.

12.C.IA In text. 12.C.2C Extend the argument of Proposition 12.C.1 to show that under the assumptions made in the text [in particular, the assumption that there is a price p < co such that x(p) = 0 for all p ~ p], both firms setting their price equal to c with certainty is the unique Nash equilibrium of the Rertrand duopoly model even when we allow for mixed strategies.

(a) Derive the Nash equilibrium of this model. Under what conditions does it involve only one firm producing? Which will this be' (b) When the equilibrium involves both firms producing. how do equilibrium outputs and profits vary when firm I's cost changes?

(a) Show that both firms naming prices equal to the smallest multiple of A that is strictly greater than c is a pure strategy equilibrium of this game. Argue that it does not involve either firm playing a weakly dominated strategy.

(e) Now consider the general case of J firms. Show that the ratio of industry profits divided by industry revenue in any (pure strategy) Nash equilibrium is exactly H 1£, where £ is the elasticity of the market demand curve at the equilibrium price and H, the Herfindahl index of concelltration, is equal to the sum of the firms' squared market shares L.i(q;lQ*)'. (Note: This result depends on the assumption of constant returns to scale.)

12.C.4" Consider altering the Bertrand duopoly model to a case in which each firm j's cost per unit is ('j and C 1 < C2' (a) What are the pure strategy Nash equilibria of this game?

-

12.C.9" Consider a two·firm Cournot model with constant returns to scale but in which firms' costs may differ. Let c j denote firm j's cost per unit of output produced, and assume that c, > c,. Assume also that the inverse demand function is p(q) = a - bq, with a > c,.

12.C.3" Note that the unique Nash equilibrium of the Rertrand duopoly model has each firm playing a weakly dominated strategy. Consider an alteration of the model in which prices must be named in some discrete unit of account (e.g., pennies) of size A.

(b) Argue that as A _ 0, this equilibrium converges to both firms charging prices equal to c.

431

,-------------------------------------------------------------~

12.C.10· Consider a J·firm Cournot model in which firms' costs differ. Let Cj(qi) = ajc(qj) denote firm j's cost function, and asSUme that c(·) is strictly increasing and convex. Assume that 17 1 >'" > (XJ.

432

CHAPTER

12:

MARKET

POWER

~~~--------------------------------------------------------(a) Show that if more than one firm is making positive sales in a Nash equilibrium of this model, then we cannot have productive efficiency; that is, the equilibrium aggregate output Q* is produced inefticiently.

--

(c) Provide an example in which wclfare decreases when a firm becomes more productive (i.e .. when ':X J falls for somej). [Him: Consider an improvement in cost for IIrm I in the model of Exercise 12.C.9.] Why can this happen?

12_C.16" Derive the Nash equilibrium prices and profits in the circular city model with J firms when travel costs are quadratic, as in Exercise 12.C.IS. Restrict attention to the case in which v is large enough that the possibility of non purchase can be ignored. What happens as J grows large! As I falls?

12.C.11 (' Consider a capacity-constrained duopoly pricing game. Firm j's capacity is 'Ij for j = 1,2, and it has a constant cost per unit of output of (' 0 and that there exists a price p such that x(f» = 'I, + 'I,. Suppose also that x(p) is concave. Let p(.) = x - '(.) denote the inverse demand function. Given a pair of prices charged. sales are dt:termined as follows: consumers try to buy at the low·priccd firm first. If demand exceeds this firm's capacity, consumers are servcd in order of their valuations. starting with high· valuation consumers. If prices are the same, demand is split evenly unless one firm's demand exceeds its capacity, in which case the extra demand spills over to the other firm. Formally, the firms' sales arc given by the functions x,(p" p,) and x,(p"p,) satisfying [x i(·) gives the amount firm i sells taking account of its capacity limitation in fullilling demand]

12,C.17" Consider the linear city model in which the two firms may have different constant unit production costs c, > 0 and c, > O. Without loss of generality, take c, OS; c, and suppose that I' is large enough that non purchase can be ignored. Determine the Nash equilibrium prices and sales levcls for equilibria in which both firms make strictly positive sales. How do local changes in c, affect the equilibrium prices and profits of firms I and 2? For what values of c, and c, docs the equilibrium involve one firm making no sales? 12.C.IH" (The SI"ckl"herli leaderxhip model) There are two firms in a market. Firm I is the -'leader" and picks its quantity lirst. Firm 2, the "follower," observes firm I's choi I' + 31 in the linear city model discussed in Example 12.C.2, a Jirm j's best response to any price of its rival p _i always results in all consumers purchasing

(e) It takes K periods to respond to a deviation.

frol11 one of the two firms. 12.C.l4

C

12.0.2" In text.

Consider the linear city model discussed in Example 11.C.2.

(a) Derivc the bcst·rcsponsc functions when equilibrium in this case is = p! = c + l.

l' E

I'r

(b) Repeat (a) for the case in which v E (c

+

(c

+

2f, c

I' -

+ 1')/2

(a) Under what conditions can the symmetric joint monopoly outputs (q" q,) = (q~/2, q~/2)

l/, c + 21).

l' < f + t. the unique Nash equilibrium involves prices of and some consumers not purchasing from either firm.

(e) Show that when (v

12.D.3" Consider an infinitely repeated Coumot duopoly with discount factor b < I, unit costs of c > 0, and inverse demand function p(q) = a - bq, with a > c and b > O.

+ 31). Show that the unique Nash

(d) Show that when v E (c + I, C + 11), the unique symmetric equilibrium is 112. Are there asymmetric equilibria in this case?

be sustained with strategies that call for (q~12, q~ 12) to be played if no one has yet deviated

pi = pi =

and for the single-period Cournot (Nash) equilibrium to be played otherwise?

pt

(b) Derive the minimal level of o. In stage 2. the demand function for firm j as a function of the price vector p = (p, •. .. PJ) of the J active firms is Xj(p) = '[i' -/I(Jp/LI P.)]. Analyze the welfare properties as the size (:;l) and the substitution (fl) parameters change. 12.G.l" Consider the linear inverse demand Cournot duopoly model and the linear city dilTerentiated-price duopoly model with differing unit costs that you examined in Exercises 12.C9 and 12.CI7. Find the derivative, with respect to a change in firm J's unit cost, affirm 2's equilibrium quantity in the Cournot model and equilibrium price in the linear city model. In which model is this change in firm 2's behavior beneficial to firm I? 12.AA.IA In text.

What happens to this loss as K ~ 0" 12.E,4" Consider a two-stage model of entry in which all potential entrants have a cost per unit of (" (in additional to an entry cost of K) and in which, whatever number of firms enter, a perfect cartel is formed. What is the socially optimal number of firms for a planner who cannot control this cartel behavior? What are the welfare consequences if the planner cannot

12.AA.2C Prove Proposition 12.AAA. [I/illl: Consider a strategy profile of the following form: the players arc to play an outcome path involving some pair (qt' q,) in period I and «Ii, qj) in every period thereafter. If either player deviates, this outcome path is restarted.] 12.BB.1A In text.

control entry? I2.E.SC Consider a two-stage entry model with a market that looks like the market in Exercise 12.CI6. The entry cost is K. Compare the equilibrium number of firms to the number that a planner would pick who can control (a) entry and pricing and (b) only entry. I2.E.6" Compare a one-stage and a two-stage model of entry with Cournot competition [all potential entrants arc identical and production costs arc c(q) = cq]. Argue that any (SPNE) equilibrium outcome of the two-stage game is also an outcome of the one-stage game. Show by example thal the reverse is not true. Argue that we cannot, however, have more firms active in the one-stage game than in the two-stage game. 12,E.7" Consider a one-stage entry model in which firms announce prices and all potential firms have average costs of AC('l) (including their. fixed setup costs) with a minimum average

=

12.1IB,2" Show that if the incumbent in the entry deterrence model discussed in Appendix II is indifTercnt between deterring entry and accommodating it, social wclfare is strictly greater if he chooses deterrence. Discuss generally why we might not be too surprised if entry deterrence could in somc cases raise social welfare. I2,BII.3 C Consider the linear city model of Exercise 12.C2 with v> (. + 31. Suppose that firm I enters the market first and can choose to set up either one plant at onc end of the city or two plants, one at each end. Each plant costs F. Then firm E decides whether to enter (for simplicity, restrict it to building one plant) and at which end it wants to locate its plant. Determine the equilibrium of this model. How is it affected by the underlying parameter values? Compare the welfare of this outcome with the welfare if there Were no entrant. Compare with the case where there is an entrant but firm I is allowed to build only one plant.

435

C

Adverse Selection, Signaling,

HAP

T

E

R

SEC T ION

13

13.A Introduction One of the implicit assumptions of the fundamental welfare theorems is that the characteristics of all commodities are observable to all market participants. Without this condition, distinct markets cannot exist for goods having differing characteristics, and so the complete markets assumption cannot hold. In reality, however, this kind of information is orten asymmetrically held by market participants. Consider the following three examples: (i) When a firm hires a worker, the firm may know less than the worker does about the worker's innate ability. (ii) When an automobile insurance company insures an individual, the individual may know more than the company about her inherent driving skill and hence about her probability of having an accident. (iii) In the used-car market, the seller of a car may have much better information about her car's quality than a prospective buyer does.

436

I N FOR MAT ION A l A S Y M MET R I E SAN 0

A 0 V E R S ESE LEe T ION

will be low. Moreover, this fact may even further exacerbate the adverse selection problem: If the price that can be received by selling a used car is very low, only sellers with really bad cars will ofTer them for sale. As a result, we may see little trade in markets in which adverse selection is present, even if a great deal of trade would occur were information symmetrically held by all market participants. We also introduce and study in Section 13.B an important concept for the analysis of market intervention in settings of asymmetric information: the notion of a coIISrmilled Pareto oprimal allocarioll. These are allocations that cannot be Pareto improved upon by a central authority who, like market participants, cannot observe individuals' privately held information. A Pareto-improving market intervention can be achieved by such an authority only when the equilibrium allocation fails to be a constrained Pareto optimum. In general, the central authority's inability to observe individuals' privately held information leads to a more stringent test for Paretoimproving market intervention. I n Sections 13.C and 13.0, we study how market behavior may adapt in response to these informational asymmetries. In Section \3.C, we consider the possibility that informed individuals may find ways to sigllill information about their unobservable knowledge through observable actions. For example, a seller of a used car could ofTer to allow a prospective buyer to take the car to a mechanic. Because sellers who have good cars are more likely to be willing to take such an action, this offer can serve as a signal of quality. In Section 13.0, we consider the possibility that uninformed parties may develop mechanisms to distinguish, or screen, informed individuals who have dilTcring information. For example, an insurance company may ofTer two policies: one with no deductible at a high premium and another with a significant deductible at a much lower premium. Potential insureds then self-selecr, with high-ability drivers choosing the policy with a deductible and low-ability drivers choosing the no-deductible policy. In both sections, we consider the welfare characteristics of the resulting market equilibria and the potential for Pareto-improving market intervention. For expositional purposes, we present all the analysis that follows in terms of the labor market example (i). We should nevertheless emphasize the wide range of settings and fields within economics in which these issues arise. Some of these examples are developed in the exercises at the end of the chapter.

and Screening

A number of questions immediately arise about these settings of asymmetric illjomlllrioll: How do we characterize market equilibria in the presence of asymmetric information? What are the properties of these equilibria? Are there possibilities for welfare-improving market intervention? In this chapter, we study these questions, which have been among the most active areas of research in microeconomic theory during the last twenty years. We begin, in Section 13.B, by introducing asymmetric information into a simple competitive market model. We see that in the presence of asymmetric information, market equilibria often fail to be Pareto optimal. The tendency for inefficiency in these settings can be strikingly exacerbated by the phenomenon known as adverse selecrioll. Adverse selection arises when an informed individual's trading decisions depend on her privately held information in a manner that adversely affects uninformed market participants. In the used-car market, for example, an individual is more likely to decide to sell her car when she knows that it is not very good. When adverse selection is present, uninformed traders will be wary of any informed trader who wishes to trade with them, and their willingness to pay for the product offered

1 3 • B:

437

----------------------------------------------------~----~~~

!

13.B Informational Asymmetries and Adverse Selection Consider the following simple labor market model adapted from Akerlof's (1970) pioneering work: I there arc many identical potential firms that can hire workers. Each produces the same output using an identical constant returns to scale teChnology in which labor is the only input. The firms arc risk neutral, seek to maximize their expccted prolits, and act as price takers. For simplicity, we take the price of the firms' output to equal I (in units of a numeraire good). Workers difTer in the number of units or output they produce if hired by a firm, I. Akcrlof (1970) used the example of a used-car market in which only the setler of a used car knows if the car is a "lemon." for this reason, (his type: of model is sometimes referred to as a It'mon.~ model.

I

1

438

C HAP T E R

1 3:

A 0 V E R 5 ESE L E C T ION

I

S I G N A LIN G,

AND

S CREE NI N G

SECTION

---------------------------------------------------------------which we denote by 0.' We let [Q, 0] c R denote the set of possible worker productivity levels, where 0 ~ Q< Ii < 00. The proportion of workers with productivity of 0 or less is given by the distribution function F(O), and we assume that F(') is nondegenerate, so that there are at least two types of workers. The total number (or, more precisely, measure) of workers is N. Workers seek to maximize the amount that they earn from their labor (in units of the numeraire good). A worker can choose to work either at a firm or at home, and we suppose that a worker of type 0 can earn r(O) on her own through home production. Thus, r(O) is the opportunity cost to a worker of type 0 of accepting employment; she will accept employment at a firm if and only if she receives a wage of at least r(O) (for convenience, we assume that she accepts if she is indifferen!).' As a point of comparison, consider first the competitive equilibrium arising in this model when workers' productivity levels are publicly observable. Because the labor of each different type of worker is a distinct good, there is a distinct equilibrium wage 11"(0) for each type O. Given the competitive, constant returns nature of the firms, in a competitive equilibrium we have IV'(O) = 0 for all 0 (recall that the price of their output is I), and the set of workers accepting employment in a firm is

13.8:

INFORMATIONAL

ASYMMETRIES

e(l\') =

~

{II: r(O)

Consider, next, the demand for labor as a function of IV. If a firm believes that the average productivity of workers who accept employment is fl, its demand for labor is given by if" < if I'

I\'

= I\'

if I' >

(13.B.3)

1\'.

Now, if worker types in set e' are accepting employment offers in a competitive equilibrium, and if firms' beliefs about the productivity of potential employees correctly renect the actual average productivity of the workers hired in this equilibrium, then we must have I' = £[0 I 0 E eo]. Hence, (I3.B.3) implies that the demand for labor can equal its supply in an equilibrium with a positive level of employment if and only if I\' = £[0 10 E eo]. This leads to the notion of a competitive equilibrium presented in Definition 13.B.1.

(I3.B.I)

(This is simply the total revenue generated by the workers'labor.)5 Aggregate surplus is therefore maximized by setting 1(0) = I for those () with r(O) ~ 0 and 1(0) = 0 otherwise (we again resolve indifference in favor of working at a firm). Put simply,

Definition 13.B.1: In the competitive labor market model with unobservable worker productivity levels, a competitive equilibrium is a wage rate w' and a set e' of worker types who accept employment such that

2. A worker's productivity could be random without requiring any change in the analysis th r and some with 0 < r. In this setting, the Pareto optimal allocation of labor has workers with 0 ~ r accepting employment at a firm and those with 0 < r not doing so. Now consider the competitive equilibrium. When r(O) = r for all 0, the set of workers who are willing to accept employment at a .given wage, e(w), is either [Q, 0] (if w ~ r) or 0 (if w < rl. Thus, £[010 E e(wl] = £[0] for all wand so by (I3.B.5) the equilibrium wage rate must be w' = £[0]. If £[0] ~ r, then al/ workers accept employment at a firm; if £[/1] < r, then none do. Which type of equilibrium arises depends on the relative fractions of good and bad workers. For example, if there is a high fraction of low-productivity workers then, because firms cannot distinguish good workers from bad, they will be unwilling to hire any workers at a wage rate that is sunicicnt to have them accept employment (i.e., a wage of at least r). On the other hand, if there arc very few low-productivity workers, then the average productivity of the workforce will be above r, and so the firms will be willing to hire workers at a wage that they arc willing to accept. In one case, too many workers arc employed relative to the Pareto optimal allocation, and in the other too few. The cause of this failure of the competitive allocation to be Pareto optimal is simple to see: because firms are unable to distinguish among workers of differing productivities, the market is unable to allocate workers efficiently between firms and home production"

--

SECTION

13.8:

INfORMATIONAL

ASYMMETRIES

AND

ADVERSE

SELECTION

441

I. . - - - - - - - - - - 45'

!

I I I I

£[/1]

~ !

£[Olr(O):5"]

I

Figure 13.B.1

I I I I

r@

A compelitive equilibrium wilh adverse selection.

'I'

not so; indeed, the market may fail completely despite the fact that every worker type should work at a lirm. To see the power of adverse selection, suppose that r(O)!> 0 for all 0 E [Q, 0] and that r(') is a strictly increasing function. The first of these assumptions implies that the Pareto optimal labor allocation has every worker type employed by a firm. The second assumption says that workers who are more productive at a firm arc also more productive at home. It is this assumption that generates adverse selection: Because the payolT of home production is greater for more capable workers, only less capable workers accept employment at any given wage "' [i.e., those with r(O) !> "l The expected value of worker productivity in condition (13.B.5) now depends on the wage rate. As the wage rate increases, more productive workers become willing to accept employment at a firm, and the average productivity of those workers accepting employment rises. For simplicity, from this point on, we assume that F(') has an associated density function [(.), with [(0) > for all 0 E [Q, 0]. This insures that the average productivity of those workers willing to accept employment, E[O 1 1'(0) !> w], varies continuously with the wage rate on the set"' E [r@,oo]. To determine the equilibrium wage, we use conditions (13.B.4) and (11B.5). Together they imply that the competitive equilibrium wage IV' must satisfy

°

Adverse Seieclioll and Markel Ullravelillg

A particularly striking breakdown in efficiency can arise when r(O) varies with O. In this case, the average productivity of those workers who arc willing to accept employment in a firm depends on the wage, and a phenomenon known as adverse select;oll may arise. Adverse selection is said to occur when an informed individual's trading decision depends on her unobservable characteristics in a manner that adversely alTects the uninformed agents in the market. In the present context, adverse selection arises when only relatively less capable workers are willing to accept a firm's employment offer at any given wage. Adverse selection can have a striking effect on market equilibrium. For example, it may seem from our discussion of the case in which r(O) = r for all 0 that problems arise for the Pareto optimality of competitive equilibrium in the presence of asymmetric information only if there are some workers who should work for a firm and some who should not (since when either 0 < r or Q> r the competitive equilibrium outcome is Pareto optimal). In fact, because of adverse selection, this is

w' = £[01 r(O) !> 11"].

(13.8.6)

We can use Figure I3.B.1 to study the determination of the equilibrium wage w·. There we graph the values of £[0 1r(O) !> w] as a function of w. This function gives the expected value of Ii for workers who would choose to work for a firm when the prevailing wage is II'. It is increasing in the level IV for wages between r(O) and r(ii), has a minimum value of Q when II' = r(q), and attains a maximum value ~f £[0] for IV ~ 1'(1)).7 The competitive equilibrium wage 11" is found by locating the wage rate at which this function crosses the 45-degree line; at this point, condition (13.B.6) is satisfied. The set of workers accepting employment at a firm is then e' = {o: r(O)!> w £[0] = ... marginally reduces her supply of tabor 10 a firm here, Ihe firm is made worse olT, in contrast with the situation in a competitive market with perfect information, where the wage exactly equals a worker's marg.inal productivity.

7. The figure does nol depici Ihis funclion for wages below r@. Because £[0] > r(q) in this model. no wage below r(q) can be an equilibrium wage under our assumption that £[/118(",) = 0] = £[0]. 8. For another diagrammatic determination of equilibrium, see Exercise 118.1.

J

442

CHAPTER

13:

ADVERSE

SELECTION,

SIGNALING.

AND

SECTION

SCREENING

13.B:

INFORMATIONAL

ASYMMETRIES

AND

ADVERSE

SELECTION

443

--------------------------------------------------------------------- ,--------------------------------------------------------------------45" 45'

I\"*

£[0]

,, ,,

£[/1]

:----r, , ,, ,,

= 0

£[Olr(O),; .. ]

£[Ulr(O),; w]

Complete market failure.

,, ,, ,,

r(lI) = ~

r(li)

,,

.

r(Q)

t

.

"'~ w~ w~ .'

We can see immediately from Figure I3.B.1 that the market equilibrium need not be ellicient. The problem is that to get the best workers to accept employment at a firm, we need the wage to be at least r(O). But in the case depicted, firms cannot break evcn at this wage because their inability to distinguish among different types of workers leaves them receiving only an expected output of £[0] < r(O) from each worker that they hire. The presence of enough low-productivity workers therefore forces thc wage down below r(ih, which in turn drives the best workers out of the market. But once the best workers are driven out of the market, the average productivity of the workforce falls, thereby further lowering the wage that firms are willing to pay. As a result, once the best workers are driven out of the market, the next-best may follow; the good may then be driven out by the mediocre. How far can this process go? Potentially very far. To see this, consider the case depicted in Figure 13.B.2, where we have r@ = and r(O) < 0 for all other O. There the equilibrium wage rate is w' = ~, and only type workers accept employment in the equilibrium. Because of adverse selection, essentially no workers are hired by firms (more precisely, a set of measure zero) even though the social optimum calls for all

q

q

to be hired,9 Example 13.8.1: To see an explicit example in which the market completely unravels let r(O) = 0:0, where 2 < 1, and let 0 be distributed uniformly on [0, 2]. Thus, r(q) = q (since (J = 0), and r(O) < 0 for 0 > 0. In this case, £[0 \ r(O) ,;; w] = (w/20:). For 0: > !, £[0 \ r(-O) ,;; 0] = 0 and £[0 \ r(O) ,;; w] < w for all w > 0, as in Figure 13.B.2.10 The competitive equilibrium defined in Definition 13.8.1 need not be unique. Figure 13.B.3, for example, depicts a case in whieh there are three equilibria with strictly positive employment levels. Multiple competitive equilibria can arise because there is virtually no restriction on the slope of the function £[0\ r(O) ,;; w). At any wage II'. this slope depends on the density of workers who are just indifferent about accepting employment and so it can vary greatly if this density varies.

9. In this equilibrium, every agent receives the same payoff as if the market were abolished: every firm earns zero and a worker of type 0 earns rIO) for all 0 (including 0 = q). 10. This example is essentially the one developed in Akeriof (1970). His example corresponds to the case): = ~.

Flgur. 13.B.2 (left)

Figure 13.B.3 (right)

Multiple eompetitiv, equilibria .

Note that the equilibria in Figure I3.B.3 can be Pareto ranked. Firms earn zero profits in any equilibrium, and workers are better off if the wage rate is higher (those workers who do not accept employment are indifferent; all other workers are strictly better off). Thus, the equilibrium with the highest wage Pareto dominates all the others. The low-wage, Pareto-dominated equilibria arise because of a coordination failure: the wage is too low because firms expect that the productivity of workers accepting employment is poor and, at the same time, only bad workers accept employment precisely because the wage is low.

A

GllI1lC- Theoret ic

Approach

The notion of competitive equilibrium that we have employed above is that used by Akerlof (1970). We might ask whether these competitive equilibria can be viewed as the outcome of a richer model in which firms could change their offered wages but choose not to in equilibrium. The situation depicted in Figure 13.B.3 might give you some concern in this regard. For example, consider the equilibrium with wage rate \\'!. In this equilibrium, a firm that experimented with small changes in its wage offer would find that a small increase in its wage, say to the level w' > \I'! depicted in the figure, would raise its profits because it would then attract workers with an average productivity of £[0\ r(O) S \\"] > 11". Hence, it seems unlikely that a model in which firms could change their offered wages would ever lead to this equilibrium outcome. Similarly, at the equilibrium involving wage II'f, a firm that understood the structure of the market would realize that it could earn a strictly positive profit by raising its offered wage to ",'. To be more formal about this idea, consider the following game-theoretic model: The underlying structure of the market [e.g., the distribution of worker productivities F(') and the reservation wage function r(')] is assumed to be common knowledge. Market behavior is captured in the following two-stage game: In stage 1, two firms simultaneously announce their wage offers (the restriction to two firms is without loss of generality). Then, in stage 2, workers decide whether to work for a firm and, if so, which one. (We suppose that if they are indifferent among some set of firms, then thcy randomize among them with equal probabilities.)" Proposition 13.B.1 characterizes the subgame perfect Nash equilibria (SPNEs) of this game for the adverse selection model in which r(') is strictly increasing with 1'(0) S () for all 0 E [Q, 0] and F(') has an associated density f(·) with f(O) > 0 for all () E [Q, 0]. Proposition 13.B.1: Let W' denote the set of competitive equilibrium wages for the adverse selection labor market model, and let w· = Max {w: WE W·}. (i) If w· > r(Q) and there is an r. > 0 such that E[O \ r(O) ,;; w'] > w' for all w' E (w* - r., w*), then there is a unique pure strategy SPNE of the two-stage game-theoretic mode\. In this SPNE, employed workers receive It. Note that if there is a single type of worker with productivity 0, this model is simply the labor market version of the Bertrand model of Section 12.C and has an equilibrium wage equal to 0, the competitive wage.

444

CHAPTER

13:

ADVERSE

SELECTION.

SIGNALING,

AND

SCREENING

a wage of w', and workers with types in the set 0(w') = (II; r(lI) :s w'} accept employment in firms. (ii) If w' = rIO), then there are multiple pure strategy SPNEs. However, in every pur~ strategy SPNE each agent's payoff exactly equals her payoff in the highest-wage competitive equilibrium. Proof: To begin, note that in any SPNE a worker of type II must follow the strategy of accepting employment only at one of the highest-wage firms, and of doing so if and only if its wage is at least r(O)." Using this fact, we can determine the equilibrium behavior of the firms. We do so for each of the two cases in turn. (i) 11" > rIO); Note, first, that in any SPNE both firms must earn exactly zero. To see this, supp;se that there is an SPNE in which a total of M workers arc hired at a wage IV and in which the aggregate profits of the two firms are

n=

M(£[Olr(O):s IV] - IV) > O.

Note that n > 0 implies that M > 0, which in turn implies that IV ~ r@. In this case, the (weakly) less-profitable firm, say firm j, must be earning no more than n/2. But firm j can earn profits of at least M(£[O I r(O) :s IV + IX] - IV - IX) by instead offering wage Ii' + ex for IX> O. Sinoe £[Olr(O):s 11'] is continuous in 11', these profits can be made arbitrarily close to by choosing IX small enough. Thus, firm j would be better off deviating, which yields a contradiction; we must therefore have :s O. Because neither firm can have strictly negative profits in an SPNE (a firm can always offer a wage of zero), we conclude that both firms must be earning exactly zero in any SPNE. From this fact, we know that if IV is the highest wage rate offered by either of the two firms in an SPNE, then either IV E W' (i.e., it must be a competitive equilibrium wage rate) or Ii' < r(O) (it must be so low that no workers accept employment). But suppose that IV < w,-= Max {II'; WE W'}. Then either firm can earn strictly positive expected profits by deviating and offering any wage rate 11" E (II" - e, 11"). We conclude that the highest wage rate offered must equal IV' in any SPNE. Finally, we argue that both firms naming 11" as their wage, plus the strategies for workers described above, constitute an SPNE. With these strategies, both firms earn zero. Neither firm can earn a positive profit by unilaterally lowering its wage because it gets no workers if it does so. To complete the argument, we show that £[0 I rIO) :s IV] < IV at every IV > 11", so that no unilateral deviation to a higher wage can yield a firm positive profits either. By hypothesis, IV" is the highest competitive wage. Hence, there is no IV> w' at which £[0 I rIO) :s 11'] = IV. Therefore, because £[0 I rIO) :s 11'] is continuous in IV, £[0 I riO) :s 11'] - IV must have the same sign for all 1\' > IV'. But we cannot have £[0 I rIO) :s IV] > II' for all II' > 11'" because, as W .... 00, £[(11 rIO) :s 11'] .... £[0], which, under our assumptions, is finite. We must therefore have £[0 I rIO) :s 11'] < II' at all IV> 11". This completes the argument for case (i). The assumption that there exists an e > 0 such that £[111 r(lI) :s 11"] > 11" for all IV' E (IV' - t, IV') rules out pathological cases such as that depicted in Figure 13.BA.

n

n

(ii) IV' = rIO); In this case, £[Olr(O):s IV] < '" for all IV> IV', so that any firm attracting workers at a wage in excess of "," incurs losses. Moreover, a firm must 12. Recall that we assume that a worker accepts employment whenever she is indilTerent.

---

SECTION

13.8:

INFORMATIONAL

ASYMMETRIES

AND

ADVERSE

SELECTION

445

,-----------------------------------------------------------, i

E[O)

~!

i, E[Olr(O) ~ .. ) ,,i!~ ,, ,,, ,, ,,

r@

r(ii)

earn exaclly zero by announcing any IV S; 11'". Hence, the set of wage offers (w" "'2) that can arise in an SPNE is {(w" w2 ): Wj:S 11'" for j = I, 2}. In everyone of these SPNEs, all agents earn exactly what they earn at the competitive equilibrium involving wage rate IV"; both firms earn zero, and a worker of type 0 earns rIO) for all liE [q, /1]. • One difference between this game-theoretical model and the notion of competitive equilibrium specified in Definition I3.B.1 involves the level of firms' sophistication. In the competitive equilibria of Definition I3.B.I, firms can be fairly unsophisticated. They need know only the average productivity level of the workers who acoept employment at the going equilibrium wage; they need not have any idea of the underlying market mechanism. In contrast, in the game-theoretic model, firms understand the entire structure of the market, including the full relationship that exists bet ween the wage rate and the quality of employed workers. The game-theoretic model tells us that if sophisticated firms have the ability to make wage offers, then we break the coordination problem described above. If the wage is too low, some firm will find it in its interest to offer a higher wage and attract better workers; the highest-wage competitive outcome must then arise.')

COllstrained Pareto Optima alld Market intervention We have seen that the presence of asymmetric information often results in market equilibria that fail to be Pareto optimal. As a consequence, a central authority who knows all agents' private information (e.g., worker types in the models above), and can engage in lump-sum transfers among agents in the economy, can achieve a Pareto improvement over these outcomes. In practice, however, a central authority may be no more able to observe agents' private information than are market participants. Without this information, the authority will face additional constraints in trying to achieve a Pareto improvement. For example, arranging lump-sum transfers among workers of different types will be impossible because the authority cannot observe workers' types directly. For Pareto-improving market intervention to be possible in this case, a more stringent test must therefore be passed. An allocation that cannot be Pareto improved by an J 3. See Exercise 13.8.6, however, for an example of a model of adverse selection in which, for some parameter values, the highest-wage competitive equilibrium is not an SPNE of our gametheoretic model.

Figure 13.8.4

A pathologicat exampte.

446

CHAPTER

'3:

ADVERSE

SELECTION.

SIGNALING.

AND

SCREENING

5 E C T ION

, 3 • B:

I N FOR MAT ION A L A . V M MET A IE.

AND

A D V E A S ESE l E C T ION

447

---------------------------------------------------------------------- ------------------------------------------------------------------authority who is unable to observe agents' private information is known as a constrained (or second-best) Pareto optimum. Because it is more difficult to generate a Pareto improvement in the absence of an ability to observe agents' types, a constrained Pareto optimal allocation need not be (fully) Pareto optimal [however, a (full) Pareto optimum is necessarily a constrained Pareto optimum]. Here, as an example, we shall study whether Pareto-improving market intervention is possible in the context of our adverse selection model (where r(') is strictly increasing with r(O) S 0 for all 0 E [Q, Ii] and F(') has an associated density f(·) with f(O) > 0 for all 0 E [~, 0]) when the central authority cannot observe worker types. That is, we study whether the competitive equilibria of this adverse selection model are constrained Pareto optima. In general, the formal analysis of this problem uses tools that we develop in Section 14.C in our study of principal-agent models with hidden information (see, in particular, the discussion of monopolistic screening). As these techniques have yet to be introduced, we shall not analyze this problem fully here. (Once you have studied Section 14.C, however, refer back to the discussion in small type at the end of this section.) Nevertheless, we can convey much of the analysis here. By way of motivation, note first that in examining whether a Pareto improvement relative to a market equilibrium is possible, we might as well simply think of intervention schemes in which the authority runs the firms herself and tries to achieve a Pareto improvement for the workers (the firms' owners will then earn exactly what they were earning in the equilibrium, namely zero profits). Second, because the authority cannot distinguish directly among different types of workers, any differences in lump-sum transfers to or from a worker can depend only on whether the worker is employed (the workers otherwise appear identical). Thus, intuitively, there should be no loss of generality in restricting attention to interventions in which the authority runs the firms herself, offers a wage of w. to those accepting employment, an unemployment benefit of w. to those who do not [these workers also receive r(O)]. leaves the workers free to choose whether to accept employment in a firm, and balances her budget. (In the small-type discussion at the end of this section, we show formally that this is the case.) Given this background, can the competitive equilibria of our adverse selection model be Pareto-improved upon in this way? Consider, first, dominated competitive equilibria, that is, competitive equilibria that are Pareto dominated by some other competitive equilibrium (e.g., the equilibrium with wage rate wf shown in Figure 13.B.3). A central authority who is unable to observe worker types can always implement the best (highest-wage) competitive equilibrium outcome. She need only set w, = w', the highest competitive equilibrium wage, and w. = O. All workers in set 0( w') then accept employment in a firm and, since w' = E[O 1r(/I) S w'], the authority exactly balances her budget." Thus, the outcome in such an equilibrium is not a constrained Pareto optimum. In this case, the planner is essentially able to step in and solve the coordination failure that is keeping the market at the low-wage equilibrium. 14. An equivalent but less heavy·handed intervention would have the authority simply require any operating firm to pay a wage rate equal to w*. Firms will be willing to remain

operational because they break even at this wage rate, and a Pareto improvement results.

What about the highest-wage competitive eqUilibrium (i.e., the SPNE outcome in the game-theoretic model of Proposition 13.B.1)? As Proposition 13.B.2 shows, any such equilibrium is a constrained Pareto optimum in this model. proposition 13.B.2: In the adverse selection labor market model (where r(') is strictly increasing with rIO) S /I for all /I E [0, Ii] and F(') has an associated density (.) with (0) > 0 for all 0 E [Q.O]). the- highest-wage competitive equilibrium is a constrained Pareto optimum. Proof: If all workers are employed in the highest wage competitive equilibrium then the outcome is fully (and, hence, constrained) Pareto optimal. So suppose some are not employed. Note, first, that for any wage w. and unemployment benefit w. offered by the central authority the set of worker types accepting employment has the form [Q, 0] for some 0 [it is {O: .... + r(O) S IV.}]. Suppose, then,that the authority attempts to implement an outcome in which worker types 0 s ~ for bE [Q, Ii] accept employment. To do so, she must choose .... and w. so that

"'. + r(O) = w•. In addition, to balance her budget, IV, and IV. must also satisfy" IV,F(O) + IV.(I - F(O» =

f

Of(O) dO.

(13.B.7)

(I3.B.8)

Substituting into (13.B.7) from (13.B.8), we find that. given the choice of 0, the values of w .. and WI' must be ",.(0) =

and

"',(0) =

r r

Of(O) dO - r(O)F(O)

(l3.B.9)

Of(O) dO + r(O)(1 - F(O)),

(13.B.10)

or, equivalently, IV.(O) = F(O)(E[OIO

s

/I] -

r(9»

w,(O) = F(O)(E[O lOs 9] - riO»~

(13.B.11)

+ r(O).

(l3.B.12) Now, let 0* denote the highest worker type who accepts employment in the highest-wage competitive equilibrium. We know that r(O') = E[/il 0 SO']. Hence, from conditions (13.B.II) and (I3.B.12), we see that w.(O') = 0 and w.(O') = r(O'). Thus, the outcome when the authority sets 0 = 0' is exactly the same as in the highest-wage competitive equilibrium. We now examine whether a Pareto improvement can be achieved by setting 6 ¥ 0*. Note that for any 0 E [Q, 0] with 6 ¥ 0', type Q workers are worse off than in the equilibrium if .... (0) < r(O') [r(O') is their wage in the equilibrium] and type workers are worse off if "',(Ii) < O. Consider 0 < 0* first. Since r(O*) > r(O), condition (l3.B.lO) implies that

o

w.(O)

s

r

Of(O) dO

+ r(O')(1

- F({J)),

15. The authority will never wish to run a budget surplus. If "". and w .. lead to a budget surplus, then setting w = "' .. +,; and wf' = WI' + (; for some I: > 0 is budget feasible and is Pareto superior. (Note that the set of workers accepting employment would be unchanged.) lI

448

CHAPTER

13:

ADVERSE

SELECTION,

SIGNALING,

AND

SCREENING

--------------------------------------------------------------------------and so

11',(0) - ,(0*) :s; F(O)(E[OI 0 :s; 0] - ,(0*») = F(O)(E[OIO:s;

0] -

E[OIO:s; 0*])

< O. Thus, type 0 workers must be made worse off by any such intervention. Now cO;lsider 0 > 0*. We know that E[O I,(0) :s; 11'] < II' for all II' > 11'* (see the proof of Proposition 13.8.1). Thus, since ,(0*) = 11'* and ,(-) is strictly increasing, we have £[0 I ,(0) :s; ,(0)] < ,(ti) for all Ii> 0*. Moreover,

E[OI'(O):s; ,(0)]

= E[OIO:s; 0],

and so £[0 I 0 :s; 0] - ,(0) < 0 for all 0> 0*. But condition (l3.B.II) then implies that 11',(6) < 0 for all 0 > 0*, and so type ii workers are made worse off by any such intervention. _ Hence, when a central authority cannot observe worker types, her options may be severely limited. Indeed, in the adverse selection model just considered, the authority is unable to create a Pareto improvement as long as the highestwage competitive equilibrium (the SPNE outcome of the game-theoretic model of Proposition 13.B.I) is the market outcome. " More generally, whether Paretoimproving market intervention is possible in situations of asymmetric information depends on the specifics of the market under study (and as we have already seen, possibly on which equilibria result). Exercises 13.B.8 and 13.B.9 provide two examples of models in which the highest-wage competitive equilibrium may fail to be a constrained Pareto optimum. Although it is impossible to Pareto improve a constrained Pareto optimal allocation, market inlcrvcntion could still be justified in the pursuit of distributional aims. For example, if social welfare is given by the sum of weighted worker utilities

r

[1(0)0

SECTION

13.8:

INFORMATIONAL

ASYMMETRIES

- I(O»,(O)]i.(O) dF(O),

(13.B.13)

where i.(0) > 0 for all 0, then social welfare may be increased even though some worker types end up worse off. In the applied literature, for example, it is common to see aggregate surplus used as the social welfare function, which is equivalent to the choice of i.(O) = N for all 0." When society has this social welfare function, social welfare can be raised relative to the competitive equilibrium in Figure 13.B.1 (which, by Proposition 13.B.2, is a constrained Pareto optimum) simply by mandating that all workers must work for a firm and that all firms must

16. Proposition 13.8.2 Can also be readily generalized to allow r(O) > 0 for some O. (See Exercise I3.B.IO.) 17. Note that when types cannot be observed. aggregate surplus is no longer a valid weHare measure for any social welfare function because, unlike the case of perfect information, lump~sum

transfers across worker types are infeasible. (See Section 1O.E for a discussion of the need for lump-sum transfers to justify aggregate surplus as a welfare measure for any social welfare function,)

ADVERSE

pay workers a wage of £(0). Although workers of type ii are made worse off by this intervention, welfare as measured by aggregate surplus increases.'· An interesting interpretation of the choice of aggregate surplus as a social welfare function is in tcrms of an unborn worker's ex ante expected utility. In particular, imagine that each worker originally has a probability /(0) of ending up a type 0 worker. If this unborn worker is risk neutral, then her ex ante expected utility is exactly equal to expression (13.B.13) with i.(O) = I for all O. Thus, maximization of aggregate surplus is equivalent to maximization of this unborn worker's expected utility. We might then say that an allocation is an ex ante COII.wrailJeti Pareto optimum in this model ir, in the absence of an ability to observe worker types, it is impossible to devise a market intervention that raises aggregate surplus. We see, therefore. thai whether an allocation is a constrained optimum (and. thus, whether a planned intervention leads to a Pareto improvement) can depend on the point at which the welrare evaluation is conducted (i.e., before the workers know their types, or after)'· Lei us now use the techniques of Section 14.C 10 show formally that we can restrict attention in searching for a Pareto improvement to interventions of the type considered above. We shall look for a Pareto improvement for the workers keeping the profits of the firms' owners nonnegative. For notational simplicity, we shall treat the firms as a single aggregate firm. By the revelation principle (see Section 14.C), we know that we can restrict attention to direct rcvcl;.ttion mechanisms in which every worker type tells the truth, Here a direct revelation mechanism assigns, for each worker type () E [q, 0], a payment from the authority to the worker of ",(Ii) E R, a lax t(O) paid by the firm to the authority, and an employment decision /(Ii) E 10, I:. The sci of feasible mechanisms here are those that satisfy the illdividual ratiollality c0I1s1raillf for the firm.

r

[/(0)0 - t(O)] dF(O)
(13.B.14)

the hwly£'[ hulcmce i'olldilion for the central authority,

f

[t(O) - ...(0)] dF(O)
(13.B.15)

and the lrur/Hellillf} (or ;'Icentivt! compatibilil)', or seltse/(-ftiml) constraints that say that for

all 0 and

iJ 1\'(0) + (I - 1(0»,.(0)
(I3.B.l7)

I R. Moreover. bec 0, c.. (e, 0) > 0, c.(e,O) < 0 for all e > 0, and c.. (e, 0) < 0 (subscripts denote partial derivatives). Thus, both the cost and the marginal cost of education are assumed to be lower for high-ability workers; for example, the work required to obtain a degree might be easier for a high-ability individual. Letting U(IV, e 10) denote the utility of a type 0 worker who chooses education level e and receives wage IV, we take U(IV, e I0) to equal her wage less any educational costs incurred: U(IV, e 10) = w - c(e, 0). As in Section 13.B, a worker of type 0 can earn r(O) by working at home.

Random mOve of

Flgure13.C.l

worker Iype.

The ex.tensive form of the education signaling

,.A,,""------------ nature determines

game. , ; : : . - - - - - - - - - - - - - - - " " : I l.....I ' - - - - Worker chooses

education level contingent on her Iype (really a continuous choice),

Conditional on seeing a level of t. say".

firms make wage offers simultaneously (really a continuous choice),

Worker decides which olTer 10 accept. if any. }

J

452

CHAPTER

13:

ADVERSE

SELECTION,

SIGNALING,

AND

SECTION

SCREENING

--------------------------------------------------------------------

13.C:

SIGNALING

453

~---------------------------------------------------------------

Note that, in contrast with the model of Section 13.B, here we explicitly model only a single worker of unknown type; the model with many workers can be thought of as simply having many of these single-worker games going on simultaneously, with the fraction of high-ability workers in the market being).. In discussing the equilibria of this game, we often speak of the "high-ability workers" and "low-ability workers," having the many-workers case in mind. The equilibrium concept we employ is that of a weak perfect Bayesian equilibrium (see Definition 9.C.3), but with an added condition. Put formally, we require that, in the game tree depicted in Figure 13.C.l, the firms' beliefs have the property that, for each possible choice of e, there exists a number /lie) E [0, 1] such that: (i) firm I's belief that the worker is of type 011 after seeing her choose e is /lie) and (ii) after the worker has chosen e, firm 2's belief that the worker is of type 011 and that firm I has chosen wage offer w is precisely /l(e)ur(w I e), where ur(w I e) is firm I's equilibrium probability of choosing wage offer IV after observing education level e. This extra condition adds an element of commonality to the firms' beliefs about the type of worker who has chosen e, and requires that the firms' beliefs about each others' wage offers following e are consistent with the equilibrium strategies both on and off the equilibrium path. We refer to a weak perfect Bayesian equilibrium satisfying this extra condition on beliefs as a perfect Bayesian equilibrium (PBE). Fortunately, this PBE notion can more easily, and equivalently, be stated as follows: A set of strategies and a belief function /,(e) E [0, 1] giving the firms' common probability assessment that the worker is of high ability after observing education level e is a PBE if

011 ------------------------

Flgur. 13.C.2 (left)

Indifference curves for high- and low·ability workers: the single-crossing

-----------------------

()I.

property. Flgur. 13.C.3 (rlghl)

o Knowing this fact, we turn to the issue of the worker's equilibrium strategy, her choice of an education level contingent on her type. As a first step in this analysis, it is useful to examine the worker's preferences over (wage rate, education level) pairs. Figure IlC.2 depicts an indifference curve for each of the two types of workers (with wages measured on the vertical axis and education levels measured on the horizontal axis). Note that these indilTerence curves cross only once and that, where they do, the indifference curve of the high-ability worker has a smaller slope. This property of preferences, known as the single-crossing property, plays an important role in the analysis of signaling models and in models of asymmetric information more generally. It arises here because the worker's marginal rate of substitution between wages and education at any given (IV, e) pair is (dIVide)" = e.(e, 0), which is decreasing in 0 because (" ... (e, II) < O. We can also graph a function giving the equilibrium wage offer that results for each education level, which we denote by wee). Note that since in any PBE wee) = /,(e)1I1I + (I - /,(e»Ol for the equilibrium belief function /lie), the equilibrium wage offer resulting from any choice of e must lie in the interval [0,.,0,,]. A possible wage offer function w(e) is shown in Figure 13.C.3. We are now ready to determine the equilibrium education choices for the two types of workers. It is useful to consider separately two different types of equilibria that might arise: separating equilibria, in which the two types of workers choose different education levels, and pooling equilibria, in which the two types choose the same education level.

(i) The worker's strategy is optimal given the firm's strategies. (ii) The belief function /lie) is derived from the worker's strategy using Bayes' rule where possible. (iii) The firms' wage offers following each choice e constitute a Nash equilibrium of the simultaneous-move wage offer game in which the probability that the worker is of high ability is /l(e).20 In the context of the model studied here, this notion of a PBE is equivalent to the sequential equilibrium concept discussed in Section 9.C. We also restrict our attention throughout to pure strategy equilibria. We begin our analysis at the end of the game. Suppose that after seeing some education level e, the firms attach a probability of /,(e) that the worker is type 0Il' lf so, the expected productivity of the worker is /l(e)OI/ + (1 - /l(e»Ol' In a simultaneous-move wage offer game, the firms' (pure strategy) Nash equilibrium wage offers equal the worker's expected productivity (this game is very much like the Bertrand pricing game discussed in Section 12.C). Thus, in any (pure strategy) PBE, we must have both firms offering a wage exactly equal to the worker's expected productivity, /l(e)OIl + (I - /,(e»Ol'

Separating Equilihria To analyze separating equilibria, let e*(tI) be the worker's equilibrium education choice as a function of her type, and let 1V*(e) be the firms' equilibrium wage offer as a function of the worker's education level. We first establish two useful lemmas. Lemma 13.C.1: In any separating perfect Bayesian equilibrium, w*(e*(OH» = 0H and w*(e*(OLl) = 0L: that is, each worker type receives a wage equal to her productivity level. Proof: In any PBE, beliefs on the equilibrium path must be correctly derived from the equilibrium strategies using Bayes' rule. Here this implies that upon seeing education level e*(Ol)' rlrms must assign probability one to the worker being type 0,.. Likewise, upon seeing education level e*(OIl)' firms must assign probability one

20. Thus, the extra condition we add imposes equilibrium-like play in parts of the tree off the equilibrium path. See Section 9.C for a discussion of the need to augment the weak perfect Bayesian equilibrium concept to achieve this end.

J

A wage schedule.

454

CHAPTER

13:

ADVERSE

SELECTION,

SIGNALING,

AND

--- --

SCREENING

Type 0" II'

Type Ot

qJ ,// i

,, , ,,

. I

/

0" ""(to)

.--

I

(/,.

._._._._._.--;-----------

I

I

I I

I

,,

I

e

11

11

e'(O,.)

e'(O,,)

(h

and 0",

Figure 13.C.4 (left)

Low-ability worker's outcome in a

Lemma 13.C.2: In any separating perfect Bayesian equilibrium, e'(Otl = 0; that is, a low-ability worker chooses to get no education. Proof: Suppose not, that i~, that when the worker is type OL' she chooses some strictly positive education level e ::. O. According to Lemma 13.C.I, by doing so, the worker receives a wage equal to 0L' However, she would receive a wage of at least 0,_ if she instead chose e = O. Since choosing e = 0 would have save her the cost of education, she would be strictly better off by doing so, which is a contradiction to the assumption that > 0 is her equilibrium education level. _

e

Lemma 13.C.2 implies that, in any separating equilibrium, type O,.'s indifference curve through her equilibrium level of education and wage must look as depicted in Figure 13.CA. Using Figure 13.CA, we can construct a separating equilibrium as follows: Let c'(O,,) = ii, let e'(Od = 0, and let the schedule ",'(e) be as drawn in Figure 13.C.5. The firms' equilibrium beliefs following education choice e are JI'(e) = (w'(e) - Od/(Oll - Od. Note that they satisfy I,'(e) E [0, I] for all e;::o; 0, since \\,'(e) E [OL' 0,,]. To verify that this is indeed a PBE, note that we are completely free to let firms have any beliefs when e is neither 0 nor On the other hand, we must have JI(O) = 0 and Il(e) = 1. The wage offers drawn, which have ",'(0) = 0,- and ""(e) = 0", renect exactly these beliefs. What about the worker's strategy? It is not hard to see that, given the wage function IV'(e), the worker is maximizing her utility by choosing e = 0 when she is type 0L and by choosing e = when she is type 0". This can be seen in Figure 13.C.S by noting that, for each type that she may be, the worker's indifference curve is at its highest-possible level along the schedule ""(e). Thus, strategies [e'(O), ""(e)] and the associated beliefs JI(e) of the firms do in fact constitute a PBE. Note that this is not the only PBE involving these education choices by the two types of workers. Because we have so much freedom to choose the firms' beliefs off the equilibrium path, many wage schedules can arise that support these education

e.

0,.

/~r'\" I

"'(e)

L/ I --·---r : -~:'~------i--------t----I I I I I

I I , , ,

I

o

to the worker being type 0". The resulting wages are then exactly respectively. _

/

'-.

----------.----------,

"'(e'(O,.)) = 01.

,,

e

A separating equilibrium with the

"'(e)

------------~----------

separating equilibrium.

Figure 13.C.S (right)

A separating equilibrium: Type is inferred from education level.

SIGNALING

Figure 13.C.6 (Iell)

(/"

. . . . . ·*'

I

13.C:

Type O.

'I"~---------

8" ------------ ,----------

,, ,, ,, ,

SECTION

"'(0,.)

11

"

r'(O,,)

.'«(/,,)

11

choices. Figure 13.C.6 depicts another one; in this PBE, firms believe that the worker is certain to be of high quality if c ;::0; i' and is certain to be of low quality if e < e. The resulting wage schedule has ""(e) = 0" if e ;::0; ii and ""(e) = OL if e < ii. In these separating equilibria, high-ability workers arc willing to get otherwise useless education simply because it allows them to distinguish themselves from low-ability workers and receive higher wages. The fundamental reason that education can serve as a signal here is that the marginal cost of education depends on a worker's type. Because the marginal cost of education is higher for a low-ability worker [since c,.• (e, 0) < 0], a type 0" worker may find it worthwhile to get some positive level of education e' > 0 to raise her wage by some amount "'"' > 0, whereas a type OL worker may be unwilling to get this same level of education in return for the same wage increase. As a result, firms can reasonably come to regard education level as a signal of worker quality. The education level for the high-ability type observed above is not the only one that can arise in a separating equilibrium in this model. Indeed, many education levels for the high-ability type arc possible. In particular, any education level between i! and el in Figure 13.C.7 can be the equilibrium education level of the high-ability workers. A wage schedule that supports education level e'(O,,) = e, is depicted in the figure. Note that the education level of the high-ability worker cannot be below in a separating equilibrium because, if it were, the low-ability worker would deviate and pretend to be of high ability by choosing the high-ability education level. On the other hand, the education level of the high-ability worker cannot be above €, because, ifit were, the high-ability worker would prefer to get no education, even if this resulted in her being thought to be of low ability. Note that these various separating equilibria can be Pareto ranked. In all of them, firms cam zero profits. and a low-ability worker's utility is 0,.. However, a high-ability worker does strictly better in equilibria in which she gets a lower level of education. Thus, separating equilibria in which the high-ability worker gets education level e (e.g., the equilibria depicted in Figures 13.C.S and 13.C.6) Pareto dominate all the others. The Pareto-dominated equilibria are sustained because of the high-ability worker's fear that if she chooses a lower level of education than that prescribed in the equilibrium firms will believe that she is not a 'high-ability worker. These beliefs can be maintained because in equilibrium they are never disconfirmed.

e

same education choices as in Figure 13.C.S but different

off-equilibriumpath beliefs. Figure 13.C.7 (right)

A separating equilibrium with an

education choice

> e by high-ability workers.

e"(OH)

455

456

CHAPTER

13:

ADVERSE

SELECTION,

SIGNALING,

AND

SCREENING

.

w

Figure 13.C.8

~TypeO.

E[O) --- --------------------

0, ------------------------

(a)

SEC T ION

1 3 • C:

S I G N A LIN G

457

----- ,------------------------------------------------------------------------

0, -------------------------

(b)

\I'

Separating equilibria may be Pareto dominated by the no·signaling OUlcom (a) A. separating e equlhbnum that is nOI Pareto domInated by the no,s'gnaling

w

Type 0.

0" ------ ------~~~~:-~~

£[11]

,,--.~

I I I

/:

I

V~

outcome.

-----~--------------- I

I

(b) A separating equilibrium that is Pareto dominated by the nO'signaling

0,

.-.,,/

.. '(e)

\

------r---------------

I

I I

t"

I I I

e'

"

outcome.

"~(Oil; j

Figure 13.C,10 (right)

= L, H

The only remaining issue therefore concerns what levels of education can arise in a pooling equilibrium, It turns out that any education level between 0 and the level e' depicted in Figure 13.C.9 can be sustained. Figure 13.C.1O shows an equilibrium supporting education level e', Given the wage schedule depicted, each type of worker maximizes her payoff by choosing education level e', This wage schedule is consistent with Bayesian updating on the equilibrium path because it gives a wage offer of £[0] when education level e' is observed, Education levels between 0 and e' can be supported in a similar manner. Education levels greater than e' cannot be sustained because a low-ability worker would rather set e = 0 than e > e' even if this results in a wage payment of OL' Note that a pooling equilibrium in which both types of worker get no education Pareto dominates any pooling equilibrium with a positive education leveL Once again, the Pareto-dominated pooling equilibria are sustained by the worker's fear that a deviation will lead firms to have an unfavorable impression of her ability. Note also that a pooling equilibrium in which both types of worker obtain no education results in exactly the same outcome as that which arises in the absence of an ability to signal. Thus, pooling equilibria are (weakly) Pareto dominated by the no-signaling outcome,

It is of interest to compare welfare in these equilibria with that arising when worker types arc unobservable but no opportunity for signaling is available. When education is not available as a signal (so workers also incur no education costs), we arc back in the situation studied in Section I3.B. In both cases, firms earn expected profits of zero. However, low-ability workers are striclly worse off when signaling is possible. I n both cases they incur no education costs, but when signaling is possible they receive a wage of OL rather than £(0). What about high-ability workers~ The somewhat surprising answer is that high-ability workers may be either better or worse off when signaling is possible. In Figure 13.C.8(a), the high-ability workers are better off because of the increase in their wages arising through signaling. However, in Figure I3.C.8(b), even though high-ability workers seek to take advantage of the signaling mechanism to distinguish themselves, they are worse off than when signaling is impossible! Although this may seem paradoxical (if high-ability workers choose to signal, how can they be worse olP), its cause lies in the fact that in a separating signaling equilibrium firms' expectations are such that the wage-education outcome from the no-signaling situation, (w, e) = (£[0],0), is no longer available to the high-ability workers; if they get no education in the separating signaling equilibrium, they are thought to be of low ability and offered a wage of 0L' Thus, they can be worse off when signaling is possible, even though they are choosing to signal. Note that because the set of separating equilibria is completely unaffected by the fraction i. of high-ability workers, as this fraction grows it becomes more likely that the high-ability workers are made worse off by the possibility of signaling [compare Figures 13.C.8(a) and 13.C.8(b)]. In fact, as this fraction gets close to I, nearly every worker is getting costly education just to avoid being thought to be one of the handful of bad workers!

Multiple Equilibria alld Equilibrium Refinemelll The multiplicity of equilibria observed here is somewhat disconcerting, As we have seen, we can have separating equilibria in which firms learn the worker's type, but we can also have pooling equilibria where they do not; and within each type of equilibrium, many different equilibrium levels of education can arise, In large part, this multiplicity stems from the great freedom that we have to choose beliefs off the equilibrium path, Recently, a great deal of research has investigated the implications of pulling "reasonable" restrictions on such beliefs along the lines we discussed in Section 9.D, To see a simple example of this kind of reasoning, consider the separating equilibrium depicted in Figure 13,C.7, To sustain e l as the equilibrium education level of high-ability workers, firms must believe that any worker with an education level below e I has a positive probability of being of type OL' But consider any education level eE (e, el)' A type OL worker could never be made better off choosing such an education level than she is getting education level e = 0 regardless of what

Poolillg Equilibria Consider now pooling equilibria, in which the two types of workers choose the same level of education, e'(OL) = e'(O/l) = eO. Since the firms' beliefs must be correctly derived from the equilibrium strategies and Bayes' rule when possible, their beliefs when they see education level e' must assign probability). to the worker being type 0/1' Thus, in any pooling equilibrium, we must have w'(e') = ).0/1 + (I - i.)OL = £[0]. I

J

(Ienl The highest-possible education level in a pooling equilibrium.

Figure 13.C.9

A pooling equilibrium.

458

CHAPTER

13:

ADVERSE

SELECTION,

SIGNALING,

AND

SCREENING

firms believe about her as a result. Hence, any belief by firms upon seeing education level e > e other than !l(e) = I seems unreasonable, But if this is so, then we must have w(e) = 9", and so the high-ability worker would deviate to e. In fact, by this logic, the only education level that can be chosen by type 0" workers in a separating

---

SECTtON

"'II'

wage of If so, low-ability workers would choose e = 0 and high-ability workers would choose e = ell' This alternative outcome involves firms incurring losses on low-ability workers and making profits on high-ability workers. However, as long as the firms break even on average, they are no worse off than before and a Pareto improvement has been achieved. The key to this Pareto improvement is that the central authority introduces cross-subsidization, where high-ability workers are paid less than their productivity level while low-ability workers are paid more than theirs, an outcome that cannot occur in a separating signaling equilibrium. (Note that the outcome when signaling is banned is an extreme case of cross-subsidization.)

equilibrium involving reasonable beliefs is e. In Appendix A we discuss in greater detail the use of these types of reasonablebeliefs refinements. One refinement proposed by Cho and Kreps (1987), known as the illluirive criterion, extends the idea discussed in the previous paragraph to rule out not only the dominated separating equilibria but also all pooling equilibria. Thus, If we accept the Cho and Kreps (1987) argument, we predict a unique outcome to this two-type signaling game: the best separating equilibrium outcome, which is shown in Figures 13.C.S and 13.C.6.

Exercise l3,C.3: In the signaling model discussed in Section I3.C with r(O,,) = r(O,,) = 0, construct an example in which a central authority who does not observe worker types can achieve a Pareto improvement over the best separating equilibrium through a policy that involves cross-subsidization, but cannot achieve a Pareto improvement by simply banning the signaling activity. [Hint: Consider first a case with linear inditTerence curves.J

Secolld-Best Market Intervention I n contrast with the market outcome predicted by the game-theoretic model studied in Section I3.B (the highest-wage competitive equilibrium), in the presence of signaling a central authority who cannot observe worker types may be able to achieve a Pareto improvement relative to the market outcome. To see this in the simplest manner, suppose that the Cho and Kreps (1987) argument predicting the best separating equilibrium outcome is correct. We have already seen that the best separating equilibrium can be Pareto dominated by the outcome that arises when signaling is impossible. When it is, a Pareto improvement can be achieved simply by banning the signaling activity, In fact, it may be possible to achieve a Pareto improvement even when the no-signaling outcome does not Pareto dominate the best separating equilibrium. To see how, consider Figure l3.C.1 I. In the figure, the best separating equilibrium has low-ability workers at point (OL' 0) and high-ability workers at point (01/, e). Note that the high-ability workers would be worse off if signaling were banned, since the point (£[OJ, O) gives them less than their equilibrium level of utility. Nevertheless, note that if we gave the low- and high-ability workers outcomes of (IVL,O) and ("'", ell), respectively, both types would be better off. The central authority can achieve this outcome by mandating that workers with education levels below ell receive a wage of"'L and that workers with education levels of at least ell receive a

,, ,I

13.C:

The case with r(OIl) = r(O"l = 0 studied above, in which the market outcome in the absence of signaling is Pareto optimal. illustrates how the use of costly signaling can reduce welfare. Yet, when the market outcome in the absence of signaling is not efficient, signaling's ability to reveal information about worker types may instead create a Pareto improvement by leading to a more efficient allocation of labor. To see this point, suppose that we have r = ,«(lL) = '(011)' with 0L < r < 0" and £[0] O. Contract (wI. + e, I,J will attract all type 01. workers. and contract (IV II + e, 11/) will attract all type 01/ workers. [Note that since type 0, initially prefers contract (w" I,) to (WI' tl)' we have w, - C(I" 0,) ~ WI - C(II' 0,), and so (Wi + e) - C(ti' Oil ~ (wj + e) - c(tjo 0,).] Since e can be chosen to be arbitrarily small. this deviation yields this firm profits arbitrarily close to n, and so the firm has a profitable deviation. Thus. we must have n :5 O. Because no firm can incur a loss in any equilibrium (it could always earn zero by oITering no contracts), both firms must in fact earn a profit of zero. _

w

Type (I, :..,....-- Indifference Curve

13,0:

~

til

lemma 13,0.2: No pooling equilibria exist.

for type 0, with perfttt observability.

Proof: Suppose that there is a pooling equilibrium contract (WP.I P). By Lemma 13.0.1, it lies on the pooled break-even line, as shown in Figure 13.0.3. Suppose that firm j is oITering contract (w p • I P ). Then firm k ¥ j has a deviation that yields it a strictly positive profit: It ofTers a single contract (1\', i) that lies somewhere in the shaded region in Figure IlD.3 and has Ii· < 0". This contract attracts all the type 0" workers and none of the type {lL workers, who prefer (w p• I P) over (IV, i). Moreover, since II- < 0", firm k makes strictly positive profits from this contract when the high-ability workers accept it. _

Flgur. 13.0.2 (right)

Break-even lines.

We now consider the possibilities for separating equilibria. Lemma 13.0.3 shows that all contracts accepted in a separating equilibrium must yield zero profits. Lemma 13.0.3: If (WL' ttl and (WH' tH ) are the contracts signed by the low- and high-ability workers in a separating equilibrium. then both contracts yield zero profits; that is, w L = OL and w H = 0H' Proof: Suppose first that 11'1. < 0,.. Then either firm could earn strictly positive profits by instead oITering only contract (w,., rd, where 01. > wL > IVI.' All low-ability workers would accept this contract: moreover, the deviating firm earns strictly positive profits from any worker (of low or high ability) who accepts it. Since Lemma 13.0.1 implies that no such deviation can exist in an equilibrium, we must have II"L ~ OL in any separating equilibrium. Suppose, instead, that 11'11 < 011' as in Figure 13.0.4. If we have a separating Flgur. 13.0.3 (left)

Type OL

No pooling equilibria Type 0"

exist. Figure 13.0.4 (right)

Lemma 13,0.1: In any equilibrium, whether pooling or separating, both firms must earn zero profits.

The high·ability contract in a

Proof: Let (wI.' t,J and (IV II • til) be the contracts chosen by the low- and high-ability workers, respectively (these could be the same contract), and suppose that the two firms' aggregate profits are n > O. Then one firm must be making no more than n/2. Consider a deviation by this firm in which it alTers contracts (WI. + e, tl.) and

°L ----------------------

1

separating equilibrium cannot have wli < 0Il'

464

CHAPTER

13:

ADVERSE

SELECTION,

SIGNALING,

AND

SCREENING

equilibrium, then the type Ii, contract (w"ILl must lie in the hatched region of the figure (by Lemma 13.0.1, it must also have 11', > lid. To see this, note that since type 011 workers choose contract (11'1/, 11/), contract (11'" Id must lie on or below the type 011 indifference curve through (11'1/, I,,), and since type 0, workers choose (11'" I,) ovcr (11'", I,,), contract (w" Id must lie on or above the type 0, indifference curve through (11'11' III)' Suppose that firm j is offering the low-ability contract (11',,1 1.1. Then firm k # j could earn strictly positive profits by deviating and offering only a contract lying in the shaded region of the figure with a wage strictly less than 011' sllch as (Ii', i). This contract, which has 11'/1 < 011' will be accepted by all the type 011 workers and by none of the type 0, workers [since firm j will still be offering contract (11'1., I,J]. SO we must have 11'11 ~ 0" in any separating equilibrium. Since, by Lemma 13.0.1, firms break even in any equilibrium, we must in fact have 11'1. = III. and 11'11 = 011' •

---- ...--Oc (~,;,,;)------------------

Proposition 13.0.2 summarizes the discussion so far. Proposition 13.0.2: In any subgame perfect Nash equilibrium of the screening game, low-ability workers accept contract (OL' 0), and high-ability workers accept contract (OH' iH), where iH satisfies OH - e(iH. Ot! = OL - e(O, Ot!. Proposition 13.0.2 does not complete our analysis, however. Although we have established what any equilibrium must look like, we have not established that one exists. In fact, we now show that one may nOI exist. Suppose that both firms are offering the two contracts identified in Proposition 13.0.2 and illustrated in Figure 13.0.7(a). Does either firm have an incenlive to deviate? No firm can earn strictly positive profits by deviating in a manner that attracts either only high-ability or only low-ability workers (just try to find such a deviation). But what about a deviation that attracts 0/1 workers? Consider a deviation in which the deviating firm attracts all workers 10 a single pooling contract. In Figure 13.0.7(a), a contract can attract both types of workers if and only if it lies in the shaded region. There is no profitable deviation of this type if, as depicted in the figure, Ihis shaded area lies completely above the pooled break-even line. However, when some of the shaded area lies strictly below the pooled break-even line, as in Figure 13.D.7(b), a profitable deviation to a pooling contract such as (I", i) exists. In this case, 110 eql/i1ibrium exists. Even when no single pooling contract breaks the separating equilibrium, it is possible that a profitable deviation involving a pair of contracts may do so. For example, a firm can attract both types of workers by offering the contracts (IV" Id and ("'1/,1/1) depicted in Figure 13.0.8. When it does so, type 0, workers accept contract (1"/., I,) and type 0/1 workers accept (WI/,1I/)' If this pair of contracts yields the firm a positive profit, then this deviation breaks the separating contracts identified

Proof: Consider Figure 13.0.6. By Lemmas 13.0.3 and 13.0.4, we know that = (0,,0) and that 11'11 = 01/' In addition, if the type 0, workers are willing to acccpt contract (0/.,0), III must be at least as large as the level ill depicted in the

(11'/., I,J

Figure 13.0.5 (leH)

The low-ability workers must recci\c

contract (Oc,O) in an! separating equilibrium (Ii',i)

I I I

I

I I I

I

o

ill

(b)

(0/1, i/l)' •

Wc can now derive the high-ability workers' contract.

Of. (;~.t~)-----r------------

o

(al

posilive profit by also offering, in addition to its current contracts, a contract lying in the shaded region of the figure with "'/I < 0/1, such as (w, I). This contract attracts all the high-ability workers and does not change the choice of the low-ability workers. Thus, in any separating equilibrium, the high-ability contract must be

Lemma 13.0.5: In any separating equilibrium, the high-ability workers accept contract (OH' i H ), where iH satisfies OH - c(iH , Ii L ) = OL - c(O, Ot!.

I

o

figure. Note that low-ability workers are indifferent between contracts (OL' 0) and (0/1' i/l), and so 0/1 - e(il/' Od = 0, - e(O, 0,). Suppose, then, that the high-ability contract (0/1' III) has 1/1 > il/' as in the figure. Then either firm can earn a strictly

I'roof: By Lemma 13.0.3,11'1. = 0, in any separating equilibrium. Suppose that the low-ahility workers' contract is instead some point (0/., li.1 with Ii. > 0, as in Figure 13.0.5. (Although it is not important for the proof, the high-ability contract must then lie on the segment of the high-ability break-even line lying in the hatched region of the figure, as shown.) If so, then a firm can make strictly positive profits by offering only a contract lying in the shaded region of the figure, such as (w, I). All low-ability workers accept this contract, and the contract yields the firm strictly positive profits from any worker (of low or high ability) who accepts it. •

I

SCREENING

465

Figure 13.0.7

Lemma 13.0.4: In any separating equilibrium, the low-ability workers accept contract (OL'O); that is, they receive the same contract as when no informational imperfections are present in the market.

Typ< Ii,.

13.0:

£[0] -----

Lemma 13.0.4 identifics the contract that must be accepted by low-ability workers in any separating equilibrium.

\I'

SECTION

Flgur. 13.0.6 (rig hi)

The high·ability workers must receivc

contract (Oil' ill) in any separating equilibrium.

J

An equilibrium may not exist. (a) No pooling contract breaks the separating equilibrium. (b) The pooling contracl (Ii>, i) breaks the separating equilibrium.

466

CHAPTER

13:

ADVERSE

SELECTION,

SIGNALING,

AND

--- --

SCREENING

Type 0.

W

Flgur. 13.0.8 ~----------------------

(wL.td

in Proposition I3.D.2 and no equilibrium exists. More generally, an equilibrium exists only ir there is no such profitable deviation.

Welfare Properties of Screening Equilibria Restricting attention to cases in which an equilibrium does exist, the screening equilibrium has welrare properties parallel to those or the signaling model's best separating equilibrium [with r(Od = r(O.) = 0). First, as in the earlier model, asymmetric inrormation leads to Pareto inefficient outcomes. Here high-ability workers end up signing contracts that make them engage in completely unproductive and disutility-producing tasks merely to distinguish themselves rrom their less able counterparts. As in the signaling model, the low-ability workers are always worse ofT here when screening is possible than when it is not. One difference rrom the signaling model, however, is that in cases where an equilibrium exists, screening must make the high-ability workers better off; it is precisely in those cases where it would not that a move to a pooling contract breaks the separating equilibrium [see Figure 13.D.7(b)]. Indeed, when an equilibrium does exist, it is a constrained Pareto optimal outcome; ir no firm has a deviation that can attract both types or workers and yield it a positive profit, then a central authority who is unable to observe worker types cannot achieve a Pareto improvement either."

A profitable deviation uSlOg a pair of

contracts may eXist that breaks the separating equilibrium.

A P PEN 0 I X

A:

A E A SON A B L E • BEL I E F 8

A E FIN E MEN T SIN

S I G N A LIN 0

money. But if (w', t') is withdrawn as a result, then low·ability workers will accept (w, i) and this deviation ends up being unprofitable. Hellwig (1986) examines sequential equilibria and their refinements in a game that explicitly allows for such withdrawals. By introducing such reactions, these papers establish the existence of pure strategy equilibria. Introducing reactions of this sort does not simply eliminate the nonexistence problem. however, but also yields somewhat different predictions regarding the character· istics of market equilibria and their welfare properties. For example, when firms can make multiple offers as we have allowed here, cross·subsidization can arise in Wilson equilibria. Indeed, Miyazaki (1977) shows that in the case in which multiple offers are possible, a Wilson equilibrium always exists and is necessarily a constrained Pareto optimum. In the screening model examined above. we took the view that the uninformed firms made employment offers to the informed workers. Vet we could equally well imagine a model in which informed workers instead make contract offers to the firms. For example, each worker might propose a task level at which she is willing to work, and firms might then offer a wage for that task level. Note, however, that this alternative model exactly parallels the signaling model in Section 13.C and. as we have seen, yields quite different predictions. For example. the signaling model has numerous equilibria, but here we have at most a single equilibrium. This is somewhat disturbing. Given that our models are inevitably simplifications of actllal market processes, if market outcomes are really very sensitive to issues such as this our models may provide us with little predictive ability. One approach to this problem is offered by Maskin and Tirole (1992). They note that contracts like those we have allowed firms to offer in the screening model discussed in this section are still somewhat restricted. In particular, we could imagine a firm offering a worker a contract that involved an ex post (after signing) choice among a set of wage-task pairs (you will sec more about contracts of this type in Section 14.C). Similarly, in considering the counterpart model in which workers make offers. we could allow a worker to propose such a contract. Maskin and Tirole (1992) show that with this enrichment or the allowed contracts (and a weak additional assumption) the sets of sequential equilibria of the two models coincide (there may be mUltiple equilibria in both cases).

APPENDIX A: REASONABLE·BELIEFS REFINEMENTS IN SIGNALING GAMES

What can be said about the potential nonexistence of equilibrium in this model? Two paths have been followed in the literature. One approach is to establish existence of equilibria in the larger strategy space that allows for mixed strategies; on this, see Dasgupta and Maskin (1986). The other is to take the position that the lack of equilibria indicates that, in some important way. the model is incompletely specified. The aspect the literature has emphasized in this regard is the lack of any dynamic reactions to new contract offers [see Wilson (1977), Riley (1979), and Hellwig (1986»). Wilson (1977), for example, uses a definition of equilibrium that captures the idea that firms are able to withdraw unprofitable contracts from the market. A set of contracts is a Wilson equilibrium if no firm has a profitable deviation that remains profitable once existing contracts that lose money after the deviation are withdrawn. This extra requirement may make deviations less attractive. In the deviation considered in Figure 13.0.3, for example, once contract (w, i) is introduced, the original contract (w', t') loses 25. Actually, there is a small gap: An equilibrium may exist when there is another pair of Contracts that would give higher utility to both types of workers and that would yield the firm deviating to it exactly zero profits. In this case, the equilibrium is not a constrained Pareto optimum.

In this appendix, we describe several commonly used reasonable-beliefs refinements or the perrect Bayesian and sequential equilibrium concepts for signaling games, and we apply them to the education signaling model discussed in Section 13.c. Excellent sources for rurther details and discussion are Cho and Kreps (1987) and Fudenberg and Tirole (1992). Consider the rollowing class or signaling games: There are I players plus nature. The first move or the game is nature's, who picks a "type" for player I, E 0 = {O" ... , ON}' The probability ortype is [(0), and this is common knowledge among the players. However, only player I observes O. The second move is player I's, who picks an action a rrom set A aner observing O. Then, aner seeing player I's action choice (but not her type), each player i = 2, ... , I simultaneously chooses an action s, rrom set S,. We define S = S2 X . . . x St. Ir player I is of type 0, her utility rrom choosing action a and having players 2, ... , I choose s = (S2" .. ,St) is ",(a, s, 0). Player ii'I receives payoff ",(a, s, 0) in this event. A perrect Bayesian

e

a

0 A M ES

467

468

CHAPTER

13:

ADVERSE

SELECTION,

SIGNALING,

ANO

SCREENING

equilibrium (PBE) in the sense used in Section 13.C is a profile of strategies

(a(O), s2(a), ... ,s,(a)), combined with a common belief function 1'(0 Ia) for players 2, ... ,I that assigns a probability 1'(0 I a) to type 0 of player I conditional on

---- --

observing action a E A, such that (i) Player l's strategy is optimal given the strategies of players 2, ... , I. (ii) The belief function }leO I a) is derived from player l's strategy using Bayes' rule where possible. (iii) The strategies of players 2, ... , 1 specify actions following each choice a E A that constitute a Nash equilibrium of the simultaneous-move game in which the probability that player I is of type II is 11(11 I a) for all II E 0. In the context of the model under study here, this notion of a PBE is equivalent to the sequential equilibrium notion. The education signaling model in Section 13.C falls into this category of signaling games if we do not explicitly model the worker's choice between the firms' offers and instead simply incorporate into the payoff functions the implications of her optimal choice (she chooses from among the firms offering the highest wage if this wage is positive and refuses both firms' offers otherwise). In that model, 1 = 3,0 = {II,., II/I}, the set A = :r: r;e, O} contains the possible education choices of the worker, and the set Si = : II': \\' E IR) contains the possible wage offers by firm i.

Domillatioll-Based Refillemellts of Beliefs The simplest reasonable-belief refinement of the PBE notion arises from the idea (discussed in Section 9.0) that reasonable beliefs should not assign positive probability to a player taking an action that is strictly dominated for her. In a signaling game, this problem can arise when players 2, ... ,I (the firms in the education signaling model) assign a probability }leO I a) > 0 to player I (the worker) being of type 0 after observing action a, even though action a is a strictly dominated choice for player I when she is of type O. Formally, we say that action a E A is a strictly dominated choice for type 0 if there is an action a' E A such that Min IIt(a', s', 0) > Max " t (a, s, 0). 2"

REASONABLE.BELIEFS

REFINEMENTS

IN

SIGNALING

Unfortunately, in the education signaling model discussed in Section l3.C, this refinement does not narrow down our predictions at all. The set 0(e) equals {OL' Ou} for all education levels e because either worker type will find e to be her optimal choice if the wage offered in response to e is sufficiently in excess of the wage offered at other education levels. Thus, no beliefs are ruled out, and all PBEs of the signaling game pass this test. If we want to narrow down our predictions for this model, we need to go beyond the use of refinements based only on notions of strict dominance. 28 Recall the argument we made in Section 13.C for eliminating all separating equilibria but the best one. We argued that since, in Figure 13.C.7, a worker of type 0,. would be better off choosing e = 0 than she would choosing an education level above i' for any beliefs and resulling equilibrium wage Ihal mighl follow Ihese Iwo edllcalion lel'e/s, no reasonable belief should assign a positive probability to a worker of type II,. choosing any e > e. This is close to an argument that education levels e > e are dominated choices for a type OL worker, but with the critical difference reflected in the italicized phrase: Only equilibrium responses of the firms are considered, rather than all conceivable responses. That is, we take a backwardinduction-like view that the worker should only concern herself with possible equilibrium reactions to her education choices. To be more formal about this idea, for any nonempty set 0 c 0, let S*(0, a) c S, x ... x S, denote the set of possible equilibrium responses that can arise after action a is observed for some beliefs satisfying the property that 1'(11 I a) > 0 only if II E 0. The set S*(0, a) contains the set of equilibrium responses by players 2, ... ,I that can follow action choice a for some beliefs that assign positive probability only to types in 0. When 0 = 0, the set of all conceivable types of player I, this construction allows for all possible beliefs. 29 We can now say that action a E A is strictly dominated for type 0 in this stronger sense if there exists an action a' with Min

u,(a', s', 0)

> Max

u,(a, s, 0).

(13.AA.2)

nS·(9.4I'

Using this stronger notion of dominance, we can define the set 0*(a) = {O: there is no a'

E

A satisfying (13.AA.2)j,

containing those types of player I for whom action a is not strictly dominated in the sense of (\ 3.AA.2). We can now say that a PBE has reasonable beliefs if for all a E A with 0*(a) i' 0, I/(a, II) > 0 only if 0 E 0*(a). Using this reasonable-beliefs refinement significantly reduces the set of possible outcomes in the educational signaling model, sometimes even to a unique prediction. In that model, S*(0, e) = [OL' 011] for all education choices e because, for any belief I' E [0, I], the resulting Nash equilibrium wage must lie between OL and 0/1' As a

(13.AA.I)

For each action a E A, it is useful to define the set 0(a) = [0: there is no a' E A satisfying (13.AA.I»).

This is the set of types of player I for whom action a is not a strictly dominated choice. We can then say that a PBE has reasonable beliefs if, for all a E A with 0(a) i' 0, onlyif

A:

s'£S·(9.o',

Sf,S

II(Ola»O

APPENDIX

2S. We could, in principle, go further with this identification of strictly dominated strategies for player I by also eliminating any strictly dominated strategies for players 2, ... ,I, then looking to see whether we have any more strictly dominated actions for any of player I's types, and so on. However, in the educational signaling model, this does not help us because the firms have no strictly dominated strategies. 29. Note that when there is only one player responding (so I = 2), the set S'(0, a) is exactly the set of responses that are not strictly dominated for player 2 conditional on following action a. Note also that in this case a strategy s,ta) is weakly dominated for player 2 if, for any a e A, it involves play of some Sf S'(0, a).

OE0(a)

and we consider a PBE to be a sensible prediction only if it has reasonable beliefs."

26. Note that a strategy a(O) is strictly dominated for player I if and only if it involves play of a strictly dominated action for some type O. 27. Doing this is equivalent to first eliminating each type (J's dominated actions from the game and then identifying the PBEs of this simplified game.

J

GAMES

469

470

CHAPTER

13:

ADVERSE

SELECTION,

SIGNALING.

ANO

SCREENING

~~~---------------------------------------------------Type

(I,.

e can survive because, as we argued in Section D.C. the high-ability worker will do better by deviating to an education level slightly in excess of e. Furthermore, we can also eliminate any pooling equilibrium in which the equilibrium outcome is worse for a high-ability worker than outcome (Ii", e), such as in the equilibrium depicted in Figure 13.AA.I, since any such equilibrium must involvc unreasonable beliefs: If 1,(0" 1e) = I for all e > thcn a type Ii" worker could do better deviating to an education level just above where she would receive a wage of 0". In fact, when the high-ability worker prefers outcome (0", e) to (£[0],0), this argument rules out all pooling equilibria, and so we get the unique prediction of the best separating equilibrium.

e, e

E(/UiliiJl'iulIl Domination and the Intuitive Criterion We now consider a further strengthening of the notion of dominance, known as 0 for 0 > 0 and r(0) < 0 for 0 < O. Let the densilY of workers of type 0 be f(O), with f(O) > 0 for all 0 E [Q, 0]. Show that a competitive equilibrium with unobservable worker types necessarily involves a Pareto inefficient outcome.

Although the use of either equilibrium domination or the intuitive criterion yields a unique prediction in the education signaling model when there are two types of workers, they do not accomplish this when there are three or more possible worker types (see Exercise 13.AA.l). 'Stronger refinements such as Banks and Sobel's (1987) notions of divilliry and universal diviniry, Cho and Kreps' (1987) related notion called D I, and Kohlberg and Mertens' (1986) srabiliry do yield the unique prediction of the best separating equilibrium in these games with many worker types. See Cho and Kreps (1987) and Fudenberg and Tirole (1992) for further details.

13.B.3" Consider a positive selection version of the model discussed in Section 13.B in which r(') is a continuous, strictly decreasing function of O. Let the density of workers of type Ii be f(O), wilh f(O) > 0 for all 0 E [Q, 0].

(a) Show that the more capable workers are the ones choosing to work at any given wage. (b) Show that if reO) > 0 for all 0, then the resulting competitive equilibrium is Pareto efficient.

(e) Suppose that there exists a (; such Ihat reO) < 0 for 0 > () and reO) > 0 for 0 < 0. Show that any competitive equilibrium with strictly positive employment necessarily involves too

ml/eh employment relative to the Pareto optimal allocation of workers. 13.B.4" Suppose two individuals, I and 2, are considering a trade at price p of an asset that they bOlh use only as a store of wealth. Ms. I is currently the owner. Each individual i has a privately observed signal of the asset's worth YI' In addition, each cares only about the expected value of the asset one year from now. Assume that a trade at price p takes place only if both parties think they are being made strictly better off. Prove that the probability of trade occurring is zero. [Hilll: Study the following trading game: The two individuals simullaneously say either "trade" or "no trade," and a trade at price p takes place only if they bOlh say" trade."]

REFERENCES Akcrlof. G. (1970). The market for lemons: Quality uncertainly and the market mechanism. Quarterly JOU"'QI of Economics 89: 488-500. Banks. J.. and J. Sobel. (1987). Equilibrium selection in signaling games. £wnomt'lr;ca 55: 647-62. Cho. I·K .. and D. M. Kreps. (1987). Signaling games and stable equilibria. Quarurly Journal of Economics 102: 179-221. Dasgupta. P., and E. Maskin. (1986). The existence of equilibrium in discontinuous economic games. Rt'lliew of Economic Studies 46: 1-41. fudcnbcrg. D .. and 1. Tirole. (1992). Game Thear),. Cambridge, Mass.: M IT Press. Hellwig, M. (1986). Some recent developments in the theory of competition in markets with adverse selection. (University or Bonn. mimeographed). Holmstrom, B., and R. B. Myerson. (1983). Efficient and durable decision rules with incomplete information. Econometrica 51: 1799-819. Kohlberg. E.. and J.-F. Mertens. (1986). On the strategic stability of equilibria. ECotlOmetrica 54: 1003-38. Maskin, E., and J. Tirole. (1992). The principal.agcnt relationship with an informed principal,ll: Common values. Econometrica 60: 1-42. Miyazaki. H. (1977). The rat race and internal labor markets. Bell JOUr/wi of Economics 8: 394-418. Riley, 1. (1979). Informational equilibrium. Econometrica 47: 331-59. Rothschild, M .. and 1. E. Stiglitz. (1976). Equilibrium in competitive insurance mark.ets: An essay in the economics of imperfect information. Quarterly Journal of Economics 80: 629-49. Spence. A. M. (1973). Job mark.l signaling. Quarl.,ly Journal of Economics 87: 355-74. Spence, A. M. (1974). Markel Signaling. Cambridge. Mass.: Harvard University Press. Wilson, C. (1977). A model of insurance markets with incomplete information. Journal of Economic Theory 16: 167-207. Wilson, C. (1980). The nature or equilibrium in markets with adverse selection. Bell Journal of Economics 11: 108-30.

I3.B.5" Reconsider the case where reO) = r for all 0, but now assume thai when the wage is such thai no workers are accepting employment firms believe that any worker who might accept would be of the lowest quality, that is, £[0 Ie = 0] = Q. Maintain the assumption that all workers accept employment when indifferent. (a) Argue that when £[0] :2: r > Q, there are now two competitive equilibria: one with e" = [Q,O] and one with w" = Q and e" = 0. Also show that when Q :2: r Ihe unique competitive equilibrium is w" = £[0] and e" = [Q,O], and when r > £[0] the unique competitive equilibrium is w" = Q and e" = 0.

1\""

= £[0] and

(b) Show that when £[0] > r and there are two equilibria, the full-employment equilibrium Pareto dominates the no-employment one. (e) Argue that when £[0] :2: r the unique SPNE of the game-theoretic model in which two firms simultaneously make wage offers is the competitive equilibrium when this equilibrium is unique, and is the full-employment (highest-wage) competitive equilibrium when the competitive equilibrium is not unique and £[0] > r. What happens when £[0] = r? What about the case where £[0] < r? (d) Argue that the highest-wage competitive equilibrium is a constrained Pareto optimum.

J

473

474

CHAPTER

13:

ADVERSE

SELECTION,

SIGNALING,

AND

SCREENING

13.B.6C [Based On Wilson (1980)] Consider the following change in the adverse selection model of Section 13.B. Now there are N firms. each of which wants to hire at most I worker. The N firms differ in their productivity: In a firm of type y a worker of type 0 produces yO units of output. The parameter y is distributed with density function g(') on [0,00], and Y(i') > 0 for all y E [0, 00].

---

E X E R CIS E S

13.C2C Reconsider the two-type signaling model with r(Od = r(8 H ) = 0, assuming a worker's productivity is 0(1 + I'e) with I' > O. Identify the separating and pooling perfect Bayesian equilibria, and relate them to the perfect information competitive outcome. 13.C3" In text.

(a) Let :(11', II) denote the aggregate demand for labor when the wage is II' and the average productivity of workers accepting employment at that wage is II. Derive an expression for this function in terms of the density function g(').

I3.C4" Reconsider the signaling model discussed in Section I3.C, now assuming that worker types are drawn from the interval [Q,O] with a density function f(O) that is strictly positive everywhere on this interval. Let the cost function be c(e, 0) = (e' /0). Derive the (unique) perfect

(b) Let 11(\\') = £[Olr(O),;; w], and define the aggregate demalld fUlletiall for labor by z*(\\') = :(\\',11(\\')). Show that :*(11') is strictly increasing in II' at wage IV if and only if the

Bayesian equilibrium.

elasticity of It with respect to w exceeds 1 at wage \\. (assume that all relevant functions arc uifferentiable).

13.CS" Assume a single firm and a single consumer. The firm's product may be either high or low qualily and is of high quality with probability i .. The Consumer cannot observe quality before purchase and is risk neutral. The consumer's valuation of a high-quality product is v,,; her valuation of a low-quality product is VL' The costs of production for high (H) and low (L) quality are "'*. Can you use these facts to give a simple proof of Proposition 13.8.2?

13.C6' Consider a market for loans to finance investment projects. All investment projects require an outlay of I dollar. There are two types of projects: good and bad. A good project > 0 and a probability (I - Po) of yielding has a probability of Po of yielding profits of profits of zero. For a bad project. the relative probabilities are P. and (I - P.), respectively, where Po > P" The fraction of projects that are good is i, E (0, I). Entrepreneurs go to banks to borrow the cash to make the initial outlay (assume for now that they borrow the entire amount). A loan contract specifies an amount R that is supposed to be repaid to the bank. Entrepreneurs know the type of project they have, but the banks do not. In the event that a project yields profits of zero, the entrepreneur defaults on her loan contract. and the bank receives nothing. Banks are competitive and risk neutral. The risk-free rate of interest (the rate the banks pay to borrow funds) is r. Assume that

n

I3.B.S" Consider the following alteration to the adverse selection model in Section I3.B. Imagine that when workers engage in home production, they use product x. Suppose that the amount consumed is related to a worker's type, with the relation given by the increasing function x(O). Show that if a central authority can observe purchases of good x but not worker types, then there is a market intervention that results in a Pareto improvement even if the market is at the highest·wage competitive equilibrium. 13.8.9" Consider a model of positive seleetioll in which r(') is strictly decreasing and there are two types of workers, 0" and 01., with 00 > 0" > 01• > O. Let i. = Prob (0 = 0,,) E (0,1). Assume that r(O,,) < 0" and that r(Od> 0L' Show that the highest-wage competitive equilibrium need not be a constrained Pareto optimum. [Hint: Consider introducing a small unemployment benefit for a case in which £[0] = r(Od. Can you use the result in Exercise 13.8.7 to give an exact condition for when a competitive equilibrium involving full employment is a constrained Pareto optimum?]

Pc; n

on

-

(I + r) > 0 > p. n

-

(I

+ r).

(a) Find the equilibrium level of R and the set of projects financed. How does this depend i., n, and r'~

{'G' PH'

(b) Now suppose that the entrepreneur can offer to contribute some fraction x of the I dollar initial outlay from her own funds (x E [0, I]). The entrepreneur is liquidity constrained, howe,.r. so that the effective cost of doing so is (I + I')x, where I' > r. (i) What is an entrepreneur's payoff as a function of her project type. her loan-repayment amount R, and her contribution x? (ii) Describe the best (from a welfare perspective) separating perfect Bayesian equifibrium of a game in which the entrepreneur first makes an offer that specifies the level of x she is willing to put into a project, banks then respond by making offers specifying the fevel of R they would require, and finally the entrepreneur accepts a bank's offer or decides not to go ahead with the project. How does the amount contributed by entrepreneurs with good projects change with small changes in P" PG, A, n, and r?

13.B.10" Show that Proposition 13.8.2 continues to hold when r(O) > 0 for some O. l3.CI" Consider a game in which, first, nature draws a worker's type from some continuous distribution on [Q, 0]. Once the worker observes her type, she can choose whether to submit to a cost less test that reveals her ability perfectly. Finally, after observing whether the worker has taken the test and its outcome if she has, two firms bid for the worker's services. Prove that in any subgame perfect Nash equilibrium of this model all worker types submit to the test, and firms offer a wage no greater than Q to any worker not doing so.

J

475

476

C HAP T E R

1 3:

A 0 V E R S ESe LEe T ION.

5 I G N A LIN G.

AND

C HAP T E R

S eRe e N I N G

----------------------------------------------------------------------(iii) How do the two types of entrepreneurs do in the separating equilibrium of (b)(ii) compared with the equilibrium in (o)?

The Principal-Agent Problem

\3.0.1" Extend the screening model to a case in which tasks are productive. Assume that a type 0 worker produces O( I + 1'1) units of output when her task level is r where II > O. Identify the subgame perfect Nash equilibria of this model.

14

I3.D.2" Consider the following model of the insurance market. There are two types of individuals: high risk and low risk. Each starts with initial wealth W but has a chance that "n accident (c.g., a firc) will reduce her wealth by L. The probability of this happening is PL for low· risk types and PH for high-risk types, where PH > Pl.. Both types are expected utility m"ximizers with a Bernoulli utility function over wealth of u(w), with u'(w) > 0 and u"(I\') < 0 at all

1\'.

There arc two risk-neutral insurance companies. An insurance policy consists of a

premium payment M made by the insured individual to her insurance firm and a paymcnt R from the insurance company to the insured individual in the event of a loss. (0) Suppose that individuals are prohibited from buying more than one insurancc policy.

Arguc that a policy can be thought of as specifying the wcalth levels of the insured individual in the two states "no loss'" and "loss."

i

I 14,A Introduction In Chapter 13, we considered situations in which asymmetries of information exist between individuals at the time of contracting. In this chapter, we shift our attention to asymmetries of information that develop subsequem to the signing of a contract. Even when informational asymmetries do not exist at the time of contracting, the parties to a contract often anticipate that asymmetries will develop sometime after the contract is signed. For example, after an owner of a firm hires a manager, the owner may be unable to observe how much effort the manager puts into the job. Similarly, the manager will often end up having better information than the owner about the opportunities available to the firm. Anticipating the development of such informational asymmetries. the contracting parties seek to design a contract that mitigates the difficulties they cause. These problems arc endemic to situations in which one individual hires another to take some action for him as his "agent." For this reason. this contract design problem has come to be known as the principal-agent problem. The literature has traditionally distinguished between two types of informational problems that can arise in these settings: those resulting from hidden actions and those resulting from hidden in/ormation. The hidden action case. also known as moral hazard. is illustrated by the owner's inability to observe how hard his manager is working; the manager's coming to possess superior information about the firm's opportunities, on the other hand. is an example of hidden information.' Although many economic situations (and some of the literature) contain elements of both types of problems. it is useful to begin by studying each in isolation. In Section 14.B, we introduce and study a model of hidden actions. Section 14.C analyzes

(b) Assumc that the insuranc~ companies simultaneously olTer policies; as in Section 13.0, they can each olTer any finitc number of policics. What are the subgamc perfcct Nash equilibrium outcomes of the model'! Docs an equilibrium necessarily exist'!

I3.D.3 c Consider the following extension of the model you developed in Exercise 13.0.1. Suppose that therc is a fixed task level T that all workers face. The monetary equivalent cost of accepting cmployment at this task level is c > 0, which is independent of worker type. However, now a worker's actual output is observable and verifiable. and so contracts can base I;ompensation on the worker's ex post observed output level. (0) What is the subgame perfect Nash equilibrium outcome of this model?

(b) Now suppose that the output realization is random. It can be either good (qG) or bad ('I.). Thc probability th"t it is good is PH for a high-ability worker and PI. for a low-ability worker (Pu > Pt.>. If workers are risk-neutral expected utility maximizers with a Bernoulli utility fUllction over wealth of u(w)

= "',

what is the subgame perfect Nash equilibrium

outcomc? (\:) \Vltat ir workers are strictly risk averse with II"(\\,) < 0 at all w?

13.\).4" Reconsider the scrcening model in Section 13.0, but assume that (i) there is an infinite nllmber of firms that could potentially enter the industry and (ii) firms can each offer at most one contract. [The implication of (i) is that, in any SPNE, no firm can have a profitable entry opportunity.] Characterize the equilibria for this casc. I3.AA.l c Consider the extension of the signaling model discussed in Section \3.C to thc OiISC of three types. Assumc all thrce types have rIO) = O. Provide an example in which more than olle perfect Bayesian equilihrium satisfies the intuitive critcrion.

1. The literature's use of the term moral hazard is not entirely uniform. The term originates in the insurance literature. which first focused attention on two types of informational imperfections: the "moral hazard" that arises when an insurance company cannot observe whether the insured exerts effort to prevent a loss and the "adverse selection" (see Section 13. B) that occurs when the

insured knows more than the company at the time he purchases a policy about his likelihood of an accident. Some authors use moral hazard to refer to either of the hidden action or hidden

information variants of the principal-agent problem [see, for example, Hart and Holmstrom (1987)]. Here, however, we use the term in the original sense.

j

477

478

C HAP T E R

1.:

THE

P R INC I PAL. AGE N T

PRO B L E M

---------------------------------------------------------------------a hidden information model. Then, in Section 14.0, we provide a brief discussion of hybrid models that contain both of these features. We shall see that the presence of postcontractual asymmetric information often leads to welfare losses for the contracting parties relative to what would be achievable in the absence of these informational imperfections. It is important to emphasize the broad range of economic relationships that fit into the general framework of the principal-agent problem. The owner-manager relationship is only one example; others include insurance companies and insured individuals (the insurance company cannot observe how much care is exercised by the insured), manufacturers and their distributors (the manufacturer may not be able to observe the market conditions faced by the distributor), a firm and its workforce (the firm may have more information than its workers about the true state of demand for its products and therefore about the value of the workers' product), and banks and borrowers (the bank may have difficulty observing whether the borrower uses the loaned funds for the purpose for which the loan was granted). As would be expected given this diversity of examples, the principal-agent framework has found application in a broad range 'of applied fields in economics. Our discussion will focus on the owner-manager problem. The analysis in this chapter, particularly that in Section 14.C, is closely related to that in two other chapters. First, the techniques developed in Section 14.C can be applicd to the analysis of screening problems in which, in contrast with the case studied in Section 13.0, only one uninformed party screens informed individuals. We discuss the analysis of this monopolistic screening problem in small type at the end of Section 14.C. Second, the principal-agent problem is actually a special case of "mechanism design," the topic of Chapter 23. Thus, the material here constitutes a first pass at this more general issue. Mastery of the fundamentals of the principalagent problem, particularly the material in Section 14.C, will be helpful when you study Chapter 23. A good source for further reading on topics of this chapter is Hart and Holmstrom

--

(1987).

14.B Hidden Actions (Moral Hazard) Imagine that the owner of a firm (the principal) wishes to hire a manager (the agent) for a one-time project. The project's profits are affected, at least in part, by the manager's actions. If these actions were observable, the contracting problem between the owner and the manager would be relatively straightforward; the contract would simply specify the exact actions to be taken by the manager and the compensation (wage payment) that the owner is to provide in return. 2 When the manager's actions are not observable, however, the contract can no longer specify them in an effective manner, because there is simply no way to verify whether the manager has fulfilled his obligations. In this circumstance, the owner must design the manager's compensation scheme in a way that indirectly gives him the incentive to take the correct

SEC T ION

1 4 • 8:

HID DEN

ACT ION S

t

M 0 RA L

actions (those that would be contracted for if his actions were observable). In this section, we study this contract design problem. To be more specific,let It denote the project's (observable) profits, and let e denote the manager's action choice. The set of possible actions is denoted by E. We interpret e as measuring managerial effort. In the simplest case that is widely studied in the literature, e is a one-dimensional measure of how "hard" the manager works, and so E c R More generally, however, managerial effort can have many dimensionshow hard the manager works to reduce costs, how much time he spends soliciting customers, and so on-and so e could be a vector with each of its elements measuring managerial effort in a distinct activity. In this case, E c AM for some M.l In our discussion, we shall refer to e as the manager's effort choice or effort level. For the nonobservability of managerial effort to have any consequence, the manager's effort must not be perfectly deducible from observation of It. Hence, to make things interesting (and realistic), we assume that although the project's profits are affected bye, they are not fully determined by it. In particular, we assume that the firm's profit can take values in ['.!' x] and that it is stochastically related to e in a manner described by the conditional density function f(lt I e), with f(lt Ie) > 0 for all e E E and all 11 E ['.!, xl Thus, any potential realization of 11 can arise following any given effort choice by the manager. In the discussion that follows, we restrict our attention to the case in which the manager has only two possible effort choices, ell and eL (see Appendix A for a discussion of the case in which the manager has many possible actions), and we make assumptions implying that ell is a "high-effort" choice that leads to a higher profit level for the firm than eL but entails greater difficulty for the manager. This fact will mean that there is a conflict between the interests of the owner and those of the manager. More specifically, we assume that the distribution of It conditional on ell first-order stochastically dominates the distribution conditional on e/.; that is, the distribution functions F(lt Ied and F(1I I ell) satisfy F(lt I ell) ::; F(lt I ed at alllt E ['.!, x], with strict inequality on some open set n c ['.!, x] (see Section 6.0). This implies that the level of expected profits when the manager chooses ell is larger than that from eL: J1If(1I I ell) d1l > J1If(1I I eLl d1l. The manager is an expected utility maximizer with a Bernoulli utility function U(IV, e) over his wage IV and effort level e. This function satisfies u.. (w, e) > 0 and u.... (w, e) ~ 0 at all (w, e) (subscripts here denote partial derivatives) and u(\\', ell) < II(IV, "L) at all w; that is, the manager prefers more income to less, is weakly risk averse over income lotteries, and dislikes a high level of effort! In what follows, we focus on a special case of this utility function that has attracted much of the

1 In fact, more general interpretations are possible. For example. e could include non-effortrelated managerial decisions such as what kind of inputs are purchased or the strategies that are adopted for appealing to buyers. We stick to the effort interpretation largely because it helps wilh intuition. 4. Note that in the multidimensional-effort case, it need not be that eH has higher effort in every dimension; the only important thing for our analysis is that it leads to higher profits and entails a larger managerial disutility than does iL.

2. Note that this requires not only that the manager's actions be observable to the owner but also that they be observable to any court that might be called upon to enforce the contract.

1

HA Z AR DI

479

480

CHAPTER

14:

THE

PRINCIPAL-AGENT

PROBLEM

attention in the literature: u(w, e) = v(w) - g(e).' For this case, our assumptions on u(w, e) imply that v'(w) > 0, v"(w) ~ 0, and g(eH) > geed. The owner receives the project's profits less any wage payments made to the manager. We assume that the owner is risk neutral and therefore that his objective is to maximize his expected return. The idea behind this simplifying assumption is that the owner may hold a well-diversified portfolio that allows him to diversify away the risk from this project. (Exercise 14.B.2 asks you to consider the case of a risk-averse owner.)

--- --

f

(11 - W(lI» [(111 e) dll

S.t.

f

f

v(w(n»[(nl e) dn -gee)

-[(lIle)

+ yv'(w{lI))[(nle)

,,(w:> -

f

481

= 0,

(14.B.3)

gee) = ii.

(14.B.4)

n[(l1l e) dll - v-'(ii

+ gee)).

(14.B.5)

The first term in (14.8.5) represents the gross profit when the manager puts forth effort e; the second term represents the wages that must be paid to compensate the manager for this effort [derived from condition (14.B.4)]. Whether ell or eL is optimal depends on the incremental increase in expected profits from ell over eL compared with the monetary cost of the incremental disutility it causes the manager. This is summarized in Proposition 14.B.1.

to offer the manager? Second, what is the best choice of e? Given that the contract specifies effort level e, choosing w(n) to maximize S(n - w(nll[(nl e) dn = (S lI[(nl e) dn) - (S w(n)[(nle) dn) is equivalent to minimizing the expected value of the owner's compensation costs, S w(n)[(l1l e) dll, so (14.B.l) tells us that the optimal compensation scheme in this case solves

S.t.

I

or

~ ii.

Proposition 14.B.1: In the principal-agent model with observable managerial effort, an optimal contract specifies that the manager choose the effort e* that maximizes n f (nl e) dn - v- '(D + gee»~] and pays the manager a fixed wage w* = v- '(D + g(e*)). This is the uniquely optimal contract if v"(w) < 0 at all w.

(14.B.2)

w(n)[(nle)dll

H A Z A R0

Note that since Ilk,,) > g(e,,), the manager's wage will be higher if the contract calls for effort ell than if it calls for eL . On the other hand, when the manager is risk neutral, say with v(w) = w, condition (14.B.3) is necessarily satisfied for any compensation function. In this case, because there is no need for insurance, a fixed wage scheme is merely one of many possible optimal compensation schemes. Any compensation function w(n) that gives the manager an expected wage payment equal to ii + gee) [the level derived from condition (14.B.4) when v(\\') = IV] is also optimal. Now consider the optimal choice of e. The owner optimally specifies the effort level e E :e,., ell} that maximizes his expected profits less wage payments,

It is convenient to think of this problem in two stages. First, for each choice of

f

(M 0 R A L

w:

e that might be specified in the contract, what is the best compensation scheme w(n)

Min

ACT ION S

If the manager is strictly risk averse [so that v'(w) is strictly decreasing in w], the implication of condition (14.B.3) is that the optimal compensation scheme W{lI) is a constant; that is, the owner should provide the manager with a fixed wage payment. This finding is just a risk-sharing result: Given that the contract explicitly dictates the manager's effort choice and that there is no problem with providing incentives, the risk-neutral owner should fully insure the risk-averse manager against any risk in his income stream (in a manner similar to that in Example 6.C.1). Hence, such given the contract's specification of e, the owner offers a fixed wage payment that the manager receives exactly his reservation utility level:

(l4.B.I)

V(W(lI)) [(111 e) dn - I/(e)

HID 0 E N

condition 6

I

The Optimal Contract when EjJort is Observable

Max

1 4. B:

V'(W{lI» = y.

It is uscful to begin our analysis by looking at the optimal contracting problem when effort is observable. Suppose that the owner chooses a contract to offer the manager that the manager can then either accept or rcject. A contract here specifies the manager's effort e E {e,., ell} and his wage payment as a function of observed profits W(lI). We assume that a competitive market.for managers dictates that the owner must provide the manager with an expected utility level of at least ii if he is to accept the owner's contract offer ('I is the manager's reser"",ioll lIIility le,'e/). If the manager rejects the owner's contract offer, the owner receives a payoff of zero. We assume throughout that the owner finds it worthwhile to make the manager an offer that he will accept. The optimal contract for the owner then solves the following problem (for notational simplicity, we suppress the lower and upper limits of integration ~ and x):

,'EI"/.t'"I. ".. _I

SEC T ION

0

~ 'I.

6. The first-order condition ror w{tt) is derived by taking the derivative with respect to the manager's wage at each level or 1t separately. To see this point, consider a discrete version or the model in which there is a finite number or possible profit levels (7[1 •...• 1t N ) and associated wage levels (Wi •...• wN ). The first-order condition (14.8.3) is analogous to the condition onc gels in the discrete model by examining the first-order conditions ror each W II , n = I..... N (note that we allow the wage payment to be negative). To be rigorous. we should add that when we have a continuum of possible levels or 7[, an optimal compensation scheme need only satisry condition

The constraint in (14.B.2) always binds at a solution to this problem; otherwise, the owner could lower the manager's wages while still getting him to accept the contract. Letting y denote the multiplier on this constraint, at a solution to problem (14.B.2) the manager's wage W(lI) at each level of n E [~, x] must satisfy the first-order

(14.B.3) aL a seL of profiL levels LhaL is of full measure.

5. Exercise 14.B.1 considers one implication or relaxing this assumption.

.1

------"

482

CHAPTER

14:

THE

PRINCIPAL·AGENT

PROBLEM

-------------------------------------------------------------------------The Optimal Colltract whell Effort is Not Observable

-

The optimal contract described in Proposition 14.B.1 accomplishes two goals: it specifics an efficient effort choice by the manager, and it fully insures him against income risk. When effort is not observable, however, these two goals often come into conflict because the only way to get the manager to work hard is to relate his pay to the realization of profits, which is random. When these goals come into conflict, the nonobservability of effort leads to inefficiencies. To highlight this point, we first study the case in which the manager is risk neutral. We show that in this case, where the risk-bearing concern is absent, the owner can still achievc the same outcome as when effort is observable. We then study the optimal contract when the manager is risk averse. In this case, whenever the first-best (full ohscrvability) contract would involve the high-efTort level, eflicient risk bearing and ellieient incentive provision come into conflict, and the presence of nonobservable actions leads to a welfare loss.

14.B:

HIDDEN

ACTIONS

(MORAL

The manager is willing to accept this contract as long as it gives him an expected utility of at least ii, that is, as long as

f

rrf(rr Ie') drr - ex - g(e') ;;:: ii.

(14.B.8)

Let (I.' be the level of ex at which (14.B.8) holds with equality. Note that the owner's payoff if the compensation scheme is w(rr) = rr - IX' is exactly 0:' (the manager gets all of" except for the fixed payment (I.'). Rearranging (14.B.8), we see that 0:' = rrf(rr Ie') drr - g(e') - U. Hence, with compensation scheme "~rr) = rr - (I.', both the owner and the manager get exactly the same payoff as when effort is observable. _

J

The basic idea behind Proposition 14.B.2 is straightforward. If the manager is risk neutral, the problem of risk sharing disappears. Efficient incentives can be provided without incurring any risk-bearing losses by having the manager receive the full marginal returns from his effort.

A ri ....{-II('lIt,.al HllIIUlfWr

A risk-averse manager

Surpose that 1'(\\') = \\'. Applying Proposition 14.B.I, the optimal efTort level (" when elTort is observable solves Max ('eh',.(',d

f

l£f(l£ I e) dl£ - y(e) - U.

When the manager is strictly risk averse over income lotteries, matters become more complicated. Now incentives for high efTort can be provided only at the cost of having the manager face risk. To characterize the optimal contract in these circumstances, we again consider the contract design problem in two steps: first, we characterize the optimal incentive scheme for each efTort level that the owner might want the manager to select; second, we consider which efTort level the owner should induce. The optimal incentive scheme for implementing a specific effort level e minimizes the owner's expected wage payment subject to two constraints. As before, the manager must receive an expected utility of at least II if he is to accept the contract. When the manager's effort is unobservable, however, the owner also faces a second constraint: The manager must actually desire to choose effort e when facing the incentive scheme. Formally, the optimal incentive scheme for implementing e must therefore solve

(14.B.6)

The owner's profit in this case is the value of expression (l4.B.6), and the manager receives an expected utility of exactly II. Now consider the owner's payoff when the manager's efTort is not observable. In Proposition 14.B.2, we establish that the owner can still achieve his full-information payofT. Proposition 14.B.2: In the principal-agent model with unobservable managerial effort and a risk-neutral manager, an optimal contract generates the same effort choice and expected utilities for the manager and the owner as when effort is observable. Proof: We show explicitly that there is a contract the owner can ofTer that gives him the same payoff that he receives under full information. This contract must therefore be an optimal contract for the owner because the owner can never do better when effort is not observable than when it is (when efTort is observable, the owner is always free to offer the optimal nonobservability contract and simply leave the choice of an effort level up to the manager). Suppose that the owner offers a compensation schedule of the form w(l£) = 1£ - (I., where 1 is some constant. This compensation schedule can be interpreted as "selling the project to the manager" because it gives the manager the full return rr except for the fixed payment ~ (the "sales price"). If the manager accepts this contract, he chooses e to maximize his expected utility,

f

SECTION

I\'(rr) I(rr I e) I at alinE Ii. But ify = O,condition (14.B.IO) then implies that "'(II'(n» :$ 0 at any such n (recall that JJ : O. On the other hand, if I' = 0 in the solution to problem (14.B.9) then, by condition (14.B.10), the optimal compensation schedule gives a fixed wage payment for every profit realization. But we know that this would lead the manager to choose eL rather than ell' violating constraint (iil/) of problem (14.B.9). Hence, J1 > O. • 7. Although problem (l4.B.9) may not appear to be a convcx programming problem. a simple

transformation of the problem shows lhat (14.B.1O) is both a necessary and a sufficient condition for a solution. To see this. reformulate (I4.B.9) as a problem of choosing the manager's level of utility for each profit outcome n. say ii(n). Lelling 4>(') = .. -1(.), the objective function becomes

reformulated problem (see Section M.K of the Mathematical Appendix). The first-order condition

for this problem is

"~(wen»~ = v(n).

[(nle..) < I f(nlel/)

'v

if

f(nle,J --~~-~ > I. I(nlel/)

X. A more direct argument for constmint (i) being binding goes as follows: Suppose that w(n) is a solution to (14.B.9) in which constraint (i) is not binding. Consider a change in the compensation function that lowers the wage paid at each level of 7t in such a way that the resulting decrease in utility is equal at all n. that is, to a new function "~'In) with [v(w(n)) - v(,,'(n))] = t.v > 0 at all 7r E [~. ill This change does not affect the satisfaction of the incentive constraint (ii H ) since if the manager wa~ willing to pick ell when faced with w(n), he will do so when faced with \v(n). Furthermore, because constraint (i) is not binding, the manager will still accept this new contract if j,r is small enough. Lastly. the owner's expected wage payments will be lower than under w(n). This yields a contradiction.

which is convex in ii(n). and the constraints are then all linear in ti(n).

Thus. (Kuhn-Tucker) first·order conditions are both necessary and sufficient for a maximum of this

Defining wen) by

if

This relationship is fairly intuitive. The optimal compensation scheme pays more than IV for outcomes that are statistically relatively more likely to occur under ell than under (',. in the sense of having a likelihood ratio [f(nl e,J/f(n I el/)] less than I. Similarly, it ofTers less compensation for outcomes that are relatively more likely when "" is chosen. We should stress, however, that while this condition evokes a statistical interpretation, there is no actual statistical inference going on here; the owner kllows whal level of effort will be chosen given the compensation schedule he ofTers. Rather, the compensation package has this form because of its illcenril'e effects. That is, by structuring compensation in this way, it provides the manager with an incentive for choosing el/ instead of "t. This point leads to what may at first seem a somewhat surprising implication: in an optimal incentive scheme, compensation is not necessarily monotonically increasing in profits. As is clear from examination of condition (14.8.10), for the optimal compensation scheme to be monotonically increasing, it must be that the likelihood ratio [/(nle..)/I(nle,,)] is decreasing in n; that is, as n increases, the likelihood of getting profit levelrr if effort is ell relative to the likelihood if effort is eL must increase. This property, known as the monotolle likelihood ratio property [see Milgrom (1981)], is lIot implied by first-order stochastic dominance. Figures 14.B.l(a) and (b), for example, depict a case in which the distribution of n conditional on ell stochastically dominates the distribution of n conditional on e L but the monotone likelihood ratio property does not hold. In Ihis example, increases in effort serve to convert low profit realizations into intermediate ones but have no effect on the likelihood of very high profit realizations. Condition (14.8.10) tells us that in this case. we should have higher wages at intermediate levels of profit than at very high ones because it is the likelihood of intermediate profit levels that is sensitive to increases in effort. The optimal compensation function for this example is shown in Figure 14.8.I(c).

or

S4>(l'(nli/(n I",,) dn.

w(n) > ,(,

wIn} 01. and Prob (0,,) = J. E (0, I). (Exercise 14.CI asks you to consider the case of an arbitrary finite number of states.) A contract must try to accomplish two objectives here: first, as in Section 14.B, the risk-neutral owner should insure the manager against fluctuations in his income: second, although there is no problem here in insuring that the manager puts in elTort (because the contract can explicitly state the elTort level required), a contract that maximizes the surplus available in the relationship (and hence, the owner's payoff) must make the level of managerial effort responsive to the disutility incurred by the manager, that is, to the state O. To fix ideas, we first illustrate how these goals are accomplished when is observable: we then turn to an analysis of the problems that arise when II is observed only by the manager.

--- --

1 ... C:

HID 0 E N

IN FOR MAT ION

(A NOM 0 N 0 POL 1ST I

These conditions indicate how the two objectives of insuring the manager and making elTort scnsitive to the state are handled. First, rearranging and combining conditions (14.C2) and (14.C3), we sec that

rr'(en = (I,.(e1, 0,)

for i = L, H.

(14.C7)

This condition says that the optimal level of elTort in state 01 equates the marginal benefit of elTort in terms of increased profit with its marginal disutility cost. The pair (11'1. is illustrated in Figure 14.CI (note that the wage is depicted on the vertical axis and the elTort level on the horizontal axis). As shown, the manager is beller off as we move to the northwest (higher wages and less elTort), and the owner is beller 01T as we move toward the southeast. Because the manager receives utility level II in state 0" the owner seeks to find the most profitable point on the manager's state 0, indilTerence curve with utility level U. This is a point of tangency between the manager's indilTerence curve and one of the owner's isoprofit curves. At this point, the marginal benefit to additional effort in terms of increased profit is exactly equal to the marginal cost borne by the manager. The owner's profit level in state 0, is n1 = rr(e1) - 1'-'(11) - l/(e1, 0,). As shown in Figure 14.C1. this profit is exactly equal to the distance from the origin to the point at which the owner's isoprofit curve through point (\\'1, hits the vertical

en

°

(14.CI)

Max w/ •• '/2!O

>\·/I.""~O

i. v(w" - g(e", 0,,»

+ (I

- i.)v(w l •

-

g(e l., Otl);:> II.

en

In any solution [(11';, en. (wr" er,)] to problem (14.CI) the reservation utility constraint must bind; otherwise, the owner could lower the level of wages olTered and still have the manager accept the contract. In addition, letting y ;:> denote the multiplier on this constraint, the solution must satisfy the following first-order conditions: (14.C2) -i. + }·i.v'(wr, - g(er" 0,,)) = o.

°

. '( ell.)

I.n

-

. '(. yJ.V W II -

491

°°

If () is observable, a contract can directly specify the elTort level and remuneration of the manager contingent on each realization of (note that these variables fully determine the economic outcomes for the two parties). Thus, a complete information contract consists of two wage-elTort pairs: (w", ell) E JR x JR+ for state 0" and (11"/., erJ E JR x JR+ for state 01.' The owner optimally chooses these pairs to solve the following problem:

+ y( I

R E E N I N G)

so the manager's marginal utility of income is equalized across states. This is the usual condition for a risk-neutral party optimally insuring a risk-averse individual. Condition (14.C6) implies that ",r, - y(er" 0Il) = wt - y(et, 01,), which in turn implies that 1'(II'r, - y(er" 011)) = 1'(11'; - y(et, 01.)); that is, the manager's utility is equalized across states. Given the rescrvation utility constraint in (14.CI), the managcr therefore has utility level Ii in each state. Now consider the optimal elTort Icvels in the two states. Since 1/,.(0, 0) = and rr'(O) > 0, conditions (14.C4) and (14.CS) must hold with equality and e1 > for i = 1,2. Combining condition (14.C2) with (14.C4), and condition (14.C3) with (14.CS), we see that the optimallevcl of elTort in state 0" e1, satisfies

TIle State (] is Observable

-( I - i.)

esc

(14.C6)

°

s.t.

SEC T ION

- i.) D'(W! -

gee!, 0tl) = 0 .

~ 0, 9 (* ell. 0)) II ge (* ell' 0II ){ = 0

(I - i.)rr'(erJ - y(1 - i.)v'(II'! - gee!, 0tl)g,(e!,

OIJ{ ~~'

Figure 14.C.1

The optimal wage-effort pair for state OJ when states are observable. n(e) - w

(14.C3)

if

eil > 0.

if et > O.

(14.CA)

(14.CS)

Profits of { 0 Owner in Slalc 0,

m:)

12. As with the case of hidden actions studied in Section 14.B, nonobservabililY causes no welfare loss In the case of managerial risk neutralily. As there. a "sellout" contract that races the manager with the full marginal returns from his actions can generate the first-best outcome. (See

Exercise 14.C.2.)

J

= n7':

492

CHAPTER

14:

THE

PRINCIPAL·AGENT

v(.' - y(e, Od)

PROBLEM

="

SECTION

v( .. - y(r,

0.» = U

.(e) - w =

.( 0 (i,e" the owner lowers the wage payments in both states by c), This new contract still satisfies constr'!int (i) as long as c is chosen small enough, In addition, the incentive compatibility constraints are still satisfied because this change just subtracts a constant, c, from each side of these constraints, But if this new contract satisfies all the constraints, the original contract could not have been optimal because the owner now has higher profits, which is a contradiction. _ Lemma 14.C.3: In any optimal contract: (i) e L :5 et; that is, the manager's effort level in state OL is no more than the level that would arise if 0 were observable, (ii) eH = e;,; that is, the manager's effort level in state OH is exactly equal to the level that would arise if 0 were observable,

Stale { ()

"

/,

I;~olil

and, as is evident in Figure I 4.C.S, the truth-telling constraints are still satisfied, Thus, > e;' cannot be optimal. a contract with Now consider part (ii), Given any wage -elTort pair (Ii'/" "/,) with ,"- :5 such as that shown in Figure 14,C.6. the owner's problem is to find the location for (1\'11'"11) in the shaded region that maximizes his profit in state 11 11 , The solution occurs at a point of tangency between the manager's state 1111 indifTerence curve through point (Ii',.. ell) and an isoprofit curve for the owner. This tangency occurs at point (lVII' cr,) in the figure, and necessarily involves efTort level er, because all points of tangency between the manager's state 0" indifTerence curves and the owner's isoprofit curves occur at efTort level er, [they are characterized by condition (14.C7) for i = If]. Note that this point of tangency occurs strictly to the right of efTort level e,. because

"I.

Ftgur. 14.C.4 (teft)

I n a feasible contract alTering (wL' eel for state OL' the pair ("'H,eH) must lie in the shaded region, Figure 14.C,5 (right)

An optimal contract has eL. :s; ei.

el. :s ei.

g(l". 0,,)
(iv) WL - g(IL'

ad

~ WI/ - g(I",

ad·

This problem has exactly the same structure as (14.C.8) but with the principal's (here the firm's) profit being a function of the state. As noted above, the analysis of this problem follows exactly the same lines as our analysis of problem (14.C.8). This class of models has seen wide application in the literature (although often with a continuum of types assumed). Maskin and Riley (1984b), for example, apply this model to the study of monopolistic price discrimination. In their model, a consumer of type 0 has utility t~x. 0) - T when he consumes x units of a monopolist'S good and makes a total payment of T to the monopolist, and can earn a reservation utility level of V(0. 0) = 0 by not purchasing from the monopolist. The monopolist has a constant unit cost of production equal to c > 0 21. The model studied in Section 13.D with ltj(t) = 0, corresponds to the limiting case where

I' -0.

eC

MOD E L 5

501

,---------------------------------------------------------------------

Exercises 14.C.7 to 14.C.9 ask you to study some examples of monopolistic screening models.

14,D Hidden Actions and Hidden Information: Hybrid Models Although the hidden action - hidden information dichotomization serves as a useful starting point for understanding principal-agent models, many real-world situations (and some of the literature as well) involve elements of both problems. To consider an example of such a model, suppose that we augment the simple hidden information model considered in Section 14.C in the following manner: let the level of efTort e now be unobservable, and let profits be a stochastic function of efTort, described by conditional density function f(n I e). I n essence, what we now have is a hidden action model, but one in which the owner also docs not know something about the disutility of the manager (which is captured in the state variable 0). Formal analysis of this model is beyond the scope of this chapter. but the basic thrust of the revelation principle extends to the analysis of these types of hybrid problems. In particular, as Myerson (1982) shows, the owner can now restrict attention to contracts of the following form: (i) After the state (/ is realized, the manager announces which state has occurred. (ii) The contract specifies, for each possible announcement 0 E e, the efTort level dO) that the manager should take and a compensation scheme 1\'(11 I 0). (iii) In every state II, the manager is willing to be both Irulirful in stage (i) and obedient following stage (ii) [i.e., he finds it optimal to choose effort level e(O) in state OJ, This contract can be thought of as a revelation game, but one in which the outcome of the manager's announcement about the state is a hidden action-style contract, that is, a compensation scheme and a "recommended action." The requirement of "obedience" amounts to an incentive constraint that is like that in the hidden action

22. The regulator's objective function can be generalized to allow a weighted average of consumer and producer surplus, with greater weight on consumers. In this case, the runction 7r i (·) will depend on 0;.

502

APPENDIX

A:

MULTIPLE

EFFORT

LEVELS

IN

THE

HIDDEN

ACTION

MODEL

503

-------------------------------------------------------------------------- ,---------------------------------------------------------------CHAPTER

14:

THE

PRINCIPAL.AGENT

PROBLEM

model considered in Section 14.B; the "truthfulness" constraints are generalizations of those considered in our hidden information model. See Myerson (1982) for details. One special case of this hybrid model deserves particular mention because its analysis reduces to that of the pure hidden information model considered in Section 14.C. In particular, suppose that effort is unobservable but that the relationship between effort and profits is determillistic, given by the function n(e). In that case, for any particular announcement 8, it is possible to induce any wage-effort pair that is desired, say (w, e), by use of a simple "forcing" compensation scheme: Just reward the manager with a wage payment of w if profits are n(e), and give him a wage payment of - OCJ otherwise. Thus, the combination of the observability of n and the one-to-one relationship between nand e effectively allows the contract to specify e. The analysis of this model is therefore identical to that of the hidden information model considered in Section 14.C, where wage-effort pairs could be specified directly as functions of the manager's announcement. To see this point in a slightly different way, note first that because of the ability to write forcing contracts, in this model an optimal contract can be thought of as specifying, for each announcement 6, a wage-profit pair (w(O), n(O)). Now, for any required profit level n, the effort level necessary to achieve a profit of n is e such that nee) = n. Let the function e(n) describe this etTort level. We can now think of the manager as having a disutility function defined directly over the profit level which is given by y(n, 0) = g(e(n), II). But this model looks just like a model with observable etTort where the effort variable is n, the disutility function over this etTort is y(n, II), and the profit function is n(7I) = 71. Thus, the analysis of this model is identical to that in a pure hidden information model. A similar point applies to a closely related hybrid model in which, instead of the manager's disutility of effort, it is the relation between profit and effort that depends on the state. In particular, suppose that the disutility of effort is given by the function g(e) and profits are given by the function 7I(e, Ii), where 71.(') > 0, 71 .. (') < 0, 71,(') > 0, and n.. (·) > 0. Effort is not observable, but profits are. The idea is that the manager knows more than the owner does about the true profit opportunities facing the firm (e.g., the marginal productivity of effort). Again, we can think of a contract as specifying, for each announcement by the manager, a wage-profit pair (implicitly using forcing contracts). In this context, the effort needed to achieve any given level of profit 71 in state II is given by some function 1'(71, Ii), and the disutility associated with this effort is then g(n, 0) = g(';(7I, II». But this model is also equivalent to our basic hidden information model with observable etTort: just let the etTort variable be n, the disutility of this etTort be g(n, 0), and the profit function be n(n) = 71. Again, our results from Section 14.C apply.

Figure 14.AA.1

Density functions for E = {eL , eM' ell}: effort choice eAt may not be

implementable.

Profit Realization

to the more general specification initially introduced in Section 14.B in which E is the feasible set of effort choices. As in Section 14.B, we can break up the principal's (the owner's) problem into several parts: (a) What are the effort levels e that it is possible to induce'! (b) What is the optimal contract for inducing each specific etTort level e E E'! (c) Which etTort level e E E is optimal'! In a multiple-action setting, each of these three parts becomes somewhat more complicated. For example, with just two actions, part (a) was trivial: eL could be induced with a fixed wage contract, and ell could always be induced by giving incentives that were sufficiently high at outcomes that were more likely to arise when ell is chosen. With more than two actions, however, this may not be so. For example, consider the three-action case in which E = {e L, eM' ell} and the conditional density functions are those depicted in Figure 14.AA. I. As is suggested by the figure, it may be impossible to design incentives such that eM is chosen because for any w(n) the agent may prefer either eL or ell to eM' (Exercise 14.B.4 provides an example along these lines.) Part (b) also becomes more involved. The optimal contract for implementing effort choice e solves Min w(lt)

f

(14.AA.I)

\I'(n)f(nle)dn

s.t. (i)

f

v(w(n)) f(n I e) dn - g(e)

(ii) e solves

~.~~

f

~ ii

v(w(n»I(n\e) dn - g(e).

Ifwe have K possible actions in set E, the incentive constraints in problem (14.AA.I) [constraints (ii)] consist of (K - I) constraints that must be satisfied. In this case, with a change of variables in which we maximize over the level of utility that the manager gets conditional on n, say v(7I), we have a problem with K linear constraints and a convex objective function [see Grossman and Hart (1983) and footnote 7 for more on this]. However, if E is a continuous set of possible actions, say E = [0, e] c IR, then we have an infinity of incentive constraints. One trick sometimes used in this case to

APPENDIX A: MULTIPLE EFFORT LEVELS IN THE HIDDEN ACTION MODEL

In this appendix, we discuss additional issues that arise when the effort choice in the hidden action (moral hazard) model discussed in Section 14.B is more complex than the simple two-effort-choice specification e E {e L, ell} analyzed there. Here, we return

d

504

CHAPTER

1.:

PRINCIPAl.AGENT

THE

PROBLEM

----------------------------------------------------------------------------------simplify problem (14.AA.l) is to replace constraint (ii) with a ./irst-order condition (this is sometimes called the ./irst·order approach). For example, if e is a one. dimensional measure of effort, then the manager's first-order condition is

f

~:P:P:E:N~O~':X~B~'~S~O~"~U~T~'~O~N__~O_'

,--

Using Lemma 14.C.1 we can restate problem (14.C.8) as Max ..... //.1'/1

v(w(n» !e(n I e) dn - g'(e) = 0,

OBLEM WITH HIDDEN INFORMATION ___ 505 __T_H_E__P_R_'_H_C_'_P_A_l_._A_G_E_N__T __P_A________________________________

~

).[n(el/) - wl/]

(14.AA.2)

1

• I.

+ I'

[!.(n1e)] f(n Ie)

The condition that ratio Ue(n I 11

+ 'l -

1/

= O.

(14.IlB.2)

+ , = o.

(14.BB.3)

- 1.

(14.1313.4)

(I 4. 1lB.5)

Step 4: Steps 1 to 3 imply that 4>, = O. Suppose not: i.e., that ,. > O. Then constraint (iv) must be binding. We shall now derive a contradiction. First, substitute for 4'" in conditions (14.BB.4) and (14.BB.5) using the fact that 4>11 = 4>,. + i.from condition (14.BB.2). Then, using the fact that (e,., ell)>> O. we can write cond,t,ons (14.1l1l.4) and (14.BB.5) as

+ 4>1.[y,.(e ll . 0,.)

-II,.(eIl' 0Il)] = 0

and

+ (I + d[g,(e/.. 011) -

g,(e" 0,,)] =

o.

But 0 and Ye(O. 0,) = a for i = L. H. Similarly for condition (14.B8.5) and e,.

~ v·'(ii)

lVI/ -

g(el/,

Step 2: Adding conditions (14.BB.2) and (14.BB.3) implies that i' = I. Hence. constmint (i) must bind at an optimal solution.

w,J

(iii)

w, - gte"~ 0Il) "'II -

Step I: Condition (14.BB.2) implies that 11 > O. Thus, constraint (iii) must bind (hold with equality) at an optimal solution.

(I - i.)[n'(e,.) - 1I,(e/.. 011)] WI. -

(14.B8.I)

along with the complementary slackness conditions for constraints (i), (iii). and (iv) [conditions (M.K.7)]. Let us break up the analysis of these conditions into several steps.

0

s.t. (i)

~

if 1', > 0

i.[rr'(ell) - r/,(ell. 0Il)]

+ (I

OLl

if 1'" > 0

Recall problem (14.C.8): ""/1. ell

WI/ -

(iv)

-(I - i.)

APPENDIX B: A FORMAL SOLUTION OF THE PRINCIPAL-AGENT PROBLEM WITH HIDDEN INFORMATION

i.[n(el/) - w,,]

~ V-'(ol)

(iii)

-i.

Finally, to answer part (c), we need to compute the optimal contract from part (b) for each action that part (a) reveals is implementable and then compare their relative profits for the principal. With more than two effort choices, two features of the two-effort-choice case fail to generalize. First, nonobservability can lead to an upward distortion in effort. (Exercise 14.B.4 provides an example.) Second, at the optimal contract under nonobservability we can get boch an inefficient effort choice and inefficiencies resulting from managerial risk bearing.

Max

OLl

IV,]

Letting (i', 4>", 4>d ~ 0 be the multipliers on constraints (i), (iii), and (iv), respectively, the Kuhn- Tucker conditions for this problem can be written (see Section M.K of the Mathematical Appendix)

(14.AA.3)

.

).)[n(ed -

s.t. (i) w, - g(e"

where f.(n I e) = af(n I e)IDe. If we replace constraint (ii) with (14.AA.2) and solve the resulting problem, we can derive a condition for w(n) that parallels condition (14.8.10):

v'(w(;i) =

+ (I -

0, ""',.t'/ ~ 0

rr'(ed - y,.(e" 011) >

a > n'(ell ) -

g,(e ll , 011),

which implies 1'1/ > e, since n(e) - gte, 011) is concave in e. But if el/ > e, and constraint (iii) binds (which it does from Step I), then constraint (iv) must be slack

....

506

CHAPTER

14;

THE

PRINCIPAL-AGENT

PROBLEM

-------------------------------------------------------------------------------------------because we then have

l

EX E R CIS E S

~----------

Maskin. E.. and J. Riley. (l984b). Monopoly with incomplete information. Rand Journal of Economics IS:

("'" - wel =

=
,. = 0, we know from (14.BB.2) that 4>11 two values into conditions (14.88.4) and (14.B8.5) we have

= J..

Substituting these

1['(e,,) - Y,(e", 0,,) = 0

(14.B8.6)

EXERCISES

(14.B8.7)

14.B.I" Consider the two-elTort-level hidden aelion model discussed in Section 14.B with the gene«11 utility function u(w, e) for Ihe agent Must the reservalion utility constraint be binding

and

[1['(e,.) - y •. (e L• 0,.>]

i.

+ l·~-;: [g.(eL' 0,,) -

y.(e L • 0,.>] =

o.

in an optim uk.).

14.C.6C Reconsider the labor market screening model in Exercise 13.0.1, but now suppose that there is a single employer. Characterize the solution to this firm's screening problem (assume that both types of workers have a reservation utility level of 0). Compare the task levels in this solution with those in the equilibrium of the competitive screening model (assuming an equilibrium exists) that you derived in Exercise 13.0.1.

(a) Suppose that the owner wants to implement effort choice e" and that both Rand C are observable. Derive the first-order condition for the optimal compensation scheme I\'(R. C). How does it depend on Rand C?

14.C.7" (1. Tirole) Assume that there are two types of consumers for a firm's product. OH and II,.. The proportion of type 0,. consumers is i.. A type Us utility when consuming amount x of the good and paying a total of T for it is u(x, T) = Ov(x) - T. where

(b) How would your answer to (a) change if the manager could always unobservably reduce the revenues of the firm (in a way that is of no direct benefit to him)?

1_(1 -x)' vex) = ----2---. The firm is the sole producer of this good, and its cost of production per unit is c > O.

(e) What if, in addition, costs are now unobservable by a court (so that compensation can be made contingent only on revenues)?

(a) Consider a nondiscriminating monopolist. Derive his optimal pricing policy. Show that he serves both classes of consumers if either OL or i. is "large enough."

14.B.7C Consider a two-period model that involves two repetitions of the two-effort-level hidden action model studied in Section 14.8. There is no discounting by either the firm or the manager. The manager's expected utility over the two periods is the sum of his two single-period expected utilities £[v(w) - g(e)), where v'(·) > 0 and v"(·) < O. Suppose that a contract can be signed ex ante that gives payoffs in each period as a function of performance up until then. Will period 2 wages depend on period I profits in the optimal contract?

(b) Consider a monopolist who can distinguish the two types (by some characteristic) but can only charge a simple price p; to each type 0,. Characterize his optimal prices.

(e) Suppose the monopolist cannot distinguish the types. Derive the optimal two-part tariff (a pricing policy consisting of a lump-sum charge F plus a linear price per unit purchased of p) under the assumption that the monopolist serves both types. Interpret. When will the monopolist serve both types? (d) Compute the fully optimal nonlinear tariff. How do the quantities purchased by the two types compare with the levels in (a) to (e)?

14.B.SC Amend the two-effort-choice hidden action model discussed in Section 14.B as follows: Suppose the principal can, for a cost of c, observe an extra signal y of the agent's effort. Profits n and the signal )' have a joint distribution fen, y Ie) conditional on e. The decision to investigate the value of )' can be made after observing It. A contract now specifies a wage schedule wen) in the event of no investigation, a wage schedule I1'(It, }') if an investigation occurs, and a probability pen) of investigation conditional

14.C.S" Air Shangri-la is the only airline allowed to fly between the islands of Shangri-la and Nirvana. There are two types of passengers, tourist and business. Business travelers are willing to pay more than tourists. The airline, however, cannot tell directly whether a ticket purchaser is a tourist or a business traveler. The two types do differ, though, in how much they are willing to pay to avoid having to purchase their tickets in advance. (Passengers do not like to commit themselves in advance to traveling at a particular time.) More specifically, the utility levels of each of the two types net of the price of the ticket, P, for any given amount of time W prior to the flight that the ticket is purchased are given by

on n. Characterize the optimal contract for implementing effort level ell'

14.C.I C Analyze the extension of the hidden information model discussed in Section 14.C where there are an arbitrary finite number of states (0" ... , ON) where 0;+, > 0; for all i. 14.C.2" Consider the hidden information model in Section 14.C, but now let the manager be risk neutral with utility function v(w) = IV. Show that the owner can do as well when 0 is unobservable as when it is observable. In particular, show that he can accomplish this with a contract that offers the manager a compensation scheme of the form w(It) = It - a and allows him to choose any effort level he wants. Graph this function and the manager's choices in (w, e)-space. What revelation mechanism would give this same outcome?

BIISilless: Tourisc:

v - O.p - W, v - OfP - W,

where 0 < O. < 0,.. (Note that for any given level of W, the business traveler is willing to pay more for his ticket. Also, the business traveler is willing to pay more for any given reduction in W.)

..

509

510

CHAPTER

t.:

THE

PRINCIPAL.AGENT

PROBLEM

The proporlion of travelers who are tourists is .I.. Assume that the cost of transporting a passenger is e. Assume in (a) to (d) that Air Shangri·la wants to carry both types of passengers.

-

PAR

T

F

0

U

R

General Equilibrium

(0) Draw the indifference curves of the two types in (P, W)-space. Draw the airline's

isoprofit curves. Now formulate the optimal (profit·maximizing) price discrimination problem mathematically that Air Shangri·la would want to solve. [Hinr: Impose nonnegativity of prices as a constraint since, if it charged a negative price, it would sell an infinite number of tickets at this price.] (b) Show that in the optimal solution, tourists are indifferent between buying a ticket and not going at all. (c) Show that in the optimal solution. business travelers never buy their ticket prior to the night and are just indifferent between doing this and buying when tourists buy. (d) Describe fully the optimal price discrimination scheme under the assumption that they sell to both types. How does it depend on the underlying parameters i., 0•• Or, and ,,? (0) Under what circumstances will Air Shangri·la choose to serve only business travelers?

P"rt IV is devoted to an examination of competitive market economies from a

gelleral equilibrium perspective. Our use of the term "general equilibrium" refers both

14.C.9" Consider a risk·averse individual who is an expected utility maximizer with a Bernoulli utility function over wealth u(·). The individual has initial wealth Wand faces a probability IJ of suffering a loss of size L, where W> L > O. An insurance contract may be described by a pair (c" e,), where e, is the amOunt of wealth the individual has in the event of no loss and c, is the amount the individual has if a loss is suffered. That is, in the event no loss occurs the individual pays the insurance company an amount (W - e,), whereas if a loss occurs the individual receives a payment [e, - (W - L)] from the company.

to a methodological point of view and to a substantive theory. Methodologically, the general equilibrium approach has two central features. First, it views the economy as a closed and interrelated system in which we must simultaneously determine the equilibrium values of all variables of interest. Thus, when we evaluate the effects of a perturbation in the economic environment, the equilibrium levels of the entire set of endogenous variables in the economy needs to be recomputed. This stands in contrast to the partial equilibrium approach, where the impact on endogenous variables not directly related to the problem at hand is explicitly or implicitly disregarded. A second central feature of the general equilibrium approach is that it aims at reducing the set of variables taken as exogenous to a small number of physical realities (e.g., the set of economic agents, the available technologies, the preferences and physical endowments of goods of various agents). From a substantive viewpoint, general equilibrium theory has a more specific meaning: It is a theory or the determination of equilibrium prices and quantities in a system of perfectly competitive markets. This theory is often referred to as the Walrasian theory of markets [from L. Walras (1874)], and it is the object of our study in Part IV. The Walrasian theory of markets is very ambitious. It attempts no less than to predict the complete vector of final consumptions and productions using only the fundamentals of the economy (the list of commodities, the state of technology, prererences and endowments), the institutional assumption that a price is quoted for every commodity (including those that will not be traded at equilibrium), and the behavioral assumption of price taking by consumers and firms. Strictly speaking, we introduced a particular case of the general equilibrium model in Chapter 10. There, we carried out an equilibrium and welfare analysis of perfectly competitive markets under the assumption that consumers had quasilinear preferences. In that setting, consumer demand functions do not display wealth effects (except for a single commodity, called the numeraire); as a consequence, the analysis of a single market (or small group of markets) could be pursued in a manner understandable as traditional partial equilibrium analysis. A good deal of what we do in Part IV

(0) Suppose that the individual's only source of insurance is a risk·neutral monopolist (i.e., the monopolist seeks to maximize its expected profits). Characterize the contract the monopolist will offer the individual in the case in which the individual's probability of loss, IJ. is observable.

(b) Suppose, instead, that 0 is not observable by the insurance company (the individual knows 0). The parameter 0 can take one of two values {OL,OH}' where 0" > OL > 0 and Prob (Od = i.. Characterize the optimal contract offers of the monopolist. Can one speak of one type of insured individual being "rationed" in his purchases of insurance (i.e., he would want to purchase more insurance if allowed to at fair odds)? Intuitively. why does this rationing occur? [Him: It might be helpful to draw a picture in (c" e,)·space. To do so. start by locating the individual's endowment point. that is, what he gets ifhe does not purchase any insurance.] (c) Compare your solution in (b) with your answer to Exercise I3.D.2. 14.AA.I" Show that [f,(lt Ie)ll(lt Ie)] is increasing in It for all e E [a, b] c R if and only if for any e', e" E [a. b]. with e" > e', [f(ltle")II(ltle')] is increasing in It.

"L.

l4.AA.2" Consider a hidden action model with e E [0, i] and two outcomes It,, and with "L' The probability of ltH given effort level e is I(lt" Ie). Give sufficient conditions for the first·order approach to be valid. Characterize the optimal contract when these conditions arc satisfied.

"If >

l4.B8.I" Try solving problem (14.B8.I) by first solving it while ignoring constraint (iv) and then arguing that the solution you derive to this "relaxed" problem is actually the solution to problem (14.B8.I).

511

...

512

PART

IV:

GENERAL

PART

EQUILIBRIUM

IV:

GENERAL

EQUILIBRIUM

513

--------------------------------------------------------------------------- --------------------------------------------------------------------------------can be viewed as an attempt to extend the ideas of Chapter 10 to a world in which wealth effects are significant. The primary motivation for this is the increase in realism it brings. To make practical use of equilibrium analysis for studying the performance of an entire economy, or for evaluating policy interventions that affect large numbers of markets simultaneously, wealth effects, a primary source of linkages across markets, cannot be neglected, and therefore the general equilibrium approach is essential. Although knowledge of the material discussed in Chapter 10 is not a strict prerequisite for Part IV, we nonetheless strongly recommend that you study it, especially Sections 10.B to 10.0. It constitutes an introduction to the main issues and provides a simple and analytically very useful example. We will see in the different chapters of Part IV that quite a number of the important results established in Chaptcr 10 for the quasilinear situation carryover to the case of general preferences. But many others do not. To understand why this may be so, recall from Chapters 4 and 10 that a group of consumers with quasilinear preferences (with respect to the same numeraire) admits the existence of a (normative) representative consumer. This is a powerful restriction on the behavior of aggregate demand that will not be available to us in the more general settings that we study here. It is important to note that, relative to the analysis carried out in Part III, we incur a cost for accomplishing the task that general equilibrium sets itself to do: the assumptions of price-taking behavior and universal price quoting-that is, the existence of markets for every relevant commodity (with the implication of symmetric information)-are present in nearly all the theory studied in Part IV. Thus, in many respects, we are not going as deep as we did in Part III in the microanalysis of markets. of market failure, and of the strategic interdependence of market actors. The trade-off in conceptual structure between Parts III and IV reflects, in a sense, the current state of the frontier of microeconomic research. The content of Part IV is organized into six chapters. Chapter 15 presents a preliminary discussion. Its main purpose is to illustrate the issues that concern general equilibrium theory by means of three simple examples: the tlVo-consumer Edgeworth box economy; the one-consumer, one-firm economy, and the sm O. In short, only the relative prices pUp! are determined in an equilibrium.

xr,

Flgur. 15.B.4 (top right)

Optimal consumption for consumer I at prices p.

Flgur. 15.B.5 (bottom)

xr

Consumer I's offer curve. This implies that the consumer's offer curve lies within the upper contour set of w, and that, if inditTerence curves are smooth, the offer curve must be tangent to the consumer's indifference curve at the endowment point. Figure 15.B.6 represents the demanded bundles of the two consumers at some arbitrary price vector p. Note that the demands expressed by the two consumers are not compatible. The total demand for good 2 exceeds its total supply in the economy ';'" whereas the total demand for good I is strictly less than its endowment w,. Put somewhat ditTerently, consumer I is a net demander of good 2 in the sense that he wants to consume more than his endowment of that commodity. Although consumer 2 is willing to be a net supplier of that good (he wants to consume less than his endowment), he is not willing to supply enough to satisfy consumer I's needs. Good 2 is therefore in excess demand in the situation depicted in the figure. In contrast, good I is in excess supply. At a market equilibrium where consumers take prices as given, markets should clear. That is, the consumers should be able to fulfill their desired purchases and

Example \S.B.I: Suppose that each consumer i has the Cobb-Douglas utility function

=

=

II,(X,;, Xli) xiix~i-'. In addition, endowments are WI = (I, 2) and w, (2, I). At prices p = (PI' p,), consumer I's wealth is (p, + 2p,) and therefore his demands lie

on the offer curve (recall the derivation in Example 3.0.1): OC,(p) = (Il(PI

+ 2p , ), (I PI

d

- 1l)(PI p,

,»).

+ 2P

figure 15.B.6

A price vector with excess demand for good 2 and excess supply for good I.

520

Ci1A":',"ER

15

GENERAL

EQUILIBRIUM

xi

2

~~~______r-__________~'02

THEORY:

_______

EXAMPLES

SEC 1 ION

Indifference Curves Ihrough w ~-+~~ -+'O~C~,~________~02

~~

1 5 . B:

PUR E

______-r___

E X C HAN C. E:

T H £

E 0 GE W 0 A TH

BOX

~02

__

x_~ ~:_. >. ,-

,',1,----------,-I

SOME

____ :'__-I_ __ } ,', p'

Figure 15.B.9

Multiple Walrasian equilibria.

0,

o,,-,_~.~_

x~ I (b)

(a)

~-I----------~~

The Edgeworth box, simple as it is, is remarkably powerful. There are virtually no phenomena or properties of general equilibrium exchange economies that cannot be depicted in it. Consider, for example, the issue of the uniqueness of Walrasian equilibrium. In Chapter 10, we saw that if there is a numeraire commodity relalive to which preferences admit a quasilinear representation, then (with strict convexity of preferences) the equilibrium consumption allocation and relative prices are unique. In Figure 15.B.7, we also have uniqueness (see Exercise 15.B.2 for a more explicit discussion). Yet, as the Edgeworth box in Figure 15.B.9 shows, this property does not generalize. In that figure, preferences (which are entirely nonpathological) are such that the offer curves change curvature and interlace several times. In particular, they intersect for prices such that p,/p, is equal to t, I, and 2. For the sake of completeness, we present an analytical example with the features of the figure.

Figure 15.B.7 ('op)

(a) A Walrasian equilibrium. (b) The consumer's offer curves intersect at the Walrasian equilibrium allocation.

" + x". Note that the utility functions are quasilinear (which, in particular, facilitates the computation of demand), but with respect to different numeraires. The endowments are w, = (2, r) and (1), = (r, 2), where r is chosen to guarantee that the equilibrium prices turn out to be round numbers. Precisely, r = 21/9 > O.ln Exercise 15.B.5, you are asked to compute the offer curves of the two consumers. They are:

Observe that the demands for the first and the second good are, respectively, decreasing and increasing with p,. This is how we have drawn OC, in Figure 15.B.7(b). Similarly, OC,(p) = (a(2p, + p,)/p" (I - 0()(2p, + p,)/p,). To determine the Walrasian equilibrium prices, note that at these prices the total amount of good I consumed by the two consumers must equal 3 (= W II + (1) ,,). Thus, a(pi

+ 2p!l + 0(2pi + p!) = pi

2"10 -

3. OC,(p" p,)

pi

Solving this equation yields

!1=_a_ p!

I - a

_

= (2 + r(~) (~)"IO, (~) -1/ )>> 0 9

and

(15.B.2) OC,(p" p,) =

Observe that at any prices (pi, p!) satisfying condition (15.B.2), the market for good 2 clears as well (you should verify this). This is a general feature of an Edgeworth box economy: To determine equilibrium prices we need only determine prices at which one of the markets clears; the other market will necessarily clear at these prices. This point can be seen graphically in the Edgeworth box: Because both consumers' demanded bundles lie on the same budget line, if the amounts of commodity I demanded are compatible, then so must be those for commodity 2. (See also Exercise 15.B.1.) •

(( P,p,)

-I/O

,2 + r

(p,) p; - (p,)"IO) p, »0.

NOle that, as illustrated in Figure 15.B.9, and in contrast with Example 15.B.I, consumer I's demand for good I (and symmetrically for consumer 2) may be increasing in p ,. To compute the equilibria it is sufficient to solve the equation that equates the total demand of the second good to its total supply, or

(p )"10 = 2 + r. ~ - ~ (~p)-'/9 + 2 + r (p)

d

521

522

CHAPTER

15:

GENERAL

EQUILIBRIUM

THEORY:

SOME

SECTION

EXAMPLES

PURE

EXCHANGE:

THE

EDGEWORTH

BOX

523

Ahr-----~~~------_.O,

THE

conditions hold:

these goods.'o Output is sold in world markets. Factors. on the other hand. are immobile and must be used for production within the country. The central question for our analysis concerns the equilibrium in the factor markets; that is. we wish to determine the equilibrium factor prices W = (w, •...• wd and the allocation of the economy's factor endowments among the J firms." Given output prices p = (p, •...• PJ) and input prices w = (w, •...• wL ). a profitmaximizing production plan for firm j solves Max

15.0:

I ..... L.

I

pj~(z;l

(15.0.5)

j

s.t.

LZj = i.

zn

How does the equilibrium factor allocation (zt .. ... compare with what this planner does? Recall from Section 5.E that whenever we have a collection of J price-taking firms, their profit-maximizing behavior is compatible with the behavior we would observe if the firms were to maximize their profits jointly taking the prices of outputs and factors as given. That is. the factor demands (zt •. .. , z1) solve

Because of the concavity of firms' production functions. first-order conditions are both necessary and sufficient for the characterization of optimal factor demands. Therefore. the L(J + I) variables formed by the factor allocation (=r ..... =1) E R~J and the factor prices w' = (IVt •...• wl) constitute an equilibrium if and only if they satisfy the following L(J + 1) equations (we assume an interior solution here):

Max

(15.0.6)

(:I •..•• :J)~O

for j = I •...• J and { = I ....• L

(15.0.1)

Since Lj zj = i (by the equilibrium property of market clearing). the factor demands must also solve problem (15.0.6) subject to the further constraint that Lj Zj = i. But this implies that the factor demands (zt ....• z1) in fact solve problem (15.0.5): if we must have Lj Zj = i. then the total cost w' -(LI z) is given. and so the joint profit-maximizing problem (15.0.6) reduces to the revenue-maximizing problem (15.0.5).

(zt.···. z1)

and for ( = I•...• L.

(15.0.2)

The equilibrium output levels are then qj = ~(zj) for every j. Equilibrium conditions for outputs and factor prices can alternatively be stated using the firms' cost functions cl(w. qj) for j = I•...• J. Output levels (qt • ...• qj) » 0 and factor prices 11'* »0 constitute an equilibrium if and only if the following

One benefit of the property just established is that it can be used to obtain the equilibrium factor allocation without a previous explicit computation of the equilibrium factor prices; we simply need to solve problem (l5.D.5) directly. It also provides a useful way of viewing the equilibrium factor prices. To sce this. consider again the joint profit-maximization problem (15.D.6). Wc can approach this problem in an equivalent manner by first deriving an aggregate

10. See Exen:isc 15.D.4 ror an endogenous determination (up to a scalar multiple) or the pri~cs I' ~ (PI"

.. PJ)'

II. Note that once the factor prices and allocations are determined. each consumer's demands G.ln be readily determined from his demand function given the exogenou5 prices (PI" .. • PJ) and

12. Note thai maximization or economy-wide revenue rrom production would be the goal of any planner who wanted to maximize consumer welrare: it allows ror the maximal purchases of consumption goods. at the fixed world prices.

Ihe wealth derived from factor input sales and profit distribulions. Recall that the currenl model is completed by assuming that this demand is met in the world markets.

rl

MODEL

531

5:.:..::

~HAPTEK

i!i:

GENERAL

EQUILIBRIUM

THEORY:

SOME

.:)tCTION

EXAMPLES

15.0.

tHE

", -.~

f(:",:,,)

=I

FIgure 15.0.1

2)(2

PRODUCTION

__________________

MODEL

533

~o,

0,

(a) A unit isoquant. (b) The unit cost (b)

(a)

function. (a) Figure 15.0.2

production function for dollars: f(:) =

Max

p.!.(Z.)

S.t.

+ ... + pJlJ(ZJ)

(h) (a)

An inefficient factor allocation.

(b)

The Pareto set of factor allocations.

represent the possible allocations of the factor endowments between the two firms in an Edgeworth box of size i, by The factors used by firm I are measured from the southwest corner; those used by firm 2 are measured from the northeast corner. We also represent the isoquants of the two firms in this Edgeworth box. Figure 15.D.2(a) depicts an inemcient allocation z of the inputs between the two firms: Any allocation in the interior of the hatched region generates more output of hOlh goods than docs :. Figure 15.D.2(b), on the other hand, depicts the Pareto set of factor allocations, that is, the set of factor allocations at which it is not possible, with the given total factor endowments, to produce more of one good without producing less of the other. The Pareto set (endpoints excluded) must lie all above or all below or be coincident with the diagonal of the Edgeworth box. If it ever cuts the diagonal then because of constant returns, the isoquants of the two firms must in fact be tangent all along the diagonal, and so the diagonal must be the Pareto set (see also Exercise 15.B.7). Moreover, you should convince yourself of the correctness of the following claims.

=,.

Liz)=:,

The aggregate factor demands must then solve Max.~o({(z) - w'z). For every I, the first-order condition for this problem is IV, = Df(z)/elz, . Moreover, at an equilibrium, the aggregate usage of factor ( must be exactly :/ Hence, the equilibrium factor price of factor ( must be W, = (if(:)/ill, ; that is, rhe prke of factor { mu.W be exactly equal to its aggregate margi/wl productivit}' (in rerms of revenue). Since f(·) is concave, this observation by itself generates some interesting comparative statics. For example, a change in the endowment of a single input must change the equilibrium price of the input in the opposite direction. Let us now be more specific and take J = L = 2, so that the economy under study produces two outputs from two primary factors, We also assume that the production functions I,(z", ZIt), 1'(Z12' ZIt) are homogeneous of degree one (so the technologies exhibit constant returns to scale; see Section 5.B). This model is known as the 2 x 2 production model. In applications, factor I is often thought of as labor and factor 2 as capital. For every vector of factor prices W = (w" w,), we denote by cj(w) the minimum cost of producing one unit of good j and by OJ(w) = (o,iw), o'l(w» the input combination (assumed unique) at which this minimum cost is reached. Recall again from Proposition 5.C.2 that Vcj(w) = (olj(w), O'j(w», Figure IS.D.I(a) depicts the unit isoquant of firm j,

Exercise 15.0.1: Suppose that the Pareto set of the 2 x 2 production model does not coincide with the diagonal of the Edgeworth box. (a) Show that in this case, the factor intensity (the ratio of a firm's use of factor I relative to factor 2) of one of the firms exceeds that of the other at every point along the Pareto set. (b) Show that in this case, any ray from the origin of either of the firms can intersect the Pareto set at most once. Conclude that the factor intensities of the two firms and the supporting relative factor prices change monotonically as we move along the Pareto set from one origin to the other.

{(zlj' Z,j) E R~: Jj(z'j' z,;l = I}, along with the cost-minimizing input combination (o,j(w), o,iw». In Figure IS.D.I(b), we draw a level curve of the unit cost function, {(WI' w,): ciw" w,) = c}. This curve is downward sloping because as w, increases, w, must fall in order to keep the minimized costs of producing one unit of good j unchanged. Moreover, the set {(WI' w,): Cj(w" WI) 2: c} is convex because of the concavity of the cost function Cj(w) in IV. Note that the vector VCj(w), which is normal to the level curve at W = (w" w,), is exactly (o,j(w), O'j(w», As we move along the curve toward higher w, and lower

In Figure 15.0.3, we depict the set of nonnegative output pairs (q"q,) that can be produced using the economy's available factor inputs. This set is known as the production possihilil)' SCI. Output pairs on the frontier of this set arise from factor allocations lying in the Pareto set of Figure IS.D.2(b). (Exercise IS.o.2 asks you to prove that the production possibility set is convex, as shown in Figure 15.0.3.)

w" the ratio O'j(w)/o'j(w) falls. Consider, first, the efficient factor allocations for this model. In Figure IS.D.2, we

g

J

534

CHAPTER

'5:

GENERAL

EQUILIBRIUM

THEORY:

SOME

EXAMPLES

SECTION

15.0:

THE

2 x 2

PRODUCTION

MODEL

535

q, Flgur. 15.0.3 (left)

The production possibility set. Flgur. 15.0.4 (right)

The equilibrium [actor q,

prices and factor

Figure 15.D.5

intensities in an interior equilibrium.

The equilibrium factor allocation.

factor intensity condition, there is at most a single pair of factor prices that can arise liS the equilibrium factor prices of an inlerior equilibrium. t s Once Ihe equilibrium factor prices 11'* are known, Ihe equilibrium output levels can be found graphically by delermining the unique point (z!, z;! in the Edgeworth box of factor allocations at which both firms have the factor inlensities associated wilh faclor prices "", that is,

With the purpose of examining more closely the determinants of the equilibrium factor allocation (z!. z;! and the corresponding equilibrium factor prices 11'* = ("'t. w!). we now assume that the fuctor intensities of the two firms bear a systematic relation to one another. In particular. we assume that in the production of good I. there is. relative to good 2. a greater need for the first factor. In Definition IS.D.I we make precise the meaning of" greater need ". is relatively more intensive in factor 1

Definition 15.0.1: The production of good than is the production of good 2 if

and

a22 (w)

at all factor prices w = (w,. w 2). To determine the equilibrium factor prices. suppose that we have an illterior equilibrium in which the production levels of the two goods are strictly positive (otherwise. we say that the equilibrium is specialized). Given our constant returns assumption, a necessary condition for (IV!. "';! to be the factor prices in an interior equilibrium is that it satisfies the system of equations and

C,(W,. 11',) = p,.

a,,(IV')

The construction is depicted in Figure 15.0.5. An important consequence of this discussion is that in the 2 x 2 production model, if Ihe factor intensity condition holds, then as long as the economy does not specialize in Ihe production of a single good [and Iherefore (15.0.7) holds], the equilibrium factor prices depend only on the technologies of the two firms and on the oil/put prices p. Thus. the levels of the endowments matter only to the extent that Ihey delermine whether the economy specializes. This result is known in the international trade literature as the faclOr price equalizatioll theorem. The theorem provides conditions (which include the presence of tradable consumption goods, idenlical produclion technologies in each country. and price-taking behavior) under which the prices of nontradable factors are equalized across nonspecialized countries.

a,,(w) > a'2(w)

a2 ,(w)

zI, = ~~(IV') z!,

(15.0.7)

That is, at an interior equilibrium, prices must be equal to unit cost. This gives us two equations for the two unknown factor prices "'I and "".'3 Figure 15.0.4 depicts the two unit cost functions in (15.0.7). By expression (15.D.7). a necessary condition for (w,. 'v,) to be the factor prices of an interior equilibrium is that these curves cross at (w" w,). Moreover. the factor intensity assumption implies that whenever the two curves cross. the curve for firm 2 must be Oatter (less negatively sloped) than that for firm I [recall that VCj(w) = (a,j(w), a'j(w))], From this, it follows that the two curves can cross at most once. I. Hence. under the

We now present two comparative statics exercises. We first ask: How does a change in the price of one of the outputs, say Pt. affect the equilibrium factor prices and factor allocations? Figure 15.D.6(a), which depicts the induced change in Figure IS.D.4. identifies the change in factor prices. The increase in p, shifts firm I's curve

15. Note, however. that although ('\',. \\ 0 and (1\""2 < 0, as we wanted. -

du'

PropositIon 15.0.1: (Stolper-Samuelson Theorem) In the 2 x 2 production model with the factor intensity assumption. if Pi increases. then the equilibrium price of the factor more intensively used in the production of good i increases, while the price of the other factor decreases (assuming interior equilibria both before and after the price change).'·

wr

We have just seen that if p, increases, then /"'! increases. Therefore, both firms must move to a less intensive use of factor I. Figure 15.D.6(b) depicts the resulting change in thc equilibrium allocation of factors. As can be seen, the factor allocation moves to a new point in the Pareto set at which the output of good I has risen and that of good 2 has fallen. For the second comparative statics exercise, suppose that the total availability of factor I increases from i , to i'" What is the effect of this on equilibrium factor prices and output levels? Because neither the output prices nor the technologies have changed, the factor input prices remain unaltered -, xr for some i. By (16.C2), we must have P' x, ~ w, for all i, and by (16.CI) P' x, > w, for some i. Hence,

L, P'x, > L,

w, = p'w

+ L p.y;, j

2. The terminology" Xi is maximal for ;::i in set B" means that x, is a prderence-maximizing choice for consumer; in the set B; that is, ."1 E B and i ;2:/ for all E B.

x x;

(16.CI)

That is, anything that is strictly preferred by consumer i to xr must be unaffordable to her. The significance of the local nonsatiation condition for the purpose at hand is that with it (16.CI) implies an additional property:

Lxi = W + L yii

Wi'

x;

Morcover, because yj is profit maximizing for firm j at price vector p,

=

E CON 0 M I C S

549

550

CHAPTER

11:

EQUILIBRIUM

AND

ITS

BASIC

WELFARE

SECTION

PROPERTIES

16.0:

THE

SECOND

FUNDAMENTAL

THEOREM

OF

WELFARE

16.D The Second Fundamental Theorem of Welfare Economics The second fundamental welfare theorem gives conditions under which a Pareto optimum allocation can be supported as a price equilibrium with transfers. It is a converse of the first welfare theorem in the sense that it tells us that, under its assumptions, we can achieve any desired Pareto optimal allocation as a market-based equilibrium using an appropriate lump-sum wealth distribution scheme. The second welf~re theorem is more delicate than the first, and its validity requires additional assumptions. To see this, reconsider some of the examples discussed in Ch-'X,~, then p'x, ~ w:'). '

Figure 16.C.1 A price equilibrium with transrers that is not a Pareto optimum.

we have p'w

+ Lj P'yj

~ p'w

+ Lj P·Yj· Thus,

L P'X

i

> p'w +

L P·Yj·

(16.C.3)

But then (x, y) cannot be feasible. Indeed, Li Xi = W + Lj Yj implies Li P' Xi = P'W + P' J'j' which contradicts (16.C.3). We conclude that the equilibrium allocation (x*, y*) must be Pareto optimal. _

L

The central idea in the proof of Proposition 16.C.1 can be put as follows: At any feasible allocation (x, Y), the total cost of the consumption bundles (x I' ... ,x,), evaluated at prices P, must be equal to the social wealth at those prices, P' W + L} P' Yj' Moreover, because preferences are locally nonsatiated, if (x, Y) Pareto dominates (x*, yO) then the total cost of consumption bundles (Xl' ... ' x,) at prices p, and therefore the social wealth at those prices, must exceed the total cost of the equilibrium consumption allocation p'(L x1j = p'w + Lj p' yj. But by the profitmaximization of Definition 16.B.4, there are no technologically feasible production levels that attain a value of social wealth at prices P in excess of P' W + Lj P' yj. The importance of the nonsatiation assumption for the result can be seen in Figure 16.C.I, which depicts an Edgeworth box where local nonsatiation fails for consumer I (note that consumer I's indifference "curve" is thick) and where the allocation x>, a price equilibrium for the price vector P = (PI' pz) (you should verify this), is not Pareto optimal. Consumer I is indifferent about a move to allocation x, and consumer 2, having strongly monotone preferences, is strictly better otT. (See Exercise 16.C.3 for a first welfare theorem compatible with satiation.) Two points about Proposition 16.C.1 should be noted. First, although the result may appear to follow from very weak hypotheses, our theoretical structure already incorporates two strong assumptions: ulliversal price quolillg of commodities (market completeness) and price taking by economic agents. In Part III, we studied a number of circumstances (externalities, market power, and asymmetric information) in which these conditions are not satisfied and market equilibria fail to be Pareto optimal. Second, the first welfare theorem is entirely silent about the desirability of the equilibrium allocation from a distributional standpoint. In Section 16.0, we study the second fundamental theorem of welfare economics. That result, a partial converse to the first welfare theorem, gives us conditions under which any desired distributional aims can be achieved through the use of competitive (price-taking) markets.

Definition 16.~.1~ Given an economy specified by ({(Xi' ;::i)}!-1' P~-lf-1' W) an allocalion (~ • V ). and a price vector p = (P" ... ,PL) # 0 constitute a price quasieqUlhbnum with transfers if there is an assignment of wealth levels (W,' ... , w,) with LiWi = p·w + LiP'Vi" such that

m

ECONOMICS

551

552

CHAPTER

11:

EQUILIBRIUM

(i) For every

i.

AND

ITS

WELFARE

PROPERTIES

SECTION

yt maximizes profits in If; that is. P'Yj ~ p·yt for all Yj Elf·

(ii) For every i. if x;>-;xi then p'x; ~ (iii)

BASIC

is,

always make. we must have p' x; =

Xi

2= p'

=~

Y=

THEOREM

OF

WELFARE

x~,

ECONOMICS

553

that

v. = {~x, E RL

X,

E V" ... ,

x, v,} E

Wi

~ lj = {~YjER":)"

E

Y" .. . ,J'JE YJ}'

Thus, V is the set of aggregate consumption bundles that could be split into I individual consumptions, each preferred by its corresponding consumer to x,*. The set l' is simply the aggregate production scI. Note that the set Y + {(v}, which geometrically is the aggregate production set with its origin shifted to (V, is the set of aggregate bundles producible with the given technology and endowments and usable, in principle, for consumption. St('l'l: Ev('ry s('t v; iscOlw('.t. Suppose that x, >-, x: and x; >-, x:' Take 0 $ (% !> I. We want to prove that <Xx; + (I - Cl)X;~, Because preferences are complete, we can assume without loss of generality that Therefore, by convexity of preferences, we have ax, + (I - a),,; ~,x;, which by transitivity yields the desired conclusion: ax, + (I - (X)x; >-, x,* [recall part (iii) of Proposition I.B.I).

x:.

Jor ever)' i. This means that we could just as well not

mention the w,'s explicitly and replace part (ii) of Definition 16.D.1 by

xr then p'

lUNOAMt.hTAl

and

j

Note also that when consumers' preferences are locally nonsatiated, part (ii) of Definition 16.D.1 implies p'x! ~ w, for every i.l In addition, from part (iii), we get L,P'x; = P'w + Lj r' y; = L, II',. Therefore, Imder Ihe assumption of focally /J(J/Isaliat.d preferences, which we

If ."(/ >i

,;,fCONO

We begin by defining, for every i, the set V. of consumptions prererred to = {x, E X,: x, >-, x~} c RL. Then define V

Part (ii) of Definition 16.0.1 is implied by the preference maximization condition of the definition of a price equilibrium with transfers [part (ii) of Definition 16.B.4]: If xi is prererence maximizing in the set {Xi E X,: P' X, ~ w,j, then no x, >-, xi with p' Xi < Wi can exist. Hence, any price equilibrium with transfers is a price quasiequilibrium with transfers. However, as we discuss later in this section, the converse is not true.

(ii')

THL

V.

Wi'

'LA = w + L yr ;

16.0:

x;.

That is, allocation (x·, y.) and price vector p constitute a price quasiequilibrium with transfers if and only if conditions (i), (ii'), and (iii) hold' Moreover, with locally nonsatiated is expenditure minimizing on the set preferences, condition (ii') is equivalent to saying that {x, E X: x, ;::,xn (see Exercise 16.D.I). Thus, our discussion later in Ihis section of the conditions under which a price quasiequilibrium with transfers is a price equilibrium with transfers can be interpreted in the locally non satiated case as providing conditions under which expenditure minimization on the set {x, e X,: x,;::, xi I implies preference maximization on the set {x,eX,: P'x,~p'xn = {x,eX,: P'x,!> w,l . •

x,;::, x;.

SI~p 2: Tire sels Valid Y + {w} are cO/wex. This is just a general, and easyto-prove, mathematical fact: The sum of any two (and therefore any number of) convex sets is convex.

xr

Slep 3: V f"\ (Y + {w}) = 0. This is a consequence of the Pareto optimality of (x·, y.). If there were a vector both in Vand in Y + {w}, then this would mean that with the given endowments and technologies it would be possible to produce an aggregate vector that could be used to give every consumer i a consumption bundle that is preferred to

x:.

Proposition 16.0.1 states a version of the second fundamental welfare theorem.

St~p

Tlrer~ is P = (PI' ... ' 1',) '" 0 alld a lIumber r Stlclr Ilral p' z ;>: r for every p'Z S r for every Z E Y + {w}. This follows directly from the separating hyperplane theorem (see Section M. G. the Mathematical Appendix). It is illustrated in Figure 16.0.1.

Proposition 16.0.1: (Second Fundamental Theorem of Welfare Economics) Consider an economy specified by ({(X;, ~;)li-" {Y;}t-" iii). and suppose that every Y; is convex and every preference relation ~; is convex [i.e., the set {X; E X;: X; ~j Xj} is convex for every x; E X;] and locally nonsatiated. Then, for every Pareto optimal allocation (x*, y*). there is a price vector p = (p" ... ,pd '" 0 such that (x·, y*. p) is a price quasiequilibrium with transfers.

4:

=E V IIl1d

Figure 16.0.1

The separation argument in the proof of the second welfare

Proof: In its essence, the proof is just an application of the separating hyperplane theorem for convex sets (see Section M.G. of the Mathematical Appendix). To facilitate comprehension, we organize the proof into a number of small steps.

theorem.

3. To see this, observe that if preferences are locally nonsalialed and p' x7 < Wi) then close to Xi with Xi >-ix7 and p' Xi < Wj. contradicting condition (ii) of Definition 16.D.1. 4. A similar observation applies, incidentally, 10 the definition of price equilibrium with transfers

x7 there is an

(Definition 16.B.4). If preferences are locally nonsatiated, we get an equivalent definition by not referring explicitly to the w:s and replacing part (ii) of the definition by (ii"): If x, >-, x~ then P'x j > P'x7- Thus. in this locally nonsatiated case, condition (ii") says that x~ is preference maximizing on {XI E Xi: p·.'t l S p' x7}.

x,

=

554

CHAPTER

16:

EQUILIBRIUM

AND

ITS

BASIC

WELFARE

PROPERTIES

SECTION

16.0:

THE

SECOND

FUNDAMENTAL THEOREM

OF

WELFARE

Budget

Seep 5: If Xi ?::" Xi for every j then p' (L, x,) ;:: r. Suppose that x, :::, xi for every

A price quasi-equilibrium that

x,

is not a price equilibrium.

Step 6: p' (Li xi) = P' (w + Lj yj) = r. Because of step 5, we have P' O::i xi) ~ r. On the other hand, Li xi = Lj yj + W E Y + (wi, and therefore P'(Li xi) s: r. Thus, p' (Li = r. Since Li xi = w + Lj yj, we also have P' (w + Lj yj) = r.

}-IX{. Beca use of steps 5 and 6, we ha ve P{Xi +

'~i xi) ~

I'

= p.(X!

+

'~I xi).

Proof: The idea of the proof is indicated in Figure 16.0.3 (where we take p'xi = Wi only because this is the leading case; the fact plays no role in the proof). Suppose that, contrary to the assertion of the proposition, there is an Xi >- I xi with p' Xi = Wi' By the cheaper consumption assumption, there exists an x; E XI such that P' x; < lVi. Then for all IX E [0, I), we have rzx i + (I - rz)x; E Xi and P·(tXXl + (1 - IX)X;) < WI" But if rt is close enough to I, the continuity of ?::'i implies that lXX, + (I - IX)X; >- i xi, which constitutes a contradiction because we have then found a consumption bundle that is preferred to xi and costs less than lVi. • Note that in the example of Figure 16.0.2, we have WI = 0 in the price quasicquilibrium supporting allocation x·, and so there is no cheaper consumption for (p, w.)." As a consequence of Proposition 16.0.2, we have Proposition 16.0.3.

Step 9: The lI'ealth levels IVI = p'xi for i = I, ... , I support (x·, y., p) as a price 411asic(luilihrium lVith transfers. Conditions (i) and (ii) of Definition 16.0.1 follow from steps 7 and 8; condition (iii) follows from the feasibility of the Pareto optimal allocation (x·, y.) . • In Exercise 16.0.2, you are asked to show that the local nonsatiation condition is required in Proposition 16.0.1. When will a price quasiequilibrium with transfers be a price equilibrium with transfers? The example in Figure 15.B.IO(a), reproduced in Figure 16.0.2, indicates that there is indeed a problem. Figure 16.0.2 depicts the quasiequilibrium associated with the Pareto optimal allocation labeled The unique price vector (normalizing PI = I) that supports x· as a quasiequilibrium allocation is p = (1,0); the associated wealth levels are WI = p·xt = (I,O)'(O,x!,) = 0 and W2 = P·x!. However, although the consumption bundle xt satisfies part (ii) of Definition 16.0.1 (indeed, p' x I 2: 0 = WI for any XI ~ 0), it is not consumer I's preference-maximizing bundle in her budget set {(x",x21)ER~:(1,0)'(x",x21)S:0} = {(XII'X21)ER~:

x·.

Xli

=

555

Figure 16.0.2 (lett)

Line

i. By local nonsatiation, for each consumer j there is a consumption bundle Xi arbitrarily close to Xi such that >-, Xi' and therefore Xi E V;. Hence, Li Xi E V, and so p'(L :li) 2: r, which, taking the limit as Xi -+ Xi' gives P'(L, Xi) ~ r. s

ECONOMICS

Proposition 16.0.3: Suppose that for every i, Xi is convex, 0 E Xi' and :::i is continuous. Then any price quasiequilibrium with transfers that has (w" ... , wd »0 is a price equilibrium with transfers.

O}.

An important feature of the example just discussed, however, is that consumer I's wealth level at the quasiequilibrium is zero. As we shall see, this is key to the failure of the quasiequilibrium to be an equilibrium. Our next result provides a sufficient condition under which the condition MXi>-IXi implies P'X i ~ lV i" is equivalent to the preference maximization condition" Xi >-i xi implies p' Xi > Wi'"

6. If, as in all our applications. ~j is locally nonsatiated and Wj = p·xi. then Proposition 16.0.2 olTers 5ufHcienl conditions for the eq uivalence of the statements" x7 minimizes expenditure relative to p in the s~l {''(j E Xj: X,~, x;}" and" xi is maximal for 2:, in the budget set lXiE X,: P·X . ~ p·xj}." 7. A similar argument can be used to show that if X, is convex and the Walrasian demand function xj(p, \\'j} is well defined, then there is a cheaper consumption for (P. w,) if and only if there is an x; arbitrarily close to xl(p. Wi) with p' x; < lVi . In the Appendix A, of Chapter 3 the latter concept was called the locall.v cheaper consumption condition. 8. Note also that Proposition 16.0.2 generalizes the result in Proposition 3.E.t(ii), which assumed local nonsatiation. Wi = p'x7 > O. and Xj = R';.

5. Geometrically. what we have done here is show that the set 1:1 {Xi E Xj: Xi;::j x7} is contained in the closure of V (see Section M.F of the Mathernalieal Appendix for this concepl), which, in turn. is contained in the half-space (v E ilL: p' v ~ r).

m

Ftgure 16.0.3 (right)

Suppose there exists a "cheaper consumption" (an x; E Xi such that p'x; < WI)' Then if the preferred set does intersect the budget set (p' XI :S WI for some xi>jx:), it follows that the preferred set does intersect the interior of the budget set (p·x; < \Vi for some

Xi >-IXn.

CHAPTER

11:

EQUILIBRIUM

AND

ITS

BASIC

WELFARE

PROPERTIES

Consider the implications of Proposition 16.0.3 for a pure exchange economy in which W »0 and every consumer has X, = R~ and continuous, locally nonsatiated preferences. In such an economy, by free disposal and profit maximization, we must have p ~ 0 and P ~ 0 at any price quasiequilibrium 9 Thus, under these assumptions, any price quasiequilibrium with transfers in which x~ »0 for all i is a price equilibrium with transfers (since then = p' x,' > 0 for all iJ. But there is more. Suppose that, in addition, preferences are strongly monotone. Then we must have p » 0 in any price quasiequilibrium with transfers. To see this, note that p ~ 0, p ~ 0, and w » 0 imply that L, w, = p' 'v > 0 and therefore that w, > 0 for some i. But by Proposition 16.0.2, this consumer must then be maximizing her preferences in her budget set {x, E R~: P' x, :0;; w,}, which, by strong monotonicity of preferences, cannot occur if prices arc not strictly positive. Once we know that we must have p »0, we can conclude that u,(x i ) for some i. But (u; •. . "ui) E U only if there is a feasible allocation (x', y') such that ui(xi) ;;:: ui for all i. It follows then that (x'. y) Pareto dominates (x. y). Conversely. if (x. y) is not a Pareto optimum. then it is Pareto dominated by some feasible (x', y'). which means that Ui{Xi) ~ u,(x i ) for all i and ui(xi) > "i(Xi) for some i. Hence. (u,(x,), ...• u,(x,» f UP . • We also note that if every Xi and every lj is convex. and if the utility functions IIi ( .) arc concave, then the utility possibility set U is convex (see Exercise 16.E.2).1l One such utility possibility set is represented in Figure 16,E.2. Suppose now that society's distributional principles can be summarized in a social welfare /ullctioll W(u" ... , u,) assigning social utility values to the various possible vectors of utilities for the J consumers. We concentrate here on a particularly simple class of social welfare functions: those that take the linear form W(u, •...• u,) =

such that

L i.;Ui

ui(x,) for i = I•... , J}. 12. However, nol every poinl in the boundary must be Pareto optimal. Go back, for example, to Figure 16.C.1: The ulility values associated with x· belong to the boundary of the utility possibility set because it is impossible to make both consumers better off. Yet. x· is not a Pareto optimum. 13. II can be shown that under a mild technical strengthening of the strict convexity assumption on preferences (essentially the same condition used to guarantee differentiability of the Walrasian demand function in Appendix A of Chapter 3), there are in the family of utility functions ",(.) that represent ;::j some utility functions that are not only quasiconcave but also concave.

II. Two faclS eSiablished in Chapter 17 lend plausibility to this claim. First, in Section 17.1, WI! show that convexity is not required for the (approximate) existence or a Walrasian equilibrium In a l'lrgc economy. Second. in Section I7.C, we argue that the second welfare theorem can be rcphnlscd as an assertion of the existence or a Walrasian eqUilibrium ror economies in which endowments are distributed in a particular manner, and it can therefore be seen as implied by the conditions guaranteeing the general existence of Walrasian equilibria.

I

Figure 16.E.2 (right)

A convex utility possibilily sel.

560

CHAPTER

16:

EQUILIBRIUM

AND

ITS

BASIC

WELFARE

SEC T ION

PROPERTIES

",

1 I. F!

FIR S T - 0 R DE R

CON 0 I T ION S

FOR

P A A E TOO P TIM A LIT y

561

x,

Figure 16.E.4 Ftgur. 16.E.3

Maximizing the utili! y or a representative consumer.

Maximizing a linear social welfare function.

for some constants i. = p., ..... i.,).'4 Lelling 1/ = (u, •...• U,). we can also write W(I/) = i. '1/. Because social welfare should be nondecreasing in the consumer's utility levels. we assume that i. :2: O. Armed with a linear social welfare function, we can select points in the utility possibility set U that maximize our measure of social welfare by solving Max w.u

i.·I/.

(16.E.I)

Figure 16.E.3 depicts the solution to problem (16. E.I). As the figure suggests. we ha ve the result presented in Proposition 16.E.2. Proposition 16.E.2: If u· = (u1, . ..• u7) is a solution to the social welfare maximization problem (16.E.1) with i.» O. then u· E UP; that is, u· is the utility vector of a Pareto optimal allocation. Moreover. if the utility possibility set U is convex. then for any ii = (ii, • ...• ii,) E UP, there is a vector of welfare weights i. = (i." ... , i.,):2: O. i. #< 0, such that i.·ii:2: ).·u for all UE U. that is. such that ii is a solution to the social welfare maximization problem (16.E.1). Proof: The first part is immediate: if u· were not Pareto optimal, then there would exist a /I E U with u:2: u' and u #< u*; and so because ).» 0, we would have i.'u>i.'fl*, For the second part, note that if ii E UP. then ii is in the boundary of U. By the supporting hyperplane theorem (see Section M.G of the Mathematical Appendix), there exists a i. #< 0 such that i.. ii :2: i.' u for all u E U. Moreover, since the set U has been constructed so that U - R'+ c U, we must have ). :2: 0 (indeed, if )" < O. then by choosing a u E U with u, < 0 large enough in absolute value, we would have i.·/1 > i.' ,i). • Proposition 16.E.2 tells us that for economies with convex utility possibility sets, there is a close relation between Pareto optima and linear social welfare optima: Every linear social welfare optimum with weights i. » 0 is Pareto optimal, and every Pareto optimal allocation (and hence. every Walrasian equilibrium) is a social welfare optimum for some welfare weights V." ... , i.,) :2: 0.15 14. See Chapter 22 for a discussion of more general types of social welfare functions. 15. The necessity of allowing for some i' l to equal zero in the second part of this statement parallels the similar feature encountered in the characterization of efficient production vectors in

Proposition 5.F.2.

As usual. in the absence of convexity of the set U. we cannot be assured that a Pareto optimum can be supported as a maximum of a linear social welfare function. The point ,i in Figure 16.E.1 provides an example where it cannot. fly using the social welfare weights associated with a particular Pareto optimal allocation (perhaps a Walrasian equilibrium), we can view the latter as the welrare optimum in a certain single-consumer. single-firm economy. To sec this, let (x·, y.) be a P.ucto optimal allocation and suppose that i. = 0.1 .... ' i.,)>> 0 is a vector of welfare weights supporting U at (ul(xn .... ,II,(xi)). Define then a utility runction u).(.X) on aggregate consumption vectors in

X=

L,

X;

C

R " by u,(.> 0 for all YjE !;IL. The meaning of the last condition is that if fit)~ = O. so that Yj is in the transformation frontier of Y;, then any attempt to produce more of some output or use less of some input makes the value of Fj (') positive and pushes us out of Y; (in other words, YJ is production ellicient. in the sense discussed in Section 5.F. in the production set y;).17 Note that. for the moment, no convexity assumptions have been made on preferences or production sets. The problem of identifying the Pareto optimal allocations for this economy can be reduced to the selection of allocations

Max

".F:

Xli:

iJu 0, --- 1', { i

OXfi

Problem (16.F.I) states the Pareto optimality problem as one of trying to maximize the well-being of consumer I subject to meeting certain required utility levels for the other consumers in the economy [constraints (I)] and the resource and technological limitations on what is feasible [constraints (2) and (3). respectively]. By solving problem (16.F.I) for varying required levels of utility for these other consumers ('1, •...• iii)' we can identify all the Pareto optimal allocations for this economy. Indeed. you should pause to convince yourself of this by solving Exercise 16.F.1.

oF.

:0;

0

= 0

Pt - Yj ---L = 0 oYtj

.

If Xli > 0

for all i,

t,

(16.F.2)

for all j,

t.

(16.F.3)

As is well known from Kuhn-Tucker theory (see Section M.K of the Mathematical Appendix), the value of the multiplier Pt at an optimal solution is exactly equal to the increase in consumer J's utility derived from a relaxation of the corresponding constraint, that is. from a marginal increase in the available social endowment wt of good t. Thus. the multiplier Pt can be interpreted as the marginal value or "shadow price" (in terms of consumer I's utility) of good t. The multiplier Ii" on the other hand, equals the marginal change in consumer J's utility if we decrease the utility requirement Ui that must be met for consumer i"f. I. Condition (16.F.2) therefore says that, at an optimal interior allocation, the increase in the utility of any consumer i from receiving an additional unit of good t. weighted (if i "f. I) by the amount that relaxing consumer i's utility constraint is worth in terms of raising consumer I's utility, should be equal to the marginal value PI of good t. Similarly, the multiplier Yj can be interpreted as the marginal benefit from relaxing the jth production constraint or. equivalently, the marginal cost from tightening it.

Exercise 16.F_I: Show that any allocation that is a solution to problem (16.F.I) is Pareto optimal and that any Pareto optimal allocation for this economy must be a solution to problem (16.F.I) for some choice of utility levels (ii, •...• iii)' [Him: Use the fact that preferences are strongly monotone.] Because utility functions are normalized to take nonnegative values. from now on we consider only required utility levels that satisfy U, ~ 0 for all i. The point of Exercise 16.F.I can be seen by examining the utility possibility set U in Figure 16.F.1. If we fix a required nonnegative utility level for consumer 2. we can locate a point on the frontier of the utility possibility set U by maximizing

18. Recall that for expositional ease we are not imposing any boundary constraints on the vectors h We note also that the assumption of strictly positive gradients of the functions uc( .} and fj( .} implies that the constraint qualification for the necessity of the Kuhn-Tucker conditions is satisfied. (See Section M.K of the Mathematical Appendix for the specifics of first-order conditions for optimization problems under constraints.)

17. For expositional convenience. we have taken every FJ(') to be defined on the entire R'·. A consequence of this (and the assumption that VF;(y/)» 0 for all YJ} is that every commodity is both an input and an output of the production process. Because this is unrealistic, we emphasize that no more than expositional ease is involved here.

i

563

564

CHAPTER

11:

EQUILIBRIUM

AND

ITS

BASIC

WELFARE

Hence. yPJFj/vYt;) is the marginal cost of increasing Ytl and thereby effectively tightening the constraint on the net outputs of the other goods. Condition (16.F.3) says. then. that at an optimum this marginal cost is equated. for every j. to the marginal benefit Jlt of good t. If we suppose that we have an interior solution (i.e .• x,» 0 for all i). then conditions (16.F.2) and (16.F.3) imply that three types of ratio conditions must hold (see Exercise 16.F.3):

(Ju;/Dx" iJu;/iJx n

for all i. i'. f. I'.

(16.FA)

t.1'.

(16.F.5)

for all i.j.t.I'.

(16.F.6)

for allj./.

cJFj"/iJY{"j"

iJll;/(:X~, =J!I/~Ytj

cJU;/(lx{",

vFI/vYf"1

Condition (16.F.4) says that in any Pareto optimal allocation. all consumers marginal rates of substitution between every pair of goods must be equalized [sec Figures 15.B.II(b) and 15.B.12 for an illustration in the two-good. two-consumer case]; condition (16.F.5) says that all firms' marginal rates of transformation between every pair of goods must be equalized [see Figure IS.D.2(b) for an illustration in the two-good. two-firm case]; and condition (16.0.6) says that every consumer's marginal rate of substitution must equal every firm's marginal rate of transformation for all pairs of goods [see Figure 15.C.2 for an illustration in the case of the one-consumer. one-firm model with two goods]. Conditions (l6.F.4) to (16.F.6) correspond to three types of efficiency embodied in a Pareto optimal allocation (see Exercise 16.F.4).

(16.F.7) •.•

XLi) ;;,

(2) L,xl/:!;;x,

u,

Fltri

PARETO

(16.F.9)

s.t. }', 5 fCY, •...• y,).

To explore the relationship of the first-order conditions (16.F.2) and (16.F.3) to the first and second welfare theorems. we make the further. and substantive. assumption that every 11,(') is a quasiconcave function (hence. preferences are convex) and that every fj(') is a Convex function (hence. production sets are convex). The virtue of this assumption is that with it we do not have to worry about second-order conditions; in all thc maximization problems to be considered. the first-order nceessary conditions arc automatically sufficient. In this differentiable. convex framework. conditions (16.F.2) and (16.FJ) can be uscd to establish a version of the two welfare theorems. To see this. note first that (x*, .1'*, p) is a price equilibrium with transfers (with associated wealth levels 1\', = p' for i = I •...• I) if and only if the first-order conditions for the budgetconstrained utility maximization problems

xr

and the profit maximization problems Max

(=

s.t. Fj(Yj)

#~"

I, ...• L.

,"Xli

(16.F.8) (n ..... .,))

{=

2•...• L

j = I •... • J.

The first-order conditions for this problem lead to condition (16.F.5).

P'Yj :!;;

0

are satisficd. Denoting by 7, and PI the respective multipliers for the constraints of these problems. the first-order conditions [evaluated at (x'. Y')] can be written as follows:

i = 2, ...• 1

(ii) Elficienr productioll across tec/tllologies. The aggregate production vector should be elficienr in the sense discussed in Section 5.F. That is, it should be impossible to reassign production plans across individual production sets so as to produce, in the aggregate, more of a particular output (or use less of it as an input) without producing less of another. Focusing. in particular. on the first good. this means that given required total productions (Yl' ... ,yc! of the other goods. we want to solve

(2) Fj(y):!;; 0

u«O, + y, ..... '0,. + h)

Max

_~,Pt

The first-order conditions for this problem lead to condition (I6.F.4).

s.t. (I) Lj Y'j;;' }',

CONDITIONS

(iii) Optimal agf/regllle production levels. We also must have picked aggregate production levels that generate a desirable assortment of commodities available for consumption. Keeping the utility requirements (", •...• ti,) fixed. let u(.i, •...• xc! and J(Y, .. ' .. h) denote. respectively. the value functions for problems (I6.F.7) and (16.F.8). Then we want to solve

(i) Oprimal al/oeation oj available goods aeros., consumers. Given some aggregate amounts (.i" ...• xc> of goods available for consumption purposes, we want to distribute them to maximize consumer I's well-being while meeting the utility requirements (u ...• ",) for " consumers 2•... , I. That is. we want to solve

s.t. (I) u,(x Ii"

fIRST·ORDER

16.f:

The first-order conditions of this problem lead to condition (16.F.6).

vu,./cJx". ou,.fJx{",.

~!"I/VYt j = J!j"~~Y{j: cJFj/ 0

: 0 is feasible if

Example 16.G.2: Occupational Choice Suppose that every individual could, in principle, work either as a classics scholar or as an economics professor. But not all individuals are equally good at both things. A way to capture the different comparative advantage is to assume that for every individual i, there is an Il i ;:>: 0 measuring how many "effective hours of economics professorial services" it takes to produce "an effective hour of classical scholarship." A relatively low Ili indicates comparative advantage in classical scholarship. Suppose also that every individual i has an amount of professorial hours that she can supply; we assume that I professorial hour can produce I effective hour of economics professorial services or 1/':1. i effective hours of classical scholarship by individual i. There is a single consumption good on which the individual i can spend her earnings. It is important to be able to imbed this problem in our formal structure because we certainly want to be able to analyze how, for example, competitive labor markets will perform when individuals have occupational choices as well as choices about how mllcil labor to supply. This is how it can be done (it is not the only possible way): suppose we list consumption and effective hours supplied as a three-dimensional vector (c i , t d , t'i)' where CI is individual j's consumption and t" ~ 0 and t" ~ 0 are the effective hours spent working as a classics scholar and as an economics professor, respectively. Because the latter two quantities are supplies-that is, services offered by the individual to the market-we follow the convention of measuring them as negative numbers. We can then define the consumption set of individual j as XI = (c" t", 1.1 ):

II.G:

q ~ f(:),

,

LXI'

+:

=

w.,

and

q=

x,.

«x;., ...

z'»

It is Pareto optimal if there is no other feasible allocation ,. O.

(16.G.l)

(ii) For any i, xi is maximal for I '-'

574

CHAPTER

16:

EQUILIBRIUM

AND

ITS

BASIC

WELFARE

EX ERe,s E S

PROPERTIES

x,

Koopmans. T. (1957). Three Essays on the Stale of Economic Science. New York: McGraw-Hili. Lange, O. (1942). The foundation or welfare economics. Econometrica 10: 21S-228. Quinzii, M. (1992). Increasing Returns and Efficiency. New York: Oltrord University Press. Samuelson. P. (1947). FOUildarions of Economic AnalysiS. Cambridge. Mass.: Harvard University Press.

/(Y+ {w}) n R~

EXERCISES Figure 16.AA.l The set or reasible

16.C.t" Show that ir a consumption set Xi c RL is nonempty. closed. and bounded and the prererence relation 0 for some i and then argue that p » 0.]

REFERENCES

16.0.4C Consider a two·good exchange economy with r identical consumers. The consumption set is R!. the individual endowments are co e R! •• and the prererences are continuous and strongly mono lone but not necessarily convex. Argue that the symmetric allocation in which

Albis. M. (1953). Traile d'ecOIlOmie pure. Paris: Publications du CNRS. Arrow, K., and F. Hahn. (1971). Genrral Competitive Analysis. San Francisco: Holdcn·Day. Dcbrcu. G. (1959). Tllt'orr IIf Value, New York: Wiley.

L

>-, xr then

xr" is equivalent to the condition: "x~ is expenditure minimizing for the price vector Xj: xr }." p in the set

p' Xi ~ P'

Proof: Note that U = U' - IR~ where E

xr

p'X; ~ Wi'"

The case with several production sets is more delicate. and it is here that the irreversibility assumption comes to the rescue. Very informally. we can dcrive. as in the preceding paragraph. the boundedness of feasible aggregate productions and feasible individual consumptions. Now. the only way that unboundedness would be possible at the individual production level while remaining bounded in the aggregate is if. so to speak. the unboundedness in one individual production plan was to be canceled by the unboundedness of another. However, this would imply that the collection of all technologies in the economy (i.e.• the aggregate production set) allows the reversal of some technologies (see Exercise 16.AA.3 for more details). Incidentally, it can also be shown that irreversibility, with the other assumptions. yields the closed ness of Y, so we do not actually need to assume this separately. _

U' = {(u,(x,), ... , U/(X/)): (x. Y)

xr

16.C.2' Suppose that the prererence relation 0 for some i and then apply Proposition 16.D.2.]

Now suppose there inc two consumers and that their preferences are identical to those above. One owns all of the land and the other owns all of the labor. In this society, arbitrary lump-sum taxes arc not possible. It is the law that any deficit incurred by a public enterprise must he covered by a tax on the value of land. (d) In appropriate notation, write the transfer from the landowner as a function of the government's planned production of education.

(0) Find a marginal cost price equilibrium for this economy where transfers have to be compatible with the transfer function specified in (d). Is it Pareto optimal?

16.E.2K Show that the utility possibility set U of an economy with convex production and consumption sets and with concave utility functions is convex.

16.AA.1 A Show that if every Xi and every lj is closed, then the set A of feasible allocations is closed.

16.F.l" In text. 16.F.2A Derive the first-order conditions (16.F.2) and (I6.FJ) of the maximization problem (16.F.I).

16.AA.2K Show that( Y + (w}) ('\ R~ is compact if the following four assumptions are satisfied: (i) Y is closed, (ii) l' is convex, (iii) 0 E Y, and (iv) if v E Y ('\ R~ then v = O. Exhibit graphically four examples showing that each of the four assumptions is indispensable.

16.F.3 A Derive conditions (16.F.4), (16.F.5), and (I6.F.6) from the first-order conditions (16.F.2) and (16.F.3).

16.AA.3" Suppose that Y = Y, + Y, c R'; satisfies the assumptions given in Exercise 16.AA.2 and that 0 E 1'" 0 E l',. Argue that if the irreversibility assumption holds for Y then (.I', E Y,: y, + y, + w ~ 0 for some y, E Y,} is bounded.

16.F.4 A Derive the first-order conditions (16.F.4), (16.F.5), and (16.F.6) from problems (16.F.7), (16.F.8), and (16.F.9), respectively. 16.G.1A Prove Proposition 16.G.1 using the first-order conditions (16.F.2) and (16.F.3). I6.G.2A In text. 16.G.3" Exhibit graphically a one-consumer, one-firm economy with two inputs and one output where at the (unique) marginal cost price equilibrium, cost is 1101 minimized. [/lillt: Choose the production function to violate quasi concavity.] 16.G.4" Show that under the general conditions of Section 16.G if there is a single consumer (perhaps a normative representative consumer) with convex preferences, then there exists at least one marginal cost price equilibrium that is an optimum. 16.G.S" In a certain economy there are two commodities. education (e) and food (f), produced by using labor (L) and land (D according to the production functions e = (Min (L, T})'

and

j=(LD'/'

.....

577

C

The Positive Theory

HAP

T

E

R

SEC T ION

17

'7 • B:

E QUI LIB" I U M:

D E FIN I T ION SAN DBA SIC

the three sections is the role of two sufficient conditions: the weak axiom of revealed preferellce ill Ille aggregale (a way of saying that wealth effects do not cancel in the aggregate the positive influence of the substitution effects), and the property of gruss slIhslilllliulI (a way of saying that there are not strong complementarities among the goods in the economy). In Section 17.1, we return to the role of convexity in guaranteeing the existence of Walrasian equilibrium. We qualify this role by showing that nonconvexities that are "small" relative to the aggregate economy (e.g., the indivisibility represented by a car) are not an obstacle to the (near) existence of equilibria, even if they are "large" from the standpoint of an individual agent. This chapter is of interest from both methodological and substantive points of view. From a substantive standpoint, it deals with an important theory: that of Walrasian equilibrium. Methodologically, the qucstions that we ask (e.g., does an equilibrium exis!"! Are the equilibria typically isolated? Is the equilibrium unique? Is it stable? What arc the effects of shocks') and the techniques that we use are questions and techniques that arc of relevance to any theory of equilibrium.

of Equilibrium

17.A Introduction III this chapter. we study the theoretical predictive power of the Walrasian equilibrium model. Thus. in contrast with Chapter 16. our outlook here is positive rather than

17.B Equilibrium: Definitions and Basic Equations

Ilormative.

The concept of a private ownership economy was described in Section 16.B. In such an economy. there are I consumers and J firms. Every consumer i is specified by a consumption set Xi c R/.• a preference relation ,.illlll presented in Section 16.B. We then introduce the notion of an ayyregllle ncess 0 such that Z/(P) > -5 for every commodity If pn _ p, where p # 0 and PI = 0 for some t, then

Max {z,(pn) .... , zdpn)} _

DEFINITIONS

AND

BASIC

t and all

defined for some p» [because we may have 11j(p) = 00 for some j). Nevertheless. an equilibrium price vector is still characterized by i(p) = 0. When production sets are not strictly convex, matters become more complicated because the correspondences Yj(p) may no longer be single-valued. Indeed, a production situation of considerable theoretical and practical importance-and one lhat we certainly do not want to rule out by assumption-is the case of constant returns to scale. With constant returns, however, production sets are neither strictly convex nor bounded above (except for the trivial case in which no positive amount of any good can be produced). In principle, we could still view the equilibria as the zeros of a "production inclusive excess demand correspolldellce," defined as in (17.B.3) for a subset of strictly positive prices.· Correspondences, however, do not make good equational systems (e.g., they cannot be differentiated). It is therefore usually much more convenient in such cases to capture the equilibria as the solutions of an extended system of equations involving the production and the consumption sides of the economy. We illustrate this idea in the small type discussion that follows.

p.

00.

Proof: With the exception of property (v), all these properties are direct conseq uenees of the definition and the parallel properties of demand functions.' The bound in (iv) follows from the nonnegativity of demand (i.e., the fact that X, = R~), which implies that a consumer's total net supply to the market of any good t can be no greater than his initial endowment. You are asked to prove property (v) in Exercise 17.B.2. The intuition for it is this: As some prices go to zero, a consumer whose wealth tends to a strictly positive limit [note that, because P-(L, w.l > 0, there must be at least one such consumer] and with strongly monotone preferences will demand an increasingly large amount of some of the commodities whose prices go to zero (but perhaps not of all such commodities: relative prices still matter). _

To sc.::c how an extended system of equations can be constructed, consider the case in which

production is of the linear activity type (this case is reviewed in Appendix A of Chapter 5). Say that, in addition to the dispos,,1 technologies, we have J basic activities ii" ••. , ilJ E R /·.

°

Finally, note that because of Walras' law, to verify that a price vector I' » clears all markets [i.e., has :/(1') = for all t] it suflices to check that it clears all markecs hll/ 0" 0 for all f}.

We shall proceed in five steps. In the first two, we construct a certain correspondence /(.) from 6 to 6. In the third, we argue that any fixed point of /(.), that is, any 1'* with 1'* E /(1'*), has z(p*) = O. The fourth step proves that /(.) is convex valued and upper hemicontinuous (or, equivalently, that it has a closed graph). Finally, the fifth step applies Kakutani's fixed-point theorem to show that a 1'* with 1'* E f( 1'*) necessarily exists. For notational clarity, in defining the sct f(p) c 6, we denote the vectors that are clements of /(1') by the symbol q.

~

:(p).q'

for all q'E6}.

where s is the bound in excess supply given by condition (iv).'o In summary, for p' c10sc cnough to Boundary 6, thc maximal demand corresponds to some of the commoditics whose price is close to zero. Therefore, we conclude that, for large n, any q' E /(p') will put nonzero weight only on commodities whose prices approach zero. But this guarantees p'q = 0, and so q E f( ,,).

= (q E 6: q, = 0 if z,(p) < Max {z,(,,), ... , zdp)}}.

5/ OJ.

Because 1'1 = 0 for some f, we have f(p) '" 0. Note also that with this construction, no price from Boundary 6 can be a fixed point; that is, I' E Boundary 6 and p E f(p) cannot occur because P'P > 0 while P'q = 0 for all q E f(p).

9. Noto also Ihat ror 'my p E /'J., the set I( p) is always a race or the simplex /'J.; that is, it is One or the subsets or /'J. spanned by a finite subset or unit coordinates. For P E Boundary /'J., I(p) is the race or /'J. spanned by Ihe zero coordinates or p. For p E Interior /'J., I(p) is the race spanned by the coordinates corresponding to commodities with maximal excess demand. 10. In words. the last chain of inequalities says that the expenditure on commodity {is bounded because it has to be financed by. and thererore cannot be larger than, the bounded value of excess supplies.

51 o. Proof: Because of homogeneity of degree zero we can restrict our search for an equilibrium to the unit simplex 11 = {p E IR\: 2:1 PI = I}. Define on 11 the function z+(·) by zt(p) = Max {ZI(p),O}. Note that z+(·) is continuous and that z+(p)'z(p) = 0 implies z(p):;; O. Denote alp) = 2:1 [PI + z;(p)]. We have alp) ~ I for all p. Define a continuous function f(·) from the closed, convex set 11 into itself by

f(p) = [l/a(p)](p

E

+ z+(p)). The second welfare theorem of Section 16.0 can be seen as a particular case of the current existence result. To see this, suppose that ., = (x" ... , x,,) is a Pareto optimal allocation of a pure exchange economy satisfying the assumptions leading to Proposition 17.C.1. Then, by Proposition 17.C.t, a Walrasian equilibrium price vector" and allocation .i = (. 0 such that if p' "" P. pi = PL = 1. and lip' - pil < c, then z(p') "" O. Moreover, if the economy is regular. then the number of normalized equilibrium price vectors is finite.

15. For any number" '" 0, sign" =

... v .... A,

5 I:. C T

16. This result was first shown by Dierker (1972). 17. For advanced treatments on the topic of this section, refer to Balasko (1988) or Mas-Colell (InS).

-I according to whether" > 0 or " < O.

1.

o!:tJ

594

CHAPTER

17:

THE

POSITIVE

THEORY

OF

EQUILIBRIUM

SECTION

LOCAL

UNIQUENESS

AND

THE

INDEX

f(·; q) if q' is close to q. Hence, the notion that the regularity of a system f(·; q) = 0 is typical, or generic, could be captured by demanding that for almost every q, f(·; q) = 0 be regular; in other words, that nonregular systems have probability zero of occurring (with respect to say, a nondegenerate normal distribution on RS).'9 It stands to reason that some condition will be required on the dependence of f(·; q) on q for this to hold. At the very least, f(·; q) has to actually depend on q. The important mathematical theorem to be presented next tells us that little beyond this is needed. 20

f'(u): Perlurbed S~stem

I(d

17.D:

Figure 17.0.4 The regular case is typical.

Proposition 17.0.3: (The Transversa/iCy Theorem) If the M x (N + S) matrix Of(v; q) has rank M whenever f(v; q) = 0 then for almost every q, the M x N matrix OJ(v; q) has rank M whenever f(v; q) = O.

description of the solution set. In particular, if M > N, the system should be overdetermilled and have no solution; if M = N, the system should be exactly determined with the solutions locally isolated; and if M < N, the system should be underdetermilled and the solutions not locally isolated. Clearly, all these statements are not always true (you can see this just by considering examples with linear equations). So, what does it mean to be in the "normal case"'? The implicit function theorem provides an answer: one needs the equations (which we assume are differentiable) to be independent (that is, truly distinct) at the solutions. Definition 17.0.3 captures this notion.

Heuristically, the assumption of the transversality theorem requires that there be enough variation in our universe. If Df(v; q) has rank M whenever f(v; q) = 0, then from any solution it is always possible to (differentially) alter the values of the function f in any prescribed direction by adjusting the v and q variables. The conclusion of the theorem is that, if this can always be done, then whenever we are initially at a nonregular situation an arbitrary random displacement in q breaks us away from nonregularity. In fanciful language, if our universe is nondegenerate, then so will be almost every world in it. Note one of the strengths of the theorem: the matrix Df(v; q) has M rows and N + S columns. Hence, if S is large, so that there are many perturbation parameters, then the assumption of the theorem is likely to be satisfied; after all, we only need to find M linearly independent columns. On the other hand, D../(v; q) has M rows but only N columns. It is thus harder to guarantee in advance that at a solution D.f(v; q) has M linearly independent columns. But the theorem tells us that this is so for almost every q. Observe that if M > N (more equations than unknowns), then the M x N matrix D.f(v; q) cannot have rank M. Hence, the theorem tells us that in this case, generically (i.e., for almost every q). f(v; q) = 0 has no solution.

Definition 17.0.3: The system of M equations in N unknowns f(v) = 0 is regular if rank Of(v) = M whenever f(v) = O. For a regular system, the implicit function theorem (see Section M.E of the Mathematical Appendix) yields the existence of the right number of degrees of freedom. Ir M < N, we can choose M variables corresponding to M linearly independent columns of Df(v) and we can express the values oC these M variables that solve the M equations f(v) = 0 as a Cunction oC the N - M remaining variables (see Exercise 17.0.2). Ir M = N, equilibria must be locally isolated for the same reasons as discussed earlier in this section for the system zIp) = O. And iC M > N, then rank Df(v):s; N < M Cor all v; in this case, Definition 17.0.3 simply says that, as a matter of definition, the equation system f(v) = 0 is regular if and only if the system admits no solution. It remains to be argued that the regular case is the "normal" one. Figure 17.0.4 suggests how this can be approached. In the figure, the one-i:Quation, one-unknown system f(v) = 0 is not regular [because of the tangency point of the graph of f(·) and the horizontal axis]. But clearly this phenomenon is not robust: if we slightly perturb the equation in an arbitrary manner [say that the shocked system is /,(. )], we get a regular system. On the other hand, the regularity of a system that is already regular is preserved for any small perturbation.'· This intuitive idea of a perturbation can be formalized as follows. Suppose there are some parameters q = (q" ... , qs) such that, for every q, we have a system of equations f(v; q) = 0, as above. The set of possible parameter values is RS (or an open region of RS ). We can then justifiably say that f(·; q') is a perturbation of

Let us now specialize our discussion to the case of a system of L - I excess demand equations in L - I unknowns, i(p) = O. We have seen by example that nonregular economies are possible. We wish to argue that they are not typical. To 19. More formally. we could say that in a system defined by finitely many parameters (taking values in. say, an open set) a property is generiC in the first sense if it holds for a set of parameters of full measure (i.e.• the complement of the set for which it holds has measure zero). The property is gelwrk ;11 the .~econd .~ense if it holds in an open set of full measure. A full measure set is dense but it need not be open. Hence. the second sense is stronger than the first. Yet in many applications (all of ours in fact). the property under consideration holds in an open set, and so genericity in the first sense automatically yields genericity in the second sense. In some applications there is no finite number of parameters and no notion of measure to appeal to. In those cases we could say that a property is generic in the third sense if the property holds in an open and dense set. When no measure is available, this still provides a sensible way to capture the idea that the property is typical; bUI it should be nOled that with finitely many parameters a set may be open, dense and have arbitrarily small (positive) measure. In this entire section we deal with genericity in the first sense, and we simply call it genericity. 20. for this theorem, we assume that f(v; q) is as many times differentiable in its two arguments as is necessary.

18. The perturbation should control the values and Ihe derivatives of the funclion. In technical language. it should be a C' perturbation.

1

THEOREM

595

596

CHAPTER

17:

THE

POSITIVE

THEORY

OF

EOUILIBRIUM

SEC T ION

do so, we could resort to a wide variety of perturbation parameters influencing preferences or endowments (or, in a more general setting, technologies). A natural set of parameters are the initial endowments themselves:

L 0 CAL

U N I QUE N E S SAN 0

THE

I N 0 EXT H E 0 REM

Because of the index theorem, this picture, in which the number of equilibria changes discontinuously from 3 to 1 at some points in the space of endowments is typical of the multiple-equilibrium case. A very extensive analysis of this equilibrium set has been carried out by Balasko (1988).

We can write the dependence of the economy's excess demand function on endowments explicitly as zIp; w). We then have Proposition 17.0.4.

We conclude the discussion of genericity with two observations: First, the generic local determinateness of the theory extends to cases with externalities, taxes, or other "imperfections"

Proposition 17.0.4: For any p and w, rank D.,i(p; w) = L - 1.

leading to Ihe failure or the first welfare theorem. (See Exercise 17.0.6.) This should be clear from the generality of the malhematical techniques which. in essence. rely only on the ability 10 express the equilibria of the theory as the zeros of a natural system of equations with Ihe

Proof: It suffices to consider the endowments of a single consumer, say consumer I, and to show that the (L - 1) x L matrix D."z(p; w) has rank L - 1 [this implies that rank D.,i(p; w) = L - 1]. To show this, we can either compute D.. ,z(p; w) explicitly (Exercise 17.0.3) or simply note that any perturbation of w,' say dw" that leaves the wealth of consumer 1 at prices p unaltered will not change demand and therefore will change excess demand by exactly -dw,. Specifically, if p'dw , = 0 then, denoting dw , = (dw ll , .•• , dWL_I.,), we have D.. ,i(p; w) dWI = D..,z,(p; w) dw , = -d such that zO(ji) = 0 and (ii) signIDio(ji)1 = (_I)c-I. For example, zO(p) could be generaled from a single-consumer Cobb-Douglas economy (Exercise 17.0.8). The idea is that fO( p) is both simple and familiar to us and that, as a consequence, we can use it to learn aboul the properties of the unfamiliar t( pl. Consider the following one· parameter family (in technical language, a homotopy) of excess

See Exercises 17.0.4 to 17.0.6 for variations on the theme of Proposition 17.0.4. In Figure 17.0.5, we represent the equilibrium set E = {(w" w 2 , PI): Z(PI' 1; w) = O} of an Edgeworth box economy with total endowment w = w, + w 2 • The set E is the graph of the correspondence that assigns equilibrium prices to economies w = (w" W2)'

demand functions:

P,

zip, I) = I:(p)

___""--;-----'7 0,

Figure 17.0.5 The equilibrium scI.

+ (I

- I)io(p)

for 0 :5 I :5 I.

Thc syslcm i(p, I) = 0 has L - I equations and L unknowns: (P,'··.' PC-I,I). Typically, Iherefore, the solution set £ = ((p,I): z(p, I) = OJ has one and only one degree of freedom al any of its points (that is, it looks locally like a segment). Moreover, since this solution set cannot escape to infinite or zero prices (because of the boundary conditions on excess demand) and is closed [because or the continuity of t(p, I)], it follows that the general situation is well represented in Figure 17.0.6. In Figure 17.0.6, we depict £ as formed, so to speak, by a finite number of circle·like and segmenl-like components, with the endpoints of the segments at Ihe I = 0 and I = I boundaries. Since Ihere arc two endpoints per segment, there is an even number of such endpoints. By construclion, (> is the only endpoint at the I = 0 boundary." Therefore, there must be an odd number of endpoints at the I = 1 boundary; that is, there is an odd number of solutions to :(p) = z(P. I) = O. Suppose now that we follow a segment from end to end. Whal

21. To be quite explicit, this means that the set of endowments that yield nonregular economies

is a subset or RLI that has (LI-dimensional) Lebesgue measure zero, or, equivalently. probability zero for. say, a nondegenerate LI·dimensional normal distribution.

22. More generally, if z(p: I) is an .rbilrary homolopy. then the typical situalion is well reprcscnled by any or the Figures 17.0.I(a), (b), or (c).

597

598

CHAPTER

17:

THE

POSITIVE

Index: +

THEORY

OF

EQUILIBRIUM

+)

I= I

FIgure 17.0.6

The equilibrium set under a homotopy.

is Iho rdalion belween Ihe indices al the two ends'? A moment's reflection (keeping the il11plicit function theorem in mind) reveals that as long as we move in a given

diroction relative to I (i.e., forward or backward), the index, (_I)L -, sign IDpi(p, 1)1, does not change. and that the index changes sign precisely when we reverse direction.2) Now, a segment that begins ~tnd ends at the same boundary must reverse direction an odd number or times; heneo. tho indices at the two endpoints have opposite signs. You can verify this in Figure 17.0.6. Therefore, the sum of the indices at I = I equals the index of the lone equilibrium of i(') connected by a segment to the equilibrium p oUo(.) at the boundary I = O. It is represented by in Figure 17.0.6. The segment that connects;; to p. in £ reverses directions an even number of times (possibly none); therefore, we conclude that the index of this equilibrium at I = I equals the index of p for to(.), which, by construction, is + I. Hence, the sum of the indices al I = I is + I, as Proposition 17.D.2 asserts to be true in complete generality.

"oz,(p) L...

,

,,*

ANYTHING

aI',

_ 0 1',-

GOES:

THE

SONNENSCHEIN-MANTEL-DEBREU

for all ( and I' [or Dz(p)p

" "z,(p) L... 1', - - = -Zt(p) , apt

= 0]

(17.E.I)

(17.E.2)

for all ( and p [or p'Dz(p) = -z(p)]

These arc the excess demand counterparts of expressions (2.E.I) and (2.E.4) for demand functions. They follow, respectively, from the homogeneity of degree zero and the Walras' law properties of excess demand. More interestingly, from z(p) = L; (x;(p, P'w;) - w;) we also get

17.E Anything Goes: The Sonnenschein-Mantel-Debreu Theorem

(17.E.3)

We have seen that under a number of general assumptions (of which the most substantial concerns convexity), an equilibrium exists and the number of equilibria is typically finite. Those are important properties, but we would like to know if we could say more, especially for predictive or comparative-statics purposes (see Section 17.G). We may well suspect by now (especially if the message of Chapter 4 on the difficulties of demand aggregation has been well understood) that the answer is likely to be negative; that is, that, in general, we will not be able to impose further reslrictions on excess demand than those in Proposition 17.8.2, and therefore that no further general restrictions on the nature of Walrasian equilibria than those already studied can be hoped for. Special assumptions will have to be made to derive stronger implications (such as uniqueness; see Section 17.F). I n this section, we confirm this and bring home the negative message in a particularly strong manner. The theme, culminating in Propositions 17.E.3 and 17.E.4, is: AII)'llrillg satisjying tire jew properties tlrat we have already shown must Iwld,

call

17.E:

The analysis that follows develops the logic of this conclusion through a series of intermediate results that have independent interest. Some readers may wish, in a first reading of this section, to skip these results and examine directly the statements of Propositions 17.E.3 and 17.E.4 and the accompanying discussion of their interpretations. To be specific, we concentrate the analysis, as usual, on exchange economies formalizcd by means of excess demand equations. Focusing on exchange economies makes sense because, as we know from Chapter 5, aggregation effects are unproblematic in production. The source of the aggregation problem rests squarely with the wealth effects of the consumption side. We begin by posing a relatively simple but nonetheless quite important question: To what extent can we derive restrictions on the behavior of excess demand at a given price p. In particular, we ask for possible restrictions on the L x L matrix of price effects DZ(p).24 Suppose that z(p) is a differentiable aggregate excess demand function. In Exercise 17.E.I, you are asked to show that

Five + Equilibria Pall = I

~----..

+ 1=0

SECTION

where, as usual, S;(p, P 'w;) is the substitution matrix (see Exercise I7.E.2). Expression (I7.E.3) is very instructive. It tells us that if it were not for the wealth effects, Dz( 1') would inherit the negative semidefiniteness (n.s.d.) property of the substitution matrices. How much havoc can the wealth effects cause? Notice that the matrix

i!!~, p'w,l z,,(p) ow,

D., ',(I', p·w.)Z,(p)T

=

...

iJxI,(p, P'w,) ZU> 0 and then choose a utility function that has OJ + z(p) as the

23. To see Ihis, think of the case where L = 2. Applying the implicit function Iheorem to i,(p,. t) = O. verify Ihen that a reversal of direction occurs precisely where OZ,(p"I)/iJp, = O.

demanded point

1

THEOREM

599

600

CHAPTER

17:

THE

POSITIVE

THEORY

OF

EQUILIBRIUM

SECTION

direction of price change." Thus, we should expect that if I < L then there are some negative semidefiniteness restrictions left on Dz( pl. That this is the case is formalized in Proposition 17.E.1.

:L z,(p) =

0, at most I of the I

+

THE

= L (lip, )e'(p,a')

,

SONNENSCHEIN-MANTEL-DE8REU

THEOREM

601

= L, e' a' = A.

and so we would have accomplished our objective. Can we find these L consumers? The answer is "yes." Begin by choosing a collection of

I vectors,

endowments (w\ ... . , w,) yielding strictly positive consumptions when excess demands arc =,(p) = _p,(a')T; that is, x, = w,- p,(a')T» 0 for every i. Observe then that, for cvery i = I. ... , L, thc candidate individual excess demand satisfies Wal",s' law

can be linearly independent. Since I < L, it follows that we can find a nonzero vector dp E RL such that p·dp = 0 and z,(p)·dp = 0 for all i.ln words: dp is a nonproportional price change that is compensated (i.e., there is no change in real wealth) for every consumer. But then from (17.EJ) we obtain

L dp'S,(p, P'w,) dp 5: O.

GOES:

Dz(p) = - L D.. x,(p, P'w,)Z,(p)T

(P,ZI(P), ... ,z,(p)} c IRL

dp'Dz(p) dp =

ANYTHING

and

Proposition 17.E.1: Suppose that 1< L. Then for any equilibrium price vector p there is some direction of price change dp ¥ 0 such that p'dp = 0 (hence, dp is not proportional to p) and dp'Dz(p) dp 5: O. Proof: Because z( p) =

17.E:

P'z,(p)

= -p,p'a' = 0

(because Ap = 0),

and. also. that the candidate wealth effect vector satisfies the necessary condition of Proposition 2.E.3 p·D..,x,(p, P'w,)

•

=(llp,)p'e' = I.

Figure 17.E.1 should thcn be persuasive enough in convincing us that wc can assign prdacnccs to i = I, ...• L in such a way that the chosen consumption at p is Xi. the wealth efTect vector at p is proportional to f' (and therefore must equal (1Ip,Je')." and the indifference map has a kink at x,. The figure illustrates the complete construction for the case /. = 2" In Exercise 17.E.3, you are asked to write an explicit utility function . •

Parallel reasoning should make us expect that if I ;e: L (i.e., if there are at least as many consumers as commodities), then there may not be any restriction left on Dz(p) beyond (17.E.I) and (17.E.2). After all, thc direction of an individual wealth effect vector at a given price is quitc arbitrary (and can be chosen independently of the substitution effects of the corresponding individual); and with I ;e: L wealth effect vectors to be specified, there is considerable room to maneuver. Proposition 17.E.2 confirms this suspicion.

\"li

Figure 17.E.1

Decomposition of excess demand and

price effects at a price vector p (for L = 2).

Proposition 17.E.2: Given a price vector p, let z E RL be an arbitrary vector and A an arbitrary L x L matrix satisfying p'z = 0, Ap = 0 and p'A = -z. Then there is a collection of L consumers generating an aggregate excess demand function z(·) such that zIp) = z and Dz(p) = A. Proof: To keep the argument simple, we restrict ourselves to a search for consumers that at their demanded vectors have a null substitution matrix, SlIp, P'w,) = 0, that is, whose indifference sets exhibit a vertex at the chosen poin!." We can always formally rewrite the given L x L matrix A as A = Le'a',

,

where e' is the (th unit column vector (i.e., all the entries of e' are 0 except the (th entry, which equals I) and a' is the Ith row of A [i.e., at = (an, .. . , a,d)' Suppose now that we could specify L consumers, i = I, ... , L, with the property that, for every i, consumer i has, at the price vector p, an excess demand vector z,(p) = _p,(a')T, a wealth effect vector Dw,x,(p, P'w,) = (1/p,)e', and a substitution matrix S,(p, P'w,) = 0 (where 1 L l L 0 , ••. ,a and e , •.. ,e are as defined above). Then we would have both zIp)

= LZ,(P) = -LP,(a')T = _ATp = , ,

-p'A

27. Indeed. if Dxj(p, P'w j ) = (t/e i , then 1 = p'Dxj(p, P'w i ) = "iP'e l = rlfpj. Hence. (t, = lip;. 28. At no extra cost, we could actually accomplish a bit more. We could also require the substitution matrices of the consumers i = I•...• L to be any arbitrary collection of L x L matrices S, satisfying the properties: Sj is symmetric. negative semidefinite, p,Sj = 0, and SiP = O. The spccil1cation of consumers generating excess demand z( p) and excess demand effects D:(p) at p would proceed in a manner similar (0 the proof just given except that the argument would now be applied to A - L S,. By using matrices S, of maximal rank (i.e.• of rank L - I), we could insure that the resulting L consumers display smooth indifference sets at their chosen consumptions.

=z

25. For example. it cannot hurt in any direction of price change that is orthogonal to the weallh effects vector D..,xj(p. P"W j ) or to the excess demand vector Zj(p). A more precise argument is given

in Proposition 17.E.1. 26. The term "vertex" refers to what is usually called a "kink" in the case L

= 2.

i

602

CHAPTER

17:

THE

POSITIVE

THEORY

OF

SECTION

EQUILIBRIUM

17.E:

ANYTHING

GOES:

THE

SONNENSCHEIN-MANTEL-OEBREU

Up to now, we have studied the possibility of restrictions on the behavior of excess demand at a single price vector. Although the results of Propositions 17.E.! and !7.E.2 are already quite useful, we can go further. The essence of the negative point being made is, unfortunately, much more general. Consider an arbitrary function z(p), and let us for the moment sidestep boundary issues by having zIp) be defined on a domain where relative prices are bounded away from zero; that is, for a small constant £ > 0, we consider only price vectors p with pt!Pt. ;:: £ for every ( and t'. We could then ask: "Can z(·) coincide with the excess demand function of an economy for every p in its domai~?" Of course, in its domain, z(·) must fulfill three obvious necessary conditions: it must be continuous, it must be homogeneous of degrec zero, and it must satisfy Walras' law. But for any z{·) satisfying these three conditions, it turns out that the answer is, again, .. yes.""

{p

E R~:

ptfPt· ;::

£

Ftgure 17.E.2 preferences (in the case L = 2) for the offer curve of an excess

demand function :J') such that ZI( p) = 0 h", no solution with 1/& < PI/P, < 1:.

for every ( and t'} U!.

and with values in RL. Assume that, in addition, z(·) is homogeneous of degree zero and satisfies Walras' law. Then there is an economy of L consumers whose aggregate excess demand function cOincides with zIp) in the domain p'.'O

= tZI(P)

+r

the initial endowment point. We then see in the figure that no matter how complicated the

Strictly speaking, Proposition 17.E.3 docs not yet settle our original question, "0111 we assert ill1ything more about the equilibria of an economy than what we have derived in Sections 17.e and 17.D?" The problem is that Proposition 17.E.3 characterizes the behavior or excess demand away rrom the boundary, whereas it is the power or the boundary conditions that yields some or the restrictions we have already established: existence, (generic) finiteness, oddness. the index rormula.)( To argue that we cannot hope ror more restrictions than these on the equilibrium set, we need to guarantee that ir a candidate equilibrium set satisfies them, then the construction or the "explaining" economy will not add new equilibria. The result presented in Proposition 17.E.4, whose proor we omit, provides therefore the final answer to our question."

[accordingly, zHp) = -(PI/P,)Z:(p)] [accordingly, z~(p)

p.

olT(:r curve! m~ly otherwise be, we can always fit an indifference map so that for any p E ~ we generale precisely the demands flJ j + :i(p). •

and :;(p) = !ZI(P) - r

+ :I( p) is the intersection point of the offer curve with the budget line perpendicular to

The olTer curve is continuous and. because Zi(p) = 0 has no solution in p,. it does not touch

Proof: AI the end of this section, we olTer (in small·type) a brief discussion of the general proof of this result. Here, we limit ourselves to the comparatively simple ease where L = 2. Suppose then Ihat L = 2 and that an & > 0 and a function z(·) satisfying the assumption of the proposition are given to us. The continuity and homogeneity of degree zero of z(·) imply the existence of a number r> 0 such that IZI(p)1 < r for every peP,. We now specify two functions Zl(.) and z'(·) with domain P, and values in A', which are also continuous and homogeneous of degree zero, and satisfy Walras' law. In particular, we let z:(p)

603

Construction of

Proposition 17.E.3: Suppose that z(·) is a continuous function defined on

p. =

THEOREM

= -(PI/P,)Z;(p)].

Note that zIp) = Zl(p) + z'(p) for every peP,. We shall show that for i = 1,2 the function Zl(.) coincides in the domain P, with the excess demand function of a consumer. To this elTect, we usc the following properties of Zl(.): continuity, homogeneity of degree zero, satisfaction of Walras law, and the fact that there is no peP, such that Zl(p) = O. In Exercise 17.E.4, you are asked to show by example that this last requirement is needed. Choose a WI » 0 such that WI + Zl(p) » 0 for every pep'. In Figure 17.E.2, we represent the offer curve OCI associated with Zl(.) in the domain p,. In the figure, for every pep',

PropOSition 17.E.4: For any N ;:: 1, suppose that we assign to each n = 1, ... ,N a price vector pn, normalized to IIpnll = 1, and an L x L matrix An of rank L - 1, satisfying Anpn = 0 and pn'A n = O. Suppose thaI. in addition, the index formula Ln (_1)L -1 sign IAnl = + 1 holds.)) If L = 2, assume also that positive and negative index equilibria alternate. Then there is an economy with L consumers such that the aggregate excess demand z( .) has the properties:

29. The question was posed by Sonnenschein (1973). He conjectured that the answer was that. indeed, On the domain where PI ~ t for all It the three properties were not only necessary but also suil1cicnl; that is. we could always find such an economy. He also proved that this is so for the

(i) zIp) = 0 for Ilpll = 1 if and only if p = pn for some n. (ii) Dz(pn) = An for every n.

two·eommodilY case. The problem was then solved by Mantel (1974) for any number of commodities. Mantel made use of 2L consumers. Shortly afterwards, Debreu (1974) gave a different and very simple proof requiring the indispensable minimum of L consumers. This was topped by Mantel (1976), who refined his earlier proof to show that L homothetic consumers (with no

31. Note, for example, that although a candidate function z(·) defined on

p, may

not have any

solution. we can still successfully generate it from an economy. What happens. of course, is that the equilibria of the economy (which must exist) are all outside of p.:.

restrictions in their initial endowments) would do.

30. NOle. in particular, that this result implies that for any / ~ L, there is an economy of I consumers Ihat generates z(·) on p,. We need only add to the L eonsumers identified by the

32. For this and more general results, see Mas-Colell (1977). 33. Here. A" is the L - I x L - 1 matrix obtained by deleting one row and corresponding column from A.

proposition J - L consumers who have no endowments (or. alternatively. whose most preferred consumption bundle at all price vectors in p.: is their endowment vector).

1

604

CHAPTER

17:

THE

POSITIVE

THEORY

OF

-

EQUILIBRIUM

Proposition 17.E.4 tells us that for any finite collection of price vectors {pl •... , pN} and matrices of price effects {DZ(pl), ... , DZ(pN)}, we can find an economy with L consumers for which these price vectors are equilibrium price vectors and {DZ(pl), ... , DZ(pN)}. are the corresponding price effects at these equilibria. The result implies that to derive further restrictions on Walrasian equilibria we will need to make additional (and, as we shall see, strong) assumptions. This is the subject of the next three sections. An excellent survey for further reading on the topic of this section is Shafer and Sonnenschein (1982).

SECTION

11.E:

ANYTHING

uOES:

THE

SONNEHSCHEIN-MANTEL-OEBAEU

p Revealed Preferred to p' Weak Axiom Satisfied

Figure 17 .E.4

We should point out that the initial endowments of the consumers obtained by means of Propositions 17.E.2, 17.E.3 or 17.E.4 are not a priori limited in any way. If there are constraints on permissible initial endowments. the nonnegativity conditions on consumption come into play and there may, in fact, be other restrictions on the function z(·). For example, you are asked in Exercise 17.E.5 to verify that the excess demand vectors z( p) and z( p') represented in Figure 17.E.3 cannot be decomposed into individual excess demand functions generated by rational preferences if the amount of any commodity that any consumer may possess as an initial endowment is prescribed to be at most I and if consumptions must be nonnegative.

Revealed preference for excess demand.

z'(p) #< z'(p')

and

P'z'(p')!> 0

(17.E.4)

(see Figure 17.E.4). We say that p is indirectly re""altd preferred 10 p' if there is a finite chain p' ..... p' such that p' = p, p" = p', and p' is directly revealed preferred to p" I for all II !> N - I. The SA then says:

=,

For "1'"'.1' p "lid p'. if p is (direc·tly or indirectly) revealed preferred preJ O.

(17.E.4')

Suppose that .,(.) is an arbitrary real-valued function of p such that .,(p) > 0 for all p € P,. The hasic observation of the proof is then the following: if z'(·) is a proportionally one-to-on" l'xC('ss clt'mand /utlcliml that 5alisjies the SA, then lhe same propert;~s are trut of the function ,,(. )z'(·). Indeed, for any p and p' the revealed preference inequalities (17.E.4') hold for z'(·) if 1II1 O. This is precisely what we will now do. For every normalized p € P,. denote T, = {z € RL: p'Z = O} and for every i = I, ... , L, let :'( p) € Tp be the point that minimizes the Euclidean distance liz - e'li (or, equivalently, maximizes the concave "utility function" -liz - ill) for z € Tp , where e' is the ith unit vector (the column vector whose ith entry is I with zeros elsewhere). Geometrically, z'( p) is the perpendicular projection of e' on the budget hyperplane Tp; that is, z'(p) = e' - p, P. where p, is the ith component of the vector p (recall that i :s; L). Then z'(·) is proportionally one-to·one (see Exercise 17.E.6) and satisfies the SA (since it is derived from utility maximization; see also Exercise 17.E.7). Now let r > 0 be a large-enough number for us to have z(p) + rp » 0 for every normalized p € P, [such an r exists by the continuity of z( . ) and the fact that the set of normalized price vectors in P, is compact and includes only strictly positive price vectors]. For every i = I •...• L and every normalized p € P" define o,(p) = z,(p) + rp, > O. where z,(p) is the ith component

Proof of Proposition '7.E.3 continued: Although a complete proof of the proposition for the case of any number of commodities would take us too far afield. the essentials of the proof by Debreu (1974) are actually not too difficult to convey. We shall attempt to do so. We note that. when carefully examined, the proof can be seen as a generalization of the argument for the L = 2 case presented earlier. In Section 3.1. we saw that the strong axiom of revealed preference (SA) for demand functions is equivalent to the existence of rationalizing preferences. The same is true for excess demand functions: If an excess demand function z'(·) satisfies the SA (we will give a precise definition in a moment), then z'(-) can be generated from rational preferences." It is thus reasonable to redefine our problem as: Given a function z(') that. on the domain P" is continuous. homogeneous of degree zero, and satisfies Walras' law (for short, we refer to these functions as excess demand functions). can we find L excess demand functions z'(·). each satisfying the SA. such that L, z'(p) = z(p} for every p E P,? Before proceeding, let us define the SA for an excess demand function z'(·). The definition is just a natural adaptation of the definition for demand functions. We say that p is directly reveald preferred to p' if

34. We refer to the proof of Proposition 3.1.1 for the justification of this claim.

.L

605

,

606

CHAPTER

17:

THE

POSITIVE

THEORY

OF

SECTION

EQUILIBRIUM

p,

.~

/ -- O.

36. The sel Y can be thought as an aggregate produclion sel. The restriction that Y be of constant returns is made merely for convenience of exposition. It allows us. for example. not to worry about the distribution of profits to consumers (since profits are zero in any equilibrium). Note also that the constant returns model includes pure exchange as a special case (where y= -R';).

35. Reviews for this topic are Kehoe (1985) and (1991). and Mas-Colell (1991).

L

EQUILIBRIA

607

608

CHAPTER

17:

THE

POSITIVE

THEORY

OF

EQUILIBRIUM

SECTION

17.F:

UNIQUENESS

OF

EOUILIBRIA

609

Proposition 17.F.2: Suppose that the excess demand function z(·) is such that, for any constant returns convex technology Y, the economy formed by z(·) and Y has a unique (normalized) equilibrium price vector. Then z(·) satisfies the weak axiom. Conversely, if z(·) satisfies the weak axiom then, lor any constant returns convex technology Y, the set 01 equilibrium price vectors is convex (and so, il the set of normalized price equilibria is finite, there can be at most one normalized price equilibrium). Figure 17.F.l

A violation of the weak axiom implies

multiplicily of equilibria for some Y.

In words, the definition says that if p is revealed preferred to p', then p' cannot be revealed preferred to p [i.e., z( p) cannot be affordable under p']. It is the same definition used in Sections I.G and 2.F, but now applied to excess demand functions.'" The axiom is always satisfied by the excess demand function of a single individual, but it is a strong condition for lI!JqreYlIre excess demand (see Section 4.C for a discussion of this point). We lirst note that, given:('), the WA is a necessary condition for us to be assured of a unique equilibrium for every possible convex, constant returns technology Y that :(.) is coupled with. To see this, suppose that the WA was violated; that is, suppose that for some p and p' we have z(p) # z(p'), p'z(p') ,.:; 0, and p'·z(p),.:; o. Then we claim that both p and p' are equilibrium prices for the convex, constant returns production set given by

Y'

I

= lYE R .: P'y,.:;

0 and P"y,.:; O}.

Figure 17.F.1 depicts this production set for the case L = 2. Note that we have :( p) E Y' and p' Y ,.:; 0 for every Y E Y·. Thus, by Proposition 17.F.I, p is an

Proof: The first part has already been shown. To verify the convexity of the set of equilibrium prices, suppose that p and p' are equilibrium price vectors for the constant returns convex technology Y; that is, z(p) E Y, z(p') E Y, and, for any )'E Y, P'y,.:; 0 and p"Y ,.:; O. Let p" = I1.p + (I - alp' for 11. E [0, I). Note, first, that p" •.\" = ap' Y + (I - a)p"J''':; 0 for any)'E r. To show that p" is an equilibrium, we therefore need only establish that :(1''') E Y. Because 0 = p",z(p") = I1.p,z(p") + (I - l1.)p'·z(p"), we have that either p·z(p"),.:; 0 or P"z(p")":; O. Suppose that the first possibility holds, so that P':(p")":; 0 [a parallel argument applies if, instead, p'·z(p"),.:; 0]. Since :(1') E Y we have p"·z(p),.:; O. But with p"':(p),,:; 0 and p·z(p")":; O. a contradiction to the WA can be avoided only if :(p") = :(p). Hence z(p") E y.lO •

We arc therefore led to focus attention on conditions on preferences and endowments of the I consumers guaranteeing that the aggregate excess demand function z( p) fulfills the WA. To begin with a relatively simple case, suppose that all the endowment vectors w, arc proportional among themselves; that is, that w, = (1,(ii, where w is the vector of total endowments and (1, ~ 0 are shares with (1, = I. I n such an economy, the distribution of wealth across consumers is independent of prices. Normalizing prices to P'w = I, the wealth of consumer I is (x, and =,(p) = x,(p, 11.,) - w,. The aggregate demand behavior ofa population of consumers with fixed wealth levels was studied in Section 4.C. We repeat our qualitative conclusion from there: if individual wealth levels remain fixed, the satisfaction of the WA by aggregate demand (or excess demand), although restrictive, is not implausible.'o

:L

equilibrium price vector. The same is true for p'. Since z(p) # z(p'), we conclude that the equilibrium is not unique for the economy formed by z(·) and the production set yo. What about sufficiency? The weak axiom is not quite a sufficient condition for uniqueness, but Proposition 17.F.2 shows that it does guarantee that for any convex, constant returns Y, rile set of equilibrium price vecrors ;s convex. Although this convexity property is certainly not the same as uniqueness, it has an immediate uniqueness implication: if an economy has only a finite number of (normalized) price equilibria (a generic situation according to Section 17.0),38 the equilibrium must be unique.

A proportionality assumption on initial endowments is not very tenable in a general equilibrium context. It is important, therefore. to ask which new effects are at work (relative to those studied in Section 4.C) when the distribution of endowments docs not satisfy this hypothesis. Unfortunately, it turns out that nonproportionality of endowments can reduce the likelihood of satisfaction of the weak axiom by aggregate excess demand. To see this, consider the relatively simple situation in which preferences arc homothetic. Recall from Sections 4.C and 4.0 that, when endowments arc proportional, this case is extremely well behaved; not only is the WA satisfied, but the model even admits a representative consumer. Yet, as we proceed to discuss

37. A formal, and inessential, difference is that we now define the revealed preference relation on the budget sets (i.e., on price vectors) directly rather than on the choices (i.e., on commodity

39. Observe thai we have cSlablished thai eilher :(p") = z(p) or :(p") = z(p'). Since this is true for any" E [0, I]. and since Ihe function z(') is conlinuous, this implies Ihal z(p) = z(p') for any

vectors).

two equilibrium price vectors p and p'; that is, if the WA holds for =('), then every Walrasian equilibrium for the given endowments must have the same aggregate consumption vector and. hence. the same Llggrcgate production vector.

38. Although our discussion in Section 17.D focused on the case of exchange economies, its conclusions regarding generic local uniqueness and finiteness of the equilibrium set can be extended to the present production context.

40. On this point. consull also Ihe references given in Chapter 4, especially Hildenbrand (t994).

-

610

C HAP T E R

1 7:

THE

PO SIT lYE

THE 0 R Y

0 F

E QUI LIB R I U M

SECTION 17,F:

UNtQUENESS

OF

EQUtLtBRIA

611

-------------------------------------------------------------------------------------------------below (in small type), even with homothetic preferences, the WA can easily be violated when endowments are not proportional.·'

Example 17.F.l: This is an example of a failure of the WA compatible with homotheticity and even with the property of gross substitution, which we will discuss shortly. Consider a

In Section 2.F we olfered a dilferential version of the WA for the case of demand functions. 1n a parallel fashion we can also do so for excess demand functions. It can be shown that a sufficient dilferential condition for the WA is

for only the first two goods; that is. he has an excess demand function Z,(P) = Z,(P" p,) that does not depend on p, and P. and. further. is such that z,,(p) = z.. (p) = 0 for all p. Similarly. consumer 2 has preferences and endowments for only the last two goods." We claim that if there is a price vector p' at which the excess demand of the two consumers is nonzero [i.e., z,(p') '" 0 and z,(p') '" 0]. then the aggregate excess demand cannot satisfy the WA. To sec this, choose (p" p,) and (p,. p.) arbitrarily, except that P,ZII(P') + P,Zll(P') < 0 and P,Zll(P') + P.z.,(p') < O. For a> O. take q = (p'" Pl' ap,. afi.) and q' = (ap" ap" Pl' p~). Then if a> 0 is sufticiently large. we have q,z(q') < 0 and q'·z(q) < 0 (Exercise 17.F.2). •

four·commodity economy with two consumers. Consumer I has preferences and endowments

dp'Dz(p) dp < 0 whenever dp'z(p) = 0 (i.e., whenever the price change is compensated) and dp is not proportional /0 p (i.e .• relative prices change).

(17.F.I)

Allowing for the first inequality to be weak, expression (l7.F.I) constitutes also a necessary condition. 42

Under the homotheticity assumption, we have

See Exercise 17.F.3 for yet another example.

I Dw-',(p. P'w,) = - - x,(p, p·w,). p.Wj

I

Denoting S,

= S,(p, P'w,), x, = . PI and Pk = Pk for k "f. t, we have Zk(P') > Zk(P) for k oF t.

41. To reinforce this point. it is also worth mentioning that. in fact. jf we are free to choose initial endowments, then the class of homothetic preferences imposes no restrictions on aggregate

If, as is the case here, we are dealing with the aggregate excess demand of an economy. then the fact that z(·) is also homogeneous of degree zero has the consequence that with gross substitution we also have z{(p') < zAp) whenever p' and pare related as in Definition 17.F.2. To see this, let p = a.p, where a. = PI/p{, Note that PI = PI and p, > p; for k "f. t. Then the homogeneity of degree zero of z(·)

demand. Indeed. as we noted in Section 17.E, the basic conclusion of Proposition 17.E.2 can still be obtained with the further restriction that preferences be homothetic. See Mantel (1976) and the su rvey of Shafer and Sonnenschein (1982). 42. Suppose that dp'z(p) = (p' - p),z(p) = O. Definition I7.F.1 implies then that dp'dz = (p' - p)'(z(p') - zIp)) s; O. Going to the dilferentiallimitand using the chain rule, it follows that dp·Oz(p)dp s; 0 whenever dp'z(p) = O. 41 But this cannot happen if the ,'(,(P, p·w . ) are collinear among themselves or if the collinear among themselves. See Exercise l7.F.1.

WI

are

44. Thus, this example can also be seen as a case of positive association between endowments and demands.

1.

01:'"

Ci:AP1ER

17.

THE

POSITIVE

THEORY

Of

~;

EQUILIBRIUM

z,

E (. T ION

( will respond positively to an increase in P•. But if response.'6 •

The offer Curve of a

"

Wli

UNIOUENESS

bl~

= 0, there will be no

Proof: It suffices that we show that z(p) = z( p') cannot occur whenever P and p' are two price vectors that are not collinear. By homogeneity of degree zero, we can assume that p' ~ p and p, = PI for some I. Now consider altering the price vector p' to obtain the price vector p in L - I steps, lowering (or keeping unaltered) the price of every commodity k '" I one at a time. By gross substitution, the excess demand of good I cannot decrease in any step, and, because P '" p', it will actually increase in at least one step. Hence, =/(P) > ZI(P') . • One might hope 10 establish uniqueness in economics with production by applying the as property to the production inclusive excess demand :(.). However, the direct use of the GS property in a production context is limited. Imagine. for example. a situation in which inputs

slIhslirllliolt. 45

Figure 17.F.2 represents the offer curve of a gross substitute excess demand function L = 2. As the relative price of good I increases, the excess demand for good I decreases and the excess demand for good 2 increases. An important characteristic ofthe gross substitute property, which follows directly from its definition, is that it is additive across excess demand Junctions. In particular, if the individual excess demand functions satisfy it, then the aggregate function does also.

and Olltputs ure distinct goods. If the price of an input increases, the demand for every other

input may decrease. not increase as the as property would require, simply because the optimal level of output decreases. Indirectly. though. the gross substitute concept may still be quite helpful. Recall. in particular. that at th. end of Section 17.B. we argu.d that it is always possible to

reduce a production economy to an exchange economy in which, in effect. consumers

c,change factor inputs and then engage in horne production using a freely availabl. constant returns technology. The aggregate excess demand in this derived exchange economy for factor inputs combines elements of both consumplion and production and may well satisfy the as property."

Example 17.1'.2: Consider a utility function of the form u,(x,) = L' UI/(x,,), [f -[xIiUi,(XIi)/ui,(xl/)] < I for all ( and Xli' then the resulting excess demand function z,(p) has the gross substitute property for any initial endowments (Exercise 17.F.5). This condition is satisfied by U,(X,) = (L, ~IiXt,)lJp for 0 < p < I (Exercise l7.F.5). The limits of these preferences as p -+ I and p .... 0 are preferences representable, respectively, by linear functions and by Cobb-Douglas utility functions (recall Exercise 3.C.6). As far as the gross substitution property is concerned, Cobb-Douglas preferences constitute a borderline case. Indeed, the excess demand function for good ( is then ZIi(P) = rt.1i(P·W,)/PI - wli . [f W" > 0, the excess demand for good

What is the relationship between gross substitution and Ihe w.ak axiom? Clearly, the \V A docs not imply the as property (the latter can be violated ev.n in quasilinear, one·consumer economies). The converse relationship is not as obvious. but it is nevertheless true that the GS property does not imply the WA. [n fact, Example 17.F.I. which viola led the \VA. could perfectly well satisfy as" There is. however. one connection that is important. The gross substitute property implies that If z(p) = 0 and ;(p') ¢' 0, then p,z(p') > O.

(17.F.3)

We shall not prove condition (17.FJ) here. For the case in which L = 2, you are asked for a proof in Exercise 17.F.7. To understand (17.F.3). note that if p is the price vector of an

45. It is worth mentioning that functions satisfying the GS property arise naturally in many economic contexts. For example. if A is an (L - I) x (L - I) input~output matrix and (.' E IR'.-'.

then (. - (I - A). satisfies the (w.ak) GS property as a function of a E R'.-' (see Appendix A of Chapter 5 for the interpretation of th.s. concepts). More generally, the equation system g(a) - a associated with the fixed-point problem [i.e., find a such that y(') = aJ of an ;naeas;nrl function r/: R~ - R~ [i. •.• g(.) ~ y(.') whenever a ~ a'J satisfies it (perhaps. again. in its weak version).

46. See also Grandmont (1992) for an interesting result wh.r. a Cobb-Douglas positive representalive consumer. and therefore GS excess demand, is derived from a requirement that at any given price, the choice behavior is widely dispersed (in a certain precise sense) across consumers. Grandmonl's is an example of a model in which the individual excess demand functions may not

Note that in these cases there is no homogeneity of degree zero or Walras' law~-conditions specific to general equilibrium applications-to complement the GS property. This is significant because exploration of the implications of the OS property without homogeneity of degree zero or Walras'

satisfy the gross substitute property but th. aggr.gat. function does. 47. See Mas-Colell (1991) and Exercis. 17.F.6 for further elaborations on this point. 48. Therefore, in view of Proposition 17.F.I. we know that in a constant returns economy the fulfillment of the GS property by the excess demand of the consumers does not imply the uniqueness

law.

of equilibrium.

as property. See

EQUILIBRIA

Proposition 17.F.3: An aggregate excess demand function z(') that satisfies the gross substitute property has at most one exchange equilibrium; that is, z(p) = 0 has at most one (normalized) solution.

gross substitute excess demand function.

tells us that 0 = zl(ii) - Z/(P) = zl(ii) - ZI(P') + Z/(P') - Z/(P), However, gross substitution implies that z/(ii) - z,(p') > 0 (change sequentially each price P; for k '" I to Pl' applying the GS property at each step), and so ZI(P') - Z/(P) < O. The differential version of gross substitution is clear enough: At every p, it must be that iJz.(p)/c1PI > 0 for k '" I; that is, the L x L matrix Dz(p) has positive off-diagonal entries. In addition, when z(·) is an aggregate excess demand function, homogeneity of degree zero implies that Dz(p)p = 0, and so c:z,(p)/c1p, < 0 for all I = I, ... , L: the diagonal entries of Dz(p) are all negative. If in these definitions the inequalities arc weak, one speaks of \\'(,lIk IJ'OSS

these conditions add substantially to the power of the

OF

III tile special case oj excllange economies if the gross substitute property holds for aggregate excess demand then equilibrium is unique.

Figure 17.F.2

'(pI

;::

Exercise 17.F.16 for an

1.

-

614

CHAPTER

17:

THE

POSITIVE

THEORY

OF

SECTION

EQUILIBRIUM

:,

'7.F:

UNIQUENESS

OF

price vector p for each consumer i, we have x, ;t,w, for all i. However, by the assumption of the proposition and the first welfare theorem, (w" ...• wc) is a Pareto optimal allocation and so we must have x, -,wc for all i. But then we can conclude that x, = w, for all i, because otherwise, by the strict convexity of preferences. the allocation Ox, + !w" ... , !x, + !w,) would be Pareto superior to (w" . ..• w,) . •

Il1dex Al1alysis and Ul1iqueness ( ... and NOl1ul1iqueness) =1 - -__ :(p)

Figure 17.F.3

The index theorem (Proposition 17.D.2) provides a device to test for uniqueness in any given model. The idea is that if merely from the general maintained assumptions of the model we can attach a definite sign to the determinant of the Jacobian matrix of the equilibrium equations at any solution point, then the equilibrium must be unique. After all, the index theorem implies that sign uniformity across equilibria is impossible if there is multiplicity. As a matter of fact, we could have proceeded by means of this index methodology for many of our previous uniqueness results. Take, for example, an exchange economy. In both the WA and the GS cases, whenever z(p) = 0, the matrix Dz(p) is necessarily negative semidefinite [see the small-type discussion of expression (17.F.I) and Proposition 17.F.4]. Moreover, if an equilibrium is regular (i.e., if rank Dz(p) = L - I), the negative scmidefiniteness of Dz(p) can be shown to imply that the index of the equilibrium is necessarily + I (see Exercise 17.F.II). Hence, we can conclude that in both the WA and GS cases, any regular economy must have a unique (normalized) equilibrium price vector. Although the index methodology provides a good research tool. it is often the case that, as here, uniqueness conditions lend themselves to direct proofs. It is a notable fact that some of the more subtle uses of index analysis are not to establish uniqueness but rather to establish nonuniqueness [the first usage of this type was made by Varian (1977)]. This is illustrated in Example 17.F.3.

The revealed preference property of gross substitution.

(e,change) equilibrium and p' is not, then, sinoe :(p); 0, we have p',z(p); 0, and therefore any nonequilibrium p' is revealed preferred to p. Henoe, the requirement in (17.F.3) that ".:( p') > 0 amounts to a restricted version of the WA asserting that no equilibrium price veclor p C'In be revealed preferred to a nonequilibrium prioe vector p'. Geometrically, it says that the range of the excess demand function, {:(p'): p' »O} c RL (Le., the offer curve), lies entirely above the hyperplane through the origin with normal vector p (see Figure 17.F.3). In par;dlcl to Proposition 17.F.2, condition (17.F.3) implies the convexity of the equilibrium price set of Ihe exchange economy, that is, of (pe R'++: z(p); O} c R" (in Exercise 17.F.B, you are asked 10 show this). Interestingly, condition (17.F.3) is satisfied not only in the WA and the GS cases but also in the no-trade case, to be reviewed shortly. In Ihe differentiable case, there is a parallel way to explore the connection between the WA and gross substitution. Let z(p) = O. The sufficient differential condition (17.F.I) for the WA tells us that dp'D:(p)dp < 0 for any dp not proportional to p. Suppose now that instead of the WA, we require that Dz(p) has the gross substitute sign pattern. Because z(p) = 0, we have p' D:(p) = 0 and Dz(p)p = 0 [recall (17.E.I) and (17.E.2)]. Using these two properties it can then be shown that again we obtain dp' Dz(p) dp < 0 for any dp not proportional to p (sec Section M.D of the Mathematical Appendix). Henoe, we can conclude that aC all exc/ulIIgc "'{lIjlibrjllll/ prke veccor, the GS property yields every local restriction implied by the WA. This is summarized in Proposition 12.F.4.

Example 17,F_3: Suppose we have two one-consumer countries. i = 1,2. Countries are symmetrically positioned relative to the home (H) and the foreign (F) good. To be specific, let each country have one unit of the home good as an endowment and none of the foreign good, and utility functions u,(xu;, xF/) = Xu; - X:, for -I < p < O. Merely from symmetry considerations, it follows that there is a symmetric equilibrium p = (1.1). But we may be interested in knowing whether there are asymmetric equilibria. One way to proceed is as follows: compute the index of the symmetric equilibrium; a sufficicnt (but not necessary) condition for the existence of an asymmetric equilibrium is that this index be negative (i.e., _1).49 If we carry out the computation for the present example (you are asked to do so in Exercise 17.F.I3), we see that the index is negative if at prices p = (I, I) the wealth effects in each country are so biased toward the home good that an increase in the price of the good of country I. say, actually increases the demand for this good in country I by more than it decreases the demand from country 2. •

Proposition 17.F.4: If z(') is an aggregate excess demand function, zIp) = 0, and Oz(p) has the gross substitute sign pattern, then we also have dp-Oz(p) dp < 0 whenever dp ,",0 is not proportional to p.

Uniijlleness as an Implication of Pareto Optimality We now present a result that is not of great significance in itself but that is nonetheless interesting because it highlights a uniqueness implication of Pareto optimality. For simplicity, we restrict ourselves again to an exchange economy (see Exercise 17.F.9 for a generalization allowing for production). Proposition 17.F.S: Suppose that the initial endowment allocation (w" .. . , wc) constitutes a Walrasian equilibrium allocation for an exchange economy with strictly convex and strongly monotone consumer preferences (Le., no-trade is an equilibrium). Then this is the unique equilibrium allocation.

49. In this, as typically in any example, the excess demand function fails to be differentiable at

prices at which demand just "hits" the boundary. Typically (we could say "generically"), these prioes will not be equilibrium prioes and the validity of the index theorem is not alTected by these

Proof: Let an allocation x = (XI' ... , xc) and price vector p constitute a Walrasian cquilibrium when consumers' endowments are (w" . .. , w,). Since w, is affordable at

nondilTerentiabilities.

L

EQUILIBRIA

615

616

CHAPTER

'7:

THE

POSITIVE

THEORY

OF

~ [ CliO N

EOUILIBRIUM

17.G Comparative Statics Analysis

Proof: Let Ihe firsl consumer have endowmenls wilh Ihe prescribed amounlS of Ihe firsl L - I commodilies, and give 10 Ihis consumer arbitrary preferences, with the single reslriclion Ihal 0"" t ,(p; ,;,,) be nonsingular (it suffices for Ihis thallhe demand function of ~onsumer I salis?es a slricl normalilY condilion; again see Exercise 17.G.I). Since 0 ..,1(;;; WI) = 0~,1,(;;; w,), expression (17.G.I) lells us Ihal we are looking for an addilional collection of L consumers such Ihal Ihe resulling (L + I)-consumer economy has z(p; ';',) = 0 and

z(p; q) = (ZI(P; q), ... , Z,._,(P; q)).

Here, q E IRN is a vector of N parameters influencing preferences or endowments (or both). Throughout, we normalize P,. = I. Suppose the value of the parameters is given initially by the vector ii and that p is an equilibrium price vector for ii; that is, z(p; ii) = O. We wish to analyze the effect of a shock in the exogenous parameters q on the endogenous variable p solving the system. A first difficulty for doing so is the possibility of multiplicity of equilibrium: the system of L - I equations in L - I unknowns i('; q) = 0 may have more than one solution for the relevant values of q, and thus we may need to decide which equilibrium to single out after a shock. If the change in the values of the parameters from ii is small, then a familiar approach to this problem is available. It consists of focusing on the local effects on p, that is, on the solutions that remain near p. Assuming the differentiability of i(p; q), we may determine those eITects by applying the implicit function theorem (see Section M.E of the Mathematical Appendix). Indeed, if the system z(·; ii) = 0 is regular at the solution p, that is, if the (L - I) x (L - I) matrix Dpz(p; ij) has rank L - l,so then for a neighborhood of (p; ij) we can express the equilibrium price vector as a function p(q) = (p,(q), ... , PL-,(q» whose (L - I) x N derivative matrix at ii is

ii)] - , D.i( p; ii).

(I7.G.2) NOlc Ihal Ihe (L - I) x (L - I) matrix defined in (17.G,2) is nonsingular. Thus, we have reduced our problem 10 Ihe following: can we find L consumers whose aggregale excess demand al p is -:,(;;; ,;',) and whose aggregate (L - I) x (L - I) matrix of price effects is A = - D,o,: ,(P;';', )8-' - O,Z,(;;; ';',)1 II follows from Proposition 17.E.2 Ihal the answer 10 Ihis queslion is "yes" (nole Ihallhe reslriclions Ihal Proposilion 17.E.2 imposes on Ihe L x L matrix A place no restriclion on Ihe malrix obtained by deleling one row and one column of A) . • Proposilion 17.G.1 shows that any firsl-order effecl is possible. As in Section 17.E (recall Figurc 17.E.3), il is also the case here thaI if there are prior reslriclions on initial endowments and if consumplion musl be nonnegalive, Ihen Ihere are again comparative statics restrictions of a global character. [See Brown and Matzkin (1993) for a recent investigation of this point]

(17.G.I)

There are a number of comparative static effects that, ideally. we would like to have and that seem economically intuitive: For example, that if the endowment of one good increases, then its equilibrium price decreases. Nevertheless, strong conditions are required for them to hold. By now this should not surprise us: We already know that wealth effects and/or the lack of sufficient substitutability can undermine intuitive comparative static effects. The latest instance we have seen of this occurring has been precisely Proposition 17.G.1. The analysis of uniqueness in Section 17.E may lead us to suspect that good comparative statics effects can hold if aggregate excess demand satisfies either weak-axiom-like conditions (recall Definition 17.F.I) or gross substitution properties (sec Definition 17.F.2). This is in fact so. We consider first the implications of a weak-axiom-like restriction on aggregate excess demand.

What can we say about the first-order eITects Dp(ij)? Expression (17.G.I) and Proposition 17.E.2 [which told us that the matrix of price effects Dpz(p; ij) is unrestricted when I 2: L] strongly suggest that, without further assumptions, the "anything goes" principle applies to the comparative statics of equilibrium in the same manner that in Section 17.E it applied to the closely related issue of the effects of price changes on excess demand. We now elaborate on this point in the context of a specific example. = (w,', ... , WL_I.') Let the list of parameters under consideration be the vector of initial endowments of the first consumer for the first L - I commodities. All of the remaining endowments are kept fixed. As before we assume that i('; dJ,) = 0 is regular at the solution p. It can be shown (see Exercise 17.G.I) that if the demand function of the first consumer satisfies a strict normality condition, then rank Dp(6J,) = L - I, where p(.) is the locally defined solution function with p(J,,) = p. Proposition 17.G.I tells us that if there are enough consumers then this is all that we can say.

w,

50. In a slighl abuse of nOlalion. we leI D,z(i;; ii) stand for Ihe matrix oblained from D,z(p; Ii) by deleling Ihe lasl row and column.

COM PAR A T I v EST A TIC S

Proposition 17.G.1: Given any price vector p, endowments for the first consumer of the first L - 1 commodities J" = (w", ... , wL -",), and a (L - I) x (L - 1) nonsingular matrix 8, there is an exchange economy formed by L + 1 consumers in which the first consumer has the prescribed endowments of the first L - 1 commodities, i(ft; J,,) = 0, i(', J,,) = 0 is regular at p and Dp(J,,) = 8.

Comparative statics is the analytical methodology that concerns itself with the study of how the equilibria ofa system are affected by changes (often described as "shocks") in various environmental parameters. In this section, we examine the comparative static properties of Walrasian equilibria. To be concrete, we consider an exchange economy formalized by a system of aggregate excess demand equations for the first L - I commodities:

Dp@ = - [Dpi( p;

11. 0:

Proposition 17.G.2: Suppose that i(p; ij) = 0, where i(') is differentiable. If Dqi(p; ij) is negative definite," then (Dqi(p; ij) dq)'(Dp(ij) dq) 2: 0 for any dq,

i

1

(17.G.3)

51. This condition is independent or which particular commodity has been labeled as L (see

Section M.D of Ihe Malhemalical Appendix).

.

A HAL Y SIS

617

618

CHAPTER

17:

THE

POSITIVE

THEORY

OF

EOUILIBRIUM

SECTION

COMPARATIVE

STATICS

first L - I goods (relative to the price of the Lth good) decrease. 53 In particular. suppose again that consumer I's initial endowment of some good decreases. By labelling commodities appropriately. we can let this good be commodity LUnder the assumption of normal demand for consumer I. a decrease in WLl. at the fixed price vector P. will decrease the excess demand for the first L - I goods. Therefore. the prices of the first L - I goods decrease and so we now reach the conclusion that we could not obtain by means of Proposition 17.G.2: if the endowments of a single good decrease then its price (relative to the price of any other good) increases. This suggests. incidentally. that the assumptions of Proposition 17.G.3 are strictly stronger than those of Proposition 17.G.2. Indeed. as we saw in Proposition 17.F.4. if z(p; ij) = 0 and the L x L matrix D,z(p; ij) satisfies the gross substitute property. then dp'D,z(p; ij) dp < 0 whenever dp '" 0 is not proportional to p. In particular. by letting drc = 0 we have that the matrix D,i(p; q) is negative definite.

Proof: The inverse of a negative definite matrix is negative definite. Therefore [D,i(p; q)] -, is negative definite (see Section M.D of the Mathematical Appendix). Hence. by (17.G.I) we have (D,z(p; ij) dq) . (Dp(ij) dq)

17.G:

= - D.i(p; ij) dq· [D,z(p; ijl] -, D.i(p; q) dq
which is precisely (17.GJ). _ The weak axiom implies the negative semidefiniteness of D,i(P; q) whenever 0 [see expression (17.F.I) and the remark following it]. Therefore. the assumption of Proposition 17.G.2 amounts to a small strengthening of this implication. Its conclusion says that for any infinitesimal shock dq in q. the induced shock to excess demand at prices fixed at P. D.i(p; q) dq. and the induced shock in equilibrium prices. D.p(ij) dq. move "in the same direction" (more precisely. as vectors in R / -' they form an acute angle). For example. a shock that at fixed prices alTects only the aggregate excess demand of the first good. 52 say by decreasing it. will necessarily decrease the equilibrium price of this good. Note that this docs 110/ say that if (I)" increases then the equilibrium priee of good I decreases. Under an assumption of normal demand. this change in w" does indeed decrease the excess demand for good I at ;; but it also alTects the excess demand for all other goods (sec Exercise 17.G.2). We next consider in Proposition 17.G.3 the implications of gross substitution (or. more precisely. of gross substitution holding locally at (p; q)).

:( p; ti) =

Expression (17.G.I) allows us to explicitly compute the effects of an infinitesimal shock. In fact. it also offers a practical computational method to estimate the local effects of small (but pcrh"ps not infinitesimal) shocks. Suppose that the value of the vector of parameters after the shock is ci and, for IE [0, I]. consider a continuous function i(' ,I) that, as I ranges from I ; 0 to I = I. distorts :(.; ti) into i('; iiI. An example of such a function, called a homotopy. is :('./) = (I - I):(';q)

+ If(·;ii).

Denote the solution set by E = {(I, pI: f(p, t) = O}. Then we may attempt to determine p(q) by following a segment in the solution set that starts at (0, p)." If ij is close to q. and the initial situation (1 is regular, then we are in the simple case of Figure 17.G.I(a): there is a unique segment that connects (0. p) to some (I. p)." Naturally. we then put P(ii) - p. If q is not close to ij but nevertheless i(' ,I) is a regular excess demand function for every I [this will be the case if. for example, z(· ./) satisfies. for every I. any of the uniqueness conditions covered in Section 17.F], then this procedure will still succeed in going from I = 0 to I = I and. therefore, in determining an equilibrium for ii.'· Unfortunately. if the shock is large, we can easily find ourselves in situations such as Figures 17.G.I(b) and 17.G.I(c), where at some t' the economy i(·. n is not regular and at (I'. Po') there is no natural

Proposition 17.G.3: Suppose that i(;;; ij) = 0, where i(·; .) is differentiable. If the L x L matrix Dpz(;;; ij) has negative diagonal entries and positive off-diagonal entries, then [Dpi(;;; ij»)" 1 has all its entries negative. Proof: Because of the homogeneity of degree zero of excess demand (recall Exercise 17.E.I). we have D,z(p;q)p=O. and so D,z(p;q)p«O. where p=(p, ..... p.-,). Denote by I the (L - I) x (L - I) identity matrix and take an r > 0 large enough for the matrix A = (l/r)D,z(p; q) + I to have all its entries positive. Then D,i(p; q) = -r[l- A]. and therefore D,l(p; q)p« 0 yields (I - A)p» 0; that is. the positive matrix A. viewed formally as an input-output matrix. is productive (see Appendix A of Chapter 5; the fact that the diagonal entries of A are not zero is inessential). Hence, as we showed in the proof of Proposition 5.AA.I. the matrix [I - A)"' exists and has all its entries positive. From [D,i(p; q)J"' = -(I/r)[1 - A)"' we have our conclusion. _

53. This conclusion holds for nonlocal shocks as well. To see this let Dz(p; q) have the gross subs,itu'e sign pattern throughout its domain and suppose that l(p; ij)« l(p: ij) for all p. For IE [0.1]. define zIp; I) = Ii(p; ii) + (I - I)i(p; ij). Denote by P(/) the solution to l(p; I) = O. Note 'hat D.i( P(/); I) dl = 1(P(/); ii) - 2(P(/); ij)« 0 for all I and therefore. by Proposition 17.G.3. Dp(/) dl « 0 for all I. But then. for any ( = I, .... L - I. we have

PI (q-) - PI (-) q = f.'[ilPI(t)]d dI I < 0 . 0

[t follows from Proposition 17.GJ and expression (17.G.I) that, given gross subSlitution. if D,z(p; q) dq «0. that is. if the excess demand for all of the first L - I goods decreases as a consequence of the shock (and therefore the excess demand for the Llh good increases), then Dp(q) dq «0. That is. the equilibrium prices of the

In Exercise 17.G.3 you can find a more direct approach to the global theory. See also Milgrom and Shannon (1994) for much more on the latter approach. 54. In practice. "following" a segment involves the application of appropriate numerical techniques; see Garcia-Zangwill (1981), Kehoe (1991). and references therein. 55. Moreover, if the shock is sufficiently small. the p so obtained is independent of the particular homotopy used. 56. However. if there are multiple equilibria at ii. then which equilibrium we find may now depend on the homotopy.

52. What this means is 'hat the excess demand of good 2 to L - I is not changed. By Walras' law. the excess demand of good L must change.

L

ANALYSIS

619

620

CHAPTER

17:

THE

POSITIVE

THEORY

o o (b)

(a)

Figure 17.G.l

OF

EOUILIBRIUM

however, that those are just two examples. Indeed, one of the difficulties in this area is the plethora of plausible disequilibrium models. Although there is a single way to be in equilibrium, there are many different ways to be in disequilibrium.

Price Ttitollllemenl

,.

We consider an exchange economy formalized by means of an excess demand function

o

z(·). Suppose that we have an initial p that is not an equilibrium price vector, so

(e)

that z(p) '" O. For example, the economy may have undergone a shock and p may be the pres hock equilibrium price vector. Then the demand-and-supply principle suggests that prices will adjust upward for goods in excess demand and downward for those in excess supply. This is what was proposed by Walras; in a ditTerential equation version put forward by Samuelson (1947), it takes the specific form

Comparative statics in the large: the general case.

conlinuation of the palh as I increases." To obtain an equilibrium i> for ij there is then no real alternative but to appeal to general algorithms for the solution of the system of equations :(.: ij) = O. It is a sobering thought that which solution we come up with at ij may be dictated more by our numerical technology than by our initial position (p; ij). This is most unsatisfactory, 'lnd it is a manifestation of a serious shortcoming-the lack of a theory of equilibrium

dp,

- = c,z,(p) de

for every (,

(17.H.I)

where dp,/til is the rate of change of the price for the fth good and c, > 0 is a constant atTecting the speed of adjustment. Simple as (17.H.I) is, its interpretation is fraught with difficulties. Which economic agent is in charge of prices'! For that malter, why must the "law of one price" hold out of equilibrium (i.e., why must identical goods have identical prices out of equilibrium)"! What sort of time does "I" represent? It cannot possibly be realtime hecause, as the model stands, a disequilibrium p is not compatible with feasibility (i.e., not all consumption plans can be simultaneously realized). Perhaps the most sensible answer to all these questions is that (17.H.I) is best thought of not as modeling the actual evolution of a demand-and-supply driven economy, but rather as a tentative trial-and-error process taking place in fictional time and run by an abstract market agent bent on finding the equilibrium level of prices (or, more modestly, bent on restoring equilibrium after a disturbance).'· The hope is that, in spite of its idealized nature, the analysis of (l7.H.I) will provide further insights into the properties of equilibria. Even perhaps some help in distinguishing good from poorly behaved equilibria. The analysis is at its most suggestive in the two-commodity case. For this case, Figure 17.H.l represents the excess demand of the first good as a function of the relative price pdp,. The actual dynamic trajectory of relative prices depends both on the initial levels of absolute prices and on the differential price changes prescribed by (17.H.I).60 But note that, whatever the initial levels of absolute prices, p,(I)/p,(t) increases at e if and only if z,(p,(t)!p,(I), I) > O. In Figure I7.H.I we see the following two features of the adjustment equations (l7.H.I).

selection.

17.H Tatonnement Stability We have, so far, carried out an extensive analysis of equilibrium equations. A characteristic feature that distinguishes economics from other scientific fields is that, for us, the equations of equilibrium constitute the center of our discipline. Other sciences, such as physics or even ecology, put comparatively more emphasis on the determination of dynamic laws of change. In contrast, up to now, we have hardly mentioned dynamics. The reason, informally speaking, is that economists are good (or so we hope) at recognizing a state of equilibrium but are poor at predicting precisely how an economy in disequilibrium will evolve. Certainly there are intuitive dynamic principles: if demand is larger than supply then the price will increase, if price is larger than marginal cost then production will expand, if industry profits are positive and there are no barriers to entry, then new firms will enter, and so on. The difficulty is in translating these informal principles into precise dynamic laws. 58 The most famous attempt at this translation was made by Walras (1874), and the modern version of his ideas have come to be known as the theory of ealonnemenl slabililY. In this section, we review two tatonnement-style models, one of pure price adjustment and the other of pure quantity adjustment. We should emphasize,

(a) Call an equilibrium (p" p,) locally slable if, whenever the initial price vector is sufficiently close to it, the dynamic trajectory causes relative prices to converge to the equilibrium relative prices p,/p, (the equilibrium is locally tolally unstable if any

57. Note that by reversing the direction of change of I we can continue to move along the segments in these two figures (this is actually quite a general facl). If p is the only solution at I = 0, as in 17.G.I(b), then the segment necessarily ends with a (I, p). Thus, in some sense we have succeeded in finding an equilibrium for ii that is associated with our initial p. But the association is very weak: it may depend on the particular homotopy and it requires the parameter-reversal procedure. If, as in Figure I7.G.I(c~ ;Hs not the only equilibrium at I = 0, then the procedure may simply not work: the segment that starts at (0, p) goes back to I = O. 58. Refer to Hahn (1982) for a general review.

59. This is, in essence, the idea of Walras (li'onnement means "groping" in French), who took inspiration from the functioning of the auctioneer·directed markets of the Paris stock exchange.

The idea was made completely explicit by Barone (t908) and by Lange (1938), who went so far as to propose the tatonnement procedure as an actual computing device for a centrally planned economy.

L

622

CHAPTER

17:

THE

POSITIVE

THEORY

OF

EQUILIBRIUM

SECTION

=,

17.H:

TATONNEMENT

STABILITY

623

L=3 I: Locally Stable. Index +1 2: Locally TOlally Unslable. Index +1 3: Saddle. Index -I p,/p,

Flgur.

=, (P,/Pl. I)

17.H.l

Tatonnement

Irajeclories for L = 2.

disturbance leads the relative prices to diverge from pdp,). Then a (regular) equilibrium PI/P, is locally stable or locally totally unstable according to tlae sign of tlae slope of excess demand at Ihe equilibrium, that is, according to the index of the equilibrium (recall Definition 17.0.2). If excess demand slopes downward at p,/p, (as in Figure 17.H.l), then a slight displacement of PI/P, above pdp, will generate excess supply for good I (and excess demand for good 2), and therefore the relative price will move back toward the equilibrium level p,/p,. The effect is the reverse if excess demand slopes upward at pdp,. (b) There is sySlem stability, that is, for any initial position (PI(O), p,(O)), tlae

Flgur. 17.H.2 An example of tatonnement

trajeclories for L = 3.

price of a good goes to zero the excess demand for the good becomes positive (thus, in particular, the trajectories point inward near the boundary). However, properties (a) and (b) are both violated: There are (regular) equilibria that are neither locally stable nor locally totally unstable (they are "saddle points," such as the equilibrium labeled 3 in the figure), and from some initial positions prices may not converge to any equilibrium."' In a more positive spirit, we now argue that for the cases where we have succeeded in proving the uniqueness of Walrasian equilibrium, we are also able to establish the convergence of any price trajectory to this equilibrium (this property is called global stability).62 The next proposition covers, in particular, the weak axiom, the gross substitute, and the no-trade cases studied in Section 17.F.63 These three cases have in common that they satisfy the weak axiom when we restrict ourselves to comparisons between equilibrium and nonequilibrium prices [see the discussion of condition (17.F.3) in Section 17.F]. That is, for the unique (normalized) equilibrium price vector p* arising in these cases we have: "If z(p*) = 0 then pO. z(p) > 0 for any p not proportional to p*."

corresponding trajectory of relalive prices PI(t)/P,(t) converges to some equilibrium arbitrarily closely as t -+ ct:). For regular, two-commodity, economies, properties (a) and (b) give a complete picture of the dynamics. It is very satisfactory picture that accounts for the persistency of tatonnement stability analysis: a theory yielding properties (a) and (b) must be saying something with economic content. Unfortunately, as soon as L> 2 neither the local conclusions (a) nor the global conclusions (b) of the two-commodity case generalize. This should not surprise us, since the price dynamics in (17.H.l) are entirely driven by the excess demand function, and we know (Propositions 17.E.2 and 17.E.3) that the latter is not restricted in any way (beyond the boundary conditions). Consider an example for L = 3 and c, = c, = c, = I. In Figure I7.H.2 we represent the normalized set of prices S = {p » 0: (p,)' + (p,)' + (P3)' = I}. This normalization has the virtue that, for any excess demand function z(p), the dynamic flow p(/) generated by the differential equation dptfdt = z(p), ( = 1,2,3, remains in S [i.e., if P(O) E S then p(t) E S for all t]. This is a consequence of Walras' law:

Proposition 17.H.l: Suppose that z(p*) = 0 and p*,z(p) > 0 for every p not proportional to p*. Then the relative prices of any solution trajectory of the differential equation (17.H.l) converge to the relative prices of pO.

d(p,(t)' + p,(t)' + pit)') dt = 2p,(t)z,(p(t» + 2p,(t)z,(p(t)) + 2p,(t)Z3(P(t» = O.

Proof: Consider the (Euclidean) distance function f(p) = Lt (l/c{)(Pt - pi)'. For any trajectory p(t) let us then focus on the distance f(p(t)} at points I along the trajectory. We have

Thus, the dynamics of p can be represented by trajectories in S, the direction vector of the trajectory at any P(/) being the direction of the excess demand vector z( p(t». We conclude, therefore, that the only restrictions on the trajectories imposed by the general theory are those derived from the boundary behavior of excess demand. In Figure 17.H.2 we represent a possible field of trajectories. In the figure, when the

61. We should warn againsl deriving any comfort when prices converge 10 a limil cycle. Recall that this price tatonnement is not happening in real time. The dynamic analysis has a hope of telling us something significant only ir it converges. 62. Warning: uniqueness by itself does nol imply slabilily-excepl ror L = 2. You should Iry

60. NOle Ihal allhough Ihe change in Pf all prescribed by (I7.H.I) depends only on Ihe relative prices p,/p, for { = I, 2, Ihe change in Ihe price ralio p,/p, al I depends bolh on Ihe curren I price

10

ratio and on the current absolute levels of PI and Pl"

1

draw a counlerexample in Ihe slyle of Figure 17.H.2. 63. For a proof specific 10 Ihe gross Subslilule case. see Exercise 17.H.1.

624

C HAP T E R

1 7:

THE

P 0 5 I T I VET H E 0 R Y

0 F

E 0 U IL

I8

~ E. C T I

R I lJ ;.,

<J

h

• 1 • H:

TAT 0 NNE MEN T

S T It. B I l i T Y

6"::~

succeeds in restoring equilibrium afler a small dislurbance. Thus we see the contrast: for lalonnemenl stability. we impose few informalional reslriclions on the adjustment process [to determine Ihe change in p we only need 10 know f(p); in particular. no knowledge of Ihe derivalives of 1(') is required]. bUI convergence is guaranleed only in special circumslances. For Ihe NeWlon method. local convergence always oblains. bul to delermine Ihe direclions of price change al any p we need 10 know alllhe excess demands f(p) and all the price effecls D:(p). See Smale (1976) and Saari and Simon (1978) for classic conlributions to Ihis Iype of

- p" z(p(r» !> 0,

Newton price dynamics.

where the last inequality is strict if and only if p(r) is not proportional to p'. We conclude that the price vector p(r) monotonically approaches the price vector p' [in fact, since the same argument applies to (1.p', p(r) must be monotonically approaching any (1.p']. This does not mean that p(r) reaches a vicinity of p'. Typically it will not: the rate of approach of p(r) to p' will go to zero before p(r) gets near p'. But the rate of approach can go to zero only if p(r) becomes nearly proportional to p' as r ..... 00, in which case the relative prices do converge.· 4 •

Qlllllltity Tatollllemelll In Ihe analysis so far. prices could be out of equilibrium but quantities. that is to say Ihe amounls demanded and supplied. are always at their equilibrium (i.e.• utility and profit·maximizing) values. We now briefly consider a model in which quantities rather Ihan prices may be in disequilibrium.· 7 This is besl done in a production context. To be very concrele. suppose that there is a single production set y.6. At any moment of lime. we assume that there is given a single. fixed production vector Y E Y. Prices. however. are always in equilibrium in the sense that the general equilibrium syslem of the economy. conditional on y. generales some equilibrium price system p 0, or. if the relevant inverse exists,

as {liven:

Definillon 17.H.1: We say that the differentiable trajectory y(t) E Y is admissible if p(y(t))·(dy(t)/dt) ~ 0 for every t. with equality only if y(t) is profit maximizing for p(y(t)) (in which case we could say that we are at a long·run equilibrium). A difference belween the price and the quantity tatonnement approaches that adds appeal to the second is Ihal feasibility is now insured at any I and that, as a result. we can interpret the dynamics as happening in real time. 69 •7o Will an admissible Irajectory necessarily take us to long. run equilibrium? We cannot really explore this matter here in any detail. As usual, the answer is "only

~·I:·

I

~.

~

Ii

'!!!. =

-;.[Di(pJr'i(p)

(17.H.2)

67. We could also look al Ihe general case where both could be in disequilibrium; sec. for example. Mas·Colell (1986). 68. There is no difficulty in considering several. Also. Y can be interpreted as an individual or

dt

This adjuslmenl equalion is known as Newlon's method and is a slandard lechnique of numerical analysis. If Df(p') is nonsingular. so Ihal [Dt(p·)r' exisls. then (l7.H.2) always

as an aggregate production set.

69. Nonelheless. it is importanl 10 realize thaI. even then. Ihis is not a fully dynamic model: The optimization problems of the consumers remain static and free of expectational feedbacks and firms follow naive. short· run rules of adjustment (in a more positive spirit one might call this adap,ive. rather than naive, behavior). For an extensive analysis of market adjustment procedures in real

64. Conlinuous rcal·valued functions Ihallake decreasing values along any dynamic trajectory and the value zero only at stationary points are known as Lyapunov functions.

time. see Fisher (1983). 70. The quantity dynamics of Definition 17.H.1 are reminiscenl of Marshall (1920) and arc

65. How could we prelend to know much about speeds of adjustments? 66. NOle Ihat Ihis fils nicely wilh Proposilion 17.H.1 because Ihe revealed.preference·like property poslulaled Ihere implies the negalive (.. mijdefiniteness of Dt(p) al Ihe equilibrium price vector p•.

ortcn referred to as Marshallian dynamics, especially in a partial equilibrium context. ]n contrast. the price dynamics are frequently called Walrasian dynamics.

l

.....

626

CHAPTER

17:

THE

POSITIVE

THEORY

Of

EQUILIBRIUM

SEC T ION

Proof: Consider u(y(l) + w) where u(·) and ware respectively the utility function and the endowments of the consumer. The unique equilibrium production vector is the single production vector y that. maximizes u(y + w) on Y; recall the oneconsumer, one-firm example of Section IS.C. The argument is much simpler if we assume that u(·) is differentiable. We claim that utility must then be increasing along any admissible trajectory. Indeed,

+ w)

V ( () uyr

+w

) dy(l) '-d-I-

dyer) = !I(I)p(y(r»'d/

NON CON V E X I TIE S

627

As we have mentioned repeatedly, especially in Chapters 10 and 12, a central justification of the price-taking hypothesis is the assumption that every economic agent constitutes an insignificant part of the whole economy. Literally speaking, however, this cannot be satisfied in the model of this chapter because, formally, we allow for no more than a finite number I of consumers. (This is particularly true of our examples, where we typically have I = 2.) A straightforward reinterpretation is possible, however. We illustrate it for the case of a pure exchange economy. Suppose we consider economies whose consumers have characteristics (preferences and endowments) that fall into I given types, with r consumers of each type (a generalization to unequal numbers per type is possible; see Exercise 17.1.1). That is, the set of consumers is formed by r replicas of a basic reference set of consumers. Furthermore, an allocation denoted by (Xl' ... ' x,) is understood now to specify that each consumer of type j consumes XI (so the totality of consumers of type j consume rx i ). We observe then that the analysis and results presented up to this point are not modified by this reinterpretation; they simply do not depend in any way on the parameter r. In this way, we can conclude informally that the theory so far covers cases with an arbitrarily large number, even an infinity, of consumers; in particular, we see that any equilibrium of our earlier model is an equilibrium of the r-replica economy (for any integer r ~ I). There is, however, an important qualification. The ability to interpret the model and results in a manner that is fully independent of the number of consumers depends crucially on the convexity assumption on preferences. Without this assumption, it is not justified to neglect allocations that assign different consumption bundles to different consumers of the same type. Consider, for example, the Edgeworth box of Figure 17.1.1. If there is only one consumer of each type, then no equilibrium exists; but if we have two of each type, then there is an equilibrium. To see this, give W2 to the convex consumers, let one of the two nonconvex consumers receive the bundle XI' and let the other receive the different bundle Thus, in the nonconvex case, the

Proposition 17.H.2: If there is a single strictly convex consumer, then any admissible trajectory converges to the (unique) equilibrium.

--d-I-- =

L A R GEE CON 0 M I E SAN D

17.I Large Economies and Nonconvexities

under special circumstances." A limited, but important example (it covers the shortrun/long-run model of Section IO.F) is described in Proposition 17.H.2.

du(y(l)

1 7 • I:

> 0,

with equality only at equilibrium. Here we have used the fact that at a short-run (interior) equilibrium, the price vector p(y(I» weighted by the marginal utility of wealth !I(t) must be equal to the vector of marginal utilities of the consumer. Now, since utility is increasing, we must necessarily reach the production vector y at which utility is maximized in the feasible production set (i.e., the equilibrium). This is illustrated in Figure 17.H.3. (We are sidestepping minor technicalities: to proceed completely rigorously, we should argue that the dynamics cannot be so sluggish that we never reach the equilibrium. To do so we would need, strictly speaking, to strengthen slightly the concept of an admissible trajectory). •

x;.

Note that the single consumer of Proposition 17.H.2 could be a (positive) representative consumer standing for a population of consumers. Figure 17.1.1

Good 2 1,(I)p(r(l))

= Vu(r(t) + w)

Figure 17.H.3

Equilibrium with

An example of

nonconvex preferences in economies of changing size.

quantity tatonnemcnt.

Indifference Curves of the (Representative) } Consumer

o,L---------\---t-----"-+-__

y" R~+ Good I i

1

THE

POSIT .... £

THEORY

OF

EQUILIBRIUM

SEC T ION

I r

I = r

(the sum has r terms)

+ ... + Z,,: =" E z,(p), ... , z"

A Ii

t:.

u

E C 0 HOM i E SA'" 0

,,0,.. l.. V

J"oj

~ 1. ), I TIE S

IJL:;;1

In the previous reasoning, the convexification or aggregate excess demand, with its existence implication, depends on our ability to prescribe very carefully which of several indifferent consumptions each consumer has to choose. Only in this way can we make sure that the "ggreg"tc consumption will be precisely right. Whatever we may think about the possible processes that may lead consumers to select among indifferent optimal choices in the right proportions, there can be little doubt that it would be better if we did not have to worry about this; that is, ir, given any price, practically every consumer had a single optimal choice. It is therefore or interest to point out that, while not a necessity, this is a most plausible occurrence if the number or consumers is large. Indeed, if the dist,ihution of individual prefere/lces ;s di.'ipas('d across the populCltion (so that, in particular. no two consumers arc exactly identical B), r!Jeli ('f('1f if til I,there need not be any inclusion relationship between E(,") and E(,') (except ir ," = m,' ror some integer m > I, in which case E(,') c E(,")). 72. See Starr (t969) ror a classic contribution to this topic. 73. See the comment after the proof of Proposition t 7.C.2 regarding demand correspondences, and also Exercise t 7.C.1.

l

630

CHAPTER

11:

THE

POSITIVE

THEORY

OF

EQUILIBRIUM

APPENDIX

We comment briefly on economies with production. Suppose that the consumption side of the economy is generated, as before, as the r-replica of a basic reference set of (possibly nonconvex) consumers. There are also J production sets l). Each lJ is closed, contains the origin, and satisfies free disposal (these are all standard assumptions). In addition, we assume that there is an upper bound (a capacity bound perhaps) on every lJ; that is, there is a number s such that )'/J ~ s for all ( and 11 e lJ. The production sets may be nonconvex. It is then possible to argue that the economy will possess a near equilibrium if r is large relative to the bound s (i.e., if the size of the consumption side of the economy is large relative to the maximal size of a single firm). On the average, the production side of the economy is also being convexified, so to speak (see the small-type discussion of Section S.E for a related point)." Note that the bounded ness property of the production sets is important. Suppose, for example, that every firm has the technology represented in Figure IS.C.3. Then no matter how many consumers there are, the potential profits of every firm are infinite (as long as p, > 0). Thus, there is no reasonable sense in which a near equilibrium exists. For the averaging·out effect to work the nonconvexity in production has to be of bounded size (see Exercise 17.1.2).

A:

CHARACTERIZING

EQUILIBRIUM

THROUGH

WELFARE

EQUATIONS

",

o,~--------------~~------~~~--

I

i

I

P(,)'(w, - "(s)) = g,(s) p,(s) . p,(s)

proportional to SEA (see Exercise I7.AA.I). In words, SEA stands for the values of utility distribution parameters and the determined allocation distributes "welfare" in accordance with the "shares" s = (s" ... ,sIlo Figure I7,AA.1 illustrates the construction. An arbitrary SEA will typically not correspond to an equilibrium. How can we recognize those sEA that do? To answer this question we can resort to the second welfare theorem. From Propositions 16.D.1 (and the discussion in small type following Proposition 16.D.3), we know that, under our assumptions, associated with x(,,) there is a price vector pes) E RL that supports the allocation in the sense that, for every i, x; >-; x,(s) implies pes)' x; > p(s) , x;(s). Therefore, (x(s'), p(s'» constitutes a Walrasian equilibrium if and only if S· E A solves the system of equations

APPENDIX A: CHARACTERIZING EQUILIBRIUM THROUGH WElFARE EQUATIONS

We have seen, beginning in Section 17.B, that if our economy satisfies sufficiently nicc properties (e.g., strict convexity of preferences) then we can resort, for the purposes of the analysis, to formalizing our theory by means of highly reduced systems of equilibrium equations. In the text of this chapter, we have focused on excess demand equations, But this is not the only possibility. In this appendix, we briefly illustrate a second approach that builds on the welfare properties of equilibria. We again concentrate on a pure exchange economy in which each consumer i = I, ... ,J has the consumption set R~ and continuous, strongly monotone, and strictly convex preferences. We also assume that WI ~ 0 for all i and 1:, w, » o. We know from Chapter 16 that a Walrasian equilibrium of this economy is a Pareto optimum (Proposition 16.C.l). Therefore, to identify an equilibrium, we can as well restrict ourselves to Pareto optimal allocations. To this effect, suppose we fix continuous utility functions u,(·) for the J consumers with u,(O) = O. Then to every vector s = (s I' ... , Sl) in the simplex A = {s' E R~ : 1:, s; = I} we can associate a unique Pareto optimal allocation xes) E R~I such that (UI(XI(S», ... , UI(XI(S))) is

g;(S')

= p(s')·[w; -

x,(s')]

=0

for every i = I, ... , J.

(I7.AA.I)

The Edgeworth box example of Figure 17.AA.2 explains the point that we are currently making. This Pareto-based equation system was first put forward by Negishi (1960), and was the approach taken by Arrow and Hahn (1971) in their proof of existence of equilibrium. It can be quite useful when the number of consumers (say, the number of countries in an international trade model) is small relative to the number of commodities. In contrast, if the number of consumers is large relative to the number of commodities, then an approach via excess demand functions will be superior. A limitation of the Negishi approach is that it is very dependent on the fact that an equilibrium must be a Pareto optimum. The excess demand approach is more easily adaptable to situations where this is not so (for example, because of tax distortions; see Exercise 17.C.3).7.

77. Observe that the average is with respect to , (the size of the economy in terms of the number of consumers). not with respect to J. Ir. as , increases. J is made to vary and is kept in some approximate fix.ed proportion with r, then rrom the qualitative point or view it does not matter how we measure size (this is a possible way to interpret, in the current context, the discussion or Section 5.E). But for the validity of the convexifying etrecl there is no need 10 vary J with r. The number J may be kept fixed and, thus, J could well be small relative to r (in which case the "averaged" economy is praclically one of pure exchange) or it could be large; il could even be Ihat J = co. The last case corresponds to a model with free entry, where the equilibrium-or the near equilibrium-

determines. endogenously, Ihe set of active firms. Typically, with free entry the sel of the aClive firms increases as the number of consumers, measured by r, grows (this point has also been discussed

78. The syslems of equations (17.B.2) and (17.AA.I) can be formally conlrasted as follows. In both of Ihem. at any poinl of Ihe domain of the equal ions, consumers and firms salisfy the utility maximizalion conditions for some prices and distribution of weallh. In (l7.B.2) this dislribution of weallh is always Ihe one induced by the inilial endowments, bUI feasibility (i.e., the equalilY of demand and supply) is insured only at the solulion. In (l7.AA.I) it is the other way around: feasibililY is always satisfied, but Ihe agreement of the wealth distribution with that induced by Ihe

in Section IO.F in a partial equilibrium context; there is not much more to add here).

initial endowments is insured only at the solution.

1

(ten) Construction of the welfare-theoretic equation system: first step.

Figure 17.AA.l

Figur. 17.AA.2 (rig hi)

Construction of the welfare-theoretic equation syslem: second step.

631

632

A P PEN 0 I X

Definition 17.BB.3: An allocation (x', y') and a price system P disposal quasiequilibrium if

if Xi

then (x·, p) cannot be a Walrasian quasiequilibrium because consuming nothing costs zero and is

preferred by the first consumer to any other consumption. 8!. Because lj is convex and closed, - R~ C lj implies lj -

R~ c lj (Exercise 5.8.5). X, for every i. With this assumption.

Proposition 17.BB.2 yields the existence of a true equilibrium. not just of a quasiequilibrium. Wj ~ .i, can be interpreted (keeping in mind the possibility of free disposal) simply as saying that consumer; could survive Economically, however, the latter assumption is considerably stronger:

without entering the markets of the economy, while the market a strictly positive amount of every good.

Wj

»X, says thac the consumer can supply to

">-; Xi

then p'x; ~ P'w;

+ L 0i/P' Y,'. /

(iii') Li xi ~ L; W;

have p »0. By profit maximization (using the free-disposal technology) and the possibility of inaction, we have p'(x1 + x! - W, - w,) ~ O. Since p'x! $ p'W" this yields p'x1 ~ p'W, > O. But

Wi»

0 constitute a free-

(i) for every i, P'Y, ~ P'Y,' for all Y,E If· (ii') For every i, P'x!' ~p'w;+ LjOijP'Y,', and

79. Recall, however, the important qualification of Section 17.1, and see also the discussion at the end of this appendix. 80. In Figure 17.8B.2, the second consumer has conventional strongly monotone preferences; but for the first consumer both commodities are bads and. thereCore. he is satiated at the origin. Also (I), »0 and w 2 » O. Suppose that x· = (xT. xf) and price vector p#-O constitute a Walrasian quasiequilibrium. Because the preferences of the second consumer 3fC strongly monotone, we must

82. A stronger condition would require that

~

I

I

1

+ L/ Y,'

and p' (Li xi - Li Wi - L/ lj') = O.

Thus. all we have done is replace in Definition 17.BB.I of a quasiequilibrium the exact feasibility condition 'Ti xi = L Wi + Lj yt" by (iii') above. That is. we allow the excess supply of some goods provided that they are free. In Exercise 17.BB.4 you arc asked to show that if one production set. say 1I. satisfies the free-disposal property and if (xf' .... x7. yf ....• yr. p) is a free-disposal quasiequilibrium. then there is l'* ~ yf such that (xf" ... x7. l,·. y! ..... yJ. p) is a Walrasian quasiequilibrium. Therefore. to establish Proposition 17.B8.2, it is enough for us to show that a free-diposal quasiequilibrium exists. We proceed to formalize the free-disposal quasiequilibrium notion as a kind of noncooperative equilibrium for a certain game among I + J + I players. The I and J players are the consumers and the firms. respectively. and their strategies are demand-supply vectors. The extra player is a fictitious market agent (a "grand coordinator") having as his strategy the prices of the L different goods. Since the set A of feasible allocations is bounded, there is r > 0 such that whenever (xl •. ··.x,.y, •...• yJ)E A we have Ix,d < rand IY'jl < r for all i,j, and t. Because we need to have compactness of strategy sets to establish existence, we begin by

635

636

CHAPTER

17:

THE

POSITIVE

THEORY

OF

APPENDIX

EOUILIBRIUM

B:

GENERAL

APPR"" .. "

Tu

IHE

E.XISTENCl

(d,

WAlRASIAN

EQUILloHlljJol

-------------------------------------------------------------------------------------Denote by x,(x, Y, p) c X, the set of consumption bundles xi so defined. Firm j: Chooses productions yj € ~ that are profit maximizing for P on ~. (Firm j's payoff function is simply its profit.) Denote by )~(x, Y, p) c ~ the set of production plans yj so defined. Market Agent: Chooses prices q €!:J. so as to solve

replacing every X, and every lj by a truncated version:

X, = {x, € X,: IXl;!

:5 r for all t}.

~ = {YJ€ lj: IYol :5 r for all

tl·

t.

Note that A c X, x ... X X, x y, x ... x Because (.£" ... ,.£" ... ,0, ... ,0) € A, it follows that .£, € X, for every i, and 0 € ~ for every j. In particular, all the strategy sets are nonempty. Lemma 17.BB.1 shows that in our search for a free-disposal quasiequilibrium we can limit ourselves to the truncated economy.

Max

Lemma 17.BB.1: If all X; and If are convex and (x", y", p) is a free-disposal quasiequilibrium in the truncated economy, that is, if (x", y", p) satisfies Definition 17.BB.3 of free-disposal quasiequilibrium with the consumption and production sets replaced by their truncated versions, then (x", y", p) is also a free-disposal quasiequilibrium for the original untruncated economy.

ilEA

Lemma 17.BB.2: Suppose that (x·, Y", p) is such that xi € x;(x", Y·, p) for all i, v," € Yj(x·, y", p) for all j, and p € p(x·, Y", pl. Then (x·, y., p) is a free-disposal quasiequilibrium for the truncated economy. Proof of Lemma 17.BB.2: We note first that P' yj ;::: 0 for every j (because 0 € 5'). By the definition of x,(·) and jij('), conditions (i) and (ii') of Definition 17.B8.3 are then automatically satisfied. Hence, the only property that remains to be established is (iii'), that is,

Lxi - LllI, - Lyt:5 0

~ O/jP' yj}'

We have P'x~:5 w/(p,Y") = P'w, The strategy sets are:

P'(LX~ i

E

~/.:

PI ;::: 0 for all t and LI Pr

= I}.

Given a strategy profile (x, Y, 1') = (X,' ... ' X" Y,' ... ' YJ' p), the payoff functions and best-responses of the different agents are: Consumer i:

Chooses consumption vectors xi € X, such that (I) p'xi :5 w,(p, y) and (2) xi ?:;,x7 for all xi' E X, satisfying P' XI < w/(p, y). (Consumer i's payoff function can be thought of as giving a payoff 1 if he chooses a consumption vector satisfying this condition, and 0 otherwise.)

(17.BB.l)

J

Only the behavior of the market agent needs comment. Given the total excess demand vector, the market agent chooses prices so as to maximize the value of this vector. Hence, he puts the whole weight of prices (which, recall, have been normalized to lie in the unit simplex) into the commodities with maximal excess demand. As we have already observed when doing the same thing in the proof of Proposition 17.C.1, this is in accord with economic logic: if the objective is to eliminate the excess demand of some commodities, try raising their prices as much as possible. Lemma 17.BB.2 says that an equilibrium of this noncooperative game yields a frec-disposal quasiequilibrium for the truncated economy.

We are now ready to set up a simultaneous-move noncooperative game. To do so we need to specify the players' strategy sets and payoff functions. To simplify notation we assign to every consumer i, price vector P and production profile Y = (YI'···' YJ), a limited liability amount of wealth

For consumer i: X, For firm j: ~ For the market agent: !:J. = {p

i

i

Denote by p(x, y, p) the set of price vectors q so defined.

Proof of Lemma 17.BB.l: Consider a consumer i (the reasoning is similar for a firm). Because (x·, y.) € A, we have Ix1,1 < r for all t; that is, the consumption bundle of consumer i is interior to the truncation bound. Suppose now that x~ fails to satisfy condition (ii') of Definition 17.BB.3 in the nontruncated economy, that is, that there is an x, € X, such that x, >-,x~, and P' x, < P'W, + LJ O/jP' yt- Denote x7 = (I - (I/n»xr + (l/n)x,. For all n we have P' xi < P'W, + Lj O'JP' yt and, by the convexity of preferences, xi ?:;,x7- Also, we can choose an n large enough to have Ixi,l < r for all t. By local nonsatiation there must then be an xi € X, such that xi >-,xi and p'xi < P'W, + Lj O'JP' yt- But then xi € X, and xi >-,x7 ?:;,X~, and so in the truncated economy xr fails to satisfy condition (ii') of Definition 17.BB.3. Thus, (x·, y., p) must not be a free-disposal quasiequilibrium in the truncated economy. This contradiction establishes the result. _

w,(p, y) = P'w, + Max{o,

(LX,-LW,-LYJ)·q.

I !

I

L

and

+ L J8/jp·yt

for all i and therefore

t):5 o.

LW' - L Y j

j

This implies L, xi - L, w, - LJ yt :5 0 because otherwise the value of the solution to problem (17.BB.l) would be positive and so P (which as we have just seen has "'(L, xi - L, w, - LJ ytl:5 0) could not be a maximizing solution vector, that is, a member of p(x", y", pl. It follows that (x·, Y·) € A and so, X?, < r for all i and t. From this we get that the budget equations are satisfied with equality (i.e., ". xi = p'W, + Lj OIjP' yt for all i) because otherwise local nonsatiation yields that for some consumer i there is a preferred consumption strictly interior to consumer i's budget set in the truncated economy, implying x~ ¢ X,(X·, y., pl. We therefore conclude that we also have P'(L, xr - LI W, - LJ ytl = O. This completes the proof. _

638

CHAPTER

17:

THE

POSITIVE

THEORY

OF

APPENDIX

EQUILIBRIUM

N OW, as we discussed in Appendix A to Chapter 8 (see the proof of Proposition

8.0.3 presented there), under appropriate conditions on the best-response correspondences, this noncooperative game has an equilibrium. Lemma 17.88.3: Suppose that the correspondences x;!"), ~(.), and p(.) are nonempty, convex valued, and upper hemicontinuous. Then there is (x', y', p) such that x7 E x,(x', yO, p) for all i, Yj' E ~(x', yO, p) for all j, and p E p(x', yO, pl. Proof of Lemma 17.BB.3: We are simply looking for a fixed point of the correspondence '1'(.) from X, x ... X X, X YI X •• '. x >J x A to itself defined by 'I'(x.y,p) = x,(x,y,p)

x··· x x,(x,y,p) x y,(x, y, p) x··· x yAx,y,p) x p(x, y, p).

The correspondence '1'(.) is nonempty, convex valued, and upper hemicontinuous. The existence of a fixed point follows directly from Kakutani's fixed point theorem (sec Section M.I of the Mathematical Appendix). _ Lemmas 17.BB.4 to 17.BB.6 verify that the best-response correspondence of this noncooperative game is nonempty, convex valued, and upper hemicontinuous· ' Lemma 17.88.4: For all strategy profiles (x, y, p), the sets xi(x, y, p), pIx, y, p) are nonempty.

~(x,

y, p), and

Proof of Lemma 17.BB.4: For Yj(x, y, p) and pIx, y, p) the claim is clear enough since we arc maximizing a continuous (in fact, linear) function on, respectively, the noncmpty, compact sets ~ and A. For Xi(X, y, p), recall that the continuity of ;:::i implies the existence of a continuous utility representation "i(') for ;:::, .• 4 Let be a maximizer of the continuous function "i(X,} on the nonempty compact budget set (Xi E Xi: p' Xi::;; Wi(p, yO)}. Then x; E Xi (X, y, pl. The budget set is nonempty because Xi E Xi and ,Xi::;; Wi' With P
x;

Lemma 17.88.5: For all strategy profiles the sets xi(x, y, p), Yj(x, y, p), and pIx, y, p) are convex. Proof of Lemma 17.BB.5: We establish the claim for Xi(X, y, pl. You are asked to complete the proof in Exercise 17.BB.6. Suppose that Xi' x; E Xi (x, y, p) and consider Xi. = aXi + (I - a)x;, for any a E [0,1]. Note first that P·Xi. ::;; W,(P, y). In addition, by the convexity of preferences we cannot have Xi :>iXi. and x; :>, x,. (Exercise 17.BB.5). So suppose that Xi. ;:::iXi' Consider now any xi e X, with P'x, < wi(p, y). Then since x, E Xi(X, y, p) we have Xi;::: i xi, and so Xi. ;:::ixi. We conclude that Xi. e Xi(X, y, pl. A similar conclusion follows if Xi. ;:::i x;. Hence, xi(x, y, p) is a convex set. _ Lemma 17.88.6: The correspondences

xi ('), ~(.), and p(.) are upper hemicontinuous.

Proof of Lemma 17.BB.6: Again, we limit ourselves to Xi(·)' Exercise 17.BB.7 asks you to complete the proof for M') and p('). 83. For the firms and the market game this result is covered by Proposition 8.DJ, but for the consumers we need a special argument (as defined, the payoff runctions of the consumers arc not continuous).

84. This was proved in Proposition lC.1 for monotone preferences on Rt As we pointed out there. however. the conclusion actually depends only on the continuity of the preference relation.

B:

GENERAL

APPRQACH

TQ

THE

EXISTENCE

OF

WALRASIAN

x;

Let p" - t p, y" - t y, x" - t x, and x;" -+ as n - t 00, and suppose that x;" E x,(x", y", pO). We need to show that x; e XI(P, x, y). From p"'x;" ::;; W,(P", yA) we get p'x; ::;; W,(pA, yA). Consider now any xi e with x;' :>iX;. Then, by the continuity of preferences, xi :>, x;" for n large enough. Hence, p". xi :?: wi(p", yO). Going to the limit we get p' x7
Xl

x;

closed-graph property that we have replaced preference maximization by the weaker objective of expenditure minimization in the definition of the objectives of the consumer. _

The combination of Lemmas I7.BB.4 to 17.BB.6 establishes that the given best-response correspondences satisfy the properties required in lemma 17.BB.3 for the existence of a fixed-point, which completes the proof of Proposition 17.BB.2. _ The assumptions on preferences and technologies can be weakened in an important respect. Our existence argument requires only that the best·response correspondence .i,(x, y, p) and rj(x, y, p) be convex valued and upper hemicontinuous. Beyond this, the proof imposes no restrictions whatsoever on the dependence of consumers' and firms' choices on the ~state" variables (x. y. pl. Thus we could allow consumers' tastes, or firms' technologies, to depend on prices (money illusion?), on the choices of other consumers or firms (a form of externalities), or even on own consumption (e.g., tastes could depend on a current reference point-a source

I).·"··

of incompleteness or non transitivity of preferences already illustrated in Chapter The following is an example of the sort of generality that can be accommodated: Suppose that consumer preferences are given to us by means of utility functions -,('; x, y, p) defined on X, but dependent, in principle, on the state of the economy. If for every (x, y, p) the conditions of Proposition 17.BB.2 are satisfied, and the parametric dependence on (x, y, p) is continuous, then a Walrasian quasiequilibrium still exists. The proof does not need any change. We can make a similar point with respect to the possibility that firms' technologies depend on external effects, with, then, an added theoretical payoff. It allows us to see that equilibrium exists if the technology of the firm is convex: il does nol mailer iflhe ~aggregale"lechnology oflhe economy is convex. See Exercise 17.B8.8 for more on this. The existence proof we have given in this appendix is an example of a "large space" proof. The fixed'point argument (in our case phrased as a Nash equilibrium existence argument) has been developed in a disaggregated domain where all the equilibrating variables have been listed separately. The advantage of proceeding this way is that the argument remains very flexible and allows us to incorporate the weakest possible conditions without extra effort (as the last paragraph has illustrated). The disadvantage, of course, is that the fixed point may be 85. Suppose, for example, that the utility function of a consumer is given to us in the form

that is, the evaluation of possible consumptions depends on the current consumption. Without loss of generality we can normalize u,(x,: x,) = 0 for every x,. Define the induced weak and strict preference relations '=/ and >-1 on Xi by, respectively, .. xj ~,x/ if ",(xi; x,) ~ 0'" and .. xi >-iXi if uj(x/; x,) > 0." Then the relations ~, and >-, contain all the relevant information for equilibrium analysis. Note, however, that it is perfectly possible for It, not to be complete and for neither It; nor r; to be transitive. See Shafer (1974) and Gale and Mas-Colell (1975) for more u;(·: x;);

on this.

86. Another example of dependence on the overall consumption vector of the economy arises if, for example. we are considering equilibrium at a given point in time. Then current consumptions in the economy (e.g., purchases of physical or financial assets) will typically affect future prices; these, in turn, will innuence current preferences via expectations.

EQUILIBRtUM

639

640

CHAPTER

17:

THE

POSITIVE

THEORY

OF

EXERCISl!:i

EOUILIBRIUM

hard to compute and cumbersome to analyze. Usually, as we have seen in Section 17.C and in Appendix A of this chapter, it is possible to work with more aggregated, reduced systems. In fact, the general point duly made, it is worthwhile to observe that this is so even under the assumptions of Proposition 17.BB.2.17 We elaborate briefly on this. We can prove Proposition 17.BB.2 by selling up a two-player game instead of an I + J + I one'· The first player is an aggregate consumer-firm that has L, X, - IL' w,} - LI ~ as its strategy set; the second is, as before, a market agent having 6. as its strategy set. Given p E 6., the first agent responds with the set of vectors z expressible as z = 1:. x, - L, w, - LI YI' where Jj is profit maximizing in ~ for every j, and x, E X, is such that (I) p' x, :5: P'W, + LI 0'1 p' Yj and (2) x, :::,x; whenever p'x; < p'W, + LI 0'1 p' YI' As before, the market agent responds with the set of q E 6. that maximize Z'q on 6.. Once this two-person game has been set up, the proof proceeds as for Proposition 17.BB.2. You should check this in Exercise 17.BB.9. If for any p E 6. the preference-maximizing choices of consumers, x,( p), and the profitmaximizing choices of firms, )j(p), were single valued, we could go one step further and consider a game with a single player (the market agent). Given p, we would then let the best response of the market agent be the set of price VectOrsqE 6. that maximizes [1:, x,{p) - L, w, - LI y)(p»)-q on Ii. In essence, this is what we did in the proof of Proposition I7.CI.

REFERENCES Arrow. K., and G. Debrcu. (1954). Existence of equilibrium for a competitive economy. f:('(mmnerricu 22: 265-90. Arrow, K.. and F. Hahn. (1971). General Competitive Analys;s. San Francisco: Holden-Day. Arrow, K, and M. Intriligalor, eds. (1982). Handbook of Mathematical Ewnomics. vol. II. Amsterdam:

North-Holland. Barone. E. (1908). II ministro della produzione nello stato colletti vista. Giornali degli economisti. [Reprinted as: The ministry or production in the collectivist state. in Collectivist Economic Planning. edited by F. H. Hayek. London: Routledge, 1935.] Balasko, Y. (1988). Foundations of the Tlteory of General Equifibrium. Orlando: Academic Press. Becker, G. (1962). Irrational behavior and economic (heory. Journal of Political E"onomy 70: 1-13. Brown. D., and R. Matzkin. (1993). Walrasian comparative statics. Mimeograph. Northwestern University. Chipman, J. (1970). External economies of scale and competitive equilibrium. Quarterly Journal of Economics 84: 347-85.

Debreu, G. (1959~ Tlteoryof Value. New York: Wiley. Debreu, G. (1970). Eeonomies with a finite sel of equilibria. Econometrica 38: 387-92. Debreu, G. (1974). Excess demand functions. Journal of Mathematical Economics I: 15-21. Dierker, E. (1972). Two remarks on the number of equilibria of an economy. Econom.trica 40: 951-53. Fisher. F. (1983). Disequilibrium Foundations of Equilibrium Economics. Cambridge. U.K.: Cambridge University Press. Gale. 0 .. and A. Mas~Colell. (1975). An equilibrium existence theorem for a general model without ordered preferences. Journal of Mathematical Economics 2: 9-15. [For some corrections see Journal of Mathematica/Economics 6: 297-98, 1979.] Garcia, C. B.. and W. I. Zangwill. (1981). Pathways to Solutions, Fixed Points and Equilibria. Englewood

Cliffs. N.J.: Prentice-Hall. Grandmont, J. M. (1992). Transformations of Ihe commodity space. behavioral heterogeneity. and the

Hildenbrand. W. (1994). Market Demand: Theory and Empirical Evidenu. Princeton. NJ.: Princeton University Press. Hildenbrand. W.o and H. Sonnenschein, cds. (1991). Handbook of Mathematical Economics. vol. IV. Amsterdam: North-Holland. Kehoe. T. (1985). Multiplicity of equilibrium and comparative statics, Quarterly Journal of Economics 100: 119-48. Kehoc. T. (1991). Compulation and mulliplicily of equilibria. Chap. 38 in Handbook of Mathematical Economics. vol. IV, edited by W. Hildenbrand, and H. Sonnenschein. Amsterdam: North-Holland. Lange. O. (1938). On the economic theory of socialism. In On the Economic Theory of Socialism. edited by It Lippincott. Minneapolis: University of Minnesota Press. McKenzie, L. (1959). On the cltistence of general equilibrium ror a competitive market. E('(mometrica 27: 54-71. Mantd, R. (1974). On the characterization or aggregate excess demand. Journal of Economic Theory 7: 34M-53. Mantel. R. (1976). itomothctic preferences and community CltCCSS demand functions. Joumat (if Eronomk n",o,)" 12: 197-201. Marshall. A. (1920). Princip/t'.'i oj £t'mlomics. 8th ed. London: Macmillan. Mils-Colell, A. (1977). On the equilibrium price set of an exchange economy. JOllrnal of Math('",otical £conomks 4: 117-26. Ma~-Colcll, A. (1985). nil! Theor}' of Gt'nt'fal £wnomic Eqllilihrj,~m: A Dij]er('ntiahlt' Approach. Camhridge. U.K.: Cambridge University Press. Ma~-(,ulcll. A. (1986). Notes. on price and quantity latonnement. In Aloc/d.'i (~r Economic Dytlamin, edited by fL Sonnenschein. Lecture Notes in Economics and Mathematical Systems No. 264. Berlin: Springcr- Vcrl;:tg. ~b~-Coldl. A. ( 1991 ). On the uniqueness of equilihrium once again. Chap. 12 in Eqllilihrillm Tht·ory and A,'plicatirm.'i. edit .......t by W. Barnell, B. Cornel, C. O'Asprcmont, J. Gabszewicz and A. Mas-Colell. Cambridge, U.K_: Cambridge Univtrsily Press. Milgrom. P.. and C. Shannon. (1994). Monotone comparative statics. Econometrica 62: 157-180. Ncgishi, T. (1960)_ Welfare economics and existence or an equilibrium for a competitive economy. Mc.'twe(·onomiC"a 12: 92-97_ Rader. T. (1972). Thenr), lif General Economic Equilihrium. New York: Academic Press. Saari. D., and C. Simon. (1978). Effective price mechanisms. Econometrica 46: 1097-125. Samuelson. P. (1947). Foundations of Economic Anat}'sis. Cambridge. Mass.: Harvard University Press. Scarf. H. (1973). The ComplllOlioll of Economic Equilihria (in collaboralion with T. Hansen). New Haven: Yalc University Press. Sh;lrcr, W. (1974). The non-transitive consumer. Econometrica 42: 913-19_ Sharer, W .• and H. Sonnenschein. (1982). Market demand and ucess demand functions. Chap. 14 in 110m/hook of Mathematical E(·onomir.~ vol. II. edited by K. Arrow and M. Intriligator. Amsterdam:

North·Holland. Shoven. 1.. and J. Whalley. (1992). Applring Gtn"al £quilibrium. New York: Cambridge University Press. Sonnenschein, H. (1973)_ Do Walras' identity and continuity characterize the class of community excess demand runctions'! Journal of Economic Theory 6: 345-54. Smale. S. (1976). A convergent process of price adjustment and global Newton methods. Journal of MllIlrematical Economics 3: 107-20. Starr. R. (1969). Quasi-equilibria in markets with non--convex preferences. Econometrica 37: 25-38. V.uian, H. (1977). Non-Walrasian equilibria. Econometrica 45: 573-90. Walras, L. (1874). Elements d'Economie Politique Pure. Lausanne: Corbaz. [Translated as: EI('mems of Pure E(·(tn(tmic_~. Homewood, III.: Irwin, 1954.]

aggregation problem. Journal of Economic Theory 57: 1-35.

Hahn, F. (1982). Stability. Chap. 16 in Handbook of Mathematical Economics, vol. II. edited by K. Arrow. and M. Intriligator. Amsterdam: North-Holland. 87. But it is not so for the generalizations described in the previous paragraph. 88. This was the approach taken in Debreu (1959).

EXERCISES

17.B.IA Show that for a pure exchange economy with J = 1 and Y, = -R~, "J'j:5: 0, p'rt = 0, and p ~ 0" ir and only ir "yj E YI and p' yj ~ p' Yt for all Yt E YI ."

041

642

CHAPTER

17:

THE

POSITIVE

THEORY

OF

EQUILIBRIUM

EXERCISES

643

------------------------------------------------------------------------------------------------17.0.2" Prove property (v) of Proposition 17.0.2. The proof of Proposition 17.B.2 in the text contains a hint. Recall also the following technical fact: any bounded sequence in RL has a convergent subsequence. 17.B.3" Suppose that z(·) is an aggregate excess demand function satisfying conditions (i) to (v) of Proposition 17.8.2. Let p. - P with some, but not all, of the components of P being zero.

17,C.3" Consider an exchange economy in which every consumer I has continuous, strongly monotone, strictly convex preferences, and w, » O. The peculiarity of the equilibrium problem to be considered is that the consumer will now pay a type of tax on his gross consumption; moreover, this tax can differ across commodities and consumers. We will also assume that total tax receipts are rebated equally across consumers and in a lump-sum fashion. Specifically, for every i there is a vector of given tax rates Ii = (t 11_ .•. tu) ~ 0 and for every price vector P » 0 the budget set of consumer i is 1

(a) Show that as n becomes large, the maximal excess demand is always obtained for some commodity whose price goes to zero. (b) Argue (if possible by example) that a commodity whose price goes to zero may actually remain in excess supply for all n. [Hint: Relative prices mailer.] 17.B.4" Suppose that there are J firms whose production sets Y ... , >J C RL are closed, " strictly convex, and bounded above. Suppose also that a strictly positive consumption bundle is producible using the initial endowments and the economy's aggregate production set Y = Li Ij (i.e., there is an x » 0 such that x e {LI w,} + Y). Show that the production inclusive aggregate excess demand function i(p) in (l7.B.3) satisfies properties (i) to (v) of Proposition 17.B.2. 17.B.S' Suppose that there are J firms. Each firm produces a single output under conditions of constant returns. The unit cost function of firm j is cJ(p), which we assume to be dilTerentiable. The consumption side of the economy is expressed by an aggregate excess demand function z(p). Write down an equation system similar to (l7.B.4)-(I7.B.5) for the equilibria of this economy. 17.8.6 C [Rader (1972)] Suppose that there is a single production set Yand that Y is a closed, convex cone satisfying free disposal. Consider the following exchange equilibrium problem. Given prices P = (P,' ... , pd, every consumer i chooses a vector V, e RL so as to maximize ?:;:, on the set {x,e Xi: P'V,:S; P'w" and x, ~ V, + y for some ye Y}. The price vector p and the choices v· = (vr, ... , vr> are in equilibrium ifL' v~ = L, w,. Show that, under the standard assumptions on preferences and consumption sets, the price vector and the individual consumptions constitute a Walrasian equilibrium for the economy with production. Interpret. 17.C.I' Verify that the correspondence f(·) introduced in the proof of Proposition l7.C.1 is convex· valued.

17.C.2C Show that a convex-valued correspondence z( -) defined on R~ + and satisfying the conditions (i) to (v) listed below (parallel to the corresponding conditions in Proposition 17.C.I) admits a solution; that is, there is a p with Oe z(p). (i) (ii) (iii) (iv) (v)

z(·) is upper-hemicontinuous.

{z;, ... , zi.} -

00.

[Hilll: If you try to replicate exactly the proof of Proposition l7.C.l you will run into difficulties with the upper-hemicontinuity condition. A possible three-step approach goes as follows: (I) Show that fo 0 small enough the solutions must be contained in 11, {p e 11: PI ~ dar all (2) argue then that for r > 0 large enough, one has z(p) c [-r, r]L for every p E 11,; finally, (3) carry out a fixed-point argument in the domain 11, x [-r, rlL. For an easier result, you could limit yourself to prove the convex-valued parallel to Proposition l7.e2. The suggested domain for the fixed-point argument is then 11 x [-r, rl'.

n;

R~: ~ (I + IIi)PIXIi :s; Wi}'

An equilibrium wilh laxes is then a price vector p »0 and an allocation (xr, ...• xl) with L. Wi such that every i maximizes preferences in Bi(p, P'W, + (II/XL" I"PIX"»'

LX~ =

(a) Illustrate the notion of an equilibrium with taxes in an Edgeworth box. Verify that an equilibrium with taxes need not be a Pareto optimum. (b) Apply Proposition 17.C.l to show that an equilibrium with taxes exists. (c) As formulated here, the taxes are on gross consumptions. If they were imposed instead on net consumptions, that is, on amounts purchased or sold, then (assuming the same rate for buying or selling) the budget set would be B,(p,

r,)

=

{x;eR~:p.(X'-W;) + ~l/lpli(x" -w,,)I:s; r,},

where the r, are the lump-sum rebates. In what way does this budget set differ from that described previously for the case of taxes on gross consumptions? Represent graphically. Notice the kinks. (d) Write down a budget set for the situation similar to (c) except that the tax rates for amounts bought or sold may be different. (c) (More advanced) How would you approach the existence issue for the modification described in (c)? 17.C.4 A Consider a pure exchange economy. The only novelty is that a progressive tax system is instituted according to the following rule: individual wealth is no longer p'W,; instead, anyone with wealth above the mean of the population must contribute half of the excess over the mean into a fund, and those below the mean receive a contribution from the fund in proportion to their deficiency below the mean. (a) For a two-consumer society with endowments w, = (1,2) and after-tax wealths of the two consumers as a function of prices.

Wz

= (2, I), write the

(b) If the consumer preferences are continuous, strictly convex, and strongly monotone, will the excess demand functions satisfy the conditions required for existence in Proposition I7.CI given that wealth is being redistributed in this way?

z(·) is homogeneous of degree zero.

For every p and z e z(p) we have p' z = 0 (Walras' law). There is 5 E R such that Zl> -5 for any z e z(p) and p. If p. _ p ",. 0, z· e z(p·) and PI = 0 for some t, then Max

Bi(p, Wi) = {Xi E

=

17.C.S" Consider a population of / consumers. Every consumer i has consumption set R~ and continuous, strictly convex preferences ?:;:i' Suppose, in addition, that every i has a household technology>; c RL satisfying 0 E t;. We can then define the induced preferences ?:;:~ on by Xi ?:;:~ xi if and only ifror any Yi e t; with xi + Yi ~ 0 there is y, e lj with x, + y, ~ 0 and Xi + Yi ?:;:,x; + yi (i.e., whatever can be done from xi, something at least as good can be obtained from Xi)'

R'.

(a) Show that induced preferences are rational, that is, complete and transitive. (b) Show that if t; is convex then induced preferences

?:;:~

are convex.

644

CHAPTER

17:

THE

POSITIVE

THEORY

OF

EQUILIBRIUM

(e) Suppose that goods are of two kinds: marketed goods and nonmarketed household goods. Initial preferences ;::;, care only about household goods, and initial endowments W, have nonzero entries only for marketed goods. Use the concept of induoed preferences to set up the equilibrium problem as one that is formally a problem of pure exchange among marketed goods. Discuss.

-

E X ERe I 5 E 5

17.0.S' Show by explicit computation that the index of the equilibrium of a one-consumer Cobb- Douglas pure exchange economy is + I. 17.E.I' Derive expressions (17.E.I) and (I7.E.2). 17.E.2' Derive expression (17.E.3).

17.C.6B Let L = 2. Consider conditions (il, (iii), and (iv) of Proposition 17.B.2 Exhibit four examples such that in each of the examples only one condition fails and yet the system of equations z(p) = 0 has no solution. Why is condition (ii) not included in the list?

17.E.3" Provide explicit utility functions rationalizing at a given price vector p the individual excess demands Z,( p) and matrices of price effects Dz,(p) constructed in the proof of Proposition 17.E.2.

17.0.IB Consider an exchange economy with two commodities and two consumers. Both consumers have homothetic preferences of the constant elasticity variety. Moreover, the elasticity of substitution is the same for both consumers and is small (i.e., goods are close to perfect complements). Specifically,

17.E.4" Consider the two-commodity case. Give an example of a function z(p) defined on = {(PI' P2)>> 0: E < (p,lp2) < (lIE)}, and with values in R2, that is continuous, is homogeneous of degree zero, satisfies Walras' law, and cannot be generated from a rational preference relation. Represent graphically the offer curve associated with this function. NOle that it goes through the initial endowment point and compare with the construction used in Figure 17.E.2.

u,(x", x,,) = (2x~,

+ X~,)"P

and

u,(x", x,,) = (Xf2

+ 2x~2)''''

r,

and p = -4. The endowments are w, = (1,0) and 002 = (0,1). Compute the excess demand function of this economy and verify that there are multiple equilibria.

17.E.SA Show that the choices represented in Figure 17.E.3 cannot be generated from consumers wilh endowment vectors bounded above by (1,1) and nonnegative consumption.

17.0.2' Apply the implicit function theorem to show that if J(v) = 0 is a system of M equations in N unknowns and if at jj we haye J(V) = 0 and rank DJ(jj) = M, then in a neighborhood of jj the solution set of J(') = 0 can be parameterized by means of N - M parameters.

I7.E.6 A Show that the excess demand function Z,(P) = e' - P,P, defined for IIpll = I. is proportionally one-to-one in the sense used in the general proof of Proposition 17.E.3 (at the end of Section 17.E).

17.0.3' Carry out explicitly the computations for Proposition 17.0.4.

17.E.7" Show directly that Ihe excess demand function z,(p) = e' - P,P used in the general proof of Proposition 17.E.3 salisfies the strong axiom of revealed preference.

17.0.4c Consider a two-commodity, two-consumer exchange economy satisfying the appropriate differentiability conditions on utility and demand functions. There is a total endowment vector 6i» O. Show that for almost eyery W,« 6i the economy defined by the initial endowments w, and W2 = 6i - w, has a finite number of equilibria. This differs from the situation in Proposition 17.0.2 in that total endowments are kept fixed. [Hint: You should use the properties of the Slutsky matrix.] 17.0.5" Consider a two-commodity, two-consumer exchange economy satisfying the appropriate differentiability conditions on utility and demand functions. Set the equilibrium problem as an equation system in the consumption variables x, e R~ and X2 e R~, the price variables p e R~, and the reciprocals of the marginal utilities of wealth )., e Rand ).2 e R (neglect the possibility of boundary equilibria). The parameters of the system are the initial endowments (00" (2) e R". Prove without further aggregation that (after deleting one equation and one unknown) the system satisfies the full rank condition of the transyersality theorem. 17.0.68 The setup is identical to Exercise 17.D.S except that an externality is allowed: The (differentiable) utility function of consumer I may depend on the consumption of COnsumer 2; that is, it has the form u,(x"x2) where X, is consumer i's consumption bundle [but we still have U2(X 2 )]. Equilibrium is defined as usual, with the proviso that consumer I takes consumer 2's consumption as given. Show that, generically on initial endowments (00" ( 2 ) E R", the number of equilibria is finite. 17.0.7 8 Suppose the agents of an overall exchange economy are distributed across N islands with no communication among them. Each island economy has three equilibria. (a) Argue that the number of equilibria in the overall economy is 3N • (b) Suppose now that the islands' economies are identical and that there is a possibility of communication across the islands: free and costless transportation of commodities. Show that then the number of equilibria is 3.

17.F.lc Show that expression (I7.F.2) gives rise to a negative semidefinite matrix of price effects. D:( pl. if initial endowments are proportional among themselves or if consumptions are proportional among themselves. 17.F.2' Complete the requested verification of Example 17.F.1. 17.F.3" There are four goods and two consumers. The endowments of the consumers are "'I = (W,I.W21'0.0) and 002 = (W , 2. Wu. 0,0). Consumer I spends all his wealth on good 3 while consumer 2 does the same on good 4. Specify some values of w, and W 2 for which the corresponding excess demand of this economy does not satisfy the weak axiom of revealed preference.

17.F.4A Suppose that there are L goods but that for every consumer there is a good such that at any price the consumer spends all his wealth on that good (perhaps goods are distinguished by their location). Show that the aggregate excess demand will satisfy the (weak) gross substitute property. 17.F.Sc Complete the missing steps of Example 17.F.2. 17.F.6 c Consider a two-consumption-good, two-factor model with constant returns and no joint production. In fact. suppose that the production functions for the two consumption goods are Cobb-Douglas. Consumers have holdings of factors and have preferences only for the two consumption goods. The economy is a closed economy (at equilibrium. consumption must equal production). Suppose that the two goods are normal and gross substitutes in the demand JUII(·tion of the consumers. Define an induced exchange economy for factors of production by assuming that at any vector of factor prices the two goods are priced at average cost and the final demand for them is met. Show that the resulting aggregate excess demand for factors of production has the gross substitute property and, consequently. that there is a unique equilibrium for the overall economy.

64:1

646

CHAPTER

17:

THE

POSITIVE

THEORY

OF

EX E R CIS E S

EOUILIBRIUM

------------------------------------------------------------------------------------------------example, be the system of excess demands corresponding to a subgroup of markets with the prices of commodities outside the group kept fixed.

17.F.7A Prove expression (17.F.3) for L = 2. 17.F.SA Show that expression (J 7.F.3) implies that the set of solutions to z(p) = 0 is convex. 17.F.9 S Consider an economy with a single constant returns production set Y. Preferences are continuous, strictly convex, and strongly monotone. Suppose that the feasible consumptions (Xl' ... ' x,) are associated with a Walrasian equilibrium. Assume, moreover, that no trade is required to allain these consumptions if Yis freely available to all consumers; that is XI - W, E Y for all i. Show then that those are the only possible equilibrium consumptions. 17.F.10A Show that expression (17.F.3) implies that Dz(p) is negative semidefinite at an equilibrium p.

=

=

17.F.J1 B Show that if z(p) 0, rank Dz(p) L - I, and Dz(p) is negative semidefinite, then, for any (, the (L - I) x (L - I) matrix obtained from Dz(p) by deleting the (th row and column has a determinant of sign (_I)L-'. [Hint: From Section M.D ofthe Mathematical Appendix you know that rank Dz(p) = L - I implies that the (L - I) x (L - I) matrix under study is nonsingular. Consider then Dz(p) - d.] 17.F.12" Show that if z(p) = 0 and Dz(p) has the gross substitute sign pallern, then the (L - I) x (L - I) matrix obtained from Dz(p) by deleting the Ith row and column has a

lIegalive dominanl diagonal (see Section M.D of the Mathematical Appendix for this concept)

and is therefore negative definite. 17.F.13 A Provide the missing computation for Example 17.F.3. B

17.F.14 Consider a firm that produces good lout of goods 1= 2, ... , L by means of a production function /(v" .. .• vd. Assume that /(.) is concave, increasing. and twice continuously differentiable. We say that I and I' are complements at the input combination v = (v" . .. , vd if 0' /(v)/ov, ov,. > O. (a) Verify that for the Cobb-Douglas production function f(v" ... , VL) = vi' x ... x ~, + ... + ~L !> I, any two inputs are complements at any •.

.r',

(b) Suppose that f(') is of the constant returns type. Show that at any. and for any there is an (' that is a complement to t at v.

t

(c) Suppose now that f(·) is strictly concave and that any two inputs are complements at any •. Let ",(p" ... , pel be the input demand functions. Show thaI, for any t, a.tlap, > 0, < 0, and < 0 for t' # I.

av,/ap,

a.tlaPr

(d) Discuss the implications of (a) to (c) for uniqueness theorems that rely on the gross substitute property.

(a) We say that g(.) satisfies the strong gross subslilule properly (SGS) if for some" > 0 every coordinate of the function "g(p) + p is strictly increasing in p and (IXU(P) + p) E [0. rJ" for every p E [0, r]H. Show that if g(p) has the SGS property then it also has the GS property. (b) Show by example that the GS property does not imply the SGS property. Establish, however, that if g(') is continuously differentiable and the GS property is satisfied then the SGS property holds.

From now on we assume that g(') satisfies the SGS property. (e) Show that there is an equilibrium, that is, a p with g(p) = O. Illustrate graphically for the case N = I. [Hinl: Quote the Tarski fixed point theorem from Section M.I of the Mathematical Appendix, or, if you prefer, assume continuity and apply Brouwer's fixed point theorem.] (d) Give an example for N = 2 where the equilibrium is not unique.

=

=

(e) Suppose that g(p) g(p') O. Show that there must be an equilibrium p' such that p. :2: P and p' :2: p'. Similarly, there is an equilibrium p- such that p- :S P and p- :S p'. [Hint: Apply the argument in (e) to the domain [Max {p" p;}. r] x ... x [Max {PH' pj,}, r].]

(f) Argue (you can assume continuity here) that the equilibrium set satisfies a strong and very special property, namely, that it has a maximal and a minimal equilibrium. That is. there are pm.. and pm;. such that g(pm .. ) = g(pm;") = 0 and pm;" S p!> p-' whenever g(p) = O. (g) Assume now that g(.) is also differentiable. Suppose that we know that at equilibrium, that is, whenever g(p) = 0, the matrix Dg(p) has a negative dominanl diagonal; that is, Dg(p)v« 0 for a .» O. Argue (perhaps non rigorously) that the equilibrium must then be

unique. (h) Suppose that g(') is the usual excess demand system for the first N goods of an economy with N + I goods in which the last price has been fixed to equal I and the overall (N + I)-good excess demand system satisfies the gross substitute property. Apply (g) to show that the equilibrium is unique. 17.F.17A [Becker (1962), Grandmont (1992)] Suppose that L = 2 and you have a continuum of consumers. All consumers have the same initial endowments; they arc not rational, however. Given a budget set, they choose at random from consumption bundles on the budget line using a uniform distribution among the nonnegative consumptions. Let z(p) be the average excess demand (= expected value of a single consumer's choice). Show that z(·) can be generated from preference maximization of a Cobb-Douglas utility function (thus the economy admits a positive representative consumer in the sense of Section 4.0).

17.F.lS" Consider a one-consumer economy with prOduction and strictly convex preferences. There is a system of ad valorem taxes I = (I" ... , Id creating a wedge between consumer and producer prices; that is, PI = (I + I,)q, where P, and q, are, respectively, the consumer and producer price for good t. Tax receipts arc turned back in lump-sum fashion. Write the definition of (distorted) equilibrium. Show that the equilibrium is unique if the production sector is of the Leontief type (a single primary factor, no joint production, constant returns) and all goods are normal in consumption. can you argue by example the nondispensability of the last normality condition? If this is simpler, you can limit your discussion to the case of two commodities (one input and one output).

17.G.l" Suppose that in an exchange economy (and with the normalization PL = I) we are given equilibrium prices p(w,) as a differentiable function defined as an open domain of the endowments of the first L - I goods of the first consumer, W, = (w", ... , w L -,.,). All the remaining endowments are kept fixed. Suppose that the demand function of the first consumer is strictly normal in the sense that Dw,x,(p, w,) » 0 through the relevant domain of (p, w,). Show then that for any';', and;; = P(';',), we have rank D.. i,(;;;';',) = L - 1 and rank Dp(';',) = L - I, where il(p; w,) is the excess demand function ~f the first consumer for the first L - 1 goods.

17.F.16C Suppose that g(p) = (g,(p), ...• gH(P» is defined in the domain [0, r]H and that g(O, . .. ,0) » (0, ... ,0), g(r, ... , r) « (0. ...• 0). Note that we do not assume Walras' law, homogeneity of degree zero, or, for that maller, continuity. The function g(') could, for

it;;;

17.G.2" The setting is as in Exercise 17.G.1 or as in Proposition 17.G.2. Suppose that cD,) = O. Show that there are economies with D,£(;;; cD,) an (L - I) x (L - I) negative definite matrix but where op,(,;")/aw,, > O. [Hint: Use Proposition 17.G.I and the arguments employed in its proof.]

647

648

CHAPTER

17:

THE

POSITIVE

THEORY

OF

EQUILIBRIUM

EXERC'SES

649

------------------------------------------------------------------------------------------------I7.G.3C The setting is a. in Exercise 17.F.16. except that now we have two functions g(p) ERN and (J(p) eRN. Each of these functions satisfies the conditions of Exercise 17.F.16 (in particular the SDS property). In addition. we assume that b(-) is an upward shift of g('); that i•• (J(p) ~ g(p) for every pE [O.r]N. Prove that if (pml •• p .... ) and (~ml •• P"''') are the minimal and maximal equilibrium price vectors (see Exercise 17.F.16) for g(') and b('), respectively. then pmln ~ pmln and pm.. ~ pm... [You can assume that g(') and !I(') are continuous; if this makes things simpler. assume also that both functions have a unique solution.] Represent graphically for the case N = I. 17.H.l c Suppose that the system of excess demand functions z(p) satisfies the gross substitute property. Consider the tatonnement price dynamics dp, = Z/(P) dt

For any price vector p let o/t(p)

for every

t.

(0)

= Max {z,(p)/p, •...• zL(p)/pd.

(I) Argue that if pCt) is a solution for the above tatonnement dynamics (i.e .•

dp,(t)/dt - ZI(pCt» for every t and t) and z(pCO» ¢ 0 then o/t(pCt» should be decreasing through time. [Him: If ZI(pCt»/PI(t) = o/t(pCr)) then PI(t)/PI'(t) cannot decrease at t for any t'. Hence. Z/(pCt» cannot increase. whereas P, surely increases.]

(b) Argue that p(t) converges to an equilibrium price as t dynamics (0) Walras' law implies that LI pJ(t) = constant.]

00 •

[Hint: Recall that for the

17.H.2" There is an output good and a numeraire. The price of the output good is p. The data of our problem are given by two functions: The consumption side of the economy provides an excess demand function z(p) for the output good. and the production side an increasing inverse output supply function pCz). Both functions are differentiable. In addition. their graphs cross at (1.1). which is the equilibrium we will concentrate on in this exercise. Given this selling we can define two one-variable dynamics: (i) In Walras price dynamics we assume that at p the price increases or decreases according to the sign of the difference between excess demand and (direct) supply at p. (ii) In Marshall quantity dynamics we assume that at z production increases or decreases according to the sign of the difference between the demand price (i.e .• the inverse excess demand) and the supply price (i.e.• p(z» at z. (a) Write the above formally and interpret economically. (b) Suppose that the technology is nearly of the constant returns type. Show then that around the equilibrium (1.1) the system is always Walrasian stable but that Marshallian stability depends on the slope of the excess demand function (in what way?). (c) Write general price and quantity dynamics where prices move II la Walras and quantities la Marshall. Draw a (P. z) phase diagram and argue that in the typical case dynamic trajectories will spiral around the equilibrium.

a

(d) Go back to the technology specification of (b). Show that the system in (c) is locally stable if and only if the equilibrium is Marshallian stable.

17.1.1A Argue that the replica procedure described at the beginning of Section 11.1 does effectively include the case where the numbers of consumers of different types are not the same (a"ume, for simplicity. that the proportions of the different types are rational numbers). [Hint: Redefine the size of the original economy.]

v'.

17.1.2A Consider for a one-input. one-output problem the production function q = where " is thc amount of input. Show that the corresponding production set Y is additive but that the smallest cone containing it. yo. is not closed. Discuss in what sense the nonconvexity in Y is large. Argue that, whatever the number of consumers, there is no useful sense in which an equilibrium (nearly) exists. 17.1.3" There arc three commodities: the first is a high-quality good. the second is a low·quality good, and the third is labor. The first and second goods can be produced from labor according to the production functions f,(v) = Min {v. I} and fiv) = Min {v', I} for 0 < /1 < I. The economy has one unit of labor in the aggregate. Labor has no utility value. There are two equally sizcd classes of agents, with a very large number of each. "Rich" and "poor" have identical endowments, but the rich own all the shares in the firms of the economy. The rich spend all their wealth on the high-quality good; the poor must buy either one quality or the other-they cannot buy both. The utility function of the poor is U(XI' x,) = x, + lx" defined for (x,. x,) not both positive. (a) Which standard hypothesis of the general model docs this economy fail to satisfy? (h) Show that there can be no equilibria other than one in which both qualities of product arc produced. (e) Show that an equilibrium exists. 17.AA.IA Consider an exchange economy in which the preferences of consumers are monotone, strictly convex, and represented by the utility functions (u,(·) •... , u,(·». Show that for any (-" ......,) »0 there can be at most one Pareto optimal allocation x = (x" ... , x,) such that (III (x;) • ... , u,(x,)) is proportional to (5, •. . .• 5,). 17.AA.2" Consider the welfare-theoretic approach to the equilibrium equations described in Appendix A (the Negishi approach). The existence of a solution to the system of equations q(s) = 0 defined there follows from a fixed-point argument similar to the one carried out in Proposition 17.C.2. Assume that you are in an exchange economy with continuous, strictly convex and strongly monotone preferences, and that w, »0 for every i. Assume also that yes) turns out to be a function rather than a correspondence (a sufficient condition for this is that preferences be representable by differentiable utility functions and that at every Pareto optimal allocation at least one consumer gets a strictly positive consumption of every good). (a) Show that yes) is continuous. (b) Show that yes) satisfies a sort of Walras' law:

"L' y,(s) = 0, for every 5."

(e) Show that if s, = 0 then g,(s) > O. [Hint: If 5, = 0 then u,(x,(s» = 0 and so pCs)' x,es) = 0.] (d) Complete the existence proof. (Note that g(s) is also defined for 5 with zero components. This makes mailers simpler.)

(e) Consider the simplest price and quantity dynamics in the limit case where there are constant returns and excess demand is also a constant function. Draw the phase diagram. Suppose now that the quantity dynamics is modified by making the quantity responses depend not only on price and cost but also on the "expectation of sales. that is. on the excess demand. Will this have a stabilizing or a destabilizing effect?

17.AA.3" Suppose that. in an exchange economy, consumption sets are R~ and preferences are representable by concave. increasing utility functions u,(·). Let f1 = p. E R~: L, i., = I} be a simplex of utility weights. Suggest an equation system for Walrasian equilibrium that proceeds by associating with every i. a linear social welfare function.

17.H.3A For L = 3 draw an example similar to Figure 17.H.2 but in which there is a single equilibrium that. moreover. is locally totally unstable. Could you make it a saddle?

17_BB.1A Give a graphical example (for L = 2) of a Walrasian quasiequilibrium with strictly positive prices that is not an equilibrium for an economy in which:

fl

650

CHAPTER

17:

THE

POSITIVE

THEORY

OF

EQUILIBRIUM

-------------------------------------------------------------------------------------------------(i) (ii) (iii) (iv)

For For For For

every j, Ij = -R~. every i, XI is nonempty, closed, convex and satisfies XI + R~ C XI' every i, preferences arc continuous, convex, and strongly monotone. every i, WI e XI'

Why does this example not contradict any result given in the text (see the small-type discussion after the proof of Proposition 17.BB.I)? 17.BB.2" Consider an economy in which every consumer desires only a subset of goods and has holdings of only some goods. For the commodities desired, however, the preferences of the consumer are strongly monotone (they' are also continuous) on the corresponding nonnegative orthant. Suppose in addition that Li WI » 0 and that the economy satisfies the following jlldemmposabililY condition: It is not possible to divide consumers into two (nonempty) groups so that the consumers of one of the groups do not desire any of the commodities owned by the consumers of the other group.

Show then that any Walrasian quasiequilibrium is an equilibrium. 17.BB.3c Consider an Edgeworth box where preferences are continuous, strictly convex and locally nonsatiated (but not necessarily monotone). Suppose also that frcc disposal of commodities is not possible. Argue that, nonetheless, the offer curves must cross and, therefore, Ihat an eq uilibrium exists. Show that at equilibrium the two prices cannot be negative. In fact, at least one price must be positive (this is harder to show). 17.BB.4A Prove that if (x', y', p) is a free-disposal quasiequilibrium and Y, satisfies free disposal, then we can get a true quasiequilibrium by changing only the production of firm I. 17.BB.5 A Provide the missing step in the proof of Lemma 17.BB.5 (that is, show that the convexity of preferences implies that XI >-1 X .. and xj >-,x .. cannot both occur for Xi.

=

IXXI

+ (I

-

IX)X;).

17.BB_6 A Complete the proof of Lemma 17.BB.S by verifying the convexity of YJ(x, y, p) and of p(", y, pl. 17.BB.7A Complete the proof of Lemma 17.BB.6 by verifying the upper hemicontinuity of the correspondences jiJ(') and ;;(.). 17.BB.S" [Existence with production externalities; see Chipman (1970) for more on this topic.] There are L goods. Good L is labor and it is the single factor of production. Consumers have consumption set R~, continuous, strongly monotone, and strictly convex preferences, and endowments only of labor. Good t = I, ... , L - I is produced in sector t, which is composed of )1 identical firms. The production function of a firm in sector ( is it("t) = IXt"~' for 0 < PI :s; I. The peculiarity of the model is that the productivity coefficient IXt will not be a constant but will depend on the aggregate use of labor in sector t. Precisely, IXt

=

Yt(t "n)",

Yt> 0 and Pt

~ O.

(a) Define the notion of Walrasian equilibrium. Assume in doing so that individual firms neglect the effect on IXt of their use of labor. To save on notation, suppose also that profit shares are equal across consumers. (b) Prove the existence of a Walrasian equilibrium for the current model (make the standard additional assumptions that you find necessary). [Him: The general proof of Appendix B needs very few adaptations.]

EX ERe I S E S

651

-------------------------------------------------------------------------------------------(c) Derive and represent the aggregate production set of each sector. Which conditions on the parameters (Jt, Yt, Pt guarantee that the aggregate production set of sector t exhibits increasing. constant, or decreasing returns to scale? (d) Note that the existence conditions of (b) may be satisfied while the aggregate production set is not convex. What would happen if the externality of sector ( were internalized by putting all the firms of the sector under joint management? (e) Suppose that L = 2, (JI = 1 and individual preferences are quasilinear in labor; that is, they admit a utility function ",(x,,) + Xli' Discuss, both analytically and graphically, the bias of the equilibrium level of production relative to the social optimum.

17.88.9" Carry out the existence argument for the two-player-game approach described at the end of Appendix B.

C

Some Foundations for

HAP

T

E

R

18

SECTION

18.A Introduction

AND

Definition 18.B.1: A coalition Sc:I improves upon, or blocks, the feasible allocation x· = (xf, ... , xn e R\I if for every i E S we can find a consumption Xi ~ 0 with the properties: (i)

Xi">-i Xi

for every i e S.

(ii) Los Xi e Y + {LoS W;}. Definition \S.B.\ says that a coalition S can improve upon a feasible allocation

x' if there is some way that, by using only their endowments L i d WI and the publicly available technology Y, the coalition can produce an aggregate commodity bundle that can then be distributed to the members of S so as to make each of them better off. I. The constant returns assumption is important. With general production sels the difficulty is Ihat we cannot avoid being explicit aboul ownership shares. However, these have been defined to be profll shares, which makes our conceptual apparatus dependent on the very notion of prices whose emergence we are currently trying to explain. Thus we stick here to the case of constant returns. This is not a serious restriction: recall from Section S.B (Proposition S.B.2) that it is always possible to reduce generailechnologies to the constant returns case by reinterpreting the ownership shares as endowments of an additional "managerial" input.

18.B Core and Equilibria The theory to be reviewed in this section was proposed by Edgeworth (IS81). His aim was to explain how the presence of many interacting competitors would lead to 652

CORE

the emergence of a system of prices taken as given by economic agents, and consequently to a Walrasian equilibrium outcome. Edgeworth's work had no immediate impact. The modern versions of his theory follow the rediscovery of his solution concept (known now as the core) in the theory of cooperative games. Appendix A contains a brief introduction to the theory of cooperative games; this section, however, is self-contained. For further, and very accessible, reading on the material of this section, we refer to Hildenbrand and Kirman (19SS). The theory of the core is distinguished by its parsimony. Its conceptual apparatus does not appeal to any specific trading mechanism nor does it assume any particular institutional setup. Informally, the notion of competition that the theory explores is one in which traders are well informed of the characteristics (endowments and preferences) of other traders, and in which the members of any group of traders can bind themselves to any mutually advantageous agreement. The simplest example is a buyer and a seller exchanging a good for money, but we can also have more complex arrangements involving many individuals and goods. Formally, we consider an economy with I consumers. Every consumer i has consumption set R\, and endowment vector WI ~ 0, and a continuous, strictly convex, strongly monotone preference relation ;::" There is also a publicly available constant returns convex technology Y c: RL.' For example, we could have Y = - R\, that is, a pure exchange economy. All of these assumptions are maintained for the rest of the section. As usual, we say that an allocation x = (x" ... , XI) E R\I is feasible if:L XI = Y + LIW; for some ye Y. With a slight abuse of notation, we let the symbol I stand for both the number of consumers and the set of consumers. Any nonempty subset of consumers Sc:I is then called a coalition. Central to the concept of the core is the identification of circumstances under which a coalition of consumers can reach an agreement that makes every member of the coalition better off. Definition IS.B.! provides a formal statement of these circumstances.

Competitive Equilibria

Up to this point of Part IV, the existence of markets in which prices arc quoted and taken as given by economic agents has been assumed. In this chapter, we discuss four topics that, in essence, have two features in common: The first is that they all try to single out and characterize the Walrasian allocations from considerations more basic than those stated in its definition. The second is that they all emphasize the role of a large number of traders in accomplishing this task. In Section IS.B we introduce the concept of the core, which can be viewed as embodying a notion of unrestricted competition. We then present the important core equivalence theorem. Section IS.C examines a more restricted concept of competition: that taking place through well·specified trading mechanisms. The analysis of this section amounts to a reexamination in the general equilibrium context of the models of noncooperative competition that were presented in Section 12.F. The motivation of the remaining two sections is more normative. In Section IS.D we show how informational limitations on the part of a policy authority (constrained to use policy tools relying on self-selection, or envy freeness) may make the Walrasian allocations the only implementable Pareto optimal allocations. In Section IS.E the objective is to characterize the Walrasian allocations, among the Pareto optimal ones, in terms of their distributional properties. In particular, we ask to what extent it can be asserted that at the Walrasian allocation everyone gets her "marginal contribution" to the collective economic well-being of society. A number of the ideas of this chapter (especially those related to the core, but also some in Section IS.E) have come to economics from the cooperative theory of games. This therefore seems a good place to present a brief introduction to this theory; we do it in Appendix A.

II.B:

EOUILIBRIA

653

--------------------------------------------------------------------------------------------

1

654

CHAPTER

18:

SOME

FOUNDATIONS

FOR

COMPETITIVE

SECTION

EQUILIBRIA

II.B:

CORE

AND

EQUILIBRIA

655

------------------------------------------------- ------------------------------------------------x"

~-+--------------~------------~~

Flgur. ".8.2 ~:---+-2:;,

Figure 18.B.l

~2

Definition 18.B.2: We say that the feasible allocation x' = (x! ..... xi) E R~I has the core property if there is no coalition 01 consumers ScI that can improve upon x*. The core is the set 01 allocations that have the core property. We can see in the Edgeworth box of Figure 18.B.1 that for the case of two consumers the core coincides with the contracl curve. With two consumers there are only three possible coalitions: {I, 2}, {I}, and {2}. Any allocation that is not a Pareto optimum will be blocked by coalition {I, 2}.2 Any allocation in the Pareto set that is not in the contract curve will be blocked by either {I} or {2}. With more than two consumers there are other potential blocking coalitions, but the fact that the coalition of the whole is always one of them means that all allocations in lite core are ParelO optimal. We also observe in Figure 18.B.I that the Walrasian equilibrium allocations, which belong to the contract curve, have the core property. Proposition 18.B.1 tells us that this is true with complete generality. The proposition amounts to an extension of the first welfare theorem. Indeed, in the current terminology, the first welfare theorem simply says that a Walrasian equilibrium cannot be blocked by the coalition of the whole.' The following result, Proposition 18.B.I, shows that it also cannot be blocked by any other coalition. Proposition 18.B.1: Any Walrasian equilibrium allocation has the core property. Proof: We simply duplicate the proof of the first welfare theorem (Proposition J6.C.I). We present it for the exchange case. See Exercise 18.B.1 for the case of a general constant returns technology. Let x' = (x!, ... , be a Walrasian allocation with corresponding equilibrium

xn

2. With continuity and strong monotonicity of preferences. if a feasible allocation is Pareto dominated. then it is Pareto dominated by • reasible allocation that strictly improves the utility of ('t'('ry consumer. To accomplish this we simply transfer a very sman amount of any good from the consumer that is made better off to every other consumer. If the amount transrerred is sufficiently small then. by the continuity or prererences, the transrerring consumer is still beller olT. while. by strong monotonicity. every other consumer is made strictly better otT. 3. Keep in mind the point made in rootnote 2.

O,~--------------------------~~r_~

The core equals the contract curve in the two-consumer casco

price vector p ~ O. Consider an arbitrary coalition ScI and suppose that the consumptions (x.} •• s are such that >-. for every i E S. Then p'X, > p·w. for every i E S and therefore P·(L •• s x.) > P·(L •• s w;). But then L •• s Xi :s: L.d w, cannot hold and so condition (ii) of Definition 18.B.1 is not satisfied (recall that we are in the pure exchange case). Hence coalition S cannot block the allocation x' . •

x. x:

The converse of Proposition 18.B.1 is, of course, not true. In the two-consumer economy or Figure 18.B.1 every allocation in the contract curve is in the core, but only one is a Walrasian allocation. The core equivalence theorem, of which we will soon give a version, argues that the converse does hold (approximately) if consumers are numerous. Quite remarkably, it turns out that as we increase the size of the economy the non-Walrasian allocations gradually drop from the core until, in the limit, only the Walrasian allocations are left. The basic intuition for this result can perhaps be grasped by examining the Edgeworth box in Figure 18.B.2. Take an alloca tion such as x where consumer I receives a very desirable consumption within the contract curve. Consumer 2 cannot do anything about this: She could not end up better by going alone. But suppose now that the preferences and endowments in the figure represent not individual consumers but types of consumers and that the economy is actually composed of four consumers, two of each type. Consider again the allocation x, interpreted now as a symmetric allocation, that is, with each consumer of type 1 receiving x I and each consumer of type 2 receiving x 2 • Then matters are quite different because a new possibility arises: The two members of type 2 can form a coalition with one member of type I. In Figure 18.B.2, we see that the allocation x can indeed be blocked by giving x; to the one consumer of type 1 in the coalition and x; to the two consumers of type 2 [note that - 2(x; - W2) = (x; - W I )].4 4. Observe that all this has the flavor or Bertrand competition, as reviewed in Section 12C. Indeed. we can look at what happens with this three-member coalition as the rollowing: One or the consumers or type 1 bids away the transactions or the COnsumers or type 2 with the other consumer or type I. Although this is a topic we shall not get into, we remark that, in fact, there are strong parallels between Bertrand price competition and core competition. Note, in particular. that core competition is as shortsighted as Bertrand competition. By undercutting the other consumer or her type. the consumer or type I is only initiating a process or blocking and counterblocking (mutual underbidding in the Bertrand selling) that eventually leads to a result (perhaps th. Walrasian allocation) where she will be worse off than at the initial position.

An allocation in the contract curve that can be blocked with two replicas.

656

CHAPTER

18:

SOME

FOUNDATIONS

FOR

COMPETITIVE

EQUILIBRIA

--------------------------------------------------------------------------------------------The ability to do this depends. of course. on the way we have drawn the indifference curves. Nonetheless. as we will see. we are always able to form a blocking coalition of this sort if we have sufficiently many consumers of each type. The version of the core equivalence theorem that we will present is in essence the original of Edgeworth. as generalized by Debreu and Scarf (1963). It builds on the intuition we have just discussed. To begin, let the set H = {I, ... , H} stand for a set of types of consumers, with each type 10 having preferences ;::, and endowments w,. For every integer n > 0, we then define the N-repliCll economy as an economy composed of N consumers of each type. for a total number of consumers IN = N H. We refer to the allocations in which consumers of the same type get the same consumption bundles as eqlwl-treatment "l1ocClliolls. Proposition 18.B.2 shows that any allocation in the core must be an equal-treatment allocation. (We hasten to add that this is true for the current replica structure, where there arc equal numbers of consumers of each type. It does not hold in general; see Exercise IS.B.2.) Proposition 18.B.2: Denoting by hn the nth individual of type h, suppose that the allocation x· = (XT1' ... • xrnl' .. • xtN" .. . xj." • ... xl.tn • ... . X;"N) E R~HN I

belongs to the core of the N-replica economy. Then x' has the equal-treatment property. that is, all consumers of the same type get the same consumption bundle: for all 1 :!> m. n:!> Nand 1 :!> h:!> H. Proof: Suppose that the feasible allocation x = (X" •...• X"N) E R/;."N does not have the equal-treatment property because. say. x, .. 1- x .. for some III 1- n. We show that x does not have the core property. In particular, we claim that x can be improved upon by any coalition of H members formed by choosing from every type a worst-treated individual among the consumers of that type. Suppose without loss of generality that, for every h. consumer h I is one such worse-off individual. that is. x" for all hand n. Define now the average consumption for each type: .", = (liN) L. x,•. By the strict convexity of preferences we have (recall that consumers of type I are not treated identically)

x,. ;::.

.x, ;::, x"

for all h

and

X,>-, X " ,

(IS.B.I)

We claim that the coalition S={II •...• hl, ...• HI}. formed by H members, can attain by itself the consumptions (x""" .x,,) E R~H. Therefore. by (IS.B.l). the original nonequal-treatment allocation can be blocked by S.' To check the feasibility of (.x I' •.•• XII) E R/;." for S, note that, because of the feasibility of x = (x", ...• x/IN) E 1R/+"N. there is Y E Y such that L. Ln x,. = y + N(Lh w,), and therefore

5. Recall that preferences are strongly monotone and continuous, so that if S can achieve an allocation that does strictly better than x· for some of its members. and at least as well as x· for all of them, then it can also achieve an allocation that does strictly better for all of its members.

S l ~ ; I 0 H

1

a . 8:

C 0 A E

AND

E

u U I LIB

RI A

657

-------------------------------------------------------------------------------------------But by the constant returns assumption on Y. (I/N)y E Yand so we conclude that (x l' ...• XII) E R';" is feasible for coalition S. • Proposition IS.8.2 allows us to regard the core allocations as vectors of fixed size LH. irrespective of the replica that we are concerned with. As a matter of terminology, we call a vector (x" ...• XII) E R~" a Iype allocation and, for any replica N. interpret it as the equal-treatment allocation to consumers where each consumer of type h gets x,. A type allocation (x l ' . . . , XII) E IR~" is feasible if L, x. = Y + L. W, for some )' E Y. Note that for any replica N the corresponding equal-treatment allocation is feasible because

and Ny E Y by the constant returns assumption on Y. By Proposition IS.8.2 the core allocations of a replica economy can be viewed as feasible type allocations. Define by eN C R~H the set of feasible type allocations for which the equal-treatment allocations induced in the N-replica have the core property. Note that eN docs depend on N. Nonetheless, we always have eN. I C eN because a type allocation blocked in the N-replica will be blocked also in the (N + I )-replica by a coalition having exactly the same composition as the one that blocked in the N-replica. Thus. as a subset of RLII the core can only get smaller when N - 00. At the same time, we know from Proposition IS.B.I that the core cannot vanish because the Walrasian equilibrium allocations belong to eN for all N. More precisely, the set of Walrasian type allocations is independent of N (see Exercise IS.8.3) and contained in all eN' The core equivalence theorem (which. in the current replica context, is the formal term for the combination of Propositions 18.B.I, IS.8.2 and the forthcoming Proposition IS.B.3) asserts that the Walrasian equilibrium allocations are the only surviving allocations in the core when N - 00.

Ri

Proposition 18.B.3: If the feasible type allocation x· = (x~, ...• xii) E H has the core property for all N = 1,2, ... , that is, x· E eN for all N. then x· is a Walrasian equilibrium allocation. Proof: To make the proof as intuitive as possible we restrict ourselves to a special case: a pure exchange economy in which. for every h. ;::, admits a continuously differentiable utility representation u,(') [with Vu,(x,)>> 0 for all x,]. In addition. the initial endowments vector w, is preferred to any consumption x, that is not strictly positive. This guarantees that any core allocation is interior. We emphasize that these simplifying assumptions are not required for the validity of the result. Suppose that x = (X, •... ' XII)E RLII is a feasible type allocation that is not a Walrasian equilibrium allocation. Our aim is to show that if N is large enough then x can be blocked. We may as well assume that X is Pareto optimal (otherwise the coalition of the whole blocks and we are done) and that x, » 0 (otherwise a consumer of type /0 alone could block). Because of Pareto optimality we can apply the second welfare theorem (Proposition 16.D.I) and conclude that X is a price equilibrium with transfers with respect to some p = (P,' ...• pLl.lf X is not Walrasian then there must be some 10, say /0 = I. with P'(X, - w,) > O. Informally. type 1 receives a positive net transfer from the rest of the economy and is thus relatively favored (interpretatively. think

658

CHAPTER

1':

SOME

FOUNDATIONS

FOR

COMPETITIVE

EQUILIBRIA

SECT,ON

---------------------------------------------------------------------------------------~-

of type I as the most favored). We shall show that, as long as N is large enough, it would pay for the members of all the other types in the economy to form a coalition with N - I consumers of type I (i.e., to throw out one consumer of type I). More precisely, if a member of type I is eliminated then to attain feasibility the rest of the economy must absorb her net trade x, That, of course, presents no difficulty for the positive entries (those commodities for which the rest of the economy is a net contributor to this consumer of type I), but it is not so simple for the negative ones (the commodities where the rest of the economy is the net beneficiary). The most straightforward methodology is to simply distribute the gains and losses equally. In summary, our coalition is formed by (N - I) + N(H - I) members and, for every type I,. every member of type h gets

+ N(H

AND

We saw in Proposition 18.B.I that the half of the core equivalence theorem that asserts that Walrasian allocations have the core property generalizes the first welfare theorem. In its essence. the half asserting that. provided the economy is large, core allocations are Walrasian constitutes a version of the second welfare theorem. To understand this it may be useful to go back to the general (non replica) setup and formulate the property of a core allocation being Walrasian in terms of the existence of a price support for a certain set. For simplicity. we restrict ourselves to the pure exchange case. Given a core allocation x = (xt, ... , 4) e R~' then. in analogy with the construction used in the proof of the second welfare theorem (Proposition 16.0.1) we can define the setsy

I

_ I)

CORE

and, therefore, will also be individually favorable (recall Section 3.1 for similar arguments)" •

W,.

x; = x, + (N

1I.B:

V. =

_ I) (x, - w,).

{XI: XI >-IXrj U {WI} c: RL

Note that (N - I)x',

+ Nx~ + ... + Nx;' =

+ Nx, + ... + NXII + (x, x, + x, - W, I)w, + Nw, + ... + Nw, .

(N - I)x,

- w,)

We have L, WI E V. But there is more: Ihe CO" property for x· implies Ihal LI WI belongs Ihe houlldary of V. To see this, note that ifL' w, is in the interior of Vthen there is: e V such that:« L, w,; that is, there is x' = (xi •.. ·, X,) with x; E V. for every i and LI x; LI w,. Hence. x' is feasible, x' ". (w, •. .. ,w,), and, for every i, either x; >-1 xr or x; = WI' It follows that the set of consumers S = Ii: x;". W,} is nonempty. that x; >-,xr for every i E S, and that

= Nw, + ... + NWII = (N -

10

Hence, the proposed consumptions are feasible for the proposed coalition. Note also that the consumptions are nonnegative if N is large enough. For every h. every consumer of type h in the coalition moves from .x, to x;. Is this an improvement or a loss"! The answer is that if N is large enough then it is an unambiguous gain. To sec this. observe that p'(x, - w,) > 0 implies Vu,(x.)·(x, - w,) > 0 for every h because p and Vu,(x.) are proportional. As we can then see in Figure 18.B.3 (or. analytically, from Taylor's formula; see Exercise 18.B.4) there is Ii> 0 with the property that, for every h, u.(x. + IX(X, > u,(x,) whenever 0 < IX < Ii. Hence, for any N with (I/[(N - I) + N(H - I)]) < Ii the coalition will actually be blocking. Intuitively, we have done the following. The coalition needs to absorb x, - W,. Evaluated at the marginal shadow prices of the economy, this is a favorable "project" for the coalition since p'(x, - w,) > O. If the coalition is numerous then we can make sure that every member will have to absorb only a very small piece of the project. Hence the individual portions of the project will all be ~at the margin"

x, + (x, - "',)

I~«I~-I~=I~-I~=I~

to p'x;

change of a consumer

of type h in the blocking coalilion.

x; ~ .', +

I

(r -1)+ r(H -I)

".

(x, - w,)

ifS

hil

,_s

i.S

xr.

Figure 18.B.3

for p, > 0

I.'

Thus S is a blocking coalition. The next claim is that if P = (P""" pd ". 0 supports Vat LI w" that is, p': :2: P'(LI Wi) for all Z E V, then P must be a Walrasian price vector for x· = (xt, ... , xf). To verify this, note first that, for every i. we have xi >i xf for some xj arbitrarily close to Therefore, xi + L. .. w. E Vand so p-(x; + L •• I co,):2: P'(WI + L ... w.). Going to the limit (i.e., letting x; ~ xn this yields p'xr ~ P'W, for all i. Because LI xr s L.WI, we must therefore have p' xr = P'W, for all i.ln addition. whenever x;>-.xr we have P'(x; + L •• IW.):2: P'(w, + L •• I w.) and so p' x; :2: p·w,. If we exploit the continuity and strong monotonicity of preferences as we did in Section 16.0 (or in Appendix B of Chapter 17). we can strengthen the last conclusion

The consumption

VU.(x,) ~ ~,P

=:«

i~S

w,»

EQUIL,BR'A

659

--------------------------------------------------------------------------------------------

>

p·W;.

The key difference from the case of the second welfare theorem (studied in Section 16.0) is that V c: R" does nol need 10 be convex and that therefore a nonzero peRI;. supporting V at L, w, may not exist. The reason for the lack of convexity is that the individual sets V, c: RL need not be convex: V. is the union of the preferred set at xr, which is convex, and the initial endowment vector which will typically be outside this preferred set and therefore disconnected from it. However, if Ihe (possibly nonconvex) sets V. c: RL being added are Ilumerous. then Ihe sum L. V. c: RL is "almost" convex. Thus, the existence of (almost) supporting prices for core allocations can be seen as yet another instance of the convexifying effects of aggregation. We end by mentioning an elegant approach to core theory pioneered by Aumann (1964) and Vind (1964). It consists of looking at a model where there is an actual continuum of consumers and where we replace all the summations by integrals. The beauty of the approach is that all the approximate results then hold exactly. The core equivalence theorem, for example,

w,.

6. See Anderson (1978) for a different line of proof 'hat makes minimal assumptions on the economy.

·· ... PTER

15

SOME

FOUNDATIONS

f")R

COMPETI1IVl;,

EQUIL,BRIA

--------------------------------------------------------------------------------------------takes the form: An allocation belongs to the core if and only if it is a Walrasian equilibrium allocation.

Definition lS.C.l: The profile of actions a* = (aT, ... , ail EA, x ... equilibrium if. for every i.

0 but the limit of pte, e) as e goes to zero remains bounded away from zero.'3 • Example IS.C.3: Tradillg Poses. This example belongs to a family proposed by Shapley and Shubik (1977). It is not particularly realistic but it has at least three 12. When every firm produces ,b'/J,the profils or one firm are ,b'/)). > I/y. Bul I/y is an upper bound ror Ihe profils or any firm Ihal deviates rrom Ihe suggeSled produclion by producing more. Hence an output level or rb'll ror every firm constitutes an equilibrium. 13. The complemenlarily makes it impossible ror .p to be continuously dillerenliable al Ihe origin. Thererore, p(' ) rails 10 be continuous. This is Ihe crucial aspecl ror Ihe example. NOle Ihal discontinuity at the origin is a natural occurrence: it will arise. ror example. whenever the indifference map or.p(·) is homothetic (bul not linear). See Harl (1980) ror more on Ihis issue.

663

ob~

... HAPTER

lei:

SOME

..

uv .... OAYlu ...

t-Gft

COMPETITIVE

fQUILIBRIA

S E to T I 0

~

\

a . LI.

T 11 E l l Mil &

l O R E 0 1ST R I 8 UTI 0 N

ootl

-------------------------------------------------- -------------------------------------------------------------------------.>

Figure lB.C.4

o

An effective budget set for the trading post Example IS.C.3.

x"

virtues: it constitutes a complete general equilibrium model. all of the participants interact strategically (in the two previous examples. consumers adjust passively). and it is analytically simple to manipulate. There are L goods and J consumers. Consumer i has endowment w, € R~. The Lth commodity. to be called "money." is treated asymmetrically. For each of the first L - I goods there is a trading post exchanging money for the good. At each trading post ( :5 L - I. each consumer i can place nonnegative bids a" = (ai" a;,) € IR~. The interpretation is that an amount ai i of good I is placed at the offer side of the trading post to be exchanged for money. Similarly. an amount a; i of money is placed in the demand side to be exchanged for good I. Accordingly. the bids are also constrained by aii :5 Wt; and LIS L-' (Iii :5 WLi' Given the bids of consumer i in the trading posts (:5 L - I and prices (p, •...• PL-,.I) the mechanism is completed by the trading rule: gt(a 1h

···,

ai i

aL-l,i; PI"'" Pl.-I' I) = -

-

, QIi

PI

for all t < L - I. The trade for the money good is derived from the budget constraint of the consumer. Given a vector a = (a" •...• aL-I."" .• a" •. ..• aL-I.,) of bids for all consumers. the clearing prices in terms of money are determined as the ratio of the amount of money offered to the amount of good offered:

ria;i

PI(a)=-~.

Lian

t

= I •...• L-I.

(IS.C.I)

Note that PI(a) is well defined and continuous except when there are no offers at the trading post I [i.e .• except when aii = 0 for all i]'" A typical effective budget set for agent i is convex and. provided that L .. i (Ii. "# 0 and L ... "# 0 for alii:::; L - I. it has an upper boundary containing no straight segments (you are asked to formally verify this in Exercise IS.C.I). This reRects the fact that as a consumer increases her bid in one side of a market the terms of trade turn against her. Figure IS.C.4 gives an illustration for the case L = 2.

a;.

t4. For the special. but important. case in which there is a single trading post (i.e .• L = 2). we can go a bit farther. When L, ail > 0 and L. a,. = O. the relative price of money is still well defined: it is zero. The essential difficulty in defining relative prices arises when L, aii = 0 and L.a,. = O.

It follows from expression (IS.C.I) that approximate price taking will prevail in any trading post that is thick in the sense that the aggregate positions taken on the two sides of the market are large relative to the size of the initial endowments of any consumer. A necessary condition for thickness is that there be many consumers. But this is not sufficient: it is possible even in a large economy to have equilibrium where some market is thin and. as a consequence. a trading equilibrium may be far from a Walrasian equilibrium. In fact. any trading equilibrium for a model where a trading post I is closed (i.e .• the trading post does not exist) will remain an equilibrium if the trading post is open but stays inactive. That is. if we put a" = (a;'" ail) = 0 for all i. Economically. this is related to Example 18.C.2: it takes at least two agents (here a buyer and a seller) to activate a market. Mathematically. the difficulty is again the impossibility of assigning prices continuously when at; = 0 for all i. Up to now. in this and previous examples. all of the instances of trading equilibria not approaching a Walrasian outcome when individual competitors are small have been related to failures of continuity of market equilibrium prices. But the current example also lends itself to illustration of the individual spanning problem. Indeed. even if markets are thick and therefore prices. from the individual point of view. are almost fixed. it remains true that the trading post structure imposes the restriction that goods can only he exchanged for money on hand (in macroeconomics this restriction is called the cash-in-advance. or the Clower. constraint). Money obtained by selling goods cannot be applied to buy goods. Therefore. for a given individual the Walrasian budget set will be (almost) attainable only if the initial endowments of money are sufficient. that is. only if at the solution of the individual optimization problem the constraint Lf " L-' WL' is not binding. But there is no general reason why this should be so. Suppose. to take an extreme case. that WL' = O. Then consumer i simply cannot buy goods at all. _

ar, : :;

18.D The Limits to Redistribution In Section 16.0 we saw that. under appropriate convexity conditions and provided that wealth can be transferred in a lump-sum manner. Pareto optimal allocations can be supported by means of prices. However, as we also pointed out there. a necessary condition for lump-sum payments to be possible is the ability of the policy authority to tell who is who-that is. to be able to precisely identify the characteristics (preferences and endowments) of every consumer in the economy. In this section, we shall explore the implications of assuming that this cannot be done to any extent; that is. we shall postulate that individual characteristics are private and become public only if revealed by economic agents through their choices. We will then see that under very general conditions the second welfare theorem fails dramatically: the only Pareto optimal allocations that can be supported involve no transfers. that is. they are precisely the Walrasian allocations. Thus. if no personal information of any sort is available to the policy authority. then there may be a real conAict between equity and efficiency: if transfers have to be implemented we must give up Pareto optimality. The nature of this trade-off is further explored in Sections 22.B and 22.C. We place ourselves in an exchange economy with J consumers. Each consumer i has the consumption set R~, the endowment vector w, ~ O. and the continuous. monotone. and strictly quasiconcave utility function u,(·).

666

CHAPTER

11:

SOME

FOUNDATIONS

FOR

COMPETITIVE

EQUILIBRIA

SECTION

-------------------------------------------------------------------------------------Good 2

Good 2

11.0:

- '"')' w,

X~?:IX;.

o

(a)

(b)

LIMITS

TO

REDIITRI8UTION

Good I B

Figure 18.0.1

~

0 for all (

~

L - I and h.

If we have a sequence of finite economies (Ii, ... ,I';,) such that I' = :[.1. -+ 00 and (1/1')/. -+~. for every h, then we can properly regard (Il., ... , ~II) as the continuum limit of the sequence of increasingly large finite economies. Exercise IB.E.I: Show that the function v('): of degree one.

R~ -+

IR is concave and homogeneous

The function v(·) is a sort of production function whose output is social utility and whose inputs are the individual consumers themselves. Further, in the limit, every individual of type h becomes an input of infinitesimal size. For the time being, we concentrate our discussion on the continuum limit. We assume also that v(') is differentiable. 2 • Definition lB.E.l: Given a continuum population ~ = (~1" .. '~H) E R': a feasible allocation 22 (xf, ... , xk) is a marginal product, or no·surplus, allocation if

•

Uh(X h )

av(~)

=-a~h

for all h.

(18.E.3)

In words: at a no-surplus allocation everyone is getting exactly what she contributes at the margin. Proposition lB.E.l: For any continuum population ii = (ii, .. . ,iiH) »0 a feasible allocation (xf, ... , xk) » 0 is a marginal product allocation if and only if it is a Walrasian equilibrium allocation. Proof: If x· = (xt, . .. , x~) is a marginal product allocation then, using Euler's formula (see Section M.B of the Mathematical Appendix), we have

( _) v~

,,_ av(ii)

= 7:~'

a~,

,,_

= 7: ~,u,

(.)

x, .

Hence, x· solves problem (IB.E.2) for ~ = ii. Suppose now that x· = (xt, ... , x~) is a feasible allocation that gives rise to social utility vIii); that is, it constitutes a solution to problem (IB.E.2) for Il = ii. Denote by Pt, ( = I, ... ,L, the values of the multipliers of the first-order conditions associated with the constraints:[, ii,(xt. - (010) ~ 0, ( = I, ... , L, in the optimization problem (IB.E.2); see Section M.K of the Mathematical Appendix. By the quasilinear form of u,(') we have PL

for all (

:s L

=I

and

PI

= Vlt/I,(xt"

... , xl-I .• )

(IB.E.4)

- I and all I ~ h ~ H.

21. This could be derived from more primitive assumptions. 20. Because utility functions are concave the maximum utility can be reached while treating consumers of the same type equally.

P A INC I P L E

671

--------------------------------------------------------

22. We assume thai consumers of the samelype are treated equally. Feasibilily means therefore that

r,,, Jl"xt

:s;

LII P.W ...

672

CHAPTER

,.:

SOME

FOUNDATIONS

FOR

COMPETITIVE

EQUILIBRIA

--------------------------------------------------------------------------------------------It follows from (IS.E.4) that the vector of multipliers p = (P .. ... ,pd is the vector of Walrasian equilibrium prices of this quasilinear economy (recall the analysis of Section 10.0). In addition, by the envelope theorem (see Section M.L of the Mathematical Appendix), applied to problem (IS.E.2). we have (Exercise IS.E.2): ov(il) - = u. (*) x. 0/1.

+ p. (w. -

*) x •.

(IS.E.5)

Therefore. we conclude that x* is Walrasian if and only if x* solves problem (18.E.2) for /1 = /' and (18.E.3) is satisfied. that is. if and only if x* is a marginal product allocation. _ Expression (18.E.5) IS mtUltlve. The lert-hand side measures how much the maximum sum of utilities increases if we add onc cxtra individual of type II. The right-hand side tells us that there are two effects. On the one hand. the extra consumer of type It receives from the rest of the economy the consumption bundle and so she directly adds her utility u.(x:) to the social utility sum. On the other. while she contributes her endowment vector w •. Hence the net change for the receiving How much is this worth to the rest of the economy'! rest of the economy is w. The vector of social shadow prices is precisely p = (p, • ...• pd. and so the total change for the rest of the economy comes to p·(w. - x:>, Note that the Walrasian allocations are thus characterized by this second effect being null: the utility of the consumer equals her entire marginal contribution to social utility. In Exercise IS.E.4 you are asked to verify that the smoothness assumption on utility functions is essential to the validity of Proposition IS.E.!. Let us now consider a finite economy (I", ..• Ill) » O. We can define the marginal contribution of an individual of type h as

x:.

x:.

x:.

A.v(I ...... Ill) = v(I, •...• I ••...• Ill) - v(l, •...• I. - I •...• I,,).

Typically. there does not exist a feasible allocation (xT •...• x7,) with u.(x:l = A.v(/", .. , Ill) for all It. To see this. note that by the concavity of v(.) we have A.v ~ ov/O/1. [both expressions evaluated at (/, •... ,III)]. Except for degenerate cases. this inequality will be strict. Moreover. L. I.(ov/o/1.) = v(/, •...• Ill) by Euler's formula (see Section M.B of the Mathematical Appendix). and thus we conclude that L1I.(&.v) > v(I, •...• Ill); that is. it is impossible to give to each consumer the full extent of her marginal contribution while maintaining feasibility. [n contrast with the continuum case, individuals are not now of negligible size: their whole contribution is not entirely at the margin. [n particular. you should note that in a finite economy the Walrasian allocation is typically not a marginal product allocation. [t follows from expression (IS.E.5) that an allocation (xT •...• x7t) that solves problem (18.E.2) for (/1 , •...• /11l) = (I ...... 1/1) is a Walrasian equilibrium allocation if and only if

illl u.(x:) = - - (/, •...• I,,).

a/I.

But we have just argued that normally &.V(/" ...• Ill) > ov(l" . .. , IIl)/il/I•. [n words: At the Walrasian equilibrium consumers are compensated according to prices determined by the marginal unit of their endowments. But they lose the extra social surplus provided by the inframarginal units. This is yet another indication that the concept of Walrasian equilibrium stands on firmer ground in large economics.

- - - - ..... _ - - -

APPENDIX

A:

COOPERATIVE

GAME

THEORY

O/,.}

-----------------------------------------------------------------------------We have just seen that in the context of economies with finitely many consumers it is not possible to feasibly distribute the gains of trade while adhering literally to the marginal productivity principle. The cooperative theory of games provides a possibility for a sort of reconciliation between feasibility and the marginal productivity principle. It is known as the Shapley value. In Appendix A. devoted to cooperative game theory. we offer a detailed presentation of this solution concept. For an economy with profile (I, •... • Ill) the Shapley value is a certain utility vector (Sit, •...• Shll) E R"that satisfies L,l,Sh, = v(/, •...• I H ). For every type h.the utility Sh, can be viewed as an al'eraye oj marginal utilities d,v(/; •. ..• I~). The average is taken over profiles (I',' ...• /~) S (I" ...• (11 ), where the probability weight given to (I; •...• /~) equals 1//, interpreted as the probability assigned to sample size I; + ... + I~, times the probability of gelling thc profile (I; •...• 1;,) when independently sampling I; + ... + I~ consumers out of the original population with I consumers and profile (I, •...• I H ). See Appendix A for more on this formula. An allocation that yields the Shapley value (let us call it a Shapley alloration) is not related in any particular way to the Walrasian equilibrium allocation (or for that mailer to the core). Except by chance. they will be different allocations. Yet. remarkably. we also have a convergence of these concepts in economies with many consumers: the Walrasian and the Shapley allocations are then close to each other. This result is known as the value equivalence llieorelli. A rigorous proof of this theorem is too advanced to be given here [see Aumann (1975) and his references]. but the basic intuition is relatively straightforward. There are two key facts. First. if the entries of (I'" ...• /~) are large. then subtracting a consumer of type Ii amounts to very lillIe. and so

d,v(I', •...• I~) '" ov(/;, ...• I~)/o/". Second, if the entries of (I, •. ..• I H ) are large then, by the law of large numbers. most profiles (I; •.... I~) constitute a good sample of (I, •... • Ill) and are therefore almost proportional to (I, ... .• I,,). Using the homogeneity of degree one of v(') (hence the homogeneity of degree zero of "1'/ v(1) for any partition of I into two coalitions S. TJ.

680

';HAPTER

11:

SOME

FOUNDATIONS

FOR

COMPETITIVE

A P P E '" v I X

EQUILIBRIA

A:

COO PER A T I V EGA MET H E 0 R Y

players i, h E S, utility differences are preserved in a manner similar to the two-player case: ShieS, v) - S/,,(S\{h}. v) = Sh.(S, v) - Sh.(S\{i}, v)

",

for all ScI, i, h E S,

L ShieS, v) = v(S)

(IS.AA.2)

for all ScI,

iES

1~{2H-I----4'=-.J.._-~.----

Figure 1B.AA.8

o

u,

{(u,. u,):", + u, =

Egalitarian division for two-player games.

'111. 2})}

Expressions (IS.AA.2) determine the numbers ShieS, v), i E S, uniquely. This is clear for Sh,({i}, v). From here we can then proceed inductively. Suppose that we have dcfined S/,,(S, v) for all ScI, S 'I' I, i E S. We show that there is one and only one way to define Sh,(f, v), i E I. To this effect, note that (tS.AA.2) allows us to express every Sh,(I, v) as a function of Sh,(f, v) and of already determined numbers:

Shi(f, v) = Sh,(f, v) + Sh,(f\{ I}, v) - Sh,(1\{i}, v) Then to determine Sh,(f, ") use :[;,' Sh i(1. v)

a reasonable, or "fair," way to divide the gains from cooperation, taking as a given the strategic realities captured by the characteristic form.29 We study only the TV case, for which the theory is particularly simple and well established. The central concept is then a certain solution called the Shapley value. 30 Suppose that individual utilities are measured in dollars and that, so to speak, society has decided that dollars of utility of different participants are of comparable social worth. The criterion offairness to which value theory adheres is egalitarianism: the aim is to distribute the gains from trade equally. To see what the egalitarian principle could mean in the current TV context let us begin with a two-player game (I, v) = ({ I, 2}, v). Then the gains (or losses, if superadditivity fails) from cooperation are

v(l) - v({I}) - v({2}). Therefore, the obvious egalitarian solution, which we denote (Sh,(I, v), Sh 2 (1, v», is (see Figure IS.AA.S)

Sh;(I, v) = v({i}) + !
REFERENCES

18.B.4' Use Taylor's formula to complete Ihe proof of Proposition IS.B.3.

Anderson. R. (1978). An elementary core equivalence theorem. Econometrica 46: 83-87. Aumann. R. (1964). Markets with a continuum of traders. Econometrica 32: 39-50. Aumann. R. (1975). Values of markets with a continuum of traders. Econometrica 43: 611-46. Champsaur. P., and G. laroque. (1981). Fair allocations in large economies. Journal of Economic Theory

IX-B.S" Consider an economy composed of 21 + I consumerS. or these, I each own One right shoe and I + I each own a left shoe. Shoes are indivisible. Everyone has the same utility function. which is Min {R. L}. where Rand L are. respectively. the quantities of right and lerl shoes consumed.

25: 269-82. Debreu. G .• and H. Scarf. (1963). A limit theorem on the Core of an economy. inlr'nOlionat Economic Review ~: 235-46. Edgeworth. F. Y. (1881). Mathematical Psychics. London: Kegan Paul. Foley. D. (1967). Resource allocation and the public sector. Yale Economic Essays 7: 45-98.

Gabszewicz. J. J.• and J. P. Vial. (1972). Oligopoly·. la Cournot· in a general equilibrium analysis. Journal of EClmomic Theory 4: 381-400. Hart, O. (1980). Perfect competicion and optimal product differentiation. Journal of Economic Theory 11: 165-99. Hildenbrand. W. and A. Kirman. (1988). Equilibrium Anal),sis. New York: Norlh·Holland. Mas·Colell, A. (1982). The Cournotian roundations or Walrasian equilibrium: an exposition or recent theory. Chap. 7 in Advanct's in Economic Theory, edited by W. Hildenbrand. New York: Cambridge University Press. Moulin, H. (1988). Ax/o,.., of Cooperative Game Theory. New York: Cambridge University Press. Myerson, R. (1991). Game Theory: Analysis of Conflict. Cambridge, Mass.: Harvard UniversilY Press. Novshek, W.. and H. Sonnenschein. (1978). Cournot and Walras equilibrium. Journal of Economic Theory

19: 223-66. Roberts. K. (1980). The limit points of monopolislic competition. Journal of Economic Theory 22: 256-278. Osborne, M. and A. Rubinstein. (1994). A Course in Game Theory. Cambridge, Mass.: MIT Press. Ostray, J. (1980). The no-surplus condition as a characterization of perfectly competitive equilibrium. Journal of Economic Theory 22: 65-9\. Owen. G. (1982). Game Theory, 2nd ed. New York: Academic Press. Schmeidler, D. and K. Vind. (1972). Fair net trades. Econometrica 40: 637-47. Shapley. L.. and M. Shubik. (1977). Trade using a commodity as a means of payment. Journal of Political Economy 85: 937-68. Shubik, M. (1959). Edgeworth's market games. In Contributions to the Theory of Games. IV. edited by R. D. Luce. and A. W. Tucker. Princeton, NJ.: Princeton University Press. Shubik, M. (1984). Game Theory in the Social Sciences. Cambridge, Mass.: MIT Press. Thomson, W., and H. Varian. (1985). Theories of justice based on symmetry. Chap. 4 in Social Goals and Social Organizations, edited by L. Hurwicz, D. Schmeidler, and H. Sonnenschein. New Vorlc: Oxford University Press. Varian. H. (1976). Two problems in the theory of fairness. Journal of Public Economics S: 249-60. Vind, K. (1964). Edgeworth allocations in an exchange economy with many traders./nlt'rnalional Economic Review 5: 165-77.

(a) Show that any allocalion of shoes that is matched (i.e .• every individual consumes the same nllmhcr of shoes of each kind) is a Parelo optimum, and conversely. (b) Which Parelo oplima are in the COre of this economy? (This time, in Ihe definition of the COre allow for weak dominance in blocking.) (e) Let P. and Pi. be the respective prices of the two kinds of shocs. Find the Walrasian equilibria of this economy.

(d) Comment on Ihe relationship between the core and the Walrasian equilibria in this economy. IS.C.I'" ESlablish the properties of effective budget sets claimed in the discussion of Example I X.C.3. You can restrict yourself to the case L = 2. IS.O.I" Consider an Edgeworth box wilh continuous, strictly COnvex and monotone preferences. Show that every feasible allocalion where both consumers are at least as well off as al their initial endowments is self.selective. IS.E.I" In texl. \S.E.2' Use the envelope theorem (see Section M.L of the Mathemalical Appendix) to derive expression (IS.E.5). IS.E.3" By considering an example with L-shaped preferences for two non.numeraire goods (hence, the utility function cannol be differentiable), argue that it is possible that at a Walrasian allocalion with a continuum of traders every trader gets less than her marginal conlribution. 18.AA.I" A collection of coalitions S" ... , SH C I is a generalized partition if we can assign a weIght b, E [0. I] to every I S n S N such that, for every player i E I. we have LI" i.S.1 b, = 1. ExhIbit examples of generalized partitions, with the corresponding weights. We say thai a TU-game (I, v) is balanced if for every generalized partition we have L. I\, ..(S.) S ..(I). where b, are the corresponding partition weights. Show that the game has a nonempty Core if and only if it is balanced. [Hint: Appeal to the duality theorem of linear programming (see Section M.M of the Mathematical Appendix).] IS.AA.2' In texl. A

EXERCISES

I8.AA.3 Show that the proportional allocation of Example IS.AA.6 is the only allocation in the core if average product is constant.

18.B.IA Show that Walrasian allocations are in the core for the model with a constant returns technology described in Section 18.B.

18.AA.4C Show that if the Shapley value is defined by formula (lS.AA.4)-or, equivalently. by (18.AA.3 )-then the preservation of differences expression (IS.AA.2) is satisfied.

685

686

CHAPTER

18:

SOME

FOUNDATIONS

FOR

COMPETITIVE

EOUILIBRIA

18.AA.5" We say that a game (I, v) is a unanimity game if there is a nonempty Sci such that v(T) = v(S) if SeT and v(T) = 0 otherwise. Show then that under the efficiency, symmetry, and dummy axioms we arc led to distribute v(S) equally across the members of S. 18.AA.6" Show that any TU-game (I, v) can be expressed as a linear combination of unanimity games. Then use the Exercise IS.AA.5 and the linearity axiom to show that there is a unique solution satisfying the efficiency, symmetry, dummy, and linearity axioms. Connect your discussion wilh the Shapley value.

CHAPTER

General Equilibrium Under

19

Uncertainty

18.AA.7c Show that the production game described in Example IS.AA.S is convex. 18.AA.8" In the context of the production example of Example IS.AA.S, give an example of a two-input production function that is convex (as a function) but for which, nonetheless, the core is empty (thus. the induced game cannot be convex). 18.AA.9" Consider the game with four players defined by v({i}) = 0, v({12}) = v({34}) = 0, = v({ 14}) = v({23}) = v({24}) = I, v({ijk}) = I for all three-player coalitions {ijk}, and ,,({I234}) = 2.

L{{ 13})

(a) Show that this is the game that you would get from the utility production technology Min {=" =,l, where z, and z, are the amounts of two factors, if the factor endowments of the four consumers are w, = w, - (1,0) and wJ = W. = (0,1). ,

(b) Show that the core of this game contains all points of the form (a. a, I - a, I E [0, I].

~)

for

(e) Show that if v({ 134}) is increased to 2, holding all other coalition values constant. there is then only one point in the core. Compare the welfare of player I at this point to what she would get at all the points in the core before the increase in v({I34}). (d) Compute the Shapley value of the game [before the modification in (e)) without using the brUle-force enumeration technique. [Hint: Use symmetry considerations and other axiomatically based simplifications to go part of the way to the answer.] (e) How does the Shapley value change under the modification of part (e)? Discuss the difference between the changes in the Shapley value and in the core. 18.AA.IO" Consider a firm constituted by two divisions. The firm must provide overhead in Ihe form of space. (x" x,), to each of them. The cost of aggregate amounts of space is given by C(x, + x,) = (x, + x,)', 0 < y < I. (a) Suppose that, whatever the usage of space (x" x,), the total cost must be exactly allocated between the two divisions. Propose a cost allocation system based on the Shapley value to accomplish this. (b) Compute the marginal cost imposed on each of the two divisions [according to the cost allocalion system identified in (a)) whenever a division increases its usage of space. (c) Suppose now that the profits accruing to the two divisions arc Il,X, and Il,X" respectively (we assume that 11, > 0 and 11, > 0), and that each division uses space to the point where marginal profits equal own marginal costs [as determined in (b)). Will this lead to an efficienl (that is, profit-maximizing) choice of overhead? (d) Is there any distribution rule ""(x,, x,), ""(x,, x,), with ""(x,, x,) + ""(x,, x,) = C(x, + .' ..• Ps) e RLS at I = I. every consumer i formulates a consumption. or trading. plan (z 1/ ••••• ZSi) e RS for contingent commodities at I = O. as well as a set of spot market consumption plans (x 1/ •••• , x,,) E RLS for the different states that may occur at t = I. Of course, these plans must satisfy a budget constraint. Let U,(') be a utility function for ;:::,. Then the problem of consumer i can be expressed formally as Max

(19.D.1)

(xl •• ··· • .xs.)ER~.s

(:u •. ·· ••s.)ERS

s.t.

(i)

L, q'Z,i

~

O.

(ii) p, x" ~ p,W"

+ p"Z"

for every s.

Restriction (i) is the budget constraint corresponding to trade at I = O. The family of restrictions (ii) are the budget constraints for the different spot markets. Note that the value of wealth at a state s is composed of two parts: the market value of the initial endowments. p, ·W". and the market value of the amounts z,' of good 1 bought or sold forward at t = O. Observe that we are not imposing any restriction on the sign or the magnitude of z". If z" < - W"i then one says that at t = 0 consumer i is selling good 1 shorl. This is because he is selling at t = 0, contingent on state s occurring. more than he has at I = I if s occurs. Hence, if s occurs he will actually have to buy in the spot market the extra amount of the first good required for the fulfillment of his commitments. The possibility of selling short is. however. indirectly 6. In principle, expectations could differ across consumers. but under the assumption of correct expeclations (soon 10 be introduced) they wiU nol.

TRADE

695

696

CHAPTER

11:

GENERAL

EQUILIBRIUM

UNDER

UNCERTAINTY

SEC TID N

limited by the fact that consumption, and therefore ex post wealth, must be nonnegative for every s.' To define an appropriate notion of sequential trade we shall impose a key condition: Consumers' expectations must be self-fuljilled, or rational; that is, we require that consumers' expectations of the prices that will clear the spot markets for the different states s do actually clear them once date t = I has arrived and a state s is revealed. Definition 19.0.1: A collection formed by a price vector q = (q, •.. . ,qs) E contingent first good commodities at t = O. a spot price vector

W for

for every s. and. for every consumer i. consumption plans zi = (zT;•...• z$;) E RS at t = 0 and xi = (xT; . .... x$;) E IRLS at t = 1 constitutes a Radner equilibrium [see Radner (1982)] if:

1 9 • 0:

SEQ U E N T I • L

Proposition 19.0.1: We have: (i) If the allocation x· E R lSI and the contingent commodities price vector (P" ...• PsI E R~S+ constitute an Arrow-Oebreu equilibrium. then there are prices q E R~ + for contingent first good commodities and consumption plans for these commodities z· = (zT ..... zrl E RSI such that the consumptions plans x'. z·. the prices q. and the spot prices (p, • ...• PsI constitute a Radner equilibrium. (ii) Conversely. if the consumption plans x· E nlSI. z· E R SI and prices q E A~ +. (p, •...• PsI E n~s+ constitute a Radner equilibrium. then there are multipliers (II, •. ..• lIS) E n~ + such that the allocation x' and the contingent commodities price vector (II,P, •. .•• jlsPs) E R~s+ constitute an Arrow-Oebreu equilibrium. (The multiplier is interpreted as the value. at t = O. of a dollar at t = 1 and state s.)

I'.

Proof: (i) It is natural to let q, = P" for every s. With this we claim that. for every consumer i. the budget set of the Arrow-Oebreu problem. BtO = {(Xli' ... ' xS/) E R,!:

L, p,'(x"

- w,,) SO}.

is identical to the budget set of the Radner problem, (i) For every i. the consumption plans

zi. xi solve problem (19.0.1).

Br =

{(x 1/ ••••• x.,,)

E

IR'! : there are (z li • . . . • zs.> such that L. q,z" S 0 and P,(X,,- w,,) S P"Z" for every s}.

see this. suppose that X, = (Xli •...• X5') E Bto. For every s. denote (I/PI')p,·(x" - w,,). Then L,q,z" = L,PI'Z" = L,P,'(X" - w,,) s 0 and P.,·(x" - W,.> = P"Z" for every s. Hence, X, E Bf. Conversely. suppose that x,=(x" •...• xs,)EBr; that is. for some (z" •...• zs,) we have L,q,z" SO and p,(x" - w,,) S P"Z" for every s. Summing over s gives L,P,·(X" - w,,) S LJPtsZ:wi = Lsqszsi S O. Hence, Xi e Bto. We conclude that our Arrow-Oebreu equilibrium allocation is also a Radner equilibrium allocation supported by q = (Pi' •...• PIS) E RS• the spot prices (P, ....• PsI, and the contingent trades (zr, ..... zt,) E RS defined by =.;. = (I/p,,)p,·(x:' - w,,). Note that the contingent markets clear since. for every s. To

At a Radner equilibrium, trade takes place through time and, in contrast to the Arrow-Oebreu setting. economic agents face a sequence of budget sets, one at each date-state (more generally, at every date-event). We can see from an examination of problem (19.0.1) that all the budget constraints are homogeneous of degree zero with respect to prices. This means that the budget sets remain unaltered if the price of one physical commodity in each date-state (that is, one price for every budget set) is arbitrarily normalized to equal I. It is natural to choose the first commodity and to put p" = I for every s, so that a unit of the s contingent commodity then pays off I dollar in state S.8 Note that this still leaves one degree of freedom, that corresponding to the forward trades at date 0 (so we could put q, = I. or perhaps L, q, = I). In Proposition 19.0.1, which is the key result of this section, we show that for this model the set of Arrow-Oebreu equilibrium allocations (induced by the arrangement of one-shot trade in LS contingent commodities) and the set of Radner equilibrium allocations (induced by contingent trade in only one commodity, sequentially followed by spot trade) are identical. 7. Observe also that we have taken the wealth at I = 0 to be zero (that is. there are no initial endowments or the contingent commodities). This is simply a convention. Suppose. ror example. that we regard

Wlli~

the amount of good t available at t

:a=

1 in state s. as the amount of the s

contingent commodity that i owns at I = 0 (to avoid double counting. the initial endowment or commodity I in the spot market' at I = I should ,imultaneously be put to zero). The budget constraints are then: (i) L,q,(:~ - "'I") ~ 0 and (ii) p,·x,. ~ LI~I P"",, + Ph:;' ror every s. But letting

Z:i = tli +

W bl •

we see that these are exactly the constraints of (l9.D.1).

8. It rollows rrom the possibility or making this normalization that. without loss or generality. we could as well suppose that our contingent commodity pays directly in dollars (see Exercise 19.D.1 ror more on this).

TRA0 E

697

---------------------~~~~~~~~-=

Z" =

L.=!

=

(l/p,,)P,·[L,(X:' -

w,,» SO.

(ii) Choose I', so that jl,P" = q,. Then we can rewrite the Radner budget set of every consumer i as

Bf

=

{(x " •...• xs.> E ALS : there are (z" •...• zs,) such that L,q,z" S 0 and jl,p,(x" - W,.> s q,z" for every s}.

But from this we can proceed as we did in part (i) and rewrite the constraints. and therefore the budget set. in the Arrow-Oebreu form:

Bf = BtO =

{(XI/ •...• Xs,)E

ALS: L,jl,P,·(x" - w,,) SO}.

x:

Hence, the consumption plan is also preference maximizing in the budget set Bt"· Since this is true for every consumer i. we conclude that the price vector LS (II. P,.···. lISPS) E A clears the markets for the LS contingent commodities. _ Example 19.0.1: Consider a two-good. two-state. two-consumer pure exchange economy. Suppose that the two states are equally likely and that every consumer has the same. state-independent. Bernoulli utility function u(x,,). The consumers differ only in their initial endowments. The aggregate endowment vectors in the two states

698

CHAPTER

11:

GENERAL

EQUILIBRIUM

UNDER

UNCERTAINTY

SECTION

Good 2

Spot Prices in the Two States

p 0, Initial Endowment in State 2

Good I Good 2

are the same; however, endowments are distributed so that consumer I gets everything in state I and consumer 2 gets everything in state 2. (See Figure 19.0.1.) By the symmetry of the problem, at an Arrow-Debreu equilibrium each consumer gets, in each state, half of the total endowment of each good. In Figure 19.0.1, we indicate how these consumptions will be reached by means of contingent trade in the first commodity and spot markets. The spot prices will be the same in the two states. The first consumer will sell an amount ex of the first good contingent on the occurrence of the first state and will in exchange buy an amount {J of the same good contingent on the second state. (You are asked to provide the details in Exercise

19.0.2.) • It is important to emphasize that, although the concept of Radner equilibrium cuts down the number of contingent commodities required to attain optimality (from LS to S), this reduction is not obtained free of charge. With the smaller number of forward contracts, the correct anticipation of future spot prices becomes crucial. Up to this point we have discussed the sequential implementation of an Arrow-Debreu equilibrium when there are two dates,9that is, for the date-eventtree of Figure 19.B.1. Except for notational complications, the same ideas carryover to a tree such as that in Figure 19.B.2 where there are T + I periods and information is released gradually. (See the small· type discussion at the end of Section \9.B for basic concepts and notation.) We would then have spot markets at every admissible date-event pair IE (i.e., those IE where E e.9';, the information partition at r). With H the set of basic physical commodities, we denote the spot prices by P,. e RH. At every rE we could also have trade for the contingent delivery of physical good I at each of the sucoessor date-events to tE. Denote by q,.(t + I, E') the price at tE of one unit of good I delivered at r + I if event E' is revealed (of course, we require E' e .9';., and E' c: E). The problem of the consumer consists of forming utility-maximizing plans by choosing, at every admissible rE, a vector of consumption of goods X,E' e R~ and, for every sucoessor (r + I, E'), a contingent trade Z,E,(I + I, E') of good I deliverable at (I + I, E'). Overall, the budget constraint to be satisfied at tE is

+

ASSET

One can then proceed to define a corresponding concept of Radner equilibrium and to show that the Arrow-Debreu equilibrium allocations for the model with H(T + I)S contingent commodity markets'· at I = 0 are the same as the Radner equilibrium allocations obtained from a model with sequential trade in which, at each date-event, consumers trade only current goods and contingent claims for delivery of good I at successor nodes. Exercises 19.D.3 and 19. D.4 discuss this topic further.

Good I

P,.· x,"

1I.E:

I

q,.(r

+ I, E')z,.,(1 + I, E')

!> P,E'W'E;

+ PUEZ,- ..... ,(t, E)

1£'.,Y',.,:E'cEI

where E - is the event at the date

I -

I predecessor to event E at

I.

9. To be as simple as possible, we have also assumed that there is no consumption at 1=0.

Figure 19.0,1 Reaching the Arrow-Debreu equilibrium by means of contingent trade in the first good only.

19.E Asset Markets Thc S contingent commodities studied in the previous section serve the purpose of transfcrring wealth across the states of the world that will be revealed in the future. Thcy arc. however, only theoretical constructs that rarely have exact counterparts in reality. Nevertheless, in reality there are assels, or securities, that to some extent perform the wealth-transferring role that we have assigned to the contingent commodities. It is therefore important 10 develop a theoretical structure that allows us to study the functioning of these asset markets. We accomplish the task in this section by extending the formal notion of a contingent commodity and then generalizing the theory of Radner equilibrium to the extended environment.'1 We begin again with the simplest situation, in which we have two dates, I = 0 and t = I, and all the information is revealed at t = I. Further, for notational simplicity we assume that consumption takes place only at t = 1. We view an asset, or, more precisely, a unit of an asset, as a title to receive either physical goods or dollars at t = I in amounts that may depend on which state occurs. '2 The payoffs of an asset are known as its returns. If the returns are in physical goods, the asset is called real (a durable piece of machinery or a futures contract for the delivery of copper would be examples). If they are in paper money, they are called financial (a government bond, for example). Mixed cases are also possible, Here we deal only with the real case and, moreover, to save on notation we assume that the returns of assets are only in amounts of physical good 1.13 It is then convenient to normalize the spot price of that good to be I in every state, so that, in effect, we are using it as numeraire. Definition 19.E,1: A unit of an asset, or security, is a title to receive an amount f. of good 1 at date t = 1 if state s occurs. An asset is therefore characterized by its feturn vectof f = (f" . .. ,fS) € R S.

10. A contingent commodity is a promise to deliver a unit of physical commodity h at date t if state" occurs. Recall from Section 19.B that the consumption sets have to be defined imbedding in them the inrormation measurability restrictions. thai is, making sure that at date t no consumption is dependent on inrormation not yet available. II. See Radner (1982) and Kreps (1979) [complemented by Marimon (1987)] for treatmenls in the spirit of this section. 12. As usual "title to receive" means "duty to deliver" ir the amount is negative. Although negative returns present no particular difficulty. we will avoid them. 11 This assumption also has an important simplifying reature: At any given slate the returns of all assets are in units of the same physical good. Therefore, the relative spot prices of the various physical goods in any given state do not affect the relative returns or the different assets in that state.

MARKETS

699

700

CHAPTER

11:

QENERAL

EQUILIBRIUM

UNDER

SECTION

II.E:

AISET

MARKETS

70t

-------------------------~~~~~~~~~

UNCERTAINTY

Example 19.E.l: Examples of assets include the following: (i) r = (1, ... , I). This asset promises the future noncontingent delivery of one unit of good 1. Its real-world counterparts are the markets for commodity futures. In the special case where there is a single consumption good (i.e., L = I). we call this asset the safe (or riskless) asset. It is important to realize that with more than one physical good a futures contract is not riskless: its return in terms of purchasing power depends on the spot prices of all the goods l 4

Dellnltlon 19.E.2: A collection formed by a price vector q = (q, • ... ,qK) E RK for assets traded at t = O. a spot price vector P. = (p, •• ...• PL.) E RL for every s. and. for every consumer i. portfolio plans zi = (zr;..... z.tl E RK at t = 0 and consumption plans xi = (Xri . ...• x~;) E R LS at t = 1 constit~tes a Radne, equilibrium if: (i) For every i. the consumption plans Max

zi. xi solve the problem

U,{X'i' .... XSi)

(Xli • ... JI's,)ER~S

(ii) r = (0•...• 0, I. 0•...• 0). This asset pays one unit of good I if and only if a certain state occurs. These were the assets considered in Section 19.0. In the current theoretical setting they are often called Arrow securities. (iii) r = (1,2. 1.2•...• 1.2). This asset pays one unit unconditionally and. in addition. another unit in even-labeled states. Example 19.E.2: Options. This is an example of a so-called derivative asset. that is. of an asset whose returns are somehow derived from the returns of another asset. Suppose there is a primary asset with return vector r E RS• Then a (European) call option on the primary asset at the strike price c E R is itself an asset. A unit of this asset gives the option to buy. after the state is revealed (but beforc the rcturns are paid). a unit of the primary asset at price c (the price c is in units of the "numeraire." that is. of good I). What is the return vector rIc) of the option? In a given state s. the option will be exercised if and only if r, > c (we neglect the case r, = c). Hence

(Z" •...• z~,)ER ..

s.t.

(a)

L. q.·Z.i ~ 0

(b) p, X'i !> P,W'i + L. P,.z./,.

for every s.

(ii) LiZki ~ 0 and LiX:i ~ L;W.; for every k and s. In the budget set of Definition 19.E.2, the wealth of consumer i at state s is the sum of the spot value of his initial endowment and the spot value of the return of his portfolio. Note that. without loss of generality. we can put Ph = I for all s. From now on we will do so. It is convenient at this point to introduce the concept the retllrn matrix R. This is an S x K matrix whose kth column is the return vector of the kth asset. Hence. its generic sk entry is r... the return of asset k in state s. With this notation. the budget constraint of consumer i becomes

B,j p, q,

B).1 "R~

, fo,"om' portfolio ,,' R'

~ ho~

q",

rIc) = (Max {O. rl - c} •... , Max {O, rs - c}).

Pt'(XII:- WII») [rtt ...... ruj_

For a primary asset with returns r = (4,3,2. I) specific examples are r(3.5) =

(.5. 0 • 0

(

p,'(xs, -

0).

r(2.5) = (1.5.

0.5. 0

r( 1.5) = (2.5.

1.5. 0.5, 0)._

0).

We proceed to extend the analysis of Section 19.0 by assuming that there is a given set of assets. known as an asset structure, and that these assets can be freely traded at date t = O. We postpone to the next section a discussion of the important issue of the origin of the particular set of assets. Each asset k is characterized by a vector of returns r. E RS• The number of assets is K. As before. we assume that there are no initial endowments of assets and that short sales are possible. The price vector for the assets traded at t = 0 is denoted q = (ql •... , qK)' A vector of trades in these assets. denoted by z = (z I' ••• , ZK) E R K, is called a portfolio. The next step is to generalize the definition of a Radner equilibrium to the current environment. In Definition 19.E.2. U;(·) is a utility function for the preferences ~; of consumer i over consumption plans xs;) R~s.

(XII .... '

~

. WS,)

'.

0 for every k. Also, without loss of generality. we assume thai no row of the relurn malrix R has all of its entries equal to zero," Given an arbitrage-free assel price veclor q E RK. consider Ihe convex set V= (UE

R':. = Rz for some ZE R" wilh

q'Z =

·w.,

r.. z:,.

for every k = I •...• K.

Ol·

That is, the vector of expected marginal utilities of the K assets must be proportional to the vector of asset prices. 18 With this we have attained our result. since by taking

The arbitrage freeness of q implies Ihal V () (R~ \(Oll = 0. Since both V and R~ \(Ol are convex sets and the origin belongs 10 V. we can apply the separating hyperplane theorem (see Section M.G of the Mathematical Appendix) to oblain a nonzero vector II' = (11', •...• lis) such that 11". $ 0 for any UE V and 1"'. ~ 0 lor any .E Note Ihal il must be Ihalll' ~ O. Moreover,because.E V implies -UE V. it follows Ihatll'·. = ofor any uE V. Figure 19.E.I(a) depicts this construction for Ihe Iwo-slale case.

w..

Figure 19.E.l

Vo

V = : 1': I' = R:. q': = O. :

E R I: Then we cannot summarize the individual decision problem by means of an indirect utility of the asset portfolio. The relative prices expected in the second period 2 ' also matter. This substantially complicates the formulation of a notion of constrained Pareto optimality. Be that as it may, there appears not to be a useful generalization of the "constrained Pareto optimal" concept in which we could assert the constrained Pareto optimality of Radner equilibrium allocations. Example 19.F.2, due to Hart (1975), makes the point.ln it we have an economy with several Radner equilibria where two of them are Pareto ordered. That is, we have a Radner equilibrium that is Pareto dominated by another Radner equilibrium. To the extent that it seems natural to allow a welfare authority, at the very least, to select equilibria, it follows that the first equilibrium is not constrained Pareto optimal.)O Example 19.F.2: Pareto Ordered Equilibria. Let 1= 2, L = 2, and S = 2. There are no assets (K = 0). The two consumers have, as endowments. one unit of every good in every state. The utility functions are of the form 7lIlUi(Xlli' X21i) + 7l2iUi(X12i, x 2,,). Note that although the probability assessments arc different for the two consumers (these probabilities will be specified in a moment), the spot economies are identical in the two states. Suppose that this spot economy has several distinct equilibria (e.g., it could be the exchange economy in Figure 15.B.9). Let p', p" E R2 be the Walrasian prices for two of these equilibria and let Vi(P) be the spot market utility associated with Ui(', .) and the spot price vector P E R2. Suppose that v,(p') > v,(p"). By Pareto optimality in the spot market, V2(P') < V2(P·). We now define two Radner equilibria. The first has equilibrium prices (p" P2) = (p', p") E R4 and the second has (p" P2) = (p., p') E R4. Because there is no possibility of transferring wealth across states, these are indeed Radner equilibrium prices and, moreover, they are so for any probability estimates 7li' However, the expected utility of these Radner equilibria for the different consumers depends on the 7l i . We can sec now that if consumer I believes that the first state is more likely than the second, that is, he has It" > }, then he will prefer the first equilibrium to the second. Indeed, 7t1l > 1 and v,(p') > v,(p") imply 7lIlV,(P') + 7l"v,(p") > 7tIlV,(P") + 7l 2l V,(p'). Similarly, if the second consumer believes that the second state is more likely than the first, that is, he has 7122 > 1, then he will also prefer the first equilibrium to the second: 7l'2 >! and v,(p') < v2(p") imply 1t'2V2(P') + 7lnv2(P") > 7l"V2(P") + 7tnv,(p'). Thus, the Radner equilibrium with prices (p', pH) Pareto dominates the one with prices (p", p') . • The consensus emerging in the literature seems to be that failures of restricted Pareto optimality (for natural meanings of this concept) are not only possible but even typical [Geanakoplos and Polemarchakis (1986)]. In Exercise 19.F.3 you are asked to develop a related optimality paradox: it is possible for the set of assets to expand and for everybody to be worse off at the new equilibrium! We shan not pursue the constrained optimality analysis

sec T ION

1 8 • G:

FIR M

B E H A V lOR

UNO E RUN C e R T A , N T Y

in any greater depth. At some point the analysis runs into the difficulty that it is hard to proceed sensibly without tackling the difficult problem of the determination of the asset structure.

We could also analyze the positive issues studied in Chapter 17 within an incomplete market sellmg. For eXIStence, there is a new set of complexities related to the fact that unbounded short sales are possible. In some contexts this may lead to existence failures (see Exercise 19.F.4).'1 New subtleties also arise for the issue of the determinacy of equilibria (i.e., the number and local uniqueness of equilibria). As we have seen in Section 17.0, with a complete asset structure we have generic finiteness. But with incomplete markets the nature

of the assets (e.g., whether real or financial) mailers, as may the size of S.

19.G Firm Behavior in General Equilibrium Models under Uncertainty In the previous sections we have concentrated on the study of exchange economies. For once, this has not been just for simplicity. The consideration of production and firms is genuinely more difficult in a context of possibly incomplete markets. The rca son relates to the issue of the objectives of the firm.12 As before, we consider a setting with two periods, t = 0 and t = I, and S possible statcs at I = I. There are L physical commodities traded in the spot markets of period t = I and K assets traded at t = O. There is no consumption at I = O. The returns of the assets are in physical amounts of the good I (which we call the numeraire). The S x K return matrix is denoted R. We introduce into our model a firm that produces a random amount ofnumeraire at date t = I (perhaps by means of inputs used at time t = O. but we do not formalize this part explicitly). We let (a ... , as) denote the state-contingent levels of produc" tion of the firm. There are also shares ()i ~ 0, with Li (), = I, giving the proportion of the firm that belongs to consumer i. We take, for the rest of this section (except in the small-type paragraphs at the end) the natural point of view that the firm is an asset with return vector a = (a" ... , as) whose shares are tradeable in the financial markets at t = 0.)) Suppose now that the firm can actually choose, within a range, its (random) productIOn plan. Say, therefore, that there is a set A c RS of possible choices of return 31. Unbounded short sales are at the origin of a discontinuity in the dependence on asset returns of the space of attainable wealth transfers across states. No matter how close asset returns (in dollar terms) may be t~ displaying a linear dependence. consumers can plan to attain. by using trades of very large magnatude, any wealth transfer in the subspace spanned by the asset returns. But when :et~rns bcco~e exactly linearly dependent, this attainable subspace suddenly drops in dimension. As 1O~lcated. thl~ can lead to an existence failure in some COntexts. The model we have analyzed in t~IS chapter IS not. however. one of those. If. as here, in every state all assets have returns in a Single good. whic~. moreover. is the same across assets, then the discontinuity does not arise.

32. The ciass,c paper on Ihis topic is Diamond (1967). For a more recent survey see Merton 29. Or the relative prices of goods between the second and third period. if we are considering more than two dates. 30. That is. the first equilibrium is not Pareto optimal relative to any set of constrained feasible allocations that includes all Radner equilibrium allocations.

(1982).

33. A minor difference with the setting so far is that the firm does really produce the vector (a l • . . . • "s), and. therer~re the total endowment of this asset is not zero. In fact, by putting L, 0, = 1 we have normahzed thiS total endowment to be I.

713

'I,I

714

CHAPTER

19:

GENERAL

EQUILIBRIUM

UNDER

UNCERTAINTY

SECTION

-------------------------------------------------------------------------------------------Bo; =

II.G:

FIRM

BEHAVIOR

UNDER

{(xu,···, x s;) E R~: there is a portfolio Z,E RK such that p,(x" - rob) S L Phr.. z" for every 50 and q-z, S O,v(a. q)},

,

(19.G.1)

Figure 19.G.l

An example of possible production choices of the firO).

vectors (a" ... , as) E A of the firm. See Figure 19.0.1 for the case where S = 2. We assume that the return vector a E A is chosen before the financial markets of period I = 0 open. Thus, the decision is made by the initial shareholders (since shares may be sold in period I = 0, the shareholders at the end of period I = 0 may be a different set). Which production plan should these initial owners choose? It turns out that the answer is very simple if A can be spanned by the existing assets and is very dillicult if it cannot. Definition 19,G.1: A set A c R S of random variables is spanned by a given asset structure if every a E A is in the range of the return matrix R of the asset structure, that is, if every a E A can be expressed as a linear combination of the available asset returns.

If we assume, first, that A is spanned by R and, second, that we are dealing with a small project (i.e., all the possible productions a E A are small relative to the size of t he economy; e.g., a,flIL; ro,;!1 is small for all s), then we are (almost) justified in taking the equilibrium spot prices P = (PI'" . ,Ps) E RLS and asset prices q = (ql' ... ' qd E R" as constants independent of the particular production plan chosen by the firm." For the asset priccs q E R" the markel value v(a, q) of any production plan a E A can be computed by arbitrage: if a = L, ex,r, then v(a, q) = L, 'J.,q,. In Exercise 19.0.1 you are asked to show that if the firm is added as a new asset to the given list of assets, and each production plan a E A is priced at its arbitrage value v(a), then any budget-feasible consumption plan of any consumer can actually be reached without purchasing any shares of the firm (the fact can be deduced from Proposition 19.E.3). Thus, for fixed asset prices q E R" and spot prices p = (p I' .•. , Ps) E R LS, the budget constraint of consumer i is"

It follows from the form of this budget constraint that at constant prices every consumer-owner (i.e., any i with 0, > 0) faced with the choice between two production plans a, a' E A, will prefer the one with higher market value, Indeed, if v(a, q) ~ v(a', q) then B•. ; c Bo" Thus, the objective of market value maximization will be the ullanimous desire of the firm's initial owners.'" If A is not spannable by the given asset structure we run into at least two serious difficulties. The first difficulty has to do with price quoting and is common to any commodity innovation problem. Without spanning, the value of a production plan a E A cannot be computed from current asset prices simply by arbitrage, The value IS not, so to speak, implicitly quoted in the economy. Therefore, it would need to be anticipated by the agents of the economy from their understanding of the workings of the overall economy-no mean task. The second difficulty, more specific to the financial context, has to do with price laklllY. Due to the possibility of unlimited short sales there is a discontinuity in the

plausibility of the price-taking assumption. With spanning we can argue, as we did that if the project is small then the effect of production decisions on asset prices, on spot prices at I = I, is also small. But if a new asset a E A, no matter how small, IS not generated by the current asset structure, then its availability increases the span of avatlable wealth transfers by one whole extra dimension. The impact is therefore substantial, and may well have a dramatic effect on prices. 31 There is then no reason for owne~s' preferences over different production plans to be dictated merely by the tocrease to wealth at the prices prior to the introduction of the firm (see Exercise 19.0.2). These two difficulties, to repeat, are serious. There is no easy way out.

0;

A variation of the above model entirely eliminates the asset role of the firm at t = O. Let us assume that the firm's shares cannot be traded at I = 0.>8 If owners at t = 0 choose " E A, this simply means that their endowments at I = I are modified by the random variable Ilia that, recall, pays in good I (i.e., the new endowment of consumer i becomes (w" + (O,a.. 0, ... , 0» E RL for every state sl. If a E A can be spanned, then we are as in the previous model. It does not matter whether shares of the firm can be sold or not at r = O. In either case consumers can take positions in the asset markets that will guarantee that the resulting final consumptions at t = I are the same (Exercise 19.G.3). If a E A cannot be spanned, matters are different. The good news is that, because no new tradeable asset is created at r = 0, the price-taking discontinuity problem disappears. The bad news IS that there IS now another difficulty: Because there is no market for the shares at I = 0, the value of the asset cannot be computed as a deterministic amount at I = O. It is rather a

34. Both assumptions are important for this conclusion. Suppose for a moment that there are

zero 10lal endowments of the asset. Then, since the asset is redundant, Proposition 19.E.3 (see also Exercise 19.E.4) implies that at the Radner equilibrium the new asset is absorbed without any change in prices. What we are now assuming is that this remains approximately true if the total endowment

of the asset is small (i.e., if the project is small). 35. Note that the value of the initial endowments at t = 0 is the value or the shares of the firm 0,1"(
It is a good exercise to verify that (20.B.I) satisfies the stationarity property and that the property can be violated by utility functions of the form V(e) = L, a;u(e,), that is, with a time-dependent discount factor (Exercise 20.B.2). The property of stationarity should nol be confused with the statement asserting that if the consumption streams c and e' coincide in the first T - I periods and a consumer chooses one of these streams at t = 0, then she will not change her mind at T. This "property" is tautologically true: at both dates we are comparing Vee) and V(e').· The stationarity experiment compares V(e) and V(e') at I = 0, but at period T it compares the utility values of the future streams shifted to t = 0, that is, V(e T ) and V(C'T). Thus, stationarity says that in the context of the form (20.B.2), the preferences over the future are independent of the age of the decision maker. Time stationarity is not essential to the analysis of this chapter (except for Sections 20.E and 20.F on dynamics), but it saves substantially on the use of subindices.

(3) Additive separability. Two implications of the additive form of the utility function are that at any date T we have, first, that the induced ordering on consumption streams that begin at T + I is independent of the consumption stream followed from 0 to T, and, second, that the ordering on consumption streams from o to T is independent of whatever (fixed) consumption expectation we may have from T + I onward (see Exercise 20.B.3). In turn, these two separability properties imply additivity; that is, if the preference ordering over consumption streams satisfies these separability properties, then it can be represented by a utility function of the form Vee) = L, u,(c,) [this is not easy to prove, see Blackorby, Primont and Russell (1978)]. How restrictive is the assumption of additive separability? We can make two arguments in its favor: the first is technical convenience; the second is a vague sense that what happens far in the future or in the past should be irrelevant to the relative welfare appreciation of current consumption alternatives. Against it we have obvious counter-examples: Past consumption creates habits and addictions, the appreciation of a particularly wonderful dish may depend on how many times it has been consumed in the last week, and so on. There is, however, a very natural way to accommodate these phenomena within an additively separable framework. We could, for example, allow for the form V(e) = L. U,(C'_I' c,). Here the utility at period t depends not only on consumption at date I but also on consumption at date t - I (or, more generally, on consumption at several past dates). We can formulate this in a slightly different way. Define a vector Z, of "habit" variables and a household producrioll recilllology that uses an input vector C'-I at t - I to jointly produce an output vector C _ I of consumption goods at I - I and a vector z, = C._I of "habit" ' variables at I. Then, formally, u. depends only on time t variables and total utility is L. u,(z" c,). In summary: additive separability is less restrictive than it appears if we allow for household production and a suitable number (typically larger than I) of current variables. (4) Lellglh of period. The plausibility of the separability assumption, which makes the enjoyment of current consumption independent of the consumption in other periods, depends on the length of the period. Because even the most perishable consumption goods have elements of durability in them (in the form, for example, of a flow of "services" after the act of consumption), the assumption is quite strained if the length of the elementary period is very short. What determines the length of the period? To the extent that our model is geared to competitive theory, this period is institutionally determined: it should be an interval of time for which prices can be taken as constant. On a related point, note that the value of fJ also depends, implicitly, on the length of the period. The shorter the period, the closer should be to 1.

a

(5) Recursive uriliry. With the form (20.B.I) for the utility function, we have = u(c o) + aV(c ' ) for any consumption stream C = (co, C I , ••• ,e... ..). If we think of u = u(co) as current utility and of V = V(e ' ) as future utility, we see that the marginal rate of substitution of current for future utility equals and is therefore independent of the levels of current and future utility. The recursive utility model [due to Koopmans (1960)] is a useful generalization of (20.B.I) that combines two features: it allows this rate to be variable but, as in the additively separable case, it has the property that the ordering of future consumption streams is independent of the consumption stream followed in the past. Vee)

2. Hence, the completeness of the preference relation on consumption streams is guaranteed. 3. Ramsey (t928) called the assumption a 'weakness of the imagination." 4. This property is often called lime consistency. Time inconsistency is possible if tastes change through time (recall the example of Ulysses and the Sirens in Section 1.8!), but, as we have just argued, it must necessarily hold if the preference ordering over consumption streams (co . ...• c" . .. ) does not change as lime passes. In line with the entire treatment of Part IV, we maintain the assumption of unchanging tastes throughout the chapter.

a

U TtL I T Y

735

736

CHAPTER

20:

EQUILIBRIUM

AND

TIME

The recursive model goes as follows. Denote current utility by II ~ 0 and future utility by V ~ O. Then we are given a current utility function II(C,) and an aggregator function G(II, V) that combines current and future utility into overall utility. For example, in the separable additive case we have G(II, V) = II + cS V. More generally we could also have, for example, G(u, V) = II' + cSV', 0 < " S I. In this case, the indifference curves in the (II, V) plane arc not straight lines. The utility of a consumption stream c = (co, ... , c" . .. ) could then be computed recursively from (20.8.3) V(c) = G(u(c o), V(e')) = G(u(co), G(u(e,), Vee'))) = .... For (20.B.3) to make sense we must be able to argue that the influence of V(e T ) on V(c) will become negligible as T ~ 00 [so that Vee) can be approximately determined by taking a large T and letting V(CT) have an arbitrary value]. This amounts to an assumption of time impatience. In applications, it will typically not be necessary to compute V(c) explicitly. See Exercise 20.B.4 for more on recursive utility.

(6) Altruism. The expression V(c) = u(co) + b V(c') suggests a multigeneration interpretation of the single-consumer problem (20.B.I). Indeed, if generations live a single period and we think of generation 0 as enjoying her consumption according to u(co), but caring also about the utility V(c') of the next generation according to f, V(c'), then V(c) = u(co) + bV(c') is her overall utility. If every generation is similarly altruistic, then we conclude, by recursive substitution, that the objective function of generation 0 is precisely (20.B.I). The entire "dynasty" behaves as a single individual. With this we also have another justification for b < I. The inequality means then that the members of the current generation care for their children, but not quite as much as for themselves. See Barro (1989) for more on these points.

20.C Intertemporal Production and Efficiency Assume that there is an infinite sequence of dates t = 0, I, .... In each period t, there are L commodities. If it facilitates reading, you can take L = 2 and interpret the commodities as labor services and a generalized consumption-investment good (see Example 20.C.1). One of the great advantages of vector notation, however, is that in some cases-and this is one-there is no novelty involved in the general case. Thus, while you think you are understanding the simple problem, you are at the same time understanding the most general one. We shall adopt the convention that goods are nondurables. This is a convention because, in order to make a good durable, it suffices to specify a storage technology whose role is, so to speak, to transport the commodity through time. lfwe were exogenously endowed with some amount of resources (e.g., some initial capital and some amount of labor every period), we would ask what we could do with them. To give an answer, we need to specify the production technology. We already know from Chapter 5 how to do this formally by means of the concept of a production set (or a production transformation function, or a production function). With minimal loss of generality, we will restrict our technologies to be of the following form: the production possibilities at time t are entirely determined by the production decisions at the most recent past, that is, at time t - I. If we keep in mind that we can always define new intermediate goods (such as different vintages of a machine),

SEC T 10"

2 0 • C:

I N T E R T E M P 0 R ALP ROD U C T I 0"

A" 0

E F Fie lEN C Y

737

------------------------------------------------------and also that we can always define periods to be very long, we see that the restriction is minor. Thus, the technological possibilities at t will be formally specified by a production set Y c R2L whose generic entries, or production plans, are written y = (Y., y.). The indices b and a are mnemonic for "before" and "after." The interpretation is that the production plans in Y cover two periods (the "initial" and the "last" period) with y. E IRL and y. E IRL being, respectively, the production plans for the initial and the last periods. As usual, negative entries represent inputs and positive entries represent outputs. We impose some assumptions on Y that are familiar from Section 5.B: (i) Y is dosed and convex. (ii) Y n IR~/ = {O} (no free lunch). (iii) Y - IR~L C Y (free disposal). An assumption specific to the temporal setting is the requirement that inputs not be used later than outputs are produced (i.e., production takes time). This is captured by (iv) If)" = (Yo,Y.)E Y then (y.,O)E Y (possibility oJtrllllcation). In words, (iv) says that, whatever the production plans for the initial period, not producing in the last period is a possibility. A simple case is when y.. ~ 0 for every r E Y, that is, when all inputs are used in the initial period. Then (iv) is implied by the free-disposal property (iii). Example 20.C.I: Ramsey-Solow Model.' Assume that there are only two commodities: A consumption-investment good and labor. It will be convenient to describe the technology by a production function F(k, I). To any amounts of capital investment k ;:>: 0 and of labor input I ~ 0, applied in the initial period, the production function assigns the total amount F(k, I) of consumption-investment good available at the last period. Then

Y = {( -k, -I, x, 0): k

~ 0,

I

~ 0, x S;

F(k, I)} -

R~.

Note that labor is a primary factor; that is, it cannot be produced. _ Example 20.C.2: Cost-oJ-Adjustmem Model. Suppose that there are three goods: capacity, a consumption good, and labor. With the amounts k and I of invested capacity and labor at the initial period, one gets F(k, I) units of consumption good output at the last period. This output can be transformed into invested capacity at the last period at a cost of k' + y(k' - k) units of consumption good for k' units of capacity, where y(.) is a convex function satisfying y(k' - k) = 0 for k' < k and i'(k' - k) > 0 for k' > k. The term y(k' - k) represents the cost of adjusting capacity upward in a given period relative to the previous period. (Note the marginal cost of doing so increases with invested capacity of the period.) Formally, the production set Y is Y = {( - k, 0, -I, k', x, 0): k : 0, k' ;:>: 0, x

:$;

F(k, I) - k' - y(k' - k)} -

I\l~. _

5. See Ramsey (1928) and Solow (1956). The same model was atso inlroduced in Swan (1956).

738

CHAPTER

20:

EQUILIBRIUM

AND

TIME

--------------------------------------------------------------------------------Example 20.C.3: Two-Sec/or Model. We could make a more general distinction between an investment and a consumption good than the one embodied in Examples 20.C.1 and 20.C.2. Indeed. we could let the production set be 1

I 1

y

= {( -k, 0, -I, k', x, 0): k?; O,I?; 0, k'?; 0, x $

G(k, I, k')} - R~,

where k, k' are, respectively, the investments in the initial and the last periods. Note that the investment and the consumption good need not be perfectly substitutable [they are produced in two separate sectors, so to speak; see Uzawa (\964)]. If they are [i.e., if the transformation function .G(k, I, k') has the form F(k, I) - k'] then this example is equivalent to the Ramsey-Solow model of Example 20.C.t. If it has the form G(k.I, k') = F(k, I) - k' - y(k' - k) then we have the cost·of-adjustment model of Example 20.C.2. • Example 20.C.4: (N + I)-Sec/or Model. As in Example 20.C.3, we have a consumption good and labor, but we now interpret k and k' as N-dimensional vectors. For simplicity of exposition, in Example 20.C.3 we have taken G(k,I, k') to be defined for any k ?; 0, k' ?; O. In general, however, this could lead to the production of negative amounts of consumption good. To avoid this it is convenient to complete the specification by means of an admissible domain A of (k,I, k') combinations. Then

y=

I( -k, 0,

-I, k', x, 0): (k,I, k') E A and x $ G(k, I, k')} -

R~(H+2J.

•

Once we have specified our technology, we can define what constitutes a path of production plans. Definition 20.C.1: The list (Yo' y" ... , y" ... ) is a production path, or trajectory, or program, il y, EYe RIL lor every t. Note that along a production path (Yo,' .• , y" •• . ) there is overlap in the time indices over which the production plans y,_, and y, are defined. Indeed, both Y•. ,_' E RL and Y.. E RL represent plans, made respectively at dates / - I and /, for input use or output production at date /. Thus, we have. at every /, a net input-output vector equal to Y•. ,_, + Y., E RL (at / = 0, we put Y•. _I = 0; this convention is kept throughout the chapter)." The negative entries or'this vector stand for amounts of inputs that have to be injected from the outside at period / if the path is to be realized, that is, amounts of input required at period / for the operation of y,_1 and y, in excess of the amounts provided as outputs by the operation of y,_1 and y,. Similarly, the positive entries represent the amounts of goods left over after input use and thus available for final consumption at time /. The situation is entirely analogous to the description of the production side of an economy in Chapter 5. If we think of the technology at every / as being run by a distinct firm (or as an aggregate of distinct firms) and of p, as an infinite sequence with nonzero entries (equal to y,) only in the / and / + I places, then L, p, is the aggregate production path; and it is also precisely the sequence that assigns the net input-output vector Y•. ,_, + Yo, e RL to period t. If we had a finite horizon, the current setting would thus be a particular case of the description of production in 6. A minor point of notation: when there is any possibility of confusion or ambiguity in the reading of indices. we insert commas; for example. we write Y•. ,_I instead of Y.'_I"

SEC.TION

20.C:

'NTERTEMPORAL

PRODUCT'ON

AND

EFFICIENCY

739

-----------------------------------------------------------------------------Chapter 5. With an infinite horizon there is a difference: we now have a countable infinity of commodities and of firms instead of only a finite number. As we shall see. this is not a minor difference. It will. however. be most helpful to arrange our discussion around the exploration of the analogy with the finite horizon case by asking the same questions we posed in Section 5.F regarding the relationship between efficient production plans and price equilibria. Definition 20.C.2: The production path (Yo, ...• y" ... ) is efficient il there is no other production path (Yo, ... , y;, ... ) such that Y•. '-1

+ Yb' $

y~.'-I

+ Yb'

lor all t,

and equality does not hold lor at least one t. I n words: the path (Yo,"" y.. ... ) is efficient if there is no way that we can produce at least as much final consumption in every period using at most the same amount of inputs in every period (with at least one inequality strict). The definition is exactly parallel to Definition 5.F. I. What constitutes a price vector in the current intertemporal context? It is natural to define it as a sequence (Po, p" . .. ,p" . .. ), where p, e RL. For the moment we shall not ask where this sequence comes from. We assume that it is somehow given and that it is available to any possible production unit. The prices should be thought of as present-value prices. We shall discuss further the nature of these prices in the nex t section. Given a path (Yo, ... ,y" ... ) and a price sequence (Po, . .. ,p" ... ), the profit level associated with the production plan at / is

P'·Ybl

+ P,+l"Ys,'

We now pursue the implications of profit maximization on the production plans made period by period. Definition 20.C.3: The production path (Yo" .. , y" ... ) is myopically, or short-run, profit maximizing for the price sequence (Po' ... ,p" ... ) il lor every t we have P,'Yb, + P'+I·Y.,?; P,'Yb, + P'+I'Y~,

lor all y;e Y.

Prices (Po,' .. ,p" . .. j capable of sustaining a path (y" . ..• y ... .. ) as myopically profit-maximizing are often called Malinvaud prices for the path [because of Malinvaud (1953»).' Does the first welfare theorem hold for myopic profit maximization? That is, if (Yo .... ,Y.. ... ) is myopically profit maximizing with respect to strictly positive prices, does it follow that (Yo, ... , y ..... ) is efficient? In a finite-horizon economy this conclusion holds true because of Proposition 5.F.I, but a little thought reveals that in the infinite-horizon context it need not. The intuition for a negative answer rests on the phenomenon of capical overaccumulation. Suppose that prices increase through 7. Observe that we do not require that L. P,"(Y•. I_I + Ybl) < 00. In principle. a production path may have an infinite present value. We saw in Sections S.E and S.F. where we had a finite number of commodities and firms that individual. decentralized profit maximization and overall profit maximization amounted to the same thing. Because of the possibility of an infinite present value. the existence of a countable number of commodities and production sets makes this a more delicate matter in the current context. See Exercises 20.C.2 to 20.C.S for a discussion.

740

C HAP T E R

20:

E QUI LIB R I U MAN D

TIM E

SECTION

---------------------------------------------------------------------------------------time fast enough. Then it may very well happen that at every single period it always pays to invest everything at hand. Along such a path, consumption never takes place-hardly an efficient outcome. Example 20.C.5: With L = I, let Y = ( -k, k'): k ~ 0, k' $ k} C R2. This is just a trivial storage technology. Consider the path where Y, = (- 1,1) for all t; that is, we always carry forward one unit of good. Then Y•. _I + Y.o = - I and Y•. ,_I + Y., = 0 for alit> O. This is not efficient; just consider the path Y; = (0,0) for alit, which has Y~.'_I + = 0 for all t ~ O. But for the stationary price sequence where p, = I for all t, (Yo, ... , y" ... ) is myopically profit maximizing. _

Proposition 20.C,1: Suppose that the production path (Yo" .. , Yr" .. ) is myopically profit maximizing with respect to the price sequence (Po' ... , Pr , ••. ) » O. Suppose also that the production path and the price sequence satisfy the transversality condition Pr+,'Y., -+ O. Then the path (Yo.·'" yr' ... ) is efficient. Proof: Suppose that the path (y~ •...• y;, ... ) is such that Y•. ,_I + Y., $ Y;.'_I + Yb' for all t, with equality not holding for at least one t. Then there is e > 0 such that if we take a T sufficiently large for some strict inequality to correspond to a date previous to T, we must have T

I

.=0

T

p"(Y~.'-1 + Yb') >

I

p,'(Y•. '-1

+ Y.,) + e.

1=0

In fact, if T is very large then PT+ I • Y.T is very small (because of the transversality condition) and therefore T

I

I

T

T-l

PT' YOT

+

I (p,+ .=0

I'

y~,

+ P"Yb') >

I (p,+ .=0

I ' Y.,

+ p,' Yo,).

We must thus have either p, + 1 'y~, + p,' Y" > p,+ I 'Y., + p,' Y., for some t ::; T - I or PT' YOT > PT+ 1 'Y.T + PT' Y.T' In either case we obtain a violation of the myopic profit-maximization assumption [recall that by the possibility of truncation we have (YbT' 0) E Y]. Therefore, no such path (y~, ... , y;, ... ) can exist. Note that the essence of the argument is very simple. The key fact is that if the transversality condition holds, then for T large enough we can approximate the overall profits of the truncated path (Yo, ... , YT) by the sum of the net values of period-by-period input-output realizations (up to period T). It does not matter whether we match the inputs and the outputs per period or per firm (that is, "per production plan "). If the horizon is far enough away, either method will come down to Profits = Total Revenue - Total Cost. _

AND

EFFICIENCY

741

(ii) If the answer to (i) is yes, can we conclude that the pair (Yo, ... , y" ... ). (Po, ... , p" ... ) satisfies the transversality condition? Ti,e allswer to (ii) is "not necessarily." In Section 20.E we will see, by means of an example, that the transversality condition is definitely not a necessary property of Malinvaud prices. The answer to (i) is .. Essentially yes." We illustrate the matter by means of two examples and then conclude this section by a small-type discussion of the general situation.

Example 20,e.6: Ramsey-Solow Model Continued. In this model, we can summarize a path by the sequence (k" I" c,) of total capital usage, labor usage, and amount available for consumption. From now on we assume that k,+ 1 + c,+ I = F(k" I,) and that the sequence I, of labor inputs is exogenously given. Then it is enough to specify the capital path (k o, .•. , k" .• .). Denoting by (q" w,) the prices of the two commodities at t, we have that profits at tare q,+,F(k"I,) - q,k, - w,l, and, therefore, the necessary and sufficient conditions for short-run profit maximization at ( are

~

q,+

p"(Y'.'-1 +y.,).

By rearranging terms-a standard trick in dynamic economics-this can be rewritten as (recall the convention Y•. _I = Y~._I = 0)

PRODUCTION

(i) Is there a system of Malinvaud prices (Po, ... , p" .. .) for (Yo, ... , y" . .. ), that is, a sequence (Po,""p" ... ) with respect to which (Yo, ... ,Y" ... ) is myopically profit maximizing?

T

p,'(Y;.'-1 +yb,»PT+I'Y.T+

INTERTEMPORAL

Proposition 20.e. I tells us that a modified version of the first welfare theorem holds in the dynamic production setting. Let us now ask about the second welfare theorem: Given an efficient path (Yo, ... , y" ... ), can it be price supported? In Proposition 5.F.2 we gave a positive answer to this question which applies to the finite-horizon case. In the current infinite-horizon situation we could decompose the question into two parts:

Y',

Efficiency will obtain if, in addition to myopic profit maximization, the (present) value of the production path becomes insignificant as t ... 00. Precisely, efficiency obtains if the (present) value of the period t production plan for period t + I goes to zero, that is, if p,+ I' Y., ... 0 as t -+ 00. This is the so-called transversality condition. Note that the condition is violated in the storage illustration of Example 20.C.5.

20.C:

--------------------------------------------------------------------------------------

= V,F(k"I,)

and

I

~ = V2 F(k" I,).

q,.l

Notc that, up to a normalization (we could put qo = I), these first-order con· ditions determine supporting prices for any feasible capital path (see Exercise 20.e.6). The transversality condition says that q,+ ,F(k" I,) -+ O. If the sequence of productions F(k" I,) is bounded, then it suffices that q, -+ O. In view of Proposition 2a.e.I. we can conclude that a set of sufficient conditions for efficiency of a feasible and bounded capital path (k o, ..• , k" ... ) is that there exist a sequence of output prices (qo, ... , q" ... ) such that

~ = V,F(k" I,)

for alit

(20.C.I)

q,+1

and

q,

-+

a

(equivalently, Ijq,

-+ 00).

(20.C.2)

Because of the possibility of capital overaccumulation, (2a.e.I). which is necessary, is not alone sufficient for efficiency. On the other hand, (20.e.2) is not necessary (see Section 20.E). Cass (1972) obtained a weakened version of (20.e.2) that, with (2a.c.I),

-

742

CHAPTER

20:

EQUILIBRIUM

AND

SECTIQN

TIME

--------------------------------------------------------------------------------------

20.0:

EQUILIBRIUM:

THE

ONE-CQNSUMER

CASE

743

---------------------------------------------------------------------------------------

is both necessary and sufficient.· The condition is

"" I

L-= ,-0 q,

(20.C.2')

00.

-

Example 20.C.7: Cost of Adjustment Model continued. In the cost or adjustment model. a production plan at time t - I involves the variables k,_ .. 1,_1' k,. c,. We associate with these variables the prices q,_I' q,. s,. Profits are then

w,_"

s,(F(k,.,.I,.,) - k, -y(k,...,. k,_,))

+ q,k, -

Production Possibility Set . Y () ({Y•. ,_,) X RL)

q,_,k,., - w,.,I,.,.

(Y •. Y•. ,_I + YIN,

thus contradicting efficiency.

(a)

(b)

We construct the desired price sequence (Po •...• P, •.. ·) by induction. Put Po = q", (i.e.• the relative prices at r = 0 are the M RTs between goods at the initial part of the production plan Yo EYe R'L). Suppose now that the prices (Po •...• PT) have already been determined, and that every y, up to I = T - 1 is myopically profit maximizing for these prices. Because of the first-order conditions for profit maximization at T - I. we have that PT = aq•. T ' I for some a > O. We know that q•. T ' I = {Jq .. for some {J> O. Then PT = a{Jq'T' Therefore. if we put p,.. 1 = a/iq.", we have that (PT' PT.,) = (a/iq'T' a{Jq.T) is proportional to qT = (q'T' q.T). which means that YT is profit maximizing for (PT. PT. I)' Hence we have extended our sequence to (Po • ...• PT. I) and we can keep going. Note that, as in Examples 20.C.6 and 20.C.7. the construction of the supporting short-run prices does not make full use of the efficiency. What is used is that the production path is "short-run efficient" (that is. the production path cannot be shown inefficient by changes in the production plans at a finite number of dates). The above observations can be made into a perfectly rigorous argument for the existence of Malinvaud prices in the smooth case. The proof for the nonsmooth case is more complex. It must combine an appeal to the separating hyperplane theorem (to get prices for truncated horizons) with a limit operation as the horizon goes to infinity. With a minor technical condition (call IlOnlighrness in the literature). this limit operation can be carried out.

20.D Equilibrium: The One-Consumer Case In this section. we bring the consumption and the production sides together and begin the study or equilibrium in the intertemporal setting. We shall start with the one-consumer case. As we will see in Section 20.G. the relevance or this case goes beyond the domain or applicability or the representative consumer theory or Chapter 4. An economy is specified by a short-term production technology Y c R2L, a utility Junct ion u( . ) defined on R~. a discount factor lJ < I, and, finally, a (bounded) sequence or initial endowments (W o," .• w" .. .), w, e R~. We assume that Y satisfies hypotheses (i) to (iv) or Section 20.C and that u(·) is strictly concave. differentiable. and has strictly positive marginal utilities throughout its

domain. 8. Some additional, very minor, regularity conditions on the production runction F(') are required ror the validity or this equivalence.

Prices are given to us as sequences (Po •. ..• p" . .. ) with p, e R~. As in Chapter 19 we can interpret these prices either as the prices or a complete system of rorward

Figure 2O.C.l (ten)

Smooth production scI. Flgur. 20.C.2 (rtgh')

A production path that is inefficient at T.

744

CHAPTER

20:

EQUILIBRIUM

AND

markets occurring simultaneously at t = 0 or as the correctly anticipated (present value) prices of a sequence of spot markets. We will consider only bounded price sequences. In fact, most of the time we will have IIp,lI _ 0. 9 Given a production path (Yo,"" y.. ... ), y, E Y, the induced stream of consumptions (co, ... , c,' ... ) is given by C,

+ P'+I-Y.'

Fixing T and rearranging the terms of we get

L

(It,+p,'w,)-

L, '" T p,'C, = L, '" T p'(Y•. ,_, + Y., + w,)

L

p,·C,=PT+'·Y.T

(20.0.1)

20. D:

E QUI L' • R , U M:

THE

0 N E - CON SUM E RCA S E

there is a forward market for every commodity at every date, or, in another, that assets (e.g., money) are available that are capable of transferring purchasing power Ihrough time (see Exercise 20.0.1 for more on this). Secondly, observe that the strict monotonicity of u(·) implies that if we have reached utility maximization then, a fortiori, total wealth (denoted w) must be finite; that is,

Lit, + LP,'w, < 00.

w=

for every t.

T , ~"

Moreover, at the equilibrium consumptions the budget constraint of (20.0.4) must hold with equality. An important consequence of the last observation is that at equilibrium the transversality condition is satisfied. Formally, we have Proposition 20.D.1. Proposition 20.0.1: Suppose that the (bounded) production path (yt,···, vi, . .. ) and the (bounded) price sequence (Po, ... ,p" ... ) constitute a Walrasian equilibrium. Then the transversality condition P,+,' 0 holds.

Y:, -

Proof: Denote

c: = Y:.,_, + y:' + w,. By expression (20.0.1) we have L (It,+p,'w,)- L p,·C,=PT+'·Y.T· 1$

T

IS T

Since cach of the sums in the left-hand side converges to w < conclude that PH' • Y:T -+ O. •

00

as T

-+ 00.

we

IS T

IS T

Expression (20.0.1) is an important identity. It tells us that the transversality condition is equivalent to the overall value of consumption not being strictly inferior to wealth (i.e., there is no escape of purchasing power at infinity). The definition of a Walrasian equilibrium is now as in the previous chapters. One only has to make sure that a few infinite sums make sense. Definition 20.0.1: The (bounded) production path (y~, ... , y~" .. ), y~ E Y, and the (bounded) price sequence p = (Po' ... ,p" ... ) constitute a Walrasian (or competitive) equilibrium if:

Y:.,_, Y6, + w, ~ 0

(i) c~ = + (ii) For every t,

1t,

for all t.

(20.0.2)

= P,'Yb, + p,+,'V:, ~ P,'Yb + p,+,·Y.

(20.0.3)

for all Y = (Vb' Y.) E Y. (iii) The consumption sequence (ct, ... ,cr . .. ) ~ 0 solves the problem Max

L, o'u(c,) s.t.

(20.0.4)

L,P,'c, ~ L,1t, + L,p,·w,.

Condition (i) is the feasibility requirement. Condition (ii) is the short-run, or myopic, profit-maximization condition already considered in Section 20.C (Definition 20.C.3). The form of the budget constraint in part (iii) deserves comment. Note first that there is a single budget constraint. As in Chapter 19, this amounts to an assumption of completeness, which means, in one interpretation, that at time t = 0 9. Keep in mind that prices are to be thought

or as

measured in current-value terms.

745

---------------------------------------------------------------------------------------

+ Y'u + 00"

= Y.... _I

If c, ~ 0 for every t, then we say that the production path (Yo, ... ,y" . .. ) is feasible: Given the initial endowment stream the production path is capable of sustaining nonnegative consumptions at every period. To keep the exposition manageable from now on we restrict all our production paths and consumption streams to be bounded. Delicate points come up in the general case, which are better avoided in a first approach. Alternatively, we could simply assume that our technology is such that any feasible production path is bounded. Given a production path (Yo, . .. ,Y... .. ) and a price sequence (Po, ... ,P.. · .. ), the induced stream of profits (Ito, ... , It" ... ) is given by

n, = P,·Y'n

s [C

TIME

-----------------------------------------------------------------------------------

Another implication of Definition 20.0.1 by

w

T. Because /J < I, there is f. > 0 such that if T is large enough then there is an improvement of utility of more than 2e in going from (1'0' ... , "" ••• J to (co, ... , c;, ... J. Since w < 00, the amount I:t> riP,' (c, - c;)1 can be made arbitrarily small. Hence, for large T the stream (c;;' . .. , c;, . ..) is almost budget feasible. It follows that it can be made budget feasible by a small sacrifice of consumption in the first period resulting in a utility loss not larger than e. Overall, it still results in an improvement. But this yields a contradiction because only the consumption in a finite number of periods has been altered in the process. _

20.0:

EOUILtBRIUM:

THE

ONE-CONSUMER

(ii) Suppose instead that Wo = I and w, = 0 for t > O. There is, however, a linear production technology transforming every unit of input at t into IX > 0 units of output at I + I. Because of the boundary behavior of the utility function, consumption will be positive in every period, and therefore the technology will be in operation at every period. The linearity of the technologies then has the important implication that the equilibrium price sequence is completely determined by the technology. Putting Po = I, we must have P, = I/a'. Wealth is IV = PoWo = I, and therefore the equilibrium consumptions must be c: = [.5'(1 - t5)]/p, = (exb)'(1 - b). Note that, as long as I ~ a < I/b, both the price and the consumption sequences are bounded. Observe also the interesting fact that for this example we have been able to compute the equilibrium without explicitly solving for the sequence of capital investments. (iii) We are as in (ii) except that we now have a general technology F(k) transforming every unit k" of investment at t into F(k,) units of output at r + I. This output can then be used indistinctly for consumption or investment purposes at I + I. That is, c,., = F(k,) - k,. I' The logarithmic form of the utility function allows for a shortcut to the computation of equilibrium prices. Indeed, say that (Po," . 'PI' ... ) are equilibrium prices and (c~, ... ,c~, ... ), (k~, ... ,k:, . .. ) equilibrium paths of consumption and capital investment. Then we know that at any T a constant fraction 0 of remaining wealth is invested. That is,

PT.,k~., = b( L

p,c:)

= t5PT+,F(k:J.

'2: T+ 1

Example 20.0.1: In this example we illustrate the use of conditions (20.0.6) for the computation of equilibrium prices. Suppose that we are in a one-commodity world with utility function L, /J' In c,. Given a price sequence (Po,' .. ,p" ... ) and wealth IV, the first-order conditions for utility maximization (20.0.6) are

b'

).p, = c,

for all I,

and

L p,c, = IV. ,

Hence, W = L, p,C, = (1/).) L, b' = (1/).)[1/(1 -.5)] and so p,c, = .5'/i. = .5'(1 - b)1V for all r. Note that this implies a conslant rale of savings because PTcdeL, " T p,c,) = I - 6, for all T (Exercise 20.0.4).'0 We now discuss three possible production scenarios. (i) The economy is of the exchange type; that is, there is no possibility of production and we are given an initial endowment sequence (wo. ... , WI' ... ) » O. Then the equilibrium must involve = w, for every I, and therefore, normalizing to L, p,W, = I, the equilibrium prices should be

c:

0'(1 - b) P,=----

w,

for every t.

10. Logarithmic utility functions facilitate computation and are very important in applications. However, they are not continuous at the boundary (In 1:, _ - 00 as c, - 0) and therefore violate one of our maintained assumptions. This does not affect the current analysis but should be kept in mind.

CASE

747

----------------------------------------------------------------------------------

Therefore, we must have k:., = of(k:J for every I. With ko = Wo = I given, this allows us to iteratively compute the sequence of equilibrium capital investments. The sequence of prices is then obtained from the profitmaximization conditions P,. ,F'(k:J- p, = o.• Since a Walrasian equilibrium is myopically profit maximizing and satisfies the transversality condition (Proposition 20.0.1), we know from Proposition 20.C.1 that it is production efficient (assuming p, » 0 for all I). Can we strengthen this to the claim that the full first welfare theorem holds? We will now verify that we can. In the current one-consumer problem, Pareto optimality simply means that the equilibrium solves the utility-maximization problem under the technological and endowment constraints: Max

L b'u(c,), S.t. (", =

Yu,l-l

(20.0.7)

+ Ybt + w,;,:: 0

y,

and

Proposition 20.0.3: Any Walrasian equilibrium path planning problem (20.D.7).

(V~,

E

Y for all I.

... , V?, . .. ) solves the

Proof: Oenote by B the budget set determined by the Walrasian equilibrium price sequence (Po,· .. , p" ... ) and wealth IV = L,1!, + L, p,·w" where 1[,

= p,'Y:'

+ P,+l'Y:,,+l

748

CHAPTER

EQUILIIIRIUM

20:

AND

TIME

SEC T ION

--------------------------------------------------------------------------------------------for all I. That is.

have

B = {(co•...• c; •.. .): c; ~ 0 for all t and

L, p,'c; !> w}.

By the definition of Walrasian equilibrium. the utility of the stream (c~ •...• c: •... ) defined by = Y:.,_1 + + £0, is maximal in this budget set. It suffices. therefore, to show that any feasible path (y~•. ..• y~, ... ), that is, any path for which y~ E Y and c~ = Y;.,_1 + Y;' + co, ~ 0 for all I, must yield a consumption stream in B. To see this note that, for any T.

c:

Y:,

L

p,'c;=

1ST

L (P,'Yb,+P,+I'Y;,)+Pr'Ybr+ L rST-l

c; »

E QUI L III R I U M:

THE

0 N E • CON. U MER

CAS E

749

0 for all t and. moreover, for it to be legitimate to determine the sign of

L, b'(u(C;) -

u(cn) = ~T(U(CT) - u(cm + b T' '(u(c~+ ,) - u(4+ ,))

by signing the first·order term b T Vu(cf)'(cr - cf) + bT+' Vu(cf+ ,)'(CT" - cf, ,) = PT'(Y;T - rtT) + PT' ,'(Y~r - Y:T) = PTe Y;T

+ PT+ I" Y~T - PTe Y:T - PT+ I" Y:T > O.

But this conclusion contradicts the assumption that

p,·W,.

(y~,

... ,

Y: .... )solves (20.0.7). •

1sT

By the possibility of truncation of production plans, we have (Ybr, 0) E Y. Therefore, by short-run profit maximization, p,' Y,r :$; 1[r and p,' + P, + I • :$; 1[, for all t:$; T- I. Hence,

y"

L

L

p,'c;:$;

1ST

which implies

20. D:

-----------------------------------------------------------------------------

L, p,'c;

1ST

1[,+

L

p,'w,:$;w

y:,

for all T,

1ST

:$; w. •

Let us now ask for the converse of Proposition 20.0.3 (i.e .• for the second welfare theorem question; see chapter 16): Is any solution (Yo, ... , y" ... ) to the planning problem (20.0.7) a Walrasian equilibrium? In essence. the answer is "yes," but the precise theorems are somewhat technical because. to obtain a well-behaved price system (i.e .• a price system as we understand it: a sequence of nonzero prices). one needs some regularity condition on the path. We give an example of one such result." Proposition 20.0.4: Suppose that the (bounded) path (Y6,"" vi, ... ) solves the planning problem (20.0.7) and that it yields strictly positive consumption (in the sense that, for some t > O. en = Yi•. '-l + Yib' + COft > t for all t and t). Then the path is a Walrasian equilibrium with respect to some price sequence (Po.···.P" ... ). Proof: We provide only a sketch of the proof. A possible candidate for an equilibrium price system is suggested by expression (20.0.6): p, = b'VU(Cn

for all t,

y:.,_,

where c: = + Y:' + w,. Because (c~, ...• c: •... ) is bounded above and bounded away from the boundary (uniformly in t) we have L, IIp,1I < 00, which implies the transversality condition. In turn. by expression (20.0.1) this yields L, Poc: = L, (n, + p,'w,) = w < 00. Therefore. by Proposition 20.0.2, the utility·maximization condition holds. I! remains to establish that short·run profit maximization also holds. To that effect suppose that this is not so. that is. that for some T there is y' E Y with PT·Y~

+ PT+t 'y~ > PT'Y:T + PT+ I'Y:T =

nT'

Let (y; •. ..• y;, ... ) be the path with Yr = y' and y; = Y: for any t >F T. Let (co •... , c; •. .. ) be the associated consumption stream. Because of the convexity of Y and the strict positivity for us to property of (c~ •. ..• c: •. .. ) we can assume that YT = Y' is sufficiently close to

yr·

II. A general treatment would involve, as in Sections IS.C or 16.D,lhe application ora suitable

version (here infinite·dimensional) of the separating hyperplane theorem. The next result gets around this by exploiting the differentiability of It(·). I! is thus parallel to the discussion in Section 16.F.

The close connection between the solutions of the equilibrium and the planning problem (20.0.7) has three important implications for. respectively. the existence. uniqueness. and computation of equilibria. The first implication is that it reduces the question of the exiSlence of an equilibrium to the possibility of solving a single optimization problem. albeit an infinite-dimensional one. Proposition 20.0.5: Suppose that there is a uniform bound on the consumption streams generated by all the feasible paths. Then the planning problem (20.0.7) attains a maximum; that is, there is a feasible path that yields utility at least as large as the utility corresponding to any other feasible paths. The proof. which is purely technical and which we skip. involves simply establishing that, in a suitable infinite-dimensional sense. the objective function of problem (20.0.7) is continuous and the constraint set is compact. The second implication is that it allows us to assert the uniqueness of equilibrium. Proposition 20.0.6: The planning problem (20.0.7) has at most one consumption stream solution. Proof: The proof consists of the usual argument showing that the maximum of a strictly concave function in a convex set is unique. Suppose that (yo •...• y, •. .. ) and (yo •...• y; •... ) are feasible paths with L, .5'u(c,) = L, .5'u(c;) = Y. where (co •...• c, •. .. ) and (co •. ..• c; •.. .) are the consumption streams associated with the two production paths. Consider y~ = lY, + !y;. Then the path (y~, ...• y~ •. .. ) is feasible and at every t the consumption level is c~ = !c, + !c;. Hence. L, .5'u(c~) ~ Y. with the inequality strict if c, l' c; for some t. Thus. if c, '# c; for some t. the paths (yo, ... , y, •.. .). (y~ •...• y; •... ) could not both solve (20.0.7). • The third implication is that Proposition 20.0.3 provides a workable approach to the computation of the equilibrium. We devote the rest of this section to elaborating on this point.

The Computation of Equilibrium and Euler Equations It will be convenient to pursue the discussion of computational issues in the slightly restricted setting of Example 20.C.4, the (N + I)-sector model. To recall. we have N capital goods, labor, and a consumption good. We fix the endowments of labor to a constant level through time. A function G(k. k'). gives the total amount of consumption good obtainable at any t if the investment in capital goods at t - I is

-

750

c HAP T E R

2 0:

E QUI LIB R I U"

AND

T

I .. E

SEC T ION

20.

D: E QUI LIB R I

U .. :

THE

0

NE· CON. U .. E RCA. E

751

-------------------------------------------------------------------------------------------given by the vector k E RH. the investment at t is required to be k' E R~. and the labor usage at t - I and t is fixed at the level exogenously given by the initial endowments. We denote by A c RH X RH the region of pairs (k. k') E R2H compatible For notational with nonnegative consumption [i.e .• A = Ilk. k') E R2H: G(k. k') ~ convenience. we write u(G(k. k'» as u(k. k'). We assume that A is convex and that 11(', .) is strictly concave. Also. at t = 0 there is some already installed capital invcstment ko and this is the only initial endowment of capital in the economy. In this economy the planning problem (20.0.7) becomes'2

On

(20.0.8) s.t. (k,_" k,) E A for every t. and ko =

ko ·

From now on we assume that (20.0.8) has a (bounded) solution. Because of the strict concavity of u(· • . ) this solution is unique. For every / ~ I the vector of variables k, E RH enters the objective function of (20.0.8) only through the two-term sum o'u(k,_,. k,) + 0'+ 'u(k,. k, + d. Therefore, diffcrentiating with respect to these N variables. we obtain the following necessary conditions for an interior path (k o•...• k, •. .. ) to be a solution of the problem (20.0.8): 13 for every n :> Nand /

~

I.

In vector notation. for every t

~

I.

(20.0.9)

Conditions (20.0.9) are known as the Euler equations of the problem (20.0.8). Example 20.0.2: Consider the Ramsey-Solow technology of Example 20.C.1 (with I, = I for all t). Then. u(k. k') = u(F(k) - k') and A = Ilk. k'): k' $; F(k)}. Therefore, the Euler equations take the form -II'(F(k,_d- k,)

+ ou'(F(k,) -

k,+,)F'(k,) = O.

for all/2'o I

or

~=F'(k,)

for all t

~

I.

ou (c,+ ,)

In words: the marginal utilities of consuming at t or of investing and postponing consumption one period are the same. _ Example 20.0.3: Consider the cost-of-adjustment technology of Example 20.C.2 (except that as in Example 20.0.2 we fix " = I for all t and drop labor as an explicitly considered commodity) and suppose we have an overall firm that tries to maximize the infinite discounted sum of profits by means of a suitable investment policy in capacity. Output can be sold at a constant unitary price that. with a constant rate 12. By convention we put u(k_ 1• ko) == O. 13. The expression "interior path" means that (kit k,. I) is in the interior of A for all t. For the interpretation of the expression to come, recall also that k. and k~ stand, respectively. for the nth and Ihe (N + 1I)lh argumenl or u(k, k').

••

of interest. gives a present value price of 0'. Thus the problem becomes that of maximizing L, o'[F(k'_I) - k, - y(k, - k,_,)]. The Euler equations are then -I-y'(k,- k,_,)

+ o[F'(k,) +y'(k,+,

-

k,n =0

for all t ~ 1.

In words: the marginal cost of a unit of investment in capacity at t equals the discounted value of the marginal product of capacity at t plus the marginal saving in the cost of capacity expansion at t + I. Note that. iterating from t = I. we get I

+ y'(k,

- k o) =

L li'(F'(k,) -

,,,I

I).

In words: At the optimum. the cost of investing in an extra unit of capacity at t = I equals the discounted sum of the marginal products of a maintained increase of a unit of capacity. '4 See Exercise 20.0.5 for more detail.'· _ Suppose that a path (k o•..•• k" ... ) satisfies the Euler necessary equations (20.0.9). From their own definition, and the concavity of u(· •. ). it follows that the Euler equations are also sufficient to guarantee that the trajectory cannot be improved upon by a trajectory involving changes in a single k,. In fact, the same is true if the changes are limited to any finite number of periods (see Exercise 20.0.6). Thus. we can say that the Euler equations are necessary and sufficient for short-run optimization. The question is then: Do the Euler equations (or. equivalently. short-run optimization) imply long· run optimization? We shall see that. under a regularity property on the path (related. in a manner we shall not make explicit. to the transversality condition 16). they do. We say that the path (k o•. ..• k, •.. .) is strictly interior if it stays strictly away from the boundary of the admissible region A. [More precisely. the path is strictly interior if there is £ > 0 such that for every t there is an £ neighborhood of (k,. k,+ I) entirely contained in A.] Proposition 20.0.7: Suppose that the path (*0' ... ' k" ... ) is bounded, is strictly interior, and satisfies the Euler equations (20.0.9). Then it solves the optimization problem (20.0.8). Proof: The basic argument is familiar. Ir 0'0'" .. k, •. .. ) does not solve (20.0.8). then there is a feasible trajectory (/ ko• then k, is unbounded. The only value of k, generating a bounded k, is k, = ko' Therefore. t/I(k o) = ko for any ko' It is instructive to see what happens if we try k, ~ ko. Then, the path induced by the ditTerence equation is feasible and. in fact, we have a constant level of consumption C, = 2k,_, - k, = 2ko - k,. Thus, for k, > k o• we have here an example of a path that is compatible with the Euler equations but that is not optimal. because at k, = ko we get a higher level of constant consumption." _

It may be helpful at this stage to introduce the concept of the value function V(k) and the policy fUllctioll t/I(k). Given an initial condition ko = k. the maximum value attained by (20.D.8) is denoted V(k). and if (k o• k l ••••• k, •... ) is the (unique) trajectory solving (20.D.8) with ko = k. then we put t/I(k) = k,. That is. t/I(k) e IRN is the vector of optimal levels of investment. hence of capital. at t = I when the levels of capital at t = 0 are given by k. What accounts for the importance of the policy function is the observation that if the path (k o•...• k" ...) solves (20.D.8) for ko = ko then. for any T, the path (k T • ••• ,k T +, •. ..) solves (20.D.8) for ko = k,.. Thus, if (k o, ...• k" . .. ) solves (20.D.8) we must have

k,+, = t/I(k,) for every t.

2 0 • D:

CA. E

753

---------------------------------------------------------

The dynamic programming approach exploits the recursivity of the optimum problem (20.D.8). namely. the fact that

V(k) =

(20.D.IO)

Max

u(k. k')

+ 0 V(k'),

(20.D.11)

,'""Uh(i,l')EA

and we see that the optimal path can be computed from knowledge of ko and the policy function t/I(.). But how do we determine t/I(')1 We now describe two approaches to the computation of t/I(.). The first exploits the Euler equations; the second rests on the method of dynamic programming. The Euler equations (20.D.9) suggest an iterative procedure for the computation of t/I(k). Fix ko = k and consider the equations corresponding to k,. With ko given, we have N equations in the 2N unknowns k, e IRN and k2 eRN. There are therefore N degrees of freedom. Suppose that we try to fix k, arbitrarily [equivalently. we try to fix - V2u(ko. k,), the marginal costs of investment at t = I] and then use the N Euler equations at t = 1 to solve for the remaining k2 unknowns [equivalently, we adjust the commitments for investment at t = 2 so that the discounted marginal payofT of investment at t = I. bV,u(k" k 2 ). equals the preestablished marginal cost of investment at t = I, i.e. - V2u(ko, k,)]. Suppose that such a solution k2 is found [by the strict concavity of u(·), if there is one solution then it has to be unique]. We can then repeat the process. The N Euler equations for period 2 are now exactly determined: Both kl and k2 are given. but we still have the N variables k) corresponding to t = 3 with which we can try to satisfy the N equations of period 2. Suppose that we reiterate in this fashion. There are three possibilities. The first is that the process breaks down somewhere, that is, that given k, _, and k, there is no solution k,+, [or, more precisely, no solution with (k,. k,+ ,) e A]; the second is that we generate a sequence that is unbounded (or nonstrictly interior); the third is that we generate a bounded (and strictly interior) sequence (k o, k" ... ,k" ... ). In the third case, by Proposition 20.D.7 we have obtained an optimum, and since by Proposition 20.D.6 the optimum is unique, we can conclude that given ko, tile third possibility (the trajectory startillg at ko and kl is strictly interior alld bounded) can occur for at most one value of k l . If it occurs, tltis value of kl is precisely t/I(k o). Thus, the computational method is: Solve the ditTerence equation induced by the Euler

and obtains t/I(k) as the vector k' that solves (20.D.II). This, of course, only transforms the problem into one of computing the value function V(·). However, it turns out that. first, under some general conditions [e.g.• if V(·) is bounded] the value function is the only function that solves (20.D.11) when viewed as a functional equation, that is, V(·) is the only function for which (20.D.lI) is true for every k. and, second, that there are some well-known and quite effective algorithms for solving equations such as (20.D.II) for the unknown function V(·). (Sec Section M.M. of the Mathematical Appendix.) We end this section by pointing out two implications of the definition of the value function (sec Exercise 20.D.8): (i) The value fUllction V(k) is concave. (ii) For every perturbacion parameter z e IRN with (k

V(k + z)

~

+ z, t/I(k» e A we have

u(k + z, t/I(k» + oV(t/I(k)).

(20.D.12)

Suppose that N = I and (k, t/I(k)) is interior to A. For later reference we point out that from (i), (ii), and V(k) = u(k. t/I(k)) + 0 V(t/I(k» we obtain

V'(k) and, if V(·) is twice-differentiable,

= V,u(k. t/I(k»

VH(k) ~ V:,u(k, t/I(k)). (See Figure 20.0.1 and Exercise 20.D.9. 18 ) 17. Hence. when k, > k., the Euler equations lead to capital overaccumulation. We note. without further elaboration, that given a path satisfying the Euler equal ions we could use the equations themselves to determine a myopically supporting price sequence. However, jf k I > ko Ihis sequence will violate the transvcrsality condition. 18. The expression V;~f(') denotes the ij second partial derivative of the real-value function f(·).

1

754

C HAP T E R

2 0:

E QUI LIB R I U MAN 0

5 EC T,0 N

TIM E

----------------------------------------------------------------------~

V(k +:)

Figure 20.0.1

Along an optimal P"h the value function is majorized by the

utilities of singleperiod adjustments.

20.E Stationary Paths, Interest Rates, and Golden Rules In this section, we concentrate on the study of steady states. This study constitutes a first step towards the analysis of the dynamics of equilibrium paths. We refer to Bliss (1975), Gale (1973), or Weizsiicker (1971) for further analysis of steady-state theory. We begin with a production set Y c I1\llL satisfying the properties considered in Section 20.C. Recall that a production path is a sequence (yo, ... , y" ... ) with y, E Y for every r. DefinItion 20.E.1: A production path (yo, ... , y" ... ) is stationary, or a steady state, if there is a production plan y = (Yb' Y.) E Y such that y, = I' for all t> O. Abusing terminology slightly, we refer to the "stationary path (y, ... , y, .. .)" as simply the "stationary path y." The first important observation is that stationary paths rhat are also efficient are supportable by proportional prices. '9 This is shown in Proposition 20.E.!. ProposItion 20.E.1: Suppose that Ii E Y defines a stationary and efficient path. Then, there is a price vector Po E I1\lL and an ex > 0 such that the path is myopically profit maximizing for the price sequence (Po' expo' ... , ex'PD' ... ). Proof: A complete proof is too delicate an affair, but the basic intuition may be grasped from the case in which production sets have smooth boundaries. For this case we can, in fact, show that every (myopically) supporting price sequence must be proportional. By the efficiency of the path (y, . .. , )" ... ), the vector y must lie at the boundary of Y. Let q = (qo, q,) be the unique (up to normalization) vector perpendicular to Y at y. Also, by the small type discussion at the end of Section 20.C, there exists a price sequence (Po, ... , P..... ) that myopically supports this efficient path. Because }' E Y is short-run profit maximizing at every r we must have (P .. P.. ,) = ).,(qo, q ,) for some ;., > O. Therefore, p, = ).,qo and p,+, = ;.,q, for all r. In particular, p, = ;., _, q, and p, = ;., qo. Combining, we obtain p, = (}.,/;., _ ,) p, and

+,

+,

+,

t 9. To prevent possible misunderstanding, we warn that establishing the inefficiency of a given stationary path will typically require the consideration of nonstationary paths.

I·

'I

S TAT' 0 N A A Y

PAT H S,

'N TEA EST

A ATE S ,

AND

+,

= ()., + ,/;., ) p,. From this we get ).,/)., _, = )., + ,/)., for all t 2: 1. Hence, denoting this quotient by ex, we have p,+, = exp, = ex 2 p,_, = ... = ex'+'po' The factor ex has a simple interpretation. Indeed, r = (I - ex)/ex [so that p, = (I + r)p,+ ,] can be viewed as a rate oj interest implicit in the price sequence (see Exercise 20.E.I). Proposition 20.E.I is a sort of second welfare theorem result for stationary paths. We could also pose the parallel first welfare theorem question. Namely, suppose that (Y, ... , }', ... ) is a stationary path myopically supported by a proportional price sequence with rate of interest r. If r> 0, then P, = (1/(1 + r»'po -+ 0 and therefore the transversality condition p, y. -+ 0 is satisfied. We conclude from Proposition 20.C.1 that the path is efficient. If r ~ 0, the transversality condition is not satisfied (p, docs not go to zero), but this does not automatically imply inefficiency because the transversality condition is sufficient but not necessary for efficiency. Suppose that r < 0 and, to make things simple, let us be in the smooth case again. Consider the stationary candidate paths defined by the constant production plan y, = (Y. + r.e,),. - u), where e = (1, ...• 1) E RL. This candidate path uses fewer inputs (or produces more outputs) at t = 0 and generates exactly the same net input-output vector at every other t. Therefore, if for some £ > 0, the candidate path is in fact a feasible path; that is. if y, E Y, then the stationary path y is not efficient (it overaccumulates). But if Y has a smooth boundary at y, the feasibility of y, for some r. > 0 can be tested by checking whether y, - y = £(e, - e) lies below the hyperplane determined by the supporting prices (Po, [1/(1 + r)]po). Evaluating. we have £(1 - 1/(1 + r»po·e < 0, because r < O. Conclusion: For'£ small enough, the stationary path)' is dominated by the stationary path y,. We record these facts for later reference in Proposition 20.E.2. p,

__~=---u(k +:, ",(k)) + 6V(I{t(k))

2 0 . E:

Q 0 L DEN

AULES

755

-----------------------------------------------------------------------

Proposition 20.E.2: Suppose that the stationary path (y, ... , y . .. ), YE Y, is myopically supported by proportional prices with rate of interest r, then the path is efficient if r > 0 and inefficient if r < O. We have not yet dealt with the case r = 0, which as we shall see, is very important. 20 We will later verify in a more specific setup that efficiency obtains in this case. Let us now bring in the consumption side of the economy and consider stationary equilibrium pat Its. Assuming differentiability and interiority, a stationary path (y, ... , y, ... ) that is also an equilibrium can be supported only (up to a normalization) by the price sequence p, = IJ' Vu(c). where c = Yb + y.; recall Proposition 20.D.4 and expression (20.D.6). That is. a slationar.v equilibrium is supported by a price sequence clIlbodyillf} (I proportiollalil,l' Jacror equal 10 rite discount Jactor .5, or, equivalently, with rate of interest r = (I - .5)/.5. Definition 20.E.2: A stationary production path that is myopically supported by proportional prices p, = rx'Po with (X = .5 is called a modified golden rule path. A stationary production path myopically supported by constant prices p, = Po is called a golden rule parh. 20. No'e that 0 is 'he ra'e of growth implicit in the pa,h (Y, ... , y, .. .). In a more general treatment we could allow for a constant returns technology and for the production path to be proportional (but not necessarily stationary). Then Proposition 20.E.2 remains valid with 0 replaced by the corresponding rate of growth.

756

c

H .. PTE R

20,

E

Q

U , L' B R , U II

.. NOT' II E

SEC T ION

2 0 • E:

S TAT ION A R Y

P It. T H S.

I N T ERE S T

RAT E S.

AND

G 0 L DEN

RULES

757

----------------------------------------------------------------------------------- ---------------------------------------------------------------------Depending on the technology and on the discount factor b, there may be a single or there may be several modified golden rule paths (see the small-type discussion at the end of this section). But in any case we have just seen that a statiollary equilibrium path is lIecessarily a modified goldell rule path. Thus, we have the important implication that the calldidates for statiollary eqllilibrium paths (y, . .. ,y, ... ) are

Qu'put

completely determined by the technology and the discount faclOr and are independent <J{ the lIIility fllnctioll u(·).

To pursue the analysis it will be much more convenient to reduce the level of ahstraction. Consider an extremely simple case, the Ramsey-Solow model technology of Example 20.C.1. We study trajectories with I, = I for all t (imagine that there is available one unit of labor at every point in time). We can then identify a production path with the sequence of capital investments (k o, ... , k" ... ). Given (k o, ... , k" .. .), denote r, = V,F(k" 1) - I. Thus, r, is the lIel (i.e., after rcplacing capital) marginal productivity of capital. Suppose that k, > 0 and that the sequence of output prices (qo, ... ,q" .. .) and wages (w o, ... , H'" ... ) myopically price supports the given path. Then, by the first-order condition for profit maximization, we have q,+,(1 + r,) - q, = O. Hence r, is the output rate of interest at time t implicit in the output price sequence (qo, ... ,q" ... ). Let us now focus on the stationary paths of this example. Any k ;,: 0 fixed through time constitutes a sleady stale. With any such steady state we can associate a constant surplus level c(k) = F(k, 1) - k and a rate of interest r(k) = V,F(k, I) - I, also constant through time. 1 ' Therefore, the supporting price-wage sequence is with H'o

V1F(k,l)

q. = V,F(k, 1>"

Denote by w(k) the real wage wo/qo so determined. It is instructive to analyze how the steady-state levels of consumption e(k), the rate of interest r(k), and the real wage w(k) depend on k. Let k be the level of capital at which the steady-state consumption level is maximized [i.e., k solves Max F(k, 1) - k). Note that k is characterized by r(k) = V, F(k, 1) - 1 = O. Thus k is precisely the goldell rule steady state. The construction is illustrated in Figure 20.E.I, where we also represent the modified golden rule k. [characterized by r(kd ) = V,F(k., 1) - 1 = (I - 0)/0). Observe that if k < k then r(k) > O. As we saw in Proposition 20.E.2, r(k) > 0 implies that the steady state k is etl1cicnt (thus, in particular, the modified golden rule is etl1cient: it gives less consumption than the golden rule but it also uses less capital). Similarly, if k > k then r(k) < 0 and we have inefficiency of the steady state k. What about k?" We now argue that the golden rule steady state k is efficient. A graphic proof will be quickest. Suppose we try to dominate the constant path k by starting with ko < k, so that consumption at I = 0 is raised. Since the surplus at t = 1 must be at least

21. Thus. c(k) is the amount of good constantly available through time and usable as a flow

for consumption purposes. 22. Recall that the associated price sequence is constant and that the transversality condition is therefore violated.

Figure 2O.E.l

I"pu,

-k I

k.

t

The production technology of the Ramsey-Solow model and the golden rule.

I

Golden Mod,fied _ Rule Golden Rule J k,

='0 -

( O. We will not state or demonstrate this theorem precisely, but the main idea of its proof is quite accessible. We devote the next few paragraphs to it. Suppose for a moment that for a given u(', .) our candidate 1/1(') is such that oJ!(k) solves, for every k, the following "complete impatience" problem: Max

u(k, k').

(20.F.2)

t':
This would be the problem of a decision maker who did not care about the future. While this is not quite the problem that we want to solve, it approximates it if we take" > 0 to be very low. Then the decision maker cares very little about the future and therefore its optimal action k' will, by continuity, be very close to I/I(k). Hence, in an approximate sense, we are done if we can find a u(', .) such that oJ!(k) solves (20.F.2) for every k. In order for a oJ!(k) > 0 to solve (20.F.2), u(k, .) cannot be everywhere decreasing in its second argument (the optimal decision would then be k' = 0). [n the simplest version of the Ramsey-Solow model (Example 20.C.I), the returns of k', the investment in the current period, accrue only in the next period, and therefore the utility function u(k, k') is decreasing in k'. But in the current, more general, two-sector model there is no reason that forces this conclusion. Suppose, for example, that there are two consumption goods. The first is the usual consumption-i~vestment good, while the second is a pure consumption good not perfectly substitutable with the first. Say that with an amount k of investment at time t - lone gets, jointly, k units of the consumption-investment good at time t and k units of the second consumption good at time t - I. Accordingly, with k' units of the consumption-investment good invested at lone gets, jointly, k' units of the consumption-investment good at t + 1 and k' units of the second consumption good at t. Thus, if k and k' are the amounts of investment at 1 - I and I, respectively, then the bundle of consumption goods available at 1 is (k - k', k'). Hence, the utility function u(',') has the form u(k, k') = u(k - k', k'), where a(·, .) is a utility function for bundles of the two consumption goods. Therefore, our problem is reduced to the following: Given oJ!(k) can we find a(·, .) such that oJ!(k) solves Max,. u(k - k', k') for all k in some range? The problem is represented in Figure 20T5. 24 We see from the figure that the problem has formally become one of finding a concave utility function with a prespecified Engel curve at some given prices (in our case, the two prices are equal). Such a utility function can always be obtained. It is a well-known, and most plausible fact that the concavity of a(·) imposes no restrictions on the shape that a single Engel curve may exhibit (see Exercise 20.F.I). The news is not uniformly bad, however. In principle, as we have seen, everything may be possible; yet there are interesting and useful sufficient conditions implying a

24. We also assume that of;(k) < k for all k.

762

C HAP T E R

, 0:

E QUI LIB R I U MAN D

8 ECTI Q N

TIM E

, 0 • F:

DYNA M I CS

763

---------------------------------------------------------------------------------=-~~~~--------------------------------Second Hence, if the discount factor t5 is close to 1, it is a plausible conclusion that 1!/I'(k)1 < 1 Consumption Good

for all k. In technical language: !/I(') is a contraction, and this implies global convergcnce to a unique steady state.'· In Exercise 20.F.2 you are invited to draw the policy functions and the arrOw diagrams for this case. A particular instance of a contraction is exhibited in Figure 20.F.1. ~_ _

(k - y,(k). y,(k))

Figure 20.F.S

Construction of an

arbitrary policy function in the completely impatient casco

45 First Consumption Good

well-behaved dynamic behavior. We discuss two types of conditions: a low discollllr of time and ('ross derivatives of uniform posit;l'£, sign.

Low Discoullr of Time One of the most general results of dynamic economics is the turnpike rheorem, which, informally, asserts that if rhe one-period utility function is strictly concave and tile decision maker is very patient, then rhere is a single modified golden rule sready scare

rhar, moreover, attracts the optimal trajectories from any initial position. I n the context of the two-sector model studied in this section, we can give some intuition for the turnpike theorem. Suppose that the value function V(k), which is concave, is twice-differentiable." At the end of Section 20.D, we saw that since by definition,

V(k

Cross Derivative of Uniform PositilJe Sign

+ z) 2: u(k + z, !/I(k» + W(t/I(k»

for all z and k (with equality for z = 0), we must have

V'(k) = V,u(k, t/I(k»

and

V"(k) 2: V?,u(k, !/I(k»

for all k.

Also for all k, t/I(k) solves the first-order condition

V2 u(k, t/I(k»

+ t5 V'(!/I(k)) = O.

(20.F.3)

Differentiating this first-order condition, we have (all the derivatives are evaluated at k, !/I(k) and assumed to be nonzero)

V 2 u(') t/I'(')

Because

= - Vi,"(';'+ t5V"(')

Vl 11l(') :;:; 0 and SV;'Il(') :;:; b V"(-) :;:; 0, it follows that 1\11'(')1 :;:; \Vi,u(

The lurnpike Iheorem is valid for any number of goods. The precise slatement and the proof of the Iheorem are subtle and technical [see McKenzie (1987) for a brief survey], but the main logic is simply conveyed. Consider the extreme case where there is complete patience, that is, "only the long-run matters." A difficulty is that it is not clear what this means for arbitrary paths; but at least for paths that are not too "wild," say for those that from some time become cyclical, it is natural to assume that it means that the paths are evaluated by taking the average utility over the cycle. Observe now that for any cyclical nonconstant path. the strict concavity of the utility function implies that the constant path equal to the mean level of capital over the cycle yields a higher utility. It may take some time 10 carry out a transition from the cycle to the constanl path (e.g., it may be necessary to build up capital) bul, as long as this can be done in a finite number of periods, the cost of the transition will not show up in the long run. Hence the cyclical nonconslant path cannot be optimal for a completely patienl optimizer. By continuity, all this remains valid if 0 is very close 10 I. We can conclude. therefore, that if a path tends to a nonconstant cycle then we can always implement a finite Iransition to a suitable constant "long-run average," for a relatively large long-run gain of utility and a relatively low short-run cost. In fact, this conclusion remains valid whenever a path does not stabilize in the long-run. It follows that the optimal path must be asymptotically almost constant, which can only be the case if the path reaches and remains in a neighborhood of a modified golden rule steady state (recall from Section 20.E that those are the only constant paths that can be equilibria, and therefore optimal)."

~i~u~~~ u(. )\. I

We shall concern ourselves here with the particular case of the two-sector model studied so far where V,u(k, k') > 0 and V,u(k, k') < 0 for all (k, k'). By a cross derivative of uniform positive sign we mean that V12 u(k, k') > 0, again at all points of tlte domain. In words: An increase in investment requirements at one date leads 10 a situation of increased productivity (in terms of current utility) of the capital installed the previous date. Examples are the classical Ramsey-Solow model u(F(k) - k') and the cost-of-adjustment model u(F(k) - k' -y(k' - k)) (see Exercise 20.F.3). We shall argue that under tltis cross derivative condition tIle policy function is increasing (as in Figures 20.F.1 or 20.F.2), and therefore the optimal path converges to a stationary path. To prove the claim, it is useful to express \II(k) as the k' solution to Max

u(k, k')

+ bV

(20.FA)

(II',VI

s.t. V:;:; V(k'),

By the concavity of 11(') we have (see Sections M.e and M.D of the Mathematical Appendix)

26. We nole thai y,(.) need not be monotone and Ihe convergence may be cyclical. although the cycles will dampen through lime.

25. For a (very advanced) discussion of this assumption. see Santos (1991).

i

~L

27. Also. with ~ close 10 1, the modified golden rule willlypically retain the uniqueness property of the golden rule.

764

C HAP T E R

2 0:

E

a

U I liB R I U MAN 0

TIM E

765 -------------------------------------------------------------------- ----------------------------------------------------------------------------Indifference Curve for Ihe Ulilily + W, k fixed

u(k, k')

v

SEC T ION

-",-",//

k:I-_ _ _. ....L... /~=------ !/I,(')

>k

k;'

r-A

r----~~/-"..."...------!/I(. ) / / / ~ Transilory Shock //

_.",&----V(k')

,,/

"" ",,""

!/I(k)

S EVE R A l e 0 N SUM E R S

Permanenl Shock / /

J /

I'(o/I(k))

E QUI l I • R I U M:

k'

Indifference Curve for Ihe Ulilily , u(i. k') + J V, k fixed

i:

J. . . . . . . .

2 0 • G:

/

/"

,, I, ,,I

~5'

I

f

Figure 2O.F.6

k'

where V(·) is the value function. For fixed k, problem (20.F.4) is represented in Figure 20.F.6. The marginal rate of substitution (M RS) between current investment k' and future utility V at s = (I/I(k), V(I/I(k))) is O/b)V2u(k. I/I(k» < O. Suppose now that we take k > k. Then the indifference map in Figure 20.F.6 changes. Because V'211(k. I/I(k)) > O. the MRS at s is altered in the manner displayed in the figure. that is. the indifference curve becomes flatter. But we can see then that necessarily I/I(k) > I/I(k), as we wanted to show. The cross derivative condition does not. by itself. imply the existence of a single modified golden rule. Thus, we could be in Figure 20.F.2 rather than in Figure 20.F.!. Note, however. that in many cases of interest it may be possible to show directly that the modified golden rule is unique. Thus. in both the classical Ramsey-Solow model of Example 20.C.1 and in the cost-of-adjustment model [with )1'(0) = 0] of Example 20.C.2. the modified golden rule is characterized by F'(k) = lib. Hence it is unique and. because the policy function is increasing. we conclude that every optimal path converges to it. We also point out that if the cross derivative is of uniform negative sign. then. by the same arguments. 1/1(') is decreasing. While this allows for cycles. the dynamics are still relatively simple. In particular. the non monotonic shape associated with the possibility of chaotic paths (Figure 20.F.4) cannot rise. See Deneckere and Pelikan (1986) for more on these points. Figure 20.F.6 is also helpful in illuminaling Ihe dislinclion bel ween Irallsilory and permanelll shocks. One of Ihe importanl uses of dynamic analysis in general. and of global convergence turnpike results in particular. is in the examination of how an economy at long-run rest reacts 10 a perlurbalion of the dala al lime I = I. In an exlremely crude c1assificalion, Ihese perturbalions can be of Iwo Iypes: (i) Transitory shocks affeci the environment of Ihe economy only al I = I; Ihal is, Ihey aller ko or. more generally. u(ko• . ). Ihe ulilily function at I = I. Then Figure 20.F.6 allows us to see how the equilibrium path will be displaced. The (k'. V) indifference curve of u(k o• k') + {, V changes, bul the constraint function V(k') remains unaltered. Hence. afler the (transilory) shock, Ihe new k', corresponds 10 the solulion of the optimum problem depicled in Figure

With the uniform positive sign cross derivative condition, the policy funclion is increasing.

20.F.6 bUI with Ihe new indifference map. From

t =

2 on we simply follow Ihe old policy

function.

(ii) Permanelll shocks move the economy to a new utility function u(k. k') constant over time. Then Ihe entire policy funclion changes to a new ';('). In terms of Figure 20.F.6 Ihere would be a change in both the indifference curves and the constraint. The new kf is now harder 10 determinc and to compare with the preshock k, or. for the same shock at period I. wilh k;'; bul il can oftcn be done. We pursue Ihe matter through Example 20.F.1. Example 20.F.I: Consider the separable utilily u(k. k') = y(k) + h(k'). This could be the inveslmenl problem of a firm: g(k) is Ihe maximal revenue obtainable wilh k. and -h(k') is Ihe cost of investment. Then Vl,u(k. k') = 0 at all (k. k'). Our previous analysis of Figure 20.F.6, lells us Ihal in this case o/t ... ,y" ... ), and (COl' ••• , C,I' •.• )
+ V1,u(·) + oVi,u(·) + Vf,u('))

o(Vi,u(')

(20.G.6)

If there are no externalities [i.e., if V1,u(') = Vf,u(') = 0] then the cO'lcavity of u(·, ,) implies that expression (20.G.6) is larger than I in absolute value (you should verify this in Exercise 20.G.5). Thus, in agreement with the discussion of Section 20.0, we are not then able to find a non·steady-state solution of the Euler equations. But if the externality effects are significant enough, inspection of expression (20.G.6) tells us immediately that dk1+ ,/dk, can perfectly well be less than 1in absolute value. The same is true for dk1+ ,/ok,_" and therefore we can conclude that robust examples with a continuum of equilibria are possible.

2Q,H Overlapping Generations In the previous sections we have studied economies that, formally, have an overlapping structure of firms but only one (or, in Section 20.G, several), infinitely long-lived, consumer. We pointed out in Section 20.B that in the presence of suitable forms of altruism it may be possible to interpret an infinitely long-lived agent as a dynasty. We will now describe a model where this cannot be done, and where, as a consequence, the consumption side of the economy consists of an infinite succession of consumers in an essential manner. To make things interesting, these consumers, to be called generations, will overlap, so that intergenerational trade is possible. The model originates in Allais (1947) and Samuelson (1958) and has become a workhorse of macroeconomics, monetary theory, and public finance. The literature on it is very extensive; see Geanakoplos (1987) or Woodford (1984) for an overview, Here we will limit ourselves to discussing a simple case with the purpose of highlighting, first, the extent to which the model can be analyzed with the Walrasian equilibrium

770

c HAP T E

R

2 0:

E QUI l I 8 R I U MAN 0

TIM E

SEC TIC. N

-------------------------------------------------------------------------------------methodology and, second, the departures from the broad lessons of the previous sections. We shall classify these departures into two categories: issues relating to optimality and issues relating to the multiplicity of equilibria. We begin by describing an economy that, except for the infinity of generations, is as simple as possible. We have an infinite succession of dates t = 0, I, ... and in every period a single consumption good. For every I there is a generation born at time I, living for two periods, and having utility function u(c." COl) where c., and c.. are, respectively, the consumption of the Ith generation when young (i.e., in period C), and its consumption when old (i.e., in period 1 + I); the indices b and a are mnemonic symbols for "before" and "after." Note that the utility functions of the dilTerent generations over consumption in their lifespan are identical. We assume that 11(', .) is quasiconcave, differentiable and strictly increasing. Every generation 1 is endowed when young with a unit of a primary factor (e.g., labor). This primary factor does not enter the utility function and can be used to produce consumption goods contemporaneously by means of some production function J(:)." Say that J(I) = I. Under the competitive price-taking assumption, total profits at C, in terms of period-I good, will be f. = I - 1'(1) and, correspondingly, labor payments will be I - t. Thus, we may as well directly assume that the initial endowments of generation 1 0 units of consumption good. Now let (Po, ... , p" ... ) be an infinite sequence of (anticipated) prices. We do not require that it be bounded. For the budget constraint of the different generations we take P,C.,

+ P.. ,Ca, $; (I

- t)p,

for t > 0

(20.H.I)

and POC.O

+ P'CaO

$;

(1- t)po

+ t(~

p,) + M.

(20.H.2)

These budget constraints deserve comment. For 1 > 0, (20.H.I) is easy to interpret. The value of the initial endowments, available at I, is (I - t)p,. Part of this amount is spent at time 1 and the rest, (I - t)p, - P,c." is saved for consumption at t + I. The saving instrument could be the title to the technology, which would thus be bought from the old by the young at 1 and then sold at 1 + I to the new young (after collecting the period t + I return). The price paid for the asset is the amount saved, that is, (I - 0) p, - P,c.,. The direct return at 1 + I is tp, + I and so, if the asset market is to be in equilibrium, the selling price at t + I should be (I - t)p, - P,C., - cp,+ ,. In summary, in agreement with the budget constraint (20.H.l) this leaves (I - ")p, - P,C., to be spent at 1 + I. The constraint (20.H.2) for 1 = 0 is more interesting. Its right-hand side is the value of the asset to generation O. Note that asset market equilibrium requires that

33. The assumption thal production is contemporary with input usage fits well with the lenglh

or I he period being long.

20. H:

0 Y E R lAP PIN G

G ENE RAT ION.

771

----------------------------------------------------------------------------this value should be at least the Jundamental value, that is, t(l:, p,).)' Indeed, the value of the asset at t = 0 equals the profit return EPO plus the price paid by the young of generation I. At any T, the price paid by the young of generation Tshould not be inferior to the direct return tPT+ I' In turn, at T - 1 it should not be inferior to the direct return plus the value at T; that is, it should be at least t(PT + PT+ I)' Iterating, we get the lower bound t(p, + ... + PT+ ,) for the price paid by generation I, which, going to the limit and adding tpo, gives tel:, p,) as a lower bound for the value to generation O. Thus, in terms of expression (20.H.2) a necessary condition for equilibrium is M 0). We did not do so in Sections 20.D or 20.G because with a finite number of consumers, bubbles are impossible at equilibrium. The equality of demand and supply implies that the (finite) value of total endowments plus total profits equals the value of total consumption, and therefore at equilibrium no individual value of consumption can be larger than the corresponding individual value of endowments and profit wealth (you should verify this in Exercise 20.H.I). We will see shortly that under some circumstances bubbles can occur at equilibrium with infinitely many consumers. It would therefore not be legitimate to eliminate them by definition. The definition of a Walrasial1 equilibrium is now the natural one presented in Definition 20.H.1. Defin1tion 20.H.1: A sequence of prices (Po • ...• P, • ... ). an M 0, there is a single price sequence (with Po = I) that can be continued indefinitely, and therefore a single equilibrium path.

34. Strictly speaking, we are saying that if the consumption good prices are given by (Po.···. P,.···) and the asset prices present no arbitrage opportunity, then the price of the asset should be at least as large as its fundamental value.

772

SEC T ION

CHAPTER

20:

EOUILIBRIUM

AND

TIME

~~~~~~~----------------------

Suppose first that s > O. We say then that the asset is real (it has "real" returns). At an equilibrium the wealth of generation 0, (1 - s)Po + seL p,) + M, must be finite (how could this generation be in equilibrium otherwise?). Therefore, if s > 0, it follows that L, p, < OOB An important implication of this is that the aggregate (Le., added over all generations) wealth oj society, which is precisely L, P.. isjinite. In Proposition 20.H.1 we now show that, as a consequence, the first welfare theorem applies for the model with, > O.

Offer Curve

Proposition 20.H.1: Any Walrasian equilibrium (Po,." ,p" .. .), {{c~,. c:,l},=o, with L, Pt < 00 is a Pareto optimum; that is, there are no other feasible consumptions {{Cb" ca,)};':,o such that u{cb" Ca,) :? U{Cb,. c:,) for all t :? 0, with strict inequality for some t.

"'' -:)1-,

f',', , \'" "",_-+, '{'" I ___

Proof: We repeat the standard argument. Suppose that {(c... c.,)}~o Pareto dominates {(e:.. c:,J},":.o· From feasibility, we have c:, + C:.'_I = I and c., + C•. '_I :; 1 for every I. Therefore, L, p,(c:' + C:.,_I) = L, p, and L, p,(c., + C•. '_I) :; L, p,. Because L, p, < 00, we can rearrange terms and get

Figure 20.H.l

~""l--

C. 2

L'--C-'.-,---c...L,,~l"'-'--t+-~-C-o-'n~s-umplion in

first Period of Life

Overlapping generations: construction of

the equilibrium (case t > 0).

L (Pt c., + p,+ I C.,) :; L (p,c:' + p,+ , C:,) = L, p, < 00. Because the utility function is increasing and (c:" e:') maximizes utility in the p,c:, + p, + IC:, for every t, with at budget set we conclude that P,c., + PH I least one strict inequality. Therefore, L, (p" C.I + p,+ IC.,) > L, (p,c:, + p,+ IC:')Contradiction. _

C., ;::

Consumption in Second Period of Life 45' =

0 V E R LAP PIN G

Pareto Optimality

Consumption in Second Period of Life

c.,

2 0 _ H:

~

•

No-Trade Sleady-Stale COl

C.o

Consumption in first Period of Life

G ENE RAT ION S

773

------------------------------------------------------------------------------

Proposition 20.H.1 is important but it is not the end of the story. Suppose now that the asset is purely nominal (i.e., , = 0; for example, the asset could be fiat money, or ownership of a constant returns technology). Then it is possible to have equilibria that are /lot optimal. I n fact, it is easy to see that we can sustain autarchy (i.e., no trade) as an equilibrium_ Just put M = 0 (no bubble, worthless fiat money) and choose (Po, ... , p". _.) so that, for every t, the relative prices p,/p, + I equal the marginal rate of substitution of u(', .) at (I, 0), denoted by p. This no-trade stationary state (also called tlte nonmonetary steady state) where every generation consumes (1,0) is represented in Figure 20.H.2. As it is drawn (with p < I), we can also see that the no-trade outcome is strictly Pareto dominated by the steady state (y, 1 - y) [or, more precisely, by the consumption path in which generation 0 consumes (I, 1 -)') and every other generation consumes (y, 1 - y)]. What is going on is simple: in this example the open-ended ness of the horizon makes it possible for the members of every generation I to pass an extra amount of good to the older generation at t and, at the same time, be more then compensated by the amount passed to them at 1+ 1 by the next generation_ Note that, in agreement with Proposition 20.H.I, the lack of optimality of this no-trade equilibrium entails p,/p, + I = P< I for all t; that is, prices increase through time. It is also possible in the purely nominal case for an equilibrium with M > 0 not to be Pareto optimal. Note first if {(e:,. c:,)},"';,o, (Po"", P..... J and M constitute an

Figure 20_H.2

Overlapping generations: construction of

equilibria (case t = 0)

It corresponds to the stationary consumptions (y, 1 - y) and the price sequence p, = 0:', where 0: = (I - s - y)/(l - y) < L Note that the iterates that ~gin at a ~alue c.o '" 1 - y unavoidably "leave the picture," that is, bec~me unfe~s~~le. In Flg~re 20.H.2, where s = 0, there is a continuum of equlhbna: any tnltlal conditIOn

C.o :; 1 - y can be continued indefinitely. .. . It is plausible from Figures 20.H.1 and 20.H.2 that the existence of an eqUlllbnum can be guaranteed under general conditions- This is indeed the case [see Wilson

35. You can also verify this graphicaJly by examining Figure 20.H.l.

(1981)].

1

774

CHAPTER

20:

EQUILIBRIUM

AND

TIME

equilibrium, then we have (recall that cto = I) p,+ ,c:,

= p,(1

- ct,)

= p,c:,,_, = ... = p,e:o = M

for every t.

-

c.,

/

Monetary Sleady State

1-",

Consumption in First Period of Life

0 V E R LAP PIN G

ProposItion 20.H.2: Suppose that at an equilibrium we have LtP t
Y we have a nonstationary equilibrium trajectory with trade (hence M > 0) which is also strictly Pareto dominated by the steady state (y, I - y). Nonetheless, it is still true that for any equilibrium with COl> Y we have Mfp, -+ 0; that is, in real terms the value of the asset becomes vanishingly small with time. For CO, = y, matters are quite different. We have a steady-state equilibrium (called the monetary steady state) in which the price sequence p, is constant and therefore the real value of money remains constant and positive. This monetary steady state is the analog of the golden rule of Section 20.E and, as was the case there, we have that. in spite of L, p, < 00 being violated. the monetary steady state is Pareto optimal. We will not give a rigorous proof of this. The basic argument is contained in Figure 20.H.3. There we represent the indifference curve through (y, I - y) and check that any attempt at increasing the utility of generation 0 by putting c.. < y leads to an unfeasible chain of compensations; that is, it cannot be done. The discussion just carried out of the examples in Figures 20.H.2 and 20.H.3 suggests and confirms the following claim, which we leave without proof: In the purely nominal case, of all equilibrium paths the Pareto optimal ones are those, and only those, that exhibit a bubble whose real value is bounded away from zero throughout time. It is certainly interesting that a bubble can serve the function of guaranteeing the optimality of the equilibria of an economy, but one should keep in mind that this happens only because an asset is needed to transfer wealth through time. If a real asset exists then this asset can do the job. If one does not exist then the economy, so to speak, needs to invent an asset. To close the circle, we point out that if there is a real asset then not only is a bubble not needed but, in fact, it cannot occur.

c.,

SEC T ION

G ENE RAT ION S

775

-------------------------------------------~~~~~~~~

We have already seen, in Figure 20.H.2, a model with a purely nominal asset (i.e., c = 0) and very nicely shaped preferences (the offer curve is of the gross substitute type) for which there is a continuum of equilibria. Of those, one is the Pareto optimal monetary steady state and the rest are nonoptimal equilibria where the real value of money goes to zero asymptotically. The existence of this sort of indeterminacy is clearly related to the ability to fix with some arbitrariness the real value of money (the "bubble") at t = 0, that is Mfpo. It cannot occur if bubbles are impossible, as, for example, in the model with a real asset (i.e., £ > 0) where, in addition, we know that the equilibrium is Pareto optimal. One may be led by the above observation to suspect that the failure of Pareto optimality is a precondition for the presence of a robust indeterminacy (i.e., of a continuum of equilibria not associated with any obvious coincidence in the basic data of the economy). This suspicion may be reinforced by the discussion of Section 20.G, .where we sa~ that the Pareto optimality of equilibria was key to our ability to claim the generic determinacy of equilibria in models with a finite number of consumers. Unfortunately, with overlapping generations the number of consumers is infinite in a fundamental way,'" and this complicates matters. Whereas with a rear asse~ the Pareto optimality of equilibria is guaranteed and the type of indeterminacy of Figure 20.H.2 disappears, it is nevertheless possible to construct nonpathological examples with a continuum of equilibria. The simplest example is illustrated in Figure 20.H.4. The figure describes a real-asset model with the steady state (y, I - }'). Suppose that, in a procedure we have resorted to repeatedly, we tried to construct an equilibrium with c. o slightly different from I - y. Then, normalizing to Po = I, we would need to use p, to clear the market of period 0, P2 to do the same for period I, and so on. In the leading case of Figure 20.H.I, we have seen that this eventually becomes unfeasible. A change in p, that takes care of a disequilibrium at t - I creates an even larger disequilibrium at t, which then has to be compensated by a change of a larger magnitude in p, + I In an explOSive process that finally becomes impossible. But in Figure 20.H.4, the utility function is such that, at the relative prices of the steady state, a change in the pnce of the second-period good has a larger impact on the demand for the first-period good than on the demand for the second-period good. Hence. the successive adjustments necessitated by an initial disturbance from c = 1 _ " dampen with each iteration and can be pursued indefinitely. We conclud~ that a~ equilibrium exists with the new initial condition. As a matter of terminology, the

, 36. By this v k, and by g(k' - k) = 0 for k' S k. Say as much as you can about the policy. In particular, determine the steady·state trajectory of investment. 20.D.6 8 Verify the claim made in the proof of Proposition 20.0.7 that the Euler equations (20.0.9) are the first-order necessary and sufficient conditions for short-run optimization. In other words: they are necessary and sufficient for the nonexistence of an improving trajectory differing from the given one at only a finite number of dates. 20.D.7 A With reference to Example 20.0.4, show that, for the functional forms given, the Euler equations are as indicated in the example: k,., = 3k, - 2k,., for every I. Also verify that the solution to this difference equation given in the text is indeed a solution, that is, that it satisfies the equation. 20.D.8 A Verify that the value function V(k) does satisfy the properties (i) and (ii) claimed for it at the end of Section 20.0. 20.D.9 A Argue that the properties (i) and (ii) of the value function referred to in Exercise 20.0.8 yield the two consequences, concerning V'(k) and V"(k), claimed at the end of Section 20.0. 20.E.IA Discuss in what sense the term r defined after the proof of Proposition 20.E.1 can be interpreted as the rate of interest implicit in the proportional price sequence. 20.E.2" Suppose that the production set Y c RL is of the constant return type and consider production paths that are proportiollal (but not necessarily stationary), that is, paths (Yo .. ..• .1'..... ) that satisfy .1', = (I + II)Y,., for aliI and some II. (a) Argue that the conclusion of Proposition 20.E.1 remains valid for proportional paths. (b) State and prove the result parallel to Proposition 20.E.2 for proportional paths. 20.E.3" Suppose that in the Ramsey-Solow model k solves Max (F(k, I) - k) (see Figure 20.E.2). Show that if k, S k - • for aliI, then the path determined by (k o,' .. , k" ... ) is efficient. [Hilll: Compute prices and verify the transversality condition.] 20.EAA Prove the three neoclassical properties stated at the end of the regular type part of Section 20.E. 20.E.S A Carry out the requested verification of expression (20.E. I).

E X E R CIS E S

785

------------------------------------------------------------------20.E.6

A

Carry out the verification requested in the discussion of Figure 20.E.3. A 20.E.7 In the Ramsey-Solow model. two dilTerent steady states are associated with different rates of interest. This is not so in the example illustrated in Figure 20.E.3, at first sight very similar. The key difference is that in the Ramsey-Solow model the consumption and investment goods are perfect substitutes in production. Clarify this by proving, in the context of the example underlying Figure 20.E.3, that if the two goods are perfect substitutes then r(k) # r(k) whenever k # i:. [Hilll: Their being perfect substitutes means that G(k, k' + a) = G(k, k') - a for any a < F(k. k'j.] A 20.E.8 Consider the proportional production paths with rate of growth equal to II > 0 (recall Exercise 20.E.2) in the context of a Ramsey-Solow technology of constant returns. Show that among these paths the one that maximizes surplus (at I = I, or, equivalently, normalized surplus or surplus" per capita") is characterized by having the rate of interest equal to II. This path is also called the lIoldell rule sleady slale palh. A 20.E.9 Argue that, for the one-Consumer model of Section 20.0, the golden rule path cannot arise as part of a competitive equilibrium. [Hint: The key fact is that J < I.] c 20.F.l Consider two arbitrary functions y,(w) and y,(w) that are defined for w > 0, take nonnegalive values, and satisfy y,(w) + y,(w) = w for all w. Suppose also that they are twice continuously ditTerentiable. Show thaI for any a > 0 there is a utilily function for lwo commodities. u(x" x,), that is increasing and concave on the domain {(x"x,): x, + x, S a} and is such that (y,(w), y,(w)) coincides wilh lhe Engel curve functions for prices p, = I, p, = I and wealth w < a. [Hinl: Let u(x" x,) ~ (x, + x,)'/2 - £[(x, - y,(x, + x,))' + (x, - y,(x, + x,))'] and take £ to be small enough. Verify then that Vu(x" x,) is strictly positive and D'u(x" x,) is negative definite for any (x,. x,) such that 0 < x, + x, SiX. and that the Engel Curve is as required.] 20.F.2A Suppose lhat. for k E R., the policy function ' .. , "'I) E R ,

where "'i takes the value 1,0, or -I according to whether agent i prefers alternative x to alternative y, is indifferent between them, or prefers alternative y to alternative x, respectively.' Definition 21.B.1: A social welfare functional (or social welfare aggregator) is a rule F(~" ... , "',) that assigns a social preference, that is,F(a., ... , "',) E {-1, O. 1}. to every possible profile of individual preferences (IX ••...• a,) E { -1. O. 1}'. All the social welfare functionals to be considered respect individual preferences in the weak sense of Definition 2I.B.2. Definition 21.B.2: The social welfare functional F(", ••.. .• "',) is Paretian, or has the Pareto property. if it respects unanimity of strict preference on the part of the agents. that is. if F(l • ...• 1) = 1 and F( -1 •...• -1) = - I. Example 2I.B.I: Paretian social welfare functionals between two alternatives abound. Let (fJ, •...• fJ,) E IR'+ be a vector of nonnegative numbers, not all zero. Then we

2 •• 8:

ASP Eel A l e A S E:

SOC I ALP REF ERE NeE SOY E R

TWO

could define F(IX" ... , ",,)

= sign 2., Pi(X',

where. recall, for any a E R, sign a equals I, 0, or -I according to whether a > 0,

a = 0, or a < 0, respectively. An important particular case is majority voting, where we take p, = I for every i. Then F(IX, •... , ",,) = I if and only if the number of agents that prefer alternative x to alternative y is larger than the number of agents that prefer y to x. Similarly, F(a, •...• a,) = -I if and only if those that prefer y to x are more numerous than those that prefer x to y. Finally, in case of equality of these two numbers, we have F(IX" ...• IX,) = 0, that is. social indifference. a Example 2I.B.2: DictatorslJip. We say that a social welfare functional is dictatorial if there is an agent h, called a dictator, such that, for any profile ("'I" .. ,"',), "'. = I implies F(cx" ... , "',j = I and, similarly, "'. = -I implies F("" , •.• , "',j = -I. That is. the strict preference of the dictator prevails as the social preference. A dictatorial social welfare functional is Paretian in the sense of Definition 2I.B.2. For the social welfare functionals of Example 21.B.1, we have dictatorship whenever "'. > 0 for some agent It and "'i = for i ¥ It. since then F(a" ... , a,) = "' •. a

°

The majority voting social welfare functional plays a leading benchmark role in social choice theory. In addition to being Paretian it has three important properties, which we proceed to state formally. The first (symmetry among agents) says that the social welfare functional treats all agents on the same footing. The second (neutrality between alternatives) says that, similarly, the social welfare functional does not a priori distinguish either of the two alternatives. The third (positive responsiveness) says, more strongly than the Paretian property of Definition 2I.B.2, that the social welfare functional is sensitive to individual preferences. Definition 21.B.3: The social welfare functional F("' ...... "',) is symmetric among agents (or anonymous) if the names of the agents do not matter. that is. if a permutation of preferences across agents does not alter the social preference. Precisely. let 1I:{1, ... . /} -+ {1 ....• 1} be an onto function (i.e., a function with the property that for any i there is h such that 1I(h) = i). Then for any profile (a, •...• IX,) we have F(cx •. ... ,,,,,) = F(IX'I')'" . ,a,I'I)' Definition 21.B.4: The social welfare functional F("', •.... :x,) is neutral between alternatives if F(a, •...• a,) = -F(-IX, •... , -:x,) for every profile (:x, •...• a,). that is. if the social preference is reversed when we reverse the preferences of all agents.

rormally the principle involved. Note. in particular. that this specification precludes the usc of any

Definition 21.B.S: The social welfare functional F(~, •. ..• :x,) is positively responsive if. whenever (IX, •...• 7,) ~ (ex; .... • ~;). ("', ....• IX,) ¥ ("'; .... , IX;). and F(",; • ...• cx;) ~ O. we have F(a, • ...• (1.,) = + 1. That is. if x is socially preferred or indifferent to y and some agents raise their consideration of x. then x becomes socially preferred.

"cardinal" or "intensity" information between the two alternatives because this intensity can only be calibrated (perhaps using lotteries) by appealing to some third alternative. A rortiori, the specification also precludes the comparison of feelings of pleasure or pain across individuals. In Chapter 22, we discuss in some detail matters pertaining to the issue of interpersonal comparability of utilities.

It is simple to verify that majority voting satisfies the three properties of symmetry among agents, neutrality between alternatives, and positive responsiveness (sec Exercise 21.B.I). As it turns out, these properties entirely characterize majority voting. The result given in Proposition 21.B.1 is due to May (1952).

I. In the whole of this chapter we make the restriction that only the agents' rankings between the two alternatives matter for the social decision between them. In Section 21.C we will state

A L T ERN AT, V E S

791

792

CHAPTER

.,:

SOCIAL

CHOICE

THEORY

Proposition 21.B.1: (May's Theorem) A social welfare functional F(IX" ... ,lXd is a majority voting social welfare functional if and only If it is symmetric among agents, neutral between alternatives, and positive responsive. Proof: We have already argued that majority voting satisfies the three properties. To establish sufficiency note first that the symmetry property among agents means that the social preference depends only on the total number of agents that prefer alternative x to y, the total number that are indifferent, and the total number that prefer y to x. Given (IX., ... ,IX,), denote n+(IX., ... ,IX,) = #(i:IX,= I),andn-(IX., ... ,IX,)= #(i:IX,= _1).2 Then symmetry among agents allows us to express F(IX., ... , IX,) in the form F(IX., ... , IX,) = G(n+(IX., ... , IX,), n-(IX., ... , IX,». Now suppose that (IX., ... ,IX,) is such that n+(IX ...... IX,) = n-(IX., ... ,IX,). Then 11+( -IX I, ... , -IX,) = n-(IX., ... , IX,) = n+(IX., ... , IX,) = n-( -IX ..... , -IX,), and so

--

SECTION

".C:

THE

GENERAL

CASE:

ARROW'S

IMPOSSIBILITY

there are I agents, indexed by i = I, ...• J. Every agent i has a rational preference relation ::::, defined on X. The strict preference and the indifference relation derived from ::::, are denoted by >-, and -" respectively.3 In addition, it will often be convenient to assume that no two distinct alternatives are indifferent in an individual preference relation ::::,. It is therefore important, for clarity of exposition, to have a symbol for the set of all possible rational preference relations on X and for the set of all possi ble preference relations on X having the property that no two distinct alternatives are indifferent. We denote these sets, respectively, by fit and 9. Observe that 9 c: .'11." In parallel to Section 21.B, we can define a social welfare functional as a rule that assigns social preferences to profiles of individual preferences (::::" ... , ::::,) E fit'. Definition 21.C.1 below generalizes Definition 21.B.I in two respects: it allows for any number of alternatives and it permits the aggregation problem to be limited to some given domain .O=IX;+ •. Therefore, by the positive responsiveness property, we must have F(IX., ... ,IX,) = I. In turn, if n-(IX., ... ,IX,) > n+(IX ..... ,IX,) then n+(-IX ..... ,-IX,» n-( -(X., ... , -IX,) and so F( -IX ..... , -IX,) = I. Therefore, by neutrality among alternatives: F(IX., ... ,IX,)= -F(-IX., ... , -IX,) = -I. We conclude that F(IX., ... , IX,) is indeed a majority voting social welfare functional. _ In Exercise 2I.B.2, you are asked to find examples dilTerent from majority voting that satisfy any two of the three properties of Proposition 21.8.1.

2LC The General Case: Arrow's Impossibility Theorem We now proceed to study the problem of aggregating individual preferences over any number of alternatives. We denote the set of alternatives by X, and assume that

Definition 21.C.1: A social welfare functional (or social welfare aggregator) defined on a given subset .j is irrcnexive (x >jX cannot occur) and transitive (x >j'y and y >jX implies x >iX), Similarly, -j is fI:nexive (x -jX for all x EX). transitive (x - j y and)' -jZ implies x -,x) and symmetric (x - j y implies Y - j x). 4. Formally, the preference relation ~j belongs to ~ if it is reflexive (x ~I x for every x EX). transitive (x ;::i y and )' ~j z implies x ~I x) and IOtal (if x "# y then either x ~i y or y ~i x, but not

bOlh). Such preference relations are often referred to as slricl preferences (although Slricl-lolal preferem'('''' would be less ambiguous) or even as linear orders, because these are the properties of the usual "larger than or equal to" order in the real line. 5. In particular, there are no individual utility levels and, therefore. there is no meaningful sense

in which any conceivable information on individual utility levels could be compared and matched up. We refer again to Chapter 22 (especially Section 22.D) for an analysis of the problem that 2. Recall the notation # A = cardinality of the set A = number of clemen Is in the set A.

focuses on the information used in the aggregation process.

THEOREM

793

794

C HAP T E R

2,:

• 0 CI AL

C HOI C E

THE 0 R Y

----------------------------------------------------------------------~

Definition 21.C.2: The social welfare functional F:sI -+ fJl is Paretian if. for any pair of alternatives {x. y} c X and any preference profile (1::;, •...• 1::;,) e sI. we have that x is socially preferred to y. that Is. x Fp(I::;, •...• 1::;,) y. whenever x >-1 y for every i. In Example 2l.C.l we describe an interesting class of Paretian social welfare functionals. Example 21.C.l: The Borda Count. Suppose that the number of alternatives is finite. Given a preference relation 1::;, e fJl we assign a number of points c,(x) to every alternative x e X as follows. Suppose for a moment that in the preference relation 1::;, no two alternatives are indifferent. Then we put c,(x) = n if x is the nth ranked alternative in the ordering of 1::;,. If indifference is possible in 1::;, then c,(x) is the average rank of the alternatives indifferent to x. 6 Finally. for any profile (1::;"" .• 1::;,) E fJl' we determine a social ordering by adding up points. That is. we let F(I::;, •...• I::;,)efJl be the preference relation defined by xF(I::; ...... I::;,)y if L, c,(x) ~ L, c,(y). This preference relation is complete and transitive [it is represented by the utility function -c(x) = - L,C,(X)]. Moreover. it is Paretian since if x >-, y for every i then c,(x) < c,(y) for every i. and so L, c,(x) < L, c,(y) . • We next state an important restriction on social welfare functionals first suggested by Arrow (1963). The restriction says that the social preferences between any two alternatives depend only on the individual preferences between the same two alternatives. There are three possible lines of justification for this assumption. The first is strictly normative and has considerable appeal: it argues that in settling on a social ranking between x and y. the presence or absence of alternatives other than x and y should not matter. They arc irrelevant to the issue at hand. The second is one of practicality. The assumption enormously facilitates the task of making social decisions because it helps to separate problems. The determination of the social ranking on a subset of alternatives does not need any information on individual preferences over alternatives outside this subset. The third relates to incentives and belongs to the subject matter of Chapter 23 (see also Proposition 2I.E.2). Pairwise independence is intimately connected with the issue of providing the right inducements for the truthful revelation of individual preferences. Definition 21.C.3: The social welfare functional F:sI .... tJt defined on the domain sI satiSfies the pairwise independence condition (or the independence of irrelevant alternatives condition) if the social preference between any two alternatives {x. y} c X depends only on the profile of individual preferences over the same alternatives. Formally'. for any pair of alternatives {x. y} eX. and for any pair of preference profiles (1::;, ....• 1::;,) E d and (1::;., •...• 1::;;) E d with the property that. for every i. Xl::;iY xl::;;y and yl::;iX yl::;;x. 6. Thus if X = {x. y. z} and x~, y -, z then e,(x) = 1. and e,(y) = e,(z) = 2.5. 7. The expressions Ihat follow are a bil cumbersome. We emphasize Iherefore thaI Ihey do nothing more than to capture formally the statement just made. An equivalent formulation would be: for any {x.y} c: X. if ~,!{x.y} = ~;!{x.y} for all i. then F(~, ..... ~r)!{x.y} = F(;:'" ... , ;:;)1 {x. y}. Here;: !{x, y} 51ands for Ihe restriction of Ihe preference ordering;: 10 Ibe set {x, y}.

SECT'ON

'1.C:

THE

GENERAL

CASE:

ARROW'S

'MPOSSIB'L'TY

THEOREM

795

--------------------------------------------------------------------------we have that xF(I::;, •...• 1::;,) y

xF(I::;; ..... 1::;;) y

yF(~, •...• ~,)

yF(~·, •...• I::;;)x.

and

x

Example 21.C.l: conrinued. Alas. the Borda count does not satisfy the pairwise independence condition. The reason is simple: the rank of an alternative depends on the placement of every other alternative. Suppose. for example. that there are two agents and three alternatives {x. y. z}. For the preferences x >-, z >-, y.

y>-,x>-,z we have that x is socially preferred to y [indeed. e(x) = 3 and ely) = 4]. But for the preferences

x >-; y >-', z. y>-,z>-,x we have that .v is socially preferred to x [indeed. now c(x) = 4 and ely) = 3]. Yet the relative ordering of x and y has not changed for either of the two agents. For another illustration, this time with three agents and four alternatives {x. y. t. w}. consider

z >-, X

>-, J' >-, lV.

z >-,x >-,y>-, w. y

>- , z >-, w >- J x.

Here. y is socially preferred to x [c(x) = 8 and ely) = 7]. But suppose now that alternatives z and w move to the bottom for all agents (which because of the Pareto property is a way of saying that the two alternatives are eliminated from the alternative set): x >-', y >-', z >-', IV.

>-2 y >-, z >-', w. y >-3X >-3 z >-', IV. x

(21.C.l)

Then x is socially preferred to y [e(x) = 4. c(y) = 5]. Thus the presence or absence of alternatives z and w matters to the social preference between x and y. Another modification would take alternative x to the bottom for agent 3:

x >-;}' >-; z >-';

lV,

>-:;y >-:;z >-:; lI', }' >-; z >-; IV >-; x. X

Now .r is socially preferred to x [which. relative to the outcome with (21.C.l). is a nice result from the point of view of agent 3]. • The previous discussion of Example 21.C.l teaches us that the pairwise independence condition is a substantial restriction. However. there is a way to proceed that will automatically guarantee that it is satisfied. It consists of determining the social prefcrence between any given two alternatives by applying an aggregation rule that uses only the information about the ordering of rhese two alternatives in

796

CHAPTER

21:

SOCIAL

CHOICE

THEORY

individual preferences. We saw in Section 21.B that, for any pair of alternatives, there are many such rules. Can we proceed in this pairwise fashion and still end up with social preferences that are rational. that is. complete and transitive? Example 21.C.2 shows that this turns out to be a real difficulty. Example 21.C.2: The Condorcet Paradox." Suppose that we were to try majority voting among any two alternatives (see Section 21.B for an analysis of majority voting). Does this determine a social welfare functional? We shall see in the next section that the answer is positive in some restricted domains d c !Jtl • But in general we run into the following problem. known as the Condorcet paradox. Let us have three alternatives {x. y. z} and three agents. The preferences of the three agents are

x >-. y >-1 z.

----

--

SECTION

21.C:

THE

GENERAL

CASE:

ARROW'S

IMPOSSIBILITY

Definition 21_C.4: Given F('), we say that a subset of agents ScI is: (i) Decisive for x over y if whenever every agent in S prefers x to y and every agent not in S prefers y to x, x is socially preferred to y. (ii) Decisive if, for any pair {x, y} eX, S is decisive for x over y. (iii) Completely decisive for x over y if whenever every agent in S prefers x to y, x is socially preferred to y. The proof will proceed by a detailed investigation of the structure of the family of decisive sets. We do this in a number of small steps. Steps 1 to 3 show that if a subset of agents is decisive for some pair of alternatives then it is decisive for all pairs. Steps 4 to 6 establish some algebraic properties of the family of decisive sets. Steps 7 and 8 use these to show that there is a smallest decisive set formed by a single agent. Steps 9 and 10 prove that this agent is a dictator.

Z>-2X>-2Y. Y >- 3Z>- 3x.

Then pairwise majority voting tells us that x must be socially preferred to Y (since x has a majority against y and. a fortiori. y does not have a majority against x). Similarly. y must be socially preferred to z (two voters prefer y to z) and z must be socially preferred to x (two voters prefer z to x). But this cyclic pattern violates the transitivity requirement on social preferences. _ The next proposition is Arrow's impossibility theorem. the central result of this chapter. It essentially tells us that the Condorcet paradox is not due to any of the strong properties of majority voting (which. we may recall from Proposition 2I.B.l. are symmetry among agents, neutrality between alternatives, and positive responsiveness). The paradox goes to the heart of the matter: with pairwise independence there is no social welfare functonal defined on !Jtl that satisfies a minimal form of symmetry among agents (no dictatorship) and a minimal form of positive responsiveness (the Pareto property). Proposition 21.C.1: (Arrow's Impossibility Theorem) Suppose that the number of alternatives is at least three and that the domain of admissible individual profiles. denoted d, is either d = 91 1 or d = iJl l • Then every social welfare functional F:.flI -+ !Jt that is Paretian and satisfies the pairwise independence condition is dictatorial in the following sense: There is an agent h such that, for any {x, y} eX and any profile (;::; I' . . . , ;::;/) ed, we have that x is socially preferred to y. that is, x Fp (;::;, •••. , ;::;d y, whenever x >-hY' Proof: We present here the classical proof of this result. For another approach to the demonstration we refer to Section 22.D. It is convenient from now on to view I not only as the number but also as the set of agents. For the entire proof we refer to a fixed social welfare functional F: d -+ !Jt satisfying the Pareto and the pairwise independence conditions. We begin with some definitions. In what follows, when we refer to pairs of alternatives we always mean distinct alternatives.

8. This example was already discussed in Section 1.8.

Step I: If for some {x. y} c X. ScI is decisive for x over y. then. for any alternative z # x. S is decisive for x over z. Simiiarly,Jor any z # y. S is decisive for z over y. We show that if S is decisive for x over y then it is decisive for x over any z # x. The reasoning for Z over y is identical (you are asked to carry it out in Exercise 21.C.l). If z = y there is nothing to prove. So we assume that z # y. Consider a profile of preferences (;::; ••...• ;::;1) e d where

for every i e S and for every i e I \S. Then. because S is decisive for x over y. we have that x is socially preferred to y, that is. xF,(;::; ••...• ;::;/)Y' In addition. since y;::;/z for every iel. and F(') satisfies the Pareto property it follov.s that y F,(;::; ...... ;::;1) z. Therefore. by the transitivity of the social preference relation. we conclude that x F,- x (i.e., such Ihal y ~ x bUI nol x ~ y). Thus, for any integer M we can find a

chain

x' >- x' >-. ,,>- x". where x· E X' for every m =

I .... , M. If M is larger than the number

of alternatives in X', then there must be some repetition in this chain. Say that x·' = x'" for m > m', By quasi transitivity. x"" >- x'" = x"", which is impossible because >- is irrenexive by definition. Hence, ;:: must be acyclic. An example or an acyclic but not quasitransitive relation will be given

in Example 2I.D.2. The relation >- derived from a rational prererence relation ~ is transitive (Proposition I.B.J). An example of a quasitransitive. but not rational. prererence relation is given

in Example 2t.D.t.

2 1 • D:

S0 ME

P D S SIB I LIT Y

RES U L T S:

RES T RIC TED

{v, w} c {x, y, z} we have that v is socially at least as good as w if either V;::I w, or v = y, w = x and v >- 2 w. In Exercise 21.0.3 you should verify that the social preferences so defined are acyclic but not necessarily quasitransitive. _

Single-Peaked Preferences We proceed now to present the most important class of restricted domain conditions: single-peakedness. We will then see that, in this restricted domain, nondictatorial aggregation is possible. In fact, with a small qualification, we will see that on this domain pairwise majority voting gives rise on this domain to a social welfare functional. Definition 21.0.2: A binary relation
x

y

then

z >- y

and If Y

> z - y.

In words: There is an alternative x that represents a peak of satisfaction and, moreover, satisfaction increases as we approach this peak (so that, in particular, there cannot be any other peak of satisfaction). Example 21.0.4: Suppose that X = [a,b] c R and ~ is the "greater than or equal to" ordering of the real numbers. Then a continuous preference relation;:: on X is single peaked with respect to ~ if and only if it is strictly convex, that is, if and only if, for every WE X, we have ay + (I - a)z >- W whenever y;:: W, Z ;:: w, y '" z, and a E (0,1). (Recall Oefinition 3.B.5 and also that, as a matter of definition, preference relations generated from strictly quasiconcave utility functions are strictly convex.) This fact accounts to a large extent for the importance of single-peakedness in economic applications. The sufficiency of strict convexity is actually quite simple to verify. (You are asked to prove necessity in Exercise 21.0.4.) Indeed, suppose that x is a maximal element for;::, and that, say, x > z > y. Then x ;:: y, y ;:: y, x '" y, and Z = ax + (I - a)y for some a E (0, I). Thus, z >- y by strict convexity. In Figures 2I.D.I and 21.0.2, we depict utility functions for two preference relations on X = [0, I]. The preference relation in Figure 21.0.1 is single peaked with respect to ;::" but that in Figure 21.D.2 is nol. _ Definition 21.0.4: Given a linear order : c 9t the collection of all rational preference relations that are single peaked with respect to
D 0 M A INS

801

802

CHAPTER

21:

SOCIAL

CHOICE

THEORY

---------------------------------------------------------------Ftgure 21.D.1 (left)

Utility

Utility

Preferences are sing). peaked with resP e ar~ and any pair {x, y} c X, we put x i(::::" ... , ::::, )y, to be read as "x is socially at least as good as y", if the number of agents that strictly prefer x to y is larger or equal to the number of agents that strictly prefer y to x, that is, if #(ie/:x>-IY};;:: #{ie/:y>-,x}. Note that, from the definition, it follows that for any pair {x, y} we must have either x i(::::" ... , ::::,) y or y i(::::" ... , ::::,) x. Thus, pairwise majority voting induces a complete social preference relation (this holds on any possible domain of preferences). In Exercise 21.0.5 you are asked to show in a direct manner that the preferences of the Condorcet paradox (Example 2I.C.2) are not single peaked with respect to any possible linear order on the alternatives. In fact, they cannot be because, as we now show, with single-peaked preferences we are always assured that the social preferences induced by pairwise majority voting have maximal elements, that is, that there are alternatives that cannot be defeated by any other alternatives under majority voting. Let (::::,' ... , ::::,) e ar~ be a fixed profile of preferences. For every i e I we denote by XI e X the maximal alternative for ::::, (we will say that Xi is "i's peak"). Definition 21.0.5: Agent h e I is a median agent for the profile (::::1' ... , ::::,) e ar~ if #{ie/:x;;;::xh};;::2I

and

I # {'le/:xh;;::xi } ~2'

A median agent always exists. The determination of a median agent is illustrated in Figure 2I.D.3. If there are no ties in peaks and # I is odd, then Definition 21.D.5 simply says that a number (I - 1)/2 of the agents have peaks strictly smaller than x. and another number (I - I )/2 strictly larger. In this case the median agent is unique. Proposition 21.0.1: Suppose that ~ is a linear order on X and consider a profile of preferences (::::1"" , ::::,) where, for eve!y i, ::::; is single peaked with respect.to ;;::. Let h e I be a median agent. Then Xh F(:::: l' ••• , ::::,) y for every y e X. That IS, the peak xh of the median agent cannot be defeated by majority voting by any other alternative. Any alternative having this property is called a Condorcet winner. Therefore, a Condorcet winner exists whenever the preferences of all agents are singlepeaked with respect to the same linear order.

Flgur. 21.0.3

Agent 5 is the Median Agent 3 VOlers

3 VOlers

Proof: Take any y e X and suppose that x. > y (the argument is the same for y > x.). We need to show that y does not defeat x, that is, that #{ie/:x.>-,y};;:: #(ie/:y>-,x.}.

Consider the set of agents Sci that have peaks larger than or equal to x., that is, S = {i e I: XI ;;:: x.}. Then x, ~ x. > y for every i e S. Hence, by single-peakedness of :::: I with respect to ~, we get x. >-, y for every j e S. On the other hand, because agent h is a median agent we have that #S ~ 1/2 and so # {i e I: y >-,x.} ~ #(/\S) ~ 1/2~ #S:;, #{ie/:x.>-,y}._ Proposition 21.0.1 guarantees that the preference relation i(::::" ... , ::::,) is acyclic. It may, however, not be transitive. In Exercise 21.0.6 you are asked to find an example of nontransitivity. Transitivity obtains in the special case where I is odd and, for every i, the preference relation ::::, belongs to the class 9"~ c ar~ formed by the rational preference relations:::: that are single peaked with respect to ~ and have the property that no two distinct alternatives are indifferent for ::::. Note that, if I is odd and preferences are in this class, then, for any pair of alternatives, there is always a strict majority for one of them against the other. Hence, in this case, a Condorcet winner necessarily defeats any other alternative. Proposition 21.0.2: Suppose that I is odd and that ;;:: Is a linear order on X. Then pairwise majority voting generates a well-defined social welfare functional F: .'Ji'~ -+ .'11. That is, on the domain of preferences that are single-peaked with respect to ;;:: and, moreover, have the property that no two distinct alternatives are indifferent, we can conclude that the social relation F(::::1' ... , ::::,) generated by pairwise majority voting is complete and transitive. Proof: We already know that i(::::" ... , ::::,) is complete. It remains to show that it is transitive. For this purpose, suppose that x i(::::" ... , ::::,)y and y i(::::" ... , ::::,) z. Under our assumptions (recall that I is odd and that no individual indifference is allowed) this means that x defeats y and y defeats z. Consider the set X' = {x, y, z}. If preferences are restricted to this set then, relative to X', preferences still belong to the class .'Ji'~, and therefore there is an alternative in X' that is not defeated by any

Determination of a median for a single-peaked family.

803

804

CHAPTER

21:

SOCIAL

CHOICE

SECTION

THEORY

-------------------------------------------------------------~

21.0:

SOME

POSSIBILITY

RESULTS:

RESTRICTED

DOMAINS

605

---------------------------------------------------------------x = R'

Figure 21.0.4

Indifference curves for the preferences of Example 21.0.5.

other alternative in X'. This alternative can be neither y (defeated by x) nor z (defeated by y). Hence, it has to be x and we conclude that x transitivity. _

FC??:." . .. , ~,) z, as

required by

In applications, the linear order on alternatives arises typically as the natural order, as real numbers, of the values of a one-dimensional parameter. Then, as we have seen, singlepeakedness follows from the strict quasiconcavity of utility functions, a restriction quite often satisfied in economics. It is an unfortunate fact that the power of quasiconcavity is confined to one-dimensional problems. We illustrate the issues involved in more general cases by discussing two examples. Example 21.0.5: Suppose that the space of alternatives is the unit square, that is, X = [0, I]'. The generic entries of X arc denoted x = (x" x,). There are three agents I = {t, 2, 3}. The preferences of the agents are expressed by the utility functions on X:

o example is that the cone spanned by the nonnegative combinations of the gradient vectors of the three utility functions equals the entire R' (see Figure 21.0.4). Exercises 21.0.7 and 21.0.8 provide further elaboration on this issue. The reason why in two (or more) dimensions, quasiconcavity does not particularly help is that, in contrast with the one-dimensional case, there is no sensible way to assign a "median" to a set of points in the plane. This will become clear in the next, classical, Example 21.0.6 which we now describe. Example 21.0.6: Euclidean Preferences. Suppose that the set of alternatives is R'. Agents have preferenees 0, is preferred by agents I and 2 to x. You should verify the claims made in (i), (ii), and (iii). _ The situation illustrated in Example 21.0.5 is not a peculiarity. The key property of the

10. The preferences

or this example are not strictly convex. This is immaterial. Without changing

the nature of the example we could modify them slightly so as to make the indifference curve map strictly convex.

A(y,z)

= {XE R": IIx -

yll < IIx-zlI}.

See Figure 21.0.6 for a representation. Geometrically, the boundary of A(y, z) is the hyperplane perpendicular to the segment connecting y and z and passing through its midpoint. We will consider the idealized limit situation where there is a continuum of agents with Euclidean preferences and the population is described by a density function g(x) defined on R', the set of possible peaks. Then given two distinct alternatives y, z E R", the fraction of the total population that prefers y to z, denoted m,(y, z), is simply the integral of g(') over the region A(y, z) c R'. When will there exist a Condorcet winner? Suppose there is an x· E R' with the property that any hyperplane through x· divides R' into two half-spaces each having a total mass of ! according to the density g('). This point could be called a median for the density g('); it coincides with the usual coneept of a median in the case n = I. A median in this sense is a Condorcet winner. It cannot be defeated by any other alternative because if y ¥- x· then A(x·, y) is larger than a half-space through x· and, therefore, m,(x·, y) ~ t. Conversely, if x·

11. For an example in the same spirit where the two roles arc kept separate, see Grandmont (1978) and Exercise 21.0.9.

Flgur. 21.0.5 (left)

Euclidean preferences in R'. Flgur. 21.D.6 (right)

The region of Euclidean preferences that prefer y to z.

806

CHAPTER

21:

SOCIAL

CHOICE

SECTtON

THEORY

-------------------------------------------------------------------~ A(x"

+ "I,x")

=>

m•. ix"

+ "I, X") > i Figure 21.D.7

If x" is not a median then it is not a Condorcet winner.

Figure 21.D.8

---

-------~-------(a)

----

--B(b)

is not a median then there is a direction q E R' such that the mass of the half-space {z E R': q'Z > q·x·J is larger than 1. Thus, by continuity, if < > 0 is small then the mass of the translated half-space A(x" + f.q, x") is larger than t. Hence x· + -,w. then the preference relatIOn >-, defi~~ by y>-iw >-iz >-ix takes {y.w} to the top from
",

As a first step in the study of policy decision problems, this section is concerned with the description of the set of options available to a policy maker. The following section will consider the objectives of the policy maker. I The starting point of the analysis is a nonempty set of alternatives X and a collection of I agents. In contrast with Chapter 21, where we used preference relations, we will now assume that agents' tastes are given to us in the form of utility functions II,: X -+ R. One may wonder what is the exact meaning of the utility values II.eX): DO they have cardinal or ordinal significance? Are they comparable across individuals? These questions will be considered in Section 22.0. For current purposes there is no need to answer them. It is a traditional, and firm, principle of welfare economics that policy making should not be paternalistic. At a minimum, this means that alternatives that cannot be distinguished from the standpoint of agents' tastes should not be distinguished by the policy maker either. We are therefore led to the idea that only the agents' utility values for the different alternative should matter and therefore that the relevant constraint set for the policy maker is the utility possibility set [introduced by Samuelson (1947)], which we now define.

U = {(u" ... , ud

SEC T ION

BARGAINING

Figure 22.B.1 (leH)

A utility possibility set.

u

Flgur. 22.B.2 (right)

",

",

Example 22.B.I corresponds to a first-best situation. A first-best problem is one in which the constraints defining X are only those imposed by technology and resources. The policy maker cannot produce from a void and, therefore, must respect these constraints, but otherwise she can appeal to any conceivable policy instrument. If, as is often the case, there are other restrictions on the usable instruments, we say that we have a second-best problem. The restrictions can be of many sorts: legal, institutional, or, more fundamentally, informational. The last type were amply illustrated in Chapters 13 and 14 (and will be seen again in Chapter 23). We should warn, however, that the conceptual distinction between first-best and second-best problems is not sharp. In a sense, adverse selection or agency restrictions are as primitive as technologies and endowments.

Example 22.8.2: Ramsey Taxation. Consider a quasilinear economy with three goods, of which the third is the numeraire. The numeraire good can be freely transferred across consumers (more formally, one of the policy instruments available to the policy maker is the lump-sum redistribution of wealth). The first two goods are produced from the numeraire at a constant marginal cost equal to I. Consumers face market prices that are eq·.al to marginal cost plus a commodity tax whose level is fixed by the policy maker. Tax proceeds are returned to the economy in lump-sum form. Finally, the amounts consumed are those determined by the demand functions of the different consumers. We know from the second welfare theorem (Section 16.0) that any utility vector in the first-best UPS can be reached with the above instruments (it suffices to set the tax rates at a zero level and distribute wealth appropriately). But suppose that we now have an unavoidable distortion-the policy maker is constrained to raise a total amount R of tax receipts. This has then become a second-best problem. To determine the corresponding second-best UPS, note first that, since the numeraire is freely transferable across consumers, the boundary of this set is still linear, as in the first-best case (i.e., as in Figure 22.8.2). Hence, to place this boundary it suffices to find the level of prices P" P2 that maximizes V(P" Pi), the indirect utility function of a representative consumer (which, up to an increasing transformation, equals the

A utility possibility set: transferable utility.

820

CHAPTER

22:

ELEMENTS

OF

WELFARE

ECONOMICS

AND

AXIOMATIC

aggregate consumer surplus; see Section 4.D and Chapter 10 for these concepts). J Denote by XI(PI' p,) and X,(PI' p,) the aggregate demand functions. Then we must solve the problem Max

--- --

BARGAININQ

v( PI' p,) (PI - I)x,(p" p,)

S.t.

+ (p,

(PI - 1) oxI(p" p,) OP1

_ dxl(p,) .),(PI - 1) - - = (1 -1.)X,(p.) dp,

and

Denoting by tl = (PI - I)lpl the tax rate on good I, we can write this condition in elasticity form as 0:

t1 = - -

for some

0:

> O.

PO S SIB I LIT Y

+ (p,

_ 1) ox,(p" P1) = O. OP1

(22.B.2)

Example 22.B.4: Few Policy Instruments. In Examples 22.B.2 and 22.B.3 we have assumed that the unrestricted transfer of numeraire across consumers is one of the instruments available to the policy maker. Because of this, in those two examples the UPS had a "full" frontier, that is, a frontier that is an (1- I)-dimensional surface. In addition, quasilinearity insured that this surface was flat (and therefore that the UPS was convex). We now explore the implications of limiting the extent to which the numeraire is transferable. We assume that we have two goods and that the utility functions of I consumers are quasilinear with respect to the first good (which is untaxed). Arbitrary transfers of numeraire are not permitted, however. The policy maker now has a single instrument: a commodity tax (or subsidy) on the second good. Again, this good can be produced at unit marginal cost. The policy maker's surplus (or deficit) is given back to the consumers according to some fixed rule (hence, no arbitrary transfers of numeraire are permitted). Say, to be specific, that this rule is that the surplus-deficit is absorbed by the first consumer. Then the (second-best) UPS is [denoting by Vi(P1) the indirect utility f.. nction of consumer i]

.( p, - - 1) dx,(p,) - - - = (1 - j.') X, (-) p, . dp,

j.

and

UTI L , T Y

Note that except in the separable case, where ox,(p" p,)lop, = 0, we have p, i' 1; that is, even if the initial distortion involves only the first market, second-best efficiency requires creating a compensatory distortion in the second market [this point was emphasized by Lipsey and Lancaster (1956)]. This is an intuitive result: suppose that we were to put P1 = 1; then the last (infinitesimal) unit demanded of the second good makes a contribution P1 - 1 = 0 to the total surplus (recall that p, will equal the marginal utility for good 2). Therefore, a small tax on good 2 is desirable because its effect is to divert some demand toward good I, where the contribution to total surplus of the last unit demanded is PI - 1 > O. •

There is i. < 0, such that

0:

22. B:

assume that PI is fixed at some level PI > I. s The policy instruments are any transfer of numeraire across agents and the level of a commodity tax on the second good. The net revenue in the two markets is given back to consumers in a lump-sum form. The solution P1 of the surplus-maximization problem is then characterized by the first-order conditions (see Exercise 22.B.3)

- I)x,(p" p,) ;e: R.

Suppose, to take the simplest case, that the utility functions of the different consumers are additively separable. This means that the two demand functions can be written as x ,(PI) and x,(p,). Then the first-order conditions satisfied by a solution (PI' p,) of the maximization problem are (carry out the calculation in Exercise 22.B.2):

tl =-£I(PI)

SEC T ION

(22.B.I)

£1(P1)

Expression (22.8.1) is known as the Ramsey taxation formula [because of Ramsey (1927)]. An implication of it is that if the demand for good 1 is uniformly less elastic than that for good 2, then the optimal tax rate for good I is higher. This makes sense: For example, if the demand for good I is totally inelastic then there is no deadweight loss from taxation of this good (see Section IO.C) and therefore we could reach the first-best optimum by taxing only this good.' • Example 22.B.3: Compensatory Distortion. The basic economy is as in Example 22.8.2, except that we do not necessarily assume that the utility functions of the consumers are additively separable. The distortion is now of a different type. We

U = {u

E

R': u :S (V,(P1)

+ (p,

- 1) Li Xi(P1), v1(p,), ... , V,(P1» for some P1 > O}.

Two points are worth observing. The first is that U does not need to be convex (you should show this in Exercise 22.B.4; recall from Proposition 3.D.3 that the indirect utility functions are quasi-convex. An example is represented in Figure 22.B.3. The second is that U is defined by means of a single parameter, P2' and therefore its Pareto frontier (which, naturally,lies in R') is one-dimensional. See Figure 22.B.4 for a case with I = 3. This feature is entirely typical. As long as the instruments available to the policymaker are fewer than I - 1 in number, the frontier of the UPS cannot be (I - 1)-dimensional. Note that when there is free transferability of numeraire across

3. Because total surplus equals consumer surplus plus the fixed amount of tax revenues R. by maximizing consumer surplus we maximize total surplus. We nole also thai the assumption that the amount R must be raised through commodity taxation is somewhat artificial in a context where lump-sum redistribution is possible. We make the assumption, in this and the next example, merely 10 be pedagogical. Alternalively, we could rule oul Ihe possibilily of lump-sum Iransfers. In this case the exercise carried out in this example (and the next) determines the first-order conditions ror Ihe problem of maximizing the sum of individual utilities (the "purely utilitarian social welfare function" in the terminology of Section 22.C) 4. We should warn that the formulas in (22.B.I) constitute only first-order conditions. As we shall see in Ihe forthcoming examples, second-best problems are frequently nonconvex and therefore the satisraction of first-order conditions does not guarantee that we have determined a true maximum.

5. More generally. we could think of the market for good I as being beyond the control of the policy maker and giving rise, perhaps because of a monopolistic structure. to a price higher than marginal cost.

1

SET S

821

822

CHAPTER

22:

ELEMENTS

OF

WELFARE

ECONOMICS

AND

AXIOMATIC

BAROAININO

SECTION

~

22 •• :

UTILITY

POSSIBIL'TY

SETS

823

---------------------------------------------------------------------------------",

Figure 22.B.3 (left)

",

A nonconvex second-best utility possibility set (Example 22.B.4).

Flgur. 22.B.6

Figure 22.B.4 (rlghl)

",

",

"J

A second-best utility possibility set for a case with few instruments: low-dimensional Pareto frontier (Example 22.B.4).

the I consumers, this automatically gives us the necessary minimum of I - I instruments. _

",

A nonconvex utility possibility set for a first·best problem wilh externalities (Example 22.8.5).

Chapter 6). then the (expected, or ex ante) UPS is convex since it is just the set of convex combinations of the utility vectors in the UPS associated with deterministic policies. There is no general theoretical reason to prevent the policy making from randomizing. On the other hand, the practical admissibility of stochastic policies cannot be decided on a priori grounds either. We conclude this section with a final example [borrowed from Atkinson (1973)] that highlights the contrast between first-best and second-best problems. Example 22.B.6: Unproductive Taxation. Suppose that there are two commodities and two consumers. We call the first commodity "labor", or leisure, and the second the "consumption good." There is a total of one unit of labor which is entirely owned by the first consumer. The consumption good can be produced by the first consumer from labor at a constant marginal cost of I (there is also free disposal). The first consumer has a utility function u,(x, I> X2') and the second has "2(X22)' In Figure 22.B.7 we illustrate the construction of the first-best Pareto frontier for this model. Suppose that u l is given. Then, subject to attaining the level of utility ", for consumer I, we want to give to consumer 2 as much utility as possible. If consumer I gets (x 11' X2') then the labor supply is I - XII and the amount of consumption good available for consumer 2 is I - XI' - X2" Thus, we should first determine (X'I' X2') by minimizing x,, + X2' subject to u,(x, I> X2') ~ "" and then let U2 = u2(1 - XII - .x2d· We now study the second-best problem where consumer I cannot be forced to supply labor. The only available policy instrument for providing consumption good

Example 22.B.5: First-best N onconvexities. In Example 22.B.4 the possible nonconvexity of the UPS is due to the second-best nature of this set.lflump-sum transfers of numeraire were allowed, then the corresponding first-best UPS would be convex. Yet a first-best UPS may also be nonconvex. Two familiar sources of nonconvexities in first-best problems are indivisibilities and externalities. As for the first, suppose that there are two locations and two agents with identical locational tastes (in particular, they both prefer the same location). There are only two possible assignments of individuals to locations and therefore the UPS will be as in Figure 22.B.5. As for externalities, suppose that there is a single good and that the utility functions of two consumers are u,(x,) = x, and u,(x .. x,) = x,/x,. Then the UPS is as in Figure 22.B.6 (see Appendix A of Chapter II for more on nonconvexities due to externalities). _ Examples 22.B.4 and 22.B.5 have provided instances where the UPS is nonconvex. There is a procedure that permits one, in principle, to convexify the UPS. It consists of allowing the policy maker to randomize over her set of feasible policies. If random outcomes are evaluated by the different agents according to their expected utility (see

Figure 22.B.7

",

Construction of the first·best Pareto frontier for Example 22.8.6.

Figure 22.B.5

A nonconvex utility possibility set for a first-best locational problem (Example

22.B.5).

u

",

"

x" Labor Supply

824

C HAP T E R

2 2:

E L E MEN T 8

0 F

W ELF ARE

E CON 0 M I C 8

AND

A X 10M A TIC

• A RQ AI NINQ

---........

.,

Flgur. 22.B.8 (le"1

Construction of Ih, second-best Pareto frontier for Example 22.B.6. Flgur. 22.B.8 (rlghll

First-best and second-best utility possibilily sels for the unproductive taxalio n example (Example 22.B.6).

x"

Labor Supply

to consumer 2 is a linear tax t(1 - XI') on whatever amount of labour the first consumer decides to supply given the tax rate. The construction of the secondbest frontier is illustrated in Figure 22.B.8. For t ~ O. consumer I will choose X'I so as to maximize u,(x". (I - t)(I - XII»' Observe that this is as if she had chosen the point in her offer curve corresponding to the price vector (I. 1/(1 - t». Denote this point by xl(t) = (xl,(t). X21 (t». The utility of consumer 2 is then u 2 (t(1 - XII(t)))· The first-best and second-best UPS are displayed in Figure 22.B.9. 6 In the second-best case the figure also depicts the locus of utility pairs Q c RI obtained as t ranges from 0 to I. that is.

Q = {(UI(XI(t)). uit(1 - XI I (t)))) e

R2: O:S; I:S;

I}.

Note that Q does not coincide with the Pareto set of the second-best UPS because it exhibits a characteristic nonmonotonicity. The economic intuition underlying it is clear: if t is low. consumer 2 will get very little of the consumption good; but if I is very high. the situation is not much better. Consumer 2 will now get a large fraction of the labor supplied by consumer I. but for precisely this reason not much labor will be supplied by consumer I. • We can distill yet another lesson from Example 22.B.6. We see in Figure 22.B.9 that it is quite possible for the first-best and second-best Pareto frontiers to have some points in common; that is. there may well be second-best Pareto optima that are first-best Pareto optima. Yet Figure 22.B.9 tells us that it would be quite silly to select a point in the second-best Pareto frontier merely according to the criterion of proximity to the first-best frontier. The resulting selection may be distributionally 7 very biased. The investigation of more sensible selection criteria will be the purpose of Section 22.C.

6. Again, the second-best frontier mayor may not be convex.

7. We may add Ihal il may also be uninteresting from Ihe point of view of policy: in Figure 22.B.9 the only second-best policy that yields a first-best result is t = 0, that is, no policy al all!

--

SECTION

22.C:

SOCIAL

WELFI.RE

FUNCTIONS

AND

SOCIAL

22,C Social Welfare Functions and Social Optima In Section 22.8 we described the constraint set of the policy maker. or social planner. The next question is which particular policy is to be selected. The application of the Pareto principle eliminates any policy that leads to utility vectors not in the Pareto frontier. Yet this still leaves considerable room for choice." which. by necessity. must now involve trading off the utility of some agent against that of others. In this section we assume that the policy maker has an explicit and consistent criterion to carry off this task. Specifically. we assume that this criterion is given by a social welfare function W(u) = W(u ...... u,) that aggregates individuals' utilities into social utilities. We can imagine that W(u) reflects the distributional value judgments underlying the decisions of the policy maker." In Section 22.E (and subsequent ones) we will discuss a somewhat different approach. one that puts more emphasis on the bargaining. or arbitration. aspects of the determination of the final policy selection. In the current section, we refrain from questioning the assumption of interpersonal comparability of Ittility, which is implicit in our use of levels of individual utility as arguments in the aggregator function W(u l ••••• u,). Section 22.0. which links with the analysis of Chapter 21, is devoted to investigating this matter. Thus, for a given social welfare function W(·) and utility possibility set U c R', the policy maker's problem is Max

W(u l ••••• u,)

(22.C.1)

s.t.(ul.···.u,)eU. A vector of utilities. or the underlying policies. solving problem (22.C.1) is called a social optimum. If the problem has a second-best nature. and we want to emphasize this fact. then we may refer to a constrained social optimum. We now present and discuss some of the interesting properties that a social welfare function (SWF) may. or may not. satisfy. (i) N onpaternalism. This first property is already implicit in the concept itself of a SWF. It prescribes that in the expression of social preferences only the individual utilities matter: Two alternatives that are considered indifferent by every agent should also be socially indifferent. The planner does not have direct preferences on the final alternatives. (ii) Paretiall property. Granted the previous property. the Paretian property is an uncontroversial complement to it. It simply says that W(·) is increasing; that is, if It; W(u). We also say that W(· ) is strictly Paretian if it is strictly increasing; that is. if u; W(u). If W(·) is strictly Paretian then a solution to (22.C.I) is necessarily a Pareto optimum. 8. Only exceptionally will the Pareto frontier consist of a single point. Recall also that, as we saw in Example 22.B.3, in second-best situations with few instruments, the requirement of Pareto optimality may not succeed in ruling out many policies.

9. This approach 10 welfare economics was firsllaken by Bergson (1938) and Samuelson (1947).

OPTIMA

825

826

CHAPTER

•• :

ELEMENTS

OF

WELFARE

ECONOMICS

AND

AXIOMATIC

---

BARGAINING

Figure 22.C.l (left)

K /

/

A symmetric social welfare function.

/

/

/

".C:

",

",

SOCIAL

WELFARE

FUNCTIONS

AND

SOCIAL

/~.'~

The optimum of a symmetric, strictly concave social welfare function on a

Invarianl When Reflecled on Diagonal

",

Symmetric and Convex

",

(iii) Symmetry. The symmetry property asserts that in evaluating social welfare all agents are on the same footing. Formally, W(·) is symmetric if W(u) = W(u') whenever the entries of the vector u [e.g., u = (2,4,5)] constitute a permutation of the entries of the vector u' [e.g., u' = (4, 5, 2)]. In other words, the names of the agents are of no consequence, only the frequencies of the different utility values matter. The indifference curves of a symmetric W(·) are represented in Figure 22.C.1 for a two-agent case. Geometrically, each indifference curve is symmetric with respect to the diagonal. Note also that, because of this, if the indifference surfaces are smooth then the marginal rates of substitution at any u = (Ul> •.. ,u,) with identical coordinates are all equal to I. (iv) Concavity. Finally, a most important property is the concavity of W(·). We saw in Chapter 6 that, in the context of uncertainty, the (strict) concavity of a utility function implies an aversion to risk. Similarly, in the current welfare-theoretic context it can be interpreted as an aversion to inequality condition. A straightforward way to see this is to simply note that if W(·) is concave and W(u) = W(u'), then W(tu + tu') ~ W(u) [with the inequality strict if u ". u' and W(·) is strictly concave]. Another is to observe that if the UPS is convex and symmetric, then the utility vector that assigns the same utility value to every agent is a social optimum of any symmetric and concave SWF (see Figure 22.C.2 and Exercise 22.C.l).10 Thus, with convex UPSs and concave, symmetric SWFs some inequality is called for only if, as will typically be the case, the UPS is not symmetric. It is to be emphasized that in general, and especially for second-best problems, the UPS may not be convex. This means that even if W(·) is concave the identification of social optima is not an easy task. A utility vector that satisfies the first-order conditions of problem (22.C.1) may not satisfy the second-order conditions or, if it does, it still may not constitute a global maximum. We can gain further insights by discussing some important instances of social welfare functions. 10. The set U c R' is symmetric if" e U implies "' e U for any"' e RL that differs from" only by a permutation of its entries. The interpretation of the symmetry property of a UPS is that there is no bias in the ability to produce utility for different agents. In other words, from the point of view of their possible contributions to social welfare. all agents arc identical.

symmetric and con"x utility possibility set ~ egalitarian.

(a)

"I

(b)

",

(c)

OPTIMA

Figure 22.C.3

Figure 22.C.2 (right)

/'

/

/

--

SECTION

",

Example 22.C.I: UriliIarian. A SWF W(II) is pllrely utilitarian if it has the form W(II) = L; U; [or, in the nonsymmetric situation, W(u) = LI PiU,]. In this case, the indifference hypersurfaces of W(·) are hyperplanes. They are represented in Figure 22.C.3(a). Note that W(·) is strictly Paretian. In the purely utilitarian case, increases or decreases in individual utilities translate into identical changes in social utility. The use of the purely utilitarian principle goes back to the very birth of economics as a theoretical discipline. In Exercise 22.C.2 you are asked to develop an interpretation of the purely utilitarian SWF as the expected utility of a single individual "behind the veil of ignorance." Another line of defense, based also on expected utility theory, has been offered by Harsanyi (1955); see Exercise 22.C.3. Because only the total amount of utility matters, the purely utilitarian SWF is neutral towards the inequality in the distribution of utility. It is important not to read into this statement more than it says. In particular, it does not say "distribution of wealth." For example, if there is a fixed amount of wealth to be distributed among individuals and these have strictly concave utility functions for wealth, then the purely utilitarian social optimum will be unique and distribute wealth so as to equalize the marginal utility of wealth across consumers. If, say, the utility functions are identical across individuals then this will choose as the unique social optimum the vector in the Pareto frontier that assigns the same utility to every agent (see Exercise 22.C.1 for generalizations). _ Example 22.C.2: Maximin. A SWF is of maximin or Rawlsian type [because of Rawls (1971)) if it has the form W(u) = Min {u l , ... , u,} [or, in the nonsymmetric case, W(u) = Min {P,u" ... , p,u,}]. In other words, social utility equals the utility value of the worst-off individual. It follows that the social planning problem becomes one of maximizing thl utility of the worst-off individual." The (L-shaped) indifference curves of the maximin SWF are represented in Figure 22.C.3(b). II. One could refine this criterion by adopting a lexical, or serial, maximin decision rule. First maximize the utility of the worst-ofT, then choose among the solutions of this first problem by maximizing the utility of the next worst-off, and so on. With this. the objectives of the policy maker can still be expressed by a le:dm;n social welfare ordering of utility vectors, but the ordering is not conlinuous and cannot be represenled by a SWF (compare with Example 3.CI). Even so, the refinement is natural and important. For example. we are then guaranteed that the social optimum is a Pareto optimum. You 3re asked to show all this in Exercise 22.C.4. Note that the maximin SWF is Paretian but not strictly Paretian. This makes for some difficulties. In Figure 22.C.4 the

Social welfare functions. (a) Purely utilitarian. (b) Maximin or Rawlsian. (c) Generalized utilitarian.

827

828

CHAPTER

22:

ELEMENTS

OF

WELFARE

ECONOMICS

AND

AXIOMATIC

",

",

/ /

45'

/

FIgure 22.C.4

// / /

--- --

/

FIgure 22.C.S (rIght)

Maximin 0plimum Ulililarian 0plimum

Q

Q~------~~--~

"I

. O' . I FII"·be,, pllmum POlo

~

"I

It is reasonably intuitive that this concave SWF will have strong egalitarian implications. In fact, the preference for equality is quite extreme. Suppose, in effect, that U E RI is an arbitrary UPS and that u E U has all its coordinates equal. Then u fails to be the Rawlsian social optimum only if u is not Pareto optimal. Hence, if there is a u = (u., ... , u / ) in the Pareto frontier of U with all its coordinates equal, then u is a maximin optimum. Note, in contrast, that for a purely utilitarian SWF we reached the social optimum at complete equality only in the case where U is convex and symmetric. In Figure 22.C.4, which continues the analysis of Example 22.6.6, we depict a situation where maximin optimization leads to the selection of a policy (a tax level) that does not yield complete equality. Nonetheless, even in this case, the purely utilitarian social optimum is significantly more unequal than the maximin optimum. _

Example 22.C.3: Generalized Utilitarian. A SWF is generalized utilitarian if it has the form W(u) = LI g(u,) [or, in the nonsymmetric case, W(u) = LI g,(u / )), where g(.) is an increasing, concave function. The generalized utilitarian SWF is strictly Paretian and could be regarded as an instance of the purely utilitarian case where the individual utility functions u,(·) have been replaced by g(u,(·». This is not, however, a conceptually useful point of view. The point is precisely that, given the individual utility functions, there is a deliberate social decision to attach decreasing social weight to successive units of individual utility. The social indifference curves for this case are represented in Figure 22.C.3(c). We can also verify in Figure 22.C.4 and 22.C.5 that the equality implications of the generalized utilitarian SWF are intermediate between those of the purely utilitarian and of the maximin SWFs. _ Example 22.C.4: COnstalll Elasticity. An instance of generalized utilitarian functions that is very useful in applications is provided by the family defined by social utility functions g(.) whose marginal utilities have constant elasticity. This is a family in which attitudes towards inequality can be adjusted by means of a single parameter p '2! O. point at the boundary of U with equal coordinates is a maximin optimum but not a Pareto optimum. In the figure we have selected as "maximin optimum'" the leximin optimum (which. by definition, is a maximin optimum itself).

Range of generali~ utilitarian optima for Example 22.B.6 and the constant elasticity SWF of Example 22.C.4 (p E [0, <Xl]).

22.C:

SOCIAL

WELFARE

FUNCTIONS

AND

SOCIAL

For the rest of the example, individual utility values are restricted to be nonnegative. Then, for any p '2! 0, we let

(t.~)

A maximin oplimum for Example 22.8.6.

/

SECTION

BARGAINING

gp(u,) = (I - p)u! -p

and

if p'# I, if p = I.

Note that, as claimed, the elasticity of g~(u,> is constant because we have u,g7(u,>/g~(u,) = -p for all values u,. Taking into account that, for p '# I, h(W) = [1/(1 - p)] WI/(I-P) is an increasing transformation of W, we can represent the generalized utilitarian social preferences in a particularly convenient manner as w,,(u) =

(L, u! -0)11'-0

for p'# I,

w,,(u) =

L, In u,

for p = I.

and

Thus, we obtain the CES functions that are well known from demand and production theories (see Exercises 3.C.6 and 5.C.IO, respectively). Note that for p = 0 we get Wo(u) = L, u" the purely utilitarian case, and as p -+ 00 we get w,,(u) -+ Min {u l , . · . , U/}, the maximin case. (See Exercise 22.C.5.) In Figure 22.C.5 we depict the range of solutions to Example 22.B.6 as we vary p. We see that as the aversion to inequality increases (that is, as p -+ 00) the optimal tax rate increases. Note, however, that even for very high p we do not approach complete equality. On the other hand, none of these second-best solutions corresponds to the point in the Pareto frontier that is also Pareto optimal for the first-best problem. The latter distributes utility so unequally that the equity considerations underlying any symmetric and concave SWF leads us to sacrifice some first-best efficiency for an equity gain. _

The Compensation Principle We could ask ourselves to what an extent we can do welfare economics without social welfare functions. If the purpose of the SWF is the determination of optimal points in a given Pareto frontier, then resorting to them seems indispensable. This is the usage of social welfare functions that we have emphasized up to now; but in practice this is not the only usage. Often, the policy problem is given to us as one of choosing among several different utility possibility sets; these may correspond, for example, to the UPS associated with different levels of a basic policy variable. 12 If we have a social welfare function W(·), then the choice among two utility possibility sets U and U' should be determined by comparing the social utility of the optimum in U with that of the optimum in U'. However, even if there is no explicit social welfare function one may attempt to say something meaningful about this problem using revealed preference-like ide \s. This is the approach underlying the compensation principle (already encountered in Sections 4.0 and to.E). Let us first take the simplest case: that in which we have two utility possibility sets such that U c U'. Then one is very tempted to conclude that U' should be preferred to U. This would certainly be the case if the points that would be chosen 12. Formally. we can reduce this problem to the previous one by considering the overall UPS formed by the union of the UPSs over which we have to choose. But this may not be the most convenient thing to do because it loses the sequential presentation of the problem (first choose among UPS. then choose the utility vector).

OPTIMA

829

,",vV

(,;HAPTER

22:

ELEMENTS

OF

WELFARE

ECONOMICS

AND

AXIOMATIC

BARGAININQ

---......

---

SECTION

22.0:

IN VARIANCE

PROPERTIES

OF

SOCIAL

WELFARE

FUNCTIONS

831

Figure 22.C.7 Flgur. 22.C.&

",

U' passes the weak compensation test over (U, u).

",

within each of V and V' were the optima of a social welfare function. But even if no social welfare function is available the set V' might still be considered superior to V according to the following strong compensation test: For any possible U E V there is a u' E V' such that uj ;,: u, for every i. That is, wherever we are in V it is possible to move to V' and compensate agents in a manner that insures that every agent is made (weakly) better off by the change to V'. If the compensation is actually made, so that every agent will indeed be made better off by a switch from V to V', there is no doubt that the switch should be recommended. But if compensation will not occur, matters are not so clear: By choosing V' over V based only on a potential compensation we are neglecting quite drastically any distributional implication of the policy change. In fact, it is even possible that the change leads to a purely egalitarian worsening (see Exercise 22.C.6). Recall from Section 10.0 that in the quasilinear case we always have V c V' or V' c V. This is because the boundaries of these sets are hyperplanes determined by the unit vector (hence parallel). In addition, this property also guarantees that the strong compensation criterion (which in Sections 3.0 and 10.E we called simply the compensation criterion) coincides with the choice we would make using a purely utilitarian social welfare function. In this quasilinear case, therefore, the strong compensation criterion does not neglect distributional issues to a larger extent than do purely egalitarian social welfare functions. Matters are more delicate when we compare two utility possibility sets V and V' which are such that one is not included in the other, that is, whose frontiers cross (see Figure 22.C.6). Suppose that we know that the outcome with utility possibility set V is the vector u E V, and that we are considering a move to V'.'3 If u E V', and we were to allocate utility optimally in V' according to a social welfare function, then the move to V' would be advisable. More generally, whenever u E V', the move from (V, u) to V' passes the following weak compensation test: There is au' E V' such that u; ;:: u, for every i. That is, given that we know that the outcome at V is u, we could move to V' and compensate every agent in a manner that makes every agent (weakly) better off. In Figure 22.C.6, V' passes the test with respect to (V, u) but not with respect to (V, u). Again, if the compensation is actually paid, then the weak compensation criterion 13. For e~ample. the original U could correspond to some underlying economy and u could be Ihe ulility values of a market equilibrium.

carries weight. If it is not paid, then it is subject to two serious criticisms. The first is the same as before (it disregards distributional consequences). The second is that it may lead to paradoxes. As in Figure 22.C.7, it is possible to have two utility possibility sets U and U', with respective outcomes u E U and u' E U', such that U' passes the weak compensation test over (U, u) and V passes the weak compensation test over (U', u').ln Exercise 22.C.7 you are asked to provide a more explicit example of this possibility in an economic context. Further elaborations are contained in Exercise 22.C.8.

22.D Invariance Properties of Social Welfare Functions In this section, we probe deeper into the meaning of the comparisons of individual utilities implicit in the definition of a social welfare function. The significance of the matter derives from the fact that whereas a policy maker may be able to identify individual cardinal utility functions (from revealed risk behavior, say), it may actually do so but only up to a choice of origins and units. Fixing these parameters unavoidably involves making value judgments about the social weight of the different agents. It is therefore worth examining the extent to which such judgments may be avoided. Thus, following an approach to the problem taken by d'Aspremont and Gevers (1977), Roberts (1980), and Sen (1977), we explore such questions as: What are the implications for social decisions of requiring that social preferences be independent of the units, or the origins, of individual utility functions?" To answer these types of questions, we need to contemplate the dependence of social preferences on profiles of individual utility functions. Thus, the social welfare functionals introduced in Chapter 21 provide a natural starting point for our analysis. However, we mudify their definition slightly by specifying that individual characteristics arc given to us in the form of individual utility functions u,(') rather than as individual preference relations. From now on we are given a set of alternatives X. We denote by 'fI the set of all possible utility functions on X, and by iJt the set of all possible rational (i.e., complete and transitive) preference relations on X.

I !

J

14. In addition to the previous references, you can consult Moulin (1988) for a succinct presentation of the material of this section.

A paradox: V' passes the weak com· pensation test over (V, u), and V passes the weak com· pensation test over (U', u').

832

CHAPTER

22:

ELEMENTS

OF

WELFARE

ECONOMICS

AND

AXIOMATIC

Definition 22.0.1: Given a set X of alternatives. a social welfare functional F: Cfil ..... 9t is a rule that assigns a rational preference relation F(ii, • ...• iii) among the alternatives in the domain X to every possible profile of individual utility functions (u,(·) ....• UI('» defined on X. The strict preference relation derived from F(u, •. ..• u,) is denoted Fp(ii, • ...• u,) .. • As in Chapter 21. we will concern ourselves only with social welfare functionals that are Paretian. Definition 22.0.2: The social welfare functional F: Cfil ..... 9t satisfies the (weak) Pareto property. or is Paretian. if. for any profile (u, • ...• iii) e Cfil and any pair x. veX. we have that u,{x);;o: ii,{V) for all i implies x F(ii, •.. .• iii) V. and also that u,{x) > u,{V) for all i implies x Fp(ii, •. ..• ii,) V. The first issue to explore is the relationship between these social welfare functionals and the social welfare functions of Section 22.C. A social welfare function W(·) assigns a social utility value to profiles (Ul •.••• UI) e RI of individual utility values. whereas a social welfare functional assigns social preferences to profiles (u ,....• u,) of individual utility functions (or. in Section 21.C, of individual preference relations). From a social welfare function W(·) we can generate a social welfare functional simply by letting F(UI' ...• u,) be the preference relation in X induced by the utility function u(x) = W(ul(x) •...• UI(X». The converse may not be possible. however. In order to be able to "factor" a social welfare functional through a social welfare function, the following necessary condition must. at the very least. be satisfied. Suppose that the profile of utility functions changes, but that the profiles of utility values for two given alternatives remain unaltered; then the social ordering among these alternatives should not change (since the value given by the social welfare function to each alternative has not changed). That is. the social ordering among two given alternatives should depend only on the profiles of individual utility values for these alternatives. Apart from being formulated in terms of utilities, this property is analogous to the pairwise independence condition for social welfare functionals (Definition 21.C.3). We keep the same term and state the condition formally in Definition 22.0.3. Definition 22.0.3: The social welfare functional F: Cfil ..... 9t satisfies the pairwise independence condition if. whenever x. veX are two alternatives and (u, • .... UI) eCfi I • (ii; • ...• iii) e Cfil are two utility function profiles with ii,{x) = D;{x) and u,(V) = ii;(V) for all i. we have xF(ii, ..... iil)V

xF(u; ..... ul)v.

The necessary pairwise independence condition is almost sufficient: In Proposition 22.0.1 we now see that if the number of alternatives is greater than 2, and the Pareto and pairwise independence conditions are satisfied. then we can derive from the social welfare functional a social preference relation defined on profiles (u l •...• UI) e 9t1 of utility values.'6 A standard continuity condition then allows us to represent this

BARGAINING

833

---...... ----------------------------------------------------------SECTION

22.0:

IN VARIANCE

PROPERTIES

OF

SOCIAL

WELFARE

preference relation by means of a function W(u, •...• u,). thereby yielding a social welfare function. proposition 22.0.1: Suppose that there are at least three alternatives in X and that the Paretian social welfare functional F: Cfi' ..... 9t satisfies the pairwise independence condition. Then there is a rational preference relation;::; defined on HI [that is. on profiles (u, . ...• uJl e HI of individual utility values] that generates F(·). In other words. for every profile of utility functions (u, •... • iii) e Cfil and for every pair of alternatives x. veX we have x F(ii, . ...• uJl V

(u,(x) •...• iii (x)) ;::; (u,(V) •. ··• iil(v)).

Proof: The desired conclusion dictates directly how ;::; should be constructed. Consider any pair of utility profiles u = (u, •. ..• u,) e R' and u' = (u; •. ..• ui) e R'. Then we let u;::; u' if x F(u ...... u,)y for some pair x.yeX and a profile (u, •. ..• u,) e Cfi' with Ui(X) = Ui and Ui(Y) = u; for every i. We argue first that the conclusion u ;::; u'. is independent of the particular two alternatives and the profile of utility functions chosen. Independence of the utility functions chosen is an immediate consequence of the statement of the pairwise independence condition. Proving independence of the pair chosen is a bit more delicate. It sunices to show that if we have concluded that u;::; u' by means of a pair x. Y then. for any third alternative z (recall that by assumption there are third alternatives). we obtain the same conclusion using the pairs x, z or z. y." We carry out the argument for x. z (in Exercise 22.0.2 you are asked to do the same for z. y). To this effect. take a profile of utility functions (UI •...• UI) e Cfi' with Ui(X) = Ui. Ui(Y) = u;. and ui(z) = u; for every i. Because we have concluded that u;::; u' using the pair x, y. we must have x F(u, •... , il,) y. By the Pareto property. we also have Y F(u, •. .. , UI) z. Hence. by the transitivity of F(u, •. ..• UI)' we obtain x F(u, •.. . , u,) z. which is the property we wanted. It remains to prove that ;::; is complete and transitive. Completeness follows simply from the fact that the preference relation F(u, •...• UI) is complete for any (u l •...• ,i,) e Cfil. As for transitivity. let u;::; u';::; u·. where u, u'. u· e R'. Take three alternatives x. Y. z e X and a profile of utility functions (u, ..... u,) e Cfil with Ui(X) = Ui. ui(Y) = u;. and ui(z) = u, for every i. Since u;::; u' and u' ;::; UN. it must be that x F(u, •. .. , u,) y and y F(u, ... .• UI) z. Because of the transitivity of F(u, •. ..• U/). this implies x F(u , •...• u,) z. and so u ;::; UN. Hence. ;::; is transitive. _ By the Pareto condition, the social preference relation;::; obtained in Proposition 22.0.1 is monotone. You are asked to show this formally in Exercise 22.0.3.

Exercise 22.0.3: Show that if the social welfare functional F: Cfi' ..... 9t satisfies the Pareto property. then a social preference relation;::; on utility profiles for which the 17. Indeed. suppose Ihat we initially used the pair (x. y). Consider any other pair (v. w). If v = x u'. Hence, let the chain of (v. w). There y) - (x. z) -

15. Tha' is. x Fi", ..... "lb' if x F(", ..... u,)y but not y F(u, ..... "I)X. 16. In Exercise 22.D.1 you can find examples showing that the Pareto condition and Ihe restriction on the number of alternatives cannot be dispensed with for the result of Proposition

or w = y then we have just claimed that we get the same ordering between u and t' # x and w # y. If. in addition, v # y. then we reach the same ordering by replacements: (x. y) - (v. y) _ (v. w). Similarly. if w # x we can use (x. y) - (x. w) remains the case (v. w) = (y. x). Here we use a third alternative. z. and the chain (x.

22.D.1.

(y,z) - (y.x).

FUNCTIONS

834

CHAPTER

22:

ELEMENTS

OF

WELFARE

ECONOMICS

AND

AXIOMATIC

conclusion of Proposition 22.0.1 holds must be monotone in the sense that if u' then u' ;:: u, and if u' » u then u' :> u.

~

u

The social preference relation;:: on R' obtained in Proposition 22.0.1 need not be continuous or representable by a utility function. Consider, for example, a lexical dictatorship (say that there are two agents and let u:> u' if U I > u; or if U I = u; and U2 > u;) and recall from Example 3.C.1 that this type of ordering is not representable by a utility function. Nonetheless, we want to focus on social welfare functions and so from now on we will simply assume that we deal only with social welfare functionals that, in addition to the assumptions of Proposition 22.0.1, yield a continuous social preference relation ;:: on R'. As in Section 3.C, such a social preference relation can then be represented by a utility function: in fact, a continuous one. This is then our social welfare function W(u" ... , u,). Note that any increasing, continuous transformation of W(·) is also an admissible social welfare function. In summary, we have seen that the existence of a social welfare function generating a given social welfare functional amounts, with some minor qualifications, to the satisfaction of the pairwise independence condition by the social welfare functional. Therefore, we will concern ourselves from now on with a social welfare functional F: Oft' -+ {It that can be generated from an increasing and continuous social welfare function W: R' -+ R, or equivalently, from a monotone and continuous rational preference relation;:: on R'. We will discover that, in this context, natural utility invariance requirements on the social welfare functional have quite drastic effects on the form that we can choose for W(·) and, therefore, on the social welfare functional itself. Definition 22.0.4: We say that the social welfare functional F: Oft' -+ {It is invariant to cammon cardinal transformations If F(ii" ... , ii,) = F(ii;, ... , iii) whenever the profiles of utility functions (ii" ... , ii,) and (ii;, ... , iii) differ only by a common change of origin and units, that Is, whenever there are numbers (J > 0 and IX such that ii,{x) = (Jii;{x) + IX for all i and x eX. If the invariance Is only with respect to common changes of origin (i.e., we require (J = 1) or of units (i.e., we require IX = 0), then we say that F(') is invariant to common changes of origin or of units, respectively. It is hard to quarrel with the requirement of in variance with respect to common cardinal transformations. Even if the policy maker has the ability to compare the utilities of different agents, the notion of an absolute unit or an absolute zero is difficult to comprehend. We begin by analyzing the implications of invariance with respect to common changes of origin. Suppose that the social welfare functional is generated from the social welfare function W(·). We claim that the invariance with respect to common changes of origin can hold only if W(u) = W(u') implies W(u + lXe) = W(u' + lXe) for all profiles of utility values u e R', u' e R' and IX e R, where e = (I, ... , I) is the unit vector. Indeed, let W(u) = W(u') and W(u + lXe) < W(u' + lXe). Consider a pair x, y e X and profile (17" ..• , 17,) e U' with u,(x) = and u,(y) = for every i. Then x F(u I' ••• , 17,) y. However, x F(u;, .•• , 17;) y does not hold when 17;0 = 17,(') + IX, can tradicting the invariance to common changes of origin. Geometrically, the assertion that W(u) = W(u') implies W(u + lXe) = W(u' + lXe) says that the indifference curves of W(·) are parallel with respect to e-they are

u,

u;

--- --

BARGAINING

SECTION

22.0:

INVARIANCE

PROPERTIES

OF

SOCIAL

WELFARE

FUNCTIONS

835

Figure 22.0.1

Indifference map of a social welfare function invariant to identical

changes of utility origins.

obtained from each other by translations along the e direction (see Figure 22.0.1). In Proposition 23.0.2 [due to Roberts (1980)], we show that this property has an important implication: up to an increasing transformation, the social welfare function can be written as a sum of a purely utilitarian social welfare function and a dispersion term. Proposition 22.0.2: Suppose that the social welfare functional F: Oft' -+ {It is generated from a continuous and increasing social welfare function. Suppose also that F(') is invariant to common changes of origins. Then the social welfare functional can be generated from a social welfare function of the form W(U" ... , Ut) = 0 - g(u, - 0, ... , u, - 0).

(22.0.1)

where 0 = (1//) LjUj. Moreover, if F(') is also independent of common changes of units, that is, fully invariant to common cardinal transformations, then g(.) is homogeneous of degree one on its domain: {s e R': LjSj = OJ. Proof: By assumption the social welfare functional F: Oft' -+ {It can be generated by a continuous and monotone preference relation;:: on R'. Moreover the invariance to identical changes of units implies that if u - u' then u + lXe - u' + lXe for any IX e R. We now construct a particular utility function W(·) for ;::. Because of continuity and monotonicity of;:: there is, for every u e R', a single number IX such that u - lXe. Let W(u) denote this number. That is, W(u) is defined by u - W(u)e. (See Figure 22.0.2 for a depiction.) Because of the monotonicity of preferences, W(·) is a legitimate utility representation for ;::.1. The fi.st part of the proof will be concluded if we show that W(u) - ii depends only on the vector of deviations (u, - 17, ... , u, - 17) = u - ae, that is, that if u - iie = u' - a'e then W(u) - 17 = W(u') - a'. But this is true because u - W(u)e and the invariance to common changes of origin imply that if u - ae = u' - a' e then u'

= u + (ii' -

ii)e - W(u)e

+ (U'

- a)e

= [W(u) + (a' -

a)]e

18. Up to here this is identical to the parallel construction in consumption theory carried out in Proposition 3.CI. We refer to the proof of the latter for details.

836

CHAPTER

22:

ELEMENTS

OF

WELFARE

ECONOMICS

AND

AXIOMATIC

BARGAINING

----

--

SEC T ION

22•D:

I NVA R, ANCE

origin case.

and therefore, W(u') = W(u) + (ii' - ii) as we wanted. The construction is illustrated in Figure 22.0.2.'9 To prove the second part, suppose that F(') is also invariant to common changes of units. Because F(') is generated from W(·), this can only happen if for every u - u' and {J > 0 we have {Ju - {Ju'. But then u - W(u)e implies {Ju - {JW(u)e, and so W({Ju) = {J W(u) for any u e R' and {J > O. That is, W(·) is homogeneous of degree one, and since g(.) coincides with - W(·) on the domain where ii = 0, we conclude that g(.) is also homogeneous of degree one. _ Going further, if the policy maker is not empowered with the ability to compare the absolute levels of utility across consumers, then the social welfare functional must satisfy more demanding invariance notions. Delinltlon 22.0.5: The social welfare lunctional F: 'fI' -+ [Jt does not allow interpersonal comparisons of utility il F(ii, . ... , ii,) = F(ii; •. ... iii) whenever there are numbers p; > 0 and r1.; such that ii,{x) = p;ii/(x) + r1.; lor all i and x. If the invariance is only with respect to independent changes 01 origin (Le .. we require P; = 1 lor all i). or only with respect to independent changes of units (Le., we require that r1.; = 0 for all i), then we say that F(') is invariant to independent changes of origins or of units. respectively. We have then Proposition 22.0.3. 20 19. We can gain some intuition on the form of this utility function by noticing its similarity to the quasilinear representations in consumer theory. Here we can write any vector u e R' as u = lie + (u - lie) and indifference sets can he obtained by parallel displacements in the direction e. In consumer theory we can write any vector x E RL as x = (x,. 0, ...• 0) + (0. Xl •••• I xd and indifference sels are parallel in the direction (1.0, ...• 0). Similarly. lhe conclusion in both cases is

thatthere is a utility function that is linearly additive in the first lerm (i.e., in the direction in which indifference sets are parallel), 20. See d'Aspremont and aevers (1977) for more results of this type.

0 F

soc I A L

W ELF ARE

proposition 22.0.3: Suppose that the social welfare functional F: 'fI' -+ [Jt can be generated from an increasing. continuous social welfare lunction. If F(·) is invariant to independent changes of origins. then F(') can be generated from a social welfare function W(·) of the purely utilitarian (but possibly nonsymmetric) form. That is, there are constants b; ~ 0, not all zero. such that W(u" ...• u,) =

Flgur. 22.0.2 Construction of the social welfare fUnetion of form (22.D.\) for the invariant to identical changes of

PRO PER TIE S

L b;u;

for all i.

(22.D.2)

Moreover. if F(') is also invariant to independent changes of units [Le., if F(') does not allow lor interpersonal comparisons of utility], then F is dictatorial: There is an agent h such that. for every pair x, VEX, iih(x) >iih(y) implies x Fp(ii, •... , ii,) y. Proof: Suppose that ;::; is the continuous preference relation on R' that generates the given F(·). For a representation of the form (22.0.2) to exist, we require that the indifference sets of ;::; be parallel hyperplanes. Since we already know from Proposition 22.0.2 that those sets are all parallel in the direction e, it suffices to show that they must be hyperplanes, that is, that if we take two u, u· e R' such that u - u'. then for u" = !U + !u· we also have u· - u - u'. The invariance of F(') with respect to independent changes of origins means. in terms of ;::;. that for any r1. e H' we have u + a;::; u" + r1. if and only if u ;::; UN. Take r1. = 1(u' - u). Then u + r1. = u" and u" + r1. = u'. Hence. u;::; u· if and only if UN;::; u'. If u;:: u" then u·;::; u' and so u· - u. If u' > u then u' > u· which contradicts u - u'. We conclude that u· - u - u'. as we wanted. Once we know that indifference sets are parallel hyperplanes. the same construction as in the Proof of Proposition 22.D.2 will give us a W(·) of the form (22.0.2). In addition, the Pareto property yields b, ~ 0 for all i. Finally, suppose that F(') is also invariant to independent changes of units. Then dictatorship follows simply. Choose an agent h with b. > O. Take u, u' e R' with u, > u~. Then, by invariance to independent changes of units. we have that L, b,u, > L, b,u; if and only if b,u, + e L, .. , bl", > b.u. + eLI .. , blu; for any e > O. Therefore, since b,u, > b,u. we get. by choosing e > 0 small enough, that LI blu, > L, b,u;. Thus. agent h is a dictator (show, in Exercise 22.0.4, that in fact b, = 0 for all i >F II). _ We point out that for the dictatorship conclusion of Proposition 22.0.3, it is not necessary that F(') be generated from a social welfare function. It suffices that it be generated from a social preference relation on R'. Proposition 22.0.3 (extended in the manner indicated in the last paragraph) has as a corollary the Arrow impossibility theorem of Chapter 21 (Proposition 2I.C.I), which is, in this manner, obtained by a very different methodology. Indeed, suppose that F(') is a social welfare functional defined. as was done in Chapter 21, on profiles of preference relations (;::;" ... , ;::;,) e !Jl'. Then we can construct a social welfare functional 1"(.) oefined on profiles of utility functions (ii" .... ii,)e'fl' by letting F'(ii" ... , ii,) = F(;::;" .•.• ;::;,). where ;::;, is the preference relation induced by the utility function ii,(·). In Exercise 22.0.5 you are asked to verify, first, that F'(') inherits the Paretian and pairwise independence conditions from F('), second, that F'(') does not allow for interpersonal comparisons of utility and, third. that a dictator for F'(') is a dictator for F(·).

FUN C T , 0 N S

837

838

CHAPTER

22:

ELEMENTS

OF

WELFARE

ECONOMICS

AND

AXIOMATIC

Other invariance properties of social welfare functionals have been found to be of interest. We mention two. We say that the social welfare functional F: 'fI1 .... {It is invariant to common ordinal transformations if F(ii" ... , ii,) = F(ii;, ... , iii) whenever there is an increasing function y(-) such that ii,(x) = y(ii;(x» for every x e X and all i. The interpretation of this invariance is that although the social planner has no notion of individual utility scales she can, nonetheless, recognize that one individual is better off than another (but the question "by how much?" is meaningless). An example is provided by the social welfare functional induced by the symmetric Rawlsian social welfare function W(u) = Min {u" ... , U/}, With this SWF, the ordering Over policies depends only on the ability to determine the worse-off individual (see Exercise 22.0.8 for further elaboration). We say that a social welfare function W(·) generating a given social welfare functional F: '1l' .... R is independent of irrelevant individuals if, when we split the set of agents into any two groups, the social preference among utility vectors in one of the groups is independent of the level at which we fix the utilities of the agents in the other group (we should add that, if so desired, the condition can be formulated directly in terms of the social welfare functional). This is a sensible requirement It says that the distributional judgments concerning the inhabitants of, say, California, should be independent of the individual welfare levels of the inhabitants of, say, Massachusetts. As in the formally similar situation in consumer theory (Exercise 3.G.4), a social welfare function for I > 2 agents that is continuous, increasing, and independent of irrelevant individuals has, up to an increasing transformation, the addilively separable form W(u) = L, g,(u,); that is, W(u) is generalized utilitarian, possible nonsymmetric. Moreover, under weak conditions it is also true that the only social welfare functions that, up to increasing transformations, both admit an additively separable form and are invariant to common changes of origin are the utilitarian W(u) = L, b,u" Thus, from an invariance viewpoint we can arrive at the utilitarian form for a social welfare function by two roads: one, Proposition 22.0.3, is based on invariance to independent changes of origins; the other, just mentioned, is based on independence of irrelevant individuals and invariance to common changes of origins. See Maskin (1978) for more on this. Example 22.0_1: Fix an alternative x* and define a social welfare functional F(') by associating to every profile of individual utility functions (ii., ... , ii,) the social preference relation generated by a utility function Vex) = L, g,(u,(x) - ii~x*». Then, informally, this social welfare functional is both invariant to independent changes of origins and independent of irrelevant individuals, but it is neither utilitarian nor dictatorial. Note, however, that this functional cannot be generated from a social welfare function because it is not pairwise independent the social preference among two alternatives may depend on the ueiliey of ehe lhird afternative x* . •

-- --

BARGAININQ

SECTION

22.E:

THE

AXIOMATIC

BARGAINING

bargaining games (such as those considered in Appendix A of Chapter 9) by adopting an axiomatic point of view. Thus, the approach is more related to ideas of cooperative game theory (as reviewed in Appendix A of Chapter 18).2' For current purposes, the description of a bargaining problem among 1 agents is composed of two elements: a utility possibility set U c H' and a threat, or status-quo, point u* E U. The set U represents the allocations of utility that can be settled on if there is cooperation among the different agents. The point u* is the outcome that will occur if there is a breakdown of cooperation. Note that cooperation requires the unanimous participation of all agents, in which case, to repeat, the available utility options are given by U c R/. If one agent does not participate, then the only possible outcome is the vector 11*. This setup is completely general with two agents and, because of this, the two-agent case is our central reference case in this section. With more than two agents, the assumption is a bit extreme, since we may want to allow for the possibility of partial cooperation. We take up this possibility in Section 22.F. Throughout this section we assume that U c R' is convex and closed and that it satisfies thcfree disposal property U - R I. C U (i.e. if u' ::;; u and u E Uthen u' E U). As in Delinition 22.B.1, U c R' could be generated from a set of underlying 22 alternatives X, which could well include lotteries over deterministic outcomes. For simplicity we also assume that u* is interior to U and that {u E U: u ~ u*} is bounded.

Definition 22.E.l: A bargaining solution is a rule that assigns a solution vector f(V, u*) E V to every bargaining problem (V, u*).21 We devote the rest of this section to a discussion of some of the properties one may want to impose on f(·) and to a presentation of four examples of bargaining solutions: the egalitarian, the utilitarian, the Nash and the Kalai-Smorodinsky solutions. We should emphasize, however, that a strong assumption has already been built into the formalization of our problem: we are implicitly assuming that the solution depends on the set X of feasible alternatives only through the resulting utility values. Definition 22.E.2: The bargaining solution f(·) is independent of utility origins (IUO). or invariant Co independent changes of origins, if for any Il = (Il, ... ,Il,) E H' we have for every i f,(V', u* + Il) = f,(V, u*) + Il, whenever U' = {(u.

+ Il., ...

,u,

+ <X,): u E V}.

The IUO property says that the bargaining solution does not depend on absolute scales of utility. From now on we assume that this property holds. Note that we therefore always have feU, 11*) = feu - {II'}, 0) + II'. This allows us to normalize our problems to 11* = O. From now on we do so and simply write f(U) for f(U, 0).

22_E The Axiomatic Bargaining Approach In this section, we briefly review an alternative approach to the determination of reasonable social compromises. The role of a planner endowed with her own preferences is now replaced by that of an (implicit) arbitrator who tries to distribute the gains from trade or, more generally, from cooperation in a manner that reflects "fairly" the bargaining strength of the different agents. The origin of the theory is game-theoretic. However, it sidesteps the construction of explicit noncooperative

21. For general introductions to the material of this section, see Roth (1979). Moulin (1988). and Thomson (1995). 22. In principle. the underlying set X and Ihe corresponding utility functions on X could be dilTerent for dilTerent U ell'. For the Iheory that follows all that matters is the utility sel U. 23. Thus, a bargaining solution is a choice rule in the sense of Chapter 1. If an underlying alternalive sel X is kepi fixed and. Iherefore, the form of U, generated as in Definition 22.B.I, depends only on the utility functions. we can also regard the bargaining solution as a choice of function in the sense of Definition 2t.E.1.

APPROACH

839

840

CHAPTER

22:

ELEMENTS

OF

WELFARE

ECONOMICS

AND

AXIOMATIC

BARGAINING

---

u,

SECTION

22.E:

THE

AXIOMATIC

BARGAINING

APPROACH

u,

Figure 22.E.2

The property o[ independence of

Symmelric Sel Flgur. 22.E.1

Threal Poinl

It should not be forgotten, however, that a change in the threat point (which will now show up as a change in U) will affect the point settled on. Definition 22.E.3: The bargaining solution f(·) is independent of utility units (IUU). or invariant to independent changes of units. if for any P= (Pt •...• P,) e R' with Pi> 0 for all i. we have fi(U') = PJi(U)

whenever U' = {(PtUt •....

for every i

P,u,): ue U}.24

With independence of utility origins (implicitly assumed in Definition 22.E.3), independence of utility units tells us that, although the bargaining solution uses cardinal information On preferences, it does not in any way involve interpersonal comparisons of utilities. Definition 22.E.4: The bargaining solution f(') satisfies the Pareto property (P). or is Paretian. if. for every U. flU) is a (weak) Pareto optimum. that Is. there is no u e U such that u, > fAU) for every i. Definition 22.E.S: The bargaining solution f(') satiSfies the property of symmetry (S) if whenever U c R' is a symmetric set (Le .. U remains unaltered under permutations of the axes;2S see Figure 22.E.l). we have that all the entries of flU) are equal. The interpretation of the symmetry property is straightforward: if, as reflected in U, all agents are identical, then the gains from cooperation are split equally.

Definition 22.E.6: The bargaining solution f(') satisfies the property of individual rationality (IR) if flU) ~ o. In words: the cooperative solution does not give any agent less than the threat point (recall also that, after normalization, we consider only sets U with 0 e U). It is a sensible property: if some agent got less than zero, then she would do better by opting out and bringing about the breakdown of negotiation. The next property is more substantial. 24. Geomelrically. U' is obtained from U by stretching the different axes by the rescaling factors ((J, ..... {J,). 25. More precisely. if U E U then u' e U for any u' differing from u only by a permutation of ils entries.

'"

The symmetry property [or bargaining solutions.

\'

841

----------------------------------------------------------------------

Threat Point

Definition 22.E.7: The bargaining solution satisfies the property of independence of irrelevant alternatives (IIA) if, whenever U' c U and flU) e U', it follows that flU') = flU).

The I1A condition says that if flU) is the "reasonable" outcome in U and we consider a U' that is smaller than U but retains the feasibility of f(U), that is, we only eliminate from U "irrelevant alternatives," then flU) remains the reasonable outcome (sec Figure 22.E.2). This line of justification would be quite persuasive if we could replace" reasonable" by "best." Indeed, if f( U) has been obtained as the unique maximizer on U of some social welfare function W(u), then the IIA condition is clearly satisfied [if f (U) maximizes W(·) on U then it also maximizes W(·) on U' c U]. We note that while the converse is not true, it is nonetheless the case that, in practice, the interesting examples where I1A is satisfied involve the maximization of some SWF. We proceed to present four examples of bargaining solutions. To avoid repetition, we put on record that all of them satisfy the Paretian, symmetry, and individual rationality properties (as well as, by the formulation itself, the independence of utility origins). You are asked to verify this in Exercise 22.E.I. In Exercise 22.E.2 you are asked to construct examples violating some of these conditions. Example 22.E.1: Egalitarian Solution. At the egalitarian solution fe('), the gains from cooperation are split equally among the agents. That is, for every bargaining problem U c IR t , f..(U) is the vector in the frontier of U with all its coordinates equal. Figure 22.E.3 depicts the case I = 2. Note also that. as illustrated in the figure, every f..(U) maximizes the Rawlsian social welfare function Min {u" ... , ut } on U. The egalitarian solution satisfies the IIA property (verify this). Clearly, for this olution. utility units are comparable across agents. and so the lUU property is not satisfied. 20 • Example 22.E.2: Utilitariall Solution. For every U we now let J.(U) be a maximizer of L, u, on U n IR~. If U is strictly convex, then this point is uniquely defined and. therefore. on the domain of strictly convex bargaining problems the IIA property is satisfied. As witt the previous example. the solution violates the IUU condition. Figure 22.E.4 illustrates the utilitarian solution in the case I = 2. • 26. Do not forget that the utility values arc nol absolute values but rather utility differences from the threal point. It is because of this that changes of origins do not matter.

irrelevant alternatives [or bargaining

solutions.

842

CHAPTER

22:

ELEMENTS

OF

WELFARE

ECONOMICS

ANO

AXIOMATIC

.ARaAININa

---....... u, /

/

45'

//

--

SECTION

22.E:

THE

AXIOMATIC

.ARGAINING

APPROACH

",

Flgur. 22.E.3 (teft) The egalitarian solution for bargaining problems.

---.-_ _f.",.(U)/

Flgur. 22.E.6 Flgur. 22.E.4 (right)

"'-

u,

Threat Point

u,

length [b, f.,(U)] = length [I.(U), aJ

U

, ,

I t

i

I

I

", ~,

-----11 1"2 = constant

Figure 22.E.5

The Nash solution for bargaining problems.

'"

Threat Point

Example 22.E.3: Nash Solution. For this solution, we take a position intermediate between the two previous examples by requiring that f.(V) be the point in V n R/+ that maximizes the product oC utilities u l x ..• x u" or, equivalently, that maximizes Li In U i (this corresponds to the case p = 1 in Example 22.C.4). In Figure 22.E.5, we provide an illustration Cor I = 2.ln this case, the Nash solution has a simple geometry: !.( V) is the boundary point oC V through which we can draw a tangent line with the property that its midpoint in the positive orthant is precisely the given boundary point f.( V); see Exercise 22.EJ. As with the egalitarian and the utilitarian examples, the Nash solution satisfies the IIA property (because it is defined by the maximization oC a strictly concave Cunction). Interestingly, and in contrast to those solutions, the condition of independence of ucility units (IUU) holds for the Nash solution. To see this, note that Li In Ui 0 it satisfies "IU, = ... = '1,U, = Y. that is, ~, = ,.( I/u,) for every i. Consider now any u' E U. We have L, q,u;!> L, 'I'"' and therefore L,( I/u,)u; !> L, (1/11,)11,. Since (I/u, .... , I/u,) is the gradient of the concave function Li In u, at (,i" ... , ,i,), this implies L, In u; !> L, In Il, (see Section M.C of the Mathematical Appendix). Hence ,i maximizes L, In u, on U, that is, u= f.,(U).". See Figure 22.E.6 for an illustration of the argument. In Exercise 22.E.3 you should show the converse-that the Nash solution is simultaneously utilitarian and egalitarian for appropriate choice of units.

• The Nash solution was proposed by Nash (1950), who also established the notable fact that it is the only solution that satisfies all the conditions so far. Proposition 22.E.l: The Nash solution is the only bargaining solution that is independent of utility origins and units, Paretian, symmetric, and independent of irrelevant alternatives. 2 • Proof: We have already shown in the discussion of Example 22,E.4 that the Nash solution satisfies the properties claimed. To establish the converse, suppose we have a candidate solution f(·) satisfying all the properties. By the independence of utility origins, we can assume, as we have done so far, that f(·) is defined on sets where the threat point has been normalized to the origin. Given now an arbitrary V, let Ii = f.(V) and consider the sets

U' = {II E IR':

L II,/Ii, S; I}

and

27. To rcpc,ll in more geometric terms: the hyperplane with normal ('110 •..• ",) passing through ii leaves U belo,,"' it (because of the utilitarian property). Thus, it suffices to show that the set (/I: L, In II,';; L, In Ii,} lies above the hyperplane. BUI note that this follows from the fact that, because of the egalitarian property. (~" ... , ~,) is proportional to (I/Ii, •... , 1/';,), which is the gradient of the concave function Li In at U. 28. Note that we do not assume individual rationality explicitly: tt turns out to be implied by the other conditions.

"i

843

844

CHAPTER

22:

ELEMENTS

OF

WELFARE

ECONOMICS

AND

AXIOMATIC

--- ---

SECTION

BARGAINING

(left) The N~sh solution is determmed uniquely from . the independell({h: nth) < n(i)})

represents how much agent i contributes when she joins the group of her predecessors in the ordering. This is the amount the predecessors would agree to pay i if she had

YAL U E

847

648

CHAPTER

22:

ELEMENTS

OF

WELFARE

ECONOMICS

ANO

AXIOMATIC

all the negotiating power, that is, if she could make a take-it-or-leave-it offer. 31 Note that L, g•.• U) = v(l) for all permutations It. The agents do not come to us ordered. They all stand on the same footing. We may account for this by giving every agent the same chance of being in any position, thereby making all positions equally likely. Equivalently, we could take the (equal weighting) average of agent i contributions over all permutations It (there are I! of these). This is precisely the Shapley value solution. Definition 22.F.6: The Shapley value solution f. ---

BARGAINING

II X IRe I • I •

Is it plausible to you? If you wanted to escape from it, how would you do it? What does this all say about the independenoe axiom as applied to social decisions? Suppose now that there are I agents and that in addition to the social utility function U(·) we are also given I individual preference relations ~ defined on the same set of lotteries X. We assume that they are also represented by utility functions of the expected utility form

EXERCISES

for 1-1, .... 1.

22.B.I A Give sufficient conditions for the convexity of the first·best utility possibility set in the context of the exchange economies of Example 22.8.1. 22.B.2A Derive the first·order conditions stated in Example 22.B.2. 22.B.3 A Derive the first·order conditions (22.B.2) of Example 22.B.3. 22.B.4" Show as explicitly as you can that the utility possibility set of Example 22.B.4 may not be convex. 22.C.l A Suppose that the utility possibility set U c R' is symmetric and convex. Show that the social optimum of an increasing, symmetric, strictly concave social welfare function W(·) assigns the same utility values to every agent. [Note: A set U is symmetric if u e U implies u' E U for any u' obtained from u by a permutation of its entries.] Observe that the same conelusion obtains if W(·) is allowed to be just concave, as in the utilitarian case, but U is required to be strictly convex. 22.C.2A Suppose that we contemplate a decision maker in an original position (or ex·ante, or hehind the veil oj ignorance) before the occurrenoe of a state of the world that will determine

which of I possible identities the decision maker will have. There is a finite set X, of possible final outcomes in identity i. Denote X = X, x ... X X,. (a) Appeal to the theory of state·dependent utility presented in Section 6.E to justify a utility function on X of the form

U(x" ... , x,) = ",(x,)

+ ... + u,(x,).

Interpret and discuss the implications of this utility function for the usage of a purely utilitarian social welfare function. (b) Suppose that X, = ... = X, and the preference relation on X defined by the utility function in (a) is symmetric. What does this imply for the form of the utility function? Discuss and interpret. 22.C.3" We have N final social outcomes and we consider a set of alternatives X that is the set oftotteries over these outcomes. An alternative can be represented by the list of probabilities assigned to the different final outcomes, that is, p = (p" . .. , p.) where p. :?; 0 for every nand P,

+ ... + P. = I.

We assume that we are given a social preferenoe relation ~ on X that is continuous and conforms to the independence axiom. Thus, it can be represented by a utility function of the expected utility form U(p) = u,p,

+ ... + ".P •.

From now on we assume that this social utility function U(·) defined on X is given. (0) Suppose that there are two final outcomes and that they are specified by which of two individuals will reoeive a oertain indivisible object. Suppose also that social preferenoes are symmetric in the sense that there is social indifferenoe between the lottery that gives the object to individual 1 for sure and the lottery that gives the object to individual 2 for sure. Show that all the lotteries must then be socially indifferent. Discuss and interpret this conclusion.

851

-----------------------------------------------------------------------

We say that the social utility function U(·) is Parelian if we have U(p) > U(p') whenever U,(p) > U,(p') for every i. (b) Consider a case with N - 3 and I = 2 and illustrate, in the 2-dimensional simplex of lotteries, how the indifferenoe map of the utility functions of the two agents and of the social utility function fit together when the social utility function is Paretian. (c) Exhibit a case where the Paretian condition determines uniquely the social indifferenoe map (recall that we are always assuming the independenoe axiom fot social preferences!). Argue. however. that in general the Paretian condition does not determine uniquely the social indifferenoe map. In fact. exhibit an example where any social utility function is Paretian. (d) Argue (you can restrict yourself to N - 3 and I - 2) that if the social utility function U(p) is Paretian then it can be written in the form U(p) - Il,U,(p)

+ ... + Il,U,(p)

where Il, :?; 0 for every i and Il, oF 0 for some I. What does this conclusion say for the usage of a purely utilitarian social welfare function? Interpret the Il, weights, as well as the fact that they need not be equal across individuals. 22.C.4A The 'eximin ordering, or preference relation, on R' has been mentioned in footnote 11 of this chapter when discussing the Rawlsian SWF. It is formally defined as follows. Given a vector u = (u, •. ..• u,) let u' e A' be the vector that is the nondecreaslng rearrange"",nl or u. That is, the entries of u' are in nondecreasing order and its numerical values (mUltiplicities included) are the same as for u. We then say that the vector u is at least as good as the vector Q in the leximin order if u' is at least as good as Q' in the lexicographic ordering introduced in Example 3.C.1. (a) Interpet the definition ofthe leximin as a refinement ofthe Rawlsian preference relation. (b) Show that the leximin ordering cannot be represented by a utility function. It is enough to show this for I = 2. (c) (Harder) Show that the social optimum of a leximin ordering is a Pareto optimum. You can limit yourself to the case I = 3. 22.C.s B Consider the constant elasticity family of social welfare functions (Example 22.C.4). Argue that w,,(u) .... Min {u, •...• u,} as p .... 00. 22.C.6 A Suppose that U and U' are utility possibility sets and that we associate with them Pareto optimal utility outcomes ii e U and u' e U', respectively. Show graphically that: (8) It is possibk for U' to pass the strong compensation test over U and yet for the outcome with U' to be worse than the outcome with U, as measured by the purely utilitarian SWF.

(b) If the utility possibility sets are derived from a quasilinear economy and U' passes the weak compensation test over U. then it also passes the strong compensation test and, moreover, the outcome for U' is a utilitarian improvement over the outcome for U. Is this conclusion valid if we evaluate social welfare by a nonutilitarian SWF? 22.C.7 B Construct an explicit example of two Edgeworth box economies, differing only in their distributions of the initial endowments. such that the utility possibility set of each one

852

CHAPTER

22:

ELEMENTS

OF

WELFARE

ECONOMICS

AND

AXIOMATIC

EXERCISES

BARGAININQ

passes the weak compensation test over the utility possibility set of the other, when the utility outcome in the laller is chosen to correspond to one of its competitive equilibria.

22.0.3 is valid under the weakened assumption that F is generated from a social preference relation on R I.)

22.C.8 A Suppose we have two utility possibility sets U, U' with respective outcomes U E U and u' E U'. We say that (U', u') passes the Kaldor compensation test over (U, u) if U' passes the weak compensation test over (U, u) and U does not pass the weak compensation test over

12.0.68 This exercise is concerned with social welfare functions satisfying expression (22.0.1).

(U',u').

(a) For I = 2, represent graphically a situation where Kaldor comparability is possible and one where it is not. (b) Observe that Kaldor comparability is asymmetric. Oefine your terms.

22.0.18 In this exercise we verify the indispensability of the assumptions of Proposition 22.0. I.

= lx, y}. The social

(a) Suppose there are three agents and only two alternatives, that is, X welfare functional is given by if and only if

ii,(x)

~

ii,(y) for every i

Y F(ii,. ii,. ii,)x

if and only if

ii,(y)

~

u,(x) for at least one i.

and Check that the social preference relation is always complete, that the social welfare functional cannot be represented by means of a social welfare function, and that only the condition on the number of alternatives fails from Proposition 22.0.1.

X

(e) Argue that if in (22.0. I) the function g(') is homogeneous of degree one and differentiable. then it must be linear (and so we arc back to the utilitarian case). 12.0.78 Consider the constant elasticity family of social welfare functions studied in Example

F(ii,. ii,. ii,)y

(b) Now we have three agents and three alternatives, that is, X = welfare functional is given by

(c) Show that the symmetric Rawlsian social welfare function W(u) = Min lu" •.. , u/l can be wrillen in the form (22.0. I). What about nonsymmetric Rawlsian social welfare functions? [Him: Check the condition of invariance to common changes of origins.] (d) Give other examples satisfying (22.0.1), in particular, examples with g(.) ~ 0 and intermediate between the utilitarian and the Rawlsian cases. Interpret them.

(e) Show that Kaldor comparability may not be transitive.

X

(a) Show that the nonsymmetric utilitarian function W(u) = L b,u , can be wrillen in the form (22.0.1). (b) Show that if W(·) is symmetric and g(O) = 0 then g(') ~ O.

lx, y, z}.

The social

F,(ii" ... , iii) y F,(ii ... , iii) z "

for every (ii" ... , iii) E fli. Show that, again, no representation by means of a social welfare function is possible and that, of the assumptions of Proposition 22.0.1, only the Paretian property fails to be satislied. (c) Exhibit an example in which the only condition of Proposition 22.0.1 that fails to be satisfied is pairwise independence. 22.0.2A Carry out the verification requested in the second paragraph of the proof of Proposition 22.0. I. 22.0.3 A In text. 22.0.4A A social welfare functional F is lexically dictatorial if there is a list of n > 0 agents h" ... , h. such that the strict preference of h, prevails socially, the strict preference of h, prevails among the alternatives for which hi is indifferent, and so on. (a) Show that if F is lexically dictatorial then F is Paretian, is pairwise independent, and does not allow for interpersonal comparisons of utility. (b) Under what conditions can a social welfare functional that is lexically dictatorial be generated from a social welfare function? (c) Show that if a dictatorial social welfare functional is generated from a social welfare function W(u) = L b,u then b, = 0 for every i distinct from the dictator. "

22.0'sc Complete the proof of Arrow's impossibility theorem along the lines suggested in the last paragraph prior to the small-type text at the end of Section 22.0. (Assume that Proposition

22.C.4. (a) Show that the social welfare functionals derived from SWFs in this family are invariant to common changes of units. (b) Show that the only members of this family which arc also invariant to common changes of origins, and therefore admit a representation in the form (22.0.1), are the purely utilitarian (i.e., p = 0) and the Rawlsian (i.e., p = 00). 22.0.8 8 This is an exercise on the property of invariance to common ordinal transformation. (a) Show that the symmetric, Rawlsian social welfare function satisfies the property. (b) Show that the anti-Rawlsian function W(u) = Max lui, ... , u/l also satislies it. (c) Show that the property is satislied for dictatorial social welfare functionals. (d) (Harder) Suppose that I = 2 and W(u) = W(u') for two vectors u, u' E R2, with

u; < u, < u, < ul' Assume also that W(·) is increasing. Show that the induced social welfare functional cannot be invariant to identical ordinal transformations. From this, argue informally (you can do it graphically) that for I = 2 a continuous, increasing social welfare function that is also invariant to identical ordinal transformations must be either dictatorial, Rawlsian, or anti-Rawlsian. 22.E.IA Verify that the bargaining solutions in Examples 22.E.I to 22.E.4 are independent of utility origins, Paretian, symmetric, and individually rational. It is enough if you do so for I = 2. 22.E.2A State nonsymmetric versions of the four bargaining solutions studied in Section 22.E (egalitarian. utilitarian, Nash, and Kalai-Smorodinsky). Motivate them. 22.E.3 8 This is an exercise on the Nash solution. (a) Verify that for I = 2, [.(U) is the boundary point of U through which we can draw a tangent line with th. property that its midpoint in the positive orthant is precisely the given boundary point J.( U). (b) Verify that if U c RI is a bargaining problem then there are rescaling units for the individual utilities with the property that the Nash solution becomes simultaneously egalitarian and utilitarian. 22.E.4A Verify that the Kalai-Smorodinsky solution satisfies the property of independence of utility units but violates the property of independence of irrelevant alternatives. You can restrict yourself to I = 2.

853

854

CHAPTER

22:

ELEMENTS

OF

WELFARE

ECONOMICS

AND

AXIOMATIC

22.E.5 B This is an exercise on Ihe monolonicily property. (0) Show Ihat Ihe egalitarian solulion is the only bargaining solution that is independent of ulilily origins, Paretian, symmetric and monotonic. [Hint: Consider first a family of symmetric ulility possibility sels with linear boundaries. Notice then that for any two sets U, U' we always have U,... U' c U and U ,... U' c U'.]

(b) (Harder) Suppose that f(') is a bargaining solution that is independent of utility origins, Paretian, and strongly monotonic [if U c U' then flU) !> flU') and, in addition, if flU) is interior to U' then f(U)« flU')]. Show that there is a curve in R' starting at the origin and strictly increasing such that, for every U, f(U) is the intersection point of the boundary of U with this curve. You can restrict yourself to the case I = 2. 22.E.6c Let I = 2. A bargaining solution fO is partially monotone if when U c U' and u'(U) = u'(U'), that is, U' expands U only in the direction of agent j ~ i, we have fi( U') :l: fi( U) for j ~ i. Argue that the Kalai-Smorodinsky solution is characterized by the following properties: independence of utility origins and units, Pareto, symmetry, and partial monotonicity. [Hint: use sets U such that U' c U and II'(U) = II'(U'), II'(U) = II'(U')]. 22.E. 7 A Consider a family of bargaining solutions f'(') such that, for every set of agents I, f'(') is independent of utility origins and is generated by maximizing the social welfare function L. g(u,) on normalized bargaining problems U c R', where g(') is increasing, strictly concave, and independent of the particular I considered. Show that the family f' is consistent. 22.E.Sc Show by example that the Kalai-Smorodinsky solution is not consistent. It is enough to consider three agents and its subgroups of two agents. 22.E.9 A This exercise is aimed at showing the independence of the assumptions of Proposition 22.E.1. To this effect, give five examples such that for each of the five assumptions of Proposition 22.E.1 there is one of the examples that violates this assumption but satisfies the

remaining four.

EXERCIIEI

22.F.1 A Show that in the transferable utility case any bargaining solution that is invariant to independent changes of origin, symmetric, and Paretian divides the gains from cooperation equally among the agents. 22.F.2A Show that the Shapley value cooperative solution presented in Section 22.F satisfies the following properties: invariance to independent changes of utility origins, in variance to common changes of utility units, Paretian, symmetry, and the dummy axiom. 22.F.3 A Suppose that for a given set of agents I we take two characteristic forms v and v' and consider their sum v + v'; that is, v + v' is the characteristic form where (v + v'XS) = v(S) + v'(S) for every ScI. (0) Verify that the Shapley value is linear in the characteristic form; that is, /.,(v f,,(v) + f,,(v') for all v, v' and i.

+ v') =

(b) Interpret the linearity property as a postulate that agents are indifferent to the timing of resolution of uncertainty when we randomize among bargaining situations. 22.F.4 c The linearity property of the previous exercise can be restated in a perhaps more intuitive form. We say that a characteristic form v(') is a IInanimity game if for some Tel we have that v(S) = v(T) if T c S, and v(S) = 0 otherwise (thus, the bargaining situations of Section 22.E correspond to T = I). (0) Show that the independence of utility origins and invariance to common changes of utility units, Pareto, symmetry, and dummy axiom properties imply that, for a unanimity game v('), any cooperative solution f(·) assigns the values [,(v) = (1/nv(1) if JET, and J.(v) = 0 otherwise.

(b) We say that the cooperative solution f(·) is weakly linear if for any v and v' differing only by a unanimity game [i.e., there is Tel and «E R such that .'(S) - v(S) + « if T c S, and v'(S) v(S) otherwise] we have that f,(v') f,(v) + «IT if JET, and ft.u) - ft.v) otherwise. Show that if, in addition to the properties listed in (a), the cooperative solution f(') is weakly linear, then it is fully linear, that is, f(v + v') f(v) + f(v') for any two characteristic forms v and v'.

=

=

=

22.E.l0A Give an example of a utilitarian bargaining solution (Example 22.E.2) that violates the property of independence of irrelevant alternatives. [Hint: It suffices to consider I = 2. Also, the violation should involve sets U that are convex but not striclly convex.] 22.E.l1 c Go back to the infinite horizon Rubinstein's bargaining model discussed in the Appendix A to Chapter 9 (specifically, Example 9.AA.2). The only modification is that the two agents are risk averse on the amount of money they get. That is, each has an increasing, concave, differentiable utility function 1I,(m,) on the nonnegative amounts of money that they receive. The factor of discount ~ < I is the same for the two agents. Also 11,(0) = O. The total amount of money is m. (a) Write down the equations for a subgame perfect Nash equilibrium (SPNE) in stationary strategies. Argue that there is a single configuration of utility payoffs that can be obtained as payoffs of a SPNE in stationary strategies. (b) Consider the utility possibility set

U = {(II,(m,), 1I,(m,» E R': m, + m, = m} - R~. Show that if ~ is close to 1 then the payoffs of a SPNE in stationary strategies are nearly equal to the Nash bargaining solution payoffs. (c) (Harder) Argue that every payoff configuration of a SPNE can be obtained as the payoff configuration of a SPNE in stationary strategies. Thus, the uniqueness result presented in Example 9.AA.2 extends to the case in which the agents have strictly concave, possibly different, utility functions for money.

855

--....... -------------------------------------------------------~~~==

BARGAININQ

(c) Show that the Shapley value is the only cooperative solution that satisfies the following prope.'ties: independence of utility origins and in variance to common changes of utility units, Parellan, symmetry, dummy axiom, and linearity. 22.F.sc In this exercise we describe another cooperative solution for a game in characteristic form: the nucleolus. For simplicity we do it for the particular case in which 1= 3, v(1) = v(2) = v(3) = 0, and 0 !> v(S) !> v(l), for any group S of two agents. Given a ulility vector II = (II" II" II,) :l: 0 and an ScI the excess of S at II is e(u, S) = v(S) - L,.s u,. We define the first maximllm excess as m,(II) = Max {_(II, S): 1 < #S < 3}. Choose a two-agent coalition S such that m,(u) = e(u, S). Then we define the second maximum excess as m,(II) = Max {e(II, S'): I < #S' < 3 and S' ~ S}. We say that an exactly feasible [i.e., II, = v(l)] utility profile II = (II" II" 1I,):l: 0 is in thr nucleolus if for any other such profile II' we have either m,(II) < m,(u') or m,(u) = m,(u') and m,(II) !> m,(u').

L,.,

(a) Show that if u = (u" u" u,) is in the nucleolus then either the three excesses for two-agent coalitions are identical or two are identical and the third is larger. (b) Show that there is one and only one utility profile in the nucleolus. [Hint: Argue first that there is a two-agent coalition S such that e(u, S) = m,(u) for every profile in the nucleolus.] From now on we refer to this profile as the nucleolus solution. (c) Argue that the nucleolus solution is symmetric.

CHAPTER 856

CHAPTER

22:

ELEMENTS

OF

WELFARE

ECONOMICS

AND

AXIOMATIC

BARGAININQ

---......

(d) Suppose that agent I is a dummy. Then ", = 0 at the nucleolus solution. (e) Suppose that tv(J):$ v(S) for any coalition S of two agents. Show then that at the nucleolus profile the three excesses for two-agent coalitions are identical.

Incentives and Mechanism

23

(f) Compute and compare the Shapley value and the nucleolus for the characteristic form: v(l) = v(2) = v(3) = 0, v({i, 2}) = v{(l, 3}) = 4, v({2, 3}) = 5, v(J) - 6. (g) Show that if the core is nonempty (see Appendix A to Chapter 18 for the definition of the core in this context) then the nucleolus utility profile belongs to the core.

Design

22.F.6" Consider a regulated firm that produces an output by means of a cost function c(q). Assuming a quasilinear economy, the consumer surplus generated by q is Seq). (a) Suppose that c(q) is strictly concave (i.e., strictly increasing returns to scale). Show that at the first·best price the firm will not cover costs. Conversely, for any q suppose that the price p(q) is determined so that the cost is covered; that is, p(q) = c(q)/q. Show that if q is then determined so as to have p(q) = S'(q), we will not reach the first-best optimum. l11ustrate graphically. (b) Suppose that the quantity produced, q, has to be determined under the constraint that with p = S'(q) we have pq :2: c(q). Solve this second-best weIrare problem. l11ustrate graphically.

(e) Interpret the units of output as "projects." For any production decision q, what is the cost allocation suggested by the Shapley value?

22.F.7 c This exercise is similar to Exercise 22.F.6, except that the firm now produces two outputs under the separable cost functions c,(q,), c,(q,). The surplus S,(q,) + S,(q,) is also separable. (a) The second-best problem [first studied by Boiteux (1956)J is now richer than in Exercise 22.F.6. Suppose that the quantities q" q, have to be determined so that with p, = S'(q,) and p, = S'(q,) we have p,q, + p,q, :2: c,(q,) + c,(q,) (equivalently, at the chosen prices demand must be served and cost covered). Derive first-order conditions for this problem. Make them as similar as possible to the Ramsey formula of Example 22.B.2. (b) (Harder) Interpret the units of outputs as projects. Suppose that these units are very small, so that a given production decision (q" q,)>> 0 represents the implementation of many projects of each of the two types. Can you guess, given (q" q,), what is an approximate value for the cost allocation suggested by the Shapley value? [Hint: For most orderings of projects, any particular project will have preceding it an almost peneet sample of all the projects.J (e) Suppose that for the productions (9" 9,), the Shapley value cost allocation assigns cost per unit of c, and c, (note that "projects" of the same type receive the same cost imputation). Suppose also that c, = oS,(q,)/oq, and c, = oS,(ii,)/oq,. Interpret. Argue that, in general, these productions will not correspond to either the first-best or the second-best optima of the problem.

23.A Introduction In Chapter 21, we studied how individual preferences might be aggregated into social preferences and ultimately into a collective decision. Howe~er, an .im~~rtant !eature of many settings in which collective decisions must be made tS that mdlVlduals actual preferences are not publicly observable. As a result, in one way or another, individuals must be relied upon to reveal this information. In this chapter, we study how this information can be elicited, and the extent to which the information revelation problem constrains the ways in which social decisions can respond to individual preferences. This topic is known as the mechanism design problem.

Mechanism design has many important applications throughout economics. The design of voting procedures, the writing of contracts among parties who will come to have private information, and the construction of procedures for deciding upon public projects or environmental standards are all examples. I The chapter is organized as follows. In Section 23.B, we introduce the mechanism design problem. We begin by illustrating the difficulties introduced by the need to elicit agents' preferences. We also define and discuss the concepts of social choice functions (already introduced in Section 21.E), ex post efficiency, mechanisms, implementation, direct revelation mechanisms, and trutliful implementation. In Section 23.C, we identify the circumstances under which a social choice function can be implemented in dominant strategy equilibria when agents' preferences are private informdtion. Our analysis begins with a formal statement and proof of the revelation principle, a result that tells us that we can restrict attention to direct revelation mechanisms that induce agents to truthfully reveal their preferences. Using this fact, we then study the constraints that the information revelation

I. Simple examples of the last two applications were encountered in Sections 14.C and J I.E, respectively. 857

858

CHAPTER

23:

INCENTIVES

AND

MECHANISM

DESIGN

----------------------------------------------------------------~

problem puts on the set of implementable social choice functions. We first present the important Gibbard-Satterthwaite theorem, which provides a very negative conclusion for cases in which individual preferences can take unrestricted forms. In the rest of the section, we go on to study the special case of quasilinear environments, discussing in detail Groves-Clarke mechanisms. In Section 23.D, we study implementation in Bayesian Nash equilibria. We begin by discussing the expected externality mechanism as an example of how the weaker Bayesian implementation concept can allow us to implement a wider range of social choice functions than is possible with dominant strategy implementation. We go on to provide a characterization of Bayesian implementable social choice functions for the case in which agents have quasilinear preferences that are linear in their type. As an application of this result, we prove the remarkable revenue equivalence rheorem for auctions. In Section 23.E, we consider the possibility that participation in a mechanism may be voluntary and study how the need to satisfy the resulting parricipation ('onsrraillls limits the set of implementable social choice functions. Here we prove the important Myerson-Satterthwaite theorem, which shows that, under very general conditions. it is impossible to achieve ex post efficiency in bilateral trade settings when agents have private information and trade is voluntary. In Section 23.F, we discuss the welfare comparison of mechanisms. defining the notions of ex ante and interim incentive efficiency, and providing several illustrations of the computation of welfare optimal Bayesian mechanisms. Appendices A and B are devoted to, first, a discussion of the issue of multiple equilibria in mechanism design and. second, the issue of mechanism design when agents know each others' types but the mechanism designer does not (so-called complete information environments). References for further reading are provided at the start of the various sections. We would be remiss. however. not to mention here two early seminal articles: Mirrlees (1971) and Hurwicz (1972).

23.B The Mechanism Design Problem In this section. we provide an introduction to the mechanism design problem that we study in detail in the rest of the chapter. To begin. consider a setting with I agents. indexed by i = I •...• I. These agents must make a collective choice from some set X of possible alternatives. Prior to the choice. however. each agent i privately observes his preferences over the alternatives in X. Formally. we model this by supposing that agent i privately observes a parameter. or signal. Ii, that determines his preferences. We will often refer to Ii, as agent fs rype. The set of possible types for agent i is denoted 0,. Each agent i is assumed to be an expected utility maximizer, whose Bernoulli utility function when he is of type 0, is u,(x. 0,). The ordinal preference relation over pairs of alternatives in X that is associated with utility function u,(x,O,) is denoted ;t,(O,). Agent i's set of possible preference relations over X is therefore given by

fJP, = {;t,: ;t, = ;t,(O,) for some 0, e 0,}. Note that because 0, is observed only by agent i. in the language of Section 8.E

--

SECTION

22.1:

THE

MECHANIIM

DEIION

we are in a setting characterized by incomplete information. As in Section 8.E, we suppose that agents' types are drawn from a commonly known prior distribution. In particular. denoting a profile of the agents' types by 9 = (0 1 , . . . ,0,), the probability density over the possible realizations of 9 e 0 1 x ... X 0, is .p(.). The probability density .p(.) as well as the sets 0 ..... ,0, and the utility functions u;(·.O,) are assumed to be common knowledge among the agents. but the specific value of each agent fs type is observed only by i. 2 Because the agents' preferences depend on the realizations of 0 = (0 1 , •••• 0,). the agents may want the collective decision to depend on O. To capture this dependence formally. we introduce in Definition 23.B.I the notion of a social choice function. a concept already discussed in Section 21.E.3

Dellnltlon 23.B.1: A social choice function is a function f: 0 1 x ... x 0 t - X that. for each possible profile of the agents' types (0 1, ••• • 0,). assigns a collective choice f(Ol.···,O,)eX·

One desirable feature for a social choice function to satisfy is the property of ex post efficiency described in Definition 23.B.2. Dellnltlon 23.B.2: The social choice function f: 0 1 x ... X 0, - X is ex post efficient (or Paretian) if for no profile 0 = (0 1, •.•• 0,) is there an x e X such that Ui(X, Oil ui(f(O), 0i) for some i. Definition 23.B.2 says that a social welfare function is ex post efficient if it selects, for every profile 0 = (0 1, •••• 0,). an alternative f(O) e X that is Pareto optimal given the agents' utility functions UI(·'OI) •...• u,(·.O,). The problem faced by the agents is that the Ois are not publicly observable, and so for the social choice f(OI, .... 0,) to be chosen when the agents' types are (0 1, •••• 0,). each agent i must be relied upon to disclose his type 9,. However. for a given social choice function f(·). an agent may not find it to be in his best interest to reveal this information truthfully. We illustrate this information revelation problem in Examples 23.B.1 through 23.B.4. which range from very abstract to more applied settings.

2. The formulation here is restrictive in one sense: in some settings of interest. agents' preferences over outcomes depend not only on their own observed signals bUI also on signals observed by others (e.g .• agent ts preferences over whether 10 hold a picnic indoors may depend on agent j's knowtedge of likely weather condilions). Through most of this chapler. we focus on the case in which an agent's 'lreferences depend only on his own signal, known as the private values case. We generalize our analysis in Section 23.F. 3. In Section 21.E an agent's type was equivalent to his ordinal preferences over X, and so a social choice function was defined there simply 35 a mapping from Uti x ... X 91, to X. Moreover, it was assumed there that for all i we have {II, = 11, the set of all possible ordinal prderence orderings on X. 4. Two points should be noted about this definition. First. it restricts attention to delerministic social choice funclions. This is largely for exposilional purposes; ahhough much of Ihe chapter considers deterministic social choice functions. in Sections 23.0 to 23.F we allow social choice functions that assign lotreries over X. Second, as in Section 21.E, we limit our attention to singte·vatued choice functions.

PROBLEM

859

860

CHAPTER

23:

INCENTIVES

AND

MECHANISM

DESIOH

Example 23.B.I: An Abstract Social Choice Selling. In the most abstract case, we are given a set X and, for each agent i, a set fJl l of possible rational preference orderings on X. To consider a very simple example, suppose that X = {x, y, z} and that J = 2. Suppose also that agent I has one possible type, so that 0, = {B,}, and that agent 2 has two possible types, so that 0, = {O'"Oi}. The agents' possible preference orderings fJl l = (;::I(B I )} and fJl 2 = (;::,(8'2), ;::2(02)} are given by

;::1 (B,)

----...

--

SEC T ION

23. B:

THE

ME C HAN I I M

DEli 0 N

z

y

y

Figure 23.B.1

In the social choice runction that selccts a Walrasian equilibrium ror each prererence profile, agent 2 has an incentive to claim to be type when he is really type 8;.

y x

x

[A higher positioned alternative is strictly preferred to a lower positioned one; so, for example, x :>,(B,) y :>,(B I ) z.] Now suppose that the agents wish to implement the ex post efficient social choice function f(·) with

Ao"0:,) = y

and

f(B" 0i) = x.

8,

0,

are (;:: ,(ii,), ;::2 (Oi))], an allocation that he strictly prefers to f(O"Oll when his preferences are ;::2 (0:,). _

If so, then agent 2 must be relied upon to truthfully reveal his preferences. But it is apparent that he will not find it in his interest to do so: When O2 = 0;, agent 2 will wish to lie and claim that his type is 0'2' In abstract social choice settings, a case of central interest arises when fJl l is, for each agent i, equal to fJI, the set of all possible rational preference relations on X. In this case, an agent has many possible false claims that he can make and, intuitively, it may be very difficult for a social choice function always to induce the agents to reveal their preferences truthfully. We will see a formal illustration of this point in Section 23.C when we present the Gibbard-Satterthwaite theorem. _

Example 23.B.3: A Public Project. Consider a situation in which J agents must decide whether to undertake a public project, such as building a bridge, whose cost must be funded by the agents themselves. An outcome is a vector x = (k, I" ... , I,), where k E (0, I} is the decision whether to build the bridge (k = I if the bridge is built, and k = 0 if not), and II E R is a monetary transfer to (or from, if II < 0) agent i. The cost of the project is c;:: 0 and so the set of feasible alternatives for the J agents is

Example 23.B.2: A Pure Exchange Economy. Consider a pure exchange economy with L goods and J consumers in which agent i has consumption set R~ and endowment vector WI = (W'h"" w u )>> o(see Chapter 15). The set of alternatives is

The constraint Lltl ~ -ck reflects the fact that there is no source of outside funding for the agents (so that we must have c + LI I, :s 0 if k = I, and LI II :S 0 if k = 0). We assume that type O;'s Bernoulli utility function has the quasilinear form

X = {(k, I" ... , I,): k E

to, I}, liE R for all i, and L II ~ -ck}. I

X

=

«x" ... ,XI): xIER~

and LXf/ ~ LW(i for I

(=

I, ... L}.

I

In this setting it may be natural to suppose that fJI;, each consumer i's set of possible preference relations over alternatives in X, is a subset of fJl E , the set of individualistic (i.e., depending on XI only), monotone, and convex preference relations on X. To consider a simple example, suppose that J = 2, that consumer I has only one possible type, so that 0, = (B,} and fJI, = (;::, (B,)}, and that for consumer 2 we have;1l2 = iJI E • Imagine then that we try to implement a social choice function that, for each pair (;::, (0,), ;::, (0,)), chooses a Walrasian equilibrium allocation (note that this social choice function is ex post efficient). As Figure 23.B.I illustrates, consumer 2 will not generally find it optimal to reveal his preferences truthfully. In the figure, f(BI,Oll is the unique Walrasian equilibrium when preferences are (;::, (ii,), ;::2 (0'2)) [it is the unique intersection ofthe consumers' offer curves OC, and OC:, occurring at a point other than the endowment point]. However, by claiming that he has type Oi, which has as its offer curve OC;, consumer 2 can obtain the allocation f(OI' 0;) [the unique Walrasian equilibrium allocation when preferences

861

-E-4--------------------------~~

;::2 (0:')

x

PRO B L E II

uI(x, 01) = Olk + (ml + II), where lii l is agent i's initial endowment of the numeraire ("money") and 0, E R. We can then interpret 01 as agent i's willingness to pay for the bridge. In this context, the social choice function f(O) = (k(O), 1,(0), ... ,1,(0» is ex post efficient if, for all 0,

k(O)=t

ifLO,;:: c, I

(23.B.I)

otherwise,

and

L 1,(0) =

-ck(O).

(23.B.2)

I

Suppose that the agents wish to implement a social choice function that satisfies (23.B.I) and (23.B.2) and in which an egalitarian contribution rule is fOllowed, t~at is, in which 11(0) = -(cj I)k(O). To consider a simple example, suppose that 0, = to,} for i '# I (so that all agents other than agent I have preferences that are known) and

862

CHAPTER

23:

'NCENT'VES

AND

MECHAN'SM

DES,GN

IICTION

L,,., e,

L,,., e, L,,..e, -

e,

~I = (c - '0"L e, + &) + tit. - ~I =

(C(l-I

I) _

L

,,..

e, +&) + m•.

Example 23.B.4: Allocation of a Single Unit of an Indivisible Private Good. Consider a setting in which there is a single unit of an indivisible private good to be allocated to one of 1 agents. Monetary transfers can also be made. An outcome here may be represented by a vector x = (y" ... , Y/O t., ... , t,), where y, = I if agent i gets the good, y, = 0 if agent i does not get the good, and t, is the monetary transfer received by agent i. The set of feasible alternatives is then

= {(y"

e

e

(l

But, for & > 0 small enough, this is less than m" which is agent I's utility if he instead claims that 8. = 0, a claim that results in the bridge not being built. Thus, agent I will prefer not to tell the truth. Intuitively, under this allocation rule, when agent I causes the bridge to be built he has a positive externality on the other agents (in the aggregate). Because he fails to internalize this effect, he has an incentive to understate his benefit from the project. _

x

DIIIGN

eo

Agent l's utility in this case is

+ m. -

MICHANII ..

e.

e,

8.

TNI

Two special cases that have received a great deal of attention in the literature deserve mention. The first is the case of bilateral trade. In this case we have I ... 2; agent I is interpreted as the initial owner of the good (the "seller"), and agent 2 is the potential purchaser of the good (the "buyer"). When ~ > there are certain to be gains from trade regardless of the realizations of 8. and 82; when ~, > 2 there are certain to be no gains from trade; finally, if ~2 < and ~, < 2 then there may or may not be gains from trade, depending on the realization of 8. The second special case is the auction setting. Here, one agent, whom we shall designate as agent 0, is interpreted as the seller of the good (the" auctioneer") and is assumed to derive no value from it (more generally, the seller might have a known value 80 = different from zero). The other agents, I, ... , I. are potential buyers (the "bidders").5 To illustrate the problem with information revelation in this example. consider an auction setting with two buyers = 2). In the previous examples, we simplified the discussion of information revelation by assuming that only one agent has more than one possible type. We now suppose instead that both buyers' (privately observed) valuations 8, are drawn independently from the uniform distribution on [0, I] and that this fact is common knowledge among the agents. Consider the social choice function f(8) = (Yo(8), y.(8), Y2(0), 10 (8), t.(8), t 2(8» in which

0, = [0, (0). Suppose also that C> > c(1 - 1)/1. These inequalities imply, first, that with this social choice function agent I's type is critical for whether the it is; if 8, < C it is not), and that the bridge is built (if 8, ~ C sum of the utilities of agents 2, ... , I is strictly greater if the bridge is built under c(1 - 1)/1> OJ. this egalitarian contribution rule than ifit is not built [since Let us examine agent I's incentives for truthfully revealing his type when e, = c - L,o" + & for & > O. If agent I reveals his true preferences, the bridge will be built because

L,,., e,

21 •• :

... ,y"t., ... ,t,):y,E{O, I} and t,EIR for all i, LY'

,

e

i)

=

if 0,

82 ;

= 0 if 8. < O2

(23.8.3)

if 8, < 8 2 ;

= 0 if 8. ~ 8 2

(23.B.4)

yo(8) = 0

for all 8

(23.8.5)

t.(8) = -8.y,(8)

(23.B.6)

t 2 (8) = - 8 2y,(8)

(23.8.7)

t o(8) = -(t.(8)

= I, and LI,;5; OJ.

e,y, + (Iii, + I,),

where 1ft, is once again agent i's initial endowment of the numeraire ("money"). Here IIi E IR can be viewed as agent i's valuation of the good, and we take the set of possible valuations for agent i to be 0, = [Q" 0,] c R. In this situation, a social choice function flO) = (y.(O), ... , y,(O), 1.(8), ... ,1,(0)) is ex post efficient if it always allocates the good to the agent who has the highest valuation (or to one of them if there are several) and if it involves no waste of the numeraire; that is, if for all 0 = (0., ... ,8,) E 0. X ••• x 0/0 y,(O)(O, - Max{O., ... , O,}) = 0

~

h(O) = I Y2(8) = I

+ t 2(8)).

(23.B.8)

In this social choice function. the seller gives the good to the buyer with the highest valuation (to buyer 1 if there is a tie) and this buyer gives the seller a payment equal to his valuation (the other, low-valuation buyer makes no transfer payment to the seller). Note that f(·) is not only ex post efficient but also is very attractive for the seller: if f(·) can be implemented, the seller will capture all of the consumption benefits that are generated by the good. Suppose we try to implement this social choice function. Assume that the buyers are expected utility maximizers. We now ask: If buyer 2 always announces his true value, will buyer I find it optimal to do the same? For each value of 8., buyer l's problem is to choose the valuation to announce, say ~" so as to solve

We suppose that type e;,s Bernoulli utility function takes the quasilinear form u,(x,

PlloallM

863

------------------------------------------------------------------------

Max i,

(II. - ~.) Prob (8 2 ~ ~.)

or Max i,

for all i

and

(8. - ~.)~ •.

S. Note that, for ease of notation, we take there to be I

,

.~

+ 1 agents

in the auction setting.

864

CHAPTER

23:

INCENTIYES

AND

MECHANISM

DESIGN

The solution to this problem has buyer 1 set 01 = Bd2. We see then that if buyer 2 always tells the truth, truth telling is not optimal for buyer I. A similar point applies to buyer 2. Intuitively, for this social choice function, a buyer has an incentive to understate his valuation so as to lower the transfer he must make in the event that he has the highest announced valuation and gets the good. The cost to him of doing this is that he gets the good less often, but this is a cost worth incurring to at least some degree." Thus, we again see that there may be a problem in implementing certain social choice functions in settings in which information is privately held. (For a similar point in the bilateral trade context, see Exercise 23.B.2.) Although buyers have an incentive to lie given the social choice function described in (23.B.3) to (23.B.8), this is not true of all social choice functions in this auction setting. To see this point, suppose we try to implement the social choice function j(.) that has the same allocation rule as that above [i.e., in which the functions y,(.) for i = 0, 1,2 are the same as those described in (23.8.3) to (23.B.5)] but instead has transfer functions 1,(0) = -O,y,(O) I,(B) = -OIY,(O) 10 (0) = -(1,(0)

+ t,(B».

In this social choice function, instead of buyer i paying the seller an amount equal to his own valuation 0, if he wins the object, he now pays OJ, where j ¢' i; that is, he pays an amount equal to the second-highest valuation. Consider buyer I's incentives for truth telling now. If buyer 2 announces his valuation to be 0, $ BI , buyer I can receive a utility of (0, - 0,) ;:: 0 by truthfully announcing that his valuation is B,. For any other announcement, buyer I's resulting utility is either the same (if he announces a valuation of at least 0,) or zero (if he announces a valuation below 0,). So if 0, $ 0" announcing the truth is weakly best for buyer I. On the other hand, if buyer 2's announced valuation is (), > B" then buyer l's utility is 0 if he reveals his true valuation. However, buyer 1 can receive only a negative utility by making a false claim that gets him the good (a claim that his valuation is at least (),). We conclude that truth telling is optimal for buyer I regardless of what buyer 2 announces. Formally, in the language of the theory of games, truth telling is a weakly dominant strategy for buyer I (see Section 8.B). A similar conclusion follows for buyer 2. Thus, this social choice function is implementable even though the buyers' valuations are private information: it suffices to simply ask each buyer to report his type, and then to choose j(O).' • Examples 23.B.I to 23.B.4 suggest that when agents' types are privately observed the information revelation problem may constrain the set of social choice functions that can be successfully implemented. With these examples as motivation, we can now pose the central question that is our focus in this chapter: What social choice junctiolls can be implemented when agellls' types are privale information?

6. This /rade-otT is similar 10 that faced by a monopotist (see Section t2.B): when the monopolist raises his price, he lowers his sales but makes more on his remaining sales. 7. For other examples of implementable social choice [unctions, see Exercise 23.8.1.

-- ----

SECTION

23.8:

THE

MECHAN"M

DESIGN

To answer this question, we need in principle to begin by thinking of all the possible ways in which a social choice function might be implemented. In the above examples we have implicitly imagined a very simple scenario in which each agent i is asked to directly reveal 0, and then, given the announcements (0" ... ,0,), the alternative j(O" ... , 0,) E X is chosen. But this is not the only way a social choice function might be implemented. In particular, a given social choice function might be indirectly implemented by having the agents interact through some type of institution in which there are rules governing the actions the agents may take and hoW these actions translate into a social outcome. To illustrate this point, Examples 23.B.5 and 23.B.6 study two commonly used auction institutions. Example 23.B.5: Firsl-Price Sealed-Bid Auclion. Consider again the auction setting introduced in Example 23.B.4. In a first-price sealed-bid auction each potential buyer i is allowed to submit a sealed bid, b,;:: O. The bids are then opened and the buyer with the highest bid gets the good and pays an amount equal to his bid to the seller.· To be specific, consider again the case where there are two potential buyers (I = 2) and each 0, is independently drawn from the uniform distribution on [0, I]. We will look for an equilibrium in which each buyer's strategy M') takes the form MO,) = CI.,O, for CI., E [0, I]. Suppose that buyer 2's strategy has this form, and consider buyer I's problem. For each B, he wants to solve Max bl

(0, - b,) Prob(b,(B,)

$

b,).

:
Since buyer 2's highest possible bid is CI., (he submits a bid of CI., when 0, = I), it is evident that buyer I should never bid more than cx,. Moreover, since B, is uniformly distributed on [0, I] and b,(B,) $ b, if and only if B, $ (b,/cx,), we can write buyer I's problem as Max

(B, - b,)(bdCl.,)·

bIE{O.1I2J

The solution to this problem is b,(B,) =

{W' CI.,

if tB, $ ietB, >

CI." CI.,.

By similar reasoning, iftB, $ CI." if!B,>CI.,.

to,

Letting CI., = CI., = t, we see that the strategies b,(O,) = for i = 1,2 constitute a Bayesian Nash equilibrium for this auction. Thus, there is a Bayesian Nash equilibrium of this first-price sealed-bid auction that indirectly yields the outcomes specified by the social ~hoice function j(O) = (Yo(B), y,(B), y,(O), t o(9), 1,(0), t,(O»

8. If there are several highest bids, we suppose that the lowest numbered of these bidders gets the good. We could equally well randomize among tho highest bidders if there are more than one, but 'his would require that we expand 'he set of alternatives to A(X), the set of all lotteries over X. In fact. we do precisely this when we sludy auctions in Sections 23.0 and 23.F.

PROBLEM

865

866

C H A , TEA

23:

INC E N T lYE 8

AND

M E C HAN IBM

DE' I Q N

--------------------------~~~~----------------~ in which

y,(O) = I

• E C T ION

23. 8:

THE

M E C HAN

II M

DE' I Q N

a Bayesian game of incomplete information. That is, leUing ii,(sl •...• s" 0,) = u,(g(s, •... ,s,), 0,). the game

(23.B.9)

Yl(/I) = I

if /I, < 0l;

(23.B.l0)

[I, {S,}, {ii,(')},e, x .. · x e"t/>(')]

yo(/I) = 0

for all 0

(23.B.11)

is exactly the type of Bayesian game studied in Section R.E. Note that a mechanism could in principle be a complex dynamic procedure. in which case the elements of the strategy sets S, would consist of contingent plans of action (see Chapter 7)." For the auction setting, the first-price sealed-bid auction is the mechanism in which S, = IR+ for all i and, given the bids (b ..... , b,) E R~. the outcome function g(b" ... , b,) = ({y,(b" ... , b, {t,(b, •.... b, )}I-,) is such that

t,(O) = -!O,y/(O)

(23.B.12)

tl(O) = -tOlY'(O) totO) = -(t,(8)

(23.B.l3)

+ t,(/I».

(23.B.14)

nt-"

•

y,(b, ..... b.>

Example 23.8.6: Second-Price Sealed-Bid Auction" Once again, consider the auction setting described in Example 23.B.4. In a second-price sealed-bid auction each potential buyer i is allowed to submit a sealed bid, b, ~ O. The bids are then ~pened and the buyer with the highest bid gets the good, but now he pays the seller an amount equal to the second-highest bid.'o By reasoning that parallels that at the end of Example 23.B.4, the strategy b,(O.) .= 0, for all /I, E [0, I] is a weakly dominant strategy for each buyer i (see ExerCise 23.B.3). Thus, when I = 2 the second-price sealed-bid auction implements the social choice function flO) = (yo(O). y,(O), Yl(O). t o(8). t,(O), tl(O» in which y,(O) = I

if 0, ~ 0l;

=

0 if 8, < Ol

y,(O) = I

if 0, < 0l;

=

0 if /I, ~ /l l

YolO) = 0

for all /I

=I

if and only if i

= Min{j: b) =

Max{b, ..... bdl.

t,(b, •...• b,) = -b,y,(b l •••• , b,).

In the second-price sealed-bid auction, on the other hand. we have the same strategy sets and functions y,('), but instead t,(b ...... b,) = -Max{b):J I< i}y,(b ..... ,b,). A strategy for agent i in the game of incomplete information created by a mechanism r is a function s,: e, -+ S, giving agent i's choice from S, for each possible type in e, that he might have. Loosely put, we say that a mechanism implements social choice function f(·) if there is an equilibrium of the game induced by the mechanism that yields the same outcomes as f(·) for each possible profile of types 8 = (0" ...• 0,). This is stated formally in Definition 23.B.4. Deflnlllon 23.B.4: The mechanism r = (5, ..... 5,. g(.)) implements social choice function f (.) if there is an equilibrium strategy profile (sH·) ..... s1 (.)) of the game induced by r such that g(s~(O,) •...• 51(/1,)) = f(/I, •...• /I,) for all (/I, .... . 0.) E e, x ... x e,.

1,(0) = -/lly,(/I)

Il(O) = -O'Yl(O) 10(0) = -(1,(/1)

Note, however, that we have not specified in Definition 23.B.4 exactly what we mean by an Mequilibrium". This is because, as we have seen in Part II, there is no single equilibrium concept that is universally agreed upon as the appropriate solution concept for games. As a result, the mechanism design literature has investigated the implementation question for a variety of solution concepts. In Sections 23.C and 23.D we focus on two central solution concepts: dominant strategy equilibrium and Bayesian Nash equilibrium. ll Note also that the notion of implementation that we have adopted in Definition 23.B.4 is in one sense a weak one: in particular, the mechanism r may have more (/'an one equilibrium, but Definition 23.B.4 requires only that one of them induce outcomes in accord with f(·). Implicitly, then, Definition 23.B.4 assumes that. if multiple equilibria exist, the agents will play the equilibrium that the mechanism designer wants. Throughout the chapter we shall stick to this notion of implementation. Appendix A is devoted to a further discussion of this issue.

+ tl(/I» . •

Examples 23.B.5 and 23.B.6 illustrate that, as a general matter, we need to consider not only the possibility of directly implementing social choice functions by asking ag~nts .to ~evea~ their .types but also their indirect implementation through the design of institutions In whIch the agents interact. The formal representation of such an institution is known as a mechanism. Definition 23.B.3: A mechanism r = (5, ..... 5,. g(.)) is a collection of I strategy sets (5, ..... 5,) and an outcome function g: 5, x ... X 51 -+ X. A mechanism can be viewed as an institution with rules governing the procedure for making the collective choice. The allowed actions of each agent i are summarized by the strategy set S,' and the rule for how agents' actions get turned into a social choice is given by the outcome function g('). For~alIy, th.e mechanism r combined with possible types (e" ... , e/)' probabIlIty denSity t/>(.), and Bernoulli utility functions (u,(·), ... , ul (·» defines

I 1. Note also that we are representing the game created by a mechanism using its normal form. For all the analysis Ihat follows in Ihe text this will be sufficient. In Appendix B. however. we consider a case where the extensive form representation is used. 12. Appendix B considers several other equilibrium concepts in the special context of camp/ere information settings in which the players observe each others' types.

9. This auclion is also called a Vickrey aucr;on. afler Vickrey (1961). 10. If there is more than One highest bid. we again select the lowest-numbered of these bidders.

J

, A

0 8 LEM

867

,-----------------------------------------------------

868

CHAPTER

23:

INCENTIVES

AND

MECHANISM

DESIGN

The identification of all social choice functions that are implementable may seem like a daunting task because, in principle, it appears that we need to consider all possible mechanisms-a very large set. Fortunately, an important result known as the revelation principle (to be formally stated and proven in Sections 23.C and 23.D) tells us that we can often restrict attention to the very simple type of mechanisms that we were implicitly considering at the outset, that is, mechanisms in which each agent is asked to reveal his type, and given the announcements (~" ... ,~,), the alternative chosen is f(~" ... , ~,) E X.13 These are known as direct revelation mechanisms, and formally constitute a special case of the mechanisms of Definition 23.B.3.

--

SECTION

STRATEGY

IMPLEMENTATION

Because of the revelation principle. when we explore in Sections 23.C and 23.D the constraints that incomplete information about types puts on the set of implementable social choice functions, we will be able to restrict our analysis to identifying those social choice functions that can be truthfully implemented. Finally, we note that, in some applications, participation in the mechanism may be voluntary, and so a social choice function must not only induce truthful revelation of information but must also satisfy certain participation (or individual rationality) constraints if it is to be successfully implemented. In Sections 23.C and 23.D, however, we shall abstract from issues of participation to focus exclusively on the information revelation problem. We introduce participation constraints in Section 23.E.

23.C Dominant Strategy Implementation

Definition 23.B.6: The social choice function f (.) is truthfully implementable (or incentive compatible) if the direct revelation mechanism r = (El" ... ,El" f(·J) has an equilibrium (sf(·) .... , s1 (. J) in which s1 (Oi) = 0i for all 0i E Eli and all i = 1•... ,I; that is. if truth telling by each agent i constitutes an equilibrium of r = (El, ....• El" f(·J).

In this section, we study implementation in dominant strategies. '4 Throughout we follow the notation introduced in Section 23.B: The vector of agents' types 0= (0" . .. ,0,) is drawn from the set El = El, x ... x El, according to a probability density t/J('), and agent i's Bernoulli utility function over the alternatives in X given his type 0, is u,(x, 0,). We also adopt the notational convention of writing 0_, = (0 1, ••• ,0,_"0,+,, ... ,0,), 0 = (0" 0_/), and El_/ = El, x ... x El/_ I x Eli+' X • " x El,. A mechanism r = (S" ... ,'Slo g(.)) is a collection of I sets S" . .. ,S" each S/ containing agent i's possible actions (or plans of action), and an outcome function g: S -+ X, where S = S, x ... X S,. As discussed in Section 23.B, a mechanism r = (S, .... , Slo g(')) combined with possible types (El ..... , El,), density t/J('), and Bernoulli utility functions (u,(·), ... , u,(·» defines a Bayesian game of incomplete information (see Section 8.E). We will also often write L, = (5" ... ,S,_"5,+,, ... ,s,), 5 = (5/,5_/), and S_/ = S, X . . . X S,_, X S/+' x .. · X S,. Recall from Section 8.B that a strategy is a weakly dominant strategy for a player in a game if it gives him at least as large a payoff as any of his other possible strategies for every possible strategy that his rivals might play. In the present incomplete information environment, strategy 5/: El, ... S, is a weakly dominant strategy for agent i in mechanism r = (S" ... ,S" g(.» if, for all 0, E El/ and all possible strategies for agentsj '" i, L,(·) = [s,(·), ... ,5,_,('),5,+,('), ... ,51 (,)],1$

To offer a hint as to why we may be able to restrict attention to direct revelation mechanisms that induce truth telling, we briefly verify that the social choice functions that are implemented indirectly through the first-price and second-price sealed-bid auctions of Examples 23.B.5 and 23.B.6 can also be truthfully implemented using a direct revelation mechanism. In fact, for the second-price sealed-bid auction of Example 23.B.6 we have already seen this fact, because the social choice function implemented by the second-price auction is exactly the social choice function that we studied at the cnd of Example 23.B.4 in which truth telling is a weakly dominant strategy for both buyers. Example 23.B.7 considers the first-price sealed-bid auction. Example 23.B.7: Truthful Implementation of the Social Choice Function Implemented by the First-Price Sealed-Bid Auction. When facing the direct revelation mechanism (El" ... , El,,f(·)) with f(O) = (Yo(O), y,(O), Y2(0), totO), t,(O), t 2(0)) satisfying (23.B.9) to (23.B.14), buyer I's optimal announcement ~, when he has type 0, solves Max ;,

DOMINANT

The first-order condition for this problem gives ~, = 0,. So truth telling is buyer l's optimal strategy given that buyer 2 always tells the truth. A similar conclusion follows for buyer 2. Thus, the social choice function implemented by the first-price sealed-bid auction (in a Bayesian Nash equilibrium) can also be truthfully implemented (in a Bayesian Nash equilibrium) through a direct revelation mechanism. That is, the social choice function (23.B.9) to (23.B.l4) is incentive compatible. _

Definition 23.B.S: A direct revelation mechanism is a mechanism in which Si = Eli for all i and g(O) = f(O) for all 0 EEl, x ... x El,. Moreover, as we shall see, the revelation principle also tells us that we can further restrict our attention to direct revelation mechanisms in which truth telling is an optimal strategy for each agent. This fact motivates the notion of truthful implementation that we introduce in Definition 23.B.6 (we are again purposely vague in the definition about the eqUilibrium concept we wish to employ).

23.C:

(0, - t~,) Prob(02 ~ ~,)

E, .[U,(g(5,(O,), 5_,(0 _,»,0,)10,] ;;:: E,Au,(g(§" L,(O _/»,0/)10/]

for all §, E S/. (23.C.I)

or Max ;,

869

-----------------------------------------------------------------------

Condition (23. ':.1) holding for all

(0, - t~,)/l,.

L

,(-l and 0, is equivalent to the condition that,

14. Good sources for fu"her reading on the subject of this section are Dasgupta, Hammond and Maskin (1979) and Green and Larront (1979). IS. The expecta'ion in (23.C.1) is taken over realizations of E (,L,.

13. Some early versions of the revelation principle were derived by Gibbard (1973). Green and Larront (1977). Myerson (1979). and Dasgupta. Hammond and Maskin (1979).

e_,

1

870

c HAP T E R

2 3:

INC E N T lYE.

AND

III E C HAN I • III

0 E• IQ N

----------------------------------------------------------------------for all O,e e .. u,(g(s,(O,), L,), 0,) 2: u,(g(J.. L,), 0,)

(23.C.2)

for aIlJ,eS, and all s_,eS_,.'6This leads to Definition 23.C.1. DefinItIon 23.C.1: The strategy profile s·(·) = (st( .), ... ,s1(')) is a dominant strategy equilibrium of mechanism r = (S" ... ,SI' g(.)) if, for all i and all 01eel' u;(g(s7(01)' L;). 0;) 2: u;(g(si. s _1).0;)

--

• E C T ION

23. C:

DO III I N A III T

• T R AT E Q Y

whether a particular J(.) is truthfully implementable in the sense introduced in Definition 23.C.3.

DefinItion 23.C.3: The social choice function f(') is truthfully implementable in dominant strategies (or dominant strategy incentive compatible. or strategy-proof. or straightforward) if s1(01) = 01 for all O;e e; and i = 1, ... ,I is a dominant strategy equilibrium of the direct revelation mechanism r = (e ,. ... ,e l • f( That is. if for all i and all 0; e e;.

'».

for all sieS;and all s_;eS_;.

ul(f(O;. 0_1). 0;) 2: UI(f(OI. 0_;). ( 1)

We now specialize Definition 23.B.4 to the notion of dominant strategy equilibrium. DefinItion 23.C.2: The mechanism r = (S" ...• SI' g(.)) implements the social choice function f (.) in dominant strategies if there exists a dominant strategy equilibrium of r. s·(·) = (st(·) •... ,si(·)). such thatg(s·(O)) = flO) for allOee. The concept of dominant strategy implementation is of special interest because if we can find a mechanism r = (S" ... ,SI' g('» that implements J(.) in dominant strategies, then this mechanism implements 1(') in a very strong and robust way. This is true in several senses. First, we can feel fairly confident that a rational agent who has a (weakly) dominant strategy will indeed play it." Unlike the equilibrium strategies in Nash-related equilibrium concepts, a player need not correctly forecast his opponents' play to justify his play of a dominant strategy. Second, although we have assumed that the agents know the probability density t/>(.) over realizations of the types (0" ... ,01 ), and hence can deduce the correct conditional probability distribution over realizations of 0_ .. if r implements J(.) in dominant strategies this implementation will be robust even if agents have incorrect, and perhaps even contradictory, beliefs about this distribution. In particular, agent i's beliefs '8 regarding the distribution of 0 _, do not affect the dominance of his strategy Third, it follows that if r implements J(') in dominant strategies then it does so regardless of the probability density t/>( '). Thus, the same mechanism can be used to implement 1(') for any t/>(.). One advantage of this is that if the mechanism designer is an outsider (say, the "government"), he need not know t/>(.) to successfully implement J(. ). As we noted in Section 23.B, to identify whether a particular social choice function Fortunately, it turns out that for dominant strategy implementation it suffices to ask

16. Condilion (23.C.2) follows from (23.C.1) simply by selling ,_,(0_,) = ,_, for all O_,E 0 _,. To see that (23.C.2) implies (23.C.1). consider the case where S_, is a finite set. Then. for any".

Thus. (23.C.2) implies (23.C.I). 17. We leave aside the question of what might happen if an agent has several weakly dominant strategies. This is Ihe issue of multiple equilibria Ihat we discuss in Appendix A. Even so. we at least mention one conclusion from that discussion: The problem of multiple equilibria is relatively small when we arc dealing with dominant strategy equilibrium. 18. In fact. the implementation of 1(' ) using r is also robusl to substantial relaxations of the hypothesis that agents maximize expected utility.

(23.C.3)

lor all (J;e e; and all O_le e_;. The ability to restrict our inquiry, without loss of generality, to the question of whether 1(') is truthfully implementable is a consequence of what is known as the revelation principle Jor dominant strategies. Proposition 23.C.1: (The Revelation Principle for Dominant Strategies) Suppose that there exists a mechanism r = (S" ...• SI' g(.)) that implements the social choice function f(·) in dominant strategies. Then f(') is truthfully implementable in dominant strategies. Proof: If r = (S" ... , SJo g(.» implements 1(') in dominant strategies, then there exists a profile of strategies s·(·) = (sr(·), ... , s1 (.» such that g(s·(O» = J(O) for all and, for all i and all 0, e e"

°

u,(g(s:(O,), L,), 0,) 2: u,(g(J.. L,), 0,)

for all §, e S, and all i and all 0, e e ..

sr (.).

J(.) is implementable. we need, in principle, to consider all possible mechanisms.

1111 P L ElliE N TAT ION

L,

u,(g(sr(O,), s!,(O_,», 0,) 2: u,(g(snb,), s!,(O_,», 0,)

for all (J, e e, and all all i and all 0, e e"

°_, °

(23.C.4)

e S_,. Condition (23.C.4) implies, in particular, that for all

e

e_,. Since g(s·(O»

(23.C.S)

= J(O) for all 0, (23.C.S) means that, for

u,(J(O.. 0_,), 0,) 2: u,(J(b.. 8_,), 0,)

for all (J, e e, and all _I e e _,. But, this is precisely condition (23.C.3), the condition for 1(') to be truthfully implementable in dominant strategies. • The intuitive idea behind the revelation principle for dominant strategies can be put as follows: Suppose that the indirect mechanism r = (S" . . , ,Sh g(.» implements J(.) in dominant strategies, and that in this indirect mechanism each agent i finds playing s~ (0,) when his type is 0, better than playing any other 5, e S, for any choices e S_, by agentsj ¥ i. Now consider altering this mechanism simply by introducing a mediator who says to each agent i: "You tell me your type, and when you say your type is 0" I will play s~(O,) for you." Clearly, if sr(O,) is agent i's optimal choice for each 0, r- e; in the initial mechanism r for any strategies chosen by the other agents. then agent i will find telling the truth to be a dominant strategy in this new scheme. But this means that we have found a way to truthfully implement 1(')' The implication of the revelation principle is that to identify the set of social choice functions that are implementable in dominant strategies, we need only identify those that are truthfully implementable. In principle, for any 1('), this is just a matter of checking the inequalities (23.C.3).

s_,

871

872

--------------------------------------------------------------CHAPTER

23:

INCENTIVEI

AND

MECHANI8M

DEIIGN

SEC T ION

2 3 . C:

0 C MIN A H T

& T RAT E G Y

IMP L E MEN TAT

ION

The inequalities (23.C.3), which are necessary and sufficient for a social choice function f(·) to be truthfully implementable in dominant strategies, can be usefully thought of in terms of a certain weak preference reversal property. In particular, consider any agent i and any pair of possible types for j, and 8';. If truth telling is a dominant strategy for agent i, then for any O_lee_ 1 we must have

e;

uM(O;, 0_,), 0;)

~

uM(Oj, 0_,), 0;)

uM(O;', 0_,), OJ)

~

u,(f(Oj, 0_,), OJ).

/.(0;,8_,) must lie

in shaded set ~.(O:)

°_,)

L,(x, 0,) =

{z e X: u,(x, 0,)

0; 0,

Figure 23.C.2 depicts a change in some agent i's type from to in an exchange setting in which agent i's preferences satisfy the single-crossing properry that we discussed in Sections 13.C and 14.C. In the figure. we denote agent i's allocation in outcome f(O,. O2 ) by J,(O" O2 ), According to Proposition 23.C.2. h(O;'. 0_,) must lie in the shaded region of the figure if truth telling is to be a dominant strategy for agent i. Thus. the characterization in Proposition 23.C.2 can be seen as a multiperson extension of the truth-telling constraints that we encountered in Section 14.C (here they must hold for every possible 0 _, e 0 _,). In the remainder of this section we explore in more detail the characteristics of social choice functions that can be truthfully implemented in dominant strategies.

~ u,(z. 0,»).

Using this lower contour set we get the characterization of the set of social choice functions that can be truthfully implemented in dominant strategies that is given in Proposition 23.C.2.

The Gibbard-Satterthwaite Theorem

Propoaltlon 23.C.2: The social choice function f(') is truthfully implementable in dominant strategies if and only if for all i. all 0_; e u,(f(O). 0,) for all i (recall that no two alternatives can be indifferent). Because f(0) = X. there exists a O' E 0 such that flO') = y. Now choose a vector of types 0" E 0 such that, for all i, u,(y, 0;') > u;(f(O),O~) > u,(z, Oil for all z >F- flO). y. (Remember that all preferences in 9 are possible.) Since L,(y, 0;) c L,(y, Oil for all i, monotonicity implies that flO") = y. But. since L,(f(O), 0,) c L,(f(O). IJ~) for all i. monotonicity also implies that f(O") = flO): a contradiction because y >F- flO). Hence. f(') must be ex post efficient.

Step 3: A social clooice function f(·) tloar is monotonic and ex post efficient is necessarily dictatorial. Step 3 follows directly from Proposition 21.E.1. Together, steps I to 3 establish the result. _ It should be noted that the conclusion of Proposition 23.C.3 does not follow if

X contains two clements. For example. in this case, a majority voting social choice function (sec Section 21.E) is both nondictatorial and truthfully implementable in dominant strategies (Exercise 23.C.2). Note also that when .iff, = .'1' for all i, any ex post efficient social choice function muse have f(0) = X (verify this in Exercise 23.C.3). Thus, the Gibbard-Satterthwaite theorem tells us that when 91, = ? for all i, and X contains more than two elements. the only ex post efficient social choice functions that are truthfully implementable in dominant strategies are dictatorial social choice functions. Given this negative conclusion, if we are to have any hope of implementing desirable social choice functions, we must either weaken the demands of our implementation concept by accepting implementation by means of less robust equilibrium notions (such as Bayesian Nash equilibria) or we must focus on more restricted environments. In the remainder of this section, we follow the latter course, studying the possibilities for implementing desirable social choice functions in dominant strategies when preferences take a quasilinear form. Section 23.D explores the former possibility: It studies implementation in Bayesian Nash equilibria. Proposition 23.C.3 is readily extended in two ways. First. the resuh's conclusion still follows whenever dt; contains [j' (the set of all rational preference relations having the property that no two alternatives are indifferent). and so it extends to environments in which individual indifference is possible. This is stated formally in Corollary 23.C.1. Corollary 23.C.1: Suppose that X is finite and contains at least three elements. that iJ' c: iii, for all i, and that ((0) = X. Then the sociat choice function f(') is truthfully implementable in dominant strategies if and only if it is dictatorial. Proof: It is again immediate that a dictalorial social choice function is truthfully implementable. We now show that under the stated hypotheses f(·) must be dictatorial if it is truthfully implementable. An implication of Proposition 23.C.3 is that there must be an agent h such that /(0) E {x EX: u,(. ",(z, 0;') > ",(x, Oil for all x;. (f(IJ'), .}; and (iii) "/(" OJ) > "1(f(IJ'), OJ) > ",(x, OJ) for all x;. (f(O'), z}. Consider the profile of types (O~, 0'" ... ,0;). By Proposition 23.C.2, we must have flO') E L,(f(Oj, 0'" ... , 0;), O~), and so it must be that I(~, 0'" .. ,' 11;) ~ f(lJ'). The same argument can be applied iteratively for all i '" Ito show that f(~, ... , 1Ii-" 0;) = f(O')· Next, note that (by Proposition 23.C.2) we must have f(O~, ... , OJ_I' 11;) E LI(f(O"), OJ). Hence, f(O") E (t, flO')}. But (by Proposition 23.C.2) we must also have f(O") E LI(j(O;, ... , 0; -,,0;),0;), and since ",(z, 0;) > ",(f(O'), 0',) this means we cannot have flO") = z. Hence, flO") = flO'). But, since ",(z, OJ) > ",(f(O'), 0;,), this contradicts agent J being a dictator whenever ;::,(0,) E 9' for all i . •

--

SEC T ION

23•

c:

0 0 .. I NAN T

S T RAT E G Y

I .. P L E .. E N TAT ION

877

of alternatives is therefore'o

x=

{(k,'" ... ,I,): k e K,I,E R for all i, and

L I, ~ OJ.

Note that this environment encompasses the cases studied in Examples 23.B.3 and 23.B.4: Example 23.el: A Public Projecl. We can fit a generalized version of the public project setting of Example 23.8.3 into the framework outlined above. To do so, let K contain the possible levels of a public project (e.g., if K = (0, I), then either the project is "not done" or "done") and denote by elk) the cost of project level k E K. Suppose that v,(k, 0,) is agent i's gross benefit from project level k and that, in the absence of any other transfers, projects will be financed through equal contribution [i.e., each agent i will pay the amount c(k)/I).l' Then, we can write agent i's nel benefit from project level k when his type is 0, as v,(k, 0,) = v,(k, 0,) - (c(k)//). The t,'s are now transfers over and above the payments e(k)/I . •

As our second extension, we can derive a related dictatorship result for social choice

functions whose image fee) is smaller than X. We first offer Definition 23.C.6. Deflnttlon 23.C.6: The social choice function f(') is dictatorial on set X c X if there exists an age"nt i such that. for all a = (a" ... ,0,) E e, flO) E (x E X: u,(x, 0,) '" u,(y, 0,) for all YEX}.

Example n.e2: AI/ocalion of a Single Unil of an Indivisible Private Good. Consider the environment described in Example 23.B.4 in which an indivisible unit of a private good is to be allocated to one of I agents. Here the "project choice" k = (y, •... , y,) represents the allocation of the private good and K = {(y" ...• Y/): y, E {O, I} for all i and :L.v, = I}. Agent i's valuation function takes the form v,(k, 0,) = (), y,. •

This weaker notion of dictatorship requires only that f(·) select one of the dictator's most preferred ahernatives in X, rather than in X. Corollary 23.C.2: Suppose that X is finite, that the number of elements in fee) is at least three, and that 9' c!il, for all i = 1, ... ,I. Then f(·) is truthfully implementable In dominant strategies if and only if it is dictatorial on the set fee).

A social choice function in this quasilinear environment takes the form f(·) = (k(-), 1 1(-), ••. ,1,(') where, for all 0 e 0. k(O) e K and L,I,(O) ~ O. Note that if the social choice function f(·) is ex post efficient then, for all 0 e 0, k(O) must satisfy

Proof: It is immediate that f(·) is truthfully implementable if it is dictatorial on the sct I(e), and so we now show that under the stated hypotheses f(·) must be dictatorial on sct f(9). If f: e .... X is truthfully implementable in dominant strategies when the sct of alternatives is X, then the social choice function /: 9 .... f(9) which has /(0) = f(8) for all 0 E 8 is truthfully implementable in dominant strategies when the sct of alternatives is f(8). By Corollary 23.C.I, /(.) must be dictatorial. Hence, f(·) is dictatorial on the set 1(9). •

,

L

/

L

v,(k(O), 0,) ~

i-1

v,(k, ()I)

for all k e K.

(23.C.7)

1-1

We begin with a result that identifies a class of social choice functions that satisfy (23.C.7) and that are truthfully implementable in dominant strategies.

The implication flowing from Corollary 23.c.2 is therefore this: When !ii, c 9' for all i, the set of social choice functions which have an image that contains at least three elements and which are truthfully implementable in dominant strategies is exactly the sct of social choice functions that can be implemented (indirectly) by restricting the sct of possible choioes to some subset X c X and assigning a single agent i to choose frcom within this set.

Proposition 23.C.4: Let k*(') be a function satisfying (23.C.7). The social choice function f(') = (k*('), t , (·), ... , til')) is truthfully implementable in dominant strategies if, for all i = 1, ... , I, t,(O) =

[.L. ,,,,,

vj(k*(O), OJ)]

+ hi(O-i)'

(23.C.8)

where h,{-) is an arbitrary function of O-i'

Quasilinear Environments: Groves-Clarke Mechanisms

Proof: If truth is not a dominant strategy for some agent i, then there exist 0" and 0_, such that

In this subsection we focus on the special, but much studied, class of environments in which agents have quasilinear preferences. In particular. an alternative is now a vector x = (k,I" ... ,1/), where k is an element of a finite sct K, to be called the "project choice," and I, e R is a transfer of a numeraire commodity ("money") to agent i. Agent i's utility function takes the quasilinear form u,(x,O,) = v,(k, 0,)

v,(k'(b" 0 _,),0,)

b,.

+ 1,(0,.0_,) > v,(k'(O" 6_,), 0,) + 1,(0" 0_,).

20. Observe that X is not a compact set. This explains what might appear as a small paradox: in this setting, there arc no dictatorial social choice runctions because any agent i. when allowed to pick his best ahernative in X, faces no bound on how much money he can extract from the other

+ (m, + I,),

where m, is agent i's endowment of the numeraire. We assume that we are dealing with a closed system in which the I agents have no outside source of financing. The sei

agents.

21. NOlhing we do depends on this choice tor the "base" method of contribution.

,

J

s

878

c HAP TEA

2 3:

INC E N T lYE SAN D

M E C HAN IS M

Substituting from (23.C.S) for I,(~" 0_ 1) and 1,(0" 0_ 1), this implies that I

L v;(k*(~"

I

L vj(k*(O), OJ),

0_ 1), OJ) >

j O. type 01 will strictly prefer to falsely report that he is type 0, when the other agents' types are 0_,. To sec this. note first that k·(o;. 0_,) = k·(O,. 0_,) since setting k = k·(O,. 0_,) maximizes v.(k. 0;) + LI'" vl(k. 01)' Thus. truth teIling being a dominant strategy requires that v,(k·(O,. 0_,). 01)

+ 1,(01. 0_,) ~ v,(k·(O,.

°

_,).0,)

+ 1,(0" 0_,).

or. substituting. from (23.C.9) and (23.C.IO).

But by the logic of part (i). h,(D:.

t

+ h,(o;. 0_,)

t

+ h,(b,.

~

h,(O,. 0_.).

°_,) = h,(O,. °_,) because k·(o;. °_,) = k·(O,. °_,). This gives

°_,)

~ h,(O,.

°_,).

(23.C.ll)

By hypothesis we have h(O,.O_,) > h(b,.o_,). and so (23.C.ll) must be violated for smaIl enough t > O. This completes the proor. _ Thus. when all possible functions v,(·) can arise for some 8, EO,. the only social choice functions satisfying (23.C.7) that are truthfully implementable in dominant strategies are those in the Groves class. Groves mechanisms and budgel balance Up to this point. we have studied whether we can implement in dominant strategies a social choice function that always results in an efficient choice of k [one satisfying (23.C.7)]. But ex post efficiency also requires that none of the numeraire be wasted. that is. that we satisfy the budgel balance condilion: LI,(8)=0 forall8E0.

,

(23.C.12)

We now briefly explore when fully ex post efficient social choice functions [those satisfying bOlh (23.C.7) and (23.C.l2)] can be truthfully implemented in dominant strategies. Unfortunately. in many cases it is impossible to truthfully implement fully ex post efficient social choioe functions in dominant strategies. For example. the result [due to Green and Laffont (1979)] in Proposition 23.C.6. whose proof we omit, shows that if the set of possible types for each agent is sufficiently rich. then no social choice functions that are truthfully implementable in dominant strategies are ex post efficient.2> Proposition 23.C.6: Suppose that for each agent i = 1 •...• I. {vk .8j ): 8j E OJ} = 1'"; that is. every possible valuation function from K to R arises for some 8j E OJ. Then there is no social choice function ((.) = (k·(·). t,(·) •...• t,(·» that is truthfully implementable in dominant strategies and is ex post effiCient. that is. that satisfies (23.C.7) and (23.C.12). Thus. under the hypotheses of Proposition 23.C.6. the presence of private information means that the 1 agents must either accept some waste of the numeraire

---- --

SEC T ION

DO MIN ANT

& T RAT E G Y

IMP L E MEN TAT ION

[I.e.• have L, 1,(8) < 0 for some O. as in the Clarke mechanism] or give up on always having an efficient project selection [i.e .• have a project selection k(8) that does not satisfy (23.C.7) for some OJ. One special case in which a more positive result does obtain arises when there is at least one agent whose preferences are known. For notational purposes. let this agent be denoted "agent 0". and let there still be 1 agents, denoted i = I •... • 1. whose preferences are private information (so that we are now lelling there be 1 + I agents in total). The simplest case of this phenomenon. of course. occurs when agent 0 has no preferences over the project choice k. that is. when his preferences are u,(x) = "'0 + 10, We saw one example of this kind in Example 23.B.4 when we considered auction sellings (agent 0 is then the seller). Another example arises in the case of a public project when the project affects only a subset of the agents in the economy (so that agent 0 represents all of the other agents in the economy). When there is such an agent. ex post efficiency of the social choice function still requires that (23.C.7) be satisfied; but now ex post efficiency is compatible with any transfer functions c,(·) •...• 1,(') for the 1 agents with private information, as long as we set coCO) = - L, .. o 1,(0) for all 8. That is. in this (I + I)-agent selling. the Groves mechanisms identified in Proposition 23.C.4 (in which only agents i = I •... • 1 announce their types) are ex post efficient as long as we set the transfer of agent 0 to be coCO) = - L" ot,(O) for all O. In essence. the presence of an "outside" agent 0 who has no private information allows us to break the budget balance condition for those agents who do have privately observed types. We should offer. however. one immediate caveat to this seemingly positive result: Up to this point. we have not worried about whether agents will find it in their interest to participate in the mechanism. As we will see in Section 23.E. when participation is voluntary. it may be that no ex post efficient social choice function is implementable in dominant strategies even when such an outside agent exists. The differentiable case

It is common in applications to encounter cases in which K = R. the v. 2 see Laffont and Maskin (1980) and Exercise 23.C.10]. By (23.C.13). for all 0= (0,,0,), we have

E. ,[ui(g(si(Oi)' S~i(O_i»' Oil I0;] ~ E.,[U;(g(§i' S~i(O-i»' 0ill0;]

Definition 23.0.2: The mechanism r = (5" ... ,Sj. g(.» implements the social choice function f(') in Bayesian Nash equilibrium if there is a BayesIan Nash equilibrium of r, s·(·) = (si(·) • ... ,sr such that g(s'(O» = flO) for all E e.

00,

Thus, for all

°

ok

(.».

()',,(O)

_iJ'_v-,-,,(k_O-,:(O~)._O-,-,,)_ ilk_'(_O) o_k_·(_O) o~

il~

+ ov,(k·(O). 0,) o_'_kO_(O_)

00,

ft

OO,il~

(23.C.16)

Delinltlon 23.0.3: The social choice lunction f(') is truthfully implementable in Bayesian Nash equilibrium (or Bayesian incentive compatible) il Sf(Oi) = 0i for al\ O,E i and i = 1•... , J is a Bayesian Nash equilibrium 01 the direct revelation mechanism r = (e" .... e" f(·)). That is. illor all i = 1•... , J and all

and

il',,(O)

00,00,

_o'_v",,(k-:-·~(O-,-,)._O~,) _ok_'(_O) _ok_·(_O) + ov,(kO(O), 0,) _o'_k'_(_O) ok' 00, 00, ok 00,00,'

e

(23.C.17)

O,E

O'V,(kO(O). 0,) ok'

+ o'v,(k·(O). 0,)] ok'(O) ilk'(O) ok'

00,

00,

ei•

(23.D.l)

If we have budget balance. then ,,(0) = -,,(8) for all 0, and so we must have 0',,(8)/00,00, = -0',,(0)/00,00,. But this would imply, by adding (23.C.16) and (23.C.17), and using (23.C.15). that

[

°

As with implementation in dominant strategies (see Section 23.C). we will see that a social choice function is Bayesian implementable if and only if it is truthfully implementable in the sense given in Definition 23.0.3.

00,

= (0" 0,),

00,00,

23.D Bayesian Implementation

for al\ 5i E Si'

ilv,(k·(O). 0,) ok'(O)

00,

IMPLEMENTATION

e

and

0,,(0)

BAYESIAN

Definition 23.0.1: The strategy profile s·(·) = (si(·) •.... si(·» Is a Bayesian Nash equilibrium of mechanism r = (5" ...• 5,. g('» if, for all i and all O,E i •

ov,(k·(O). 0,) ok·(O) ok

23.0:

In this section. we study implementation in Bayesian Nash equilibrium. 17 Throughout we follow the notation introduced in Section 23.B: The vector of agents' types 0=(0, •...• 0,) is drawn from set e=e, x· .. xe, according to probability density 0, we

and

Proposition 23.0.2 shows that to identify all Bayesian incentive compatible social choice functions in the linear setting. we can proceed as follows: First identify which functions k( .) lead every agent i's expected benefit function v,(') to be nondecreasing. Then. for each such function. identify the expected transfer functions I, (.)..... I, (.) that satisfy condition (23.0.12) of the proposition. Substituting for U,(·). these are precisely the expected transfer functions that satisfy. for i = I, ...• I.

1,(0,) = 1M,)

+ ~,v,(~,) - O,v,(O,) +

f" v,(s) ds ~,

for some constant l,{~,). Finally. choose any set of transfer functions (tl(O) •. ..• t,(O» such that E, ,[t,(O,. 0_,)) = 1,(0,) for all 0,. In general. there are many such functions t,(· •. ); one. for example. is simply t,(O,. = 1,(0,)." We now illustrate one implication of this characterization result for the auction setting introduced in Example 23.B.4. Some further implications of Proposition 23.0.2 are derived in Sections 23.E and 23.F.

°_,)

Auctions: tile revenue equivalence theorem Thus, - (0.)

v"

~ U,(O,) - U,(II,) ~ -(0)

b,- 0,

v"

•

(23.0.13)

Expression (23.0.13) immediately implies that v,(') must be nondecreasing (recall that we have taken b, > 0,). In addition, letting 8, -+ 0, in (23.D.13) implies that for all 0, we have

Ui(O,) = v,(O,) and so

U,(O,) =

U,{~,)

+

f"

v,(s) ds

for all 0,.

!,

(ii) Sufficiency. Consider any 0, and 0, and suppose without loss of generality that 0,. If (23.0.11) and (23.0.12) hold, then

0, >

u;(o,) - U,(O,) =

', f" f'·

_ v,(s) ds

;:: _ v,(8,) ds I.

= (0, - O,)v,(8,).

32. Observe Ihat the agent's preferences here over his expected benefit jj, and expected transfer I, satisfy the single-crossing property that played a prominent role in Sections I3.C and 14.C.

889

-----------------------------------------------------------------------

Let us consider again the auction setting introduced in Example 23.B.4: Agent 0 is the seller of an indivisible object from which he derives no value. and agents 1•... , 1 are potential buyers. 34 It will be convenient. however, to generalize the set of possible alternatives relative to those considered in Example 23.B.4 by allowing for a random assignment of the object. Thus. we now take y,(O) to be buyer i's probability of getting the object when the vector of announced types is = (0" ...• 0,). Buyer i's expected utility when the profile of types for the 1 buyers is 0= (0" ... ,0,) is then 0, y,(O) + tote). Note that buyer i is risk neutral with respect to lotteries both over transfers and over the allocation of the good. This setting corresponds in the framework studied in Proposition 23.0.2 to the case where we take k=(yl .... ,y,). K={(yl .... ,y,):y,E[O.I] for all i = 1, ... ,1 and L, y, ~ I}, and v,{k) = Yo' Thus, to apply Proposition 23.0.2 we can write VitO,) = y,(O,), where y,(8,) = E._.[y,(O" 0_,)] is the probability that i gets the object conditional on announcing his type to be 8, when agents j .;. i announce their types truthfully, and U,(O,) = 0d,(Od + 1,(0,).

°

33. However. if we wish the social choice function f(') = (k('), ,,(.)•...• 1,('» to satisfy some further properties, such as budget balance. only a subset (possibly an empty one) of the transfer functions generating the expected transfer functions (t.(8.). ...• t,(8,» may have these properties. 34. We note that our assumption that the seller in an auction setting derives no value rrom the

object is not necessary for the revenue equivalence tbeorem. (As we shall see, tbe result characterizes the expected revenues generated for the seller in different auctions. and so is valid for any utility function that the seUer might bave.) In the absence of tbis assumption, however, tbe seller in an auction will generally care about more than just the expected revenue he receives.

890

CHAPTER

23:

INCENTIVES

AND

MECHANISM

DEIIGN

We can now establish a remarkable result, known as the revenue equivalence theorem. 3 >

--- --

(.v,(O,lO, -

t

MO,) dO,) -

dO,J - U,(Q.).

I -,(0.> ,(O'»)J( ,., .n j(III») dO, ... dll,J , (23.0.16)

By inspection of (23.0.16). we see that any two Bayesian incentive compatible social choice functions that generate the same functions (y,(O) •...• y,(II» and the same values of (U,(Q,) •...• U,(Q,» generate the same expected revenue for the seller. _ As an example of the application of Proposition 23.0.3, consider the equilibria of the first-price and second-price sealed-bid auctions that we identified in Examples 23.8.5 and 23.B.6 (where the buyers' valuations were independently drawn from the uniform distribution on [0. I]). for these equilibria. the conditions of the revenue equivalence theorem are satisfied: in both auctions the buyer with the highest valuation always gets the good and a buyer with a zero valuation has an expected utility of zero. Thus, the revenue equivalence theorem tells us that the seller receives exactly the same level of expected revenue in these equilibria of the two auctions (you can confirm this fact in Exercise 23.0.3). More generally. it can be shown that in any symmetric auction setting (i.e .• one where the buyers' valuations are independently drawn from identical distributions). the conditions of the revenue equivalence theorem will be met for any Bayesian Nash equilibrium of the first-price sealed-bid auction and the (dominant strategy) equilibrium of the second-price sealed-bid auction (see Exercise 23.0.4 for a consideration of symmetric equilibria in these settings). We can conclude from Proposition 23.0.3. therefore. that in any such setting the first-price and second-price sealed-bid auctions generate exactly the same revenue for the seller.

U,(Q,).

Moreover. integration by parts implies that

.v,(s) dS),(O,) dO, =

j(Oj»)dll, ...

I

,.L, U,(Q,).

.v,(s) dS),(O,) dO,

.v,(s) dS) ,(0,) dO,J -

n

[ y,(O, •...• 0,)(0, [ f.O,' ... fO' 0, ,=,

r

[r r r(r (r (r =

E[ -1.(0,)] =

(23.0.15)

E[ -1,(0)) = E•.[ - 1,(0,))

(.v,(0;) 11, - U,(Q,) -

P A A TIC I PAT ION

Thus. the seller's expected revenue is equal to

Proof: By the revelation principle. we know that the social choice function that is (indirectly) implemented by the equilibrium of any auction procedure must be Bayesian incentive compatible. Thus, we can establish the result by showing that if two Bayesian incentive compatible social choice functions in this auction setting have the same functions (y,(O) •...• y,(O)) and the same values of (U,(Q,), ... , U,(Q,» then they generate the same expected revenue for the seller. To show this. we derive an expression for the seller's expected revenue from an arbitrary Bayesian incentive compatible mechanism. Note. first. that the seller's expected revenue is equal to :Lf., E[ -1,(0». Now.

r

23. E:

or. equivalently.

6' ... f6' y,(O, •...• 0,)(0, - I - ,(0,»)( [ f,,~, ,(0,) ,.

Proposition 23.0.3: (The Revenue Equivalence Theorem) Consider an auction setting with I risk-neutral buyers. in which buyer i's valuation Is drawn from an interval [Q;.6;] with Q; ;I< 6; and a strictly positive density 11>;(·) > 0, and in which buyers' Iypes are statistically independent. Suppose that a given pair of Bayesian Nash equilibria of two different auction procedures are such that for every buyer i: (i) For each possible realization of (0 ...• 0,). buyer i has an identical probability " of getting the good in the two auctions; and (ii) Buyer i has the same expected utility level in the two auctions when his valuation for the object is at its lowest possible level. Then these equilibria of the two auctions generate the same expected revenue for the seller.

=

5 ECT, 0 N

23.E Participation Constraints

.v,(O,),(O,) dO,)

In Sections 23. B to 23.0. we have studied the constraints that the presence of private information puts on the set of implementable social choice functions. Our analysis up to this point. however, has assumed implicitly that each agent i has no choice but to participate in any mechanism chosen by the mechanism designer. That is. agent i's discretion was limited to choosing his optimal actions within those allowed by the mechanism. In many applications. however. agents' participation in the mechanism is voluntary. As a result. the social choice function that is to be implemented by a mechanism must not only be incentive compatible but must also satisfy certain participation (or individual rationality) constraints if it is to be successfully implemented. In this section. we provide a brief discussion of these additional

Substituting. we see that (23.0.14)

35. Versions of the revenue equivalence theorem have been derived by many authors; see McAfee and McMillan (1987) and Milgrom (1987) for references as well as for a further discussion of the result.

.l

CON 5 T R A I N T 5

891

892

CHAPTER

23:

INCENTIVES

AND

MECHANISM

DESIGN

--------------------------------------------------------------constraints on the set of implementable social choice functions. By way of motivating our study, Example 23.E.I provides a simple illustration of how the presence of participation constraints may limit the set of social choice functions that can be successfully implemented. Example 23.E.l: Participation Constraints in Public Project Choice. Consider the following simple example of public project choice (recall our initial discussion of public project choice in Example 23.S.3). A decision must be made whether to do a given project or not, so that K = {O, I}. There are two agents, I and 2. For each agent i, 0, = {~, 8}, so that each agent either has a valuation of~, or a valuation of iI. We shall assume that 8 > 2~ > O. The cost of the project is c E (2~, 8). Suppose that we want to implement a social choice function having an ex post efficient project choice; that is, one that has k·(III' (1 2) = 1 if either 0 1 or 112 is equal to 8, and P(OI' (1 2) = 0 if II, = 112 =~. In the absence of the need to insure voluntary participation, we know from Section 23.C that we can implement some such social choice function in dominant strategies using a Groves scheme. Suppose, however, that each agent has the option of withdrawing from the mechanism at any time (perhaps by withdrawing from the group), and that, if he does, he will not enjoy the benefits of the project if it is done, but will also avoid paying any monetary transfers. Can we implement a social choice function that achieves voluntary participation and that has an ex post efficient project choice?l6 The answer is "no." To see this, note that if agent I can withdraw at any time, then to insure his participation it must be that I,(~, 8) ~ -~. That is, it must be that whenever his valuation for the project is ~, he pays no more than ~ toward the cost of the project. Now consider what agent l's transfer must be when both agents announce that they have valuation 8: If truth telling is to be a dominant strategy, then t, (0, 0) must satisfy

Ok·(O, 0)

+ 1, (0, 0) ~

Ok·(~, 0)

+ t,(~, 0),

or, substituting for k·(O, 0) and k·(~, ii),

0+ t,(O, 8) ~ 8 + t,(~, 8). Since t,(~, 0) ~ -~, this implies that 1, (0, 8) ~ -~. Thus, we conclude that agent I must not make a contribution toward the cost of the project that exceeds ~ when (II" ( 2 ) = (0,8). Moreover, by symmetry, we have exactly the same constraint for agent 2's transfer when (II" (1 2 ) = (8, 8), namely, t , (8, 8) ~ -~. Hence, t,(O, 0) + t 2 (O, 0) ~ -2~. But if this is so, then because 2~ < c, the feasibility condition t,(O, 8) + t 2(8, 8) S -c cannot be satisfied. We conclude,therefore, that it is impossible to implement a social choice function with an ex post efficient project choice when the agents can withdraw from the mechanism at any time. Note also that the presence of an "outside agent" (say "agent 0") who does not care about the project decision does not help at all here when that agent can also withdraw from the mechanism at any time. This is because, to insure this agent's participation, his transfer to(lI .. (1 2 ) must be nonnegative for every realization of

36. Note that any social choice funclion thai fails to have both agents participate is necessarily e., post inefficient because one of the agents is excluded from the benefits of Ihe project.

--

IECTION

U.E:

PARTICIPATION

(0 (1 2 ), In particular, w.,e ..must h~v: t o(8, 82 ~ 0, and so we must fail to satisfy the " feasibility condition to(II, II) + t,(II, II) + t 2(O, II) s -c. • As a general matter, we can distinguish among three stages at which participation constraints may be relevant in any particular application. First, as in Example 23.E.I, an agent i may be able to withdraw from the mechanism at the ex post slage that arises after the agents have announced their types and an outcome in X has been chosen. Formally, suppose that agent i can receive a utility of ii,(II,) by withdrawing from the mechanism when his type is 11,.37 Then, to insure agent i's participation, we must satisfy the ex post participation (or individual rationality) constraints 3 ' (23.E.I) In other circumstances, agent i may only be able to withdraw from the mechanism at the interim stage that arises after the agents have each learned their type but before they have chosen their actions in the mechanism. Letting U,(O.lfl = E•. ,[u,(f(II" 11_,), II,) IlIa denote agent i's interim expected utility from social choice function f( . ) when his type is II" agent i will participate in a mechanism that implements social choice function f(') when he is of type II, if and only if U,(O,lfl is not less than ii,(O,). Thus, interim participation (or individual rationality) constraints for agent i require that for all 0,.

(23.E.2)

In still other cases, agent i might only be able to refuse to participate at the ex ante stage that arises before the agents learn their types. Letting U,(f) = E.,[U,(O;l fl] = E[u,(f(II.. II _,),0,)] denote agent i's ex ante expected utility from a mechanism that implements social choice function f('), the ex ante participation (or individual rationality) constraint for agent i is U,(f) ~ E.,[ii,(II,)].

(23.E.3)

Participation constraints are of the ex ante variety when the agents can agree to be bound by the mechanism prior to learning their types. When, instead, agents know their types prior to the time at which they can agree to be bound by the mechanism, we face interim participation constraints. 39 Finally, if there is no way to bind the

37. We assume that agent i's utility from withdrawal depends only on his own Iype. 38. We assume throughout that it is always optimal to insure that each agent is always willing to participate. In fact, however, there is no loss of generality from assuming this: When agents can "not participate: any outcome that can arise when some subset I' of the I agents does not participate, say x', should be included in the set X. Because we can always have the mechanism select x' in the circumstanoes when this subset of agents would have refused to participate, if the set X is defined appropriately we can always replicate the outcome of any mechanism that causes non participation with a mechanism in which all agents are always willing to participate. 39. Recall that the assumption in a Bayesian game that types are drawn from a commOn prior density is often merely a modeling device for how agents form beliefs about each others' types (see Section 8.E). That is, for analytical purposes we may be representing a setting in which agents' types are already determined but are only privately observed by assuming that there has been a prior random draw of types from a commonly known distribution; but there may not actually be any such prior stage at which the agents could possibly interact.

CONITIIAINTI

893

894

CHAPTER

23:

INCENTIVES

AND

MECHANISM

--

DEIIGN

---------------------------------------------------------------------~

agents to the assigned outcomes of the mechanism against their will, then we face ex post participation constraints.'· Note that if f(·) satisfies (23.E.I), then it satisfies (23.E.2); and, in turn, if it satisfies (23.E.2), then it satisfies (23.EJ). However, the reverse is not true. Thus, the constraints imposed by voluntary participation are most severe when agents can withdraw at the ex post stage, and least severe when they can withdraw only at the ex ante stage. In summary, when agents' types are privately observed, the set of social choice functions that can be successfully implemented are those that satisfy not only the conditions identified in Sections 23.C and 23.0 for incentive compatibility (in, respectively, either a dominant strategy or Bayesian sense, depending on the equilibrium concept we employ) but also any participation constraints that are relevant in the environment under study. In the remainder of this section, we illustrate further the limitations on the set of implementable social choice functions that may be caused by participation constraints by studying the important Myerson-Salterthwaite theorem [due to Myerson and Satterthwaite (1983)].

SECTION

23.£:

PARTlelPATION

news: Whenever gains from trade are possible, but are not certain,'1 there is no ex post efficient social choice function that is both Bayesian incentive compatible and satisfies these interim participation constraints. Thus, under the conditions of the theorem, the presence of both private information and voluntary participation implies that it is impossible to achieve ex post efficiency. (For an illustration of the result for specific functional forms, see Exercise 23.E.7.)

Proposition 23.E.1: (The Myerson-Satterthwaite Theorem) Consider a bilateral trade setting in which the buyer and seller are risk neutral, the valuations 0, and O2 are independently drawn from the intervals [~" 8,] c R and [~2' 62] c R with strictly positive densities, and (Q" 0,) ("\ (~2' 62) '" 0. Then there is no Bayesian incentive compatible social choice function that is ex post efficient and gives every buyer type and every seiler type nonnegative expected gains from participation.

Proof: The argument consists of two steps:

Slep I: In any Bayesian incentive compatible and interim individually rational social choice function f(·) = [y,(.), y,{- ),t,(' ),t,(-)) in which y,(O" 0,) + y,(O" 0,) = I and t,(O" 0,) + t,(O" 0,) = 0, we musl have

f fi, i,

The Myerson-Satterthwaite Theorem

Consider again the bilateral trade setting introduced in Example 23.B.4. Agent I is the seller of an indivisible object and has a valuation for the object that lies in the interval 0 1 = [~I' 61] c R; agent 2 is the buyer and has a valuation that lies in 0, = [~" 6,] c R. The two valuations are statistically independent, and 0, has distribution function 0 for iii]. We let y,(O) denote the probability that agent i receives the good all 0i E given types 9 = (0 1,9,), and so agent i's expected utility given 0 is O,y,(O) + t,(O) (we normalize mi = 0 for all i). The expected externality mechanism studied in Section 23.0 shows that in this setting we can Bayesian implement an ex post efficient social choice function (or what, in this environment, we might call a "trading rule"). A problem arises with the expected externality mechanism, however, when trade is voluntary. In this case, every type of buyer and seller must have nonnegative expected gains from trade if he is to participate. In particular, if a seller of type 01 is to participate in a mechanism that implements social choice function f('), that is, if participation in the mechanism is to be individually rational for this type of seller, it must be that UI(O.!f> ~ 0" because this seller can achieve an expected utility of 0, by not participating in the mechanism and simply consuming the good. Likewise, a buyer of type 0, can always earn zero by refusing to participate, and so we must have U,(O,If> ~ O. Unfortunately, these interim participation constraints are not satisfied in the expected externality mechanism (you are asked to verify this in Exercise 23.E.I). The Myerson-Satterthwaite Theorem tells us the following disappointing piece of

"

"

.

.

y,(O"0,)

[(

I - ,(0,») (0, + ,(0,»)] - 4>,(0,)4>,(0,) dO, dO, ~ O.

0, - - - - - 4>,(0,)

4>,(0,)

~~

To see this, note first that the same argument that leads to (23.0.15) can be applied here to give [throughout the proof we suppress the argument f in U,(0.lf) and simply write U,(O,)]:

mi'

fi, fi, y,(O,. 0,)(0, -

£[ -i,(O,)) = [

"

I - ,(0'»)4>,(0,)4>,(0,) dO, dO,] -

"

.

4>,(0,)

.

Also, because (23.D.12) implies that

V,(Q,)

=

VItO,) -

i, ii, f ~I

U,(~,). (23.E.5)

y,(O" 0,)4>,(0,) dO, dO"

!l

condition (23.0.15) also implies that

-

[ fi, fi, y,(O"O,) (,(0,») 0, + - - 4>,(0,)4>,(0,) dO, dO, ] - V,(O,).

£[ -1,(0,)] =

" .

"

4>,(0,)

.

(23.E.6)

Then, since J',(O,. 0,) = I - y,(O,. 0,) we have

-

£[-1,(0,))=

-

[ f" f" (0 , + ,(0.1) - - 4>,(O,)tJ>,(O,)dO,dO, ] 4>,(0,) ,,~,

[ f fi, i' ~,

"

»)

y,(O,.o,) ( II, + - '(0 - ' 4>,(0,)4>,(0,) dO, dll, 4>,(0,)

e,) '"

]-

U,(ii,).

41. That is, whenever (~" 8,)" (q" 0 (or equivalently, 8, > ~, and 8, > ~,), so that for some realizations of 0 = (0 1.8 2 ) there are gains from trade but for others there are not.

40. For example, if the mechanism can lead an agent into bankruptcy, the provisions of bankruptcy law provide an elfective lower bound on ex post utilities.

J

CONSTRAINT.

895

896

CH"PTER

23:

INCENTIVES

.. NO

"ECH .. NIS ..

--- --

DESIGN

But dO, dO,J [ f•i,, 'fi,, (0, + ,(0'»),(0,),(0,) ,(0,)

= [

fi, [0,,(0,) + ,(0,)] dO,J !,

= [0,,(0,)]::

=0,. Thus. £[ -1,(0,)]

0, - [

=

fi, fi, y,(O" 0,)(0, + ,(0'»),(0,),(0,) dO, dO,J ,(0,) "

.

U,(O,).

h

.

~~

Now. the fact that 1,(0" 0,) + 1,(0,,0,) = 0 implies that £[ -1,(0,,0,)] + £[ -1,(0,,0,)] = So. adding (23.E.5) and (23.E.7) we see that

o.

[U,(O,) - 0,] + U,(q,) =

f"i, fi," }"(O,, 0,)[(0, -

I - ,(0,») _ (0,

+ ,(O'»)J,(O,),(O,) dO, dO,.

,(0,)

,(0,)

But individual rationality implies that U,(O,) ~ 0, and U,(q,) ~ 0, which establishes (23.E.4). Seep 2: Cundilion (23.£.4) cannol be salisfied },,(O,. 0,) = 0 whenever 0, < 0,.

if

y,(O" 0,) = I whenever 0, > 0, and

Suppose it were. Then the left-hand side of (23.E.4) could be written as

f f";'I.' .• i

,

"

"

oI

[(0, - I - ,(0,) - 0,),(0,) - ,(O,)J,(O,) dO, dO, ,(0,) =

=

f"[( f"[(

J";,,.,.',I ,(0,) dO,

"

I - ,(0,) ) 0, - - - - - 0, ,(0,) ,(0,) "

!,

I - ,(0,) . - ) ,(Mm{O"O,}) . O,-----Mm{O"O,} ,(O,)dO, ,(0,)

= -

f

~ [I

J

- ,(0,)],(0,) dO,

+ f~ . [(0, - 0,),(0,) + (,(0,) -

i,

= -

f"

I)] dO,

'1

~l

I

[I - ,(0,)],(0,) dO,

+ [(0, - 0,)(,(0,) - 1)],:

23.F:

OPTI .... L

... VEII .. N

whether trade will occur and at what price.·' By the revelation principle, we know that the social choice function that is indirectly implemented in a Bayesian Nash equilibrium·) of such a mechanism must be Bayesian incentive compatible. Moreover, since participation is voluntary, this social choice function f(·) must satisfy the interim individual rationality constraints that UI (8 1 1f) ~ 8 1 for all 8 1 and U,(O,1 f) ;;>: 0 for all 0,. Thus, the Myerson-Satterthwaite theorem tells us that, under its assumptions, no voluntary trading institution can have a Bayesian Nash equilibrium that leads to an ex post efficient outcome for all realizations of the buyer's and seller's valuations.

:

1

23,F Optimal Bayesian Mechanisms In Sections 23.B to 23.E we have been concerned with the identification of implementable social choice functions in environments characterized by incomplete information about agents' preferences. In this section, we shift our focus to the welfare evaluaton of implementable social choice functions. We begin by developing several welfare criteria that extend the notion of Pareto efficiency that we have used throughout the book in the context of economies with complete information to these incomplete information settings. With these welfare notions in hand, we then discuss several examples that illustrate the characterization of optimal social choice functions (and, by implication, the optimal direct revelation mechanisms that implement them). We restrict our focus throughout this section to implementation in Bayesian Nash equilibria, discussed in detail in Section 23.0. Unless otherwise noted, we also adopt the assumptions and notation of Section 23.0. Good sources for further reading on the subject of this section are Holmstrom and Myerson (1983), Myerson (1991), and Fudenberg and Tirole (1991). For economies in which agents' preferences are known with certainty, the concept of Pareto efficiency (or Pareto optimality) provides a minimal test that any welfare optimal outcome x E X should pass: There should be no other feasible outcome:i E X with the property that some agents are strictly better off with outcome :i than with outcome x, and no agent is worse off. The extension of this welfare test to social choice functions in settings of incomplete information should read something like the following: The social choice function 1(') is efficient if it is feasible and if there is no other feasible social choice function that makes some agents strictly better off, and no agents worse off.

completes the argument. •

SECTION

q,

and

q,

< 0,. This contradicts (23.E.4) and

Recalling the revelation principle for Bayesian Nash equilibrium (Proposition 23.0.1), the implication of the Myerson-Satterthwaite theorem can be put as follows: Consider any voluntary trading institution that regulates trade between the buyer and the seller. This includes, for example, any bargaining process in which the parties can make offers and counteroffers to each other, as well as any arbitration mechanism in which the parties tell a third party their types and this third party then decides

To operationalize this idea, however, we need to be more specific about two things: First, what exactly do we mean by a social choice function being ~feasible',? Second,

42. Strictly speaking, for a direct application of Proposition 23.E.I, the date of delivery and consumption of the good must be fixed (so the bargaining processes studied in Appendix A of Chapter 9 would not count). But through a suitable reinterpretation Proposition 23.E.l can be applied to settings in which trade may take place over real time, where not only delivery of the good matters but also the rime of delivery (sec Exercise 23.E.4 for details). 43. And, hence, in any perfect Bayesian or sequential equilibrium (sec Section 9.C).

.. ECH .. NIIN.

897

898

CHAPTER

23:

INCENTIVES

AND

MECHANI8M

DEIION

precisely what do we mean when we say that no other feasible social choice function .. makes some agents strictly better off, and no agent worse off"? Let us consider the first of these issues. The identification of the set of feasible social choice functions when agents' preferences are private information has been discussed extensively in Sections 23.0 and 23.E. Suppose that we define the set FBtc = {f: El .... X: f(·) is Bayesian incentive compatible}.

(23.F.I)

The elements of set Fatc in any particular application are the social choice functions that satisfy condition (23.0.1), the condition that assures that there is a Bayesian Nash equilibrium of the direct revelation mechanism r = (0 , .. , Elto f(·» in which " truth telling is each agent's equilibrium strategy. Likewise, following the discussion in Section 23.E, we can also define the set

FI /( = {f: El .... X: f(·) is individually rational}.

(23.F.2)

The set FIR contains those social choice functions that satisfy whichever of the three types of individual rationality (or participation) constraints (23.E.I)-(23.E.3) are relevant in the application being studied. If no individual rationality constraints are relevant (i.e., if agents' participation is not voluntary), then we simply have FtR = {f: El .... X}, the set of all possible social choice functions. The content of our discussion in Sections 23.0 and 23.E is therefore that the set of feasible social choice functions in environments in which agents' types are private information is precisely F* = FBIC n F,/(. Following Myerson (1991), we call this the incentive feasible set to emphasize that it is the set of feasible social choice functions when, because of incomplete information, incentive compatibility conditions must be satisfied. Now consider the second issue: What do we mean when we say that no other feasible social choice function would "make some agents strictly better off, and no agents worse off"? The critical issue here has to do with the liming of our welfare analysis. In particular, is the welfare analysis occurring before the agents (privately) learn their types, or after? The former amounts to a welfare analysis conducted at what we called in Section 23.E the ex ante stage (the point in time at which agents have not yet learned their types); the latter corresponds to what we called in Section 23.E the interim stage (the point in time after each agent has learned his type, but before the agents' types are publicly revealed). To formally define the different welfare criteria that arise in these two cases, let us once again denote by U.(0.lf) agent i's expected utility from social choice function f(·) conditional on being of type 0 Also let U,(f) = E.,[ U,(O;lf)] denote agent i's ex ante expected utility from social "choice function f(·). We can now state Definitions 23.F.1 and 23.F.2. Definition 23.F.1: Given any set of feasible social choice functions F, the social choice function f(') E F is ex ante efficient in F if there is no I (.) E F having the property that Vi(/) ~ V;(f) for all i = 1, ... , I, and Vi(/) > Vi(f) for some i. Definition 23.F.2: Given any set of feasible social choice functions F, the social choice function f(·) E F is interim efficient in F if there is no 1(·) E F having the property that Vi(Oil/) ~ Vi(Oilf) for all OiE Eli and all i= 1, . .. ,1, and Vi(Oill) > Vi(Oilf) for some i and 0iE Eli' The motivation for the ex ante efficiency test is straightforward: If agents have not yet learned their types, then when comparing two feasible social choice functions

---

IECTION

U.F:

O,TIMAL

.AYEIIAN

MECHANIIMI

899

---------------------------------------------------------------------we should evaluate each agent's well-being using his expected utility over all of his possible types. However, when our welfare analysis occurs after agents have (privately) learned their types, things are a bit trickier. Although the agents each know their types, we-as outsiders-do not know them. Thus, the appropriate notion for us to adopt in saying that one social choice function !(.) welfare dominates another social choice function f(·) is that !(.) makes every possible type of every agent at least as well off as does f('), and makes some type of some agent strictly better off. This leads to the concept of interim efficiency given in Definition 23.F.2. Proposition 23.F.1 compares these two notions of efficiency. Proposition 23.F.1: Given any set of feasible social choice functions F, if the social choice function f(') E F is ex ante efficient in F. then It is also Interim efficient in F. Proof: Suppose that f(·) is ex ante efficient in F but is not interim efficient in F. Then there exists an !(')EF such that U,(O.!!)~ U,(O.!f) for all O,EEl, and all i = I, ... , I, and U,(O.!!) > V,(O.! f) for some i and 0, EEl,. But since, for all i, U.(f) = E.,[U,(O.!f)] and V,(!) = E.,[U,(O,I!)], it follows that V/(!) ~ V/(f) for all j = I, ... , I, and V,(!) > V,(f) for some i, contradicting the hypothesis that f(·) is ex ante efficient in F . • The ex ante efficiency concept is more demanding than is interim efficiency (and so fewer social choice functions f(·) pass the ex ante efficiency test) because a social choice function !(.) can raise every agent's ex ante expected utility relative to the social choice function f(') even though !(.) may lead some type of some agent i to have a lower expected utility than he does with f(·). Putting together the elements developed above, we conclude that when agents' types are already determined at the time we are conducting our welfare analysis, the proper notion of efficiency of a social choice function in an environment with incomplete information is interim efficiency in F·, the set of Bayesian incentive. compatible and individually rational social choice functions.·· On the other hand, if our analysis is conducted prior to agents learning their types, then the proper notion of efficiency is ex ante efficiency in F*.·5 These two notions are often called simply ex ante incentive efficiency and interim incentive efficiency [the terminology is due to Holmstrom and Myerson (1983)], where the modifier "incentive" is meant to convey the point that the set F* is being used.·' These two welfare notions differ from the ex post efficiency criterion introduced in Definition 23.B.2. To see their relationship to it more clearly, Definition 23.F.3 44. These cases often correspond to situations in which our assumption that the agents' types are drawn from a known prior distribution is being used merely as a device to model agents' beliefs about each others' types, as described in Section 8.E, rather than as a description of any actual prior lime at which the agents could interact or our welfare analysis might have been done. 45. This case often arises in contracting problems when, It the time of contracting, the agents anlicipate thatlhey will later come to acquire privale information aboutthcir types. Then the natural welfare standard to use in comparing different contracts (i.e, different mechanisms) is the ex ante criterion. The principal-agent model studied in Section 14.C and Example 23.F.1 below is an example along Ihese lines. 46. However, since the relevant individual rationality constraints Vlry from one application to another, it is usually clearer to describe precisely the sct F within which efficiency is being evaluated.

900

CHAPTER

23:

INCENTIVES

AND

MECHANISM

DESIGN

develops the ex post efficiency notion in a manner that parallels Definitions 23.F.I and 23.F.2. Dellnltlon 23.F.3: Given any set of feasible social choice functions F. the social choice function f (.) e F is ex post efficient in F if there is no 1(·) e F having the property that Uj(/(O).Oj)~Uj(f(O).Oj) for all i=I ..... 1 and all Oee. and u,(i(O), OJ) > uj(f(O), OJ) for some i and 0 e e. The ex post efficiency test in Definition 23.F.3 conducts its welfare evaluation at the ex post stage at which all agents' information has been publicly revealed. Using this definition. we see that a social choice function f(·) is ex post efficient in the sense of Definition 23.B.2 if and only if it is ex post efficient in the sense of Definition 23.F.3 when we take F = {f: e ... X}. Note that the criterion of ex post efficiency in (f: e -+ X }, or more generally. of ex post efficiency in when individual rationality constraints are present. ignores issues of incentive compatibility. As a result, it is appropriate as a welfare criterion only if agents' types are in fact publicly observable. Because F* c Fill' allocations that are ex ante or interim incentive efficient need not be ex post efficient in this sense. Indeed. the Myerson-Satterthwaite theorem (Proposition 23.E.1) provides an illustration of this phenomenon for the bilateral trade setting: under its assumptions. no element of F * is ex post efficient. Examples 23.F.1 to 23.F.3 provide further illustrations. (For one way in which the notion of ex post efficiency is nevertheless still of interest in settings with privately observed types. see Exercise 23.F.1.) Note also that even in settings in which agents' types are public information, the use of ex post efficiency in F,. as our welfare criterion is appropriate only when agents' types are already determined. When our welfare analysis instead occurs prior to agents learning their types, the appropriate notion is instead the stronger criterion that f(·) be ex ante efficient in These two notions are sometimes called ex post classical efficiency and ex ante classical efficiency [again, the terminology is due to Holmstrom and Myerson (1983)] to indicate that no incentive constraints are involved in defining the feasible set of social choice functions.

F,.

F,.,

In the remainder of this section we study three examples in which we characterize welfare optimal social choice functions. In Examples 23.F.I and 23.F.2. it is supposed that one agent who receives no private information chooses a mechanism to maximize his expected utility subject to both incentive compatibility constraints and interim individual rationality constraints for the other agents. These two examples therefore amount to a characterization of one particular interim incentive efficient mechanism. In Example 23.F.3, we provide a full characterization of the sets of interim and ex ante incentive efficient social choice functions for a simple setting of bilateral trade with adverse selection. Example 23.F.l: A Principal-A gem Problem with Hidden Information. In Section 14.C we studied principal-agent problems with hidden information for the case in which the agent has two possible types. Here we consider the case where the agent may have a continuum of types. Recall from Section 14.C that in the principal-agent problem with hidden information. the principal faces the problem of designing an optimal (i.e .• payoff maximizing) contract for an agent who will come to possess private information. In doing so, the principal faces both incentive constraints and

--- ---

SEC T ION

23. F:

0 P TIM ALB AYE. I A N i l E C HAN I • II •

a reservation utility constraint for the agent. Recall also from Section 14.C that, in the limiting case in which the agent is infinitely risk averse, the agent must be guaranteed his reservation utility for each possible type he may come to have, and so this contracting problem is identical to the contracting problem that would arise if the agent already knew his type at the time of contracting. Here we shall set things up directly in these terms, assuming that the agent already possesses this information when contracting occurs. With this formulation. the principal's optimal contract can be viewed as implementing one particular interim incentive efficient social choice function. (When the agent actually does not know his type at the time of contracting and is infinitely risk averse. then this social choice function is also ex ante incentive efficient.) To introduce our notation. we suppose that the agent (individual I) may take some observable action e e R. (his "effort" or "task "level) and receives a monetary payment from the principal of t,. The agent's type is drawn from the interval [~, 6]. where ~ < 6 < O. according to the distribution function (') which has an associated density function (.) that is strictly positive on [Q,6]. We assume that this distribution satisfies the property that [0 - «I - (0»/(0))] is nondecreasing in IJ.47 The agent's Bernoulli utility function when his type is IJ is u,(e. 11.0) = I, + IJg(e). where g(.) is a differentiable function with g(O) = O. g(e) > 0 for e > O. g'(O) = O. g'(e) > 0 for e > O. and g"(') > 0; that is. Og(e) represents the agent's disutility of effort (recall that 0 < 0). with higher effort levels leading to an increasing level of disutility to the agent. Note that a larger (i.e., less negative) level of 0 lowers. at any level of e. both the agent's total level of dis utility and his marginal dis utility from any increase in e. As noted above, we suppose that the agent must be guaranteed an expected utility level of at least ii for each possible type he may have. The principal (individual 0) has no private information. His Bernoulli utility function is uo(e. (0 ) = v(e) + 10, where 10 is his net transfer and v(.) is a differentiable function satisfying v'(·) > 0 and v"(·) < O. A contract between the principal and the agent can be viewed as specifying a mechanism in the sense we have used throughout this chapter. By the revelation principle for Bayesian Nash equilibrium (Proposition 23.D.I). the equilibrium outcome induced by such a contract, formally a social choice function that maps each possible agent type into effort and transfer levels, can always be duplicated using a direct revelation mechanism that induces truth telling. Thus, the principal can confine his search for an optimal contract to the set of Bayesian incentive compatible social choice functions f(·) = (e('), 10 (' ),1 , ('» that give the agent an expected utility of at least ii for every possible value of O. In what follows. we shall (without loss of generality) restrict attention in our search for the principal's optimal contract to contracts that have to(lJ) = - t,(IJ) for all IJ (i.e., that involve no waste ofnumeraire). The principal's problem can therefore be stated as Max

E[v(e(O» - t,(IJ)]

f(·)"'(~{·I.II(·»

S.t. f(·) is Bayesian incentive compatible and individually rational. 47. For a discussion of how the analysis changes when this assumption is not satisfied, see Fudenberg and Tirol. (1991).

901

902

CH~PTEA

23:

INCENTIVES

"NO

MECH .. NISM

DEIIGN

SECTION

The present model falls into the class of models with linear utility studied in Section 23.0 [specifically, in the notation of Proposition 23.0.2, k = e, vl{k) = g{e), and VI{O) = g{e{O» here]. Letting VI{O) = II{O) + Og{e{O» denote the agent's utility if his type is 0 and he tells the truth, Proposition 23.0.2 can be used to restate the principal's problem in terms of choosing the functions e{') and VI (.) to solve E[v{e{O»

Max

+ Og{e{O» -

UI{O)]

(23.FJ)

r

s.t. (i) e{') is nondecreasing (ii) UI{O) = UI @ +

g(e(s» ds for all 0

u for all O~

(iii) UI(O);::

Constraints (i) and (ii) are the necessary and sufficient conditions for the principal's contract to be Bayesian incentive compatible, adapted from Proposition 23.0.2 [constraint (i) follows because g(.) is increasing in e], while constraint (iii) is the agent's individual rationality constraint. Note first that if constraint (ii) is satisfied, then constraint (iii) will be satisfied if and only if UI{Q) ~ u. As a result, we can replace constraint (iii) with (iii')

UI{Q)

~

U.

Next, substituting for UI(O) in the objective function from constraint (ii), and then integrating by parts in a fashion similar to the steps leading to (23.0.14), problem (23.FJ) can be restated as

[f

Max

S.t.

{v{e(o))

+ g{e{O»(0 - I ~(:O»)}4>{O)dOJ - VI @

(23.Fo4)

(i) e{') is nondecreasing (iii') UI{Q) ~ u.

It is now immediate from (23.Fo4) that in any solution we must in fact have = U. Thus, we can write the principal's problem as one of choosing e(') to solve

UI (Q)

Max n')

[f

{v{e(o»

+ g(e(O»

(0 - I

~(~~0»)}4>(0) dO] -

u

(23.F.5)

S.t. (i) e{') is nondecreasing. Suppose for the moment that we can ignore constraint (i). Then the optimal function e(') must satisfy the first·order condition·· v'(e{O»

+ g'(e(l!» ( 0 -

1- cI>{O») -- = 0 4>{I!)

for alll!.

U.F:

OPTI .... L

."'EII .. N MECHANISMS

903

----------------------------------------------------------------------

(23.F.6)

But note that, under our assumption that [I! - {(I - cl>{1!))/4>(0))] is nondecreasing in 0, the implicit function theorem applied to (23.F.6) tells us that any solution e{') to this relaxed problem must in fact be nondecreasing. Thus, (23.F.6) characterizes the solution to the principal's actual problem (see Section M.K of the Mathematical Appendix). The optimal VI {') [and, hence, ll{')] is then calculated from constraint (ii) of (23.FJ) using this optimal e{') and the fact that UI{Q) = ii.

It is interesting to compare this solution with the optimal contract for the case in which the agent's type is observable. This contract solves

Max

E[v{e{I!» - tl{I!)]

s.t. tl{l!)

+ Og(e(O)) ~ u for

all O.

Hence, the optimal task level in this complete information contract is the level e*(O) that satisfies, for all 0, v'(e*(O» + g'(e*{O))1! = O. Note that e*(O) is the level that arises in any ex post (classically) efficient social choice function. In contrast, the principal's optimal e(') when I! is private information is such that v'{e(l!» + g'(e(I!»I!{> 0 at alll!.< 0, = 0 at 0 = O. We see then that e(l!) < eO(O) for all I! < 0, and e(0) = eO{O). This is a version of the same result that we saw for the two· type case in Section 14.C. In the optimal contract, the type of agent with the lowest disutility from effort (here type 0; in Section 14.C, type 0,,) takes an ex post efficient action, while all other types have their effort levels distorted downward. The reason is also the same: doing so helps reduce the amount the agent's utility exceeds his reservation utility for types 0 > Q (his so·called information rents). To see this point heuristically, suppose that starting with some function e(') we lower e(iI) by an amount de < 0 for some type ~ E (Q, 0) and lower this type's transfer to keep his utility unchanged!' The decrease in the transfer paid to type a is ag'{e(O» de, while the direct effect on the principal is v'{e{~» de. At the same time, according to constraint (ii), this change in e{~) lowers the utility level, and hence the transfer, that must be given to all types 0 > ~ by exactly g'{e{~» de. The expected value of this reduction in the transfers paid to these types is -(I - cI>{O»g'(e{O»de. If the original e{') is an optimum, the sum of the first two changes in the principal's profits (those for type ~) weighted by the density of type ii, [v'(e(a» + ag'(e(a))] 4>(0) de, plus the reduction in payments to types 0 > a, (I - cI>(a»g'(e(a» de, must equal zero. This gives exactly (23.F.6). • Example 23.F.2: Optimal Auctions. We consider again the auction setting introduced in Example 23.Bo4. Here we determine the optimal auction for the seller of an indivisible object (agent 0) when there are I buyers, indexed by i = I, ... , I. Each buyer has a Bernoulli utility function 0, y,(I!) + t,{I!), where y,{O) is the probability that agent i gets the good when the agents' types are I! = (Ol' ... ' 0,). In addition, each buyer i's type is independently drawn according to the distribution function ,(') on [Q" 0;] c: Ii! with Q, -F 0, and associated density 4>,(') that is strictly positive on [Q" 0;]. We assume also that, for i = I •...• 1, 1- cI>,(O,)

0,----4>,{O,)

is nondecreasing in 0,. ~o 49. We say "heuristically" because to do this rigorously we need to perform this reduction in

48. It can be shown that under our assumptions, the optimal contract is interior, that is, has e(0) > 0 for (almost) all O.

e over an interval of types and then take limits.

50. For a discussion of the case in which this assumption is not met, see Myerson (1981).

904

CHAPTER

23:

'NCENT'VES

AND

MECHAN'SM

--

DES'GN

A social choice function in this environment is a function f(·) = (YO('), •.. ,y,(.), 10 (') ••••• 1,(')) having the properties that. for all fi e e. Yi(fi) e [0, I] for all i, :[,,,0 y,(fi) = I - yo(fi), and lo(fi) = - L,o'o 1.(fi).51 The seller wishes to choose the Bayesian incentive compatible social choice function that maximizes his expected revenue E,[lo(fi)) = - E.[L.o'o I,(fi)] but faces the interim individual rationality constraints that V,(fi,) = fi,Y.(fi,) + 1,(0,)
ij·

f

[y,(fi,)O, - V,(fi;)]
»)][ n' ]

cIl,{fi ... , 0,) ( 0i -I-- O. For small enough £ > 0, this alternative social choice function satisfies ali of the constraints of problem (23.F.17) (note that it satisfies ill < I because, by step 2, y~ < I; check the other constraints too). Moreover, it yields a larger value of the objective function of (23.F.17) than (Y!, I!, y~, I~)-a contradiction. This establishes step 3. Slep 4: satisfied.

tH

I_

I_ =

BAYEIIAN

= y~ = I). But we have

already noted above that no such social choice function is incentive feasible (i.e., is an element of F·). Slep 3: equality).

OPTIMAL

Figure 23.F.1 (len)

Step 2: Any solulion 10 problem (23.F.Il) has YII < I; Ihal is, in any inlerim incenlive efficienl social choice function, trade does nOI occur with certainly when Ihe good is of high qualilY. (y!,I!, y~,I~) is ex post (classically) efficient (i.e., it has Y!

40y_ + ",_

23. F:

UI = 24

+ IH

- 24YH'

U2 = 8

+ 26YH

- I H·

In Figure 23.F.3, for an arbitrary point 0 > OL and OL + OH > O. The agents' valuations are statistically independent with Prob (0, = 8Ll = ). E (0, I) for I = 1,2. In the expected externality mechanism, each agent i announces his valuation and agent;os transfer when the announced types are (8" 8,) has the form t,(8" 8_,) = E._.[O_;k*(O;,IL;)] + h,(0_,),wherek*(8 I ,O,1 = OifO I = 0, = 8L ,and k*(8 .. 8,) = I otherwise. As we saw in Section 23.D, in one Bayesian Nash equilibrium of this mechanism, truth telling is each agent's equilibrium strategy. But this truth-telling equilibrium is not the only Bayesian Nash equilibrium. In particular, there is an equilibrium in which both agents always claim that 0H is their type. To see this, consider agent j's optimal 56_ The "strong" terminology is not standard; in the literature it is not uncommon, for example. to see the strong implementation concept simply referred to as .. implementation ....

912

c H ... PTE R

23;

INC E N T I V E S

... N D

M E C H ... N ISM

-

DES I G N

------------------------------------------------------------------------------strategy if agent - j will always announce 0H' Whichever announcement agent j makes, the project is done. Thus, regardless of his actual type, agent j's direct benefit (i.e., O,k·(O" 62 » is not affected by his announcement (it is OL if he is of type OL' and 0H if he is of type 0H)' It follows that agent j's optimal strategy is to make an announcement that maximizes his expected transfer. Now, agent j's expected transfer if he announces 0" is ().OL + (1 - )')OH) + h,(O,,), whereas if he announces OL his expected transfer is (I - )')OH + h,(OH)' Hence, agent j will prefer to announce 0" regardless of his type if agent - j is doing the same. It follows that both agents always announcing 0", and the project consequently always being done, constitutes a second Bayesian Nash equilibrium of this mechanism. _

A P PEN D I X

B:

IMP L E MEN TAT tON

IN

EN V I RON MEN T'

WIT H

COM P LET E

observe 0, there is still an implementation problem: Because no outsider (such as a court) will observe 0, the agents cannot write an enforceable ex ante agreement saying that they will choose outcome f(O) when agents' preferences are O. Rather, they can only agree to participate in some mechanism in which equilibrium play yields f(O) if 0 is realized. 57 Note that a complete information setting can be viewed as a special case of the general environment considered throughout this chapter, in which the probability density 0,; =0 if 0, SO,.

(e) Prove that if [(.) satisfies IPM, and if 91, c i? for all i. then J(.) is monotonic.

+ O,)y,(O,. 0,).

1,(0,.0,) = -!(O,

(d) Prove that if [(.) is monotonic, and 91, = i? for all i. then J(') satisfies IPM.

+ O,)y,(O,. 0,).

Suppose that the seller truthfully reveals his type for all 0, worthwhile to reveal his type? Interpret.

E

[0. I]. Will the buyer find it

23.B.3 8 Show that b,(O,) = 0, for all 0, E [0. I] is a weakly dominant strategy for each agent i in the second-price sealed-bid auction. 23.B.4 c Consider a bilateral trade setting (see Example 23.B.4) in which both the seller's and the buyer's types are drawn independently from the uniform distribution on [0. I]. (a) Consider the double auclion mechanism in which the seller (agent I) and buyer (agent 2) each submit a sealed bid. bi ~ O. If b, ~ b,. the seller keeps the good and no monetary transfer is made; while if b, > b" the buyer gets the good and pays the seller the amount ! O. Show that the continuously differentiable soeial choice function f(-) - (k(·),I,(·), ... , 1,('» is truthfully implementable in dominant strategies if and only if, for all i = I, ... ,I,

23.0.4 c Consider a first-price sealed-bid auction with I symmetric buyers. Each buyer's valuation is independently drawn from the interval [~, 0] according to the strictly positive density 1/>(').

k(O) is nondecreasing in 0,

and

(a) Show that the buyer's equilibrium bid function is nondecreasing in his type. 1,(0,,0 _,) = 1,(0,.0 _.) _ -

f."

(b) Argue that in any symmetric equilibrium (b·(·), . .. ,b*('» there can be no interval of types (0',0"),0' '" 0", such that b'(O) is the same for all 0 E (0', 0"). Conclude that b'(') must therefore be strictly increasing.

ov.(k(s, 0 -.), s) ok(s, 0 -,) ds.

!,

ok

Os

23.C.IO" (8. Holmstrom) Consider the quasilinear environment studied in Section 23.C. Let

(c) Argue, using the revenue equivalence theorem, that any symmetric equilibrium of such an auction must yield the seller the same expected revenue as in the (dominant strategy) equilibrium of the second-price sealed-bid auction.

k'(') denote any project decision rule that satisfies (23.C.7). Also define the function V'(O) = L:, v,(P(O), 0,).

(a) Prove that there exists an ex post efficient soeial choice function [i.e., one that satisfies condition (23.C.7) and the budget balance condition (23.C. I 2)] that is truthfully implementable in dominant strategies if and only if the function V·(·) can be written as V'(O) = L:, 1'1(0 _,) for some functions V,(·)" .. , 1'1(.) having the property that V,(.) depends only on 0_, for all i.

=

23.0.Sc For the same assumptions as in Exercise 23.0.4. consider a sealed-bid all-pay auction in which every buyer submits a bid, the highest bidder receives the good, and every buyer pays the seller the amount of his bid regardless of whelher he wins. Argue that any symmetric equilibrium of this auction also yields the seller the same expected revenue as the sealed-bid second-price auction. [Hint: Follow similar steps as in Exercise 23.0.4.]

=

(b) Use the result in part (a) to show that when 1=3, K R, 9, R. for all i, and v,(k, 0,) = O,k - (!)k' for all i an ex posteffieient social choice function exists that is truthfully implementable in dominant strategies. (This result extends to any I > 2.)

23,0,6c Suppose that I symmetric individuals wish to acquire the single remaining ticket to a concert. The ticket office opens at 9 a.m. on Monday. Each individual must decide what time to go to get on line: the first individual to get on line will get the ticket. An individual who waits I hours incurs a (monetary equivalent) disutility of {JI. Suppose also that an individual showing up after the first one can go home immediately and so incurs no waiting cost. Individual i's value of receiving the ticket is 0., and each individual's 0, is independently drawn from a uniform distribution on [0, I]. What is the expected value of the number of hours that the first individual in line will wait? [Hint: Note the analogy to a first-price sealed-bid auction and use the revenue equivalence theorem.] How does this vary when {J doubles? When I doubles?

(c) Now suppose that the v,(k, 0,) functions are such that V·(·) is an I-times continuously differentiable function. Argue that a necessary condition for an ex post efficient social choice function to exist is that, at all 0, o'V'(O) =

o.

00, ... 00, (In fact, this is a sufficient condition as well.) (d) Usc the result in (c) to verify that, under the assumptions made in the small type discussion at the end of Section 23.C, when I = 2 no ex post effieient social choice function is truthfully implementable in dominant strategies.

23.E.I" Consider again a bilateral trade selling in which each 0, (i = 1,2) is independently drawn from a uniform distribution on [0, I]. Suppose now that by refusing to participate in the mechanism a seller with valuation 0, receives expected utility 0, (he simply consumes the good). whereas a buyer with valuation 6, receives expected utility 0 (he simply consumes his endowment of the numeraire, which we have normalized to equal 0). Show that in the expected externality mechanism there is a type of buyer or seller who will strictly prefer not to participate.

23.C.II A Consider a quasilinear environment, but now suppose that each agent i has a Bernoulli utility function of the form u,(v,(k, 0,) + m, + I,) with u;(') > O. That is, preferences over certain outcomes take a quasilinear form, but risk preferences arc unrestricted. Verify that Proposition 23.C.4 is unaffected by this change. 23.0.1" [Based on an example in Myerson (1991)] A buyer and a seller arc bargaining over the sale of an indivisible good. The buyer's valuation is O. K 10. The seller's valuation takes one of two values: 0, e {O, 9}. Let I be the period in which trade occurs (I = 1,2, ... ) and let p be the price agreed. Both the buyer and the seller have discount factor lJ < I.

23.E.2A Argue that when the assumptions of Proposition 23.E.I hold in the bilaterai trade selling: (a) There is no social choice function 1(') that is dominant strategy incentive compatible and interim individually rational (i.e., that gives each agent i nonnegative gains from participation conditional on his type 0" for all 0,).

(a) What is the set X of alternatives in this selling? (b) Suppose that in a Bayesian Nash equilibrium of this bargaining process, trade occurs

...

921

922

CHAPTER

23:

INCENTIVES

AND

MECHANISM

DESIGN

-----------------------------------------------------------------------------(b) There is no social choice function f(·) that is Bayesian incentive compatible and ex post individually rational [i.e., that gives each agent nonnegative gains from participation for every pair of types (9" 9,». 23.E.3" Show by means of an example that when the buyer and seller in a bilateral trade selling both have a discrete set of possible valuations, social choice functions may exist that are Bayesian incentive compatible, ex post efficient, and individually rational. [Hint: It suffices to let each have two possible types.] Conclude that the assumption of a strictly positive density is required for the Myerson-Sallerthwaite theorem. 23.E.4" A seller (i = I) and a buyer (i = 2) are bargaining over the sale of an indivisible good. Trade can occur at discrete periods t = 1,2, .... Both the buyer and the seller have discount factor ~ < I. The buyer's and seller's valuations are drawn independently with positive densities from [~" 0,] and [~" 0,], respectively. Assume that (Q" 0,)" (~" 0,) # 0. Note that in this selling ex post efficiency requires that trade occur in period I whenever 0, > 0" and that trade not occur whenever 0, > 0,. Use the Myerson-Sallerthwaite theorem to show that, in this selling with discounting, no voluntary trading process can achieve ex post efficiency. 23.E.S" Suppose there is a conrinuum of buyers and sellers (with quasilinear preferences). Each seller initially has one unit of an indivisible good and each buyer initially has none. A seller's valuation for consumption of the good is 0, e [Q" ii,], which is independently and identically drawn from distribution CI>,(') with associated strictly positive density 4>,('). A buyer's valuation f~om consumption of the good is 0, E [Q" 0,], which is independently and identically drawn from distribution CI>,(') with associated strictly positive density 4>,c-). (_) Characterize the trading rule in an ex post efficient social choice function. Which buyers and sellers end up with a unit of the good? (b) Exhibit a social choice function that has the trading rule you identified in (a), is Bayesian incentive compatible, and is individually rational. [Hint: Think of a "competitive" mechanism.] Conclude that the inefficiency identified in the Myerson-Sallerthwaite theorem goes away as the number of buyers and sellers grows large. [For a formal examination showing that, with a finite number of traders, the efficiency loss goes to zero as the number of traders grows large, see Gresik and Sallerthwaite (1989).] 23.E.6" Consider a bilateral trading selling in which both agents initially own one unit of a good. Each agent i's (i = 1,2) valuation per unit consumed of the good is 0,. Assume that 0, is independently drawn from a uniform distribution on [0, 1]. (a) Characterize the trading rule in an ex post efficient social choice function. (b) Consider the following mechanism: Each agent submits a bid; the highest bidder buys the other agent's unit of the good and pays him the amount of his bid. Derive a symmetric Bayesian Nash equilibrium of this mechanism. [Hine: Look for one in which an agent's bid is a linear function of his type.] (0) What is the social choice function that is implemented by this mechanism? Verify that it is Bayesian incentive compatible. Is it ex post efficient? Is it individually rational [which here requires that U,(O,) ~ 9, for all 9, and i = 1,2]? Intuitively, why is there a difference from the conclusion of the Myerson-Satterthwaite theorem? [See Cramton, Gibbons, and Klemperer (1987) for a formal analysis of these "partnership division" problems.]

23.E.7" Consider a bilateral trade setting in which the buyer's and seller's valuations are drawn independently from the uniform distribution on [0, I]. (0) Show that if f(·) is a Bayesian incentive compatible and interim individually rational

EXEIICIIEI923

----------------------------------------------------------------------------------social choice function that is ex post efficient, the sum of the buyer's and seller's expected utilities under f(·) cannot be less than 5/6. (b) Show that, in fact, there is no social choice function (whether Bayesian incentive compatible and interim individually rational or not) in which the sum of the buyer's and seller's expected utilities exceeds 2/3. 23.F.l c Consider the quasilinear setting studied in Sections 23.C and 23.D. Show that if the social choice function f(·) e F' is ex post classically efficient in FlO then it is both ex ante and interim incentive efficient in P. [From this fact, we see that if an ex post classically efficient social choice function can be implemented in a setting with privately observed types (i.e., if it is incentive feasible), then no other incentive feasible social choice function can welfare dominate it. Note, however, that there may be other ex ante or interim incentive efficient social choice functions that are not ex post efficient; for example, you can verify that in Example 23.F.1 there is an ex post classically efficient social choice function that is incentive feasible, but the particular interim incentive efficient social choice function derived in the example is not ex post efficient.] 23.F.2" [Based on Maskin and Riley (1984)] A monopolist seller produces a good with constant returns to scale at a cost of c > 0 per unit. The monopolist sells to a consumer whose preference for the product the monopolist cannot observe. A consumer of type 9 > 0 derives a utility of Ov(. 0 and VO(.) < O. The set of possible COnsumer types is [q,O] with > Q > 0, and the distribution of types is 4>('), with an associated strictly positive density function 4>(') > O. Assume that [9 - «I - 4>(0))/4>(9))] is nondecreasing in O. Characterize the monopolist's optimal selling mechanism to this consumer, assuming that a consumer of type 0 can always choose not to buy at all, thereby deriving a utility of O.

°

23.F.3 c An auction with a reserve price is an auction in which there is a minimum allowable bid. Suppose that in the auction selling of Example 23.F.2 the I buyers are symmetric and that Q = O. Argue that a second'price sealed·bid auction with a reserve price is an optimal auction in this case. What is the optimal reserve price? Can you think of a modified second·price sealed·bid auction that is optimal in the general (nonsymmetric) case? 23.F.4" Derive the optimal y,{') functions in the auction selling of Example 23.F.2 when the seller's valuation for the object is 00 > O. 23.F.5 B Suppose that a monopolist seller who has two potential buyers has a total of one divisible unit to sell; that is, production costs are zero up to one unit, and infinite beyond that. The demand function of buyer i is the decreasing function x,(p) for i = 1,2. The monopolist can name distinct prices for the two buyers. (a) Characterize the monopolist's optimal prices. (b) Relate your answer in (0) to the optimal auction derived in Example 23.F.2. [For more on this, see Bulow and Roberts (1989).] 23.F.6 c [Based on Baron and Myerson (1982)]. Consider the optimal regulatory scheme for a regulator of a monopolist who has known demand function x(p), with x'(p) < 0, and a privately observed constant marginal cost of production 9. The regulator can set the monopolist's price and can make a transfer from or to the monopolist, so the set of outcomes is X = {(p, I): p > 0 and Ie R}. The regulator must guarantee the monopolist a nonnegative profit regardless of his production costs to prevent the monopolist from shutting down. The monopolist's marginal cost 0 is drawn from [~, 0] with 8 > ~ > 0 according to the distribution function ('), which has an associated strictly positive density function 4>(') > O. Assume that

924

CHAPTER

23:

INCENTIVES

AND

MECHANISM

DESION

$(O)/IIJ(O) is nondecreasing in O. Denote a type-O monopolist's profit from outcome (p,l) by "(p, I, 0) = (p - O)x(p)

+ I.

-

EXERCISEI925

-------------------------------------------------------------------------------------------normalized valuation function, that is, a function such that v.(k o) = O. Suppose that kO(.) and the Groves transfers are calculated using these announcements. Does each agent have a unique (weakly) dominant strategy in this normalized Groves mechanism?

(a) Adapl the characterization in Proposition 23.0.2 to this application.

23.BB.IA Consider the dynamic mechanism in Example 23.BB.1.

(b) Suppose that the regulator wants to design a direct revelation regulatory scheme

(a) For each possible preference profile, write down its normal form and identify its Nash equilibria.

(p('), 1(')) that maximizes the expected value of a weighted sum of consumer and producer

surplus,

f-(I,~

xis) ds

(b) For each possible preference profile, identify this mechanism's subgame perfect Nash equilibria.

+ ",,(P(O), 1(0), 0),

where" < I. Characterize the regulator's optimal regulatory scheme. What if " :!: 1?

23.BB.28 Is a social choice function that is implementable in dominant strategies necessarily implementable in Nash equilibrium? What if we are interested in strong implementation instead?

23.F.7c [Based on Dana and Spier (1994)] Two firms, j = 1,2. compete for the right to produce in a given market. A social planner designs an optimal auction of production rights to maximize the expected value of social welfare as measured by

w=

l>J + S + (;' J

23.BB-3c Consider a setting of public project choice (see Example 23.8.3) in which K = {O, II· Let 0, denote agent i's benefit if the project is done (i.e., if k = 1); normalize the value from k = 0 to equal zero. Assume that El, = R. In this setting, the only mechanisms that involve an ex post efficient project choice are Groves mechanisms. Let kO(.) denote the project choice rule in such a mechanism. Also, suppose that I ~ 3. The transfers in a Groves mechanism are characterized by two properties:

I) ~:rJ' J

where I; denotes the transfer from firm j to the planner, S is consumer surplus, "; is the gross (pretransfer) profit of firmj, and;' > I is the shadow cost of public funds. The auction specifies transfers for each of the firms and a market structure; that is, it either awards neither firm production rights, awards only one firm production rights (thereby making that firm an unregulated monopolist), or gives production rights to both firms (thereby making them compete as unregulated duopolists). Each firm j privately observes its fixed cost of production OJ. The fixed cost levels 0, and 0, are independently distributed on [g,O] with continuously differentiable density function (') is increasing in O. The firms have common marginal cost c < 1 and produce a homogeneous product for which the market inverse demand function is p(x) = I - x (this is publicly known). If both firms are awarded production rights, they interact as Cournot competitors (see Section 12.C). Characterize the planner's optimal auction of production rights.

°

(i) if kO(O" _I) = kO(O;, 0_,), then I.{O" 0_,) = 1,(0;, 0_,); (ii) if kO(O" 0_,) = I and kO(O;, 0_,) = 0, then 1,(0,,0_,) - 1,(0;,0_,) = 2.J'" OJ. Which, if any, of these two properties must be satisfied by any Nash implementable social choice function that involves an ex post efficient project choice?

23.F.8 A Show that any ex post classically efficient social choice function in Example 23.F.3 has YL = y" = I. 23.F_9 8 Show that in the model of Example 23.F.3: (a) No feasible social choice function is ex post efficient. (b) In any feasible social choice function, y" S YL and I" S I L • (c) In any feasible social choice function, the expected gains from trade of a low-quality seller are at least as large as the expected gains from trade of a high-quality seller; that is, It - 20j't ~ I" - 40y". 23.F.10· Characterize the sets of interim and ex ante incentive efficient social choice functions in the model of Example 23.F.3 when trade is not voluntary for the seller (but it is voluntary for the buyer). 23.AA.1 • Reconsider Exercise 23.C.8. Exhibit a mechanism r = (S., . .. , S" g(')) that is not a direct revelation mechanism that truthfully implements f(') in dominant strategies and for which each agent has a unique (weakly) dominant strategy. 23.AA.2 8 Let K = {ko, k., ... , kN I be the set of possible projects and suppose that, for each agent i, {v,L 0,): Ole El,l = 'V, that is, that every possible valuation function from K to R arises for some 0, eEl,. Do players in a Groves mechanism have a unique (weakly) dominant strategy? Consider instead a mechanism in which each agent i is allowed to announce a

.....

Mathematical Appendix

SECTION

M.A:

MATRIX

NOTATION

FOR

particular, that if M = 1 (so that [(x) E R) then DI(x) is a 1 x N matrix; in fact VI(x) = [D [(x)]'. To avoid ambiguity, in some cases we write DJ(x) to indicate explicitly the variables with respect to which the function [(.) is being differentiated. For example, with this notation, if I: RNH .... RM is a function whose arguments are the vectors x E RN and y E RX, the matrix D.[(x, y) is the M x N matrix whose mn th entry is iJIm(x, y)/iJx •. Finally, for a real-valued differentiable function I: RN .... R, the Hessian matrix D2[(X) is the derivative matrix of the vector-valued gradient function V[(x); i.e., D2 [(x) = D[V[(x)). In the remainder of this section, we consider differentiable functions and examine how two well-known rules of calculus-the chain rule and the product rule---{;ome out in matrix notation. The Chain Rule

Suppose that g: RS .... RN and I: RN .... RAt are differentiable functions. The composite function [(g(')) is also differentiable. Consider any point x E RS. The chain rule allows us to evaluate the M x S derivative matrix of the composite function with respect to x, DJ(g(x» by matrix multiplication of the N x S derivative matrix of g('), Dg(x), and the M x N derivative matrix of I(') evaluated at g(x), that is, D I(y), where y = g(x). Specifically,

This appendix contains a quick and unsystematic review of some of the mathematical concepts and techniques used in the text. The formal results are quoted as "Theorems" and they are fairly rigorously stated. It seems useful in a technical appendix such as this to provide motivational remarks, examples, and general ideas for some proofs. This we often do under the label of the "Proof" of the mathematical theorem under discussion. Nonetheless, no rigor of any sort is intended here. Perhaps the heading "Discussion of Theorem" would be more accurate. It goes without saying that this appendix is no substitute for a more extensive and systematic, book-length, treatment. Good references for some or most of the material covered in this appendix, as well as for further background reading, are Simon and Blume (1993), Sydsaeter and Hammond (1994), Novshek (1993), Dixit (1990), Chang (1984), and Intriligator (1971).

D.[(g(x)) = D[(g(x» Dg(x).

(M.A.I)

The Product Rule

Here we simply provide a few illustrations. (i) Suppose that [: RN .... R has the form [(x) = g(x)h(x), where both g(.) and h(') are real-valued functions of the N variables x = (x" . .. ,XN) (so that g: RN .... R and h: RN .... R). Then the product rule of calculus tells us that D[(x) = g(x) Dh(x)

+ h(x) Dg(x).

(M.A.2)

which, transposing, can also be written as

M.A Matrix Notation for Derivatives

VI(x) = g(x) Vh(x)

+ h(x) Vg(x).

(ii) Suppose that J: RN .... R has the form I(x) = g(x)·h(x) where both g(.) and h(') are vector·valued functions which map the N variables x = (Xl' ... ' x N ) into RM. Then D[(x) = g(x)'Dh(x) + h(x)·Dg(x). (M.A.3)

We begin by reviewing some matters of notation. The first and most important is that formally and mathematically a "vector" in RN is a column. This applies to any vector; it does not matter, for example, if the vector represents quantities or prices. It applies also to the gradient vector VI(x) E RN of a function at a point x; this is the vector whose nth entry is the partial derivative with respect to the nth variable of the real-valued function I: RN .... R, evaluated at the point x ERN. Expositionally, however, because rows take less space to display, we typically describe vectors horizontally in the text, as in x = (Xl' ... ' XN)' But the rule has no exception: all vectors are mathematically columns. The inner product of two N vectors X E RN and y E RN is written as X' Y = L. x.y•. If we view these vectors as N x 1 matrices, we see that X' Y = X T y, where T is the matrix transposition operator. An expression such as "x·" can always be read as "XT"; for example, the expression X' A, where A is an N x M matrix, is the same as

Note that h(x)' Dg(x) = [h(X)]T Dg(x) is a I x N matrix, as is the other term in the right-hand side. Thus, the vector-valued case (M.AJ) implies the scalar-valued formula (M.A.2). (iii) Suppose that I: R .... RM has the form [(x) = a(x)g(x), where a(') is a real-valued function of one variable (i.e., a: R .... R) and g: R .... RM. Then D [(x) = a(x) Dg(x) + a'(x)g(x). (M.A.4) (iv) Suppose that [: RN .... RM has the form [(x) = h(x)g(x) where h: RN .... R and g: RN .... RM. Then DI(x) = h(x) Dg(x)

+ g(x) Dh(x).

(M.A.5)

Note that g(x) is an M-element vector (i.e., an M x 1 matrix) and Dh(x) is a 1 x N matrix. Hence, g(x) Dh(x) is an M x N matrix (of rank 1). Observe also that (M.A.4) follows as a special case of (M.A.5).

xTA.

If I: !\IN .... RM is a vector-valued differentiable function, then at any x ERN we denote by D[(x) the M x N matrix whose mnth entry is iJIm(x)/iJx•. Note, in 926

.....

DERIVATIVES

927

928

MATHEMATICAL

APPENDIX

M.B Homogeneous Functions and Euler's Formula

-

SECTION

M.B:

HOMOGENEOUS

FUNCTIONS

AND

EULER'S

FORMULA

929

---------------------------------------------------------------------------------X,

In this section. we consider functions of N variables, f(x l ••••• x N). defined for all nonnegative values (Xl ••••• x N ):2: O. Definition M.B.1: A function f(x" ... ,xN) is homogeneous of degree r (for r = ... , -1.0,1, ... ) if for every I> 0 we have f(IX" ... ,IXN ) = (,f(x, • ...• x N ).

As an example. f(x i • x,) = XI/X, is homogeneous of degree zero and f(x ,• x,) = (XIX,)I" is homogeneous of degree one. Note that if f(x" ...• x N) is homogeneous of degree zero and we restrict the domain to have XI > 0 then, by taking I = I/x i • we can write the function f(·) as

j(-)

= t' Flgur. M.B.1

f(-) = I X,

f(l. x,/x l ••••• XN/X I) = f(x l ••••• x N). Similarly, if the function is homogeneous of degree one then and the slope of the level set containing point IX for

f(l. x,/x I•...• XN/X I ) = (l/xI)f(x , •...• x N ).

I

An illustration of this fact is provided in Figure M.B.1. Suppose that f(·) is homogeneous of some degree r and that h(·) is an increasing function of one variable. Then the function hU(x" ... , XN» is called homothetie. Note that the family of level sets of hU('» coincides with the family of level sets of f(·). Therefore. for any homothetic function it is also true that the slopes of the level sets are unchanged along rays through the origin. A key property of homogeneous functions is given in Theorem M.B.2.

> O. By the definition of homogeneity (Definition M.S.\) we have f(IX , •...• IX N ) -I'f(x , •... , XN) = O.

Differentiating this expression with respect to x. gives t

> 0 at IX is

_ Of..~X]!~1 = _ I' - , 0 f(x)/ox I 0f(x)/ox, iJf(IX)/OX 2 1'-' of(x)/ox, = - iJf(x)/ox,'

Theorem M.B.1: If f(x" . .. , xN) is homogeneous of degree r (for r = ...• -1.0,1 •... ), then for any n = 1, ... ,N the partial derivative function of (x, , ...• xN)/ox n is homogeneous of degree r - 1. Proof: Fix a

I

Of(IX" .... tXN) ,of(xI.· ... XN) 0 -t = • ox, ox.

Theorem M.B.2: (Euler's Formula) Suppose that f(x" ... ,XN) is homogeneous of degreer(for somer = ... , -1, 0, 1, ... ) and differentiable. Then at any (x" . .. , xN ) we have

so that

of(IX " ... ,IX N ) = t,-I of(X " ...• XN). OX, OX, By Definition M.B.I, we conclude that of(X " ...• xN)/ox, is homogeneous of degree r - I. •

f!.L

of (x, , ... ,xN )

n=l

iJxn

-

-

f(-

xn = r

Xl.···

- )

I

XN •

or, in matrix notation, Vf(x)·x = rf(x).

For example. for the homogeneous of degree one function f(x ,• x,) = (XIX,)I/'. we have of(X I. x,)/ox i = f(X,/XI)I/', which is indeed homogeneous of degree zero in accordance with Theorem M.B.1. Note that if f(·) is a homogeneous function of any degree then f(x ...• XN) = " is, a radial f(x;, ... , x;") implies f(tx , •... , tXN) = f(tx;, ... , tx;") for any t > 0; that expansion of a level set of f(·) gives a new level set of f(·). I This has an interesting implication: the slopes of the level sets of f(·) are unchanged along any ray through the origin. For example, suppose that N = 2. Then, assuming that of(x)/ox, "" 0, the slope of the level set containing point x = (XI' x,) at X is -(of(x)/ox,)/(of(x)/ox,),

Proof: By definition we have

f(lx t .... ,IX N) - t'!(x, .... , XN)

= O.

Differentiating this expression with respect to t gives

Evaluating at

I

= I, we obtain Euler's formula . •

For a function that is homogeneous of degree zero, Euler's formula says that t. A level set offunction 1(,) is a set of the form {x E

R~: I(x)

= k} for some k. A radial

expansion of this set is the set of points obtained by mUltiplying each vector X in this level set by some positive scalar t > O.

....

The level sels of a homogeneous funclion.

930

MATHEMATICAL

-

APPENDIX

As an example. note that for the function f(x i • x,) = XI/X,. we have of(x I> x,)/ox i = l/x2 and of(x I' x,)/ox, = -(xl/(x,)'). and so ~ of(xl.···.XN) _ '~I aX. x.

I -

XI

= .i; XI

-

-

0

(x,)' x, = .

• E C TI 0 N

M.

c:

CON CAY E

AND

~ of(XI' •••• XN) - - f(L.

X,,-

AX,.

(M.C.2) for any collection of vectors XI E A, ...• x that 0: 1 + ... + O:K = I.

- )

X1"",XN'

For example. when f(x l• x,) = (XIX,)I/'. we have of(X I' X,)/OXI of(.x l • X2 )/OX 2 = HXI/X2)1/'. and so

= t(X,/XI)I/'

K

E

A and numbers 0:, ~ O•..• , 0:" ~ 0 such

and Let us consider again the one-variable case. We could view each number 0:, in condition (M.C.2l as the "probability" that x' occurs. Then condition (M.C.2) says that the value of the expectation is not smalier than the expected value. Indeed, a concave function f: R .... R is characterized by the condition that

~o.f'...2(-'xl,,-''_'....c'._X"-,,N) __ I (X,)I/' _ I (XI)I/' _ X - ax. • 2 -XI XI + -2 -X, X2

L.. .~,

= (X IX2)1/2 =

f(X,.

(M.C.3)

x,).

for any distribution function F: R .... [0, I). Condition (M.C.3) is known as J,nsen's inequality.

M.C Concave and Quasiconcave Functions In this section. we consider functions of N variables f(x, •...• x N) defined on a domain A that is a convex subset of RN (such as A = RN or A = R~ = {x E IR N: x ~ O}).' We denote X = (Xl •...• x N ).

The properties of convexily and slriel convexily for a function f(·) are defined analogously but with the inequality in (M.CI) reversed. In particular, for a strictly convex function f('), a straight line connecting any two points in its graph should lie entirely above its graph, as shown in Figure M.C.2. Note also that f(·) is concave if and only if - f(·) is convex. Theorem M.C'! provides a useful alternative characterization of concavity and strict concavity.

Definition M.C.l: The function f: A .... R. defined on the convex set A eRN, is con-

cave if f(rJ.X'

+ (1

- o:)x) ~ o:f(x')

+ (1

- o:)f(x)

(M.C.1)

for all x and x' E A and all 0: E [0, 1]. If the inequality is strict for all x' 0: E (0,1), then we say that the function is strictly concave.

~

x and all

Theorem M.C.l: The (continuously differentiable) function f: A .... R is concave if and only if

Figure M.CI(a) illustrates a strictly concave function of one variable. For this case, condition (M.C.I) says that the straight line connecting any two points in the graph of f(·) lies entirely below this graph. 3 In Figure M.C.I(b). we show a function

fIx

R

,f(·
Theorem M.C.3's characterization of quasiconcave functions is illustrated in Figure M.CS. The content of the theorem's condition (M.CS) is that for any quasiconcave function f(·) and any pair of points x and x' with fIx') ~ fIx), the gradient vector VfIx) and the vector (x' - x) must form an acute angle.

6. See Section M.E for a discussion of the properties of such matrices.

d

935

936

MATHEMATICAL

APPENDIX

Theorem M.D.1: Suppose that M is an N x N matrix. (i) The matrix M is negative definite if and only if the symmetric matrix M + MT is negative definite. (ii) If M is symmetric. then M is negative definite if and only if all of the characteristic values of M are negative. (iii) The matrix M is negative definite if and only if M- 1 is negative definite. (iv) If the matrix M is negative definite. then for all diagonal N x N matrices K with positive diagonal entries the matrix KM is stable.'

--- -

• E C T ION

Proof: Part (i) simply follows from the observation ihat z'(M + MT)Z = 2z'Mz for every z E RH. The logic of part (ii) is the following. Any symmetric matrix M can be diagonalized in a simple manner: There is an N x N matrix of full rank e having e T = e - I and such that CM C T is a diagonal matrix with the diagonal entries equal to the characteristic values of M. But then z· Mz = (Cz)· eMCT(Cz). and for every Z E RH there is a z such that i = Cz. Thus. the matrix M is negative definite if and only if the diagonal matrix CMC T is. But it is straightforward to verify that a diagonal matrix is negative definite if and only if everyone of its diagonal entries is negative. Part (iii): Suppose that M - I is negative definite and let z ¥- O. Then z· M z = (Z'MZ)T = z'MTz = (MTz)'M-I(MTz) < O. Part (iv): It is known that a matrix A is stable if and only if there is a symmetric positive definite matrix E such that EA is negative definite. Thus, in our case, we can take A = KM and E = K- . •

M. 0:

.. A T RIC E . :

NEG A T I V E

( • E .. II 0 E FIN I TEN E ••

AND

0 THE R

Proof: (i) The necessity part is simple. Note that by the definition of negative definiteness we have that every ,M, is negative definite. Thus, by Theorem M.D.!, the characteristic values of ,M, are negative. The determinant of a square matrix is equal to the product of its characteristic values. Hence, I,M,I has the sign of (-IY. The sufficiency part requires some computation. which we shall not carry out. It is very easy to verify for the case N = 2 [if the conclusion of (i) holds for a 2 x 2 symmetric matrix. then the determinant is positive and both diagonal entries are negative; the combination of these two facts is well known to imply the negativity of the two characteristic values]. For (ii), we simply note the requirement to consider all permutations. For example. if M is a matrix with all its entries equal to zero except the N N entry, which is positive. then M satisfies the nonnegative version of (i) but it is not negative semidefinite according to Definition M.D.1. Notice that in part (iii) we only claim necessity of the determinantal condition. [n fact, for nonsymmetric matrices the condition is not sufficient. • Example M.D.I: Consider a real-valued function of two variables, I(x" x,). In what follows. we let subscripts denote partial derivatives; for example, 112(X,. x,) = iJ' I(x I' x,)/iJx, iJx,. Theorem M.C.2 tells us that 1(') is strictly concave if

Xl)]

D'/(x" x,) = [/"(X,, x,) 112(X" I,I(X I • X,) I,,(x l , X,)

is negative definite for all (XI' x,), According to Theorem M.D.2, this is true if and only if

'

For positive definite matrices, we can simply reverse the words "positive" and "negative" wherever they appear in Theorem M.D.1. Our next result (Theorem M.D.2) provides a determinantal test for negative definiteness or negative semidefiniteness of a matrix M. Given any T x S matrix M, we denote by ,M the r x S submatrix of M where only the first t ~ Trows are retained. Analogously, we let M, be the T x s submatrix of M where the first s ~ S columns are retained, and we let ,M, be the t x s submatrix of M where only the first r oS T rows and s oS S columns are retained. Also. if M is an N x N matrix, then for any permutation It of the indices {I, ...• N} we denote by M' the matrix in which rows and columns are correspondingly permuted.

and

/ll(X I • x,) 112(X" X')I > 0, I"(x,, x,)

IIII (x " x,)

or equivalently, if and only if

and

111(X" x,)/,,(x l , x,) - U12(X" x,)]' > O. Theorem M.C.2 also tells us that 1(') is concave if and only if D' I(x" x,) is negative semidefinite for all (XI' x,). Theorem M.D.2 tells us that this is the case if and only if

Theorem M.D.2: Let M be an N x N matrix. (i) Suppose that M is symmetric. Then M is negative definite if and only if (-l),I,M,1 > 0 for every, = 1..... N. (ii) Suppose that M is symmetric. Then M is negative semidefinite If and only if ( -1 )'J,M;I 2: 0 for every' = 1•...• N and for every permutation It of the indices {1 ....• N}. (iii) Suppose that M is negative definite (not necessarily symmetric). Then ( -1 )'J,M;I > 0 for every, = 1....• N and for every permutation It of the indices {1 .... • N}.8

and

/II(X" x,) l , x,)

I111(x

ltiXI,Xl)I2:0. 1,,(xI' x,)

and, permuting the rows and columns of D'/(x l , x,).

1/,,(x l • x,)1

~

0

and

Xl)l . .

I,,(x I , x,) 111(x" O. '" 111(X" x,)

I112(X I, x,)

Thus. 1(') is concave if and only if 111(x I , x,)

7. A matrix M is stable irall or its characteristic values have negative real pans. This terminology is motivated by the ract that in this case the solution or the system or differential equations dx(r)/dl = MX(I) will converge to zero as I _ 00 ror any initial position x(O). 8. A matrix M such that - M satisfies the condition in (iii) is called a P matrix. The reason is that the detenninant of any submatrix obtained by deleting some rows (and corresponding columns)

:s 0,

I"(x,, x,) oS 0, and

•

is positive.

.....

PRO PER TIE'

937

938

MATHEMATICAL

-

SECTION

APPENDIX

-------------------------------------------------------------------------A similar test is available for positive definite and semidefinite matrices: The results for these matrices parallel conditions (i) to (iii) of Theorem M.D.2, but omit the factor (_1)'.9 Theorem M.D.3: Let M be an N x N symmetric matrix and let B be an N x S matrix with S ~ N and rank equal to S.

(-1)'!

,M:

(,B")T

= O}

tSEMI)DEFINITENESS

AND

OTHER

and (performing the appropriate permutations) f"(x,, x,)

f"(x,, x,)

f,(x" x,)

f12(X" x,)

fl1(X" x,)

f,(x" x,) ~ O.

f,(x" x,)

f,(x" x,)

0

-

To characterize matrices that are positive definite or positive semidefinite on the subspace {t e IJl:N: Bz = O}, we need only alter Theorem M.D.3 by replacing the term ( _ I)' with ( _I )s.

0

for r = S + 1, ... , N. (ii) M is negative semidefinite on {z eRN: Bz z ERN with Bz = 0 and z ,;. 0) if and only if

NEGATIVE

2f,(x" x,)f,(x" x,)fdx" x,) - [f,(x" x,)]' f"(x,, x,) - [f,(x" x,)]'fl1 (x" x,) ~ O.

,M, ,B! > 0

(,B)T

MATRICES:

Computing these two determinants gives us the necessary and sufficient condition

(i) M is negative definite on {z eRN: Bz = O} (i.e., z· Mz < 0 for any z eRN with Bz = 0 and z ,;. 0) if and only if

(-1)'!

M.D:

(I.e., z'Mz ~ 0 for any

Theorem M.D.4: Suppose that M is an N x N matrix and that for some p » 0 we have Mp = 0 and MTp = O. Denote Tp = {z eRN: p'z = O} and let if be the (N - 1) x (N - 1) matrix obtained from M by deleting one row and the corresponding column.

,B"I ~0 0

for r = S + 1, ... , N and and every permutation n, where ,8" is the matrix formed by permuting only the rows of the matrix ,8 according to the permutation n (,M; is, as before, a matrix formed by permuting both the rows and columns of ,M,).

(i) If rank M = N - 1, then rank Nt = N - 1. (ii) If z· Mz < 0 for all z e Tp with z,;. 0 (i.e., if M is negative definite on Tp), then z· Mz < 0 for any z eRN not proportional to p. (iii) The matrix M is negative definite on Tp if and only if Nt is negative definite.

Proof: We will not prove this result. Note that it is parallel to parts (i) and (ii) of Theorem M.D.2 with the bordered matrix here playing a role similar to the matrix there. _

Proof: (i) Suppose that rank M < N - I, that is, Mz = 0 for some i e RN -, with i ,;. O. Complete to a vector z e RN by letting the value of the missing coordinate be zero. Then we have that, first, z is linearly independent of p (recall that p» 0) and, second, Mz = 0 and Mp = O. Thus, rank M < N - I, which contradicts the

z

Example M.D.2: Suppose we have a function of two variables, f(x l , x,). We assume that Vf(x) ,;. 0 for every x. Theorem M.C.4 tells us that f(·) is strictly quasi concave if the Hessian matrix D' f(x l , x,) is negative definite in the subspace (z e R': Vf(x)' z = O} for every x = (x" x,). By Theorem M.D.3 the latter is true if and only if fl1(X" x,)

f12(X" x,)

fleX"~ x,)

f"(x,, x,)

f"(x,, x,)

f,(x" x,) > 0,

fleX"~ x,)

f,(x" x,)

hypothesis. (ii) Take a z e RN not proportional to p. For IX, = (P'z)/(P'p) and z* we have z* e Tp and z' ,;. O. Because MTp = Mp = 0, we have then

(iii) This is similar to part (ii). In fact, part (ii) directly implies that M is negative definite if M is negative definite on Tp (because for any ze RN -', z'Mi = z'Mz, where z has been completed from z by placing a zero in the missing coordinate, and if z ,;. 0 this z is by construction not proportional to pl. For the converse, let n denote the row and column dropped from M to obtain M. If for every z' e Tp with z'';' 0 we let z = z' - (z;/P.)p, then z. = 0 and z';' 0 [if z were equal to zero, then we would have z' = (z;/P.)p in contradiction to z'·p = 0]. Moreover, z"Mz' = z·M. = z'Mz < O. _

0

2f,(x" x,)f,(x" X,)f12(X" x,) - [flex"~ x,)]'fdx" x,) - [f,(x" x,)]' fl1(X" x,) > O.

= XIX, we get 2x,x, > 0 confirming that the function is strictly quasiconcave. By Theorem M.C.4, f(·) is quasiconcave if and only if the Hessian matrix D' f(x" x,) is negative semidefinite in the subspace (z e R': Vf(x)' z = O} for every x = (x" x,). By Theorem M.D.3 this is true if and only if

If we apply this test to f(x" x,)

fl1(X" x,)

fdx" x,)

f,(x" x,)

f"(x,, x,)

f,(x" x,) ~ 0,

f,(x l , x,)

f,(xI' x,)

IX,p,

z'Mz = (z· + IX,p)'M(z' + IX,p) = z"Mz' < O.

or equivalently, if and only if

f"(x,, x,)

=z-

Definition M.D.2: The N x N matrix M with generic entry aji has a dominant diagonal if there is (p" ... ,PN) »0 such that, for every i = 1, ... , N, !pjajA > Li*; !Pia;J Dellnltlon M.D.3: The N x N matrix M has the gross substitute sign pattern if every nondiagonal entry is positive. Theorem M.D.S: Suppose that M is an N x N matrix.

0

(i) If M has a dominant diagonal, then It Is nonsingular. (ii) Suppose that M is symmetric. If M has a negative and dominant diagonal then it is negative definite.

9. Recall that M is positive (semi)definite if and only if - M is negative (semi)definite. Moreover,

hM,1 = (-I)'I,M,I·

...

PROPERTIES

939

940

MATHEMATICAL

APPENDIX

(iii) If M has the gross substitute sign pattern and if for some p » 0 we have Mp « 0 and MTp « 0, then M is negative definite. (iv) If M has the gross substitute sign pattern and we have Mp = MTp = 0 for some p » 0, then Nt is negative definite, where Nt Is any (N - 1) x (N - 1) matrix obtained from M by deleting a row and the corresponding column. (v) Suppose that all the entries of M are nonnegative and that Mz« z for some z » 0 (i.e., M is a productive input-output matrix). Then the matrix (1- M)-' exists. In fact, (/- M)-' = L~:cf M*.

-

SECTION

IMPLICIT

FUNCTION

THEOREM

941

x

R

~(ij)

-------7!T--- ~(.)

x' --------=x x' ------- I I I I I I

I

:

: I I I I I

: I I I I I

Figure M.E.1

A locally solvable equation. (a) Solutions or f(x; q) = 0 near (;C, q). (b) The graph or ~(- ).

q' ij q'

f(·;q')

j{-; ij) f(·;q')

(a)

(b)

Suppose that x = (x, •.... xN) E A and ii = (ii, •.. ·• iiM) E B satisfy equations (M.E.I). That is. I.(x. ii) = 0 for every n. We are then interested in the possibility of solving for x = (x, •...• x N ) as a function of q = (q, •...• qM) locally around ii and .ii. Formally. we say that a set A' is an open neighborhood of a point x E IRN if A' = {x' E IR N: IIx' - xII < £} for some scalar £ > O. An open neighborhood B' of a point q E IRM is defined in the same way. Definition M.E.1: Suppose that x = (x" ... ,xN) E A and q = (q" ... ,qM) E B sa~sf! the equations (M.E.1). We say that we can locally solve equations (M.E.1) at (x, q) for x = (x, . ... ,XN) as a function of q = (q" ...• qM) if there a.re open neighborhoods A' c A and B' c B, of x and q, respectively, and N untquely determined "implicit" functions '1,( .), ... , '1N(') from B' to A' such that 'n('1,(q). ... ,'1N(q); q) = 0

for every q

E

B' and every n,

and for every n. In Figure M.E.I we represent. for the case where N = M = I, a situation in which the system of equations can be locally solved around a given solution. The implicit function theorem gives a sufficient condition for the existence of such implicit functions and tells us the first-order comparative statics effects of q on x at a solution.

M.E The Implicit Function Theorem

Theorem M.E.1: (Implicit Function Theorem) Suppose that every equation I n (·) is continuously differentiable with respect to its N + M variables and that we consider a solution x = (x" ... , XN) at parameter values q = (q" ...• qM)' that is, satisfying 'nIx; q) = 0 for every n. If the Jacobian matrix of the system (M.E.1) ~It~ respect to the endogenous variables, evaluated at (x, q). is nonsingular, that IS, If

The setting for the implicit function theorem (1FT) is as follows. We have a system of N equations depending on N endogenous variables x = (x, •...• x N) and M parameters q = (q, •...• qM): XN;

THE

I.~---------~~---------------

Proof: (i) Assume, for simplicity. that p = (I •...• I). Suppose. by way of contradiction. that Mz = 0 for z # O. Choose a coordinate n such that Iz,l ~ Iz,.1 for every other coordinate n'. Then la•• z.1 > L, •• la.,z.1 ~ L,.,la.jzjl. where 0/) is the generic entry of M. Hence. we cannot have Lj a,)z) = O. and so M z # O. Contradiction. (ii) If M has a negative dominant diagonal then so does the matrix M - aI, for any value a ~ O. Hence. by (i) we have (_I)NIM - all # O. Now if a is very large it is clear that (-ltIM - all> 0 (since (_I)NIM - all = (-I)NaNI(Mja) - II and 1-/1=(-1)''). Moreover. since (-I)NIM-aII is continuous in a and (_I)NIM - all # 0 for all a ~ O. this tells us that (-I)NIM - all> 0 for all a ~ O. Hence. (-I )NIMI > O. By the same argument. (-IYI,M,I > 0 for all r. So. if M is also symmetric then by part (i) of Theorem M.D.2 it is negative definite. (iii) The stated conditions imply that M + MT has a negative and dominant diagonal [in particular. note that Mp« 0 and AfT p «0 implies that p.(2a.. ) < - L) •• Pj(aj • + a.) for all n. where 0/) is the generic entry of M]. Because, by the gross substitute property. 0/) > 0 for i # j.this gives us Ip.(2a.. )1 > IL) .. p)(a", + 0,,)1 for all n. Hence. the conclusion follows from part (ii) of this theorem and part (i) of Theorem M.D.!. (iv) If M satisfies the condition of (iv). then the fact that M has the gross substitute sign pattern implies that M does as well and that Mp «0 and MT p «0. Hence. M satisfies the conditions of (iii) and is therefore negative definite. (v) This result was already proved in the Appendix to Chapter 5 (see the proof of Proposition 5.AA.I).

f,(x, •...•

M.E:

q, •...• qM) = 0 (M.E.I)

Of,(x, q)

Of,(x, q)

ox,

oXN

fN(x, •...• xN;q, •...• qM) = 0

# 0,

The domain of the endogenous variables is A c RN and the domain of the parameters is Be RM.'O

OfN(x, q)

OfN(x, q)

ox,

oXN

(M.E.2)

then the system can be locally solved at (x, q) by implicitly defined lunctions 'In: B' _ A' that are continuously differentiable. Moreover, the first-order effects

10. In wha, follows. we ,ake A and B to be open sets (sec Section M.F) so as to avoid boundary problems.

..

942

MATHEMATICAL

- -

APPENDIX

of q on x at (x, 17) are given by Dq'l(q) = -[Dxf(x; qlr'Dqf(x; 17).

(M.E.3)

Proof: A proof of the existence of the implicit functions 'I.: B' -+ A' is too technical for this appendix, but its common-sense logic is easy to grasp. Expression (M.E.2), a full rank condition, tells us that we can move the values of the system of equations in any direction by appropriate changes of the endogenous variables. Therefore, if there is a shock to the parameters and the values of the equation system are pushed away from zero, then we can adjust the endogenous variables so as to restore the "equilibrium." Now, given a system of implicit functions '1(q) = ('1,(q), ... ,'1N(q)) defined on some neighborhood of (x, Ii), the first-order comparative static effects iJ'I.(ii)/iJqm are readily determined. Let f(x; q) = (/, (x; q), ... ,fN(X; q)). Since we have

f('1(q); q) = 0

CONTINUOUS

FUNCTIONS

AND

COMPACT

Definition M.E.2: Given open sets A c ~ and 8 c RM , the (continuously differentiable) system 01 equations f(·;

Microeconomic Theory

Microeconomic Theory

Microeconomic theory