Вы находитесь на странице: 1из 24

Static Games of Complete Information

Introduction

We will first look at situations where players or agents -more than one-
choose actions simultaneously ( a term to be made precise soon). Importantly,
the utility or payoff to any player is not determined just by that players
choice of action - it depends on other players choice of action. So, there
is strategic interaction, and each player has to form conjectures about what
other players will choose to play. Complete Information implies that every
player knows the entire structure of payoffs, not just her own, but also that
of others- otherwise conjectures about other players actions are harder to
make.1
A convenient representation of strategic possibilities is the normal or
strategic form.
A game in strategic form is (N, {Ai , }iN , u)
(i) N = {1, 2, . . . , n} is the finite set of players.

(ii) For each i N , Ai is the pure strategy set or action set of player i.
Each strategy is a description of the action that player wants to take.

(iii) u : A <n is the payoff function, where A = A1 A2 . . . An .

Example 1 ( Cournot Oligopoly)


Firms i = 1, . . . , n.
Firm i s strategy set is Ai = <+ ,
Payoff function = profit function.
Example 2 (Sealed Bid First Price Auction)
There are n bidders, each with valuation vi for an indivisible object,
say a painting.
Each bidder announces a bid. The announcement is the strategy or
action to be chosen by bidder i. So, Ai = <+
Highest bidder gets the object, pays what he bids. So, payoff is vi bi
to the winner; all others get 0 payoff.
Example 3 (Sealed Bid Second-Price Auction)
1
We implicitly make a much stronger assumption of what is called common knowledge.

1
The set up is the same. But, the highest bidder pays the bid announced
by the second-highest bidder.

2
Example 4 (Prisoners Dilemma)

C D
C (1, 1) (1, 2)
D (2, 1) (0, 0)

Example 5:

L R
U (2, 1) (1, 0)
M (0, 0) (0, 2)
D (1, 1) (2, 0)

Example 6: (Matching Pennies)

H T
H (1, 1) (1, 1)
T (1, 1) (1, 1)
i can guarantee himself only a payoff of 1.
But, what if he keeps opponent guessing?
Such games provide a rationale for randomization or using mixed strate-
gies.

3
Mixed Strategies
A mixed strategy is a probability distribution over the set of pure strate-
gies.
For simplicity, let us assume that each Ai is finite.
Let (Ai ) i denote the set of probability distributions over Ai , and
iN i .
i (ai ) is the probability with which ai is played.
Player is randomization is statistically independent of those of other
players.
The payoffs to a profile of mixed strategies are the expected values of the
corresponding pure-strategy profiles.

X n
Y
ui () = j (aj ) ui (a) (1)
sS j=1

Solution Concepts
How should rational players play a game? Rationality is equivalent to
saying that players choose actions in order to maximise their own (expected)
payoffs given the prevailing information structure. So, as outside observers
can we predict the outcome(s) that are likely to arise in a given game? This
requires us to specify the likely choice of actions of rational players in a game
and leads to the notion of a solution concept. Of course, an appropriate
solution concept will depend on assumptions made about the information
structure.
Consider the Prisoners Dilemma game - it is in the interest of each player
to play D. But, (C,C) gives strictly higher payoffs to both players. Why cant
the 2 players cooperate and decide to play (C,C)? This is because players
cannot sign binding contracts. We are in a non-cooperative world in which
any contract has to be self-enforcing - no one should be to deviate unilaterally
from the equilibrium play and become better off.2

2
This is also why splitting up the monopoly profit equally between two Cournot
duopolists cannot be a Cournot equilibrium.

4
Domination

Definition 1 A strategy ai strictly dominates a0i if ai Ai ,

ui (ai , ai ) > ui (a0i , ai )

ai is a strictly dominant strategy if it strictly dominates every a0i Ai , a0i 6=


ai .

Definition 2 A strategy ai weakly dominates a0i if ai Ai ,

ui (ai , ai ) ui (a0i , ai )

and there is a0i such that

ui (ai , a0i ) > ui (a0i , a0i )

Definition 3 A pure strategy ai is strictly dominated for player i if there


exists i i such that

ui (i , ai ) > ui (ai , ai ) for all ai Ai (2)

The strategy ai is weakly dominated if there exists i0 such that (2) holds
with weak inequality, and the inequality is strict for at least one ai .
A pure strategy can be strictly dominated by a mixed strategy even it
is not dominated by any pure strategy in the support of the mixed strategy.
(Check Example 5).
Claim: A rational player can never play a strictly dominant strategy
in any self-enforcing equilibrium. (Check that this assertion makes sense.)
Correspondingly, a rational player should play a strictly dominant strat-
egy if it exists.
Question : Can there be more than one strictly dominant strategy?
But, most interesting games of economic interest do not have strictly
dominant strategies. However, there are games which do possess weakly
dominant strategies. Here are a couple of examples.

5
Sealed Bid Second -price auction

Theorem 1 : Truthful bidding is a weakly dominant strategy for every bid-


der in the second-price auction.

Proof. Choose bidder i, fix bids bi .


Let bj = max bi .
We want to show that player i cannot gain either by overbidding (i.e.
bidding more than true valuation) or underbidding( bidding less than valua-
tion).
Suppose player i overbids. Compare a bid of bi > vi versus bidding
truthfully. The only case in which this makes a difference is when bi > bj > vi .
In this case, she gets the object by bidding bi , but no when bidding vi . But,
she pays bj > vi . So, payoff is

vi bj < 0

By bidding vi , she gets 0.


Suppose she underbids and bids bi < vi .
If i win with bi , then she also wins with vi , and hence earns the same
payoff of vi bj .
But, if vi > bj > bi , i wins with vi but not with bi , and

ui (vi , bi ) = vi bj > 0

6
Provision of Discrete Public Good
Agent i values a discrete public good at vi . This is her net willingness
to pay for the public good. Note that vi can be negative since cost shares
may be incorporated into the description of the project.3
Efficiency : supply iff iN vi 0.
P

Truthtelling cant be optimal without taxes and transfers. (Why?).


Groves-Clark mechanism to elicit truth-telling as a weakly dominant
strategy.
Suppose each i announces vi . Agents are allowed to lie, and so vi need
not be equal to vi .
The allocation rule chosen by planner is to provide public good if
iN vi 0.
P

In this case, each agent i is paid


(
if iN vi 0,
P P
j6=i vj
ti (v1 , . . . , vn ) =
0 otherwise .

Transfer can be negative (tax).


Weakly dominant strategy to announce vi = vi .
Fix any vector of other announcements vj .
Suppose vi + j6=i vj 0.
P

Any announcement that results in public good being provided gives


P
him same payoff - vi + j6=i vj .
Any announcement that results in public good not being produced gives
him 0.
Show that vi is the optimal announcement when vi + j6=i vj < 0.
P

3
That is, the government asks the following question. If we build the bridge and
ask you to pay a sum of xi if the bridge is built, how much (in money terms) additional
utility do you get?

7
Iterated elimination of strictly dominated strategies
It is reasonable to assert players will use dominant strategies if they
exist - only need to know strategy sets, their own payoffs.
Also, it is reasonable that players will not use dominated strategies.
Suppose more information is available - the entire payoff matrix is
known, and there is common knowledge of rationality
If ai is strictly dominated, then j should know that i will not play ai .
This might result in some of js strategies which were not strictly dom-
inated earlier being dominated once ai is eliminated.
This opens up the possibility of iteratively eliminating dominated strate-
gies.

8
Example : The Cournot Duopoly
Consider 2 duopolists producing a homogeneous good, with constant
marginal cost of production

ci (qi ) = 10qi

The market demand curve is

p(q) = 100 (q1 + q2 )

Then, the profit function of (say) firm 1 is

1 (q1 , q2 ) = (100 (q1 + q2 ))q1 10q1


= 90q1 q12 q1 q2

The first order condition gives


90 q2
q1 = (3)
2
Since q2 0, firm will never produce more than 45!
Show that any q1 > 45 is strictly dominated by q10 = 45.
But, symmetry implies that any q2 > 45 is strictly dominated for firm 2.
Given common knowledge of knowledge, firm 1 should use this information,
and conclude that q2 [0, 45]. From equation 3, this implies that

q1 22.5

So, the second round of elimination implies that qi [22.5, 45]. This process
can be continued. Using the fact that the 2 firms are symmetric, the process
converges to an interval [qmin , qmax ] satisfying
90 qmax
qmin =
2
90 qmin
qmax =
2
The only solution to these is
q = 30 (4)

9
There could be a problem with iterated elimination of weakly dominated
strategies - order of deletion may matter, as shown in the enxt example.
Example 7:

L R
U (1, 1) (0, 0)
M (1, 1) (2, 1)
D (0, 0) (2, 1)

First eliminate U (weakly dominated by M), and then L(weakly domi-


nated by R). This leads to the outcome (2, 1). Alternatively, eliminate D
(weakly dominated by M), and then R (weakly dominated by R). This leads
to the outcome (1, 1).

10
The Notion of Best Response
The fundamental difference between a one-person decision problem and
game theory is the strategic interdependence between players. In a decision
problem, knowledge of the decision problem allows the agent to calculate
optimal decisions. This is not gnerally true in game theory. Optimal decisions
depend on the structure of the game as well as the choice of action of other
players unless players have dominant strategies.
This is well-illustrated in the game below. (Can you explain why?)
Example 8: (Battle of the Sexes)
O F
O (2, 1) (0, 0)
F (0, 0) (1, 2)
Here, R wants C to play O and then he will also play O. (On the other
hand, C wants both players to play to F.) But, R will play O only if he
believes that C will play O. In other words, optimal actions must depend
upon beliefs about others choice of actions.
Definition 4 A belief of player i about other players actions is a probability
distribution i over Ai .
That is, i (ai ) is the probability with which i believes that others will play
ai .
Notice that this definition allows for the possibility that player 1 may
have beliefs which dont pin down the others strategies. For example, R may
believe that C will play both F and O with probability half.
Given a particular belief about other players actions, player i should then
play a best response. That is, if is belief is represented by i , then i should
play the optimal action given this belief. So, if R believes that C is playing
F and O with equal probability, then he should play O. (Why?)
Definition 5 An action ai is a best response to belief i if no other action
a0i gives him a strictly higher expected payoff given this belief. That is, ai is
a best response to i if
X X
ui (ai , i (ai )ai ) ui (ai , i (ai )ai ) for all ai Ai
ai ai

Sometimes, more than one action may be a best response to some belief.
This gives rise to the best response correspondence BRi (i ).

11
Rationalizability
Rationalizability is a weak solution concept- it is one that derives
restrictions on players choice of actions from assumptions that rationality
and payoffs are common knowledge.
It asks : what are all the strategies that a rational player can possibly
play?
Player cant reasonably play a strategy which is not a best response to
some beliefs about his opponents strategies. Moreover, rationality means
that opponents will also use only strategies which are best responses to some
beliefs.
An action ai is never a best response if ai / BR(i ) for any belief mui .
Clearly, i should not use ai if it is never a best response. Neither should
player i expect his opponents to use actions that are in their never best
response sets. Eliminating these actions gives rise to smaller reduced sets
of actions. Clearly, one can then iteratively delete never best actions in the
same way as in iterative elimination of strictly dominated strategies. This
will give rise to the set of rationalizable strategies.

Definition 6 An action aj is rationalizable in the game (N, (Ai )iN , (ui )iN )
if for each j N , there is a set Zj Aj such that
(i) aj Zj ,
(ii) every action aj Zj is a best response to a belief j of player j whose
support is a subset of Zj .

Application of the set of rationalizable strategies in the Cournot duopoly


problem.
The p-beauty contest
Consider a game with N = {1, 2, . . . , n}. Players have to choose simul-
taneously an integer between 0 and 20. So the strategy set of each player
is
Ai = {0, 1, . . . , 20}
The winner is the player whose announcement is the closest to 34 of the
a verage announcement. So, if n = 3, and announcements are a1 = 4, a2 =
8, a3 = 6, then the average is 6 and 4.5 is 43 of 6. So, the winner is player 1.
: How would you play this game?

12
: What can be the set of rationalizable strategies
: What are the set of strategies which survive iterated elimination of
strictly dominated strategies?

13
Relationship between Rationalizability and Iterated Elimination
of Strictly Dominated Strategies

Theorem 2 If X = jN Xj survives iterated elimination of strictly domi-


nated strategies, then Xj is the set of js rationalizable strategies.

Sketch of Proof: Suppose ai is rationalizable. Let (Zj )jN be the profile


of sets that supports ai as a rationalizable strategy. Each action in Zj is a
best response to some belief over Zj and so is not dominated in the game
(N, (Xit ), (uti )). So, for all t, Zj Xjt . Hence, ai Xi .
No action in Xj is strictly dominated when strategy sets are restricted
to Xi for each i. So, each action in Xj must be a best response among the
members of Xj to some belief on Xj . (Check why.)
The proof is completed by showing that each aj Xj is a best response
amongst all actions in Aj to some belief on Aj .
Suppose not. Then, there is some value of t such that aj is a best response
amongst members of Xjt to a belief j on Xj , but is not a best response
amongst Xjt1 . Then, some bj Xjt1 must be a best response to j . But,
bj is dominated at stage t 1, which is a contradiction.

14
Nash Equilibrium

All the solution concepts discussed so far have no predictive power in the
BOS game. The Nash equilibrium solution concept ties down actions and
beliefs in a more precise manner. The concept is built around the notion
that
(i) Actions must be a best response to beliefs
(ii) Beliefs must be correct in equilibrium.
Formally,

Definition 7 A mixed strategy profile is a Nash equilibrium if for all


players i
ui ( ) ui (ai , i

) for all ai Ai . (5)

A pure strategy Nash equilibrium is a pure-strategy profile which satisfies


(5).
The condition can also be written in a different way. For any i and i ,
define the best reply correspondence for player i as follows.

Bi (i ) = {i i |ui (i , i ) ui (i0 , i )i0 i }.


Then, is a Nash equilibrium iff i Bi (i

) for all i.

So, if player i expects others to play i , then he has no reason to deviate
from i . This is the sense in which it is an equilibrium.

15
Example 8: (Battle of the Sexes)

B F
F (0, 0) (2, 1)
B (1, 2) (0, 0)

There are two pure strategy Nash equilibria - (F,F) and (B,B).

Example 9 (Bertrand Duopoly)


Ai = [0, p] ( firms compete in prices)
Q(p) market demand curve.
Firm charging lower price captures entire market.
If pi = pj = p, then both firms sell Q(p)/2.
Unique equilibrium a with ai = c for all i where c is the common
constant marginal cost of production.
Example 10 (Cournot Equilibrium )
Example 11 (Hotelling Competition)
There is a continuum of voters who vote for two political candidates.
Each candidate simultaneously chooses a policy platform ai [0, 1], which
is the policy line (i.e. the set of feasible policy platforms.) Voters ideal
points are uniformly distributed on [0, 1]. Each voter votes for the candidate
whose announced policy is closest to her ideal point.
The unique Nash equilibrium is the midpoint of the line (the median voter
theorem).
Suppose now there are 3 parties. The problem is considerably more
complicated.
Case 1: Parties care only about winning, not vote shares.
Then, the only equilibrium possible must be of the form xi = xj < xk
and k winning, or xi < xk = xk and i winning. For instance, there could
be an equilibrium where xi = xj = 1/4, xk = 3/4. Check that this is an
equilibrium.
Case 2: Parties care also about vote shares.
Then there is no pure strategy Nash equilibrium.

16
Existence
Use of fixed point theorems to prove existence of equilibrium.
Define the set-valued function B : 7 by B() = iN Bi (i ).
A Nash equilibrium is an action profile such that B( ).
That is, must be a fixed point of the mapping B.
Example: Let f : [0, 1] [0, 1] be a continuous function. Then, there
must be a number x [0, 1] such that f (x) = x.
Kakutani Fixed Point Theorem: Let X be a compact, convex, subset
of some Euclidean space <M , and let f : X X be a set-valued function
such that

(i) for all x X, f (x) is non-empty and convex.

(ii) the graph of f is closed. That is, for all sequences {xn }, {yn } such that
yn f (xn ) for all n, xn x0 , yn y0 , we have y0 f (x0 ).

Then, there exists x such that x f (x ).

Theorem 3 The game (N, {Ai }iN , {ui }iN ) has a (pure strategy) Nash equi-
librium if for all i N ,
Ai is a non-empty compact convex subset of a Euclidean space.
Each ui is continuous and quasi-concave on Ai .

Proof. Define B(a) = iN Bi (ai ).


A continuous function on a compact set has a maximum. So, each Bi (ai )
is non-empty.
To check that each Bi (.) is convex-valued, suppose ai , a0i Bi (ai ) for
some ai .
Define ai tai + (1 t)a0i for some t [0, 1].
From quasi-concavity of ui ,

ui (ai , ai ) ui (ai , ai ) = ui (a0i , ai )

Hence, ai Bi (ai ).
Now, take a sequence (aki , aki )k converging to (ai , ai ) with aki Bi (aki )
for each k.
This implies that ui (aki , aki ) ui (a0i , aki ) for all a0i Ai .

17
By continuity of ui , this implies that ui (ai , ai ) ui (a0i , ai ).
Hence, ai Bi (ai ).
So, the graph of B is closed, and B satisfies all the conditions for a fixed
point to exist by Kakutanis theorem.
This fixed point is a Nash equilibrium.

Theorem 4 Every strategic game in which each player has a finite number
of pure actions has a mixed strategy Nash equilibrium.

Proof. Suppose each Ai is finite.


Let i be typical mixed strategy. So, (A) is a compact, convex subset
of some Euclidean space. and the best response function is a mapping from
(A) to itself.
Each players payoff function is linear and hence continuous in his own
mixed strategy. So, the best response correspondence is non-empty for the
same reason as in Theorem 2.
Suppose i , i0 are in Bi (i ).
Let i ti + (1 t)i0 . Then,

ui (i , i ) = tui (i , i ) + (1 t)ui (i0 , i )

So, B is convex-valued.
Closedness also follows from continuity of ui . Hence, a fixed point must
exist.

18
Here are some obvious facts about Nash equilibria. Suppose is a
Nash equilibrium. Then,

(i) No i can be strictly dominated.

(ii) All pure strategies in the support of i must give individual i the same
(expected) payoff.

(iii) If the iterated elimination of strictly dominated strategies produces a


unique strategy profile , then that must be a Nash equilibrium.

Proof of (iii): Suppose a is the action profile obtained by iterated


elimination of strictly dominated strategies.
If it is not a Nash equilibrium, then there is i and ai such that ui (ai , ai ) >
ui (a ).
k+1
Then, there is a sequence a1i , . . . , aK i such that ai dominates aki in
round k for all k = 1, . . . K 1 with ai = a1i , ai = aK i .
Since ai survives the iterated elimination of dominated strategies,
ui (ak+1
i , ai ) > ui (ak+1
i , ai ).
Transitivity implies ui (ai , ai ) > ui (ai , ai ).

19
Can we identify games which will have pure strategy Nash equilibria?
Potential Games (Monderer and Shapley, Games and Economic Be-
havior, 1996)

Definition 8 (N, {Ai , ui }iN ) is a potential game if there is a function


P : A < such that for all i N , for all ai , a0i , for all ai ,

P (a) P (a0i , ai ) > 0 iff ui (a) ui (a0i , ai ) > 0

Theorem 5 If (N, {Ai , ui }iN ) is a potential game and P () attains a maxi-


mum over the set A, then the game has a pure strategy Nash equilibrium.

Proof. : Let P (a ) P (a) for all a A. Then, a is a Nash equilibrium.


For consider any deviation by i to ai . Since P (a ) P (ai , ai ), we must have
ui (a ) ui (ai , ai ).
Examples of Potential Games.
Cournot oligopoly game. Define the potential as
n
" #
X
P (q1 , . . . , qn ) = q1 q2 . . . qn F ( qi ) c
i=1

where F (.) is the inverse demand function and c is the constant marginal
cost.
Check that this is a potential.

20
Congestion game: defined by a set of resources E.
The strategy set of player i is Ai 2E .
For each resource e E, a cost function ce : N R gives the cost of
resource e as a function of xe , the number of players using the resource.
The cost function of each player is given by
X
ci (a) = ce (xe )
eai

Players want to minimize cost.


Example: consider a directed graph where each vertex represents a city
and each edge is a road between cities.
Each player wants to go from the origin city to the target city.
The resources here are the roads, the cost is the time taken to travel
down the road, increasing in the number of others who also travel down the
same road.
Ai is the same for all players - any set of edges which takes a player
from the origin to the target.

Theorem 6 Every congestion game is a potential game.

Proof.
For any strategy vector, define
xe
XX
P (a) = ce (j)
eE j=1

Suppose player i deviates from ai to a0i . Define

/ ai , e a0i }
E + = {e E|e

E = {e E|e ai , e
/ a0i }
For every resource that is both in ai and a0i or neither in ai or a0i , there is
no effect of the deviation on P . So,

P (a) P (a0i , ai ) =
X X
ce (xe ) ce (xe + 1)
eE eE +
= c(ai , ai ) c(a0i , ai )

21
Supermodular Games
These are games where playerss strategies can be ordered, and where
each players marginal utility of increasing his strategy rises with increases
in his rivals strategies : strategic complements.
Implication best response of a player is a nondecreasing function of
other players strategies.
Here is a definition of supermodular games (not the most general one
possible).
Suppose each Ai is a compact interval of <, and ui is twice continuously
differentiable.
2u
The game is supermodular iff a i ai
j
0.

Examples
1. Bertrand game with heterogeneous goods:
Let demand functions be:
X
Di (pi , pi ) = ai + bi pi + dij pj
j6=i

where bi < 0, dij > 0. Let

ui (pi , pi ) = (pi c)Di (pi , pi )

This game is supermodular -

2 ui
>0
pi pj
.
2. Cournot duopoly game:
Suppose inverse demand function Pi (qi , qj ) and firm is marginal revenue
Pi + qi P
qi
i
are decreasing in qj . The payoffs are

ui (qi , qj ) = qi (Pi (qi , qj ) ci )

The game is supermodular under a change of variable. Define Ai = qi and


aj = qj .

22
Tragedy of the Commons :
Nash equilibria are often Pareto suboptimal.
There are n farmers in a village.
Value per cow = v(C), when C is the total number of cows grazing.

v 0 < 0, v 00 < 0, v(C) > 0

The cost is d per cow..


Assume farmers decide simultaneously how many cows to graze.
X
i (ci , ci ) = ci v(ci + cj ) dci
j6=i

Suppose c is a Nash equilibrium. Then,

cj ) + ci v 0 ( cj ) d = 0
X X
v(
j j

Add all the first-order conditions, and divide by n


1X 0 X
cj ) +
X
v( c v ( cj ) d = 0
j n j j j

The social optimum is obtained by maximising

Cv(C) dC

The first order condition for this is:

v(C ) + C v 0 (C ) d = 0
n
c > C - overgrazing.
P
So, i
j=1

23
Correlated Equilibrium
Consider the BOS game described below.
B F
F (6, 6) (2, 7)
B (7, 2) (0, 0)
There are 3 Nash equilibria, with payoffs being (7,2), (2,7) and (14/3,
14/3).
Pre-play communication allows players to improve payoffs.
They can correlate their strategies by agreeing to play the joint actions
(B,F), (F,F) and (B,B), each with probability 1/3.
Notice that the expected payoff is then (5,5), which is not in the convex
hull of the Nash equilibrium payoffs.
Using mediator to enforce correlated strategies.
Once 1 hears the recommendation F, what can she infer about what
player 2 has received?
Check that she has no incentive to deviate from the recommendation.
A correlated strategy is a probability distribution over A.
Given any , let S() denote the support of , That is,
S() = {a A|(a) > 0}

Definition 9 A correlated strategy of the game (N, (Ai )iN , (ui )iN ) is a
correlated equilibrium if for all a S(), for all players i, and for all a0i Ai ,
ui (a0i , bi )(ai , bi )
X X
ui (ai , bi )(ai , bi )
bi Ai bi Ai

Two observations:
1. A Nash equilibrium must be a correlated equilibrium. (Why?)
2. The set of correlated equilibria is a convex set.
Theorem 7 Every action used with positive probability by some player in a
correlated equilibrium of a finite strategic game is rationalizable.

Sketch of Proof: Suppose Zi is the set of actions used by i in a correlated


equilibrium . Then, any ai Zi is a best response to the (conditional)
distribution over Ai induced by and ai . The support of this distribution
is in Zi .

24

Вам также может понравиться