Академический Документы
Профессиональный Документы
Культура Документы
Game theory is the study of strategic interaction. A game consists of several key
elements:
∼∼∼
A best response to s¬k is a strategy sk for player k that maximises her payoff, given
those strategies of the other players.
A dominant strategy s̄k is a best response to every strategy s¬k of the other players –
it gives a higher payoff to player k than any other strategy sk .
A Nash equilibrium is a collection of strategies "s∗1 , s∗2 , . . . , s∗K # such that s∗k is a
best response to s∗¬k for each player k = 1, 2, . . . , K.
Note: Don’t confuse an equilibrium with an equilibrium outcome. The latter is what
actually happens (actions, expected payoffs) when the players adopt their equilibrium
strategies.
In a sequential game where Player 1 first chooses U or D and Player 2 then chooses L
or R, an equilibrium might be the pair of strategies "U; if U then L, if D then R#, and
then the outcome would be the pair of actions "U; then L# with the associated payoffs.
1
Examples
(1) Both the Row and Column players have dominant strategies (“Prisoner’s Dilemma”):
Left Right
3 5
Top
3 −2
−2 −1
Bottom
5 −1
Left Right
3 −1
Top
3 −2
−2 5
Bottom
5 −1
Left Right
3 −1
Top
5 −1
−2 5
Bottom
−2 3
Left Right
1 −1
Top
−1 1
−1 1
Bottom
1 −1
2
1.2 Oligopoly
p(q) = α − γq
1.2.1 Cournot
Best response function (or reaction curve): Firm k chooses qk to maximise πk , taking
q¬k as given.
1.2.2 Stackelberg
As above, except sequential moves: take firm 1 as the leader, firm 2 as the follower.
Firm 1 knows that firm 2 will produce q2 (q1 ) = (α − c − γq1 )/2γ, i.e. the best response
in the Cournot game above.
3
q2 "
#
#
#
#
# Strategic
# BR1
# Substitutes
#
#
#
M2 $ #
$$ #
$
$$# C
$#!
$
# $$!
# $ S
# $$
# $$
# $ BR2
$$
# $
## $$
$ !
M1 q1
Note: If each firm’s action set is restricted to {Cournot qty., 12 Monopoly qty.}, then the
game is strategically equivalent to a Prisoner’s Dilemma.
Comparisons
α−c
Perfect competition γ
c
1 α−c
Monopoly/Collusion 2 γ
c + 12 (α − c)
4
1.2.3 Bertrand
Duopoly, homogeneous goods, each firm chooses a price; one-shot, simultaneous move
game.
p2 "
pM %
% %
%
%%
%
%
BR1 %%
%% BR2
%%
%
%
%
% Strategic
c
Complements
!
c p M
p1
Note: We get the competitive outcome in a homogeneous market with only two firms.
5
1.3 Duopoly with Differentiated Goods
1.3.1 Bertrand
p2 "
&
&
&
BR1 &
&
& '
''
& ''''
'
''' &
BR2
'' &
''
'' &
&
&
&
&
&
&
& !
p1
The reaction curves intersect at prices which are greater than marginal cost, but lower
than the prices which prevail at the Cournot equilibrium . . .
6
2 Game Theory & IO – Collusion & Repeated Games
Example: Incumbent threatens a price war if new firm enters the market:
Accommodate Fight
1 −1
Enter
1 −1
2 2
Don’t enter
0 0
Strategies: Each produce 12 Monopoly qty. as long as opponent does likewise, else punish
by producing a ‘large’ quantity.
These might constitute a NE, but certainly not a SPE – the firms will play the NE of
the one-shot Cournot game in the last period.
Knowing this, the firms cannot sustain collusion in the next to last period, and will
play the NE of the one-shot Cournot game in the next to last period too.
Knowing this, . . . the firms play the NE of the one-shot Cournot game in every period.
All simple finite games with complete & perfect information can be solved using back-
ward induction, and repeated one-shot games unwind like this.
So, if NE is unique, we have this NE at each stage.
7
2.2.2 Incomplete/imperfect information
N potential entrants, one per period. Incumbent is ‘weak’ with probability close to 1,
else ‘tough’:
(•$
( $$
Enter (( $ Don’t enter
( $$
(( $
•(( $$
•
(($$
Accommodate (( $$ Fight (0, 2)
( $
(( $$
"(
( $"
$
(1, −1) (−1, 1)
Even if the incumbent is weak, the threat to fight (act tough) is credible in early periods,
hence building a false reputation.
Since the threat is credible, entry does not occur early on.
Consider trigger strategies – play X as long as opponent doesn’t play Y , in which case
play Z.
Repeated quantity setting duopoly model. Define the following profits in the stage game:
• π ∗ from colluding
• π d from deviating
Strategies: Collude as long as opponent does likewise; if either firm deviates this period,
punish by both firms reverting to Cournot forever, starting next period.
8
Alternatively . . .
gain from deviating: π d − π ∗ today;
loss from deviating: π ∗ − π C from tomorrow on;
deviation is not profitable if
δ ! ∗ "
πd − π∗ < π − πC
1−δ
A minmax punishment is the worst that one player can do to the other, given that
the player being punished is responding optimally to its punishment. Firm 1 minmaxes
firm 2 at the quantities "q̃1 , q̃2 # if those quantities solve
# $
min
q
max
q
{π2 (q1 , q2 )}
1 2
A payoff pair is feasible if there is a pair of strategies, one for each player, that generates
it.
The Folk Theorem states that any feasible payoff pair (weakly) better than that guar-
anteed by individually rational strategies can be supported by trigger strategies incor-
porating minmax punishments, for suitable values of δ.
Points to note:
• Cournot punishments are credible but not severe
• minmax punishments are severe but not credible
9
2.3.3 “Carrot and Stick” (relenting punishments)
d δ C π∗
π + π <
1−δ 1−δ
Abreu (1986), JET: any credible punishment will do that gives the deviator a punishment
payoff of
π∗
Πp < − πd
1−δ
e.g. a price war for one or more periods, followed by a return to co-operation.
Note that Πp is the sum of the deviator’s profits during the punishment phase plus those
after the punisher has relented.
Market demand is given by P̂t = P (Qt )θt , where θt is a multiplicative shock with mean
1 and known distribution F (θ).
Firms cannot observe rivals’ output, but try to infer it from the observed market price
P̂t .
If the price is low, is it because demand is low or because some firm is over-producing?
Trigger strategies: Firms each produce q ∗ until market price falls below p̃, at which point
they punish (by producing Cournot quantity q C ) for T periods.
Find strategies (q ∗ , p̃, T ) that sustain collusion as SPE.
In equilibrium, no-one cheats, but price occasionally falls below p̃, so punishment is
triggered which hurts everyone.
So why don’t they scrap the trigger?
Because then everyone would cheat all the time.
10
3 Game Theory & IO – Entry
Revisit Stackelberg . . .
(See also Bain, Sylos, Modigliani (1950s, 60s); refs. in Tirole Ch.8.)
The leader firm says that it will produce a particular quantity, q1S .
If the follower firm believes this, then it may enter (depending on the fixed costs of entry)
and produce its best-response quantity, q2S .
If the leader firm can now change its choice, it would do so: q1S is not its best response
to q2S .
If the follower firm believed this in the first place, then (if it entered) it would produce
the best-response quantity of the Cournot game, as would the leader.
We saw in Section 2.1 that a threat by an incumbent firm to fight after a potential rival
entered the market was not credible (in a finite horizon game of complete & perfect
information).
Having observed k, firm E (potential entrant) decides whether or not it will enter the
market in period 2 – entry will entail a fixed cost F .
Period 2: Firm I produces up to k with unit cost w, beyond k with unit cost r + w.
If firm E does not enter, it incurs no costs.
If firm E enters, the two firms engage in quantity competition, but firm E incurs the
fixed cost F and has a unit cost of r + w.
11
Stage 2: Quantity competition
Firm E’s best response function is bE (qI | r, w).
Firm I’s best response function is bI (qE | r, w; k).
Firm E will enter the market if its expected post-entry profit covers the fixed cost.
qE " qE "
) )
) ## ) ##
) BRI ) BRI
* ) * )
* ) * )
* ) * )
* ) * )
* ) * )
* ) * )
* ) * )
$$ * $$ *
$ ) $ )
$$* $$*
*# *#
) )
C * $#$ )
$ ) $ )
$ $
!
$
Z
C * $$ S )
* $$ ) * $$ )
)# $ )# $ Z
* $) N * $) N
$$ $$
#
* *
# * ) $$ BRE # * ) $ BR
BRI BRI $$ E
* *
* !$
) $ )
) $$ ! * ) $$ !
qI qI
• Entry might not be deterred . . . but if it is, output and welfare increase
qE " qE "
) )
) ## ) ##
) BRI ) BRI
* ) * )
* ) * )
* ) * )
* ) * )
* ) * )
* ) * )
* ) * )
$$ * $$ *
$$ * ) $$ * )
# #
$* ) $* )
$$ ) $$ )
C * $$ S !$ C * $$ S #
) )
#)
* $ ) * $ )
$# $
$ $
)# $ #N
* $) N * $
* Z * Z $) $
$
BR#I * ) $$ BRE BR#I * ) $$ BRE
* *
* !$
) $$ ) $$
* ) $ ! ) $ !
qI qI
12
3.2.2 Imperfect information
Period 1: Firm I is alone in the market, and knows its own production costs.
It charges a price p1 , making a profit limited by either M L or M H , depending on its
costs – it could make less by charging a price different from its static monopoly price.
Having observed firm I’s first-period price, firm E (potential entrant) decides whether
or not to enter the market in period 2 – it can base its decision on the observed price.
• Assume that an incumbent with low costs benefits more from deterring entry than
one with high costs:
M L − DL > M H − DH
and also that
θDEL + (1 − θ)DEH < 0
i.e. θ is large enough to deter entry unless firm E learns that firm I has high costs.
Firm I would like to discourage entry either by keeping firm E unsure, or by convincing
firm E that it is low-cost.
Firm I has an incentive to charge a low price in period 1 to do this . . .
But firm E knows about this, and might not be fooled . . .
13
Gain/Loss
"M L −. M L (p1 ) M H − M H (p1 )
...
. ... .. ..
... ...
... ... ...
... .. ..
... .. ..
δ(M L − DL ) × ...
...
...
...
...
...
...
.
...
...
. .
...
...
.
... .. ..
... .. .... ...
.
... ...
... ... ... ...
... ... ... ...
... ... ... ...
... ... ... ...
...
...
... ... ...
...
...
... ... ...
.. ..
...
...
... .
... ...
.
...
... ... ... ...
... ... ... ...
... ... .. ..
... ... .
... ....
... ...
... ... ... ...
... ... ... ...
... ... ... ...
... ... ... ...
... ... ... ...
δ(M H − DH ) ...
...
...
× ...
...
... ..
...
...
...
..
...
... ... . .
... ... .. ..
... ... ... ...
... ... ... ...
... ... ... ...
....
....
.... .
....
. .
....
.
.... . .
.... .... .... ....
.... .... .... ....
.... .... ... ...
.... ....
....... .
. ....
.
.... .... ... ...
.... .... ... ...
.... .... .... ....
.... .... .... ....
....
....
....
. ...... . .
....
..... . ....
...... ...... ..... ......
...... ...... ...... ......
....... ....... ...... .......
.......
........ ....... ............ .......
.
..
.
...
.
.. .
...
..
........ ........ ................. ........
..........
.......... ..........
............
...............................................................
............
....................... ..................................
........ !
p̃ p̂ pLm pH
m p1
[P] Firm I charges the same first-period price pLm independent of its costs.
Firm E learns nothing and so stays out, leading to a total profit for a high-cost firm I
of M H (pLm ) + δM H .
Firm I cannot gain from increasing first-period price if it has high costs since this induces
entry, leading to a total profit for firm I of at most M H + δDH < M H (pLm ) + δM H .
Note: This implies that, if has high costs, firm I is, in principle, prepared to lower
first-period price as far as p̂ if this manages to deter entry.
Firm E learns the costs of firm I and enters if and only if they are high.
That is, firm E is convinced that firm I has low costs if it observes a first-period price p̂.
So firm I makes a total profit of M H + δDH if has high costs, and a total profit of
M L (p̂) + δM L if it has low costs.
Firm I cannot gain from decreasing first-period price to p̂ if it has high costs – firm E
stays out but firm I makes M H (p̂) + δM H = M H + δDH .
Firm I cannot gain from increasing first-period price above p̂ if it has low costs – this is
a credible choice for a high-cost firm I (see the note above), so firm E enters and firm I
makes at most M L + δDL < M L (p̂) + δM L .
Note: This implies that, if it has low costs, firm I is, in principle, prepared to lower
first-period price as far as p̃ in order to convince firm E.
14
3.3 Business strategies
Incumbent can make an investment K which affects both its own competitive position
in the subsequent market, and also an Entrant’s profit.
The investment can make the incumbent either ‘Tough’ (e.g. cost reduction) or ‘Soft’
(e.g. increase client base through advertising).
But . . .
If actions are strategic complements (e.g. prices) and investment makes I tough,
then underinvest.
If actions are strategic complements (e.g. prices) and investment makes I soft,
then overinvest.
K makes Incumbent · · ·
Actions are Tough Soft
..
.
Strategic
substitutes overinvest underinvest
(quantities)
Strategic
complements underinvest overinvest
(prices)
15