Вы находитесь на странице: 1из 27

g t a p

g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
GAME THEORY & APPLICATIONS
J. Ajith Kumar, TAPMI, Manipal
Pure strategy, Nash Equilibrium and Dominance
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
2
INTRODUCTION
Game Theory has been used to help model phenomena
in Economics, Business, Political Science, Sociology,
Psychology, Evolutionary Biology. Can be applied to
management also.
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
3
INTRODUCTION
What is Game in Game Theory?
A game is an interaction (or a set of interactions)
between multiple agents in which the payoff to each
agent is a function of the combination of actions
enacted by all agents.
Many real-life situations can be modelled as games.
Agents are also called players. If there is only one
player, then there is no game. At least 2 are needed.
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
4
INTRODUCTION
E.g. cricket, where batsman has options of multiple
strokes, and bowler has option of multiple types of ball.
Simplistically speaking, the number of runs scored by
the batsman can be thought of as a function of the
pairing between the type of ball bowled and type of
stroke played.
Analysis of a game can also be thought of as solving a
decision tree with more than one agent, with each
agent trying to maximize her returns.
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
5
Game theoretic analyses can help us understand reality
better and
explain current reality why things are the way they
are.
prescribe actions what should be done to shape
reality in some desirable manner.
HOW GAME THEORY IS USEFUL
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
6
* Simultaneous game a model that assumes players
dont get to see each others actions before they
perform their own actions.
SIMULTANEOUS vs. SEQUENTIAL GAMES
* Sequential game a model that accommodates the
time dimension, and assumes players get to see what
the other players did before performing their own
actions.
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
7
A game is typically characterized by
a) the context,
b) the players,
c) actions (strategies) available to the players,
d) the payoff functions of the players , and
e) whether the game is simultaneous or sequential.
CHARACTERISTICS OF A GAME
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
8
Context Players
Actions
available
Payoffs
How the game is
played*
Pricing in the
telecom
industry
Telecom
firms
Different
levels of price
(e.g. high,
medium, low)
Revenues
Profits
Market share
Simultaneous
(over multiple
rounds)
Writing an
Examination
Invigilators,
Students
Invigilators:
(strict,
lenient),
Students:
(cheat, dont
cheat)
Invigilators:
how well the
exam is
conducted.
Students:
performance
in the exam
Simultaneous /
sequential
EXERCISE (10 minutes)
List real life situations that you can model as games. See
examples below.
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
9
Theory of Rational Choice
(Choice Theory, Rational Actor Theory)
Decision makers are rational actors. While making a
decision, they choose the alternative(s) that maximizes
their utility under given circumstances. They also have
adequate knowledge and ability to estimate their utilities
and to use them to discriminate between alternatives.
Axioms of Rationality
For rationality to be possible, we assume these to be true:
1. Reflexivity (trivial)
2. Completeness
3. Transitivity
4. Continuity (sufficient but not necessary)
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
App Development Project Game
10
Each student is given a slip in which s/he indicates
her/his choice after reading the problem and thinking
over it. No one should talk to each other.
Read App Development Project in Exercise Set A
How many marked Work hard and how many Hardly
Work? Why did you mark what you did?
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
App Development Project Game
Player 1s utilities are:
u
1
(a
1
= WH | a
2
= WH) = 30
u
1
(a
1
= HW | a
2
= WH) = 70
u
1
(a
1
= WH | a
2
= HW) = 10
u
1
(a
1
= HW | a
2
= HW) = 20
Player 2s utilities will be similar.
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
12
App Development Project Game
PARTNER
WH HW
YOU
W
H
30, 30 10, 70
H
W
70, 10 20, 20
Strategic formor normal formrepresentation
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
App Development Project Game
What is the best action for you? Why?
What is the best action for your partner? Why?
We see that regardless of what your partner does, you
get higher payoffs if you chose HW. Thus HW is the
best action for you.
By the same reasoning as above, HW is the best action
for your partner too.
If both players are rational, then what will the
outcome be?
(HW, HW): also called the Nash Equilibrium of this
game.
13
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
14
NASH EQUILIBRIUM
A Nash Equilibrium is an action profile in which no
player can improve his payoff by changing his action,
given that the other players actions remain the same
as in that action profile.
An action profile is a Nash equilibrium if and only if
each players action is his best response to the actions
of all other players in that action profile.
In other words, iff u
i
(a
i
, a
-i
) u
i
(a
i
, a
-i
) for all i, a
i,
is
satisfied, then (a
i
, a
-i
) is a Nash Equilibrium.
Examine if (HW, HW) fulfils the above condition.
Examine if any of the other three action profiles in this game
fulfil the condition and can be called a Nash Equilibrium.
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
15
NASH EQUILIBRIUM
Important Caveat:
An NE is an action profile, not a payoff profile.
To say that (HW, HW) is a NE correct
But to say that (20, 20) is an NE is incorrect.
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
16
STRICT vs. NON-STRICT NASH EQUILIBRIUM
If in a NE, u
i
(a
i
, a
-i
) > u
i
(a
i
, a
-i
) for all i, a
i
,
then (a
i
, a
-i
) is said to be a Strict Nash Equilibrium.
If not, then it is a nonstrict NE.
Player 2
H T
P
l
a
y
e
r

1
H 1, 2 3, 0
T
2, 1 3, 2
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
17
NASH EQUILIBRIUM: A SOLUTION CONCEPT
1. Indicates possible actions that players might take in a
given situation.
2. Indicates possible stable steady state that a system
might reach after a period of time, over several trials
of the game.
3. An NE can be thought of as a stable social norm. If
everyone else adheres to it, no individual would wish
to deviate from it.
4. Give us a language to discuss what can be done to
change the system behavior (alter players behaviors,
alter the equilibrium). Mechanism Design.
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
18
If the players had chosen WH instead of HW, then
both would have got higher payoffs. Yet, why is (HW,
HW) the rational outcome of this game?
App Development Project Game
Lack of adequate trust, understanding? If I choose
WH, how do I know for sure that my partner will also
choose WH? He might choose HW. If he does that
then my payoff will be just 10 points. On the other
hand, I can be assured of getting at least 20 points if I
choose HW and might even get 70 if my partner
chooses WH.
This reasoning implicitly assumes that each player
thinks purely selfishly.
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
19
Lesson: When everyone acts rationally, it can lead to
suboptimal outcomes for everyone. The combined
effect of individual rationality can sometimes be
collective irrationality.
App Development Project Game
Can control and coordination mechanisms help?
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 20
They did not understand the question correctly.
They did not calculate all the payoffs correctly.
...other reasons?
Our model predicts that HW would be the rational
choice. Why then did everyone not choose HW?
Lesson: Bounded Rationality (Herbert Simon) the
NE may not always be reached in all real-life
situations.
Lesson: To be perfectly selfish, a person needs to
have complete knowledge and intelligence...!
Otherwise, it may not be possible. Can we say the
same of being perfectly unselfish?
App Development Project Game
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
IMPORTANT LESSONS
21
Lesson: The payoffs that each player gets are a
function of her actions in combination with the actions
of all other players in the game.
Lesson: The payoffs need not always be in terms of a
tangible measure such as money, market share etc...
Utility is a more general conceptualization of payoff.
Covers intangibles like satisfaction, happiness,
frustration etc... Utility: assessed by the modeller and
is thus, subjective in nature.
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
22
Action: decision choice available to a player. More
generally, also referred to as pure strategy. Actions are
mutually exclusive and collectively exhaustive. In this
model, a player can play only one action at a time and
not a combination of actions. Often denoted as a
i
.
Action set: defined for each player in the game, as a
listing of all possible actions that the player can choose.
Denoted by A
i
. In the above game, A
1
= {WH, HW} and
A
2
= {WH, HW}.
In this example, equal number and identical actions
available to all players (i.e. two) but not necessary, in
general.
Some Terminology
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
Some Terminology
23
Action profile: a distinct combination of actions of all
the players, in which for each player, one action is
listed, and collectively, the actions are listed in a
standard order. In a two player game, the row players
action is listed first.
An action (strategy) profile is also referred to as an
outcome.
A 2 x 2 game (two players, two actions each) has four
possible action profiles. Likewise a mx n x p game will
have mnp possible action (pure strategy) profiles.
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
24
Utility (payoff) profile: Corresponding to each strategy
profile is an utility profile: the distinct combination of
utilities received by all players if the corresponding
strategy profile is played.
Some Terminology
For each player, one utility value is listed, and
collectively, the utilities are listed in the same order in
which actions are listed in the action profile.
In a two player game, the row players utility is listed
first.
A 2 x 2 game has four possible utility profiles. E.g. (30,
30), (70, 10), (10, 70), (20, 20). Likewise a mx n x p
game will have mnp possible utility profiles.
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES
25
The NE represents a possible outcome
But is it the best, or optimal outcome?
What would be an optimal outcome,
when there are multiple agents?
How do we determine optimality,
when there are trade-offs?
WHAT IS THE BEST OUTCOME?
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 26
PARETO OPTIMALITY: ANOTHER SOLUTION
CONCEPT
Pareto Domination:
Action profile a Pareto dominates action profile a if for
all i N, u
i
(a) u
i
(a), and there exists some j N for
which u
j
(a) > u
j
(a).
Pareto Optimal:
Action profile a is Pareto optimal (or, Pareto efficient) if
there does not exist another action profile a S, which
Pareto dominates a.
g t a p
g t a p
Prof. J. Ajith Kumar, TAPMI, Manipal GAME THEORY NOTES 27
PARETO OPTIMALITY: ANOTHER SOLUTION
CONCEPT
Player 2
H T
P
l
a
y
e
r

1
H 1, 2 3, 0
T
2, 1 3, 2
Shyam
Run Stand
R
a
m
R
u
n
1, 1 2, 2
S
t
a
n
d
3, 3 4, 4
Ashwin
R Q
M
u
r
a
l
i
M
3, 2 2, 3
L
2, 1 1, 2
India
Fight Dont
P
a
k
i
s
t
a
n
F
i
g
h
t
5, 4 5, 4
D
o
n
t
5, 4 5, 4
Find the NE(s) as well as the Pareto optimal solution(s)

Вам также может понравиться