[PDF] - Game Theory: Lecture #11 Outline: Strategic form games Best PDF Document

SLIDE 1

Game Theory: Lecture #11

Outline:

SLIDE 2

Strategic games

– Set of players, N = {1, ..., n} – A set of actions for each player i ∈ N, Ai. – This induces the set of action profiles A = A1 × A2 × ... × An – For each player, preferences over action profiles characterized by a function: Ui : A → R

what should an agent do in a given game?

for the other agents in the game

– Model of other agents: worst-case / adversarial – Reasonable choice: Security strategies – Expected performance: Security levels

L R T 2, 2 0, 0 B 0, 0 ǫ, ǫ

SLIDE 3

Nash Equilibrium

Ui(a∗) = Ui(a∗

for every ai ∈ Ai.

– {−i} represents all players other than i, i.e., {−i} = {1, . . . , i − 1, i + 1, . . . , n} – a−i represents the choice of all players other than i, i.e., a−i = {a1, . . . , ai−1, ai+1, . . . , an}

– Optimization case: An optimizer will play the best action – Nash equilibrium: An action profile in which each player is acting as an optimizer – Term “rational” implies that an agent is an “optimizer”

– Definition: The best response function of player i, Bi(·), is Bi(a−i) = {ai : Ui(ai, a−i) ≥ Ui(a′

Note that the best response “function” is actually a “set” – An action profile a∗ is a Nash equilibrium if for every player i, a∗

i.e., each player is playing a best response to the actions of other player

SLIDE 4

Descriptive question: What’s the outcome?

– Does a Nash equilibrium exist? – Is a Nash equilibrium unique? – Which Nash equilibrium? – Why Nash equilibrium?

– Setup: Cooperate vs Defect? C D C 3, 3 0, 4 D 4, 0 1, 1 – Also used to model work vs shirk? arm vs disarm? – What if played several times?

B S B 2, 1 0, 0 S 0, 0 1, 2

Stag Hare Stag 2, 2 0, 1 Hare 1, 0 1, 1

Alt Std Alt 3, 3 0, 0 Std 0, 0 1, 1

SLIDE 5

Nash equilibrium, cont

H T H 1, −1 −1, 1 T −1, 1 1, −1

players −i at stage k − 1

– Stage 1: (H, H) – Stage 2: (H, T) – Stage 3: (T, T) – Stage 4: (T, H) – Stage 5: ...

SLIDE 6

Example: Routing game

– High road: cH + nH – Low road: cL + nL

High satisfied Low satisfied cH + nH ≤ cL + nL + 1 cL + nL ≤ cH + nH + 1 cH + nH ≤ cL + (N − nH) + 1 cL + (N − nH) ≤ cH + nH + 1 2nH ≤ N + cL − cH + 1 2nH ≥ N + cL − cH − 1

85 ≤ 2nH ≤ 87 ⇒ nH = 43

84 ≤ 2nH ≤ 86 ⇒ 42 ≤ nH ≤ 43 NE is both nH = 42 or nH = 43

SLIDE 7

Example: Routing

S D High road Low road

– Players: Two agents that each control 1/2 units of splittable traffic. – Actions: Players can route 1/2 of traffic arbitrarily over H and L – Cost: The cost of an agent is just the total cost of it’s traffic Ji(f H

– Convention: Use Ji(·) for cost and Ui(·) for benefit

B1(x) = arg min

1 ≤0.5

f H

B1(x) = 1 4 − x 2

B2(y) = 1 4 − y 2

SLIDE 8

Example: Routing

f H

f H

f H

= 1/6

SLIDE 9

Example: Routing

f ∗

4 − f2 2 f ∗

4 − f1 2

f ∗

2

f ∗

4

f ∗

1 8, 1 4

f ∗

1 8, 3 16

f ∗

5 32, 3 16

f ∗

10 64, 11 64

. . f ∗

6

SLIDE 10

Dominated strategies

– Approach #1: Exhaustively check all joint actions – Approach #2: Investigate best response functions – Best approach depends on game of interest

– Prisoner’s dilemma: Defect was did better than alternatives (strict) – Second price sealed bid: Internal valuation did no worse than alternatives (weak)

L R T 2, 2 1, 1 B 1, 1 1, 2 T weakly dominates B, but both (T, L) and (B, R) are NE (see also auction example)

remaining strategies.

SLIDE 11

Iterated elimination of strictly dominated strategies

L C R T 4, 3 5, 1 6, 2 M 2, 1 8, 4 3, 6 B 3, 0 9, 6 2, 8

L R T 4, 3 6, 2 M 2, 1 3, 6 B 3, 0 2, 8

L R T 4, 3 6, 2