An Introduction to Poker Opponent Modeling Peter Chapman Brielin - - PowerPoint PPT Presentation

an introduction to poker opponent modeling
SMART_READER_LITE
LIVE PREVIEW

An Introduction to Poker Opponent Modeling Peter Chapman Brielin - - PowerPoint PPT Presentation

An Introduction to Poker Opponent Modeling Peter Chapman Brielin Brown University of Virginia 1 March 2011 It is not my aim to surprise or shock you-but the simplest way I can summarize is to say that there are now in the world machines that


slide-1
SLIDE 1

An Introduction to Poker Opponent Modeling

Peter Chapman Brielin Brown

University of Virginia

1 March 2011

slide-2
SLIDE 2

It is not my aim to surprise or shock you-but the simplest way I can summarize is to say that there are now in the world machines that think, that learn and that create. Moreover, their ability to do these things is going to increase rapidly until - in a visible future - the range of problems they can handle will be coextensive with the range to which the human mind has been applied.

slide-3
SLIDE 3

It is not my aim to surprise or shock you-but the simplest way I can summarize is to say that there are now in the world machines that think, that learn and that create. Moreover, their ability to do these things is going to increase rapidly until - in a visible future - the range of problems they can handle will be coextensive with the range to which the human mind has been applied. Herbert Simon - 1957 [1]

slide-4
SLIDE 4

Goals Basic Knowledges of General Approaches to Opponent Modeling (OM) Ability to Implement the Simple OM System Used in Loki

slide-5
SLIDE 5

Goals Basic Knowledges of General Approaches to Opponent Modeling (OM) Ability to Implement the Simple OM System Used in Loki

slide-6
SLIDE 6

Outline

1 Motivation 2 General Approaches 3 Loki Opponent Modeling

slide-7
SLIDE 7

Opponent Modeling

slide-8
SLIDE 8

Opponent Modeling

Goals:

Understanding the internal state of the opponent Predicting the opponent’s future actions

slide-9
SLIDE 9

Deep Blue

”There is no psychology at work” in Deep Blue, says IBM research scientist Murray

  • Campbell. Nor does Deep Blue

”learn” its opponent as it plays. Instead, it operates much like a turbocharged ”expert system,” drawing on vast resources of stored information (For example, a database of opening games played by grandmasters over the last 100 years) and then calculating the most appropriate response to an opponent’s move.

slide-10
SLIDE 10

Scrabble

slide-11
SLIDE 11

Rock-Paper-Scissors

slide-12
SLIDE 12

Rock-Paper-Scissors

slide-13
SLIDE 13

Rock-Paper-Scissors

slide-14
SLIDE 14

Rock-Paper-Scissors

int getComputerInput () { int t o t a l = seenPaper+seenRock+s e e n S c i s s o r s ; int choice = rand () % t o t a l ; i f ( choice < seenPaper ) return SCISSORS ; else i f ( choice < seenRock ) return PAPER ; else return ROCK; }

slide-15
SLIDE 15

Rock-Paper-Scissors

int henny () { return ((∗ o p p h i s t o r y ? o p p h i s t o r y [ random ()%∗

  • p p h i s t o r y +1]+1:random () )%3) ;

}

slide-16
SLIDE 16

Optimality and Maximality

Optimal Play

Nash Equilibrium

Maximal Play

Making non-optimal moves in order to increase expected value

slide-17
SLIDE 17

Poker opponent modeling is hard.

slide-18
SLIDE 18

Difficulties of Poker Opponent Modeling

Fundamental Uncertainties

[2]

Each hand is completely different Difficult to extract a “signal” through the noise.

slide-19
SLIDE 19

Difficulties of Poker Opponent Modeling

Fundamental Uncertainties

[2]

Each hand is completely different Difficult to extract a “signal” through the noise.

Time to Learn

[3]

Need to get a good model working in less than 100 hands

slide-20
SLIDE 20

Difficulties of Poker Opponent Modeling

Missing Information

[2]

A fold does not reveal opponent’s hand Few games make it the showdown

slide-21
SLIDE 21

Difficulties of Poker Opponent Modeling

Missing Information

[2]

A fold does not reveal opponent’s hand Few games make it the showdown

Different Criteria for Different Players

[2]

Position at the table

Generally better to have loose player on the right and tight player on the left [4]

Stack size, blind size and position, previous actions of

  • ther players

Mood of the game and players Player skill Hand strength

slide-22
SLIDE 22

Difficulties of Poker Opponent Modeling

The past is not necessarily a good predictor of the future

[5]

Looking only at the recent history does not work Humans have emotions Good opponents change strategies Your opponent is modeling you

slide-23
SLIDE 23

Rational Opponent

The implicit model in Minimax search Variations possible

slide-24
SLIDE 24

Prepared Strategies

Simple Prepared Strategy Come up with some poker strategy that works against everyone Categorical Prepared Strategy Loose, tight, passive, aggressive, etc.

slide-25
SLIDE 25

Statistical Approach

Simple Percentage of time opponent sees the flop Percentage of time caught bluffing Complex Frequency opponent goes for the straight

slide-26
SLIDE 26

Neural Networks

slide-27
SLIDE 27

Neural Networks

slide-28
SLIDE 28

Loki

Predecessor to Poki and Norse God or J¨

  • tunn or Both.

[6]

slide-29
SLIDE 29

Loki

slide-30
SLIDE 30

Loki

Keep in mind Loki’s OM only tries to figure out opponents cards.

slide-31
SLIDE 31

Hand Assessment

Hand Strength (HS)

Pre-flop strength is calculated through offline random simulation After the flop, the strength is the percentile ranking of the current hand in relation to all the other (1081) possible dealt pairs

A♦ − Q♣ with the flop 3♥ − 4♣ − J♥ 444 better hands, 9 equal hands, and 628 worse hands

628+ 9

2

1081 = 58.5%

slide-32
SLIDE 32

Hand Assessment

Hand Potential

Positive Potential (PpotN): The probability of improving to the best hand after N more cards Negative Potential (NpotN): The probability of falling behind after N more cards For each 1,081 hands, look at 990 combinations of the two cards after the flop

Effective Hand Strength (EHS)

Hands where player is ahead or have a positive potential

slide-33
SLIDE 33

Opponent Modeling Calculate a weight for each of the 1,081 possible

  • pponent hands

Assumes ”reasonable” behavior, seems vulnerable to bluffing Can include specific opponent history to increase accuracy

slide-34
SLIDE 34

Opponent Modeling

Initial Weights

slide-35
SLIDE 35

Opponent Modeling

Re-weighting

slide-36
SLIDE 36

Knowledge is power, if you know it about the right person. Erastus Flavel Beadle (1821-1894)

slide-37
SLIDE 37

References

  • S. Russell and P. Norvig, Artificial Intelligence: A Modern Approach, 3rd ed.

Upper Saddle River, NJ, USA: Prentice Hall Press, 2009.

  • A. Davidson, A. Davidson, D. Szafron, R. Holte, and W. Pedrycz,

“Opponent modeling in poker: Learning and acting in a hostile and uncertain environment,” Tech. Rep., 2002.

  • D. Billings and D. Billings, “Algorithms and assessment in computer poker,”
  • Tech. Rep., 2006.
  • K. Glocer and M. Deckert, “Opponent modeling in poker,” Tech. Rep., 2007.
  • M. Salim and P. Rohwer, “Poker opponent modeling.”
  • D. Billings, D. Papp, J. Schaeffer, and D. Szafron, “Opponent modeling in

poker,” Proceedings of the Fifteenth National Conference of the American Association for Artificial Intelligence (AAAI), 1998. [Online]. Available: http://www.cs.ualberta.ca/∼{}games/poker/publications/AAAI98.pdf

slide-38
SLIDE 38

Initial Explanation Detailed