The Game of Life, Decision & Communication Roland M uhlenbernd - - PowerPoint PPT Presentation

the game of life decision communication
SMART_READER_LITE
LIVE PREVIEW

The Game of Life, Decision & Communication Roland M uhlenbernd - - PowerPoint PPT Presentation

T HE G AME O F L IFE P RE -D ECISION L EARNING C OMMUNICATION The Game of Life, Decision & Communication Roland M uhlenbernd T HE G AME O F L IFE P RE -D ECISION L EARNING C OMMUNICATION O VERVIEW 1. Introduction: The Game Of Life 2.


slide-1
SLIDE 1

THE GAME OF LIFE PRE-DECISION LEARNING COMMUNICATION

The Game of Life, Decision & Communication

Roland M¨ uhlenbernd

slide-2
SLIDE 2

THE GAME OF LIFE PRE-DECISION LEARNING COMMUNICATION

OVERVIEW

  • 1. Introduction: The Game Of Life
  • 2. Pre-Decision
  • 3. Learning
  • 4. Communication
slide-3
SLIDE 3

THE GAME OF LIFE PRE-DECISION LEARNING COMMUNICATION

GAME OF LIFE’S RULES OF NATURE

  • 1. under-population: any alive cell

with fewer then two alive neighbor cells dies

  • 2. surviving: any alive cell with two
  • r three alive neighbor cells lives
  • n to the next generation
  • 3. overcrowding: any alive cell with

more than three alive neighbor cells dies

  • 4. reproduction: any dead cell with

exactly three alive neighbors becomes an alive cell

Xdu Xdo Xs Xs Xs Xr

slide-4
SLIDE 4

THE GAME OF LIFE PRE-DECISION LEARNING COMMUNICATION

GAME OF LIFE’S RULES OF NATURE

Play the Game of Life on http://www.bitstorm.org/gameoflife/

  • r

http://www.denkoffen.de/Games/SpieldesLebens/

slide-5
SLIDE 5

THE GAME OF LIFE PRE-DECISION LEARNING COMMUNICATION

DECREASING OCCUPATION SHARE

200 400 600 800 1000 1200 1400 1600 500 1000 1500 2000 2500 3000 Basic Game of Life

Figure : The number of alive cells decreases from initially around 1225 (25%) to finally 158 (3.2%) on average over 15 runs (70x70 grid).

slide-6
SLIDE 6

THE GAME OF LIFE PRE-DECISION LEARNING COMMUNICATION

THE NON-DETERMINISTIC n-DIE GAME

P1: Initialization

  • 1. Create a list of all alive cells in a random order

P2: Sacrifice Decision

  • 2. Delete successively all cells with n neighbors

P3: Rules of Nature

  • 3. Apply the rules of nature of the game of life
slide-7
SLIDE 7

THE GAME OF LIFE PRE-DECISION LEARNING COMMUNICATION

200 400 600 800 1000 1200 1400 500 1000 1500 2000 2500 3000 3-die modification

3-die game (1.8%)

200 400 600 800 1000 1200 1400 1600 500 1000 1500 2000 2500 3000 4-die modification

4-die game (6.9%)

200 400 600 800 1000 1200 1400 1600 500 1000 1500 2000 2500 3000 5-die modification

5-die game (14.7%)

200 400 600 800 1000 1200 1400 1600 500 1000 1500 2000 2500 3000 6-die modification

6-die game (3%)

slide-8
SLIDE 8

THE GAME OF LIFE PRE-DECISION LEARNING COMMUNICATION

basic game.3.die game.4.die game.5.die game.6.die 0.05 0.10 0.15

slide-9
SLIDE 9

THE GAME OF LIFE PRE-DECISION LEARNING COMMUNICATION

FROM SITUATIONS TO ACTIONS

◮ Set of states T = {t1, t2, t3, t4, t5, t6, t7, t8} ◮ Set of situations Γ = {γ = ti, tj|ti ∈ T is the state of an

alive cell c, tj ∈ T the state of an alive neighbor cell of c}

◮ Set of actions A = {adie, astay}

X t1, t3 X t3, t1 X t2, t2 X t2, t3

slide-10
SLIDE 10

THE GAME OF LIFE PRE-DECISION LEARNING COMMUNICATION

REINFORCEMENT LEARNING

Reinforcement learning account RL = {σ, Ω}

◮ response rule σ ∈ (Γ → ∆(A)) ◮ update rule Ω: if action a is successful in situation γ, then

increase the probability σ(a|γ)

◮ an action a is considered as successful, if and only if

OSa > OS¬a

slide-11
SLIDE 11

THE GAME OF LIFE PRE-DECISION LEARNING COMMUNICATION

THE n × m-DIE LEARNING GAME

P1: Initialization

  • 1. Initialize an RL account for Γ and A

P2: Sacrifice Decision

  • 2. For all ci ∈ C:

2.1 pick randomly a neighbor cj ∈ Ni and request its state tm 2.2 play action a via response rule σ(a|tn, tm), where tn is the state of ci 2.3 if a = adie delete cell ci, RL update Ω

P3: Rules of Nature

  • 3. Apply the rules of nature of the game of life
slide-12
SLIDE 12

THE GAME OF LIFE PRE-DECISION LEARNING COMMUNICATION

RESULTS

500 1000 1500 2000 2500 500 1000 1500 2000 2500 3000 Signaling with given meaning

Course of 20 simulation runs

all successful failed 0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35

Box plots of all, successful and failed runs

◮ the average occupation share over all runs is 17.6% (862 cells) ◮ the average occupation share of successful runs is 28.4% (1392

cells), for failed runs 1.4% (69 cells)

slide-13
SLIDE 13

THE GAME OF LIFE PRE-DECISION LEARNING COMMUNICATION

RESULTS

Definition (Neighbor treatment rules)

For the n × m-die learning game a successful strategy can be characterized by the following two rules:

  • 1. Sacrifice if your neighbor has exactly 4 neighbors.
  • 2. Never sacrifice if your neighbor has less than 4 neighbors.
slide-14
SLIDE 14

THE GAME OF LIFE PRE-DECISION LEARNING COMMUNICATION

BUT...

”In our opinion, the property of access restriction to direct neighborhood information is an important requirement for all following pre-games since this property reflects the spatial character of the rules of nature of the game of life. We denote this requirement as the local information rule.”

X t1, t3 X t3, t2 X t2, t2 X t2, t3

slide-15
SLIDE 15

THE GAME OF LIFE PRE-DECISION LEARNING COMMUNICATION

SIGNALING GAMES

A signaling game SG = (S, R), T, M, A, U is

◮ played between a sender S and a receiver R ◮ S has private information state t ∈ T ◮ S sends a message m ∈ M ◮ R responds with a choice of action a ∈ A ◮ U : T × A → R defines the success of communication

slide-16
SLIDE 16

THE GAME OF LIFE PRE-DECISION LEARNING COMMUNICATION

THE n-MESSAGES SIGNALING GAME

P1: Initialization

  • 1. Create a RL account for the signaling game

SGn = (S, R), T, M, A, U witn n messages P2: Sacrifice Decision

  • 2. For all ci ∈ C:

2.1 pick randomly a neighbor cj ∈ Ni and make a state request for its state t 2.2 cj sends a message m ∈ M via response rule σ(m|t) 2.3 ci plays action a ∈ A via response rule σ(a|m) 2.4 if a = adie delete cell ci, RL-update Ω P3: Rules on Nature

  • 3. Apply the rules of nature of the game of life
slide-17
SLIDE 17

THE GAME OF LIFE PRE-DECISION LEARNING COMMUNICATION

RESULTING SUCCESSFUL STRATEGIES

t1 t2 t3 t4 t5 t6 t7 t8 ma mb adie astay

Result for 2 messages

t1 t2 t3 t4 t5 t6 t7 t8 ma mb mc md adie astay

Result for 4 messages

slide-18
SLIDE 18

THE GAME OF LIFE PRE-DECISION LEARNING COMMUNICATION

RESULTS

◮ Always one ”death message”, but often multiple ”survive

messages” and unused messages

◮ Successful strategies realize ”Neighbor treatment rules” ◮ Strong tendency for t5, astay and t6, adie ◮ The rate for learning a successful strategy increases with

the number of messages

percentage of runs 0.2 0.4 0.6 0.8 1 2 4 6 8 n =

slide-19
SLIDE 19

THE GAME OF LIFE PRE-DECISION LEARNING COMMUNICATION

OUTLOOK

◮ How do rules of nature affect evolving signaling systems?

→ Experiments with changed rules of nature

◮ General question: how do signaling strategies evolve

under selective pressure determined by environmental / nature rules?

slide-20
SLIDE 20

THE GAME OF LIFE PRE-DECISION LEARNING COMMUNICATION

Thanks for attention!