CSE$473:$Ar+ficial$Intelligence$
$ Reinforcement$Learning$
Dan$Weld$ University$of$Washington$
[Most$of$these$slides$were$created$by$Dan$Klein$and$Pieter$Abbeel$for$CS188$Intro$to$AI$at$UC$Berkeley.$$All$CS188$materials$are$available$ at$hNp://ai.berkeley.edu.]$
Midterm$Postmortem$
! It$was$long,$hard…$"$
! Max $ $ $41$$ ! Min $ $ $13$ ! Mean$&$Median $27$
! Final$
! Will$include$some$of$the$midterm$problems$
Office$Hour$Change$(this$week)$
! Thurs$ 10`11am$
! CSE$588$ ! (Not$Fri)$
“Listen Simkins, when I said that you could always come to me with your problems, I meant during office hours!”
Reinforcement$Learning$ Two$Key$Ideas$
! Credit$assignment$problem$ ! Explora+on`exploita+on$tradeoff$
Reinforcement$Learning$
! Basic$idea:$
! Receive$feedback$in$the$form$of$rewards$ ! Agent’s$u+lity$is$defined$by$the$reward$func+on$ ! Must$(learn$to)$act$so$as$to$maximize$expected$ rewards$ ! All$learning$is$based$on$observed$samples$of$
- utcomes!$
Environm ent$
$
Age nt$
Ac+ons:$a$ State:$s$ Reward:$r$