de Scale-free adaptive PLANNING for deterministic dynamics & discounted rewards
Peter Bartlett, Victor Gabillon, Jennifer Healey, Michal Valko
ICML - June 13th, 2019
1/11
de Scale-free adaptive PLANNING for deterministic dynamics & - - PowerPoint PPT Presentation
de Scale-free adaptive PLANNING for deterministic dynamics & discounted rewards Peter Bartlett, Victor Gabillon, Jennifer Healey, Michal Valko ICML - June 13th, 2019 1/11 An MCTS setting MDP with starting state x 0 X , action space A
1/11
2/11
2/11
2/11
3/11
3/11
4/11
4/11
5/11
5/11
5/11
5/11
...
6/11
7/11
8/11
h=0 h=1 h=2 h=3 h=4 h=5
x4 x3 x2 x5 x6 x7
h=0 h=1 h=2 h=3 h=4 h=5
x4 x3 x2 x5 x6 x7
r02 r03 r04 r35 r56
9/11
h=0 h=1 h=2 h=3 h=4 h=5
x4 x3 x2 x5 x6 x7
f105 h=0 h=1 h=2 h=3 h=4 h=5
x4 x3 x2 x5 x6 x7
9/11
h=0 h=1 h=2 h=3 h=4 h=5
x4 x3 x2 x5 x6 x7
f134 h=0 h=1 h=2 h=3 h=4 h=5
x4 x3 x2 x5 x6 x7
9/11
h=0 h=1 h=2 h=3 h=4 h=5
x4 x3 x2 x5 x6 x7
f134 h=0 h=1 h=2 h=3 h=4 h=5
x4 x3 x2 x5 x6 x7
9/11
h=0 h=1 h=2 h=3 h=4 h=5
x4 x3 x2 x5 x6 x7
r04 r04 r04 r04 r04 r04 r04 r04 r04 r04 r04 r04
h=0 h=1
...
...
10/11
h=0 h=1 h=2 h=3 h=4 h=5
x4 x3 x2 x5 x6 x7
r04 r04 r04 r04 r04 r04 r04 r04 r04 r04 r04 r04
h=0 h=1
...
...
10/11
11/11