Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
Reinforcement Learning
Lecture 8
November 24, 2015 Reinforcement Learning 1
Reinforcement Learning Lecture 8 Reinforcement Learning November - - PowerPoint PPT Presentation
Wentworth Institute of Technology COMP4050 Machine Learning | Fall 2015 | Derbinsky Reinforcement Learning Lecture 8 Reinforcement Learning November 24, 2015 1 Wentworth Institute of Technology COMP4050 Machine Learning | Fall 2015 |
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 1
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 2
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
– Given a training set and a target variable, generalize; measured over a testing set
– Given a dataset, find “interesting” patterns; potentially no “right” answer
– Learn an optional action policy over time; given an environment that provides states, affords actions, and provides feedback as numerical reward, maximize the expected future reward
November 24, 2015 Reinforcement Learning 3
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 4
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
Agent Environment (stochas2c) state st ac'on at reward rt+1 st+1
November 24, 2015 5 Reinforcement Learning
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 6
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 7
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 8
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 9
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 10
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 11
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 12
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 13
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 14
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 15
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 16
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 17
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 18
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 19
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 20
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 21
a0 Q(s0, a0) − Q(s, a)]
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 22
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 23
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 24
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 25
50 100 150 200 250 2 4 6 8 10 12 Games Won Blocks of Training (250 Games/Block)
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 26
50 100 150 200 250 2 4 6 8 10 12 Games Won Blocks of Training (250 Games/Block)
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 27
50 100 150 200 250 5 10 15 20 Games Won Blocks of Trainings (250 Games/Block) PMH PM PH P PMH-0 PM-0 PH-0 P-0
Wentworth Institute of Technology COMP4050 – Machine Learning | Fall 2015 | Derbinsky
November 24, 2015 Reinforcement Learning 28