CSC413/2516 Lecture 11: Q-Learning & the Game of Go
Jimmy Ba
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 1 / 29
CSC413/2516 Lecture 11: Q-Learning & the Game of Go Jimmy Ba - - PowerPoint PPT Presentation
CSC413/2516 Lecture 11: Q-Learning & the Game of Go Jimmy Ba Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 1 / 29 Overview Second lecture on reinforcement learning Optimize a policy directly, dont represent
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 1 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 2 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 3 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 4 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 5 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 6 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 7 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 7 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 8 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 9 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 10 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 11 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 12 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 13 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 14 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 15 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 16 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 17 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 18 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 19 / 29
https://www.cs.cmu.edu/~adamchik/15-121/lectures/Game%20Trees/Game%20Trees.html Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 20 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 21 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 22 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 23 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 23 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 24 / 29
Silver et al., 2016
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 25 / 29
Silver et al., 2016 Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 26 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 27 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 28 / 29
Jimmy Ba CSC413/2516 Lecture 11: Q-Learning & the Game of Go 29 / 29