What Can Learned Intrinsic Rewards Capture?
zeyu@umich.edu junhyuk@google.com
Zeyu Zheng*, Junhyuk Oh*, Matteo Hessel, Zhongwen Xu, Manuel Kroiss, Hado van Hasselt, David Silver, Satinder Singh
What Can Learned Intrinsic Rewards Capture? Zeyu Zheng*, Junhyuk - - PowerPoint PPT Presentation
What Can Learned Intrinsic Rewards Capture? Zeyu Zheng*, Junhyuk Oh*, Matteo Hessel, Zhongwen Xu, Manuel Kroiss, Hado van Hasselt, David Silver, Satinder Singh zeyu@umich.edu junhyuk@google.com Motivation: Loci of Knowledge in RL Common
zeyu@umich.edu junhyuk@google.com
Zeyu Zheng*, Junhyuk Oh*, Matteo Hessel, Zhongwen Xu, Manuel Kroiss, Hado van Hasselt, David Silver, Satinder Singh
Episode 1 Episode 2
Episode 1 Episode 2
Intrinsic Reward
Episode 1 Episode 2
Intrinsic Reward
Episode 1 Episode 2
Intrinsic Reward
Episode 1 Episode 2
Intrinsic Reward
Episode 1 Episode 2
Intrinsic Reward
Inner loop
Inner loop Outer loop
Inner loop Outer loop
Inner loop Outer loop
Agent
Agent Goal
Good or bad Bad Mildly good
Change Change
Change Change