Lecture 2: Infinite Horizon and Indefinite Horizon MDPs
B9140 Dynamic Programming & Rienforcement Learning. – Prof. Daniel Russo
Last time:
- RL overview and motivation
- Finite Horizon MDPs: formulation and the DP algorithm
Today:
- Infinite horizon discounted MDPs
- Basic theory of Bellman operators; contraction mappings; existence of
- ptimal policies;
- Analogous theory for indefinite horizon (episodic) MDPs.