Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes

Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes

Alekh Agarwal (Microsoft Research Redmond) https://simons.berkeley.edu/talks/tba-83 Emerging Challenges in Deep Learning