This video explains the proof of Policy Gradient Methods and explains about REINFORCE Algorithm To follow along with the course schedule and syllabus, visit: https://chandar-lab.github.io/INF8953... Sarath Chandar Assistant Professor @ École Polytechnique de Montréal, Core faculty member @ Mila - The Quebec AI Institute http://sarathchandar.in/ Happy Learning