DRL Lecture 1: Policy Gradient (Review)