Policy Gradients Are Easy In Keras | Deep Reinforcement Learning Tutorial

Today you’re going to learn how to code a policy gradient agent in the Keras framework. As a bonus, you’ll get to see how to use custom loss functions. The policy gradient algorithm, REINFORCE specifically, is a Monte Carlo reinforcement learning method that approximates the optimal policy for the reinforcement learning agent. It works by shifting the policy, a probability distribution for action selection, in the direction of the actions that produce the largest advantage. Here advantage is defined as t

11 views