REINFORCE Algorithm

An approach to computing gradients where stochastic nodes are involved. Originally introduced in the context of policy-based Reinforcement learning (RL).