REINFORCE Algorithm
An approach to computing gradients where stochastic nodes are involved. Originally introduced in the context of policy-based Reinforcement learning (RL).
An approach to computing gradients where stochastic nodes are involved. Originally introduced in the context of policy-based Reinforcement learning (RL).