-
Intro to Vanilla Policy Gradient, continued...
Extending the Theory and intuition behind one of our introductory algorithms
-
Intro to Vanilla Policy Gradient
Theory and intuition behind one of our introductory algorithms
-
The Reinforcement Learning Problem
Before anything else, define the problem you need to solve.
-
Some Acme Speedbumps
Messing around with parallelization.
-
CoachRL Back End
Digging in to dirty details