Now we'll see how to use Meta-SGD in reinforcement learning. Meta-SGD is compatible with any RL algorithm that can be trained with gradient descent.
- Let's say we have a model
parameterized by a parameter
and we have a distribution over tasks
. First, we randomly initialize the model parameter
and we randomly ...