Now let's see how gradient agreement works step by step:
- Let's say we have a model parameterized by a parameter and a distribution over tasks . First, we randomly initialize the model parameter .
- We sample some batch of tasks from a distribution ...