MAML理解

line6，each $\theta_i$ is the one-step updated parameter for each task, which would not be used as parameters for testing. Instead, it is used to update $\theta$ , which is the final initilization that we wish to get.

This way, it is asctually minimizing the average loss across different tasks after one-step update. In other word, MAML could find the initilization parameters that is optimal for training with only one step update.

Line 4-8 could be expressed equivalently as follow:

$\theta=\theta-\beta\nabla_{\theta}L(\theta) \\L(\theta)=\sum_{i=1}^{N}l_i(\theta_i') \\\theta_i'=\theta-\alpha\nabla_{\theta}l(\theta)$

2021-05-31
2021-04-28
2021-07-20
2021-11-09
2022-01-14
2021-04-05
2022-12-23
2021-05-15