MAML理解

line6,each θi\theta_i is the one-step updated parameter for each task, which would not be used as parameters for testing. Instead, it is used to update θ\theta, which is the final initilization that we wish to get.

This way, it is asctually minimizing the average loss across different tasks after one-step update. In other word, MAML could find the initilization parameters that is optimal for training with only one step update.

Line 4-8 could be expressed equivalently as follow:

θ=θβθL(θ)L(θ)=i=1Nli(θi)θi=θαθl(θ)\theta=\theta-\beta\nabla_{\theta}L(\theta) \\L(\theta)=\sum_{i=1}^{N}l_i(\theta_i') \\\theta_i'=\theta-\alpha\nabla_{\theta}l(\theta)

相关文章:

  • 2021-05-31
  • 2021-04-28
  • 2021-07-20
  • 2021-11-09
  • 2022-01-14
  • 2021-04-05
  • 2022-12-23
  • 2021-05-15
猜你喜欢
  • 2021-05-27
  • 2022-12-23
  • 2021-05-05
  • 2021-08-04
  • 2021-08-13
  • 2022-01-10
相关资源
相似解决方案