1.思想

好的学习率,那么loss应该要下降的很快,那么可以绘制处学习率和loss的函数。

例如:
pytorch 如何选择合适的学习率(翻译)

此时,该如何选学习率呢?选10^-2可以使训练的比较快,并且不会使梯度爆炸,
10^-1可能已经太大了。

2.算法

假设初始lr为10^-8,最大为10,共测试N step,那么我们可以记录每经过一个step的lr和loss,其中,每次lr增加q:
pytorch 如何选择合适的学习率(翻译)

loss,把它平滑一下:
pytorch 如何选择合适的学习率(翻译)

pytorch 如何选择合适的学习率(翻译)
实现代码如下:

        #Compute the smoothed loss
        avg_loss = beta * avg_loss + (1-beta) *loss.data[0]
        smoothed_loss = avg_loss / (1 - beta**batch_num)

3.整体代码

def find_lr(init_value = 1e-8, final_value=10., beta = 0.98):
    num = len(trn_loader)-1
    mult = (final_value / init_value) ** (1/num)
    lr = init_value
    optimizer.param_groups[0]['lr'] = lr
    avg_loss = 0.
    best_loss = 0.
    batch_num = 0
    losses = []
    log_lrs = []
    for data in trn_loader:
        batch_num += 1
        #As before, get the loss for this mini-batch of inputs/outputs
        inputs,labels = data
        inputs, labels = Variable(inputs), Variable(labels)
        optimizer.zero_grad()
        outputs = net(inputs)
        loss = criterion(outputs, labels)
        #Compute the smoothed loss
        avg_loss = beta * avg_loss + (1-beta) *loss.data[0]
        smoothed_loss = avg_loss / (1 - beta**batch_num)
        #Stop if the loss is exploding
        if batch_num > 1 and smoothed_loss > 4 * best_loss:
            return log_lrs, losses
        #Record the best loss
        if smoothed_loss < best_loss or batch_num==1:
            best_loss = smoothed_loss
        #Store the values
        losses.append(smoothed_loss)
        log_lrs.append(math.log10(lr))
        #Do the SGD step
        loss.backward()
        optimizer.step()
        #Update the lr for the next step
        lr *= mult
        optimizer.param_groups[0]['lr'] = lr
    return log_lrs, losses

参考:
原文:https://sgugger.github.io/how-do-you-find-a-good-learning-rate.html

相关文章:

  • 2021-12-27
  • 2021-04-01
  • 2021-11-09
  • 2022-01-15
  • 2021-11-01
  • 2021-12-18
  • 2022-12-23
  • 2021-12-05
猜你喜欢
  • 2022-01-11
  • 2022-12-23
  • 2021-11-18
  • 2021-12-18
  • 2022-01-22
  • 2021-09-03
  • 2022-12-23
相关资源
相似解决方案