【问题标题】:Retrain mxnet model in R在 R 中重新训练 mxnet 模型
【发布时间】:2017-11-16 10:18:17
【问题描述】:

我用 mxnet 创建了一个神经网络。现在我想在新数据点上迭代训练这个模型。在我模拟了一个新的数据点之后,我想在这个模型上做一个新的梯度下降更新。我不想将模型保存到外部文件并再次加载。

我编写了以下代码,但在新的训练步骤后权重不会改变。我还收到NaN 作为训练错误。

library(mxnet)
data <- mx.symbol.Variable("data")
fc1 <- mx.symbol.FullyConnected(data, num_hidden = 2, no.bias = TRUE)
lro <- mx.symbol.LinearRegressionOutput(fc1)

# first data observation
train.x = matrix(0, ncol = 3)
train.y = matrix(0, nrow = 2)

# first training step
model = mx.model.FeedForward.create(lro,
  X = train.x, y = train.y, initializer = mx.init.uniform(0.001),
  num.round = 1, array.batch.size = 1, array.layout = "rowmajor",
  learning.rate = 0.1, eval.metric = mx.metric.mae)
print(model$arg.params)

# second data observation
train.x = matrix(0, ncol = 3)
train.x[1] = 1
train.y = matrix(0, nrow = 2)
train.y[1] = -33

# retrain model on new data
# pass on params of old model
model = mx.model.FeedForward.create(symbol = model$symbol,
  arg.params = model$arg.params, aux.params = model$aux.params,
  X = train.x, y = train.y, num.round = 1,
  array.batch.size = 1, array.layout = "rowmajor",
  learning.rate = 0.1, eval.metric = mx.metric.mae)
# weights do not change
print(model$arg.params)

【问题讨论】:

    标签: r mxnet


    【解决方案1】:

    我找到了解决方案。第二步训练的begin.round必须大于第一步训练的num.round,这样模型才能继续训练。

    library(mxnet)
    data <- mx.symbol.Variable("data")
    fc1 <- mx.symbol.FullyConnected(data, num_hidden = 2, no.bias = TRUE)
    lro <- mx.symbol.LinearRegressionOutput(fc1)
    
    # first data observation
    train.x = matrix(0, ncol = 3)
    train.y = matrix(0, nrow = 2)
    
    # first training step
    model = mx.model.FeedForward.create(lro,
      X = train.x, y = train.y, initializer = mx.init.uniform(0.001),
      num.round = 1, array.batch.size = 1, array.layout = "rowmajor",
      learning.rate = 0.1, eval.metric = mx.metric.mae)
    print(model$arg.params)
    
    # second data observation
    train.x = matrix(0, ncol = 3)
    train.x[1] = 1
    train.y = matrix(0, nrow = 2)
    train.y[1] = -33
    
    # retrain model on new data
    # pass on params of old model
    model = mx.model.FeedForward.create(symbol = model$symbol,
      arg.params = model$arg.params, aux.params = model$aux.params,
      X = train.x, y = train.y, begin.round = 2, num.round = 3,
      array.batch.size = 1, array.layout = "rowmajor",
      learning.rate = 0.1, eval.metric = mx.metric.mae)
    
    print(model$arg.params)
    

    【讨论】:

      【解决方案2】:

      您是否尝试只调用一次mx.model.FeedForward.create,然后使用fit 函数进行增量训练?

      【讨论】:

      • 我在mxnet 中找不到合适的函数。您使用哪个软件包版本?你能提供一些示例代码吗
      • 感谢您以这种方式重新训练 mxnet。
      猜你喜欢
      • 1970-01-01
      • 1970-01-01
      • 2018-10-29
      • 1970-01-01
      • 2022-01-01
      • 1970-01-01
      • 1970-01-01
      • 1970-01-01
      • 1970-01-01
      相关资源
      最近更新 更多