【发布时间】:2019-09-29 23:16:45
【问题描述】:
我目前正在 Tensorflow 中设计一个 NoisyNet,我需要为此定义一个自定义层。复制包含该自定义层的模型时,python 会引发错误 ValueError: Unknown layer: NoisyLayer。层的实现提供here。
目标是复制一个网络,创建它的第二个实例。为此,我使用命令net_copy = copy.deepcopy(net_original),只要我不在要复制的模型中包含上面提到的自定义层,它就可以工作。
我看到对于保存和加载,存在一种指定自定义属性(例如自定义图层)的方法,但是我找不到适用于 copy.deepcopy() 的类似命令,其中副本是通过 import copy 导入的。
我在 Python3 中使用 Tensorflow 1.12.0。
同样,自定义层在上面的链接下提供。 使用自定义层的网络如下所示:
class Network:
def __init__(self, actionspace_size, learning_rate, gradient_momentum, gradient_min):
frames_input = keras.layers.Input((84, 84, 4))
actions_input = keras.layers.Input((actionspace_size,))
conv1 = keras.layers.Conv2D(16, (8, 8), strides=(4, 4), activation="relu")(frames_input)
conv2 = keras.layers.Conv2D(32, (4, 4), strides=(2, 2), activation="relu")(conv1)
flattened = keras.layers.Flatten()(conv2)
# NoisyNet
hidden = NoisyLayer(activation=tf.nn.relu)(inputs=flattened, resample_noise_flag=True)
output = NoisyLayer(in_shape=(1,256), out_units=actionspace_size)(inputs=hidden, resample_noise_flag=True)
filtered_output = keras.layers.merge.Multiply()([output, actions_input])
self.model = keras.models.Model(inputs=[frames_input, actions_input], outputs=filtered_output)
self.model.compile(loss='mse', optimizer=keras.optimizers.RMSprop(lr=learning_rate, rho=gradient_momentum, epsilon=gradient_min))
打电话时
q_net = Network(actionspace_size, learning_rate, gradient_momentum, gradient_min).
target_net = copy.deepcopy(q_net)
出现以下错误:
Traceback (most recent call last):
File "DQN_tf_NoisyNet.py", line 315, in <module>
main()
File "DQN_tf_NoisyNet.py", line 252, in main
target_net = copy.deepcopy(q_net)
File "/usr/lib/python3.5/copy.py", line 182, in deepcopy
y = _reconstruct(x, rv, 1, memo)
File "/usr/lib/python3.5/copy.py", line 299, in _reconstruct
y.__setstate__(state)
File "/usr/local/lib/python3.5/dist-packages/keras/engine/network.py", line 1266, in __setstate__
model = saving.unpickle_model(state)
File "/usr/local/lib/python3.5/dist-packages/keras/engine/saving.py", line 435, in unpickle_model
return _deserialize_model(f)
File "/usr/local/lib/python3.5/dist-packages/keras/engine/saving.py", line 225, in _deserialize_model
model = model_from_config(model_config, custom_objects=custom_objects)
File "/usr/local/lib/python3.5/dist-packages/keras/engine/saving.py", line 458, in model_from_config
return deserialize(config, custom_objects=custom_objects)
File "/usr/local/lib/python3.5/dist-packages/keras/layers/__init__.py", line 55, in deserialize
printable_module_name='layer')
File "/usr/local/lib/python3.5/dist-packages/keras/utils/generic_utils.py", line 145, in deserialize_keras_object
list(custom_objects.items())))
File "/usr/local/lib/python3.5/dist-packages/keras/engine/network.py", line 1022, in from_config
process_layer(layer_data)
File "/usr/local/lib/python3.5/dist-packages/keras/engine/network.py", line 1008, in process_layer
custom_objects=custom_objects)
File "/usr/local/lib/python3.5/dist-packages/keras/layers/__init__.py", line 55, in deserialize
printable_module_name='layer')
File "/usr/local/lib/python3.5/dist-packages/keras/utils/generic_utils.py", line 138, in deserialize_keras_object
': ' + class_name)
ValueError: Unknown layer: NoisyLayer
我知道网络本身不是问题(deepcopy 方法也不是),因为只要我用标准密集层替换 NoisyLayers(自定义),两者都会再次正常工作。
有人知道如何复制包含自定义层的 Tensorflow 模型吗?提前致谢!
【问题讨论】:
-
是否有另一种方法可以绕过制作整个网络的副本,而只是在等效网络架构之间复制单个权重和偏差?
标签: python python-3.x tensorflow valueerror