Tensorflow 错误：广播的不兼容形状答案

【问题标题】：Tensorflow Error: Incompatible Shapes for BroadcastingTensorflow 错误：广播的不兼容形状
【发布时间】：2016-07-13 02:21:19
【问题描述】：

我目前正在 Tensorflow 中开发一个程序，该程序可以读取 1750 x 1750 像素的数据。我通过卷积网络运行它：

import os
import sys

import tensorflow as tf
import Input

FLAGS = tf.app.flags.FLAGS

tf.app.flags.DEFINE_integer('batch_size', 100, "hello")
tf.app.flags.DEFINE_string('data_dir',     '/Volumes/Machine_Learning_Data',  "hello")

def inputs():
  if not FLAGS.data_dir:
    raise ValueError('Please supply a data_dir')
  data_dir = os.path.join(FLAGS.data_dir, 'Data')
  images, labels = Input.inputs(data_dir = data_dir, batch_size =     FLAGS.batch_size)
  return images, labels

def weight_variable(shape):
    initial = tf.truncated_normal(shape, stddev=0.1)
    return tf.Variable(initial)

def bias_variable(shape):
    initial = tf.constant(0.1, shape = shape)
    return tf.Variable(initial)

def conv2d(images, W):
    return tf.nn.conv2d(images, W, strides = [1, 1, 1, 1], padding =     'SAME')

def max_pool_5x5(images):
    return tf.nn.max_pool(images, ksize = [1, 5, 5, 1], strides = [1, 1, 1, 1], padding = 'SAME')

def forward_propagation(images):
  with tf.variable_scope('conv1') as scope:
      W_conv1 = weight_variable([5, 5, 1, 32])
      b_conv1 = bias_variable([32])
      image_matrix = tf.reshape(images, [-1, 1750, 1750, 1])
      h_conv1 = tf.nn.sigmoid(conv2d(image_matrix, W_conv1) + b_conv1)
      h_pool1 = max_pool_5x5(h_conv1)

  with tf.variable_scope('conv2') as scope:
      W_conv2 = weight_variable([5, 5, 32, 64])
      b_conv2 = bias_variable([64])
      h_conv2 = tf.nn.sigmoid(conv2d(h_pool1, W_conv2) + b_conv2)
      h_pool2 = max_pool_5x5(h_conv2)

  with tf.variable_scope('conv3') as scope:
      W_conv3 = weight_variable([5, 5, 64, 128])
      b_conv3 = bias_variable([128])
      h_conv3 = tf.nn.sigmoid(conv2d(h_pool2, W_conv3) + b_conv3)
      h_pool3 = max_pool_5x5(h_conv3)

  with tf.variable_scope('local3') as scope:
      W_fc1 = weight_variable([10 * 10 * 128, 256])
      b_fc1 = bias_variable([256])
      h_pool3_flat = tf.reshape(h_pool3, [-1, 10 * 10 * 128])
      h_fc1 = tf.nn.sigmoid(tf.matmul(h_pool3_flat, W_fc1) + b_fc1)
      keep_prob = tf.placeholder(tf.float32)
      h_fc1_drop = tf.nn.dropout(h_fc1, keep_prob)
      W_fc2 = weight_variable([256, 4])
      b_fc2 = bias_variable([4])

      y_conv = tf.nn.softmax(tf.matmul(h_fc1_drop, W_fc2) + b_fc2)
      return y_conv

def error(forward_propagation_results, labels):
    labels = tf.cast(labels, tf.float32)
    mean_squared_error = tf.square(tf.sub(labels, forward_propagation_results))
    cost = tf.reduce_mean(mean_squared_error)
    train = tf.train.GradientDescentOptimizer(learning_rate = 0.3).minimize(cost)
    return train

print cost

不幸的是，出现了一个错误

广播不兼容的形状：TensorShape([Dimension(100)]) 和 TensorShape([Dimension(9187500), Dimension(4)])

我无法调试这个。

矩阵维度有什么问题？解释器说错误发生在 tf.sub 行。

编辑：

这是调用函数的代码的主要部分。

import Input
import Process

import tensorflow as tf


def train():
    with tf.Session() as sess:
        images, labels = Process.inputs()

        forward_propgation_results = Process.forward_propagation(images)

        train_loss = Process.error(forward_propgation_results, labels)

        init = tf.initialize_all_variables()

        sess.run(init)

def main(argv = None):
    train()

if __name__ == '__main__':
  tf.app.run()

【问题讨论】：

很难判断发生了什么，因为您似乎错过了调用error() 的脚本部分。
谢谢！我会尽快添加的

标签： python machine-learning neural-network tensorflow

【解决方案1】：

我发现了以下问题：

您的labels 输入是一个简单的一维标签标识符数组，但它需要被单热编码为大小为[batch_size, 4] 的矩阵，其中填充有 1 或 0。
您的最大池化操作需要具有不同于 1 的步幅才能真正减小图像的宽度和高度。所以设置strides=[1, 5, 5, 1] 应该可以工作。
修复该问题后，您的最大池操作实际上并没有像您假设的那样将宽度/高度从 1750 降低到 10，而只是降低到 14（因为 1750 / 5 / 5 / 5 == 14）。所以你可能想在这里增加你的权重矩阵，但也有其他选择。
您的图像是否可能从 3 个通道开始？您在这里假设灰度，因此您应该将image_matrix 重塑为具有 3 个通道，或者将图像转换为灰度。

应用这些修复后，网络输出和标签都应该具有[batch_size, 4] 的形状，并且您应该能够计算出差异。

编辑：我在下面的聊天中讨论了代码后对此进行了调整。

【讨论】：

您能否详细说明第一点？我应该如何实现将数组转换为矩阵？我还查看了 CIFAR-10 Tensorflow 代码，它们的实现使用了我正在使用的一维数组。
他们使用一维数组，因为they're using 函数tf.softmax_cross_entropy_with_logits() 来计算损失函数，此处记录：github.com/tensorflow/tensorflow/blob/r0.9/tensorflow/models/… 如果您使用该函数，那么您不需要执行一次热编码，因为它已经处理好了。
我不认为 tensorflow 中的 MSE 损失可以为您做同样的事情，因此您可能必须使用 tf.one_hot 函数自己执行 one-hot 编码：tensorflow.org/versions/master/api_docs/python/…
“VALID”填充和“SAME”填充有什么区别？在其中一个 Deep MNIST 示例中，他们使用了 SAME 填充。
你是说这个例子吗？ tensorflow.org/versions/r0.9/tutorials/mnist/pros/… 请注意，它们使用的步幅与 1 不同。此处解释了 SAME 和 VALID：tensorflow.org/versions/r0.9/api_docs/python/…

【解决方案2】：

One_hot 标签为其输入添加维度。例如，如果 labels 张量大小为 [batch,1]，则使用 tf.one_hot(batch_labels, depth=2, axis=-1) 返回 [batch,1,2] 维度张量。对于 labels 张量的大小 [batch_size,1] 的情况，以下脚本可以成为摆脱额外维度的解决方案：

tf.one_hot(tf.squeeze(batch_labels,[1]), depth=2, axis=-1)

基本上labels 张量的大小必须为 [batch_size,]。 tf.squeeze() 函数，消除特定维度。 [1] 参数，提示函数消除第二维1。

【讨论】：