Tf.keras model.predict() 返回大于 1 的类概率？答案

【问题标题】：Tf.keras model.predict() returns class probabilities that are higher than 1?Tf.keras model.predict() 返回大于 1 的类概率？
【发布时间】：2020-05-28 02:03:52
【问题描述】：

我正在尝试在 CNN 上调用 tf.keras 中的 model.predict() 来预测单个图像的类别。出于某种原因，类概率返回高于 1，这是荒谬的。我不确定为什么会发生这种情况。以下是我训练 CNN 的方法：

class_names = ['Angry','Disgust','Fear','Happy','Sad','Surprise','Neutral']
model = models.Sequential()
model.add(layers.Conv2D(64, (3, 3), activation='relu', input_shape=(48, 48, 1), kernel_regularizer=tf.keras.regularizers.l1(0.01)))
model.add(layers.Conv2D(128, (3, 3), padding='same', activation='relu'))
model.add(layers.MaxPooling2D((2, 2)))
model.add(tf.keras.layers.Dropout(0.5))
model.add(layers.Conv2D(64, (3, 3), activation='relu'))
model.add(layers.MaxPooling2D((2, 2)))
model.add(tf.keras.layers.Dropout(0.5))
model.add(layers.Conv2D(64, (3, 3), activation='relu'))


model.summary()

model.add(layers.Flatten())
model.add(layers.Dense(64, activation='relu'))
model.add(layers.Dense(7))


#model.summary()
model.compile(optimizer='adam',loss=tf.keras.losses.CategoricalCrossentropy(from_logits=True),metrics=['accuracy'])
lr_reducer = tf.keras.callbacks.ReduceLROnPlateau(monitor='val_loss', factor=0.9, patience=3) #monitors the validation loss for signs of a plateau and then alter the learning rate by the specified factor if a plateau is detected

early_stopper = tf.keras.callbacks.EarlyStopping(monitor='val_accuracy', min_delta=0, patience=6, mode='auto')  #This will monitor and stop the model training if it is not further converging

checkpointer = tf.keras.callbacks.ModelCheckpoint('C:\\Users\\rtlum\\Documents\\DataSci_Projects\\PythonTensorFlowProjects\\Datasets\\FER2013_Model_Weights\\Model\\weights.hd5', monitor='val_loss', verbose=1, save_best_only=True) #This allows checkpoints to be saved each epoch just in case the model stops training

epochs = 100
batch_size = 64
learning_rate = 0.001

model.fit(
          train_data,
          train_labels,
          epochs = epochs,
          batch_size = batch_size,
          validation_split = 0.2,
          shuffle = True,
          callbacks=[lr_reducer, checkpointer, early_stopper]
          )

以下是我如何调用 model.predict() 并传入单个图像进行预测：

    model = tf.keras.models.load_model('Model\\weights.hd5')
    img = Image.open(test_image).convert('L')
    img = img.resize([48, 48])
    image_data = np.asarray(img, dtype=np.uint8)
    #image_data = np.resize(img,3072)
    image_data = image_data / 255
    image_data_test = image_data.reshape((1, 48, 48, 1))
    class_names = ['Angry','Disgust','Fear','Happy','Sad','Surprise','Neutral']
    x = model.predict(image_data_test)
    app.logger.info(x)
    image_pred = np.argmax(x)
    y = round(x[0][np.argmax(x)], 2)
    confidence = y * 100
    print(class_names[image_pred], confidence)

最后，下面是我从 model.predict() 收到的类概率：

>>> x = model.predict(image_data_test)
>>> x
array([[ 1.0593076 , -3.5140653 ,  0.7505076 ,  2.1341033 ,  0.02394461,
        -0.08749148,  0.6640976 ]], dtype=float32)

【问题讨论】：

回答没有帮助？

标签： python tensorflow machine-learning keras conv-neural-network

【解决方案1】：

您的最后一层model.add(layers.Dense(7)) 正在使用线性激活函数。要获得 7 个类别的概率，您应该使用softmax 激活。

将最后一层更改为

model.add(layers.Dense(7 , activation='softmax'))

【讨论】：

这解决了问题。感谢您的帮助~

【解决方案2】：

添加一个激活层以将您的输出值转换为 [0,1] 的值

【讨论】：

这并没有提供问题的答案；究竟是什么激活？