如何获得模型预测的最后一个索引？答案

【问题标题】：How to get the last index of model's prediction?如何获得模型预测的最后一个索引？
【发布时间】：2021-04-01 06:38:56
【问题描述】：

我是 PyTorch 的新手。我有一个变量 pred，它有一个张量列表。

print(pred)
output: [tensor([[176.64380, 193.86154, 273.84702, 306.30405, 0.83492, 2.00000]])]

所以我想访问最后一个元素，即类。我首先将列表转换为张量。

x = torch.stack(pred)
output: tensor([[[176.64380, 193.86154, 273.84702, 306.30405, 0.83492, 2.00000]]])

现在，我如何访问最后一个元素，或者有什么更好/更有效的方法吗？

编辑：为了进一步参考，这里是执行分类任务的代码。

def classify_face(image):
    device = torch.device("cpu")
    img = process_image(image)
    print('Image processed')
    # img = image.unsqueeze_(0)
    # img = image.float()
    
    pred = model(img)[0]
    
    # Apply NMS
    pred = non_max_suppression(pred, 0.4, 0.5, classes = [0, 1, 2], agnostic = None )
    if classify:
        pred = apply_classifier(pred, modelc, img, im0s)
    #print(pred)
    
    model.eval()
    model.cpu()
    
    print(pred)
        
    # output = non_max_suppression(output, 0.4, 0.5, classes = class_names, agnostic = False)
        
    #_, predicted = torch.max(output[0], 1)
    #print(predicted.data[0], "predicted")

    classification = torch.cat(pred)[:, -1]
    index = int(classification)
    print(names[index])
    return names[index]

在预测期间 pred 由x1、y1、x2、y2、conf 和 class 组成。

例如pred = [tensor([[176.64380, 193.86154, 273.84702, 306.30405, 0.83492, 2.00000]])]

如果模型没有做出预测，那么pred 就是空的。

例如pred = [tensor([], size=(0, 6))]

目前，如果我的程序收到一个空张量并抛出错误，它就会停止预测：

Traceback (most recent call last):
  File "WEBCAM_DETECT.py", line 168, in <module>
    label = classify_face(frame)
  File "WEBCAM_DETECT.py", line 150, in classify_face
    index = int(classification)
ValueError: only one element tensors can be converted to Python scalars

编辑1：当我检查 pred 的长度时它似乎有效，但是当张量中有两行或更多行时出现此错误。

[tensor([[212.38568, 117.47020, 339.35773, 266.00513,   0.74144,   2.00000],
        [214.60651, 118.50694, 339.90192, 265.91696,   0.44277,   0.00000]])]
#################
#################
Traceback (most recent call last):
  File "WEBCAM_DETECT.py", line 172, in <module>
    label = classify_face(frame)
  File "WEBCAM_DETECT.py", line 154, in classify_face
    index = int(classification)
ValueError: only one element tensors can be converted to Python scalars

如果在某一帧没有做出预测，我如何让我的程序忽略并继续下一帧？

【问题讨论】：

标签： python numpy pytorch tensor

【解决方案1】：

您可以选择第 3 轴上带有索引符号的最后一个元素，然后广播到一维张量：

x[:, :, -1].view(-1)

但是，我宁愿在pred 上使用torch.cat，这样可以避免创建新轴：

torch.cat(pred)[:, -1]

编辑 - 你可以预先检查张量是否为空：

if len(pred) == 0:
    return None

【讨论】：

好的，谢谢。我同意第二个建议。但是在预测过程中，一些帧只是空的，即没有什么可以预测返回空张量的。在这种情况下，我收到一个错误：ValueError: only one element tensors can be converted to Python scalars 当我将带有最后一个元素的张量转换为整数时。我该如何克服这个问题？？
或者有没有其他方法可以在不使用 int() 的情况下获取元素？？
我明白了，您能否举一个带有多个预测的起始 pred 张量的示例，其中一个是空的，好吗？在这种情况下，我不确定我是否看到您如何使用cat...
嗨伊万。我已对帖子进行了编辑，请查看。
但这不会退出函数吗？