【问题标题】:OpenCV feature matching multiple objectsOpenCV 特征匹配多个对象
【发布时间】:2017-08-13 18:29:28
【问题描述】:

如何在一张图片上找到一种类型的多个对象。 我使用 ORB 特征查找器和蛮力匹配器 (opencv = 3.2.0)。

我的源代码:

import numpy as np
import cv2
from matplotlib import pyplot as plt

MIN_MATCH_COUNT = 10

img1 = cv2.imread('box.png', 0)  # queryImage
img2 = cv2.imread('box1.png', 0) # trainImage

#img2 = cv2.cvtColor(img1, cv2.COLOR_BGR2GRAY)

# Initiate ORB detector
# 
orb = cv2.ORB_create(10000, 1.2, nlevels=9, edgeThreshold = 4)
#orb = cv2.ORB_create()

# find the keypoints and descriptors with SIFT
kp1, des1 = orb.detectAndCompute(img1, None)
kp2, des2 = orb.detectAndCompute(img2, None)

FLANN_INDEX_KDTREE = 0
index_params = dict(algorithm = FLANN_INDEX_KDTREE, trees = 5)
search_params = dict(checks = 50)

flann = cv2.FlannBasedMatcher(index_params, search_params)

des1 = np.float32(des1)
des2 = np.float32(des2)

# matches = flann.knnMatch(des1, des2, 2)

bf = cv2.BFMatcher()
matches = bf.knnMatch(des1, des2, k=2)

# store all the good matches as per Lowe's ratio test.
good = []
for m,n in matches:
    if m.distance < 0.7*n.distance:
        good.append(m)

if len(good)>3:
    src_pts = np.float32([ kp1[m.queryIdx].pt for m in good ]).reshape(-1,1,2)
    dst_pts = np.float32([ kp2[m.trainIdx].pt for m in good ]).reshape(-1,1,2)

    M, mask = cv2.findHomography(src_pts, dst_pts, cv2.RANSAC, 2)

    if M is None:
        print ("No Homography")
    else:
        matchesMask = mask.ravel().tolist()

        h,w = img1.shape
        pts = np.float32([ [0,0],[0,h-1],[w-1,h-1],[w-1,0] ]).reshape(-1,1,2)
        dst = cv2.perspectiveTransform(pts,M)

        img2 = cv2.polylines(img2,[np.int32(dst)],True,255,3, cv2.LINE_AA)

else:
    print ("Not enough matches are found - %d/%d" % (len(good),MIN_MATCH_COUNT))
    matchesMask = None

draw_params = dict(matchColor = (0,255,0), # draw matches in green color
                   singlePointColor = None,
                   matchesMask = matchesMask, # draw only inliers
                   flags = 2)

img3 = cv2.drawMatches(img1,kp1,img2,kp2,good,None,**draw_params)

plt.imshow(img3, 'gray'),plt.show()

但它只能找到查询图像的一个实例。

查询图片

测试图片

结果

所以它只找到了两张图片中的一张。 我做错了什么?

【问题讨论】:

  • 找到第一个对象,计算变换,找到对象的遮罩区域,重复直到得到所有对象。
  • @m3h0w 谢谢!
  • @m3h0w,可能是: 1. 计算特征 2. 计算变换 3. 找到第一个对象 4. 遮罩区域重复直到得到所有对象
  • 目前没有时间阅读文档,但我认为匹配算法正在寻找最佳匹配对象而不是多个对象是一个公平的假设。
  • @V.Gai 你也可以在文献中查看处理这种情况的常用方法是什么(严格的特征匹配处理匹配描述符,没有对象假设)。 Lowe 的 SIFT 论文提出了一种基于 Hough 投票的方法。我最近发现了这篇论文:MAC-RANSAC: a robust algorithm for the recognition of multiple objects,但它应该存在很多其他的。

标签: opencv image-processing computer-vision orb


【解决方案1】:

为了解决这个任务可以使用下一个方法:

  1. SIFT (SURF) + MEAN SHIFT
  2. Haar Cascades
  3. HOG + SVM

【讨论】:

    【解决方案2】:

    我使用 ORB 描述符查找多个对象的来源

    import cv2
    from matplotlib import pyplot as plt
    
    MIN_MATCH_COUNT = 10
    
    img1 = cv2.imread('box.png', 0)  # queryImage
    img2 = cv2.imread('box1.png', 0) # trainImage
    
    orb = cv2.ORB_create(10000, 1.2, nlevels=8, edgeThreshold = 5)
    
    # find the keypoints and descriptors with ORB
    kp1, des1 = orb.detectAndCompute(img1, None)
    kp2, des2 = orb.detectAndCompute(img2, None)
    
    import numpy as np
    from sklearn.cluster import MeanShift, estimate_bandwidth
    
    x = np.array([kp2[0].pt])
    
    for i in xrange(len(kp2)):
        x = np.append(x, [kp2[i].pt], axis=0)
    
    x = x[1:len(x)]
    
    bandwidth = estimate_bandwidth(x, quantile=0.1, n_samples=500)
    
    ms = MeanShift(bandwidth=bandwidth, bin_seeding=True, cluster_all=True)
    ms.fit(x)
    labels = ms.labels_
    cluster_centers = ms.cluster_centers_
    
    labels_unique = np.unique(labels)
    n_clusters_ = len(labels_unique)
    print("number of estimated clusters : %d" % n_clusters_)
    
    s = [None] * n_clusters_
    for i in xrange(n_clusters_):
        l = ms.labels_
        d, = np.where(l == i)
        print(d.__len__())
        s[i] = list(kp2[xx] for xx in d)
    
    des2_ = des2
    
    for i in xrange(n_clusters_):
    
        kp2 = s[i]
        l = ms.labels_
        d, = np.where(l == i)
        des2 = des2_[d, ]
    
        FLANN_INDEX_KDTREE = 0
        index_params = dict(algorithm = FLANN_INDEX_KDTREE, trees = 5)
        search_params = dict(checks = 50)
    
        flann = cv2.FlannBasedMatcher(index_params, search_params)
    
        des1 = np.float32(des1)
        des2 = np.float32(des2)
    
        matches = flann.knnMatch(des1, des2, 2)
    
        # store all the good matches as per Lowe's ratio test.
        good = []
        for m,n in matches:
            if m.distance < 0.7*n.distance:
                good.append(m)
    
        if len(good)>3:
            src_pts = np.float32([ kp1[m.queryIdx].pt for m in good ]).reshape(-1,1,2)
            dst_pts = np.float32([ kp2[m.trainIdx].pt for m in good ]).reshape(-1,1,2)
    
            M, mask = cv2.findHomography(src_pts, dst_pts, cv2.RANSAC, 2)
    
            if M is None:
                print ("No Homography")
            else:
                matchesMask = mask.ravel().tolist()
    
                h,w = img1.shape
                pts = np.float32([ [0,0],[0,h-1],[w-1,h-1],[w-1,0] ]).reshape(-1,1,2)
                dst = cv2.perspectiveTransform(pts,M)
    
                img2 = cv2.polylines(img2,[np.int32(dst)],True,255,3, cv2.LINE_AA)
    
                draw_params = dict(matchColor=(0, 255, 0),  # draw matches in green color
                                   singlePointColor=None,
                                   matchesMask=matchesMask,  # draw only inliers
                                   flags=2)
    
                img3 = cv2.drawMatches(img1, kp1, img2, kp2, good, None, **draw_params)
    
                plt.imshow(img3, 'gray'), plt.show()
    
        else:
            print ("Not enough matches are found - %d/%d" % (len(good),MIN_MATCH_COUNT))
            matchesMask = None
    

    结果图片

    【讨论】:

    • 你的答案是所有搜索中最好的,但我需要一个 c++ 实现,我在 c++ 中找不到 estimate_bandwidthMeanShift 的替代品,你能指出我吗?如何在 C++ 中将找到的好的匹配项分割成集群?非常感谢!
    • 嗨!我想你可以在这里找到 C++ 的 MeanShift:docs.opencv.org/3.4.3/dc/d6b/…。关于这里的集群:docs.opencv.org/2.4/modules/core/doc/clustering.html
    • MeanShiftPython scikit.lean 中的差别很大,它是用来跟踪相机中的物体,不能跟踪多个物体。虽然OpenCVkmeans 需要先估计集群数,但这种情况让我很无奈。
    • 我把我的尝试发到了这个帖子,你能看一下吗?非常感谢,你的方法太完美了!stackoverflow.com/questions/52425355/…
    猜你喜欢
    • 2018-07-03
    • 2018-04-27
    • 2018-09-12
    • 2014-04-11
    • 1970-01-01
    • 2018-07-11
    • 2018-05-08
    • 1970-01-01
    • 1970-01-01
    相关资源
    最近更新 更多