【问题标题】:Google Cloud Vision- Method: files.annotate - Response Object ChangingGoogle Cloud Vision- 方法:files.annotate - 响应对象更改
【发布时间】:2020-02-07 14:30:06
【问题描述】:

根据Cloud Vision Docs 中提供的文档,blocks 数组中的 BoundingPoly 对象应该具有这样的格式

{
  "vertices": [
    {
      object (Vertex)
    }
  ],
  "normalizedVertices": [
    {
      object (NormalizedVertex)
    }
  ]
}

但是当我们尝试https://vision.googleapis.com/v1/files:annotate?key=xxxxxx 对带有请求的 PDF 文件执行 OCR 时:

{
    "requests": [{
        "inputConfig": {
            "content": "encoded content",
            "mimeType": "application/pdf"
        },
        "features": [{
            "type": "DOCUMENT_TEXT_DETECTION",
            "maxResults": 50
        }]
    }]
}

来自服务器的响应是

{
    "responses": [
        {
            "responses": [
                {
                    "fullTextAnnotation": {
                        "pages": [
                            {
                                "property": {
                                    "detectedLanguages": [
                                        {
                                            "languageCode": "en",
                                            "confidence": 0.65
                                        },
                                        {
                                            "languageCode": "fil",
                                            "confidence": 0.01
                                        }
                                    ]
                                },
                                "width": 841,
                                "height": 595,
                                "blocks": [
                                    {
                                        "property": {
                                            "detectedLanguages": [
                                                {
                                                    "languageCode": "en",
                                                    "confidence": 1
                                                }
                                            ]
                                        },
                                        "boundingBox": {
                                            "normalizedVertices": [
                                                {
                                                    "x": 0.4351962,
                                                    "y": 0.057142857
                                                },
                                                {
                                                    "x": 0.6052319,
                                                    "y": 0.057142857
                                                },
                                                {
                                                    "x": 0.6052319,
                                                    "y": 0.08571429
                                                },
                                                {
                                                    "x": 0.4351962,
                                                    "y": 0.08571429
                                                }
                                            ]
                                        },
                                        "paragraphs": [
                                            {
                                                "property": {
                                                    "detectedLanguages": [
                                                        {
                                                            "languageCode": "en",
                                                            "confidence": 1
                                                        }
                                                    ]
                                                },
                                                "boundingBox": {
                                                    "normalizedVertices": [
                                                        {
                                                            "x": 0.4351962,
                                                            "y": 0.057142857
                                                        },
                                                        {
                                                            "x": 0.6052319,
                                                            "y": 0.057142857
                                                        },
                                                        {
                                                            "x": 0.6052319,
                                                            "y": 0.08571429
                                                        },
                                                        {
                                                            "x": 0.4351962,
                                                            "y": 0.08571429
                                                        }
                                                    ]
                                                },
                                                "words": [
                                                    {
                                                        "property": {
                                                            "detectedLanguages": [
                                                                {
                                                                    "languageCode": "en"
                                                                }
                                                            ]
                                                        },
                                                        "boundingBox": {
                                                            "normalizedVertices": [
                                                                {
                                                                    "x": 0.4351962,
                                                                    "y": 0.057142857
                                                                },
                                                                {
                                                                    "x": 0.49346018,
                                                                    "y": 0.057142857
                                                                },
                                                                {
                                                                    "x": 0.49346018,
                                                                    "y": 0.08571429
                                                                },
                                                                {
                                                                    "x": 0.4351962,
                                                                    "y": 0.08571429
                                                                }
                                                            ]
                                                        },
                                                        "symbols": [
                                                            {
                                                                "property": {
                                                                    "detectedLanguages": [
                                                                        {
                                                                            "languageCode": "en"
                                                                        }
                                                                    ]
                                                                },
                                                                "text": "F",
                                                                "confidence": 0.99
                                                            },
                                                            {
                                                                "property": {
                                                                    "detectedLanguages": [
                                                                        {
                                                                            "languageCode": "en"
                                                                        }
                                                                    ]
                                                                },
                                                                "text": "a",
                                                                "confidence": 1
                                                            },
                                                            {
                                                                "property": {
                                                                    "detectedLanguages": [
                                                                        {
                                                                            "languageCode": "en"
                                                                        }
                                                                    ]
                                                                },
                                                                "text": "c",
                                                                "confidence": 0.99
                                                            },
                                                            {
                                                                "property": {
                                                                    "detectedLanguages": [
                                                                        {
                                                                            "languageCode": "en"
                                                                        }
                                                                    ]
                                                                },
                                                                "text": "t",
                                                                "confidence": 0.99
                                                            },
                                                            {
                                                                "property": {
                                                                    "detectedLanguages": [
                                                                        {
                                                                            "languageCode": "en"
                                                                        }
                                                                    ]
                                                                },
                                                                "text": "o",
                                                                "confidence": 1
                                                            },
                                                            {
                                                                "property": {
                                                                    "detectedLanguages": [
                                                                        {
                                                                            "languageCode": "en"
                                                                        }
                                                                    ]
                                                                },
                                                                "text": "r",
                                                                "confidence": 1
                                                            },
                                                            {
                                                                "property": {
                                                                    "detectedLanguages": [
                                                                        {
                                                                            "languageCode": "en"
                                                                        }
                                                                    ],
                                                                    "detectedBreak": {
                                                                        "type": "SPACE"
                                                                    }
                                                                },
                                                                "text": "y",
                                                                "confidence": 1
                                                            }
                                                        ],
                                                        "confidence": 0.99
                                                    },                                              
                                                            {
                                                                "property": {
                                                                    "detectedLanguages": [
                                                                        {
                                                                            "languageCode": "en"
                                                                        }
                                                                    ]
                                                                },
                                                                "text": "i",
                                                                "confidence": 0.99
                                                            },
                                                            {
                                                                "property": {
                                                                    "detectedLanguages": [
                                                                        {
                                                                            "languageCode": "en"
                                                                        }
                                                                    ]
                                                                },
                                                                "text": "n",
                                                                "confidence": 1
                                                            },
                                                            {
                                                                "property": {
                                                                    "detectedLanguages": [
                                                                        {
                                                                            "languageCode": "en"
                                                                        }
                                                                    ],
                                                                    "detectedBreak": {
                                                                        "type": "SPACE"
                                                                    }
                                                                },
                                                                "text": "g",
                                                                "confidence": 1
                                                            }
                                                        ],
                                                        "confidence": 0.99
                                                    },
                                                    {
                                                        "property": {
                                                            "detectedLanguages": [
                                                                {
                                                                    "languageCode": "en"
                                                                }
                                                            ]
                                                        },
                                                        "boundingBox": {
                                                            "normalizedVertices": [
                                                                {
                                                                    "x": 0.57431626,
                                                                    "y": 0.057142857
                                                                },
                                                                {
                                                                    "x": 0.6052319,
                                                                    "y": 0.057142857
                                                                },
                                                                {
                                                                    "x": 0.6052319,
                                                                    "y": 0.08571429
                                                                },
                                                                {
                                                                    "x": 0.57431626,
                                                                    "y": 0.08571429
                                                                }
                                                            ]
                                                        },
                                                        "symbols": [
                                                            {
                                                                "property": {
                                                                    "detectedLanguages": [
                                                                        {
                                                                            "languageCode": "en"
                                                                        }
                                                                    ]
                                                                },
                                                                "text": "L",
                                                                "confidence": 0.99
                                                            },
                                                            {
                                                                "property": {
                                                                    "detectedLanguages": [
                                                                        {
                                                                            "languageCode": "en"
                                                                        }
                                                                    ]
                                                                },
                                                                "text": "i",
                                                                "confidence": 0.99
                                                            },
                                                            {
                                                                "property": {
                                                                    "detectedLanguages": [
                                                                        {
                                                                            "languageCode": "en"
                                                                        }
                                                                    ]
                                                                },
                                                                "text": "s",
                                                                "confidence": 0.99
                                                            },
                                                            {
                                                                "property": {
                                                                    "detectedLanguages": [
                                                                        {
                                                                            "languageCode": "en"
                                                                        }
                                                                    ],
                                                                    "detectedBreak": {
                                                                        "type": "LINE_BREAK"
                                                                    }
                                                                },
                                                                "text": "t",
                                                                "confidence": 1
                                                            }
                                                        ],
                                                        "confidence": 0.99
                                                    }
                                                ],
                                                "confidence": 0.99
                                            }
                                        ],
                                        "blockType": "TEXT",
                                        "confidence": 0.99
                                    }

BoundingPoly对象(boundingBox)property in the above json

中缺少vertices对象有什么需要考虑的

在拖放demo 中尝试时,在图像上完成的 OCR 的 json 响应是

  "fullTextAnnotation": {
    "pages": [
      {
        "blocks": [
          {
            "blockType": "TEXT",
            "boundingBox": {
              "vertices": [
                {
                  "x": 31,
                  "y": 63
                },
                {
                  "x": 147,
                  "y": 63
                },
                {
                  "x": 147,
                  "y": 81
                },
                {
                  "x": 31,
                  "y": 81
                }
              ]
            },
            "confidence": 0.99,
            "paragraphs": [
              {
                "boundingBox": {
                  "vertices": [
                    {
                      "x": 31,
                      "y": 63
                    },
                    {
                      "x": 147,
                      "y": 63
                    },
                    {
                      "x": 147,
                      "y": 81
                    },
                    {
                      "x": 31,
                      "y": 81
                    }
                  ]
                },

这是预期的行为还是任何问题?我们应该遵循哪个字段 normalizedVerticesvertices !!

【问题讨论】:

    标签: google-cloud-platform google-cloud-vision


    【解决方案1】:

    不同之处在于,在从代码发出的请求中,您发送的是 PDF。在拖放演示中,您发送的是图像(该演示不接受文件)。

    我复制了这个并且行为似乎是不变的:PDF 文件使用 NormalizedVertices 进行注释,而图像使用 Vertices 进行注释。我的猜测是,这是提高大型 PDF 文件注释请求(由于页数很大)的性能的预期行为。

    我向 Google 文档发送了一个请求,以便他们可以在他们的文档中添加此信息。

    【讨论】:

      猜你喜欢
      • 1970-01-01
      • 1970-01-01
      • 1970-01-01
      • 2019-04-03
      • 1970-01-01
      • 1970-01-01
      • 1970-01-01
      • 2018-06-10
      • 1970-01-01
      相关资源
      最近更新 更多