文本块内的正方体整数识别答案

【问题标题】：tesseract whole number recognition within a text block文本块内的正方体整数识别
【发布时间】：2015-01-11 17:18:35
【问题描述】：

我对 tesseract 和 openCV 还是很陌生。我正在构建一个简单的 Linux 应用程序来口述纸上的打印文本。使用 tesseract 我设法进行文本块识别，但是如果文本块中有一个整数，则该数字将被省略。例子输入： “你好，这是我 2014 年的 1080p 3D 视频” 输出： “你好，这是我的 1080p 3D 视频”

有人遇到过这个问题吗？

openCV：2.4.9 正方体：V3.02 莱普托尼卡：1.71 操作系统：Ubuntu 64bit 14.04 LTS

问候

【问题讨论】：

你是否可靠地在其他包含纯数字的句子上重复了这一点？注意 3D 和 1080p 工作，所以我认为训练数据不是问题。

标签： linux opencv ubuntu ocr tesseract

【解决方案1】：

我以前从未见过这种情况，但我知道您有时可以使用 Page Seg 模式更改这种行为。您是否使用自动页面分割？尝试其他一些设置并输入每个块、行甚至单词的文本。您可以通过以下方式根据手册更改 Page Seg Mode：

-psm N
Set Tesseract to only run a subset of layout analysis and assume a certain form of image. The options for N are:

0 = Orientation and script detection (OSD) only.
1 = Automatic page segmentation with OSD.
2 = Automatic page segmentation, but no OSD, or OCR.
3 = Fully automatic page segmentation, but no OSD. (Default)
4 = Assume a single column of text of variable sizes.
5 = Assume a single uniform block of vertically aligned text.
6 = Assume a single uniform block of text.
7 = Treat the image as a single text line.
8 = Treat the image as a single word.
9 = Treat the image as a single word in a circle.
10 = Treat the image as a single character.

【讨论】：