当前位置：文江博客话题详情

图像 OCR - 过滤不需要的数据

发布于 2024-10-18 05:51:54 字数 713 浏览 7 评论 0原文

基本上，我正在使用 tessract OCR 读取车辆牌照，然而，尽管能够通过改变对比度、减少噪音等轻松地强调文本，但车辆的某些“部分”仍然保留在图像上，这确实会导致 OCR 出错结果。

例如：

在此处输入图像描述

我可以很容易地更改它，例如：

在此处输入图像描述

我希望消除每个板的边缘，这是另一个示例：

在此处输入图像描述

我可以使用像素操作算法删除边缘，但我认为这不是正确的方法，并且会导致很多问题。

我一直在使用以下应用程序来测试各种方法，例如形态学和消除不需要的数据，到目前为止我还没有成功。

http://www.codeproject.com/KB/GDI-plus/Image_Processing_Lab。 aspx

然而，了解这一点的人可以使用上面文章中的应用程序来实现我正在尝试的目标，所以请随意尝试一下。

谢谢

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

天涯离梦残月幽梦 2024-10-25 05:51:54

请尝试使用笔画宽度变换概念。

这个概念用于从自然图像中分割文本......

回复收藏 0 原文

梦屿孤独相伴 2024-10-25 05:51:54

我已经做了这样的算法。我只能说效果很好。秘密在于，您需要知道光线可能仅来自一侧。您不能仅使用一个阈值将图像设置为“黑/白”。

检测图像各部分的平均亮度，并使用此亮度计算来设置每个区域的阈值。

例如，如果左上角较亮，则需要较低的阈值来使这些部分不那么亮。而如果右下角光线较弱，则需要将阈值设置得更高，才能接收到所有现有的光线信息。

然后，您只需使用以下方法从两侧驱动到图像：

IsPixelAboveThreshold ?

如果它在下面，则您位于边界上，如果它在上面，您可以说您位于图像的中间，亮度更高。

问候

I already did such an algorithm. I just can say that it works great. The secret is, that you need to know that the light is coming just from one side perhaps. You cannot set the image to "black/white" just by using ONE threshold.

Detect the average luminance of parts of the image and use this luminance calculation to set the threshold for each region.

For example, if the left top is lighter, you need a lower threshold to make these parts not to bright. And if the bottom right has low light, you need to set the threshold higher to receive all existing light information.

Then, you need just to drive into the image from each side by using the method: