用于文档 OCR/可读性的图像处理/增强算法?
我正在寻找算法、论文或软件来增强传真、手机摄像头图像以及其他类似来源的可读性和 OCR。
我主要对简单的增强功能感兴趣(例如,使用 ImageMagick 可以做的事情),但我也对更复杂的技术感兴趣。 我已经在与供应商交谈,所以对于这个问题我主要是在寻找算法或开源软件。
进一步澄清:我不是在寻找 OCR 软件或算法;我只是在寻找 OCR 软件或算法。 我正在寻找算法来清理图像,使其看起来更适合人眼阅读,并且可能用于 OCR。
I'm looking for algorithms, papers, or software to enhance faxes, images from cell phone cameras, and other similar source for readability and OCR.
I'm mainly interested in simple enhancements (eg. things you could do using ImageMagick), but I'm also interested in more sophisticated techniques. I'm already talking to vendors, so for this question I'm mostly looking for algorithms or open source software.
To further clarify: I'm not looking for OCR software or algorithms; I'm looking for algorithms to clean up the image so it looks more readable to the human eye, and can possibly be used for OCR.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(3)
当我编写一些书籍扫描软件时,我遇到了类似的问题; 互联网上流传着一个名为 pagetools 的程序,它使用相当聪明的数学技巧称为拉东变换。
我还编写了一个小例程,可以将页面上的空白区域变白; 当 OCR 算法不必应对背景噪音时,它们往往会做得更好。 我所做的就是寻找距离深色像素超过小半径的浅色像素,然后将它们提升为纯白色。
不过,已经过去几年了,所以我手头没有确切的实现细节。
I had a similar problem when I was writing some software to do book scanning; floating around on the internet is a program called pagetools that does straightening of scanned-in pages using a fairly clever mathematical trick called the Radon transform.
I also wrote a small routine that would white out the blank space on the page; OCR algorithms tend to do a lot better when they don't have to contend with background noise. What I did, was look for light-colored pixels that were more than a small radius away from dark-colored ones, and then boost those up to being pure white.
It's been a few years, though, so I don't have the exact implementation details handy.
一个简单的图像过滤器是“中值过滤器”,它是一种非常简单、易于自己实现的过滤器,可帮助清理扫描/拍摄的文本。 http://en.wikipedia.org/wiki/Median_filter
One simple image filter to look into is the "Median Filter" which is a very straightforward, easy to implement yourself, filter to help clean up scanned/photographed text. http://en.wikipedia.org/wiki/Median_filter
根据要求,链接到维基百科:光学字符识别
Microsoft Research:光学字符识别论文
CiteSeerX : 光学字符识别论文
As requested, link to Wikipedia: Optical character recognition
Microsoft Research: Optical character recognition papers
CiteSeerX : Papers on optical character recognition