整体文字识别算法详解
在哪里可以找到整体单词识别的算法详细信息?我需要在硬件(实际上是 FPGA)中构建一个简单的 OCR 系统,而科学期刊似乎很抽象?
有没有用于整体单词识别的开源(开放核心)代码?
谢谢
Where Can I find algorithm details for holistic word recognition? I need to build a simple OCR system in hardware (FPGAs actually), and the scientific journals seems so abstract?
Are there any open source (open core) codes for holistic word recognition?
Thanks
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
对于非常适合 FPGA 实现(并行)的算法,您可以查看:
http:// en.wikipedia.org/wiki/Cross-correlation
它速度快且易于实现。
唯一的事情是:它识别一个形状(在你的情况下是一些文本),取决于旋转和大小/拉伸/倾斜等。但如果这不是问题,它可以非常快并且非常强大。您应该只注意相似字符(如 o 和 c)的解释问题。
我用它来查找扫描表单上的默认文本,以获得感兴趣区域所在的方位,并且在这些图像(600 万像素)中进行搜索仅花费了大约 15 毫秒,而我们在 Core2 CPU 上的单线程实现。
For an algorithm that is quite suitable for FPGA implementation (embarrassingly parallel) you might look at:
http://en.wikipedia.org/wiki/Cross-correlation
It is fast, and easily implemented.
The only thing is: it recognizes a shape (in your case some text) DEPENDENT of the rotation and size / stretch / skew etc. But if that isn't a problem, it can be very fast and is quite robust. You should only watch out for interpretation problems with characters that are similar (like o and c).
I used it to find default texts on scanned forms to obtain bearings where Region of Interests are and searching in those images (6M pixels) only took around 15 ms with our implementation on a Core2 CPU in a single thread.