用于识别 .NET 中的手写内容的 OCR
我需要在 .NET 应用程序中使用可以识别手写内容的 OCR 组件。 我们从 MS Office 获取 MODI 组件并运行,它可以识别打印文本,但不能识别手写文本。
我不介意付钱。
有人有什么建议吗?
非常感谢 吉姆
I need to use an OCR component within a .NET application that can recognise handwriting. We got the MODI component from MS office up and running, and it recognises printed text, but not handwriting.
I don't mind paying.
Does anyone have any recommendations?
Many thanks
Jim
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(5)
我推荐Pegasus Imaging,我们每天都使用它,它非常棒。 对于手写,您可以使用 ICR 模块。
I recommend Pegasus Imaging, we use it every day and it is great. For handwriting you have the ICR module.
在我的应用程序中,我们需要识别一些手写文本填写的表格。 只要用户使用格式正确的字符,准确性就很好。 我使用的 SDK 名为 Leadtools。 我的应用程序使用 MFC,但它们也有 .NET 类。
In my application, we had a need to recognize some forms that were filled in with hand-printed text. The accuracy was good as long as the users used well-formed characters. The SDK I used is called Leadtools. My application uses MFC but they also have .NET classes.
大多数 OCR 工具包旨在识别打印文本,而不是手写文本。 我会推荐 Atalasoft 的工具包,因为您可以灵活地插入各种 OCR 引擎。
识别手写文本的准确性要低得多,而且更加困难。 它通常被标记为智能字符识别(ICR),我不确定Atalasoft是否, Pegasus 或大多数商业产品都支持 ICR。 手写识别是通过在网格上的单元格中写下每个字母来辅助的,就像在 SAT 考试中填写你的名字一样。 您是在谈论自由格式文本还是在网格上书写?
Most OCR toolkits are aimed at recognizing printed text, not handwritten text. I would recommend Atalasoft's toolkit for flexibility in that you can plug-in various OCR engines.
Recognizing handwritten text is much less accurate and more difficult. It is usually labelled as Intelligent Character Recognition (ICR), and I am not sure whether Atalasoft, Pegasus, or most commercial products support ICR. Hand-written recognition is aided by writing each letter in a cell on a grid, like filling out your name on the SAT. Are you talking about free form text or writing on a grid?
尝试 tessnet,它是一个基于 tesseract 的开源 .NET OCR 引擎
Try tessnet, it's an open-source .NET OCR engine based on tesseract
区分手写文本和草书书写很重要。 术语 ICR 通常用于指手写文本。
Accusoft Pegasus 提供了 SmartZone ICR .NET SDK,可以让您快速识别文本。 当您控制文本的输入方式(例如在表单上等)时,准确性会变得越来越好。
认识到可用于实时识别的提示(例如在平板电脑上)是完全不同的也很重要比扫描图像中可用的内容。 这就是为什么实时草书识别比事后识别(扫描图像)容易得多。 你可以读出笔的动作,笔画的顺序对识别很有帮助。
It's important to distinguish between hand-printed text and cursive script writing. The term ICR generally is used to refer to hand-printed text.
Accusoft Pegasus provides a SmartZone ICR .NET SDK that can get you recognizing text very quickly. Accuracy get better and better as you gain control over how the text is entered, such as on a form, etc.
It also is important to realize that the cues available to real-time recognition, such as on a tablet PC, are completely different than those available in a scanned image. This is why cursive recognition is so much easier in real-time than after the fact (scanned image). You can read the movements of the pen, and the order of strokes is very helpful in recognition.