OCR 和文字审阅
我正在使用 Tesseract 进行我的字母识别项目,目前识别效果非常好。图像处理部分是使用OpenCv库完成的。 这些字母是手写的。但是当我用它来识别字母“O”和数字“0”时,出现了一些问题。这些字母在数据区域中用作输入名称的字段。所以名字中不能有任何数字。当我们使用数据字段系统作为出生日期时,它只包含数字。所以我愿意对识别系统进行限制,说对应的数据字段只有数字或字母。
我也愿意用可能的单词来检查已识别的字母,以便我们可以提高数据的准确性。我愿意使用 openCv 库来完成这项任务。但我不知道有哪些库可以帮助完成这项任务以及它们的功能是什么。所以请有人帮助我。谢谢。
问候, 蒂兰卡。
I'm using Tesseract for my letter recognition project and currently the recognitions is quite good. The image processing part was done using OpenCv libraries.
The letters are hand written.But there are some problems when I used it to recognise the letter "O" and number "0". These letters are used in data areas as the fields that enter names. So names cannot have any numbers with it. And when we are using the the system of the data fields as date of birth it only contains numbers. So I'm willing to give restriction to the recognition system saying that the corresponding data fields have only numbers or the letters.
And also I'm willing to review the recognised letters with the possible words so we can improve the accuracy of the data. I'm willing to use the openCv libraries for this task. But I don't know what are the libraries that help for this task and what are the functionalities of those. So please can some one help me. Thank you.
Regards,
Thilanka.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
我从来没有使用过 Tesseract。但是,常见问题解答中提到
如何仅识别数字?
想必您可以使用常见问题解答条目的模式来设置它,以便它仅正确识别字母或数字。
如果您已经尝试过此操作,您能否提供更多详细信息来说明为什么它不起作用?
I've never used Tesseract. However, in the FAQ it says
How do I recognise only digits?
Presumably you could use the pattern of the FAQ entry to set it up so it only recognises letters or just digits appropriately.
If you have already tried this, can you give more details of why it doesn't work?