光学字符识别
我必须编写一个能够识别模式,特别是字符的程序。我已经在 C# 中实现了反向传播,现在我想将其用于模式识别。我还创建了一个表单应用程序并使用画笔/图形,以便用户可以借助鼠标编写一些内容(就像 MS Paint 中的“铅笔工具”)。所以我需要一些关于“如何在我的应用程序中实现字符识别方法?”的帮助材料。
通过互联网提供的帮助主要与反向传播和软件演示有关。
I've to write a program which is able to recognize patterns, specially characters. I've implemented back-propagation in c# and now I want to use it for the pattern recognition. I've also created a form application and used brush/graphics so that user can write something with the help of mouse (just like 'pencil tool' in MS Paint). So I need some helping material about "How to implement character recognition method in my application?".
Helping stuff over the internet mostly related to back-propagation and software demos.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
如果您的项目是其他项目,但您希望在项目中使用 OCR,则应该搜索执行此操作的第三方工具。但是,如果您的项目是这样的并且您想自己做,请阅读这个答案:
识别字符有两种方法。线上和线下。
在线方式采用笔(或鼠标)输入数据。离线方式仅使用像素。
您的第一步将从其中选择一个。离线方式没有笔数据,这是一个有用的功能。但在离线状态下,您可以识别图像文件中的字符(通过绘画创建并保存甚至扫描)。
其次,您应该预处理数据(此步骤仅适用于离线方式)。您应该从中消除噪音,缩放它,然后对其进行细化。
接下来,您应该从预处理的数据(在线或离线)中提取有用的特征。为此,您可以阅读一些有关光学字符识别及其特征提取的文章。有一个关于预处理和特征提取的很好的幻灯片演示 这里。此外,在 Google 中搜索词末尾的
pdf
关键字和filetype:pdf
也会对您有所帮助!然后你应该使用神经网络或类似的东西来识别角色。输入应该是提取的特征。
但请记住,这个项目并不容易,可能需要一些时间! (这是我的波斯语项目)
If your project is something else but you want to have OCR in your project, you should search for third party tools that do this. But if your project is this and you want to do that yourself, read this answer:
There are two ways of recognizing characters. Online and offline.
Online way uses the pen (or mouse) input data. and offline way uses just the pixels.
Your first step will be choose from one of them. offline way hasn't the pen data, this is a useful feature. but in offline, you can recognize characters from image files (created by paint and saved or even scanned)
Second, you should preprocess data (this step is for only offline way). you should remove noises from it, scale it, and do the Thinning to it.
Next, you should extract useful features from the preprocessed data (online or offline). for this, you can read some articles about optical character recognition and feature extractions of it. there is a good powerpoint presentation about preprocessing and feature extraction here. Also
pdf
keyword andfiletype:pdf
at the end of your search term in google would help you!Then you should use neural networks or something like that to recognize the character. inputs should be extracted features.
But remember, this project is not easy and may take some time! (This was my project for Persian language)