如何使Keras-Or默认模型仅识别数字？

发布于 2025-02-02 16:59:17 字数 983 浏览 4 评论 0 原文

我使用Python和Keras OCR。我希望Keras只识别数字，所以在管道中我这样做。

recognizer = keras_ocr.recognition.Recognizer(alphabet="0123456789")
pipeline = keras_ocr.pipeline.Pipeline(recognizer=recognizer)

但是，它没有将字母转向数字并提高诸如Tesseract白名单之类的识别质量，而是发生了。因此，这些数字根本无法识别。

使用默认字母，结果更好。但是有些数字与字母混淆。但是，将字母更改为“替换（“ O”，“ 0”）”之类的数字是一个坏主意。

识别功能简单而复制:)


    _image = keras_ocr.tools.read(_path)
    plt.figure(figsize=(10, 20))
    plt.imshow(_image)

    prediction = pipeline.recognize([_image])[0]
    fig, axs = plt.subplots(1, figsize=(10, 20))
    keras_ocr.tools.drawAnnotations(image=_image, predictions=prediction, ax=axs)
    plt.show()

原文

I use python and keras ocr.
I want keras to recognize only numbers, so in pipeline i do this.

recognizer = keras_ocr.recognition.Recognizer(alphabet="0123456789")
pipeline = keras_ocr.pipeline.Pipeline(recognizer=recognizer)

But instead of turning letters to digits and improving quality of recognition like tesseract whitelist it happens.

So the numbers are not recognized at all.

With default alphabet the result is better. But some digits are confused with letters. However change letters to digits like "replace("O", "0")" is quite a bad idea.

Function for recognizing is simple and copied :)


    _image = keras_ocr.tools.read(_path)
    plt.figure(figsize=(10, 20))
    plt.imshow(_image)

    prediction = pipeline.recognize([_image])[0]
    fig, axs = plt.subplots(1, figsize=(10, 20))
    keras_ocr.tools.drawAnnotations(image=_image, predictions=prediction, ax=axs)
    plt.show()

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

呆° 2025-02-09 16:59:17

我没有发现比使用Keras OCR工具学习模型更多的简单方法。
但是，用于合成数据的文本生成器使用具有想法，含义的书籍，期刊或SMTH的文本（我不知道用英语说:)）。因此，数字很少，有时如果您的字母为“ 0123456789”，生成器会返回空字符串。因此，我写了自己的发电机，只能用数字制作字符串。
https：////keras-ocr.readthedocs.io oredocs.ioo of示例/end_to_end_training.html