生成 Tesseract 训练数据失败
我在 Windows 10 上使用 Tesseract v5.0.1.20220118,训练只有字母“P”和“Q”的字体。
当我执行到此步骤时,
mftraining -F font_properties.txt -U unicharset -O normal.unicharset pq.normal.exp0.tr
未生成 pffmtable
文件。
当我运行代码 cntraining pq.normal.exp0.tr
它告诉我
Reading pq.normal.exp0.tr ...
Clustering ...
N == sizeof(Cluster->Mean):Error:Assert failed:in file ../../../src/classify/cluster.cpp, line 2526
为什么会出错?我该如何修复它?
我只生成了 inttemp
和 shapetable
,但教程说会有四个文件,包括 shapetable
、inttemp
、 pffmtable
和 normproto
,我想知道可能是因为字体只有字母“P”和“Q”,但我不知道如何解决。
I'm using Tesseract v5.0.1.20220118 on Windows 10, training a font only have letter "P" and "Q".
When I get to the step
mftraining -F font_properties.txt -U unicharset -O normal.unicharset pq.normal.exp0.tr
The pffmtable
file is not generated.
And when I run code cntraining pq.normal.exp0.tr
It shows me
Reading pq.normal.exp0.tr ...
Clustering ...
N == sizeof(Cluster->Mean):Error:Assert failed:in file ../../../src/classify/cluster.cpp, line 2526
Why it goes wrong? How can I fix it?
I only have inttemp
and shapetable
generated, but the tutorial says there will be four files include shapetable
, inttemp
, pffmtable
and normproto
, I wonder that maybe is beacuse of the font only have letter "P" and "Q", but I have no idea how to solve it.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
data:image/s3,"s3://crabby-images/d5906/d59060df4059a6cc364216c4d63ceec29ef7fe66" alt="扫码二维码加入Web技术交流群"
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
请阅读文档:
https://tesseract-ocr.github。 io/tessdoc/#training-for-tesseract-5
使用正确的工具:
https://github.com/tesseract-ocr/tesstrain
Please read the docs:
https://tesseract-ocr.github.io/tessdoc/#training-for-tesseract-5
Use the right tools:
https://github.com/tesseract-ocr/tesstrain