Sphinx II 识别 读取和识别
我正在尝试使用 Sphinx II 实现语音识别器客户端,我的目标是让某人阅读文本中的书面句子,并对每个单词进行置信度评分。 为此,我使用有限状态语法和有限字典。
我得到了一些工作,但置信度分数非常低(< 0.30)。我使用与 Sphinx 2 的简单 Reco 示例相同的设置。 由于每个发音的单词只有一种可能性(每个单词都是语法中的一种状态),因此我通常应该获得高分和时间高效的识别,但假设需要时间来计算。而且这个词一发音我就得到了假设。 所以现在我实际上不知道该去哪里改进。我是否使用正确的解决方案来解决此类问题?我应该使用语言模型吗?如果是,如何将上下文限制为故事中的给定句子?
我还需要使用狮身人面像训练器吗?到目前为止,我已经使用网络工具生成了语言模型/字典。
我们将非常感谢您的帮助。 谢谢,鲍里斯·古根
I'm trying to implement a speech recognizer client using Sphinx II, my goal is to have the someone reading a written sentence in a text and have a confidence score for each word.
So for that I'm using Finite State Grammars and limited dictionary.
I got something working, but the confidence score are very low (< 0.30). I'm using the same settings as the simple Reco example of Sphinx 2.
Since there is only one possibility for each word pronounced (each word is one state in the grammar), I should normally have high score and time efficient recognition, but the hypothesis take time to be computed. Also I get the hypothesis as soon as the word as been pronounced.
So now I don't actually know where to go to improve this. Am i using the right solution for this kind of problem? Should I instead use Language Models, and if yes how to restrict the context to a given sentence in a story?
Also do I need to use the sphinx trainer? So far I've generated the language models/dictionary using the web tool.
Your help would be much appreciated.
Thanks, Boris Gougeon
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论