如何将语音转换为文字?
我正在尝试使用 sphinx4 将我的 wav 文件转换为文本文件。是否可以识别语法文件中未包含的单词?
I am trying to converting my wav file to text file using sphinx4. Is it possible to recognize the word which is not include in grammar file?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
这是一个常见的误解,您应该在没有任何语法的情况下工作,以便能够有效地识别语音并解决应用程序任务。相反,使用一些可以解决问题的解决方案是实用的,该解决方案可以工作并解码您的文件。
如果您不确定领域或语言,您始终可以使用非常通用的语言模型,辅以基于音节的语法来解码未知单词。之后通常使用网络查询将基于音节的变体转换为单词,从而允许系统获取词汇。这将使您能够为非常通用的语音类型获得良好的结果。
Sphinx-4 支持这一点。
This is a common misconception that you should exactly work without any grammar to be able to recognize speech efficiently and solve the application task. Instead, it's practical to use some solution to the problem which will work and decode you files.
If you are not sure about the domain or the language you can always use a very generic language model assisted with syllable-based grammar to decode unknown words. It's often common to use web queries after that to turn syllable-based variants into words thus allowing system to aquire vocabulary. That will allow you to get a good result for very generic types of speech.
Sphinx-4 supports that.