在 Linux 下我可以使用哪些库(C 或 Python)来编写识别语音命令的软件?
我需要一份软件合同,要求从非常有限的集合中识别口头命令,例如“向上”、“向下”、“关闭”、“打开”等。
我似乎不需要完整的语音识别,即我不必进行波形到文本的转换。有具体的术语我可以谷歌吗?
我对这个主题还不够了解,无法自己编写这些东西。如果我想解决这个问题,我需要一个相当成熟的库来解决这个子问题,最好是开源的。
目标平台是某种运行 Linux 的工业 PC,我将使用 C 或 Python 来执行该应用程序。
I have a request for a software contract that calls for recognizing spoken commands from a very limited set, such as "up", "down", "close", "open" etc.
I don't seem to need full-blown speech recognition, i. e. I don't have to do wave-to-text transformation. Is there a specific term I could google?
I simply don't know enough about the subject to be able to code that stuff myself. If I want to tackl ethis at all, I would need a fairly mature library for this subproblem, preferrably open source.
The target platform would be some kind of industrial PC running Linux and I would use either C or Python for the application.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
这看起来是一个合适的候选者:CMU Sphinx。
不过还没有尝试过。
This looks like a suitable candidate: CMU Sphinx.
Haven't tried it, though.
你可以查看 pyspeech,但我认为它仅限于 Windows。请参阅Python 中用于语音转文本的语音处理库
You could check out pyspeech, but I think it is Windows only. See Speech processing library in Python for speech to text