VoiceXML 和语音合成之间的工作流程是怎样的?
我想知道语音服务器的文本到语音引擎如何呈现 VoiceXML 文档。 VXML 文档将包含要转换为音频文件的文本。如果TTS服务器理解MRCP,那么VXML文档转换成什么,以便语音服务器能够理解它以及如何......?
I would like to know how a VoiceXML document is rendered by a text to speech engine of a speech server. The VXML document would be containing the text which is supposed to be converted into an audio file. If the TTS server understands MRCP, to what is the VXML doc converted into, so that the speech server can understand it and how..?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
VoiceXML 文档作为一个整体不被 TTS 引擎解析。相反,VoiceXML 浏览器负责提取提示,包括 VoiceXML 文档中包含的任何语音合成标记语言 (SSML) 标记,并通过 MRCP 将该文本传递到 TTS 引擎。
您可以从 W3C 规范中找到有关 SSML 的更多信息:SSML 1.0 规范
The VoiceXML document as a whole is not parsed by the TTS engine. Instead, the VoiceXML browser is responsible for extracting the prompt, including any Speech Synthesis Markup Language (SSML) markup included in the VoiceXML document, and passing just that text to the TTS engine via MRCP.
You can find more info on SSML from the W3C specification: SSML 1.0 Specification