石墨烯到音素启发式oov单词
我需要在C ++中进行石墨烯至音素(G2P)转换,但是,大多数G2P库都在Python或Rust中(Pocketsphinx可能有效,但看起来像是头痛)。
G2P由以下方式完成:
- 非共同载体的字典查找。
- 计算pos的同音词的POS
- 对OOV(词汇)单词进行深度学习
,但是,现在是否有元音替代词与同音词和OOV单词都没关系。
我只需要让其他代码启动并运行,然后我就可以忘记这场灾难,因此需要快速启发式。
我会尽快发布答案。
I need to do Graphene to phoneme (g2p) conversion in c++, however, most g2p libraries are in python or rust (pocketsphinx might work but seems like a headache).
G2P is done by:
- Dictionary lookup for non-homophones.
- calculates POS for homophones
- deep learning for OOV (out of vocabulary) words
However, for now it doesn't matter if there's vowel substitution for homophones and OOV words.
I just need to get my other code up and running, and then I can forget about this disaster, so a quick heuristic is what's needed.
I'd post an answer soon.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
是一个尝试 https://godbolt.org/z/z6g4o6c4c4k
这 代码>为
s-chul
(在Arpabets chuw l
中)。但这足够好。它首先尝试解析前两个字符,如果它们匹配常见的digram(ie
ch
,sh
,th
),然后返回相应的音位和通过其余的。否则将首字母转换为最接近的音素。Here's an attempt https://godbolt.org/z/z6G4o6c4K
It pronounces
school
ass-chul
(in arpabetS CH UW L
). But that's good enough I suppose.It first tries to parse the first two characters, if they match common digram (i.e.
ch
,sh
,th
) then return the corresponding phoneme and pass the remainder. Otherwise convert the first letter to the closest phoneme.