HTK 扬声器适配
我正在尝试使基于单音素的识别器适应特定的扬声器。我正在使用 HTKBook 3.4.1 第 3.6.2 节
中给出的配方。我陷入了像 sp 一样调用的 HHEd
部分:
HHEd -A -D -T 1 -H hmm15/hmmdefs -H hmm15/macros -M classes regtree.hed monophones1eng
我最终遇到的错误如下:错误 [+999] 基类列表中缺少组件 (2413 3375)
错误[+999] BaseClass 检查失败
文件夹classes
包含文件global
,其中包含以下内容:~b ''全局''
<参数>混合库
<类别> 1 {*.state[2-4].mix[1-25]}
hmm15
中的 hmmdefs
文件有一些混合组件(我使用 25每部手机的每种状态的混合物成分)丢失。我尝试通过给出具有随机均值和方差值但权重为零的混合成分来“填补空白”。这也没有效果。
hmms 是具有 5 个状态(3 个发射)的左右 hmms,每个状态由 25 个分量混合物建模。每个组件依次由带有 EDA 组件的 MFCC 建模。共有 46 部电话。
我的问题是:
1. 我调用 HHEd
的方式是否正确?单音素可以按上述方式调用吗?
2.我知道基类列表(rtree.base
)必须包含每个混合组件,但是我在哪里可以找到这些缺失的混合组件?
注意:如果需要更多信息,请告诉我。
编辑 1:文件 regtree.hed 包含以下内容:
RN "models"
LS "stats_engOnly_3_4"
RC 32 "rtree"
谢谢,
斯里拉姆
I am trying to adapt a monophone-based recogniser to a specific speaker. I am using the recipe given in HTKBook 3.4.1 section 3.6.2
. I am getting stuck on the HHEd
part which I am invoking like sp:
HHEd -A -D -T 1 -H hmm15/hmmdefs -H hmm15/macros -M classes regtree.hed monophones1eng
The error I end up with is as follows:ERROR [+999] Components missing from Base Class list (2413 3375)
ERROR [+999] BaseClass check failed
The folder classes
contains the file global
which has the following contents:~b ‘‘global’’
<MMFIDMASK> *
<PARAMETERS> MIXBASE
<NUMCLASSES> 1
<CLASS> 1 {*.state[2-4].mix[1-25]}
The hmmdefs
file within hmm15
had some mixture components (I am using 25 mixture components per state of each phone) missing. I tried to "fill in the blanks" by giving in mixture components with random mean and variance values but zero weigths. This too has had no effect.
The hmms are left-right hmms with 5 states (3 emitting), each state modelled by a 25 component mixture. Each component in turn is modelled by an MFCC with EDA components. There are 46 phones in all.
My questions are:
1. Is the way I am invoking HHEd
correct? Can it be invoked in the above manner for monophones?
2. I know that the base class list (rtree.base
must contain every single mixture component, but where do I find these missing mixture components?
NOTE: Please let me know in case more information is needed.
Edit 1: The file regtree.hed contains the following:
RN "models"
LS "stats_engOnly_3_4"
RC 32 "rtree"
Thanks,
Sriram
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
您调用 HHEd 的方式看起来不错。这些组件因失效而丢失。要处理失效组件,请阅读 HTKBook-3.4.1 第 8.4 节第 137 页。
问题:
- regtree.hed 包含什么?
- 您使用了多少数据(以小时为单位)? 25 种混合物可能过多。
您可能想要使用更逐渐增加的混合物 - MU +1 或 MU +2 并限制混合物的数量(猜测:3-8,具体取决于训练数据量)。
They way you invoke HHEd looks fine. The components are missing as they have become defunct. To deal with defunct components read HTKBook-3.4.1 Section 8.4 page 137.
Questions:
- What does regtree.hed contain?
- How much data (in hours) are you using? 25 mixtures might be excessive.
You might want to use a more gradual increase in mixtures - MU +1 or MU +2 and limit the number of mixtures (a guess: 3-8 depending on training data amount).