从 Apache Mahout LDA 包访问 LDA 参数
我刚刚测试了使用 Apache Mahout 在 30 个文档的语料库上构建潜在狄利克雷分配模型。我的系统上没有安装 Hadoop,这就是 Mahout 的本地执行产生结果模型的原因。我想访问模型参数,如估计的 \alpha、\beta、\Phi、\Theta
我如何访问这些参数?
/mahout lda -i /tf-vectors -o -k 4-v 27
我可以看到它有学习算法每次迭代(我假设)的文件夹。每个都有一个文件part-r-0000,我不知道如何访问。
任何帮助将不胜感激。
I have just tested using Apache Mahout for building an Latent dirichlet allocation model on a corpus of 30 documents. I did not have Hadoop installed on the system thats why a local execution of the Mahout yielded the resulting model. I would like to access the model parameters, as in the estimated \alpha, \beta, \Phi, \Theta
How can I access these?
/mahout lda -i /tf-vectors -o -k 4-v 27
I can see that has folder for each iteration(i presume) of the learning algorithm. Each has a single file part-r-0000 which I do not know how to access.
Any help will be appreciated.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
我无法立即帮助你,只能说能够帮助你的人
阅读 mahout-user 邮件列表并在那里回答问题。我建议你
在那里重复你的问题。
有关如何订阅的更多信息,请参阅此处:https://cwiki.apache.org/MAHOUT /mailing-lists.html
I can't help you off-hand except to say that the people who can help you all
read the mahout-user mailing list and answer questions there. I suggest you
repeat your question there.
See here for more information about how to subscribe: https://cwiki.apache.org/MAHOUT/mailing-lists.html