IBM Cloud Watson Discovery:相关性训练从未成功运行
我将包含 9 个文档的 CSV 文件上传到 Watson Discovery 中的集合。我尝试使用一些查询来搜索该集合,但尽管返回了正确的文档,但置信度确实很低(0.01 -> 0.02)。这让我接受了相关性培训。我输入了大约 60 个问题并对返回的结果进行评分(在改进工具面板上)。然而,在我看来,训练似乎从未开始。 IBM不断展示“IBM将很快开始学习”。 这是通过 python-sdk API 检查的项目状态。这样的情况已经持续了好几天了。
我的问题是:
- 相关性培训可能出现什么问题,导致培训过程无法运行?
- 置信度是0.01->对于未经训练的集合(未经训练的策略),0.02 是正常的吗?
先感谢您。
I uploaded a CSV file containing 9 documents to a collection in Watson Discovery. I've tried searching this collection with some queries but the confidences are really low(0.01 -> 0.02), despite returning the correct document. That led me to Relevancy training. I input around 60 questions and rate the returning results (on the Improvement tools panel). However, it seems to me that the training never starts. IBM keeps showing "IBM will begin learning soon".
Here is the project status checked by python-sdk API. It has been like this for a couple of days.
My questions are:
- What could be possibly wrong with the relevancy training that lead to the training process not running?
- Is confidence of 0.01 -> 0.02 normal for an untrained collection (untrained strategy)?
Thank you in advance.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
原来是文档格式不对。我的同事上传了一个包含 HTML 代码的 CSV 文件,但 IBM Discovery 似乎不喜欢它。
我将它们转换为一组 pdf 文件,它可以工作。
It turns out that the format of the document is off. My coworker uploaded a CSV file with HTML code and IBM Discovery doesn't seem to like it.
I converted them to a set of pdf files and it works.