如何调整 Solr 匹配参数?
我使用 Solr 作为应用程序的搜索组件,并希望尝试不同的因素以了解它如何影响结果。
具体来说,Solr 文档提到了基本评分因素:
- tf -->词频
- idf -->逆文档频率
- 坐标 -->协调因子
- lengthNorm -->基于字段长度的匹配
有人能告诉我如何“调整”这些值使用的任何数字因素吗? (如果可能的话,还没有找到太多说明“ye”或“ney”的文档)
在我尝试过这些之后,我将继续讨论诸如提升等方法。
谢谢你们!
I'm using Solr for the search component of my application, and am looking to play around with different factors to see how it affects results.
Specifically Solr docs make mention of the basic scoring factors:
- tf --> term frequency
- idf --> inverse document frequency
- coord --> coordination factor
- lengthNorm --> matches based on length of field
Could anyone tell me how to "adjust" whatever numerical factors are being used for these values? (If that's possible, haven't found much documentation saying ye or ney)
After I've played around with these I'll move on to methods such as boosting and so on.
Thanks guys!
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
您可以从自定义相似度类开始。
这将允许您修改上述参数和评分因素。
检查 lucene DefaultSimilarity 类供参考,这是实际的实现。
You can start with the Custom Similarity class.
This would allow you to modify the above parameters and scoring factors.
Check the lucene DefaultSimilarity class for reference which is the actual implementation.