使用 Lucene 进行个性化搜索
我想询问有关个性化搜索的问题。我即将使用 Lucene 设计/实现个性化搜索。我对此做了一些谷歌搜索,但似乎没有找到可以与 Lucene 一起使用的模块/工具。所以我想我需要自己实现一个。我想知道是否有任何可用的工具,这样我就不会重新发明轮子。
如果没有什么可以利用的,那也没关系。但我想知道是否有关于如何实施的任何建议。我当前的想法是向每个文档添加额外的位向量或布隆过滤器。每个位代表文档与一个用户或一组相似用户的相关性。相关性信息来自数据挖掘算法。我不确定这是否是最佳解决方案(同时实施成本低廉)。
任何建议/评论将不胜感激!谢谢你!
I'd like to ask questions about personalized search. I'm about to design/implement a personalized search with Lucene. I did some googling about that, but didn't seem to find module/tools ready to use with Lucene. So I guess I need to implement one on my own. I would like to know if there's any tools available so that I don't reinvent the wheel.
If nothing is available, it's fine too. But I would like to know if there's any suggestion on how to implement it. My current idea is about adding an extra bit-vector or bloom filter to each document. Each bit represents the relevancy of the document to a user or a group of similar users. The relevancy info is derived from data mining algorithms. I am not exactly sure if this is the optimal solution (cheap to implement at the same time).
Any suggestion/comment will highly appreciated! Thank you!
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
您可以添加一个表示匹配组/用户的字段,并提高各个值。例如,文档“horses and butterflies”的字段组将具有增强标记组:“schoolgirls”^10 组:“bikers”^-1,而文档“road horses”将具有增强标记组:“schoolgirls” “^2 组:“骑自行车的人”^10。如果您是骑自行车的人,搜索马会首先返回公路马;如果您是女学生,则搜索马会首先返回蝴蝶。
You could add a field representing the matched groups / users, and boost the individual values. For example, the field group for the document "horses and butterflies" would have the boosted tokens group:"schoolgirls"^10 group:"bikers"^-1 while the document "road horses" would have the boosted tokens group:"schoolgirls"^2 group:"bikers"^10. Searching for horses would return road horses first in case you are a biker, and butterflies first in case you are a schoolgirl.