Mahout rescorer 实施

发布于 2024-11-30 15:46:43 字数 473 浏览 3 评论 0原文

我想通过它们共享的 Corating 数量除以 50 来对两个项目之间的所有 PearsonItemSimilarity 值进行加权。

或者换句话说,相应地更新两个项目(例如项目 a 和 b)之间的通用皮尔逊相似度 - 相似性_new_ab =相似性_ab*numCoRatings_ab/50

  1. 如何使用现有的mahout框架获得两个游戏之间的corating数量。

  2. 有人可以将我链接到(或说明)重新记录器的示例实现吗?

我这样做的理由如下,

我假设计算的大多数皮尔逊相似度都是基于少量(大多数情况下为 1 或 2)的 Coratings。这将导致游戏之间的皮尔逊相关性为 1,但事实上,如果存在更多相关性,情况可能并非如此。

考虑到这一点,我想将这些“天真的”皮尔逊相似性更改为同样基于共同评分数量的相似性。

我以为这就是重新记录器的用途,但我想我错了。

I'd like to weight all of my PearsonItemSimilarity values between two items by the number of coratings they share divided by 50.

Or in other words update the generic pearson similarity between two items (items a and b for instance) accordingly --
similarity_new_ab = similarity_ab*numCoRatings_ab/50

  1. How does one get the number of coratings between two games using the existing mahout framework.

  2. Can someone please link me to (or illustrate) an example implementation of a rescorer?

My reasoning for doing this is as follows,

I postulate that most of the Pearson-similarities calculated are based on a small number (1 or 2 in most cases) of coratings. This would lead to the games sharing a Pearson correlation of 1 with each other, which in fact would probably not be the case should more coratings exist.

To account for this, I'd like up change these "naive" Pearson similarities to a similarity that is also based on the number of co-ratings.

I thought this is what the rescorer was built for, but I guess I was wrong.

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(1

℡Ms空城旧梦 2024-12-07 15:46:43

您需要使用 DataModel 上的 getNumUsersWithPreferenceFor() 方法并向其传递两个项目 ID。
我不认为这对于相似性度量来说是最好的做法。如果您使用共现,请查看 LogLikelihoodSimilarity

不过,这与 Rescorer 无关,您的问题是什么?

You want the method getNumUsersWithPreferenceFor() on DataModel and pass it the two item IDs.
I don't think this is the best thing to do for this similarity metric. If you are using co-occurrence, look at LogLikelihoodSimilarity instead.

This has nothing to do with Rescorer though, what is your question there?

~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文