LightFM模型：内部的得分和Sigmoid函数

发布于 2025-02-12 17:05:52 字数 2187 浏览 3 评论 0原文

我有两个与LightFM模型有关的问题：

我阅读文章关于模型，我看到它使用了sigmoid f（。）函数。我还检查了图书馆的
我是否纠正了返回的分数是q_u * p_i + b_u + b_i（请参阅文章）？如果没有，我该如何计算自己的分数？它们来自哪里，为什么它们的幅度如此之高？我的分数大约从-100000到+100000。

upd1：我遵循评论并找到了以下功能：

cdef inline flt compute_prediction_from_repr(flt *user_repr,
                                             flt *item_repr,
                                             int no_components) nogil:

    cdef int i
    cdef flt result

    # Biases
    result = user_repr[no_components] + item_repr[no_components]

    # Latent factor dot product
    for i in range(no_components):
        result += user_repr[i] * item_repr[i]

    return result

似乎分数确实是上面的公式，但是如果有人也可以看一下，这将很有帮助 - 我对Cython

Upd2：Sigmoid使用不太好仅适用于模型的逻辑变体。如果您尝试经过翘曲，则不会使用它。

原文

I have two questions related to the LightFM model:

I read the article about the model and I see that it uses sigmoid f(.)-function. I also checked library's Cython code and I see that the function is implemented there as well. However, the model is applicable to rank items in the rating setting (rating from 1 to 5). Why isn't sigmoid harming the ranking system? I mean it returns the value from 0 to 1, why the model still works for ratings?
Am I correct that the scores which model returns is q_u * p_i + b_u + b_i (see the article)? If not, how can I calculate the scores myself? Where do they come from and why their magnitude is so high? I get the scores approximately from -100000 to +100000.

UPD1: I followed the comments and found out the following function:

cdef inline flt compute_prediction_from_repr(flt *user_repr,
                                             flt *item_repr,
                                             int no_components) nogil:

    cdef int i
    cdef flt result

    # Biases
    result = user_repr[no_components] + item_repr[no_components]

    # Latent factor dot product
    for i in range(no_components):
        result += user_repr[i] * item_repr[i]

    return result

It seems like the scores are indeed the formula above, but it would be helpful if someone could also have a look - I'm not very good with Cython

UPD2: sigmoid is used only for the logistic variant of the model. It's not used if you try WARP.

分享到QQ

分享到微博