如何实现推荐引擎？

发布于 2024-10-03 20:20:15 字数 369 浏览 4 评论 0原文

请耐心等待我的写作，因为我的英语不熟练。

作为一名程序员，我想了解在推荐系统或相关系统下实现的算法或机器学习智能。例如，最明显的例子来自亚马逊。他们有一个非常好的推荐系统。他们会知道：如果您喜欢这个，您可能也会喜欢那个，或者类似的东西：喜欢这个和<的人所占的比例是多少em>那在一起。

当然我知道亚马逊是一个大网站，他们在这些系统上投入了大量的人力和金钱。但是，在最基本的核心上，我们如何在数据库中实现类似的功能？我们如何识别一个对象与其他对象的关系？我们怎样才能建立一个统计单元来处理这种事情呢？

如果有人能指出一些算法，我将不胜感激。或者，基本上，指出一些我们都可以学习的好的直接参考资料/书籍。谢谢大家！

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

挖个坑埋了你 2024-10-10 20:20:15

有两种不同类型的推荐引擎。

最简单的是基于项目，即“购买产品 A 的客户也购买了产品 B”。这很容易实现。存储稀疏对称矩阵 nxn（其中 n 是项目数）。每个元素 (m[a][b]) 是任何人购买商品“a”和商品“b”的次数。

另一个是基于用户的。那就是“像你这样的人常常喜欢这样的事情”。该问题的一个可能的解决方案是 k 均值聚类。即构建一组集群，将具有相似品味的用户放置在同一集群中，并根据同一集群中的用户提出建议。

一种更好但更复杂的解决方案是一种称为“受限玻尔兹曼机”的技术。此处有对它们的介绍

回复收藏 0 原文

浮生未歇 2024-10-10 20:20:15

第一次尝试可能如下所示：

//First Calculate how often any product pair was bought together
//The time/memory should be about Sum over all Customers of Customer.BoughtProducts^2
Dictionary<Pair<ProductID,ProductID>> boughtTogether=new Dictionary<Pair<ProductID,ProductID>>();
foreach(Customer in Customers)
{
    foreach(product1 in Customer.BoughtProducts)
        foreach(product2 in Customer.BoughtProducts)
            {
                int counter=boughtTogether[Pair(product1,product2)] or 0 if missing;
                counter++;
                boughtTogether[Pair(product1,product2)]=counter;
            }
}

boughtTogether.GroupBy(entry.Key.First).Select(group.OrderByDescending(entry=>entry.Value).Take(10).Select(new{key.Second as ProductID,Value as Count}));

首先，我计算每对产品一起购买的频率，然后按产品对它们进行分组，并选择与其一起购买的前 20 个其他产品。结果应该放入某种由产品 ID 键入的字典中。

对于大型数据库来说，这可能会变得太慢或消耗太多内存。

A first attempt could look like this:

//First Calculate how often any product pair was bought together
//The time/memory should be about Sum over all Customers of Customer.BoughtProducts^2
Dictionary<Pair<ProductID,ProductID>> boughtTogether=new Dictionary<Pair<ProductID,ProductID>>();
foreach(Customer in Customers)
{
    foreach(product1 in Customer.BoughtProducts)
        foreach(product2 in Customer.BoughtProducts)
            {
                int counter=boughtTogether[Pair(product1,product2)] or 0 if missing;
                counter++;
                boughtTogether[Pair(product1,product2)]=counter;
            }
}

boughtTogether.GroupBy(entry.Key.First).Select(group.OrderByDescending(entry=>entry.Value).Take(10).Select(new{key.Second as ProductID,Value as Count}));

First I calculate how often each pair of products was bought together, and then I group them by the product and select the top 20 other products bought with it. The result should be put into some kind of dictionary keyed by product ID.

This might get too slow or cost too much memory for large databases.

回复收藏 0 原文