Apache Mahout 数据集
我正在寻找可用于实现 Apache Mahout 推荐系统用例的数据集。我只知道 MovieLens 数据集 来自 GroupLens 研究小组。
有人知道可用于推荐系统实施的其他数据集吗?我对基于项目的数据集特别感兴趣,尽管其他数据集也是最受欢迎的。
I am looking for datasets that can be used for implementing recommendation system usecase of Apache Mahout. I know of only MovieLens Data Sets from GroupLens Research group.
Anyone knows any other datasets that can be used for recommendation system implementation? I am particularly interested in item-based data sets though other datasets are most welcome.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(3)
这是来自 Mahout 的塞巴斯蒂安。
您可能会对捷克约会网站上的数据集感兴趣:http://www .occamslab.com/petricek/data/
顺便说一句,术语“基于项目”是指一种特殊的协作过滤方法,而不是数据集本身,它通常采用大多数协作过滤的用户项目评分三元组的常见形式方法与.
我们很乐意在我们的用户邮件列表中听取您的实验结果和经验(如果您想分享),网址为 [电子邮件受保护]
this is Sebastian from Mahout.
There is a dataset from a czech dating website available that might be of interest to you: http://www.occamslab.com/petricek/data/
Btw the term item-based refers to a special collaborative filtering approach not to the dataset itself, which is usually in the common form of user-item-rating tripels that most collaborative filtering approaches work with.
We would love to hear from your experimentation results and experiences (if you wanna share them) on our user mailinglist at [email protected]
在搜索数据集时,我发现很少有网站列出可用于数据挖掘的公开数据集。其中一些也可以用于 Mahout。
Bixo 实验室
UCI 数据集
KDnuggets一个>
While searching for data sets, I found few sites that list publicly available data sets which can used for data mining. Some of these can be used for Mahout too.
Bixo Labs
UCI Datasets
KDnuggets
您可以查看品友RTB竞价数据集
知乎:http://qr.ae/OrqgM
http://contest.ipinyou.com/data-release.html
You can look at iPinYou RTB Bidding Data Set
Quora : http://qr.ae/OrqgM
http://contest.ipinyou.com/data-release.html