Alfresco 的推荐引擎?
我想在 Alfresco 中实现类似 Amazon 的建议。
例如,如果员工搜索“2007 年财务报告”,搜索 UI 将显示相关文档,例如之前搜索过相同内容的用户下载/查看的文档。
它可能会显示 Lucene(Alfresco 使用的)无法找到的文档。
例如,是否有人将 Alfresco 与 Apache Mahout 或 pysuggest?
I want to implement Amazon-like recommendations in Alfresco.
For instance, if an employee searches for "financial reports 2007", the search UI will show related documents, for instance documents that were downloaded/viewed by users who previously searched for the same thing.
It might show documents that would not have been found by Lucene (which Alfresco uses).
For instance, has anyone integrated Alfresco with Apache Mahout or pysuggest?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
data:image/s3,"s3://crabby-images/d5906/d59060df4059a6cc364216c4d63ceec29ef7fe66" alt="扫码二维码加入Web技术交流群"
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(3)
好处是 alfresco 默认支持引用(关联)。因此您可以定义文档之间的许多有用的关系。例如:
文档->用户=>查看-
文档->用户=>搜索-
文档->用户=>下载-按
文档->文档=>相关
文档->文档=>同年
...
您可以使用露天策略/行为捕获/实现大多数事件 (http://wiki.alfresco.com/wiki/Policy_Component)。例如:当 onCreate 事件发生时(创建文档),搜索具有相同作者的文档并将该文档链接到它们(添加关联)。
然后,您可以实现自定义搜索(可能是网页脚本),该搜索将返回结果,并且对于每个结果还返回其引用(关联)。
唯一让我担心的是,某些事件可能只能通过审核日志访问,我不知道如何使用 java 以编程方式捕获这些事件。
最后,您可以将这些内容输入到您的引擎中,引擎将从中学习。
The good thing is that alfresco by default supports references (associations). So you can define many usefull relations between documents. For example:
Document->User => viewed-by
Document->User => searched-by
Document->User => downloaded-by
Document->Document => Related-to
Document->Document => Same-year
...
You can catch/implement most of the events using alfresco policies/behaviours (http://wiki.alfresco.com/wiki/Policy_Component). For example: when onCreate event occurs (document is created) do a search for documents with same author and link this document (add associations) to them.
Then you can implement a custom search (webscript maybe) that will return results and for each result also return it's references (associations).
The only thing that worries me is that some events would probably be only accessible via audit log which I have no idea how to capture programatically using java.
In the end you can then feed this stuff to your engine that will learn on that.
有趣的话题!最近我在 Lucene/Solr 的背景下读到了 Mahout。 Lucidimaginations 有一些人深入参与 Mahout,请参阅:
由于 Lucene/Solr 是 Alfresco 的一部分,您可以考虑在搜索引擎级别集成它。此外,您还可以询问canoo company(瑞士巴塞尔)。过去,他们为我们基于 Solr 开发的多平台相关文档引擎提供了一个有趣的解决方案。
Interesting topic! Recently I read about Mahout in context of Lucene/Solr. There are some people deeply involved in Mahout at Lucidimaginations, see:
Since Lucene/Solr is part of Alfresco you could think about integrating it at search engine level. Additionally you could ask to canoo company (Basel, Switzerland). In the past they offered us an interesting solution for a multi-platform related-document engine they developed based on Solr.
我们已将 Mahout 集成到 Alfresco 中,以根据用户查看过的类似内容以及用户对内容的评分方式提供内容推荐。 Alfresco Mahout 集成代码位于
这提供了亚马逊风格的内容推荐服务。它可以扩展到推荐类似的搜索短语。
We've integrated Mahout into Alfresco to provide Content Recommendation based on similar content users have viewed and also based on how users have been rating content. The Alfresco Mahout integration code is available at
This provides Amazon style content recommendation services. It can be extended to recommend similar search phrases.