需要帮助选择数据挖掘/神经网络 API
我计划为我用 Java 开发的电子商务平台构建一个功能,以与 Amazon 大致相同的方式显示相关产品。我想探索一些不同的关联产品指标。
- 购买历史(同时购买)
- 按系列/类型相关(类似的产品分类)
- 有意相关(提升结果;“买这个!”)
虽然我可能能够开发自己的数据挖掘库,但这不会很便携式,我敢说它也不会很好。
有几个软件包可以做这类事情,但我觉得我无法评估哪个软件包或解决方案最适合我。任何轶事或个人经验的输入将不胜感激。
注意:我将其标记为神经网络,因为我参加了一次 Python 演讲,其中使用类神经网络进行数据挖掘,我不相信神经网络是这项工作的最佳选择。
I'm planning on building a feature for an e-commerce platform I developed in Java to display related products in much the same way Amazon does. There are a few different metrics for relating products that I want to explore.
- Purchase history (purchased at the same time)
- Related by family/type (similar product classifications)
- Intentionally related (boosting results; "Buy this!")
While I would probably be able to develop my own datamining library, it wouldn't be very portable and I dare say it wouldn't be very good either.
There are several packages out there for doing this sort of thing but I don't feel like I am in a position to evaluate which package or solution would work best for me. Any input anecdotal or from personal experience would be greatly appreciated.
Note: I've tagged this as Neural networking because of a python talk I was at where a neural-like-network was used for datamining, I'm not convinced a neural network is the best choice for this job.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
看看 Apache Mahout
Take a look at Apache Mahout
有一些用于数据挖掘的人工算法技术,例如 C4.5 或 < a href="http://en.wikipedia.org/wiki/ID3_algorithm" rel="nofollow noreferrer">ID3。这些算法进行分类。其他技术(例如蚂蚁聚类、神经网络或遗传算法)用于数据挖掘中的分类目的。
至于算法,我不太了解,但ID3/C4.5可以轻松编程。
希望这有帮助。
There are some artificial algorithm techniques used for data mining, such as C4.5 or ID3. These algorithm does classification. Other techniques such as ant clustering, neural networks or genetic algorithms are used for classification purposes in data mining.
As far as algorithms, I don't know much but ID3/C4.5 can be easily programmed.
Hope this helps.