面向对象的贝叶斯垃圾邮件过滤?
我想知道是否有任何良好且干净的面向对象编程(OOP)实现用于垃圾邮件和文本分类的贝叶斯过滤? 这仅用于学习目的。
I was wondering if there is any good and clean object-oriented programming (OOP) implementation of Bayesian filtering for spam and text classification? This is just for learning purposes.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(6)
法语,但您应该能够找到下载链接:)
PHP 朴素贝叶斯过滤器
In French, but you should be able to find the download link :)
PHP Naive Bayesian Filter
nBayes - CodePlex 上托管的另一个 C# 实现
nBayes - another C# implementation hosted on CodePlex
以下是 C# 中贝叶斯过滤的实现:C# 的朴素贝叶斯垃圾邮件过滤器(托管在 CodeProject 上)。
Here is an implementation of Bayesian filtering in C#: A Naive Bayesian Spam Filter for C# (hosted on CodeProject).
也许 https://ci-bayes.dev.java.net/ 或 http://www.cs.cmu.edu/~javabayes/Home/node2.html ?
我也从来没有玩过它。
Maybe https://ci-bayes.dev.java.net/ or http://www.cs.cmu.edu/~javabayes/Home/node2.html?
I never played with it either.
查看集体智慧编程的第 6 章
Check out Chapter 6 of Programming Collective Intelligence
我绝对推荐Weka,它是一个开源数据挖掘软件< /em> 用 Java 编写:
如上所述,它附带了许多不同的分类器,例如 SVM、Winnow, C4.5、朴素贝叶斯(当然)等等(请参阅 API 文档)。
请注意,众所周知,在垃圾邮件检测或文本分类领域,许多分类器比朴素贝叶斯具有更好的性能。
此外,Weka 为您带来了一个非常强大的 GUI...
I definitely recommend Weka which is an Open Source Data Mining Software written in Java:
As mentioned above, it ships with a bunch of different classifiers like SVM, Winnow, C4.5, Naive Bayes (of course) and many more (see the API doc).
Note that a lot of classifiers are known to have much better perfomance than Naive Bayes in the field of spam detection or text classification.
Furthermore Weka brings you a very powerful GUI…