用于垃圾邮件过滤的贝叶斯网络
我想使用贝叶斯网络机制进行垃圾邮件过滤。 您认为它应该看起来如何才是正确的网络拓扑? 朴素贝叶斯模型怎么样? (朴素贝叶斯模型有时称为贝叶斯分类器)
I want to use Bayesian Network mechanism for spam filtering. How do you think it should look a proper topology of the network? What about naive Bayes model? (The naive Bayes model is sometimes called a Bayesian classifier)
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
如果您不熟悉垃圾邮件过滤,最好从朴素贝叶斯分类器等简单的东西开始。 这样您就可以熟悉处理数据所涉及的问题(阅读电子邮件、对其进行分类、存储词典等),而不会陷入实际的分类代码中。 一旦您掌握了程序运行的基础知识,您就可以继续进行更高级的过滤类型。
我发现结束垃圾邮件一书中的讨论非常有帮助。
If you're new to spam filtering, it'd be a good idea to start with something simple like a naive Bayesian classifier. That way you get familiar with the issues involved in handling the data (reading the email, classifying it, storing your lexicon, etc.) without getting too bogged down in the actually classification code. Once you have the basics of your program working, you can go on to more advanced types of filtering.
I found the discussion in the book Ending Spam to be quite helpful.