如何实现良好的脏话过滤器？

发布于 2024-07-08 18:00:28 字数 555 浏览 14 评论 0原文

我们中的许多人需要处理用户输入、搜索查询以及输入文本可能包含脏话或不良语言的情况。通常这需要被过滤掉。

在哪里可以找到各种语言和方言的脏话列表？

是否有可用于包含良好列表的源的 API？或者也许有一个 API 只是简单地说“是的，这是干净的”或“不，这是肮脏的”，并带有一些参数？

有哪些好方法可以用来抓捕试图欺骗系统的人，例如 a$$、azz 或 a55？

如果您提供 PHP 解决方案，将获得加分。 :)

编辑：对简单地避免程序化问题的答案的回应：

我认为这种过滤器有一席之地，例如，用户可以使用公共图像搜索来查找获得的图片添加到敏感社区池。如果他们可以搜索“阴茎”，那么他们可能会得到很多照片，是的。如果我们不想要那个图片，那么阻止这个词作为搜索词是一个很好的把关人，尽管不可否认，这不是一个万无一失的方法。首先获取单词列表才是真正的问题。

所以我真正指的是一种方法来确定单个令牌是否脏，然后简单地禁止它。我不会费心去阻止像完全搞笑的“长颈长颈鹿”这样的情绪。你在那里无能为力。 :)

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

傲性难收 2024-07-15 18:00:29

虽然我知道这个问题相当老了，但这是一个常见的问题......

对脏话过滤器既有原因也有明显的需求（请参阅维基百科条目），但由于非常明显的原因，它们常常达不到 100% 准确； 上下文和准确性。

这（完全）取决于您想要实现的目标 - 最基本的是，您可能试图涵盖“七个脏东西一些企业需要过滤最基本的脏话：基本的脏话、URL甚至个人信息等等，但其他企业则需要防止非法帐户命名（Xbox live就是一个例子））或更多...

用户生成的内容不仅包含潜在的脏话，还可能包含对以下内容的冒犯性引用：

性行为、性取向
、
宗教
、种族
等...

并且可能以多种语言呈现。 Shutterstock 开发了 10 种语言的基本脏词列表，日期，但它仍然是基本的，并且非常面向他们的“标记”需求。网络上还有许多其他列表。

我同意公认的答案，即这不是一门明确的科学，因为语言是一项不断发展的挑战，但 90% 的捕获率优于 0%。这完全取决于您的目标 - 您想要实现的目标、您获得的支持程度以及消除不同类型的脏话的重要性。

在构建过滤器时，您需要考虑以下元素以及它们与您的项目的关系：

单词/短语
首字母缩略词（FOAD/LMFAO 等）
误报（单词、地点和名称，例如“mishit”、“scunthorpe”和“titsworth”）
URL（色情网站是明显的目标）
个人信息（电子邮件、地址、电话等 - 如果适用）
语言选择（默认情况下通常为英语）
审核（如何（如果有的话）与用户生成的内容进行交互以及可以用它做什么）

您可以轻松构建一个捕获 90% 以上的脏话的脏话过滤器，但您永远不会达到100%。这是不可能的。你越接近 100%，就越难...过去构建了一个复杂的脏话引擎，每天处理超过 500K 条实时消息，我提供以下建议：

基本的过滤器将涉及：

建立适用的脏话列表
开发处理脏话派生的方法

中等复杂的过滤器将涉及：（除了基本过滤器之外）：

使用复杂的模式匹配处理扩展派生（使用高级正则表达式）
处理 Leetspeak (l33t)
处理误报

复杂的过滤器将涉及以下许多内容（除了中等过滤器之外）：

白名单和黑名单
朴素贝叶斯推理过滤短语/术语
Soundex 函数（其中一个单词听起来像另一个单词）
Levenshtein 距离
词干提取
人类版主帮助指导过滤引擎学习通过示例或在没有指导的情况下匹配不够准确（自我/持续改进的系统）
也许某种形式的人工智能引擎

Whilst I know that this question is fairly old, but it's a commonly occurring question...

There is both a reason and a distinct need for profanity filters (see Wikipedia entry here), but they often fall short of being 100% accurate for very distinct reasons; Context and accuracy.

It depends (wholly) on what you're trying to achieve - at it's most basic, you're probably trying to cover the "seven dirty words" and then some... Some businesses need to filter the most basic of profanity: basic swear words, URLs or even personal information and so on, but others need to prevent illicit account naming (Xbox live is an example) or far more...

User generated content doesn't just contain potential swear words, it can also contain offensive references to:

Sexual acts
Sexual orientation
Religion
Ethnicity
Etc...

And potentially, in multiple languages. Shutterstock has developed basic dirty-words lists in 10 languages to date, but it's still basic and very much oriented towards their 'tagging' needs. There are a number of other lists available on the web.

I agree with the accepted answer that it's not a defined science and as language is a continually evolving challenge but one where a 90% catch rate is better than 0%. It depends purely on your goals - what you're trying to achieve, the level of support you have and how important it is to remove profanities of different types.

In building a filter, you need to consider the following elements and how they relate to your project:

Words/phrases
Acronyms (FOAD/LMFAO etc)
False positives (words, places and names like 'mishit', 'scunthorpe' and 'titsworth')
URLs (porn sites are an obvious target)
Personal information (email, address, phone etc - if applicable)
Language choice (usually English by default)
Moderation (how, if at all, you can interact with user generated content and what you can do with it)

You can easily build a profanity filter that captures 90%+ of profanities, but you'll never hit 100%. It's just not possible. The closer you want to get to 100%, the harder it becomes... Having built a complex profanity engine in the past that dealt with more than 500K realtime messages per day, I'd offer the following advice:

A basic filter would involve:

Building a list of applicable profanities
Developing a method of dealing with derivations of profanities

A moderately complex filer would involve, (In addition to a basic filter):

Using complex pattern matching to deal with extended derivations (using advanced regex)
Dealing with Leetspeak (l33t)
Dealing with false positives

A complex filter would involve a number of the following (In addition to a moderate filter):

Whitelists and blacklists
Naive bayesian inference filtering of phrases/terms
Soundex functions (where a word sounds like another)
Levenshtein distance
Stemming
Human moderators to help guide a filtering engine to learn by example or where matches aren't accurate enough without guidance (a self/continually-improving system)
Perhaps some form of AI engine

如何实现良好的脏话过滤器？

编辑：对简单地避免程序化问题的答案的回应：

Edit: Response to answers that say simply avoid the programmatic issue:

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（19）

关于作者

相关话题

热门标签

推荐作者

5576443447

酒几许

夜空下最亮的亮点

xiaolangfanhua

好久不见√

盗心人

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。