儿童游戏中玩家间聊天的白名单
我们正在为儿童开发一款教育性多人游戏,并希望允许玩家使用白名单系统互相聊天。使用白名单聊天时,玩家将只能输入白名单中出现的单词。
我们知道白名单的局限性,但我们认为白名单聊天系统可以让我们的玩家在游戏中更好地表达自己,同时比审核或黑名单聊天提供更高级别的安全性。
虽然该系统很容易实现,但我们无法在网上找到“安全”单词白名单示例。有谁知道我们在哪里可以找到这样的列表,最好有允许我们在商业项目中使用它的许可证?
谢谢。
We're developing an educational multiplayer game for kids and want to allow players to chat with each other using a whitelist system. When using whitelist chat, players will be able to type only words which appear in the whitelist.
We're aware of the limitations of whitelists in general, but we think a whitelist chat system is something that would allow our players to express themselves better in the game, while allowing a higher level of security than moderated or blacklist chat.
While the system is easy enough to implement, we haven't been able to find a sample whitelist of "safe" words online. Does anyone know of where we can find such a list, preferably with a license that allows us to use it in a commercial project?
Thanks.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(3)
我不相信简单的单词白名单就能解决这个问题。对于很多东西,有相当多的委婉说法,白名单永远不会阻止(例如“他像杂草一样生长”可以,“他正在种植杂草”则不然)。我们更不用提基本的“你想见面吗?”如果会议发生在游戏中,那还好,但如果会议发生在游戏外,那就非常危险了。还有一个问题是阻止罕见的、外来的或输入错误的单词,这可能会让你的聊天系统令人沮丧,以至于无法使用。
在我看来,绝对没有办法比得上积极且称职的人类版主所提供的安全性。当然,根据聊天流量和任何实时要求,使用人工进行此操作会遇到很多实际问题。然而,考虑到您的应用程序是针对儿童的,尽管成本要高得多,但人工审核可能是完全可以接受的。
第二种选择,但与人类调节能力相去甚远,是使用一些统计过滤器,例如 Bogofilter ,如果你训练得好,它会很乐意对任意文本进行排序。黑名单还有助于立即减少含有小孩子不应该(但通常)知道的单词的消息。您还需要一堆过滤器来减少包含电话号码、电子邮件、街道地址和网络链接等内容的消息。
也许具有最佳效果/成本比的方法是使用人工主持人并在多个统计过滤器的协助下更好地利用他们的时间。但请记住,如果存在恶意用户(即教室中除同龄孩子之外的其他用户),则无法确保不会发生任何可疑或危险的情况。
I do not believe that a simple whitelist of words will cut it. There are quite a few euphemisms for a lot of stuff out there, that a whitelist would never block (e.g. "he is growing like a weed" is fine, "he is growing weed" is NOT). And let's not mention the basic "would you like to meet?" which would be fine if the meeting were to happen in-game, but very dangerous if it were to happen out of it. Then there is also the issue of blocking rare, foreign or mistyped words, that might make your chat system frustrating enough that it would not be used.
In my opinion, there is absolutely no way you could ever match the security offered by an active and competent human moderator. Of course, depending on the volume of chat traffic and any real-time requirements there are quite a few practical issues with using humans for this. Considering that your application is targeted at children, however, human moderation might be quite acceptable, despite its much higer cost.
A second choise, but one very far from the abilities of human moderation, is to use some statistical filter such as Bogofilter, which will happily sort arbitrary text if you train it well. A blacklist would also help to immediately cut down messages with words that little kids should not (but usually do) know. You would also need a bunch of filters that would cut down messages with stuff like telephone numbers, email and street addresses and web links.
Perhaps the method with the best effectiveness/cost ratio would be to use human moderators assisted by multiple statistical filters to better make use of their time. Keep in mind, however, that if there are malicious users (i.e. anything else than same-age kids in a classroom) there is no way to make sure that nothing questionable or dangerous ever goes through.
你可以尝试标准的unix字典。 /usr/share/dict/words。但你必须修改它以删除顽皮的词语。
http://en.wikipedia.org/wiki/Words_%28Unix%29
http://www.openwall.com/wordlists/
You can try the standard unix dictionary. /usr/share/dict/words. But you'll have to modify it to remove the naughty words.
http://en.wikipedia.org/wiki/Words_%28Unix%29
http://www.openwall.com/wordlists/
虽然这并不能完全回答您的问题,但 Runescape 使用短语白名单,而不是单词。
Runescape 中的实现很尴尬,因为有太多短语可供选择。有时您必须浏览 3 或 4 个菜单才能找到您想要的短语。
如果您能想出更好的短语组织方式,那么这可能对您有用。
While this doesn't exactly answer your question, Runescape uses a white list of phrases, rather than words.
The implementation in Runescape is awkward, because there are so many phrases to choose from. You have to go through 3 or 4 menus sometimes to get to the phrase you want.
If you can come up with a better organization of phrases, then this might work for you.