在Python或Web服务中搜索语言识别库
有没有提供语言识别功能的工具?
就像我输入一个英文句子,它会提示字符串可能是英文。
我想我需要以下东西之一。
- Python语言识别库。
- 提供此类功能的 Web 服务。
有人可以帮忙吗?
Is there any tool that provides language recognition functionality?
Like if I input a sentence of English, it will suggest that the string may be English.
I think I need one of the following stuff.
- Python language recognition library.
- Web service that provides such functionality.
Anyone can help?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
我非常广泛地使用 GuessLanguage 模块。它可以检测 100 多种语言,并且非常适合长文本。英语很好。其他语言我不确定。如果您想一次处理短句子或推文,那么这是非常糟糕的,因为它是为对完整文章进行检测而设计的。
我的公司(社交媒体分析)目前也在寻找替代包,并正在探索 NLTK 作为一个选项。
I have used the GuessLanguage module pretty extensively. It detects 100+ languages and works really well for LONG pieces of text. English is very good. Other languages I am not sure. If you want to process short sentences at a time or tweets it is pretty bad as it was designed for doing detection on full articles.
My company (social media analytics) is currently looking for a replacement package as well and are exploring NLTK as an option.
您尝试过 nltk 吗?有关更多信息,请参阅此问题。
还有一个 来自 xerox 的 Web 服务,它声称可以执行您想要的操作。
Have you tried nltk? See this SO question for further information.
There is also a web service from xerox which claims to do what you want.