C/C++/Obj-C 的词性标记和命名实体识别
需要一些帮助!
我正在尝试在 Objective-C 中编写一些需要词性标记的代码,并且理想情况下还需要命名实体识别。我对“自己动手”没有太大兴趣,所以我正在寻找一个像样的库来用于此目的。显然,越准确越好,但我们在这里讨论的不是任何关键的东西——所以只要它总体上相当准确就足够了。
至少目前只有英语版本,但我不想自己对模型进行任何培训。因此,无论采用什么解决方案,都必须已经构建了英语语言模型。
最后,它必须通过商业友好的许可证(例如 BSD/Berkeley、LGPL)提供。不能使用 GPL 或任何类似的限制,但如果这是唯一的选择,我愿意支付少量费用来购买商业许可证。
C、C++ 或 Obj-C 代码都可以。
那么:有人熟悉可以在这里解决问题的东西吗?谢谢!!
need some help!
I'm trying to write some code in objective-c that requires part-of-speech tagging, and ideally also named entity recognition. I don't have much interest in "rolling my own", so I'm looking for a decent library to use for this purpose. Obviously the more accurate the better, but we're not talking anything critical here -- so as long as it's generally pretty accurate that's good enough.
It's going to be English-only, at least for the time being, but I don't want to have to do any training of models myself. So whatever the solution, it has to have an English language model already built.
And finally, it has to be available via a commercial-friendly license (e.g. BSD/Berkeley, LGPL). Can't do GPL or anything restrictive like that, though I'm open to paying a small amount for a commercial license if that's the only option.
C, C++ or Obj-C code is all fine.
So: Anyone familiar with something that'd do the trick here? Thanks!!
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
我建议您查看 iOS 5 beta 发行说明。
I suggest you check out the iOS 5 beta release notes.
您可能已经发现,大多数免费提供的 NLP 代码都是用 python、perl 或 java 编写的。然而,快速浏览一下斯坦福大学的 NLP 工具页面会发现 C/C++ 中的一些内容可用。另一个工具列表可以在博客文章。
在词性标注器中,YamCha 是众所周知的,尽管我自己没有使用过它(作为一个 java/python/perl 人员)。
不幸的是,我无法推荐任何 NER nlp 工具。不过,我敢打赌 C/C++ 中有 maxent 或 svm 实现可供您使用:
1)创建训练数据并对其进行注释
2)定义你的特征
3)使用ml库
抱歉我无法提供更多帮助,但如果我想到其他任何内容,我会添加它。
也许一旦我对 Objective-C 有了一定程度的了解,我就会为它编写一个 NLP 库!
As you've probably figured out most of the NLP code that's freely available is in python, perl or java. However, a quick look at Stanford's NLP tools page shows a few things in C/C++ that are available. Another list of tools can be found at a blog post.
Of the POS taggers, YamCha is well-known, though I have not used it myself (being a java/python/perl guy).
Unfortunately, I cannot suggest any NER nlp tools. However, I bet there's a maxent or svm implentation in C/C++ that you can work with:
1) create your training data and annotate it
2) define your features
3) use the ml library
Sorry I can't be of more help, but if anything else comes to mind I'll add it.
Maybe once I figure out objective-c to a respectable degree I'll write an NLP library for it!