Java 命名实体识别库

发布于 2024-07-07 05:01:08 字数 1557 浏览 6 评论 0原文

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(4

回眸一笑 2024-07-14 05:01:08

您可能想查看我之前的回答 类似的问题。

除此之外,大多数较轻的 NER 系统在很大程度上取决于所使用的域。 例如,您会发现大量有关生物医学 NER 系统的工具和论文。 除了我之前的文章(如果你想做 NER,它已经包含了我的主要建议),这里还有一些你可能想要研究的工具:

附加说明:如果不对输入进行标记化,您将无法逃脱。 自然语言的标记化有点不简单,这就是为什么我建议您使用一个可以同时完成这两件事的工具箱。

You might want to have a look at one of my earlier answers to a similar problem.

Other than that, most lighter NER systems depend a lot on the domain used. You will find a whole lot of tools and papers about biomedical NER systems, for example. In addition to my previous post (which already contains my main recommendation if you want to do NER), here are some more tools you might want to look into:

  • The Stanford CER-NER
  • The Postech Biomedical NER System if you are interested in this particular domain
  • OpenCalais seems to be a commercial system. There are UIMA wrappers for OpenCalais but they seem dated. There is also a dictionary based Context-Mapper annotator for UIMA that may help you out. Be aware that UIMA implies significant overhead in learning curve ;-)
  • OpenNLP also have an NER tool.
  • Balie does NER, too, among other things.
  • ABNER does NER, but again its focused on the biomedical domain.
  • The JULIE Lab Tools from the university of Jena, Germany also do NER. They have standalone versions and UIMA analysis engines.

One additional remark: you won't get away without tokenization on the input. Tokenization of natural language is slightly non-trivial, that's why I suggest you use a toolbox that does both for you.

闻呓 2024-07-14 05:01:08

顺便说一句,我最近遇到了 OpenCalais ,它似乎具有我一直在寻找的功能。

BTW, I recently ran across OpenCalais which seems to havethe functionality I was looking after.

攀登最高峰 2024-07-14 05:01:08

您可能还想尝试 Alchemy API。 它类似于开放加来。

You might want to try Alchemy API as well. Its similar to Open Calais.

~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文