当前位置：文江博客话题详情

使用 OpenNLP 进行共指解析

发布于 2024-12-22 14:33:23 字数 195 浏览 1 评论 0原文

我想使用 OpenNLP 进行“共指解析”。 Apache 的文档 (共指解析) 不涵盖如何进行“共指解析”。。有人有任何文档/教程如何做到这一点吗？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

命硬 2024-12-29 14:33:23

我最近遇到了同样的问题，并写了一些使用 OpenNLP 1.5.x 工具的博客笔记。完整复制有点密集，所以这是包含更多详细信息的链接。

在较高级别上，您需要加载适当的 OpenNLP 共指模型库还有 WordNet 3.0 词典。考虑到这些依赖关系，初始化链接器对象非常简单：

// LinkerMode should be TEST
//Note: I tried LinkerMode.EVAL before realizing that this was the problem
Linker _linker = new DefaultLinker("lib/opennlp/coref", LinkerMode.TEST);

但是，使用链接器有点不太明显。您需要：

将内容分解为句子和相应的标记
为每个句子创建一个 Parse 对象

包裹每个句子 Parse 以指示句子顺序：

final DefaultParse parseWrapper = new DefaultParse(parse, idx);

迭代每个句子解析并使用 Linker 从每个句子获取 Mention 对象解析：
```
最终提及[]范围=
   _linker.getMentionFinder().getMentions(parseWrapper);
```
最后，使用链接器识别所有句子中的不同实体提及对象：
```
DiscourseEntity[]Entity = _linker.getEntities(arrayOfAllMentions);
```

I recently ran into the same problem and wrote up some blog notes for using OpenNLP 1.5.x tools. It's a bit dense to copy in its entirety, so here's a link with more details.

At a high level, you need to load the appropriate OpenNLP coreference model libraries and also the WordNet 3.0 dictionary. Given those dependencies, initializing the linker object is pretty straightforward:

// LinkerMode should be TEST
//Note: I tried LinkerMode.EVAL before realizing that this was the problem
Linker _linker = new DefaultLinker("lib/opennlp/coref", LinkerMode.TEST);

Using the Linker, however, is a bit less obvious. You need to:

Break the content down into sentences and the corresponding tokens
Create a Parse object for each sentence
Wrap each sentence Parse so as to indicate the sentence ordering:
```
final DefaultParse parseWrapper = new DefaultParse(parse, idx);
```
Iterate over each sentence parse ane use the Linker to get the Mention objects from each parse:
```
final Mention[] extents =
   _linker.getMentionFinder().getMentions(parseWrapper);
```
Finally, use the Linker to identify the distinct entities across all of the Mention objects:
```
DiscourseEntity[] entities = _linker.getEntities(arrayOfAllMentions);
```

回复收藏 0 原文