单词着色和语法分析
我想根据文本中的单词的分类(类别/偏角等)对它们进行着色。我有一本功能齐全的字典,但问题是有很多歧义。例如,foedere
可以是动词“fornicate”或名词“treaty”的形式。
解决这些歧义或产生良好猜测的一般策略是什么?
谢谢!
I want to colorize the words in a text according to their classification (category/declination etc). I have a fully working dictionary, but the problem is that there is a lot of ambiguity. foedere
, for instance, can be forms of either the verb "fornicate" or the noun "treaty".
What the general strategies for solving these ambiguities or generating good guesses are?
Thanks!
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
一般策略是首先对数据运行词性标注器确定单词类别(名词、动词等)。然而,这需要数据(上下文统计)和工具。 这篇研究论文可能是一个起点。
The general strategy is to first run a part-of-speech tagger on the data to determine the word category (noun, verb, etc.). That, however, requires data (context statistics) and tools. This research paper may be a starting point.