相关词
我正在开发一个程序,但遇到了一个特定的障碍。我需要找到与其他单词相关的单词。 EG“绿色”可能与“环境”、“叶子”、“地球”、“风”、“电力”、“混合动力”等相关联。我只能找到Google Sets。还有其他更好的资源吗?
I am developing a program but stuck on a particular hurdle. I need to find words associated with other words. EG "green" might be associated with "environment", "leaf", "earth", "wind", "electric", "hybrid", etc. All I can find is Google Sets. Is there any other resource that is better?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
如果您有大量文本集合(例如维基百科、古腾堡计划),您可以使用共现分数来提取此类数据。请参见例如 Padó 和 Lapata 以及其中的参考文献。
我最近构建了一个工具,它通过另一种方法从维基百科数据库转储中挖掘这种关联。但它需要大量的内存;其他人尝试使用随机方法做同样的事情。
If you have a large text collection (say Wikipedia, Project Gutenberg) you can use co-occurrence scores extract this kind of data. See e.g. Padó and Lapata and the references therein.
I recently built a tool that mines this kind of associations from Wikipedia database dumps by another method. It requires a lot of memory though; other folks have tried to do the same using randomized methods.
如果您仍在寻找语义相关单词的资源,我最近刚刚开发了一个 API,它接受查询并返回语义相关单词。它提供词性、与查询词的关系以及词相似度测量。
https://kiingo.co/rapid-associations-api
免责声明:我是开发者这个 API 的。
If you're still looking for a resource of semantically related words, I've just recently developed an API that takes a query and returns semantically related words. It offers parts of speech, relationships to the query word, and a word similarity measurement.
https://kiingo.co/rapid-associations-api
Disclaimer: I'm the developer of this API.