是否可以从给定特定行的全文索引中获取相关单词的列表?
我想在我们的系统中对传入文本进行一些自动标记,我想知道全文索引是否能够提供给定索引行的单词排名列表。
如果没有,您对如何做到这一点有什么建议吗?我们已经有了一个自动标记系统,但其方法相当暴力(又名缓慢),并且并不总是能产生良好的单词列表。
I wuld like to do som automatic tagging of incoming text in our system and I was wondering if a full-text index is capable of providing a a ranked list of words given an indexed row.
If not, do you have any suggestions on how to do this? We already have a system in place for auto tagging but its fairly brute-force (aka. slow) in its method and it is not always yielding a good list of words.
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
我想是的。来自此处
给出的使用< code>sys.dm_fts_index_keywords_by_document 是
“我想知道全文索引包含多少个关键字”
“我想知道某个关键字是否是给定文档/行的一部分”
“我想知道某个关键字出现了多少次关键字在整个全文索引中出现了” (sum(occurrence_Count) where keywords=……)
“我想知道某个关键字在给定文档/行中出现了多少次”
“我想知道给定文档/行包含多少个关键字”
“我想检索属于给定文档/行的所有关键字doc/row”
I think so. From here
The scenarios given for using
sys.dm_fts_index_keywords_by_document
are“I want to know how many keywords the full-text index contains”
“I want to know if a keyword is part of a given doc/row”
“I want to know how many times a keyword appears in the whole full-text index” (sum(occurrence_Count) where keyword=……)
“I want to know how many times a keyword appears in a given doc/row”
“I want to know how many keywords a given doc/row contains”
“I want to retrieve all the keywords belonging to a given doc/row”