搜索引擎字符串匹配
在线搜索引擎用于对拼写错误的单词提出建议的典型算法是什么。我不一定指的是 Google,而是任何具有搜索功能的网站,例如 Amazon.com。假设我搜索单词“shoo”
;该网站将返回并显示“您的意思是:鞋”
。
这是 Levenshtein 距离算法 的某种变体吗?也许如果他们使用一些全文搜索框架(例如 lucene),这是内置的?也许完全定制?
我知道答案有很大差异,我只是在寻找有关如何开始使用此方法的指示(在企业环境中)。
What is the typical algorithm used by online search engines to make suggestions for misspelled words. I'm not necessarily talking about Google, but any site with a search feature, such as as Amazon.com for instance. Say I search for the word "shoo"
; the site will come back and say "did you mean: shoe"
.
Is this some variation of the Levenshtein distance algorithm? Perhaps if they are using some full text search framework (like lucene for instance) this is built in? Maybe fully custom?
I know the answer varies a lot, I'm just looking for an indication on how to get started with this (in an enterprise environment).
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论