SQL Server 中的 FORMSOF 同义词库
有没有人在速度方面做过任何性能测量,其中任何给定单词都有大量替代品。例如,我想用它来存储常见的拼写错误;期望一个单词有 4-10 个变体。
<expansion>
<sub>administration</sub>
<sub>administraton</sub>
<sub>aministraton</sub>
</expansion>
当您运行全文搜索时,性能如何因变体数量而降低?例如,我假设它必须执行单独的全文搜索,执行 OR?
另外,假设 Thesaurus xml 文件中有 20/30K 条目 - 这会影响性能吗?
Has anyone done any performance measures with this in terms of speed where there is a high number of substitutes for any given word. For instance, I want to use this to store common misspellings; expecting to have 4-10 variations of a word.
<expansion>
<sub>administration</sub>
<sub>administraton</sub>
<sub>aministraton</sub>
</expansion>
When you run a fulltext search, how does performance degrade with that number of variations? for instance, I assume it has to do a separate fulltext search performing an OR?
Also, having say 20/30K entries in the Thesaurus xml file - does this impact performance?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
不,但性能测试通常是特定于系统的。我建议整理一些示例数据并运行您自己的测试用例是您最好的选择。
No, but performance testing is very often quite system-specific. I'd suggest putting together some sample data and running your own test cases is your best bet.