狮身人面像 + NoSQL 帮助
因此,我希望在 NoSQL 系统(例如 MongoDB、HBase、Cassandra 等)上运行 Sphinx。
现在,我们正在比较所有 NoSQL 系统。基本上,我们需要每秒数千次全文搜索来查询 50+ 百万行产品数据,因此我们正在尝试找到最高效的 NoSQL 系统。
不过,这是我们的问题。如果我们将任何NoSQL系统与Sphinx一起使用,当我们执行实际搜索时,搜索是否会与NoSQL系统本身进行任何交互,或者Sphinx是否会在数据索引时完成工作?如果只是Sphinx,那么NoSQL系统的性能不就只能是次要的了吗?
谢谢!
So I'm looking to run Sphinx over a NoSQL system such as MongoDB, HBase, Cassandra, etc.
Right now, we're comparing all the NoSQL systems out there. Basically, we need to query 50+ Million rows of product data with fulltext searches thousands of times a second, so we're trying to find the most efficient NoSQL system.
Here is our question, though. If we use any NoSQL system with Sphinx, when we perform the actual searches, will the search have any interaction with the NoSQL system itself, or will Sphinx be doing the work as it has the data indexed? If it's only Sphinx, then wouldn't the performance of the NoSQL system be only secondary?
Thanks!
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
使用最新的字符串属性,您可以完全删除数据库部分的搜索,这将更加高效。
Using the latest string attribute, you can cut of the database part of the search completely, that will be much more efficient.
据我了解,我认为你可以做到。因为我只熟悉mongodb和hbase,所以只能根据2个数据库来谈这个问题。您需要在索引器上做一些工作,并将数据/属性构建到sphinx索引文件中,并将主键(标记数据库中的唯一记录)也包含到其中(对于mongodb,它是object_id,对于hbase,它是行键),那么在进行全文搜索后,您可以通过主键从数据库中获取整个数据/属性。
此外,另一个全文搜索引擎也很好地支持no-sql db,它是solr。如果它的性能能够满足您的要求,您可以尝试一下。
As my understanding, I think you can do it. Because I'm only familiar with mongodb and hbase, i can only talk about this question based on the 2 databases. You need to do some work on the indexer and build the data/attributes into the sphinx index file, and to include the primary key(which mark the sole record in the database) into it too(for mongodb, it's object_id, for hbase, it's row key), then after you do the fulltext search, you can get the whole data/attributes from databases by the primary key.
Besides, another full-text search engine supports no-sql db very well, it's solr. you can try it if the performance of it can satisfy your request.