我阅读以下文档
-
- https://solr.apache.org/guide/8_11/getting-started-with-with-with-with-solrcloud.html
- https://solr-user.lucene..apache。 narkive.com/b1bl4zmq/does-cloudsolrserrserrserver-hit-zookeeper-for-for-ever-ever-request
基于这些,我的理解是
- 我们可以使用嵌入式的Zookeeper或
- 的嵌入式Zookeepers
- Zookeepers 的效果更好或正确 /最佳实践是什么?
solrcloud Zookeepers的查询,索引等在性能或击中solr节点 http:// localhost:8983/solr/getstarted/select?q = :)在所有碎片上查询,但我看到Solr kafka connect,solrj等有助于通过ZookeEpereper和Solr节点进行连接。
我看到一些地方说Zookeeper提供了故障转移和负载平衡,我还看到一些地方说Zookeeper维持群集状态,Zookeeper持有Solr Nodes的配置等。
是否有任何官方文档解释了该角色是什么solr/solrcloud中的Zookeeper,何时/为什么有人击中Zookeeper地址而不是Solr Node ?
任何线索都非常感谢。谢谢
I read the following documents
- https://solr.apache.org/guide/8_11/shards-and-indexing-data-in-solrcloud.html
- https://solr.apache.org/guide/8_11/distributed-requests.html
- https://solr.apache.org/guide/8_11/getting-started-with-solrcloud.html
- https://solr-user.lucene.apache.narkive.com/b1bL4ZMQ/does-cloudsolrserver-hit-zookeeper-for-every-request
Based on these my understanding is
- We can have SolrCloud with embedded Zookeeper or
- External ensemble of Zookeepers
- But irrespective of having any type of Zookeepers, for querying, indexing etc is hitting Zookeeper better in performance or hitting Solr node better or what is the correct / best practise?
To elaborate on #3 - distributed requests for example, says to hit the solr node (http://localhost:8983/solr/gettingstarted/select?q=:) to query across all shards but I see Solr kafka connect, Solrj etc facilitating to connect via Zookeeper and to Solr node.
I see some places it says ZooKeeper provides failover and load balancing, I also see some places saying ZooKeeper maintains cluster state, ZooKeeper holds the configs of Solr nodes etc
Is there any official documentation that explains what is the role of Zookeeper in Solr/SolrCloud and when/why would someone hit Zookeeper address and not Solr node?
Any leads are much appreciated. Thanks
发布评论