Apache Zookeeper 如何用于分片?

发布于 2024-11-13 06:01:34 字数 303 浏览 4 评论 0原文

我们正在考虑集中 cfg 信息,看起来 Zookeeper 是一个不错的选择。我们也对分片感兴趣并且有一个方案。在poweredBy[1]中看到Rackspace和Yahoo正在使用Zookeeper进行分片。希望得到指点和细节。

[1] https://cwiki.apache.org/confluence/display/ZOOKEEPER/PoweredBy< /a>

We are thinking of centralizing cfg information and looks like zookeeper is a good choice. We are also interested in sharding and have a scheme. In the poweredBy[1] saw that Rackspace and Yahoo is using Zookeeper for sharding. Would appreciate pointers and details.

[1] https://cwiki.apache.org/confluence/display/ZOOKEEPER/PoweredBy

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(4

迷荒 2024-11-20 06:01:34

Solr 将使用 Zookeeper 进行分片。 ZooKeeper 集成 设计文档可能对您感兴趣。

Solr is going to use Zookeeper for sharding. ZooKeeper Integeration design doc might be interesting for you.

淡笑忘祈一世凡恋 2024-11-20 06:01:34

我能想到他们可以参考的两件事。

  1. 他们可以使用内置的集成功能进行引用。使用这些,您实际上可以为您的服务设置组管理协议。当您向整体添加更多服务器时,您可以有效地将池划分为更多数量。 ensemble 成员之间的数据在成员服务器之间同步。这对于将相同数据集分片到多个读取池的应用程序特别有用 - 例如索引服务器、搜索服务器、读取缓存等。

  2. 它们可以使用 ZooKeeper 进行配置管理。现在假设您的应用程序可能有数千个客户端,所有客户端都需要同时更新其配置文件。假设您的应用程序现在访问包含 50 台服务器的数据存储层 - 但该池需要分片到 200 个。您可以设置从属关系来执行 1 到 4 从属关系。然后,ZooKeeper 可用于更新该配置文件,本质上在一秒钟内更改每个配置文件。

I can think of two things that they could be referencing.

  1. They could be referencing using the built in ensemble features. Using those you can actually setup a group management protocol for your service. As you add more servers to the ensemble you effectively shard your pool out to greater numbers. The data between the members of the ensemble is sync'd between the member servers. This is especially useful for applications that shard out the same data set to multiple read pools - such as index servers, search servers, read cache's, etc.

  2. They could be using ZooKeeper for configuration management. Let's now assume that your application may have thousands of clients that all need to update their config files at the same time. Let's say that your application now accesses a data storage layer of 50 servers - but that pool needs to be sharded out to 200. You can setup a slaving relationship to perform the 1 to 4 slave relationship. ZooKeeper could then be used to update that config file and in essence change every config file within a second of each other.

筑梦 2024-11-20 06:01:34

您还应该看看 HBase 如何使用 Zookeeper;特别是维护有关区域的信息。这类似于使用 ZK 来维护数据库分片信息。

You should also take a look at how HBase uses Zookeeper; specifically to maintain information about regions. This would be analogous to using ZK to maintain DB sharding info.

夜访吸血鬼 2024-11-20 06:01:34

用于管理查找表。
由于这个查找表必须是强一致的,这就是 Zookeeper 发挥作用的地方。

For managing the lookup table .
Since this lookup table have to be strong consistent, this is where zookeeper comes into picture.

~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文