Hadoop的单点故障可以通过集成Zookeeper来解决吗?
我正在寻找通过集成 Zookeeper 并允许 Hadoop 使用 Zookeeper 的命名空间来解决单点故障问题?是否可以?我们该如何解决这个问题 ?
I am looking for addressing the issue of single point failure by integrating Zookeeper and allow Hadoop to use the namespace from Zookeeper? Is it possible? how can we address this
?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(2)
当然,这在技术上可能是可行的,但这还没有以任何形式或形式实现。您不能只是将其插入 ZooKeeper...这需要一些工程工作才能完成(也许是您?!)。
以下是一些解决方案的指针:
Sure, it might be technically possible, but this is not something that has been implemented in any shape or form just yet. You can't just plug it into ZooKeeper... it'll take some engineering effort to get it done (by you perhaps?!).
Here are a couple pointers to solutions of this:
那效果不太好。 Zookeeper 的写入成本很高。 ZK集群中的节点越多,成本就越高。 ZK将所有数据保存在所有节点的内存中。相反,Namenode 是专注于写入的组件,特别是当您谈论 Hadoop 时,它的写入量非常大。
That would not work very well. Cost of write in Zookeeper is high. The more nodes you have in ZK cluster the higher the cost. ZK keeps all the data in memory on all nodes. Namenode is on the contrary write focused component, especially if you talking about Hadoop, which is super write heavy.