HBase 稳定并且可以投入生产吗?

发布于 2024-07-25 00:13:20 字数 277 浏览 4 评论 0原文

对于在自己的集群上部署了HBase的人来说,您认为它对于生产使用来说足够稳定吗? 您遇到过哪些类型的麻烦或问题?

我确实看到许多公司被列为在生产中使用 HBase (http://wiki.apache. org/hadoop/Hbase/PoweredBy),但我很好奇是否需要进行大量维护、修补和防火练习来保持 HBase 集群的正常运行。

For folks who have deployed HBase on their own clusters, do you feel that it's sufficiently stable for production use? What types of troubles or issues have you run into?

I do see a bunch of companies listed as using HBase in production (http://wiki.apache.org/hadoop/Hbase/PoweredBy), but I'm curious as to whether a lot of maintenance, patching, and firedrills goes into keeping the HBase cluster up and running.

如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

扫码二维码加入Web技术交流群

发布评论

需要 登录 才能够评论, 你可以免费 注册 一个本站的账号。

评论(1

七秒鱼° 2024-08-01 00:13:20

HBase 即将通过 HBase-0.20 达到一个重要里程碑。 有一个 alpha,很快就会有一个 RC。 它有非常重大的性能改进。 据报道,StumbleUpon 使用 HBase 的主干版本为其网站提供服务,与其他网站一样,没有额外的缓存层。 所以我想说它绝对可以用于生产。

Ryan Rawson(来自 StumbleUpon)最近在 nosql 会议上就此发表了精彩演讲,主要是关于它的发展程度过去 6 个月来。 如果您不想观看整个内容,可以使用幻灯片。 除了性能改进之外,另一个主要的补充是它现在与zookeeper集成,因此master不再是单点故障。

由于文件格式的限制,HBase 过去常常在处理小单元时出现内存问题。 新的自定义文件格式也解决了这个问题,这也提高了性能。

我已经尝试 HBase 大约一年了,我已经准备好信任 0.20 的生产服务,但我不太喜欢旧版本。 我在实验时建议至少使用 4 或 5 节点的 devcluster。

我无法真正评论照顾生产集群的感觉,因为我们才刚刚开始使用生产集群。 一个有帮助的方面是邮件列表非常活跃,并且 irc 一直在使用,因此至少有一个非常强大的社区可以提供帮助。

HBase is about to hit a major milestone with HBase-0.20. There's is an alpha and soon to be a RC. It has had very major performance improvements. StumbleUpon reportedly serve their site live out the trunk version of HBase, with no additional caching layer, as do others. So I'd say it's definitely ready for production use.

Ryan Rawson (of StumbleUpon) gave a nice talk on it at the nosql conference recently, which mostly is about how far it's come over the last 6 months. There are slides if you don't want to watch the whole thing. Apart from performance improvements the other major addition is it integrates with zookeeper now, so the master isn't a single point of failure anymore.

HBase used to fall over with small cell sizes with memory issues because of a limitation of the file format. This has been addressed too with a new custom file format, which also gave performance gains.

I've been experimenting with HBase for about a year now, I'm ready to trust 0.20 with a production service, I wasn't quite with older versions. I recommended at least a 4 or 5 node devcluster when experimenting.

I can't really comment on what it's like care-taking a production cluster, because we only just started with a production one. An aspect that helps is the mailing list is extremely active and irc is in constant use so there's a very strong community for helping out at least.

~没有更多了~
我们使用 Cookies 和其他技术来定制您的体验包括您的登录状态等。通过阅读我们的 隐私政策 了解更多相关信息。 单击 接受 或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
原文