Hadoop 的变体
我的一个项目是比较 Hadoop 的不同变体,据说有很多,但是谷歌搜索对我来说效果不太好:(
有谁知道 Hadoop 的不同变体吗?我发现的唯一一个是哈洛普。
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
我的一个项目是比较 Hadoop 的不同变体,据说有很多,但是谷歌搜索对我来说效果不太好:(
有谁知道 Hadoop 的不同变体吗?我发现的唯一一个是哈洛普。
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
接受
或继续使用网站,即表示您同意使用 Cookies 和您的相关数据。
发布评论
评论(5)
我认为更通用的术语是“地图缩减”:
http://www.google.com/search?gcx=c&sourceid=chrome&ie=UTF-8&q=map+reduce&safe=active
I think the more generic term is "map reduce":
http://www.google.com/search?gcx=c&sourceid=chrome&ie=UTF-8&q=map+reduce&safe=active
不太清楚 Hadoop 的不同变体是什么意思。
但是,有很多公司提供商业支持或提供自己的 Hadoop 版本(开源和专有)。您可以在此处找到更多详细信息。
例如,MapR 有自己专有的 Hadoop 实现,但他们声称它与 Apache Hadoop 兼容,有点模糊,因为 Apache Hadoop 正在不断发展,而且 Hadoop API 还没有标准。 Cloudera 有自己的 Hadoop 版本 CDH,它基于 Apache Hadoop。 HortonWorks 是从 Yahoo 分离出来的,它为 Hadoop 提供商业支持。
您可以在此处找到更多信息。 Hadoop 发展得非常快,所以这可能有点陈旧。
Not exactly sure what you mean by different variants for Hadoop.
But, there are a lot of companies providing commercial support or providing their own versions of Hadoop (open-source and proprietary). You can find more details here.
For ex., MapR has it's own proprietary implementation of Hadoop, but they claim it's compatible with Apache Hadoop, which is a bit vague because Apache Hadoop is evolving and there are no standards around Hadoop API. Cloudera has it's own version of Hadoop CDH which is based on the Apache Hadoop. HortonWorks has been spun from Yahoo, which provides commercial support for Hadoop.
You can find more information here. Hadoop is evolving very fast, so this might be a bit stale.
这个可以参考
- hadoops文件系统,
- 或者它对地图缩减的有效支持......
- 或者更一般地说,云/分布式存储系统的想法。
最好阐明您对 hadoop 的哪些方面感兴趣。
当然,在学术上比较hadoop时,首先要开始看GFS——因为那是hadoop的起源。
This can refer to
- hadoops file system,
- or its effective support for map reduce...
- or even more generally, to the idea of cloud / distributed storage systems.
Best to clarify what aspects of hadoop you are interested In.
Of course when comparing hadoop academically, you must first start looking at GFS- since that is the origin of hadoop.
抛开 HBase,我们可以将 hadoop 视为两层:存储层和 Map-Reduce 层。
存储层具有以下非常不同的实现,比较起来很有趣:标准 hadoop 文件系统、Cassandra 上的 HDFS(Brisk)、S3 上的 HDFS、MapR hadoop 实现。
MapR 也改变了 Map-reduce 的实现。
Taking aside HBase we can see hadoop as two layers - storage layer and map-reduce layer.
Storage layer has the following really different implementation which would be interesting to compare: standard hadoop file system, HDFS over Cassandra (Brisk), HDFS over S3, MapR hadoop implementation.
MapR also have changed Map-reduce implementation.
这个网站http://www.nosql-database.org/有很多NoSql的列表DB 就在那里。也许它可以帮助你。
This site http://www.nosql-database.org/ has a list of a lot of NoSql DBs out there. Maybe it can help you.