HBase、MapReduce 和 HDFS 可以在安装并运行 Hadoop 的单台机器上工作吗?
我正在研究一个搜索引擎设计,该引擎将在云上运行。 我们刚刚开始,对 Hdoop 还没有太多了解。 谁能告诉我 HBase、MapReduce 和 HDFS 是否可以在安装并运行 Hdoop 的单台机器上工作?
I am working on a search engine design, which is to be run on cloud.
We have just started, and have not much idea about Hdoop.
Can anyone tell if HBase , MapReduce and HDFS can work on a single machine having Hdoop installed and running on it ?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(3)
是的,你可以。您甚至可以创建一个虚拟机并在一台“计算机”上运行它(这就是我所拥有的:))。
关键是简单地以“伪分布式模式”安装 Hadoop,甚至在Hadoop 快速入门中进行了描述。
如果您使用 Cloudera 发行版,他们甚至已经在 RPM 中创建了所需的配置。 在此处查看更多信息。
Yes you can. You can even create a Virtual Machine and run it on there on a single "computer" (which is what I have :) ).
The key is to simply install Hadoop in "Pseudo Distributed Mode" which is even described in the Hadoop Quickstart.
If you use the Cloudera distribution they have even created the configs needed for that in an RPM. Look here for more info in that.
HTH
是的。在我的开发环境中,我运行
此外,我运行我的应用程序,以及映射和减少任务跟踪器启动的任务。
在同一台机器上运行如此多的进程会导致对 CPU 核心、内存和磁盘 I/O 的大量争用,因此对于高性能来说肯定不是很好,但除了可用资源量之外没有任何限制。
Yes. In my development environment, I run
In addition, I run my applications, and map and reduce tasks launched by the task tracker.
Running so many processes on the same machine results in a lot of contention for CPU cores, memory, and disk I/O, so it's definitely not great for high performance, but there is no limitation other than the amount of resources available.
同样,我在一台计算机上运行 hadoop/hbase/hive。
如果你真的想在一台计算机上看到分布式计算,请获取大量 RAM、一些硬盘空间,然后像这样进行 -
这可能只是一个实验,因为除非你有一个像样的多CPU或多核系统,这样的配置实际上会消耗更多的维护本身而不是给你带来任何性能。
祝你好运。
--l4l
same here, I am running hadoop/hbase/hive on a single computer.
If you really really want to see distributed computing on a single computer, grab lots of RAM, some hard disk space and go like this -
this could just be an experiment, because unless you have a decent multi-cpu or multi-core system, such a configuration will actually consume more on maintaining itself than giving you any performance.
gud luck.
--l4l