闲置的hadoop master - 如何让它做一些工作？

发布于 2024-09-18 22:17:00 字数 985 浏览 9 评论 0原文

我启动了一个由两个节点组成的小型集群，并注意到主节点完全空闲，而从节点则完成所有工作。我想知道有什么方法可以让master运行一些任务。我知道对于较大的集群来说，可能需要有一个专用的主服务器，但在 2 节点集群上，这似乎有点过分了。

感谢您提供任何提示，

Vaclav

更多详细信息：

这两个盒子各有 2 个 CPU。该集群已在 Amazon Elastic MapReduce 上设置，但我正在从命令行运行 hadoop。

我刚刚尝试过的集群有：

Hadoop 0.18
java version "1.6.0_12"
Java(TM) SE Runtime Environment (build 1.6.0_12-b04)
Java HotSpot(TM) Server VM (build 11.2-b01, mixed mode)


hadoop jar /home/hadoop/contrib/streaming/hadoop-0.18-streaming.jar  \
            -jobconf mapred.job.name=map_data \
            -file /path/map.pl                     \
            -mapper  "map.pl x aaa"                                     \
            -reducer NONE                                     \
            -input   /data/part-*                                         \
            -output  /data/temp/mapped-data                                    \
            -jobconf mapred.output.compress=true

其中输入由 18 个文件组成。

原文

I have launched a small cluster of two nodes and noticed that the master stays completely idle while the slave does all the work. I was wondering what is the way to let master run some of the tasks. I understand that for a larger cluster having a dedicated master may be necessary but on a 2-node cluster it seems an overkill.

Thanks for any tips,

Vaclav

Some more details:

The two boxes have 2 CPUs each. The cluster has been set up on Amazon Elastic MapReduce but I am running hadoop from commandline.

The cluster I just tried it on has:

Hadoop 0.18
java version "1.6.0_12"
Java(TM) SE Runtime Environment (build 1.6.0_12-b04)
Java HotSpot(TM) Server VM (build 11.2-b01, mixed mode)


hadoop jar /home/hadoop/contrib/streaming/hadoop-0.18-streaming.jar  \
            -jobconf mapred.job.name=map_data \
            -file /path/map.pl                     \
            -mapper  "map.pl x aaa"                                     \
            -reducer NONE                                     \
            -input   /data/part-*                                         \
            -output  /data/temp/mapped-data                                    \
            -jobconf mapred.output.compress=true

where the input consists of 18 files.

分享到QQ

分享到微博