如何在 Haoop v 0.21 中调用 Partitioner

发布于 2024-11-07 19:28:26 字数 454 浏览 0 评论 0原文

在我的应用程序中，我想根据键创建尽可能多的减速器作业。现在，我当前的实现将所有键和值写入单个（reducer）输出文件中。所以为了解决这个问题，我使用了一个分区器，但我无法调用该类。分区器应该在选择Map任务之后和选择reduce任务之前调用，但它没有。分区器的代码如下是

public class MultiWayJoinPartitioner extends Partitioner<Text, Text> {
@Override
public int getPartition(Text key, Text value, int nbPartitions) {
return (key.getFirst().hashCode() & Integer.MAX_VALUE) % nbPartitions;
return 0;
}
}

这个代码是根据键和值对文件进行分区是否正确，输出将自动传输到减速器？

原文

In my application I want to create as many reducer jobs as possible based on the keys. Now my current implementation writes all the keys and values in a single (reducer) output file. So to solve this, I have used one partitioner but I cannot call the class.The partitioner should be called after the selection Map task and before the selection reduce task but it did not.The code of the partitioner is the following

public class MultiWayJoinPartitioner extends Partitioner<Text, Text> {
@Override
public int getPartition(Text key, Text value, int nbPartitions) {
return (key.getFirst().hashCode() & Integer.MAX_VALUE) % nbPartitions;
return 0;
}
}

Is this code is correct to partition the files based on the keys and values and the output will be transfer to the reducer automatically??

分享到QQ

分享到微博