如何将对象传递给Mapper和reducers

发布于 2024-12-11 06:29:25 字数 567 浏览 0 评论 0原文

我有一个在 hadoop 上运行的应用程序。如何将对象传递给映射器和缩减器以处理数据。例如，我声明一个 FieldFilter 对象来过滤映射器中处理的行。过滤器包含许多由用户指定的过滤规则。所以，我想知道如何将过滤器和规则传递给映射器和缩减器？我的想法是将对象序列化为String，通过configure传递字符串，然后通过字符串重新构造对象。但似乎对我来说不太好！还有其他方法吗？谢谢！

public class FieldFilter  {      
private final ArrayList<FieldFilterRule> rules = new ArrayList<FieldFilterRule>();

public FieldFilter addRule(FieldFilterRule ... rules) {
    for (int i = 0; i < rules.length; i++) {
        this.rules.add(rules[i]);
        rules[i].setFieldFilter(this);
    }
    return this;
}    }

原文

I have an application run on hadoop. How can I pass the objects to the mappers and reducers so as to process the data. For example, I declare a FieldFilter object for filter the rows processed in the Mappers. The filters contains many filter rules which are specified by users. So, I am wondering how can I pass the filters and rules to the Mappers and Reducers?
My idea is to serialize the objects into String, pass around the string by configure, re-then construct the object by the string. But seems not good for me! any other approaches?
thanks!

public class FieldFilter  {      
private final ArrayList<FieldFilterRule> rules = new ArrayList<FieldFilterRule>();

public FieldFilter addRule(FieldFilterRule ... rules) {
    for (int i = 0; i < rules.length; i++) {
        this.rules.add(rules[i]);
        rules[i].setFieldFilter(this);
    }
    return this;
}    }

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

冷清清 2024-12-18 06:29:25

您想在 Configuration 中使用 setClass()，如您所见此处。然后您可以通过 newInstance() 实例化您的类。请记住在 Mapper/Reducer 的 setup() 方法中进行实例化，这样就不会在每次调用 Map/Reduce 方法时都实例化过滤器。祝你好运。

- 编辑。我应该补充一点，您可以通过上下文访问配置，这就是您获取所需类的方式。配置API中有一个getClass()方法。