重读avro文件对文件进行简单的mr计算-阿里云开发者社区

重读avro文件对文件进行简单的mr计算

2017-10-19 1786

版权

本文内容由阿里云实名注册用户自发贡献，版权归原作者所有，阿里云开发者社区不拥有其著作权，亦不承担相应法律责任。具体规则请查看《阿里云开发者社区用户服务协议》和《阿里云开发者社区知识产权保护指引》。如果您发现本社区中有涉嫌抄袭的内容，填写侵权投诉表单进行举报，一经查实，本社区将立刻删除涉嫌侵权内容。

简介：

public class ReadAvroInput {

public static class ReadAvroInputMap extends Mapper<AvroKey<UserActionLog>, NullWritable, Text, IntWritable> {

    private Text oKey = new Text();
    private final IntWritable ONE = new IntWritable(1);
    private UserActionLog keyData;

    @Override
    protected void map(AvroKey<UserActionLog> key, NullWritable value,
            Mapper<AvroKey<UserActionLog>, NullWritable, Text, IntWritable>.Context context)
            throws IOException, InterruptedException {
        keyData = key.datum();
        oKey.set(keyData.getProvience().toString());
        context.write(oKey, ONE);
    }
}

public static class ReadAvroInputReducer extends Reducer<Text, IntWritable, Text, IntWritable> {
    private int sum;
    private IntWritable oValue = new IntWritable();

    @Override
    protected void reduce(Text key, Iterable<IntWritable> values,
            Reducer<Text, IntWritable, Text, IntWritable>.Context context)
            throws IOException, InterruptedException {
        sum=0;
        for (IntWritable value : values) {
            sum += value.get();
        }
        oValue.set(sum);
        context.write(key, oValue);

    }
}

public static void main(String[] args) throws IOException, ClassNotFoundException, InterruptedException {
    Configuration configuration =new Configuration();
    Job job =Job.getInstance(configuration);
    job.setJarByClass(ReadAvroInput.class);
    job.setJobName("重读avro文件进行mr计算");
    
    job.setMapperClass(ReadAvroInputMap.class);
    job.setCombinerClass(ReadAvroInputReducer.class);
    job.setReducerClass(ReadAvroInputReducer.class);
    
    job.setOutputKeyClass(Text.class);
    job.setOutputValueClass(IntWritable.class);
    
    job.setInputFormatClass(AvroKeyInputFormat.class);
    AvroJob.setInputKeySchema(job, UserActionLog.getClassSchema());
    
    FileInputFormat.addInputPath(job, new Path("/ReducerJoin/part-r-00000.avro"));
    Path outputPath =new Path("/ReadAvroInput");
    outputPath.getFileSystem(configuration).delete(outputPath, true);
    FileOutputFormat.setOutputPath(job, outputPath);
    System.exit(job.waitForCompletion(true)?0:1);
}

}

UserActionLog是通过mvn 指令通过schema框架生成的

重读avro文件对文件进行简单的mr计算

}

热门文章

最新文章

相关电子书

探索云世界

热门

云计算

大数据

云原生

人工智能

数据库

开发与运维

活动广场

任务中心

训练营

直播

乘风者计划

下载

镜像站

技术资料

重读avro文件 对文件进行简单的mr计算

}

热门文章

最新文章

相关电子书

重读avro文件对文件进行简单的mr计算