把单词按照ASCII码奇偶分区(Partitioner)

1、分析

黑猴子的家:MapReduce WordCount奇偶分区(Partitioner)

2、自定义分区

import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Partitioner;

public class WordCountPartitioner extends Partitioner<Text, IntWritable>{

    @Override
    public int getPartition(Text key, IntWritable value, int numPartitions) {
        
        // 1 获取单词key  
        String firWord = key.toString().substring(0, 1);
        char[] charArray = firWord.toCharArray();
        int result = charArray[0];
        // int result  = key.toString().charAt(0);

        // 2 根据奇数偶数分区
        if (result % 2 == 0) {
            return 0;
        }else {
            return 1;
        }
    }
}

3、在驱动中配置加载分区,设置reducetask个数

job.setPartitionerClass(WordCountPartitioner.class);
job.setNumReduceTasks(2);

4、基于wordcount案例

https://www.jianshu.com/p/c8726f1ccd8f

5、Code -> GitHub

https://github.com/liufengji/hadoop_mapreduce.git

相关文章:

  • 2021-06-12
  • 2021-07-17
  • 2021-12-19
  • 2021-08-13
  • 2021-04-07
  • 2022-01-09
  • 2021-12-07
猜你喜欢
  • 2021-08-12
  • 2021-08-05
  • 2021-04-24
  • 2021-09-21
  • 2021-11-03
  • 2021-12-20
  • 2021-08-30
相关资源
相似解决方案