我不确定您的期望如下。
JavaRDD<String> lines = sc.textFile("readcsvyear.csv");
JavaRDD<String> filtered = lines.filter(line -> {
String[] list = line.split(",");
if(list.length >= 5 && list[4].equalsIgnoreCase("2018")) {
return true;
}
return false;
});
System.out.println(filtered);
for(String line:filtered.collect()){
System.out.println("filtered "+line);
}
输入文件
1,Date,desc,type,2018
2,Date,desc,type,2018
3,Date,desc,type,2018
4,Date,desc,type,2018
5,Date,desc,type,2018
6,Date,desc,type,2017
7,Date,desc,type,2017
8,Date,desc,type,2017
输出
filtered 1,Date,desc,type,2018
filtered 2,Date,desc,type,2018
filtered 3,Date,desc,type,2018
filtered 4,Date,desc,type,2018
filtered 5,Date,desc,type,2018