【发布时间】:2025-11-27 13:15:02
【问题描述】:
我正在尝试使用 Cascading 读取文件并在特定字段上使用 Group By。
但它返回源文件中的所有行。
源文件:
no,date,amt
1,3/10/2016,1000
1,3/10/2016,2000
1,3/11/2016,400
232,2/10/2016,1500
代码:
Fields tscnFields = new Fields("no", "date", "amt");
FileTap tscnFileTap = new FileTap(new TextDelimited(tscnFields,true, ","), "C://Users//Test//tscn.txt");
final Fields groupField = new Fields("date");
Pipe pipe = new Pipe("test");
pipe = new Each(pipe, new Debug());
pipe = new GroupBy("group by date", pipe, groupField);
Fields outFields = new Fields("no", "date", "amt");
FileTap sinkTap = new FileTap(new TextDelimited(outFields,true, ","), "C://Users//Test//out.txt", SinkMode.REPLACE); Flow flow = flowConnector.connect("FlowMonitorTest", tscnFileTap, sinkTap, pipe);
flow.complete();
我得到的输出是
['1', '3/10/2016', '1000']
['1', '3/10/2016', '2000']
['1', '3/11/2016', '400']
['232', '2/10/2016', '1500']
tuples count: 4
【问题讨论】:
-
你想要什么样的输出?
标签: cascading