【发布时间】:2020-08-07 00:40:50
【问题描述】:
我有一个如下所示的数据框:
+--------------+-------+-------+-------+-------+-------+-------+-------+
|Country/Region| 3/7/20| 3/8/20| 3/9/20|3/10/20|3/11/20|3/12/20|3/13/20|
+--------------+-------+-------+-------+-------+-------+-------+-------+
| Senegal| 0| 4| 10| 18| 27| 31| 35|
+--------------+-------+-------+-------+-------+-------+-------+-------+
| Tunisia| 1| 8| 15| 21| 37| 42| 59|
+--------------+-------+-------+-------+-------+-------+-------+-------+
对于每个国家/地区,我都有一个唯一的行,但我有很多列代表天数。 我想遍历每一列并从中减去上一列中的相应值,例如生成的df应该如下:
+--------------+-------+-------+-------+-------+-------+-------+-------+
|Country/Region| 3/7/20| 3/8/20| 3/9/20|3/10/20|3/11/20|3/12/20|3/13/20|
+--------------+-------+-------+-------+-------+-------+-------+-------+
| Senegal| 0| 4| 6| 8| 9| 4| 4|
+--------------+-------+-------+-------+-------+-------+-------+-------+
| Tunisia| 1| 7| 7| 6| 16| 5| 17|
+--------------+-------+-------+-------+-------+-------+-------+-------+
【问题讨论】:
标签: scala apache-spark apache-spark-sql