【发布时间】:2017-09-08 00:43:00
【问题描述】:
我有一个如下所示的数据框 -
+----------+-------+-------------------------------------------------+
| WindowID | State | Details |
+----------+-------+-------------------------------------------------+
| 6 | SD | [[29916,3], [156570,4], [245934,1], [329748,8]] |
| 3 | CO | [[524586,2], [1548,3], [527220,1]] |
+----------+-------+-------------------------------------------------+
现在,我想根据列表的第二个元素按降序对Details 列的每一行进行排序。结果应该是 -
+----------+-------+-------------------------------------------------+
| WindowID | State | Details |
+----------+-------+-------------------------------------------------+
| 6 | SD | [[329748,8], [156570,4], [29916,3], [245934,1]] |
| 3 | CO | [[1548,3], [524586,2], [527220,1]] |
+----------+-------+-------------------------------------------------+
我如何在 pyspark 中做到这一点?提前谢谢你。
【问题讨论】:
标签: python-3.x apache-spark dataframe