【发布时间】:2021-12-31 12:49:35
【问题描述】:
我有一个包含熊猫系列(第 2 列)的数据框,如下所示:
| column 1 | column 2 | column 3 |
|---|---|---|
| 1123 | Requested By = John Doe 1\n Requested On = 12 October 2021\n Comments = This is a generic request | INC29192 |
| 1251 | NaN | INC18217 |
| 1918 | Requested By = John Doe 2\n Requested On = 2 September 2021\n Comments = This is another generic request | INC19281 |
我正在努力将第 2 列数据提取、拆分和映射到一系列新列名,其中包含该记录的适当数据(如果可能,那是有可用数据的地方,因为我有 NaN)。
所需的输出类似于(为了便于阅读,我删除了第 3 列数据):
| column 1 | column 3 | Requested By | Requested On | Comments |
|---|---|---|---|---|
| 1123 | INC29192 | John Doe 1 | 12 October 2021 | This is a generic request |
| 1251 | INC18217 | NaN | NaN | NaN |
| 1918 | INC19281 | John Doe 2 | 2 September 2021 | This is another generic request |
我花了相当长的时间尝试各种方法,从 lambda 函数到理解再到分解方法,但还没有找到提供所需输出的解决方案 - 任何想法都非常感谢!
【问题讨论】:
-
请将数据添加为代码sn-p
标签: python-3.x pandas list lambda split