【发布时间】:2020-05-21 21:33:41
【问题描述】:
我有一个大数据框,我正在尝试找出如何将具有不同值的单元格连接到一个单元格中,给定以下数据框:
DF1:以数据和名称作为标题
Data, Name
Address 213, Name1
Household = 1, Name1
1012 ..alpha numeric value..., Name1
1013 ..alpha numeric value..., Name1
1012 ..alpha num val2... Name1
1013 ..alpha num val2... Name1
Address 234, Name2
Household = 2, Name2
1012 ..alpha numeric value..., Name2
1013 ..alpha numeric value..., Name2
1012 ..alpha num val2... Name2
1013 ..alpha num val2... Name2
.......... dataframe repeats with different values for 10,000+ lines
1012 和 1013 是重复序列 X 次。
我想要得到的输出是:
DF输出:
Data Name
Address 213 Household = 1 1012 ..alpha numeric value.. 1013 ..alpha numeric value..., Name1
Address 213 Household = 1 1012 ..alpha num val2... 1013 ..alpha num val2..., Name1
Address 234 Household = 2 1012 ..alpha numeric value.. 1013 ..alpha numeric value..., Name2
Address 234 Household = 2 1012 ..alpha num val2... 1013 ..alpha num val2..., Name2
..... repeats for entire dataframe 10,000+ lines in DF1
另外,DFOut 中Data 列中的单元格也可以分开:
Data, Number, Seq, Seq1, Name
Address 213, Household = 1, 1012 ..alpha numeric value.., 1013 ..alpha numeric value..., Name1
Address 213, Household = 1, 1012 ..alpha num val2..., 1013 ..alpha num val2..., Name1
Address 234, Household = 2, 1012 ..alpha numeric value.., 1013 ..alpha numeric value..., Name2
Address 234, Household = 2, 1012 ..alpha num val2..., 1013 ..alpha num val2..., Name2
..... repeats for entire dataframe 10,000+ lines in DF1
我尝试使用几个for 循环根据值搜索Data 列,然后将不同的值连接到一列中,但这样做之后由于某种原因我丢失了Name 列。我对 Python 相当陌生,任何帮助将不胜感激。提前致谢!
【问题讨论】:
标签: python python-3.x excel pandas csv