【问题标题】:Removing duplicated headers in panda (Python)删除熊猫中的重复标题(Python)
【发布时间】:2020-11-15 13:49:20
【问题描述】:

基于下面的代码,我正在合并基于减法的科学和数学。

for f in Science['Name']:
     Math=(df[df['Name'].str.startswith(f)])
     Math['Sub'] = Math['Name'].str.extract(r'(\w*)', expand=False)
     Field1= pd.merge(Science, Math, left_on='Sub', right_on='Sub')
     
     print(Field1)

上述代码的结果如下,但是我想删除重复的标题,即 Name-x、Name-y、Sub 和 Name。我将我的预期结果分享如下,我也想将其存储在 csv 文件中

    Name_x      Name_y    Sub            Name
0  Numbers  Math-Numbers  Math         Math-01
1  Numbers  Math-Numbers  Math         Math-01
2  Numbers  Math-Numbers  Math         Math-01

    Name_x      Name_y    Sub           Name
0  Numbers  Math-Numbers  Math         Math-02
1  Numbers  Math-Numbers  Math         Math-02
2  Numbers  Math-Numbers  Math         Math-02

    Name_x      Name_y    Sub           Name
0  Numbers  Math-Numbers  Math         Math-03
1  Numbers  Math-Numbers  Math         Math-03
2  Numbers  Math-Numbers  Math         Math-03

预期的 CSV 文件:

    Name_x      Name_y    Sub           Name
0  Numbers  Math-Numbers  Math         Math-01
1  Numbers  Math-Numbers  Math         Math-01
2  Numbers  Math-Numbers  Math         Math-01
0  Numbers  Math-Numbers  Math         Math-02
1  Numbers  Math-Numbers  Math         Math-02
2  Numbers  Math-Numbers  Math         Math-02
0  Numbers  Math-Numbers  Math         Math-03
1  Numbers  Math-Numbers  Math         Math-03
2  Numbers  Math-Numbers  Math         Math-03

【问题讨论】:

    标签: python python-3.x pandas python-2.7 dataframe


    【解决方案1】:

    您所需要的只是连接生成的数据帧。 即

    result = []
    for f in Science['Name']:
         Math=(df[df['Name'].str.startswith(f)])
         Math['Sub'] = Math['Name'].str.extract(r'(\w*)', expand=False)
         Field1= pd.merge(Science, Math, left_on='Sub', right_on='Sub')
         result.append(Field1)
    out_df = pd.concat(result)
    print(out_df)
    

    【讨论】:

      猜你喜欢
      • 2017-02-08
      • 2016-01-30
      • 2013-10-28
      • 1970-01-01
      • 2021-08-31
      • 1970-01-01
      • 2019-04-12
      • 2019-12-08
      • 1970-01-01
      相关资源
      最近更新 更多