【发布时间】:2017-08-15 22:58:33
【问题描述】:
我对 pandas 很陌生,希望您能就如何解决我的问题提供意见。我有以下数据框:
df = pd.DataFrame({'A' : ["me","you","you","me","me","me","me"],
'B' : ["Y","X","X","X","X","X","Z"],
'C' : ["1","2","3","4","5","6","7"]
})
我需要根据 A 列和 B 列中的行值对其进行转换。逻辑应该是,只要 A 列和 B 列中的值在连续行上相同,则应保留此序列中的第一行,但以下行应在 B 列中设置“A”。
例如:第 1 行和第 2 行中 A 列和 B 列中的值相同。B 列第 2 行中的值应替换为 A。这是我的预期输出:
df2= pd.DataFrame({'A' : ["me","you","you","me","me","me","me"],
'B' : ["Y","X","A","X","A","A","Z"],
'C' : ["1","2","3","4","5","6","7"]})
【问题讨论】:
标签: python-2.7 pandas replace duplicates cumsum