【发布时间】:2018-06-28 08:24:19
【问题描述】:
我有一个 df,
Sr.No Name Class Data
0 1 Sri 1 sri is a good player
1 '' Sri 2 sri is good in cricket
2 '' Sri 3 sri went out
3 2 Ram 1 Ram is a good player
4 '' Ram 2 sri is good in cricket
5 '' Ram 3 Ram went out
6 3 Sri 1 sri is a good player
7 '' Sri 2 sri is good in cricket
8 '' Sri 3 sri went out
9 4 Sri 1 sri is a good player
10 '' Sri 2 sri is good in cricket
11 '' Sri 3 sri went out
12 '' Sri 4 sri came back
我正在尝试根据 ["Name","Class","Data"] 删除重复项。目标是根据每个 Sr 编号的所有句子删除重复项。
我的预期输出是,
out_df
Sr.No Name Class Data
0 1 Sri 1 sri is a good player
1 Sri 2 sri is good in cricket
2 Sri 3 sri went out
3 2 Ram 1 Ram is a good player
4 Ram 2 sri is good in cricket
5 Ram 3 Ram went out
9 4 Sri 1 sri is a good player
10 Sri 2 sri is good in cricket
11 Sri 3 sri went out
12 Sri 4 sri came back
【问题讨论】:
-
您能否打印
df.to_dict()并将输出粘贴到您的问题中?您的数据框很难复制。 -
您的 to_dict 输出与您发布的数据框不同。请务必使其保持一致,以便您的预期输出清晰;)
-
@cᴏʟᴅsᴘᴇᴇᴅ,我用正确的
df.to_dict()编辑了我的问题,请检查 -
我不明白你,当我做
pd.DataFrame(my_dict)它正确地给出了我的实际df。 -
没关系,我最初误解了这个问题。
标签: python pandas dataframe group-by duplicates