【发布时间】:2019-03-20 01:40:15
【问题描述】:
我有输入数据框:
df1 = pandas.DataFrame( {
"Name" : ["Alice", "Bob", "Mallory", "Mallory","Mallory", "Bob" ,"Bob", "Mallory", "Alice"] ,
"City" : ["Seattle", "Seattle", "Portland", "Seattle", "Seattle", "Portland", "Portland", "Seattle", "Seattle"] } )
我想按名称分组,但不是唯一的,所以输出应该是:
["Alice","Bob","Mallory","Bob","Mallory", "Alice"]
我找不到任何有效的方法 - 有没有不迭代所有行的方法?
【问题讨论】:
-
df1.groupby(df1.Name.ne(df1.Name.shift()).cumsum()).Name.first()
标签: python pandas dataframe pandas-groupby data-science