【发布时间】:2020-08-26 13:57:09
【问题描述】:
import pandas as pd
list = ['apple','banana','cherries','dragonfruit','elderberry']
data = {'name': ['Alpha', 'Bravo','Charlie','Delta','Echo'],
'favorite_fruit': ['apple banana cherries', 'banana cherries dragonfruit',
'cherries dragonfruit','dragonfruit','apple elderberry']}
df = pd.DataFrame (data, columns = ['name','favorite_fruit'])
我想计算df中list中每个水果的频率。
预期输出:
df2
Fruit | Frequency
Apple | 2
Banana | 2
Cherries | 3
Dragonfruit | 3
Elderberry | 1
代码 df.favorite_fruit.str.split(expand=True).stack().value_counts() 适用于小型 DataFrame。
如果df.favorite_fruit包含数千行不同的水果组合,
如何仅找到list 中单词的频率?
【问题讨论】:
标签: python python-3.x regex pandas