【发布时间】:2022-01-04 01:18:18
【问题描述】:
我想总结“文本”列中超过 8 个字母的单词数/ 该表有超过 500,000 个值。 我对熊猫还不够熟悉。
def howmany8(array): //returns the amount of words above 8 letter
counter=0;
for i in range(len(array)):
if(len(array[i])>8):
counter+=counter
return counter
newdf= df;
newdf.dropna(subset = ['text'])
newdf['wordssum']=newdf['text']
for i in range(len(newdf['text'])):
newdf['wordssum'][i]= howmany8(re.split("\s",newdf['text'][i]))
print(newdf['words'].sum())
【问题讨论】: