【发布时间】:2019-09-06 17:02:16
【问题描述】:
我必须绘制一个 wordcloud。 “tweets.csv”是一个 Pandas 数据框,其中有一列名为“text”。绘制的图表并非基于最常见的词,艰难。单词大小如何与它们在数据框中的频率相关联?
text = df_final.text.values
wordcloud = WordCloud(
#mask = logomask,
max_words = 1000,
width = 600,
height = 400,
#max_font_size = 1000,
#min_font_size = 100,
normalize_plurals = True,
#scale = 5,
#relative_scaling = 0,
background_color = 'black',
stopwords = STOPWORDS.union(stopwords)
).generate(str(text))
fig = plt.figure(
figsize = (50,40),
facecolor = 'k',
edgecolor = 'k')
plt.imshow(wordcloud, interpolation = 'bilinear')
plt.axis('off')
plt.tight_layout(pad=0)
plt.show()
我的数据框如下所示:
0 RT @Pontifex_pt: Temos que descobrir as riquezezas ...
1 RT @Pontifex_pt: Todos estamos em viagem rumo ...
2 RT @Pontifex_pt: Unamos as forças, em todos ...
3 RT @GeneralMourao: #Segurançapública, preocupa ...
4 RT @FIFAcom: The Brasileirao U-17 final provided ...
【问题讨论】:
标签: python pandas dataframe frequency word-cloud