【发布时间】:2018-11-26 16:14:09
【问题描述】:
from sklearn.feature_extraction.text import TfidfVectorizer
from sklearn.cluster import KMeans
cc_tfid = TfidfVectorizer().fit_transform(cc_corpus)
cc_km = KMeans(n_clusters = 3, init = 'k-means++', max_iter = 99, n_init = 4, verbose = False )
cc_km.fit(cc_tfid)
plt.scatter(cc_tfid[:, 0], cc_tfid[:, 1])
centroids = cc_km.cluster_centers_
plt.scatter(centroids[:, 0], centroids[:, 1], c='red', s=200, alpha=0.5)
plt.show()
我可以可视化质心,但不能可视化点,因为它们来自稀疏矩阵。请问这个怎么画?
【问题讨论】:
-
你能分享复制问题的数据吗?
-
@MohammedKashif 给你dropbox.com/s/knwsoto5sa8blbg/cc_corpus?dl=0
标签: python matplotlib scikit-learn k-means sklearn-pandas