【发布时间】:2018-11-22 16:39:37
【问题描述】:
var_vector = TfidfVectorizer()
train_var = var_vector.fit_transform(t_df['var'])
top_100 = np.array(var_vector.get_feature_names())
tfidf_100 = np.argsort(var_vector.idf_)[::-1]
n = 100
top_n = top_100[tfidf_100][:n]
从 tfidf Vectorizer 中选择前 100 个单词后如何将维度更新为 100?
【问题讨论】:
标签: python nltk tfidfvectorizer