【发布时间】:2021-05-30 00:27:57
【问题描述】:
tfidf = TfidfVectorizer(小写=假, ) tfidf.fit_transform(问题)
dict key:word 和 value:tf-idf 分数
word2tfidf = dict(zip(tfidf.get_feature_names(), tfidf.idf_))
【问题讨论】:
标签: machine-learning scikit-learn nlp
tfidf = TfidfVectorizer(小写=假, ) tfidf.fit_transform(问题)
word2tfidf = dict(zip(tfidf.get_feature_names(), tfidf.idf_))
【问题讨论】:
标签: machine-learning scikit-learn nlp
idf_: array of shape (n_features,)
The inverse document frequency (IDF) vector; only defined if use_idf is True.
【讨论】: