【发布时间】:2016-02-21 18:40:10
【问题描述】:
如何将以下带有多个文档中每个单词的 tf-idf 分数的 pandas 数据帧转换为一个名为“tfdif”的矩阵,以便我可以实现例如
from sklearn.feature_extraction.text import TfidfVectorizer
from nltk.stem.porter import PorterStemmer
str = 'this sentence has unseen text such as computer but also king lord juliet'
response = tfidf.transform([str])
【问题讨论】: