【发布时间】:2020-08-16 20:08:04
【问题描述】:
我想一次在两列上应用 Sklearn 的 CountVectorizer。 我试过这个:
features = df[['col 1', 'col2']]
results = df[['col 3']
vectorizer = CountVectorizer(lowercase=False)
features = vectorizer.fit_transform(features)
results = vectorizer.fit_transform(results)
但我收到此错误:
TypeError: expected string or bytes-like object
然后我尝试了这个:
from sklearn.compose import make_column_transformer
vectorizer = CountVectorizer(lowercase=False)
transformer = make_column_transformer((vectorizer, 'col 1'), (vectorizer, 'col 2'))
features = transformer.fit_transform(features)
results = vectorizer.fit_transform(results)
但我收到此错误:
ValueError: Specifying the columns using strings is only supported for pandas DataFrames
我做错了什么,我在这里看到了第二个解决方案:
【问题讨论】:
标签: python pandas scikit-learn