【发布时间】:2021-05-27 18:57:05
【问题描述】:
我有一个非常大的数据集,我必须训练一个模型。我不知道 Nan 或缺失值在哪里。 svm代码启动时出现错误。
df = pd.read_csv('Data.txt',delimiter=',')
df.head()
X = df[['a', 'b', 'c']].values
Y=df['Label'].values
cv = KFold(n_splits=2, random_state=42, shuffle=False)
for train_index, test_index in cv.split(X):
print("Train Index: ", train_index, "\n")
print("Test Index: ", test_index)
X_train, X_test, Y_train, Y_test = X[train_index], X[test_index], Y[train_index], Y[test_index]
svm1 = svm.SVC(gamma='scale', probability=True)
pred = svm1.fit(X_train, Y_train).predict(X_test)
错误
raise ValueError(msg_err.format(type_err, X.dtype))
ValueError: Input contains NaN, infinity or a value too large for dtype('float64')
【问题讨论】:
标签: python pandas scikit-learn svm