【发布时间】:2020-12-22 19:32:13
【问题描述】:
我使用permutatation_importance 来查找最重要的值
from sklearn.neighbors import KNeighborsClassifier
import numpy as np
from sklearn.inspection import permutation_importance
columns=['progresion', 'tipo']
X = df_cat.drop(columns, axis = 1)
y = df_cat['progresion']
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.33, random_state = 42)
knn = KNeighborsClassifier()
knn.fit(X_train,y_train)
results = permutation_importance(knn, X, y, scoring='accuracy')
importance = results.importances_mean
for i,v in enumerate(importance):
print('Feature: %0d, Score: %.5f' % (i,v))
但我想做的是评估每对变量的 KNN 分类器,以找出哪对变量更相关,从而获得更好的模型性能。
【问题讨论】:
标签: python validation knn