【发布时间】:2022-08-16 23:58:49
【问题描述】:
我是熊猫新手。 所以我尝试在相关性大于零的列表中附加列名。
这是我的代码
corr_matrix = df_train.corr()
corr_matrix[\"failure\"].sort_values(ascending=False)
useful_features = []
for f in corr_matrix[\"failure\"]:
if f > 0:
useful_features.append(df_train.columns)
print(useful_features)
但这是将所有列名附加到列表中
[Index([\'id\', \'product_code\', \'loading\', \'attribute_0\', \'attribute_1\',
\'attribute_2\', \'attribute_3\', \'measurement_0\', \'measurement_1\',
\'measurement_2\', \'measurement_3\', \'measurement_4\', \'measurement_5\',
\'measurement_6\', \'measurement_7\', \'measurement_8\', \'measurement_9\',
\'measurement_10\', \'measurement_11\', \'measurement_12\', \'measurement_13\',
\'measurement_14\', \'measurement_15\', \'measurement_16\', \'measurement_17\',
\'failure\', \'kfold\'],
.
.
.
I am not pasting complete output
我想要的是
useful_features = [\'failure\',\'loading\',...,\'kfold\']
的输出 corr_matrix[\"失败\"].sort_values(ascending=False)
failure 1.000000
loading 0.129089
measurement_17 0.033905
measurement_5 0.018079
measurement_8 0.017119
measurement_7 0.016787
measurement_2 0.015808
measurement_6 0.014791
measurement_0 0.009646
attribute_2 0.006337
measurement_14 0.006211
measurement_12 0.004398
measurement_3 0.003577
measurement_16 0.002237
kfold 0.000130
measurement_10 -0.001515
measurement_13 -0.001831
measurement_15 -0.003544
measurement_9 -0.003587
measurement_11 -0.004801
id -0.007545
measurement_4 -0.010488
measurement_1 -0.010810
attribute_3 -0.019222
Name: failure, dtype: float64
有没有办法附加列名? df_train.columns.values 还附加列表中的所有名称