【发布时间】:2021-09-13 05:41:44
【问题描述】:
我已经创建了 xgboost 回归模型,想看看随着训练集数量的增加,训练和测试性能如何变化。
xgbm_reg = XGBRegressor()
tr_sizes, tr_scs, test_scs = learning_curve(estimator=xgbm_reg,
X=ori_X,y=y,
train_sizes=np.linspace(0.1, 1, 5),
cv=5)
tr_scs 和 test_scs 的性能如何?
Sklearn doc 告诉我
scoring : str or callable, default=None
A str (see model evaluation documentation) or a scorer callable object / function
with signature scorer(estimator, X, y)
所以我查看了XGboost documentation,它说目标是default = reg:squarederror,这是否意味着 tr_scs 和 test_scs 的结果是平方误差?
我想使用 cross_val_score 检查
scoring = "neg_mean_squared_error"
cv_results = cross_val_score(xgbm_reg, ori_X, y, cv=5, scoring=scoring)
但不太清楚如何从 cross_val_score 中得到 squared_error
【问题讨论】:
标签: python scikit-learn xgboost