【发布时间】:2020-11-01 15:01:36
【问题描述】:
我是 SciKit-Learn 的新手,我一直在研究 kaggle 上的回归问题(国王县 csv)。我一直在训练一个回归模型来预测房子的价格,我想绘制图表,但我不知道该怎么做。我正在使用python 3.6。任何意见或建议将不胜感激。
#importing numpy and pandas, seaborn
import numpy as np #linear algebra
import pandas as pd #datapreprocessing, CSV file I/O
import seaborn as sns #for plotting graphs
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
from sklearn.model_selection import KFold
from sklearn.model_selection import cross_val_score
import matplotlib.pyplot as plt
data = pd.read_csv('kc_house_data.csv')
data = data.drop('date',axis=1)
data = data.drop('id',axis=1)
X = data
Y = X['price'].values
X = X.drop('price', axis = 1).values
X_train, X_test, Y_train, Y_test = train_test_split (X, Y, test_size = 0.30, random_state=21)
reg = LinearRegression()
kfold = KFold(n_splits=15, random_state=21)
cv_results = cross_val_score(reg, X_train, Y_train, cv=kfold, scoring='r2')
print(cv_results)
round(np.mean(cv_results)*100, 2)
【问题讨论】:
标签: python plot scikit-learn linear-regression