【发布时间】:2017-01-30 06:50:53
【问题描述】:
我正在浏览这个odds ratios in logistic regression tutorial,并尝试使用 scikit-learn 的逻辑回归模块获得完全相同的结果。使用下面的代码,我可以获得系数和截距,但我找不到找到教程中列出的模型的其他属性的方法,例如 log-likelyhood、Odds Ratio、Std。错误,z,P>|z|,[95% Conf.间隔]。如果有人能告诉我如何使用sklearn 包计算它们,我将不胜感激。
import pandas as pd
from sklearn.linear_model import LogisticRegression
url = 'https://stats.idre.ucla.edu/wp-content/uploads/2016/02/sample.csv'
df = pd.read_csv(url, na_values=[''])
y = df.hon.values
X = df.math.values
y = y.reshape(200,1)
X = X.reshape(200,1)
clf = LogisticRegression(C=1e5)
clf.fit(X,y)
clf.coef_
clf.intercept_
【问题讨论】:
-
仅供参考,您应该以
from sklearn.linear_model import LogisticRegression身份进行导入 -
当我运行这段代码时,我得到
databricks/python/lib/python3.7/site-packages/sklearn/utils/validation.py:760: DataConversionWarning: A column-vector y was passed when a 1d array was expected. Please change the shape of y to (n_samples, ), for example using ravel(). y = column_or_1d(y, warn=True) Out[2]: LogisticRegression(C=100000.0, class_weight=None, dual=False, fit_intercept=True, intercept_scaling=1, l1_ratio=None, max_iter=100, multi_class='auto', n_jobs=None, penalty='l2', random_state=None, solver='lbfgs', tol=0.0001, verbose=0,warm_start=False) -
我很确定在遇到这个问题时我一直在使用 Python 2.7。
标签: python scikit-learn