【发布时间】:2018-11-22 04:19:37
【问题描述】:
我是 Python 新手,试图做一个时间序列回归模型。我有 3 列,X、Y 和日期。我导入了下面的所有内容,但我遇到了一个错误。
import numpy as np
from sklearn import linear_model
import matplotlib.pyplot as plt
import pandas as pd
%matplotlib inline
from matplotlib.pylab import rcParams
rcParams['figure.figsize'] = 15, 6
import statsmodels.api as sm
import statsmodels.formula.api as smf
from statsmodels.tsa.stattools import adfuller
raw_data = pd.read_csv("IMF and BBG Fair Values.csv")
ISO_TH = raw_data[["IMF_VALUE", "BBG_FV", "IMF_DATE"]]
过滤掉NaN:
filtered_TH = ISO_TH[np.isfinite(raw_data['BBG_FV'])]
我收到此错误:
C:\Program Files\Anaconda3\lib\site-packages\pandas\core\generic.py:2698: >SettingWithCopyWarning: 试图在 DataFrame 中的切片副本上设置值。 尝试改用 .loc[row_indexer,col_indexer] = value
查看documentation中的注意事项
【问题讨论】:
-
我实际上得到了这行代码的错误:filtered_TH.IMF_DATE = pd.DatetimeIndex(filtered_TH.IMF_DATE)
-
import numpy as np from sklearn import linear_model import matplotlib.pyplot as plt import pandas as pd %matplotlib inline from matplotlib.pylab import rcParams rcParams['figure.figsize'] = 15, 6 import statsmodels. api as sm import statsmodels.formula.api as smf from statsmodels.tsa.stattools import adfuller raw_data=pd.DataFrame([[np.inf,22,333,44], [3,4,5,2],[1,2 ,3,4],[np.inf,0,0,0]],columns=["BBG_FV", "IMF_VALUE", "IMF_DATE", "未使用"]) ISO_TH = raw_data.loc[:,["IMF_VALUE ", "BBG_FV", "IMF_DATE"]] ISO_TH.IMF_VALUE=[0,0,0,0]
标签: python numpy statistics regression