【发布时间】:2021-12-31 01:14:43
【问题描述】:
在尝试从 github 端点导入流行的 UCL bank marketing dataset 时,我遇到了一些问题。读取语句未正确获取 17 列的数据集。我检查了分隔符和标题,但我不确定如何更正索引。
# URL endoint
url = 'https://raw.githubusercontent.com/ThamuMnyulwa/bankMarketing/main/bank-additional-full.csv'
column_names = ["age","job","marital","education","default","balance","housing","loan","contact","day","month"
,"duration","campaign","pdays","previous","poutcome", "y"]
raw_dataset = pd.read_csv(url, names=column_names,
na_values='?',sep=';'
, skipinitialspace=False, index_col=None)
相反,它给了我这样的东西:
如何使用 pandas read_csv 从 URL 正确导入数据集 (link)?
【问题讨论】:
标签: python pandas dataframe csv import