【发布时间】:2019-05-05 08:49:04
【问题描述】:
我正在尝试制作工资与时间的关系图。这是我得到的:
import pandas as pd
import matplotlib.pyplot as plt
df = pd.read_csv('Pay051118_1.csv')
df.to_csv("Newcsv", columns = ['Dates worked', 'Pay'])
df = pd.read_csv('Newcsv', parse_dates = True, index_col = 0)
df = df.fillna(0)
df = ((df[['Dates worked','Pay']])[df.Pay > 1])
print (df)
df.plot()
plt.show()
代码的第 5 行为我删除了其中一个索引列。如果不输入 parse_dates 和 index_col 参数,我将获得 2 列索引。当我打印 (df) 时,这就是我得到的:
Dates worked Pay
0 Monday, 5 November 2018 44.5
2 Wednesday, 7 November 2018 44.5
3 Thursday, 8 November 2018 44.5
4 Friday, 9 November 2018 44.5
6 Sunday, 11 November 2018 50.5
7 Monday, 12 November 2018 62.0
8 Tuesday, 13 November 2018 38.5
9 Wednesday, 14 November 2018 65.5
10 Thursday, 15 November 2018 62.0
16 Wednesday, 21 November 2018 65.5
17 Thursday, 22 November 2018 62.0
18 Friday, 23 November 2018 65.5
20 Sunday, 25 November 2018 62.5
21 Monday, 26 November 2018 58.5
22 Tuesday, 27 November 2018 42.0
23 Wednesday, 28 November 2018 65.5
25 Friday, 30 November 2018 62.0
如您所见,索引列仍然存在。我怎样才能摆脱它?
【问题讨论】:
-
数据框总是有一个索引,你可以做的就是将它重置为其他东西,例如日期。这可以通过
df.set_index来实现。 -
您好,您不需要导入、导出和重新导入 CSV。您可以在导入时选择列。请参阅 pandas.pydata.org/pandas-docs/stable/io.html#io-read-csv-table。之后,请参阅pandas.pydata.org/pandas-docs/stable/generated/…。我们无权访问您的 csv。