【发布时间】:2023-03-13 04:02:02
【问题描述】:
我有一个如下所示的 csv:
Date,Sentiment
2014-01-03,0.4
2014-01-04,-0.03
2014-01-09,0.0
2014-01-10,0.07
2014-01-12,0.0
2014-02-24,0.0
2014-02-25,0.0
2014-02-25,0.0
2014-02-26,0.0
2014-02-28,0.0
2014-03-01,0.1
2014-03-02,-0.5
2014-03-03,0.0
2014-03-08,-0.06
2014-03-11,-0.13
2014-03-22,0.0
2014-03-23,0.33
2014-03-23,0.3
2014-03-25,-0.14
2014-03-28,-0.25
etc
我的目标是按月汇总日期并计算月的平均值。日期可能不以 1. 或一月开头。问题是我有很多数据,这意味着我还有更多年。为此,我想找到最早的日期(月份),然后从那里开始计算月份及其平均值。例如:
Month count, average
1, 0.4 (<= the earliest month)
2, -0.3
3, 0.0
...
12, 0.1
13, -0.4 (<= new year but counting of month is continuing)
14, 0.3
我正在使用 Pandas 打开 csv
data = pd.read_csv("pks.csv", sep=",")
所以在data['Date'] 我有日期,在data['Sentiment'] 我有价值观。知道怎么做吗?
【问题讨论】:
标签: python date csv pandas aggregate