【发布时间】:2021-12-14 22:49:00
【问题描述】:
如何编写代码来输出dates 和data 之间的差异。 data 代码中缺少数据点,其中 1 分钟时间范围的 dates 数据帧中存在跳过。例如,在2015-10-08 13:53:00 之后有 6 个数据点丢失,因此它将其打印为'2015-10-08 13:54:00', '2015-10-08 14:00:00' 输出丢失的data 的范围。将其记录在Expected Output 中的二维数组中。我将如何编写产生预期输出的函数。
import pandas as pd
import datetime
dates = pd.date_range("2015-10-08 13:40:00", "2015-10-08 14:12:00", freq="1min")
data = pd.to_datetime(['2015-10-08 13:41:00',
'2015-10-08 13:42:00', '2015-10-08 13:43:00',
'2015-10-08 13:44:00', '2015-10-08 13:45:00',
'2015-10-08 13:46:00', '2015-10-08 13:47:00',
'2015-10-08 13:48:00', '2015-10-08 13:49:00',
'2015-10-08 13:50:00', '2015-10-08 13:51:00',
'2015-10-08 13:52:00', '2015-10-08 13:53:00',
'2015-10-08 13:54:00', '2015-10-08 14:01:00',
'2015-10-08 14:02:00', '2015-10-08 14:03:00',
'2015-10-08 14:04:00', '2015-10-08 14:05:00',
'2015-10-08 14:06:00', '2015-10-08 14:07:00',
'2015-10-08 14:10:00', '2015-10-08 14:11:00',
'2015-10-08 14:12:00'])
预期输出:
[['2015-10-08 13:40:00'],
['2015-10-08 13:54:00', '2015-10-08 14:00:00'],
['2015-10-08 14:08:00', '2015-10-08 14:09:00']]
【问题讨论】:
-
这个问题或多或少在 20 分钟前得到了回答。转换为
DatetimeIndex并使用difference函数stackoverflow.com/questions/69775788/…
标签: python pandas numpy datetime time