【发布时间】:2018-08-11 11:46:58
【问题描述】:
是否有任何现有的库能够将日期时间列分成仅包含一个变量的列,例如年、月、日、小时、分钟等。
我这样做是为了对我打算尝试在(Kaggle 纽约出租车费)上使用机器学习的数据进行预处理。
我已经能够使用以下方法做到这一点:
df_raw["pickup_year"] = df_raw['pickup_datetime'].dt.year
df_raw["pickup_month"] = df_raw['pickup_datetime'].dt.month
df_raw["pickup_day"] = df_raw['pickup_datetime'].dt.day
df_raw["pickup_hour"] = df_raw['pickup_datetime'].dt.hour
df_raw["pickup_minute"] = df_raw['pickup_datetime'].dt.minute
df_raw["pickup_second"] = df_raw['pickup_datetime'].dt.second
df_raw["pickup_dayofyear"] = df_raw['pickup_datetime'].dt.dayofyear
df_raw["pickup_week"] = df_raw['pickup_datetime'].dt.week
df_raw["pickup_weekofyear"] = df_raw['pickup_datetime'].dt.weekofyear
df_raw["pickup_dayofweek"] = df_raw['pickup_datetime'].dt.dayofweek
df_raw["pickup_weekday"] = df_raw['pickup_datetime'].dt.weekday
df_raw["pickup_quarter"] = df_raw['pickup_datetime'].dt.quarter
df_raw.head()
但我想,这肯定是在某个图书馆之前做过的吗?
【问题讨论】:
标签: python pandas datetime dataframe