【发布时间】:2021-07-08 04:02:30
【问题描述】:
我正在尝试使用 tidyr 的提取功能从 Notes 列中提取日期。我正在处理的数据如下所示:
dates <- data.frame(col1 = c("customer", "customer2", "customer3"),
Notes = c("DOB: 12/10/62
START: 09/01/2019
END: 09/01/2020", "
S/DATE: 28/08/19
R/DATE: 27/08/20", "DOB: 13/01/1980
Start:04/12/2018"),
End_date = NA,
Start_Date = NA )
我尝试像这样提取字符串“S/DATE”之后的日期:
extract <- extract(
dates,
col = "Notes",
into = "Start_date",
regex = "(?<=(S\\/DATE:)).*" # Using regex lookahead
)
但是,这只会提取字符串“S/DATE:”,而不是之后的日期。当我在 regex101.com 上尝试此操作时,它按预期工作。
谢谢。易卜拉欣
【问题讨论】: