【发布时间】:2015-11-17 14:38:32
【问题描述】:
假设我有一个向量,其中包含一些我想根据正则表达式拆分的字符。
更准确地说,我想根据逗号、空格和大写字母来分割字符串(据我了解,regex 命令看起来像这样:/(, [A-Z])/g(有效)当我尝试时很好here))。
当我尝试在r 中实现这一点时,regex 似乎不起作用,例如:
x <- c("Non MMF investment funds, Insurance corporations, Assets (Net Acquisition of), Loans, Long-term original maturity (over 1 year or no stated maturity)",
"Non financial corporations, Financial corporations other than MFIs, insurance corporations, pension funds and non-MMF investment funds, Assets (Net Acquisition of), Loans, Short-term original maturity (up to 1 year)")
strsplit(x, "/(, [A-Z])/g")
[[1]]
[1] "Non MMF investment funds, Insurance corporations, Assets (Net Acquisition of), Loans, Long-term original maturity (over 1 year or no stated maturity)"
[[2]]
[1] "Non financial corporations, Financial corporations other than MFIs, insurance corporations, pension funds and non-MMF investment funds, Assets (Net Acquisition of), Loans, Short-term original maturity (up to 1 year)"
它没有发现分裂。我在这里做错了什么?
非常感谢任何帮助!
【问题讨论】:
-
你不应该使用
/.../g。这不是 JS。我猜你不想省略这封信,是吗?试试this。 -
R 中未使用分隔符,
strsplit函数正在消耗字符。