【发布时间】:2015-08-06 17:44:00
【问题描述】:
我正在寻找一种在 R 中标记句子开头和结尾的方法。为此,我想消除除句末标记(如句号、感叹号、问号和连字符)之外的所有标点符号。我想用标记 *** 代替。同时,我也想保留包含撇号的单词。举一个具体的例子,给定这个字符串:
txt <- "We have examined all the possibilities, however we have not reached a solid conclusion - however we keep and open mind! Have you considered any other approach? Haven't you?"
期望的结果是
txt <- "We have examined all the possibilities however he have not reached a solid conclusion *** however we keep and open mind*** Have you considered any other approach*** Haven't you***"
我还没能拿出一个正则表达式来做到这一点。非常感谢任何提示。
【问题讨论】: