【问题标题】:Extract first N digits from a string从字符串中提取前 N 个数字
【发布时间】:2018-07-20 07:26:09
【问题描述】:

我想从一些字符串中提取前两个数字。 假设数据是:

ABC Conference Room Monitor - Z5580J    
ABC 19 Monitor    
ABC 24 Monitor for Video-Conferencing
ABC UltraSharp 24 Monitor -QU2482Z

所需的输出:

55
19
24
24

【问题讨论】:

标签: r


【解决方案1】:

使用正则表达式\\D 匹配非数字字符和\\d{2} 匹配前两位数字的解决方案。

as.numeric(sub("\\D*(\\d{2}).*", "\\1", INPUT))
# [1] 55 19 24 24

数据:

INPUT <- c("ABC Conference Room Monitor - Z5580J",
           "ABC 19 Monitor",
           "ABC 24 Monitor for Video-Conferencing",
           "ABC UltraSharp 24 Monitor -QU2482Z")

【讨论】:

    【解决方案2】:

    另一种解决方案:

    strings <- c('ABC Conference Room Monitor - Z5580J','ABC 19 Monitor','ABC 24 Monitor for Video-Conferencing','ABC UltraSharp 24 Monitor -QU2482Z')
    x <- as.numeric(gsub("\\D", "", strings))
    as.numeric(substring(as.character(x*100), 1, 2))
    
    [1] 55 19 24 24
    

    【讨论】:

    • 为什么不只是as.numeric(substr(gsub("\\D", "", strings),1,2))
    【解决方案3】:

    stringr 可能是最干净的解决方案:

    stringr::str_extract(string, "\\d{2}")
     "55" "19" "24" "24"
    

    【讨论】:

      【解决方案4】:

      stringr 的一个解决方案是:

      library(stringr)
      string <- str_extract_all("ABC Conference Room Monitor - Z5580J","\\(?[0-9,.]+\\)?")[[1]]
      # "\\(?[0-9,.]+\\)?" is the regex, extracts only numbers
      as.numeric(substr(string , 1,2)) # this selects the first two elements
      #as.numeric is optional
      

      【讨论】:

        猜你喜欢
        • 1970-01-01
        • 2021-10-14
        • 2016-12-09
        • 1970-01-01
        • 1970-01-01
        • 1970-01-01
        • 2018-06-02
        • 2011-12-19
        相关资源
        最近更新 更多