R：使用变量名而不是值进行变异答案

【问题标题】：R: Mutate using of variable name instead of valueR：使用变量名而不是值进行变异
【发布时间】：2017-10-26 08:01:34
【问题描述】：

我正在尝试创建一个循环并为每次迭代（其数量可能因源文件而异）构造一个 mutate 语句以根据另一列的值添加一列。

有我的 php 编程背景，在我看来这应该可行：

for(i in number){
         colname <- paste("Column",i,sep="")
         filtercol <- paste("DateDiff_",i,sep="")
         dataset <- mutate(dataset, a = ifelse(b >= 0 & b <= 364,1,NA))
     }

但是...正如我现在已经注意到的几次 R 函数有时该函数会完全忽略您已经定义了一个具有该名称的变量 - 因为mutate() 在这里。

因此，我没有得到标题为“a1”、“a2”、“a3”等的几列，而是得到一个标题为“a”的列，每次迭代都会被覆盖。

首先，有人可以向我指出我在哪里出错了，但是其次有人可以向我解释在什么情况下 R 忽略变量名，因为它已经发生了几次，而且看起来非常不一致观点。我敢肯定它不是，这里面有逻辑，但它肯定被很好地混淆了。

还值得一提的是，我最初是这样尝试的：

just.dates <- just.dates %>%
     for(i in number){
         a <- paste("a",i,sep="")
         filtercol <- paste("DateDiff_",i,sep="")
         mutate(a = ifelse(filtercol >= 0 & filtercol <= 364),1,NA)
     }

但这种方式决定了我在 for() 循环中只需要三个参数时传递了 4 个参数。

【问题讨论】：

也许这会有所帮助：stackoverflow.com/questions/26003574/…。这个想法是字符串与变量不同。并且 R 中的某些函数使用非标准评估，其中变量被视为符号名称，而不是像往常一样评估。此外，当使用命名参数调用函数时，变量永远不会在等号（参数名称）的左侧进行计算。
a. 顶级版本中colname 和filtercol 的意义何在？ b. 如果名称已存在，mutate 将覆盖列。 c. 在 R 中编写代码几乎总是比for 循环更好的方法。在这里，我可以使用purrr::map_df 制作一个data.frame 并使用bind_cols，但有很多选择。 d. 如果您真的想在 dplyr 中将字符串变量作为参数传递，则需要使用标准评估。 0.5 意味着mutate_ 和lazyeval；即将推出的 0.6 意味着 rlang。
和 e. You should make your example reproducible.
使用base R 来做这件事很直接。你真的不需要一个包来做这个。使用DF 作为您的数据框：DF[[a]] <- ifelse(DF[[filtercol]] >= 0 & DF[[filtercol]] <= 364, 1, NA)

标签： r loops for-loop dplyr

【解决方案1】：

这样的事情可能对你有用。 mutate_() 函数而不是 mutate() 应该可以帮助您解决这个问题。

# Create dataframe for testing
dataset <- data.frame(date = as.Date(c("06/07/2000","15/09/2000","15/10/2000","03/01/2001","17/03/2001",
                                       "06/08/2010","15/09/2010","15/10/2010","03/01/2011","17/03/2011"), "%d/%m/%Y"),
                      event=c(0,0,1,0,1, 1,0,1,0,1),
                      id = c(rep(1,5),rep(2,5)),
                      DateDiff_1 = c(-2,0,34,700,rep(5,6)), 
                      DateDiff_2 = c(20,-12,360,900,rep(5,6))
                     )

# Set test number vector
number <- c(1:2)

# Begin loop through numbers
for(i in number){
  # Set the name of the new column to be created
  newcolumn <- paste("Column",i,sep="")

  # Set the name of the column to be filtered
  filtercolumn <- paste("DateDiff_",i,sep="")

  # Create the function to be passed into the mutate command
  mutate_function = lazyeval::interp(~ ifelse(fc >= 0 & fc <= 364, 1, NA), fc = as.name(filtercolumn))

  # Apply the mutate command to the dataframe
  dataset  <- dataset  %>% 
              mutate_(.dots = setNames(list(mutate_function), newcolumn)) 
}

【讨论】：