【发布时间】:2018-02-17 19:21:59
【问题描述】:
我想从 df 的 column1 中的每个值中选择 5 行,以便输出对于 column2 中的每个唯一值至少有 1 个值。 输出中也不应该有任何重复
编辑: (column1, column3) 对中不应有重复项: 即对于 column1 中的每个值,column3 中的所有值都应该是唯一的
column1 = rep(c("a","b"), each = 12)
column2 = rep(c(1,2,3), each = 4)
column3 = c("x1","x2","x3","x4","x5","x3","x6","x7","x8","x1","x9","x5","x6","x2","x3","x4","x7","x5","x6","x1","x4","x1","x6","x9")
df = data.frame(column1, column2, column3)
这是一个有效的解决方案
sample_output_1 = data.frame(column1 = rep(c("a","b"), each = 5),
column2 = c(1,1,2,2,3,1,1,2,2,3),
column3 = c("x1","x2","x5","x3","x8","x6","x2","x5","x1","x9"))
【问题讨论】: