【发布时间】:2020-10-01 17:38:43
【问题描述】:
我试图找出 ID 中有多少个 x 值重复了一些值,然后根据新结果找到整体的最小值、最大值、IQR 和中值;
ID <- c("ID004", "ID004", "ID004", "ID004", "ID006", "ID006", "ID006", "ID006", "ID006", "ID006", "ID006", "ID006", "ID006", "ID006", "ID006", "ID009", "ID009", "ID009", "ID009", "ID009", "ID009", "ID020", "ID020")
D <- c("CMP-001", "CMP-001","CMP-001","CMP-001","CMP-001", "CMP-001","CMP-002", "CMP-002", "CMP-002", "CMP-003", "CMP-003", "CMP-003", "CMP-004", "CMP-004", "CMP-004", "CMP-001", "CMP-001", "CMP-001", "CMP-001", "CMP-002", "CMP-002", "CMP-001", "CMP-001")
X <- c(3,3,3,3,1,1,3,3,3,1,1,1,4,4,4,4,4,4,4,2,2,2,2)
data <- data.frame(ID, D, X)
我们首先找出每个 ID 有多少 x 值;
ID. No. of X values
ID004. 1
ID006. 4
ID009 2
ID020 1
那么根据这个结果,我们应该得到以下结果;
Min. Median. Max. IQR
Number of X per ID 1 1.5 4 3-1
我认为我们需要创建一个新变量,其中包含每个 ID 的 X 值。然后找到新变量的汇总统计数据
感谢您的帮助
【问题讨论】: