【发布时间】:2020-07-27 04:51:09
【问题描述】:
我有一个只有 NA 值的数据集,我正在尝试生成一个表格,显示该特定数据集 100% 缺失。
但输出显示 NA 值同时计为“1”和“0”。此代码适用于不包含缺失值的不同数据子集。为什么这个数据集不一样?
t1 <- data.frame(characteristic = rep(NA, 5), year = sample(x = 1990:1995, size = 100, replace = TRUE))
t1 %>%
select(YEAR, CHARACTERISTIC) %>%
group_by(YEAR) %>%
mutate(YES = length(CHARACTERISTIC[CHARACTERISTIC == "1"]),
NO = length(CHARACTERISTIC[CHARACTERISTIC == "0"]),
COUNT = n(),
MISSING = sum(is.na(CHARACTERISTIC))) %>%
summarize(CHARACTERISTIC = paste(round(first(YES / COUNT) * 100, 2), "%"),
NO_CHARACTERISTIC= paste(round(first(NO / COUNT) * 100, 2), "%"),
MISSING = paste(round(first(MISSING / COUNT) * 100, 2), "%"))
【问题讨论】: