【发布时间】:2017-11-21 16:45:14
【问题描述】:
我习惯了 SPSS,我真的很喜欢使用自定义表格来报告调查数据。如果我能在 R 中做类似的事情,我会很高兴。
我想做的是一个包含多行和多列的表格,其中包含列百分比和计数(N - 百分比的基数)
这是一个调查数据的示例代码:
set.seed(321)
ID <- seq(1:200)
Age <- sample(c("18-34", "35-59"), 200, replace = TRUE)
Sex <- sample(c("Male", "Female"), 200, replace = TRUE)
TOTAL <- rep(c("TOTAL"), 200)
Edu <- sample(c("Lower", "Middle", "Higher"), 200, replace = TRUE)
PurchaseInt <- sample(c("Definitely yes", "Somewhat yes", "Somewhat not", "Definitely not"),200, replace=TRUE)
Relevance <- sample(c("Definitely fits my needs", "Somewhat fits my needs", "Somewhat does not fit", "Definitely does not fit"),200, replace=TRUE)
DF <- data.frame(ID,TOTAL,Sex,Age,Edu,PurchaseInt,Relevance)
head(DF)
ID TOTAL Sex Age Edu PurchaseInt Relevance
1 1 TOTAL Male 35-59 Lower Definitely yes Somewhat fits my needs
2 2 TOTAL Male 35-59 Higher Somewhat not Definitely does not fit
3 3 TOTAL Male 18-34 Higher Definitely yes Somewhat does not fit
4 4 TOTAL Female 18-34 Lower Somewhat not Definitely does not fit
5 5 TOTAL Female 18-34 Higher Definitely yes Somewhat does not fit
6 6 TOTAL Female 18-34 Higher Definitely not Definitely does not fit
# Simple table, 1 variable by 1 variable, no N (BASE) BAD TABLE :(
prop.table(table(DF$PurchaseInt, DF$Sex),2)
Female Male
Definitely not 0.28 0.30
Definitely yes 0.25 0.28
Somewhat not 0.29 0.24
Somewhat yes 0.17 0.18
我真的很想得到这样的东西(从 SPSS 完成):
我意识到将计数与 col 百分比结合起来可能会更加棘手。对我来说至关重要的是找到在一个表中报告多行和多列(尤其是多列)的可能性,因为这有助于数据分析很多。
【问题讨论】:
-
可能类似于
prop.table(ftable(PurchaseInt + Relevance ~ Sex + Age + Edu, DF), margin = 2) * 100,虽然我不认为是这样 -
您可以使用
kableExtra中的分组列和行来执行此操作。 cran.r-project.org/web/packages/kableExtra/vignettes/…
标签: r