【发布时间】:2026-01-07 15:20:02
【问题描述】:
我有以下列标题的数据:
Sheet Number, Year, Term, Class, Debate #, Role in Debate, Gender of Evaluator, Person #1 Clarity, Person #1 Effort, Person #1 Gender, Person #1 Origin, Debater Number, Person #2 Clarity, Person #2 Effort, Person #2 Gender, Person #2 Origin, Debater Number, Person #3 Clarity, Person #3 Effort, Person #3 Gender, Person #3 Origin, Debater Number, Person #4 Clarity, Person #4 Effort, Person #4 Gender, Person #4 Origin, Debater Number, Person #5 Clarity, Person #5 Effort, Person #5 Gender, Person #5 Origin, Debater Number, Person #6 Clarity, Person #6 Effort, Person #6 Gender, Person #6 Origin, Debater Number, Person #7 Clarity, Person #7 Effort, Person #7 Gender, Person #7 Origin, Debater Number, Person #8 Clarity, Person #8 Effort, Person #8 Gender, Person #8 Origin, Debater Number, Learned from Team 1, Learned from Team 2, Who won?, Sheet all 10s?, Evaluator Class Year
我想将其转换为具有以下列标题的格式:
Sheet Number, Year, Term, Class, Debate #, Gender of Evaluator, Evaluator Class Year, Role in Debate, Debate Team Member #, Debater Number, Gender of Debate Team Member, Origin of Debate Team Member, Clarity of Debate Team Member, Effort of Debate Team Member, Learned from Team 1, Learned from Team 2, Who won?, Sheet all 10s?, =1 if Gender of Evaluator==Gender of Debater
两者之间的主要区别在于,在第一种格式中,每个工作表编号都有 5-8 个编号的“人员”与之关联。第二种格式,每个表号都有一个与之关联的人(因此每个表号出现多次并且数据被“扩展”)。
我如何在 R 中实现这一点?我一直在尝试使用“重塑”包。谢谢!
【问题讨论】:
-
欢迎来到 *!尤其是这类数据的“宽度”,我敦促你做一个reproducible minimal working example。在这种情况下,将问题推广到少数几列,您可以在不淹没我们的情况下获得概念。 (此外,提供具有代表性的数据结构非常有帮助。)
-
所以原始格式类似于
Sheet Number, Person #1 Clarity, Person #1 Effort, Person #1 Gender, Person #1 Origin, Debater Number, Person #2 Clarity, Person #2 Effort, Person #2 Gender, Person #2 Origin, Debater Number, Person #3 Clarity, Person #3 Effort, Person #3 Gender, Person #3 Origin, Debater Number,我希望它类似于Sheet Number, Debate Team Member #, Debater Number, Gender of Debate Team Member, Origin of Debate Team Member, Clarity of Debate Team Member, Effort of Debate Team Member(即每个工作表编号仅与一个人相关联) -
这不是一个最小的可重现示例。我建议你: 1. 做一个小数据集。 2. 发布您针对数据集运行的 R 代码及其生成的内容。 3. 你期望它产生什么。那么我们应该能够为您提供帮助。