【问题标题】:How to create clustering plots which long and wide format data for multiple variables如何为多个变量创建长宽格式数据的聚类图
【发布时间】:2020-05-07 06:29:25
【问题描述】:

拥有这样的数据集:

data.frame(id = c(1,1,1,2,2,3) snames  = c("stockA","stockB","stockC","stockA","stockB","stockc"), var1 = c(0.13,1.2,-1.5,3.45,-0.26,-2.1), var2 = c(-2.1,2.34,3.56,-1.53,-0.48,-0.29), var3 = c(0.04,-3.45,-0.22,-0.29,1.34,0.32), var4 = c(2.14,-1.34,-4.35,-1.56,0.13,-2.35), var5 = c(1.53,1.24,-0,32,-0.3,-4.25,-2.49))

如何同时使用长宽格式数据创建集群?

有了这些数据,有什么方法可以结合 var1、var2、var3、var4、var5 对 snames 列的变量进行聚类,以查找在第一个聚类中我们有 stockB 和 var2 和 var3 的组中的组?

喜欢 一个来自here

【问题讨论】:

    标签: r cluster-analysis


    【解决方案1】:

    我一直在处理你的代码,但你需要提供更多信息让我回答

    首先,您提供的代码中有拼写错误,我在这里修复了它们。

    df<- data.frame(id=c(1,1,1,2,2,3),snames=c("stockA","stockB","stockC","stockA","stockB","stockC"), var1 = c(0.13,1.2,-1.5,3.45,-0.26,-2.1), var2 = c(-2.1,2.34,3.56,-1.53,-0.48,-0.29), var3 = c(0.04,-3.45,-0.22,-0.29,1.34,0.32), var4 = c(2.14,-1.34,-4.35,-1.56,0.13,-2.35), var5 = c(1.53,1.24,-0.32,-0.3,-4.25,-2.49)) # you wrote stockc and not stockC, also var5  was written -0,32, and it needs to be -0.32
    

    请提供您用于制作上图的代码。

    我相信您可以简单地定义 pch=levels(df$snames) 和 col=c("df$vars1","df$vars2","df$vars3","df$vars4","df $vars5") 在 plot 命令中,它应该做你想做的事

    【讨论】:

    猜你喜欢
    • 2017-10-13
    • 2019-08-13
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    • 2020-11-28
    • 2012-05-22
    • 1970-01-01
    • 2021-05-12
    相关资源
    最近更新 更多