【发布时间】:2016-10-27 16:51:39
【问题描述】:
这实际上是this question 的重复。但是,我想问一个非常具体的问题,关于根据我通过基本的“手动”编码实验获得的感知器系数绘制决策边界线。如您所见,从逻辑回归中提取的系数会产生一条不错的决策边界线:
基于glm() 结果:
(Intercept) test1 test2
1.718449 4.012903 3.743903
感知器实验的系数完全不同:
bias test1 test2
9.131054 19.095881 20.736352
为了方便回答,here is the data,这里是代码:
# DATA PRE-PROCESSING:
dat = read.csv("perceptron.txt", header=F)
dat[,1:2] = apply(dat[,1:2], MARGIN = 2, FUN = function(x) scale(x)) # scaling the data
data = data.frame(rep(1,nrow(dat)), dat) # introducing the "bias" column
colnames(data) = c("bias","test1","test2","y")
data$y[data$y==0] = -1 # Turning 0/1 dependent variable into -1/1.
data = as.matrix(data) # Turning data.frame into matrix to avoid mmult problems.
# PERCEPTRON:
set.seed(62416)
no.iter = 1000 # Number of loops
theta = rnorm(ncol(data) - 1) # Starting a random vector of coefficients.
theta = theta/sqrt(sum(theta^2)) # Normalizing the vector.
h = theta %*% t(data[,1:3]) # Performing the first f(theta^T X)
for (i in 1:no.iter){ # We will recalculate 1,000 times
for (j in 1:nrow(data)){ # Each time we go through each example.
if(h[j] * data[j, 4] < 0){ # If the hypothesis disagrees with the sign of y,
theta = theta + (sign(data[j,4]) * data[j, 1:3]) # We + or - the example from theta.
}
else
theta = theta # Else we let it be.
}
h = theta %*% t(data[,1:3]) # Calculating h() after iteration.
}
theta # Final coefficients
mean(sign(h) == data[,4]) # Accuracy
问题:如果我们只有感知器系数,如何绘制边界线(就像我上面使用逻辑回归系数所做的那样)?
【问题讨论】:
标签: r plot machine-learning