【发布时间】:2015-09-08 05:58:52
【问题描述】:
我意识到当我杀死leader zookeeper时master spark没有响应(当然我将leader选举任务分配给了zookeeper)。以下是我在 Master Spark 节点上看到的错误日志。您有什么建议可以解决吗?
15/06/22 10:44:00 INFO ClientCnxn: Unable to read additional data from
> server sessionid 0x14dd82e22f70ef1, likely server has closed socket,
> closing socket connection and attempting reconnect
15/06/22 10:44:00
> INFO ClientCnxn: Unable to read additional data from server sessionid
> 0x24dc5a319b40090, likely server has closed socket, closing socket
> connection and attempting reconnect
15/06/22 10:44:01 INFO
> ConnectionStateManager: State change: SUSPENDED
15/06/22 10:44:01 INFO
> ConnectionStateManager: State change: SUSPENDED
15/06/22 10:44:01 WARN
> ConnectionStateManager: There are no ConnectionStateListeners
> registered.
15/06/22 10:44:01 INFO ZooKeeperLeaderElectionAgent: We
> have lost leadership
15/06/22 10:44:01 ERROR Master: Leadership has
> been revoked -- master shutting down.
【问题讨论】:
-
spark.deploy.recoveryMode, spark.zookeeper.url 的确切配置参数是什么?您是否使用 --supervise 启动?你的集群管理器是什么?
标签: apache-spark apache-zookeeper