【问题标题】:Mesos agent always in Deactivated stateMesos 代理始终处于停用状态
【发布时间】:2018-02-23 22:39:51
【问题描述】:

我在 VMware WorkStation 的两个虚拟主机中部署了 Mesos 集群:

  • heron01 ip:192.168.201.131:运行Mesos Master,Zookeeper
  • heron02 ip:192.168.201.128: 运行 Mesos Slave

但是,从站始终处于停用状态。 mesos master ERROR日志如下:

Log file created at: 2018/02/18 02:08:35
Running on machine: ubuntu
Log line format: [IWEF]mmdd hh:mm:ss.uuuuuu threadid file:line] msg
W0218 02:08:35.859475  5857 authenticator.cpp:513] No credentials provided, authentication requests will be refused
E0218 02:08:40.518481  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39882: Transport endpoint is not connected
E0218 02:08:40.523883  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39884: Transport endpoint is not connected
W0218 02:08:40.532027  5854 master.cpp:7557] Master returning resources offered because agent eae9d24b-3cf3-4a0b-9546-dfde4288fbc8-S0 at slave(1)@127.0.1.1:5051 (ubuntu) is disconnected
E0218 02:08:41.131724  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39886: Transport endpoint is not connected
W0218 02:08:41.135860  5857 master.cpp:7557] Master returning resources offered because agent eae9d24b-3cf3-4a0b-9546-dfde4288fbc8-S1 at slave(1)@127.0.1.1:5051 (ubuntu) is disconnected
E0218 02:08:41.580379  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39888: Transport endpoint is not connected
E0218 02:08:41.583258  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39890: Transport endpoint is not connected
W0218 02:08:41.585355  5858 master.cpp:7557] Master returning resources offered because agent eae9d24b-3cf3-4a0b-9546-dfde4288fbc8-S2 at slave(1)@127.0.1.1:5051 (ubuntu) is disconnected
E0218 02:08:48.556628  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39892: Transport endpoint is not connected
E0218 02:08:48.562399  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39894: Transport endpoint is not connected
E0218 02:08:48.566049  5859 process.cpp:2577] Failed to shutdown socket with fd 28, address 127.0.0.1:39896: Transport endpoint is not connected
W0218 02:08:48.567793  5853 master.cpp:7557] Master returning resources offered because agent eae9d24b-3cf3-4a0b-9546-dfde4288fbc8-S3 at slave(1)@127.0.1.1:5051 (ubuntu) is disconnected
E0218 02:09:00.063712  5859 process.cpp:2577] Failed to shutdown socket with fd 35, address 127.0.0.1:39914: Transport endpoint is not connected

Mesos slave WARNING 日志如下:

Log file created at: 2018/02/17 08:25:51
Running on machine: ubuntu
Log line format: [IWEF]mmdd hh:mm:ss.uuuuuu threadid file:line] msg
E0217 08:25:51.034782 48017 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45090: Transport endpoint is not connected
E0217 08:25:51.040766 48017 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45092: Transport endpoint is not connected
W0217 08:25:51.041786 48017 slave.cpp:5010] Master disconnected! Waiting for a new master to be elected
E0217 08:25:51.631784 48019 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45094: Transport endpoint is not connected
W0217 08:25:51.632076 48017 slave.cpp:5010] Master disconnected! Waiting for a new master to be elected
W0217 08:25:52.095075 48011 slave.cpp:5010] Master disconnected! Waiting for a new master to be elected
E0217 08:25:52.095427 48019 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45096: Transport endpoint is not connected
W0217 08:25:59.021628 48012 slave.cpp:5010] Master disconnected! Waiting for a new master to be elected
E0217 08:25:59.022001 48019 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45098: Transport endpoint is not connected
W0217 08:26:10.564131 48016 slave.cpp:5010] Master disconnected! Waiting for a new master to be elected
E0217 08:26:10.564538 48019 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45100: Transport endpoint is not connected
W0217 08:26:12.141916 48012 slave.cpp:5010] Master disconnected! Waiting for a new master to be elected
E0217 08:26:12.142215 48019 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45102: Transport endpoint is not connected
W0217 08:26:39.090140 48018 slave.cpp:5010] Master disconnected! Waiting for a new master to be elected
E0217 08:26:39.090345 48019 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45104: Transport endpoint is not connected
E0217 08:27:38.279918 48019 process.cpp:2577] Failed to shutdown socket with fd 8, address 192.168.201.129:45106: Transport endpoint is not connected


我用修改配置文件的方法来配置集群环境。配置如下。
1.mesos-master-env.sh

export MESOS_log_dir=/home/yitian/mesosdata/log
export MESOS_work_dir=/home/yitian/mesosdata/data
export MESOS_ZK=zk://heron01:2181/mesos
export MESOS_quorum=1

2。 mesos-slave-env.sh 和 mesos-agent-env.sh

export MESOS_master=heron01:5050
export MESOS_log_dir=/home/yitian/mesosdata/log
export MESOS_work_dir=/home/yitian/mesosdata/run

3。大师

heron01

4。奴隶

heron02

更重要的是,主机名和 ip 已添加到 /etc/hosts。两台主机都有相同的配置文件。我该如何解决?感谢您的帮助!

【问题讨论】:

    标签: cluster-computing mesos


    【解决方案1】:

    我相信你没有正确设置主IP,下面是一个正确的命令。如果使用 zk,你也不能使用 127.0.0.1,仅供参考。

    大师

    mesos-master --ip=192.168.201.131 --work_dir=/tmp/mesos
    

    代理

    mesos-agent --ip=192.168.201.128 --master=192.168.201.131:5050 --work_dir=/tmp/mesos
    

    【讨论】:

    • 我以前是修改配置文件来配置集群环境的。我之前忘了解释这个。我现在已经编辑了这个问题。谢谢你的帮助。
    • @YitianZhang 从你的日志Running on machine: ubuntu 看来两台机器都被视为ubuntu。我猜当您安装操作系统时,初始名称是ubuntu。还有这里系统相关的问题,所以mesos获取节点名称为ubuntu,不得不翻译成127.0.0.1 能不能直接改成Ip确认一下?
    • 非常感谢。你是对的。在我修改 /etc/hostname 以配置每个主机的“主机名”后,我已经解决了这个问题。我没有尝试通过更改 mesos 配置文件中的 IP 地址来解决问题。但它也应该有效。
    猜你喜欢
    • 1970-01-01
    • 2014-09-20
    • 1970-01-01
    • 2020-12-04
    • 1970-01-01
    • 2015-07-14
    • 1970-01-01
    • 1970-01-01
    • 2014-06-28
    相关资源
    最近更新 更多