【问题标题】:Ambari shows zeppelin server not started but the server is actually up and runningAmbari 显示 zeppelin 服务器未启动,但服务器实际上已启动并正在运行
【发布时间】:2016-10-05 17:53:26
【问题描述】:

我使用的是 HDP 2.4.2,并且之前安装了 zeppelin 服务器。它工作正常,但今天当我重新启动集群(AWS 节点已重新启动)时,Ambari 显示 Zeppelin 服务器没有运行并且无法启动服务器并出现以下错误:

Traceback (most recent call last):
  File "/var/lib/ambari-agent/cache/stacks/HDP/2.4/services/ZEPPELIN/package/scripts/master.py", line 235, in <module>
    Master().execute()
  File "/usr/lib/python2.6/site-packages/resource_management/libraries/script/script.py", line 219, in execute
    method(env)
  File "/var/lib/ambari-agent/cache/stacks/HDP/2.4/services/ZEPPELIN/package/scripts/master.py", line 169, in start
    + params.zeppelin_log_file, user=params.zeppelin_user)
  File "/usr/lib/python2.6/site-packages/resource_management/core/base.py", line 154, in __init__
    self.env.run()
  File "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", line 158, in run
    self.run_action(resource, action)
  File "/usr/lib/python2.6/site-packages/resource_management/core/environment.py", line 121, in run_action
    provider_action()
  File "/usr/lib/python2.6/site-packages/resource_management/core/providers/system.py", line 238, in action_run
    tries=self.resource.tries, try_sleep=self.resource.try_sleep)
  File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 70, in inner
    result = function(command, **kwargs)
  File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 92, in checked_call
    tries=tries, try_sleep=try_sleep)
  File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 140, in _call_wrapper
    result = _call(command, **kwargs_copy)
  File "/usr/lib/python2.6/site-packages/resource_management/core/shell.py", line 291, in _call
    raise Fail(err_msg)
resource_management.core.exceptions.Fail: Execution of '/usr/hdp/current/zeppelin-server/lib/bin/zeppelin-daemon.sh start >> /var/log/zeppelin/zeppelin-setup.log' returned 1. /usr/hdp/current/zeppelin-server/lib/bin/zeppelin-daemon.sh: line 187: /var/run/zeppelin-notebook/zeppelin-zeppelin-ip-10-0-0-11.eu-west-1.compute.internal.pid: Permission denied
cat: /var/run/zeppelin-notebook/zeppelin-zeppelin-ip-10-0-0-11.eu-west-1.compute.internal.pid: No such file or directory

在飞艇日志中:

错误 [2016-06-06 03:20:36,714] ({main} VFSNotebookRepo.java[list]:140) - 无法读取注释文件://usr/hdp/current/zeppelin-server/ lib/notebook/screenshots java.io.IOException: file:///usr/hdp/current/zeppelin-server/lib/notebook/screenshots/note.json not found

错误 [2016-06-06 03:34:12,795] ({main} Notebook.java[loadNoteFromRepo]:330) - 无法加载 2BHU1G67J java.io.IOException: file:///usr/hdp/current /zeppelin-server/lib/notebook/2BHU1G67J 不是目录

但由于某种原因,zeppelin 端口正在侦听,尽管存在这些错误,但 zeppelin 服务器运行良好并执行所有查询。请就如何纠正 Ambari 中的问题并从 ambari 无错误地启动服务提出建议。

【问题讨论】:

    标签: ambari apache-zeppelin


    【解决方案1】:

    问题出在 zeppelin 服务的 PID 文件上。它要么归错误的用户所有,要么拥有错误的权限。手动停止 zeppelin 服务,然后删除位于:/var/run/zeppelin-notebook/zeppelin-zeppelin-ip-10-0-0-11.eu-west-1.compute.internal.pid 的 pid 文件。仔细检查/var/run/zeppelin-notebook 文件夹的所有者/权限。然后,您应该能够在 Ambari UI 中重新启动服务。

    【讨论】:

    • 谢谢 cjackson!我想了很多.. 但我发现问题是因为 zeppelin 用来查找 pid 文件的默认位置是 /var/run/zeppelin 而不是 /var/run/zeppelin-notebook。一旦我改变了它,服务就从 Ambari 开始。我也向开发人员报告了同样的错误。
    猜你喜欢
    • 2021-08-29
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    • 2020-11-17
    • 2011-07-10
    • 1970-01-01
    相关资源
    最近更新 更多