【问题标题】:kubernetes dashboard CrashLoopBackOffKubernetes 仪表板 CrashLoopBackOff
【发布时间】:2021-05-12 00:42:00
【问题描述】:

kubernetes 的新手,但设法安装了 kubernetes、ubuntu 20.04 LTS,但仪表板有问题。按照程序,使用法兰绒作为CNF。

日志指出与 10.96.0.1:443 的连接存在问题,但 telnet 似乎可以工作?有什么建议可以走得更远吗?

bwa@prod3:~$ kubectl get pods -o wide --all-namespaces
NAMESPACE              NAME                                         READY   STATUS             RESTARTS   AGE   IP             NODE    NOMINATED NODE   READINESS GATES
kube-system            coredns-66bff467f8-jgmpl                     0/1     Running            1          27h   10.244.0.6     prod3   <none>           <none>
kube-system            coredns-66bff467f8-ldr9d                     0/1     Running            1          27h   10.244.0.9     prod3   <none>           <none>
kube-system            etcd-prod3                                   1/1     Running            1          27h   192.168.0.93   prod3   <none>           <none>
kube-system            kube-apiserver-prod3                         1/1     Running            1          27h   192.168.0.93   prod3   <none>           <none>
kube-system            kube-controller-manager-prod3                1/1     Running            1          27h   192.168.0.93   prod3   <none>           <none>
kube-system            kube-flannel-ds-amd64-xm26h                  1/1     Running            2          27h   192.168.0.93   prod3   <none>           <none>
kube-system            kube-proxy-7lk5d                             1/1     Running            1          27h   192.168.0.93   prod3   <none>           <none>
kube-system            kube-scheduler-prod3                         1/1     Running            1          27h   192.168.0.93   prod3   <none>           <none>
kubernetes-dashboard   dashboard-metrics-scraper-6b4884c9d5-xrdbh   1/1     Running            1          27h   10.244.0.7     prod3   <none>           <none>
kubernetes-dashboard   kubernetes-dashboard-7f99b75bf4-lfqtf        0/1     CrashLoopBackOff   310        27h   10.244.0.8     prod3   <none>           <none>


bwa@prod3:~$ kubectl logs kubernetes-dashboard-7f99b75bf4-lfqtf --namespace=kubernetes-dashboard --tail=100
    2020/08/05 12:02:31 Starting overwatch
    2020/08/05 12:02:31 Using namespace: kubernetes-dashboard
    2020/08/05 12:02:31 Using in-cluster config to connect to apiserver
    2020/08/05 12:02:31 Using secret token for csrf signing
    2020/08/05 12:02:31 Initializing csrf token from kubernetes-dashboard-csrf secret
    panic: Get "https://10.96.0.1:443/api/v1/namespaces/kubernetes-dashboard/secrets/kubernetes-dashboard-csrf": dial tcp 10.96.0.1:443: i/o timeout
    
    goroutine 1 [running]:
    github.com/kubernetes/dashboard/src/app/backend/client/csrf.(*csrfTokenManager).init(0xc00000c640)
            /home/runner/work/dashboard/dashboard/src/app/backend/client/csrf/manager.go:41 +0x446
    github.com/kubernetes/dashboard/src/app/backend/client/csrf.NewCsrfTokenManager(...)
            /home/runner/work/dashboard/dashboard/src/app/backend/client/csrf/manager.go:66
    github.com/kubernetes/dashboard/src/app/backend/client.(*clientManager).initCSRFKey(0xc00044f800)
            /home/runner/work/dashboard/dashboard/src/app/backend/client/manager.go:501 +0xc6
    github.com/kubernetes/dashboard/src/app/backend/client.(*clientManager).init(0xc00044f800)
            /home/runner/work/dashboard/dashboard/src/app/backend/client/manager.go:469 +0x47
    github.com/kubernetes/dashboard/src/app/backend/client.NewClientManager(...)
            /home/runner/work/dashboard/dashboard/src/app/backend/client/manager.go:550
    main.main()
            /home/runner/work/dashboard/dashboard/src/app/backend/dashboard.go:105 +0x20d
    bwa@prod3:~$ telnet 10.96.0.1 443
    Trying 10.96.0.1...
    Connected to 10.96.0.1.
    Escape character is '^]'.
    ^CConnection closed by foreign host.
    bwa@prod3:~$

【问题讨论】:

  • 从另一个 pod telnet 到该 IP 和端口是否有效?
  • 这个 Pod 的事件中有什么? (kubectl describe pod kubernetes-dashboard-7f99b75bf4-lfqtf) 10.96.0.1 是某个服务的地址吗?
  • IP 只是为了尝试错误: dial tcp 10.96.0.1:443: i/o timeout describe does not give any: bwa@prod3:~$ kubectl describe kubernetes-dashboard-7f99b75bf4-lfqtf错误:服务器没有资源类型“kubernetes-dashboard-7f99b75bf4-lfqtf”bwa@prod3:~$ kubectl describe pod kubernetes-dashboard-7f99b75bf4-lfqtf 服务器错误(未找到):pods“kubernetes-dashboard-7f99b75bf4 -lfqtf" 未找到 bwa@prod3:~$
  • @bwa,解决这个问题有什么进展吗?向您的问题添加任何其他信息(尤其是代码示例、命令输出等)时,请使用edit 选项,而不是在 cmets 中发布。它更具可读性。请注意,您的 pod 部署在 kubernetes-dashboard 命名空间中,因此您需要将 --namespace=kubernetes-dashboard 选项添加到 kubectl describe pod &lt;pod_name&gt;,与运行 kubectl logs 命令时的方式相同。

标签: kubernetes kubernetes-dashboard


【解决方案1】:

从该集群的外观来看,您没有安装网络插件 (CNI)。我在 kube-system 命名空间中没有看到任何 flannel pod,并且 coredns pod 没有启动。

这也可以解释为什么仪表板会出现恐慌,因为它无法通过 10.96.0.1 服务访问 K8s API 服务器。

你能检查一下 flannel 的安装(或者只是在集群上重新安装 flannel)吗?

【讨论】:

    猜你喜欢
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    • 2020-01-23
    • 2020-06-02
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    • 2022-08-08
    相关资源
    最近更新 更多