【发布时间】:2021-04-14 21:10:20
【问题描述】:
它工作了一段时间,然后崩溃了CrashLoopBackOff。当它偶尔工作时,我会收到Unauthorized 错误。 5 到 10 分钟后,它会崩溃。
Error from server (InternalError): an error on the server ("Internal Server Error: \"/apis/metrics.k8s.io/v1beta1/nodes\": Unauthorized") has prevented the request from succeeding (get nodes.metrics.k8s.io)
我正在使用最新版本的 metric-server。
Events:
Type Reason Age From Message
---- ------ ---- ---- -------
Normal Scheduled 27m default-scheduler Successfully assigned kube-system/metrics-server-59ff97d56-xjbh4 to gke-test-test-node-pool-05539c92-26z1
Normal Created 20m (x3 over 27m) kubelet Created container metrics-server
Normal Started 20m (x3 over 27m) kubelet Started container metrics-server
Warning Unhealthy 20m (x7 over 21m) kubelet Liveness probe failed: HTTP probe failed with statuscode: 500
Warning Unhealthy 20m (x8 over 21m) kubelet Readiness probe failed: HTTP probe failed with statuscode: 500
Normal Killing 12m (x8 over 20m) kubelet Container metrics-server failed liveness probe, will be restarted
Normal Pulled 7m19s (x9 over 27m) kubelet Container image "k8s.gcr.io/metrics-server/metrics-server:v0.4.1" already present on machine
Warning BackOff 2m15s (x71 over 18m) kubelet Back-off restarting failed container
我尝试按照其他人answers 的建议更改设置,但它们都不起作用。还有其他建议吗?
135a136,137
> - --kubelet-insecure-tls
> - --kubelet-preferred-address-types=InternalIP
151a154
> initialDelaySeconds: 300
【问题讨论】:
-
你的 GKE 版本是多少?
-
您最近是否对集群进行了任何更改?您是否更改了默认指标服务器参数?您使用的是哪个 GKE 版本,是否进行了任何升级?您能否提供来自 metrics pod 的日志?
-
只是为了澄清您正在使用
Google Kubernetes Engine或者您已经使用Google Cloud VMs创建了 Kubeadm 集群?
标签: kubernetes google-cloud-platform google-kubernetes-engine