【问题标题】:Unresponsive GCP VM after suspend/resume - corrupted Boot Disk or DHCP config挂起/恢复后 GCP 虚拟机无响应 - 启动磁盘或 DHCP 配置损坏
【发布时间】:2021-03-24 19:59:46
【问题描述】:

我最近在 GCP 中暂停了一个虚拟机。 几天后,我恢复了它,但我无法以任何方式连接实例。

使用指向外部 IP 的 ping 命令,我没有得到任何响应。当然,使用 Web 浏览器发送 HTTP 请求也不起作用。

如果我尝试使用 SSH 连接(使用 gcloud 或浏览器窗口),我会收到连接超时 (Putty) 或 Code: 4003 Reason: failed to connect to backend。我确实尝试将 SSH 公钥再次显式添加到 VM 设置中。另外,我有一个用于不同环境的虚拟机,我从未暂停过,也没有发出连接。

我检查了防火墙规则,一切似乎都正常。

如果我启用并连接到串行控制台,我会看到它不断记录这些奇怪的消息:

Mar 23[28087280.507763] sd 0:0:1:0: rejecting I/O to offline device
 05:41:56 xxx[28087280.515939] sd 0:0:1:0: rejecting I/O to offline device
 dhclient[600]: DHCPDISCOVER on eth0 to 255.255.255.255 port 67 [28087280.527691] sd 0:0:1:0: rejecting I/O to offline device
interval 8
Mar [28087280.535079] sd 0:0:1:0: rejecting I/O to offline device
23 05:41:57 xxx dhclient[600]: DHCPREQUEST of 10.xxx.xxx.xxx on eth0 to 255.255.2[28087280.547472] sd 0:0:1:0: rejecting I/O to offline device
55.255 port 67
[28087280.555395] sd 0:0:1:0: rejecting I/O to offline device
Mar 23 05:41:57 xxx dhclient[600]: DHCPOFFER of 10.xxx.xxx.xxx from 169.xxx.xxx.xxx[28087280.567655] sd 0:0:1:0: rejecting I/O to offline device
54
Mar 23 05:41[28087280.575244] sd 0:0:1:0: rejecting I/O to offline device
:57 xxx dhclient[600]: DHCPACK of 10.xxx.xxx.xxx from 169.xxx.xxx.xxx
Mar 23 05:[28087280.710564] sd 0:0:1:0: rejecting I/O to offline device
41:57 xxx dhc[28087280.717131] sd 0:0:1:0: rejecting I/O to offline device
lient[600]: DHCP[28087280.724725] sd 0:0:1:0: rejecting I/O to offline device
DECLINE on eth0 [28087280.731449] sd 0:0:1:0: rejecting I/O to offline device
to 255.255.255.2[28087280.738646] sd 0:0:1:0: rejecting I/O to offline device
55 port 67
Mar [28087280.745723] sd 0:0:1:0: rejecting I/O to offline device
23 05:41:57 hoc-[28087280.752348] sd 0:0:1:0: rejecting I/O to offline device
qa dhclient[600][28087280.759874] sd 0:0:1:0: rejecting I/O to offline device
: DHCPDISCOVER o[28087280.766707] sd 0:0:1:0: rejecting I/O to offline device
n eth0 to 255.25[28087280.773666] sd 0:0:1:0: rejecting I/O to offline device
5.255.255 port 6[28087280.780721] sd 0:0:1:0: rejecting I/O to offline device
7 interval 4
Ma[28087280.787709] sd 0:0:1:0: rejecting I/O to offline device
r 23 05:41:57 ho[28087280.794742] sd 0:0:1:0: rejecting I/O to offline device
c-qa dhclient[60[28087280.801786] sd 0:0:1:0: rejecting I/O to offline device
0]: DHCPREQUEST [28087280.808534] sd 0:0:1:0: rejecting I/O to offline device
of 10.xxx.xxx.xxx on[28087280.815978] sd 0:0:1:0: rejecting I/O to offline device
 eth0 to 255.255[28087280.822758] sd 0:0:1:0: rejecting I/O to offline device
.255.255 port 67[28087280.829765] sd 0:0:1:0: rejecting I/O to offline device

Mar 23 05:41:57 xxx dhclien[28087280.963410] sd 0:0:1:0: rejecting I/O to offline device
t[600]: DHCPOFFER of 10.xxx.xxx.xxx from 169.xxx.xxx.xxx
Mar 23 05:41:57 xxx dhc[28087280.975106] sd 0:0:1:0: rejecting I/O to offline device
lient[600]: DHCP[28087280.982881] sd 0:0:1:0: rejecting I/O to offline device
ACK of 10.xxx.xxx.xxx from 169.xxx.xxx.xxx

我已经看到一些 SSH 出现问题的线程,但这似乎是引导磁盘的某种错误,或者可能是 DHCP 配置错误。

【问题讨论】:

    标签: google-cloud-platform google-compute-engine


    【解决方案1】:

    以防万一有人遇到同样的问题,我现在通过在我的 VM 上执行 RESET 操作来解决它。

    在努力寻找(未成功)一个合适的解决方案之后,我认为这将是更简单的解决方案,特别是请记住我的虚拟机来自暂停状态。

    【讨论】:

      猜你喜欢
      • 1970-01-01
      • 1970-01-01
      • 2015-11-14
      • 1970-01-01
      • 1970-01-01
      • 1970-01-01
      • 1970-01-01
      • 2020-06-10
      • 2019-06-04
      相关资源
      最近更新 更多