suspend/resume 后 GCP VM 无响应 - 损坏的启动磁盘或 DHCP 配置
Unresponsive GCP VM after suspend/resume - corrupted Boot Disk or DHCP config
我最近在 GCP 中暂停了一个 VM。
几天后,我恢复了,但是无论如何都连接不上实例
使用指向外部IP的ping
命令,我没有得到任何回应。当然,使用网络浏览器发送 HTTP 请求也不行。
如果我尝试使用 SSH 连接(使用 gcloud
或浏览器 window),我会遇到连接超时 (Putty) 或 Code: 4003 Reason: failed to connect to backend
。我确实尝试再次编辑显式添加 SSH public 密钥到 VM 设置。另外,我有一个用于不同环境的虚拟机,我从未暂停过,也没有发布连接。
我检查了防火墙规则,似乎一切正常。
如果我启用并连接到串行控制台,我会看到它不断地记录这些奇怪的消息:
Mar 23[28087280.507763] sd 0:0:1:0: rejecting I/O to offline device
05:41:56 xxx[28087280.515939] sd 0:0:1:0: rejecting I/O to offline device
dhclient[600]: DHCPDISCOVER on eth0 to 255.255.255.255 port 67 [28087280.527691] sd 0:0:1:0: rejecting I/O to offline device
interval 8
Mar [28087280.535079] sd 0:0:1:0: rejecting I/O to offline device
23 05:41:57 xxx dhclient[600]: DHCPREQUEST of 10.xxx.xxx.xxx on eth0 to 255.255.2[28087280.547472] sd 0:0:1:0: rejecting I/O to offline device
55.255 port 67
[28087280.555395] sd 0:0:1:0: rejecting I/O to offline device
Mar 23 05:41:57 xxx dhclient[600]: DHCPOFFER of 10.xxx.xxx.xxx from 169.xxx.xxx.xxx[28087280.567655] sd 0:0:1:0: rejecting I/O to offline device
54
Mar 23 05:41[28087280.575244] sd 0:0:1:0: rejecting I/O to offline device
:57 xxx dhclient[600]: DHCPACK of 10.xxx.xxx.xxx from 169.xxx.xxx.xxx
Mar 23 05:[28087280.710564] sd 0:0:1:0: rejecting I/O to offline device
41:57 xxx dhc[28087280.717131] sd 0:0:1:0: rejecting I/O to offline device
lient[600]: DHCP[28087280.724725] sd 0:0:1:0: rejecting I/O to offline device
DECLINE on eth0 [28087280.731449] sd 0:0:1:0: rejecting I/O to offline device
to 255.255.255.2[28087280.738646] sd 0:0:1:0: rejecting I/O to offline device
55 port 67
Mar [28087280.745723] sd 0:0:1:0: rejecting I/O to offline device
23 05:41:57 hoc-[28087280.752348] sd 0:0:1:0: rejecting I/O to offline device
qa dhclient[600][28087280.759874] sd 0:0:1:0: rejecting I/O to offline device
: DHCPDISCOVER o[28087280.766707] sd 0:0:1:0: rejecting I/O to offline device
n eth0 to 255.25[28087280.773666] sd 0:0:1:0: rejecting I/O to offline device
5.255.255 port 6[28087280.780721] sd 0:0:1:0: rejecting I/O to offline device
7 interval 4
Ma[28087280.787709] sd 0:0:1:0: rejecting I/O to offline device
r 23 05:41:57 ho[28087280.794742] sd 0:0:1:0: rejecting I/O to offline device
c-qa dhclient[60[28087280.801786] sd 0:0:1:0: rejecting I/O to offline device
0]: DHCPREQUEST [28087280.808534] sd 0:0:1:0: rejecting I/O to offline device
of 10.xxx.xxx.xxx on[28087280.815978] sd 0:0:1:0: rejecting I/O to offline device
eth0 to 255.255[28087280.822758] sd 0:0:1:0: rejecting I/O to offline device
.255.255 port 67[28087280.829765] sd 0:0:1:0: rejecting I/O to offline device
Mar 23 05:41:57 xxx dhclien[28087280.963410] sd 0:0:1:0: rejecting I/O to offline device
t[600]: DHCPOFFER of 10.xxx.xxx.xxx from 169.xxx.xxx.xxx
Mar 23 05:41:57 xxx dhc[28087280.975106] sd 0:0:1:0: rejecting I/O to offline device
lient[600]: DHCP[28087280.982881] sd 0:0:1:0: rejecting I/O to offline device
ACK of 10.xxx.xxx.xxx from 169.xxx.xxx.xxx
我已经看到一些 SSH 出现问题的线程,但这似乎是引导磁盘或 DHCP 配置的某种错误。
为了防止有人遇到同样的问题,我现在通过在我的 VM 上执行 RESET
操作解决了这个问题。
在努力找出(未成功)适合此问题的解决方案后,我认为这将是更简单的解决方案,尤其是要记住我的虚拟机处于挂起状态。
我最近在 GCP 中暂停了一个 VM。 几天后,我恢复了,但是无论如何都连接不上实例
使用指向外部IP的ping
命令,我没有得到任何回应。当然,使用网络浏览器发送 HTTP 请求也不行。
如果我尝试使用 SSH 连接(使用 gcloud
或浏览器 window),我会遇到连接超时 (Putty) 或 Code: 4003 Reason: failed to connect to backend
。我确实尝试再次编辑显式添加 SSH public 密钥到 VM 设置。另外,我有一个用于不同环境的虚拟机,我从未暂停过,也没有发布连接。
我检查了防火墙规则,似乎一切正常。
如果我启用并连接到串行控制台,我会看到它不断地记录这些奇怪的消息:
Mar 23[28087280.507763] sd 0:0:1:0: rejecting I/O to offline device
05:41:56 xxx[28087280.515939] sd 0:0:1:0: rejecting I/O to offline device
dhclient[600]: DHCPDISCOVER on eth0 to 255.255.255.255 port 67 [28087280.527691] sd 0:0:1:0: rejecting I/O to offline device
interval 8
Mar [28087280.535079] sd 0:0:1:0: rejecting I/O to offline device
23 05:41:57 xxx dhclient[600]: DHCPREQUEST of 10.xxx.xxx.xxx on eth0 to 255.255.2[28087280.547472] sd 0:0:1:0: rejecting I/O to offline device
55.255 port 67
[28087280.555395] sd 0:0:1:0: rejecting I/O to offline device
Mar 23 05:41:57 xxx dhclient[600]: DHCPOFFER of 10.xxx.xxx.xxx from 169.xxx.xxx.xxx[28087280.567655] sd 0:0:1:0: rejecting I/O to offline device
54
Mar 23 05:41[28087280.575244] sd 0:0:1:0: rejecting I/O to offline device
:57 xxx dhclient[600]: DHCPACK of 10.xxx.xxx.xxx from 169.xxx.xxx.xxx
Mar 23 05:[28087280.710564] sd 0:0:1:0: rejecting I/O to offline device
41:57 xxx dhc[28087280.717131] sd 0:0:1:0: rejecting I/O to offline device
lient[600]: DHCP[28087280.724725] sd 0:0:1:0: rejecting I/O to offline device
DECLINE on eth0 [28087280.731449] sd 0:0:1:0: rejecting I/O to offline device
to 255.255.255.2[28087280.738646] sd 0:0:1:0: rejecting I/O to offline device
55 port 67
Mar [28087280.745723] sd 0:0:1:0: rejecting I/O to offline device
23 05:41:57 hoc-[28087280.752348] sd 0:0:1:0: rejecting I/O to offline device
qa dhclient[600][28087280.759874] sd 0:0:1:0: rejecting I/O to offline device
: DHCPDISCOVER o[28087280.766707] sd 0:0:1:0: rejecting I/O to offline device
n eth0 to 255.25[28087280.773666] sd 0:0:1:0: rejecting I/O to offline device
5.255.255 port 6[28087280.780721] sd 0:0:1:0: rejecting I/O to offline device
7 interval 4
Ma[28087280.787709] sd 0:0:1:0: rejecting I/O to offline device
r 23 05:41:57 ho[28087280.794742] sd 0:0:1:0: rejecting I/O to offline device
c-qa dhclient[60[28087280.801786] sd 0:0:1:0: rejecting I/O to offline device
0]: DHCPREQUEST [28087280.808534] sd 0:0:1:0: rejecting I/O to offline device
of 10.xxx.xxx.xxx on[28087280.815978] sd 0:0:1:0: rejecting I/O to offline device
eth0 to 255.255[28087280.822758] sd 0:0:1:0: rejecting I/O to offline device
.255.255 port 67[28087280.829765] sd 0:0:1:0: rejecting I/O to offline device
Mar 23 05:41:57 xxx dhclien[28087280.963410] sd 0:0:1:0: rejecting I/O to offline device
t[600]: DHCPOFFER of 10.xxx.xxx.xxx from 169.xxx.xxx.xxx
Mar 23 05:41:57 xxx dhc[28087280.975106] sd 0:0:1:0: rejecting I/O to offline device
lient[600]: DHCP[28087280.982881] sd 0:0:1:0: rejecting I/O to offline device
ACK of 10.xxx.xxx.xxx from 169.xxx.xxx.xxx
我已经看到一些 SSH 出现问题的线程,但这似乎是引导磁盘或 DHCP 配置的某种错误。
为了防止有人遇到同样的问题,我现在通过在我的 VM 上执行 RESET
操作解决了这个问题。
在努力找出(未成功)适合此问题的解决方案后,我认为这将是更简单的解决方案,尤其是要记住我的虚拟机处于挂起状态。