rancher 的 rke 无法在新集群上启动
rancher's rke fails to start on new cluster
/opt/kubernetes/bin/rke up --config /home/msh/rancher-cluster.yml
牧场主-cluser.yml 文件包含:
nodes:
- address: 192.168.10.34
internal_address: 172.17.0.2
user: bsh
role: [controlplane,etcd]
- address: 192.168.10.35
internal_address: 172.17.0.3
user: bsh
role: [worker]
- address: 192.168.10.36
internal_address: 172.17.0.4
user: bsh
role: [worker]
add_job_timeout: 120
注意:我没有在任何节点上配置任何接口 internal_address。我的理解是 rancher/k8s 会为我设置这些。 . .什么的。
这里是 rke 启动失败的结尾。
INFO[0039] Removing container [rke-bundle-cert] on host [192.168.10.34], try #1
INFO[0039] Image [rancher/rke-tools:v0.1.69] exists on host [192.168.10.34]
INFO[0039] Starting container [rke-log-linker] on host [192.168.10.34], try #1
INFO[0040] [etcd] Successfully started [rke-log-linker] container on host [192.168.10.34]
INFO[0040] Removing container [rke-log-linker] on host [192.168.10.34], try #1
INFO[0040] [remove/rke-log-linker] Successfully removed container on host [192.168.10.34]
INFO[0040] [etcd] Successfully started etcd plane.. Checking etcd cluster health
WARN[0055] [etcd] host [192.168.10.34] failed to check etcd health: failed to get /health for host [192.168.10.34]: Get https://172.17.0.2:2379/health: Unable to access the service on 172.17.0.2:2379. The service might be still starting up. Error: ssh: rejected: connect failed (Connection refused)
FATA[0055] [etcd] Failed to bring up Etcd Plane: etcd cluster is unhealthy: hosts [192.168.10.34] failed to report healthy. Check etcd container logs on each host for more information
使用:
牧场主 v2.5.2
rke 版本 v1.0.16
docker-ce-19.03.14-3.el8.x86_64
根据我的理解,接口配置必须预先存在。 RKE 不会处理接口配置。
因此,要么设置内部子网并为其分配接口,要么也使用外部地址进行内部通信。
/opt/kubernetes/bin/rke up --config /home/msh/rancher-cluster.yml
牧场主-cluser.yml 文件包含:
nodes:
- address: 192.168.10.34
internal_address: 172.17.0.2
user: bsh
role: [controlplane,etcd]
- address: 192.168.10.35
internal_address: 172.17.0.3
user: bsh
role: [worker]
- address: 192.168.10.36
internal_address: 172.17.0.4
user: bsh
role: [worker]
add_job_timeout: 120
注意:我没有在任何节点上配置任何接口 internal_address。我的理解是 rancher/k8s 会为我设置这些。 . .什么的。
这里是 rke 启动失败的结尾。
INFO[0039] Removing container [rke-bundle-cert] on host [192.168.10.34], try #1
INFO[0039] Image [rancher/rke-tools:v0.1.69] exists on host [192.168.10.34]
INFO[0039] Starting container [rke-log-linker] on host [192.168.10.34], try #1
INFO[0040] [etcd] Successfully started [rke-log-linker] container on host [192.168.10.34]
INFO[0040] Removing container [rke-log-linker] on host [192.168.10.34], try #1
INFO[0040] [remove/rke-log-linker] Successfully removed container on host [192.168.10.34]
INFO[0040] [etcd] Successfully started etcd plane.. Checking etcd cluster health
WARN[0055] [etcd] host [192.168.10.34] failed to check etcd health: failed to get /health for host [192.168.10.34]: Get https://172.17.0.2:2379/health: Unable to access the service on 172.17.0.2:2379. The service might be still starting up. Error: ssh: rejected: connect failed (Connection refused)
FATA[0055] [etcd] Failed to bring up Etcd Plane: etcd cluster is unhealthy: hosts [192.168.10.34] failed to report healthy. Check etcd container logs on each host for more information
使用:
牧场主 v2.5.2
rke 版本 v1.0.16
docker-ce-19.03.14-3.el8.x86_64
根据我的理解,接口配置必须预先存在。 RKE 不会处理接口配置。
因此,要么设置内部子网并为其分配接口,要么也使用外部地址进行内部通信。