添加新的 etcd 成员时遇到问题

Have troubles while adding new etcd members

我打算将新成员添加到 etcd 的单个实例中,但遇到了问题。

我使用以下命令启动了第一个 etcd 成员:

nohup etcd \
  --advertise-client-urls=https://192.168.22.34:2379 \
  --cert-file=/etc/kubernetes/pki/etcd/server.crt \
  --client-cert-auth=true \
  --data-dir=/var/lib/etcd \
  --initial-advertise-peer-urls=https://192.168.22.34:2380 \
  --initial-cluster=test-master-01=https://192.168.22.34:2380 \
  --key-file=/etc/kubernetes/pki/etcd/server.key \
  --listen-client-urls=https://0.0.0.0:2379 \
  --listen-peer-urls=https://192.168.22.34:2380 \
  --name=test-master-01 \
  --peer-cert-file=/etc/kubernetes/pki/etcd/peer.crt \
  --peer-client-cert-auth=true \
  --peer-key-file=/etc/kubernetes/pki/etcd/peer.key \
  --peer-trusted-ca-file=/etc/kubernetes/pki/etcd/ca.crt \
  --snapshot-count=10000 \
  --trusted-ca-file=/etc/kubernetes/pki/etcd/ca.crt &

然后我检查了集群的健康状况,似乎是健康的:

member f13d668ae0cba84 is healthy: got healthy result from https://192.168.22.34:2379
cluster is healthy  

我也查了成员:

f13d668ae0cba84: name=test-master-01 peerURLs=http://192.168.22.34:2380 clientURLs=https://192.168.22.34:2379 isLeader=true

然后我尝试添加第二个成员:

etcdctl \
  --endpoints=https://127.0.0.1:2379 \
  --ca-file=/etc/kubernetes/pki/etcd/ca.crt \
  --cert-file=/etc/kubernetes/pki/etcd/healthcheck-client.crt \
  --key-file=/etc/kubernetes/pki/etcd/healthcheck-client.key \
  member add test-master-02 https://192.168.22.37:2380

Added member named test-master-02 with ID 65bec874cca265d8 to cluster ETCD_NAME="test-master-02"
ETCD_INITIAL_CLUSTER="test-master-01=http://192.168.22.34:2380,test-master-02=https://192.168.22.37:2380"
ETCD_INITIAL_CLUSTER_STATE="existing"

然后使用以下命令启动第二个 etcd 成员:

etcd \
  --name test-master-02 \
  --listen-client-urls https://192.168.22.37:2379 \
  --advertise-client-urls https://192.168.22.37:2379 \
  --listen-peer-urls https://192.168.22.37:2380 \
  --cert-file=/etc/kubernetes/pki/etcd/server.crt \
  --client-cert-auth=true \
  --trusted-ca-file=/etc/kubernetes/pki/etcd/ca.crt \
  --peer-cert-file=/etc/kubernetes/pki/etcd/peer.crt \
  --peer-client-cert-auth=true \
  --peer-key-file=/etc/kubernetes/pki/etcd/peer.key \
  --key-file=/etc/kubernetes/pki/etcd/server.key \
  --initial-cluster-state=existing \
  --initial-cluster=test-master-01=https://192.168.22.34:2380,test-master-02=https://192.168.22.37:2380

但是我得到一个错误:

etcdmain: error validating peerURLs {ClusterID:bc8c76911939f2de Members:[&{ID:f13d668ae0cba84 RaftAttributes:{PeerURLs:[http://192.168.22.34:2380]} Attributes:{Name:test-master-01 ClientURLs:[https://192.168.22.34:2379]}} &{ID:65bec874cca265d8 RaftAttributes:{PeerURLs:[https://192.168.22.37:2380]} Attributes:{Name: ClientURLs:[]}}] RemovedMemberIDs:[]}: unmatched member while checking PeerURLs

更新 看起来我在没有从快照恢复的情况下从头开始集群时没有这样的问题。

发现在添加新成员之前我需要更新我的主要 etcd 成员,因为成员列表命令在 peerurl

上返回 127.0.0.1 而不是 etcd 配置,