v1.16 - HA master join failure - etcdserver: leader changed #1843
Labels
area/etcd
kind/bug
Categorizes issue or PR as related to a bug.
kind/support
Categorizes issue or PR as a support question.
triage/needs-information
Indicates an issue needs more information in order to work on it.
BUG REPORT
Versions
kubeadm version:
Environment:
Cloud provider or hardware configuration:
In a VirtualBox VM network. 6 VMS. 3 masters, 3 workers.
OS (e.g. from /etc/os-release):
Ubuntu 16.04
Kernel (e.g.
uname -a
):Linux 192-168-123-102 4.15.0-65-generic #74~16.04.1-Ubuntu SMP Wed Sep 18 09:51:44 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Others:
We use an automated script which uses kubeadm to spin up the first master. It then captures the relevant details to spin up 2 additional masters "simultaneously".
What happened?
Upon attempting to bring up the 3rd of 3 HA masters using kubeadm, the
kubeadm join
command fails with the error below. It seems pretty explanatory. kubeadm doesn't deal well if the leader changes, and I'm guessing that the leader changes when the 2nd node joins the cluster.We can consistently reproduce this, even if we wait a while between spinning up master #2 and master #3.
This has never occurred, to my knowledge, in version 1.14. We have spun up hundreds of clusters in 1.14.
What you expected to happen?
I expected
kubeadm join
to succeed and the current node to join the HA master quorum.How to reproduce it (as minimally and precisely as possible)?
Create a master node, collect the relevant details (token and certhash etc) then us it to start 2 additional masters, as close to simultaneously as possible.
Anything else we need to know?
You people rock. I love kubeadm.
The text was updated successfully, but these errors were encountered: