kubelet do not restart kube-apiserver #7616
-
Environmental Info: Node(s) CPU architecture, OS, and Version: Cluster Configuration:
basic config file: node-ip: 192.168.99.90
token: deadbeef
tls-san:
- 192.168.99.100
container-runtime-endpoint: /var/run/cri-dockerd.sock
etcd-arg:
- heartbeat-interval=1000
- election-timeout=10000 Describe the bug:
Steps To Reproduce:
Expected behavior:
Actual behavior:
Additional context / logs: kubelet known that apiserver is down, but do not retry to restart it:
|
Beta Was this translation helpful? Give feedback.
Replies: 12 comments
-
kube-apiserver is always down:
etcd is fine:
rke2-server and kubelet is also running, but waiting for API server to become available:
|
Beta Was this translation helpful? Give feedback.
-
May relate to this upstream k8s bug, but I am not sure: kubernetes/kubernetes#105543 |
Beta Was this translation helpful? Give feedback.
-
etcd is running:
|
Beta Was this translation helpful? Give feedback.
-
restart kubelet do not fix this problem, and kubelet stuck on "Starting controller":
|
Beta Was this translation helpful? Give feedback.
-
restart rke2-server do not fix this problem:
|
Beta Was this translation helpful? Give feedback.
-
restart cri-docker do not fix. |
Beta Was this translation helpful? Give feedback.
-
delete the pause container of apiserver do not fix. |
Beta Was this translation helpful? Give feedback.
-
restart docker and then restart rke2-server do not fix, now all docker container is down and rke2-server is waiting for multiple services:
|
Beta Was this translation helpful? Give feedback.
-
This node has a dead nfs mount, force umount it and kubelet run as expected, kube-apiserver is started successfully. |
Beta Was this translation helpful? Give feedback.
-
For anyone facing my problem, check if you have a dead nfs(or other network filesystem) mount in your node, try to umount it ( I do not if this is a bug or this is desired, I hope the rke2 developer may notice this problem. If this is not a but, just close this issue. |
Beta Was this translation helpful? Give feedback.
-
We do not test or support rke2 with the docker container runtime. You're welcome to use it (or any other CRI implementation) but we don't support it. Why are you trying to use docker/cri-dockerd in the first place? |
Beta Was this translation helpful? Give feedback.
-
@brandond And all of us are familier with docker instead of cri/containerd, some of our applications are deployed directly to docker but not k8s. |
Beta Was this translation helpful? Give feedback.
For anyone facing my problem, check if you have a dead nfs(or other network filesystem) mount in your node, try to umount it (
umount -f /PATH
orumount -f -l /PATH
), and check if your kubelet run as expected.I do not if this is a bug or this is desired, I hope the rke2 developer may notice this problem. If this is not a but, just close this issue.