This is sort of strange behavior in our K8 cluster. When we try to deploy a new version of our applications we get: <pre class="prettyprint"><code>Failed to create pod sandbox: rpc error: code = Unknown desc = failed to set up sandbox container "<container-id>" network for pod "application-6647b7cbdb-4tp2v": networkPlugin cni failed to set up pod "application-6647b7cbdb-4tp2v_default" network: Get "https://[10.233.0.1]:443/api/v1/namespaces/default": dial tcp 10.233.0.1:443: connect: connection refused </code></pre> I used <code>kubectl get cs</code> and found <code>controller</code> and <code>scheduler</code> in <code>Unhealthy</code> state. As describer here updated <code>/etc/kubernetes/manifests/kube-scheduler.yaml</code> and <code>/etc/kubernetes/manifests/kube-controller-manager.yaml</code> by commenting <code>--port=0</code> When I checked <code>systemctl status kubelet</code> it was working. <pre class="prettyprint"><code>Active: active (running) since Mon 2020-10-26 13:18:46 +0530; 1 years 0 months ago </code></pre> I had restarted kubelet service and <code>controller</code> and <code>scheduler</code> were shown healthy. But <code>systemctl status kubelet</code> shows (soon after restart kubelet it showed running state) <pre class="prettyprint"><code>Active: activating (auto-restart) (Result: exit-code) since Thu 2021-11-11 10:50:49 +0530; 3s ago Docs: https://github.com/GoogleCloudPlatform/kubernetes Process: 21234 ExecStart=/usr/bin/kubelet $KUBELET_KUBECONFIG_ARGS $KUBELET_CONFIG_ARGS $KUBELET_KUBEADM_ARGS $KUBELET </code></pre> Tried adding <code>Environment="KUBELET_SYSTEM_PODS_ARGS=--pod-manifest-path=/etc/kubernetes/manifests --allow-privileged=true --fail-swap-on=false" </code> to <code>/etc/systemd/system/kubelet.service.d/10-kubeadm.conf</code> as described here, but still its not working properly. Also removed <code>--port=0</code> comment in above mentioned manifests and tried restarting,still same result. Edit: This issue was due to <code>kubelet</code> certificate expired and fixed following these steps. If someone faces this issue, make sure <code>/var/lib/kubelet/pki/kubelet-client-current.pem</code> certificate and key values are base64 encoded when placing on <code>/etc/kubernetes/kubelet.conf</code> Many other suggested <code>kubeadm init</code> again. But this cluster was created using <code>kubespray</code> no manually added nodes. We have baremetal k8 running on Ubuntu 18.04. K8: v1.18.8 We would like to know any debugging and fixing suggestions. PS: When we try to <code>telnet 10.233.0.1 443</code> from any node, first attempt fails and second attempt success. Edit: Found this in <code>kubelet</code> service logs <pre class="prettyprint"><code>Nov 10 17:35:05 node1 kubelet[1951]: W1110 17:35:05.380982 1951 docker_sandbox.go:402] failed to read pod IP from plugin/docker: networkPlugin cni failed on the status hook for pod "app-7b54557dd4-bzjd9_default": unexpected command output nsenter: cannot open /proc/12311/ns/net: No such file or directory </code></pre>

Posting comment as the community wiki answer for better visibility <hr> This issue was due to <code>kubelet</code> certificate expired and fixed following these steps. If someone faces this issue, make sure <code>/var/lib/kubelet/pki/kubelet-client-current.pem</code> certificate and key values are <code>base64</code> encoded when placing on <code>/etc/kubernetes/kubelet.conf</code>

kubelet won't start after kuberntes/manifest update

Tags:

kubernetes

kubelet

cni

This is sort of strange behavior in our K8 cluster.

When we try to deploy a new version of our applications we get:

Failed to create pod sandbox: rpc error: code = Unknown desc = failed to set up sandbox container "<container-id>" network for pod "application-6647b7cbdb-4tp2v": networkPlugin cni failed to set up pod "application-6647b7cbdb-4tp2v_default" network: Get "https://[10.233.0.1]:443/api/v1/namespaces/default": dial tcp 10.233.0.1:443: connect: connection refused

I used kubectl get cs and found controller and scheduler in Unhealthy state.

As describer here updated /etc/kubernetes/manifests/kube-scheduler.yaml and /etc/kubernetes/manifests/kube-controller-manager.yaml by commenting --port=0

When I checked systemctl status kubelet it was working.

Active: active (running) since Mon 2020-10-26 13:18:46 +0530; 1 years 0 months ago

I had restarted kubelet service and controller and scheduler were shown healthy.

But systemctl status kubelet shows (soon after restart kubelet it showed running state)

Active: activating (auto-restart) (Result: exit-code) since Thu 2021-11-11 10:50:49 +0530; 3s ago<br>
    Docs: https://github.com/GoogleCloudPlatform/kubernetes<br>  Process: 21234 ExecStart=/usr/bin/kubelet $KUBELET_KUBECONFIG_ARGS $KUBELET_CONFIG_ARGS $KUBELET_KUBEADM_ARGS $KUBELET

Tried adding Environment="KUBELET_SYSTEM_PODS_ARGS=--pod-manifest-path=/etc/kubernetes/manifests --allow-privileged=true --fail-swap-on=false" to /etc/systemd/system/kubelet.service.d/10-kubeadm.conf as described here, but still its not working properly.

Also removed --port=0 comment in above mentioned manifests and tried restarting,still same result.

Edit: This issue was due to kubelet certificate expired and fixed following these steps. If someone faces this issue, make sure /var/lib/kubelet/pki/kubelet-client-current.pem certificate and key values are base64 encoded when placing on /etc/kubernetes/kubelet.conf

Many other suggested kubeadm init again. But this cluster was created using kubespray no manually added nodes.

We have baremetal k8 running on Ubuntu 18.04. K8: v1.18.8

We would like to know any debugging and fixing suggestions.

PS:
When we try to telnet 10.233.0.1 443 from any node, first attempt fails and second attempt success.

Edit: Found this in kubelet service logs

Nov 10 17:35:05 node1 kubelet[1951]: W1110 17:35:05.380982    1951 docker_sandbox.go:402] failed to read pod IP from plugin/docker: networkPlugin cni failed on the status hook for pod "app-7b54557dd4-bzjd9_default": unexpected command output nsenter: cannot open /proc/12311/ns/net: No such file or directory

643

asked Nov 11 '21 05:11

Sachith Muhandiram

1 Answers

Posting comment as the community wiki answer for better visibility

This issue was due to kubelet certificate expired and fixed following these steps. If someone faces this issue, make sure /var/lib/kubelet/pki/kubelet-client-current.pem certificate and key values are base64 encoded when placing on /etc/kubernetes/kubelet.conf

187

answered Oct 19 '22 07:10

Bazhikov

Related questions
                            
                                Difference between Minikube, Kubernetes, Docker Compose, Docker Swarm, etc
                            
                                Kube flannel in CrashLoopBackOff status
                            
                                Kubernetes kubectl get secrets by type?
                            
                                Kubernetes python client: authentication issue
                            
                                Disable SSL redirect for Kubernetes NGINX ingress
                            
                                Transform SSL .crt to kubernetes inline format
                            
                                How to install kubectl in kubernetes container through docker image
                            
                                How do I force Kubernetes CoreDNS to reload its Config Map after a change?
                            
                                Kubernetes basic authentication with Traefik
                            
                                Can't install Kubernetes on Vagrant
                            
                                How to get the namespace from inside a pod in OpenShift?
                            
                                Redistribute pods after adding a node in Kubernetes
                            
                                Kubernetes Garbage Collection fails - FreeDiskSpaceFailed & ImageGCFailed
                            
                                etcdctl throws Error: context deadline exceeded error
                            
                                Docker push intermittent failure to private docker registry on kubernetes (docker-desktop)
                            
                                In Kubernetes, how do I implement session affinity using an Ingress?
                            
                                Spark 2.3 submit on Kubernetes error

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With